Merged Conference Papers
Generated from the Conference database.
ACL 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| 2kenize: Tying Subword Sequences for Chinese Script Conversion | Pranav A and Isabelle Augenstein | N/A | N/A |
| A Batch Normalized Inference Network Keeps the KL Vanishing Away | Qile Zhu, Wei Bi, Xiaojiang Liu, Xiyao Ma, Xiaolin Li and Dapeng Wu | N/A | N/A |
| A Call for More Rigor in Unsupervised Cross-lingual Learning | Mikel Artetxe, Sebastian Ruder, Dani Yogatama, Gorka Labaka and Eneko Agirre | N/A | N/A |
| A Comprehensive Analysis of Preprocessing for Word Representation Learning in Affective Tasks | Nastaran Babanejad, Ameeta Agrawal, Aijun An and Manos Papagelis | N/A | N/A |
| A Contextual Hierarchical Attention Network with Adaptive Objective for Dialogue State Tracking | Yong Shan, Zekang Li, Jinchao Zhang, Fandong Meng, Yang Feng, Cheng Niu and Jie Zhou | N/A | N/A |
| A Corpus for Large-Scale Phonetic Typology | Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black and Jason Eisner | N/A | N/A |
| A Formal Hierarchy of RNN Architectures | William Merrill, Gail Weiss, Yoav Goldberg, Roy Schwartz, Noah A. Smith and Eran Yahav | N/A | N/A |
| A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization | Dongfang Xu, Zeyu Zhang and Steven Bethard | N/A | N/A |
| A Generative Model for Joint Natural Language Understanding and Generation | Bo-Hsiang Tseng, Jianpeng Cheng, Yimai Fang and David Vandyke | N/A | N/A |
| A Girl Has A Name: Detecting Authorship Obfuscation | Asad Mahmood, Zubair Shafiq and Padmini Srinivasan | N/A | N/A |
| A Graph Auto-encoder Model of Derivational Morphology | Valentin Hofmann, Hinrich Schütze and Janet Pierrehumbert | N/A | N/A |
| A Graph-based Coarse-to-fine Method for Unsupervised Bilingual Lexicon Induction | Shuo Ren, Shujie Liu, Ming Zhou and Shuai Ma | N/A | N/A |
| A Joint Model for Document Segmentation and Segment Labeling | Joe Barrow, Rajiv Jain, Vlad Morariu, Varun Manjunatha, Douglas Oard and Philip Resnik | N/A | N/A |
| A Joint Neural Model for Information Extraction with Global Features | Ying Lin, Heng Ji, Fei Huang and Lingfei Wu | N/A | N/A |
| A Methodology for Creating Question Answering Corpora Using Inverse Data Annotation | Jan Deriu, Katsiaryna Mlynchyk, Philippe Schläpfer, Alvaro Rodrigo, Dirk von Grünigen, Nicolas Kaiser, Kurt Stockinger, Eneko Agirre and Mark Cieliebak | N/A | N/A |
| A Mixture of h − 1 Heads is Better than h Heads | Hao Peng, Roy Schwartz, Dianqi Li and Noah A. Smith | N/A | N/A |
| A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages | Pedro Javier Ortiz Suárez, Laurent Romary and Benoît Sagot | N/A | N/A |
| A Multitask Learning Approach for Diacritic Restoration | Sawsan Alqahtani, Ajay Mishra and Mona Diab | N/A | N/A |
| A Novel Cascade Binary Tagging Framework for Relational Triple Extraction | Zhepei Wei, Jianlin Su, Yue Wang, Yuan Tian and Yi Chang | N/A | N/A |
| A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation | Yongjing Yin, Fandong Meng, Jinsong Su, Chulun Zhou, Zhengyuan Yang, Jie Zhou and Jiebo Luo | N/A | N/A |
| A Prioritization Model for Suicidality Risk Assessment | Han-Chin Shing, Philip Resnik and Douglas Oard | N/A | N/A |
| A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks | Angela Lin, Sudha Rao, Asli Celikyilmaz, Elnaz Nouri, Chris Brockett, Debadeepta Dey and Bill Dolan | N/A | N/A |
| A Reinforced Generation of Adversarial Examples for Neural Machine Translation | Wei Zou, Shujian Huang, Jun Xie, Xinyu Dai and Jiajun Chen | N/A | N/A |
| A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction | Yilin Niu, Fangkai Jiao, Mantong Zhou, Ting Yao, Jingfang Xu and Minlie Huang | N/A | N/A |
| A Span-based Linearization for Constituent Trees | Yang Wei, Yuanbin Wu and Man Lan | N/A | N/A |
| A Study of Non-autoregressive Model for Sequence Generation | Yi Ren, Jinglin Liu, Xu Tan, Zhou Zhao, Sheng Zhao and Tie-Yan Liu | N/A | N/A |
| A Systematic Assessment of Syntactic Generalization in Neural Language Models | Jennifer Hu, Jon Gauthier, Peng Qian, Ethan Wilcox and Roger Levy | N/A | N/A |
| A Tale of Two Perplexities: Sensitivity of Neural Language Models to Lexical Retrieval Deficits in Dementia of the Alzheimer’s Type | Trevor Cohen and Serguei Pakhomov | N/A | N/A |
| A Top-down Neural Architecture towards Text-level Parsing of Discourse Rhetorical Structure | Longyin Zhang, Yuqing Xing, Fang Kong, Peifeng Li and Guodong Zhou | N/A | N/A |
| A Unified MRC Framework for Named Entity Recognition | Xiaoya Li, Jingrong Feng, Yuxian Meng, Qinghong Han, Fei Wu and Jiwei Li | N/A | N/A |
| Adaptive Compression of Word Embeddings | Yeachan Kim, Kang-Min Kim and SangKeun Lee | N/A | N/A |
| Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation | Arya D. McCarthy, Xian Li, Jiatao Gu and Ning Dong | N/A | N/A |
| AdvAug: Robust Adversarial Augmentation for Neural Machine Translation | Yong Cheng, Lu Jiang, Wolfgang Macherey and Jacob Eisenstein | N/A | N/A |
| Adversarial and Domain-Aware BERT for Cross-Domain Sentiment Analysis | Chunning Du, Haifeng Sun, Jingyu Wang, Qi Qi and Jianxin Liao | N/A | N/A |
| Adversarial NLI: A New Benchmark for Natural Language Understanding | Yixin Nie, Adina Williams, Emily Dinan, Mohit Bansal, Jason Weston and Douwe Kiela | N/A | N/A |
| Agreement Prediction of Arguments in Cyber Argumentation for Detecting Stance Polarity and Intensity | Joseph Sirrianni, Xiaoqing Liu and Douglas Adams | N/A | N/A |
| Aligned Dual Channel Graph Convolutional Network for Visual Question Answering | Qingbao Huang, Jielong Wei, Yi Cai, Changmeng Zheng, Junying Chen, Ho-fung Leung and Qing Li | N/A | N/A |
| Amalgamation of protein sequence, structure and textual information for improving protein-protein interaction identification | Pratik Dutta and Sriparna Saha | N/A | N/A |
| AMR Parsing via Graph-Sequence Iterative Inference | Deng Cai and Wai Lam | N/A | N/A |
| AMR Parsing with Latent Structural Information | Qiji Zhou, Yue Zhang, Donghong Ji and Hao Tang | N/A | N/A |
| An analysis of the utility of explicit negative examples to improve the syntactic abilities of neural language models | Hiroshi Noji and Hiroya Takamura | N/A | N/A |
| An Effective Transition-based Model for Discontinuous NER | Xiang Dai, Sarvnaz Karimi, Ben Hachey and Cecile Paris | N/A | N/A |
| An Effectiveness Metric for Ordinal Classification: Formal Properties and Experimental Results | Enrique Amigo, Julio Gonzalo, Stefano Mizzaro and Jorge Carrillo-de-Albornoz | N/A | N/A |
| An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering | Jay Kumar, Junming Shao, Salah Uddin and Wazir Ali | N/A | N/A |
| Analysing Lexical Semantic Change with Contextualised Word Representations | Mario Giulianelli, Marco Del Tredici and Raquel Fernández | N/A | N/A |
| Analyzing analytical methods: The case of phonology in neural models of spoken language | Grzegorz Chrupała, Bertrand Higy and Afra Alishahi | N/A | N/A |
| Analyzing Political Parody in Social Media | Antonios Maronikolakis, Danae Sánchez Villegas, Daniel Preotiuc-Pietro and Nikolaos Aletras | N/A | N/A |
| Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition | Paloma Jeretic, Alex Warstadt, Suvrat Bhooshan and Adina Williams | N/A | N/A |
| Asking and Answering Questions to Evaluate the Factual Consistency of Summaries | Alex Wang, Kyunghyun Cho and Mike Lewis | N/A | N/A |
| Aspect Sentiment Classification with Document-level Sentiment Preference Modeling | Xiao Chen, Changlong Sun, Jingjing Wang, Shoushan Li, Luo Si, Min Zhang and Guodong Zhou | N/A | N/A |
| ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations | Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot and Lucia Specia | N/A | N/A |
| Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization | Junnan Zhu, Yu Zhou, Jiajun Zhang and Chengqing Zong | N/A | N/A |
| Attentive Pooling with Learnable Norms for Text Representation | Chuhan Wu, Fangzhao Wu, Tao Qi, Xiaohui Cui and Yongfeng Huang | N/A | N/A |
| Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics | Guy Emerson | N/A | N/A |
| Automated Evaluation of Writing – 50 Years and Counting | Beata Beigman Klebanov and Nitin Madnani | N/A | N/A |
| Automatic Detection of Generated Text is Easiest when Humans are Fooled | Daphne Ippolito, Daniel Duckworth, Chris Callison-Burch and Douglas Eck | N/A | N/A |
| Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study | Xinyu Xing, Xiaosheng Fan and Xiaojun Wan | N/A | N/A |
| Automatic Poetry Generation from Prosaic Text | Tim Van de Cruys | N/A | N/A |
| BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps | Wang Zhu, Hexiang Hu, Jiacheng Chen, Zhiwei Deng, Vihan Jain, Eugene Ie and Fei Sha | N/A | N/A |
| Balancing Objectives in Counseling Conversations: Advancing Forwards or Looking Backwards | Justine Zhang and Cristian Danescu-Niculescu-Mizil | N/A | N/A |
| Balancing Training for Multilingual Neural Machine Translation | Xinyi Wang, Yulia Tsvetkov and Graham Neubig | N/A | N/A |
| BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension | Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov and Luke Zettlemoyer | N/A | N/A |
| Benchmarking Multimodal Regex Synthesis with Complex Structures | Xi Ye, Qiaochu Chen, Isil Dillig and Greg Durrett | N/A | N/A |
| BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance | Timo Schick and Hinrich Schütze | N/A | N/A |
| Beyond Accuracy: Behavioral Testing of NLP Models with CheckList | Marco Tulio Ribeiro, Tongshuang Wu, Carlos Guestrin and Sameer Singh | N/A | N/A |
| Beyond Possession Existence: Duration and Co-Possession | Dhivya Chinnappa, Srikala Murugan and Eduardo Blanco | N/A | N/A |
| Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation | Weixin Liang, James Zou and Zhou Yu | N/A | N/A |
| Bilingual Dictionary Based Neural Machine Translation without Using Parallel Sentences | Xiangyu Duan, Baijun Ji, Hao Jia, Min Tan, Min Zhang, Boxing Chen, Weihua Luo and Yue Zhang | N/A | N/A |
| Biomedical Entity Representations with Synonym Marginalization | Mujeen Sung, Hwisang Jeon, Jinhyuk Lee and Jaewoo Kang | N/A | N/A |
| Bipartite Flat-Graph Network for Nested Named Entity Recognition | Ying Luo and Hai Zhao | N/A | N/A |
| BiRRE: Learning Bidirectional Residual Relation Embeddings for Supervised Hypernymy Detection | Chengyu Wang and Xiaofeng He | N/A | N/A |
| BLEURT: Learning Robust Metrics for Text Generation | Thibault Sellam, Dipanjan Das and Ankur Parikh | N/A | N/A |
| Boosting Neural Machine Translation with Similar Translations | Jitao XU, Josep Crego and Jean Senellart | N/A | N/A |
| Bootstrapping Techniques for Polysynthetic Morphological Analysis | William Lane and Steven Bird | N/A | N/A |
| BPE-Dropout: Simple and Effective Subword Regularization | Ivan Provilkov, Dmitrii Emelianenko and Elena Voita | N/A | N/A |
| Breaking Through the 80% Glass Ceiling: Raising the State of the Art in Word Sense Disambiguation by Incorporating Knowledge Graph Information | Michele Bevilacqua and Roberto Navigli | N/A | N/A |
| Bridging Anaphora Resolution as Question Answering | Yufang Hou | N/A | N/A |
| Bridging the Structural Gap Between Encoding and Decoding for Data-To-Text Generation | Chao Zhao, Marilyn Walker and Snigdha Chaturvedi | N/A | N/A |
| Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell | Djamé Seddah, Farah Essaidi, Amal Fethi, Matthieu Futeral, Benjamin Muller, Pedro Javier Ortiz Suárez, Benoît Sagot and Abhishek Srivastava | N/A | N/A |
| Calibrating Structured Output Predictors for Natural Language Processing | Abhyuday Jagannatha and Hong Yu | N/A | N/A |
| CamemBERT: a Tasty French Language Model | Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, Éric de la Clergerie, Djamé Seddah and Benoît Sagot | N/A | N/A |
| Can We Predict New Facts with Open Knowledge Graph Embeddings? A Benchmark for Open Link Prediction | Samuel Broscheit, Kiril Gashteovski, Yanjie Wang and Rainer Gemulla | N/A | N/A |
| Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills | Eric Michael Smith, Mary Williamson, Kurt Shuster, Jason Weston and Y-Lan Boureau | N/A | N/A |
| CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation | Lei Shen and Yang Feng | N/A | N/A |
| ChartDialogs: Plotting from Natural Language Instructions | Yutong Shao and Ndapa Nakashole | N/A | N/A |
| CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotation of Modality | Wenmeng Yu, Hua Xu, Fanyang Meng, Yilin Zhu, Yixiao Ma, Jiele Wu, Jiyun Zou and Kaicheng Yang | N/A | N/A |
| Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data | Emily M. Bender and Alexander Koller | N/A | N/A |
| Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset | Xiang Yue, Bernal Jimenez Gutierrez and Huan Sun | N/A | N/A |
| CluBERT: A Cluster-Based Approach for Learning Sense Distributions in Multiple Languages | Tommaso Pasini, Federico Scozzafava and Bianca Scarlini | N/A | N/A |
| CluHTM - Semantic Hierarchical Topic Modeling based on CluWords | Felipe Viegas, Washington Cunha, Christian Gomes, Antônio Pereira, Leonardo Rocha and Marcos Goncalves | N/A | N/A |
| Code and Named Entity Recognition in StackOverflow | Jeniya Tabassum, Mounica Maddela, Wei Xu and Alan Ritter | N/A | N/A |
| CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning | Alessandro Suglia, Ioannis Konstas, Andrea Vanzo, Emanuele Bastianelli, Desmond Elliott, Stella Frank and Oliver Lemon | N/A | N/A |
| Compositionality and Generalization In Emergent Languages | Rahma Chaabouni, Eugene Kharitonov, Diane Bouchacourt, Emmanuel Dupoux and Marco Baroni | N/A | N/A |
| Conditional Augmentation for Aspect Term Extraction via Masked Sequence-to-Sequence Generation | Kun Li, Chengbo Chen, Xiaojun Quan, Qing Ling and Yan Song | N/A | N/A |
| Connecting Embeddings for Knowledge Graph Entity Typing | Yu Zhao, anxiang zhang, Ruobing Xie, Kang Liu and Xiaojie WANG | N/A | N/A |
| Contextualized Weak Supervision for Text Classification | Dheeraj Mekala and Jingbo Shang | N/A | N/A |
| Continual Relation Learning via Episodic Memory Activation and Reconsolidation | Xu Han, Yi Dai, Tianyu Gao, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun and Jie Zhou | N/A | N/A |
| Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation | Jun Xu, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che and Ting Liu | N/A | N/A |
| CorefQA: Coreference Resolution as Query-based Span Prediction | Wei Wu, Fei Wang, Arianna Yuan, Fei Wu and Jiwei Li | N/A | N/A |
| Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation | Ning Ding, Dingkun Long, Guangwei Xu, Muhua Zhu, Pengjun Xie, Xiaobin Wang and Haitao Zheng | N/A | N/A |
| CraftAssist Instruction Parsing: Semantic Parsing for a Voxel-World Assistant | Kavya Srinet, Yacine Jernite, Jonathan Gray and Arthur Szlam | N/A | N/A |
| Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus | Hao Fei, Meishan Zhang and Donghong Ji | N/A | N/A |
| Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning | Hongliang Fei and Ping Li | N/A | N/A |
| Cross-Linguistic Syntactic Evaluation of Word Prediction Models | Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou, Natalia Talmina and Tal Linzen | N/A | N/A |
| Cross-media Structured Common Space for Multimedia Event Extraction | Manling Li, Alireza Zareian, Qi Zeng, Spencer Whitehead, Di Lu, Heng Ji and Shih-Fu Chang | N/A | N/A |
| Cross-modal Coherence Modeling for Caption Generation | Malihe Alikhani, Piyush Sharma, Shengjie Li, Radu Soricut and Matthew Stone | N/A | N/A |
| Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage | Ashish V. Thapliyal and Radu Soricut | N/A | N/A |
| Cross-Modality Relevance for Reasoning on Language and Vision | Chen Zheng, Quan Guo and Parisa Kordjamshidi | N/A | N/A |
| Curriculum Learning for Natural Language Understanding | Benfeng Xu, Licheng Zhang, Zhendong Mao, Quan Wang, Hongtao Xie and Yongdong Zhang | N/A | N/A |
| Curriculum Pre-training for End-to-End Speech Translation | Chengyi Wang, Yu Wu, Shujie Liu, Ming Zhou and Zhenglu Yang | N/A | N/A |
| Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight | Hengyi Cai, Hongshen Chen, Yonghao Song, Cheng Zhang, Xiaofang Zhao and Dawei Yin | N/A | N/A |
| DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering | Qingqing Cao, Harsh Trivedi, Aruna Balasubramanian and Niranjan Balasubramanian | N/A | N/A |
| Demographics Should Not Be the Reason of Toxicity: Mitigating Discrimination in Text Classifications with Instance Weighting | Guanhua Zhang, Bing Bai, Junqi Zhang, Kun Bai, Conghui Zhu and Tiejun Zhao | N/A | N/A |
| Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA | Hyounghun Kim, Zineng Tang and Mohit Bansal | N/A | N/A |
| Dependency Graph Enhanced Dual-transformer Structure for Aspect-based Sentiment Classification | Hao Tang, Donghong Ji, Chenliang Li and Qiji Zhou | N/A | N/A |
| DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking | Christopher Hidey, Tuhin Chakrabarty, Tariq Alhindi, Siddharth Varia, Kriste Krstovski, Mona Diab and Smaranda Muresan | N/A | N/A |
| Detecting Perceived Emotions in Hurricane Disasters | Shrey Desai, Cornelia Caragea and Junyi Jessy Li | N/A | N/A |
| Dialogue Coherence Assessment Without Explicit Dialogue Act Labels | Mohsen Mesgar, Sebastian Bücker and Iryna Gurevych | N/A | N/A |
| Dialogue-Based Relation Extraction | Dian Yu, Kai Sun, Claire Cardie and Dong Yu | N/A | N/A |
| Dice Loss for Data-imbalanced NLP Tasks | Xiaoya Li, Xiaofei Sun, Yuxian Meng, Junjun Liang, Fei Wu and Jiwei Li | N/A | N/A |
| Differentiable Window for Dynamic Local Attention | Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li | N/A | N/A |
| Discourse as a Function of Event: Profiling Discourse Structure in News Articles around the Main Event | Prafulla Kumar Choubey, Aaron Lee, Ruihong Huang and Lu Wang | N/A | N/A |
| Discourse-Aware Neural Extractive Text Summarization | Jiacheng Xu, Zhe Gan, Yu Cheng and Jingjing Liu | N/A | N/A |
| Discrete Latent Variable Representations for Low-Resource Text Classification | Shuning Jin, Sam Wiseman, Karl Stratos and Karen Livescu | N/A | N/A |
| Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction | Raphael Schumann, Lili Mou, Yao Lu, Olga Vechtomova and Katja Markert | N/A | N/A |
| Distilling Annotations via Active Imitation Learning | Kianté Brantley, Hal Daumé III and Amr Sharaf | N/A | N/A |
| Distilling Knowledge Learned in BERT for Text Generation | Yen-Chun Chen, Zhe Gan, Yu Cheng, Jingzhou Liu and Jingjing Liu | N/A | N/A |
| Distinguish Confusing Law Articles for Legal Judgment Prediction | Nuo Xu, Pinghui Wang, Long Chen, Li Pan, Xiaoyan Wang and Junzhou Zhao | N/A | N/A |
| Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge Awareness | Sixing Wu, Ying Li, Dawei Zhang, Yang Zhou and Zhonghai Wu | N/A | N/A |
| Diversifying Dialogue Generation with Non-Conversational Text | Hui Su, Xiaoyu Shen, Sanqiang Zhao, Zhou Xiao, Pengwei Hu, Randy Zhong, Cheng Niu and Jie Zhou | N/A | N/A |
| Do Neural Language Models Show Preferences for Syntactic Formalisms? | Artur Kulmizev, Vinit Ravishankar, Mostafa Abdou and Joakim Nivre | N/A | N/A |
| Do Neural Models Learn Systematicity of Monotonicity Inference in Natural Language? | Hitomi Yanaka, Koji Mineshima, Daisuke Bekki and Kentaro Inui | N/A | N/A |
| Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension | Bo Zheng, Haoyang Wen, Yaobo Liang, Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou and Ting Liu | N/A | N/A |
| Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain | Shadi Saleh and Pavel Pecina | N/A | N/A |
| Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding | Xinya Du and Claire Cardie | N/A | N/A |
| Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training | Margaret Li, Stephen Roller, Ilia Kulikov, Sean Welleck, Y-Lan Boureau, Kyunghyun Cho and Jason Weston | N/A | N/A |
| Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks | Suchin Gururangan, Ana Marasović, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey and Noah A. Smith | N/A | N/A |
| DoQA - Accessing Domain-Specific FAQs via Conversational QA | Jon Ander Campos, Arantxa Otegi, Aitor Soroa, Jan Deriu, Mark Cieliebak and Eneko Agirre | N/A | N/A |
| Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation | Tianlu Wang, Xi Victoria Lin, Nazneen Fatema Rajani, Bryan McCann, Vicente Ordonez and Caiming Xiong | N/A | N/A |
| DRTS Parsing with Structure-Aware Encoding and Decoding | Qiankun Fu, Yue Zhang, Jiangming Liu and Meishan Zhang | N/A | N/A |
| DTCA: Decision Tree-based Co-Attention Networks for Explainable Claim Verification | Lianwei Wu, Yuan Rao, Yongqiang Zhao, Hao Liang and Ambreen Nazir | N/A | N/A |
| Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog | Libo Qin, Xiao Xu, Wanxiang Che, Yue Zhang and Ting Liu | N/A | N/A |
| Dynamic Online Conversation Recommendation | Xingshan Zeng, Jing Li, Lu Wang, Zhiming Mao and Kam-Fai Wong | N/A | N/A |
| Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation | Xuanli He, Gholamreza Haffari and Mohammad Norouzi | N/A | N/A |
| ECPE-2D: Emotion-Cause Pair Extraction based on Joint Two-Dimensional Representation, Interaction and Prediction | Zixiang Ding, Rui Xia and Jianfei Yu | N/A | N/A |
| Effective Estimation of Deep Generative Language Models | Tom Pelsmaeker and Wilker Aziz | N/A | N/A |
| Effective Inter-Clause Modeling for End-to-End Emotion-Cause Pair Extraction | Penghui Wei, Jiahao Zhao and Wenji Mao | N/A | N/A |
| Efficient Constituency Parsing by Pointing | Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li | N/A | N/A |
| Efficient Dialogue State Tracking by Selectively Overwriting Memory | Sungdong Kim, Sohee Yang, Gyuwan Kim and Sang-Woo Lee | N/A | N/A |
| Efficient Pairwise Annotation of Argument Quality | Lukas Gienapp, Benno Stein, Matthias Hagen and Martin Potthast | N/A | N/A |
| Efficient Second-Order TreeCRF for Neural Dependency Parsing | Yu Zhang, Zhenghua Li and Min Zhang | N/A | N/A |
| Emergence of Syntax Needs Minimal Supervision | Raphaël Bailly and Kata Gábor | N/A | N/A |
| Emerging Cross-lingual Structure in Pretrained Language Models | Alexis Conneau, Shijie Wu, Haoran Li, Luke Zettlemoyer and Veselin Stoyanov | N/A | N/A |
| Empower Entity Set Expansion via Language Model Probing | Yunyi Zhang, Jiaming Shen, Jingbo Shang and Jiawei Han | N/A | N/A |
| Empowering Active Learning to Jointly Optimize System and User Demands | Ji-Ung Lee, Christian M. Meyer and Iryna Gurevych | N/A | N/A |
| End-to-End Bias Mitigation by Modelling Biases in Corpora | Rabeeh Karimi Mahabadi, Yonatan Belinkov and James Henderson | N/A | N/A |
| End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2 | Donghoon Ham, Jeong-Gwan Lee, Youngsoo Jang and Kee-Eung Kim | N/A | N/A |
| End-to-End Neural Word Alignment Outperforms GIZA++ | Thomas Zenkel, Joern Wuebker and John DeNero | N/A | N/A |
| Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension | Fei Yuan, Linjun Shou, Xuanyu Bai, Ming Gong, Yaobo Liang, Nan Duan, Yan Fu and Daxin Jiang | N/A | N/A |
| Enhancing Cross-target Stance Detection with Transferable Semantic-Emotion Knowledge | Bowen Zhang, Min Yang, Xutao Li, Yunming Ye, Xiaofei Xu and Kuai Dai | N/A | N/A |
| ERASER: A Benchmark to Evaluate Rationalized NLP Models | Jay DeYoung, Sarthak Jain, Nazneen Fatema Rajani, Eric Lehman, Caiming Xiong, Richard Socher and Byron C. Wallace | N/A | N/A |
| ESPRIT: Explaining Solutions to Physical Reasoning Tasks | Nazneen Fatema Rajani, Rui Zhang, Yi Chern Tan, Stephan Zheng, Jeremy Weiss, Aadit Vyas, Abhijit Gupta, Caiming Xiong, Richard Socher and Dragomir Radev | N/A | N/A |
| Estimating predictive uncertainty for rumour verification models | Elena Kochkina and Maria Liakata | N/A | N/A |
| Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks | Fynn Schröder and Chris Biemann | N/A | N/A |
| Evaluating and Enhancing the Robustness of Neural Network-based Dependency Parsing Models with Adversarial Examples | Xiaoqing Zheng, Jiehang Zeng, Yi Zhou, Cho-Jui Hsieh, Minhao Cheng and Xuanjing Huang | N/A | N/A |
| Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior? | Peter Hase and Mohit Bansal | N/A | N/A |
| Evaluating Explanation Methods for Neural Machine Translation | Jierui Li, Lemao Liu, Huayang Li, Guanlin Li, Guoping Huang and Shuming Shi | N/A | N/A |
| Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder | Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Daxin Jiang and Ming Zhou | N/A | N/A |
| Exact yet Efficient Graph Parsing, Bi-directional Locality and the Constructivist Hypothesis | Yajie Ye and Weiwei Sun | N/A | N/A |
| Examining Citations of Natural Language Processing Literature | Saif M. Mohammad | N/A | N/A |
| Examining the State-of-the-Art in News Timeline Summarization | Demian Gholipour Ghalandari and Georgiana Ifrim | N/A | N/A |
| Exclusive Hierarchical Decoding for Deep Keyphrase Generation | Wang Chen, Hou Pong Chan, Piji Li and Irwin King | N/A | N/A |
| Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen | Yixin Cao, Ruihao Shui, Liangming Pan, Min-Yen Kan, Zhiyuan Liu and Tat-Seng Chua | N/A | N/A |
| Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions | Xiaochuang Han, Byron C. Wallace and Yulia Tsvetkov | N/A | N/A |
| Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading | Yifan Gao, Chien-Sheng Wu, Shafiq Joty, Caiming Xiong, Richard Socher, Irwin King, Michael Lyu and Steven C.H. Hoi | N/A | N/A |
| Explicit Semantic Decomposition for Definition Generation | Jiahuan Li, Yu Bao, Shujian Huang, Xinyu Dai and Jiajun Chen | N/A | N/A |
| Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach | Wenyu DU, Zhouhan Lin, Yikang Shen, Timothy J. O’Donnell, Yoshua Bengio and Yue Zhang | N/A | N/A |
| Exploiting the Syntax-Model Consistency for Neural Relation Extraction | Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou and Thien Huu Nguyen | N/A | N/A |
| Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer | Chulun Zhou, Liangyu Chen, Jiachen Liu, Xinyan Xiao, Jinsong Su, Sheng Guo and Hua Wu | N/A | N/A |
| Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing | Alane Suhr, Ming-Wei Chang, Peter Shaw and Kenton Lee | N/A | N/A |
| Extracting Headless MWEs from Dependency Parse Trees: Parsing, Tagging, and Joint Modeling Approaches | Tianze Shi and Lillian Lee | N/A | N/A |
| Extractive Summarization as Text Matching | Ming Zhong, Pengfei Liu, Yiran Chen, Danqing Wang, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| Facet-Aware Evaluation for Extractive Summarization | Yuning Mao, Liyuan Liu, Qi Zhu, Xiang Ren and Jiawei Han | N/A | N/A |
| Fact-based Text Editing | Hayate Iso, Chao Qiao and Hang Li | N/A | N/A |
| Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning | Joongbo Shin, Yoonhyung Lee, Seunghyun Yoon and Kyomin Jung | N/A | N/A |
| Fast and Accurate Non-Projective Dependency Tree Linearization | Xiang Yu, Simon Tannert, Ngoc Thang Vu and Jonas Kuhn | N/A | N/A |
| FastBERT: a Self-distilling BERT with Adaptive Inference Time | Weijie Liu, Peng Zhou, Zhiruo Wang, Zhe Zhao, Haotang Deng and QI JU | N/A | N/A |
| Feature Projection for Improved Text Classification | Qi Qin, Wenpeng Hu and Bing Liu | N/A | N/A |
| FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization | Esin Durmus, He He and Mona Diab | N/A | N/A |
| Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network | Yutai Hou, Wanxiang Che, Yongkui Lai, Zhihan Zhou, Yijia Liu, Han Liu and Ting Liu | N/A | N/A |
| Finding Universal Grammatical Relations in Multilingual BERT | Ethan A. Chi, John Hewitt and Christopher D. Manning | N/A | N/A |
| Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences | Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe and Omri Abend | N/A | N/A |
| Fine-grained Fact Verification with Kernel Graph Attention Network | Zhenghao Liu, Chenyan Xiong, Maosong Sun and Zhiyuan Liu | N/A | N/A |
| Fine-grained Interest Matching for Neural News Recommendation | Heyuan Wang, Fangzhao Wu, Zheng Liu and Xing Xie | N/A | N/A |
| Fluent Response Generation for Conversational Question Answering | Ashutosh Baheti, Alan Ritter and Kevin Small | N/A | N/A |
| From Arguments to Key Points: Towards Automatic Argument Summarization | Roy Bar-Haim, Lilach Eden, Roni Friedman, Yoav Kantor, Dan Lahav and Noam Slonim | N/A | N/A |
| From English to Code-Switching: Transfer Learning with Strong Morphological Clues | Gustavo Aguilar and Thamar Solorio | N/A | N/A |
| From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)? | Reut Tsarfaty, Dan Bareket, Stav Klein and Amit Seker | N/A | N/A |
| From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains | Jan-Christoph Klie, Richard Eckart de Castilho and Iryna Gurevych | N/A | N/A |
| Frugal Paradigm Completion | Alexander Erdmann, Tom Kenter, Markus Becker and Christian Schallhart | N/A | N/A |
| Gated Convolutional Bidirectional Attention-based Model for Off-topic Spoken Response Detection | Yefei Zha, Ruobing Li and Hui Lin | N/A | N/A |
| GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media | Yi-Ju Lu and Cheng-Te Li | N/A | N/A |
| Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer | Jieyu Zhao, Subhabrata Mukherjee, Saghar Hosseini, Kai-Wei Chang and Ahmed Hassan Awadallah | N/A | N/A |
| Gender Gap in Natural Language Processing Research: Disparities in Authorship and Citations | Saif M. Mohammad | N/A | N/A |
| Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus | Luisa Bentivogli, Beatrice Savoldi, Matteo Negri, Mattia A. Di Gangi, Roldano Cattoni and Marco Turchi | N/A | N/A |
| Generalized Entropy Regularization or: There’s Nothing Special about Label Smoothing | Clara Meister, Elizabeth Salesky and Ryan Cotterell | N/A | N/A |
| Generalizing Natural Language Analysis through Span-relation Representations | Zhengbao Jiang, Wei Xu, Jun Araki and Graham Neubig | N/A | N/A |
| Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation | Haoyu Song, Yan Wang, Wei-Nan Zhang, Xiaojiang Liu and Ting Liu | N/A | N/A |
| Generating Counter Narratives against Online Hate Speech: Data and Strategies | Serra Sinem Tekiroğlu, Yi-Ling Chung and Marco Guerini | N/A | N/A |
| Generating Diverse and Consistent QA pairs from Contexts with Information-Maximizing Hierarchical Conditional VAEs | Dong Bok Lee, Seanie Lee, Woo Tae Jeong, Donghwan Kim and Sung Ju Hwang | N/A | N/A |
| Generating Fact Checking Explanations | Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma and Isabelle Augenstein | N/A | N/A |
| Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection | Hanjie Chen, Guangtao Zheng and Yangfeng Ji | N/A | N/A |
| Generating Informative Conversational Response using Recurrent Knowledge-Interaction and Knowledge-Copy | Xiexiong Lin, Weiyu Jian, Jianshan He, Taifeng Wang and Wei Chu | N/A | N/A |
| Generative Semantic Hashing Enhanced via Boltzmann Machines | Lin Zheng, Qinliang Su, Dinghan Shen and Changyou Chen | N/A | N/A |
| GLUECoS: An Evaluation Benchmark for Code-Switched NLP | Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram and Monojit Choudhury | N/A | N/A |
| GoEmotions: A Dataset of Fine-Grained Emotions | Dorottya Demszky, Dana Movshovitz-Attias, Jeongwoo Ko, Alan Cowen, Gaurav Nemade and Sujith Ravi | N/A | N/A |
| Good-Enough Compositional Data Augmentation | Jacob Andreas | N/A | N/A |
| Graph Neural News Recommendation with Unsupervised Preference Disentanglement | Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie and Ming Zhou | N/A | N/A |
| Graph-to-Tree Learning for Solving Math Word Problems | Jipeng Zhang, Lei Wang, Roy Ka-Wei Lee, Yi Bin, Yan Wang, Jie Shao and Ee-Peng Lim | N/A | N/A |
| Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs | Houyu Zhang, Zhenghao Liu, Chenyan Xiong and Zhiyuan Liu | N/A | N/A |
| Grounding Conversations with Improvised Dialogues | Hyundong Cho and Jonathan May | N/A | N/A |
| Guiding Variational Response Generator to Exploit Persona | Bowen Wu, Mengyuan Li, Zongsheng Wang, Yifu Chen, Derek F. Wong, Qihang Feng, Junhong Huang and Baoxun Wang | N/A | N/A |
| Handling Rare Entities for Neural Sequence Labeling | Yangming Li, Han Li, Kaisheng Yao and Xiaolong Li | N/A | N/A |
| Hard-Coded Gaussian Attention for Neural Machine Translation | Weiqiu You, Simeng Sun and Mohit Iyyer | N/A | N/A |
| Harnessing the linguistic signal to predict scalar inferences | Sebastian Schuster, Yuxing Chen and Judith Degen | N/A | N/A |
| Harvesting and Refining Question-Answer Pairs for Unsupervised QA | Zhongli Li, Wenhui Wang, Li Dong, Furu Wei and Ke Xu | N/A | N/A |
| HAT: Hardware-Aware Transformers for Efficient Natural Language Processing | Hanrui Wang, Zhanghao Wu, Zhijian Liu, Han Cai, Ligeng Zhu, Chuang Gan and Song Han | N/A | N/A |
| He said “who’s gonna take care of your children when you are at ACL?”: Reported Sexist Acts are Not Sexist | Patricia Chiril, Véronique Moriceau, Farah Benamara, Alda Mari, Gloria Origgi and Marlène Coulomb-Gully | N/A | N/A |
| Heterogeneous Graph Neural Networks for Extractive Document Summarization | Danqing Wang, Pengfei Liu, Yining Zheng, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| Heterogeneous Graph Transformer for Graph-to-Sequence Learning | Shaowei Yao, Tianming Wang and Xiaojun Wan | N/A | N/A |
| Hierarchical Entity Typing via Multi-level Learning to Rank | Tongfei Chen, Yunmo Chen and Benjamin Van Durme | N/A | N/A |
| Hierarchical Modeling for User Personality Prediction: The Role of Message-Level Attention | Veronica Lynn, Niranjan Balasubramanian and H. Andrew Schwartz | N/A | N/A |
| Hierarchy-Aware Global Model for Hierarchical Text Classification | Jie Zhou, Chunping Ma, Dingkun Long, Guangwei Xu, Ning Ding, Haoyu Zhang, Pengjun Xie and Gongshen Liu | N/A | N/A |
| Highway Transformer: Self-Gating Enhanced Self-Attentive Networks | Yekun Chai, Shuo Jin and Xinwen Hou | N/A | N/A |
| Hiring Now: A Skill-Aware Multi-Attention Model for Job Posting Generation | Liting Liu, Jie Liu, Wenzheng Zhang, Ziming Chi, Wenxuan Shi and Yalou Huang | N/A | N/A |
| History for Visual Dialog: Do we really need it? | Shubham Agarwal, Trung Bui, Joon-Young Lee, Ioannis Konstas and Verena Rieser | N/A | N/A |
| Hooks in the Headline: Learning to Generate Headlines with Controlled Styles | Di Jin, Zhijing Jin, Joey Tianyi Zhou, Lisa Orii and Peter Szolovits | N/A | N/A |
| How Accents Confound: Probing for Accent Information in End-to-End Speech Recognition Systems | Archiki Prasad and Preethi Jyothi | N/A | N/A |
| How does BERT’s attention change when you fine-tune? An analysis methodology and a case study in negation scope | Yiyun Zhao and Steven Bethard | N/A | N/A |
| How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence | Haoxi Zhong, Chaojun Xiao, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| How Does Selective Mechanism Improve Self-Attention Networks? | Xinwei Geng, Longyue Wang, Xing Wang, Bing Qin, Ting Liu and Zhaopeng Tu | N/A | N/A |
| How to Ask Good Questions? Try to Leverage Paraphrases | Xin Jia, Wenjie Zhou, Xu Sun and Yunfang Wu | N/A | N/A |
| Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? | Cansu Sen, Thomas Hartvigsen, Biao Yin, Xiangnan Kong and Elke Rundensteiner | N/A | N/A |
| Hyperbolic Capsule Networks for Multi-Label Classification | Boli Chen, Xin Huang, Lin Xiao and Liping Jing | N/A | N/A |
| HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding | Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu and Weifeng Chong | N/A | N/A |
| Image-Chat: Engaging Grounded Conversations | Kurt Shuster, Samuel Humeau, Antoine Bordes and Jason Weston | N/A | N/A |
| IMoJIE: Iterative Memory-Based Joint Open Information Extraction | Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, Mausam - and Soumen Chakrabarti | N/A | N/A |
| Improved Natural Language Generation via Loss Truncation | Daniel Kang and Tatsunori Hashimoto | N/A | N/A |
| Improving Adversarial Text Generation by Modeling the Distant Future | Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen and Lawrence Carin | N/A | N/A |
| Improving Chinese Word Segmentation with Wordhood Memory Networks | Yuanhe Tian, Yan Song, Fei Xia, Tong Zhang and Yonggang Wang | N/A | N/A |
| Improving Disentangled Text Representation Learning with Information-Theoretic Guidance | Pengyu Cheng, Martin Renqiang Min, Dinghan Shen, Christopher Malon, Yizhe Zhang, Yitong Li and Lawrence Carin | N/A | N/A |
| Improving Disfluency Detection by Self-Training a Self-Attentive Model | Paria Jamshid Lou and Mark Johnson | N/A | N/A |
| Improving Event Detection via Open-domain Trigger Knowledge | Meihan Tong, Bin Xu, Shuai Wang, Yixin Cao, Lei Hou, Juanzi Li and Jun Xie | N/A | N/A |
| Improving Image Captioning Evaluation by Considering Inter References Variance | Yanzhi Yi, Hangyu Deng and Jinglu Hu | N/A | N/A |
| Improving Image Captioning with Better Use of Caption | Zhan Shi, Xu Zhou, Xipeng Qiu and Xiaodan Zhu | N/A | N/A |
| Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation | Biao Zhang, Philip Williams, Ivan Titov and Rico Sennrich | N/A | N/A |
| Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings | Apoorv Saxena, Aditay Tripathi and Partha Talukdar | N/A | N/A |
| Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer | Jianfei Yu, Jing Jiang, Li Yang and Rui Xia | N/A | N/A |
| Improving Neural Machine Translation with Soft Template Prediction | Jian Yang, Shuming Ma, Dongdong Zhang, Zhoujun Li and Ming Zhou | N/A | N/A |
| Improving Segmentation for Technical Support Problems | Kushal Chauhan and Abhirut Gupta | N/A | N/A |
| Improving Transformer Models by Reordering their Sublayers | Ofir Press, Noah A. Smith and Omer Levy | N/A | N/A |
| Improving Truthfulness of Headline Generation | Kazuki Matsumaru, Sho Takase and Naoaki Okazaki | N/A | N/A |
| In Layman’s Terms: Semi-Open Relation Extraction from Scientific Texts | Ruben Kruiper, Julian Vincent, Jessica Chen-Burger, Marc Desmulliez and Ioannis Konstas | N/A | N/A |
| In Neural Machine Translation, What Does Transfer Learning Transfer? | Alham Fikri Aji, Nikolay Bogoychev, Kenneth Heafield and Rico Sennrich | N/A | N/A |
| Inflecting when there’s no majority: Limitations of encoder-decoder neural networks as cognitive models for German plurals | Kate McCurdy, Sharon Goldwater and Adam Lopez | N/A | N/A |
| Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models | Kaiji Lu, Piotr Mardziel, Klas Leino, Matt Fredrikson and Anupam Datta | N/A | N/A |
| Information-Theoretic Probing for Linguistic Structure | Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams and Ryan Cotterell | N/A | N/A |
| INFOTABS: Inference on Tables as Semi-structured Data | Vivek Gupta, Maitrey Mehta, Pegah Nokhiz and Vivek Srikumar | N/A | N/A |
| Injecting Numerical Reasoning Skills into Language Models | Mor Geva, Ankit Gupta and Jonathan Berant | N/A | N/A |
| INSET: Sentence Infilling with INter-SEntential Transformer | Yichen Huang, Yizhe Zhang, Oussama Elachqar and Yu Cheng | N/A | N/A |
| Integrating Multimodal Information in Large Pretrained Transformers | Wasifur Rahman, Md Kamrul Hasan, Sangwu Lee, AmirAli Bagher Zadeh, Chengfeng Mao, Louis-Philippe Morency and Ehsan Hoque | N/A | N/A |
| Integrating Semantic and Structural Information with Graph Convolutional Network for Controversy Detection | Lei Zhong, Juan Cao, Qiang Sheng, Junbo Guo and Ziang Wang | N/A | N/A |
| Interactive Classification by Asking Informative Questions | Lili Yu, Howard Chen, Sida I. Wang, Tao Lei and Yoav Artzi | N/A | N/A |
| Interactive Construction of User-Centric Dictionary for Text Analytics | Ryosuke Kohita, Issei Yoshida, Hiroshi Kanayama and Tetsuya Nasukawa | N/A | N/A |
| Interactive Machine Comprehension with Information Seeking Agents | Xingdi Yuan, Jie Fu, Marc-Alexandre Côté, Yi Tay, Chris Pal and Adam Trischler | N/A | N/A |
| Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work? | Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann and Samuel R. Bowman | N/A | N/A |
| Interpreting Pretrained Contextualized Representations via Reductions to Static Embeddings | Rishi Bommasani, Kelly Davis and Claire Cardie | N/A | N/A |
| Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions | Hannah Craighead, Andrew Caines, Paula Buttery and Helen Yannakoudakis | N/A | N/A |
| Investigating Word-Class Distributions in Word Vector Spaces | Ryohei Sasano and Anna Korhonen | N/A | N/A |
| iSarcasm: A Dataset of Intended Sarcasm | Silviu Oprea and Walid Magdy | N/A | N/A |
| It Takes Two to Lie: One to Lie, and One to Listen | Denis Peskov, Benny Cheng, Ahmed Elgohary, Joe Barrow, Cristian Danescu-Niculescu-Mizil and Jordan Boyd-Graber | N/A | N/A |
| It’s Morphin’ Time! Combating Linguistic Discrimination with Inflectional Perturbations | Samson Tan, Shafiq Joty, Min-Yen Kan and Richard Socher | N/A | N/A |
| Iterative Edit-Based Unsupervised Sentence Simplification | Dhruv Kumar, Lili Mou, Lukasz Golab and Olga Vechtomova | N/A | N/A |
| Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge | Yuanhe Tian, Yan Song, Xiang Ao, Fei Xia, Xiaojun Quan, Tong Zhang and Yonggang Wang | N/A | N/A |
| Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging | Nasser Zalmout and Nizar Habash | N/A | N/A |
| Joint Modelling of Emotion and Abusive Language Detection | Santhosh Rajamanickam, Pushkar Mishra, Helen Yannakoudakis and Ekaterina Shutova | N/A | N/A |
| Jointly Learning to Align and Summarize for Neural Cross-Lingual Summarization | Yue Cao, Hui Liu and Xiaojun Wan | N/A | N/A |
| Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation | Junliang Guo, Linli Xu and Enhong Chen | N/A | N/A |
| KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation | Hao Zhou, Chujie Zheng, Kaili Huang, Minlie Huang and Xiaoyan Zhu | N/A | N/A |
| KinGDOM: Knowledge-Guided DOMain adaptation for sentiment analysis | Deepanway Ghosal, Devamanyu Hazarika, Abhinaba Roy, Navonil Majumder, Rada Mihalcea and Soujanya Poria | N/A | N/A |
| KLEJ: Comprehensive Benchmark for Polish Language Understanding | Piotr Rybak, Robert Mroczkowski, Janusz Tracz and Ireneusz Gawlik | N/A | N/A |
| Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation | Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita and Tiejun Zhao | N/A | N/A |
| Knowledge Graph Embedding Compression | Mrinmaya Sachan | N/A | N/A |
| Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward | Luyang Huang, Lingfei Wu and Lu Wang | N/A | N/A |
| Language (Re)modelling: Towards Embodied Language Understanding | Ronen Tamari, Chen Shani, Tom Hope, Miriam R L Petruck, Omri Abend and Dafna Shahaf | N/A | N/A |
| Language (technology) is power: The need to be explicit about NLP harms | Su Lin Blodgett, Solon Barocas, Hal Daumé III and Hanna Wallach | N/A | N/A |
| Language Models as an Alternative Evaluator of Word Order Hypotheses: A Case Study in Japanese | Tatsuki Kuribayashi, Takumi Ito, Jun Suzuki and Kentaro Inui | N/A | N/A |
| Language to Network: Conditional Parameter Adaptation with Natural Language Descriptions | Tian Jin, Zhun Liu, Shengjia Yan, Alexandre Eichenberger and Louis-Philippe Morency | N/A | N/A |
| Large Scale Multi-Actor Generative Dialog Modeling | Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary and Bryan Catanzaro | N/A | N/A |
| Learning a Multi-Domain Curriculum for Neural Machine Translation | Wei Wang, Ye Tian, Jiquan Ngiam, Yinfei Yang, Isaac Caswell and Zarana Parekh | N/A | N/A |
| Learning and Evaluating Emotion Lexicons for 91 Languages | Sven Buechel, Susanna Rücker and Udo Hahn | N/A | N/A |
| Learning Architectures from an Extended Search Space for Language Modeling | Yinqiao Li, Chi Hu, Yuhao Zhang, Nuo Xu, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu and Changliang Li | N/A | N/A |
| Learning Constraints for Structured Prediction Using Rectifier Networks | Xingyuan Pan, Maitrey Mehta and Vivek Srikumar | N/A | N/A |
| Learning Dialog Policies from Weak Demonstrations | Gabriel Gordon-Hall, Philip John Gorinski and Shay B. Cohen | N/A | N/A |
| Learning Efficient Dialogue Policy from Demonstrations through Shaping | Huimin Wang, Baolin Peng and Kam-Fai Wong | N/A | N/A |
| Learning Interpretable Relationships between Entities, Relations and Concepts via Bayesian Structure Learning on Open Domain Facts | Jingyuan Zhang, Mingming Sun, Yue Feng and Ping Li | N/A | N/A |
| Learning Source Phrase Representations for Neural Machine Translation | Hongfei Xu, Josef van Genabith, Deyi Xiong, Qiuhui Liu and Jingyi Zhang | N/A | N/A |
| Learning to Ask More: Semi-Autoregressive Sequential Question Generation under Dual-Graph Interaction | Zi Chai and Xiaojun Wan | N/A | N/A |
| Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling | Ouyu Lan, Xiao Huang, Bill Yuchen Lin, He Jiang, Liyuan Liu and Xiang Ren | N/A | N/A |
| Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks | Yiping Song, Zequn Liu, Wei Bi, Rui Yan and Ming Zhang | N/A | N/A |
| Learning to Deceive with Attention-Based Explanations | Danish Pruthi, Mansi Gupta, Bhuwan Dhingra, Graham Neubig and Zachary C. Lipton | N/A | N/A |
| Learning to execute instructions in a Minecraft dialogue | Prashant Jayannavar, Anjali Narayan-Chen and Julia Hockenmaier | N/A | N/A |
| Learning to Faithfully Rationalize by Construction | Sarthak Jain, Sarah Wiegreffe, Yuval Pinter and Byron C. Wallace | N/A | N/A |
| Learning to Identify Follow-Up Questions in Conversational Question Answering | Souvik Kundu, Qian Lin and Hwee Tou Ng | N/A | N/A |
| Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation | Qiu Ran, Yankai Lin, Peng Li and Jie Zhou | N/A | N/A |
| Learning to Segment Actions from Observation and Narration | Daniel Fried, Jean-Baptiste Alayrac, Phil Blunsom, Chris Dyer, Stephen Clark and Aida Nematzadeh | N/A | N/A |
| Learning to Update Natural Language Comments Based on Code Changes | Sheena Panthaplackel, Pengyu Nie, Milos Gligoric, Junyi Jessy Li and Raymond Mooney | N/A | N/A |
| Learning Web-based Procedures by Reasoning over Explanations and Demonstrations in Context | Shashank Srivastava, Oleksandr Polozov, Nebojsa Jojic and Christopher Meek | N/A | N/A |
| Leveraging Graph to Improve Abstractive Multi-Document Summarization | Wei Li, Xinyan Xiao, Jiachen Liu, Hua Wu, Haifeng Wang and Junping Du | N/A | N/A |
| Line Graph Enhanced AMR-to-Text Generation with Mix-Order Graph Attention Networks | Yanbin Zhao, Lu Chen, Zhi Chen, Ruisheng Cao, Su Zhu and Kai Yu | N/A | N/A |
| Location Attention for Extrapolation to Longer Sequences | Yann Dubois, Gautier Dagan, Dieuwke Hupkes and Elia Bruni | N/A | N/A |
| Logical Natural Language Generation from Open-Domain Tables | Wenhu Chen, Jianshu Chen, Yu Su, Zhiyu Chen and William Yang Wang | N/A | N/A |
| LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network | Wanjun Zhong, Duyu Tang, Zhangyin Feng, Nan Duan, Ming Zhou, Ming Gong, Linjun Shou, Daxin Jiang, Jiahai Wang and Jian Yin | N/A | N/A |
| Low-Dimensional Hyperbolic Knowledge Graph Embeddings | Ines Chami, Adva Wolf, Da-Cheng Juan, Frederic Sala, Sujith Ravi and Christopher Ré | N/A | N/A |
| Low-Resource Generation of Multi-hop Reasoning Questions | Jianxing Yu, Wei Liu, Shuang Qiu, Qinliang Su, Kai Wang, Xiaojun Quan and Jian Yin | N/A | N/A |
| Machine Reading of Historical Events | Or Honovich, Lucas Torroba Hennigen, Omri Abend and Shay B. Cohen | N/A | N/A |
| Mapping Natural Language Instructions to Mobile UI Action Sequences | Yang Li, Jiacong He, Xin Zhou, Yuan Zhang and Jason Baldridge | N/A | N/A |
| MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning | Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara Berg and Mohit Bansal | N/A | N/A |
| Masked Language Model Scoring | Julian Salazar, Davis Liang, Toan Q. Nguyen and Katrin Kirchhoff | N/A | N/A |
| MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization | Canwen Xu, Jiaxin Pei, Hongtao Wu, Yiyu Liu and Chenliang Li | N/A | N/A |
| Max-Margin Incremental CCG Parsing | Miloš Stanojević and Mark Steedman | N/A | N/A |
| Measuring Forecasting Skill from Text | Shi Zong, Alan Ritter and Eduard Hovy | N/A | N/A |
| Meta-Reinforced Multi-Domain State Generator for Dialogue Systems | Yi Huang, Junlan Feng, Min Hu, Xiaoting Wu, Xiaoyu Du and Shuo Ma | N/A | N/A |
| MIE: A Medical Information Extractor towards Medical Dialogues | Yuanzhe Zhang, Zhongtao Jiang, Tao Zhang, Shiwan Liu, Jiarun Cao, Kang Liu, Shengping Liu and Jun Zhao | N/A | N/A |
| Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance | Prasetya Ajie Utama, Nafise Sadat Moosavi and Iryna Gurevych | N/A | N/A |
| MIND: A Large-scale Dataset for News Recommendation | Fangzhao Wu, Ying Qiao, Jiun-Hung Chen, Chuhan Wu, Tao Qi, Jianxun Lian, Danyang Liu, Xing Xie, Jianfeng Gao, Winnie Wu and Ming Zhou | N/A | N/A |
| MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification | Jiaao Chen, Zichao Yang and Diyi Yang | N/A | N/A |
| MLQA: Evaluating Cross-lingual Extractive Question Answering | Patrick Lewis, Barlas Oguz, Ruty Rinott, Sebastian Riedel and Holger Schwenk | N/A | N/A |
| MMPE: A Multi-Modal Interface for Post-Editing Machine Translation | Nico Herbig, Tim Düwel, Santanu Pal, Kalliopi Maria Meladaki, Mahsa Monshizadeh, Antonio Krüger and Josef van Genabith | N/A | N/A |
| MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices | Zhiqing Sun, Hongkun Yu, Xiaodan Song, Renjie Liu, Yiming Yang and Denny Zhou | N/A | N/A |
| Modeling Code-Switch Languages Using Bilingual Parallel Corpus | Grandee Lee and Haizhou Li | N/A | N/A |
| Modeling Morphological Typology for Unsupervised Learning of Language Morphology | Hongzhi Xu, Jordan Kodner, Mitchell Marcus and Charles Yang | N/A | N/A |
| Modelling Context and Syntactical Features for Aspect-based Sentiment Analysis | Minh Hieu Phan and Philip O. Ogunbona | N/A | N/A |
| More Diverse Dialogue Datasets via Diversity-Informed Data Collection | Katherine Stasaski, Grace Hui Yang and Marti A. Hearst | N/A | N/A |
| Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders | Terra Blevins and Luke Zettlemoyer | N/A | N/A |
| Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning | Angeliki Lazaridou, Anna Potapenko and Olivier Tieleman | N/A | N/A |
| Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition | Ryuichi Takanobu, Runze Liang and Minlie Huang | N/A | N/A |
| Multi-Cell Compositional LSTM for NER Domain Adaptation | Chen Jia and Yue Zhang | N/A | N/A |
| Multidirectional Associative Optimization of Function-Specific Word Representations | Daniela Gerz, Ivan Vulić, Marek Rei, Roi Reichart and Anna Korhonen | N/A | N/A |
| Multi-Domain Dialogue Acts and Response Co-Generation | Kai Wang, Junfeng Tian, Rui Wang, Xiaojun Quan and Jianxing Yu | N/A | N/A |
| Multi-Domain Named Entity Recognition with Genre-Aware and Agnostic Inference | Jing Wang, Mayank Kulkarni and Daniel Preotiuc-Pietro | N/A | N/A |
| Multi-Domain Neural Machine Translation with Word-Level Adaptive Layer-wise Domain Mixing | Haoming Jiang, Chen Liang, Chong Wang and Tuo Zhao | N/A | N/A |
| Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization | Hanqi Jin, Tianming Wang and Xiaojun Wan | N/A | N/A |
| Multi-Hypothesis Machine Translation Evaluation | Marina Fomicheva, Lucia Specia and Francisco Guzmán | N/A | N/A |
| Multi-Label and Multilingual News Framing Analysis | Afra Feyza Akyürek, Lei Guo, Randa Elanwar, Prakash Ishwar, Margrit Betke and Derry Tanti Wijaya | N/A | N/A |
| Multimodal Neural Graph Memory Networks for Visual Question Answering | Mahmoud Khademi | N/A | N/A |
| MultiQT: Multimodal learning for real-time question tracking in speech | Jakob D. Havtorn, Jan Latko, Joakim Edin, Lars Maaløe, Lasse Borgholt, Lorenzo Belgrano, Nicolai Jacobsen, Regitze Sdun and Željko Agić | N/A | N/A |
| Multiscale Collaborative Deep Models for Neural Machine Translation | Xiangpeng Wei, Heng Yu, Yue Hu, Yue Zhang, Rongxiang Weng and Weihua Luo | N/A | N/A |
| Multi-Sentence Argument Linking | Seth Ebner, Patrick Xia, Ryan Culkin, Kyle Rawlins and Benjamin Van Durme | N/A | N/A |
| Multi-source Meta Transfer for Low Resource Multiple-Choice Question Answering | Ming Yan, Hao Zhang, Di Jin and Joey Tianyi Zhou | N/A | N/A |
| MuTual: A Dataset for Multi-Turn Dialogue Reasoning | Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang and Ming Zhou | N/A | N/A |
| Named Entity Recognition without Labelled Data: A Weak Supervision Approach | Pierre Lison, Jeremy Barnes, Aliaksandr Hubin and Samia Touileb | N/A | N/A |
| NAT: Noise-Aware Training for Robust Neural Sequence Labeling | Marcin Namysl, Sven Behnke and Joachim Köhler | N/A | N/A |
| Negative Training for Neural Dialogue Response Generation | Tianxing He and James Glass | N/A | N/A |
| Neighborhood Matching Network for Entity Alignment | Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang and Dongyan Zhao | N/A | N/A |
| NeuInfer: Knowledge Inference on N-ary Facts | Saiping Guan, Xiaolong Jin, Jiafeng Guo, Yuanzhuo Wang and Xueqi Cheng | N/A | N/A |
| Neural CRF Model for Sentence Alignment in Text Simplification | Chao Jiang, Mounica Maddela, Wuwei Lan, Yang Zhong and Wei Xu | N/A | N/A |
| Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence | Xiaoyu Shen, Ernie Chang, Hui Su, Cheng Niu and Dietrich Klakow | N/A | N/A |
| Neural Generation of Dialogue Response Timings | Matthew Roddy and Naomi Harte | N/A | N/A |
| Neural Mixed Counting Models for Dispersed Topic Discovery | Jiemin Wu, Yanghui Rao, Zusheng Zhang, Haoran Xie, Qing Li, Fu Lee Wang and Ziye Chen | N/A | N/A |
| Neural Reranking for Dependency Parsing: An Evaluation | Bich-Ngoc Do and Ines Rehbein | N/A | N/A |
| Neural Syntactic Preordering for Controlled Paraphrase Generation | Tanya Goyal and Greg Durrett | N/A | N/A |
| Neural Topic Modeling with Bidirectional Adversarial Training | Rui Wang, Xuemeng Hu, Deyu Zhou, Yulan He, Yuxuan Xiong, Chenchen Ye and Haiyang Xu | N/A | N/A |
| NILE : Natural Language Inference with Faithful Natural Language Explanations | Sawan Kumar and Partha Talukdar | N/A | N/A |
| Norm-Based Curriculum Learning for Neural Machine Translation | Xuebo Liu, Houtim Lai, Derek F. Wong and Lidia S. Chao | N/A | N/A |
| Not All Claims are Created Equal: Choosing the Right Statistical Approach to Assess Hypotheses | Erfan Sadeqi Azer, Daniel Khashabi, Ashish Sabharwal and Dan Roth | N/A | N/A |
| Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection | Shauli Ravfogel, Yanai Elazar, Hila Gonen, Michael Twiton and Yoav Goldberg | N/A | N/A |
| Obtaining Faithful Interpretations from Compositional Neural Networks | Sanjay Subramanian, Ben Bogin, Nitish Gupta, Tomer Wolfson, Sameer Singh, Jonathan Berant and Matt Gardner | N/A | N/A |
| On Faithfulness and Factuality in Abstractive Summarization | Joshua Maynez, Shashi Narayan, Bernd Bohnet and Ryan McDonald | N/A | N/A |
| On the Cross-lingual Transferability of Monolingual Representations | Mikel Artetxe, Sebastian Ruder and Dani Yogatama | N/A | N/A |
| On the Encoder-Decoder Incompatibility in Variational Text Modeling and Beyond | Chen Wu, Prince Zizhuang Wang and William Yang Wang | N/A | N/A |
| On The Evaluation of Machine Translation SystemsTrained With Back-Translation | Sergey Edunov, Myle Ott, Marc’Aurelio Ranzato and Michael Auli | N/A | N/A |
| On the Inference Calibration of Neural Machine Translation | Shuo Wang, Zhaopeng Tu, Shuming Shi and Yang Liu | N/A | N/A |
| On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation | Wei Zhao, Goran Glavaš, Maxime Peyrard, Yang Gao, Robert West and Steffen Eger | N/A | N/A |
| On the Robustness of Language Encoders against Grammatical Errors | Fan Yin, Quanyu Long, Tao Meng and Kai-Wei Chang | N/A | N/A |
| One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases | Xingdi Yuan, Tong Wang, Rui Meng, Khushboo Thaker, Peter Brusilovsky, Daqing He and Adam Trischler | N/A | N/A |
| Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports | Yuhao Zhang, Derek Merck, Emily Tsai, Christopher D. Manning and Curtis Langlotz | N/A | N/A |
| Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding | Yun Tang, Jing Huang, Guangtao Wang, Xiaodong He and Bowen Zhou | N/A | N/A |
| Out of the Echo Chamber: Detecting Countering Debate Speeches | Matan Orbach, Yonatan Bilu, Assaf Toledo, Dan Lahav, Michal Jacovi, Ranit Aharonov and Noam Slonim | N/A | N/A |
| ParaCrawl: Web-Scale Acquisition of Parallel Corpora | Marta Bañón, Pinzhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Esplà-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Elsa Sarrías, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins and Jaume Zaragoza | N/A | N/A |
| Parallel Corpus Filtering via Pre-trained Language Models | Boliang Zhang, Ajay Nagesh and Kevin Knight | N/A | N/A |
| Paraphrase Augmented Task-Oriented Dialog Generation | Silin Gao, Yichi Zhang, Zhijian Ou and Zhou Yu | N/A | N/A |
| Paraphrase Generation by Learning How to Edit from Samples | Amirhossein Kazemnejad, Mohammadreza Salehi and Mahdieh Soleymani Baghshah | N/A | N/A |
| Parsing into Variable-in-situ Logico-Semantic Graphs | Yufei Chen and Weiwei Sun | N/A | N/A |
| Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT | Zhiyong Wu, Yun Chen, Ben Kao and Qun Liu | N/A | N/A |
| PeTra: A Sparsely Supervised Memory Model for People Tracking | Shubham Toshniwal, Allyson Ettinger, Kevin Gimpel and Karen Livescu | N/A | N/A |
| Phone Features Improve Speech Translation | Elizabeth Salesky and Alan W Black | N/A | N/A |
| Phonetic and Visual Priors for Decipherment of Informal Romanization | Maria Ryskina, Matthew R. Gormley and Taylor Berg-Kirkpatrick | N/A | N/A |
| PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable | Siqi Bao, Huang He, Fan Wang, Hua Wu and Haifeng Wang | N/A | N/A |
| Politeness Transfer: A Tag and Generate Approach | Aman Madaan, Amrith Setlur, Tanmay Parekh, Barnabas Poczos, Graham Neubig, Yiming Yang, Ruslan Salakhutdinov, Alan W Black and Shrimai Prabhumoye | N/A | N/A |
| Posterior Control of Blackbox Generation | Xiang Lisa Li and Alexander Rush | N/A | N/A |
| Predicting Declension Class from Form and Meaning | Adina Williams, Tiago Pimentel, Arya D. McCarthy, Hagen Blix, Eleanor Chodroff and Ryan Cotterell | N/A | N/A |
| Predicting Depression in Screening Interviews from Latent Categorization of Interview Prompts | Alex Rinaldi, Jean Fox Tree and Snigdha Chaturvedi | N/A | N/A |
| Predicting Performance for Natural Language Processing Tasks | Mengzhou Xia, Antonios Anastasopoulos, Ruochen Xu, Yiming Yang and Graham Neubig | N/A | N/A |
| Predicting the Focus of Negation: Model and Error Analysis | Md Mosharaf Hossain, Kathleen Hamilton, Alexis Palmer and Eduardo Blanco | N/A | N/A |
| Predicting the Growth of Morphological Families from Social and Linguistic Factors | Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze | N/A | N/A |
| Predicting the Topical Stance and Political Leaning of Media using Tweets | Peter Stefanov, Kareem Darwish, Atanas Atanasov and Preslav Nakov | N/A | N/A |
| Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview | Deven Santosh Shah, H. Andrew Schwartz and Dirk Hovy | N/A | N/A |
| Premise Selection in Natural Language Mathematical Texts | Deborah Ferreira and André Freitas | N/A | N/A |
| Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders | Yu Duan, Canwen Xu, Jiaxin Pei, Jialong Han and Chenliang Li | N/A | N/A |
| Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning | Alexandre Tamborrino, Nicola Pellicanò, Baptiste Pannier, Pascal Voitot and Louise Naudin | N/A | N/A |
| Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models | Dan Iter, Kelvin Guu, Larry Lansing and Dan Jurafsky | N/A | N/A |
| Probabilistic Assumptions Matter: Improved Models for Distantly-Supervised Document-Level Question Answering | Hao Cheng, Ming-Wei Chang, Kenton Lee and Kristina Toutanova | N/A | N/A |
| Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order | Yi Liao, Xin Jiang and Qun Liu | N/A | N/A |
| Probing for referential information in language models | Ionut-Teodor Sorodoc, Kristina Gulordava and Gemma Boleda | N/A | N/A |
| Probing Linguistic Features of Sentence-Level Representations in Relation Extraction | Christoph Alt, Aleksandra Gabryszak and Leonhard Hennig | N/A | N/A |
| Probing Linguistic Systematicity | Emily Goodwin, Koustuv Sinha and Timothy J. O’Donnell | N/A | N/A |
| Programming in Natural Language with fuSE: Synthesizing Methods from Spoken Utterances Using Deep Natural Language Understanding | Sebastian Weigelt, Vanessa Steurer, Tobias Hey and Walter F. Tichy | N/A | N/A |
| PuzzLing Machines: A Challenge on Learning From Small Data | Gözde Gül Şahin, Yova Kementchedjhieva, Phillip Rust and Iryna Gurevych | N/A | N/A |
| Pyramid: A Layered Model for Nested Named Entity Recognition | Jue Wang, Lidan Shou, Ke Chen and Gang Chen | N/A | N/A |
| QuASE: Question-Answer Driven Sentence Encoding | Hangfeng He, Qiang Ning and Dan Roth | N/A | N/A |
| R^3: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge | Tuhin Chakrabarty, Debanjan Ghosh, Smaranda Muresan and Nanyun Peng | N/A | N/A |
| Rationalizing Medical Relation Prediction from Corpus-level Statistics | Zhen Wang, Jennifer Lee, Simon Lin and Huan Sun | N/A | N/A |
| Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport | Kyle Swanson, Lili Yu and Tao Lei | N/A | N/A |
| RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers | Bailin Wang, Richard Shin, Xiaodong Liu, Oleksandr Polozov and Matthew Richardson | N/A | N/A |
| Reasoning Over Semantic-Level Graph for Fact Checking | Wanjun Zhong, Jingjing Xu, Duyu Tang, Zenan Xu, Nan Duan, Ming Zhou, Jiahai Wang and Jian Yin | N/A | N/A |
| Reasoning with Latent Structure Refinement for Document-Level Relation Extraction | Guoshun Nan, Zhijiang Guo, Ivan Sekulic and Wei Lu | N/A | N/A |
| Reasoning with Multimodal Sarcastic Tweets via Modeling Cross-Modality Contrast and Semantic Association | Nan Xu, Zhixiong Zeng and Wenji Mao | N/A | N/A |
| (Re)construing Meaning in NLP | Sean Trott, Tiago Timponi Torrent, Nancy Chang and Nathan Schneider | N/A | N/A |
| Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension | Hongyu Gong, Yelong Shen, Dian Yu, Jianshu Chen and Dong Yu | N/A | N/A |
| Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment | Forrest Davis and Marten van Schijndel | N/A | N/A |
| Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem | Danielle Saunders and Bill Byrne | N/A | N/A |
| Refer360° : A Referring Expression Recognition Dataset in 360° Images | Volkan Cirik, Taylor Berg-Kirkpatrick and Louis-Philippe Morency | N/A | N/A |
| ReInceptionE: Relation-Aware Inception Network with Joint Local-Global Structural Information for Knowledge Graph Embedding | Zhiwen Xie, Guangyou Zhou, Jin Liu and Jimmy Xiangji Huang | N/A | N/A |
| Relabel the Noise: Joint Extraction of Entities and Relations via Cooperative Multiagents | Daoyuan Chen, Yaliang Li, Kai Lei and Ying Shen | N/A | N/A |
| Relational Graph Attention Network for Aspect-based Sentiment Analysis | Kai Wang, Weizhou Shen, Yunyi Yang, Xiaojun Quan and Rui Wang | N/A | N/A |
| Relation-Aware Collaborative Learning for Unified Aspect-Based Sentiment Analysis | Zhuang Chen and Tieyun Qian | N/A | N/A |
| Representation Learning for Information Extraction from Form-like Documents | Bodhisattwa Prasad Majumder, Navneet Potti, Sandeep Tata, James Bradley Wendt, Qi Zhao and Marc Najork | N/A | N/A |
| Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation | Zhiliang Tian, Wei Bi, Dongkyu Lee, Lanqing Xue, Yiping Song, Xiaojiang Liu and Nevin L. Zhang | N/A | N/A |
| Review-based Question Generation with Adaptive Instance Transfer and Augmentation | Qian Yu, Lidong Bing, Qiong Zhang, Wai Lam and Luo Si | N/A | N/A |
| Revisiting the Context Window for Cross-lingual Word Embeddings | Ryokan Ri and Yoshimasa Tsuruoka | N/A | N/A |
| Rigid Formats Controlled Text Generation | Piji Li, Haisong Zhang, Xiaojiang Liu and Shuming Shi | N/A | N/A |
| RikiNet: Reading Wikipedia Pages for Natural Question Answering | Dayiheng Liu, Yeyun Gong, Jie Fu, Yu Yan, Jiusheng Chen, Daxin Jiang, Jiancheng Lv and Nan Duan | N/A | N/A |
| Robust Encodings: A Framework for Combating Adversarial Typos | Erik Jones, Robin Jia, Aditi Raghunathan and Percy Liang | N/A | N/A |
| Roles and Utilization of Attention Heads in Transformer-based Neural Language Models | Jae-young Jo and Sung-Hyon Myaeng | N/A | N/A |
| S2ORC: The Semantic Scholar Open Research Corpus | Kyle Lo, Lucy Wang, Mark Neumann, Rodney Kinney and Daniel Weld | N/A | N/A |
| SAS: Dialogue State Tracking via Slot Attention and Slot Information Sharing | Jiaying Hu, Yan Yang, Chencai Chen, Liang He and Zhou Yu | N/A | N/A |
| SCDE: Sentence Cloze Dataset with High Quality Distractors From Examinations | Xiang Kong, Varun Gangal and Eduard Hovy | N/A | N/A |
| schuBERT: Optimizing Elements of BERT | Ashish Khetan and Zohar Karnin | N/A | N/A |
| SciREX: A Challenge Dataset for Document-Level Information Extraction | Sarthak Jain, Madeleine van Zuylen, Hannaneh Hajishirzi and Iz Beltagy | N/A | N/A |
| Screenplay Summarization Using Latent Narrative Structure | Pinelopi Papalampidi, Frank Keller, Lea Frermann and Mirella Lapata | N/A | N/A |
| ScriptWriter: Narrative-Guided Script Generation | Yutao Zhu, Ruihua Song, Zhicheng Dou, Jian-Yun Nie and Jin Zhou | N/A | N/A |
| SEEK: Segmented Embedding of Knowledge Graphs | Wentao Xu, Shun Zheng, Liang He, Bin Shao, Jian Yin and Tie-Yan Liu | N/A | N/A |
| Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation | Xabier Soto, Dimitar Shterionov, Alberto Poncelas and Andy Way | N/A | N/A |
| Selective Question Answering under Domain Shift | Amita Kamath, Robin Jia and Percy Liang | N/A | N/A |
| Semantic Graphs for Generating Deep Questions | Liangming Pan, Yuxi Xie, Yansong Feng, Tat-Seng Chua and Min-Yen Kan | N/A | N/A |
| Semantic Parsing for English as a Second Language | Yuanyuan Zhao, Weiwei Sun, Junjie Cao and Xiaojun Wan | N/A | N/A |
| Semantic Scaffolds for Pseudocode-to-Code Generation | Ruiqi Zhong, Mitchell Stern and Dan Klein | N/A | N/A |
| Semi-supervised Contextual Historical Text Normalization | Peter Makarov and Simon Clematide | N/A | N/A |
| Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation | Xinting Huang, Jianzhong Qi, Yu Sun and Rui Zhang | N/A | N/A |
| Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders | Zixia Jia, Youmi Ma, Jiong Cai and Kewei Tu | N/A | N/A |
| SenseBERT: Driving Some Sense into BERT | Yoav Levine, Barak Lenz, Or Dagan, Ori Ram, Dan Padnos, Or Sharir, Shai Shalev-Shwartz, Amnon Shashua and Yoav Shoham | N/A | N/A |
| SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics | Da Yin, Tao Meng and Kai-Wei Chang | N/A | N/A |
| Sentiment and Emotion help Sarcasm? A Multi-task Learning Framework for Multi-Modal Sarcasm, Sentiment and Emotion Analysis | Dushyant Singh Chauhan, Dhanush S R, Asif Ekbal and Pushpak Bhattacharyya | N/A | N/A |
| SeqVAT: Virtual Adversarial Training for Semi-Supervised Sequence Labeling | Luoxin Chen, Weitong Ruan, Xinyue Liu and Jianhua Lu | N/A | N/A |
| Should All Cross-Lingual Embeddings Speak English? | Antonios Anastasopoulos and Graham Neubig | N/A | N/A |
| Similarity Analysis of Contextual Word Representation Models | John Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi and James Glass | N/A | N/A |
| Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora | Hila Gonen, Ganesh Jawahar, Djamé Seddah and Yoav Goldberg | N/A | N/A |
| Simplify the Usage of Lexicon in Chinese NER | Ruotian Ma, Minlong Peng, Qi Zhang, Zhongyu Wei and Xuanjing Huang | N/A | N/A |
| SimulSpeech: End-to-End Simultaneous Speech to Text Translation | Yi Ren, Jinglin Liu, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao and Tie-Yan Liu | N/A | N/A |
| Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on Unlabeled Data in Target Language | Qianhui Wu, Zijia Lin, Börje Karlsson, Jian-Guang Lou and Biqing Huang | N/A | N/A |
| SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis | Hao Tian, Can Gao, Xinyan Xiao, Hao Liu, Bolei He, Hua Wu, Haifeng Wang and Feng Wu | N/A | N/A |
| Slot-consistent NLG for Task-oriented Dialogue Systems with Iterative Rectification Network | Yangming Li, Kaisheng Yao, Libo Qin, Wanxiang Che, Xiaolong Li and Ting Liu | N/A | N/A |
| SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization | Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao and Tuo Zhao | N/A | N/A |
| Social Bias Frames: Reasoning about Social and Power Implications of Language | Maarten Sap, Saadia Gabriel, Lianhui Qin, Dan Jurafsky, Noah A. Smith and Yejin Choi | N/A | N/A |
| Sources of Transfer in Multilingual Named Entity Recognition | David Mueller, Nicholas Andrews and Mark Dredze | N/A | N/A |
| Span Selection Pre-training for Question Answering | Michael Glass, Alfio Gliozzo, Rishav Chakravarti, Anthony Ferritto, Lin Pan, G P Shrivatsa Bhargav, Dinesh Garg and Avi Sil | N/A | N/A |
| Span-based Localizing Network for Natural Language Video Localization | Hao Zhang, Aixin Sun, Wei Jing and Joey Tianyi Zhou | N/A | N/A |
| SpanMlt: A Span-based Multi-Task Learning Framework for Pair-wise Aspect and Opinion Terms Extraction | He Zhao, Longtao Huang, Rong Zhang, Quan Lu and Hui Xue | N/A | N/A |
| Speak to your Parser: Interactive Text-to-SQL with Natural Language Feedback | Ahmed Elgohary, Saghar Hosseini and Ahmed Hassan Awadallah | N/A | N/A |
| Speaker Sensitive Response Evaluation Model | JinYeong Bak and Alice Oh | N/A | N/A |
| Speakers enhance contextually confusable words | Eric Meinhardt, Eric Bakovic and Leon Bergen | N/A | N/A |
| SPECTER: Document-level Representation Learning using Citation-informed Transformers | Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey and Daniel Weld | N/A | N/A |
| Speech Translation and the End-to-End Promise: Taking Stock of Where We Are | Matthias Sperber and Matthias Paulik | N/A | N/A |
| SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check | Xingyi Cheng, Weidi Xu, Kunlong Chen, Shaohua Jiang, Feng Wang, Taifeng Wang, Wei Chu and Yuan Qi | N/A | N/A |
| Spelling Error Correction with Soft-Masked BERT | Shaohua Zhang, Haoran Huang, Jicong Liu and Hang Li | N/A | N/A |
| Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words | Josef Klafka and Allyson Ettinger | N/A | N/A |
| STARC: Structured Annotations for Reading Comprehension | Yevgeni Berzak, Jonathan Malmaud and Roger Levy | N/A | N/A |
| Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization | Xin Du and Kumiko Tanaka-Ishii | N/A | N/A |
| Storytelling with Dialogue: A Critical Role Dungeons and Dragons Dataset | Revanth Rameshkumar and Peter Bailey | N/A | N/A |
| Structural Information Preserving for Graph-to-Text Generation | Linfeng Song, Ante Wang, Jinsong Su, Yue Zhang, Kun Xu, Yubin Ge and Dong Yu | N/A | N/A |
| Structured Tuning for Semantic Role Labeling | Tao Li, Parth Anand Jawale, Martha Palmer and Vivek Srikumar | N/A | N/A |
| Structure-Level Knowledge Distillation For Multilingual Sequence Labeling | Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Fei Huang and Kewei Tu | N/A | N/A |
| Suspense in Short Stories is Predicted By Uncertainty Reduction over Neural Story Representation | David Wilmot and Frank Keller | N/A | N/A |
| Synchronous Double-channel Recurrent Network for Aspect-Opinion Pair Extraction | Shaowei Chen, Jie Liu, Yu Wang, Wenzheng Zhang and Ziming Chi | N/A | N/A |
| Syn-QG: Syntactic and Shallow Semantic Rules for Question Generation | Kaustubh Dhole and Christopher D. Manning | N/A | N/A |
| Syntax-Aware Opinion Role Labeling with Dependency Graph Convolutional Networks | Bo Zhang, Yue Zhang, Rui Wang, Zhenghua Li and Min Zhang | N/A | N/A |
| TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data | Pengcheng Yin, Graham Neubig, Wen-tau Yih and Sebastian Riedel | N/A | N/A |
| TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task | Christoph Alt, Aleksandra Gabryszak and Leonhard Hennig | N/A | N/A |
| TAG : Type Auxiliary Guiding for Code Comment Generation | Ruichu Cai, Zhihao Liang, Boyan Xu, zijian li, Yuexing Hao and Yao Chen | N/A | N/A |
| Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics | Nitika Mathur, Timothy Baldwin and Trevor Cohn | N/A | N/A |
| TaPas: Weakly Supervised Table Parsing via Pre-training | Jonathan Herzig, Pawel Krzysztof Nowak, Thomas Müller, Francesco Piccinno and Julian Eisenschlos | N/A | N/A |
| Target Inference in Argument Conclusion Generation | Milad Alshomary, Shahbaz Syed, Martin Potthast and Henning Wachsmuth | N/A | N/A |
| Taxonomy Construction of Unseen Domains via Graph-based Cross-Domain Knowledge Transfer | Chao Shang, Sarthak Dash, Md Faisal Mahbub Chowdhury, Nandana Mihindukulasooriya and Alfio Gliozzo | N/A | N/A |
| Tchebycheff Procedure for Multi-task Text Classification | Yuren Mao, Shuang Yun, Weiwei Liu and Bo Du | N/A | N/A |
| Temporal Common Sense Acquisition with Minimal Supervision | Ben Zhou, Qiang Ning, Daniel Khashabi and Dan Roth | N/A | N/A |
| Temporally-Informed Analysis of Named Entity Recognition | Shruti Rijhwani and Daniel Preotiuc-Pietro | N/A | N/A |
| Text and Causal Inference: A Review of Using Text to Remove Confounding from Causal Estimates | Katherine Keith, David Jensen and Brendan O’Connor | N/A | N/A |
| Text-Based Ideal Points | Keyon Vafa, Suresh Naidu and David Blei | N/A | N/A |
| That is a Known Lie: Detecting Previously Fact-Checked Claims | Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino and Preslav Nakov | N/A | N/A |
| “The Boating Store Had Its Best Sail Ever”: Pronunciation-attentive Contextualized Pun Recognition | Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang and Wei Wang | N/A | N/A |
| The Cascade Transformer: an Application for Efficient Answer Sentence Selection | Luca Soldaini and Alessandro Moschitti | N/A | N/A |
| The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents | Kurt Shuster, Da JU, Stephen Roller, Emily Dinan, Y-Lan Boureau and Jason Weston | N/A | N/A |
| The Paradigm Discovery Problem | Alexander Erdmann, Micha Elsner, Shijie Wu, Ryan Cotterell and Nizar Habash | N/A | N/A |
| The Right Tool for the Job: Matching Model and Instance Complexities | Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta, Jesse Dodge and Noah A. Smith | N/A | N/A |
| The Sensitivity of Language Models and Humans to Winograd Schema Perturbations | Mostafa Abdou, Vinit Ravishankar, Maria Barrett, Yonatan Belinkov, Desmond Elliott and Anders Søgaard | N/A | N/A |
| The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain | Annemarie Friedrich, Heike Adel, Federico Tomazic, Johannes Hingerl, Renou Benteau, Anika Marusczyk and Lukas Lange | N/A | N/A |
| The State and Fate of Linguistic Diversity and Inclusion in the NLP World | Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali and Monojit Choudhury | N/A | N/A |
| The Summary Loop: Learning to Write Abstractive Summaries Without Examples | Philippe Laban, Andrew Hsi, John Canny and Marti A. Hearst | N/A | N/A |
| The TechQA Dataset | Vittorio Castelli, Rishav Chakravarti, Saswati Dana, Anthony Ferritto, Radu Florian, Martin Franz, Dinesh Garg, Dinesh Khandelwal, Scott McCarley, Michael McCawley, Mohamed Nasr, Lin Pan, Cezar Pendus, John Pitrelli, Saurabh Pujar, Salim Roukos, Andrzej Sakrajda, Avi Sil, Rosario Uceda-Sosa, Todd Ward and Rong Zhang | N/A | N/A |
| The Unstoppable Rise of Computational Linguistics in Deep Learning | James Henderson | N/A | N/A |
| To Boldly Query What No One Has Annotated Before? The Frontiers of Corpus Querying | Markus Gärtner and Kerstin Jung | N/A | N/A |
| To Test Machine Comprehension, Start by Defining Comprehension | Jesse Dunietz, Greg Burnham, Akash Bharadwaj, Owen Rambow, Jennifer Chu-Carroll and Dave Ferrucci | N/A | N/A |
| Toward Gender-Inclusive Coreference Resolution | Yang Trista Cao and Hal Daumé III | N/A | N/A |
| Towards Conversational Recommendation over Multi-Type Dialogs | Zeming Liu, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che and Ting Liu | N/A | N/A |
| Towards Debiasing Sentence Representations | Paul Pu Liang, Irene Mengze Li, Emily Zheng, Yao Chong Lim, Ruslan Salakhutdinov and Louis-Philippe Morency | N/A | N/A |
| Towards Emotion-aided Multi-modal Dialogue Act Classification | Tulika Saha, Aditya Patra, Sriparna Saha and Pushpak Bhattacharyya | N/A | N/A |
| Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints | Zhenyi Wang, Xiaoyang Wang, Bang An, Dong Yu and Changyou Chen | N/A | N/A |
| Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation | Bo Pang, Erik Nijkamp, Wenjuan Han, Linqi Zhou, Yixian Liu and Kewei Tu | N/A | N/A |
| Towards Interpretable Clinical Diagnosis with Bayesian Network Ensembles Stacked on Entity-Aware CNNs | Jun Chen, Xiaoya Dai, Quan Yuan, Chao Lu and Haifeng Huang | N/A | N/A |
| Towards Robustifying NLI Models Against Lexical Dataset Biases | Xiang Zhou and Mohit Bansal | N/A | N/A |
| Towards Transparent and Explainable Attention Models | Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan, Mitesh M. Khapra, Balaji Vasan Srinivasan and Balaraman Ravindran | N/A | N/A |
| Towards Understanding Gender Bias in Relation Extraction | Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang and William Yang Wang | N/A | N/A |
| Towards Unsupervised Language Understanding and Generation by Joint Dual Learning | Shang-Yu Su, Chao-Wei Huang and Yun-Nung Chen | N/A | N/A |
| Toxicity Detection: Does Context Really Matter? | John Pavlopoulos, Jeffrey Sorensen, Lucas Dixon, Nithum Thain and Ion Androutsopoulos | N/A | N/A |
| Transition-based Directed Graph Construction for Emotion-Cause Pair Extraction | Chuang Fan, Chaofa Yuan, Jiachen Du, Lin Gui, Min Yang and Ruifeng Xu | N/A | N/A |
| Transition-based Semantic Dependency Parsing with Pointer Networks | Daniel Fernández-González and Carlos Gómez-Rodríguez | N/A | N/A |
| Translationese as a Language in “Multilingual” NMT | Parker Riley, Isaac Caswell, Markus Freitag and David Grangier | N/A | N/A |
| TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition | Ruifang He, Jian Wang, Fengyu Guo and Yugui Han | N/A | N/A |
| TVQA+: Spatio-Temporal Grounding for Video Question Answering | Jie Lei, Licheng Yu, Tamara Berg and Mohit Bansal | N/A | N/A |
| TXtract: Taxonomy-Aware Knowledge Extraction for Thousands of Product Categories | Giannis Karamanolakis, Jun Ma and Xin Luna Dong | N/A | N/A |
| Uncertainty-Aware Curriculum Learning for Neural Machine Translation | Yikai Zhou, Baosong Yang, Derek F. Wong, Yu Wan and Lidia S. Chao | N/A | N/A |
| Understanding Attention for Text Classification | Xiaobing Sun and Wei Lu | N/A | N/A |
| Understanding the Language of Political Agreement and Disagreement in Legislative Texts | Maryam Davoodi, Eric Waltenburg and Dan Goldwasser | N/A | N/A |
| Universal Decompositional Semantic Parsing | Elias Stengel-Eskin, Aaron Steven White, Sheng Zhang and Benjamin Van Durme | N/A | N/A |
| Unknown Intent Detection Using Gaussian Mixture Model with an Application to Zero-shot Intent Classification | Guangfeng Yan, Lu Fan, Qimai Li, Han Liu, Xiaotong Zhang, Xiao-Ming Wu and Albert Y.S. Lam | N/A | N/A |
| Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering | Vikas Yadav, Steven Bethard and Mihai Surdeanu | N/A | N/A |
| Unsupervised Cross-lingual Representation Learning at Scale | Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer and Veselin Stoyanov | N/A | N/A |
| Unsupervised Domain Clusters in Pretrained Language Models | Roee Aharoni and Yoav Goldberg | N/A | N/A |
| Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing | Ruisheng Cao, Su Zhu, Chenyu Yang, Chen Liu, Rao Ma, Yanbin Zhao, Lu Chen and Kai Yu | N/A | N/A |
| Unsupervised Morphological Paradigm Completion | Huiming Jin, Liwei Cai, Yihui Peng, Chen Xia, Arya McCarthy and Katharina Kann | N/A | N/A |
| Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting | Po-Yao Huang, Junjie Hu, Xiaojun Chang and Alexander Hauptmann | N/A | N/A |
| Unsupervised Opinion Summarization as Copycat-Review Generation | Arthur Bražinskas, Mirella Lapata and Ivan Titov | N/A | N/A |
| Unsupervised Opinion Summarization with Noising and Denoising | Reinald Kim Amplayo and Mirella Lapata | N/A | N/A |
| Unsupervised Paraphrasing by Simulated Annealing | Xianggen Liu, Lili Mou, Fandong Meng, Hao Zhou, Jie Zhou and Sen Song | N/A | N/A |
| USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | Shikib Mehri and Maxine Eskenazi | N/A | N/A |
| Weight Poisoning Attacks on Pretrained Models | Keita Kurita, Paul Michel and Graham Neubig | N/A | N/A |
| What are the Goals of Distributional Semantics? | Guy Emerson | N/A | N/A |
| What determines the order of adjectives in English? Comparing efficiency-based theories using dependency treebanks | Richard Futrell, William Dyer and Greg Scontras | N/A | N/A |
| What Question Answering can Learn from Trivia Nerds | Jordan Boyd-Graber and Benjamin Börschinger | N/A | N/A |
| What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context | Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass and Preslav Nakov | N/A | N/A |
| When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People? | Kenneth Joseph and Jonathan Morgan | N/A | N/A |
| “Who said it, and Why?” Provenance for Natural Language Claims | Yi Zhang, Zachary Ives and Dan Roth | N/A | N/A |
| WinoWhy: A Deep Diagnosis of Essential Commonsense Knowledge for Answering Winograd Schema Challenge | Hongming Zhang, Xinran Zhao and Yangqiu Song | N/A | N/A |
| Word-level Textual Adversarial Attacking as Combinatorial Optimization | Yuan Zang, Fanchao Qi, Chenghao Yang, Zhiyuan Liu, Meng Zhang, Qun Liu and Maosong Sun | N/A | N/A |
| XtremeDistil: Multi-stage Distillation for Massive Multilingual Models | Subhabrata Mukherjee and Ahmed Hassan Awadallah | N/A | N/A |
| You Impress Me: Dialogue Generation via Mutual Persona Perception | Qian Liu, Yihong Chen, Bei Chen, Jian-Guang Lou, Zixuan Chen, Bin Zhou and Dongmei Zhang | N/A | N/A |
| Zero-shot Text Classification via Reinforced Self-training | Zhiquan Ye, Yuxia Geng, Jiaoyan Chen, Jingmin Chen, Xiaoxiao Xu, Suhang Zheng, Feng Wang, Jun Zhang and Huajun Chen | N/A | N/A |
| Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking | Giovanni Campagna, Agata Foryciarz, Mehrad Moradshahi and Monica Lam | N/A | N/A |
| ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages | Colin Lockard, Prashant Shiralkar, Xin Luna Dong and Hannaneh Hajishirzi | N/A | N/A |
| A Complete Shift-Reduce Chinese Discourse Parser with Robust Dynamic Oracle | Shyh-Shiun Hung, Hen-Hsen Huang and Hsin-Hsi Chen | N/A | N/A |
| A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers | Shen-yun Miao, Chao-Chun Liang and Keh-Yih Su | N/A | N/A |
| A Frame-based Sentence Representation for Machine Reading Comprehension | Shaoru Guo, Ru Li, Hongye Tan, Xiaoli Li, Yong Guan, Hongyan Zhao and Yueping Zhang | N/A | N/A |
| A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal | Demian Gholipour Ghalandari, Chris Hokamp, Nghia The Pham, John Glover and Georgiana Ifrim | N/A | N/A |
| A Multi-Perspective Architecture for Semantic Code Search | Rajarshi Haldar, Lingfei Wu, JinJun Xiong and Julia Hockenmaier | N/A | N/A |
| A negative case analysis of visual grounding methods for VQA | Robik Shrestha, Kushal Kafle and Christopher Kanan | N/A | N/A |
| A Probabilistic Generative Model for Typographical Analysis of Early Modern Printing | Kartik Goyal, Chris Dyer, Christopher Warren, Maxwell G’Sell and Taylor Berg-Kirkpatrick | N/A | N/A |
| A Re-evaluation of Knowledge Graph Completion Methods | Zhiqing Sun, Shikhar Vashishth, Soumya Sanyal, Partha Talukdar and Yiming Yang | N/A | N/A |
| A Relational Memory-based Embedding Model for Triple Classification and Search Personalization | Dai Quoc Nguyen, Tu Nguyen and Dinh Phung | N/A | N/A |
| A Relaxed Matching Procedure for Unsupervised BLI | Xu Zhao, Zihao Wang, Yong Zhang and Hao Wu | N/A | N/A |
| A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation | Shuo Ren, Yu Wu, Shujie Liu, Ming Zhou and Shuai Ma | N/A | N/A |
| A Simple and Effective Unified Encoder for Document-Level Machine Translation | Shuming Ma, Dongdong Zhang and Ming Zhou | N/A | N/A |
| A Tale of a Probe and a Parser | Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams and Ryan Cotterell | N/A | N/A |
| A Three-Parameter Rank-Frequency Relation in Natural Languages | Chenchen Ding, Masao Utiyama and Eiichiro Sumita | N/A | N/A |
| A Transformer-based Approach for Source Code Summarization | Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray and Kai-Wei Chang | N/A | N/A |
| A Two-Stage Masked LM Method for Term Set Expansion | Guy Kushilevitz, Shaul Markovitch and Yoav Goldberg | N/A | N/A |
| A Two-Step Approach for Implicit Event Argument Detection | Zhisong Zhang, Xiang Kong, Zhengzhong Liu, Xuezhe Ma and Eduard Hovy | N/A | N/A |
| Active Learning for Coreference Resolution using Discrete Annotation | Belinda Z. Li, Gabriel Stanovsky and Luke Zettlemoyer | N/A | N/A |
| An Empirical Comparison of Unsupervised Constituency Parsing Methods | Jun Li, Yifan Cao, Jiong Cai, Yong Jiang and Kewei Tu | N/A | N/A |
| Analyzing the Persuasive Effect of Style in News Editorial Argumentation | Roxanne El Baff, Henning Wachsmuth, Khalid Al Khatib and Benno Stein | N/A | N/A |
| Are we Estimating or Guesstimating Translation Quality? | Shuo Sun, Francisco Guzmán and Lucia Specia | N/A | N/A |
| Attend to Medical Ontologies: Content Selection for Clinical Abstractive Summarization | Sajad Sotudeh Gharebagh, Nazli Goharian and Ross Filice | N/A | N/A |
| Autoencoding Keyword Correlation Graph for Document Clustering | Billy Chiu, Sunil Kumar Sahu, Derek Thomas, Neha Sengupta and Mohammady Mahdy | N/A | N/A |
| Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring | Haoran Zhang and Diane Litman | N/A | N/A |
| Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model | Kosuke Takahashi, Katsuhito Sudoh and Satoshi Nakamura | N/A | N/A |
| Bayesian Hierarchical Words Representation Learning | Oren Barkan, Idan Rejwan, Avi Caciularu and Noam Koenigstein | N/A | N/A |
| Benefits of Intermediate Annotations in Reading Comprehension | Dheeru Dua, Sameer Singh and Matt Gardner | N/A | N/A |
| Camouflaged Chinese Spam Content Detection with Semi-supervised Generative Active Learning | Zhuoren Jiang, Zhe Gao, Yu Duan, Yangyang Kang, Changlong Sun, Qiong Zhang and Xiaozhong Liu | N/A | N/A |
| Character-Level Translation with Self-attention | Yingqiang Gao, Nikola I. Nikolov, Yuhuang Hu and Richard H.R. Hahnloser | N/A | N/A |
| ClarQ: A large-scale and diverse dataset for Clarification Question Generation | Vaibhav Kumar and Alan W Black | N/A | N/A |
| Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction | Mladen Karan, Ivan Vulić, Anna Korhonen and Goran Glavaš | N/A | N/A |
| Clinical Concept Linking with Contextualized Neural Representations | Elliot Schumacher, Andriy Mulyar and Mark Dredze | N/A | N/A |
| Closing the Gap: Joint De-Identification and Concept Extraction in the Clinical Domain | Lukas Lange, Heike Adel and Jannik Strötgen | N/A | N/A |
| Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling | Zihan Liu, Genta Indra Winata, Peng Xu and Pascale Fung | N/A | N/A |
| Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection | Srijan Bansal, Vishal Garimella, Ayush Suhane, Jasabanta Patro and Animesh Mukherjee | N/A | N/A |
| Composing Elementary Discourse Units in Abstractive Summarization | Zhenwen Li, Wenhao Wu and Sujian Li | N/A | N/A |
| Content Word Aware Neural Machine Translation | Kehai Chen, Rui Wang, Masao Utiyama and Eiichiro Sumita | N/A | N/A |
| Contextual Embeddings: When Are They Worth It? | Simran Arora, Avner May, Jian Zhang and Christopher Ré | N/A | N/A |
| Contextual Neural Machine Translation Improves Translation of Cataphoric Pronouns | KayYen Wong, Sameen Maruf and Gholamreza Haffari | N/A | N/A |
| Contextualized Sparse Representations for Real-Time Open-Domain Question Answering | Jinhyuk Lee, Minjoon Seo, Hannaneh Hajishirzi and Jaewoo Kang | N/A | N/A |
| Contextualizing Hate Speech Classifiers with Post-hoc Explanation | Brendan Kennedy, Xisen Jin, Aida Mostafazadeh Davani, Morteza Dehghani and Xiang Ren | N/A | N/A |
| Contrastive Self-Supervised Learning for Commonsense Reasoning | Tassilo Klein and Moin Nabi | N/A | N/A |
| Controlled Crowdsourcing for High-Quality QA-SRL Annotation | Paul Roit, Ayal Klein, Daniela Stepanov, Jonathan Mamou, Julian Michael, Gabriel Stanovsky, Luke Zettlemoyer and Ido Dagan | N/A | N/A |
| Conversational Word Embedding for Retrieval-Based Dialog System | Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang and Guoping Hu | N/A | N/A |
| Crawling and Preprocessing Mailing Lists At Scale for Dialog Analysis | Janek Bevendorff, Khalid Al Khatib, Martin Potthast and Benno Stein | N/A | N/A |
| Crossing Variational Autoencoders for Answer Retrieval | Wenhao Yu, Lingfei Wu, Qingkai Zeng, Shu Tao, Yu Deng and Meng Jiang | N/A | N/A |
| DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference | Ji Xin, Raphael Tang, Jaejun Lee, Yaoliang Yu and Jimmy Lin | N/A | N/A |
| Designing Precise and Robust Dialogue Response Evaluators | Tianyu Zhao, Divesh Lala and Tatsuya Kawahara | N/A | N/A |
| Dialogue State Tracking with Explicit Slot Connection Modeling | Yawen Ouyang, Moxin Chen, Xinyu Dai, Yinggong Zhao, Shujian Huang and Jiajun Chen | N/A | N/A |
| Do Transformers Need Deep Long-Range Memory? | Jack Rae and Ali Razavi | N/A | N/A |
| Do you have the right scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods | Ning Miao, Yuxuan Song, Hao Zhou and Lei Li | N/A | N/A |
| Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation | Bei Li, Hui Liu, Ziyang Wang, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu and Changliang Li | N/A | N/A |
| Don’t Eclipse Your Arts Due to Small Discrepancies: Boundary Repositioning with a Pointer Network for Aspect Extraction | Zhenkai Wei, Yu Hong, Bowei Zou, Meng Cheng and Jianmin Yao | N/A | N/A |
| Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing | Jiangming Liu, Shay B. Cohen and Mirella Lapata | N/A | N/A |
| Dynamic Memory Induction Networks for Few-Shot Text Classification | Ruiying Geng, Binhua Li, Yongbin Li, Jian Sun and Xiaodan Zhu | N/A | N/A |
| Dynamic Sampling Strategies for Multi-Task Reading Comprehension | Ananth Gottumukkala, Dheeru Dua, Sameer Singh and Matt Gardner | N/A | N/A |
| Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change | Hongfei Xu, Josef van Genabith, Deyi Xiong and Qiuhui Liu | N/A | N/A |
| Efficient strategies for hierarchical text classification: external knowledge and auxiliary tasks | Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay and Marco Antonio Sobrevilla Cabezudo | N/A | N/A |
| Embarrassingly Simple Unsupervised Aspect Extraction | Stéphan Tulkens and Andreas van Cranenburgh | N/A | N/A |
| Enabling Language Models to Fill in the Blanks | Chris Donahue, Mina Lee and Percy Liang | N/A | N/A |
| Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction | Masahiro Kaneko, Masato Mita, Shun Kiyono, Jun Suzuki and Kentaro Inui | N/A | N/A |
| ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation | Lifu Tu, Richard Yuanzhe Pang, Sam Wiseman and Kevin Gimpel | N/A | N/A |
| Enhancing Machine Translation with Dependency-Aware Self-Attention | Emanuele Bugliarello and Naoaki Okazaki | N/A | N/A |
| Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention | Yanzeng Li, Bowen Yu, Xue Mengge and Tingwen Liu | N/A | N/A |
| Enriched In-Order Linearization for Faster Sequence-to-Sequence Constituent Parsing | Daniel Fernández-González and Carlos Gómez-Rodríguez | N/A | N/A |
| Entity-Aware Dependency-Based Deep Graph Attention Network for Comparative Preference Classification | Nianzu Ma, Sahisnu Mazumder, Hao Wang and Bing Liu | N/A | N/A |
| Estimating Mutual Information Between Dense Word Embeddings | Vitalii Zhelezniak, Aleksandar Savkov and Nils Hammerla | N/A | N/A |
| Evaluating Dialogue Generation Systems via Response Selection | Shiki Sato, Reina Akama, Hiroki Ouchi, Jun Suzuki and Kentaro Inui | N/A | N/A |
| Evaluating Robustness to Input Perturbations for Neural Machine Translation | Xing Niu, Prashant Mathur, Georgiana Dinu and Yaser Al-Onaizan | N/A | N/A |
| Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks | Yufeng Zhang, Xueli Yu, Zeyu Cui, Shu Wu, Zhongzhen Wen and Liang Wang | N/A | N/A |
| ExpBERT: Representation Engineering with Natural Language Explanations | Shikhar Murty, Pang Wei Koh and Percy Liang | N/A | N/A |
| Exploiting Personal Characteristics of Debaters for Predicting Persuasiveness | Khalid Al Khatib, Michael Völske, Shahbaz Syed, Nikolay Kolyada and Benno Stein | N/A | N/A |
| Exploring Content Selection in Summarization of Novel Chapters | Faisal Ladhak, Bryan Li, Yaser Al-Onaizan and Kathy McKeown | N/A | N/A |
| Fact-based Content Weighting for Evaluating Abstractive Summarisation | Xinnuo Xu, Ondřej Dušek, Jingyi Li, Verena Rieser and Ioannis Konstas | N/A | N/A |
| Fatality Killed the Cat or: BabelPic, a Multimodal Dataset for Non-Concrete Concepts | Agostina Calabrese, Michele Bevilacqua and Roberto Navigli | N/A | N/A |
| Few-Shot NLG with Pre-Trained Language Model | Zhiyu Chen, Harini Eavani, Wenhu Chen, Yinyin Liu and William Yang Wang | N/A | N/A |
| FLAT: Chinese NER Using Flat-Lattice Transformer | Xiaonan Li, Hang Yan, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples | Danilo Croce, Giuseppe Castellucci and Roberto Basili | N/A | N/A |
| Geometry-aware domain adaptation for unsupervised alignment of word embeddings | Pratik Jawanpuria, Mayank Meghwanshi and Bamdev Mishra | N/A | N/A |
| Give Me Convenience and Give Her Death: Who Should Decide What Uses of NLP are Appropriate, and on What Basis? | Kobi Leins, Jey Han Lau and Timothy Baldwin | N/A | N/A |
| Glyph2Vec: Learning Chinese Out-of-Vocabulary Word Embedding from Glyphs | Hong-You Chen, SZ-HAN YU and Shou-de Lin | N/A | N/A |
| GPT-too: A language-model-first approach for AMR-to-text generation | Manuel Mager, Ramón Fernandez Astudillo, Tahira Naseem, Md Arafat Sultan, Young-Suk Lee, Radu Florian and Salim Roukos | N/A | N/A |
| How Can We Accelerate Progress Towards Human-like Linguistic Generalization? | Tal Linzen | N/A | N/A |
| Hypernymy Detection for Low-Resource Languages via Meta Learning | Changlong Yu, Jialong Han, Haisong Zhang and Wilfred Ng | N/A | N/A |
| Identifying Principals and Accessories in a Complex Case based on the Comprehension of Fact Description | Yakun Hu, Zhunchen Luo and Wenhan Chao | N/A | N/A |
| Implicit Discourse Relation Classification: We Need to Talk about Evaluation | Najoung Kim, Song Feng, Chulaka Gunasekara and Luis Lastras | N/A | N/A |
| Improved Speech Representations with Multi-Target Autoregressive Predictive Coding | Yu-An Chung and James Glass | N/A | N/A |
| Improving Entity Linking through Semantic Reinforced Entity Embeddings | Feng Hou, Ruili Wang, Jun He and Yi Zhou | N/A | N/A |
| Improving Low-Resource Named Entity Recognition using Joint Sentence and Token Labeling | Canasai Kruengkrai, Thien Hai Nguyen, Sharifah Mahani Aljunied and Lidong Bing | N/A | N/A |
| Improving Non-autoregressive Neural Machine Translation with Monolingual Data | Jiawei Zhou and Phillip Keung | N/A | N/A |
| Incorporating External Knowledge through Pre-training for Natural Language to Code Generation | Frank F. Xu, Zhengbao Jiang, Pengcheng Yin, Bogdan Vasilescu and Graham Neubig | N/A | N/A |
| Instance-Based Learning of Span Representations: A Case Study through Named Entity Recognition | Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Ryuto Konno and Kentaro Inui | N/A | N/A |
| Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder | Fan Zhou, Shengming Zhang and Yi Yang | N/A | N/A |
| Interpreting Twitter User Geolocation | Ting Zhong, Tianliang Wang, Fan Zhou, Goce Trajcevski, Kunpeng Zhang and Yi Yang | N/A | N/A |
| Is Your Classifier Actually Biased? Measuring Fairness under Uncertainty with Bernstein Bounds | Kawin Ethayarajh | N/A | N/A |
| It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information | Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell and Naoaki Okazaki | N/A | N/A |
| Keyphrase Generation for Scientific Document Retrieval | Florian Boudin, Ygor Gallina and Akiko Aizawa | N/A | N/A |
| Knowledge Supports Visual Language Grounding: A Case Study on Colour Terms | Simeon Schüz and Sina Zarrieß | N/A | N/A |
| Language-aware Interlingua for Multilingual Neural Machine Translation | Changfeng Zhu, Heng Yu, Shanbo Cheng and Weihua Luo | N/A | N/A |
| Learning an Unreferenced Metric for Online Dialogue Evaluation | Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton and Joelle Pineau | N/A | N/A |
| Learning Implicit Text Generation via Feature Matching | Inkit Padhi, Pierre Dognin, Ke Bai, Cícero Nogueira dos Santos, Vijil Chenthamarakshan, Youssef Mroueh and Payel Das | N/A | N/A |
| Learning Low-Resource End-To-End Goal-Oriented Dialog for Fast and Reliable System Deployment | Yinpei Dai, Hangyu Li, Chengguang Tang, Yongbin Li, Jian Sun and Xiaodan Zhu | N/A | N/A |
| Learning Robust Models for e-Commerce Product Search | Thanh Nguyen, Nikhil Rao and Karthik Subbian | N/A | N/A |
| Learning Spoken Language Representations with Neural Lattice Language Modeling | Chao-Wei Huang and Yun-Nung Chen | N/A | N/A |
| Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge | Keqing He, Yuanmeng Yan and Weiran XU | N/A | N/A |
| Learning to Understand Child-directed and Adult-directed Speech | Lieke Gelderloos, Grzegorz Chrupała and Afra Alishahi | N/A | N/A |
| Let Me Choose: From Verbal Context to Font Selection | Amirreza Shirani, Franck Dernoncourt, Jose Echevarria, Paul Asente, Nedim Lipka and Thamar Solorio | N/A | N/A |
| Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation | Aditya Siddhant, Ankur Bapna, Yuan Cao, Orhan Firat, Mia Chen, Sneha Kudugunta, Naveen Arivazhagan and Yonghui Wu | N/A | N/A |
| Lexically Constrained Neural Machine Translation with Levenshtein Transformer | Raymond Hendy Susanto, Shamil Chollampatt and Liling Tan | N/A | N/A |
| Lipschitz Constrained Parameter Initialization for Deep Transformers | Hongfei Xu, Qiuhui Liu, Josef van Genabith, Deyi Xiong and Jingyi Zhang | N/A | N/A |
| Logic-Guided Data Augmentation and Regularization for Consistent Question Answering | Akari Asai and Hannaneh Hajishirzi | N/A | N/A |
| Low Resource Sequence Tagging using Sentence Reconstruction | Tal Perl, Sriram Chaudhury and Raja Giryes | N/A | N/A |
| Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations | Oana-Maria Camburu, Brendan Shillingford, Pasquale Minervini, Thomas Lukasiewicz and Phil Blunsom | N/A | N/A |
| Masking Actor Information Leads to Fairer Political Claims Detection | Erenay Dayanik and Sebastian Padó | N/A | N/A |
| Meta-Transfer Learning for Code-Switched Speech Recognition | Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Peng Xu and Pascale Fung | N/A | N/A |
| Mitigating Gender Bias Amplification in Distribution by Posterior Regularization | Shengyu Jia, Tao Meng, Jieyu Zhao and Kai-Wei Chang | N/A | N/A |
| Modeling Label Semantics for Predicting Emotional Reactions | Radhika Gaonkar, Heeyoung Kwon, Mohaddeseh Bastan, Niranjan Balasubramanian and Nathanael Chambers | N/A | N/A |
| Modeling Long Context for Task-Oriented Dialogue State Generation | Jun Quan and Deyi Xiong | N/A | N/A |
| Modeling Word Formation in English–German Neural Machine Translation | Marion Weller-Di Marco and Alexander Fraser | N/A | N/A |
| MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs | Jifan Yu, Gan Luo, Tong Xiao, Qingyang Zhong, Yuquan Wang, Wenzheng Feng, Junyi Luo, Chenyu Wang, Lei Hou, Juanzi Li, Zhiyuan Liu and Jie Tang | N/A | N/A |
| Multimodal and Multiresolution Speech Recognition with Transformers | Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare and Shiva Sundaram | N/A | N/A |
| Multimodal Quality Estimation for Machine Translation | Shu Okabe, Frédéric Blain and Lucia Specia | N/A | N/A |
| Multimodal Transformer for Multimodal Machine Translation | Shaowei Yao and Xiaojun Wan | N/A | N/A |
| Named Entity Recognition as Dependency Parsing | Juntao Yu, Bernd Bohnet and Massimo Poesio | N/A | N/A |
| Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly | Nora Kassner and Hinrich Schütze | N/A | N/A |
| Neural Graph Matching Networks for Chinese Short Text Matching | Lu Chen, Yanbin Zhao, Boer Lv, Lesheng Jin, Zhi Chen, Su Zhu and Kai Yu | N/A | N/A |
| Neural Temporal Opinion Modelling for Opinion Prediction on Twitter | Lixing Zhu, Yulan He and Deyu Zhou | N/A | N/A |
| Neural-DINF: A Neural Network based Framework for Measuring Document Influence | Jie Tan, Changlin Yang, Ying Li, Siliang Tang, Chen Huang and Yueting Zhuang | N/A | N/A |
| Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces | Goran Glavaš and Ivan Vulić | N/A | N/A |
| “None of the Above”: Measure Uncertainty in Dialog Response Retrieval | Yulan Feng, Shikib Mehri, Maxine Eskenazi and Tiancheng Zhao | N/A | N/A |
| On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation | Chaojun Wang and Rico Sennrich | N/A | N/A |
| On Forgetting to Cite Older Papers: An Analysis of the ACL Anthology | Marcel Bollmann and Desmond Elliott | N/A | N/A |
| On Importance Sampling-Based Evaluation of Latent Language Models | Robert L Logan IV, Matt Gardner and Sameer Singh | N/A | N/A |
| On the Importance of Diversity in Question Generation for QA | Md Arafat Sultan, Shubham Chandel, Ramón Fernandez Astudillo and Vittorio Castelli | N/A | N/A |
| On the Spontaneous Emergence of Discrete and Compositional Signals | Nur Geffen Lan, Emmanuel Chemla and Shane Steinert-Threlkeld | N/A | N/A |
| OpinionDigest: A Simple Framework for Opinion Summarization | Yoshihiko Suhara, Xiaolan Wang, Stefanos Angelidis and Wang-Chiew Tan | N/A | N/A |
| Opportunistic Decoding with Timely Correction for Simultaneous Translation | Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu and Liang Huang | N/A | N/A |
| Overestimation of Syntactic Representation in Neural Language Models | Jordan Kodner and Nitish Gupta | N/A | N/A |
| Parallel Data Augmentation for Formality Style Transfer | Yi Zhang, Tao Ge and Xu SUN | N/A | N/A |
| Parallel Sentence Mining by Constrained Decoding | Pinzhen Chen, Nikolay Bogoychev, Kenneth Heafield and Faheem Kirefu | N/A | N/A |
| Posterior Calibrated Training on Sentence Classification Tasks | Taehee Jung, Dongyeop Kang, Hua Cheng, Lucas Mentch and Thomas Schaaf | N/A | N/A |
| Predicting Degrees of Technicality in Automatic Terminology Extraction | Anna Hätty, Dominik Schlechtweg, Michael Dorna and Sabine Schulte im Walde | N/A | N/A |
| Pretrained Transformers Improve Out-of-Distribution Robustness | Dan Hendrycks, Xiaoyuan Liu, Eric Wallace, Adam Dziedzic, Rishabh Krishnan and Dawn Song | N/A | N/A |
| Quantifying Attention Flow in Transformers | Samira Abnar and Willem Zuidema | N/A | N/A |
| Query Graph Generation for Answering Multi-hop Complex Questions from Knowledge Bases | Yunshi Lan and Jing Jiang | N/A | N/A |
| R4C: A Benchmark for Evaluating RC Systems to Get the Right Answer for the Right Reason | Naoya Inoue, Pontus Stenetorp and Kentaro Inui | N/A | N/A |
| Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models | Maarten Sap, Eric Horvitz, Yejin Choi, Noah A. Smith and James Pennebaker | N/A | N/A |
| Recursive Template-based Frame Generation for Task Oriented Dialog | Rashmi Gangadharaiah and Balakrishnan Narayanaswamy | N/A | N/A |
| Regularized Context Gates on Transformer for Machine Translation | Xintong Li, Lemao Liu, Rui Wang, Guoping Huang and Max Meng | N/A | N/A |
| Relation Extraction with Explanation | Hamed Shahbazi, Xiaoli Fern, Reza Ghaeini and Prasad Tadepalli | N/A | N/A |
| Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs | Michael Lepori, Tal Linzen and R. Thomas McCoy | N/A | N/A |
| Returning the N to NLP: Towards Contextually Personalized Classification Models | Lucie Flek | N/A | N/A |
| Reverse Engineering Configurations of Neural Text Generation Models | Yi Tay, Dara Bahri, Che Zheng, Clifford Brunk, Donald Metzler and Andrew Tomkins | N/A | N/A |
| Revisiting Higher-Order Dependency Parsers | Erick Fonseca and André F. T. Martins | N/A | N/A |
| Revisiting Unsupervised Relation Extraction | Thy Thy Tran, Phong Le and Sophia Ananiadou | N/A | N/A |
| SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions | Mao Ye, Chengyue Gong and Qiang Liu | N/A | N/A |
| Self-Attention Guided Copy Mechanism for Abstractive Summarization | Song Xu, Haoran Li, Peng Yuan, Youzheng Wu, Xiaodong He and Bowen Zhou | N/A | N/A |
| Self-Attention with Cross-Lingual Position Representation | Liang Ding, Longyue Wang and Dacheng Tao | N/A | N/A |
| Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity | Nina Poerner, Ulli Waltinger and Hinrich Schütze | N/A | N/A |
| Shape of synth to come: Why we should use synthetic data for English surface realization | Henry Elder, Robert Burke, Alexander O’Connor and Jennifer Foster | N/A | N/A |
| Shaping Visual Representations with Language for Few-Shot Classification | Jesse Mu, Percy Liang and Noah Goodman | N/A | N/A |
| Showing Your Work Doesn’t Always Work | Raphael Tang, Jaejun Lee, Ji Xin, Xinyu Liu, Yaoliang Yu and Jimmy Lin | N/A | N/A |
| Simple and Effective Retrieve-Edit-Rerank Text Generation | Nabil Hossain, Marjan Ghazvininejad and Luke Zettlemoyer | N/A | N/A |
| Simultaneous Translation Policies: From Fixed to Adaptive | Baigong Zheng, Kaibo Liu, Renjie Zheng, Mingbo Ma, Hairong Liu and Liang Huang | N/A | N/A |
| Single Model Ensemble using Pseudo-Tags and Distinct Vectors | Ryosuke Kuwabara, Jun Suzuki and Hideki Nakayama | N/A | N/A |
| Smart To-Do: Automatic Generation of To-Do Items from Emails | Sudipto Mukherjee, Subhabrata Mukherjee, Marcello Hasegawa, Ahmed Hassan Awadallah and Ryen White | N/A | N/A |
| Social Biases in NLP Models as Barriers for Persons with Disabilities | Ben Hutchinson, Vinodkumar Prabhakaran, Emily Denton, Kellie Webster, Yu Zhong and Stephen Denuyl | N/A | N/A |
| Soft Gazetteers for Low-Resource Named Entity Recognition | Shruti Rijhwani, Shuyan Zhou, Graham Neubig and Jaime Carbonell | N/A | N/A |
| Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations | Samuel Coope, Tyler Farghly, Daniela Gerz, Ivan Vulić and Matthew Henderson | N/A | N/A |
| Stolen Probability: A Structural Weakness of Neural Language Models | David Demeter, Gregory Kimmel and Doug Downey | N/A | N/A |
| Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer Architecture | Christopher Brix, Parnia Bahar and Hermann Ney | N/A | N/A |
| SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization | Yang Gao, Wei Zhao and Steffen Eger | N/A | N/A |
| Supervised Grapheme-to-Phoneme Conversion of Orthographic Schwas in Hindi and Punjabi | Aryaman Arora, Luke Gessler and Nathan Schneider | N/A | N/A |
| Syntactic Data Augmentation Increases Robustness to Inference Heuristics | Junghyun Min, R. Thomas McCoy, Dipanjan Das, Emily Pitler and Tal Linzen | N/A | N/A |
| Tagged Back-translation Revisited: Why Does It Really Work? | Benjamin Marie, Raphael Rubino and Atsushi Fujita | N/A | N/A |
| tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection | Nicole Peinelt, Dong Nguyen and Maria Liakata | N/A | N/A |
| Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering | Alexander Fabbri, Patrick Ng, Zhiguo Wang, Ramesh Nallapati and Bing Xiang | N/A | N/A |
| Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference | Nikita Kitaev and Dan Klein | N/A | N/A |
| Text Classification with Negative Supervision | Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara, Chenhui Chu and Yuki Arase | N/A | N/A |
| To Pretrain or Not to Pretrain: Examining the Benefits of Pretrainng on Resource Rich Tasks | Sinong Wang, Madian Khabsa and Hao Ma | N/A | N/A |
| Topological Sort for Sentence Ordering | Shrimai Prabhumoye, Ruslan Salakhutdinov and Alan W Black | N/A | N/A |
| Toward Better Storylines with Sentence-Level Language Models | Daphne Ippolito, David Grangier, Douglas Eck and Chris Callison-Burch | N/A | N/A |
| Towards Better Non-Tree Argument Mining: Proposition-Level Biaffine Parsing with Task-Specific Parameterization | Gaku Morio, Hiroaki Ozaki, Terufumi Morishita, Yuta Koreeda and Kohsuke Yanai | N/A | N/A |
| Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations | Karan Singla, Zhuohao Chen, David Atkins and Shrikanth Narayanan | N/A | N/A |
| Towards Faithfully Interpretable NLP Systems: How should we define and evaluate faithfulness? | Alon Jacovi and Yoav Goldberg | N/A | N/A |
| Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation | Aakanksha Naik and Carolyn Rose | N/A | N/A |
| Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question Answering | Changmao Li and Jinho D. Choi | N/A | N/A |
| Treebank Embedding Vectors for Out-of-domain Dependency Parsing | Joachim Wagner, James Barry and Jennifer Foster | N/A | N/A |
| Tree-Structured Neural Topic Model | Masaru Isonuma, Junichiro Mori, Danushka Bollegala and Ichiro Sakata | N/A | N/A |
| TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition | Bill Yuchen Lin, Dong-Ho Lee, Ming Shen, Ryan Moreno, Xiao Huang, Prashant Shiralkar and Xiang Ren | N/A | N/A |
| Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data | Hamidreza Shahidi, Ming Li and Jimmy Lin | N/A | N/A |
| Uncertain Natural Language Inference | Tongfei Chen, Zhengping Jiang, Adam Poliak, Keisuke Sakaguchi and Benjamin Van Durme | N/A | N/A |
| Understanding Advertisements with BERT | Kanika Kalra, Bhargav Kurma, Silpa Vadakkeeveetil Sreelatha, Manasi Patwardhan and Shirish Karande | N/A | N/A |
| Unsupervised FAQ Retrieval with Question Generation and BERT | Yosi Mass, Boaz Carmeli, Haggai Roitman and David Konopnicki | N/A | N/A |
| Using Context in Neural Machine Translation Training Objectives | Danielle Saunders, Felix Stahlberg and Bill Byrne | N/A | N/A |
| Variational Neural Machine Translation with Normalizing Flows | Hendra Setiawan, Matthias Sperber, Udhyakumar Nallasamy and Matthias Paulik | N/A | N/A |
| Verbal Multiword Expressions for Identification of Metaphor | Omid Rohanian, Marek Rei, Shiva Taslimipoor and Le An Ha | N/A | N/A |
| Video-Grounded Dialogues with Pretrained Generation Language Models | Hung Le and Steven C.H. Hoi | N/A | N/A |
| What Does BERT with Vision Look At? | Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh and Kai-Wei Chang | N/A | N/A |
| What is Learned in Visually Grounded Neural Syntax Acquisition | Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi | N/A | N/A |
| Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries | Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul and Jordan Boyd-Graber | N/A | N/A |
| Will-They-Won’t-They: A Very Large Dataset for Stance Detection on Twitter | Costanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd and Nigel Collier | N/A | N/A |
| Words aren’t enough, their order matters: On the Robustness of Grounding Visual Referring Expressions | Arjun Akula, Spandana Gella, Yaser Al-Onaizan, Song-Chun Zhu and Siva Reddy | N/A | N/A |
| Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation | Shun-Po Chuang, Tzu-Wei Sung, Alexander H. Liu and Hung-yi Lee | N/A | N/A |
| Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences | Yi Tay, Donovan Ong, Jie Fu, Alvin Chan, Nancy Chen, Anh Tuan Luu and Chris Pal | N/A | N/A |
| You Don’t Have Time to Read This: An Exploration of Document Reading Time Prediction | Orion Weller, Jordan Hildebrandt, Ilya Reznik, Christopher Challis, E. Shannon Tass, Quinn Snell and Kevin Seppi | N/A | N/A |
| ``You Sound Just Like Your Father’’ Commercial Machine Translation Systems Include Stylistic Biases | Dirk Hovy, Federico Bianchi and Tommaso Fornaciari | N/A | N/A |
| ZPR2: Joint Zero Pronoun Recovery and Resolution using Multi-Task Learning and BERT | Linfeng Song, Kun Xu, Yue Zhang, Jianshu Chen and Dong Yu | N/A | N/A |
| ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents | Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic and Ngoc Thang Vu | N/A | N/A |
| BENTO: A Visual Platform for Building Clinical NLP Pipelines Based on CodaLab | Yonghao Jin, Fei Li and Hong Yu | N/A | N/A |
| Clinical-Coder: Assigning Interpretable ICD-10 Codes to Chinese Clinical Notes | Pengfei Cao, Chenwei Yan, xiangling fu, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu and Weifeng Chong | N/A | N/A |
| CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task | Shuo Sun, Suzanna Sia and Kevin Duh | N/A | N/A |
| Conversation Learner - A Machine Teaching Tool for Building Dialog Managers for Task-Oriented Dialog Systems | Swadheen Shukla, Lars Liden, Shahin Shayandeh, Eslam Kamal, Jinchao Li, Matt Mazzola, Thomas Park, Baolin Peng and Jianfeng Gao | N/A | N/A |
| ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems | Qi Zhu, Zheng Zhang, Yan Fang, Xiang Li, Ryuichi Takanobu, Jinchao Li, Baolin Peng, Jianfeng Gao, xiaoyan zhu and Minlie Huang | N/A | N/A |
| DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation | Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu and Bill Dolan | N/A | N/A |
| Embedding-based Scientific Literature Discovery in a Text Editor Application | Onur Gökçe, Jonathan Prada, Nikola Nikolov, Nianlong Gu and Richard Hahnloser | N/A | N/A |
| ESPnet-ST: All-in-One Speech Translation Toolkit | Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Yalta, Tomoki Hayashi and Shinji Watanabe | N/A | N/A |
| EVIDENCEMINER: Textual Evidence Discovery for Life Sciences | Xuan Wang, Yingjun Guan, Weili Liu, Aabhas Chauhan, Enyi Jiang, Qi Li, David Liem, Dibakar Sigdel, John Caufield, Peipei Ping and Jiawei Han | N/A | N/A |
| exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformer Models | Benjamin Hoover, Hendrik Strobelt and Sebastian Gehrmann | N/A | N/A |
| GAIA: A Fine-grained Multimedia Knowledge Extraction System | Manling Li, Alireza Zareian, Ying Lin, Xiaoman Pan, Spencer Whitehead, BRIAN CHEN, Bo Wu, Heng Ji, Shih-Fu Chang, Clare Voss, Daniel Napierski and Marjorie Freedman | N/A | N/A |
| Interactive Task Learning from GUI-Grounded Natural Language Instructions and Demonstrations | Toby Jia-Jun Li, Tom Mitchell and Brad Myers | N/A | N/A |
| jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models | Yada Pruksachatkun, Phil Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney and Samuel R. Bowman | N/A | N/A |
| Label Noise in Context | Michael Desmond, Catherine Finegan-Dollak, Jeff Boston and Matt Arnold | N/A | N/A |
| LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation | Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves and Xiang Ren | N/A | N/A |
| LinggleWrite: a Coaching System for Essay Writing | Chung-Ting Tsai, Jhih-Jie Chen, Chingyu Yang and Jason Chang | N/A | N/A |
| MixingBoard: a Knowledgeable Stylized Integrated Text Generation Platform | Xiang Gao, Michel Galley and Bill Dolan | N/A | N/A |
| MMPE: A Multi-Modal Interface using Handwriting, Touch Reordering, and Speech Commands for Post-Editing Machine Translation | Nico Herbig, Santanu Pal, Tim Düwel, Kalliopi Maria Meladaki, Mahsa Monshizadeh, Vladislav Hnatovskiy, Antonio Krüger and Josef van Genabith | N/A | N/A |
| Multilingual Universal Sentence Encoder for Semantic Retrieval | Yinfei Yang, Daniel Cer, Amin Ahmad, Mandy Guo, Jax Law, Noah Constant, Gustavo Hernandez Abrego, Steve Yuan, Chris Tar, Yun-hsuan Sung, Brian Strope and Ray Kurzweil | N/A | N/A |
| Nakdan: Professional Hebrew Diacritizer | Avi Shmidman, Shaltiel Shmidman, Moshe Koppel and Yoav Goldberg | N/A | N/A |
| NLP Scholar: An Interactive Visual Explorer for Natural Language Processing Literature | Saif Mohammad | N/A | N/A |
| NSTM: Real-Time Query-Driven News Overview Composition at Bloomberg | Joshua Bambrick, Minjie Xu, Andy Almonte, Igor Malioutov, Guim Perarnau, Vittorio Selo and Iat Chong Chan | N/A | N/A |
| OpusFilter: A Configurable Parallel Corpus Filtering Toolbox | Mikko Aulamo, Sami Virpioja and Jörg Tiedemann | N/A | N/A |
| Penman: An Open-Source Library and Tool for AMR Graphs | Michael Wayne Goodman | N/A | N/A |
| Personalized PageRank with Syntagmatic Information for Multilingual Word Sense Disambiguation | Federico Scozzafava, Marco Maru, Fabrizio Brignone, Giovanni Torrisi and Roberto Navigli | N/A | N/A |
| Photon: A Robust Cross-Domain Text-to-SQL System | Jichuan Zeng, Xi Victoria Lin, Steven C.H. Hoi, Richard Socher, Caiming Xiong, Michael Lyu and Irwin King | N/A | N/A |
| Prta: A System to Support the Analysis of Propaganda Techniques in the News | Giovanni Da San Martino, Shaden Shaar, Yifan Zhang, Seunghak Yu, Alberto Barrón-Cedeño and Preslav Nakov | N/A | N/A |
| pyBART: Evidence-based Syntactic Transformations for IE | Aryeh Tiktinsky, Yoav Goldberg and Reut Tsarfaty | N/A | N/A |
| Stanza: A Python Natural Language Processing Toolkit for Many Human Languages | Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton and Christopher D. Manning | N/A | N/A |
| Stimulating Creativity with FunLines: A Case Study of Humor Generation in Headlines | Nabil Hossain, John Krumm, Tanvir Sajed and Henry Kautz | N/A | N/A |
| SUPP.AI: finding evidence for supplement-drug interactions | Lucy Wang, Oyvind Tafjord, Arman Cohan, Sarthak Jain, Sam Skjonsberg, Carissa Schoenick, Nick Botner and Waleed Ammar | N/A | N/A |
| Syntactic Search by Example | Micah Shlain, Hillel Taub-Tabib, Shoval Sadde and Yoav Goldberg | N/A | N/A |
| SyntaxGym: An Online Platform for Targeted Evaluation of Language Models | Jon Gauthier, Jennifer Hu, Ethan Wilcox, Peng Qian and Roger Levy | N/A | N/A |
| Tabouid: a Wikipedia-based word guessing game | Timothée Bernard | N/A | N/A |
| Talk to Papers: Bringing Neural Question Answering to Academic Search | Tiancheng Zhao and Kyusong Lee | N/A | N/A |
| TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing | Ziqing Yang, Yiming Cui, Zhipeng Chen, Wanxiang Che, Ting Liu, Shijin Wang and Guoping Hu | N/A | N/A |
| The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding | Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao and Jianfeng Gao | N/A | N/A |
| Torch-Struct: Deep Structured Prediction Library | Alexander Rush | N/A | N/A |
| Trialstreamer: Mapping and Browsing Medical Evidence in Real-Time | Benjamin Nye, Ani Nenkova, Iain Marshall and Byron C. Wallace | N/A | N/A |
| Usnea: An Authorship Tool for Interactive Fiction using Retrieval Based Semantic Parsing | Ben Swanson and Boris Smus | N/A | N/A |
| What’s The Latest? A Question-driven News Chatbot | Philippe Laban, John Canny and Marti A. Hearst | N/A | N/A |
| Xiaomingbot: A Multilingual Robot News Reporter | Runxin Xu, Jun Cao, Mingxuan Wang, Jiaze Chen, Hao Zhou, Ying Zeng, Yuping Wang, Li Chen, Xiang Yin, Xijin Zhang, Songcheng Jiang, Yuxuan Wang and Lei Li | N/A | N/A |
| #NotAWhore! A Computational Linguistic Perspective of Rape Culture and Victimization on Social Media | Ashima Suvarna and Grusha Bhalla | N/A | N/A |
| A Geometry-Inspired Attack for Generating Natural Language Adversarial Examples | Zhao Meng and Roger Wattenhofer | N/A | N/A |
| A Simple and Effective Dependency parser for Telugu | Sneha Nallani, Manish Shrivastava and Dipti Sharma | N/A | N/A |
| Adaptive Transformers for Learning Multimodal Representations | Prajjwal Bhargava | N/A | N/A |
| AraDIC: Arabic Document Classification Using Image-Based Character Embeddings and Class-Balanced Loss | Mahmoud Daif, Shunsuke Kitada and Hitoshi Iyatomi | N/A | N/A |
| Building a Japanese Typo Dataset from Wikipedia’s Revision History | Yu Tanaka, Yugo Murawaki, Daisuke Kawahara and Sadao Kurohashi | N/A | N/A |
| Checkpoint Reranking: An Approach To Select Better Hypothesis For Neural Machine Translation Systems | Vinay Pandramish and Dipti Misra Sharma | N/A | N/A |
| Combining Subword Representations into Word-level Representations in the Transformer Architecture | Noe Casas, Marta R. Costa-jussà and José A. R. Fonollosa | N/A | N/A |
| Compositional generalization by factorizing alignment and translation | Jacob Russin, Jason Jo, Randall O’Reilly and Yoshua Bengio | N/A | N/A |
| Considering Likelihood in NLP Classification Explanations with Occlusion and Language Modeling | David Harbecke and Christoph Alt | N/A | N/A |
| Crossing the Line: Where do Demographic Variables Fit into Humor Detection? | J. A. Meaney | N/A | N/A |
| Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup | Jishnu Ray Chowdhury, Cornelia Caragea and Doina Caragea | N/A | N/A |
| Dominance as an Indicator of Rapport and Learning in Human-Agent Communication | Amanda Buddemeyer, Xiaoyi Tian and Erin Walker | N/A | N/A |
| Effectively Aligning and Filtering Parallel Corpora under Sparse Data Conditions | Steinþór Steingrímsson | N/A | N/A |
| Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages | Vikrant Goyal, Sourav Kumar and Dipti Misra Sharma | N/A | N/A |
| Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition | Takuma Kato, Kaori Abe, Hiroki Ouchi, Shumpei Miyawaki, Jun Suzuki and Kentaro Inui | N/A | N/A |
| Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources | Magdalena Biesialska, bardia rafieian and Marta R. Costa-jussà | N/A | N/A |
| Exploring Interpretability in Event Extraction: Multitask Learning of a Neural Event Classifier and an Explanation Decoder | Zheng Tang, Gus Hahn-Powell and Mihai Surdeanu | N/A | N/A |
| Exploring the Role of Context to Distinguish Rhetorical and Information-Seeking Questions | Yuan Zhuang and Ellen Riloff | N/A | N/A |
| Feature Difference Makes Sense: A medical image captioning model exploiting feature difference and tag information | Hyeryun Park, Kyungmo Kim, Jooyoung Yoon, Seongkeun Park and Jinwook Choi | N/A | N/A |
| Grammatical Error Correction Using Pseudo Learner Corpus Considering Learner’s Error Tendency | Yujin Takahashi, Satoru Katsumata and Mamoru Komachi | N/A | N/A |
| HGCN4MeSH: Hybrid Graph Convolution Network for MeSH Indexing | Miaomiao Yu, Yujiu Yang and Chenhui Li | N/A | N/A |
| How much complexity does an RNN architecture need to learn syntax-sensitive dependencies? | Gantavya Bhatt, Hritik Bansal, Rishubh Singh and Sumeet Agarwal | N/A | N/A |
| υBLEU: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems | Tsuta Yuma, Naoki Yoshinaga and Masashi Toyoda | N/A | N/A |
| Inducing Grammar from Long Short-Term Memory Networks by Shapley Decomposition | Yuhui Zhang and Allen Nie | N/A | N/A |
| Let’s be Humorous: Knowledge Enhanced Humor Generation | Hang Zhang, Dayiheng Liu, Jiancheng Lv and Luo Cheng | N/A | N/A |
| Logical Inferences with Comparatives and Generalized Quantifiers | Izumi Haruta, Koji Mineshima and Daisuke Bekki | N/A | N/A |
| Media Bias, the Social Sciences, and NLP: Automating Frame Analyses to Identify Bias by Word Choice and Labeling | Felix Hamborg | N/A | N/A |
| Multi-Task Neural Model for Agglutinative Language Translation | Yirong Pan, Xiao Li, Yating Yang and Rui Dong | N/A | N/A |
| Noise-Based Augmentation Techniques for Emotion Datasets: What do we Recommend? | Mimansa Jaiswal and Emily Mower Provost | N/A | N/A |
| Non-Topical Coherence in Social Talk: A Call for Dialogue Model Enrichment | Alex Lưu and Sophia A. Malamud | N/A | N/A |
| Pointwise Paraphrase Appraisal is Potentially Problematic | Hannah Chen, Yangfeng Ji and David Evans | N/A | N/A |
| Pre-training via Leveraging Assisting Languages for Neural Machine Translation | Haiyue Song, Raj Dabre, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi and Eiichiro Sumita | N/A | N/A |
| Preventing Critical Scoring Errors in Short Answer Scoring with Confidence Estimation | Hiroaki Funayama, Shota Sasaki, Yuichiroh Matsubayashi, Tomoya Mizumoto, Jun Suzuki, Masato Mita and Kentaro Inui | N/A | N/A |
| Reflection-based Word Attribute Transfer | Yoichi Ishibashi, Katsuhito Sudoh, Koichiro Yoshino and Satoshi Nakamura | N/A | N/A |
| Research on Task Discovery for Transfer Learning in Deep Neural Networks | Arda Akdemir | N/A | N/A |
| Research Replication Prediction Using Weakly Supervised Learning | Tianyi Luo, Xingyu Li, Hainan Wang and Yang Liu | N/A | N/A |
| RPD: A Distance Function Between Word Embeddings | Xuhui Zhou, Shujian Huang and Zaixiang Zheng | N/A | N/A |
| SCAR: Sentence Compression using Autoencoders for Reconstruction | Chanakya Malireddy, Tirth Maniar and Manish Shrivastava | N/A | N/A |
| Self-Attention is Not Only a Weight: Analyzing BERT with Vector Norms | Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi and Kentaro Inui | N/A | N/A |
| Story-level Text Style Transfer: A Proposal | Yusu Qian | N/A | N/A |
| To compress or not to compress? A Finite-State approach to Nen verbal morphology | Saliha Muradoglu, Nicholas Evans and Hanna Suominen | N/A | N/A |
| Topic balancing with additive regularization of topic models | Eugeniia Veselova and Konstantin Vorontsov | N/A | N/A |
| Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya | Abrhalei Frezghi Tela, Abraham Woubie Zewoudie and Ville Hautamäki | N/A | N/A |
| Understanding Points of Correspondence between Sentences for Abstractive Summarization | Logan Lebanoff, John Muchovej, Franck Dernoncourt, Doo Soon Kim, Lidan Wang, Walter Chang and Fei Liu | N/A | N/A |
| Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining | Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre and Ondřej Bojar | N/A | N/A |
| Unsupervised Paraphasia Classification in Aphasic Speech | Sharan Pai, Nikhil Sachdeva, Prince Sachdeva and Rajiv Ratn Shah | N/A | N/A |
| Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models | Pia Sommerauer | N/A | N/A |
| Zero-shot North Korean to English Neural Machine Translation by Character Tokenization and Phoneme Decomposition | Hwichan Kim, Tosho Hirasawa and Mamoru Komachi | N/A | N/A |
| 2kenize: Tying Subword Sequences for Chinese Script Conversion | Pranav A and Isabelle Augenstein | N/A | N/A |
| A Batch Normalized Inference Network Keeps the KL Vanishing Away | Qile Zhu, Wei Bi, Xiaojiang Liu, Xiyao Ma, Xiaolin Li and Dapeng Wu | N/A | N/A |
| A Call for More Rigor in Unsupervised Cross-lingual Learning | Mikel Artetxe, Sebastian Ruder, Dani Yogatama, Gorka Labaka and Eneko Agirre | N/A | N/A |
| A Comprehensive Analysis of Preprocessing for Word Representation Learning in Affective Tasks | Nastaran Babanejad, Ameeta Agrawal, Aijun An and Manos Papagelis | N/A | N/A |
| A Contextual Hierarchical Attention Network with Adaptive Objective for Dialogue State Tracking | Yong Shan, Zekang Li, Jinchao Zhang, Fandong Meng, Yang Feng, Cheng Niu and Jie Zhou | N/A | N/A |
| A Corpus for Large-Scale Phonetic Typology | Elizabeth Salesky, Eleanor Chodroff, Tiago Pimentel, Matthew Wiesner, Ryan Cotterell, Alan W Black and Jason Eisner | N/A | N/A |
| A Formal Hierarchy of RNN Architectures | William Merrill, Gail Weiss, Yoav Goldberg, Roy Schwartz, Noah A. Smith and Eran Yahav | N/A | N/A |
| A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization | Dongfang Xu, Zeyu Zhang and Steven Bethard | N/A | N/A |
| A Generative Model for Joint Natural Language Understanding and Generation | Bo-Hsiang Tseng, Jianpeng Cheng, Yimai Fang and David Vandyke | N/A | N/A |
| A Girl Has A Name: Detecting Authorship Obfuscation | Asad Mahmood, Zubair Shafiq and Padmini Srinivasan | N/A | N/A |
| A Graph Auto-encoder Model of Derivational Morphology | Valentin Hofmann, Hinrich Schütze and Janet Pierrehumbert | N/A | N/A |
| A Graph-based Coarse-to-fine Method for Unsupervised Bilingual Lexicon Induction | Shuo Ren, Shujie Liu, Ming Zhou and Shuai Ma | N/A | N/A |
| A Joint Model for Document Segmentation and Segment Labeling | Joe Barrow, Rajiv Jain, Vlad Morariu, Varun Manjunatha, Douglas Oard and Philip Resnik | N/A | N/A |
| A Joint Neural Model for Information Extraction with Global Features | Ying Lin, Heng Ji, Fei Huang and Lingfei Wu | N/A | N/A |
| A Methodology for Creating Question Answering Corpora Using Inverse Data Annotation | Jan Deriu, Katsiaryna Mlynchyk, Philippe Schläpfer, Alvaro Rodrigo, Dirk von Grünigen, Nicolas Kaiser, Kurt Stockinger, Eneko Agirre and Mark Cieliebak | N/A | N/A |
| A Mixture of h − 1 Heads is Better than h Heads | Hao Peng, Roy Schwartz, Dianqi Li and Noah A. Smith | N/A | N/A |
| A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages | Pedro Javier Ortiz Suárez, Laurent Romary and Benoît Sagot | N/A | N/A |
| A Multitask Learning Approach for Diacritic Restoration | Sawsan Alqahtani, Ajay Mishra and Mona Diab | N/A | N/A |
| A Novel Cascade Binary Tagging Framework for Relational Triple Extraction | Zhepei Wei, Jianlin Su, Yue Wang, Yuan Tian and Yi Chang | N/A | N/A |
| A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation | Yongjing Yin, Fandong Meng, Jinsong Su, Chulun Zhou, Zhengyuan Yang, Jie Zhou and Jiebo Luo | N/A | N/A |
| A Prioritization Model for Suicidality Risk Assessment | Han-Chin Shing, Philip Resnik and Douglas Oard | N/A | N/A |
| A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks | Angela Lin, Sudha Rao, Asli Celikyilmaz, Elnaz Nouri, Chris Brockett, Debadeepta Dey and Bill Dolan | N/A | N/A |
| A Reinforced Generation of Adversarial Examples for Neural Machine Translation | Wei Zou, Shujian Huang, Jun Xie, Xinyu Dai and Jiajun Chen | N/A | N/A |
| A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction | Yilin Niu, Fangkai Jiao, Mantong Zhou, Ting Yao, Jingfang Xu and Minlie Huang | N/A | N/A |
| A Span-based Linearization for Constituent Trees | Yang Wei, Yuanbin Wu and Man Lan | N/A | N/A |
| A Study of Non-autoregressive Model for Sequence Generation | Yi Ren, Jinglin Liu, Xu Tan, Zhou Zhao, Sheng Zhao and Tie-Yan Liu | N/A | N/A |
| A Systematic Assessment of Syntactic Generalization in Neural Language Models | Jennifer Hu, Jon Gauthier, Peng Qian, Ethan Wilcox and Roger Levy | N/A | N/A |
| A Tale of Two Perplexities: Sensitivity of Neural Language Models to Lexical Retrieval Deficits in Dementia of the Alzheimer’s Type | Trevor Cohen and Serguei Pakhomov | N/A | N/A |
| A Top-down Neural Architecture towards Text-level Parsing of Discourse Rhetorical Structure | Longyin Zhang, Yuqing Xing, Fang Kong, Peifeng Li and Guodong Zhou | N/A | N/A |
| A Unified MRC Framework for Named Entity Recognition | Xiaoya Li, Jingrong Feng, Yuxian Meng, Qinghong Han, Fei Wu and Jiwei Li | N/A | N/A |
| Adaptive Compression of Word Embeddings | Yeachan Kim, Kang-Min Kim and SangKeun Lee | N/A | N/A |
| Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation | Arya D. McCarthy, Xian Li, Jiatao Gu and Ning Dong | N/A | N/A |
| AdvAug: Robust Adversarial Augmentation for Neural Machine Translation | Yong Cheng, Lu Jiang, Wolfgang Macherey and Jacob Eisenstein | N/A | N/A |
| Adversarial and Domain-Aware BERT for Cross-Domain Sentiment Analysis | Chunning Du, Haifeng Sun, Jingyu Wang, Qi Qi and Jianxin Liao | N/A | N/A |
| Adversarial NLI: A New Benchmark for Natural Language Understanding | Yixin Nie, Adina Williams, Emily Dinan, Mohit Bansal, Jason Weston and Douwe Kiela | N/A | N/A |
| Agreement Prediction of Arguments in Cyber Argumentation for Detecting Stance Polarity and Intensity | Joseph Sirrianni, Xiaoqing Liu and Douglas Adams | N/A | N/A |
| Aligned Dual Channel Graph Convolutional Network for Visual Question Answering | Qingbao Huang, Jielong Wei, Yi Cai, Changmeng Zheng, Junying Chen, Ho-fung Leung and Qing Li | N/A | N/A |
| Amalgamation of protein sequence, structure and textual information for improving protein-protein interaction identification | Pratik Dutta and Sriparna Saha | N/A | N/A |
| AMR Parsing via Graph-Sequence Iterative Inference | Deng Cai and Wai Lam | N/A | N/A |
| AMR Parsing with Latent Structural Information | Qiji Zhou, Yue Zhang, Donghong Ji and Hao Tang | N/A | N/A |
| An analysis of the utility of explicit negative examples to improve the syntactic abilities of neural language models | Hiroshi Noji and Hiroya Takamura | N/A | N/A |
| An Effective Transition-based Model for Discontinuous NER | Xiang Dai, Sarvnaz Karimi, Ben Hachey and Cecile Paris | N/A | N/A |
| An Effectiveness Metric for Ordinal Classification: Formal Properties and Experimental Results | Enrique Amigo, Julio Gonzalo, Stefano Mizzaro and Jorge Carrillo-de-Albornoz | N/A | N/A |
| An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering | Jay Kumar, Junming Shao, Salah Uddin and Wazir Ali | N/A | N/A |
| Analysing Lexical Semantic Change with Contextualised Word Representations | Mario Giulianelli, Marco Del Tredici and Raquel Fernández | N/A | N/A |
| Analyzing analytical methods: The case of phonology in neural models of spoken language | Grzegorz Chrupała, Bertrand Higy and Afra Alishahi | N/A | N/A |
| Analyzing Political Parody in Social Media | Antonios Maronikolakis, Danae Sánchez Villegas, Daniel Preotiuc-Pietro and Nikolaos Aletras | N/A | N/A |
| Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition | Paloma Jeretic, Alex Warstadt, Suvrat Bhooshan and Adina Williams | N/A | N/A |
| Asking and Answering Questions to Evaluate the Factual Consistency of Summaries | Alex Wang, Kyunghyun Cho and Mike Lewis | N/A | N/A |
| Aspect Sentiment Classification with Document-level Sentiment Preference Modeling | Xiao Chen, Changlong Sun, Jingjing Wang, Shoushan Li, Luo Si, Min Zhang and Guodong Zhou | N/A | N/A |
| ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations | Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot and Lucia Specia | N/A | N/A |
| Attend, Translate and Summarize: An Efficient Method for Neural Cross-Lingual Summarization | Junnan Zhu, Yu Zhou, Jiajun Zhang and Chengqing Zong | N/A | N/A |
| Attentive Pooling with Learnable Norms for Text Representation | Chuhan Wu, Fangzhao Wu, Tao Qi, Xiaohui Cui and Yongfeng Huang | N/A | N/A |
| Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics | Guy Emerson | N/A | N/A |
| Automated Evaluation of Writing – 50 Years and Counting | Beata Beigman Klebanov and Nitin Madnani | N/A | N/A |
| Automatic Detection of Generated Text is Easiest when Humans are Fooled | Daphne Ippolito, Daniel Duckworth, Chris Callison-Burch and Douglas Eck | N/A | N/A |
| Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study | Xinyu Xing, Xiaosheng Fan and Xiaojun Wan | N/A | N/A |
| Automatic Poetry Generation from Prosaic Text | Tim Van de Cruys | N/A | N/A |
| BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps | Wang Zhu, Hexiang Hu, Jiacheng Chen, Zhiwei Deng, Vihan Jain, Eugene Ie and Fei Sha | N/A | N/A |
| Balancing Objectives in Counseling Conversations: Advancing Forwards or Looking Backwards | Justine Zhang and Cristian Danescu-Niculescu-Mizil | N/A | N/A |
| Balancing Training for Multilingual Neural Machine Translation | Xinyi Wang, Yulia Tsvetkov and Graham Neubig | N/A | N/A |
| BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension | Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov and Luke Zettlemoyer | N/A | N/A |
| Benchmarking Multimodal Regex Synthesis with Complex Structures | Xi Ye, Qiaochu Chen, Isil Dillig and Greg Durrett | N/A | N/A |
| BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance | Timo Schick and Hinrich Schütze | N/A | N/A |
| Beyond Accuracy: Behavioral Testing of NLP Models with CheckList | Marco Tulio Ribeiro, Tongshuang Wu, Carlos Guestrin and Sameer Singh | N/A | N/A |
| Beyond Possession Existence: Duration and Co-Possession | Dhivya Chinnappa, Srikala Murugan and Eduardo Blanco | N/A | N/A |
| Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation | Weixin Liang, James Zou and Zhou Yu | N/A | N/A |
| Bilingual Dictionary Based Neural Machine Translation without Using Parallel Sentences | Xiangyu Duan, Baijun Ji, Hao Jia, Min Tan, Min Zhang, Boxing Chen, Weihua Luo and Yue Zhang | N/A | N/A |
| Biomedical Entity Representations with Synonym Marginalization | Mujeen Sung, Hwisang Jeon, Jinhyuk Lee and Jaewoo Kang | N/A | N/A |
| Bipartite Flat-Graph Network for Nested Named Entity Recognition | Ying Luo and Hai Zhao | N/A | N/A |
| BiRRE: Learning Bidirectional Residual Relation Embeddings for Supervised Hypernymy Detection | Chengyu Wang and Xiaofeng He | N/A | N/A |
| BLEURT: Learning Robust Metrics for Text Generation | Thibault Sellam, Dipanjan Das and Ankur Parikh | N/A | N/A |
| Boosting Neural Machine Translation with Similar Translations | Jitao XU, Josep Crego and Jean Senellart | N/A | N/A |
| Bootstrapping Techniques for Polysynthetic Morphological Analysis | William Lane and Steven Bird | N/A | N/A |
| BPE-Dropout: Simple and Effective Subword Regularization | Ivan Provilkov, Dmitrii Emelianenko and Elena Voita | N/A | N/A |
| Breaking Through the 80% Glass Ceiling: Raising the State of the Art in Word Sense Disambiguation by Incorporating Knowledge Graph Information | Michele Bevilacqua and Roberto Navigli | N/A | N/A |
| Bridging Anaphora Resolution as Question Answering | Yufang Hou | N/A | N/A |
| Bridging the Structural Gap Between Encoding and Decoding for Data-To-Text Generation | Chao Zhao, Marilyn Walker and Snigdha Chaturvedi | N/A | N/A |
| Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell | Djamé Seddah, Farah Essaidi, Amal Fethi, Matthieu Futeral, Benjamin Muller, Pedro Javier Ortiz Suárez, Benoît Sagot and Abhishek Srivastava | N/A | N/A |
| Calibrating Structured Output Predictors for Natural Language Processing | Abhyuday Jagannatha and Hong Yu | N/A | N/A |
| CamemBERT: a Tasty French Language Model | Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, Éric de la Clergerie, Djamé Seddah and Benoît Sagot | N/A | N/A |
| Can We Predict New Facts with Open Knowledge Graph Embeddings? A Benchmark for Open Link Prediction | Samuel Broscheit, Kiril Gashteovski, Yanjie Wang and Rainer Gemulla | N/A | N/A |
| Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills | Eric Michael Smith, Mary Williamson, Kurt Shuster, Jason Weston and Y-Lan Boureau | N/A | N/A |
| CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation | Lei Shen and Yang Feng | N/A | N/A |
| ChartDialogs: Plotting from Natural Language Instructions | Yutong Shao and Ndapa Nakashole | N/A | N/A |
| CH-SIMS: A Chinese Multimodal Sentiment Analysis Dataset with Fine-grained Annotation of Modality | Wenmeng Yu, Hua Xu, Fanyang Meng, Yilin Zhu, Yixiao Ma, Jiele Wu, Jiyun Zou and Kaicheng Yang | N/A | N/A |
| Climbing towards NLU: On Meaning, Form, and Understanding in the Age of Data | Emily M. Bender and Alexander Koller | N/A | N/A |
| Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset | Xiang Yue, Bernal Jimenez Gutierrez and Huan Sun | N/A | N/A |
| CluBERT: A Cluster-Based Approach for Learning Sense Distributions in Multiple Languages | Tommaso Pasini, Federico Scozzafava and Bianca Scarlini | N/A | N/A |
| CluHTM - Semantic Hierarchical Topic Modeling based on CluWords | Felipe Viegas, Washington Cunha, Christian Gomes, Antônio Pereira, Leonardo Rocha and Marcos Goncalves | N/A | N/A |
| Code and Named Entity Recognition in StackOverflow | Jeniya Tabassum, Mounica Maddela, Wei Xu and Alan Ritter | N/A | N/A |
| CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning | Alessandro Suglia, Ioannis Konstas, Andrea Vanzo, Emanuele Bastianelli, Desmond Elliott, Stella Frank and Oliver Lemon | N/A | N/A |
| Compositionality and Generalization In Emergent Languages | Rahma Chaabouni, Eugene Kharitonov, Diane Bouchacourt, Emmanuel Dupoux and Marco Baroni | N/A | N/A |
| Conditional Augmentation for Aspect Term Extraction via Masked Sequence-to-Sequence Generation | Kun Li, Chengbo Chen, Xiaojun Quan, Qing Ling and Yan Song | N/A | N/A |
| Connecting Embeddings for Knowledge Graph Entity Typing | Yu Zhao, anxiang zhang, Ruobing Xie, Kang Liu and Xiaojie WANG | N/A | N/A |
| Contextualized Weak Supervision for Text Classification | Dheeraj Mekala and Jingbo Shang | N/A | N/A |
| Continual Relation Learning via Episodic Memory Activation and Reconsolidation | Xu Han, Yi Dai, Tianyu Gao, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun and Jie Zhou | N/A | N/A |
| Conversational Graph Grounded Policy Learning for Open-Domain Conversation Generation | Jun Xu, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che and Ting Liu | N/A | N/A |
| CorefQA: Coreference Resolution as Query-based Span Prediction | Wei Wu, Fei Wang, Arianna Yuan, Fei Wu and Jiwei Li | N/A | N/A |
| Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation | Ning Ding, Dingkun Long, Guangwei Xu, Muhua Zhu, Pengjun Xie, Xiaobin Wang and Haitao Zheng | N/A | N/A |
| CraftAssist Instruction Parsing: Semantic Parsing for a Voxel-World Assistant | Kavya Srinet, Yacine Jernite, Jonathan Gray and Arthur Szlam | N/A | N/A |
| Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus | Hao Fei, Meishan Zhang and Donghong Ji | N/A | N/A |
| Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning | Hongliang Fei and Ping Li | N/A | N/A |
| Cross-Linguistic Syntactic Evaluation of Word Prediction Models | Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou, Natalia Talmina and Tal Linzen | N/A | N/A |
| Cross-media Structured Common Space for Multimedia Event Extraction | Manling Li, Alireza Zareian, Qi Zeng, Spencer Whitehead, Di Lu, Heng Ji and Shih-Fu Chang | N/A | N/A |
| Cross-modal Coherence Modeling for Caption Generation | Malihe Alikhani, Piyush Sharma, Shengjie Li, Radu Soricut and Matthew Stone | N/A | N/A |
| Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage | Ashish V. Thapliyal and Radu Soricut | N/A | N/A |
| Cross-Modality Relevance for Reasoning on Language and Vision | Chen Zheng, Quan Guo and Parisa Kordjamshidi | N/A | N/A |
| Curriculum Learning for Natural Language Understanding | Benfeng Xu, Licheng Zhang, Zhendong Mao, Quan Wang, Hongtao Xie and Yongdong Zhang | N/A | N/A |
| Curriculum Pre-training for End-to-End Speech Translation | Chengyi Wang, Yu Wu, Shujie Liu, Ming Zhou and Zhenglu Yang | N/A | N/A |
| Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight | Hengyi Cai, Hongshen Chen, Yonghao Song, Cheng Zhang, Xiaofang Zhao and Dawei Yin | N/A | N/A |
| DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering | Qingqing Cao, Harsh Trivedi, Aruna Balasubramanian and Niranjan Balasubramanian | N/A | N/A |
| Demographics Should Not Be the Reason of Toxicity: Mitigating Discrimination in Text Classifications with Instance Weighting | Guanhua Zhang, Bing Bai, Junqi Zhang, Kun Bai, Conghui Zhu and Tiejun Zhao | N/A | N/A |
| Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA | Hyounghun Kim, Zineng Tang and Mohit Bansal | N/A | N/A |
| Dependency Graph Enhanced Dual-transformer Structure for Aspect-based Sentiment Classification | Hao Tang, Donghong Ji, Chenliang Li and Qiji Zhou | N/A | N/A |
| DeSePtion: Dual Sequence Prediction and Adversarial Examples for Improved Fact-Checking | Christopher Hidey, Tuhin Chakrabarty, Tariq Alhindi, Siddharth Varia, Kriste Krstovski, Mona Diab and Smaranda Muresan | N/A | N/A |
| Detecting Perceived Emotions in Hurricane Disasters | Shrey Desai, Cornelia Caragea and Junyi Jessy Li | N/A | N/A |
| Dialogue Coherence Assessment Without Explicit Dialogue Act Labels | Mohsen Mesgar, Sebastian Bücker and Iryna Gurevych | N/A | N/A |
| Dialogue-Based Relation Extraction | Dian Yu, Kai Sun, Claire Cardie and Dong Yu | N/A | N/A |
| Dice Loss for Data-imbalanced NLP Tasks | Xiaoya Li, Xiaofei Sun, Yuxian Meng, Junjun Liang, Fei Wu and Jiwei Li | N/A | N/A |
| Differentiable Window for Dynamic Local Attention | Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li | N/A | N/A |
| Discourse as a Function of Event: Profiling Discourse Structure in News Articles around the Main Event | Prafulla Kumar Choubey, Aaron Lee, Ruihong Huang and Lu Wang | N/A | N/A |
| Discourse-Aware Neural Extractive Text Summarization | Jiacheng Xu, Zhe Gan, Yu Cheng and Jingjing Liu | N/A | N/A |
| Discrete Latent Variable Representations for Low-Resource Text Classification | Shuning Jin, Sam Wiseman, Karl Stratos and Karen Livescu | N/A | N/A |
| Discrete Optimization for Unsupervised Sentence Summarization with Word-Level Extraction | Raphael Schumann, Lili Mou, Yao Lu, Olga Vechtomova and Katja Markert | N/A | N/A |
| Distilling Annotations via Active Imitation Learning | Kianté Brantley, Hal Daumé III and Amr Sharaf | N/A | N/A |
| Distilling Knowledge Learned in BERT for Text Generation | Yen-Chun Chen, Zhe Gan, Yu Cheng, Jingzhou Liu and Jingjing Liu | N/A | N/A |
| Distinguish Confusing Law Articles for Legal Judgment Prediction | Nuo Xu, Pinghui Wang, Long Chen, Li Pan, Xiaoyan Wang and Junzhou Zhao | N/A | N/A |
| Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge Awareness | Sixing Wu, Ying Li, Dawei Zhang, Yang Zhou and Zhonghai Wu | N/A | N/A |
| Diversifying Dialogue Generation with Non-Conversational Text | Hui Su, Xiaoyu Shen, Sanqiang Zhao, Zhou Xiao, Pengwei Hu, Randy Zhong, Cheng Niu and Jie Zhou | N/A | N/A |
| Do Neural Language Models Show Preferences for Syntactic Formalisms? | Artur Kulmizev, Vinit Ravishankar, Mostafa Abdou and Joakim Nivre | N/A | N/A |
| Do Neural Models Learn Systematicity of Monotonicity Inference in Natural Language? | Hitomi Yanaka, Koji Mineshima, Daisuke Bekki and Kentaro Inui | N/A | N/A |
| Document Modeling with Graph Attention Networks for Multi-grained Machine Reading Comprehension | Bo Zheng, Haoyang Wen, Yaobo Liang, Nan Duan, Wanxiang Che, Daxin Jiang, Ming Zhou and Ting Liu | N/A | N/A |
| Document Translation vs. Query Translation for Cross-Lingual Information Retrieval in the Medical Domain | Shadi Saleh and Pavel Pecina | N/A | N/A |
| Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding | Xinya Du and Claire Cardie | N/A | N/A |
| Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training | Margaret Li, Stephen Roller, Ilia Kulikov, Sean Welleck, Y-Lan Boureau, Kyunghyun Cho and Jason Weston | N/A | N/A |
| Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks | Suchin Gururangan, Ana Marasović, Swabha Swayamdipta, Kyle Lo, Iz Beltagy, Doug Downey and Noah A. Smith | N/A | N/A |
| DoQA - Accessing Domain-Specific FAQs via Conversational QA | Jon Ander Campos, Arantxa Otegi, Aitor Soroa, Jan Deriu, Mark Cieliebak and Eneko Agirre | N/A | N/A |
| Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation | Tianlu Wang, Xi Victoria Lin, Nazneen Fatema Rajani, Bryan McCann, Vicente Ordonez and Caiming Xiong | N/A | N/A |
| DRTS Parsing with Structure-Aware Encoding and Decoding | Qiankun Fu, Yue Zhang, Jiangming Liu and Meishan Zhang | N/A | N/A |
| DTCA: Decision Tree-based Co-Attention Networks for Explainable Claim Verification | Lianwei Wu, Yuan Rao, Yongqiang Zhao, Hao Liang and Ambreen Nazir | N/A | N/A |
| Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog | Libo Qin, Xiao Xu, Wanxiang Che, Yue Zhang and Ting Liu | N/A | N/A |
| Dynamic Online Conversation Recommendation | Xingshan Zeng, Jing Li, Lu Wang, Zhiming Mao and Kam-Fai Wong | N/A | N/A |
| Dynamic Programming Encoding for Subword Segmentation in Neural Machine Translation | Xuanli He, Gholamreza Haffari and Mohammad Norouzi | N/A | N/A |
| ECPE-2D: Emotion-Cause Pair Extraction based on Joint Two-Dimensional Representation, Interaction and Prediction | Zixiang Ding, Rui Xia and Jianfei Yu | N/A | N/A |
| Effective Estimation of Deep Generative Language Models | Tom Pelsmaeker and Wilker Aziz | N/A | N/A |
| Effective Inter-Clause Modeling for End-to-End Emotion-Cause Pair Extraction | Penghui Wei, Jiahao Zhao and Wenji Mao | N/A | N/A |
| Efficient Constituency Parsing by Pointing | Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li | N/A | N/A |
| Efficient Dialogue State Tracking by Selectively Overwriting Memory | Sungdong Kim, Sohee Yang, Gyuwan Kim and Sang-Woo Lee | N/A | N/A |
| Efficient Pairwise Annotation of Argument Quality | Lukas Gienapp, Benno Stein, Matthias Hagen and Martin Potthast | N/A | N/A |
| Efficient Second-Order TreeCRF for Neural Dependency Parsing | Yu Zhang, Zhenghua Li and Min Zhang | N/A | N/A |
| Emergence of Syntax Needs Minimal Supervision | Raphaël Bailly and Kata Gábor | N/A | N/A |
| Emerging Cross-lingual Structure in Pretrained Language Models | Alexis Conneau, Shijie Wu, Haoran Li, Luke Zettlemoyer and Veselin Stoyanov | N/A | N/A |
| Empower Entity Set Expansion via Language Model Probing | Yunyi Zhang, Jiaming Shen, Jingbo Shang and Jiawei Han | N/A | N/A |
| Empowering Active Learning to Jointly Optimize System and User Demands | Ji-Ung Lee, Christian M. Meyer and Iryna Gurevych | N/A | N/A |
| End-to-End Bias Mitigation by Modelling Biases in Corpora | Rabeeh Karimi Mahabadi, Yonatan Belinkov and James Henderson | N/A | N/A |
| End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2 | Donghoon Ham, Jeong-Gwan Lee, Youngsoo Jang and Kee-Eung Kim | N/A | N/A |
| End-to-End Neural Word Alignment Outperforms GIZA++ | Thomas Zenkel, Joern Wuebker and John DeNero | N/A | N/A |
| Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension | Fei Yuan, Linjun Shou, Xuanyu Bai, Ming Gong, Yaobo Liang, Nan Duan, Yan Fu and Daxin Jiang | N/A | N/A |
| Enhancing Cross-target Stance Detection with Transferable Semantic-Emotion Knowledge | Bowen Zhang, Min Yang, Xutao Li, Yunming Ye, Xiaofei Xu and Kuai Dai | N/A | N/A |
| ERASER: A Benchmark to Evaluate Rationalized NLP Models | Jay DeYoung, Sarthak Jain, Nazneen Fatema Rajani, Eric Lehman, Caiming Xiong, Richard Socher and Byron C. Wallace | N/A | N/A |
| ESPRIT: Explaining Solutions to Physical Reasoning Tasks | Nazneen Fatema Rajani, Rui Zhang, Yi Chern Tan, Stephan Zheng, Jeremy Weiss, Aadit Vyas, Abhijit Gupta, Caiming Xiong, Richard Socher and Dragomir Radev | N/A | N/A |
| Estimating predictive uncertainty for rumour verification models | Elena Kochkina and Maria Liakata | N/A | N/A |
| Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks | Fynn Schröder and Chris Biemann | N/A | N/A |
| Evaluating and Enhancing the Robustness of Neural Network-based Dependency Parsing Models with Adversarial Examples | Xiaoqing Zheng, Jiehang Zeng, Yi Zhou, Cho-Jui Hsieh, Minhao Cheng and Xuanjing Huang | N/A | N/A |
| Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior? | Peter Hase and Mohit Bansal | N/A | N/A |
| Evaluating Explanation Methods for Neural Machine Translation | Jierui Li, Lemao Liu, Huayang Li, Guanlin Li, Guoping Huang and Shuming Shi | N/A | N/A |
| Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEncoder | Daya Guo, Duyu Tang, Nan Duan, Jian Yin, Daxin Jiang and Ming Zhou | N/A | N/A |
| Exact yet Efficient Graph Parsing, Bi-directional Locality and the Constructivist Hypothesis | Yajie Ye and Weiwei Sun | N/A | N/A |
| Examining Citations of Natural Language Processing Literature | Saif M. Mohammad | N/A | N/A |
| Examining the State-of-the-Art in News Timeline Summarization | Demian Gholipour Ghalandari and Georgiana Ifrim | N/A | N/A |
| Exclusive Hierarchical Decoding for Deep Keyphrase Generation | Wang Chen, Hou Pong Chan, Piji Li and Irwin King | N/A | N/A |
| Expertise Style Transfer: A New Task Towards Better Communication between Experts and Laymen | Yixin Cao, Ruihao Shui, Liangming Pan, Min-Yen Kan, Zhiyuan Liu and Tat-Seng Chua | N/A | N/A |
| Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions | Xiaochuang Han, Byron C. Wallace and Yulia Tsvetkov | N/A | N/A |
| Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading | Yifan Gao, Chien-Sheng Wu, Shafiq Joty, Caiming Xiong, Richard Socher, Irwin King, Michael Lyu and Steven C.H. Hoi | N/A | N/A |
| Explicit Semantic Decomposition for Definition Generation | Jiahuan Li, Yu Bao, Shujian Huang, Xinyu Dai and Jiajun Chen | N/A | N/A |
| Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach | Wenyu DU, Zhouhan Lin, Yikang Shen, Timothy J. O’Donnell, Yoshua Bengio and Yue Zhang | N/A | N/A |
| Exploiting the Syntax-Model Consistency for Neural Relation Extraction | Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou and Thien Huu Nguyen | N/A | N/A |
| Exploring Contextual Word-level Style Relevance for Unsupervised Style Transfer | Chulun Zhou, Liangyu Chen, Jiachen Liu, Xinyan Xiao, Jinsong Su, Sheng Guo and Hua Wu | N/A | N/A |
| Exploring Unexplored Generalization Challenges for Cross-Database Semantic Parsing | Alane Suhr, Ming-Wei Chang, Peter Shaw and Kenton Lee | N/A | N/A |
| Extracting Headless MWEs from Dependency Parse Trees: Parsing, Tagging, and Joint Modeling Approaches | Tianze Shi and Lillian Lee | N/A | N/A |
| Extractive Summarization as Text Matching | Ming Zhong, Pengfei Liu, Yiran Chen, Danqing Wang, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| Facet-Aware Evaluation for Extractive Summarization | Yuning Mao, Liyuan Liu, Qi Zhu, Xiang Ren and Jiawei Han | N/A | N/A |
| Fact-based Text Editing | Hayate Iso, Chao Qiao and Hang Li | N/A | N/A |
| Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning | Joongbo Shin, Yoonhyung Lee, Seunghyun Yoon and Kyomin Jung | N/A | N/A |
| Fast and Accurate Non-Projective Dependency Tree Linearization | Xiang Yu, Simon Tannert, Ngoc Thang Vu and Jonas Kuhn | N/A | N/A |
| FastBERT: a Self-distilling BERT with Adaptive Inference Time | Weijie Liu, Peng Zhou, Zhiruo Wang, Zhe Zhao, Haotang Deng and QI JU | N/A | N/A |
| Feature Projection for Improved Text Classification | Qi Qin, Wenpeng Hu and Bing Liu | N/A | N/A |
| FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization | Esin Durmus, He He and Mona Diab | N/A | N/A |
| Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network | Yutai Hou, Wanxiang Che, Yongkui Lai, Zhihan Zhou, Yijia Liu, Han Liu and Ting Liu | N/A | N/A |
| Finding Universal Grammatical Relations in Multilingual BERT | Ethan A. Chi, John Hewitt and Christopher D. Manning | N/A | N/A |
| Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences | Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe and Omri Abend | N/A | N/A |
| Fine-grained Fact Verification with Kernel Graph Attention Network | Zhenghao Liu, Chenyan Xiong, Maosong Sun and Zhiyuan Liu | N/A | N/A |
| Fine-grained Interest Matching for Neural News Recommendation | Heyuan Wang, Fangzhao Wu, Zheng Liu and Xing Xie | N/A | N/A |
| Fluent Response Generation for Conversational Question Answering | Ashutosh Baheti, Alan Ritter and Kevin Small | N/A | N/A |
| From Arguments to Key Points: Towards Automatic Argument Summarization | Roy Bar-Haim, Lilach Eden, Roni Friedman, Yoav Kantor, Dan Lahav and Noam Slonim | N/A | N/A |
| From English to Code-Switching: Transfer Learning with Strong Morphological Clues | Gustavo Aguilar and Thamar Solorio | N/A | N/A |
| From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)? | Reut Tsarfaty, Dan Bareket, Stav Klein and Amit Seker | N/A | N/A |
| From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains | Jan-Christoph Klie, Richard Eckart de Castilho and Iryna Gurevych | N/A | N/A |
| Frugal Paradigm Completion | Alexander Erdmann, Tom Kenter, Markus Becker and Christian Schallhart | N/A | N/A |
| Gated Convolutional Bidirectional Attention-based Model for Off-topic Spoken Response Detection | Yefei Zha, Ruobing Li and Hui Lin | N/A | N/A |
| GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media | Yi-Ju Lu and Cheng-Te Li | N/A | N/A |
| Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer | Jieyu Zhao, Subhabrata Mukherjee, Saghar Hosseini, Kai-Wei Chang and Ahmed Hassan Awadallah | N/A | N/A |
| Gender Gap in Natural Language Processing Research: Disparities in Authorship and Citations | Saif M. Mohammad | N/A | N/A |
| Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus | Luisa Bentivogli, Beatrice Savoldi, Matteo Negri, Mattia A. Di Gangi, Roldano Cattoni and Marco Turchi | N/A | N/A |
| Generalized Entropy Regularization or: There’s Nothing Special about Label Smoothing | Clara Meister, Elizabeth Salesky and Ryan Cotterell | N/A | N/A |
| Generalizing Natural Language Analysis through Span-relation Representations | Zhengbao Jiang, Wei Xu, Jun Araki and Graham Neubig | N/A | N/A |
| Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation | Haoyu Song, Yan Wang, Wei-Nan Zhang, Xiaojiang Liu and Ting Liu | N/A | N/A |
| Generating Counter Narratives against Online Hate Speech: Data and Strategies | Serra Sinem Tekiroğlu, Yi-Ling Chung and Marco Guerini | N/A | N/A |
| Generating Diverse and Consistent QA pairs from Contexts with Information-Maximizing Hierarchical Conditional VAEs | Dong Bok Lee, Seanie Lee, Woo Tae Jeong, Donghwan Kim and Sung Ju Hwang | N/A | N/A |
| Generating Fact Checking Explanations | Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma and Isabelle Augenstein | N/A | N/A |
| Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection | Hanjie Chen, Guangtao Zheng and Yangfeng Ji | N/A | N/A |
| Generating Informative Conversational Response using Recurrent Knowledge-Interaction and Knowledge-Copy | Xiexiong Lin, Weiyu Jian, Jianshan He, Taifeng Wang and Wei Chu | N/A | N/A |
| Generative Semantic Hashing Enhanced via Boltzmann Machines | Lin Zheng, Qinliang Su, Dinghan Shen and Changyou Chen | N/A | N/A |
| GLUECoS: An Evaluation Benchmark for Code-Switched NLP | Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram and Monojit Choudhury | N/A | N/A |
| GoEmotions: A Dataset of Fine-Grained Emotions | Dorottya Demszky, Dana Movshovitz-Attias, Jeongwoo Ko, Alan Cowen, Gaurav Nemade and Sujith Ravi | N/A | N/A |
| Good-Enough Compositional Data Augmentation | Jacob Andreas | N/A | N/A |
| Graph Neural News Recommendation with Unsupervised Preference Disentanglement | Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie and Ming Zhou | N/A | N/A |
| Graph-to-Tree Learning for Solving Math Word Problems | Jipeng Zhang, Lei Wang, Roy Ka-Wei Lee, Yi Bin, Yan Wang, Jie Shao and Ee-Peng Lim | N/A | N/A |
| Grounded Conversation Generation as Guided Traverses in Commonsense Knowledge Graphs | Houyu Zhang, Zhenghao Liu, Chenyan Xiong and Zhiyuan Liu | N/A | N/A |
| Grounding Conversations with Improvised Dialogues | Hyundong Cho and Jonathan May | N/A | N/A |
| Guiding Variational Response Generator to Exploit Persona | Bowen Wu, Mengyuan Li, Zongsheng Wang, Yifu Chen, Derek F. Wong, Qihang Feng, Junhong Huang and Baoxun Wang | N/A | N/A |
| Handling Rare Entities for Neural Sequence Labeling | Yangming Li, Han Li, Kaisheng Yao and Xiaolong Li | N/A | N/A |
| Hard-Coded Gaussian Attention for Neural Machine Translation | Weiqiu You, Simeng Sun and Mohit Iyyer | N/A | N/A |
| Harnessing the linguistic signal to predict scalar inferences | Sebastian Schuster, Yuxing Chen and Judith Degen | N/A | N/A |
| Harvesting and Refining Question-Answer Pairs for Unsupervised QA | Zhongli Li, Wenhui Wang, Li Dong, Furu Wei and Ke Xu | N/A | N/A |
| HAT: Hardware-Aware Transformers for Efficient Natural Language Processing | Hanrui Wang, Zhanghao Wu, Zhijian Liu, Han Cai, Ligeng Zhu, Chuang Gan and Song Han | N/A | N/A |
| He said “who’s gonna take care of your children when you are at ACL?”: Reported Sexist Acts are Not Sexist | Patricia Chiril, Véronique Moriceau, Farah Benamara, Alda Mari, Gloria Origgi and Marlène Coulomb-Gully | N/A | N/A |
| Heterogeneous Graph Neural Networks for Extractive Document Summarization | Danqing Wang, Pengfei Liu, Yining Zheng, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| Heterogeneous Graph Transformer for Graph-to-Sequence Learning | Shaowei Yao, Tianming Wang and Xiaojun Wan | N/A | N/A |
| Hierarchical Entity Typing via Multi-level Learning to Rank | Tongfei Chen, Yunmo Chen and Benjamin Van Durme | N/A | N/A |
| Hierarchical Modeling for User Personality Prediction: The Role of Message-Level Attention | Veronica Lynn, Niranjan Balasubramanian and H. Andrew Schwartz | N/A | N/A |
| Hierarchy-Aware Global Model for Hierarchical Text Classification | Jie Zhou, Chunping Ma, Dingkun Long, Guangwei Xu, Ning Ding, Haoyu Zhang, Pengjun Xie and Gongshen Liu | N/A | N/A |
| Highway Transformer: Self-Gating Enhanced Self-Attentive Networks | Yekun Chai, Shuo Jin and Xinwen Hou | N/A | N/A |
| Hiring Now: A Skill-Aware Multi-Attention Model for Job Posting Generation | Liting Liu, Jie Liu, Wenzheng Zhang, Ziming Chi, Wenxuan Shi and Yalou Huang | N/A | N/A |
| History for Visual Dialog: Do we really need it? | Shubham Agarwal, Trung Bui, Joon-Young Lee, Ioannis Konstas and Verena Rieser | N/A | N/A |
| Hooks in the Headline: Learning to Generate Headlines with Controlled Styles | Di Jin, Zhijing Jin, Joey Tianyi Zhou, Lisa Orii and Peter Szolovits | N/A | N/A |
| How Accents Confound: Probing for Accent Information in End-to-End Speech Recognition Systems | Archiki Prasad and Preethi Jyothi | N/A | N/A |
| How does BERT’s attention change when you fine-tune? An analysis methodology and a case study in negation scope | Yiyun Zhao and Steven Bethard | N/A | N/A |
| How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence | Haoxi Zhong, Chaojun Xiao, Cunchao Tu, Tianyang Zhang, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| How Does Selective Mechanism Improve Self-Attention Networks? | Xinwei Geng, Longyue Wang, Xing Wang, Bing Qin, Ting Liu and Zhaopeng Tu | N/A | N/A |
| How to Ask Good Questions? Try to Leverage Paraphrases | Xin Jia, Wenjie Zhou, Xu Sun and Yunfang Wu | N/A | N/A |
| Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? | Cansu Sen, Thomas Hartvigsen, Biao Yin, Xiangnan Kong and Elke Rundensteiner | N/A | N/A |
| Hyperbolic Capsule Networks for Multi-Label Classification | Boli Chen, Xin Huang, Lin Xiao and Liping Jing | N/A | N/A |
| HyperCore: Hyperbolic and Co-graph Representation for Automatic ICD Coding | Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu and Weifeng Chong | N/A | N/A |
| Image-Chat: Engaging Grounded Conversations | Kurt Shuster, Samuel Humeau, Antoine Bordes and Jason Weston | N/A | N/A |
| IMoJIE: Iterative Memory-Based Joint Open Information Extraction | Keshav Kolluru, Samarth Aggarwal, Vipul Rathore, Mausam - and Soumen Chakrabarti | N/A | N/A |
| Improved Natural Language Generation via Loss Truncation | Daniel Kang and Tatsunori Hashimoto | N/A | N/A |
| Improving Adversarial Text Generation by Modeling the Distant Future | Ruiyi Zhang, Changyou Chen, Zhe Gan, Wenlin Wang, Dinghan Shen, Guoyin Wang, Zheng Wen and Lawrence Carin | N/A | N/A |
| Improving Chinese Word Segmentation with Wordhood Memory Networks | Yuanhe Tian, Yan Song, Fei Xia, Tong Zhang and Yonggang Wang | N/A | N/A |
| Improving Disentangled Text Representation Learning with Information-Theoretic Guidance | Pengyu Cheng, Martin Renqiang Min, Dinghan Shen, Christopher Malon, Yizhe Zhang, Yitong Li and Lawrence Carin | N/A | N/A |
| Improving Disfluency Detection by Self-Training a Self-Attentive Model | Paria Jamshid Lou and Mark Johnson | N/A | N/A |
| Improving Event Detection via Open-domain Trigger Knowledge | Meihan Tong, Bin Xu, Shuai Wang, Yixin Cao, Lei Hou, Juanzi Li and Jun Xie | N/A | N/A |
| Improving Image Captioning Evaluation by Considering Inter References Variance | Yanzhi Yi, Hangyu Deng and Jinglu Hu | N/A | N/A |
| Improving Image Captioning with Better Use of Caption | Zhan Shi, Xu Zhou, Xipeng Qiu and Xiaodan Zhu | N/A | N/A |
| Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation | Biao Zhang, Philip Williams, Ivan Titov and Rico Sennrich | N/A | N/A |
| Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings | Apoorv Saxena, Aditay Tripathi and Partha Talukdar | N/A | N/A |
| Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer | Jianfei Yu, Jing Jiang, Li Yang and Rui Xia | N/A | N/A |
| Improving Neural Machine Translation with Soft Template Prediction | Jian Yang, Shuming Ma, Dongdong Zhang, Zhoujun Li and Ming Zhou | N/A | N/A |
| Improving Segmentation for Technical Support Problems | Kushal Chauhan and Abhirut Gupta | N/A | N/A |
| Improving Transformer Models by Reordering their Sublayers | Ofir Press, Noah A. Smith and Omer Levy | N/A | N/A |
| Improving Truthfulness of Headline Generation | Kazuki Matsumaru, Sho Takase and Naoaki Okazaki | N/A | N/A |
| In Layman’s Terms: Semi-Open Relation Extraction from Scientific Texts | Ruben Kruiper, Julian Vincent, Jessica Chen-Burger, Marc Desmulliez and Ioannis Konstas | N/A | N/A |
| In Neural Machine Translation, What Does Transfer Learning Transfer? | Alham Fikri Aji, Nikolay Bogoychev, Kenneth Heafield and Rico Sennrich | N/A | N/A |
| Inflecting when there’s no majority: Limitations of encoder-decoder neural networks as cognitive models for German plurals | Kate McCurdy, Sharon Goldwater and Adam Lopez | N/A | N/A |
| Influence Paths for Characterizing Subject-Verb Number Agreement in LSTM Language Models | Kaiji Lu, Piotr Mardziel, Klas Leino, Matt Fredrikson and Anupam Datta | N/A | N/A |
| Information-Theoretic Probing for Linguistic Structure | Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay, Ran Zmigrod, Adina Williams and Ryan Cotterell | N/A | N/A |
| INFOTABS: Inference on Tables as Semi-structured Data | Vivek Gupta, Maitrey Mehta, Pegah Nokhiz and Vivek Srikumar | N/A | N/A |
| Injecting Numerical Reasoning Skills into Language Models | Mor Geva, Ankit Gupta and Jonathan Berant | N/A | N/A |
| INSET: Sentence Infilling with INter-SEntential Transformer | Yichen Huang, Yizhe Zhang, Oussama Elachqar and Yu Cheng | N/A | N/A |
| Integrating Multimodal Information in Large Pretrained Transformers | Wasifur Rahman, Md Kamrul Hasan, Sangwu Lee, AmirAli Bagher Zadeh, Chengfeng Mao, Louis-Philippe Morency and Ehsan Hoque | N/A | N/A |
| Integrating Semantic and Structural Information with Graph Convolutional Network for Controversy Detection | Lei Zhong, Juan Cao, Qiang Sheng, Junbo Guo and Ziang Wang | N/A | N/A |
| Interactive Classification by Asking Informative Questions | Lili Yu, Howard Chen, Sida I. Wang, Tao Lei and Yoav Artzi | N/A | N/A |
| Interactive Construction of User-Centric Dictionary for Text Analytics | Ryosuke Kohita, Issei Yoshida, Hiroshi Kanayama and Tetsuya Nasukawa | N/A | N/A |
| Interactive Machine Comprehension with Information Seeking Agents | Xingdi Yuan, Jie Fu, Marc-Alexandre Côté, Yi Tay, Chris Pal and Adam Trischler | N/A | N/A |
| Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work? | Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann and Samuel R. Bowman | N/A | N/A |
| Interpreting Pretrained Contextualized Representations via Reductions to Static Embeddings | Rishi Bommasani, Kelly Davis and Claire Cardie | N/A | N/A |
| Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions | Hannah Craighead, Andrew Caines, Paula Buttery and Helen Yannakoudakis | N/A | N/A |
| Investigating Word-Class Distributions in Word Vector Spaces | Ryohei Sasano and Anna Korhonen | N/A | N/A |
| iSarcasm: A Dataset of Intended Sarcasm | Silviu Oprea and Walid Magdy | N/A | N/A |
| It Takes Two to Lie: One to Lie, and One to Listen | Denis Peskov, Benny Cheng, Ahmed Elgohary, Joe Barrow, Cristian Danescu-Niculescu-Mizil and Jordan Boyd-Graber | N/A | N/A |
| It’s Morphin’ Time! Combating Linguistic Discrimination with Inflectional Perturbations | Samson Tan, Shafiq Joty, Min-Yen Kan and Richard Socher | N/A | N/A |
| Iterative Edit-Based Unsupervised Sentence Simplification | Dhruv Kumar, Lili Mou, Lukasz Golab and Olga Vechtomova | N/A | N/A |
| Joint Chinese Word Segmentation and Part-of-speech Tagging via Two-way Attentions of Auto-analyzed Knowledge | Yuanhe Tian, Yan Song, Xiang Ao, Fei Xia, Xiaojun Quan, Tong Zhang and Yonggang Wang | N/A | N/A |
| Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging | Nasser Zalmout and Nizar Habash | N/A | N/A |
| Joint Modelling of Emotion and Abusive Language Detection | Santhosh Rajamanickam, Pushkar Mishra, Helen Yannakoudakis and Ekaterina Shutova | N/A | N/A |
| Jointly Learning to Align and Summarize for Neural Cross-Lingual Summarization | Yue Cao, Hui Liu and Xiaojun Wan | N/A | N/A |
| Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation | Junliang Guo, Linli Xu and Enhong Chen | N/A | N/A |
| KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation | Hao Zhou, Chujie Zheng, Kaili Huang, Minlie Huang and Xiaoyan Zhu | N/A | N/A |
| KinGDOM: Knowledge-Guided DOMain adaptation for sentiment analysis | Deepanway Ghosal, Devamanyu Hazarika, Abhinaba Roy, Navonil Majumder, Rada Mihalcea and Soujanya Poria | N/A | N/A |
| KLEJ: Comprehensive Benchmark for Polish Language Understanding | Piotr Rybak, Robert Mroczkowski, Janusz Tracz and Ireneusz Gawlik | N/A | N/A |
| Knowledge Distillation for Multilingual Unsupervised Neural Machine Translation | Haipeng Sun, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita and Tiejun Zhao | N/A | N/A |
| Knowledge Graph Embedding Compression | Mrinmaya Sachan | N/A | N/A |
| Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward | Luyang Huang, Lingfei Wu and Lu Wang | N/A | N/A |
| Language (Re)modelling: Towards Embodied Language Understanding | Ronen Tamari, Chen Shani, Tom Hope, Miriam R L Petruck, Omri Abend and Dafna Shahaf | N/A | N/A |
| Language (technology) is power: The need to be explicit about NLP harms | Su Lin Blodgett, Solon Barocas, Hal Daumé III and Hanna Wallach | N/A | N/A |
| Language Models as an Alternative Evaluator of Word Order Hypotheses: A Case Study in Japanese | Tatsuki Kuribayashi, Takumi Ito, Jun Suzuki and Kentaro Inui | N/A | N/A |
| Language to Network: Conditional Parameter Adaptation with Natural Language Descriptions | Tian Jin, Zhun Liu, Shengjia Yan, Alexandre Eichenberger and Louis-Philippe Morency | N/A | N/A |
| Large Scale Multi-Actor Generative Dialog Modeling | Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary and Bryan Catanzaro | N/A | N/A |
| Learning a Multi-Domain Curriculum for Neural Machine Translation | Wei Wang, Ye Tian, Jiquan Ngiam, Yinfei Yang, Isaac Caswell and Zarana Parekh | N/A | N/A |
| Learning and Evaluating Emotion Lexicons for 91 Languages | Sven Buechel, Susanna Rücker and Udo Hahn | N/A | N/A |
| Learning Architectures from an Extended Search Space for Language Modeling | Yinqiao Li, Chi Hu, Yuhao Zhang, Nuo Xu, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu and Changliang Li | N/A | N/A |
| Learning Constraints for Structured Prediction Using Rectifier Networks | Xingyuan Pan, Maitrey Mehta and Vivek Srikumar | N/A | N/A |
| Learning Dialog Policies from Weak Demonstrations | Gabriel Gordon-Hall, Philip John Gorinski and Shay B. Cohen | N/A | N/A |
| Learning Efficient Dialogue Policy from Demonstrations through Shaping | Huimin Wang, Baolin Peng and Kam-Fai Wong | N/A | N/A |
| Learning Interpretable Relationships between Entities, Relations and Concepts via Bayesian Structure Learning on Open Domain Facts | Jingyuan Zhang, Mingming Sun, Yue Feng and Ping Li | N/A | N/A |
| Learning Source Phrase Representations for Neural Machine Translation | Hongfei Xu, Josef van Genabith, Deyi Xiong, Qiuhui Liu and Jingyi Zhang | N/A | N/A |
| Learning to Ask More: Semi-Autoregressive Sequential Question Generation under Dual-Graph Interaction | Zi Chai and Xiaojun Wan | N/A | N/A |
| Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling | Ouyu Lan, Xiao Huang, Bill Yuchen Lin, He Jiang, Liyuan Liu and Xiang Ren | N/A | N/A |
| Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks | Yiping Song, Zequn Liu, Wei Bi, Rui Yan and Ming Zhang | N/A | N/A |
| Learning to Deceive with Attention-Based Explanations | Danish Pruthi, Mansi Gupta, Bhuwan Dhingra, Graham Neubig and Zachary C. Lipton | N/A | N/A |
| Learning to execute instructions in a Minecraft dialogue | Prashant Jayannavar, Anjali Narayan-Chen and Julia Hockenmaier | N/A | N/A |
| Learning to Faithfully Rationalize by Construction | Sarthak Jain, Sarah Wiegreffe, Yuval Pinter and Byron C. Wallace | N/A | N/A |
| Learning to Identify Follow-Up Questions in Conversational Question Answering | Souvik Kundu, Qian Lin and Hwee Tou Ng | N/A | N/A |
| Learning to Recover from Multi-Modality Errors for Non-Autoregressive Neural Machine Translation | Qiu Ran, Yankai Lin, Peng Li and Jie Zhou | N/A | N/A |
| Learning to Segment Actions from Observation and Narration | Daniel Fried, Jean-Baptiste Alayrac, Phil Blunsom, Chris Dyer, Stephen Clark and Aida Nematzadeh | N/A | N/A |
| Learning to Update Natural Language Comments Based on Code Changes | Sheena Panthaplackel, Pengyu Nie, Milos Gligoric, Junyi Jessy Li and Raymond Mooney | N/A | N/A |
| Learning Web-based Procedures by Reasoning over Explanations and Demonstrations in Context | Shashank Srivastava, Oleksandr Polozov, Nebojsa Jojic and Christopher Meek | N/A | N/A |
| Leveraging Graph to Improve Abstractive Multi-Document Summarization | Wei Li, Xinyan Xiao, Jiachen Liu, Hua Wu, Haifeng Wang and Junping Du | N/A | N/A |
| Line Graph Enhanced AMR-to-Text Generation with Mix-Order Graph Attention Networks | Yanbin Zhao, Lu Chen, Zhi Chen, Ruisheng Cao, Su Zhu and Kai Yu | N/A | N/A |
| Location Attention for Extrapolation to Longer Sequences | Yann Dubois, Gautier Dagan, Dieuwke Hupkes and Elia Bruni | N/A | N/A |
| Logical Natural Language Generation from Open-Domain Tables | Wenhu Chen, Jianshu Chen, Yu Su, Zhiyu Chen and William Yang Wang | N/A | N/A |
| LogicalFactChecker: Leveraging Logical Operations for Fact Checking with Graph Module Network | Wanjun Zhong, Duyu Tang, Zhangyin Feng, Nan Duan, Ming Zhou, Ming Gong, Linjun Shou, Daxin Jiang, Jiahai Wang and Jian Yin | N/A | N/A |
| Low-Dimensional Hyperbolic Knowledge Graph Embeddings | Ines Chami, Adva Wolf, Da-Cheng Juan, Frederic Sala, Sujith Ravi and Christopher Ré | N/A | N/A |
| Low-Resource Generation of Multi-hop Reasoning Questions | Jianxing Yu, Wei Liu, Shuang Qiu, Qinliang Su, Kai Wang, Xiaojun Quan and Jian Yin | N/A | N/A |
| Machine Reading of Historical Events | Or Honovich, Lucas Torroba Hennigen, Omri Abend and Shay B. Cohen | N/A | N/A |
| Mapping Natural Language Instructions to Mobile UI Action Sequences | Yang Li, Jiacong He, Xin Zhou, Yuan Zhang and Jason Baldridge | N/A | N/A |
| MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning | Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara Berg and Mohit Bansal | N/A | N/A |
| Masked Language Model Scoring | Julian Salazar, Davis Liang, Toan Q. Nguyen and Katrin Kirchhoff | N/A | N/A |
| MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization | Canwen Xu, Jiaxin Pei, Hongtao Wu, Yiyu Liu and Chenliang Li | N/A | N/A |
| Max-Margin Incremental CCG Parsing | Miloš Stanojević and Mark Steedman | N/A | N/A |
| Measuring Forecasting Skill from Text | Shi Zong, Alan Ritter and Eduard Hovy | N/A | N/A |
| Meta-Reinforced Multi-Domain State Generator for Dialogue Systems | Yi Huang, Junlan Feng, Min Hu, Xiaoting Wu, Xiaoyu Du and Shuo Ma | N/A | N/A |
| MIE: A Medical Information Extractor towards Medical Dialogues | Yuanzhe Zhang, Zhongtao Jiang, Tao Zhang, Shiwan Liu, Jiarun Cao, Kang Liu, Shengping Liu and Jun Zhao | N/A | N/A |
| Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance | Prasetya Ajie Utama, Nafise Sadat Moosavi and Iryna Gurevych | N/A | N/A |
| MIND: A Large-scale Dataset for News Recommendation | Fangzhao Wu, Ying Qiao, Jiun-Hung Chen, Chuhan Wu, Tao Qi, Jianxun Lian, Danyang Liu, Xing Xie, Jianfeng Gao, Winnie Wu and Ming Zhou | N/A | N/A |
| MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification | Jiaao Chen, Zichao Yang and Diyi Yang | N/A | N/A |
| MLQA: Evaluating Cross-lingual Extractive Question Answering | Patrick Lewis, Barlas Oguz, Ruty Rinott, Sebastian Riedel and Holger Schwenk | N/A | N/A |
| MMPE: A Multi-Modal Interface for Post-Editing Machine Translation | Nico Herbig, Tim Düwel, Santanu Pal, Kalliopi Maria Meladaki, Mahsa Monshizadeh, Antonio Krüger and Josef van Genabith | N/A | N/A |
| MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices | Zhiqing Sun, Hongkun Yu, Xiaodan Song, Renjie Liu, Yiming Yang and Denny Zhou | N/A | N/A |
| Modeling Code-Switch Languages Using Bilingual Parallel Corpus | Grandee Lee and Haizhou Li | N/A | N/A |
| Modeling Morphological Typology for Unsupervised Learning of Language Morphology | Hongzhi Xu, Jordan Kodner, Mitchell Marcus and Charles Yang | N/A | N/A |
| Modelling Context and Syntactical Features for Aspect-based Sentiment Analysis | Minh Hieu Phan and Philip O. Ogunbona | N/A | N/A |
| More Diverse Dialogue Datasets via Diversity-Informed Data Collection | Katherine Stasaski, Grace Hui Yang and Marti A. Hearst | N/A | N/A |
| Moving Down the Long Tail of Word Sense Disambiguation with Gloss Informed Bi-encoders | Terra Blevins and Luke Zettlemoyer | N/A | N/A |
| Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning | Angeliki Lazaridou, Anna Potapenko and Olivier Tieleman | N/A | N/A |
| Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition | Ryuichi Takanobu, Runze Liang and Minlie Huang | N/A | N/A |
| Multi-Cell Compositional LSTM for NER Domain Adaptation | Chen Jia and Yue Zhang | N/A | N/A |
| Multidirectional Associative Optimization of Function-Specific Word Representations | Daniela Gerz, Ivan Vulić, Marek Rei, Roi Reichart and Anna Korhonen | N/A | N/A |
| Multi-Domain Dialogue Acts and Response Co-Generation | Kai Wang, Junfeng Tian, Rui Wang, Xiaojun Quan and Jianxing Yu | N/A | N/A |
| Multi-Domain Named Entity Recognition with Genre-Aware and Agnostic Inference | Jing Wang, Mayank Kulkarni and Daniel Preotiuc-Pietro | N/A | N/A |
| Multi-Domain Neural Machine Translation with Word-Level Adaptive Layer-wise Domain Mixing | Haoming Jiang, Chen Liang, Chong Wang and Tuo Zhao | N/A | N/A |
| Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization | Hanqi Jin, Tianming Wang and Xiaojun Wan | N/A | N/A |
| Multi-Hypothesis Machine Translation Evaluation | Marina Fomicheva, Lucia Specia and Francisco Guzmán | N/A | N/A |
| Multi-Label and Multilingual News Framing Analysis | Afra Feyza Akyürek, Lei Guo, Randa Elanwar, Prakash Ishwar, Margrit Betke and Derry Tanti Wijaya | N/A | N/A |
| Multimodal Neural Graph Memory Networks for Visual Question Answering | Mahmoud Khademi | N/A | N/A |
| MultiQT: Multimodal learning for real-time question tracking in speech | Jakob D. Havtorn, Jan Latko, Joakim Edin, Lars Maaløe, Lasse Borgholt, Lorenzo Belgrano, Nicolai Jacobsen, Regitze Sdun and Željko Agić | N/A | N/A |
| Multiscale Collaborative Deep Models for Neural Machine Translation | Xiangpeng Wei, Heng Yu, Yue Hu, Yue Zhang, Rongxiang Weng and Weihua Luo | N/A | N/A |
| Multi-Sentence Argument Linking | Seth Ebner, Patrick Xia, Ryan Culkin, Kyle Rawlins and Benjamin Van Durme | N/A | N/A |
| Multi-source Meta Transfer for Low Resource Multiple-Choice Question Answering | Ming Yan, Hao Zhang, Di Jin and Joey Tianyi Zhou | N/A | N/A |
| MuTual: A Dataset for Multi-Turn Dialogue Reasoning | Leyang Cui, Yu Wu, Shujie Liu, Yue Zhang and Ming Zhou | N/A | N/A |
| Named Entity Recognition without Labelled Data: A Weak Supervision Approach | Pierre Lison, Jeremy Barnes, Aliaksandr Hubin and Samia Touileb | N/A | N/A |
| NAT: Noise-Aware Training for Robust Neural Sequence Labeling | Marcin Namysl, Sven Behnke and Joachim Köhler | N/A | N/A |
| Negative Training for Neural Dialogue Response Generation | Tianxing He and James Glass | N/A | N/A |
| Neighborhood Matching Network for Entity Alignment | Yuting Wu, Xiao Liu, Yansong Feng, Zheng Wang and Dongyan Zhao | N/A | N/A |
| NeuInfer: Knowledge Inference on N-ary Facts | Saiping Guan, Xiaolong Jin, Jiafeng Guo, Yuanzhuo Wang and Xueqi Cheng | N/A | N/A |
| Neural CRF Model for Sentence Alignment in Text Simplification | Chao Jiang, Mounica Maddela, Wuwei Lan, Yang Zhong and Wei Xu | N/A | N/A |
| Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence | Xiaoyu Shen, Ernie Chang, Hui Su, Cheng Niu and Dietrich Klakow | N/A | N/A |
| Neural Generation of Dialogue Response Timings | Matthew Roddy and Naomi Harte | N/A | N/A |
| Neural Mixed Counting Models for Dispersed Topic Discovery | Jiemin Wu, Yanghui Rao, Zusheng Zhang, Haoran Xie, Qing Li, Fu Lee Wang and Ziye Chen | N/A | N/A |
| Neural Reranking for Dependency Parsing: An Evaluation | Bich-Ngoc Do and Ines Rehbein | N/A | N/A |
| Neural Syntactic Preordering for Controlled Paraphrase Generation | Tanya Goyal and Greg Durrett | N/A | N/A |
| Neural Topic Modeling with Bidirectional Adversarial Training | Rui Wang, Xuemeng Hu, Deyu Zhou, Yulan He, Yuxuan Xiong, Chenchen Ye and Haiyang Xu | N/A | N/A |
| NILE : Natural Language Inference with Faithful Natural Language Explanations | Sawan Kumar and Partha Talukdar | N/A | N/A |
| Norm-Based Curriculum Learning for Neural Machine Translation | Xuebo Liu, Houtim Lai, Derek F. Wong and Lidia S. Chao | N/A | N/A |
| Not All Claims are Created Equal: Choosing the Right Statistical Approach to Assess Hypotheses | Erfan Sadeqi Azer, Daniel Khashabi, Ashish Sabharwal and Dan Roth | N/A | N/A |
| Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection | Shauli Ravfogel, Yanai Elazar, Hila Gonen, Michael Twiton and Yoav Goldberg | N/A | N/A |
| Obtaining Faithful Interpretations from Compositional Neural Networks | Sanjay Subramanian, Ben Bogin, Nitish Gupta, Tomer Wolfson, Sameer Singh, Jonathan Berant and Matt Gardner | N/A | N/A |
| On Faithfulness and Factuality in Abstractive Summarization | Joshua Maynez, Shashi Narayan, Bernd Bohnet and Ryan McDonald | N/A | N/A |
| On the Cross-lingual Transferability of Monolingual Representations | Mikel Artetxe, Sebastian Ruder and Dani Yogatama | N/A | N/A |
| On the Encoder-Decoder Incompatibility in Variational Text Modeling and Beyond | Chen Wu, Prince Zizhuang Wang and William Yang Wang | N/A | N/A |
| On The Evaluation of Machine Translation SystemsTrained With Back-Translation | Sergey Edunov, Myle Ott, Marc’Aurelio Ranzato and Michael Auli | N/A | N/A |
| On the Inference Calibration of Neural Machine Translation | Shuo Wang, Zhaopeng Tu, Shuming Shi and Yang Liu | N/A | N/A |
| On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation | Wei Zhao, Goran Glavaš, Maxime Peyrard, Yang Gao, Robert West and Steffen Eger | N/A | N/A |
| On the Robustness of Language Encoders against Grammatical Errors | Fan Yin, Quanyu Long, Tao Meng and Kai-Wei Chang | N/A | N/A |
| One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases | Xingdi Yuan, Tong Wang, Rui Meng, Khushboo Thaker, Peter Brusilovsky, Daqing He and Adam Trischler | N/A | N/A |
| Optimizing the Factual Correctness of a Summary: A Study of Summarizing Radiology Reports | Yuhao Zhang, Derek Merck, Emily Tsai, Christopher D. Manning and Curtis Langlotz | N/A | N/A |
| Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding | Yun Tang, Jing Huang, Guangtao Wang, Xiaodong He and Bowen Zhou | N/A | N/A |
| Out of the Echo Chamber: Detecting Countering Debate Speeches | Matan Orbach, Yonatan Bilu, Assaf Toledo, Dan Lahav, Michal Jacovi, Ranit Aharonov and Noam Slonim | N/A | N/A |
| ParaCrawl: Web-Scale Acquisition of Parallel Corpora | Marta Bañón, Pinzhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Esplà-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Elsa Sarrías, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins and Jaume Zaragoza | N/A | N/A |
| Parallel Corpus Filtering via Pre-trained Language Models | Boliang Zhang, Ajay Nagesh and Kevin Knight | N/A | N/A |
| Paraphrase Augmented Task-Oriented Dialog Generation | Silin Gao, Yichi Zhang, Zhijian Ou and Zhou Yu | N/A | N/A |
| Paraphrase Generation by Learning How to Edit from Samples | Amirhossein Kazemnejad, Mohammadreza Salehi and Mahdieh Soleymani Baghshah | N/A | N/A |
| Parsing into Variable-in-situ Logico-Semantic Graphs | Yufei Chen and Weiwei Sun | N/A | N/A |
| Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT | Zhiyong Wu, Yun Chen, Ben Kao and Qun Liu | N/A | N/A |
| PeTra: A Sparsely Supervised Memory Model for People Tracking | Shubham Toshniwal, Allyson Ettinger, Kevin Gimpel and Karen Livescu | N/A | N/A |
| Phone Features Improve Speech Translation | Elizabeth Salesky and Alan W Black | N/A | N/A |
| Phonetic and Visual Priors for Decipherment of Informal Romanization | Maria Ryskina, Matthew R. Gormley and Taylor Berg-Kirkpatrick | N/A | N/A |
| PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable | Siqi Bao, Huang He, Fan Wang, Hua Wu and Haifeng Wang | N/A | N/A |
| Politeness Transfer: A Tag and Generate Approach | Aman Madaan, Amrith Setlur, Tanmay Parekh, Barnabas Poczos, Graham Neubig, Yiming Yang, Ruslan Salakhutdinov, Alan W Black and Shrimai Prabhumoye | N/A | N/A |
| Posterior Control of Blackbox Generation | Xiang Lisa Li and Alexander Rush | N/A | N/A |
| Predicting Declension Class from Form and Meaning | Adina Williams, Tiago Pimentel, Arya D. McCarthy, Hagen Blix, Eleanor Chodroff and Ryan Cotterell | N/A | N/A |
| Predicting Depression in Screening Interviews from Latent Categorization of Interview Prompts | Alex Rinaldi, Jean Fox Tree and Snigdha Chaturvedi | N/A | N/A |
| Predicting Performance for Natural Language Processing Tasks | Mengzhou Xia, Antonios Anastasopoulos, Ruochen Xu, Yiming Yang and Graham Neubig | N/A | N/A |
| Predicting the Focus of Negation: Model and Error Analysis | Md Mosharaf Hossain, Kathleen Hamilton, Alexis Palmer and Eduardo Blanco | N/A | N/A |
| Predicting the Growth of Morphological Families from Social and Linguistic Factors | Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze | N/A | N/A |
| Predicting the Topical Stance and Political Leaning of Media using Tweets | Peter Stefanov, Kareem Darwish, Atanas Atanasov and Preslav Nakov | N/A | N/A |
| Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview | Deven Santosh Shah, H. Andrew Schwartz and Dirk Hovy | N/A | N/A |
| Premise Selection in Natural Language Mathematical Texts | Deborah Ferreira and André Freitas | N/A | N/A |
| Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders | Yu Duan, Canwen Xu, Jiaxin Pei, Jialong Han and Chenliang Li | N/A | N/A |
| Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning | Alexandre Tamborrino, Nicola Pellicanò, Baptiste Pannier, Pascal Voitot and Louise Naudin | N/A | N/A |
| Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models | Dan Iter, Kelvin Guu, Larry Lansing and Dan Jurafsky | N/A | N/A |
| Probabilistic Assumptions Matter: Improved Models for Distantly-Supervised Document-Level Question Answering | Hao Cheng, Ming-Wei Chang, Kenton Lee and Kristina Toutanova | N/A | N/A |
| Probabilistically Masked Language Model Capable of Autoregressive Generation in Arbitrary Word Order | Yi Liao, Xin Jiang and Qun Liu | N/A | N/A |
| Probing for referential information in language models | Ionut-Teodor Sorodoc, Kristina Gulordava and Gemma Boleda | N/A | N/A |
| Probing Linguistic Features of Sentence-Level Representations in Relation Extraction | Christoph Alt, Aleksandra Gabryszak and Leonhard Hennig | N/A | N/A |
| Probing Linguistic Systematicity | Emily Goodwin, Koustuv Sinha and Timothy J. O’Donnell | N/A | N/A |
| Programming in Natural Language with fuSE: Synthesizing Methods from Spoken Utterances Using Deep Natural Language Understanding | Sebastian Weigelt, Vanessa Steurer, Tobias Hey and Walter F. Tichy | N/A | N/A |
| PuzzLing Machines: A Challenge on Learning From Small Data | Gözde Gül Şahin, Yova Kementchedjhieva, Phillip Rust and Iryna Gurevych | N/A | N/A |
| Pyramid: A Layered Model for Nested Named Entity Recognition | Jue Wang, Lidan Shou, Ke Chen and Gang Chen | N/A | N/A |
| QuASE: Question-Answer Driven Sentence Encoding | Hangfeng He, Qiang Ning and Dan Roth | N/A | N/A |
| R^3: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge | Tuhin Chakrabarty, Debanjan Ghosh, Smaranda Muresan and Nanyun Peng | N/A | N/A |
| Rationalizing Medical Relation Prediction from Corpus-level Statistics | Zhen Wang, Jennifer Lee, Simon Lin and Huan Sun | N/A | N/A |
| Rationalizing Text Matching: Learning Sparse Alignments via Optimal Transport | Kyle Swanson, Lili Yu and Tao Lei | N/A | N/A |
| RAT-SQL: Relation-Aware Schema Encoding and Linking for Text-to-SQL Parsers | Bailin Wang, Richard Shin, Xiaodong Liu, Oleksandr Polozov and Matthew Richardson | N/A | N/A |
| Reasoning Over Semantic-Level Graph for Fact Checking | Wanjun Zhong, Jingjing Xu, Duyu Tang, Zenan Xu, Nan Duan, Ming Zhou, Jiahai Wang and Jian Yin | N/A | N/A |
| Reasoning with Latent Structure Refinement for Document-Level Relation Extraction | Guoshun Nan, Zhijiang Guo, Ivan Sekulic and Wei Lu | N/A | N/A |
| Reasoning with Multimodal Sarcastic Tweets via Modeling Cross-Modality Contrast and Semantic Association | Nan Xu, Zhixiong Zeng and Wenji Mao | N/A | N/A |
| (Re)construing Meaning in NLP | Sean Trott, Tiago Timponi Torrent, Nancy Chang and Nathan Schneider | N/A | N/A |
| Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension | Hongyu Gong, Yelong Shen, Dian Yu, Jianshu Chen and Dong Yu | N/A | N/A |
| Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment | Forrest Davis and Marten van Schijndel | N/A | N/A |
| Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem | Danielle Saunders and Bill Byrne | N/A | N/A |
| Refer360° : A Referring Expression Recognition Dataset in 360° Images | Volkan Cirik, Taylor Berg-Kirkpatrick and Louis-Philippe Morency | N/A | N/A |
| ReInceptionE: Relation-Aware Inception Network with Joint Local-Global Structural Information for Knowledge Graph Embedding | Zhiwen Xie, Guangyou Zhou, Jin Liu and Jimmy Xiangji Huang | N/A | N/A |
| Relabel the Noise: Joint Extraction of Entities and Relations via Cooperative Multiagents | Daoyuan Chen, Yaliang Li, Kai Lei and Ying Shen | N/A | N/A |
| Relational Graph Attention Network for Aspect-based Sentiment Analysis | Kai Wang, Weizhou Shen, Yunyi Yang, Xiaojun Quan and Rui Wang | N/A | N/A |
| Relation-Aware Collaborative Learning for Unified Aspect-Based Sentiment Analysis | Zhuang Chen and Tieyun Qian | N/A | N/A |
| Representation Learning for Information Extraction from Form-like Documents | Bodhisattwa Prasad Majumder, Navneet Potti, Sandeep Tata, James Bradley Wendt, Qi Zhao and Marc Najork | N/A | N/A |
| Response-Anticipated Memory for On-Demand Knowledge Integration in Response Generation | Zhiliang Tian, Wei Bi, Dongkyu Lee, Lanqing Xue, Yiping Song, Xiaojiang Liu and Nevin L. Zhang | N/A | N/A |
| Review-based Question Generation with Adaptive Instance Transfer and Augmentation | Qian Yu, Lidong Bing, Qiong Zhang, Wai Lam and Luo Si | N/A | N/A |
| Revisiting the Context Window for Cross-lingual Word Embeddings | Ryokan Ri and Yoshimasa Tsuruoka | N/A | N/A |
| Rigid Formats Controlled Text Generation | Piji Li, Haisong Zhang, Xiaojiang Liu and Shuming Shi | N/A | N/A |
| RikiNet: Reading Wikipedia Pages for Natural Question Answering | Dayiheng Liu, Yeyun Gong, Jie Fu, Yu Yan, Jiusheng Chen, Daxin Jiang, Jiancheng Lv and Nan Duan | N/A | N/A |
| Robust Encodings: A Framework for Combating Adversarial Typos | Erik Jones, Robin Jia, Aditi Raghunathan and Percy Liang | N/A | N/A |
| Roles and Utilization of Attention Heads in Transformer-based Neural Language Models | Jae-young Jo and Sung-Hyon Myaeng | N/A | N/A |
| S2ORC: The Semantic Scholar Open Research Corpus | Kyle Lo, Lucy Wang, Mark Neumann, Rodney Kinney and Daniel Weld | N/A | N/A |
| SAS: Dialogue State Tracking via Slot Attention and Slot Information Sharing | Jiaying Hu, Yan Yang, Chencai Chen, Liang He and Zhou Yu | N/A | N/A |
| SCDE: Sentence Cloze Dataset with High Quality Distractors From Examinations | Xiang Kong, Varun Gangal and Eduard Hovy | N/A | N/A |
| schuBERT: Optimizing Elements of BERT | Ashish Khetan and Zohar Karnin | N/A | N/A |
| SciREX: A Challenge Dataset for Document-Level Information Extraction | Sarthak Jain, Madeleine van Zuylen, Hannaneh Hajishirzi and Iz Beltagy | N/A | N/A |
| Screenplay Summarization Using Latent Narrative Structure | Pinelopi Papalampidi, Frank Keller, Lea Frermann and Mirella Lapata | N/A | N/A |
| ScriptWriter: Narrative-Guided Script Generation | Yutao Zhu, Ruihua Song, Zhicheng Dou, Jian-Yun Nie and Jin Zhou | N/A | N/A |
| SEEK: Segmented Embedding of Knowledge Graphs | Wentao Xu, Shun Zheng, Liang He, Bin Shao, Jian Yin and Tie-Yan Liu | N/A | N/A |
| Selecting Backtranslated Data from Multiple Sources for Improved Neural Machine Translation | Xabier Soto, Dimitar Shterionov, Alberto Poncelas and Andy Way | N/A | N/A |
| Selective Question Answering under Domain Shift | Amita Kamath, Robin Jia and Percy Liang | N/A | N/A |
| Semantic Graphs for Generating Deep Questions | Liangming Pan, Yuxi Xie, Yansong Feng, Tat-Seng Chua and Min-Yen Kan | N/A | N/A |
| Semantic Parsing for English as a Second Language | Yuanyuan Zhao, Weiwei Sun, Junjie Cao and Xiaojun Wan | N/A | N/A |
| Semantic Scaffolds for Pseudocode-to-Code Generation | Ruiqi Zhong, Mitchell Stern and Dan Klein | N/A | N/A |
| Semi-supervised Contextual Historical Text Normalization | Peter Makarov and Simon Clematide | N/A | N/A |
| Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation | Xinting Huang, Jianzhong Qi, Yu Sun and Rui Zhang | N/A | N/A |
| Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders | Zixia Jia, Youmi Ma, Jiong Cai and Kewei Tu | N/A | N/A |
| SenseBERT: Driving Some Sense into BERT | Yoav Levine, Barak Lenz, Or Dagan, Ori Ram, Dan Padnos, Or Sharir, Shai Shalev-Shwartz, Amnon Shashua and Yoav Shoham | N/A | N/A |
| SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics | Da Yin, Tao Meng and Kai-Wei Chang | N/A | N/A |
| Sentiment and Emotion help Sarcasm? A Multi-task Learning Framework for Multi-Modal Sarcasm, Sentiment and Emotion Analysis | Dushyant Singh Chauhan, Dhanush S R, Asif Ekbal and Pushpak Bhattacharyya | N/A | N/A |
| SeqVAT: Virtual Adversarial Training for Semi-Supervised Sequence Labeling | Luoxin Chen, Weitong Ruan, Xinyue Liu and Jianhua Lu | N/A | N/A |
| Should All Cross-Lingual Embeddings Speak English? | Antonios Anastasopoulos and Graham Neubig | N/A | N/A |
| Similarity Analysis of Contextual Word Representation Models | John Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi and James Glass | N/A | N/A |
| Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora | Hila Gonen, Ganesh Jawahar, Djamé Seddah and Yoav Goldberg | N/A | N/A |
| Simplify the Usage of Lexicon in Chinese NER | Ruotian Ma, Minlong Peng, Qi Zhang, Zhongyu Wei and Xuanjing Huang | N/A | N/A |
| SimulSpeech: End-to-End Simultaneous Speech to Text Translation | Yi Ren, Jinglin Liu, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao and Tie-Yan Liu | N/A | N/A |
| Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on Unlabeled Data in Target Language | Qianhui Wu, Zijia Lin, Börje Karlsson, Jian-Guang Lou and Biqing Huang | N/A | N/A |
| SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis | Hao Tian, Can Gao, Xinyan Xiao, Hao Liu, Bolei He, Hua Wu, Haifeng Wang and Feng Wu | N/A | N/A |
| Slot-consistent NLG for Task-oriented Dialogue Systems with Iterative Rectification Network | Yangming Li, Kaisheng Yao, Libo Qin, Wanxiang Che, Xiaolong Li and Ting Liu | N/A | N/A |
| SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization | Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao and Tuo Zhao | N/A | N/A |
| Social Bias Frames: Reasoning about Social and Power Implications of Language | Maarten Sap, Saadia Gabriel, Lianhui Qin, Dan Jurafsky, Noah A. Smith and Yejin Choi | N/A | N/A |
| Sources of Transfer in Multilingual Named Entity Recognition | David Mueller, Nicholas Andrews and Mark Dredze | N/A | N/A |
| Span Selection Pre-training for Question Answering | Michael Glass, Alfio Gliozzo, Rishav Chakravarti, Anthony Ferritto, Lin Pan, G P Shrivatsa Bhargav, Dinesh Garg and Avi Sil | N/A | N/A |
| Span-based Localizing Network for Natural Language Video Localization | Hao Zhang, Aixin Sun, Wei Jing and Joey Tianyi Zhou | N/A | N/A |
| SpanMlt: A Span-based Multi-Task Learning Framework for Pair-wise Aspect and Opinion Terms Extraction | He Zhao, Longtao Huang, Rong Zhang, Quan Lu and Hui Xue | N/A | N/A |
| Speak to your Parser: Interactive Text-to-SQL with Natural Language Feedback | Ahmed Elgohary, Saghar Hosseini and Ahmed Hassan Awadallah | N/A | N/A |
| Speaker Sensitive Response Evaluation Model | JinYeong Bak and Alice Oh | N/A | N/A |
| Speakers enhance contextually confusable words | Eric Meinhardt, Eric Bakovic and Leon Bergen | N/A | N/A |
| SPECTER: Document-level Representation Learning using Citation-informed Transformers | Arman Cohan, Sergey Feldman, Iz Beltagy, Doug Downey and Daniel Weld | N/A | N/A |
| Speech Translation and the End-to-End Promise: Taking Stock of Where We Are | Matthias Sperber and Matthias Paulik | N/A | N/A |
| SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check | Xingyi Cheng, Weidi Xu, Kunlong Chen, Shaohua Jiang, Feng Wang, Taifeng Wang, Wei Chu and Yuan Qi | N/A | N/A |
| Spelling Error Correction with Soft-Masked BERT | Shaohua Zhang, Haoran Huang, Jicong Liu and Hang Li | N/A | N/A |
| Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words | Josef Klafka and Allyson Ettinger | N/A | N/A |
| STARC: Structured Annotations for Reading Comprehension | Yevgeni Berzak, Jonathan Malmaud and Roger Levy | N/A | N/A |
| Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization | Xin Du and Kumiko Tanaka-Ishii | N/A | N/A |
| Storytelling with Dialogue: A Critical Role Dungeons and Dragons Dataset | Revanth Rameshkumar and Peter Bailey | N/A | N/A |
| Structural Information Preserving for Graph-to-Text Generation | Linfeng Song, Ante Wang, Jinsong Su, Yue Zhang, Kun Xu, Yubin Ge and Dong Yu | N/A | N/A |
| Structured Tuning for Semantic Role Labeling | Tao Li, Parth Anand Jawale, Martha Palmer and Vivek Srikumar | N/A | N/A |
| Structure-Level Knowledge Distillation For Multilingual Sequence Labeling | Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Fei Huang and Kewei Tu | N/A | N/A |
| Suspense in Short Stories is Predicted By Uncertainty Reduction over Neural Story Representation | David Wilmot and Frank Keller | N/A | N/A |
| Synchronous Double-channel Recurrent Network for Aspect-Opinion Pair Extraction | Shaowei Chen, Jie Liu, Yu Wang, Wenzheng Zhang and Ziming Chi | N/A | N/A |
| Syn-QG: Syntactic and Shallow Semantic Rules for Question Generation | Kaustubh Dhole and Christopher D. Manning | N/A | N/A |
| Syntax-Aware Opinion Role Labeling with Dependency Graph Convolutional Networks | Bo Zhang, Yue Zhang, Rui Wang, Zhenghua Li and Min Zhang | N/A | N/A |
| TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data | Pengcheng Yin, Graham Neubig, Wen-tau Yih and Sebastian Riedel | N/A | N/A |
| TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task | Christoph Alt, Aleksandra Gabryszak and Leonhard Hennig | N/A | N/A |
| TAG : Type Auxiliary Guiding for Code Comment Generation | Ruichu Cai, Zhihao Liang, Boyan Xu, zijian li, Yuexing Hao and Yao Chen | N/A | N/A |
| Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics | Nitika Mathur, Timothy Baldwin and Trevor Cohn | N/A | N/A |
| TaPas: Weakly Supervised Table Parsing via Pre-training | Jonathan Herzig, Pawel Krzysztof Nowak, Thomas Müller, Francesco Piccinno and Julian Eisenschlos | N/A | N/A |
| Target Inference in Argument Conclusion Generation | Milad Alshomary, Shahbaz Syed, Martin Potthast and Henning Wachsmuth | N/A | N/A |
| Taxonomy Construction of Unseen Domains via Graph-based Cross-Domain Knowledge Transfer | Chao Shang, Sarthak Dash, Md Faisal Mahbub Chowdhury, Nandana Mihindukulasooriya and Alfio Gliozzo | N/A | N/A |
| Tchebycheff Procedure for Multi-task Text Classification | Yuren Mao, Shuang Yun, Weiwei Liu and Bo Du | N/A | N/A |
| Temporal Common Sense Acquisition with Minimal Supervision | Ben Zhou, Qiang Ning, Daniel Khashabi and Dan Roth | N/A | N/A |
| Temporally-Informed Analysis of Named Entity Recognition | Shruti Rijhwani and Daniel Preotiuc-Pietro | N/A | N/A |
| Text and Causal Inference: A Review of Using Text to Remove Confounding from Causal Estimates | Katherine Keith, David Jensen and Brendan O’Connor | N/A | N/A |
| Text-Based Ideal Points | Keyon Vafa, Suresh Naidu and David Blei | N/A | N/A |
| That is a Known Lie: Detecting Previously Fact-Checked Claims | Shaden Shaar, Nikolay Babulkov, Giovanni Da San Martino and Preslav Nakov | N/A | N/A |
| “The Boating Store Had Its Best Sail Ever”: Pronunciation-attentive Contextualized Pun Recognition | Yichao Zhou, Jyun-Yu Jiang, Jieyu Zhao, Kai-Wei Chang and Wei Wang | N/A | N/A |
| The Cascade Transformer: an Application for Efficient Answer Sentence Selection | Luca Soldaini and Alessandro Moschitti | N/A | N/A |
| The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents | Kurt Shuster, Da JU, Stephen Roller, Emily Dinan, Y-Lan Boureau and Jason Weston | N/A | N/A |
| The Paradigm Discovery Problem | Alexander Erdmann, Micha Elsner, Shijie Wu, Ryan Cotterell and Nizar Habash | N/A | N/A |
| The Right Tool for the Job: Matching Model and Instance Complexities | Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta, Jesse Dodge and Noah A. Smith | N/A | N/A |
| The Sensitivity of Language Models and Humans to Winograd Schema Perturbations | Mostafa Abdou, Vinit Ravishankar, Maria Barrett, Yonatan Belinkov, Desmond Elliott and Anders Søgaard | N/A | N/A |
| The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain | Annemarie Friedrich, Heike Adel, Federico Tomazic, Johannes Hingerl, Renou Benteau, Anika Marusczyk and Lukas Lange | N/A | N/A |
| The State and Fate of Linguistic Diversity and Inclusion in the NLP World | Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali and Monojit Choudhury | N/A | N/A |
| The Summary Loop: Learning to Write Abstractive Summaries Without Examples | Philippe Laban, Andrew Hsi, John Canny and Marti A. Hearst | N/A | N/A |
| The TechQA Dataset | Vittorio Castelli, Rishav Chakravarti, Saswati Dana, Anthony Ferritto, Radu Florian, Martin Franz, Dinesh Garg, Dinesh Khandelwal, Scott McCarley, Michael McCawley, Mohamed Nasr, Lin Pan, Cezar Pendus, John Pitrelli, Saurabh Pujar, Salim Roukos, Andrzej Sakrajda, Avi Sil, Rosario Uceda-Sosa, Todd Ward and Rong Zhang | N/A | N/A |
| The Unstoppable Rise of Computational Linguistics in Deep Learning | James Henderson | N/A | N/A |
| To Boldly Query What No One Has Annotated Before? The Frontiers of Corpus Querying | Markus Gärtner and Kerstin Jung | N/A | N/A |
| To Test Machine Comprehension, Start by Defining Comprehension | Jesse Dunietz, Greg Burnham, Akash Bharadwaj, Owen Rambow, Jennifer Chu-Carroll and Dave Ferrucci | N/A | N/A |
| Toward Gender-Inclusive Coreference Resolution | Yang Trista Cao and Hal Daumé III | N/A | N/A |
| Towards Conversational Recommendation over Multi-Type Dialogs | Zeming Liu, Haifeng Wang, Zheng-Yu Niu, Hua Wu, Wanxiang Che and Ting Liu | N/A | N/A |
| Towards Debiasing Sentence Representations | Paul Pu Liang, Irene Mengze Li, Emily Zheng, Yao Chong Lim, Ruslan Salakhutdinov and Louis-Philippe Morency | N/A | N/A |
| Towards Emotion-aided Multi-modal Dialogue Act Classification | Tulika Saha, Aditya Patra, Sriparna Saha and Pushpak Bhattacharyya | N/A | N/A |
| Towards Faithful Neural Table-to-Text Generation with Content-Matching Constraints | Zhenyi Wang, Xiaoyang Wang, Bang An, Dong Yu and Changyou Chen | N/A | N/A |
| Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation | Bo Pang, Erik Nijkamp, Wenjuan Han, Linqi Zhou, Yixian Liu and Kewei Tu | N/A | N/A |
| Towards Interpretable Clinical Diagnosis with Bayesian Network Ensembles Stacked on Entity-Aware CNNs | Jun Chen, Xiaoya Dai, Quan Yuan, Chao Lu and Haifeng Huang | N/A | N/A |
| Towards Robustifying NLI Models Against Lexical Dataset Biases | Xiang Zhou and Mohit Bansal | N/A | N/A |
| Towards Transparent and Explainable Attention Models | Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan, Mitesh M. Khapra, Balaji Vasan Srinivasan and Balaraman Ravindran | N/A | N/A |
| Towards Understanding Gender Bias in Relation Extraction | Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang and William Yang Wang | N/A | N/A |
| Towards Unsupervised Language Understanding and Generation by Joint Dual Learning | Shang-Yu Su, Chao-Wei Huang and Yun-Nung Chen | N/A | N/A |
| Toxicity Detection: Does Context Really Matter? | John Pavlopoulos, Jeffrey Sorensen, Lucas Dixon, Nithum Thain and Ion Androutsopoulos | N/A | N/A |
| Transition-based Directed Graph Construction for Emotion-Cause Pair Extraction | Chuang Fan, Chaofa Yuan, Jiachen Du, Lin Gui, Min Yang and Ruifeng Xu | N/A | N/A |
| Transition-based Semantic Dependency Parsing with Pointer Networks | Daniel Fernández-González and Carlos Gómez-Rodríguez | N/A | N/A |
| Translationese as a Language in “Multilingual” NMT | Parker Riley, Isaac Caswell, Markus Freitag and David Grangier | N/A | N/A |
| TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition | Ruifang He, Jian Wang, Fengyu Guo and Yugui Han | N/A | N/A |
| TVQA+: Spatio-Temporal Grounding for Video Question Answering | Jie Lei, Licheng Yu, Tamara Berg and Mohit Bansal | N/A | N/A |
| TXtract: Taxonomy-Aware Knowledge Extraction for Thousands of Product Categories | Giannis Karamanolakis, Jun Ma and Xin Luna Dong | N/A | N/A |
| Uncertainty-Aware Curriculum Learning for Neural Machine Translation | Yikai Zhou, Baosong Yang, Derek F. Wong, Yu Wan and Lidia S. Chao | N/A | N/A |
| Understanding Attention for Text Classification | Xiaobing Sun and Wei Lu | N/A | N/A |
| Understanding the Language of Political Agreement and Disagreement in Legislative Texts | Maryam Davoodi, Eric Waltenburg and Dan Goldwasser | N/A | N/A |
| Universal Decompositional Semantic Parsing | Elias Stengel-Eskin, Aaron Steven White, Sheng Zhang and Benjamin Van Durme | N/A | N/A |
| Unknown Intent Detection Using Gaussian Mixture Model with an Application to Zero-shot Intent Classification | Guangfeng Yan, Lu Fan, Qimai Li, Han Liu, Xiaotong Zhang, Xiao-Ming Wu and Albert Y.S. Lam | N/A | N/A |
| Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering | Vikas Yadav, Steven Bethard and Mihai Surdeanu | N/A | N/A |
| Unsupervised Cross-lingual Representation Learning at Scale | Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer and Veselin Stoyanov | N/A | N/A |
| Unsupervised Domain Clusters in Pretrained Language Models | Roee Aharoni and Yoav Goldberg | N/A | N/A |
| Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing | Ruisheng Cao, Su Zhu, Chenyu Yang, Chen Liu, Rao Ma, Yanbin Zhao, Lu Chen and Kai Yu | N/A | N/A |
| Unsupervised Morphological Paradigm Completion | Huiming Jin, Liwei Cai, Yihui Peng, Chen Xia, Arya McCarthy and Katharina Kann | N/A | N/A |
| Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting | Po-Yao Huang, Junjie Hu, Xiaojun Chang and Alexander Hauptmann | N/A | N/A |
| Unsupervised Opinion Summarization as Copycat-Review Generation | Arthur Bražinskas, Mirella Lapata and Ivan Titov | N/A | N/A |
| Unsupervised Opinion Summarization with Noising and Denoising | Reinald Kim Amplayo and Mirella Lapata | N/A | N/A |
| Unsupervised Paraphrasing by Simulated Annealing | Xianggen Liu, Lili Mou, Fandong Meng, Hao Zhou, Jie Zhou and Sen Song | N/A | N/A |
| USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation | Shikib Mehri and Maxine Eskenazi | N/A | N/A |
| Weight Poisoning Attacks on Pretrained Models | Keita Kurita, Paul Michel and Graham Neubig | N/A | N/A |
| What are the Goals of Distributional Semantics? | Guy Emerson | N/A | N/A |
| What determines the order of adjectives in English? Comparing efficiency-based theories using dependency treebanks | Richard Futrell, William Dyer and Greg Scontras | N/A | N/A |
| What Question Answering can Learn from Trivia Nerds | Jordan Boyd-Graber and Benjamin Börschinger | N/A | N/A |
| What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context | Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James Glass and Preslav Nakov | N/A | N/A |
| When do Word Embeddings Accurately Reflect Surveys on our Beliefs About People? | Kenneth Joseph and Jonathan Morgan | N/A | N/A |
| “Who said it, and Why?” Provenance for Natural Language Claims | Yi Zhang, Zachary Ives and Dan Roth | N/A | N/A |
| WinoWhy: A Deep Diagnosis of Essential Commonsense Knowledge for Answering Winograd Schema Challenge | Hongming Zhang, Xinran Zhao and Yangqiu Song | N/A | N/A |
| Word-level Textual Adversarial Attacking as Combinatorial Optimization | Yuan Zang, Fanchao Qi, Chenghao Yang, Zhiyuan Liu, Meng Zhang, Qun Liu and Maosong Sun | N/A | N/A |
| XtremeDistil: Multi-stage Distillation for Massive Multilingual Models | Subhabrata Mukherjee and Ahmed Hassan Awadallah | N/A | N/A |
| You Impress Me: Dialogue Generation via Mutual Persona Perception | Qian Liu, Yihong Chen, Bei Chen, Jian-Guang Lou, Zixuan Chen, Bin Zhou and Dongmei Zhang | N/A | N/A |
| Zero-shot Text Classification via Reinforced Self-training | Zhiquan Ye, Yuxia Geng, Jiaoyan Chen, Jingmin Chen, Xiaoxiao Xu, Suhang Zheng, Feng Wang, Jun Zhang and Huajun Chen | N/A | N/A |
| Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking | Giovanni Campagna, Agata Foryciarz, Mehrad Moradshahi and Monica Lam | N/A | N/A |
| ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages | Colin Lockard, Prashant Shiralkar, Xin Luna Dong and Hannaneh Hajishirzi | N/A | N/A |
| A Complete Shift-Reduce Chinese Discourse Parser with Robust Dynamic Oracle | Shyh-Shiun Hung, Hen-Hsen Huang and Hsin-Hsi Chen | N/A | N/A |
| A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers | Shen-yun Miao, Chao-Chun Liang and Keh-Yih Su | N/A | N/A |
| A Frame-based Sentence Representation for Machine Reading Comprehension | Shaoru Guo, Ru Li, Hongye Tan, Xiaoli Li, Yong Guan, Hongyan Zhao and Yueping Zhang | N/A | N/A |
| A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal | Demian Gholipour Ghalandari, Chris Hokamp, Nghia The Pham, John Glover and Georgiana Ifrim | N/A | N/A |
| A Multi-Perspective Architecture for Semantic Code Search | Rajarshi Haldar, Lingfei Wu, JinJun Xiong and Julia Hockenmaier | N/A | N/A |
| A negative case analysis of visual grounding methods for VQA | Robik Shrestha, Kushal Kafle and Christopher Kanan | N/A | N/A |
| A Probabilistic Generative Model for Typographical Analysis of Early Modern Printing | Kartik Goyal, Chris Dyer, Christopher Warren, Maxwell G’Sell and Taylor Berg-Kirkpatrick | N/A | N/A |
| A Re-evaluation of Knowledge Graph Completion Methods | Zhiqing Sun, Shikhar Vashishth, Soumya Sanyal, Partha Talukdar and Yiming Yang | N/A | N/A |
| A Relational Memory-based Embedding Model for Triple Classification and Search Personalization | Dai Quoc Nguyen, Tu Nguyen and Dinh Phung | N/A | N/A |
| A Relaxed Matching Procedure for Unsupervised BLI | Xu Zhao, Zihao Wang, Yong Zhang and Hao Wu | N/A | N/A |
| A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation | Shuo Ren, Yu Wu, Shujie Liu, Ming Zhou and Shuai Ma | N/A | N/A |
| A Simple and Effective Unified Encoder for Document-Level Machine Translation | Shuming Ma, Dongdong Zhang and Ming Zhou | N/A | N/A |
| A Tale of a Probe and a Parser | Rowan Hall Maudslay, Josef Valvoda, Tiago Pimentel, Adina Williams and Ryan Cotterell | N/A | N/A |
| A Three-Parameter Rank-Frequency Relation in Natural Languages | Chenchen Ding, Masao Utiyama and Eiichiro Sumita | N/A | N/A |
| A Transformer-based Approach for Source Code Summarization | Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray and Kai-Wei Chang | N/A | N/A |
| A Two-Stage Masked LM Method for Term Set Expansion | Guy Kushilevitz, Shaul Markovitch and Yoav Goldberg | N/A | N/A |
| A Two-Step Approach for Implicit Event Argument Detection | Zhisong Zhang, Xiang Kong, Zhengzhong Liu, Xuezhe Ma and Eduard Hovy | N/A | N/A |
| Active Learning for Coreference Resolution using Discrete Annotation | Belinda Z. Li, Gabriel Stanovsky and Luke Zettlemoyer | N/A | N/A |
| An Empirical Comparison of Unsupervised Constituency Parsing Methods | Jun Li, Yifan Cao, Jiong Cai, Yong Jiang and Kewei Tu | N/A | N/A |
| Analyzing the Persuasive Effect of Style in News Editorial Argumentation | Roxanne El Baff, Henning Wachsmuth, Khalid Al Khatib and Benno Stein | N/A | N/A |
| Are we Estimating or Guesstimating Translation Quality? | Shuo Sun, Francisco Guzmán and Lucia Specia | N/A | N/A |
| Attend to Medical Ontologies: Content Selection for Clinical Abstractive Summarization | Sajad Sotudeh Gharebagh, Nazli Goharian and Ross Filice | N/A | N/A |
| Autoencoding Keyword Correlation Graph for Document Clustering | Billy Chiu, Sunil Kumar Sahu, Derek Thomas, Neha Sengupta and Mohammady Mahdy | N/A | N/A |
| Automated Topical Component Extraction Using Neural Network Attention Scores from Source-based Essay Scoring | Haoran Zhang and Diane Litman | N/A | N/A |
| Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model | Kosuke Takahashi, Katsuhito Sudoh and Satoshi Nakamura | N/A | N/A |
| Bayesian Hierarchical Words Representation Learning | Oren Barkan, Idan Rejwan, Avi Caciularu and Noam Koenigstein | N/A | N/A |
| Benefits of Intermediate Annotations in Reading Comprehension | Dheeru Dua, Sameer Singh and Matt Gardner | N/A | N/A |
| Camouflaged Chinese Spam Content Detection with Semi-supervised Generative Active Learning | Zhuoren Jiang, Zhe Gao, Yu Duan, Yangyang Kang, Changlong Sun, Qiong Zhang and Xiaozhong Liu | N/A | N/A |
| Character-Level Translation with Self-attention | Yingqiang Gao, Nikola I. Nikolov, Yuhuang Hu and Richard H.R. Hahnloser | N/A | N/A |
| ClarQ: A large-scale and diverse dataset for Clarification Question Generation | Vaibhav Kumar and Alan W Black | N/A | N/A |
| Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction | Mladen Karan, Ivan Vulić, Anna Korhonen and Goran Glavaš | N/A | N/A |
| Clinical Concept Linking with Contextualized Neural Representations | Elliot Schumacher, Andriy Mulyar and Mark Dredze | N/A | N/A |
| Closing the Gap: Joint De-Identification and Concept Extraction in the Clinical Domain | Lukas Lange, Heike Adel and Jannik Strötgen | N/A | N/A |
| Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling | Zihan Liu, Genta Indra Winata, Peng Xu and Pascale Fung | N/A | N/A |
| Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection | Srijan Bansal, Vishal Garimella, Ayush Suhane, Jasabanta Patro and Animesh Mukherjee | N/A | N/A |
| Composing Elementary Discourse Units in Abstractive Summarization | Zhenwen Li, Wenhao Wu and Sujian Li | N/A | N/A |
| Content Word Aware Neural Machine Translation | Kehai Chen, Rui Wang, Masao Utiyama and Eiichiro Sumita | N/A | N/A |
| Contextual Embeddings: When Are They Worth It? | Simran Arora, Avner May, Jian Zhang and Christopher Ré | N/A | N/A |
| Contextual Neural Machine Translation Improves Translation of Cataphoric Pronouns | KayYen Wong, Sameen Maruf and Gholamreza Haffari | N/A | N/A |
| Contextualized Sparse Representations for Real-Time Open-Domain Question Answering | Jinhyuk Lee, Minjoon Seo, Hannaneh Hajishirzi and Jaewoo Kang | N/A | N/A |
| Contextualizing Hate Speech Classifiers with Post-hoc Explanation | Brendan Kennedy, Xisen Jin, Aida Mostafazadeh Davani, Morteza Dehghani and Xiang Ren | N/A | N/A |
| Contrastive Self-Supervised Learning for Commonsense Reasoning | Tassilo Klein and Moin Nabi | N/A | N/A |
| Controlled Crowdsourcing for High-Quality QA-SRL Annotation | Paul Roit, Ayal Klein, Daniela Stepanov, Jonathan Mamou, Julian Michael, Gabriel Stanovsky, Luke Zettlemoyer and Ido Dagan | N/A | N/A |
| Conversational Word Embedding for Retrieval-Based Dialog System | Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang and Guoping Hu | N/A | N/A |
| Crawling and Preprocessing Mailing Lists At Scale for Dialog Analysis | Janek Bevendorff, Khalid Al Khatib, Martin Potthast and Benno Stein | N/A | N/A |
| Crossing Variational Autoencoders for Answer Retrieval | Wenhao Yu, Lingfei Wu, Qingkai Zeng, Shu Tao, Yu Deng and Meng Jiang | N/A | N/A |
| DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference | Ji Xin, Raphael Tang, Jaejun Lee, Yaoliang Yu and Jimmy Lin | N/A | N/A |
| Designing Precise and Robust Dialogue Response Evaluators | Tianyu Zhao, Divesh Lala and Tatsuya Kawahara | N/A | N/A |
| Dialogue State Tracking with Explicit Slot Connection Modeling | Yawen Ouyang, Moxin Chen, Xinyu Dai, Yinggong Zhao, Shujian Huang and Jiajun Chen | N/A | N/A |
| Do Transformers Need Deep Long-Range Memory? | Jack Rae and Ali Razavi | N/A | N/A |
| Do you have the right scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods | Ning Miao, Yuxuan Song, Hao Zhou and Lei Li | N/A | N/A |
| Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation | Bei Li, Hui Liu, Ziyang Wang, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu and Changliang Li | N/A | N/A |
| Don’t Eclipse Your Arts Due to Small Discrepancies: Boundary Repositioning with a Pointer Network for Aspect Extraction | Zhenkai Wei, Yu Hong, Bowei Zou, Meng Cheng and Jianmin Yao | N/A | N/A |
| Dscorer: A Fast Evaluation Metric for Discourse Representation Structure Parsing | Jiangming Liu, Shay B. Cohen and Mirella Lapata | N/A | N/A |
| Dynamic Memory Induction Networks for Few-Shot Text Classification | Ruiying Geng, Binhua Li, Yongbin Li, Jian Sun and Xiaodan Zhu | N/A | N/A |
| Dynamic Sampling Strategies for Multi-Task Reading Comprehension | Ananth Gottumukkala, Dheeru Dua, Sameer Singh and Matt Gardner | N/A | N/A |
| Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change | Hongfei Xu, Josef van Genabith, Deyi Xiong and Qiuhui Liu | N/A | N/A |
| Efficient strategies for hierarchical text classification: external knowledge and auxiliary tasks | Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay and Marco Antonio Sobrevilla Cabezudo | N/A | N/A |
| Embarrassingly Simple Unsupervised Aspect Extraction | Stéphan Tulkens and Andreas van Cranenburgh | N/A | N/A |
| Enabling Language Models to Fill in the Blanks | Chris Donahue, Mina Lee and Percy Liang | N/A | N/A |
| Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction | Masahiro Kaneko, Masato Mita, Shun Kiyono, Jun Suzuki and Kentaro Inui | N/A | N/A |
| ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation | Lifu Tu, Richard Yuanzhe Pang, Sam Wiseman and Kevin Gimpel | N/A | N/A |
| Enhancing Machine Translation with Dependency-Aware Self-Attention | Emanuele Bugliarello and Naoaki Okazaki | N/A | N/A |
| Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention | Yanzeng Li, Bowen Yu, Xue Mengge and Tingwen Liu | N/A | N/A |
| Enriched In-Order Linearization for Faster Sequence-to-Sequence Constituent Parsing | Daniel Fernández-González and Carlos Gómez-Rodríguez | N/A | N/A |
| Entity-Aware Dependency-Based Deep Graph Attention Network for Comparative Preference Classification | Nianzu Ma, Sahisnu Mazumder, Hao Wang and Bing Liu | N/A | N/A |
| Estimating Mutual Information Between Dense Word Embeddings | Vitalii Zhelezniak, Aleksandar Savkov and Nils Hammerla | N/A | N/A |
| Evaluating Dialogue Generation Systems via Response Selection | Shiki Sato, Reina Akama, Hiroki Ouchi, Jun Suzuki and Kentaro Inui | N/A | N/A |
| Evaluating Robustness to Input Perturbations for Neural Machine Translation | Xing Niu, Prashant Mathur, Georgiana Dinu and Yaser Al-Onaizan | N/A | N/A |
| Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks | Yufeng Zhang, Xueli Yu, Zeyu Cui, Shu Wu, Zhongzhen Wen and Liang Wang | N/A | N/A |
| ExpBERT: Representation Engineering with Natural Language Explanations | Shikhar Murty, Pang Wei Koh and Percy Liang | N/A | N/A |
| Exploiting Personal Characteristics of Debaters for Predicting Persuasiveness | Khalid Al Khatib, Michael Völske, Shahbaz Syed, Nikolay Kolyada and Benno Stein | N/A | N/A |
| Exploring Content Selection in Summarization of Novel Chapters | Faisal Ladhak, Bryan Li, Yaser Al-Onaizan and Kathy McKeown | N/A | N/A |
| Fact-based Content Weighting for Evaluating Abstractive Summarisation | Xinnuo Xu, Ondřej Dušek, Jingyi Li, Verena Rieser and Ioannis Konstas | N/A | N/A |
| Fatality Killed the Cat or: BabelPic, a Multimodal Dataset for Non-Concrete Concepts | Agostina Calabrese, Michele Bevilacqua and Roberto Navigli | N/A | N/A |
| Few-Shot NLG with Pre-Trained Language Model | Zhiyu Chen, Harini Eavani, Wenhu Chen, Yinyin Liu and William Yang Wang | N/A | N/A |
| FLAT: Chinese NER Using Flat-Lattice Transformer | Xiaonan Li, Hang Yan, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples | Danilo Croce, Giuseppe Castellucci and Roberto Basili | N/A | N/A |
| Geometry-aware domain adaptation for unsupervised alignment of word embeddings | Pratik Jawanpuria, Mayank Meghwanshi and Bamdev Mishra | N/A | N/A |
| Give Me Convenience and Give Her Death: Who Should Decide What Uses of NLP are Appropriate, and on What Basis? | Kobi Leins, Jey Han Lau and Timothy Baldwin | N/A | N/A |
| Glyph2Vec: Learning Chinese Out-of-Vocabulary Word Embedding from Glyphs | Hong-You Chen, SZ-HAN YU and Shou-de Lin | N/A | N/A |
| GPT-too: A language-model-first approach for AMR-to-text generation | Manuel Mager, Ramón Fernandez Astudillo, Tahira Naseem, Md Arafat Sultan, Young-Suk Lee, Radu Florian and Salim Roukos | N/A | N/A |
| How Can We Accelerate Progress Towards Human-like Linguistic Generalization? | Tal Linzen | N/A | N/A |
| Hypernymy Detection for Low-Resource Languages via Meta Learning | Changlong Yu, Jialong Han, Haisong Zhang and Wilfred Ng | N/A | N/A |
| Identifying Principals and Accessories in a Complex Case based on the Comprehension of Fact Description | Yakun Hu, Zhunchen Luo and Wenhan Chao | N/A | N/A |
| Implicit Discourse Relation Classification: We Need to Talk about Evaluation | Najoung Kim, Song Feng, Chulaka Gunasekara and Luis Lastras | N/A | N/A |
| Improved Speech Representations with Multi-Target Autoregressive Predictive Coding | Yu-An Chung and James Glass | N/A | N/A |
| Improving Entity Linking through Semantic Reinforced Entity Embeddings | Feng Hou, Ruili Wang, Jun He and Yi Zhou | N/A | N/A |
| Improving Low-Resource Named Entity Recognition using Joint Sentence and Token Labeling | Canasai Kruengkrai, Thien Hai Nguyen, Sharifah Mahani Aljunied and Lidong Bing | N/A | N/A |
| Improving Non-autoregressive Neural Machine Translation with Monolingual Data | Jiawei Zhou and Phillip Keung | N/A | N/A |
| Incorporating External Knowledge through Pre-training for Natural Language to Code Generation | Frank F. Xu, Zhengbao Jiang, Pengcheng Yin, Bogdan Vasilescu and Graham Neubig | N/A | N/A |
| Instance-Based Learning of Span Representations: A Case Study through Named Entity Recognition | Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Ryuto Konno and Kentaro Inui | N/A | N/A |
| Interpretable Operational Risk Classification with Semi-Supervised Variational Autoencoder | Fan Zhou, Shengming Zhang and Yi Yang | N/A | N/A |
| Interpreting Twitter User Geolocation | Ting Zhong, Tianliang Wang, Fan Zhou, Goce Trajcevski, Kunpeng Zhang and Yi Yang | N/A | N/A |
| Is Your Classifier Actually Biased? Measuring Fairness under Uncertainty with Bernstein Bounds | Kawin Ethayarajh | N/A | N/A |
| It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information | Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos, Ryan Cotterell and Naoaki Okazaki | N/A | N/A |
| Keyphrase Generation for Scientific Document Retrieval | Florian Boudin, Ygor Gallina and Akiko Aizawa | N/A | N/A |
| Knowledge Supports Visual Language Grounding: A Case Study on Colour Terms | Simeon Schüz and Sina Zarrieß | N/A | N/A |
| Language-aware Interlingua for Multilingual Neural Machine Translation | Changfeng Zhu, Heng Yu, Shanbo Cheng and Weihua Luo | N/A | N/A |
| Learning an Unreferenced Metric for Online Dialogue Evaluation | Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton and Joelle Pineau | N/A | N/A |
| Learning Implicit Text Generation via Feature Matching | Inkit Padhi, Pierre Dognin, Ke Bai, Cícero Nogueira dos Santos, Vijil Chenthamarakshan, Youssef Mroueh and Payel Das | N/A | N/A |
| Learning Low-Resource End-To-End Goal-Oriented Dialog for Fast and Reliable System Deployment | Yinpei Dai, Hangyu Li, Chengguang Tang, Yongbin Li, Jian Sun and Xiaodan Zhu | N/A | N/A |
| Learning Robust Models for e-Commerce Product Search | Thanh Nguyen, Nikhil Rao and Karthik Subbian | N/A | N/A |
| Learning Spoken Language Representations with Neural Lattice Language Modeling | Chao-Wei Huang and Yun-Nung Chen | N/A | N/A |
| Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge | Keqing He, Yuanmeng Yan and Weiran XU | N/A | N/A |
| Learning to Understand Child-directed and Adult-directed Speech | Lieke Gelderloos, Grzegorz Chrupała and Afra Alishahi | N/A | N/A |
| Let Me Choose: From Verbal Context to Font Selection | Amirreza Shirani, Franck Dernoncourt, Jose Echevarria, Paul Asente, Nedim Lipka and Thamar Solorio | N/A | N/A |
| Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation | Aditya Siddhant, Ankur Bapna, Yuan Cao, Orhan Firat, Mia Chen, Sneha Kudugunta, Naveen Arivazhagan and Yonghui Wu | N/A | N/A |
| Lexically Constrained Neural Machine Translation with Levenshtein Transformer | Raymond Hendy Susanto, Shamil Chollampatt and Liling Tan | N/A | N/A |
| Lipschitz Constrained Parameter Initialization for Deep Transformers | Hongfei Xu, Qiuhui Liu, Josef van Genabith, Deyi Xiong and Jingyi Zhang | N/A | N/A |
| Logic-Guided Data Augmentation and Regularization for Consistent Question Answering | Akari Asai and Hannaneh Hajishirzi | N/A | N/A |
| Low Resource Sequence Tagging using Sentence Reconstruction | Tal Perl, Sriram Chaudhury and Raja Giryes | N/A | N/A |
| Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations | Oana-Maria Camburu, Brendan Shillingford, Pasquale Minervini, Thomas Lukasiewicz and Phil Blunsom | N/A | N/A |
| Masking Actor Information Leads to Fairer Political Claims Detection | Erenay Dayanik and Sebastian Padó | N/A | N/A |
| Meta-Transfer Learning for Code-Switched Speech Recognition | Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin, Zihan Liu, Peng Xu and Pascale Fung | N/A | N/A |
| Mitigating Gender Bias Amplification in Distribution by Posterior Regularization | Shengyu Jia, Tao Meng, Jieyu Zhao and Kai-Wei Chang | N/A | N/A |
| Modeling Label Semantics for Predicting Emotional Reactions | Radhika Gaonkar, Heeyoung Kwon, Mohaddeseh Bastan, Niranjan Balasubramanian and Nathanael Chambers | N/A | N/A |
| Modeling Long Context for Task-Oriented Dialogue State Generation | Jun Quan and Deyi Xiong | N/A | N/A |
| Modeling Word Formation in English–German Neural Machine Translation | Marion Weller-Di Marco and Alexander Fraser | N/A | N/A |
| MOOCCube: A Large-scale Data Repository for NLP Applications in MOOCs | Jifan Yu, Gan Luo, Tong Xiao, Qingyang Zhong, Yuquan Wang, Wenzheng Feng, Junyi Luo, Chenyu Wang, Lei Hou, Juanzi Li, Zhiyuan Liu and Jie Tang | N/A | N/A |
| Multimodal and Multiresolution Speech Recognition with Transformers | Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare and Shiva Sundaram | N/A | N/A |
| Multimodal Quality Estimation for Machine Translation | Shu Okabe, Frédéric Blain and Lucia Specia | N/A | N/A |
| Multimodal Transformer for Multimodal Machine Translation | Shaowei Yao and Xiaojun Wan | N/A | N/A |
| Named Entity Recognition as Dependency Parsing | Juntao Yu, Bernd Bohnet and Massimo Poesio | N/A | N/A |
| Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly | Nora Kassner and Hinrich Schütze | N/A | N/A |
| Neural Graph Matching Networks for Chinese Short Text Matching | Lu Chen, Yanbin Zhao, Boer Lv, Lesheng Jin, Zhi Chen, Su Zhu and Kai Yu | N/A | N/A |
| Neural Temporal Opinion Modelling for Opinion Prediction on Twitter | Lixing Zhu, Yulan He and Deyu Zhou | N/A | N/A |
| Neural-DINF: A Neural Network based Framework for Measuring Document Influence | Jie Tan, Changlin Yang, Ying Li, Siliang Tang, Chen Huang and Yueting Zhuang | N/A | N/A |
| Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces | Goran Glavaš and Ivan Vulić | N/A | N/A |
| “None of the Above”: Measure Uncertainty in Dialog Response Retrieval | Yulan Feng, Shikib Mehri, Maxine Eskenazi and Tiancheng Zhao | N/A | N/A |
| On Exposure Bias, Hallucination and Domain Shift in Neural Machine Translation | Chaojun Wang and Rico Sennrich | N/A | N/A |
| On Forgetting to Cite Older Papers: An Analysis of the ACL Anthology | Marcel Bollmann and Desmond Elliott | N/A | N/A |
| On Importance Sampling-Based Evaluation of Latent Language Models | Robert L Logan IV, Matt Gardner and Sameer Singh | N/A | N/A |
| On the Importance of Diversity in Question Generation for QA | Md Arafat Sultan, Shubham Chandel, Ramón Fernandez Astudillo and Vittorio Castelli | N/A | N/A |
| On the Spontaneous Emergence of Discrete and Compositional Signals | Nur Geffen Lan, Emmanuel Chemla and Shane Steinert-Threlkeld | N/A | N/A |
| OpinionDigest: A Simple Framework for Opinion Summarization | Yoshihiko Suhara, Xiaolan Wang, Stefanos Angelidis and Wang-Chiew Tan | N/A | N/A |
| Opportunistic Decoding with Timely Correction for Simultaneous Translation | Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu and Liang Huang | N/A | N/A |
| Overestimation of Syntactic Representation in Neural Language Models | Jordan Kodner and Nitish Gupta | N/A | N/A |
| Parallel Data Augmentation for Formality Style Transfer | Yi Zhang, Tao Ge and Xu SUN | N/A | N/A |
| Parallel Sentence Mining by Constrained Decoding | Pinzhen Chen, Nikolay Bogoychev, Kenneth Heafield and Faheem Kirefu | N/A | N/A |
| Posterior Calibrated Training on Sentence Classification Tasks | Taehee Jung, Dongyeop Kang, Hua Cheng, Lucas Mentch and Thomas Schaaf | N/A | N/A |
| Predicting Degrees of Technicality in Automatic Terminology Extraction | Anna Hätty, Dominik Schlechtweg, Michael Dorna and Sabine Schulte im Walde | N/A | N/A |
| Pretrained Transformers Improve Out-of-Distribution Robustness | Dan Hendrycks, Xiaoyuan Liu, Eric Wallace, Adam Dziedzic, Rishabh Krishnan and Dawn Song | N/A | N/A |
| Quantifying Attention Flow in Transformers | Samira Abnar and Willem Zuidema | N/A | N/A |
| Query Graph Generation for Answering Multi-hop Complex Questions from Knowledge Bases | Yunshi Lan and Jing Jiang | N/A | N/A |
| R4C: A Benchmark for Evaluating RC Systems to Get the Right Answer for the Right Reason | Naoya Inoue, Pontus Stenetorp and Kentaro Inui | N/A | N/A |
| Recollection versus Imagination: Exploring Human Memory and Cognition via Neural Language Models | Maarten Sap, Eric Horvitz, Yejin Choi, Noah A. Smith and James Pennebaker | N/A | N/A |
| Recursive Template-based Frame Generation for Task Oriented Dialog | Rashmi Gangadharaiah and Balakrishnan Narayanaswamy | N/A | N/A |
| Regularized Context Gates on Transformer for Machine Translation | Xintong Li, Lemao Liu, Rui Wang, Guoping Huang and Max Meng | N/A | N/A |
| Relation Extraction with Explanation | Hamed Shahbazi, Xiaoli Fern, Reza Ghaeini and Prasad Tadepalli | N/A | N/A |
| Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs | Michael Lepori, Tal Linzen and R. Thomas McCoy | N/A | N/A |
| Returning the N to NLP: Towards Contextually Personalized Classification Models | Lucie Flek | N/A | N/A |
| Reverse Engineering Configurations of Neural Text Generation Models | Yi Tay, Dara Bahri, Che Zheng, Clifford Brunk, Donald Metzler and Andrew Tomkins | N/A | N/A |
| Revisiting Higher-Order Dependency Parsers | Erick Fonseca and André F. T. Martins | N/A | N/A |
| Revisiting Unsupervised Relation Extraction | Thy Thy Tran, Phong Le and Sophia Ananiadou | N/A | N/A |
| SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions | Mao Ye, Chengyue Gong and Qiang Liu | N/A | N/A |
| Self-Attention Guided Copy Mechanism for Abstractive Summarization | Song Xu, Haoran Li, Peng Yuan, Youzheng Wu, Xiaodong He and Bowen Zhou | N/A | N/A |
| Self-Attention with Cross-Lingual Position Representation | Liang Ding, Longyue Wang and Dacheng Tao | N/A | N/A |
| Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity | Nina Poerner, Ulli Waltinger and Hinrich Schütze | N/A | N/A |
| Shape of synth to come: Why we should use synthetic data for English surface realization | Henry Elder, Robert Burke, Alexander O’Connor and Jennifer Foster | N/A | N/A |
| Shaping Visual Representations with Language for Few-Shot Classification | Jesse Mu, Percy Liang and Noah Goodman | N/A | N/A |
| Showing Your Work Doesn’t Always Work | Raphael Tang, Jaejun Lee, Ji Xin, Xinyu Liu, Yaoliang Yu and Jimmy Lin | N/A | N/A |
| Simple and Effective Retrieve-Edit-Rerank Text Generation | Nabil Hossain, Marjan Ghazvininejad and Luke Zettlemoyer | N/A | N/A |
| Simultaneous Translation Policies: From Fixed to Adaptive | Baigong Zheng, Kaibo Liu, Renjie Zheng, Mingbo Ma, Hairong Liu and Liang Huang | N/A | N/A |
| Single Model Ensemble using Pseudo-Tags and Distinct Vectors | Ryosuke Kuwabara, Jun Suzuki and Hideki Nakayama | N/A | N/A |
| Smart To-Do: Automatic Generation of To-Do Items from Emails | Sudipto Mukherjee, Subhabrata Mukherjee, Marcello Hasegawa, Ahmed Hassan Awadallah and Ryen White | N/A | N/A |
| Social Biases in NLP Models as Barriers for Persons with Disabilities | Ben Hutchinson, Vinodkumar Prabhakaran, Emily Denton, Kellie Webster, Yu Zhong and Stephen Denuyl | N/A | N/A |
| Soft Gazetteers for Low-Resource Named Entity Recognition | Shruti Rijhwani, Shuyan Zhou, Graham Neubig and Jaime Carbonell | N/A | N/A |
| Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations | Samuel Coope, Tyler Farghly, Daniela Gerz, Ivan Vulić and Matthew Henderson | N/A | N/A |
| Stolen Probability: A Structural Weakness of Neural Language Models | David Demeter, Gregory Kimmel and Doug Downey | N/A | N/A |
| Successfully Applying the Stabilized Lottery Ticket Hypothesis to the Transformer Architecture | Christopher Brix, Parnia Bahar and Hermann Ney | N/A | N/A |
| SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization | Yang Gao, Wei Zhao and Steffen Eger | N/A | N/A |
| Supervised Grapheme-to-Phoneme Conversion of Orthographic Schwas in Hindi and Punjabi | Aryaman Arora, Luke Gessler and Nathan Schneider | N/A | N/A |
| Syntactic Data Augmentation Increases Robustness to Inference Heuristics | Junghyun Min, R. Thomas McCoy, Dipanjan Das, Emily Pitler and Tal Linzen | N/A | N/A |
| Tagged Back-translation Revisited: Why Does It Really Work? | Benjamin Marie, Raphael Rubino and Atsushi Fujita | N/A | N/A |
| tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection | Nicole Peinelt, Dong Nguyen and Maria Liakata | N/A | N/A |
| Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering | Alexander Fabbri, Patrick Ng, Zhiguo Wang, Ramesh Nallapati and Bing Xiang | N/A | N/A |
| Tetra-Tagging: Word-Synchronous Parsing with Linear-Time Inference | Nikita Kitaev and Dan Klein | N/A | N/A |
| Text Classification with Negative Supervision | Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara, Chenhui Chu and Yuki Arase | N/A | N/A |
| To Pretrain or Not to Pretrain: Examining the Benefits of Pretrainng on Resource Rich Tasks | Sinong Wang, Madian Khabsa and Hao Ma | N/A | N/A |
| Topological Sort for Sentence Ordering | Shrimai Prabhumoye, Ruslan Salakhutdinov and Alan W Black | N/A | N/A |
| Toward Better Storylines with Sentence-Level Language Models | Daphne Ippolito, David Grangier, Douglas Eck and Chris Callison-Burch | N/A | N/A |
| Towards Better Non-Tree Argument Mining: Proposition-Level Biaffine Parsing with Task-Specific Parameterization | Gaku Morio, Hiroaki Ozaki, Terufumi Morishita, Yuta Koreeda and Kohsuke Yanai | N/A | N/A |
| Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations | Karan Singla, Zhuohao Chen, David Atkins and Shrikanth Narayanan | N/A | N/A |
| Towards Faithfully Interpretable NLP Systems: How should we define and evaluate faithfulness? | Alon Jacovi and Yoav Goldberg | N/A | N/A |
| Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation | Aakanksha Naik and Carolyn Rose | N/A | N/A |
| Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-based Question Answering | Changmao Li and Jinho D. Choi | N/A | N/A |
| Treebank Embedding Vectors for Out-of-domain Dependency Parsing | Joachim Wagner, James Barry and Jennifer Foster | N/A | N/A |
| Tree-Structured Neural Topic Model | Masaru Isonuma, Junichiro Mori, Danushka Bollegala and Ichiro Sakata | N/A | N/A |
| TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition | Bill Yuchen Lin, Dong-Ho Lee, Ming Shen, Ryan Moreno, Xiao Huang, Prashant Shiralkar and Xiang Ren | N/A | N/A |
| Two Birds, One Stone: A Simple, Unified Model for Text Generation from Structured and Unstructured Data | Hamidreza Shahidi, Ming Li and Jimmy Lin | N/A | N/A |
| Uncertain Natural Language Inference | Tongfei Chen, Zhengping Jiang, Adam Poliak, Keisuke Sakaguchi and Benjamin Van Durme | N/A | N/A |
| Understanding Advertisements with BERT | Kanika Kalra, Bhargav Kurma, Silpa Vadakkeeveetil Sreelatha, Manasi Patwardhan and Shirish Karande | N/A | N/A |
| Unsupervised FAQ Retrieval with Question Generation and BERT | Yosi Mass, Boaz Carmeli, Haggai Roitman and David Konopnicki | N/A | N/A |
| Using Context in Neural Machine Translation Training Objectives | Danielle Saunders, Felix Stahlberg and Bill Byrne | N/A | N/A |
| Variational Neural Machine Translation with Normalizing Flows | Hendra Setiawan, Matthias Sperber, Udhyakumar Nallasamy and Matthias Paulik | N/A | N/A |
| Verbal Multiword Expressions for Identification of Metaphor | Omid Rohanian, Marek Rei, Shiva Taslimipoor and Le An Ha | N/A | N/A |
| Video-Grounded Dialogues with Pretrained Generation Language Models | Hung Le and Steven C.H. Hoi | N/A | N/A |
| What Does BERT with Vision Look At? | Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh and Kai-Wei Chang | N/A | N/A |
| What is Learned in Visually Grounded Neural Syntax Acquisition | Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi | N/A | N/A |
| Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries | Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul and Jordan Boyd-Graber | N/A | N/A |
| Will-They-Won’t-They: A Very Large Dataset for Stance Detection on Twitter | Costanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd and Nigel Collier | N/A | N/A |
| Words aren’t enough, their order matters: On the Robustness of Grounding Visual Referring Expressions | Arjun Akula, Spandana Gella, Yaser Al-Onaizan, Song-Chun Zhu and Siva Reddy | N/A | N/A |
| Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation | Shun-Po Chuang, Tzu-Wei Sung, Alexander H. Liu and Hung-yi Lee | N/A | N/A |
| Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences | Yi Tay, Donovan Ong, Jie Fu, Alvin Chan, Nancy Chen, Anh Tuan Luu and Chris Pal | N/A | N/A |
| You Don’t Have Time to Read This: An Exploration of Document Reading Time Prediction | Orion Weller, Jordan Hildebrandt, Ilya Reznik, Christopher Challis, E. Shannon Tass, Quinn Snell and Kevin Seppi | N/A | N/A |
| ``You Sound Just Like Your Father’’ Commercial Machine Translation Systems Include Stylistic Biases | Dirk Hovy, Federico Bianchi and Tommaso Fornaciari | N/A | N/A |
| ZPR2: Joint Zero Pronoun Recovery and Resolution using Multi-Task Learning and BERT | Linfeng Song, Kun Xu, Yue Zhang, Jianshu Chen and Dong Yu | N/A | N/A |
| ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents | Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic and Ngoc Thang Vu | N/A | N/A |
| BENTO: A Visual Platform for Building Clinical NLP Pipelines Based on CodaLab | Yonghao Jin, Fei Li and Hong Yu | N/A | N/A |
| Clinical-Coder: Assigning Interpretable ICD-10 Codes to Chinese Clinical Notes | Pengfei Cao, Chenwei Yan, xiangling fu, Yubo Chen, Kang Liu, Jun Zhao, Shengping Liu and Weifeng Chong | N/A | N/A |
| CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task | Shuo Sun, Suzanna Sia and Kevin Duh | N/A | N/A |
| Conversation Learner - A Machine Teaching Tool for Building Dialog Managers for Task-Oriented Dialog Systems | Swadheen Shukla, Lars Liden, Shahin Shayandeh, Eslam Kamal, Jinchao Li, Matt Mazzola, Thomas Park, Baolin Peng and Jianfeng Gao | N/A | N/A |
| ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems | Qi Zhu, Zheng Zhang, Yan Fang, Xiang Li, Ryuichi Takanobu, Jinchao Li, Baolin Peng, Jianfeng Gao, xiaoyan zhu and Minlie Huang | N/A | N/A |
| DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation | Yizhe Zhang, Siqi Sun, Michel Galley, Yen-Chun Chen, Chris Brockett, Xiang Gao, Jianfeng Gao, Jingjing Liu and Bill Dolan | N/A | N/A |
| Embedding-based Scientific Literature Discovery in a Text Editor Application | Onur Gökçe, Jonathan Prada, Nikola Nikolov, Nianlong Gu and Richard Hahnloser | N/A | N/A |
| ESPnet-ST: All-in-One Speech Translation Toolkit | Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Yalta, Tomoki Hayashi and Shinji Watanabe | N/A | N/A |
| EVIDENCEMINER: Textual Evidence Discovery for Life Sciences | Xuan Wang, Yingjun Guan, Weili Liu, Aabhas Chauhan, Enyi Jiang, Qi Li, David Liem, Dibakar Sigdel, John Caufield, Peipei Ping and Jiawei Han | N/A | N/A |
| exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformer Models | Benjamin Hoover, Hendrik Strobelt and Sebastian Gehrmann | N/A | N/A |
| GAIA: A Fine-grained Multimedia Knowledge Extraction System | Manling Li, Alireza Zareian, Ying Lin, Xiaoman Pan, Spencer Whitehead, BRIAN CHEN, Bo Wu, Heng Ji, Shih-Fu Chang, Clare Voss, Daniel Napierski and Marjorie Freedman | N/A | N/A |
| Interactive Task Learning from GUI-Grounded Natural Language Instructions and Demonstrations | Toby Jia-Jun Li, Tom Mitchell and Brad Myers | N/A | N/A |
| jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models | Yada Pruksachatkun, Phil Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney and Samuel R. Bowman | N/A | N/A |
| Label Noise in Context | Michael Desmond, Catherine Finegan-Dollak, Jeff Boston and Matt Arnold | N/A | N/A |
| LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation | Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves and Xiang Ren | N/A | N/A |
| LinggleWrite: a Coaching System for Essay Writing | Chung-Ting Tsai, Jhih-Jie Chen, Chingyu Yang and Jason Chang | N/A | N/A |
| MixingBoard: a Knowledgeable Stylized Integrated Text Generation Platform | Xiang Gao, Michel Galley and Bill Dolan | N/A | N/A |
| MMPE: A Multi-Modal Interface using Handwriting, Touch Reordering, and Speech Commands for Post-Editing Machine Translation | Nico Herbig, Santanu Pal, Tim Düwel, Kalliopi Maria Meladaki, Mahsa Monshizadeh, Vladislav Hnatovskiy, Antonio Krüger and Josef van Genabith | N/A | N/A |
| Multilingual Universal Sentence Encoder for Semantic Retrieval | Yinfei Yang, Daniel Cer, Amin Ahmad, Mandy Guo, Jax Law, Noah Constant, Gustavo Hernandez Abrego, Steve Yuan, Chris Tar, Yun-hsuan Sung, Brian Strope and Ray Kurzweil | N/A | N/A |
| Nakdan: Professional Hebrew Diacritizer | Avi Shmidman, Shaltiel Shmidman, Moshe Koppel and Yoav Goldberg | N/A | N/A |
| NLP Scholar: An Interactive Visual Explorer for Natural Language Processing Literature | Saif Mohammad | N/A | N/A |
| NSTM: Real-Time Query-Driven News Overview Composition at Bloomberg | Joshua Bambrick, Minjie Xu, Andy Almonte, Igor Malioutov, Guim Perarnau, Vittorio Selo and Iat Chong Chan | N/A | N/A |
| OpusFilter: A Configurable Parallel Corpus Filtering Toolbox | Mikko Aulamo, Sami Virpioja and Jörg Tiedemann | N/A | N/A |
| Penman: An Open-Source Library and Tool for AMR Graphs | Michael Wayne Goodman | N/A | N/A |
| Personalized PageRank with Syntagmatic Information for Multilingual Word Sense Disambiguation | Federico Scozzafava, Marco Maru, Fabrizio Brignone, Giovanni Torrisi and Roberto Navigli | N/A | N/A |
| Photon: A Robust Cross-Domain Text-to-SQL System | Jichuan Zeng, Xi Victoria Lin, Steven C.H. Hoi, Richard Socher, Caiming Xiong, Michael Lyu and Irwin King | N/A | N/A |
| Prta: A System to Support the Analysis of Propaganda Techniques in the News | Giovanni Da San Martino, Shaden Shaar, Yifan Zhang, Seunghak Yu, Alberto Barrón-Cedeño and Preslav Nakov | N/A | N/A |
| pyBART: Evidence-based Syntactic Transformations for IE | Aryeh Tiktinsky, Yoav Goldberg and Reut Tsarfaty | N/A | N/A |
| Stanza: A Python Natural Language Processing Toolkit for Many Human Languages | Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton and Christopher D. Manning | N/A | N/A |
| Stimulating Creativity with FunLines: A Case Study of Humor Generation in Headlines | Nabil Hossain, John Krumm, Tanvir Sajed and Henry Kautz | N/A | N/A |
| SUPP.AI: finding evidence for supplement-drug interactions | Lucy Wang, Oyvind Tafjord, Arman Cohan, Sarthak Jain, Sam Skjonsberg, Carissa Schoenick, Nick Botner and Waleed Ammar | N/A | N/A |
| Syntactic Search by Example | Micah Shlain, Hillel Taub-Tabib, Shoval Sadde and Yoav Goldberg | N/A | N/A |
| SyntaxGym: An Online Platform for Targeted Evaluation of Language Models | Jon Gauthier, Jennifer Hu, Ethan Wilcox, Peng Qian and Roger Levy | N/A | N/A |
| Tabouid: a Wikipedia-based word guessing game | Timothée Bernard | N/A | N/A |
| Talk to Papers: Bringing Neural Question Answering to Academic Search | Tiancheng Zhao and Kyusong Lee | N/A | N/A |
| TextBrewer: An Open-Source Knowledge Distillation Toolkit for Natural Language Processing | Ziqing Yang, Yiming Cui, Zhipeng Chen, Wanxiang Che, Ting Liu, Shijin Wang and Guoping Hu | N/A | N/A |
| The Microsoft Toolkit of Multi-Task Deep Neural Networks for Natural Language Understanding | Xiaodong Liu, Yu Wang, Jianshu Ji, Hao Cheng, Xueyun Zhu, Emmanuel Awa, Pengcheng He, Weizhu Chen, Hoifung Poon, Guihong Cao and Jianfeng Gao | N/A | N/A |
| Torch-Struct: Deep Structured Prediction Library | Alexander Rush | N/A | N/A |
| Trialstreamer: Mapping and Browsing Medical Evidence in Real-Time | Benjamin Nye, Ani Nenkova, Iain Marshall and Byron C. Wallace | N/A | N/A |
| Usnea: An Authorship Tool for Interactive Fiction using Retrieval Based Semantic Parsing | Ben Swanson and Boris Smus | N/A | N/A |
| What’s The Latest? A Question-driven News Chatbot | Philippe Laban, John Canny and Marti A. Hearst | N/A | N/A |
| Xiaomingbot: A Multilingual Robot News Reporter | Runxin Xu, Jun Cao, Mingxuan Wang, Jiaze Chen, Hao Zhou, Ying Zeng, Yuping Wang, Li Chen, Xiang Yin, Xijin Zhang, Songcheng Jiang, Yuxuan Wang and Lei Li | N/A | N/A |
| #NotAWhore! A Computational Linguistic Perspective of Rape Culture and Victimization on Social Media | Ashima Suvarna and Grusha Bhalla | N/A | N/A |
| A Geometry-Inspired Attack for Generating Natural Language Adversarial Examples | Zhao Meng and Roger Wattenhofer | N/A | N/A |
| A Simple and Effective Dependency parser for Telugu | Sneha Nallani, Manish Shrivastava and Dipti Sharma | N/A | N/A |
| Adaptive Transformers for Learning Multimodal Representations | Prajjwal Bhargava | N/A | N/A |
| AraDIC: Arabic Document Classification Using Image-Based Character Embeddings and Class-Balanced Loss | Mahmoud Daif, Shunsuke Kitada and Hitoshi Iyatomi | N/A | N/A |
| Building a Japanese Typo Dataset from Wikipedia’s Revision History | Yu Tanaka, Yugo Murawaki, Daisuke Kawahara and Sadao Kurohashi | N/A | N/A |
| Checkpoint Reranking: An Approach To Select Better Hypothesis For Neural Machine Translation Systems | Vinay Pandramish and Dipti Misra Sharma | N/A | N/A |
| Combining Subword Representations into Word-level Representations in the Transformer Architecture | Noe Casas, Marta R. Costa-jussà and José A. R. Fonollosa | N/A | N/A |
| Compositional generalization by factorizing alignment and translation | Jacob Russin, Jason Jo, Randall O’Reilly and Yoshua Bengio | N/A | N/A |
| Considering Likelihood in NLP Classification Explanations with Occlusion and Language Modeling | David Harbecke and Christoph Alt | N/A | N/A |
| Crossing the Line: Where do Demographic Variables Fit into Humor Detection? | J. A. Meaney | N/A | N/A |
| Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup | Jishnu Ray Chowdhury, Cornelia Caragea and Doina Caragea | N/A | N/A |
| Dominance as an Indicator of Rapport and Learning in Human-Agent Communication | Amanda Buddemeyer, Xiaoyi Tian and Erin Walker | N/A | N/A |
| Effectively Aligning and Filtering Parallel Corpora under Sparse Data Conditions | Steinþór Steingrímsson | N/A | N/A |
| Efficient Neural Machine Translation for Low-Resource Languages via Exploiting Related Languages | Vikrant Goyal, Sourav Kumar and Dipti Misra Sharma | N/A | N/A |
| Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition | Takuma Kato, Kaori Abe, Hiroki Ouchi, Shumpei Miyawaki, Jun Suzuki and Kentaro Inui | N/A | N/A |
| Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources | Magdalena Biesialska, bardia rafieian and Marta R. Costa-jussà | N/A | N/A |
| Exploring Interpretability in Event Extraction: Multitask Learning of a Neural Event Classifier and an Explanation Decoder | Zheng Tang, Gus Hahn-Powell and Mihai Surdeanu | N/A | N/A |
| Exploring the Role of Context to Distinguish Rhetorical and Information-Seeking Questions | Yuan Zhuang and Ellen Riloff | N/A | N/A |
| Feature Difference Makes Sense: A medical image captioning model exploiting feature difference and tag information | Hyeryun Park, Kyungmo Kim, Jooyoung Yoon, Seongkeun Park and Jinwook Choi | N/A | N/A |
| Grammatical Error Correction Using Pseudo Learner Corpus Considering Learner’s Error Tendency | Yujin Takahashi, Satoru Katsumata and Mamoru Komachi | N/A | N/A |
| HGCN4MeSH: Hybrid Graph Convolution Network for MeSH Indexing | Miaomiao Yu, Yujiu Yang and Chenhui Li | N/A | N/A |
| How much complexity does an RNN architecture need to learn syntax-sensitive dependencies? | Gantavya Bhatt, Hritik Bansal, Rishubh Singh and Sumeet Agarwal | N/A | N/A |
| υBLEU: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems | Tsuta Yuma, Naoki Yoshinaga and Masashi Toyoda | N/A | N/A |
| Inducing Grammar from Long Short-Term Memory Networks by Shapley Decomposition | Yuhui Zhang and Allen Nie | N/A | N/A |
| Let’s be Humorous: Knowledge Enhanced Humor Generation | Hang Zhang, Dayiheng Liu, Jiancheng Lv and Luo Cheng | N/A | N/A |
| Logical Inferences with Comparatives and Generalized Quantifiers | Izumi Haruta, Koji Mineshima and Daisuke Bekki | N/A | N/A |
| Media Bias, the Social Sciences, and NLP: Automating Frame Analyses to Identify Bias by Word Choice and Labeling | Felix Hamborg | N/A | N/A |
| Multi-Task Neural Model for Agglutinative Language Translation | Yirong Pan, Xiao Li, Yating Yang and Rui Dong | N/A | N/A |
| Noise-Based Augmentation Techniques for Emotion Datasets: What do we Recommend? | Mimansa Jaiswal and Emily Mower Provost | N/A | N/A |
| Non-Topical Coherence in Social Talk: A Call for Dialogue Model Enrichment | Alex Lưu and Sophia A. Malamud | N/A | N/A |
| Pointwise Paraphrase Appraisal is Potentially Problematic | Hannah Chen, Yangfeng Ji and David Evans | N/A | N/A |
| Pre-training via Leveraging Assisting Languages for Neural Machine Translation | Haiyue Song, Raj Dabre, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi and Eiichiro Sumita | N/A | N/A |
| Preventing Critical Scoring Errors in Short Answer Scoring with Confidence Estimation | Hiroaki Funayama, Shota Sasaki, Yuichiroh Matsubayashi, Tomoya Mizumoto, Jun Suzuki, Masato Mita and Kentaro Inui | N/A | N/A |
| Reflection-based Word Attribute Transfer | Yoichi Ishibashi, Katsuhito Sudoh, Koichiro Yoshino and Satoshi Nakamura | N/A | N/A |
| Research on Task Discovery for Transfer Learning in Deep Neural Networks | Arda Akdemir | N/A | N/A |
| Research Replication Prediction Using Weakly Supervised Learning | Tianyi Luo, Xingyu Li, Hainan Wang and Yang Liu | N/A | N/A |
| RPD: A Distance Function Between Word Embeddings | Xuhui Zhou, Shujian Huang and Zaixiang Zheng | N/A | N/A |
| SCAR: Sentence Compression using Autoencoders for Reconstruction | Chanakya Malireddy, Tirth Maniar and Manish Shrivastava | N/A | N/A |
| Self-Attention is Not Only a Weight: Analyzing BERT with Vector Norms | Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi and Kentaro Inui | N/A | N/A |
| Story-level Text Style Transfer: A Proposal | Yusu Qian | N/A | N/A |
| To compress or not to compress? A Finite-State approach to Nen verbal morphology | Saliha Muradoglu, Nicholas Evans and Hanna Suominen | N/A | N/A |
| Topic balancing with additive regularization of topic models | Eugeniia Veselova and Konstantin Vorontsov | N/A | N/A |
| Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya | Abrhalei Frezghi Tela, Abraham Woubie Zewoudie and Ville Hautamäki | N/A | N/A |
| Understanding Points of Correspondence between Sentences for Abstractive Summarization | Logan Lebanoff, John Muchovej, Franck Dernoncourt, Doo Soon Kim, Lidan Wang, Walter Chang and Fei Liu | N/A | N/A |
| Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining | Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka, Eneko Agirre and Ondřej Bojar | N/A | N/A |
| Unsupervised Paraphasia Classification in Aphasic Speech | Sharan Pai, Nikhil Sachdeva, Prince Sachdeva and Rajiv Ratn Shah | N/A | N/A |
| Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models | Pia Sommerauer | N/A | N/A |
| Zero-shot North Korean to English Neural Machine Translation by Character Tokenization and Phoneme Decomposition | Hwichan Kim, Tosho Hirasawa and Mamoru Komachi | N/A | N/A |
ACL 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Vocabulary Learning via Optimal Transport for Neural Machine Translation | Jingjing Xu, Hao Zhou, Chun Gan, Zaixiang Zheng and Lei Li | N/A | N/A |
| Including Signed Languages in Natural Language Processing | Kayo Yin, Amit Moryossef, Julie Hochgesang, Yoav Goldberg and Malihe Alikhani | N/A | N/A |
| All Thatâs âHumanâ Is Not Gold: Evaluating Human Evaluation of Generated Text | Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan and Noah A. Smith | N/A | N/A |
| Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning | Armen Aghajanyan, Sonal Gupta and Luke Zettlemoyer | N/A | N/A |
| Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering | Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei and Christopher Manning | N/A | N/A |
| Neural Machine Translation with Monolingual Translation Memory | Deng Cai, Yan Wang, Huayang Li, Wai Lam and Lemao Liu | N/A | N/A |
| Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers | Benjamin Marie, Atsushi Fujita and Raphael Rubino | N/A | N/A |
| UnNatural Language Inference | Koustuv Sinha, Prasanna Parthasarathi, Joelle Pineau and Adina Williams | N/A | N/A |
| Semi-Supervised Text Classification with Balanced Deep Representation Distributions | Changchun Li, Ximing Li and Jihong Ouyang | N/A | N/A |
| Generalising Multilingual Concept-to-Text NLG with Language Agnostic Delexicalisation | Giulio Zhou and Gerasimos Lampouras | N/A | N/A |
| How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models | Phillip Rust, Jonas Pfeiffer, Ivan VuliÄ, Sebastian Ruder and Iryna Gurevych | N/A | N/A |
| TicketTalk: Toward human-level performance with end-to-end, transaction-based dialog systems | Bill Byrne, Karthik Krishnamoorthi, Saravanan Ganesh and Mihir Kale | N/A | N/A |
| Unified Dual-view Cognitive Model for Interpretable Claim Verification | Lianwei Wu, Yuan Rao, Yuqian Lan, Ling Sun and Zhaoyin Qi | N/A | N/A |
| Dual Graph Convolutional Networks for Aspect-based Sentiment Analysis | Ruifan Li, Hao Chen, Fangxiang Feng, Zhanyu Ma, Xiaojie WANG and Eduard Hovy | N/A | N/A |
| Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble | Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, Kai-Wei Chang and Xuanjing Huang | N/A | N/A |
| Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning | Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang and Song-Chun Zhu | N/A | N/A |
| Evaluating Evaluation Measures for Ordinal Classification and Ordinal Quantification | Tetsuya Sakai | N/A | N/A |
| SocAoG: Incremental Graph Parsing for Social Relation Inference in Dialogues | Liang Qiu, Yuan Liang, Yizhou Zhao, Pan Lu, Baolin Peng, Zhou Yu, Ying Nian Wu and Song-Chun Zhu | N/A | N/A |
| Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains | Haojie Pan, Chengyu Wang, Minghui Qiu, Yichang Zhang, Yaliang Li and Jun Huang | N/A | N/A |
| DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations | John Giorgi, Osvald Nitski, Bo Wang and Gary Bader | N/A | N/A |
| A Sweet Rabbit Hole by DARCY: Using Honeypots to Detect Universal Triggerâs Adversarial Attacks | Thai Le, Noseong Park and Dongwon Lee | N/A | N/A |
| Cross-Lingual Abstractive Summarization with Limited Parallel Resources | Yu Bai, Yang Gao and Heyan Huang | N/A | N/A |
| Rewriter-Evaluator Architecture for Neural Machine Translation | Yangming Li and Kaisheng Yao | N/A | N/A |
| HERALD: An Annotation Efficient Method to Detect User Disengagement in Social Conversations | Weixin Liang, Kai-Hui Liang and Zhou Yu | N/A | N/A |
| How is BERT surprised? Layerwise detection of linguistic anomalies | Bai Li, Zining Zhu, Guillaume Thomas, Yang Xu and Frank Rudzicz | N/A | N/A |
| Unsupervised Extractive Summarization-Based Representations for Accurate and Explainable Collaborative Filtering | Reinald Adrian Pugoy and Hung-Yu Kao | N/A | N/A |
| Prefix-Tuning: Optimizing Continuous Prompts for Generation | Xiang Lisa Li and Percy Liang | N/A | N/A |
| Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models | Tongshuang Wu, Marco Tulio Ribeiro, Jeffrey Heer and Daniel Weld | N/A | N/A |
| KACE: Generating Knowledge Aware Contrastive Explanations for Natural Language Inference | Qianglong Chen, Feng Ji, Xiangji Zeng, Feng-Lin Li, Ji Zhang, Haiqing Chen and Yin Zhang | N/A | N/A |
| Multi-Label Few-Shot Learning for Aspect Category Detection | Mengting Hu, Shiwan Zhao, Honglei Guo, Chao Xue, Hang Gao, Tiegang Gao, Renhong Cheng and Zhong Su | N/A | N/A |
| Word Sense Disambiguation: Towards Interactive Context Exploitation from Both Word and Sense Perspectives | Ming Wang and Yinglin Wang | N/A | N/A |
| Crafting Adversarial Examples for Neural Machine Translation | Xinze Zhang, Junzhe Zhang, Zhenhua Chen and Kun He | N/A | N/A |
| Dual Reader-Parser on Hybrid Textual and Tabular Evidence for Open Domain Question Answering | Alexander Hanbo Li, Patrick Ng, Peng Xu, Henghui Zhu, Zhiguo Wang and Bing Xiang | N/A | N/A |
| PhotoChat: A Human-Human Dialogue Dataset With Photo Sharing Behavior For Joint Image-Text Modeling | Xiaoxue Zang, Lijuan Liu, Maria Wang, Yang Song, Hao Zhang and Jindong Chen | N/A | N/A |
| Comprehensive Study: How the Context Information of Different Granularity Affects Dialogue State Tracking? | Puhai Yang, Heyan Huang and Xian-Ling Mao | N/A | N/A |
| Control Image Captioning Spatially and Temporally | Kun Yan, Lei Ji, Huaishao Luo, Ming Zhou, Nan Duan and Shuai Ma | N/A | N/A |
| Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization | Chen Liang, Simiao Zuo, Minshuo Chen, Haoming Jiang, Xiaodong Liu, Pengcheng He, Tuo Zhao and Weizhu Chen | N/A | N/A |
| Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data | Haoming Jiang, Danqing Zhang, Tianyu Cao, Bing Yin and Tuo Zhao | N/A | N/A |
| Competence-based Multimodal Curriculum Learning for Medical Report Generation | Fenglin Liu, Shen Ge and Xian Wu | N/A | N/A |
| Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition | Yongliang Shen, Xinyin Ma, Zeqi Tan, Shuai Zhang, Wen Wang and Weiming Lu | N/A | N/A |
| A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech | Jingxuan Yang, Kerui Xu, Jun Xu, Si Li, Sheng Gao, Jun Guo, Nianwen Xue and Ji-Rong Wen | N/A | N/A |
| OntoED: Low-resource Event Detection with Ontology Embedding | Shumin Deng, Ningyu Zhang, Luoqiu Li, Chen Hui, Tou Huaixiao, Mosha Chen, Fei Huang and Huajun Chen | N/A | N/A |
| Conditional Generation of Temporally-ordered Event Sequences | Shih-Ting Lin, Nathanael Chambers and Greg Durrett | N/A | N/A |
| ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Paraphrasing | Thomas Dopierre, Christophe Gravier and Wilfried Logerais | N/A | N/A |
| Subsequence Based Deep Active Learning for Named Entity Recognition | Puria Radmard, Yassir Fathullah and Aldo Lipani | N/A | N/A |
| Adversarial Learning for Discourse Rhetorical Structure Parsing | Longyin Zhang, Fang Kong and Guodong Zhou | N/A | N/A |
| Hierarchical Context-aware Network for Dense Video Event Captioning | Lei Ji, Xianglin Guo, Haoyang Huang and Xilin Chen | N/A | N/A |
| BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition | Yinghao Li, Pranav Shetty, Lucas Liu, Chao Zhang and Le Song | N/A | N/A |
| Maria: A Visual Experience Powered Conversational Agent | Zujie Liang, Huang Hu, Can Xu, Chongyang Tao, Xiubo Geng, Yining Chen, Fan Liang and Daxin Jiang | N/A | N/A |
| R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling | Xiang Hu, Haitao Mi, Zujie Wen, Yafang Wang, Yi Su, Jing Zheng and Gerard De Melo | N/A | N/A |
| Discovering Dialog Structure Graph for Coherent Dialog Generation | Jun Xu, Zeyang Lei, Haifeng Wang, Zheng-Yu Niu, Hua Wu and Wanxiang Che | N/A | N/A |
| LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding | Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei Florencio, Cha Zhang, Wanxiang Che, Min Zhang and Lidong Zhou | N/A | N/A |
| Improving the Faithfulness of Attention-based Explanations with Task-specific Information for Text Classification | George Chrysostomou and Nikolaos Aletras | N/A | N/A |
| Text-Free Image-to-Speech Synthesis Using Learned Segmental Units | Wei-Ning Hsu, David Harwath, Tyler Miller, Christopher Song and James Glass | N/A | N/A |
| Directed Acyclic Graph Network for Conversational Emotion Recognition | Weizhou Shen, Siyue Wu, Yunyi Yang and Xiaojun Quan | N/A | N/A |
| Knowledge-Enriched Event Causality Identification via Latent Structure Induction Networks | Pengfei Cao, Xinyu Zuo, Yubo Chen, Kang Liu, Jun Zhao, Yuguang Chen and Weihua Peng | N/A | N/A |
| BACO: A Background Knowledge- and Content-Based Framework for Citing Sentence Generation | Yubin Ge, Ly Dinh, Xiaofeng Liu, Jinsong Su, Ziyao Lu, Ante Wang and Jana Diesner | N/A | N/A |
| Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker | Runxin Xu, Tianyu Liu, Lei Li and Baobao Chang | N/A | N/A |
| Psycholinguistic Tripartite Graph Network for Personality Detection | Tao Yang, Feifan Yang, Haolan Ouyang and Xiaojun Quan | N/A | N/A |
| Mention Flags (MF): Constraining Transformer-based Text Generators | Yufei Wang, Ian Wood, Stephen Wan, Mark Dras and Mark Johnson | N/A | N/A |
| Guiding the Growth: Difficulty-Controllable Question Generation through Step-by-Step Rewriting | Yi Cheng, Siyao Li, Bang Liu, Ruihui Zhao, Sujian Li, Chenghua Lin and Yefeng Zheng | N/A | N/A |
| OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics | Jian Guan, Zhexin Zhang, Zhuoer Feng, Zitao Liu, Wenbiao Ding, Xiaoxi Mao, Changjie Fan and Minlie Huang | N/A | N/A |
| Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text Generation | Liang Li, Can Ma, Yinliang Yue and Dayong Hu | N/A | N/A |
| Automated Concatenation of Embeddings for Structured Prediction | Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu | N/A | N/A |
| Improving Factual Consistency of Abstractive Summarization via Question Answering | Feng Nan, Cicero Nogueira Dos Santos, Henghui Zhu, Patrick Ng, Kathleen McKeown, Ramesh Nallapati, Dejiao Zhang, Zhiguo Wang, Andrew O. Arnold and Bing Xiang | N/A | N/A |
| Explanations for CommonsenseQA: New Dataset and Models | Shourya Aggarwal, Divyanshu Mandowara, Vishwajeet Agrawal, Dinesh Khandelwal, Parag Singla and Dinesh Garg | N/A | N/A |
| IrEne: Interpretable Energy Prediction for Transformers | Qingqing Cao, Yash Kumar Lal, Harsh Trivedi, Aruna Balasubramanian and Niranjan Balasubramanian | N/A | N/A |
| Writing by Memorizing: Hierarchical Retrieval-based Medical Report Generation | Xingyi Yang, Muchao Ye, Quanzeng You and Fenglong Ma | N/A | N/A |
| Long-Span Summarization via Local Attention and Content Selection | Potsawee Manakul and Mark Gales | N/A | N/A |
| Claim Matching Beyond English to Scale Global Fact-Checking | Ashkan Kazemi, Kiran Garimella, Devin Gaffney and Scott Hale | N/A | N/A |
| A Large-Scale Chinese Multimodal NER Dataset with Speech Clues | Dianbo Sui, Zhengkun Tian, Yubo Chen, Kang Liu and Jun Zhao | N/A | N/A |
| Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach | Jie Huang, Kevin Chang, JinJun Xiong and Wen-mei Hwu | N/A | N/A |
| Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip Prediction | Yifan Gao, Henghui Zhu, Patrick Ng, Cicero Nogueira Dos Santos, Zhiguo Wang, Feng Nan, Dejiao Zhang, Ramesh Nallapati, Andrew O. Arnold and Bing Xiang | N/A | N/A |
| Out-of-Scope Intent Detection with Self-Supervision and Discriminative Training | Li-Ming Zhan, Haowen Liang, Bo LIU, Lu Fan, Xiao-Ming Wu and Albert Y.S. Lam | N/A | N/A |
| SemFace: Pre-training Encoder and Decoder with a Semantic Interface for Neural Machine Translation | Shuo Ren, Long Zhou, Shujie Liu, Furu Wei, Ming Zhou and Shuai Ma | N/A | N/A |
| Consistency Regularization for Cross-Lingual Fine-Tuning | Bo Zheng, Li Dong, Shaohan Huang, Wenhui Wang, Zewen Chi, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song and Furu Wei | N/A | N/A |
| Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval | Hongyin Tang, Xingwu Sun, Beihong Jin, Jingang Wang, Fuzheng Zhang and Wei Wu | N/A | N/A |
| Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment | Zewen Chi, Li Dong, Bo Zheng, Shaohan Huang, Xian-Ling Mao, Heyan Huang and Furu Wei | N/A | N/A |
| GhostBERT: Generate More Features with Cheap Operations for BERT | Zhiqi Huang, Lu Hou, Lifeng Shang, Xin Jiang, Xiao Chen and Qun Liu | N/A | N/A |
| Improving Zero-Shot Translation by Disentangling Positional Information | Danni Liu, Jan Niehues, James Cross, Francisco Guzmán and Xian Li | N/A | N/A |
| Diversifying Dialog Generation via Adaptive Label Smoothing | Yida Wang, Yinhe Zheng, Yong Jiang and Minlie Huang | N/A | N/A |
| Supporting Cognitive and Emotional Empathic Writing of Students | Thiemo Wambsganss, Christina Niklaus, Matthias Söllner, Siegfried Handschuh and Jan Marco Leimeister | N/A | N/A |
| Fast and Accurate Neural Machine Translation with Translation Memory | Qiuxiang He, Guoping Huang, Qu Cui, Li Li and Lemao Liu | N/A | N/A |
| Syntax-Enhanced Pre-trained Model | Zenan Xu, Daya Guo, Duyu Tang, Qinliang Su, Linjun Shou, Ming Gong, Wanjun Zhong, Xiaojun Quan, Daxin Jiang and Nan Duan | N/A | N/A |
| Towards Propagation Uncertainty: Edge-enhanced Bayesian Graph Convolutional Networks for Rumor Detection | Lingwei Wei, Dou Hu, Wei Zhou, Zhaojuan Yue and Songlin Hu | N/A | N/A |
| PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction | Shulin Liu, Tao Yang, Tianchi Yue, Feng Zhang and Di Wang | N/A | N/A |
| Concept-Based Label Embedding via Dynamic Routing for Hierarchical Text Classification | Xuepeng Wang, Li Zhao, Bing Liu, Tao Chen, Feng Zhang and Di Wang | N/A | N/A |
| Joint Verification and Reranking for Open Fact Checking Over Tables | Michael Sejr Schlichtkrull, Vladimir Karpukhin, Barlas Oguz, Mike Lewis, Wen-tau Yih and Sebastian Riedel | N/A | N/A |
| CoSQA: 20,000+ Web Queries for Code Search and Question Answering | Junjie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming Zhou and Nan Duan | N/A | N/A |
| BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data | Haoyu Song, Yan Wang, Kaiyan Zhang, Wei-Nan Zhang and Ting Liu | N/A | N/A |
| Position Bias Mitigation: A Knowledge-Aware Graph Model for Emotion Cause Extraction | Hanqi Yan, Lin Gui, Gabriele Pergola and Yulan He | N/A | N/A |
| Evaluation of Thematic Coherence in Microblogs | Iman Munire Bilal, Bo Wang, Maria Liakata, Rob Procter and Adam Tsakalidis | N/A | N/A |
| Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech | Margherita Fanton, Helena Bonaldi, Serra Sinem TekiroÄlu and Marco Guerini | N/A | N/A |
| Factorising Meaning and Form for Intent-Preserving Paraphrasing | Tom Hosking and Mirella Lapata | N/A | N/A |
| EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering | Zhibin Duan, Hao Zhang, Chaojie Wang, Zhengjue Wang, Bo Chen and Mingyuan Zhou | N/A | N/A |
| LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification | Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng and Yuguang Chen | N/A | N/A |
| Coreference Reasoning in Machine Reading Comprehension | Mingzhu Wu, Nafise Sadat Moosavi, Dan Roth and Iryna Gurevych | N/A | N/A |
| Evidence-based Factual Error Correction | James Thorne and Andreas Vlachos | N/A | N/A |
| StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling | Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler and Aaron Courville | N/A | N/A |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Tyler Chang, Yifan Xu, Weijian Xu and Zhuowen Tu | N/A | N/A |
| Learning Faithful Representations of Causal Graphs | Ananth Balashankar and Lakshminarayanan Subramanian | N/A | N/A |
| The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing | Ji Xin, Raphael Tang, Yaoliang Yu and Jimmy Lin | N/A | N/A |
| Ruddit: Norms of Offensiveness for English Reddit Comments | Rishav Hada, Sohi Sudhir, Pushkar Mishra, Helen Yannakoudakis, Saif M. Mohammad and Ekaterina Shutova | N/A | N/A |
| Learning Language and Multimodal Privacy-Preserving Markers of Mood from Mobile Data | Paul Pu Liang, Terrance Liu, Anna Cai, Michal Muszynski, Ryo Ishii, Nick Allen, Randy Auerbach, David Brent, Ruslan Salakhutdinov and Louis-Philippe Morency | N/A | N/A |
| Prosodic segmentation for parsing spoken dialogue | Elizabeth Nielsen, Mark Steedman and Sharon Goldwater | N/A | N/A |
| End-to-End Training of Neural Retrievers for Open-Domain Question Answering | Devendra Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L. Hamilton and Bryan Catanzaro | N/A | N/A |
| Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in Conversational Systems | Claudio Pinhanez, Paulo Cavalin, Victor Henrique Alves Ribeiro, Ana Appel, Heloisa Candello, Julio Nogima, Mauro Pichiliani, Melina Guerra, Maira De Bayser, Gabriel Malfatti and Henrique Ferreira | N/A | N/A |
| TextSETTR: Few-Shot Text Style Extraction and Tunable Targeted Restyling | Parker Riley, Noah Constant, Mandy Guo, Girish Kumar, David Uthus and Zarana Parekh | N/A | N/A |
| I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling | Yixin Nie, Mary Williamson, Mohit Bansal, Douwe Kiela and Jason Weston | N/A | N/A |
| Implicit Representations of Meaning in Neural Language Models | Belinda Z. Li, Maxwell Nye and Jacob Andreas | N/A | N/A |
| Personalized Transformer for Explainable Recommendation | Lei Li, Yongfeng Zhang and Li Chen | N/A | N/A |
| Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor | Xinyu Wang, Yong Jiang, Zhaohui Yan, Zixia Jia, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu | N/A | N/A |
| UnNatural Language Inference | Koustuv Sinha, Prasanna Parthasarathi, Joelle Pineau and Adina Williams | N/A | N/A |
| ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning | Yujia Qin, Yankai Lin, Ryuichi Takanobu, Zhiyuan Liu, Peng Li, Heng Ji, Minlie Huang, Maosong Sun and Jie Zhou | N/A | N/A |
| RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy | Xiyan Fu, Yating Zhang, Tianyi Wang, Xiaozhong Liu, Changlong Sun and Zhenglu Yang | N/A | N/A |
| A Conditional Splitting Framework for Efficient Constituency Parsing | Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty and Xiaoli Li | N/A | N/A |
| Reliability Testing for Natural Language Processing Systems | Samson Tan, Shafiq Joty, Kathy Baxter, Araz Taeihagh, Gregory A. Bennett and Min-Yen Kan | N/A | N/A |
| Multi-Head Highly Parallelized LSTM Decoder for Neural Machine Translation | Hongfei Xu, Qiuhui Liu, Josef Van Genabith, Deyi Xiong and Meng Zhang | N/A | N/A |
| N-ary Constituent Tree Parsing with Recursive Semi-Markov Model | Xin Xin, Jinlong Li and Zeqi Tan | N/A | N/A |
| Math Word Problem Solving with Explicit Numerical Values | Qinzhuo Wu, Qi Zhang, Zhongyu Wei and Xuanjing Huang | N/A | N/A |
| Improving Formality Style Transfer with Context-Aware Rule Injection | Zonghai Yao and Hong Yu | N/A | N/A |
| CDRNN: Discovering Complex Dynamics in Human Language Processing | Cory Shain | N/A | N/A |
| DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling | Lanqing Xue, Kaitao Song, Duocai Wu, Xu Tan, Nevin L. Zhang, Tao Qin, Wei-Qiang Zhang and Tie-Yan Liu | N/A | N/A |
| CIL: Contrastive Instance Learning Framework for Distantly Supervised Relation Extraction | Tao Chen, Haizhou Shi, Siliang Tang, Zhigang Chen, Fei Wu and Yueting Zhuang | N/A | N/A |
| COSY: COunterfactual SYntax for Cross-Lingual Understanding | SICHENG YU, Hao Zhang, Yulei Niu, Qianru Sun and Jing Jiang | N/A | N/A |
| A Knowledge-Guided Framework for Frame Identification | Xuefeng Su, Ru Li, Xiaoli Li, Jeff Z. Pan, Hu Zhang, Qinghua Chai and Xiaoqi Han | N/A | N/A |
| A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment | Jingyi Zhang and Josef Van Genabith | N/A | N/A |
| Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Context Anchoring | Aitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka and Eneko Agirre | N/A | N/A |
| Few-Shot Question Answering by Pretraining Span Selection | Ori Ram, Yuval Kirstain, Jonathan Berant, Amir Globerson and Omer Levy | N/A | N/A |
| Few-NERD: A Few-shot Named Entity Recognition Dataset | Ning Ding, Guangwei Xu, Yulin Chen, Xiaobin Wang, Xu Han, Pengjun Xie, Haitao Zheng and Zhiyuan Liu | N/A | N/A |
| Matching Distributions between Model and Data: Cross-domain Knowledge Distillation for Unsupervised Domain Adaptation | Bo Zhang, Xiaoming Zhang, Yun Liu, Lei Cheng and Zhoujun Li | N/A | N/A |
| Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset | Alexandra Ils, Dan Liu, Daniela Grunow and Steffen Eger | N/A | N/A |
| SENT: Sentence-level Distant Relation Extraction via Negative Training | Ruotian Ma, Tao Gui, Linyang Li, Qi Zhang, Xuanjing Huang and Yaqian Zhou | N/A | N/A |
| BinaryBERT: Pushing the Limit of BERT Quantization | Haoli Bai, Wei Zhang, Lu Hou, Lifeng Shang, Jin JIN, Xin Jiang, Qun Liu, Michael Lyu and Irwin King | N/A | N/A |
| Modularized Interaction Network for Named Entity Recognition | Fei Li, Zheng Wang, Siu Cheung Hui, Lejian Liao, Dandan Song, Jing Xu, Guoxiu He and Meihuizi Jia | N/A | N/A |
| Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision | Si Sun, Yingzhuo Qian, Zhenghao Liu, Chenyan Xiong, Kaitao Zhang, Jie Bao, Zhiyuan Liu and Paul Bennett | N/A | N/A |
| Breaking the Corpus Bottleneck for Context-Aware Neural Machine Translation with Cross-Task Pre-training | Linqing Chen, Junhui Li, Zhengxian Gong, Boxing Chen, Weihua Luo, Min Zhang and Guodong Zhou | N/A | N/A |
| MultiMET: A Multimodal Dataset for Metaphor Understanding | Dongyu Zhang, Minghao Zhang, Heting Zhang, Liang Yang and Hongfei LIN | N/A | N/A |
| OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classification | Seonghyeon Lee, Dongha Lee and Hwanjo Yu | N/A | N/A |
| Breaking Down the Invisible Wall of Informal Fallacies in Online Discussions | Saumya Sahai, Oana Balalau and Roxana Horincar | N/A | N/A |
| Towards Quantifiable Dialogue Coherence Evaluation | Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin and Xiaodan Liang | N/A | N/A |
| Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks | Weicheng Ma, Kai Zhang, Renze Lou, Lili Wang and Soroush Vosoughi | N/A | N/A |
| Evaluation Examples are not Equally Informative: How should that change NLP Leaderboards? | Pedro Rodriguez, Joe Barrow, Alexander Miserlis Hoyle, John P. Lalor, Robin Jia and Jordan Boyd-Graber | N/A | N/A |
| Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder | Xi Xiangyu, Wei Ye, Shikun Zhang, Quanxiu Wang, Huixing Jiang and Wei Wu | N/A | N/A |
| Parameter-Efficient Transfer Learning with Diff Pruning | Demi Guo, Alexander Rush and Yoon Kim | N/A | N/A |
| Robust Knowledge Graph Completion with Stacked Convolutions and a Student Re-Ranking Network | Justin Lovelace, Denis Newman-Griffis, Shikhar Vashishth, Jill Fain Lehman and Carolyn Rosé | N/A | N/A |
| A Human-machine Collaborative Framework for Evaluating Malevolence in Dialogues | Yangjun Zhang, Pengjie Ren and Maarten De Rijke | N/A | N/A |
| ADEPT: An Adjective-Dependent Plausibility Task | Ali Emami, Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler and Jackie Chi Kit Cheung | N/A | N/A |
| Database reasoning over text | James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel and Alon Halevy | N/A | N/A |
| Generating Landmark Navigation Instructions from Maps as a Graph-to-Text Problem | Raphael Schumann and Stefan Riezler | N/A | N/A |
| A Sequence-to-Sequence Approach to Dialogue State Tracking | Yue Feng, Yang Wang and Hang Li | N/A | N/A |
| Verb Knowledge Injection for Multilingual Event Processing | Olga Majewska, Ivan VuliÄ, Goran GlavaÅ¡, Edoardo Maria Ponti and Anna Korhonen | N/A | N/A |
| Generating Relevant and Coherent Dialogue Responses using Self-Separated Conditional Variational AutoEncoders | Bin Sun, Shaoxiong Feng, Yiwei Li, Jiamou Liu and Kan Li | N/A | N/A |
| A Span-Based Model for Joint Overlapped and Discontinuous Named Entity Recognition | Fei Li, ZhiChao Lin, Meishan Zhang and Donghong Ji | N/A | N/A |
| Neural Stylistic Response Generation with Disentangled Latent Variables | Qingfu Zhu, Wei-Nan Zhang, Ting Liu and William Yang Wang | N/A | N/A |
| A Cognitive Regularizer for Language Modeling | Jason Wei, Clara Meister and Ryan Cotterell | N/A | N/A |
| Shortformer: Better Language Modeling using Shorter Inputs | Ofir Press, Noah A. Smith and Mike Lewis | N/A | N/A |
| An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization | Baohang Zhou, Xiangrui Cai, Ying Zhang and Xiaojie Yuan | N/A | N/A |
| Making Pre-trained Language Models Better Few-shot Learners | Tianyu Gao, Adam Fisch and Danqi Chen | N/A | N/A |
| Counterfactual Inference for Text Classification Debiasing | Chen Qian, Fuli Feng, Lijie Wen, Chunping Ma and Pengjun Xie | N/A | N/A |
| Intent Classification and Slot Filling for Privacy Policies | Wasi Ahmad, Jianfeng Chi, Tu Le, Thomas Norton, Yuan Tian and Kai-Wei Chang | N/A | N/A |
| One2Set: Generating Diverse Keyphrases as a Set | Jiacheng Ye, Tao Gui, Yichao Luo, Yige Xu and Qi Zhang | N/A | N/A |
| Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking | Jinyu Guo, Kai Shuang, Jijie Li and Zihan Wang | N/A | N/A |
| Integrating Semantics and Neighborhood Information with Graph-Driven Generative Models for Document Retrieval | Zijing Ou, Qinliang Su, Jianxing Yu, Bang Liu, Jingwen Wang, Ruihui Zhao, Changyou Chen and Yefeng Zheng | N/A | N/A |
| Enhancing Content Preservation in Text Style Transfer Using Reverse Attention and Conditional Layer Normalization | Dongkyu Lee, Zhiliang Tian, Lanqing Xue and Nevin L. Zhang | N/A | N/A |
| ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information | Zijun Sun, Xiaoya Li, Xiaofei Sun, Yuxian Meng, Xiang Ao, Qing He, Fei Wu and Jiwei Li | N/A | N/A |
| MLBiNet: A Cross-Sentence Collective Event Detection Network | Dongfang Lou, Zhilin Liao, Shumin Deng, Ningyu Zhang and Huajun Chen | N/A | N/A |
| Self-Attention Networks Can Process Bounded Hierarchical Languages | Shunyu Yao, Binghui Peng, Christos Papadimitriou and Karthik Narasimhan | N/A | N/A |
| Data Augmentation for Text Generation Without Any Augmented Data | Wei Bi, Huayang Li and Jiacheng Huang | N/A | N/A |
| Are Pretrained Convolutions Better than Pretrained Transformers? | Yi Tay, Mostafa Dehghani, Jai Prakash Gupta, Vamsi Aribandi, Dara Bahri, Zhen Qin and Donald Metzler | N/A | N/A |
| PRGC: Potential Relation and Global Correspondence Based Joint Relational Triple Extraction | Hengyi Zheng, Rui Wen, Xi Chen, Yifan Yang, Yunyan Zhang, Ziheng Zhang, Ningyu Zhang, Bin Qin, Xu Ming and Yefeng Zheng | N/A | N/A |
| ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation | Vijit Malik, Rishabh Sanjay, Shubham Kumar Nigam, Kripabandhu Ghosh, Shouvik Kumar Guha, Arnab Bhattacharya and Ashutosh Modi | N/A | N/A |
| Common Sense Beyond English: Evaluating and Improving Multilingual Language Models for Commonsense Reasoning | Bill Yuchen Lin, Seyeon Lee, Xiaoyang Qiao and Xiang Ren | N/A | N/A |
| Lightweight Cross-Lingual Sentence Representation Learning | Zhuoyuan Mao, Prakhar Gupta, Chenhui Chu, Martin Jaggi and Sadao Kurohashi | N/A | N/A |
| HateCheck: Functional Tests for Hate Speech Detection Models | Paul Röttger, Bertie Vidgen, Dong Nguyen, Zeerak Waseem, Helen Margetts and Janet Pierrehumbert | N/A | N/A |
| Robustifying Multi-hop QA through Pseudo-Evidentiality Training | Kyungjae Lee, Seung-won Hwang, Sang-eun Han and Dohyeon Lee | N/A | N/A |
| ERNIE-Doc: A Retrospective Long-Document Modeling Transformer | SiYu Ding, Junyuan Shang, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu and Haifeng Wang | N/A | N/A |
| LeeBERT: Learned Early Exit for BERT with cross-level optimization | Wei Zhu | N/A | N/A |
| Probing Toxic Content in Large Pre-Trained Language Models | Nedjma Ousidhoum, Xinran Zhao, Tianqing Fang, Yangqiu Song and Dit-Yan Yeung | N/A | N/A |
| Align Voting Behavior with Public Statements for Legislator Representation Learning | Xinyi Mou, Zhongyu Wei, Lei Chen, Shangyi Ning, Yancheng He, Changjian Jiang and Xuanjing Huang | N/A | N/A |
| Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks | Jinghui Qin, Xiaodan Liang, Yining Hong, Jianheng Tang and Liang Lin | N/A | N/A |
| BanditMTL: Bandit-based Multi-task Learning for Text Classification | Yuren Mao, Zekai Wang, Weiwei Liu, Xuemin Lin and Wenbin Hu | N/A | N/A |
| Verb Metaphor Detection via Contextual Relation Learning | Wei Song, Shuhui Zhou, Ruiji Fu, Ting Liu and Lizhen Liu | N/A | N/A |
| Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning | Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu | N/A | N/A |
| On Finding the K-best Non-projective Dependency Trees | Ran Zmigrod, Tim Vieira and Ryan Cotterell | N/A | N/A |
| Towards Robustness of Text-to-SQL Models against Synonym Substitution | Yujian Gan, Xinyun Chen, Qiuping Huang, Matthew Purver, John R. Woodward, Jinxia Xie and Pengsheng Huang | N/A | N/A |
| Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation | Tong Zhang, Long Zhang, Wei Ye, Bo Li, Jinan Sun, Xiaoyu Zhu, Wen Zhao and Shikun Zhang | N/A | N/A |
| ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer | Yuanmeng Yan, Rumei Li, Sirui Wang, Fuzheng Zhang, Wei Wu and Weiran Xu | N/A | N/A |
| Structurizing Misinformation Stories via Rationalizing Fact-Checks | Shan Jiang and Christo Wilson | N/A | N/A |
| PairRE: Knowledge Graph Embeddings via Paired Relation Vectors | Linlin Chao, Jianshan He, Taifeng Wang and Wei Chu | N/A | N/A |
| Positional Artefacts Propagate Through Masked Language Model Embeddings | Ziyang Luo, Artur Kulmizev and Xiaoxi Mao | N/A | N/A |
| Exploring the Representation of Word Meanings in Context: A Case Study on Homonymy and Synonymy | Marcos Garcia | N/A | N/A |
| Leveraging Type Descriptions for Zero-shot Named Entity Recognition and Classification | Rami Aly, Andreas Vlachos and Ryan McDonald | N/A | N/A |
| Learning from Perturbations: Diverse and Informative Dialogue Generation with Inverse Adversarial Training | Wangchunshu Zhou, Qifei LI and Chenle Li | N/A | N/A |
| Exploring Dynamic Selection of Branch Expansion Orders for Code Generation | Hui Jiang, Chulun Zhou, Fandong Meng, Biao Zhang, Jie Zhou, Degen Huang, Qingqiang Wu and Jinsong Su | N/A | N/A |
| Learning Dense Representations of Phrases at Scale | Jinhyuk Lee, Mujeen Sung, Jaewoo Kang and Danqi Chen | N/A | N/A |
| Modeling Bilingual Conversational Characteristics for Neural Chat Translation | Yunlong Liang, Fandong Meng, Yufeng Chen, Jinan Xu and Jie Zhou | N/A | N/A |
| Revisiting the Negative Data of Distantly Supervised Relation Extraction | Chenhao Xie, Jiaqing Liang, Jingping Liu, Chengsong Huang, Wenhao Huang and Yanghua Xiao | N/A | N/A |
| Rational LAMOL: A Rationale-based Lifelong Learning Framework | Kasidis Kanwatchara, Thanapapas Horsuwan, Piyawat Lertvittayakumjorn, Boonserm Kijsirikul and Peerapon Vateekul | N/A | N/A |
| Learning from Miscellaneous Other-Class Words for Few-shot Named Entity Recognition | Meihan Tong, Shuai Wang, Bin Xu, Yixin Cao, Minghui Liu, Lei Hou and Juanzi Li | N/A | N/A |
| Knowing the No-match: Entity Alignment with Dangling Cases | Zequn Sun, Muhao Chen and Wei Hu | N/A | N/A |
| Including Signed Languages in Natural Language Processing | Kayo Yin, Amit Moryossef, Julie Hochgesang, Yoav Goldberg and Malihe Alikhani | N/A | N/A |
| Label-Specific Dual Graph Neural Network for Multi-Label Text Classification | Qianwen Ma, Chunyuan Yuan, Wei Zhou and Songlin Hu | N/A | N/A |
| Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference | Tuan Lai, Heng Ji, ChengXiang Zhai and Quan Hung Tran | N/A | N/A |
| Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge Graph Embedding | Hidetaka Kamigaito and Katsuhiko Hayashi | N/A | N/A |
| Argument Pair Extraction via Attention-guided Multi-Layer Multi-Cross Encoding | Liying Cheng, Tianyu Wu, Lidong Bing and Luo Si | N/A | N/A |
| Mitigating Bias in Session-based Cyberbullying Detection: A Non-Compromising Approach | Lu Cheng, Ahmadreza Mosallanezhad, Yasin Silva, Deborah Hall and Huan Liu | N/A | N/A |
| Novel Slot Detection: A Benchmark for Discovering Unknown Slot Types in the Task-Oriented Dialogue System | Yanan Wu, Zhiyuan Zeng, Keqing He, Hong Xu, Yuanmeng Yan, Huixing Jiang and Weiran Xu | N/A | N/A |
| Generation-Augmented Retrieval for Open-Domain Question Answering | Yuning Mao, Pengcheng He, Xiaodong Liu, Yelong Shen, Jianfeng Gao, Jiawei Han and Weizhu Chen | N/A | N/A |
| A Systematic Investigation of KB-Text Embedding Alignment at Scale | Vardaan Pahuja, Yu Gu, Wenhu Chen, Mehdi Bahrami, Lei Liu, Wei-Peng Chen and Yu Su | N/A | N/A |
| Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge | Linmei Hu, Tianchi Yang, Luhao Zhang, Wanjun Zhong, Duyu Tang, Chuan Shi, Nan Duan and Ming Zhou | N/A | N/A |
| Nested Named Entity Recognition via Explicitly Excluding the Influence of the Best Path | Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe | N/A | N/A |
| COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion | Debjit Paul and Anette Frank | N/A | N/A |
| Bridge-Based Active Domain Adaptation for Aspect Term Extraction | Zhuang Chen and Tieyun Qian | N/A | N/A |
| The Limitations of Limited Context for Constituency Parsing | Yuchen Li and Andrej Risteski | N/A | N/A |
| Dynamic Contextualized Word Embeddings | Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze | N/A | N/A |
| Superbizarre Is Not Superb: Derivational Morphology Improves BERTâs Interpretation of Complex Words | Valentin Hofmann, Janet Pierrehumbert and Hinrich Schütze | N/A | N/A |
| RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems | Baolin Peng, Chunyuan Li, Zhu Zhang, Chenguang Zhu, Jinchao Li and Jianfeng Gao | N/A | N/A |
| How Knowledge Graph and Attention Help? A Qualitative Analysis into Bag-level Relation Extraction | Zikun Hu, Yixin Cao, Lifu Huang and Tat-Seng Chua | N/A | N/A |
| InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection | Yi Fung, Christopher Thomas, Revanth Gangi Reddy, Sandeep Polisetty, Heng Ji, Shih-Fu Chang, Kathleen McKeown, Mohit Bansal and Avi Sil | N/A | N/A |
| Birdâs Eye: Probing for Linguistic Graph Structures with a Simple Information-Theoretic Approach | Yifan Hou and Mrinmaya Sachan | N/A | N/A |
| A DQN-based Approach to Finding Precise Evidences for Fact Verification | Hai Wan, Haicheng Chen, Jianfeng Du, Weilin Luo and Rongzhen Ye | N/A | N/A |
| Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection | Lixing Zhu, Gabriele Pergola, Lin Gui, Deyu Zhou and Yulan He | N/A | N/A |
| RAW-C: Relatedness of Ambiguous Words in Context (A New Lexical Resource for English) | Sean Trott and Benjamin Bergen | N/A | N/A |
| Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning | Armen Aghajanyan, Sonal Gupta and Luke Zettlemoyer | N/A | N/A |
| Optimizing Deeper Transformers on Small Datasets | Peng Xu, Dhruv Kumar, Wei Yang, Wenjie Zi, Keyi Tang, Chenyang Huang, Jackie Chi Kit Cheung, Simon J.D. Prince and Yanshuai Cao | N/A | N/A |
| Unsupervised Out-of-Domain Detection via Pre-trained Transformers | Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng and Caiming Xiong | N/A | N/A |
| Learning to Ask Conversational Questions by Optimizing Levenshtein Distance | Zhongkun Liu, Pengjie Ren, Zhumin CHEN, Zhaochun Ren, Maarten De Rijke and Ming Zhou | N/A | N/A |
| Multi-stage Pre-training over Simplified Multimodal Pre-training Models | Tongtong Liu, Fangxiang Feng and Xiaojie WANG | N/A | N/A |
| When Do You Need Billions of Words of Pretraining Data? | Yian Zhang, Alex Warstadt, Xiaocheng Li and Samuel R. Bowman | N/A | N/A |
| Bilingual Lexicon Induction via Unsupervised Bitext Construction and Word Alignment | Haoyue Shi, Luke Zettlemoyer and Sida I. Wang | N/A | N/A |
| xMoCo: Cross Momentum Contrastive Learning for Open-Domain Question Answering | Nan Yang, Furu Wei, Binxing Jiao, Daxing Jiang and Linjun Yang | N/A | N/A |
| Importance-based Neuron Allocation for Multilingual Neural Machine Translation | Wanying Xie, Yang Feng, Shuhao Gu and Dong Yu | N/A | N/A |
| DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue | Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard and Satwik Kottur | N/A | N/A |
| CTFN: Hierarchical Learning for Multimodal Sentiment Analysis Using Coupled-Translation Fusion Network | Jiajia Tang, Kang Li, Xuanyu Jin, Andrzej Cichocki, Qibin Zhao and Wanzeng Kong | N/A | N/A |
| Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases | Boxi Cao, Hongyu Lin, Xianpei Han, Le Sun, Lingyong Yan, Meng Liao, Tong Xue and Jin Xu | N/A | N/A |
| End-to-End AMR Corefencence Resolution | Qiankun Fu, Linfeng Song, Wenyu Du and Yue Zhang | N/A | N/A |
| Neural Machine Translation with Monolingual Translation Memory | Deng Cai, Yan Wang, Huayang Li, Wai Lam and Lemao Liu | N/A | N/A |
| Can Sequence-to-Sequence Models Crack Substitution Ciphers? | Nada Aldarrab and Jonathan May | N/A | N/A |
| From Discourse to Narrative: Knowledge Projection for Event Relation Extraction | Jialong Tang, Hongyu Lin, Meng Liao, Yaojie Lu, Xianpei Han, Le Sun, Weijian Xie and Jin Xu | N/A | N/A |
| EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets | Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Zhangyang Wang and Jingjing Liu | N/A | N/A |
| VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation | Changhan Wang, Morgane Riviere, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Pino and Emmanuel Dupoux | N/A | N/A |
| Fine-grained Information Extraction from Biomedical Literature based on Knowledge-enriched Abstract Meaning Representation | Zixuan Zhang, Nikolaus Parulian, Heng Ji, Ahmed Elsayed, Skatje Myers and Martha Palmer | N/A | N/A |
| SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining | Taolin Zhang, Zerui Cai, Chengyu Wang, Minghui Qiu, Bite Yang and XIAOFENG HE | N/A | N/A |
| Structural Guidance for Transformer Language Models | Peng Qian, Tahira Naseem, Roger Levy and Ramón Fernandez Astudillo | N/A | N/A |
| COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic | Arkadiy Saakyan, Tuhin Chakrabarty and Smaranda Muresan | N/A | N/A |
| Robustness Testing of Language Understanding in Task-Oriented Dialog | Jiexi Liu, Ryuichi Takanobu, Jiaxin Wen, Dazhen Wan, Hongguang Li, Weiran Nie, Cheng LI, Wei Peng and Minlie Huang | N/A | N/A |
| TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance | Fengbin Zhu, Wenqiang Lei, Youcheng Huang, Chao Wang, Shuo Zhang, Jiancheng Lv, Fuli Feng and Tat-Seng Chua | N/A | N/A |
| Societal Biases in Language Generation: Progress and Challenges | Emily Sheng, Kai-Wei Chang, Prem Natarajan and Nanyun Peng | N/A | N/A |
| Weight Distillation: Transferring the Knowledge in Neural Network Parameters | Ye Lin, Yanyang Li, Ziyang Wang, Bei Li, Quan Du, Tong Xiao and Jingbo Zhu | N/A | N/A |
| UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning | Wei Li, Can Gao, Guocheng Niu, Xinyan Xiao, Hao Liu, Jiachen Liu, Hua Wu and Haifeng Wang | N/A | N/A |
| What is Your Article Based On? Inferring Fine-grained Provenance | Yi Zhang, Zachary Ives and Dan Roth | N/A | N/A |
| MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition | Shuang Wu, Xiaoning Song and Zhenhua Feng | N/A | N/A |
| VisualSparta: An Embarrassingly Simple Approach to Large-scale Text-to-Image Search with Weighted Bag-of-words | Xiaopeng Lu, Tiancheng Zhao and Kyusong Lee | N/A | N/A |
| Are Missing Links Predictable? An Inferential Benchmark for Knowledge Graph Completion | Yixin Cao, Xiang Ji, Xin Lv, Juanzi Li, Yonggang Wen and Hanwang Zhang | N/A | N/A |
| Unleash GPT-2 Power for Event Detection | Amir Pouran Ben Veyseh, Viet Lai, Franck Dernoncourt and Thien Huu Nguyen | N/A | N/A |
| G-Transformer for Document-Level Machine Translation | Guangsheng Bao, Yue Zhang, Zhiyang Teng, Boxing Chen and Weihua Luo | N/A | N/A |
| Prevent the Language Model from being Overconfident in Neural Machine Translation | Mengqi Miao, Fandong Meng, Yijin Liu, Xiao-Hua Zhou and Jie Zhou | N/A | N/A |
| Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders | Chen Xu, Bojie Hu, Yanyang Li, Yuhao Zhang, Shen Huang, Qi Ju, Tong Xiao and Jingbo Zhu | N/A | N/A |
| Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models | Sumanta Bhattacharyya, Amirmohammad Rooshenas, Subhajit Naskar, Simeng Sun, Mohit Iyyer and Andrew McCallum | N/A | N/A |
| PENS: A Dataset and Generic Framework for Personalized News Headline Generation | Xiang Ao, Xiting Wang, Ling Luo, Ying Qiao, Qing He and Xing Xie | N/A | N/A |
| Cross-modal Memory Networks for Radiology Report Generation | Zhihong Chen, Yaling Shen, Yan Song and Xiang Wan | N/A | N/A |
| On Compositional Generalization of Neural Machine Translation | Yafu Li, Yongjing Yin, Yulong Chen and Yue Zhang | N/A | N/A |
| A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations | Pierre Colombo, Pablo Piantanida and Chloé Clavel | N/A | N/A |
| Multimodal Sentiment Detection Based on Multi-channel Graph Neural Networks | Xiaocui Yang, Shi Feng, Yifei Zhang and Daling Wang | N/A | N/A |
| MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding | Jia-Chen Gu, Chongyang Tao, Zhenhua Ling, Can Xu, Xiubo Geng and Daxin Jiang | N/A | N/A |
| Mask-Align: Self-Supervised Neural Word Alignment | Chi Chen, Maosong Sun and Yang Liu | N/A | N/A |
| Reasoning over Entity-Action-Location Graph for Procedural Text Understanding | Hao Huang, Xiubo Geng, Jian Pei, Guodong Long and Daxin Jiang | N/A | N/A |
| From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding | Shan Wu, Bo Chen, Chunlei Xin, Xianpei Han, Le Sun, Weipeng Zhang, Jiansong Chen, Fan Yang and Xunliang Cai | N/A | N/A |
| GWLAN: General Word-Level AutocompletioN for Computer-Aided Translation | Huayang Li, Lemao Liu, Guoping Huang and Shuming Shi | N/A | N/A |
| CogAlign: Learning to Align Textual Neural Representations to Cognitive Language Processing Signals | Yuqi Ren and Deyi Xiong | N/A | N/A |
| Distributed Representations of Emotion Categories in Emotion Space | Xiangyu Wang and Chengqing Zong | N/A | N/A |
| Transfer Learning for Sequence Generation: from Single-source to Multi-source | Xuancheng Huang, Jingfang Xu, Maosong Sun and Yang Liu | N/A | N/A |
| HieRec: Hierarchical User Interest Modeling for Personalized News Recommendation | Tao Qi, Fangzhao Wu, Chuhan Wu, Peiru Yang, Yang Yu, Xing Xie and Yongfeng Huang | N/A | N/A |
| Trigger is Not Sufficient: Exploiting Frame-aware Knowledge for Implicit Event Argument Extraction | Kaiwen Wei, Xian Sun, Zequn Zhang, Jingyuan Zhang, Guo Zhi and Li Jin | N/A | N/A |
| Can vectors read minds better than experts? Comparing data augmentation strategies for the automated scoring of childrenâs mindreading ability | Venelin Kovatchev, Phillip Smith, Mark Lee and Rory Devine | N/A | N/A |
| Multi-hop Graph Convolutional Network with High-order Chebyshev Approximation for Text Reasoning | Shuoran Jiang, Qingcai Chen, Xin Liu, Baotian Hu and Lisai Zhang | N/A | N/A |
| Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances | Zekang Li, Jinchao Zhang, Zhengcong Fei, Yang Feng and Jie Zhou | N/A | N/A |
| Determinantal Beam Search | Clara Meister, Martina Forster and Ryan Cotterell | N/A | N/A |
| Language Model Evaluation Beyond Perplexity | Clara Meister and Ryan Cotterell | N/A | N/A |
| Self-Guided Contrastive Learning for BERT Sentence Representations | Taeuk Kim, Kang Min Yoo and Sang-goo Lee | N/A | N/A |
| A Dataset and Baselines for Multilingual Reply Suggestion | Mozhi Zhang, Wei Wang, Budhaditya Deb, Guoqing Zheng, Milad Shokouhi and Ahmed Hassan Awadallah | N/A | N/A |
| Element Intervention for Open Relation Extraction | Fangchao Liu, Lingyong Yan, Hongyu Lin, Xianpei Han and Le Sun | N/A | N/A |
| BERTGen: Multi-task Generation through BERT | Faidon Mitzalis, Ozan Caglayan, Pranava Madhyastha and Lucia Specia | N/A | N/A |
| Semantic Representation for Dialogue Modeling | Xuefeng Bai, Yulong Chen, Linfeng Song and Yue Zhang | N/A | N/A |
| Selective Knowledge Distillation for Neural Machine Translation | Fusheng Wang, Jianhao Yan, Fandong Meng and Jie Zhou | N/A | N/A |
| Lexical Semantic Change Discovery | Sinan Kurtyigit, Maike Park, Dominik Schlechtweg, Jonas Kuhn and Sabine Schulte Im Walde | N/A | N/A |
| Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction | Yaojie Lu, Hongyu Lin, Jin Xu, Xianpei Han, Jialong Tang, Annan Li, Le Sun, Meng Liao and Shaoyi Chen | N/A | N/A |
| Mid-Air Hand Gestures for Post-Editing of Machine Translation | Rashad Albo Jamara, Nico Herbig, Antonio Krüger and Josef Van Genabith | N/A | N/A |
| Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation | Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li and Ben Kao | N/A | N/A |
| Pre-training Universal Language Representation | Yian Li and Hai Zhao | N/A | N/A |
| De-Confounded Variational Encoder-Decoder for Logical Table-to-Text Generation | Wenqing Chen, Jidong Tian, Yitian Li, Hao He and Yaohui Jin | N/A | N/A |
| Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection | Bertie Vidgen, Tristan Thrush, Zeerak Waseem and Douwe Kiela | N/A | N/A |
| Towards User-Driven Neural Machine Translation | Huan Lin, Liang Yao, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Degen Huang and Jinsong Su | N/A | N/A |
| Explaining Relationships Between Scientific Documents | Kelvin Luu, Xinyi Wu, Rik Koncel-Kedziorski, Kyle Lo, Isabel Cachola and Noah A. Smith | N/A | N/A |
| XLPT-AMR: Cross-Lingual Pre-Training via Multi-Task Learning for Zero-Shot AMR Parsing and Text Generation | Dongqin Xu, Junhui Li, Muhua Zhu, Min Zhang and Guodong Zhou | N/A | N/A |
| Tree-Structured Topic Modeling with Nonparametric Neural Variational Inference | Ziye Chen, Cheng Ding, Zusheng Zhang, Yanghui Rao and Haoran Xie | N/A | N/A |
| CLEVE: Contrastive Pre-training for Event Extraction | Ziqi Wang, Xiaozhi Wang, Xu Han, Yankai Lin, Lei Hou, Zhiyuan Liu, Peng Li, Juanzi Li and Jie Zhou | N/A | N/A |
| TWAG: A Topic-Guided Wikipedia Abstract Generator | Fangwei Zhu, Shangqing Tu, Jiaxin Shi, Juanzi Li, Lei Hou and Tong Cui | N/A | N/A |
| Towards Emotional Support Dialog Systems | Siyang Liu, Chujie Zheng, Orianna Demasi, Sahand Sabour, Yu Li, Zhou Yu, Yong Jiang and Minlie Huang | N/A | N/A |
| Contrastive Learning for Many-to-many Multilingual Neural Machine Translation | Xiao Pan, Mingxuan Wang, Liwei Wu and Lei Li | N/A | N/A |
| A Semantic-based Method for Unsupervised Commonsense Question Answering | Yilin Niu, Fei Huang, Jiaming Liang, Wenkai Chen, Xiaoyan Zhu and Minlie Huang | N/A | N/A |
| Rethinking Stealthiness of Backdoor Attack against NLP Models | Wenkai Yang, Yankai Lin, Peng Li, Jie Zhou and Xu Sun | N/A | N/A |
| A Neural Model for Joint Document and Snippet Ranking in Question Answering for Large Document Collections | Dimitris Pappas and Ion Androutsopoulos | N/A | N/A |
| Cascaded Head-colliding Attention | Lin Zheng, Zhiyong Wu and Lingpeng Kong | N/A | N/A |
| DialogueCRN: Contextual Reasoning Networks for Emotion Recognition in Conversations | Dou Hu, Lingwei Wei and Xiaoyong Huai | N/A | N/A |
| Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval | Akari Asai and Eunsol Choi | N/A | N/A |
| An In-depth Study on Internal Structure of Chinese Words | Chen Gong, Saihao Huang, Houquan Zhou, Zhenghua Li, Min Zhang, Zhefeng Wang, Baoxing Huai and Nicholas Jing Yuan | N/A | N/A |
| Understanding the Properties of Minimum Bayes Risk Decoding in Neural Machine Translation | Mathias Müller and Rico Sennrich | N/A | N/A |
| Language Model as an Annotator: Exploring DialoGPT for Dialogue Summarization | Xiachong Feng, Xiaocheng Feng, Libo Qin, Bing Qin and Ting Liu | N/A | N/A |
| Span-based Semantic Parsing for Compositional Generalization | Jonathan Herzig and Jonathan Berant | N/A | N/A |
| BASS: Boosting Abstractive Summarization with Unified Semantic Graph | Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Ziqiang Cao, Sujian Li, Hua Wu and Haifeng Wang | N/A | N/A |
| Missing Modality Imagination Network for Emotion Recognition with Uncertain Missing Modalities | Jinming Zhao, Ruichen Li and Qin Jin | N/A | N/A |
| Controversy and Conformity: from Generalized to Personalized Aggressiveness Detection | Kamil Kanclerz, Alicja Figas, Marcin Gruza, Tomasz Kajdanowicz, Jan Kocon, Daria Puchalska and Przemyslaw Kazienko | N/A | N/A |
| A Neural Transition-based Model for Argumentation Mining | Jianzhu Bao, Chuang Fan, Jipeng Wu, Yixue Dang, Jiachen Du and Ruifeng Xu | N/A | N/A |
| MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER | Linlin Liu, BOSHENG DING, Lidong Bing, Shafiq Joty, Luo Si and Chunyan Miao | N/A | N/A |
| Discovering Dialogue Slots with Weak Supervision | VojtÄch HudeÄek, OndÅej DuÅ¡ek and Zhou Yu | N/A | N/A |
| Self-Training Sampling with Monolingual Data Uncertainty for Neural Machine Translation | Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Shuming Shi, Michael Lyu and Irwin King | N/A | N/A |
| Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks | Rabeeh Karimi Mahabadi, Sebastian Ruder, Mostafa Dehghani and James Henderson | N/A | N/A |
| Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation | Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Xiangliang Zhang, Dongyan Zhao and Rui Yan | N/A | N/A |
| Attention Calibration for Transformer in Neural Machine Translation | Yu Lu, Jiali Zeng, Jiajun Zhang, Shuangzhi Wu and Mu Li | N/A | N/A |
| Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation | Elena Voita, Rico Sennrich and Ivan Titov | N/A | N/A |
| Accelerating BERT Inference for Sequence Labeling via Early-Exit | Xiaonan Li, Yunfan Shao, Tianxiang Sun, Hang Yan, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| Focus Attention: Promoting Faithfulness and Diversity in Summarization | Rahul Aralikatte, Shashi Narayan, Joshua Maynez, Sascha Rothe and Ryan McDonald | N/A | N/A |
| De-biasing Distantly Supervised Named Entity Recognition via Causal Intervention | Wenkai Zhang, Hongyu Lin, Xianpei Han and Le Sun | N/A | N/A |
| Beyond Sentence-Level End-to-End Speech Translation: Context Helps | Biao Zhang, Ivan Titov, Barry Haddow and Rico Sennrich | N/A | N/A |
| Every Bite Is an Experience: Key Point Analysis of Business Reviews | Roy Bar-Haim, Lilach Eden, Yoav Kantor, Roni Friedman and Noam Slonim | N/A | N/A |
| POS-Constrained Parallel Decoding for Non-autoregressive Generation | Kexin Yang, Wenqiang Lei, Dayiheng Liu, Weizhen Qi and Jiancheng Lv | N/A | N/A |
| Structural Pre-training for Dialogue Comprehension | Zhuosheng Zhang and Hai Zhao | N/A | N/A |
| Learning Language Specific Sub-network for Multilingual Machine Translation | Zehui Lin, Liwei Wu, Mingxuan Wang and Lei Li | N/A | N/A |
| AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models | Yichun Yin, Cheng Chen, Lifeng Shang, Xin Jiang, Xiao Chen and Qun Liu | N/A | N/A |
| PP-Rec: News Recommendation with Personalized User Interest and Time-aware News Popularity | Tao Qi, Fangzhao Wu, Chuhan Wu and Yongfeng Huang | N/A | N/A |
| Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model | Hongliang Dai, Yangqiu Song and Haixun Wang | N/A | N/A |
| Investigating label suggestions for opinion mining in German Covid-19 social media | Tilman Beck, Ji-Ung Lee, Christina Viehmann, Marcus Maurer, Oliver Quiring and Iryna Gurevych | N/A | N/A |
| Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation | Xin Liu, Baosong Yang, Dayiheng Liu, Haibo Zhang, Weihua Luo, Min Zhang, Haiying Zhang and Jinsong Su | N/A | N/A |
| How Did This Get Funded?! Automatically Identifying Quirky Scientific Achievements | Chen Shani, Nadav Borenstein and Dafna Shahaf | N/A | N/A |
| Deep Differential Amplifier for Extractive Summarization | Ruipeng Jia, Yanan Cao, Fang Fang, Yuchen Zhou, Zheng Fang, Yanbing Liu and Shi Wang | N/A | N/A |
| AggGen: Ordering and Aggregating while Generating | Xinnuo Xu, OndÅej DuÅ¡ek, Verena Rieser and Ioannis Konstas | N/A | N/A |
| Lexicon Enhanced Chinese Sequence Labeling Using BERT Adapter | Wei Liu, Xiyan Fu, Yue Zhang and Wenming Xiao | N/A | N/A |
| Multi-perspective Coherent Reasoning for Helpfulness Prediction of Multimodal Reviews | Junhao Liu, Zhen Hai, Min Yang and Lidong Bing | N/A | N/A |
| AugNLG: Few-shot Natural Language Generation using Self-trained Data Augmentation | Xinnuo Xu, Guoyin Wang, Young-Bum Kim and Sungjin Lee | N/A | N/A |
| Metaphor Generation with Conceptual Mappings | Kevin Stowe, Tuhin Chakrabarty, Nanyun Peng, Smaranda Muresan and Iryna Gurevych | N/A | N/A |
| Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering | Ahjeong Seo, Gi-Cheon Kang, Joonhan Park and Byoung-Tak Zhang | N/A | N/A |
| Learning Syntactic Dense Embedding with Correlation Graph for Automatic Readability Assessment | Xinying Qiu, Yuan Chen, Hanwu Chen, Jian-Yun Nie, Yuming Shen and Dawei Lu | N/A | N/A |
| Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution | Fanchao Qi, Yuan Yao, Sophia Xu, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-Based Simulation | Sungdong Kim, Minsuk Chang and Sang-Woo Lee | N/A | N/A |
| A Unified Generative Framework for Aspect-based Sentiment Analysis | Hang Yan, Junqi Dai, Tuo Ji, Xipeng Qiu and Zheng Zhang | N/A | N/A |
| Hierarchy-aware Label Semantics Matching Network for Hierarchical Text Classification | Haibin Chen, Qianli Ma, Zhenxi Lin and Jiangyue Yan | N/A | N/A |
| UXLA: A Robust Unsupervised Data Augmentation Framework for Cross-Lingual NLP | M Saiful Bari, Tasnim Mohiuddin and Shafiq Joty | N/A | N/A |
| Space Efficient Context Encoding for Non-Task-Oriented Dialogue Generation with Graph Attention Transformer | Fabian Galetzka, Jewgeni Rose, David Schlangen and Jens Lehmann | N/A | N/A |
| Integrated Directional Gradients: Feature Interaction Attribution for Neural NLP Models | Sandipan Sikdar, Parantapa Bhattacharya and Kieran Heese | N/A | N/A |
| Engage the Public: Poll Question Generation for Social Media Posts | Zexin Lu, Keyang Ding, Yuji Zhang, Jing Li, Baolin Peng and Lemao Liu | N/A | N/A |
| Generating Query Focused Summaries from Query-Free Resources | Yumo Xu and Mirella Lapata | N/A | N/A |
| Hate Speech Detection Based on Sentiment Knowledge Sharing | Xianbing Zhou, Yang Yong, Xiaochao Fan, Ge Ren, Yunfeng Song, Yufeng Diao, Liang Yang and Hongfei LIN | N/A | N/A |
| UniRE: A Unified Label Space for Entity Relation Extraction | Yijun Wang, Changzhi Sun, Yuanbin Wu, Hao Zhou, Lei Li and Junchi Yan | N/A | N/A |
| Question Answering Over Temporal Knowledge Graphs | Apoorv Saxena, Soumen Chakrabarti and Partha Talukdar | N/A | N/A |
| MMGCN: Multimodal Fusion via Deep Graph Convolution Network for Emotion Recognition in Conversation | Jingwen Hu, Yuchen Liu, Jinming Zhao and Qin Jin | N/A | N/A |
| Introducing Orthogonal Constraint in Structural Probes | Tomasz Limisiewicz and David MareÄek | N/A | N/A |
| PlotCoder: Hierarchical Decoding for Synthesizing Visualization Code in Programmatic Context | Xinyun Chen, Linyuan Gong, Alvin Cheung and Dawn Song | N/A | N/A |
| Can Generative Pre-trained Language Models Serve As Knowledge Bases for Closed-book QA? | Cunxiang Wang, Pai Liu and Yue Zhang | N/A | N/A |
| Poisoning Knowledge Graph Embeddings via Relation Inference Patterns | Peru Bhardwaj, John Kelleher, Luca Costabello and Declan OâSullivan | N/A | N/A |
| On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation | Ruidan He, Linlin Liu, Hai Ye, Qingyu Tan, BOSHENG DING, Liying Cheng, Jiawei Low, Lidong Bing and Luo Si | N/A | N/A |
| Multi-View Cross-Lingual Structured Prediction with Minimum Supervision | Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu | N/A | N/A |
| Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis | Linyi Yang, Jiazheng Li, Padraig Cunningham, Yue Zhang, Barry Smyth and Ruihai Dong | N/A | N/A |
| Detecting Propaganda Techniques in Memes | Dimitar Dimitrov, Bishr Bin Ali, Shaden Shaar, Firoj Alam, Fabrizio Silvestri, Hamed Firooz, Preslav Nakov and Giovanni Da San Martino | N/A | N/A |
| Adapting Unsupervised Syntactic Parsing Methodology for Discourse Dependency Parsing | Liwen Zhang, Ge Wang, Wenjuan Han and Kewei Tu | N/A | N/A |
| A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters | Mengjie Zhao, Yi Zhu, Ehsan Shareghi, Ivan VuliÄ, Roi Reichart, Anna Korhonen and Hinrich Schütze | N/A | N/A |
| Towards Argument Mining for Social Good: A Survey | Eva Maria Vecchi, Neele Falk, Iman Jundi and Gabriella Lapesa | N/A | N/A |
| What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? | Nikita Nangia, Saku Sugawara, Harsh Trivedi, Alex Warstadt, Clara Vania and Samuel R. Bowman | N/A | N/A |
| Cross-language Sentence Selection via Data Augmentation and Rationale Training | Yanda Chen, Chris Kedzie, Suraj Nair, Petra Galuscakova, Rui Zhang, Douglas Oard and Kathleen McKeown | N/A | N/A |
| RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models | Soumya Barikeri, Anne Lauscher, Ivan VuliÄ and Goran GlavaÅ¡ | N/A | N/A |
| Transferable Dialogue Systems and User Simulators | Bo-Hsiang Tseng, Yinpei Dai, Florian Kreyssig and Bill Byrne | N/A | N/A |
| PASS: Perturb-and-Select Summarizer for Product Reviews | Nadav Oved and Ran Levy | N/A | N/A |
| Comparing Test Sets with Item Response Theory | Clara Vania, Phu Mon Htut, William Huang, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho and Samuel R. Bowman | N/A | N/A |
| A Pre-training Strategy for Zero-Resource Response Selection in Knowledge-Grounded Conversations | Chongyang Tao, Changyu Chen, Jiazhan Feng, Ji-Rong Wen and Rui Yan | N/A | N/A |
| Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference? | Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri and Marco Turchi | N/A | N/A |
| Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims | Qiang Sheng, Juan Cao, Xueyao Zhang, Xirong Li and Lei Zhong | N/A | N/A |
| Unsupervised Neural Machine Translation for Low-Resource Domains via Meta-Learning | Cheonbok Park, Yunwon Tae, TaeHee Kim, Soyoung Yang, Mohammad Azam Khan, Lucy Park and Jaegul Choo | N/A | N/A |
| Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation | Shizhe Diao, Ruijia Xu, Hongjin Su, Yilei Jiang, Yan Song and Tong Zhang | N/A | N/A |
| Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition | Xin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang and Pengjun Xie | N/A | N/A |
| ExCAR: Event Graph Knowledge Enhanced Explainable Causal Reasoning | Li Du, Xiao Ding, Kai Xiong, Ting Liu and Bing Qin | N/A | N/A |
| CLIP: A Dataset for Extracting Action Items for Physicians from Hospital Discharge Notes | James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, Greg McKelvey, Hui Dai, Yi Yang and David Sontag | N/A | N/A |
| Assessing Emoji Use in Modern Text Processing Tools | Abu Awal Md Shoeb and Gerard De Melo | N/A | N/A |
| Marginal Utility Diminishes: Exploring the Minimum Knowledge for BERT Knowledge Distillation | Yuanxin LIU, Fandong Meng, Zheng Lin, Weiping Wang and Jie Zhou | N/A | N/A |
| Measuring and Increasing Context Usage in Context-Aware Machine Translation | Patrick Fernandes, Kayo Yin, Graham Neubig and André F. T. Martins | N/A | N/A |
| Online Learning Meets Machine Translation Evaluation: Finding the Best Systems with the Least Human Effort | Vânia Mendonça, Ricardo Rei, Luisa Coheur, Alberto Sardinha and Ana Lúcia Santos | N/A | N/A |
| Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL | Jiaqi Guo, Ziliang Si, Yu Wang, Qian Liu, Ming Fan, Jian-Guang LOU, Zijiang Yang and Ting Liu | N/A | N/A |
| Bad Seeds: Evaluating Lexical Methods for Bias Measurement | Maria Antoniak and David Mimno | N/A | N/A |
| Explaining Contextualization in Language Models using Visual Analytics | Rita Sevastjanova, Aikaterini-Lida Kalouli, Christin Beck, Hanna Schäfer and Mennatallah El-Assady | N/A | N/A |
| Evaluating morphological typology in zero-shot cross-lingual transfer | Antonio MartÃnez-GarcÃa, Toni Badia and Jeremy Barnes | N/A | N/A |
| Improving Dialog Systems for Negotiation with Personality Modeling | Runzhe Yang, Jingxiao Chen and Karthik Narasimhan | N/A | N/A |
| Lower Perplexity is Not Always Human-Like | Tatsuki Kuribayashi, Yohei Oseki, Takumi Ito, Ryo Yoshida, Masayuki Asahara and Kentaro Inui | N/A | N/A |
| Understanding and Countering Stereotypes: A Computational Approach to the Stereotype Content Model | Kathleen C. Fraser, Isar Nejadgholi and Svetlana Kiritchenko | N/A | N/A |
| Obtaining Better Static Word Embeddings Using Contextual Embedding Models | Prakhar Gupta and Martin Jaggi | N/A | N/A |
| What Context Features Can Transformer Language Models Use? | Joe OâConnor and Jacob Andreas | N/A | N/A |
| From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text | Ishan Tarunesh, Syamantak Kumar and Preethi Jyothi | N/A | N/A |
| Best of Both Worlds: Making High Accuracy Non-incremental Transformer-based Disfluency Detection Incremental | Morteza Rohanian and Julian Hough | N/A | N/A |
| Reflective Decoding: Beyond Unidirectional Generation with Off-the-Shelf Language Models | Peter West, Ximing Lu, Ari Holtzman, Chandra Bhagavatula, Jena D. Hwang and Yejin Choi | N/A | N/A |
| Early Detection of Sexual Predators in Chats | Matthias Vogt, Ulf Leser and Alan Akbik | N/A | N/A |
| StereoRel: Relational Triple Extraction from a Stereoscopic Perspective | Xuetao Tian, Liping Jing, Lu He and Feng Liu | N/A | N/A |
| Do Context-Aware Translation Models Pay the Right Attention? | Kayo Yin, Patrick Fernandes, Danish Pruthi, Aditi Chaudhary, André F. T. Martins and Graham Neubig | N/A | N/A |
| Transition-based Bubble Parsing: Improvements on Coordination Structure Prediction | Tianze Shi and Lillian Lee | N/A | N/A |
| UnitedQA: A Hybrid Approach for Open Domain Question Answering | Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen and Jianfeng Gao | N/A | N/A |
| ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences | Yanjun Gao, Ting-Hao Huang and Rebecca J. Passonneau | N/A | N/A |
| Supporting Land Reuse of Former Open Pit Mining Sites using Text Classification and Active Learning | Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann and Gerhard Heyer | N/A | N/A |
| Reservoir Transformers | Sheng Shen, Alexei Baevski, Ari Morcos, Kurt Keutzer, Michael Auli and Douwe Kiela | N/A | N/A |
| KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation | Yiran Xing, Zai Shi, Zhao Meng, Gerhard Lakemeyer, Yunpu Ma and Roger Wattenhofer | N/A | N/A |
| CCMatrix: Mining Billions of High-Quality Parallel Sentences on the Web | Holger Schwenk, Guillaume Wenzek, Sergey Edunov, Edouard Grave, Armand Joulin and Angela Fan | N/A | N/A |
| Improving Paraphrase Detection with the Adversarial Paraphrasing Task | Animesh Nighojkar and John Licato | N/A | N/A |
| Structured Sentiment Analysis as Dependency Graph Parsing | Jeremy Barnes, Robin Kurtz, Stephan Oepen, Lilja Ãvrelid and Erik Velldal | N/A | N/A |
| A Survey of Race, Racism, and Anti-Racism in NLP | Anjalie Field, Su Lin Blodgett, Zeerak Waseem and Yulia Tsvetkov | N/A | N/A |
| TIMEDIAL: Temporal Commonsense Reasoning in Dialog | Lianhui Qin, Aditya Gupta, Shyam Upadhyay, Luheng He, Yejin Choi and Manaal Faruqui | N/A | N/A |
| MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation | Ahmad Rashid, Vasileios Lioutas and Mehdi Rezagholizadeh | N/A | N/A |
| Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation | Yingjun Du, Nithin Holla, Xiantong Zhen, Cees Snoek and Ekaterina Shutova | N/A | N/A |
| Diverse Pretrained Context Encodings Improve Document Translation | Domenic Donato, Lei Yu and Chris Dyer | N/A | N/A |
| Uncovering Constraint-Based Behavior in Neural Models via Targeted Fine-Tuning | Forrest Davis and Marten Van Schijndel | N/A | N/A |
| Lexicon Learning for Few Shot Sequence Modeling | Ekin Akyurek and Jacob Andreas | N/A | N/A |
| Employing Argumentation Knowledge Graphs for Neural Argument Generation | Khalid Al Khatib, Lukas Trautner, Henning Wachsmuth, Yufang Hou and Benno Stein | N/A | N/A |
| TAN-NTM: Topic Attention Networks for Neural Topic Modeling | Madhur Panwar, Shashank Shailabh, Milan Aggarwal and Balaji Krishnamurthy | N/A | N/A |
| LexFit: Lexical Fine-Tuning of Pretrained Language Models | Ivan VuliÄ, Edoardo Maria Ponti, Anna Korhonen and Goran GlavaÅ¡ | N/A | N/A |
| OTTers: One-turn Topic Transitions for Open-Domain Dialogue | Karin Sevegnani, David M. Howcroft, Ioannis Konstas and Verena Rieser | N/A | N/A |
| ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data | Woojeong Jin, Rahul Khanna, Suji Kim, Dong-Ho Lee, Fred Morstatter, Aram Galstyan and Xiang Ren | N/A | N/A |
| Enhancing the generalization for Intent Classification and Out-of-Domain Detection in SLU | Yilin Shen, Yen-Chang Hsu, Avik Ray and Hongxia Jin | N/A | N/A |
| Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models | Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart Shieber, Tal Linzen and Yonatan Belinkov | N/A | N/A |
| To POS Tag or Not to POS Tag: The Impact of POS Tags on Morphological Learning in Low-Resource Settings | Sarah Moeller, Ling Liu and Mans Hulden | N/A | N/A |
| Edited Media Understanding Frames: Reasoning About the Intent and Implications of Visual Misinformation | Jeff Da, Maxwell Forbes, Rowan Zellers, Anthony Zheng, Jena D. Hwang, Antoine Bosselut and Yejin Choi | N/A | N/A |
| Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features | Hannah Rashkin, David Reitter, Gaurav Singh Tomar and Dipanjan Das | N/A | N/A |
| Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation | Eleftheria Briakou and Marine Carpuat | N/A | N/A |
| Continuous Language Generative Flow | Zineng Tang, Shiyue Zhang, Hyounghun Kim and Mohit Bansal | N/A | N/A |
| Recursive Tree-Structured Self-Attention for Answer Sentence Selection | Khalil Mrini, Emilia Farcas and Ndapa Nakashole | N/A | N/A |
| AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding | Jun Yan, Nasser Zalmout, Yan Liang, Christan Grant, Xiang Ren and Xin Luna Dong | N/A | N/A |
| Better than Average: Paired Evaluation of NLP systems | Maxime Peyrard, Wei Zhao, Steffen Eger and Robert West | N/A | N/A |
| Modeling Fine-Grained Entity Types with Box Embeddings | Yasumasa Onoe, Michael Boratko, Andrew McCallum and Greg Durrett | N/A | N/A |
| Value-Agnostic Conversational Semantic Parsing | Emmanouil Antonios Platanios, Adam Pauls, Subhro Roy, Yuchen Zhang, Alexander Kyte, Alan Guo, Sam Thomson, Jayant Krishnamurthy, Jason Wolfe, Jacob Andreas and Dan Klein | N/A | N/A |
| CoRI: Collective Relation Integration with Data Augmentation for Open Information Extraction | Zhengbao Jiang, Jialong Han, BUNYAMIN SISMAN and Xin Luna Dong | N/A | N/A |
| Syntopical Graphs for Computational Argumentation Tasks | Joe Barrow, Rajiv Jain, Nedim Lipka, Franck Dernoncourt, Vlad Morariu, Varun Manjunatha, Douglas Oard, Philip Resnik and Henning Wachsmuth | N/A | N/A |
| Anonymisation Models for Text Data: State of the art, Challenges and Future Directions | Pierre Lison, Ildikó Pilán, David Sanchez, Montserrat Batet and Lilja Ãvrelid | N/A | N/A |
| Selecting Informative Contexts Improves Language Model Fine-tuning | Richard Antonello, Nicole Beckage, Javier Turek and Alexander Huth | N/A | N/A |
| BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies? | Asahi Ushio, Luis Espinosa Anke, Steven Schockaert and Jose Camacho-Collados | N/A | N/A |
| Measure and Evaluation of Semantic Divergence across Two Languages | Syrielle Montariol and Alexandre Allauzen | N/A | N/A |
| Discriminative Reranking for Neural Machine Translation | Ann Lee, Michael Auli and MarcâAurelio Ranzato | N/A | N/A |
| Probabilistic, Structure-Aware Algorithms for Improved Variety, Accuracy, and Coverage of AMR Alignments | Austin Blodgett and Nathan Schneider | N/A | N/A |
| Meta-Learning to Compositionally Generalize | Henry Conklin, Bailin Wang, Kenny Smith and Ivan Titov | N/A | N/A |
| Handling Extreme Class Imbalance in Technical Logbook Datasets | Farhad Akhbardeh, Cecilia Ovesdotter Alm, Marcos Zampieri and Travis Desell | N/A | N/A |
| W-RST: Towards a Weighted RST-style Discourse Framework | Patrick Huber, Wen Xiao and Giuseppe Carenini | N/A | N/A |
| Examining the Inductive Bias of Neural Language Models with Artificial Languages | Jennifer C. White and Ryan Cotterell | N/A | N/A |
| Factoring Statutory Reasoning as Language Understanding Challenges | Nils Holzenberger and Benjamin Van Durme | N/A | N/A |
| CitationIE: Leveraging the Citation Graph for Scientific Information Extraction | Vijay Viswanathan, Graham Neubig and Pengfei Liu | N/A | N/A |
| Changing the World by Changing the Data | Anna Rogers | N/A | N/A |
| Cross-replication Reliability - An Empirical Approach to Interpreting Inter-rater Reliability | Ka Wong, Praveen Paritosh and Lora Aroyo | N/A | N/A |
| Select, Extract and Generate: Neural Keyphrase Generation with Layer-wise Coverage Attention | Wasi Ahmad, Xiao Bai, Soomin Lee and Kai-Wei Chang | N/A | N/A |
| Automated Generation of Storytelling Vocabulary from Photographs for use in AAC | Mauricio Fontana De Vargas and Karyn Moffatt | N/A | N/A |
| DESCGEN: A Distantly Supervised Datasetfor Generating Entity Descriptions | Weijia Shi, Mandar Joshi and Luke Zettlemoyer | N/A | N/A |
| An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models | Xueqing Liu and Chi Wang | N/A | N/A |
| Keep It Simple: Unsupervised Simplification of Multi-Paragraph Text | Philippe Laban, Tobias Schnabel, Paul Bennett and Marti A. Hearst | N/A | N/A |
| Exploiting Document Structures and Cluster Consistencies for Event Coreference Resolution | Hieu Minh Tran, Duy Phung and Thien Huu Nguyen | N/A | N/A |
| A Targeted Assessment of Incremental Processing in Neural Language Models and Humans | Ethan Wilcox, Pranali Vani and Roger Levy | N/A | N/A |
| Data Augmentation with Adversarial Training for Cross-Lingual NLI | Xin Dong, Yaxin Zhu, Zuohui Fu, Dongkuan Xu and Gerard De Melo | N/A | N/A |
| Factuality Assessment as Modal Dependency Parsing | Jiarui Yao, Haoling Qiu, Jin Zhao, Bonan Min and Nianwen Xue | N/A | N/A |
| Accelerating Text Communication via Abbreviated Sentence Input | Jiban Adhikary, Jamie Berger and Keith Vertanen | N/A | N/A |
| End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages | Josef Jon, João Paulo Aires, Dusan Varis and OndÅej Bojar | N/A | N/A |
| Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets | Su Lin Blodgett, Gilsinia Lopez, Alexandra Olteanu, Robert Sim and Hanna Wallach | N/A | N/A |
| Syntax-augmented Multilingual BERT for Cross-lingual Transfer | Wasi Ahmad, Haoran Li, Kai-Wei Chang and Yashar Mehdad | N/A | N/A |
| Language Embeddings for Typology and Cross-lingual Transfer Learning | Dian Yu, Taiqi He and Kenji Sagae | N/A | N/A |
| Breaking Down Walls of Text: How Can NLP Benefit Consumer Privacy? | Abhilasha Ravichander, Alan W Black, Thomas Norton, Shomir Wilson and Norman Sadeh | N/A | N/A |
| HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability | Jiaao Chen, Dinghan Shen, Weizhu Chen and Diyi Yang | N/A | N/A |
| ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining | Alexander Fabbri, Faiaz Rahman, Imad Rizvi, Borui Wang, Haoran Li, Yashar Mehdad and Dragomir Radev | N/A | N/A |
| Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data | Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn and Mona Diab | N/A | N/A |
| Learning Latent Structures for Cross Action Phrase Relations in Wet Lab Protocols | Chaitanya Kulkarni, Jany Chan, Eric Fosler-Lussier and Raghu Machiraju | N/A | N/A |
| Modeling Language Usage and Listener Engagement in Podcasts | Sravana Reddy, Mariya Lazarova, Yongze Yu and Rosie Jones | N/A | N/A |
| Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task | Yun Tang, Juan Pino, Xian Li, Changhan Wang and Dmitriy Genzel | N/A | N/A |
| On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study | Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton and Wen-tau Yih | N/A | N/A |
| Learning Prototypical Functions for Physical Artifacts | Tianyu Jiang and Ellen Riloff | N/A | N/A |
| A unified approach to sentence segmentation of punctuated text in many languages | Rachel Wicks and Matt Post | N/A | N/A |
| Multilingual Speech Translation from Efficient Finetuning of Pretrained Models | Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Pino, Alexei Baevski, Alexis Conneau and Michael Auli | N/A | N/A |
| Instantaneous Grammatical Error Correction with Shallow Aggressive Decoding | Xin Sun, Tao Ge, Furu Wei and Houfeng Wang | N/A | N/A |
| Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates | YUQING XIE, Yi-An Lai, Yuanjun Xiong, Yi Zhang and Stefano Soatto | N/A | N/A |
| Multi-TimeLine Summarization (MTLS): Improving Timeline Summarization by Generating Multiple Summaries | Yi Yu, Adam Jatowt, Antoine Doucet, Kazunari Sugiyama and Masatoshi Yoshikawa | N/A | N/A |
| Learning to Explain: Generating Stable Explanations Fast | Xuelin Situ, Ingrid Zukerman, Cecile Paris, Sameen Maruf and Gholamreza Haffari | N/A | N/A |
| StereoSet: Measuring stereotypical bias in pretrained language models | Moin Nadeem, Anna Bethke and Siva Reddy | N/A | N/A |
| Neural semi-Markov CRF for Monolingual Word Alignment | Wuwei Lan, Chao Jiang and Wei Xu | N/A | N/A |
| Privacy at Scale: Introducing the PrivaSeer Corpus of Web Privacy Policies | Mukund Srinath, Shomir Wilson and C Lee Giles | N/A | N/A |
| GL-GIN: Fast and Accurate Non-Autoregressive Model for Joint Multiple Intent Detection and Slot Filling | Libo Qin, Fuxuan Wei, Tianbao Xie, Xiao Xu, Wanxiang Che and Ting Liu | N/A | N/A |
| Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification | Cristina Garbacea, Mengtian Guo, Samuel Carton and Qiaozhu Mei | N/A | N/A |
| EmailSum: Abstractive Email Thread Summarization | Shiyue Zhang, Asli Celikyilmaz, Jianfeng Gao and Mohit Bansal | N/A | N/A |
| The R-U-A-Robot Dataset: Helping Avoid Chatbot Deception by Detecting User Questions About Human or Non-Human Identity | David Gros, Yu Li and Zhou Yu | N/A | N/A |
| Multi-Task Retrieval for Knowledge-Intensive Tasks | Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oguz, Veselin Stoyanov and Gargi Ghosh | N/A | N/A |
| Dependency-driven Relation Extraction with Attentive Graph Convolutional Networks | Yuanhe Tian, Guimin Chen, Yan Song and Xiang Wan | N/A | N/A |
| Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference | Robert L Logan IV, Andrew McCallum, Sameer Singh and Dan Bikel | N/A | N/A |
| Compositional Generalization and Natural Language Variation: Can a Semantic Parsing Approach Handle Both? | Peter Shaw, Ming-Wei Chang, Panupong Pasupat and Kristina Toutanova | N/A | N/A |
| KaggleDBQA: Realistic Evaluation of Text-to-SQL Parsers | Chia-Hsuan Lee, Oleksandr Polozov and Matthew Richardson | N/A | N/A |
| The statistical advantage of automatic NLG metrics at the system level | Johnny Wei and Robin Jia | N/A | N/A |
| Style is NOT a single variable: Case Studies for Cross-Stylistic Language Understanding | Dongyeop Kang and Eduard Hovy | N/A | N/A |
| GTM: A Generative Triple-wise Model for Conversational Question Generation | Lei Shen, Fandong Meng, Jinchao Zhang, Yang Feng and Jie Zhou | N/A | N/A |
| Discontinuous Named Entity Recognition as Maximal Clique Discovery | Yucheng Wang, Bowen Yu, Hongsong Zhu, Tingwen Liu, Nan Yu and Limin Sun | N/A | N/A |
| Joint Models for Answer Verification in Question Answering Systems | Zeyu Zhang, Thuy Vu and Alessandro Moschitti | N/A | N/A |
| Language Model Augmented Relevance Score | Ruibo Liu, Jason Wei and Soroush Vosoughi | N/A | N/A |
| Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution | Jiacheng Xu and Greg Durrett | N/A | N/A |
| Rejuvenating Low-Frequency Words: Making the Most of Parallel Data in Non-Autoregressive Translation | Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao and Zhaopeng Tu | N/A | N/A |
| All Thatâs âHumanâ Is Not Gold: Evaluating Human Evaluation of Generated Text | Elizabeth Clark, Tal August, Sofia Serrano, Nikita Haduong, Suchin Gururangan and Noah A. Smith | N/A | N/A |
| ReadOnce Transformers: Reusable Representations of Text for Transformers | Shih-Ting Lin, Ashish Sabharwal and Tushar Khot | N/A | N/A |
| Automatic ICD Coding via Interactive Shared Representation Networks with Self-distillation Mechanism | Tong Zhou, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Kun Niu, Weifeng Chong and Shengping Liu | N/A | N/A |
| E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning | Haiyang Xu, Ming Yan, Chenliang Li, Bin Bi, Songfang Huang, Wenming Xiao and Fei Huang | N/A | N/A |
| DynaEval: Unifying Turn and Dialogue Level Evaluation | Chen Zhang, Yiming Chen, Luis Fernando DâHaro, Yan Zhang, Thomas Friedrichs, Grandee Lee and Haizhou Li | N/A | N/A |
| Stance Detection in COVID-19 Tweets | Kyle Glandt, Sarthak Khanal, Yingjie Li, Doina Caragea and Cornelia Caragea | N/A | N/A |
| Bootstrapped Unsupervised Sentence Representation Learning | Yan Zhang, Ruidan He, ZUOZHU LIU, Lidong Bing and Haizhou Li | N/A | N/A |
| Neural Bi-Lexicalized PCFG Induction | Songlin Yang, Yanpeng Zhao and Kewei Tu | N/A | N/A |
| Surprisal Estimators for Human Reading Times Need Character Models | Byung-Doh Oh, Christian Clark and William Schuler | N/A | N/A |
| Weakly Supervised Named Entity Tagging with Learnable Logical Rules | Jiacheng Li, Haibo Ding, Jingbo Shang, Julian McAuley and Zhe Feng | N/A | N/A |
| Intrinsic Bias Metrics Do Not Correlate with Application Bias | Seraphina Goldfarb-Tarrant, Rebecca Marchant, Ricardo Muñoz Sánchez, Mugdha Pandya and Adam Lopez | N/A | N/A |
| A Hierarchical VAE for Calibrating Attributes while Generating Text using Normalizing Flow | Bidisha Samanta, Mohit Agrawal and NIloy Ganguly | N/A | N/A |
| DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation | Xinyu Hua, Ashwin Sreevatsa and Lu Wang | N/A | N/A |
| Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers | Benjamin Marie, Atsushi Fujita and Raphael Rubino | N/A | N/A |
| SpanNER: Named Entity Re-/Recognition as Span Prediction | Jinlan Fu, Xuanjing Huang and Pengfei Liu | N/A | N/A |
| Self-Supervised Multimodal Opinion Summarization | Jinbae Im, Moonki Kim, Hoyeop Lee, Hyunsouk Cho and Sehee Chung | N/A | N/A |
| Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering | Najoung Kim, Ellie Pavlick, Burcu Karagol Ayan and Deepak Ramachandran | N/A | N/A |
| How to Adapt Your Pretrained Multilingual Model to 1600 Languages | Abteen Ebrahimi and Katharina Kann | N/A | N/A |
| A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance and Self-referenced Redundancy | Wang Chen, Piji Li and Irwin King | N/A | N/A |
| PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World | Rowan Zellers, Ari Holtzman, Matthew Peters, Roozbeh Mottaghi, Aniruddha Kembhavi, Ali Farhadi and Yejin Choi | N/A | N/A |
| LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking | Hang Jiang, Sairam Gurajada, Qiuhao Lu, Sumit Neelam, Lucian Popa, Prithviraj Sen, Yunyao Li and Alexander Gray | N/A | N/A |
| Refining Sample Embeddings with Relation Prototypes to Enhance Continual Relation Extraction | Li Cui, Deqing Yang, Jiaxin Yu, Chengwei Hu, Jiayang Cheng, Jingjie Yi and Yanghua Xiao | N/A | N/A |
| Controllable Open-ended Question Generation with A New Question Type Ontology | Shuyang Cao and Lu Wang | N/A | N/A |
| Measuring Conversational Uptake: A Case Study on Student-Teacher Interactions | Dorottya Demszky, Jing Liu, Zid Mancenido, Julie Cohen, Heather Hill, Dan Jurafsky and Tatsunori Hashimoto | N/A | N/A |
| Modeling Transitions of Focal Entities for Conversational Knowledge Base Question Answering | Yunshi Lan and Jing Jiang | N/A | N/A |
| Vocabulary Learning via Optimal Transport for Neural Machine Translation | Jingjing Xu, Hao Zhou, Chun Gan, Zaixiang Zheng and Lei Li | N/A | N/A |
| A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding | Khalil Mrini, Franck Dernoncourt, Seunghyun Yoon, Trung Bui, Walter Chang, Emilia Farcas and Ndapa Nakashole | N/A | N/A |
| Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques | Kundan Krishna, Sopan Khosla, Jeffrey Bigham and Zachary C. Lipton | N/A | N/A |
| Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions | Hongjie Cai, Rui Xia and Jianfei Yu | N/A | N/A |
| DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts | Alisa Liu, Maarten Sap, Ximing Lu, Swabha Swayamdipta, Chandra Bhagavatula, Noah A. Smith and Yejin Choi | N/A | N/A |
| Topic-Aware Evidence Reasoning and Stance-Aware Aggregation for Fact Verification | Jiasheng Si, Deyu Zhou, Tongzhe Li, Xingyu Shi and Yulan He | N/A | N/A |
| Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger | Fanchao Qi, Mukai Li, Yangyi Chen, Zhengyan Zhang, Zhiyuan Liu, Yasheng Wang and Maosong Sun | N/A | N/A |
| Multimodal Multi-Speaker Merger & Acquisition Financial Modeling: A New Task, Dataset, and Neural Baselines | Ramit Sawhney, Mihir Goyal, Prakhar Goel, Puneet Mathur and Rajiv Ratn Shah | N/A | N/A |
| Exploring Discourse Structures for Argument Impact Classification | Xin Liu, Jiefu Ou, Yangqiu Song and Xin Jiang | N/A | N/A |
| Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP | Anthony Chen, Pallavi Gudipati, Shayne Longpre, Xiao Ling and Sameer Singh | N/A | N/A |
| QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus | Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury and Ahmed Ali | N/A | N/A |
| LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations | Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu and Kai Yu | N/A | N/A |
| Mind Your Outliers! Investigating the Negative Impact of Outliers on Active Learning for Visual Question Answering | Siddharth Karamcheti, Ranjay Krishna, Li Fei-Fei and Christopher Manning | N/A | N/A |
| ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic | Muhammad Abdul-Mageed, AbdelRahim Elmadany and El Moatez Billah Nagoudi | N/A | N/A |
| Glancing Transformer for Non-Autoregressive Neural Machine Translation | Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu and Lei Li | N/A | N/A |
| Alignment Rationale for Natural Language Inference | Zhongtao Jiang, Yuanzhe Zhang, Zhao Yang, Jun Zhao and Kang Liu | N/A | N/A |
| Learning Event Graph Knowledge for Abductive Reasoning | Li Du, Xiao Ding, Ting Liu and Bing Qin | N/A | N/A |
| Towards Table-to-Text Generation with Numerical Reasoning | Lya Hulliyyatus Suadaa, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura and Hiroya Takamura | N/A | N/A |
| Check It Again: Progressive Visual Question Answering via Visual Entailment | Qingyi Si, Zheng Lin, Ming Yu Zheng, Peng Fu and Weiping Wang | N/A | N/A |
| Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels | Marcos Garcia, Tiago Kramer Vieira, Carolina Scarton, Marco Idiart and Aline Villavicencio | N/A | N/A |
| Document-level Event Extraction via Parallel Prediction Networks | Hang Yang, Dianbo Sui, Yubo Chen, Kang Liu, Jun Zhao and Taifeng Wang | N/A | N/A |
| More Identifiable yet Equally Performant Transformers for Text Classification | Rishabh Bhardwaj, Navonil Majumder, Soujanya Poria and Eduard Hovy | N/A | N/A |
| Guiding Teacher Forcing with Seer Forcing for Neural Machine Translation | Yang Feng, Shuhao Gu, Dengji Guo, Zhengxin Yang and Chenze Shao | N/A | N/A |
| StructuralLM: Structural Pre-training for Form Understanding | Chenliang Li, Bin Bi, Ming Yan, Wei Wang, Songfang Huang, Fei Huang and Luo Si | N/A | N/A |
| A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering | Zhihong Shao, Lifeng Shang, Qun Liu and Minlie Huang | N/A | N/A |
| Learning Relation Alignment for Calibrated Cross-modal Retrieval | Shuhuai Ren, Junyang Lin, Guangxiang Zhao, Rui Men, An Yang, Jingren Zhou, Xu Sun and Hongxia Yang | N/A | N/A |
| TGEA: An Error-Annotated Dataset and Benchmark Tasks for TextGeneration from Pretrained Language Models | Jie He, Bo Peng, Yi Liao, Qun Liu and Deyi Xiong | N/A | N/A |
| Learn to Resolve Conversational Dependency: A Consistency Training Framework for Conversational Question Answering | Gangwoo Kim, Hyunjae Kim, Jungsoo Park and Jaewoo Kang | N/A | N/A |
| AdvPicker: Effectively Leveraging Unlabeled Data via Adversarial Discriminator for Cross-Lingual NER | Weile Chen, Huiqiang Jiang, Qianhui Wu, Borje Karlsson and Yi Guan | N/A | N/A |
| A Survey of Code-switching: Linguistic and Social Perspectives for Language Technologies | A. Seza DoÄruöz, Sunayana Sitaram, Barbara E. Bullock and Almeida Jacqueline Toribio | N/A | N/A |
| Tail-to-Tail Non-Autoregressive Sequence Prediction for Chinese Grammatical Error Correction | Piji Li and Shuming Shi | N/A | N/A |
| Risk Minimization for Zero-shot Sequence Labeling | Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang and Kewei Tu | N/A | N/A |
| Exploring Distantly-Labeled Rationales in Neural Network Models | Quzhe Huang, Shengqi Zhu, Yansong Feng and Dongyan Zhao | N/A | N/A |
| CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding | Dong Wang, Ning Ding, Piji Li and Haitao Zheng | N/A | N/A |
| Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction | Lu Xu, Yew Ken Chia and Lidong Bing | N/A | N/A |
| PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check | Li Huang, Junjie Li, Weiwei Jiang, Zhiyu Zhang, Minchuan Chen, Shaojun Wang and Jing Xiao | N/A | N/A |
| Learning to Perturb Word Embeddings for Out-of-distribution QA | Seanie Lee, Minki Kang, Juho Lee and Sung Ju Hwang | N/A | N/A |
| The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing | Valentina Pyatkin, Shoval Sadde, Aynat Rubinstein, Paul Portner and Reut Tsarfaty | N/A | N/A |
| WARP: Word-level Adversarial ReProgramming | Karen Hambardzumyan, Hrant Khachatrian and Jonathan May | N/A | N/A |
| BERTAC: Enhancing Transformer-based Language Models with Adversarially Pretrained Convolutional Neural Networks | Jong-Hoon Oh, Ryu Iida, Julien Kloetzer and Kentaro Torisawa | N/A | N/A |
| A Neural Transition-based Joint Model for Disease Named Entity Recognition and Normalization | Zongcheng Ji, Tian Xia, Mei Han and Jing Xiao | N/A | N/A |
| Annotating Online Misogyny | Philine Zeinert, Nanna Inie and Leon Derczynski | N/A | N/A |
| Search from History and Reason for Future: Two-stage Reasoning on Temporal Knowledge Graphs | Zixuan Li, Xiaolong Jin, Saiping Guan, Wei Li, Jiafeng Guo, Yuanzhuo Wang and Xueqi Cheng | N/A | N/A |
| Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study | Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar and Sunita Sarawagi | N/A | N/A |
| Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making | Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li, YICHI ZHANG and Zelin Dai | N/A | N/A |
| Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators | Peiyu Liu, Ze-Feng Gao, Wayne Xin Zhao, Zhi-Yuan Xie, Zhong-Yi Lu and Ji-Rong Wen | N/A | N/A |
| On Sample Based Explanation Methods for NLP: Faithfulness, Efficiency and Semantic Evaluation | Wei Zhang, Ziming Huang, Yada Zhu, Guangnan Ye, Xiaodong Cui and Fan Zhang | N/A | N/A |
| Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search | Gyuwan Kim and Kyunghyun Cho | N/A | N/A |
| H-Transformer-1D: Fast One-Dimensional Hierarchical Attention for Sequences | Zhenhai Zhu and Radu Soricut | N/A | N/A |
| The Curse of Dense Low-Dimensional Information Retrieval for Large Index Sizes | Nils Reimers and Iryna Gurevych | N/A | N/A |
| Whatâs in the Box? An Analysis of Undesirable Content in the Common Crawl Corpus | Alexandra Luccioni and Joseph Viviano | N/A | N/A |
| Uncertainty and Surprisal Jointly Deliver the Punchline: Exploiting Incongruity-Based Features for Humor Recognition | Yubo Xie, Junze Li and Pearl Pu | N/A | N/A |
| Parameter Selection: Why We Should Pay More Attention to It | Jie-Jyun Liu, Tsung-Han Yang, Si-An Chen and Chih-Jen Lin | N/A | N/A |
| DuReader_robust: A Chinese Dataset Towards Evaluating Robustness and Generalization of Machine Reading Comprehension in Real-World Applications | Hongxuan Tang, Hongyu Li, Jing Liu, Yu Hong, Hua Wu and Haifeng Wang | N/A | N/A |
| How effective is BERT without word ordering? Implications for language understanding and data privacy | Jack Hessel and Alexandra Schofield | N/A | N/A |
| Attentive Multiview Text Representation for Differential Diagnosis | Hadi Amiri, Mitra Mohtarami and Isaac Kohane | N/A | N/A |
| AligNarr: Aligning Narratives on Movies | Paramita Mirza, Mostafa Abouhamra and Gerhard Weikum | N/A | N/A |
| Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models | Chong Li, Cenyuan Zhang, Xiaoqing Zheng and Xuanjing Huang | N/A | N/A |
| Targeting the Benchmark: On Methodology in Current Natural Language Processing Research | David Schlangen | N/A | N/A |
| Towards a more Robust Evaluation for Conversational Question Answering | Wissam Siblini, Baris Sayil and Yacine Kessaci | N/A | N/A |
| TIMERS: Document-level Temporal Relation Extraction | Puneet Mathur, Rajiv Jain, Franck Dernoncourt, Vlad Morariu, Quan Hung Tran and Dinesh Manocha | N/A | N/A |
| ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction | Chen-Yu Lee, Chun-Liang Li, Chu Wang, Renshen Wang, Yasuhisa Fujii, Siyang Qin, Ashok Popat and Tomas Pfister | N/A | N/A |
| Enhancing Entity Boundary Detection for Better Chinese Named Entity Recognition | Chun Chen and Fang Kong | N/A | N/A |
| Coreference Resolution without Span Representations | Yuval Kirstain, Ori Ram and Omer Levy | N/A | N/A |
| Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving | Shih-hung Tsai, Chao-Chun Liang, Hsin-Min Wang and Keh-Yih Su | N/A | N/A |
| Training Adaptive Computation for Open-Domain Question Answering with Computational Constraints | Yuxiang Wu, Pasquale Minervini, Pontus Stenetorp and Sebastian Riedel | N/A | N/A |
| Embedding Time Differences in Context-sensitive Neural Networks for Learning Time to Event | Nazanin Dehghani, Hassan Hajipoor and Hadi Amiri | N/A | N/A |
| Unsupervised Cross-Domain Prerequisite Chain Learning using Variational Graph Autoencoders | Irene Li, Vanessa Yan, Tianxiao Li, Rihao Qu and Dragomir Radev | N/A | N/A |
| Beyond Laurel/Yanny: An Autoencoder-Enabled Search for Polyperceivable Audio | Kartik Chandra, Chuma Kabaghe and Gregory Valiant | N/A | N/A |
| Modeling Discriminative Representations for Out-of-Domain Detection with Supervised Contrastive Learning | Zhiyuan Zeng, Keqing He, Yuanmeng Yan, Zijun Liu, Yanan Wu, Hong Xu, Huixing Jiang and Weiran Xu | N/A | N/A |
| Video Paragraph Captioning as a Text Summarization Task | Hui Liu and Xiaojun Wan | N/A | N/A |
| Entity Enhancement for Implicit Discourse Relation Classification in the Biomedical Domain | Wei Shi and Vera Demberg | N/A | N/A |
| A Mixture-of-Experts Model for Antonym-Synonym Discrimination | Zhipeng Xie and Nan Zeng | N/A | N/A |
| Attention Flows are Shapley Value Explanations | Kawin Ethayarajh and Dan Jurafsky | N/A | N/A |
| Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking | Fangyu Liu, Ivan VuliÄ, Anna Korhonen and Nigel Collier | N/A | N/A |
| BERTTune: Fine-Tuning Neural Machine Translation with BERTScore | Inigo Jauregi Unanue, Jacob Parnell and Massimo Piccardi | N/A | N/A |
| SaRoCo: Detecting Satire in a Novel Romanian Corpus of News Articles | Ana-Cristina Rogoz, Gaman Mihaela and Radu Tudor Ionescu | N/A | N/A |
| Improving Arabic Diacritization with Regularized Decoding and Adversarial Training | Han Qin, Guimin Chen, Yuanhe Tian and Yan Song | N/A | N/A |
| Higher-order Derivatives of Weighted Finite-state Machines | Ran Zmigrod, Tim Vieira and Ryan Cotterell | N/A | N/A |
| Multi-Scale Progressive Attention Network for Video Question Answering | Zhicheng Guo, Jiaxuan Zhao, Licheng Jiao, Xu Liu and Lingling Li | N/A | N/A |
| Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions | Daniel Rosenberg, Itai Gat, Amir Feder and Roi Reichart | N/A | N/A |
| Efficient Passage Retrieval with Hashing for Open-domain Question Answering | Ikuya Yamada, Akari Asai and Hannaneh Hajishirzi | N/A | N/A |
| Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries | Ashish Shrivastava, Kaustubh Dhole, Abhinav Bhatt and Sharvani Raghunath | N/A | N/A |
| An Empirical Study on Adversarial Attack on NMT: Languages and Positions Matter | Zhiyuan Zeng and Deyi Xiong | N/A | N/A |
| Doing Good or Doing Right? Exploring the Weakness of Commonsense Causal Reasoning Models | Mingyue Han and Yinglin Wang | N/A | N/A |
| Catchphrase: Automatic Detection of Cultural References | Nir Sweed and Dafna Shahaf | N/A | N/A |
| Towards Visual Question Answering on Pathology Images | Xuehai He, Zhuo Cai, Wenlan Wei, Yichen Zhang, Luntian Mou, Eric Xing and Pengtao Xie | N/A | N/A |
| Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection | Debora Nozza | N/A | N/A |
| QA-Driven Zero-shot Slot Filling with Weak Supervision Pretraining | Xinya Du, Luheng He, Qi Li, Dian Yu, Panupong Pasupat and Yuan Zhang | N/A | N/A |
| Entity Concept-enhanced Few-shot Relation Extraction | Shan Yang, Yongfei Zhang, Guanglin Niu, Qinghua Zhao and Shiliang Pu | N/A | N/A |
| Enhancing Descriptive Image Captioning with Natural Language Inference | Zhan Shi, Hui Liu and Xiaodan Zhu | N/A | N/A |
| Cross-lingual Text Classification with Heterogeneous Graph Neural Network | Ziyun Wang, Xuan Liu, Peiji Yang, Shixing Liu and Zhisheng Wang | N/A | N/A |
| Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialogue State Tracking | Yinpei Dai, Hangyu Li, Yongbin Li, Jian Sun, Fei Huang, Luo Si and Xiaodan Zhu | N/A | N/A |
| Continual Learning for Task-oriented Dialogue System with Iterative Network Pruning, Expanding and Masking | Binzong Geng, Fajie Yuan, Qiancheng Xu, Ying Shen, Ruifeng Xu and Min Yang | N/A | N/A |
| On the Generation of Medical Dialogs for COVID-19 | Meng Zhou, Zechen Li, Bowen Tan, Guangtao Zeng, Wenmian Yang, Xuehai He, Zeqian Ju, Subrato Chakravorty, Shu Chen, Xingyi Yang, Yichen Zhang, Qingyang Wu, Zhou Yu, Kun Xu, Eric Xing and Pengtao Xie | N/A | N/A |
| Domain-Adaptive Pretraining Methods for Dialogue Understanding | Han Wu, Kun Xu, Linfeng Song, Lifeng Jin, Haisong Zhang and Linqi Song | N/A | N/A |
| Zero-shot Fact Verification by Claim Generation | Liangming Pan, Wenhu Chen, Wenhan Xiong, Min-Yen Kan and William Yang Wang | N/A | N/A |
| Hi-Transformer: Hierarchical Interactive Transformer for Efficient and Effective Long Document Modeling | Chuhan Wu, Fangzhao Wu, Tao Qi and Yongfeng Huang | N/A | N/A |
| Improving Compositional Generalization in Classification Tasks via Structure Annotations | Juyong Kim, Pradeep Ravikumar, Joshua Ainslie and Santiago Ontanon | N/A | N/A |
| Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation | Yangyifan Xu, Yijin Liu, Fandong Meng, Jiajun Zhang, Jinan Xu and Jie Zhou | N/A | N/A |
| Improving Model Generalization: A Chinese Named Entity Recognition Case Study | Guanqing Liang and Cane Wing-Ki Leung | N/A | N/A |
| On Orthogonality Constraints for Transformers | Aston Zhang, Alvin Chan, Yi Tay, Jie Fu, Shuohang Wang, Shuai Zhang, Huajie Shao, Shuochao Yao and Roy Ka-Wei Lee | N/A | N/A |
| Improving Lexically Constrained Neural Machine Translation with Source-Conditioned Masked Span Prediction | Gyubok Lee, Seongjun Yang and Edward Choi | N/A | N/A |
| Is Sparse Attention more Interpretable? | Clara Meister, Stefan Lazov, Isabelle Augenstein and Ryan Cotterell | N/A | N/A |
| What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts | Sanja Stajner, Seren Yenikent, Bilal Ghanem and Marc Franco-Salvador | N/A | N/A |
| Constructing Multi-Modal Dialogue Dataset by Replacing Text with Semantically Relevant Images | Nyoungwoo Lee, Suwon Shin, Jaegul Choo, Ho-Jin Choi and Sung-Hyon Myaeng | N/A | N/A |
| Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models | Jieyu Lin, Jiajie Zou and Nai Ding | N/A | N/A |
| Modeling Task-Aware MIMO Cardinality for Efficient Multilingual Neural Machine Translation | Hongfei Xu, Qiuhui Liu, Josef Van Genabith and Deyi Xiong | N/A | N/A |
| Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia | Jiao Sun and Nanyun Peng | N/A | N/A |
| Learning to Generate Task-Specific Adapters from Task Description | Qinyuan Ye and Xiang Ren | N/A | N/A |
| Continual Quality Estimation with Online Bayesian Meta-Learning | Abiola Obamuyide, Marina Fomicheva and Lucia Specia | N/A | N/A |
| Relative Importance in Sentence Processing | Nora Hollenstein and Lisa Beinborn | N/A | N/A |
| PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation | Jing Gu, Qingyang Wu, Chongruo Wu, Weiyan Shi and Zhou Yu | N/A | N/A |
| WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation | Nachshon Cohen, Oren Kalinsky, Yftah Ziser and Alessandro Moschitti | N/A | N/A |
| Replicating and Extending ``Because Their Treebanks Leakââ: Graph Isomorphism, Covariants, and Parser Performance | Mark Anderson, Anders Sogaard and Carlos Gómez-RodrÃguez | N/A | N/A |
| Distinct Label Representations for Few-Shot Text Classification | Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and Yuki Arase | N/A | N/A |
| Deep Context- and Relation-Aware Learning for Aspect-based Sentiment Analysis | Shinhyeok Oh, Dongyub Lee, Taesun Whang, IlNam Park, Seo Gaeun, EungGyun Kim and Harksoo Kim | N/A | N/A |
| Towards Generative Aspect-Based Sentiment Analysis | Wenxuan Zhang, Xin Li, Yang Deng, Lidong Bing and Wai Lam | N/A | N/A |
| Adaptive Nearest Neighbor Machine Translation | Xin Zheng, Zhirui Zhang, Junliang Guo, Shujian Huang, Boxing Chen, Weihua Luo and Jiajun CHEN | N/A | N/A |
| VAULT: VAriable Unified Long Text Representation for Machine Reading Comprehension | Haoyang Wen, Anthony Ferritto, Heng Ji, Radu Florian and Avi Sil | N/A | N/A |
| When is Char Better Than Subword: A Systematic Study of Segmentation Algorithms for Neural Machine Translation | Jiahuan Li, Yutong Shen, Shujian Huang, Xinyu Dai and Jiajun CHEN | N/A | N/A |
| On Training Instance Selection for Few-Shot Neural Text Generation | Ernie Chang, Xiaoyu Shen, Hui-Syuan Yeh and Vera Demberg | N/A | N/A |
| How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation? | Sayan Ghosh, Zheng Qi, Snigdha Chaturvedi and Shashank Srivastava | N/A | N/A |
| Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains | Chenghao Yang, Yudong Zhang and Smaranda Muresan | N/A | N/A |
| MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network | Nicholas FitzGerald, Dan Bikel, Jan Botha, Daniel Gillick, Tom Kwiatkowski and Andrew McCallum | N/A | N/A |
| Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer | Huiyuan Lai, Antonio Toral and Malvina Nissim | N/A | N/A |
| DefSent: Sentence Embeddings using Definition Sentences | Hayato Tsukagoshi, Ryohei Sasano and Koichi Takeda | N/A | N/A |
| OntoGUM: Evaluating Contextualized SOTA Coreference Resolution on 12 More Genres | Yilun Zhu, Sameer Pradhan and Amir Zeldes | N/A | N/A |
| An Improved Model for Voicing Silent Speech | David Gaddy and Dan Klein | N/A | N/A |
| A Simple Recipe for Multilingual Grammatical Error Correction | Sascha Rothe, Jonathan Mallinson, Eric Malmi, Sebastian Krause and Aliaksei Severyn | N/A | N/A |
| Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence | Federico Bianchi, Silvia Terragni and Dirk Hovy | N/A | N/A |
| Explicitly Capturing Relations between Entity Mentions via Graph Neural Networks for Domain-specific Named Entity Recognition | Pei Chen, Haibo Ding, Jun Araki and Ruihong Huang | N/A | N/A |
| Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents | Rui Meng, Khushboo Thaker, Lei Zhang, Yue Dong, Xingdi Yuan, Tong Wang and Daqing He | N/A | N/A |
| Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction | Ming Shen, Pratyay Banerjee and Chitta Baral | N/A | N/A |
| Anchor-based Bilingual Word Embeddings for Low-Resource Languages | Tobias Eder, Viktor Hangya and Alexander Fraser | N/A | N/A |
| Learning to Solve NLP Tasks in an Incremental Number of Languages | Giuseppe Castellucci, Simone Filice, Danilo Croce and Roberto Basili | N/A | N/A |
| Quantifying and Avoiding Unfair Qualification Labour in Crowdsourcing | Jonathan K. Kummerfeld | N/A | N/A |
| The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models | Ulme Wennberg and Gustav Eje Henter | N/A | N/A |
| Question Generation for Adaptive Education | Megha Srivastava and Noah Goodman | N/A | N/A |
| Measuring and Improving BERTâs Mathematical Abilities by Predicting the Order of Reasoning. | Piotr PiÄkos, Mateusz Malinowski and Henryk Michalewski | N/A | N/A |
| MedNLI Is Not Immune: Natural Language Inference Artifacts in the Clinical Domain | Christine Herlihy and Rachel Rudinger | N/A | N/A |
| Lightweight Adapter Tuning for Multilingual Speech Translation | Hang Le, Juan Pino, Changhan Wang, Jiatao Gu, Didier Schwab and Laurent Besacier | N/A | N/A |
| Efficient Text-based Reinforcement Learning by Jointly Leveraging State and Commonsense Graph Representations | Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi, Kartik Talamadupula, Mrinmaya Sachan and Murray Campbell | N/A | N/A |
| Zero-shot Event Extraction via Transfer Learning: Challenges and Insights | Qing Lyu, Hongming Zhang, Elior Sulem and Dan Roth | N/A | N/A |
| AND does not mean OR: Using Formal Languages to Study Language Modelsâ Representations | Aaron Traylor, Roman Feiman and Ellie Pavlick | N/A | N/A |
| Can Transformer Models Measure Coherence In Text: Re-Thinking the Shuffle Test | Philippe Laban, Luke Dai, Lucas Bandarkar and Marti A. Hearst | N/A | N/A |
| An Exploratory Analysis of Multilingual Word-Level Quality Estimation with Cross-Lingual Transformers | Tharindu Ranasinghe, Constantin Orasan and Ruslan Mitkov | N/A | N/A |
| Towards more equitable question answering systems: How much more data do you need? | Arnab Debnath, Navid Rajabi, Fardina Fathmiul Alam and Antonios Anastasopoulos | N/A | N/A |
| nmT5 - Is parallel data still relevant for pre-training massively multilingual language models? | Mihir Kale, Aditya Siddhant, Rami Al-Rfou, Linting Xue, Noah Constant and Melvin Johnson | N/A | N/A |
| Input Representations for Parsing Discourse Representation Structures: Comparing English with Chinese | Chunliu Wang, Rik Van Noord, Arianna Bisazza and Johan Bos | N/A | N/A |
| Code Generation from Natural Language with Less Prior Knowledge and More Monolingual Data | Sajad Norouzi, Keyi Tang and Yanshuai Cao | N/A | N/A |
| Gender bias amplification during Speed-Quality optimization in Neural Machine Translation | Adithya Renduchintala, Denise Diaz, Kenneth Heafield, Xian Li and Mona Diab | N/A | N/A |
| Enforcing Consistency in Weakly Supervised Semantic Parsing | Nitish Gupta, Sameer Singh and Matt Gardner | N/A | N/A |
| Multilingual Agreement for Multilingual Neural Machine Translation | Jian Yang, Yuwei Yin, Shuming Ma, Haoyang Huang, Dongdong Zhang, Zhoujun Li and Furu Wei | N/A | N/A |
| Robust Transfer Learning with Pretrained Language Models through Adapters | Wenjuan Han, Bo Pang and Ying Nian Wu | N/A | N/A |
| mTVR: Multilingual Moment Retrieval in Videos | Jie Lei, Tamara Berg and Mohit Bansal | N/A | N/A |
| Reinforcement Learning for Abstractive Question Summarization with Question-aware Semantic Rewards | Shweta Yadav, Deepak Gupta, Asma Ben Abacha and Dina Demner-Fushman | N/A | N/A |
| Embracing Ambiguity: Shifting the Training Target of NLI Models | Johannes Mario Meissner, Napat Thumwanit, Saku Sugawara and Akiko Aizawa | N/A | N/A |
| Machine Translation into Low-resource Language Varieties | Sachin Kumar, Antonios Anastasopoulos, Shuly Wintner and Yulia Tsvetkov | N/A | N/A |
| On Positivity Bias in Negative Reviews | Madhusudhan Aithal and Chenhao Tan | N/A | N/A |
| A Semantics-aware Transformer Model of Relation Linking for Knowledge Base Question Answering | Tahira Naseem, Srinivas Ravishankar, Nandana Mihindukulasooriya, Ibrahim Abdelaziz, Young-Suk Lee, Pavan Kapanipathi, Salim Roukos, Alfio Gliozzo and Alexander Gray | N/A | N/A |
| Three Sentences Are All You Need: Local Path Enhanced Document Relation Extraction | Quzhe Huang, Shengqi Zhu, Yansong Feng, Yuan Ye, Yuxuan Lai and Dongyan Zhao | N/A | N/A |
| More than Text: Multi-modal Chinese Word Segmentation | Dong Zhang, Zheng Hu, Shoushan Li, Hanqian Wu, Qiaoming Zhu and Guodong Zhou | N/A | N/A |
| Unsupervised Enrichment of Persona-grounded Dialog with Background Stories | Bodhisattwa Prasad Majumder, Taylor Berg-Kirkpatrick, Julian McAuley and Harsh Jhamtani | N/A | N/A |
| Exploring Listwise Evidence Reasoning with T5 for Fact Verification | Kelvin Jiang, Ronak Pradeep and Jimmy Lin | N/A | N/A |
| Demoting the Lead Bias in News Summarization via Alternating Adversarial Learning | Linzi Xing, Wen Xiao and Giuseppe Carenini | N/A | N/A |
| SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization | Yixin Liu and Pengfei Liu | N/A | N/A |
| Avoiding Overlap in Data Augmentation for AMR-to-Text Generation | Wenchao Du and Jeffrey Flanigan | N/A | N/A |
| Quotation Recommendation and Interpretation Based on Transformation from Queries to Quotations | Lingzhi Wang, Xingshan Zeng and Kam-Fai Wong | N/A | N/A |
| Donât Let Discourse Confine Your Model: Sequence Perturbations for Improved Event Language Models | Mahnaz Koupaee, Greg Durrett, Nathanael Chambers and Niranjan Balasubramanian | N/A | N/A |
| eMLM: A New Pre-training Objective for Emotion Related Tasks | Tiberiu Sosea and Cornelia Caragea | N/A | N/A |
| Discrete Cosine Transform as Universal Sentence Encoder | Nada Almarwani and Mona Diab | N/A | N/A |
| Semantic Frame Induction using Masked Word Embeddings and Two-Step Clustering | Kosuke Yamada, Ryohei Sasano and Koichi Takeda | N/A | N/A |
| A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space | Sara Rajaee and Mohammad Taher Pilehvar | N/A | N/A |
| Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation | Yinfei Yang, Ning Jin, Kuo Lin, Mandy Guo and Daniel Cer | N/A | N/A |
| Happy Dance, Slow Clap: Using Reaction GIFs to Predict Induced Affect on Twitter | Boaz Shmueli, Soumya Ray and Lun-Wei Ku | N/A | N/A |
| Donât Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data | Rajat Bhatnagar, Ananya Ganesh and Katharina Kann | N/A | N/A |
| Automatic Fake News Detection: Are Models Learning to Reason? | Casper Hansen, Christian Hansen and Lucas Chaves Lima | N/A | N/A |
| Addressing Semantic Drift in Generative Question Answering with Auxiliary Extraction | Chenliang Li, Bin Bi, Ming Yan, Wei Wang and Songfang Huang | N/A | N/A |
| Counterfactuals to Control Latent Disentangled Text Representations for Style Transfer | Sharmila Reddy Nangi, Niyati Chhaya, Sopan Khosla, Nikhil Kaushik and Harshit Nyati | N/A | N/A |
| UMIC: An Unreferenced Metric for Image Captioning via Contrastive Learning | Hwanhee Lee, Seunghyun Yoon, Franck Dernoncourt, Trung Bui and Kyomin Jung | N/A | N/A |
| Issues with Entailment-based Zero-shot Text Classification | Tingting Ma, Jin-Ge Yao, Chin-Yew Lin and Tiejun Zhao | N/A | N/A |
| Difficulty-Aware Machine Translation Evaluation | Runzhe Zhan, Xuebo Liu, Derek F. Wong and Lidia S. Chao | N/A | N/A |
| A Span-based Dynamic Local Attention Model for Sequential Sentence Classification | Xichen Shang, Qianli Ma, Zhenxi Lin, Jiangyue Yan and Zipeng Chen | N/A | N/A |
| X-Fact: A New Benchmark Dataset for Multilingual Fact Checking | Ashim Gupta and Vivek Srikumar | N/A | N/A |
| Neural-Symbolic Commonsense Reasoner with Relation Predictors | Farhad Moghimifar, Lizhen Qu, Terry Yue Zhuo, Gholamreza Haffari and Mahsa Baktashmotlagh | N/A | N/A |
| In Factuality: Efficient Integration of Relevant Facts for Visual Question Answering | Peter Vickers, Nikolaos Aletras, Emilio Monti and Loïc Barrault | N/A | N/A |
| N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses | Karthik Ganesan, Pakhi Bamdev, Jaivarsan B, Amresh Venugopal and Abhinav Tushar | N/A | N/A |
| Explainable Inference Over Grounding-Abstract Chains for Science Questions | Mokanarangan Thayaparan, Marco Valentino and Andre Freitas | N/A | N/A |
| LV-BERT: Exploiting Layer Variety for BERT | Weihao Yu, Zihang Jiang, Fei Chen, Qibin Hou and Jiashi Feng | N/A | N/A |
| Few-Shot Event Detection with Prototypical Amortized Conditional Random Field | Xin Cong, Shiyao Cui, Bowen Yu, Tingwen Liu, Wang Yubin and Bin Wang | N/A | N/A |
| LUX (Linguistic aspects Under eXamination): Discourse Analysis for Automatic Fake News Classification | Lucas Azevedo, Mathieu dâAquin, Brian Davis and Manel Zarrouk | N/A | N/A |
| Semantic Relation-aware Difference Representation Learning for Change Captioning | Yunbin Tu, Tingting Yao, Liang Li, Jiedong Lou, Shengxiang Gao, Zhengtao Yu and Chenggang Yan | N/A | N/A |
| The Authors Matter: Understanding and Mitigating Implicit Bias in Deep Text Classification | Haochen Liu, Wei Jin, Hamid Karimi, Zitao Liu and Jiliang Tang | N/A | N/A |
| From What to Why: Improving Relation Extraction with Rationale Graph | Zhenyu Zhang, Bowen Yu, Xiaobo Shu, Xue Mengge, Tingwen Liu and Li Guo | N/A | N/A |
| SyGNS: A Systematic Generalization Testbed Based on Natural Language Semantics | Hitomi Yanaka, Koji Mineshima and Kentaro Inui | N/A | N/A |
| Fully Non-autoregressive Neural Machine Translation: Tricks of the Trade | Jiatao Gu and Xiang Kong | N/A | N/A |
| Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech | Wanzheng Zhu and Suma Bhat | N/A | N/A |
| REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training | Fangkai Jiao, Yangyang Guo, Yilin Niu, Feng Ji, Feng-Lin Li and Liqiang Nie | N/A | N/A |
| CasEE: A Joint Learning Framework with Cascade Decoding for Overlapping Event Extraction | Jiawei Sheng, Shu Guo, Bowen Yu, Qian Li, Yiming Hei, Lihong Wang, Tingwen Liu and Hongbo Xu | N/A | N/A |
| Discovering Topics in Long-tailed Corpora with Causal Intervention | Xiaobao Wu, Chunping Li and Yishu Miao | N/A | N/A |
| WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article Sections | Mingda Chen, Sam Wiseman and Kevin Gimpel | N/A | N/A |
| Deep Cognitive Reasoning Network for Multi-hop Question Answering over Knowledge Graphs | Jianyu Cai, Zhanqiu Zhang, Feng Wu and Jie Wang | N/A | N/A |
| GoG: Relation-aware Graph-over-Graph Network for Visual Dialog | Feilong Chen, Xiuyi Chen, Fandong Meng, Peng Li and Jie Zhou | N/A | N/A |
| Joint Optimization of Tokenization and Downstream Model | Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki and Naoaki Okazaki | N/A | N/A |
| How does Attention Affect the Model? | Cheng Zhang, Qiuchi Li, Lingyu Hua and Dawei Song | N/A | N/A |
| Contrastive Attention for Automatic Chest X-ray Report Generation | Fenglin Liu, Changchang Yin, Xian Wu, Shen Ge, Ping Zhang and Xu Sun | N/A | N/A |
| O2NA: An Object-Oriented Non-Autoregressive Approach for Controllable Video Captioning | Fenglin Liu, Xuancheng Ren, Xian Wu, Bang Yang, Shen Ge and Xu Sun | N/A | N/A |
| Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning | Benjamin Minixhofer, Milan Gritta and Ignacio Iacobacci | N/A | N/A |
| Empirical Error Modeling Improves Robustness of Noisy Neural Sequence Labeling | Marcin Namysl, Sven Behnke and Joachim Kohler | N/A | N/A |
| Spatial Dependency Parsing for Semi-Structured Document Information Extraction | Wonseok Hwang, Jinyeong Yim, Seunghyun Park, Sohee Yang and Minjoon Seo | N/A | N/A |
| Entity-Aware Abstractive Multi-Document Summarization | Hao Zhou, Weidong Ren, Gongshen Liu, Bo Su and Wei Lu | N/A | N/A |
| XeroAlign: Zero-shot cross-lingual transformer alignment | Milan Gritta and Ignacio Iacobacci | N/A | N/A |
| Link Prediction on N-ary Relational Facts: A Graph-based Approach | Quan Wang, Haifeng Wang, Yajuan Lyu and Yong Zhu | N/A | N/A |
| GLGE: A New General Language Generation Evaluation Benchmark | Dayiheng Liu, Yu Yan, Yeyun Gong, Weizhen Qi, Hang Zhang, Jian Jiao, Weizhu Chen, Jie Fu, Linjun Shou, Ming Gong, Pengcheng Wang, Jiusheng Chen, Daxin Jiang, Jiancheng Lv, Ruofei Zhang, Winnie Wu, Ming Zhou and Nan Duan | N/A | N/A |
| AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization | Xinsong Zhang, Pengshuai Li and Hang Li | N/A | N/A |
| Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation | Feilong Chen, Fandong Meng, Xiuyi Chen, Peng Li and Jie Zhou | N/A | N/A |
| Retrieve & Memorize: Dialog Policy Learning with Multi-Action Memory | YunHao Li, Yunyi Yang, Xiaojun Quan and Jianxing Yu | N/A | N/A |
| Adapt-and-Distill: Developing Small, Fast and Effective Pretrained Language Models for Domains | Yunzhi Yao, Shaohan Huang, Wenhui Wang, Li Dong and Furu Wei | N/A | N/A |
| DNN-driven Gradual Machine Learning for Aspect-term Sentiment Analysis | Murtadha Ahmed, Qun Chen, Yanyan Wang, Youcef Nafa, Zhanhuai Li and Tianyi Duan | N/A | N/A |
| OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack | DongHyun Choi, Myeong Cheol Shin, EungGyun Kim and Dong Ryeol Shin | N/A | N/A |
| GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning | Jiaqi Chen, Jianheng Tang, Jinghui Qin, Xiaodan Liang, Lingbo Liu, Eric Xing and Liang Lin | N/A | N/A |
| SIRE: Separate Intra- and Inter-sentential Reasoning for Document-level Relation Extraction | Shuang Zeng, Yuting Wu and Baobao Chang | N/A | N/A |
| KGPool: Dynamic Knowledge Graph Context Selection for Relation Extraction | Abhishek Nadgeri, Anson Bastos, Kuldeep Singh, Isaiah Onando Mulang, Johannes Hoffart, Saeedeh Shekarpour and Vijay Saraswat | N/A | N/A |
| Better Combine Them Together! Integrating Syntactic Constituency and Dependency Representations for Semantic Role Labeling | Hao Fei, Shengqiong Wu, Yafeng Ren, Fei Li and Donghong Ji | N/A | N/A |
| Keep the Primary, Rewrite the Secondary: A Two-Stage Approach for Paraphrase Generation | Yixuan Su, David Vandyke, Simon Baker, Yan Wang and Nigel Collier | N/A | N/A |
| Contrastive Fine-tuning Improves Robustness for Neural Rankers | Xiaofei Ma, Cicero Nogueira Dos Santos and Andrew O. Arnold | N/A | N/A |
| Cross-Lingual Transfer in Zero-Shot Cross-Language Entity Linking | Elliot Schumacher, James Mayfield and Mark Dredze | N/A | N/A |
| TellMeWhy: A Dataset for Answering Why-Questions in Narratives | Yash Kumar Lal, Nathanael Chambers, Raymond Mooney and Niranjan Balasubramanian | N/A | N/A |
| Dialogue in the Wild: Learning from a Deployed Role-Playing Game with Humans and Bots | Kurt Shuster, Jack Urbanek, Emily Dinan, Arthur Szlam and Jason Weston | N/A | N/A |
| Deep Learning against COVID-19: Respiratory Insufficiency Detection in Brazilian Portuguese Speech | Edresson Casanova, Lucas Gris, Augusto Camargo, Daniel Da Silva, Murilo Gazzola, Ester Sabino, Anna Levin, Arnaldo Candido Jr, Sandra Aluisio and Marcelo Finger | N/A | N/A |
| A Dialogue-based Information Extraction System for Medical Insurance Assessment | Shuang Peng, Mengdi Zhou, Minghui Yang, Haitao Mi, Shaosheng Cao, Zujie Wen, Teng Xu, Hongbin Wang and Lei Liu | N/A | N/A |
| Prediction or Comparison: Toward Interpretable Qualitative Reasoning | Mucheng Ren, Heyan Huang and Yang Gao | N/A | N/A |
| On Commonsense Cues in BERT for Solving Commonsense Tasks | Leyang Cui, Sijie Cheng, Yu Wu and Yue Zhang | N/A | N/A |
| Weakly Supervised Pre-Training for Multi-Hop Retriever | Yeon Seonwoo, Sang-Woo Lee, Ji-Hoon Kim, Jung-Woo Ha and Alice Oh | N/A | N/A |
| Meet The Truth: Leverage Objective Facts and Subjective Views for Interpretable Rumor Detection | Jiawen Li, Shiwen Ni and Hung-Yu Kao | N/A | N/A |
| Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking | Heng-Da Xu, Zhongli Li, Qingyu Zhou, Chao Li, Zizhen Wang, Yunbo Cao, Heyan Huang and Xian-Ling Mao | N/A | N/A |
| TransSum: Translating Aspect and Sentiment Embeddings for Self-Supervised Opinion Summarization | Ke Wang and Xiaojun Wan | N/A | N/A |
| Hashing based Efficient Inference for Image-Text Matching | Rong-Cheng Tu, Lei Ji, Huaishao Luo, Botian Shi, Heyan Huang, Nan Duan and Xian-Ling Mao | N/A | N/A |
| Rationalization through Concepts | Diego Antognini and Boi Faltings | N/A | N/A |
| Parallel Attention Network with Sequence Matching for Video Grounding | Hao Zhang, Aixin Sun, Wei Jing, Liangli Zhen, Joey Tianyi Zhou and Siow Mong Rick Goh | N/A | N/A |
| MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training | Mingliang Zeng, Xu Tan, Rui Wang, Zeqian Ju, Tao Qin and Tie-Yan Liu | N/A | N/A |
| CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation | Chujie Zheng, Yong Liu, Wei Chen, Yongcai Leng and Minlie Huang | N/A | N/A |
| UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction | Huanqin Wu, Wei Liu, Lei Li, Dan Nie, Tao Chen, Feng Zhang and Di Wang | N/A | N/A |
| As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other Languages | Wietse De Vries and Malvina Nissim | N/A | N/A |
| Can Cognate Prediction Be Modelled as a Low-Resource Machine Translation Task? | Clementine Fourrier, Rachel Bawden and Benoit Sagot | N/A | N/A |
| What if This Modified That? Syntactic Interventions with Counterfactual Embeddings | Mycal Tucker, Peng Qian and Roger Levy | N/A | N/A |
| COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences | Shikhar Singh, Nuan Wen, Yu Hou, Pegah Alipoormolabashi, Te-lin Wu, Xuezhe Ma and Nanyun Peng | N/A | N/A |
| Towards Knowledge-Grounded Counter Narrative Generation for Hate Speech | Yi-Ling Chung, Serra Sinem Tekirolu and Marco Guerini | N/A | N/A |
| SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification | Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Marcos Zampieri and Preslav Nakov | N/A | N/A |
| RealFormer: Transformer Likes Residual Attention | Ruining He, Anirudh Ravula, Bhargav Kanagal and Joshua Ainslie | N/A | N/A |
| Promoting Graph Awareness in Linearized Graph-to-Text Generation | Alexander Miserlis Hoyle, Ana Marasovi and Noah A. Smith | N/A | N/A |
| Predicting cross-linguistic adjective order with information gain | William Dyer, Richard Futrell, Zoey Liu and Greg Scontras | N/A | N/A |
| A Survey of Data Augmentation Approaches for NLP | Steven Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura and Eduard Hovy | N/A | N/A |
| Why Machine Reading Comprehension Models Learn Shortcuts? | Yuxuan Lai, Chen Zhang, Yansong Feng, Quzhe Huang and Dongyan Zhao | N/A | N/A |
| Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation | Peerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich and Sarana Nutanong | N/A | N/A |
| Sensei: Self-Supervised Sensor Name Segmentation | Jiaman Wu, Dezhi Hong, Rajesh Gupta and Jingbo Shang | N/A | N/A |
| Medical Code Assignment with Gated Convolution and Note-Code Interaction | Shaoxiong Ji, Shirui Pan and Pekka Marttinen | N/A | N/A |
| Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question Answering | Weiwen Xu, Huihui Zhang, Deng Cai and Wai Lam | N/A | N/A |
| Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot Consistency | Zekang Li, Jinchao Zhang, Zhengcong Fei, Yang Feng and Jie Zhou | N/A | N/A |
| Code Summarization with Structure-induced Transformer | Hongqiu Wu, Hai Zhao and Min Zhang | N/A | N/A |
| Scheduled Dialog Policy Learning: An Automatic Curriculum Learning Framework for Task-oriented Dialog System | Sihong Liu, Jinchao Zhang, Keqing He, Weiran Xu and Jie Zhou | N/A | N/A |
| Do Explanations Help Users Detect Errors in Open-Domain QA? An Evaluation of Spoken vs. Visual Explanations | Ana Valeria Gonzalez, Gagan Bansal, Angela Fan, Yashar Mehdad, Robin Jia and Srinivasan Iyer | N/A | N/A |
| OntoEA: Ontology-guided Entity Alignment via Joint Knowledge Graph Embedding | Yuejia Xiang, Ziheng Zhang, Jiaoyan Chen, Xi Chen, Zhenxi Lin and Yefeng Zheng | N/A | N/A |
| Learning Algebraic Recombination for Compositional Generalization | Chenyao Liu, Shengnan An, Zeqi Lin, Qian Liu, Bei Chen, Jian-Guang Lou, Lijie Wen, Nanning Zheng and Dongmei Zhang | N/A | N/A |
| Out of Order: How important is the sequential order of words in a sentence in Natural Language Understanding tasks? | Thang Pham, Trung Bui, Long Mai and Anh Nguyen | N/A | N/A |
| RevCore: Review-Augmented Conversational Recommendation | Yu Lu, Junwei Bao, Yan Song, Zichen Ma, Shuguang Cui, Youzheng Wu and Xiaodong He | N/A | N/A |
| Awakening Latent Grounding from Pretrained Language Models for Semantic Parsing | Qian Liu, Dejian Yang, Jiahui Zhang, Jiaqi Guo, Bin Zhou and Jian-Guang Lou | N/A | N/A |
| Enhancing Label Correlation Feedback in Multi-Label Text Classification via Multi-Task Learning | Ximing Zhang, Qian-Wen Zhang, Zhao Yan, Ruifang Liu and Yunbo Cao | N/A | N/A |
| Unsupervised Energy-based Adversarial Domain Adaptation for Cross-domain Text Classification | Han Zou, Jianfei Yang and Xiaojian Wu | N/A | N/A |
| Survival text regression for time-to-event prediction in conversations | Christine De Kock and Andreas Vlachos | N/A | N/A |
| Unsupervised Knowledge Selection for Dialogue Generation | Xiuyi Chen, Feilong Chen, Fandong Meng, Peng Li and Jie Zhou | N/A | N/A |
| Minimax and Neyman-Pearson Meta-Learning for Outlier Languages | Edoardo Maria Ponti, Rahul Aralikatte, Disha Shrivastava, Siva Reddy and Anders Sogaard | N/A | N/A |
| On-the-Fly Attention Modulation for Neural Generation | Yue Dong, Chandra Bhagavatula, Ximing Lu, Jena D. Hwang, Antoine Bosselut, Jackie Chi Kit Cheung and Yejin Choi | N/A | N/A |
| Enhanced Metaphor Detection via Incorporation of External Knowledge Based on Linguistic Theories | Chang Su, Kechun Wu and Yijiang Chen | N/A | N/A |
| Controlling Text Edition by Changing Answers of Specific Questions | Lei Sha, Patrick Hohenecker and Thomas Lukasiewicz | N/A | N/A |
| Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction | Tianyu Gao, Xu Han, Yuzhuo Bai, Keyue Qiu, Zhiyu Xie, Yankai Lin, Zhiyuan Liu, Peng Li, Maosong Sun and Jie Zhou | N/A | N/A |
| GCRC: A New Challenging MRC Dataset from Gaokao Chinese for Explainable Evaluation | Hongye Tan, Xiaoyue Wang, Yu Ji, Ru Li, Xiaoli Li, Zhiwei Hu, Yunxiao Zhao and Xiaoqi Han | N/A | N/A |
| Zero-shot Label-Aware Event Trigger and Argument Classification | Hongming Zhang, Haoyu Wang and Dan Roth | N/A | N/A |
| Incorporating Global Information in Local Attention for Knowledge Representation Learning | Yu Zhao, Han Zhou, Ruobing Xie, Fuzhen Zhuang, Qing Li and Ji Liu | N/A | N/A |
| MRN: A Locally and Globally Mention-Based Reasoning Network for Document-Level Relation Extraction | Jingye Li, Kang Xu, Fei Li, Hao Fei, Yafeng Ren and Donghong Ji | N/A | N/A |
| Adversary-Aware Rumor Detection | Yun-Zhu Song, Yi-Syuan Chen, Yi-Ting Chang, Shao-Yu Weng and Hong-Han Shuai | N/A | N/A |
| LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization | Weidong Guo, Mingjun Zhao, Lusheng Zhang, Di Niu, Jinwen Luo, Zhenhua Liu, Zhenyang Li and Jianbo Tang | N/A | N/A |
| Detecting Hallucinated Content in Conditional Neural Sequence Generation | Chunting Zhou, Graham Neubig, Jiatao Gu, Mona Diab, Francisco Guzman, Luke Zettlemoyer and Marjan Ghazvininejad | N/A | N/A |
| K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters | Ruize Wang, Duyu Tang, Nan Duan, Zhongyu Wei, Xuanjing Huang, Jianshu Ji, Guihong Cao, Daxin Jiang and Ming Zhou | N/A | N/A |
| Global Attention Decoder for Chinese Spelling Error Correction | Zhao Guo, Yuan Ni, Keqiang Wang, Wei Zhu and Guotong Xie | N/A | N/A |
| Exploring the Role of Context in Utterance-level Emotion, Act and Intent Classification in Conversations: An Empirical Study | Deepanway Ghosal, Navonil Majumder, Rada Mihalcea and Soujanya Poria | N/A | N/A |
| Putting words into the systemâs mouth: A targeted attack on neural machine translation using monolingual data poisoning | Jun Wang, Chang Xu, Francisco Guzman, Ahmed El-Kishky, Yuqing Tang, Benjamin Rubinstein and Trevor Cohn | N/A | N/A |
| Semantic and Syntactic Enhanced Aspect Sentiment Triplet Extraction | Zhexue Chen, Hong Huang, Bang Liu, Xuanhua Shi and Hai Jin | N/A | N/A |
| PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support | Hao Sun, Zhenru Lin, Chujie Zheng, Siyang Liu and Minlie Huang | N/A | N/A |
| RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge | Bill Yuchen Lin, Ziyi Wu, Yichi Yang, Dong-Ho Lee and Xiang Ren | N/A | N/A |
| Learning to Generate Questions by Learning to Recover Answer-containing Sentences | Seohyun Back, Akhil Kedia, Sai Chetan Chinthakindi, Haejun Lee and Jaegul Choo | N/A | N/A |
| Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing | Yitao Cai, Zhe Lin and Xiaojun Wan | N/A | N/A |
| Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach | Zhe Lin and Xiaojun Wan | N/A | N/A |
| Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models | Junyi Li, Tianyi Tang, Wayne Xin Zhao, Zhicheng Wei, Nicholas Jing Yuan and Ji-Rong Wen | N/A | N/A |
| NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer | Fei Huang, Zikai Chen, Chen Henry Wu, Qihan Guo, Xiaoyan Zhu and Minlie Huang | N/A | N/A |
| HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management | Silin Gao, Ryuichi Takanobu, Wei Peng, Qun Liu and Minlie Huang | N/A | N/A |
| Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition | Ying Zhang, Fandong Meng, Yufeng Chen, Jinan Xu and Jie Zhou | N/A | N/A |
| BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks | Yannik Keller, Jan Mackensen and Steffen Eger | N/A | N/A |
| Event Detection as Graph Parsing | Jianye Xie, Haotong Sun, Junsheng Zhou, Weiguang Qu and Xinyu Dai | N/A | N/A |
| Toward Fully Exploiting Heterogeneous Corpus: A Decoupled Named Entity Recognition Model with Two-stage Training | Yun Hu, Yeshuang Zhu, Jinchao Zhang, Changwen Zheng and Jie Zhou | N/A | N/A |
| Discriminative Reasoning for Document-level Relation Extraction | Wang Xu, Kehai Chen and Tiejun Zhao | N/A | N/A |
| Meta-Learning Adversarial Domain Adaptation Network for Few-Shot Text Classification | Chengcheng Han, Zeqiu Fan, Dongxiang Zhang, Minghui Qiu, Ming Gao and Aoying Zhou | N/A | N/A |
| Documents Representation via Generalized Coupled Tensor Chain with the Rotation Group constraint | Igor Vorona, Anh-Huy Phan, Alexander Panchenko and Andrzej Cichocki | N/A | N/A |
| Improving Unsupervised Extractive Summarization with Facet-Aware Modeling | Xinnian Liang, Shuangzhi Wu, Mu Li and Zhoujun Li | N/A | N/A |
| Improving Gradient-based Adversarial Training for Text Classification by Contrastive Learning and Auto-Encoder | Yao Qiu, Jinchao Zhang and Jie Zhou | N/A | N/A |
| Multi-Granularity Contrasting for Cross-Lingual Pre-Training | Shicheng Li, Pengcheng Yang, Fuli Luo and Jun Xie | N/A | N/A |
| A Comparison between Pre-training and Large-scale Back-translation for Neural Machine Translation | Dandan Huang, Kun Wang and Yue Zhang | N/A | N/A |
| Bi-Granularity Contrastive Learning for Post-Training in Few-Shot Scene | Ruikun Luo, Guanhuan Huang and Xiaojun Quan | N/A | N/A |
| KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion | Jie Zhou, Shengding Hu, Xin Lv, Cheng Yang, Zhiyuan Liu, Wei Xu, Jie Jiang, Juanzi Li and Maosong Sun | N/A | N/A |
| A Query-Driven Topic Model | Zheng Fang, Yulan He and Rob Procter | N/A | N/A |
| Gaussian Process based Deep Dyna-Q approach for Dialogue Policy Learning | Guanlin Wu, Wenqi Fang, Ji Wang, Jiang Cao, Weidong Bao, Yang Ping, Xiaomin Zhu and Zheng Wang | N/A | N/A |
| CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding | Dustin Wright and Isabelle Augenstein | N/A | N/A |
| Counter-Argument Generation by Attacking Weak Premises | Milad Alshomary, Shahbaz Syed, Arkajit Dhar, Martin Potthast and Henning Wachsmuth | N/A | N/A |
| Template-Based Named Entity Recognition Using BART | Leyang Cui, Yu Wu, Jian Liu, Sen Yang and Yue Zhang | N/A | N/A |
| âDoes it Matter When I Think You Are Lying?â Improving Deception Detection by Integrating Interlocutorâs Judgements in Conversations | Huang-Cheng Chou, Woan-Shiuan Chien, Da-Cheng Juan and Chi-Chun Lee | N/A | N/A |
| High-Quality Dialogue Diversification by Intermittent Short Extension Ensembles | Zhiwen Tang, Hrishikesh Kulkarni and Grace Hui Yang | N/A | N/A |
| Structured Refinement for Sequential Labeling | Yiran Wang, Hiroyuki Shindo, Yuji Matsumoto and Taro Watanabe | N/A | N/A |
| End-to-End Construction of NLP Knowledge Graph | Ishani Mondal, Yufang Hou and Charles Jochim | N/A | N/A |
| Deciphering Implicit Hate: Evaluating Automated Detection Algorithms for Multimodal Hate | Austin Botelho, Scott Hale and Bertie Vidgen | N/A | N/A |
| Studying the Evolution of Scientific Topics and their Relationships | Ana Sabina Uban, Cornelia Caragea and Liviu P. Dinu | N/A | N/A |
| A Mixed-Method Design Approach for Empirically Based Selection of Unbiased Data Annotators | Gautam Thakur, Janna Caspersen, Drahomira Herrmannova, Bryan Eaton and Jordan Burdette | N/A | N/A |
| An Evaluation of Disentangled Representation Learning for Texts | Krishnapriya Vishnubhotla, Graeme Hirst and Frank Rudzicz | N/A | N/A |
| Knowing More About Questions Can Help: Improving Calibration in Question Answering | Shujian Zhang, Chengyue Gong and Eunsol Choi | N/A | N/A |
| Enhancing Metaphor Detection by Gloss-based Interpretations | Hai Wan, Jinxia Lin, Jianfeng Du, Dawei Shen and Manrong Zhang | N/A | N/A |
| Evaluating Word Embeddings with Categorical Modularity | Silvia Casacuberta, Karina Halevy and Damian Blasi | N/A | N/A |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke and Ankur Gandhe | N/A | N/A |
| Exploring Cross-Lingual Transfer Learning with Unsupervised Machine Translation | Chao Wang, Judith Gaspers, Thi Ngoc Quynh Do and Hui Jiang | N/A | N/A |
| Pipeline Signed Japanese Translation Focusing on a Post-positional Particle Complement and Conjugation in a Low-resource Setting | Ken Yano and Akira Utsumi | N/A | N/A |
| Language-Mediated, Object-Centric Representation Learning | Ruocheng Wang, Jiayuan Mao, Samuel Gershman and Jiajun Wu | N/A | N/A |
| Entheos: A Multimodal Dataset for Studying Enthusiasm | Carla Viegas and Malihe Alikhani | N/A | N/A |
| Are Rotten Apples Edible? Challenging Commonsense Inference Ability with Exceptions | Nam Do and Ellie Pavlick | N/A | N/A |
| GRICE: A Grammar-based Dataset for Recovering Implicature and Conversational rEasoning | Zilong Zheng, Shuwen Qiu, Lifeng Fan, Yixin Zhu and Song-Chun Zhu | N/A | N/A |
| Automatic Document Sketching: Generating Drafts from Analogous Texts | Zeqiu Wu, Michel Galley, Chris Brockett, Yizhe Zhang and Bill Dolan | N/A | N/A |
| Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading | Zhihan Zhou, Liqian Ma and Han Liu | N/A | N/A |
| Language-based General Action Template for Reinforcement Learning Agents | Ryosuke Kohita, Akifumi Wachi, Daiki Kimura, Subhajit Chaudhury, Michiaki Tatsubori and Asim Munawar | N/A | N/A |
| MiniLMv2: Multi-Head Self-Attention Relation Distillation for Compressing Pretrained Transformers | Wenhui Wang, Hangbo Bao, Shaohan Huang, Li Dong and Furu Wei | N/A | N/A |
| Attending via both Fine-tuning and Compressing | Jie Zhou, Yuanbin Wu, Qin Chen, Xuanjing Huang and Liang He | N/A | N/A |
| Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement | Xinyu Zuo, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, Weihua Peng and Yuguang Chen | N/A | N/A |
| PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval | Ruiyang Ren, Shangwen Lv, Yingqi Qu, Jing Liu, Wayne Xin Zhao, QiaoQiao She, Hua Wu, Haifeng Wang and Ji-Rong Wen | N/A | N/A |
| Neural Combinatory Constituency Parsing | Zhousi Chen, Longtu Zhang, Aizhan Imankulova and Mamoru Komachi | N/A | N/A |
| Learning Shared Semantic Space for Speech-to-Text Translation | Chi Han, Mingxuan Wang, Heng Ji and Lei Li | N/A | N/A |
| Empowering Language Understanding with Counterfactual Reasoning | Fuli Feng, Jizhi Zhang, Xiangnan He, Hanwang Zhang and Tat-Seng Chua | N/A | N/A |
| Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources | Taolin Zhang, Chengyu Wang, Minghui Qiu, Bite Yang, Zerui Cai, Xiaofeng He and Jun Huang | N/A | N/A |
| Correcting Chinese Spelling Errors with Phonetic Pre-training | Ruiqing Zhang, Chao Pang, Chuanqiang Zhang, Shuohuan Wang, Zhongjun He, Yu Sun, Hua Wu and Haifeng Wang | N/A | N/A |
| Multi-Lingual Question Generation with Language Agnostic Language Model | Bingning Wang, Ting Yao, Weipeng Chen, Jingfang Xu and Xiaochuan Wang | N/A | N/A |
| On the Interplay Between Fine-tuning and Composition in Transformers | Lang Yu and Allyson Ettinger | N/A | N/A |
| Lifelong Learning of Topics and Domain-Specific Word Embeddings | Xiaorui Qin, Yuyin Lu, Yufu Chen and Yanghui Rao | N/A | N/A |
| Leveraging Argumentation Knowledge Graph for Interactive Argument Pair Identification | Jian Yuan, Zhongyu Wei, Donghua Zhao, Qi Zhang and Changjian Jiang | N/A | N/A |
| Confidence-Aware Scheduled Sampling for Neural Machine Translation | Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu and Jie Zhou | N/A | N/A |
| A Closer Look into the Robustness of Neural Dependency Parsers Using Better Adversarial Examples | Yuxuan Wang, Wanxiang Che, Ivan Titov, Shay B. Cohen, Zhilin Lei and Ting Liu | N/A | N/A |
| P-Stance: A Large Dataset for Stance Detection in Political Domain | Yingjie Li, Tiberiu Sosea, Aditya Sawant, Ajith Jayaraman Nair, Diana Inkpen and Cornelia Caragea | N/A | N/A |
| WIND: Weighting Instances Differentially for Model-Agnostic Domain Adaptation | Xiang Chen, Yue Cao and Xiaojun Wan | N/A | N/A |
| DocOIE: A Document-level Context-Aware Dataset for OpenIE | Kuicai Dong, Zhao Yilin, Aixin Sun, Jung-Jae Kim and Xiaoli Li | N/A | N/A |
| CONDA: a CONtextual Dual-Annotated dataset for in-game toxicity understanding and detection | Henry Weld, Guanghao Huang, Jean Lee, Tongshu Zhang, Kunze Wang, Xinghong Guo, Siqu Long, Josiah Poon and Caren Han | N/A | N/A |
| Adaptive Knowledge-Enhanced Bayesian Meta-Learning for Few-shot Event Detection | Shirong Shen, Tongtong Wu, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari and Sheng Bi | N/A | N/A |
| Dynamic Connected Networks for Chinese Spelling Check | Baoxin Wang, Wanxiang Che, Dayong Wu, Shijin Wang, Guoping Hu and Ting Liu | N/A | N/A |
| A Multi-Level Attention Model for Evidence-Based Fact Checking | Canasai Kruengkrai, Junichi Yamagishi and Xin Wang | N/A | N/A |
| RealTranS: End-to-End Simultaneous Speech Translation with Convolutional Weighted-Shrinking Transformer | Xingshan Zeng, Liangyou Li and Qun Liu | N/A | N/A |
| Training ELECTRA Augmented with Multi-word Selection | Jiaming Shen, Jialu Liu, Tianqi Liu, Cong Yu and Jiawei Han | N/A | N/A |
| REAM$\sharp$: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation | Jun Gao, Wei Bi, Ruifeng Xu and Shuming Shi | N/A | N/A |
| Relation Extraction with Type-aware Map Memories of Word Dependencies | Guimin Chen, Yuanhe Tian, Yan Song and Xiang Wan | N/A | N/A |
| PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning | Siqi Bao, Huang He, Fan Wang, Hua Wu, Haifeng Wang, Wenquan Wu, Zhen Guo, Zhibin Liu and Xinchao Xu | N/A | N/A |
| JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs | Pei Ke, Haozhe Ji, Yu Ran, Xin Cui, Liwei Wang, Linfeng Song, Xiaoyan Zhu and Minlie Huang | N/A | N/A |
| OKGIT: Open Knowledge Graph Link Prediction with Implicit Types | Chandrahas and Partha Talukdar | N/A | N/A |
| Multimodal Fusion with Co-Attention Networks for Fake News Detection | Yang Wu, Pengwei Zhan, Yunjian Zhang, Liming Wang and Zhen Xu | N/A | N/A |
| Joint Multi-Decoder Framework with Hierarchical Pointer Network for Frame Semantic Parsing | Xudong Chen, Ce Zheng and Baobao Chang | N/A | N/A |
| H-FND: Hierarchical False-Negative Denoising for Distant Supervision Relation Extraction | Jhih-wei Chen, Tsu-Jui Fu, Chen-Kang Lee and Wei-Yun Ma | N/A | N/A |
| GEM: A General Evaluation Benchmark for Multimodal Tasks | Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti and Arun Sacheti | N/A | N/A |
| Graph Relational Topic Model with Higher-order Graph Attention Auto-encoders | Qianqian Xie, Jimin Huang, Pan Du and Min Peng | N/A | N/A |
| Paths to Relation Extraction through Semantic Structure | Jonathan Yellin and Omri Abend | N/A | N/A |
| Dynamic and Multi-Channel Graph Convolutional Networks for Aspect-Based Sentiment Analysis | Shiguan Pang, Yun Xue, Zehao Yan, Weihao Huang and Jinhui Feng | N/A | N/A |
| Automatic Text Simplification for Social Good: Progress and Challenges | Sanja Stajner | N/A | N/A |
| Dialogue-oriented Pre-training | Yi Xu and Hai Zhao | N/A | N/A |
| GrantRel: Grant Information Extraction via Joint Entity and Relation Extraction | Junyi Bian, Li Huang, Xiaodi Huang, Hong Zhou and Shanfeng Zhu | N/A | N/A |
| Making Flexible Use of Subtasks: A Multiplex Interaction Network for Unified Aspect-based Sentiment Analysis | Guoxin Yu, Xiang Ao, Ling Luo, Min Yang, Xiaofei Sun, Jiwei Li and Qing He | N/A | N/A |
| Continual Mixed-Language Pre-Training for Extremely Low-Resource Neural Machine Translation | Zihan Liu, Genta Indra Winata and Pascale Fung | N/A | N/A |
| Two Parents, One Child: Dual Transfer for Low-Resource Neural Machine Translation | Meng Zhang, Liangyou Li and Qun Liu | N/A | N/A |
| Contrastive Aligned Joint Learning for Multilingual Summarization | Danqing Wang, Jiaze Chen, Hao Zhou, Xipeng Qiu and Lei Li | N/A | N/A |
| When Time Makes Sense: A Historically-Aware Approach to Targeted Sense Disambiguation | Kaspar Beelen, Federico Nanni, Mariona Coll Ardanuy, Kasra Hosseini, Giorgia Tolfo and Barbara McGillivray | N/A | N/A |
| Understanding Feature Focus in Multitask Settings for Lexico-semantic Relation Identification | Houssam Akhmouch, Gael Dias and Jose G. Moreno | N/A | N/A |
| Donât Miss the Labels: Label-semantic Augmented Meta-Learner for Few-Shot Text Classification | Qiaoyang Luo, Lingqiao Liu, Yuhao Lin and Wei Zhang | N/A | N/A |
| Detecting Harmful Memes and Their Targets | Shraman Pramanick, Dimitar Dimitrov, Rituparna Mukherjee, Shivam Sharma, Md. Shad Akhtar, Preslav Nakov and Tanmoy Chakraborty | N/A | N/A |
| ZmBART: An Unsupervised Cross-lingual Transfer Framework for Language Generation | Kaushal Kumar Maurya, Maunendra Sankar Desarkar, Yoshinobu Kano and Kumari Deepshikha | N/A | N/A |
| HacRED: A Large-Scale Relation Extraction Dataset Toward Hard Cases in Practical Applications | Qiao Cheng, Juntao Liu, Xiaoye Qu, Jin Zhao, Jiaqing Liang, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan and Yanghua Xiao | N/A | N/A |
| Learning Sequential and Structural Information for Source Code Summarization | YunSeok Choi, JinYeong Bak, CheolWon Na and Jee-Hyong Lee | N/A | N/A |
| Energy-based Unknown Intent Detection with Data Manipulation | Yawen Ouyang, Jiasheng Ye, Yu Chen, Xinyu Dai, Shujian Huang and Jiajun Chen | N/A | N/A |
| Automatic Rephrasing of Transcripts-based Action Items | Amir Cohen, Amir Kantor, Sagi Hilleli and Eyal Kolman | N/A | N/A |
| MergeDistill: Merging Language Models using Pre-trained Distillation | Simran Khanuja, Melvin Johnson and Partha Talukdar | N/A | N/A |
| On Sparsifying Encoder Outputs in Sequence-to-Sequence Models | Biao Zhang, Ivan Titov and Rico Sennrich | N/A | N/A |
| FrameNet-assisted Noun Compound Interpretation | Girishkumar Ponkiya, Diptesh Kanojia, Pushpak Bhattacharyya and Girish Palshikar | N/A | N/A |
| Hypernym Discovery via a Recurrent Mapping Model | Yuhang Bai, Richong Zhang, Fanshuang Kong, Junfan Chen and Yongyi Mao | N/A | N/A |
| On the Interaction of Belief Bias and Explanations | Ana Valeria Gonzalez, Anna Rogers and Anders Sogaard | N/A | N/A |
| Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction | Jinpeng Zhang, Baijun Ji, Nini Xiao, Xiangyu Duan, Min Zhang, Yangbin Shi and Weihua Luo | N/A | N/A |
| Exploring Unsupervised Pretraining Objectives for Machine Translation | Christos Baziotis, Ivan Titov, Alexandra Birch and Barry Haddow | N/A | N/A |
| Knowledge-Grounded Dialogue Generation with Term-level De-noising | Wen Zheng, Natasa Milic-Frayling and Ke Zhou | N/A | N/A |
| Inspecting the concept knowledge graph encoded by modern language models | Carlos Aspillaga, Marcelo Mendoza and Alvaro Soto | N/A | N/A |
| Latent Reasoning for Low-Resource Question Generation | Xinting Huang, Jianzhong Qi, Yu Sun and Rui Zhang | N/A | N/A |
| Probing Pre-Trained Language Models for Disease Knowledge | Israa Alghanmi, Luis Espinosa Anke and Steven Schockaert | N/A | N/A |
| AugVic: Exploiting BiText Vicinity for Low-Resource NMT | Tasnim Mohiuddin, M Saiful Bari and Shafiq Joty | N/A | N/A |
| Provably Secure Generative Linguistic Steganography | Siyu Zhang, Zhongliang Yang, Jinshuai Yang and Yongfeng Huang | N/A | N/A |
| Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL | Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu and Kai Yu | N/A | N/A |
| Adjacency List Oriented Relational Fact Extraction via Adaptive Multi-task Learning | Fubang Zhao, Zhuoren Jiang, Yangyang Kang, Changlong Sun and Xiaozhong Liu | N/A | N/A |
| Self-Supervised Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference | Dvir Ginzburg, Itzik Malkiel, Oren Barkan, Avi Caciularu and Noam Koenigstein | N/A | N/A |
| How Good Is NLP? A Sober Look at NLP Tasks through the Lens of Social Impact | Zhijing Jin, Geeticka Chauhan, Brian Tse, Mrinmaya Sachan and Rada Mihalcea | N/A | N/A |
| IgSEG: Image-guided Story Ending Generation | Qingbao Huang, Chuan Huang, Linzhang Mo, Jielong Wei, Yi Cai, Ho-fung Leung and Qing Li | N/A | N/A |
| Probabilistic Graph Reasoning for Natural Proof Generation | Changzhi Sun, Xinbo Zhang, Jiangjie Chen, Chun Gan, Yuanbin Wu, Jiaze Chen, Hao Zhou and Lei Li | N/A | N/A |
| Dialogue Graph Modeling for Conversational Machine Reading | Siru Ouyang, Zhuosheng Zhang and Hai Zhao | N/A | N/A |
| IndoCollex: A Testbed for Morphological Transformation of Indonesian Word Colloquialism | Haryo Akbarianto Wibowo, Made Nindyatama Nityasya, Afra Feyza Akyurek, Suci Fitriany, Alham Fikri Aji, Radityo Eko Prasojo and Derry Tanti Wijaya | N/A | N/A |
| Effective Cascade Dual-Decoder Model for Joint Entity and Relation Extraction | Lianbo Ma, Huimin Ren, Zhiwei Lin and Xiliang Zhang | N/A | N/A |
| Learning to Bridge Metric Spaces: Few-shot Joint Learning of Intent Detection and Slot Filling | Yutai Hou, Yongkui Lai, Cheng Chen, Wanxiang Che and Ting Liu | N/A | N/A |
| Insertion-based Tree Decoding | Denis Lukovnikov and Asja Fischer | N/A | N/A |
| Detecting Bot-Generated Text by Characterizing Linguistic Accommodation in Human-Bot Interactions | Paras Bhatt and Anthony Rios | N/A | N/A |
| Defending Pre-trained Language Models from Adversarial Word Substitution Without Performance Sacrifice | Rongzhou Bao, Jiayi Wang and Hai Zhao | N/A | N/A |
| BERT-Proof Syntactic Structures: Investigating Errors in Discontinuous Constituency Parsing | Maximin Coavoux | N/A | N/A |
| Hyperbolic Temporal Knowledge Graph Embeddings with Relational and Time Curvatures | Sebastien Montella, Lina M. Rojas Barahona and Johannes Heinecke | N/A | N/A |
| Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering | Aditya Gupta, Jiacheng Xu, Shyam Upadhyay, Diyi Yang and Manaal Faruqui | N/A | N/A |
| Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text Classification | Yada Pruksachatkun, Satyapriya Krishna, Jwala Dhamala, Rahul Gupta and Kai-Wei Chang | N/A | N/A |
| A Joint Model for Structure-based News Genre Classification with Application to Text Summarization | Zeyu Dai and Ruihong Huang | N/A | N/A |
| Representing Syntax and Composition with Geometric Transformations | Lorenzo Bertolini, Julie Weeds, David Weir and Qiwei Peng | N/A | N/A |
| To Point or Not to Point: Understanding How Abstractive Summarizers Paraphrase Text | Matt Wilber, William Timkey and Marten Van Schijndel | N/A | N/A |
| AgreeSum: Agreement-Oriented Multi-Document Summarization | Richard Yuanzhe Pang, Adam Lelkes, Vinh Tran and Cong Yu | N/A | N/A |
| BERT Busters: Outlier Dimensions that Disrupt Transformers | Olga Kovaleva, Saurabh Kulshreshtha, Anna Rogers and Anna Rumshisky | N/A | N/A |
| âWe will Reduce Taxesâ - Identifying Election Pledges with Language Models | Tommaso Fornaciari, Dirk Hovy, Elin Naurin, Julia Runeson, Robert Thomson and Pankaj Adhikari | N/A | N/A |
| WeaQA: Weak Supervision via Captions for Visual Question Answering | Pratyay Banerjee, Tejas Gokhale, Yezhou Yang and Chitta Baral | N/A | N/A |
| How well do you know your summarization datasets? | Priyam Tejaswin, Dhruv Naik and Pengfei Liu | N/A | N/A |
| Multilingual Translation from Denoising Pre-Training | Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu and Angela Fan | N/A | N/A |
| Annotations Matter: Leveraging Multi-task Learning to Parse UD and SUD | Zeeshan Ali Sayyed and Daniel Dakota | N/A | N/A |
| Generating Informative Conclusions for Argumentative Texts | Shahbaz Syed, Khalid Al Khatib, Milad Alshomary, Henning Wachsmuth and Martin Potthast | N/A | N/A |
| Substructure Substitution: Structured Data Augmentation for NLP | Haoyue Shi, Karen Livescu and Kevin Gimpel | N/A | N/A |
| Towards Protecting Vital Healthcare Programs by Extracting Actionable Knowledge from Policy | Vanessa Lopez, Nagesh Yadav, Gabriele Picco, Inge Vejsbjerg, Eoin Carrol, Seamus Brady, Marco Luca Sbodio, Lam Thanh Hoang, Miao Wei and John Segrave | N/A | N/A |
| Not Far Away, Not So Close: Sample Efficient Nearest Neighbour Data Augmentation via MiniMax | Ehsan Kamalloo, Mehdi Rezagholizadeh, Peyman Passban and Ali Ghodsi | N/A | N/A |
| Itâs All in the Heads: Using Attention Heads as a Baseline for Cross-Lingual Transfer in Commonsense Reasoning | Alexey Tikhonov and Max Ryabinin | N/A | N/A |
| Biomedical Interpretable Entity Representations | Diego Garcia-Olano, Yasumasa Onoe, Ioana Baldini, Joydeep Ghosh, Byron Wallace and Kush Varshney | N/A | N/A |
| Learning Robust Latent Representations for Controllable Speech Synthesis | Shakti Kumar, Jithin Pradeep and Hussain Zaidi | N/A | N/A |
| How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation | Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri and Marco Turchi | N/A | N/A |
| On the Ethical Limits of Natural Language Processing on Legal Text | Dimitrios Tsarapatsanis and Nikolaos Aletras | N/A | N/A |
| Transforming Term Extraction: Transformer-Based Approaches to Multilingual Term Extraction Across Domains | Christian Lang, Lennart Wachowiak, Barbara Heinisch and Dagmar Gromann | N/A | N/A |
| ProofWriter: Generating Implications, Proofs, and Abductive Statements over Natural Language | Oyvind Tafjord, Bhavana Dalvi and Peter Clark | N/A | N/A |
| Probing Image-Language Transformers for Verb Understanding | Lisa Anne Hendricks and Aida Nematzadeh | N/A | N/A |
| Implications of Using Internet Sting Corpora to Approximate Underage Victims | Tatiana Ringenberg, Kathryn Seigfried-Spellar and Julia Rayz | N/A | N/A |
| Detecting Domain Polarity-Changes of Words in a Sentiment Lexicon | Shuai Wang, Guangyi Lv, Sahisnu Mazumder and Bing Liu | N/A | N/A |
| Analyzing Online Political Advertisements | Danae Sanchez Villegas, Saeid Mokaram and Nikolaos Aletras | N/A | N/A |
| Probing Multi-modal Machine Translation with Pre-trained Language Model | Kong Yawei and Kai Fan | N/A | N/A |
| The interplay between language similarity and script on a novel multi-layer Algerian dialect corpus | Samia Touileb and Jeremy Barnes | N/A | N/A |
| A Non-Autoregressive Edit-Based Approach to Controllable Text Simplification | Sweta Agrawal, Weijia Xu and Marine Carpuat | N/A | N/A |
| Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference | Hai Hu, He Zhou, Zuoyu Tian, Yiwen Zhang, Yina Patterson, Yanting Li, Yixin Nie and Kyle Richardson | N/A | N/A |
| Using surprisal and fMRI to map the neural bases of broad and local contextual prediction during natural language comprehension | Shohini Bhattasali and Philip Resnik | N/A | N/A |
| Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models | Laura Perez-Mayos, Alba Taboas Garcia, Simon Mille, and Leo Wanner | N/A | N/A |
| Are Larger Pretrained Language Models Uniformly Better? Comparing Performance at the Instance Level | Ruiqi Zhong, Dhruba Ghosh, Dan Klein and Jacob Steinhardt | N/A | N/A |
| Named Entity Recognition through Deep Representation Learning and Weak Supervision | Jerrod Parker and Shi Yu | N/A | N/A |
| Explaining NLP Models via Minimal Contrastive Editing (MiCE) | Alexis Ross, Ana Marasovi and Matthew Peters | N/A | N/A |
| Differential Privacy for Text Analytics via Natural Text Sanitization | Xiang Yue, Minxin Du, Tianhao Wang, Yaliang Li, Huan Sun and Sherman S. M. Chow | N/A | N/A |
| Synthesizing Adversarial Negative Responses for Robust Response Ranking and Evaluation | Prakhar Gupta, Yulia Tsvetkov and Jeffrey Bigham | N/A | N/A |
| Leveraging Abstract Meaning Representation for Knowledge Base Question Answering | Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Fernandez Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Gangi Reddy, Ryan Riegel, Gaetano Rossiello, Udit Sharma, G P Shrivatsa Bhargav and Mo Yu | N/A | N/A |
| Perceptual Models of Machine-Edited Text | Elizabeth Merkhofer, Monica-Ann Mendoza, Rebecca Marvin and John Henderson | N/A | N/A |
| Scaling Within Document Coreference to Long Texts | Raghuveer Thirukovalluru, Nicholas Monath, Kumar Shridhar, Manzil Zaheer, Mrinmaya Sachan and Andrew McCallum | N/A | N/A |
| LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer | Machel Reid and Victor Zhong | N/A | N/A |
| Constructing Flow Graphs from Procedural Cybersecurity Texts | Kuntal Kumar Pal, Kazuaki Kashihara, Pratyay Banerjee, Swaroop Mishra, Ruoyu Wang and Chitta Baral | N/A | N/A |
| Cluster-Former: Clustering-based Sparse Transformer for Question Answering | Shuohang Wang, Luowei Zhou, Zhe Gan, Yen-Chun Chen, Yuwei Fang, Siqi Sun, Yu Cheng and Jingjing Liu | N/A | N/A |
| Multi-Task Learning and Adapted Knowledge Models for Emotion-Cause Extraction | Elsbeth Turcan, Shuai Wang, Rishita Anubhai, Kasturi Bhattacharjee, Yaser Al-Onaizan and Smaranda Muresan | N/A | N/A |
| The Utility and Interplay of Gazetteers and Entity Segmentation for Named Entity Recognition in English | Oshin Agarwal and Ani Nenkova | N/A | N/A |
| On the Cost-Effectiveness of Stacking of Neural and Non-Neural Methods for Text Classification: Scenarios and Performance Prediction | Christian Gomes, Marcos Goncalves, Leonardo Rocha and Sergio Canuto | N/A | N/A |
| Unsupervised Domain Adaptation for Event Detection using Domain-specific Adapters | Nghia Ngo Trung, Duy Phung and Thien Huu Nguyen | N/A | N/A |
| Learning Contextualized Knowledge Structures for Commonsense Reasoning | Jun Yan, Mrigank Raman, Aaron Chan, Tianyu Zhang, Ryan Rossi, Handong Zhao, Sungchul Kim, Nedim Lipka and Xiang Ren | N/A | N/A |
| Analyzing Stereotypes in Generative Text Inference Tasks | Anna Sotnikova, Yang Trista Cao, Hal Daume Iii and Rachel Rudinger | N/A | N/A |
| HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction | Liliang Ren, Chenkai Sun, Heng Ji and Julia Hockenmaier | N/A | N/A |
| Who Blames or Endorses Whom? Entity-to-Entity Directed Sentiment Extraction in News Text | Kunwoo Park, Zhufeng Pan and Jungseock Joo | N/A | N/A |
| A Formidable Ability: Detecting Adjectival Extremeness with DSMs | Farhan Samir, Barend Beekhuizen and Suzanne Stevenson | N/A | N/A |
| Compositionality of Complex Graphemes in the Undeciphered Proto-Elamite Script using Image and Text Embedding Models | Logan Born, Kathryn Kelley, M. Willis Monroe and Anoop Sarkar | N/A | N/A |
| Unsupervised Label Refinement Improves Dataless Text Classification | Zewei Chu, Karl Stratos and Kevin Gimpel | N/A | N/A |
| Prompting Contrastive Explanations for Commonsense Reasoning Tasks | Bhargavi Paranjape, Julian Michael, Marjan Ghazvininejad, Hannaneh Hajishirzi and Luke Zettlemoyer | N/A | N/A |
| SMS Spam Detection Through Skip-gram Embeddings and Shallow Networks | Gustavo Sousa, Daniel Carlos Guimaraes Pedronette, Joao Paulo Papa and Ivan Rizzo Guilherme | N/A | N/A |
| Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring | Yichi Zhang and Joyce Chai | N/A | N/A |
| Marked Attribute Bias in Natural Language Inference | Hillary Dawkins | N/A | N/A |
| VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding | Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze and Luke Zettlemoyer | N/A | N/A |
| Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence | Andrew Halterman, Katherine Keith, Sheikh Sarwar and Brendan OâConnor | N/A | N/A |
| Memory-Efficient Differentiable Transformer Architecture Search | Yuekai Zhao, Li Dong, Yelong Shen, Zhihua Zhang, Furu Wei and Weizhu Chen | N/A | N/A |
| On the Copying Behaviors of Pre-Training for Neural Machine Translation | Xuebo Liu, Longyue Wang, Derek F. Wong, Liang Ding, Lidia S. Chao, Shuming Shi and Zhaopeng Tu | N/A | N/A |
| Grounding âGroundingâ in NLP | Khyathi Raghavi Chandu, Yonatan Bisk and Alan W Black | N/A | N/A |
| MLMLM: Link Prediction with Mean Likelihood Masked Language Model | Louis Clouatre, Philippe Trempe, Amal Zouaq and Sarath Chandar | N/A | N/A |
| Effective Batching for Recurrent Neural Network Grammars | Hiroshi Noji and Yohei Oseki | N/A | N/A |
| Verb Sense Clustering using Contextualized Word Representations for Semantic Frame Induction | Kosuke Yamada, Ryohei Sasano and Koichi Takeda | N/A | N/A |
| Enhancing Chinese Word Segmentation via Pseudo Labels for Practicability | Kaiyu Huang, Junpeng Liu, Degen Huang, Deyi Xiong, Zhuang Liu and Jinsong Su | N/A | N/A |
| Logic-Consistency Text Generation from Semantic Parses | Chang Shu, Yusen Zhang, Xiangyu Dong, Peng Shi, Tao Yu and Rui Zhang | N/A | N/A |
| Inducing Semantic Roles Without Syntax | Julian Michael and Luke Zettlemoyer | N/A | N/A |
| Plot and Rework: Modeling Storylines for Visual Storytelling | Chi-yang Hsu, Yun-Wei Chu, Ting-Hao Huang and Lun-Wei Ku | N/A | N/A |
| Disentangled Code Representation Learning for Multiple Programming Languages | Jingfeng Zhang, Haiwen Hong, Yin Zhang, Yao Wan, Ye Liu and Yulei Sui | N/A | N/A |
| Exploring Self-Identified Counseling Expertise in Online Support Forums | Allison Lahnala, Yuntian Zhao, Charles Welch, Jonathan K. Kummerfeld, Lawrence C An, Kenneth Resnicow, Rada Mihalcea and Veronica Perez-Rosas | N/A | N/A |
| An Investigation of Suitability of Pre-Trained Language Models for Dialogue Generation â Avoiding Discrepancies | Yan Zeng and Jian-Yun Nie | N/A | N/A |
| Learning to Sample Replacements for ELECTRA Pre-Training | Yaru Hao, Li Dong, Hangbo Bao, Ke Xu and Furu Wei | N/A | N/A |
| Reordering Examples Helps during Priming-based Few-Shot Learning | Sawan Kumar and Partha Talukdar | N/A | N/A |
| Constrained Labeled Data Generation for Low-Resource Named Entity Recognition | Ruohao Guo and Dan Roth | N/A | N/A |
| He is very intelligent, she is very beautiful? On Mitigating Social Biases in Language Modelling and Generation | Aparna Garimella, Akhash Amarnath, Kiran Kumar, Akash Pramod Yalla, Anandhavelu N, Niyati Chhaya and Balaji Vasan Srinivasan | N/A | N/A |
| Using Social and Linguistic Information to Adapt Pretrained Representations for Political Perspective Identification | Chang Li and Dan Goldwasser | N/A | N/A |
| Modeling Event-Pair Relations in External Knowledge Graphs for Script Reasoning | Yucheng Zhou, Xiubo Geng, Tao Shen, Jian Pei, Wenqiang Zhang and Daxin Jiang | N/A | N/A |
| PROST: Physical Reasoning about Objects through Space and Time | Stephane Aroca-Ouellette, Cory Paik, Alessandro Roncone and Katharina Kann | N/A | N/A |
| Revisiting the Evaluation of End-to-end Event Extraction | Shun Zheng, Wei Cao, Wei Xu and Jiang Bian | N/A | N/A |
| HIT - A Hierarchically Fused Deep Attention Network for Robust Code-mixed Language Representation | Ayan Sengupta, Sourabh Kumar Bhattacharjee, Tanmoy Chakraborty and Md. Shad Akhtar | N/A | N/A |
| Semi-Supervised Data Programming with Subset Selection | Ayush Maheshwari, Oishik Chatterjee, Krishnateja Killamsetty, Ganesh Ramakrishnan and Rishabh Iyer | N/A | N/A |
| Fingerprinting Fine-tuned Language Models in the Wild | Nirav Diwan, Tanmoy Chakraborty and Zubair Shafiq | N/A | N/A |
| Automatic Construction of Sememe Knowledge Bases via Dictionaries | Fanchao Qi, Yangyi Chen, Fengyu Wang, Zhiyuan Liu, Xiao Chen and Maosong Sun | N/A | N/A |
| XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages | Tahmid Hasan, Abhik Bhattacharjee, Md. Saiful Islam, Kazi Mubasshir, Yuan-Fang Li, Yong-Bin Kang, M. Sohel Rahman and Rifat Shahriyar | N/A | N/A |
| Investigating Memorization of Conspiracy Theories in Text Generation | Sharon Levy, Michael Saxon and William Yang Wang | N/A | N/A |
| A Text-Centered Shared-Private Framework via Cross-Modal Prediction for Multimodal Sentiment Analysis | Yang Wu, Zijie Lin, Yanyan Zhao, Bing Qin and Li-Nan Zhu | N/A | N/A |
| What Would a Teacher Do? Predicting Future Talk Moves | Ananya Ganesh, Martha Palmer and Katharina Kann | N/A | N/A |
| Cross-Domain Review Generation for Aspect-Based Sentiment Analysis | Jianfei Yu, Chenggong Gong and Rui Xia | N/A | N/A |
| On the Language Coverage Bias for Neural Machine Translation | Shuo Wang, Zhaopeng Tu, Zhixing Tan, Shuming Shi, Maosong Sun and Yang Liu | N/A | N/A |
| Named Entity Recognition via Noise Aware Training Mechanism with Data Filter | Xiusheng Huang, Yubo Chen, Shun Wu, Jun Zhao, Yuantao Xie and Weijian Sun | N/A | N/A |
| A Multi-Task Approach for Improving Biomedical Named Entity Recognition by Incorporating Multi-Granularity information | Yiqi Tong, Yidong Chen and Xiaodong Shi | N/A | N/A |
| EBERT: Efficient BERT Inference with Dynamic Structured Pruning | Zejian Liu, Fanrong Li, Gang Li and Jian Cheng | N/A | N/A |
| Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation | Peng Wang, Junyang Lin, An Yang, Chang Zhou, Yichang Zhang, Jingren Zhou and Hongxia Yang | N/A | N/A |
| TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation | Shizhe Diao, Xinwei Shen, Kashun Shum, Yan Song and Tong Zhang | N/A | N/A |
| John praised Mary because he? Implicit Causality Bias and Its Interaction with Explicit Cues in LMs | he | N/A | N/A |
| Enhancing the Open-Domain Dialogue Evaluation in Latent Space | Zhangming Chan, Lemao Liu, Juntao Li, Haisong Zhang, Dongyan Zhao, Shuming Shi and Rui Yan | N/A | N/A |
| DocNLI: A Large-scale Dataset for Document-level Natural Language Inference | Wenpeng Yin, Dragomir Radev and Caiming Xiong | N/A | N/A |
| Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan | Jordi Armengol-Estape, Casimiro Pio Carrino, Carlos Rodriguez-Penagos, Ona De Gibert Bonet, Carme Armentano-Oller, Aitor Gonzalez-Agirre, Maite Melero and Marta Villegas | N/A | N/A |
| Language Models Use Monotonicity to Assess NPI Licensing | Jaap Jumelet, Milica Denic, Jakub Szymanik, Dieuwke Hupkes and Shane Steinert-Threlkeld | N/A | N/A |
| Slot Transferability for Cross-domain Slot Filling | Hengtong Lu, Zhuoxin Han, Caixia Yuan, Xiaojie Wang, Shuyu Lei, Huixing Jiang and Wei Wu | N/A | N/A |
| Word Graph Guided Summarization for Radiology Findings | Jinpeng Hu, Jianling Li, Zhihong Chen, Yaling Shen, Yan Song, Xiang Wan and Tsung-Hui Chang | N/A | N/A |
| Generalized Supervised Attention for Text Generation | Yixian Liu, Liwen Zhang, Xinyu Zhang, Yong Jiang, Yue Zhang and Kewei Tu | N/A | N/A |
| Automatically Select Emotion for Response via Personality-affected Emotion Transition | Zhiyuan Wen, Jiannong Cao, Ruosong Yang, Shuaiqi Liu and Jiaxing Shen | N/A | N/A |
| Phrase-Level Action Reinforcement Learning for Neural Dialog Response Generation | Takato Yamazaki and Akiko Aizawa | N/A | N/A |
| Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights | Devaraja Adiga, Rishabh Kumar, Amrith Krishna, Preethi Jyothi, Ganesh Ramakrishnan and Pawan Goyal | N/A | N/A |
| DialogSum: A Real-Life Scenario Dialogue Summarization Dataset | Yulong Chen, Yang Liu, Liang Chen and Yue Zhang | N/A | N/A |
| What Did You Refer to? Evaluating Co-References in Dialogue | Wei-Nan Zhang, Yue Zhang, Hanlin Tang, Zhengyu Zhao, Caihai Zhu and Ting Liu | N/A | N/A |
| Controllable Abstractive Dialogue Summarization with Sketch Supervision | Chien-Sheng Wu, Linqing Liu, Wenhao Liu, Pontus Stenetorp and Caiming Xiong | N/A | N/A |
| Elaborative Simplification: Content Addition and Explanation Generation in Text Simplification | Neha Srikanth and Junyi Jessy Li | N/A | N/A |
| Diagnosing Transformers in Task-Oriented Semantic Parsing | Shrey Desai and Ahmed Aly | N/A | N/A |
| More Parameters? No Thanks! | Zeeshan Khan, Kartheek Akella, Vinay Namboodiri and C V Jawahar | N/A | N/A |
| More than just Frequency? Demasking Unsupervised Hypernymy Prediction Methods | Thomas Bott, Dominik Schlechtweg and Sabine Schulte Im Walde | N/A | N/A |
| CoDesc: A Large Code-Description Parallel Dataset | Masum Hasan, Tanveer Muttaqueen, Abdullah Al Ishtiaq, Kazi Sajeed Mehrab, Md. Mahim Anjum Haque, Tahmid Hasan, Wasi Ahmad, Anindya Iqbal and Rifat Shahriyar | N/A | N/A |
| Better Chinese Sentence Segmentation with Reinforcement Learning | Srivatsan Srinivasan and Chris Dyer | N/A | N/A |
| Reader-Guided Passage Reranking for Open-Domain Question Answering | Yuning Mao, Pengcheng He, Xiaodong Liu, Yelong Shen, Jianfeng Gao, Jiawei Han and Weizhu Chen | N/A | N/A |
| LenAtten: An Effective Length Controlling Unit For Text Summarization | Zhongyi Yu, Zhenghao Wu, Hao Zheng, Zhe XuanYuan, Jefferson Fong and Weifeng Su | N/A | N/A |
| Using Word Embeddings to Analyze Teacher Evaluations: An Application to a Filipino Education Non-Profit Organization | Francesca Vera | N/A | N/A |
| Relation Classification with Entity Type Restriction | Shengfei Lyu and Huanhuan Chen | N/A | N/A |
| Decoupling Adversarial Training for Fair NLP | Xudong Han, Timothy Baldwin and Trevor Cohn | N/A | N/A |
| GO FIGURE: A Meta Evaluation of Factuality in Summarization | Saadia Gabriel, Asli Celikyilmaz, Rahul Jha, Yejin Choi and Jianfeng Gao | N/A | N/A |
| Error Detection in Large-Scale Natural Language Understanding Systems Using Transformer Models | Rakesh Chada, Pradeep Natarajan, Darshan Fofadiya and Prathap Ramachandra | N/A | N/A |
| Benchmarking Robustness of Machine Reading Comprehension Models | Chenglei Si, Ziqing Yang, Yiming Cui, Wentao Ma, Ting Liu and Shijin Wang | N/A | N/A |
| Improving BERT with Syntax-aware Local Attention | Zhongli Li, Qingyu Zhou, Chao Li, Ke Xu and Yunbo Cao | N/A | N/A |
| Boundary Detection with BERT for Span-level Emotion Cause Analysis | Xiangju Li, Wei Gao, Shi Feng, Yifei Zhang and Daling Wang | N/A | N/A |
| Can the Transformer Learn Nested Recursion with Symbol Masking? | Jean-Philippe Bernardy, Adam Ek and Vladislav Maraev | N/A | N/A |
| Evaluating the Efficacy of Summarization Evaluation across Languages | Fajri Koto, Jey Han Lau and Timothy Baldwin | N/A | N/A |
| Investigating Text Simplification Evaluation | Laura Vasquez-Rodriguez, Matthew Shardlow, Piotr Przybya and Sophia Ananiadou | N/A | N/A |
| Frustratingly Simple Few-Shot Slot Tagging | Jianqiang Ma, Zeyu Yan, Chang Li and Yang Zhang | N/A | N/A |
| Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation | Shun-Po Chuang, Yung-Sung Chuang, Chih-Chiang Chang and Hung-yi Lee | N/A | N/A |
| Fusing Context Into Knowledge Graph for Commonsense Question Answering | Yichong Xu, Chenguang Zhu, Ruochen Xu, Yang Liu, Michael Zeng and Xuedong Huang | N/A | N/A |
| Grammar-Constrained Neural Semantic Parsing with LR Parsers | Artur Baranowski and Nico Hochgeschwender | N/A | N/A |
| Grammar-Based Patches Generation for Automated Program Repair | Yu Tang, Long Zhou, Ambrosio Blanco, Shujie Liu, Furu Wei, Ming Zhou and Muyun Yang | N/A | N/A |
| Exploiting Position Bias for Robust Aspect Sentiment Classification | Fang Ma, Chen Zhang and Dawei Song | N/A | N/A |
| Jointly Identifying Rhetoric and Implicit Emotions via Multi-Task Learning | Xin Chen, Zhen Hai, Deyu Li, Suge Wang and Dian Wang | N/A | N/A |
| Encouraging Neural Machine Translation to Satisfy Terminology Constraints | Melissa Ailem, Jingshu Liu and Raheel Qader | N/A | N/A |
| BertGCN: Transductive Text Classification by Combining GNN and BERT | Yuxiao Lin, Yuxian Meng, Xiaofei Sun, Qinghong Han, Kun Kuang, Jiwei Li and Fei Wu | N/A | N/A |
| UserAdapter: Few-Shot User Learning in Sentiment Analysis | Wanjun Zhong, Duyu Tang, Jiahai Wang, Jian Yin and Nan Duan | N/A | N/A |
| Learning Slice-Aware Representations with Mixture of Attentions | Cheng Wang, Sungjin Lee, Sunghyun Park, Han Li, Young-Bum Kim and Ruhi Sarikaya | N/A | N/A |
| Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning | Chenglei Si, Zhengyan Zhang, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Qun Liu and Maosong Sun | N/A | N/A |
| Fusing Label Embedding into BERT: An Efficient Improvement for Text Classification | Yijin Xiong, Yukun Feng, Hao Wu, Hidetaka Kamigaito and Manabu Okumura | N/A | N/A |
| How Reliable are Model Diagnostics? | Vamsi Aribandi, Yi Tay and Donald Metzler | N/A | N/A |
| Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts | Barbara Plank | N/A | N/A |
| Alternated Training with Synthetic and Authentic Data for Neural Machine Translation | Rui Jiao, Zonghan Yang, Maosong Sun and Yang Liu | N/A | N/A |
| End-to-End Self-Debiasing Framework for Robust NLU Training | Abbas Ghaddar, Phillippe Langlais, Mehdi Rezagholizadeh and Ahmad Rashid | N/A | N/A |
| Injecting Knowledge Base Information into End-to-End Joint Entity and Relation Extraction and Coreference Resolution | Severine Verlinden, Klim Zaporojets, Johannes Deleu, Thomas Demeester and Chris Develder | N/A | N/A |
| Annotation and Evaluation of Coreference Resolution in Screenplays | Sabyasachee Baruah, Sandeep Nallan Chakravarthula and Shrikanth Narayanan | N/A | N/A |
| RetroGAN: A Cyclic Post-Specialization System for Improving Out-of-Knowledge and Rare Word Representations | Pedro Colon-Hernandez, Yida Xin, Henry Lieberman, Catherine Havasi, Cynthia Breazeal and Peter Chin | N/A | N/A |
| Fusion: Towards Automated ICD Coding via Feature Compression | Junyu Luo, Cao Xiao, Lucas Glass, Jimeng Sun and Fenglong Ma | N/A | N/A |
| Is Human Scoring the Best Criteria for Summary Evaluation? | Oleg Vasilyev and John Bohannon | N/A | N/A |
| Assessing Dialogue Systems with Distribution Distances | Jiannan Xiang, Yahui Liu, Deng Cai, Huayang Li, Defu Lian and Lemao Liu | N/A | N/A |
| Structure-Aware Pre-Training for Table-to-Text Generation | Xinyu Xing and Xiaojun Wan | N/A | N/A |
| A Multi-Task Learning Framework for Multi-Target Stance Detection | Yingjie Li and Cornelia Caragea | N/A | N/A |
| MA-BERT: Learning Representation by Incorporating Multi-Attribute Knowledge in Transformers | You Zhang, Jin Wang, Liang-Chih Yu and Xuejie Zhang | N/A | N/A |
| Event Extraction from Historical Texts: A New Dataset for Black Rebellions | Viet Lai, Minh Van Nguyen, Heidi Kaufman and Thien Huu Nguyen | N/A | N/A |
| Zero-shot Medical Entity Retrieval without Annotation: Learning From Rich Knowledge Graph Semantics | Luyang Kong, Christopher Winestock and Parminder Bhatia | N/A | N/A |
| Stylized Story Generation with Style-Guided Planning | Xiangzhe Kong, Jialiang Huang, Ziquan Tung, Jian Guan and Minlie Huang | N/A | N/A |
| AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation | Wuwei Huang, Dexin Wang and Deyi Xiong | N/A | N/A |
| A Neural Edge-Editing Approach for Document-Level Relation Graph Extraction | Kohei Makino, Makoto Miwa and Yutaka Sasaki | N/A | N/A |
| Enhancing Language Generation with Effective Checkpoints of Pre-trained Language Model | Jeonghyeok Park and Hai Zhao | N/A | N/A |
| Transformer-Exclusive Cross-Modal Representation for Vision and Language | Andrew Shin and Takuya Narihira | N/A | N/A |
| Progressive Multi-Granularity Training for Non-Autoregressive Translation | Liang Ding, Longyue Wang, Xuebo Liu, Derek F. Wong, Dacheng Tao and Zhaopeng Tu | N/A | N/A |
| Do Multilingual Neural Machine Translation Models Contain Language Pair Specific Attention Heads? | Zae Myung Kim, Laurent Besacier, Vassilina Nikoulina and Didier Schwab | N/A | N/A |
| Modeling the Influence of Verb Aspect on the Activation of Typical Event Locations with BERT | Won Ik Cho, Emmanuele Chersoni, Yu-Yin Hsu and Chu-Ren Huang | N/A | N/A |
| Language Tags Matter for Zero-Shot Neural Machine Translation | Liwei Wu, Shanbo Cheng, Mingxuan Wang and Lei Li | N/A | N/A |
| Retrieval Enhanced Model for Commonsense Generation | Han Wang, Yang Liu, Chenguang Zhu, Linjun Shou, Ming Gong, Yichong Xu and Michael Zeng | N/A | N/A |
| Improve Query Focused Abstractive Summarization by Incorporating Answer Relevance | Dan Su, Tiezheng Yu and Pascale Fung | N/A | N/A |
| Learning a Reversible Embedding Mapping using Bi-Directional Manifold Alignment | Ashwinkumar Ganesan, Francis Ferraro and Tim Oates | N/A | N/A |
| Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge Graph | Rui Liu, Zheng Lin, Yutong Tan and Weiping Wang | N/A | N/A |
| Manifold Adversarial Augmentation for Neural Machine Translation | Guandan Chen, Kai Fan, Kaibo Zhang, Boxing Chen and Zhongqiang Huang | N/A | N/A |
| Is the Lottery Fair? Evaluating Winning Tickets Across Demographics | Victor Petren Bach Hansen and Anders Sogaard | N/A | N/A |
| SSMix: Saliency-Based Span Mixup for Text Classification | Soyoung Yoon, Gyuwan Kim and Kyumin Park | N/A | N/A |
| DoT: An efficient Double Transformer for NLP tasks with tables | Syrine Krichene, Thomas Muller and Julian Eisenschlos | N/A | N/A |
| Grammatical Error Correction as GAN-like Sequence Labeling | Kevin Parnow, Zuchao Li and Hai Zhao | N/A | N/A |
| Neural Entity Recognition with Gazetteer based Fusion | Qing Sun and Parminder Bhatia | N/A | N/A |
| Figurative Language in Recognizing Textual Entailment | Tuhin Chakrabarty, Debanjan Ghosh, Adam Poliak and Smaranda Muresan | N/A | N/A |
| An Exploratory Analysis of the Relation between Offensive Language and Mental Health | Ana-Maria Bucur, Marcos Zampieri and Liviu P. Dinu | N/A | N/A |
| Do Language Models Perform Generalizable Commonsense Inference? | Peifeng Wang, Filip Ilievski, Muhao Chen and Xiang Ren | N/A | N/A |
| Few-Shot Upsampling for Protest Size Detection | Andrew Halterman and Benjamin J. Radford | N/A | N/A |
| Modeling the Unigram Distribution | Irene Nikkarinen, Tiago Pimentel, Damian Blasi and Ryan Cotterell | N/A | N/A |
| On the Lack of Robust Interpretability of Neural Text Classifiers | Muhammad Bilal Zafar, Michele Donini, Dylan Slack, Cedric Archambeau, Sanjiv Das and Krishnaram Kenthapadi | N/A | N/A |
| Multimodal Graph-based Transformer Framework for Biomedical Relation Extraction | Sriram Pingali, Shweta Yadav, Pratik Dutta and Sriparna Saha | N/A | N/A |
| Summary Grounded Conversation Generation | Chulaka Gunasekara, Guy Feigenblat, Benjamin Sznajder, Sachindra Joshi and David Konopnicki | N/A | N/A |
| On the Gap between Adoption and Understanding in NLP | Federico Bianchi and Dirk Hovy | N/A | N/A |
| Learning Disentangled Latent Topics for Twitter Rumour Veracity Classification | John Dougrez-Lewis, Maria Liakata, Elena Kochkina and Yulan He | N/A | N/A |
| Minimally-Supervised Morphological Segmentation using Adaptor Grammars with Linguistic Priors | Ramy Eskander, Cass Lowry, Sujay Khandagale, Francesca Callejas, Judith Klavans, Maria Polinsky and Smaranda Muresan | N/A | N/A |
| Predicting in-hospital mortality by combining clinical notes with time-series data | Iman Deznabi, Mohit Iyyer and Madalina Fiterau | N/A | N/A |
| Sequence Models for Computational Etymology of Borrowings | Winston Wu, Kevin Duh and David Yarowsky | N/A | N/A |
| Improving Automated Evaluation of Open Domain Dialog via Diverse Reference Augmentation | Varun Gangal, Harsh Jhamtani, Eduard Hovy and Taylor Berg-Kirkpatrick | N/A | N/A |
| New Dataset and Strong Baselines for the Grammatical Error Correction of Russian | Viet Anh Trinh and Alla Rozovskaya | N/A | N/A |
| Effective Attention Sheds Light On Interpretability | Kaiser Sun and Ana Marasovi | N/A | N/A |
| On the Distribution, Sparsity, and Inference-time Quantization of Attention Values in Transformers | Tianchu Ji, Shraddhan Jain, Michael Ferdman, Peter Milder, H. Andrew Schwartz and Niranjan Balasubramanian | N/A | N/A |
| Ethical-Advice Taker: Do Language Models Understand Natural Language Interventions? | Jieyu Zhao, Daniel Khashabi, Tushar Khot, Ashish Sabharwal and Kai-Wei Chang | N/A | N/A |
| Answer Generation for Retrieval-based Question Answering Systems | Chao-Chun Hsu, Eric Lind, Luca Soldaini and Alessandro Moschitti | N/A | N/A |
| Federated Chinese Word Segmentation with Global Character Associations | Yuanhe Tian, Guimin Chen, Han Qin and Yan Song | N/A | N/A |
| PSED: A Dataset for Selecting Emphasis in Presentation Slides | Amirreza Shirani, Giai Tran, Hieu Trinh, Franck Dernoncourt, Nedim Lipka, Jose Echevarria, Thamar Solorio and Paul Asente | N/A | N/A |
| Modulating Language Models with Emotions | Ruibo Liu, Jason Wei, Chenyan Jia and Soroush Vosoughi | N/A | N/A |
| Benchmarking Neural Topic Models: An Empirical Study | Thanh-Nam Doan and Tuan-Anh Hoang | N/A | N/A |
| Analysis of Tree-Structured Architectures for Code Generation | Samip Dahal, Adyasha Maharana and Mohit Bansal | N/A | N/A |
| How Does Distilled Data Complexity Impact the Quality and Confidence of Non-Autoregressive Machine Translation? | Weijia Xu, Shuming Ma, Dongdong Zhang and Marine Carpuat | N/A | N/A |
| Leveraging Topic Relatedness for Argument Persuasion | Xinran Zhao, Esin Durmus, Hongming Zhang and Claire Cardie | N/A | N/A |
| One Teacher is Enough? Pre-trained Language Model Distillation from Multiple Teachers | Chuhan Wu, Fangzhao Wu and Yongfeng Huang | N/A | N/A |
| Task-adaptive Pre-training of Language Models with Word Embedding Regularization | Kosuke Nishida, Kyosuke Nishida and Sen Yoshida | N/A | N/A |
| Do Grammatical Error Correction Models Realize Grammatical Generalization? | Masato Mita and Hitomi Yanaka | N/A | N/A |
| Domain-Aware Dependency Parsing for Questions | Aparna Garimella, Laura Chiticariu and Yunyao Li | N/A | N/A |
| Enhancing Dialogue-based Relation Extraction by Speaker and Trigger Words Prediction | Tianyang Zhao, Zhao Yan, Yunbo Cao and Zhoujun Li | N/A | N/A |
| Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR | Junkun Chen, Mingbo Ma, Renjie Zheng and Liang Huang | N/A | N/A |
| Analyzing Code Embeddings for Coding Clinical Narratives | Wei Shi, Jiewen Wu, Xiwen Yang, Nancy Chen, Ivan Ho Mien, Jung-Jae Kim and Pavitra Krishnaswamy | N/A | N/A |
| Rule-Aware Reinforcement Learning for Knowledge Graph Reasoning | Zhongni Hou, Xiaolong Jin, Zixuan Li and Long Bai | N/A | N/A |
| Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices | Sebastin Santy, Anku Rani and Monojit Choudhury | N/A | N/A |
| As Easy as 1, 2, 3: Behavioural Testing of NMT Systems for Numerical Translation | Jun Wang, Chang Xu, Francisco Guzman, Ahmed El-Kishky, Benjamin Rubinstein and Trevor Cohn | N/A | N/A |
| BioGen: Generating Biography Summary under Table Guidance on Wikipedia | Shen Gao, Xiuying Chen, Chang Liu, Dongyan Zhao and Rui Yan | N/A | N/A |
| Multilingual Simultaneous Neural Machine Translation | Philip Arthur, Dongwon Ryu and Gholamreza Haffari | N/A | N/A |
| Strong and Light Baseline Models for Fact-Checking Joint Inference | Kateryna Tymoshenko and Alessandro Moschitti | N/A | N/A |
| Do It Once: An Embarrassingly Simple Joint Matching Approach to Response Selection | Linhao Zhang, Dehong Ma, Sujian Li and Houfeng Wang | N/A | N/A |
| Climbing the Tower of Treebanks: Improving Low-Resource Dependency Parsing via Hierarchical Source Selection | Goran Glavas and Ivan Vuli | N/A | N/A |
| Adapting Monolingual Models: Data can be Scarce when Language Similarity is High | Wietse De Vries, Martijn Bartelds, Malvina Nissim and Martijn Wieling | N/A | N/A |
| BatchMixup: Improving Training by Interpolating Hidden States of the Entire Mini-batch | Wenpeng Yin, Huan Wang, Jin Qu and Caiming Xiong | N/A | N/A |
| Rule Augmented Unsupervised Constituency Parsing | Atul Sahay, Anshul Nasery, Ayush Maheshwari, Ganesh Ramakrishnan and Rishabh Iyer | N/A | N/A |
| How transfer learning impacts linguistic knowledge in deep NLP models? | Nadir Durrani, Hassan Sajjad and Fahim Dalvi | N/A | N/A |
| Uncertainty Aware Review Hallucination for Science Article Classification | Korbinian Friedl, Georgios Rizos, Lukas Stappen, Madina Hasan, Lucia Specia, Thomas Hain and Bjorn Schuller | N/A | N/A |
| Highlight-Transformer: Leveraging Key Phrase Aware Attention to Improve Abstractive Multi-Document Summarization | Shuaiqi Liu, Jiannong Cao, Ruosong Yang and Zhiyuan Wen | N/A | N/A |
| Constraint based Knowledge Base Distillation in End-to-End Task Oriented Dialogs | Dinesh Raghu, Atishya Jain, Mausam - and Sachindra Joshi | N/A | N/A |
| Beyond Metadata: What Paper Authors Say About Corpora They Use | Nikolay Kolyada, Martin Potthast and Benno Stein | N/A | N/A |
| Knowledge Distillation for Quality Estimation | Amit Gajbhiye, Marina Fomicheva, Fernando Alva-Manchego, Frederic Blain, Abiola Obamuyide, Nikolaos Aletras and Lucia Specia | N/A | N/A |
| Cross-document Coreference Resolution over Predicted Mentions | Arie Cattan, Alon Eirew, Gabriel Stanovsky, Mandar Joshi and Ido Dagan | N/A | N/A |
| Could you give me a hint ? Generating inference graphs for defeasible reasoning | Aman Madaan, Dheeraj Rajagopal, Niket Tandon, Yiming Yang and Eduard Hovy | N/A | N/A |
| Characterizing Social Spambots by their Human Traits | Salvatore Giorgi, Lyle Ungar and H. Andrew Schwartz | N/A | N/A |
ACL 2023
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research | Mohamed Abdalla, Jan Philip Wahle, Terry Lima Ruas, Aurélie Névéol, Fanny Ducel, Saif Mohammad and Karen Fort | N/A | N/A |
| How About Kind of Generating Hedges using End-to-End Neural Models? | Alafate Abulimiti, Chloé Clavel and Justine Cassell | N/A | N/A |
| What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization | Griffin Adams, Bichlien Nguyen, Jake Smith, Yingce Xia, Shufang Xie, Anna Ostropolets, Budhaditya Deb, Yuan-Jyue Chen, Tristan Naumann and Noémie Elhadad | N/A | N/A |
| Generating EDU Extracts for Plan-Guided Summary Re-Ranking | Griffin Adams, Alex Fabbri, Faisal Ladhak, Noémie Elhadad and Kathleen McKeown | N/A | N/A |
| The CRINGE Loss: Learning what language not to model | Leonard Adolphs, Tianyu Gao, Jing Xu, Kurt Shuster, Sainbayar Sukhbaatar and Jason Weston | N/A | N/A |
| Multimodal Persona Based Generation of Comic Dialogs | Harsh Agrawal, Aditya Mishra, Manish Gupta and Mausam | N/A | N/A |
| Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities | Sina Ahmadi and Antonios Anastasopoulos | N/A | N/A |
| MPCHAT: Towards Multimodal Persona-Grounded Conversation | Jaewoo Ahn, Yeda Song, Sangdoo Yun and Gunhee Kim | N/A | N/A |
| On-the-fly Cross-lingual Masking for Multilingual Pre-training | Xi Ai and Bin Fang | N/A | N/A |
| Early Discovery of Disappearing Entities in Microblogs | Satoshi Akasaki, Naoki Yoshinaga and Masashi Toyoda | N/A | N/A |
| Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation | Ibrahim Taha Aksu, Min-Yen Kan and Nancy Chen | N/A | N/A |
| RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs | Afra Feyza Akyurek, Ekin Akyurek, Ashwin Kalyan, Peter Clark, Derry Tanti Wijaya and Niket Tandon | N/A | N/A |
| LexSym: Compositionality as Lexical Symmetry | Ekin Akyurek and Jacob Andreas | N/A | N/A |
| A Diverse Set of Freely Available Linguistic Resources for Turkish | Duygu ALTINOK | N/A | N/A |
| Query Refinement Prompts for Closed-Book Long-Form QA | Reinald Kim Amplayo, Kellie Webster, Michael Collins, Dipanjan Das and Shashi Narayan | N/A | N/A |
| Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting Model | Chantal Amrhein, Florian Schottmann, Rico Sennrich and Samuel Läubli | N/A | N/A |
| How Do In-Context Examples Affect Compositional Generalization? | Shengnan An, Zeqi Lin, Qiang Fu, Bei Chen, Nanning Zheng, Jian-Guang LOU and Dongmei Zhang | N/A | N/A |
| DisorBERT: A Double Domain Adaptation Model for Detecting Signs of Mental Disorders in Social Media | Mario Aragon, Adrian Pastor Lopez Monroy, Luis Gonzalez, David E. Losada and Manuel Montes | N/A | N/A |
| Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection | Erik Arakelyan, Arnav Arora and Isabelle Augenstein | N/A | N/A |
| Unbalanced Optimal Transport for Unbalanced Word Alignment | Yuki Arase, Han Bao and Sho Yokoi | N/A | N/A |
| The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources | Akshatha Arodi, Martin Pömsl, Kaheer Suleman, Adam Trischler, Alexandra Olteanu and Jackie Chi Kit Cheung | N/A | N/A |
| Direct Fact Retrieval from Knowledge Graphs without Entity Linking | Jinheon Baek, Alham Fikri Aji, Jens Lehmann and Sung Ju Hwang | N/A | N/A |
| Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding | Haoli Bai, Zhiguang Liu, Xiaojun Meng, li wentao, Shuang Liu, Yifeng LUO, nian xie, Rongfu Zheng, Liangwei Wang, Lu Hou, Jiansheng Wei, Xin Jiang and Qun Liu | N/A | N/A |
| Syntax and Geometry of Information | Raphaël Bailly, Laurent Leblond and Kata Gábor | N/A | N/A |
| Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Scale | Hritik Bansal, Karthik Gopalakrishnan, Saket Dingliwal, Sravan Bodapati, Katrin Kirchhoff and Dan Roth | N/A | N/A |
| Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers | Parikshit Bansal and Amit Sharma | N/A | N/A |
| Target-Side Augmentation for Document-Level Machine Translation | Guangsheng Bao, ZHIYANG TENG and Yue Zhang | N/A | N/A |
| A Synthetic Data Generation Framework for Grounded Dialogues | Jianzhu Bao, Rui Wang, Yasheng Wang, Aixin Sun, Yitong Li, Fei Mi and Ruifeng Xu | N/A | N/A |
| CASN:Class-Aware Score Network for Textual Adversarial Detection | Rong Bao, Rui Zheng, Liang Ding, Qi Zhang and Dacheng Tao | N/A | N/A |
| Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition | Yuwei Bao, Barrett Lattimer and Joyce Chai | N/A | N/A |
| Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation | Martijn Bartelds, Nay San, Bradley McDonnell, Dan Jurafsky and Martijn Wieling | N/A | N/A |
| NEUROSTRUCTURAL DECODING: Neural Text Generation with Structural Constraints | Mohaddeseh Bastan, Mihai Surdeanu and Niranjan Balasubramanian | N/A | N/A |
| Span-Selective Linear Attention Transformers for Effective and Robust Schema-Guided Dialogue State Tracking | Björn Bebensee and Haejun Lee | N/A | N/A |
| ELQA: A Corpus of Metalinguistic Questions and Answers about English | Shabnam Behzad, Keisuke Sakaguchi, Nathan Schneider and Amir Zeldes | N/A | N/A |
| ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models | Jonas Belouadi and Steffen Eger | N/A | N/A |
| I2D2: Inductive Knowledge Distillation with NeuroLogic and Self-Imitation | Chandra Bhagavatula, Jena D. Hwang, Doug Downey, Ronan Le Bras, Ximing Lu, Lianhui Qin, Keisuke Sakaguchi, Swabha Swayamdipta, Peter West and Yejin Choi | N/A | N/A |
| CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs | Abhik Bhattacharjee, Tahmid Hasan, Wasi Uddin Ahmad, Yuan-Fang Li, Yong-Bin Kang and Rifat Shahriyar | N/A | N/A |
| Simplicity Bias in Transformers and their Ability to Learn Sparse Boolean Functions | Satwik Bhattamishra, Arkil Patel, Varun Kanade and Phil Blunsom | N/A | N/A |
| DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation | Guanqun Bi, Lei Shen, Yanan Cao, Meng Chen, Yuqiang Xie, Zheng Lin and Xiaodong He | N/A | N/A |
| Prompting Language Models for Linguistic Structure | Terra Blevins, Hila Gonen and Luke Zettlemoyer | N/A | N/A |
| SIMSUM: Document-level Text Simplification via Simultaneous Summarization | Sofia Blinova, Xinyu Zhou, Martin Jaggi, Carsten Eickhoff and Seyed Ali Bahrainian | N/A | N/A |
| WikiHowQA: A Comprehensive Benchmark for Multi-Document Non-Factoid Question Answering | Valeriia Bolotova-Baranova, Vladislav Blinov, Sofya Filippova, Falk Scholer and Mark Sanderson | N/A | N/A |
| Multilingual Event Extraction from Historical Newspaper Adverts | Nadav Borenstein, Natália da Silva Perez and Isabelle Augenstein | N/A | N/A |
| Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM’s Translation Capability | Eleftheria Briakou, Colin Cherry and George Foster | N/A | N/A |
| Measuring Progress in Fine-grained Vision-and-Language Understanding | Emanuele Bugliarello, Laurent Sartran, Aishwarya Agrawal, Lisa Anne Hendricks and Aida Nematzadeh | N/A | N/A |
| Convergence and Diversity in the Control Hierarchy | Alexandra Butoi, Ryan Cotterell and David Chiang | N/A | N/A |
| Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering | Avi Caciularu, Matthew Peters, Jacob Goldberger, Ido Dagan and Arman Cohan | N/A | N/A |
| Query Enhanced Knowledge-Intensive Conversation via Unsupervised Joint Modeling | Mingzhu Cai, Siqi Bao, Xin Tian, Huang He, Fan Wang and Hua Wu | N/A | N/A |
| Generating User-Engaging News Headlines | Pengshan Cai, Kaiqiang Song, Sangwoo Cho, Hongwei Wang, Xiaoyang Wang, hong yu, Fei Liu and Dong Yu | N/A | N/A |
| A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training | Nitay Calderon, Subhabrata Mukherjee, Roi Reichart and Amir Kantor | N/A | N/A |
| What is the best recipe for character-level encoder-only modelling? | Kris Cao | N/A | N/A |
| PuMer: Pruning and Merging Tokens for Efficient Vision Language Models | Qingqing Cao, Bhargavi Paranjape and Hannaneh Hajishirzi | N/A | N/A |
| Bridging the Domain Gaps in Context Representations for $k$-Nearest Neighbor Neural Machine Translation | Zhiwei Cao, Baosong Yang, Huan Lin, Suhang Wu, Xiangpeng Wei, Dayiheng Liu, Jun Xie, Min Zhang and Jinsong Su | N/A | N/A |
| From Key Points to Key Point Hierarchy: Structured and Expressive Opinion Summarization | Arie Cattan, Lilach Eden, Yoav Kantor and Roy Bar-Haim | N/A | N/A |
| Improving Gradient Trade-offs between Tasks in Multi-task Text Classification | Heyan Chai, Jinhao Cui, Ye Wang, Min Zhang, Binxing Fang and Qing Liao | N/A | N/A |
| Zero-shot Approach to Overcome Perturbation Sensitivity of Prompts | Mohna Chakraborty, Adithya Kulkarni and Qi Li | N/A | N/A |
| LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development | Ilias Chalkidis, Nicolas Garneau, Catalina Goanta, Daniel Katz and Anders Søgaard | N/A | N/A |
| Few-shot Adaptation Works with UnpredicTable Data | Jun Shern Chan, Michael Pieler, Jonathan Jao, Jérémy Scheurer and Ethan Perez | N/A | N/A |
| Composition-contrastive Learning for Sentence Embeddings | Sachin Chanchani and Ruihong Huang | N/A | N/A |
| Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling | Haw-Shiuan Chang, Ruei-Yao Sun, Kathryn Ricci and Andrew McCallum | N/A | N/A |
| Data Curation Alone Can Stabilize In-context Learning | Ting-Yun Chang and Robin Jia | N/A | N/A |
| Characterizing and Measuring Linguistic Dataset Drift | Tyler Chang, Kishaloy Halder, Neha Anna John, Yogarshi Vyas, Yassine Benajiba, Miguel Ballesteros and Dan Roth | N/A | N/A |
| Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning | Subhajit Chaudhury, Sarathkrishna Swaminathan, Daiki Kimura, Prithviraj Sen, Keerthiram Murugesan, Rosario Uceda-Sosa, Michiaki Tatsubori, Achille Fokoue, Pavan Kapanipathi, Asim Munawar and Alexander Gray | N/A | N/A |
| Ideology Prediction from Scarce and Biased Supervision: Learn to Disregard the "What” and Focus on the "How”! | Chen Chen, Dylan Walker and Venkatesh Saligrama | N/A | N/A |
| Weakly Supervised Vision-and-Language Pre-training with Relative Representations | Chi Chen, Peng Li, Maosong Sun and Yang Liu | N/A | N/A |
| Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification | Chih Yao Chen, Tun Min Hung, Yi-Li Hsu and Lun-Wei Ku | N/A | N/A |
| mCLIP: Multilingual CLIP via Cross-lingual Transfer | Guanhua Chen, Lu Hou, Yun Chen, Wenliang Dai, Lifeng Shang, Xin Jiang, Qun Liu, Jia Pan and Wenping Wang | N/A | N/A |
| REV: Information-Theoretic Evaluation of Free-Text Rationales | Hanjie Chen, Faeze Brahman, Xiang Ren, Yangfeng Ji, Yejin Choi and Swabha Swayamdipta | N/A | N/A |
| Did the Models Understand Documents? Benchmarking Models for Language Understanding in Document-Level Relation Extraction | Haotian Chen, Bingsheng Chen and Xiangdong Zhou | N/A | N/A |
| Nonlinear Structural Equation Model Guided Gaussian Mixture Hierarchical Topic Modeling | HeGang Chen, Pengbo Mao, Yuyin Lu and Yanghui Rao | N/A | N/A |
| Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge | Jiangjie Chen, Wei Shi, Ziquan Fu, Sijie Cheng, Lei Li and Yanghua Xiao | N/A | N/A |
| Learning In-context Learning for Named Entity Recognition | Jiawei Chen, Yaojie Lu, Hongyu Lin, Jie Lou, Wei Jia, Dai Dai, Hua Wu, Boxi Cao, Xianpei Han and Le Sun | N/A | N/A |
| Exploring How Generative Adversarial Networks Learn Phonological Representations | Jingyi Chen and Micha Elsner | N/A | N/A |
| TableVLM: Multi-modal Pre-training for Table Structure Recognition | Leiyuan Chen, Chengsong Huang, Xiaoqing Zheng, Jinshu Lin and Xuanjing Huang | N/A | N/A |
| CHEER: Centrality-aware High-order Event Reasoning Network for Document-level Event Causality Identification | Meiqi Chen, Yixin Cao, Yan Zhang and Zhiwei Liu | N/A | N/A |
| BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric | Mingda Chen, Paul-Ambroise Duquenne, Pierre Andrews, Justine Kao, Alexandre Mourachko, Holger Schwenk and Marta R. Costa-juss | N/A | N/A |
| Alleviating Over-smoothing for Unsupervised Sentence Representation | Nuo Chen, Linjun Shou, Jian Pei, Ming Gong, Bowen Cao, Jianhui Chang, Jia Li and Daxin Jiang | N/A | N/A |
| Consistent Prototype Learning for Few-Shot Continual Relation Extraction | Xiudi Chen, Hui Wu and xiaodong shi | N/A | N/A |
| Improving the Robustness of Summarization Systems with Dual Augmentation | Xiuying Chen, Guodong Long, Chongyang Tao, Mingzhe Li, Xin Gao, Chengqi Zhang and Xiangliang Zhang | N/A | N/A |
| DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models | Xuxi Chen, Tianlong Chen, Weizhu Chen, Ahmed Hassan Awadallah, Zhangyang Wang and Yu Cheng | N/A | N/A |
| A Close Look into the Calibration of Pre-trained Language Models | Yangyi Chen, Lifan Yuan, Ganqu CUI, Zhiyuan Liu and Heng Ji | N/A | N/A |
| Dynamic Transformers Provide a False Sense of Efficiency | Yiming Chen, Simin Chen, Zexin Li, Wei Yang, Cong Liu, Robby Tan and Haizhou Li | N/A | N/A |
| PMAES: Prompt-mapping Contrastive Learning for Cross-prompt Automated Essay Scoring | Yuan Chen and Xia Li | N/A | N/A |
| Exploring Lottery Prompts for Pre-trained Language Models | Yulin Chen, Ning Ding, Xiaobin Wang, Shengding Hu, Haitao Zheng, Zhiyuan Liu and Pengjun Xie | N/A | N/A |
| UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization | Yulong Chen, Yang Liu, Ruochen Xu, Ziyi Yang, Chenguang Zhu, Michael Zeng and Yue Zhang | N/A | N/A |
| Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation | Yulong Chen, Huajian Zhang, Yijie Zhou, Xuefeng Bai, Yueguan Wang, Ming Zhong, Jianhao Yan, Yafu Li, Judy Li, Xianchao Zhu and Yue Zhang | N/A | N/A |
| DISCO: Distilling Counterfactuals with Large Language Models | Zeming Chen, Qiyue Gao, Antoine Bosselut, Ashish Sabharwal and Kyle Richardson | N/A | N/A |
| From the One, Judge of the Whole: Typed Entailment Graph Construction with Predicate Generation | Zhibin Chen, Yansong Feng and Dongyan Zhao | N/A | N/A |
| Causal Intervention and Counterfactual Reasoning for Multi-modal Fake News Detection | Ziwei Chen, Linmei Hu, Weixin Li, Yingxia Shao and Liqiang Nie | N/A | N/A |
| Multi-granularity Temporal Question Answering over Knowledge Graphs | Ziyang Chen, Jinzhi Liao and Xiang Zhao | N/A | N/A |
| Explainable Recommendation with Personalized Review Retrieval and Aspect Learning | Hao Cheng, Shuo Wang, Wensheng Lu, Wei Zhang, Mingyang Zhou, Kezhong Lu and Hao Liao | N/A | N/A |
| MDACE: MIMIC Documents Annotated with Code Evidence | Hua Cheng, Rana Jafari, April Russell, Russell Klopfer, Edmond Lu, Benjamin Striner and Matthew Gormley | N/A | N/A |
| Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models | Myra Cheng, Esin Durmus and Dan Jurafsky | N/A | N/A |
| OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment | Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan and Zhou Zhao | N/A | N/A |
| Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis | Ta-Chung Chi, Ting-Han Fan, alexander rudnicky and Peter Ramadge | N/A | N/A |
| Can Large Language Models Be an Alternative to Human Evaluations? | Cheng-Han Chiang and Hung-yi Lee | N/A | N/A |
| CELDA: Leveraging Black-box Language Model as Enhanced Classifier without Labels | Hyunsoo Cho, Youna Kim and Sang-goo Lee | N/A | N/A |
| Advancing Multi-Criteria Chinese Word Segmentation Through Criterion Classification and Denoising | Tzu Hsuan Chou, Chun-Yi Lin and Hung-Yu Kao | N/A | N/A |
| A Method for Studying Semantic Construal in Grammatical Constructions with Interpretable Contextual Embedding Spaces | Gabriella Chronis, Kyle Mahowald and Katrin Erk | N/A | N/A |
| Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions | John Chung, Ece Kamar and Saleema Amershi | N/A | N/A |
| Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection | Christopher Clarke, Matthew Hall, Gaurav Mittal, Ye Yu, Sandra Sajeev, Jason Mars and Mei Chen | N/A | N/A |
| A dynamic programming algorithm for span-based nested named-entity recognition in O(n^2) | Caio Corro | N/A | N/A |
| Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing | Maxwell Crouse, Pavan Kapanipathi, Subhajit Chaudhury, Tahira Naseem, Ramon Fernandez Astudillo, Achille Fokoue and Tim Klinger | N/A | N/A |
| Decoder Tuning: Efficient Language Understanding as Decoding | Ganqu CUI, Wentao Li, Ning Ding, Longtao Huang, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| Adaptive and Personalized Exercise Generation for Online Language Learning | Peng Cui and Mrinmaya Sachan | N/A | N/A |
| What does the Failure to Reason with "Respectively’’ in Zero/Few-Shot Settings Tell Us about Language Models? | Ruixiang Cui, Seolhwa Lee, Daniel Hershcovich and Anders Søgaard | N/A | N/A |
| Free Lunch for Efficient Textual Commonsense Integration in Language Models | Wanyun Cui and Xingran Chen | N/A | N/A |
| From Ultra-Fine to Fine: Fine-tuning Ultra-Fine Entity Typing Models to Fine-grained | Hongliang Dai and Ziqian Zeng | N/A | N/A |
| Long-Tailed Question Answering in an Open World | Yi Dai, Hao Lang, Yinhe Zheng, Fei Huang and Yongbin Li | N/A | N/A |
| Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better | David Dale, Elena Voita, Loic Barrault and Marta R. Costa-jussà | N/A | N/A |
| Analyzing Transformers in Embedding Space | Guy Dar, Mor Geva, Ankit Gupta and Jonathan Berant | N/A | N/A |
| Improving Pretraining Techniques for Code-Switched NLP | Richeek Das, Sahasra Ranjan, Shreya Pathak and Preethi Jyothi | N/A | N/A |
| Dependency resolution at the syntax-semantics interface: psycholinguistic and computational insights on control dependencies | Iria de-Dios-Flores, Juan Garcia Amboage and Marcos Garcia | N/A | N/A |
| Subset Retrieval Nearest Neighbor Machine Translation | Hiroyuki Deguchi, Taro Watanabe, Yusuke Matsui, Masao Utiyama, Hideki Tanaka and Eiichiro Sumita | N/A | N/A |
| SPEECH: Structured Prediction with Energy-Based Event-Centric Hyperspheres | Shumin Deng, Shengyu Mao, Ningyu Zhang and Bryan Hooi | N/A | N/A |
| Counterfactual Active Learning for Out-of-Distribution Generalization | Xun Deng, Wenjie Wang, Fuli Feng, Hanwang Zhang, Xiangnan He and Yong Liao | N/A | N/A |
| Knowledge-enhanced Mixed-initiative Dialogue System for Emotional Support Conversations | Yang Deng, Wenxuan Zhang, Yifei Yuan and Wai Lam | N/A | N/A |
| Product Question Answering in E-Commerce: A Survey | Yang Deng, Wenxuan Zhang, Qian Yu and Wai Lam | N/A | N/A |
| Towards Faithful Dialogues via Focus Learning | Yifan Deng, Xingsheng Zhang, Heyan Huang and Yue Hu | N/A | N/A |
| Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis | Yue Deng, Wenxuan Zhang, Sinno Jialin Pan and Lidong Bing | N/A | N/A |
| Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models’ Memories | Shizhe Diao, Tianyang Xu, Ruijia Xu, Jiawei Wang and Tong Zhang | N/A | N/A |
| Is GPT-3 a Good Data Annotator? | BOSHENG DING, Chengwei Qin, Linlin Liu, Yew Ken Chia, Boyang Li, Shafiq Joty and Lidong Bing | N/A | N/A |
| MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African languages | Cheikh M. Bamba Dione, David Ifeoluwa Adelani, Peter Nabende, Jesujoba Alabi, Thapelo Sindane, Happy Buzaaba, Shamsuddeen Hassan Muhammad, Chris Chinenye Emezue, Perez Ogayo, Anuoluwapo Aremu, Catherine Gitau, Derguene Mbaye, Jonathan Mukiibi, Blessing Sibanda, Bonaventure F. P. Dossou, Andiswa Bukula, Rooweither Mabuya, Allahsera Auguste Tapo, Edwin Munkoh-Buabeng, victoire Memdjokam Koagne, Fatoumata Ouoba Kabore, Amelia Taylor, Godson KALIPE, Tebogo Macucwa, Vukosi Marivate, Tajuddeen Gwadabe, Mboning Tchiaze Elvis, Ikechukwu Onyenwe, Gratien Atindogbe, Tolulope Adelani, Idris Akinade, Olanrewaju Samuel, Marien NAHIMANA, Théogène MUSABEYEZU, Emile Niyomutabazi, Ester Chimhenga, Kudzai Gotosa, Patrick Mizha, Apelete AGBOLO, SEYDOU TRAORE, Chinedu Uchechukwu, Aliyu Yusuf, Muhammad Abdullahi and Dietrich Klakow | N/A | N/A |
| Unsupervised Open-domain Keyphrase Generation | Lam Do, Pritom Saha Akash and Kevin Chen-Chuan Chang | N/A | N/A |
| Modeling What-to-ask and How-to-ask for Answer-unaware Conversational Question Generation | Xuan Long Do, Bowei Zou, Shafiq Joty, Tran Tai, Liangming Pan, Nancy Chen and Ai Ti Aw | N/A | N/A |
| Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages | Sumanth Doddapaneni, Rahul Aralikatte, Gowtham Ramesh, Shreya Goyal, Mitesh M. Khapra, Anoop Kunchukuttan and Pratyush Kumar | N/A | N/A |
| ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning | Shachar Don-Yehiya, Elad Venezian, Colin Raffel, Noam Slonim and Leshem Choshen | N/A | N/A |
| Generalizing Backpropagation for Gradient-Based Interpretability | Kevin Du, Lucas Torroba Hennigen, Niklas Stoehr, Alex Warstadt and Ryan Cotterell | N/A | N/A |
| A Measure-Theoretic Characterization of Tight Language Models | Li Du, Lucas Torroba Hennigen, Tiago Pimentel, Clara Meister, Jason Eisner and Ryan Cotterell | N/A | N/A |
| Towards Stable Natural Language Understanding via Information Entropy Guided Debiasing | Li Du, Xiao Ding, Zhouhao Sun, Ting Liu, Bing Qin and Jingshuo Liu | N/A | N/A |
| StoryWars: A Dataset and Instruction Tuning Baselines for Collaborative Story Understanding and Generation | Yulun Du and Lydia Chilton | N/A | N/A |
| Measuring the Instability of Fine-Tuning | Yupei Du and Dong Nguyen | N/A | N/A |
| To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering | Dheeru Dua, Emma Strubell, Sameer Singh and Pat Verga | N/A | N/A |
| MAD-TSC: A Multilingual Aligned News Dataset for Target-dependent Sentiment Classification | Evan Dufraisse, Adrian Popescu, Julien Tourille, Armelle Brun and Jerome Deshayes | N/A | N/A |
| SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations | Paul-Ambroise Duquenne, Hongyu Gong, Ning Dong, Jingfei Du, Ann Lee, Vedanuj Goswami, Changhan Wang, Juan Pino, Benoît Sagot and Holger Schwenk | N/A | N/A |
| Automatic Annotation of Direct Speech in Written French Narratives | Noé Durandard, Viet Anh TRAN, Gaspard Michel and Elena Epure | N/A | N/A |
| NLPeer: A Unified Resource for the Computational Study of Peer Review | Nils Dycke, Ilia Kuznetsov and Iryna Gurevych | N/A | N/A |
| How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks | Salijona Dyrmishi, Salah GHAMIZI and Maxime Cordy | N/A | N/A |
| HuCurl: Human-induced Curriculum Discovery | Mohamed Elgaar and Hadi Amiri | N/A | N/A |
| Injecting knowledge into language generation: a case study in auto-charting after-visit care instructions from medical dialogue | Maksim Eremeev, Ilya Valmianski, Xavier Amatriain and Anitha Kannan | N/A | N/A |
| StoryARG: a corpus of narratives and personal experiences in argumentative texts | Neele Falk and Gabriella Lapesa | N/A | N/A |
| MANNER: A Variational Memory-Augmented Model for Cross Domain Few-Shot Named Entity Recognition | Jinyuan Fang, Xiaobin Wang, Zaiqiao Meng, Pengjun Xie, Fei Huang and Yong Jiang | N/A | N/A |
| Understanding and Bridging the Modality Gap for Speech Translation | Qingkai Fang and Yang Feng | N/A | N/A |
| Back Translation for Speech-to-text Translation Without Transcripts | Qingkai Fang and Yang Feng | N/A | N/A |
| Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language | Shahla Farzana and Natalie Parde | N/A | N/A |
| Cross-lingual Science Journalism: Select, Simplify and Rewrite Summaries for Non-expert Readers | Mehwish Fatima and Michael Strube | N/A | N/A |
| Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination | Hao Fei, Qian Liu, Meishan Zhang, Min Zhang and Tat-Seng Chua | N/A | N/A |
| Mitigating Label Biases for In-context Learning | Yu Fei, Yifan Hou, Zeming Chen and Antoine Bosselut | N/A | N/A |
| Enhancing Grammatical Error Correction Systems with Explanations | Yuejiao Fei, Leyang Cui, Sen Yang, Wai Lam, Zhenzhong Lan and Shuming Shi | N/A | N/A |
| WinoQueer: A Community-in-the-Loop Benchmark for Anti-LGBTQ+ Bias in Large Language Models | Virginia Felkner, Ho-Chun Herbert Chang, Eugene Jang and Jonathan May | N/A | N/A |
| Joint Constrained Learning with Boundary-adjusting for Emotion-Cause Pair Extraction | Huawen Feng, Junlong Liu, Junhao Zheng, Haibin Chen, Xichen Shang and Qianli Ma | N/A | N/A |
| MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation | Jiazhan Feng, Qingfeng Sun, Can Xu, Pu Zhao, Yaming Yang, Chongyang Tao, Dongyan Zhao and Qingwei Lin | N/A | N/A |
| KALM: Knowledge-Aware Integration of Local, Document, and Global Contexts for Long Document Understanding | Shangbin Feng, Zhaoxuan Tan, Wenqian Zhang, Zhenyu Lei and Yulia Tsvetkov | N/A | N/A |
| From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models | Shangbin Feng, Chan Young Park, Yuhan Liu and Yulia Tsvetkov | N/A | N/A |
| Generic Temporal Reasoning with Differential Analysis and Explanation | Yu Feng, Ben Zhou, Haoyu Wang, Helen Jin and Dan Roth | N/A | N/A |
| Schema-Guided User Satisfaction Modeling for Task-Oriented Dialogues | Yue Feng, Yunlong Jiao, Animesh Prasad, Nikolaos Aletras, Emine Yilmaz and Gabriella Kazai | N/A | N/A |
| DuNST: Dual Noisy Self Training for Semi-Supervised Controllable Text Generation | Yuxi Feng, Xiaoyuan Yi, Xiting Wang, Laks Lakshmanan, V.S. and Xing Xie | N/A | N/A |
| When Does Translation Require Context? A Data-driven, Multilingual Exploration | Patrick Fernandes, Kayo Yin, Emmy Liu, André Martins and Graham Neubig | N/A | N/A |
| Explaining How Transformers Use Context to Build Predictions | Javier Ferrando, Gerard I. Gállego, Ioannis Tsiamas and Marta R. Costa-jussà | N/A | N/A |
| Don’t Forget Your ABC’s: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems | Sarah E. Finch, James D. Finch and Jinho D. Choi | N/A | N/A |
| MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages | Jack FitzGerald, Christopher Hench, Charith Peris, Scott Mackie, Kay Rottmann, Ana Sanchez, Aaron Nash, Liam Urbach, Vishesh Kakarala, Richa Singh, Swetha Ranganath, Laurie Crist, Misha Britan, Wouter Leeuwis, Gokhan Tur and Prem Natarajan | N/A | N/A |
| FairPrism: Evaluating Fairness-Related Harms in Text Generation | Eve Fleisig, Aubrie Amstutz, Chad Atalla, Su Lin Blodgett, Hal Daumé III, Alexandra Olteanu, Emily Sheng, Dan Vann and Hanna Wallach | N/A | N/A |
| Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models | Myles Foley, Ambrish Rawat, Taesung Lee, Yufang Hou, Gabriele Picco and Giulio Zizzo | N/A | N/A |
| EPIC: Multi-Perspective Annotation of a Corpus of Irony | Simona Frenda, Alessandro Pedrani, Valerio Basile, Soda Marem Lo, Alessandra Teresa Cignarella, Raffaella Panizzon, Cristina Marco, Bianca Scarlini, Viviana Patti, Cristina Bosco and Davide Bernardi | N/A | N/A |
| Conflicts, Villains, Resolutions: Towards models of Narrative Media Framing | Lea Frermann, Jiatong Li, Shima Khanehzar and Gosia Mikolajczak | N/A | N/A |
| On the Compositional Generalization in Versatile Open-domain Dialogue | Tingchen Fu, Xueliang Zhao, Lemao Liu and Rui Yan | N/A | N/A |
| Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation | Matthieu Futeral, Cordelia Schmid, Ivan Laptev, Benoît Sagot and Rachel Bawden | N/A | N/A |
| Learning Answer Generation using Supervision from Automatic Question Answering Evaluators | Matteo Gabburo, Siddhant Garg, Rik Koncel-Kedziorski and Alessandro Moschitti | N/A | N/A |
| Question-Answering in a Low-resourced Language: Benchmark Dataset and Models for Tigrinya | Fitsum Gaim, Wonsuk Yang, Hancheol Park and Jong Park | N/A | N/A |
| Annotating Mentions Alone Enables Efficient Domain Adaptation for Coreference Resolution | Nupoor Gandhi, Anjalie Field and Emma Strubell | N/A | N/A |
| LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming | Jingsheng Gao, Yixin Lian, Ziyi Zhou, yuzhuo fu and Baoyuan Wang | N/A | N/A |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Luyu Gao, Zhuyun Dai, Panupong Pasupat, Anthony Chen, Arun Tejasvi Chaganty, Yicheng Fan, Vincent Zhao, Ni Lao, Hongrae Lee, Da-Cheng Juan and Kelvin Guu | N/A | N/A |
| Precise Zero-Shot Dense Retrieval without Relevance Labels | Luyu Gao, Xueguang Ma, Jimmy Lin and Jamie Callan | N/A | N/A |
| Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework | Mingqi Gao, Xiaojun Wan, Jia Su, Zhefeng Wang and baoxing Huai | N/A | N/A |
| Dialogue Summarization with Static-Dynamic Structure Fusion Graph | Shen Gao, Xin Cheng, Mingzhe Li, Xiuying Chen, Jinpeng Li, Dongyan Zhao and Rui Yan | N/A | N/A |
| PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives | Silin Gao, Beatriz Borges, Soyoung Oh, Deniz Bayazit, Saya Kanno, Hiromi Wakaki, Yuki Mitsufuji and Antoine Bosselut | N/A | N/A |
| DSRM: Boost Textual Adversarial Training with Distribution Shift Risk Minimization | SongYang Gao, Shihan Dou, Yan Liu, Xiao Wang, Qi Zhang, Zhongyu Wei, Jin Ma and Ying Shan | N/A | N/A |
| Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization | Ze-Feng Gao, Kun Zhou, Peiyu Liu, Wayne Xin Zhao and Ji-Rong Wen | N/A | N/A |
| Entailment as Robust Self-Learner | Jiaxin Ge, Hongyin Luo, Yoon Kim and James Glass | N/A | N/A |
| Compounding Geometric Operations for Knowledge Graph Completion | Xiou Ge, Yun Cheng Wang, Bin Wang and C.-C. Jay Kuo | N/A | N/A |
| The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers | Ariel Gera, Roni Friedman, Ofir Arviv, Chulaka Gunasekara, Benjamin Sznajder, Noam Slonim and Eyal Shnarch | N/A | N/A |
| ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems | Sarik Ghazarian, Yijia Shao, Rujun Han, Aram Galstyan and Nanyun Peng | N/A | N/A |
| ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER | Sreyan Ghosh, Utkarsh Tyagi, Manan Suri, Sonal Kumar, Ramaneswaran S and Dinesh Manocha | N/A | N/A |
| Multitask Pretraining with Structured Knowledge for Text-to-SQL Generation | Robert Giaquinto, Dejiao Zhang, Benjamin Kleiner, Yang Li, Ming Tan, Parminder Bhatia, Ramesh Nallapati and Xiaofei Ma | N/A | N/A |
| Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change Analysis | Mario Giulianelli, Iris Luden, Raquel Fernandez and Andrey Kutuzov | N/A | N/A |
| Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers | Linyuan Gong, Chenyan Xiong, Xiaodong Liu, Payal Bajaj, Yiqing Xie, Alvin Cheung, Jianfeng Gao and Xia Song | N/A | N/A |
| MvP: Multi-view Prompting Improves Aspect Sentiment Tuple Prediction | Zhibin Gou, qingyan guo and Yujiu Yang | N/A | N/A |
| Factual or Contextual? Disentangling Error Types in Entity Description Generation | Navita Goyal, Ani Nenkova and Hal Daumé III | N/A | N/A |
| Massively Multilingual Lexical Specialization of Multilingual Transformers | Tommaso Green, Simone Paolo Ponzetto and Goran Glavaš | N/A | N/A |
| Rogue Scores | Max Grusky | N/A | N/A |
| GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding | Jia-Chen Gu, Zhenhua Ling, Quan Liu, Cong Liu and Guoping Hu | N/A | N/A |
| A Gradient Control Method for Backdoor Attacks on Parameter-Efficient Tuning | Naibin Gu, Peng Fu, Xiyu Liu, Zhengxiao Liu, Zheng Lin and Weiping Wang | N/A | N/A |
| Don’t Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments | Yu Gu, Xiang Deng and Yu Su | N/A | N/A |
| Do language models have coherent mental models of everyday things? | Yuling Gu, Bhavana Dalvi Mishra and Peter Clark | N/A | N/A |
| Pre-Training to Learn in Context | Yuxian Gu, Li Dong, Furu Wei and Minlie Huang | N/A | N/A |
| Controllable Text Generation via Probability Density Estimation in the Latent Space | Yuxuan Gu, Xiaocheng Feng, Sicheng Ma, Lingyuan Zhang, Heng Gong, Weihong Zhong and Bing Qin | N/A | N/A |
| On the Evaluation of Neural Selective Prediction Methods for Natural Language Processing | Zhengyao Gu and Mark Hopkins | N/A | N/A |
| Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation | Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida and André Martins | N/A | N/A |
| HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation | Anchun Gui and Han Xiao | N/A | N/A |
| Visually-augmented pretrained language models for NLP tasks without images | Hangyu Guo, Kun Zhou, Wayne Xin Zhao, Qinyu Zhang and Ji-Rong Wen | N/A | N/A |
| Decoding Symbolism in Language Models | Meiqi Guo, Rebecca Hwa and Adriana Kovashka | N/A | N/A |
| Dual Cache for Long Document Neural Coreference Resolution | Qipeng Guo, Xiangkun Hu, Yue Zhang, Xipeng Qiu and Zheng Zhang | N/A | N/A |
| Learning Optimal Policy for Simultaneous Machine Translation via Binary Search | Shoutao Guo, Shaolei Zhang and Yang Feng | N/A | N/A |
| Counterfactual Multihop QA: A Cause-Effect Approach for Reducing Disconnected Reasoning | Wangzhen Guo, Qinkang Gong, Yanghui Rao and Hanjiang Lai | N/A | N/A |
| Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast | Yiduo Guo, Yaobo Liang, Dongyan Zhao, Bing Liu and Nan Duan | N/A | N/A |
| Bi-Phone: Modeling Inter Language Phonetic Influences in Text | Abhirut Gupta, Ananya B. Sai, Richard Sproat, Yuri Vasilevski, James Ren, Ambarish Jash, Sukhdeep Sodhi and Aravindan Raghuveer | N/A | N/A |
| Don’t Retrain, Just Rewrite: Countering Adversarial Perturbations by Rewriting Text | Ashim Gupta, Carter Blum, Temma Choji, Yingjie Fei, Shalin Shah, Alakananda Vempala and Vivek Srikumar | N/A | N/A |
| Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generation | Rishabh Gupta, Shaily Desai, Manvi Goel, Anil Bandhakavi, Tanmoy Chakraborty and Md. Shad Akhtar | N/A | N/A |
| DiSCoMaT: Distantly Supervised Composition Extraction from Tables in Materials Science Articles | Tanishq Gupta, Mohd Zaki, Devanshi Khatsuriya, Kausik Hira, N M Anoop Krishnan and Mausam | N/A | N/A |
| Linguistic representations for fewer-shot relation extraction across domains | Sireesh Gururaja, Ritam Dutt, Tinglong Liao and Carolyn Rosé | N/A | N/A |
| Hybrid Knowledge Transfer for Improved Cross-Lingual Event Detection via Hierarchical Sample Selection | Luis Guzman Nateras, Franck Dernoncourt and Thien Nguyen | N/A | N/A |
| Improving the Detection of Multilingual Online Attacks with Rich Social Media Data from Singapore | Janosch Haber, Bertie Vidgen, Matthew Chapman, Vibhor Agarwal, Roy Ka-Wei Lee, Yong Keong Yap and Paul Röttger | N/A | N/A |
| Text Style Transfer with Contrastive Transfer Pattern Mining | Jingxuan Han, Quan Wang, Licheng Zhang, Weidong Chen, Yan Song and Zhendong Mao | N/A | N/A |
| SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control | Xiaochuang Han, Sachin Kumar and Yulia Tsvetkov | N/A | N/A |
| Understanding In-Context Learning via Supportive Pretraining Data | Xiaochuang Han, Daniel Simig, Todor Mihaylov, Yulia Tsvetkov, Asli Celikyilmaz and Tianlu Wang | N/A | N/A |
| bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark | Momchil Hardalov, Pepa Atanasova, Todor Mihaylov, Galia Angelova, Kiril Simov, Petya Osenova, Veselin Stoyanov, Ivan Koychev, Preslav Nakov and Dragomir Radev | N/A | N/A |
| Neural Unsupervised Reconstruction of Protolanguage Word Forms | Andre He, Nicholas Tomlin and Dan Klein | N/A | N/A |
| Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization | Pengcheng He, Baolin Peng, Song Wang, Yang Liu, Ruochen Xu, Hany Hassan, Yu Shi, Chenguang Zhu, Wayne Xiong, Michael Zeng, Jianfeng Gao and Xuedong Huang | N/A | N/A |
| HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation | Qianyu He, Yikai Zhang, Jiaqing Liang, Yuncheng Huang, Yanghua Xiao and Yunwen Chen | N/A | N/A |
| PAD-Net: An Efficient Framework for Dynamic Networks | Shwai He, Liang Ding, Daize Dong, Boan Liu, Fuqiang Yu and Dacheng Tao | N/A | N/A |
| On the Blind Spots of Model-Based Evaluation Metrics for Text Generation | Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James Glass and Yulia Tsvetkov | N/A | N/A |
| HermEs: Interactive Spreadsheet Formula Prediction via Hierarchical Formulet Expansion | Wanrong He, Haoyu Dong, Yihuai Gao, zhichao fan, Xingzhuo Guo, Zhitao Hou, Xiao Lv, Ran Jia, Shi Han and Dongmei Zhang | N/A | N/A |
| Exploring the Capacity of Pretrained Language Models for Reasoning about Actions and Change | Weinan He, Canming Huang, Zhanhao Xiao and Yongmei Liu | N/A | N/A |
| Revisiting Event Argument Extraction: Can EAE Models Learn Better When Being Aware of Event Co-occurrences? | Yuxin He, Jingyue Hu and Buzhou Tang | N/A | N/A |
| Targeted Data Generation: Finding and Fixing Model Weaknesses | Zexue He, Marco Tulio Ribeiro and Fereshte Khani | N/A | N/A |
| DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models | Zhengfu He, Tianxiang Sun, Qiong Tang, Kuanning Wang, Xuanjing Huang and Xipeng Qiu | N/A | N/A |
| UMRSpell: Unifying the Detection and Correction Parts of Pre-trained Models towards Chinese Missing, Redundant, and Spelling Correction | Zheyu He, Yujin Zhu, Linlin Wang and Liang Xu | N/A | N/A |
| Language of Bargaining | Mourad Heddaya, Solomon Dworkin, Chenhao Tan, Rob Voigt and Alexander Zentefis | N/A | N/A |
| DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue | William Held, Christopher Hidey, Fei Liu, Eric Zhu, Rahul Goel, Diyi Yang and Rushin Shah | N/A | N/A |
| MultiTACRED: A Multilingual Version of the TAC Relation Extraction Dataset | Leonhard Hennig, Philippe Thomas and Sebastian Möller | N/A | N/A |
| Comparative evaluation of boundary-relaxed annotation for Entity Linking performance | Gabriel Herman Bernardim Andrade, Shuntaro Yada and Eiji ARAMAKI | N/A | N/A |
| Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest | Jack Hessel, Ana Marasovic, Jena D. Hwang, Lillian Lee, Jeff Da, Rowan Zellers, Robert Mankoff and Yejin Choi | N/A | N/A |
| Backpack Language Models | John Hewitt, John Thickstun, Christopher Manning and Percy Liang | N/A | N/A |
| Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features | Ester Hlavnova and Sebastian Ruder | N/A | N/A |
| Large Language Models Are Reasoning Teachers | Namgyu Ho, Laura Schmid and Se-Young Yun | N/A | N/A |
| My side, your side and the evidence: Discovering aligned actor groups and the narratives they weave | Pavan Holur, David Chong, Timothy Tangherlini and Vwani Roychowdhury | N/A | N/A |
| Faithful Question Answering with Monte-Carlo Planning | Ruixin Hong, Hongming Zhang, Hong Zhao, Dong Yu and Changshui Zhang | N/A | N/A |
| Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor | Or Honovich, Thomas Scialom, Omer Levy and Timo Schick | N/A | N/A |
| Instruction Induction: From Few Examples to Natural Language Task Descriptions | Or Honovich, Uri Shaham, Samuel R. Bowman and Omer Levy | N/A | N/A |
| Attributable and Scalable Opinion Summarization | Tom Hosking, Hao Tang and Mirella Lapata | N/A | N/A |
| MISGENDERED: Limits of Large Language Models in Understanding Pronouns | Tamanna Hossain, Sunipa Dev and Sameer Singh | N/A | N/A |
| Resolving Indirect Referring Expressions for Entity Selection | Mohammad Javad Hosseini, Filip Radlinski, Silvia Pareti and Annie Louis | N/A | N/A |
| ORGAN: Observation-Guided Radiology Report Generation via Tree Reasoning | Wenjun Hou, Kaishuai Xu, Yi Cheng, Wenjie Li and Jiang Liu | N/A | N/A |
| CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding | Zhijian Hou, Wanjun Zhong, Lei Ji, DIFEI GAO, Kun Yan, W.K. Chan, Chong-Wah Ngo, Mike Zheng Shou and Nan Duan | N/A | N/A |
| TAGPRIME: A Unified Framework for Relational Structure Extraction | I-Hung Hsu, Kuan-Hao Huang, Shuning Zhang, Wenxin Cheng, Prem Natarajan, Kai-Wei Chang and Nanyun Peng | N/A | N/A |
| AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model | I-Hung Hsu, Zhiyu Xie, Kuan-Hao Huang, Prem Natarajan and Nanyun Peng | N/A | N/A |
| InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation | Anwen Hu, Shizhe Chen, Liang Zhang and Qin Jin | N/A | N/A |
| Supervised Adversarial Contrastive Learning for Emotion Recognition in Conversations | Dou Hu, Yinan Bao, Lingwei Wei, Wei Zhou and Songlin Hu | N/A | N/A |
| A fine-grained comparison of pragmatic language understanding in humans and language models | Jennifer Hu, Sammy Floyd, Olessia Jouravlev, Evelina Fedorenko and Edward Gibson | N/A | N/A |
| Won’t Get Fooled Again: Answering Questions with False Premises | Shengding Hu, Yifan Luo, Huadong Wang, Xingyi Cheng, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| In-Context Analogical Reasoning with Pre-Trained Language Models | Xiaoyang Hu, Shane Storks, Richard Lewis and Joyce Chai | N/A | N/A |
| MeetingBank: A Benchmark Dataset for Meeting Summarization | Yebowen Hu, Timothy Ganter, Hanieh Deilamsalehy, Franck Dernoncourt, Hassan Foroosh and Fei Liu | N/A | N/A |
| Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition | Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu and Eng Siong Chng | N/A | N/A |
| MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition | Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou and Eng Siong Chng | N/A | N/A |
| Semantic Structure Enhanced Event Causality Identification | Zhilei Hu, Zixuan Li, Xiaolong Jin, Long Bai, Saiping Guan, Jiafeng Guo and Xueqi Cheng | N/A | N/A |
| Improving Translation Quality Estimation with Bias Mitigation | Hui Huang, Shuangzhi Wu, Kehai Chen, Hui Di, Muyun Yang and Tiejun Zhao | N/A | N/A |
| Knowledge Transfer in Incremental Learning for Multilingual Neural Machine Translation | Kaiyu Huang, Peng Li, Jin Ma, Ting Yao and Yang Liu | N/A | N/A |
| ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation | Kuan-Hao Huang, Varun Iyer, I-Hung Hsu, Anoop Kumar, Kai-Wei Chang and Aram Galstyan | N/A | N/A |
| Faking Fake News for Real Fake News Detection: Propaganda-Loaded Training Data Generation | Kung-Hsiang Huang, Kathleen McKeown, Preslav Nakov, Yejin Choi and Heng Ji | N/A | N/A |
| Zero-shot Faithful Factual Error Correction | Kung-Hsiang Huang, Hou Pong Chan and Heng Ji | N/A | N/A |
| More than Classification: A Unified Framework for Event Temporal Relation Extraction | Quzhe Huang, Yutong Hu, Shengqi Zhu, Yansong Feng, Chang Liu and Dongyan Zhao | N/A | N/A |
| AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation | Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin and Zhou Zhao | N/A | N/A |
| An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation | Xuancheng Huang, Zijun Liu, Peng Li, Tao Li, Maosong Sun and Yang Liu | N/A | N/A |
| Towards Higher Pareto Frontier in Multilingual Machine Translation | yichong huang, Xiaocheng Feng, Xinwei Geng, Baohang Li and Bing Qin | N/A | N/A |
| MVP-Tuning: Multi-View Knowledge Retrieval with Prompt Tuning for Commonsense Reasoning | Yongfeng Huang, Yanyang Li, Yichong Xu, Lin Zhang, ruyi gan, Jiaxing Zhang and Liwei Wang | N/A | N/A |
| REDFM: a Filtered and Multilingual Relation Extraction Dataset | Pere-Lluís Huguet Cabot, Simone Tedeschi, Axel-Cyrille Ngonga Ngomo and Roberto Navigli | N/A | N/A |
| Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages | Ayyoob ImaniGooghari, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André Martins, François Yvon and Hinrich Schütze | N/A | N/A |
| UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units | Hirofumi Inaguma, Sravya Popuri, Ilia Kulikov, Peng-Jen Chen, Changhan Wang, Yu-An Chung, Yun Tang, Ann Lee, Shinji Watanabe and Juan Pino | N/A | N/A |
| Byte-Level Grammatical Error Correction Using Synthetic and Curated Corpora | Svanhvít Lilja Ingólfsdóttir, Petur Ragnarsson, Haukur Jónsson, Haukur Simonarson, Vilhjalmur Thorsteinsson and Vésteinn Snæbjarnarson | N/A | N/A |
| DARE: Towards Robust Text Explanations in Biomedical and Healthcare Applications | Adam Ivankay, Mattia Rigotti and Pascal Frossard | N/A | N/A |
| HINT: Hypernetwork Instruction Tuning for Efficient Zero- and Few-Shot Generalisation | Hamish Ivison, Akshita Bhagia, Yizhong Wang, Hannaneh Hajishirzi and Matthew Peters | N/A | N/A |
| ContraCLM: Contrastive Learning For Causal Language Model | Nihal Jain, Dejiao Zhang, Wasi Uddin Ahmad, Zijian Wang, Feng Nan, Xiaopeng Li, Ming Tan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Xiaofei Ma and Bing Xiang | N/A | N/A |
| Knowledge Unlearning for Mitigating Privacy Risks in Language Models | Joel Jang, Dongkeun Yoon, Sohee Yang, Sungmin Cha, Moontae Lee, Lajanugen Logeswaran and Minjoon Seo | N/A | N/A |
| SeeGULL: A Stereotype Benchmark with Broad Geo-Cultural Coverage Leveraging Generative Models | Akshita Jha, Aida Mostafazadeh Davani, Chandan K Reddy, Shachi Dave, Vinodkumar Prabhakaran and Sunipa Dev | N/A | N/A |
| Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification | Ke Ji, Yixin Lian, Jingsheng Gao and Baoyuan Wang | N/A | N/A |
| In-sample Curriculum Learning by Sequence Completion for Natural Language Generation | Qi Jia, Yizhu Liu, Haifeng Tang and Kenny Zhu | N/A | N/A |
| Modeling Instance Interactions for Joint Information Extraction with Neural High-Order Conditional Random Field | Zixia Jia, Zhaohui Yan, Wenjuan Han, Zilong Zheng and Kewei Tu | N/A | N/A |
| Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation | Chaoya Jiang, Wei Ye, Haiyang Xu, Songfang Huang, Fei Huang and Shikun Zhang | N/A | N/A |
| Recall, Expand, and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing | Chengyue Jiang, Wenyang Hui, Yong Jiang, Xiaobin Wang, Pengjun Xie and Kewei Tu | N/A | N/A |
| LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion | Dongfu Jiang, Xiang Ren and Bill Yuchen Lin | N/A | N/A |
| A Cognitive Stimulation Dialogue System with Multi-source Knowledge Fusion for Elders with Cognitive Impairment | Jiyue Jiang, Sheng Wang, Qintong Li, Lingpeng Kong and Chuan Wu | N/A | N/A |
| Pruning Pre-trained Language Models Without Fine-Tuning | Ting Jiang, deqing wang, Fuzhen Zhuang, Ruobing Xie and Feng Xia | N/A | N/A |
| Discourse-Centric Evaluation of Document-level Machine Translation with a New Densely Annotated Parallel Corpus of Novels | Yuchen Eleanor Jiang, Tianyu Liu, Shuming Ma, Dongdong Zhang, Mrinmaya Sachan and Ryan Cotterell | N/A | N/A |
| Improving Domain Generalization for Prompt-Aware Essay Scoring via Disentangled Representation Learning | Zhiwei Jiang, Tianyi Gao, Yafeng Yin, Meng Liu, Hua Yu, Zifeng Cheng and Qing Gu | N/A | N/A |
| Patton: Language Model Pretraining on Text-Rich Networks | Bowen Jin, Wentao Zhang, Yu Zhang, Yu Meng, Xinyang Zhang, Qi Zhu and Jiawei Han | N/A | N/A |
| DarkBERT: A Language Model for the Dark Side of the Internet | Youngjin Jin, Eugene Jang, Jian Cui, Jin-Woo Chung, Yongjae Lee and Seungwon Shin | N/A | N/A |
| Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation | Liqiang Jing, Xuemeng Song, Kun Ouyang, Mengzhao Jia and Liqiang Nie | N/A | N/A |
| MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction | Wang Jing, Aixin Sun, Hao Zhang and Xiaoli Li | N/A | N/A |
| Breeding Machine Translations: Evolutionary approach to survive and thrive in the world of automated evaluation | Josef Jon and Ondřej Bojar | N/A | N/A |
| U-CREAT: Unsupervised Case Retrieval using Events extrAcTion | Abhinav Joshi, Akshat Sharma, Sai Kiran Tanikella and Ashutosh Modi | N/A | N/A |
| Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-text Rationales | Brihi Joshi, Ziyi Liu, Sahana Ramnath, Aaron Chan, Zhewei Tong, Shaoliang Nie, Qifan Wang, Yejin Choi and Xiang Ren | N/A | N/A |
| ArgAnalysis35K : A large-scale dataset for Argument Quality Analysis | Omkar Joshi, Priya Pitre and Yashodhara Haribhakta | N/A | N/A |
| A Compare-and-contrast Multistage Pipeline for Uncovering Financial Signals in Financial Reports | Jia-Huei Ju, Yu-Shiang Huang, Cheng-Wei Lin, Che Lin and Chuan-Ju Wang | N/A | N/A |
| Node Placement in Argument Maps: Modeling Unidirectional Relations in High & Low-Resource Scenarios | Iman Jundi, Neele Falk, Eva Maria Vecchi and Gabriella Lapesa | N/A | N/A |
| Your spouse needs professional help: Determining the Contextual Appropriateness of Messages through Modeling Social Relationships | David Jurgens, Agrima Seth, Jackson Sargent, Athena Aghighi and Michael Geraci | N/A | N/A |
| Evaluating Open-Domain Question Answering in the Era of Large Language Models | Ehsan Kamalloo, Nouha Dziri, Charles Clarke and Davood Rafiei | N/A | N/A |
| Distill or Annotate? Cost-Efficient Fine-Tuning of Compact Models | Junmo Kang, Wei Xu and Alan Ritter | N/A | N/A |
| LAMBADA: Backward Chaining for Automated Reasoning in Natural Language | Mehran Kazemi, Najoung Kim, Deepti Bhatia, Xin Xu and Deepak Ramachandran | N/A | N/A |
| DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering | Pei Ke, Fei Huang, Fei Mi, Yasheng Wang, Qun Liu, Xiaoyan Zhu and Minlie Huang | N/A | N/A |
| Few-shot Reranking for Multi-hop QA via Language Model Prompting | Muhammad Khalifa, Lajanugen Logeswaran, Moontae Lee, Honglak Lee and Lu Wang | N/A | N/A |
| A Cautious Generalization Goes a Long Way: Learning Morphophonological Rules | Salam Khalifa, Sarah Payne, Jordan Kodner, Ellen Broselow and Owen Rambow | N/A | N/A |
| ExplainMeetSum: A Dataset for Explainable Meeting Summarization Aligned with Human Intent | Hyun Kim, Minsoo Cho and Seung-Hoon Na | N/A | N/A |
| Automatic Creation of Named Entity Recognition Datasets by Querying Phrase Representations | Hyunjae Kim, jaehyo yoo, Seunghyun Yoon and Jaewoo Kang | N/A | N/A |
| infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information | Jaehyung Kim, Yekyung Kim, Karin de Langis, Jinwoo Shin and Dongyeop Kang | N/A | N/A |
| FactKG: Fact Verification via Reasoning on Knowledge Graphs | Jiho Kim, Sungjin Park, Yeonsu Kwon, Yohan Jo, James Thorne and Edward Choi | N/A | N/A |
| (QA)^2: Question Answering with Questionable Assumptions | Najoung Kim, Phu Mon Htut, Samuel R. Bowman and Jackson Petty | N/A | N/A |
| Entity Tracking in Language Models | Najoung Kim and Sebastian Schuster | N/A | N/A |
| Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning | Nayeon Kim, Yinhua Piao and Sun Kim | N/A | N/A |
| miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings | Tassilo Klein and Moin Nabi | N/A | N/A |
| PairSpanBERT: An Enhanced Language Model for Bridging Resolution | Hideo Kobayashi, Yufang Hou and Vincent Ng | N/A | N/A |
| Morphological Inflection: A Reality Check | Jordan Kodner, Sarah Payne, Salam Khalifa and Zoey Liu | N/A | N/A |
| Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model | Yeskendir Koishekenov, Alexandre Berard and Vassilina Nikoulina | N/A | N/A |
| PromptRank: Unsupervised Keyphrase Extraction Using Prompt | Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun and Xiaoyan Bai | N/A | N/A |
| Improving the robustness of NLI models with minimax training | Michalis Korakakis and Andreas Vlachos | N/A | N/A |
| DITTO: Data-efficient and Fair Targeted Subset Selection for ASR Accent Adaptation | Suraj Kothawade, Anmol Mekala, D.Chandra Sekhara Hetha Havya, Mayank Kothyari, Rishabh Iyer, Ganesh Ramakrishnan and Preethi Jyothi | N/A | N/A |
| Downstream Datasets Make Surprisingly Good Pretraining Corpora | Kundan Krishna, Saurabh Garg, Jeffrey Bigham and Zachary Lipton | N/A | N/A |
| Multi-Row, Multi-Span Distant Supervision For Table+Text Question Answering | Vishwajeet Kumar, Yash Gupta, Saneem Chemmengath, Jaydeep Sen, Soumen Chakrabarti, Samarth Bharadwaj and Feifei Pan | N/A | N/A |
| An Inclusive Notion of Text | Ilia Kuznetsov and Iryna Gurevych | N/A | N/A |
| Language Detoxification with Attribute-Discriminative Latent Space | Jin Myung Kwak, Minseon Kim and Sung Ju Hwang | N/A | N/A |
| Vision Meets Definitions: Unsupervised Visual Word Sense Disambiguation Incorporating Gloss Information | Sunjae Kwon, Rishabh Garodia, Minhwa Lee, Zhichao Yang and hong yu | N/A | N/A |
| SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages | Philippe Laban, Jesse Vig, Wojciech Kryscinski, Shafiq Joty, Caiming Xiong and Chien-Sheng Wu | N/A | N/A |
| DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains | Yanis Labrak, Adrien Bazoge, Richard Dufour, Mickael Rouvier, Emmanuel Morin, Béatrice Daille and Pierre-Antoine Gourraud | N/A | N/A |
| Contrastive Error Attribution for Finetuned Language Models | Faisal Ladhak, Esin Durmus and Tatsunori Hashimoto | N/A | N/A |
| Exploring Better Text Image Translation with Multimodal Codebook | Zhibin Lan, Jiawei Yu, Xiang Li, Wen Zhang, Jian Luan, Bin Wang, Degen Huang and Jinsong Su | N/A | N/A |
| What about "em"? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns | Anne Lauscher, Debora Nozza, Ehm Miltersen, Archie Crowley and Dirk Hovy | N/A | N/A |
| Improved Instruction Ordering in Recipe-Grounded Conversation | Duong Le, Ruohao Guo, Wei Xu and Alan Ritter | N/A | N/A |
| FormNetV2: Multimodal Graph Contrastive Learning for Form Document Information Extraction | Chen-Yu Lee, Chun-Liang Li, Hao Zhang, Timothy Dozat, Vincent Perot, Guolong Su, Xiang Zhang, Kihyuk Sohn, NIKOLAY GLUSHNEV, Renshen Wang, Joshua Ainslie, Shangbang Long, Siyang Qin, Yasuhisa Fujii, Nan Hua and Tomas Pfister | N/A | N/A |
| Query-Efficient Black-Box Red Teaming via Bayesian Optimization | Deokjae Lee, JunYeong Lee, Jung-Woo Ha, Jin-Hwa Kim, Sang-Woo Lee, Hwaran Lee and Hyun Oh Song | N/A | N/A |
| On Complementarity Objectives for Hybrid Retrieval | Dohyeon Lee, Seung-won Hwang, Kyungjae Lee, Seungtaek Choi and Sunghyun Park | N/A | N/A |
| SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created through Human-Machine Collaboration | Hwaran Lee, Seokhee Hong, Joonsuk Park, Takyoung Kim, Meeyoung Cha, Yejin Choi, BYOUNGPIL KIM, Gunhee Kim, Eun-Ju Lee, Yong Lim, Alice Oh, Sangchul Park and Jung-Woo Ha | N/A | N/A |
| Revealing Single Frame Bias for Video-and-Language Learning | Jie Lei, Tamara Berg and Mohit Bansal | N/A | N/A |
| TART: Improved Few-shot Text Classification Using Task-Adaptive Reference Transformation | Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen and Chang-Tien Lu | N/A | N/A |
| BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic Consistency | Zhenyu Lei, Herun Wan, Wenqian Zhang, Shangbin Feng, Zilong Chen, Jundong Li, Qinghua Zheng and Minnan Luo | N/A | N/A |
| Tell2Design: A Dataset for Language-Guided Floor Plan Generation | Sicong Leng, Yang Zhou, Mohammed Haroon Dupty, Wee Sun Lee, Sam Joyce and Wei Lu | N/A | N/A |
| Diverse Demonstrations Improve In-context Compositional Generalization | Itay Levy, Ben Bogin and Jonathan Berant | N/A | N/A |
| Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning | Alexander Hanbo Li, Mingyue Shang, Evangelia Spiliopoulou, Jie Ma, Patrick Ng, Zhiguo Wang, Bonan Min, William Yang Wang, Kathleen McKeown, Vittorio Castelli, Dan Roth and Bing Xiang | N/A | N/A |
| Understanding Client Reactions in Online Mental Health Counseling | Anqi Li, Lizhi Ma, Yaling Mei, Hongliang He, Shuai Zhang, Huachuan Qiu and Zhenzhong Lan | N/A | N/A |
| Toward Interactive Dictation | Belinda Z. Li, Jason Eisner, Adam Pauls and Sam Thomson | N/A | N/A |
| Python Code Generation by Asking Clarification Questions | Haau-Sing (Xiaocheng) Li, Mohsen Mesgar, André Martins and Iryna Gurevych | N/A | N/A |
| Do You Hear The People Sing? Key Point Analysis via Iterative Clustering and Abstractive Summarisation | Hao Li, Viktor Schlegel, Riza Batista-Navarro and Goran Nenadic | N/A | N/A |
| TeAST: Temporal Knowledge Graph Embedding via Archimedean Spiral Timeline | Jiang Li, Xiangdong Su and Guanglai Gao | N/A | N/A |
| Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk | jianquan li, XiangBo Wu, Xiaokang Liu, Qianqian Xie, Prayag Tiwari and Benyou Wang | N/A | N/A |
| Contextual Distortion Reveals Constituency: Masked Language Models are Implicit Parsers | Jiaxi Li and Wei Lu | N/A | N/A |
| Are Message Passing Neural Networks Really Helpful for Knowledge Graph Completion? | Juanhui Li, Harry Shomer, Jiayuan Ding, Yiqi Wang, Yao Ma, Neil Shah, Jiliang Tang and Dawei Yin | N/A | N/A |
| CATS: A Pragmatic Chinese Answer-to-Sequence Dataset with Large Scale and High Quality | Liang Li, Ruiying Geng, Chengyang Fang, Bing Li, Can Ma, Rongyu Cao, Binhua Li, Fei Huang and Yongbin Li | N/A | N/A |
| Text Adversarial Purification as Defense against Adversarial Attacks | Linyang Li, Demin Song and Xipeng Qiu | N/A | N/A |
| Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step | Liunian Harold Li, Jack Hessel, Youngjae Yu, Xiang Ren, Kai-Wei Chang and Yejin Choi | N/A | N/A |
| Multi-modal Action Chain Abductive Reasoning | Mengze Li, Tianbao Wang, Jiahe Xu, Kairong Han, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, wenqiao zhang, Shiliang Pu and Fei Wu | N/A | N/A |
| CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval | Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih and Xilun Chen | N/A | N/A |
| CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors | Peng Li, Tianxiang Sun, Qiong Tang, Hang Yan, Yuanbin Wu, Xuanjing Huang and Xipeng Qiu | N/A | N/A |
| To Copy Rather Than Memorize: A Vertical Learning Paradigm for Knowledge Graph Completion | Rui Li, Xu Chen, Chaozhuo Li, Yanming Shen, Jianan Zhao, Yujing Wang, Weihao Han, Hao Sun, Weiwei Deng, Qi Zhang and Xing Xie | N/A | N/A |
| Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification | Sha Li, Ruining Zhao, Manling Li, Heng Ji, Chris Callison-Burch and Jiawei Han | N/A | N/A |
| Sequence Parallelism: Long Sequence Training from System Perspective | Shenggui Li, Fuzhao Xue, Chaitanya Baranwal, Yongbin Li and Yang You | N/A | N/A |
| Few-shot In-context Learning on Knowledge Base Question Answering | Tianle Li, Xueguang Ma, Alex Zhuang, Yu Gu, Yu Su and Wenhu Chen | N/A | N/A |
| TREA: Tree-Structure Reasoning Schema for Conversational Recommendation | Wendi Li, Wei Wei, Xiaoye Qu, Xian-Ling Mao, Ye Yuan, Wenfeng Xie and Dangyang Chen | N/A | N/A |
| Contrastive Decoding: Open-ended Text Generation as Optimization | Xiang Lisa Li, Ari Holtzman, Daniel Fried, Percy Liang, Jason Eisner, Tatsunori Hashimoto, Luke Zettlemoyer and Mike Lewis | N/A | N/A |
| Unified Demonstration Retriever for In-Context Learning | Xiaonan Li, Kai Lv, Hang Yan, Tianyang Lin, Wei Zhu, Yuan Ni, GUOTONG XIE, Xiaoling Wang and Xipeng Qiu | N/A | N/A |
| Explicit Syntactic Guidance for Neural Text Generation | Yafu Li, Leyang Cui, Jianhao Yan, Yongjing Yin, Wei Bi, Shuming Shi and Yue Zhang | N/A | N/A |
| AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction | Yanzeng Li, Bingcong Xue, Ruoyu Zhang and Lei Zou | N/A | N/A |
| Multi-target Backdoor Attacks for Code Pre-trained Models | Yanzhou Li, Shangqing Liu, Kangjie Chen, Xiaofei Xie, Tianwei Zhang and Yang Liu | N/A | N/A |
| Translation-Enhanced Multilingual Text-to-Image Generation | Yaoyiran Li, Ching-Yun Chang, Stephen Rawls, Ivan Vulić and Anna Korhonen | N/A | N/A |
| Making Language Models Better Reasoners with Step-Aware Verifier | Yifei Li, Zeqi Lin, Shizhuo Zhang, Qiang Fu, Bei Chen, Jian-Guang LOU and Weizhu Chen | N/A | N/A |
| TemplateGEC: Improving Grammatical Error Correction with Detection Template | Yinghao Li, Xuebo Liu, Shuo Wang, Peiyuan Gong, Derek F. Wong, Yang Gao, Heyan Huang and Min Zhang | N/A | N/A |
| Prompt Tuning Pushes Farther, Contrastive Learning Pulls Closer: A Two-Stage Approach to Mitigate Social Biases | Yingji Li, Mengnan Du, Xin Wang and Ying Wang | N/A | N/A |
| A New Direction in Stance Detection: Target-Stance Extraction in the Wild | Yingjie Li, Krishna Garg and Cornelia Caragea | N/A | N/A |
| Pre-training Multi-party Dialogue Models with Latent Discourse Inference | Yiyang Li, Xinting Huang, Wei Bi and Hai Zhao | N/A | N/A |
| EM Pre-training for Multi-party Dialogue Response Generation | Yiyang Li and Hai Zhao | N/A | N/A |
| Multiview Identifiers Enhanced Generative Retrieval | Yongqi Li, Nan Yang, Liang Wang, Furu Wei and Wenjie Li | N/A | N/A |
| DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization | Yu Li, Baolin Peng, Pengcheng He, Michel Galley, Zhou Yu and Jianfeng Gao | N/A | N/A |
| White-Box Multi-Objective Adversarial Attack on Dialogue Generation | Yufei Li, Zexin Li, Yingfan Gao and Cong Liu | N/A | N/A |
| PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts | Yunshui Li, Binyuan Hui, ZhiChao Yin, Min Yang, Fei Huang and Yongbin Li | N/A | N/A |
| A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text | Yunxin Li, Baotian Hu, Yuxin Ding, Lin Ma and Min Zhang | N/A | N/A |
| A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues | Yunxin Li, Baotian Hu, Chen Xinyu, Yuxin Ding, Lin Ma and Min Zhang | N/A | N/A |
| Unifying Cross-Lingual and Cross-Modal Modeling Towards Weakly Supervised Multilingual Vision-Language Pre-training | Zejun Li, Zhihao Fan, Jingjing Chen, Qi Zhang, Xuanjing Huang and Zhongyu Wei | N/A | N/A |
| Learning to Substitute Spans towards Improving Compositional Generalization | Zhaoyi Li, Ying Wei and Defu Lian | N/A | N/A |
| FAA: Fine-grained Attention Alignment for Cascade Document Ranking | Zhen Li, Chongyang Tao, Jiazhan Feng, Tao Shen, Dongyan Zhao, Xiubo Geng and Daxin Jiang | N/A | N/A |
| The Best of Both Worlds: Combining Human and Machine Translations for Multilingual Semantic Parsing with Active Learning | Zhuang Li, Lizhen Qu, Philip Cohen, Raj Tumuluri and Gholamreza Haffari | N/A | N/A |
| Dual-Alignment Pre-training for Cross-lingual Sentence Embedding | Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Huang, Jian Jiao, Furu Wei, Weiwei Deng and Qi Zhang | N/A | N/A |
| Dynamic and Efficient Inference for Text Generation via BERT Family | Xiaobo Liang, Juntao Li, Lijun Wu, Ziqiang Cao and Min Zhang | N/A | N/A |
| Open-ended Long Text Generation via Masked Language Modeling | Xiaobo Liang, Zecheng Tang, Juntao Li and Min Zhang | N/A | N/A |
| Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization | Yunlong Liang, Fandong Meng, Jinan Xu, Jiaan Wang, Yufeng Chen and Jie Zhou | N/A | N/A |
| Disentangled Phonetic Representation for Chinese Spelling Correction | Zihong Liang, Xiaojun Quan and Qifan Wang | N/A | N/A |
| Graph-based Relation Mining for Context-free Out-of-vocabulary Word Embedding Learning | Ziran Liang, Yuyin Lu, HeGang Chen and Yanghui Rao | N/A | N/A |
| Prompts Can Play Lottery Tickets Well: Achieving Lifelong Information Extraction via Lottery Prompt Tuning | Zujie Liang, feng wei, Yin Jie, YUXI QIAN, Zhenghong Hao and Bing Han | N/A | N/A |
| Parameter-Efficient Fine-Tuning without Introducing New Latency | Baohao Liao, Yan Meng and Christof Monz | N/A | N/A |
| Large-Scale Correlation Analysis of Automated Metrics for Topic Models | Jia Peng Lim and Hady Lauw | N/A | N/A |
| Gloss-Free End-to-End Sign Language Translation | Kezhou Lin, Xiaohan Wang, Linchao Zhu, Ke Sun, bang zhang and Yi Yang | N/A | N/A |
| TECHS: Temporal Logical Graph Networks for Explainable Extrapolation Reasoning | Qika Lin, Jun Liu, Rui Mao, Fangzhi Xu and Erik Cambria | N/A | N/A |
| TAVT: Towards Transferable Audio-Visual Text Generation | Wang Lin, Tao Jin, Wenwen Pan, Linjun Li, Xize Cheng, Ye Wang and Zhou Zhao | N/A | N/A |
| An Inner Table Retriever for Robust Table Question Answering | Weizhe Lin, Rexhina Blloshmi, Bill Byrne, Adria de Gispert and Gonzalo Iglesias | N/A | N/A |
| Compositional Generalization without Trees using Multiset Tagging and Latent Permutations | Matthias Lindemann, Alexander Koller and Ivan Titov | N/A | N/A |
| What does a Text Classifier Learn about Morality? An Explainable Method for Cross-Domain Comparison of Moral Rhetoric | Enrico Liscio, Oscar Araque, Lorenzo Gatti, Ionut Constantinescu, Catholijn Jonker, Kyriaki Kalimeri and Pradeep Kumar Murukannaiah | N/A | N/A |
| DimonGen: Diversified Generative Commonsense Reasoning for Explaining Concept Relationships | Chenzhengyi Liu, Jie Huang, Kerui Zhu and Kevin Chen-Chuan Chang | N/A | N/A |
| MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering | Fangyu Liu, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Yasemin Altun, Nigel Collier and Julian Eisenschlos | N/A | N/A |
| Learning with Partial Annotations for Event Detection | Jian Liu, Dianbo Sui, Kang Liu, Haoyan Liu and Zhe Zhao | N/A | N/A |
| Document-Level Event Argument Extraction With a Chain Reasoning Paradigm | Jian Liu, Chen Liang, Jinan Xu, Haoyan Liu and Zhe Zhao | N/A | N/A |
| RankCSE: Unsupervised Sentence Representations Learning via Learning to Rank | Jiduan Liu, Jiahao Liu, Qifan Wang, Jingang Wang, Wei Wu, Yunsen Xian, Dongyan Zhao, Kai Chen and Rui Yan | N/A | N/A |
| Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations | Linlin Liu, Xingxuan Li, Megh Thakkar, Xin Li, Shafiq Joty, Luo Si and Lidong Bing | N/A | N/A |
| Do Question Answering Modeling Improvements Hold Across Benchmarks? | Nelson F. Liu, Tony Lee, Robin Jia and Percy Liang | N/A | N/A |
| Character-Aware Models Improve Visual Text Rendering | Rosanne Liu, Dan Garrette, Chitwan Saharia, William Chan, Adam Roberts, Sharan Narang, Irina Blok, RJ Mical, Mohammad Norouzi and Noah Constant | N/A | N/A |
| RECAP: Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation | Shuai Liu, Hyundong Cho, Marjorie Freedman, Xuezhe Ma and Jonathan May | N/A | N/A |
| kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation | Shudong Liu, Xuebo Liu, Derek F. Wong, Zhaocong Li, Wenxiang Jiao, Lidia S. Chao and Min Zhang | N/A | N/A |
| Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023? | Shuheng Liu and Alan Ritter | N/A | N/A |
| Annotation-Inspired Implicit Discourse Relation Classification with Auxiliary Discourse Connective Generation | Wei Liu and Michael Strube | N/A | N/A |
| MGR: Multi-generator Based Rationalization | Wei Liu, Haozhao Wang, Jun Wang, Ruixuan Li, Xinyang Li, YuanKai Zhang and Yang Qiu | N/A | N/A |
| Modeling Structural Similarities between Documents for Coherence Assessment with Graph Convolutional Networks | Wei Liu, Xiyan Fu and Michael Strube | N/A | N/A |
| Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge | Xuebo Liu, Yutong Wang, Derek F. Wong, Runzhe Zhan, Liangxuan Yu and Min Zhang | N/A | N/A |
| One Cannot Stand for Everyone! Leveraging Multiple User Simulators\ to train Task-oriented Dialogue Systems | Yajiao LIU, Xin Jiang, Yichun Yin, Yasheng Wang, Fei Mi, Qun Liu, Xiang Wan and Benyou Wang | N/A | N/A |
| Uncovering and Categorizing Social Biases in Text-to-SQL | Yan Liu, Yan Gao, Zhe Su, Xiaokang Chen, Elliott Ash and Jian-Guang LOU | N/A | N/A |
| Towards Better Entity Linking with Multi-View Enhanced Distillation | Yi Liu, Yuan Tian, Jianxun Lian, xinlong wang, Yanan Cao, Fang Fang, Wen Zhang, Haizhen Huang, Weiwei Deng and Qi Zhang | N/A | N/A |
| A Crosslingual Investigation of Conceptualization in 1335 Languages | Yihong Liu, Haotian Ye, Leonie Weissweiler, Philipp Wicke, Renhao Pei, Robert Zangenfeind and Hinrich Schütze | N/A | N/A |
| Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation | Yixin Liu, Alex Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong and Dragomir Radev | N/A | N/A |
| On Improving Summarization Factual Consistency from Natural Language Feedback | Yixin Liu, Budhaditya Deb, Milagro Teruel, Aaron Halfaker, Dragomir Radev and Ahmed Hassan Awadallah | N/A | N/A |
| PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism | Yongkang Liu, Shi Feng, Daling Wang, Yifei Zhang and Hinrich Schütze | N/A | N/A |
| Binary and Ternary Natural Language Generation | Zechun Liu, Barlas Oguz, Aasish Pappu, Yangyang Shi and Raghuraman Krishnamoorthi | N/A | N/A |
| XDailyDialog: A Multilingual Parallel Dialogue Corpus | Zeming Liu, Ping Nie, Jie Cai, Haifeng Wang, Zheng-Yu Niu, PENG ZHANG, Mrinmaya Sachan and Kaiping Peng | N/A | N/A |
| RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | Zheng Liu, Shitao Xiao, Yingxia Shao and Zhao Cao | N/A | N/A |
| Guiding Computational Stance Detection with Expanded Stance Triangle Framework | Zhengyuan Liu, Yong Keong Yap, Hai Leong Chieu and Nancy Chen | N/A | N/A |
| Towards Boosting the Open-Domain Chatbot with Human Feedback | Hua Lu, Siqi Bao, Huang He, Fan Wang, Hua Wu and Haifeng Wang | N/A | N/A |
| What Makes Pre-trained Language Models Better Zero-shot Learners? | Jinghui Lu, dongsheng zhu, weidong Han, Rui Zhao, Brian Mac Namee and Fei Tan | N/A | N/A |
| Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks | Junyu Lu, Bo Xu, Xiaokun Zhang, Changrong Min, Liang Yang and Hongfei LIN | N/A | N/A |
| DaMSTF: Domain Adversarial Learning Enhanced Meta Self-Training for Domain Adaptation | Menglong Lu, Zhen Huang, Yunxiang Zhao, Zhiliang Tian, Yang Liu and Dongsheng Li | N/A | N/A |
| Distantly Supervised Course Concept Extraction in MOOCs with Academic Discipline | Mengying Lu, Yuquan Wang, Jifan Yu, Yexing Du, Lei Hou and Juanzi Li | N/A | N/A |
| A Survey of Deep Learning for Mathematical Reasoning | Pan Lu, Liang Qiu, Wenhao Yu, Sean Welleck and Kai-Wei Chang | N/A | N/A |
| Toward Human-Like Evaluation for Natural Language Generation with Error Analysis | Qingyu Lu, Liang Ding, Liping Xie, Kanjian Zhang, Derek F. Wong and Dacheng Tao | N/A | N/A |
| Explanation-based Finetuning Makes Models More Robust to Spurious Cues | Josh Magnus Ludan, Yixuan Meng, Tai Nguyen, Saurabh Shah, Qing Lyu, Marianna Apidianaki and Chris Callison-Burch | N/A | N/A |
| Causality-Guided Multi-Memory Interaction Network for Multivariate Stock Price Movement Prediction | Di Luo, Weiheng Liao, Shuqi Li, Xin Cheng and Rui Yan | N/A | N/A |
| HAHE: Hierarchical Attention for Hyper-Relational Knowledge Graphs in Global and Local Level | Haoran Luo, Haihong E, Yuhao Yang, Yikai Guo, Mingzhi Sun, Tianyu Yao, Zichen Tang, Kaiyang Wan, Meina Song and Wei Lin | N/A | N/A |
| End-to-end Knowledge Retrieval with Multi-modal Queries | Man Luo, Zhiyuan Fang, Tejas Gokhale, Yezhou Yang and Chitta Baral | N/A | N/A |
| CAME: Confidence-guided Adaptive Memory Efficient Optimization | Yang Luo, Xiaozhe REN, Zangwei Zheng, ZHUO JIANG, Xin Jiang and Yang You | N/A | N/A |
| DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations | Ang Lv, Jinpeng Li, yuhan chen, GAO XING, Ji Zhang and Rui Yan | N/A | N/A |
| Envisioning Future from the Past: Hierarchical Duality Learning for Multi-Turn Dialogue Generation | Ang Lv, Jinpeng Li, Shufang Xie and Rui Yan | N/A | N/A |
| Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations | Xinxi Lyu, Sewon Min, Iz Beltagy, Luke Zettlemoyer and Hannaneh Hajishirzi | N/A | N/A |
| Cross-lingual Continual Learning | Meryem M’hamdi, Xiang Ren and Jonathan May | N/A | N/A |
| AMR-based Network for Aspect-based Sentiment Analysis | Fukun Ma, Xuming Hu, Aiwei Liu, Yawen Yang, Shuang Li, Philip S. Yu and Lijie Wen | N/A | N/A |
| Chain-of-Skills: A Configurable Model for Open-Domain Question Answering | Kaixin Ma, Hao Cheng, Yu Zhang, Xiaodong Liu, Eric Nyberg and Jianfeng Gao | N/A | N/A |
| BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics | Liang Ma, Shuyang Cao, Robert L Logan IV, Di Lu, Shihao Ran, Ke Zhang, Joel Tetreault and Alejandro Jaimes | N/A | N/A |
| DICE: Data-Efficient Clinical Event Extraction with Generative Models | Mingyu Derek Ma, Alexander Taylor, Wei Wang and Nanyun Peng | N/A | N/A |
| Learning "O" Helps for Learning More: Handling the Unlabeled Entity Problem for Class-incremental NER | Ruotian Ma, xuanting chen, zhang lin, Xin Zhou, Junzhe Wang, Tao Gui, Qi Zhang, Xiang Gao and Yun Wen Chen | N/A | N/A |
| CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition | Tingting Ma, Qianhui Wu, Huiqiang Jiang, Börje Karlsson, Tiejun Zhao and Chin-Yew Lin | N/A | N/A |
| Few-shot Event Detection: An Empirical Study and a Unified View | Yubo Ma, Zehao Wang, Yixin Cao and Aixin Sun | N/A | N/A |
| World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models | Ziqiao Ma, Jiayi Pan and Joyce Chai | N/A | N/A |
| Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts | Mounica Maddela, Megan Ung, Jing Xu, Andrea Madotto, Heather Foran and Y-Lan Boureau | N/A | N/A |
| LENS: A Learnable Evaluation Metric for Text Simplification | Mounica Maddela, Yao Dou, David Heineman and Wei Xu | N/A | N/A |
| Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers | Manuel Mager, Elisabeth Mager, Katharina Kann and Ngoc Thang Vu | N/A | N/A |
| HyperMixer: An MLP-based Low Cost Alternative to Transformers | Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, Francois Marelli, Francois Fleuret and James Henderson | N/A | N/A |
| Small Data, Big Impact: Leveraging Minimal Data for Effective Machine Translation | Jean Maillard, Cynthia Gao, Elahe Kalbassi, Kaushik Ram Sadagopan, Vedanuj Goswami, Philipp Koehn, Angela Fan and Francisco Guzman | N/A | N/A |
| QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations | Chaitanya Malaviya, Peter Shaw, Ming-Wei Chang, Kenton Lee and Kristina Toutanova | N/A | N/A |
| When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories | Alex Mallen, Akari Asai, Victor Zhong, Rajarshi Das, Daniel Khashabi and Hannaneh Hajishirzi | N/A | N/A |
| Benchmarking Large Language Model Capabilities for Conditional Generation | Joshua Maynez, Priyanka Agrawal and Sebastian Gehrmann | N/A | N/A |
| Logic-driven Indirect Supervision: An Application to Crisis Counseling | Mattia Medina Grespan, Meghan Broadbent, Xinyao Zhang, Katherine Axford, Brent Kious, Zac Imel and Vivek Srikumar | N/A | N/A |
| Resolving Ambiguities in Text-to-Image Generative Models | Ninareh Mehrabi, Palash Goyal, Apurv Verma, Jwala Dhamala, Varun Kumar, Qian Hu, Kai-Wei Chang, Richard Zemel, Aram Galstyan and Rahul Gupta | N/A | N/A |
| NOTABLE: Transferable Backdoor Attacks Against Prompt-based NLP Models | Kai Mei, Zheng Li, Zhenting Wang, Yang Zhang and Shiqing Ma | N/A | N/A |
| On the Efficacy of Sampling Adapters | Clara Meister, Tiago Pimentel, Luca Malagutti, Ethan Wilcox and Ryan Cotterell | N/A | N/A |
| From Dogwhistles to Bullhorns: Unveiling Coded Rhetoric with Language Models | Julia Mendelsohn, Ronan Le Bras, Yejin Choi and Maarten Sap | N/A | N/A |
| Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 Treatments | Ethan Mendes, Yang Chen, Wei Xu and Alan Ritter | N/A | N/A |
| Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages | Arnav Mhaske, Harshit Kedia, Sumanth Doddapaneni, Mitesh M. Khapra, Pratyush Kumar, Rudra Murthy and Anoop Kunchukuttan | N/A | N/A |
| What Do NLP Researchers Believe? Results of the NLP Community Metasurvey | Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang and Samuel R. Bowman | N/A | N/A |
| LAIT: Efficient Multi-Segment Encoding in Transformers with Layer-Adjustable Interaction | Jeremiah Milbauer, Annie Louis, Mohammad Javad Hosseini, Alex Fabrikant, Donald Metzler and Tal Schuster | N/A | N/A |
| Just Like a Human Would, Direct Access to Sarcasm Augmented with Potential Result and Reaction | Changrong Min, Ximing Li, Liang Yang, Zhilin Wang, Bo Xu and Hongfei LIN | N/A | N/A |
| Where’s the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation | Benjamin Minixhofer, Jonas Pfeiffer and Ivan Vulić | N/A | N/A |
| Privacy-Preserving Domain Adaptation of Semantic Parsers | Fatemehsadat Mireshghallah, Yu Su, Tatsunori Hashimoto, Jason Eisner and Richard Shin | N/A | N/A |
| What is the Real Intention behind this Question? Dataset Collection and Intention Classification | Maryam Sadat Mirzaei, Kourosh Meshgi and Satoshi Sekine | N/A | N/A |
| PAL to Lend a Helping Hand: Towards Building an Emotion Adaptive Polite and Empathetic Counseling Conversational Agent | Kshitij Mishra, Priyanshu Priya and Asif Ekbal | N/A | N/A |
| ConvGQR: Generative Query Reformulation for Conversational Search | Fengran Mo, Kelong Mao, Yutao Zhu, Yihong Wu, Kaiyu Huang and Jian-Yun Nie | N/A | N/A |
| DecompX: Explaining Transformers Decisions by Propagating Token Decomposition | Ali Modarressi, Mohsen Fayyaz, Ehsan Aghazadeh, Yadollah Yaghoobzadeh and Mohammad Taher Pilehvar | N/A | N/A |
| Extrinsic Evaluation of Machine Translation Metrics | Nikita Moghe, Tom Sherborne, Mark Steedman and Alexandra Birch | N/A | N/A |
| Large-scale Lifelong Learning of In-context Instructions and How to Tackle It | Jisoo Mok, Jaeyoung Do, Sungjin Lee, Tara Taghavi, Seunghak Yu and Sungroh Yoon | N/A | N/A |
| Dynamic Regularization in UDA for Transformers in Multimodal Classification | Ivonne Monter-Aldana, Adrian Pastor Lopez Monroy and Fernando Sanchez-Vega | N/A | N/A |
| Randomized Smoothing with Masked Inference for Adversarially Robust Text Classifications | Han Cheol Moon, Shafiq Joty, Ruochen Zhao, Megh Thakkar and Chi Xu | N/A | N/A |
| UPPAM: A Unified Pre-training Architecture for Political Actor Modeling based on Language | Xinyi Mou, Zhongyu Wei, Qi Zhang and Xuanjing Huang | N/A | N/A |
| Decoupling Pseudo Label Disambiguation and Representation Learning for Generalized Intent Discovery | Yutao Mou, Xiaoshuai Song, Keqing He, Chen Zeng, Pei Wang, Jingang Wang, Yunsen Xian and Weiran Xu | N/A | N/A |
| How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases | Aaron Mueller and Tal Linzen | N/A | N/A |
| Crosslingual Generalization through Multitask Finetuning | Niklas Muennighoff, Thomas Wang, Lintang Sutawika, Adam Roberts, Stella Biderman, Teven Le Scao, M Saiful Bari, Sheng Shen, Zheng Xin Yong, Hailey Schoelkopf, Xiangru Tang, Dragomir Radev, Alham Fikri Aji, KHALID ALMUBARAK, Samuel Albanie, Zaid Alyafeai, Albert Webson, Edward Raff and Colin Raffel | N/A | N/A |
| DIP: Dead code Insertion based Black-box Attack for Programming Language Model | CheolWon Na, YunSeok Choi and Jee-Hyong Lee | N/A | N/A |
| Efficient Transformers with Dynamic Token Pooling | Piotr Nawrot, Jan Chorowski, Adrian Lancucki and Edoardo Maria Ponti | N/A | N/A |
| DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering | Ella Neeman, Roee Aharoni, Or Honovich, Leshem Choshen, Idan Szpektor and Omri Abend | N/A | N/A |
| When Does Aggregating Multiple Skills with Multi-Task Learning Work? A Case Study in Financial NLP | Jingwei Ni, Zhijing Jin, QIAN WANG, Mrinmaya Sachan and Markus Leippold | N/A | N/A |
| Finding the Pillars of Strength for Multi-Head Attention | Jinjie Ni, Rui Mao, Zonglin Yang, Han Lei and Erik Cambria | N/A | N/A |
| NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist | Iftitahu Nimah, Meng Fang, Vlado Menkovski and Mykola Pechenizkiy | N/A | N/A |
| OD-RTE: A One-Stage Object Detection Framework for Relational Triple Extraction | Jinzhong Ning, Zhihao Yang, Yuanyuan Sun, Zhizheng Wang and Hongfei LIN | N/A | N/A |
| On "Scientific Debt" in NLP: A Case for More Rigour in Language Model Pre-Training Research | Made Nindyatama Nityasya, Haryo Wibowo, Alham Fikri Aji, Genta Winata, Radityo Eko Prasojo, Phil Blunsom and Adhiguna Kuncoro | N/A | N/A |
| Using counterfactual contrast to improve compositional generalization for multi-step quantitative reasoning | Armineh Nourbakhsh, Sameena Shah and Carolyn Rosé | N/A | N/A |
| Token-wise Decomposition of Autoregressive Language Model Hidden States for Analyzing Model Predictions | Byung-Doh Oh and William Schuler | N/A | N/A |
| Social-Group-Agnostic Bias Mitigation via the Stereotype Content Model | Ali Omrani, Alireza Salkhordeh Ziabari, Charles Yu, Preni Golazizian, Brendan Kennedy, Mohammad Atari, Heng Ji and Morteza Dehghani | N/A | N/A |
| Can LMs Learn New Entities from Descriptions? Challenges in Propagating Injected Knowledge | Yasumasa Onoe, Michael Zhang, Shankar Padmanabhan, Greg Durrett and Eunsol Choi | N/A | N/A |
| Efficient Semiring-Weighted Earley Parsing | Andreas Opedal, Ran Zmigrod, Tim Vieira, Ryan Cotterell and Jason Eisner | N/A | N/A |
| BLIND: Bias Removal With No Demographics | Hadas Orgad and Yonatan Belinkov | N/A | N/A |
| A Textual Dataset for Situated Proactive Response Selection | Naoki Otani, Jun Araki, HyeongSik Kim and Eduard Hovy | N/A | N/A |
| Songs Across Borders: Singable and Controllable Neural Lyric Translation | Longshen Ou, Xichu Ma, Min-Yen Kan and Ye Wang | N/A | N/A |
| WACO: Word-Aligned Contrastive Learning for Speech Translation | Siqi Ouyang, Rong Ye and Lei Li | N/A | N/A |
| Compositional Data Augmentation for Abstractive Conversation Summarization | Siru Ouyang, Jiaao Chen, Jiawei Han and Diyi Yang | N/A | N/A |
| On Prefix-tuning for Lightweight Out-of-distribution Detection | Yawen Ouyang, Yongchang Cao, Yuan Gao, Zhen Wu, Jianbing Zhang and Xinyu Dai | N/A | N/A |
| ThinkSum: Probabilistic reasoning over sets using large language models | Batu Ozturkler, Nikolay Malkin, Zhen Wang and Nebojsa Jojic | N/A | N/A |
| Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization | Artidoro Pagnoni, Alex Fabbri, Wojciech Kryscinski and Chien-Sheng Wu | N/A | N/A |
| MultiTabQA: Generating Tabular Answers for Multi-Table Question Answering | Vaishali Pal, Andrew Yates, Evangelos Kanoulas and Maarten de Rijke | N/A | N/A |
| Using Neural Machine Translation for Generating Diverse Challenging Exercises for Language Learner | Frank Palma Gomez, Subhadarshi Panda, Michael Flor and Alla Rozovskaya | N/A | N/A |
| Fact-Checking Complex Claims with Program-Guided Reasoning | Liangming Pan, Xiaobao Wu, Xinyuan Lu, Anh Tuan Luu, William Yang Wang, Min-Yen Kan and Preslav Nakov | N/A | N/A |
| Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment | Rohan Pandey, Rulin Shao, Paul Pu Liang, Ruslan Salakhutdinov and Louis-Philippe Morency | N/A | N/A |
| Reward Gaming in Conditional Text Generation | Richard Yuanzhe Pang, Vishakh Padmakumar, Thibault Sellam, Ankur Parikh and He He | N/A | N/A |
| Attention as a Guide for Simultaneous Speech Translation | Sara Papi, Matteo Negri and Marco Turchi | N/A | N/A |
| MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks | Letitia Parcalabescu and Anette Frank | N/A | N/A |
| GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles | Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang and Nanyun Peng | N/A | N/A |
| Deep Model Compression Also Helps Models Capture Ambiguity | Hancheol Park and Jong Park | N/A | N/A |
| Do I have the Knowledge to Answer? Investigating Answerability of Knowledge Base Questions | Mayur Patidar, Prayushi Faldu, Avinash Singh, Lovekesh Vig, Indrajit Bhattacharya and Mausam | N/A | N/A |
| Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning | Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary and Xia Song | N/A | N/A |
| Dating Greek Papyri with Text Regression | John Pavlopoulos, Maria Konstantinidou, Isabelle Marthot-Santaniello, Holger Essler and Asimina Paparigopoulou | N/A | N/A |
| When to Use What: An In-Depth Comparative Empirical Analysis of OpenIE Systems for Downstream Applications | Kevin Pei, Ishan Jindal, Kevin Chen-Chuan Chang, ChengXiang Zhai and Yunyao Li | N/A | N/A |
| FSUIE: A Novel Fuzzy Span Mechanism for Universal Information Extraction | Tianshuo Peng, Zuchao Li, Lefei Zhang, Bo Du and Hai Zhao | N/A | N/A |
| Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark | Wenjun Peng, Jingwei Yi, Fangzhao Wu, Shangxi Wu, Bin Bin Zhu, Lingjuan Lyu, Binxing Jiao, Tong Xu, Guangzhong Sun and Xing Xie | N/A | N/A |
| Neural Machine Translation for Mathematical Formulae | Felix Petersen, Moritz Schubotz, Andre Greiner-Petter and Bela Gipp | N/A | N/A |
| Dealing with Semantic Underspecification in Multimodal NLP | Sandro Pezzelle | N/A | N/A |
| Towards a Common Understanding of Contributing Factors for Cross-Lingual Transfer in Multilingual Language Models: A Review | Fred Philippy, Siwen Guo and Shohreh Haddadan | N/A | N/A |
| UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective | yang ping, JunYu Lu, ruyi gan, Junjie Wang, Yuxiang Zhang, Pingjian Zhang and Jiaxing Zhang | N/A | N/A |
| Learning Language-Specific Layers for Multilingual Machine Translation | Telmo Pires, Robin M. Schmidt, Yi-Hsiu Liao and Stephan Peitz | N/A | N/A |
| Multilingual Multifaceted Understanding of Online News in Terms of Genre, Framing, and Persuasion Techniques | Jakub Piskorski, Nicolas Stefanovitch, Nikolaos Nikolaidis, Giovanni Da San Martino and Preslav Nakov | N/A | N/A |
| No clues good clues: out of context Lexical Relation Classification | Lucia Pitarch, Jordi Bernad, Lacramioara Dranca, Carlos Bobed Lisbona and Jorge Gracia | N/A | N/A |
| Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation Tasks | Moritz Plenz, Juri Opitz, Philipp Heinisch, Philipp Cimiano and Anette Frank | N/A | N/A |
| Concise Answers to Complex Questions: Summarization of Long-form Answers | Abhilash Potluri, Fangyuan Xu and Eunsol Choi | N/A | N/A |
| Reanalyzing L2 Preposition Learning with Bayesian Mixed Effects and a Pretrained Language Model | Jakob Prange and Man Ho Ivy Wong | N/A | N/A |
| MeetingQA: Extractive Question-Answering on Meeting Transcripts | Archiki Prasad, Trung Bui, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt and Mohit Bansal | N/A | N/A |
| Using Domain Knowledge to Guide Dialog Structure Induction via Neural Probabilistic Soft Logic | Connor Pryor, Quan Yuan, Jeremiah Liu, Mehran Kazemi, Deepak Ramachandran, Tania Bedrax-Weiss and Lise Getoor | N/A | N/A |
| Conjunct Lengths in English, Dependency Length Minimization, and Dependency Structure of Coordination | Adam Przepiórkowski and Michał Woźniak | N/A | N/A |
| Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization | Dongqi Pu, Yifan Wang and Vera Demberg | N/A | N/A |
| ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations | Valentina Pyatkin, Jena D. Hwang, Vivek Srikumar, Ximing Lu, Liwei Jiang, Yejin Choi and Chandra Bhagavatula | N/A | N/A |
| Limitations of Language Models in Arithmetic and Symbolic Induction | Jing Qian, Hong Wang, Zekun Li, Shiyang Li and Xifeng Yan | N/A | N/A |
| UniLG: A Unified Structure-aware Framework for Lyrics Generation | Tao Qian, Fan Lou, Jiatong Shi, Yuning Wu, Shuai Guo, Xiang Yin and Qin Jin | N/A | N/A |
| ParaLS: Lexical Substitution via Pretrained Paraphraser | Jipeng Qiang, Kang Liu, Yun Li, Yunhao Yuan and Yi Zhu | N/A | N/A |
| Reasoning with Language Model Prompting: A Survey | Shuofei Qiao, Yixin Ou, Ningyu Zhang, Xiang Chen, Yunzhi Yao, Shumin Deng, Chuanqi Tan, Fei Huang and Huajun Chen | N/A | N/A |
| Learning to Initialize: Can Meta Learning Improve Cross-task Generalization in Prompt Tuning? | Chengwei Qin, Shafiq Joty, Qian Li and Ruochen Zhao | N/A | N/A |
| WebCPM: Interactive Web Search for Chinese Long-form Question Answering | Yujia Qin, Zihan Cai, Dian Jin, Lan Yan, Shihao Liang, Kunlun Zhu, Yankai Lin, Xu Han, Ning Ding, Huadong Wang, Ruobing Xie, Fanchao Qi, Zhiyuan Liu, Maosong Sun and Jie Zhou | N/A | N/A |
| A Survey on Asking Clarification Questions Datasets in Conversational Systems | Hossein A. Rahmani, Xi Wang, Yue Feng, Qiang Zhang, Emine Yilmaz and Aldo Lipani | N/A | N/A |
| What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary | Ori Ram, Liat Bezalel, Adi Zicher, Yonatan Belinkov, Jonathan Berant and Amir Globerson | N/A | N/A |
| Single Sequence Prediction over Reasoning Graphs for Multi-hop QA | Gowtham Ramesh, Makesh Narsimhan Sreedhar and Junjie Hu | N/A | N/A |
| A Comparative Study on the Impact of Model Compression Techniques on Fairness in Language Models | Krithika Ramesh, Arnav Chavan, Shrey Pandit and Sunayana Sitaram | N/A | N/A |
| Knowledge of cultural moral norms in large language models | Aida Ramezani and Yang Xu | N/A | N/A |
| Cross-Modal Attribute Insertions for Assessing the Robustness of Vision-and-Language Learning | Shivaen Ramshetty, Gaurav Verma and Srijan Kumar | N/A | N/A |
| FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering | Anku Rani, S.M Towhidul Islam Tonmoy, Dwip Dalal, Shreya Gautam, Megha Chakraborty, Aman Chadha, Amit Sheth and Amitava Das | N/A | N/A |
| Conjunct Resolution in the Face of Verbal Omissions | Royi Rassin, Yoav Goldberg and Reut Tsarfaty | N/A | N/A |
| Parallel Context Windows for Large Language Models | Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Inbal Magar, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown and Yoav Shoham | N/A | N/A |
| Linear Guardedness and its Implications | Shauli Ravfogel, Yoav Goldberg and Ryan Cotterell | N/A | N/A |
| TOME: A Two-stage Approach for Model-based Retrieval | Ruiyang Ren, Wayne Xin Zhao, Jing Liu, Hua Wu, Ji-Rong Wen and Haifeng Wang | N/A | N/A |
| Retrieve-and-Sample: Document-level Event Argument Extraction via Hybrid Retrieval Augmentation | Yubing Ren, Yanan Cao, Ping Guo, Fang Fang, Wei Ma and Zheng Lin | N/A | N/A |
| Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation | Yuxin Ren, Zihan Zhong, Xingjian Shi, Yi Zhu, Chun Yuan and Mu Li | N/A | N/A |
| Exploring Large Language Models for Classical Philology | Frederick Riemenschneider and Anette Frank | N/A | N/A |
| Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback | Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Leonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos Garea, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin and Idan Szpektor | N/A | N/A |
| Finding the SWEET Spot: Analysis and Improvement of Adaptive Inference in Low Resource Settings | Daniel Rotem, Michael Hassid, Jonathan Mamou and Roy Schwartz | N/A | N/A |
| Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations | Dongyu Ru, Lin Qiu, Xipeng Qiu, Yue Zhang and Zheng Zhang | N/A | N/A |
| A Dataset of Argumentative Dialogues on Scientific Papers | Federico Ruggeri, Mohsen Mesgar and Iryna Gurevych | N/A | N/A |
| Helping a Friend or Supporting a Cause? Disentangling Active and Passive Cosponsorship in the U.S. Congress | Giuseppe Russo, Christoph Gote, Laurence Brandenberger, Sophia Johanna Schlosser and Frank Schweitzer | N/A | N/A |
| Revisiting non-English Text Simplification: A Unified Multilingual Benchmark | Michael Ryan, Tarek Naous and Wei Xu | N/A | N/A |
| ArgU: A Controllable Factual Argument Generator | Sougata Saha and Rohini Srihari | N/A | N/A |
| IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages | Ananya Sai B, Tanay Dixit, Vignesh Nagarajan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra and Raj Dabre | N/A | N/A |
| Hidden Schema Networks | Ramses Sanchez, Lukas Conrads, Pascal Welke, Kostadin Cvejoski and Cesar Ojeda Marin | N/A | N/A |
| Accelerating Transformer Inference for Translation via Parallel Decoding | Andrea Santilli, Silvio Severino, Emilian Postolache, Valentino Maiorca, Michele Mancusi, Riccardo Marin and Emanuele Rodola | N/A | N/A |
| NLPositionality: Characterizing Design Biases of Datasets and Models | Sebastin Santy, Jenny Liang, Ronan Le Bras, Katharina Reinecke and Maarten Sap | N/A | N/A |
| APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning | Soumya Sanyal, Yichong Xu, Shuohang Wang, Ziyi Yang, Reid Pryzant, Wenhao Yu, Chenguang Zhu and Xiang Ren | N/A | N/A |
| VendorLink: An NLP approach for Identifying & Linking Vendor Migrants & Potential Aliases on Darknet Markets | Vageesh Saxena, Nils Rethmeier, Gijs van Dijck and Gerasimos Spanakis | N/A | N/A |
| Multilingual Conceptual Coverage in Text-to-Image Models | Michael Saxon and William Yang Wang | N/A | N/A |
| Tree-Based Representation and Generation of Natural and Mathematical Language | Alexander Scarlatos and Andrew Lan | N/A | N/A |
| Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging | Fabian Schmidt, Ivan Vulić and Goran Glavaš | N/A | N/A |
| Minding Language Models’ (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker | Melanie Sclar, Sachin Kumar, Peter West, Alane Suhr, Yejin Choi and Yulia Tsvetkov | N/A | N/A |
| Ranking-Enhanced Unsupervised Sentence Representation Learning | Yeon Seonwoo, Guoyin Wang, Changmin Seo, Sajal Choudhary, Jiwei Li, Xiang Li, Puyang Xu, Sunghyun Park and Alice Oh | N/A | N/A |
| Training-free Neural Architecture Search for RNNs and Transformers | Aaron Serianni and Jugal Kalita | N/A | N/A |
| Trillion Dollar Words: A New Financial Dataset, Task & Market Analysis | Agam Shah, Suvan Paturi and Sudheer Chava | N/A | N/A |
| Causes and Cures for Interference in Multilingual Translation | Uri Shaham, Maha Elbayad, Vedanuj Goswami, Omer Levy and Shruti Bhosale | N/A | N/A |
| On Second Thought, Let’s Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning | Omar Shaikh, Hongxin Zhang, William Held, Michael Bernstein and Diyi Yang | N/A | N/A |
| Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction | Ashish Sharma, Kevin Rushton, Inna Lin, David Wadden, Khendra Lucas, Adam Miner, Theresa Nguyen and Tim Althoff | N/A | N/A |
| Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency | Mandar Sharma, Nikhil Muralidhar and Naren Ramakrishnan | N/A | N/A |
| When and how to paraphrase for named entity recognition? | Saket Sharma, Aviral Joshi, Yiyun Zhao, Namrata Mukhija, Hanoz Bhathena, Prateek Singh and Sashank Santhanam | N/A | N/A |
| MEMEX: Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization | Shivam Sharma, Ramaneswaran S, Udit Arora, Md. Shad Akhtar and Tanmoy Chakraborty | N/A | N/A |
| Dense-ATOMIC: Towards Densely-connected ATOMIC with High Knowledge Coverage and Massive Multi-hop Paths | Xiangqing Shen, Siwei Wu and Rui Xia | N/A | N/A |
| PromptNER: Prompt Locating and Typing for Named Entity Recognition | Yongliang Shen, Zeqi Tan, Shuhui Wu, Wenqi Zhang, Rongsheng Zhang, Yadong Xi, Weiming Lu and Yueting Zhuang | N/A | N/A |
| DiffusionNER: Boundary Diffusion for Named Entity Recognition | Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu and Yueting Zhuang | N/A | N/A |
| MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations | Tao Shi and Shao-Lun Huang | N/A | N/A |
| MidMed: Towards Mixed-Type Dialogues for Medical Consultation | Xiaoming Shi, Zeming Liu, Chuan Wang, Haitao Leng, Kui Xue, Xiaofan Zhang and Shaoting Zhang | N/A | N/A |
| RADE: Reference-Assisted Dialogue Evaluation for Open-Domain Dialogue | Zhengliang Shi, Weiwei Sun, Shuo Zhang, Zhen Zhang, Pengjie Ren and Zhaochun Ren | N/A | N/A |
| Pivotal Role of Language Modeling in Recommender Systems: Enriching Task-specific and Task-agnostic Representation Learning | Kyuyong Shin, Hanock Kwak, Wonjae Kim, Jisu Jeong, Seungjae Jung, Kyungmin Kim, Jung-Woo Ha and Sang-Woo Lee | N/A | N/A |
| SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks | Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu, Roshan S Sharma, Wei-Lun Wu, Hung-yi Lee, Karen Livescu and Shinji Watanabe | N/A | N/A |
| Evaluate AMR Graph Similarity via Self-supervised Learning | Ziyi Shou and Fangzhen Lin | N/A | N/A |
| Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations | Chenglei Si, Dan Friedman, Nitish Joshi, Shi Feng, Danqi Chen and He He | N/A | N/A |
| READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises | Chenglei Si, Zhengyan Zhang, Yingfa Chen, Xiaozhi Wang, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| Combo of Thinking and Observing for Outside-Knowledge VQA | Qingyi Si, Yuchen Mo, Zheng Lin, HUISHAN JI and Weiping Wang | N/A | N/A |
| Learning to Generate Equitable Text in Dialogue from Biased Training Data | Anthony Sicilia and Malihe Alikhani | N/A | N/A |
| BIG-C: a Multimodal Multi-Purpose Dataset for Bemba | Claytone Sikasote, Eunice Mukonde, Md Mahfuz Ibn Alam and Antonios Anastasopoulos | N/A | N/A |
| Peeking inside the black box: A Commonsense-aware Generative Framework for Explainable Complaint Detection | Apoorva Singh, Raghav Jain, Prince Jha and Sriparna Saha | N/A | N/A |
| Forgotten Knowledge: Examining the Citational Amnesia in NLP | Janvijay Singh, Mukund Rungta, Diyi Yang and Saif Mohammad | N/A | N/A |
| EEL: Efficiently Encoding Lattices for Reranking | Prasann Singhal, Jiacheng Xu, Xi Ye and Greg Durrett | N/A | N/A |
| Language model acceptability judgements are not always robust to context | Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes, Roger Levy and Adina Williams | N/A | N/A |
| FERMAT: An Alternative to Accuracy for Numerical Reasoning | Jasivan Sivakumar and Nafise Sadat Moosavi | N/A | N/A |
| To Revise or Not to Revise: Learning to Detect Improvable Claims for Argumentative Writing Support | Gabriella Skitalinskaya and Henning Wachsmuth | N/A | N/A |
| A Simple and Flexible Modeling for Mental Disorder Detection by Learning from Clinical Questionnaires | Hoyun Song, Jisu Shin, Huije Lee and Jong Park | N/A | N/A |
| Peer-Label Assisted Hierarchical Text Classification | Junru Song, Feifei Wang and Yang Yang | N/A | N/A |
| MatSci-NLP: Evaluating Scientific Language Models on Materials Science Language Tasks Using Text-to-Schema Modeling | Yu Song, Santiago Miret and Bang Liu | N/A | N/A |
| Grounding Characters and Places in Narrative Text | Sandeep Soni, Amanpreet Sihra, Elizabeth Evans, Matthew Wilkens and David Bamman | N/A | N/A |
| Unsupervised Extractive Summarization of Emotion Triggers | Tiberiu Sosea, Hongli Zhan, Junyi Jessy Li and Cornelia Caragea | N/A | N/A |
| Local Byte Fusion for Neural Machine Translation | Makesh Narsimhan Sreedhar, Xiangpeng Wan, Yu Cheng and Junjie Hu | N/A | N/A |
| Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA | Elias Stengel-Eskin, Jimena Guallar-Blasco, Yi Zhou and Benjamin Van Durme | N/A | N/A |
| DEplain: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification | Regina Stodden, Omar Momen and Laura Kallmeyer | N/A | N/A |
| An Ordinal Latent Variable Model of Conflict Intensity | Niklas Stoehr, Lucas Torroba Hennigen, Josef Valvoda, Robert West, Ryan Cotterell and Aaron Schein | N/A | N/A |
| A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models | Alessandro Stolfo, Zhijing Jin, Kumar Shridhar, Bernhard Schoelkopf and Mrinmaya Sachan | N/A | N/A |
| Unsupervised Selective Rationalization with Noise Injection | Adam Storek, Melanie Subbiah and Kathleen McKeown | N/A | N/A |
| NLP Reproducibility For All: Understanding Experiences of Beginners | Shane Storks, Keunwoo Yu, Ziqiao Ma and Joyce Chai | N/A | N/A |
| WikiBio: a Semantic Resource for the Intersectional Analysis of Biographical Events | Marco Antonio Stranisci, Rossana Damiano, Enrico Mensa, Viviana Patti, Daniele Radicioni and Tommaso Caselli | N/A | N/A |
| History Semantic Graph Enhanced Conversational KBQA with Temporal Information Modeling | Hao Sun, Yang Li, Liwei Deng, Bowen Li, Binyuan Hui, Binhua Li, Yunshi Lan, Yan Zhang and Yongbin Li | N/A | N/A |
| MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions | Hao Sun, Zhexin Zhang, Fei Mi, Yasheng Wang, Wei Liu, Jianwei Cui, Bin Wang, Qun Liu and Minlie Huang | N/A | N/A |
| Dialect-robust Evaluation of Generated Text | Jiao Sun, Thibault Sellam, Elizabeth Clark, Tu Vu, Timothy Dozat, Dan Garrette, Aditya Siddhant, Jacob Eisenstein and Sebastian Gehrmann | N/A | N/A |
| Layer-wise Fusion with Modality Independence Modeling for Multi-modal Emotion Recognition | Jun Sun, Shoukang Han, Yu-Ping Ruan, Xiaoning Zhang, Shu-Kai Zheng, Yulong Liu, Yuxin Huang and Taihao Li | N/A | N/A |
| From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding | Li Sun, Florian Luisier, Kayhan Batmanghelich, Dinei Florencio and Cha Zhang | N/A | N/A |
| Uncertainty Guided Label Denoising for Document-level Distant Relation Extraction | Qi Sun, Kun Huang, Xiaocui Yang, Pengfei Hong, Kun Zhang and Soujanya Poria | N/A | N/A |
| Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning | Tianxiang Sun, Zhengfu He, Qin Zhu, Xipeng Qiu and Xuanjing Huang | N/A | N/A |
| Backdooring Neural Code Search | Weisong Sun, Yuchen Chen, Guanhong Tao, Chunrong Fang, Xiangyu Zhang, Quanjun Zhang and Bin Luo | N/A | N/A |
| Answering Ambiguous Questions via Iterative Prompting | Weiwei Sun, Hengyi Cai, Hongshen Chen, Pengjie Ren, Zhumin CHEN, Maarten de Rijke and Zhaochun Ren | N/A | N/A |
| A Length-Extrapolatable Transformer | Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song and Furu Wei | N/A | N/A |
| IDRISI-RA: The First Arabic Location Mention Recognition Dataset of Disaster Tweets | Reem Suwaileh, Muhammad Imran and Tamer Elsayed | N/A | N/A |
| Why Aren’t We NER Yet? Artifacts of ASR Errors in Named Entity Recognition in Spontaneous Speech Transcripts | Piotr Szymański, Lukasz Augustyniak, Mikolaj Morzy, Adrian Szymczak, Krzysztof Surdyk and Piotr Żelasko | N/A | N/A |
| Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models | Qingyu Tan, Hwee Tou Ng and Lidong Bing | N/A | N/A |
| VisText: A Benchmark for Semantically Rich Chart Captioning | Benny Tang, Angie Boggust and Arvind Satyanarayan | N/A | N/A |
| Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation | Chen Tang, Hongbo Zhang, Tyler Loakman, Chenghua Lin and Frank Guerin | N/A | N/A |
| Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors | Liyan Tang, Tanya Goyal, Alex Fabbri, Philippe Laban, Jiacheng Xu, Semih Yavuz, Wojciech Kryscinski, Justin Rousseau and Greg Durrett | N/A | N/A |
| What the DAAM: Interpreting Stable Diffusion Using Cross Attention | Raphael Tang, Linqing Liu, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Pontus Stenetorp, Jimmy Lin and Ferhan Ture | N/A | N/A |
| Multilingual Knowledge Graph Completion with Language-Sensitive Multi-Graph Attention | Rongchuan Tang, Yang Zhao, Chengqing Zong and Yu Zhou | N/A | N/A |
| Learning to Imagine: Visually-Augmented Natural Language Generation | Tianyi Tang, Yushuo Chen, Yifan Du, Junyi Li, Wayne Xin Zhao and Ji-Rong Wen | N/A | N/A |
| Learning Dynamic Contextualised Word Embeddings via Template-based Temporal Adaptation | Xiaohang Tang, Yi Zhou and Danushka Bollegala | N/A | N/A |
| Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense Persona | Yihong Tang, Bo Wang, Miao Fang, Dongming Zhao, Kun Huang, Ruifang He and Yuexian Hou | N/A | N/A |
| Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks | Yun Tang, Anna Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden Tomasello and Juan Pino | N/A | N/A |
| Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment | Eshaan Tanwar, Subhabrata Dutta, Manish Borthakur and Tanmoy Chakraborty | N/A | N/A |
| CORE: Cooperative Training of Retriever-Reranker for Effective Dialogue Response Selection | Chongyang Tao, Jiazhan Feng, Tao Shen, Chang Liu, Juntao Li, Xiubo Geng and Daxin Jiang | N/A | N/A |
| UniEvent: Unified Generative Model with Multi-Dimensional Prefix for Zero-Shot Event-Relational Reasoning | Zhengwei Tao, Zhi Jin, Haiyan Zhao, Chengfeng Dou, yongqiang zhao, Tao Shen and Chongyang Tao | N/A | N/A |
| What’s the Meaning of Superhuman Performance in Today’s NLU? | Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajič, Daniel Hershcovich, Eduard Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova and Roberto Navigli | N/A | N/A |
| We Understand Elliptical Sentences, and Language Models should Too: A New Dataset for Studying Ellipsis and its Interaction with Thematic Fit | Davide Testa, Emmanuele Chersoni and Alessandro Lenci | N/A | N/A |
| Being Right for Whose Right Reasons? | Terne Sasha Thorn Jakobsen, Laura Cabello and Anders Søgaard | N/A | N/A |
| Dynamic Routing Transformer Network for Multimodal Sarcasm Detection | Yuan Tian, Nan Xu, Ruike Zhang and Wenji Mao | N/A | N/A |
| Unsupervised Melody-to-Lyrics Generation | Yufei Tian, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Gunnar Sigurdsson, Chenyang Tao, Wenbo Zhao, Tagyoung Chung, Jing Huang and Nanyun Peng | N/A | N/A |
| A New Aligned Simple German Corpus | Vanessa Toborek, Moritz Busch, Malte Boßert, Christian Bauckhage and Pascal Welke | N/A | N/A |
| Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children’s Fairy Tales | Paulina Toro Isaza, Guangxuan Xu, Toye Oloko, Yufang Hou, Nanyun Peng and Dakuo Wang | N/A | N/A |
| Model-Based Simulation for Optimising Smart Reply | Benjamin Towle and Ke Zhou | N/A | N/A |
| CREST: A Joint Framework for Rationalization and Counterfactual Text Generation | Marcos Treviso, Alexis Ross, Nuno M. Guerreiro and André Martins | N/A | N/A |
| Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions | Harsh Trivedi, Niranjan Balasubramanian, Tushar Khot and Ashish Sabharwal | N/A | N/A |
| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | Yi Tu, Ya Guo, Huan Chen and jinyang tang | N/A | N/A |
| Is Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection | Rheeya Uppaal, Junjie Hu and Yixuan Li | N/A | N/A |
| Curriculum Learning for Graph Neural Networks: A Multiview Competence-based Approach | Nidhi Vakil and Hadi Amiri | N/A | N/A |
| Transfer and Active Learning for Dissonance Detection: Addressing the Rare-Class Challenge | Vasudha Varadarajan, Swanie Juhng, Syeda Mahwish, Xiaoran Liu, Jonah Luby, Christian Luhmann and H. Andrew Schwartz | N/A | N/A |
| Post-Abstention: Towards Reliably Re-Attempting the Abstained Instances in QA | Neeraj Varshney and Chitta Baral | N/A | N/A |
| Hybrid Uncertainty Quantification for Selective Text Classification in Ambiguous Tasks | Artem Vazhentsev, Gleb Kuzmin, Akim Tsvigun, Alexander Panchenko, Maxim Panov, Mikhail Burtsev and Artem Shelmanov | N/A | N/A |
| Prompting PaLM for Translation: Assessing Strategies and Performance | David Vilar, Markus Freitag, Colin Cherry, Jiaming Luo, Viresh Ratnakar and George Foster | N/A | N/A |
| DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions | Vijay Viswanathan, Luyu Gao, Tongshuang Wu, Pengfei Liu and Graham Neubig | N/A | N/A |
| Does GPT-3 Grasp Metaphors? Identifying Metaphor Mappings with Generative Language Models | Lennart Wachowiak and Dagmar Gromann | N/A | N/A |
| Revisiting Relation Extraction in the era of Large Language Models | Somin Wadhwa, Silvio Amir and Byron C. Wallace | N/A | N/A |
| Multi-Grained Knowledge Retrieval for End-to-End Task-Oriented Dialog | Fanqi Wan, Weizhou Shen, Ke Yang, Xiaojun Quan and Wei Bi | N/A | N/A |
| Joint Document-Level Event Extraction via Token-Token Bidirectional Event Completed Graph | Qizhi Wan, Changxuan Wan, Keli Xiao, Dexi Liu, Chenliang Li, Bolong Zheng, Xiping Liu and Rong Hu | N/A | N/A |
| Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters | Boshi Wang, Sewon Min, Xiang Deng, Jiaming Shen, You Wu, Luke Zettlemoyer and Huan Sun | N/A | N/A |
| Simple and Effective Unsupervised Speech Translation | Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli and Juan Pino | N/A | N/A |
| Aggregating Multiple Heuristic Signals as Supervision for Unsupervised Automated Essay Scoring | Cong Wang, Zhiwei Jiang, Yafeng Yin, Zifeng Cheng, Shiping Ge and Qing Gu | N/A | N/A |
| DT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value Function | Haiming Wang, Ye Yuan, Zhengying Liu, Jianhao Shen, Yichun Yin, Jing Xiong, Enze Xie, Han Shi, Yujun Li, lin li, Jian Yin, Zhenguo Li and Xiaodan Liang | N/A | N/A |
| CoAD: Automatic Diagnosis through Symptom and Disease Collaborative Generation | Huimin Wang, Wai Chung Kwan, Kam-Fai Wong and Yefeng Zheng | N/A | N/A |
| Towards Unifying Multi-Lingual and Cross-Lingual Summarization | Jiaan Wang, Fandong Meng, Duo Zheng, Yunlong Liang, Zhixu Li, Jianfeng Qu and Jie Zhou | N/A | N/A |
| Easy Guided Decoding in Providing Suggestions for Interactive Machine Translation | Ke Wang, Xin Ge, Jiayi Wang, Yuqi Zhang and Yu Zhao | N/A | N/A |
| Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models | Lei Wang, Wanyu Xu, Yihuai Lan, Zhiqiang Hu, Yunshi Lan, Roy Ka-Wei Lee and Ee-Peng Lim | N/A | N/A |
| SimLM: Pre-training with Representation Bottleneck for Dense Passage Retrieval | Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder and Furu Wei | N/A | N/A |
| Two-Stage Fine-Tuning for Improved Bias and Variance for Large Pretrained Language Models | Lijing Wang, Yingya Li, Timothy Miller, Steven Bethard and Guergana Savova | N/A | N/A |
| A Theory of Unsupervised Speech Recognition | Liming Wang, Mark Hasegawa-Johnson and Chang Yoo | N/A | N/A |
| KGA: A General Machine Unlearning Framework Based on Knowledge Gap Alignment | Lingzhi Wang, Tong Chen, Wei Yuan, Xingshan Zeng, Kam-Fai Wong and Hongzhi Yin | N/A | N/A |
| A Survey on Zero Pronoun Translation | Longyue Wang, Siyou Liu, Mingzhou Xu, Linfeng Song, Shuming Shi and Zhaopeng Tu | N/A | N/A |
| Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations | Lucy Lu Wang, Yulia Otmakhova, Jay DeYoung, Thinh Hung Truong, Bailey Kuehl, Erin Bransom and Byron Wallace | N/A | N/A |
| SCOTT: Self-Consistent Chain-of-Thought Distillation | Peifeng Wang, Zhengyang Wang, Zheng Li, Yifan Gao, Bing Yin and Xiang Ren | N/A | N/A |
| MUSTIE: Multimodal Structural Transformer for Web Information Extraction | Qifan Wang, Jingang Wang, Xiaojun Quan, Fuli Feng, Zenglin Xu, Shaoliang Nie, Sinong Wang, Madian Khabsa, Hamed Firooz and Dongfang Liu | N/A | N/A |
| Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts for Zero-Shot Dialogue State Tracking | Qingyue Wang, Liang Ding, Yanan Cao, Yibing Zhan, Zheng Lin, Shi Wang, Dacheng Tao and Li Guo | N/A | N/A |
| Retrieval-free Knowledge Injection through Multi-Document Traversal for Dialogue Models | Rui Wang, Jianzhu Bao, Fei Mi, Yi Chen, Hongru Wang, Yasheng Wang, Yitong Li, Lifeng Shang, Kam-Fai Wong and Ruifeng Xu | N/A | N/A |
| ReCode: Robustness Evaluation of Code Generation Models | Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth and Bing Xiang | N/A | N/A |
| Better Simultaneous Translation with Monotonic Knowledge Distillation | Shushu Wang, Jing Wu, Kai Fan, Wei Luo, Jun Xiao and Zhongqiang Huang | N/A | N/A |
| Query Structure Modeling for Inductive Logical Reasoning Over Knowledge Graphs | Siyuan Wang, Zhongyu Wei, meng han, Zhihao Fan, Haijun Shan, Qi Zhang and Xuanjing Huang | N/A | N/A |
| CAT: A Contextualized Conceptualization and Instantiation Framework for Commonsense Reasoning | Weiqi Wang, Tianqing Fang, Baixuan Xu, Chun Yi Louis Bo, Yangqiu Song and Lei Chen | N/A | N/A |
| Elaboration-Generating Commonsense Question Answering at Scale | Wenya Wang, Vivek Srikumar, Hannaneh Hajishirzi and Noah A. Smith | N/A | N/A |
| Effective Contrastive Weighting for Dense Query Expansion | Xiao Wang, Sean MacAvaney, Craig Macdonald and Iadh Ounis | N/A | N/A |
| Code4Struct: Code Generation for Few-Shot Event Structure Prediction | Xingyao Wang, Sha Li and Heng Ji | N/A | N/A |
| Document-Level Multi-Event Extraction with Event Proxy Nodes and Hausdorff Distance Minimization | Xinyu Wang, Lin Gui and Yulan He | N/A | N/A |
| PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification | Yau-Shian Wang, Ta-Chung Chi, Ruohong Zhang and YIMING YANG | N/A | N/A |
| Weakly-Supervised Spoken Video Grounding via Semantic Interaction Learning | Ye Wang, Wang Lin, Shengyu Zhang, Tao Jin, Linjun Li, Xize Cheng and Zhou Zhao | N/A | N/A |
| Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method | Yiming Wang, Zhuosheng Zhang and Rui Wang | N/A | N/A |
| Self-Instruct: Aligning Language Models with Self-Generated Instructions | Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi and Hannaneh Hajishirzi | N/A | N/A |
| Dynamic Heterogeneous-Graph Reasoning with Language Models and Knowledge Representation Learning for Commonsense Question Answering | Yujie Wang, Hu Zhang, Jiye Liang and Ru Li | N/A | N/A |
| GreenKGC: A Lightweight Knowledge Graph Completion Method | Yun Cheng Wang, Xiou Ge, Bin Wang and C.-C. Jay Kuo | N/A | N/A |
| VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions | Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang and Dongyan Zhao | N/A | N/A |
| COLA: Contextualized Commonsense Causal Reasoning from the Causal Inference Perspective | Zhaowei Wang, Quyet V. Do, Hongming Zhang, Jiayao Zhang, Weiqi Wang, Tianqing Fang, Yangqiu Song, Ginny Y. Wong and Simon See | N/A | N/A |
| RMLM: A Flexible Defense Framework for Proactively Mitigating Word-level Adversarial Attacks | Zhaoyang Wang, Zhiyue Liu, Xiaopeng Zheng, Qinliang Su and Jiahai Wang | N/A | N/A |
| Rehearsal-free Continual Language Learning via Efficient Parameter Isolation | Zhicheng Wang, Yufang Liu, Tao Ji, xiaoling Wang, Yuanbin Wu, congcong jiang, ye chao, zhencong han, ling wang, xu shao and wenqiu zeng | N/A | N/A |
| Faithful Low-Resource Data-to-Text Generation through Cycle Training | Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi and Oleg Rokhlenko | N/A | N/A |
| On Evaluating Multilingual Compositional Generalization with Translated Datasets | Zi Wang and Daniel Hershcovich | N/A | N/A |
| DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models | Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover and Duen Horng Chau | N/A | N/A |
| What social attitudes about gender does BERT encode? Leveraging insights from psycholinguistics | Julia Watson, Barend Beekhuizen and Suzanne Stevenson | N/A | N/A |
| Subjective Crowd Disagreements for Subjective Data: Uncovering Meaningful CrowdOpinion with Population-level Learning | Tharindu Cyril Weerasooriya, Sarah Luger, Saloni Poddar, Ashiqur KhudaBukhsh and Christopher Homan | N/A | N/A |
| Text Style Transfer Back-Translation | Daimeng Wei, Zhanglin Wu, Hengchao Shang, Zongyao Li, Minghan Wang, Jiaxin GUO, Xiaoyu Chen, Zhengzhe YU and Hao Yang | N/A | N/A |
| Guide the Many-to-One Assignment: Open Information Extraction via IoU-aware Optimal Transport | Kaiwen Wei, Yiran Yang, li jin, Xian Sun, Zequn Zhang, Jingyuan Zhang, Xiao yu Li, Linhao Zhang, Jintao Liu and Guo Zhi | N/A | N/A |
| Tackling Modality Heterogeneity with Multi-View Calibration Network for Multimodal Sentiment Detection | yiwei wei, Shaozu Yuan, Ruosong Yang, Lei Shen, zhangmeizhi li, Longbiao Wang and Meng Chen | N/A | N/A |
| f-Divergence Minimization for Sequence-Level Knowledge Distillation | Yuqiao Wen, Zichao Li, Wenyu Du and Lili Mou | N/A | N/A |
| WebIE: Faithful and Robust Information Extraction on the Web | Chenxi Whitehouse, Clara Vania, Alham Fikri Aji, Christos Christodoulopoulos and Andrea Pierleoni | N/A | N/A |
| Trigger Warning Assignment as a Multi-Label Document Classification Problem | Matti Wiegmann, Magdalena Wolska, Christopher Schröder, Ole Borchardt, Benno Stein and Martin Potthast | N/A | N/A |
| Beyond Contrastive Learning: A Variational Generative Model for Multilingual Retrieval | John Wieting, Jonathan Clark, William Cohen, Graham Neubig and Taylor Berg-Kirkpatrick | N/A | N/A |
| BREAK: Breaking the Dialogue State Tracking Barrier with Beam Search and Re-ranking | Seungpil Won, Heeyoung Kwak, Joongbo Shin, Janghoon Han and Kyomin Jung | N/A | N/A |
| lilGym: Natural Language Visual Reasoning with Reinforcement Learning | Anne Wu, Kiante Brantley, Noriyuki Kojima and Yoav Artzi | N/A | N/A |
| Rethinking Masked Language Modeling for Chinese Spelling Correction | Hongqiu Wu, Shaohua Zhang, Yuchen Zhang and Hai Zhao | N/A | N/A |
| Connective Prediction for Implicit Discourse Relation Recognition via Knowledge Distillation | Hongyi Wu, Hao Zhou, Man Lan, Yuanbin Wu and Yadong Zhang | N/A | N/A |
| Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text | Qianhui Wu, Huiqiang Jiang, Haonan Yin, Börje Karlsson and Chin-Yew Lin | N/A | N/A |
| WSPAlign: Word Alignment Pre-training via Large-Scale Weakly Supervised Span Prediction | Qiyu Wu, Masaaki Nagata and Yoshimasa Tsuruoka | N/A | N/A |
| Ambiguous Learning from Retrieval: Towards Zero-shot Semantic Parsing | Shan Wu, Chunlei Xin, Hongyu Lin, Xianpei Han, Cao Liu, Jiansong Chen, Fan Yang, Guanglu Wan and Le Sun | N/A | N/A |
| Denoising Bottleneck with Mutual Information Maximization for Video Multimodal Fusion | Shaoxiang Wu, Damai Dai, Ziwei Qin, Tianyu Liu, Binghuai Lin, Yunbo Cao and Zhifang Sui | N/A | N/A |
| Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling | Shengqiong Wu, Hao Fei, Yixin Cao, Lidong Bing and Tat-Seng Chua | N/A | N/A |
| Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment | Shengqiong Wu, Hao Fei, Wei Ji and Tat-Seng Chua | N/A | N/A |
| AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression | Siyue Wu, Hongzhan Chen, Xiaojun Quan, Qifan Wang and Rui Wang | N/A | N/A |
| SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams | Te-Lin Wu, Satwik Kottur, Andrea Madotto, Mahmoud Azab, Pedro Rodriguez, Babak Damavandi, Nanyun Peng and Seungwhan Moon | N/A | N/A |
| Learning Action Conditions from Instructional Manuals for Instruction Understanding | Te-Lin Wu, Caiqi ZHANG, Qingyuan Hu, Alexander Spangher and Nanyun Peng | N/A | N/A |
| Towards Zero-Shot Multilingual Transfer for Code-Switched Responses | Ting-Wei Wu, Changsheng Zhao, Ernie Chang, Yangyang Shi, Pierce Chuang, Vikas Chandra and Biing Juang | N/A | N/A |
| Do PLMs Know and Understand Ontological Knowledge? | Weiqi Wu, Chengyue Jiang, Yong Jiang, Pengjun Xie and Kewei Tu | N/A | N/A |
| Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression | Wen Wu, Chao Zhang and Philip C. Woodland | N/A | N/A |
| WeCheck: Strong Factual Consistency Checker via Weakly Supervised Learning | Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Sujian Li and Yajuan Lyu | N/A | N/A |
| Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering | Zhiyong Wu, Yaoxiang Wang, Jiacheng Ye and Lingpeng Kong | N/A | N/A |
| Are Experts Needed? On Human Evaluation of Counselling Reflection Generation | Zixiu Wu, Simone Balloccu, Ehud Reiter, Rim Helaoui, Diego Reforgiato Recupero and Daniele Riboni | N/A | N/A |
| UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive signals and human language | Nuwa Xi, Sendong Zhao, Haochun Wang, Chi Liu, Bing Qin and Ting Liu | N/A | N/A |
| Training Trajectories of Language Models Across Scales | Mengzhou Xia, Mikel Artetxe, Chunting Zhou, Xi Victoria Lin, Ramakanth Pasunuru, Danqi Chen, Luke Zettlemoyer and Veselin Stoyanov | N/A | N/A |
| Plug-and-Play Document Modules for Pre-trained Models | Chaojun Xiao, Zhengyan Zhang, Xu Han, Chi-Min Chan, Yankai Lin, Zhiyuan Liu, xiangyang li, Zhonghua Li, Zhao Cao and Maosong Sun | N/A | N/A |
| CFSum:A Coarse-to-Fine Contribution Network for Multimodal Summarization | Min Xiao, Junnan Zhu, Haitao Lin, Yu Zhou and Chengqing Zong | N/A | N/A |
| An Empirical Analysis of Parameter-Efficient Methods for Debiasing Pre-Trained Language Models | Zhongbin Xie and Thomas Lukasiewicz | N/A | N/A |
| Interpreting Positional Information in Perspective of Word Order | Zhang Xilong, Liu Ruochen, Liu Jin and Liang Xuefeng | N/A | N/A |
| Shrinking Embeddings for Hyper-Relational Knowledge Graphs | Bo Xiong, Mojtaba Nayyeri, Shirui Pan and Steffen Staab | N/A | N/A |
| Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models | Albert Xu, Xiang Ren and Robin Jia | N/A | N/A |
| S2ynRE: Two-stage Self-training with Synthetic data for Low-resource Relation Extraction | Benfeng Xu, Quan Wang, Yajuan Lyu, Dai Dai, Yongdong Zhang and Zhendong Mao | N/A | N/A |
| CTC-based Non-autoregressive Speech Translation | Chen Xu, Xiaoqian Liu, Xiaowen Liu, Qingxuan Sun, Yuhao Zhang, Murun Yang, Qianqian Dong, Tom Ko, Mingxuan Wang, Tong Xiao, Anxiang Ma and Jingbo Zhu | N/A | N/A |
| Introducing Semantics into Speech Encoders | Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Bing Liu, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Guan-Ting Lin, Alexei Baevski, Hung-yi Lee, Yizhou Sun and Wei Wang | N/A | N/A |
| A Critical Evaluation of Evaluations for Long-form Question Answering | Fangyuan Xu, Yixiao Song, Mohit Iyyer and Eunsol Choi | N/A | N/A |
| A Universal Discriminator for Zero-Shot Generalization | Haike Xu, Zongyu Lin, Jing Zhou, Yanan Zheng and Zhilin Yang | N/A | N/A |
| Double-Branch Multi-Attention based Graph Neural Network for Knowledge Graph Completion | Hongcai Xu, Junpeng Bao and Wenbo Liu | N/A | N/A |
| Best-k Search Algorithm for Neural Text Generation | Jiacheng Xu, Caiming Xiong, silvio savarese and Yingbo Zhou | N/A | N/A |
| Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction? | Jiashu Xu, Mingyu Derek Ma and Muhao Chen | N/A | N/A |
| Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback | Jing Xu, Megan Ung, Mojtaba Komeili, Kushal Arora, Y-Lan Boureau and Jason Weston | N/A | N/A |
| Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach | Liyan Xu, Chenwei Zhang, Xian Li, Jingbo Shang and Jinho D. Choi | N/A | N/A |
| Enhancing Language Representation with Constructional Information for Natural Language Understanding | Lvxiaowei Xu, Jianwang Wu, Jiawei Peng, Zhilin Gong, Ming Cai and Tianxiang Wang | N/A | N/A |
| BERM: Training the Balanced and Extractable Representation for Matching to Improve Generalization Ability of Dense Retrieval | Shicheng Xu, Liang Pang, Huawei Shen and Xueqi Cheng | N/A | N/A |
| PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks | Weiwen Xu, Xin Li, Yang Deng, Wai Lam and Lidong Bing | N/A | N/A |
| Counterfactual Debiasing for Fact Verification | Weizhi Xu, Qiang Liu, Shu Wu and Liang Wang | N/A | N/A |
| SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes | Wenda Xu, Xian Qian, Mingxuan Wang, Lei Li and William Yang Wang | N/A | N/A |
| ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning | Xiao Xu, Bei Li, Chenfei Wu, Shao-Yen Tseng, Anahita Bhiwandiwalla, Shachar Rosenman, Vasudev Lal, Wanxiang Che and Nan Duan | N/A | N/A |
| KILM: Knowledge Injection into Encoder-Decoder Language Models | Yan Xu, Mahdi Namazifar, Devamanyu Hazarika, Aishwarya Padmakumar, Yang Liu and Dilek Hakkani-Tur | N/A | N/A |
| Exploring and Verbalizing Academic Ideas by Concept Co-occurrence | Yi Xu, Shuqian Sheng, Bo Xue, Luoyi Fu, Xinbing Wang and Chenghu Zhou | N/A | N/A |
| Unsupervised Graph-Text Mutual Conversion with a Unified Pretrained Language Model | Yi Xu, Shuqian Sheng, Jiexing Qi, Luoyi Fu, Zhouhan Lin, Xinbing Wang and Chenghu Zhou | N/A | N/A |
| Hard Sample Aware Prompt-Tuning | Yuanjian Xu, Qi An, Jiahuan Zhang, Peng Li and Zaiqing Nie | N/A | N/A |
| MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning | Zhiyang Xu, Ying Shen and Lifu Huang | N/A | N/A |
| Constrained Tuple Extraction with Interaction-Aware Network | Xiaojun Xue, Chunxia Zhang, Tianxiang Xu and Zhendong Niu | N/A | N/A |
| Towards Identifying Fine-Grained Depression Symptoms from Memes | Shweta Yadav, Cornelia Caragea, Chenye Zhao, Naincy Kumari, Marvin Solberg and Tanmay Sharma | N/A | N/A |
| SLABERT Talk Pretty One Day: Modeling Second Language Acquisition with BERT | Aditya Yadavalli, Alekhya Yadavalli and Vera Tobin | N/A | N/A |
| GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding | Konstantin Yakovlev, Alexander Podolskiy, Andrey Bout, Sergey Nikolenko and Irina Piontkovskaya | N/A | N/A |
| Holographic CCG Parsing | Ryosuke Yamaki, Tadahiro Taniguchi and Daichi Mochihashi | N/A | N/A |
| Holistic Prediction on a Time-Evolving Attributed Graph | Shohei Yamasaki, Yuya Sasaki, Panagiotis Karras and Makoto Onizuka | N/A | N/A |
| UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction | Hang Yan, Yu Sun, Xiaonan Li, Yunhua Zhou, Xuanjing Huang and Xipeng Qiu | N/A | N/A |
| Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing | Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih and Ziyu Yao | N/A | N/A |
| BITE: Textual Backdoor Attacks with Iterative Trigger Injection | Jun Yan, Vansh Gupta and Xiang Ren | N/A | N/A |
| BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training | Yiming Yan, Tao Wang, Chengqi Zhao, Shujian Huang, Jiajun CHEN and Mingxuan Wang | N/A | N/A |
| MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning | Bang Yang, Fenglin Liu, Xian Wu, Yaowei Wang, Xu Sun and Yuexian Zou | N/A | N/A |
| Efficient Shapley Values Estimation by Amortization for Text Classification | Chenghao Yang, Fan Yin, He He, Kai-Wei Chang, Xiaofei Ma and Bing Xiang | N/A | N/A |
| Attractive Storyteller: Stylized Visual Storytelling with Unpaired Text | Dingyi Yang and Qin Jin | N/A | N/A |
| Learning Better Masking for Better Language Model Pre-training | Dongjie Yang, Zhuosheng Zhang and Hai Zhao | N/A | N/A |
| GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator | Jian Yang, Shuming Ma, Li Dong, Shaohan Huang, Haoyang Huang, Yuwei Yin, Dongdong Zhang, Liqun Yang, Furu Wei and Zhoujun Li | N/A | N/A |
| ConFEDE: Contrastive Feature Decomposition for Multimodal Sentiment Analysis | Jiuding Yang, Yakun Yu, Di Niu, Weidong Guo and Yu Xu | N/A | N/A |
| DOC: Improving Long Story Coherence With Detailed Outline Control | Kevin Yang, Dan Klein, Nanyun Peng and Yuandong Tian | N/A | N/A |
| Fantastic Expressions and Where to Find Them: Chinese Simile Generation with Multiple Constraints | Kexin Yang, Dayiheng Liu, Wenqiang Lei, Baosong Yang, Xiangpeng Wei, Zhengyuan Liu and Jun Xie | N/A | N/A |
| Tailor: A Soft-Prompt-Based Approach to Attribute-Based Controlled Text Generation | Kexin Yang, Dayiheng Liu, Wenqiang Lei, Baosong Yang, Mingfeng Xue, Boxing Chen and Jun Xie | N/A | N/A |
| Measuring Consistency in Text-based Financial Forecasting Models | Linyi Yang, Yingpeng Ma and Yue Zhang | N/A | N/A |
| Local Interpretation of Transformer Based on Linear Decomposition | Sen Yang, Shujian Huang, wei zou, Jianbing Zhang, Xinyu Dai and Jiajun CHEN | N/A | N/A |
| A New Dataset and Empirical Study for Sentence Simplification in Chinese | Shiping Yang, Renliang Sun and Xiaojun Wan | N/A | N/A |
| Unsupervised Discontinuous Constituency Parsing with Mildly Context-Sensitive Grammars | Songlin Yang, Roger Levy and Yoon Kim | N/A | N/A |
| Don’t Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span Selection | Songlin Yang and Kewei Tu | N/A | N/A |
| HistRED: A Historical Document-Level Relation Extraction Dataset | Soyoung Yang, Minseok Choi, Youngwoo Cho and Jaegul Choo | N/A | N/A |
| Prototype-Guided Pseudo Labeling for Semi-Supervised Text Classification | Weiyi Yang, Richong Zhang, Junfan Chen, Lihong Wang and Jaein Kim | N/A | N/A |
| Few-Shot Document-Level Event Argument Extraction | Xianjun Yang, Yujie Lu and Linda Petzold | N/A | N/A |
| Transforming Visual Scene Graphs to Image Captions | Xu Yang, Jiawei Peng, Zihua Wang, Haiyang Xu, Qinghao Ye, Chenliang Li, Songfang Huang, Fei Huang, Zhangzikang Li and Yu Zhang | N/A | N/A |
| An AMR-based Link Prediction Approach for Document-level Event Argument Extraction | Yuqing Yang, Qipeng Guo, Xiangkun Hu, Yue Zhang, Xipeng Qiu and Zheng Zhang | N/A | N/A |
| Gradient-based Intra-attention Pruning on Pre-trained Language Models | Ziqing Yang, Yiming Cui, Xin Yao and Shijin Wang | N/A | N/A |
| Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations | Bingsheng Yao, Prithviraj Sen, Lucian Popa, James Hendler and Dakuo Wang | N/A | N/A |
| Modeling User Satisfaction Dynamics in Dialogue via Hawkes Process | Fanghua Ye, zhiyuan hu and Emine Yilmaz | N/A | N/A |
| Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering | Hai Ye, Qizhe Xie and Hwee Tou Ng | N/A | N/A |
| FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning | Qinyuan Ye, Iz Beltagy, Matthew Peters, Xiang Ren and Hannaneh Hajishirzi | N/A | N/A |
| CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training | Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin and Zhou Zhao | N/A | N/A |
| How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech | Aditya Yedetore, Tal Linzen, Robert Frank and R. Thomas McCoy | N/A | N/A |
| Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning | Fan Yin, Jesse Vig, Philippe Laban, Shafiq Joty, Caiming Xiong and Chien-Sheng Wu | N/A | N/A |
| Natural Language to Code Generation in Interactive Data Science Notebooks | Pengcheng Yin, Wen-Ding Li, Kefan Xiao, Abhishek Rao, Yeming Wen, Kensen Shi, Joshua Howland, Paige Bailey, Michele Catasta, Henryk Michalewski, Oleksandr Polozov and Charles Sutton | N/A | N/A |
| NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation | Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Ming Gong, Lijuan Wang, Zicheng Liu, Houqiang Li and Nan Duan | N/A | N/A |
| Consistency Regularization Training for Compositional Generalization | Yongjing Yin, Jiali Zeng, Yafu Li, Fandong Meng, Jie Zhou and Yue Zhang | N/A | N/A |
| BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting | Zheng Xin Yong, Hailey Schoelkopf, Niklas Muennighoff, Alham Fikri Aji, David Ifeoluwa Adelani, KHALID ALMUBARAK, M Saiful Bari, Lintang Sutawika, Jungo Kasai, Ahmed Baruwa, Genta Winata, Stella Biderman, Edward Raff, Dragomir Radev and Vassilina Nikoulina | N/A | N/A |
| Rethinking Annotation: Can Language Learners Contribute? | Haneul Yoo, Rifki Afina Putri, Changyoon Lee, Youngin Lee, So-Yeon Ahn, Dongyeop Kang and Alice Oh | N/A | N/A |
| Robust Multi-bit Natural Language Watermarking through Invariant Features | KiYoon Yoo, Wonhyuk Ahn, Jiho Jang and Nojun Kwak | N/A | N/A |
| Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation | Soyoung Yoon, Sungjoon Park, Gyuwan Kim, Junhee Cho, Kihyo Park, Gyu Tae Kim, Minjoon Seo and Alice Oh | N/A | N/A |
| Grounded Multimodal Named Entity Recognition on Social Media | Jianfei Yu, Ziyan Li, Jieming Wang and Rui Xia | N/A | N/A |
| Cross-Domain Data Augmentation with Domain-Adaptive Language Modeling for Aspect-Based Sentiment Analysis | Jianfei Yu, Qiankun Zhao and Rui Xia | N/A | N/A |
| Word sense extension | Lei Yu and Yang Xu | N/A | N/A |
| Personality Understanding of Fictional Characters during Book Reading | Mo Yu, Jiangnan Li, Shunyu Yao, Wenjie Pang, Xiaochen Zhou, Zhou Xiao, Fandong Meng and Jie Zhou | N/A | N/A |
| ALERT: Adapt Language Models to Reasoning Tasks | Ping Yu, Tianlu Wang, Olga Golovneva, Badr AlKhamissi, Siddharth Verma, Zhijing Jin, Gargi Ghosh, Mona Diab and Asli Celikyilmaz | N/A | N/A |
| Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment | Tianshu Yu, haoyu gao, Ting-En Lin, Min Yang, Yuchuan Wu, Wentao Ma, chao wang, Fei Huang and Yongbin Li | N/A | N/A |
| Generating Hashtags for Short-form Videos with Guided Signals | Tiezheng Yu, Hanchao Yu, Davis Liang, Yuning Mao, Shaoliang Nie, Po-Yao Huang, Madian Khabsa, Pascale Fung and Yi-Chia Wang | N/A | N/A |
| CREPE: Open-Domain Question Answering with False Presuppositions | Xinyan Yu, Sewon Min, Luke Zettlemoyer and Hannaneh Hajishirzi | N/A | N/A |
| Cold-Start Data Selection for Better Few-shot Language Model Fine-tuning: A Prompt-based Uncertainty Propagation Approach | Yue Yu, Rongzhi Zhang, Ran Xu, Jieyu Zhang, Jiaming Shen and Chao Zhang | N/A | N/A |
| Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In | Zichun Yu, Chenyan Xiong, Shi Yu and Zhiyuan Liu | N/A | N/A |
| Discriminative Reasoning with Sparse Event Representation for Document-level Event-Event Relation Extraction | Changsen Yuan, Heyan Huang, Yixin Cao and Yonggang Wen | N/A | N/A |
| HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation | Hongyi Yuan, Zheng Yuan, Chuanqi Tan, Fei Huang and Songfang Huang | N/A | N/A |
| Distilling Script Knowledge from Large Language Models for Constrained Language Planning | Siyu Yuan, Jiangjie Chen, Ziquan Fu, Xuyang Ge, Soham Shah, Charles Jankowski, Yanghua Xiao and Deqing Yang | N/A | N/A |
| Causality-aware Concept Extraction based on Knowledge-guided Prompting | Siyu Yuan, Deqing Yang, Jinxi Liu, Shuyu Tian, Jiaqing Liang, Yanghua Xiao and Rui Xie | N/A | N/A |
| Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe | Xiang Yue, Huseyin Inan, Xuechen Li, Girish Kumar, Julia McAnallen, Hoda Shajari, Huan Sun, David Levitan and Robert Sim | N/A | N/A |
| MetaAdapt: Domain Adaptive Few-Shot Misinformation Detection via Meta Learning | Zhenrui Yue, Huimin Zeng, Yang Zhang, Lanyu Shang and Dong Wang | N/A | N/A |
| Zero- and Few-Shot Event Detection via Prompt-Based Meta Learning | Zhenrui Yue, Huimin Zeng, Mengfei Lan, Heng Ji and Dong Wang | N/A | N/A |
| Movie101: A New Movie Understanding Benchmark | Zihao Yue, Qi Zhang, Anwen Hu, Liang Zhang, Ziheng Wang and Qin Jin | N/A | N/A |
| Large Language Models Meet NL2Code: A Survey | Daoguang Zan, Bei Chen, Fengji Zhang, Dianjie Lu, Bingchao Wu, Bei Guan, Wang Yongji and Jian-Guang LOU | N/A | N/A |
| One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning | Guangtao Zeng, Peiyuan Zhang and Wei Lu | N/A | N/A |
| Synthesize, Prompt and Transfer: Zero-shot Conversational Question Generation with Pre-trained Language Model | Hongwei Zeng, Bifan Wei, Jun Liu and Weiping Fu | N/A | N/A |
| Soft Language Clustering for Multilingual Model Pre-training | Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, Yunbo Cao and Jie Zhou | N/A | N/A |
| FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue | Weihao Zeng, Keqing He, Yejie Wang, Chen Zeng, Jingang Wang, Yunsen Xian and Weiran Xu | N/A | N/A |
| Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue Generation | Weihao Zeng, Lulu Zhao, Keqing He, Ruotong Geng, Jingang Wang, Wei Wu and Weiran Xu | N/A | N/A |
| Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training | Yan Zeng, Wangchunshu Zhou, Ao Luo, Ziming Cheng and Xinsong Zhang | N/A | N/A |
| Hints on the data for language modeling of synthetic languages with transformers | Rodolfo Zevallos and Nuria Bel | N/A | N/A |
| AlignScore: Evaluating Factual Consistency with A Unified Alignment Function | Yuheng Zha, Yichi Yang, Ruichen Li and Zhiting Hu | N/A | N/A |
| USSA: A Unified Table Filling Scheme for Structured Sentiment Analysis | Zepeng Zhai, Hao Chen, Ruifan Li and Xiaojie WANG | N/A | N/A |
| Contrastive Learning with Adversarial Examples for Alleviating Pathology of Language Model | Pengwei Zhan, Jing Yang, Xiao Huang, Chunlei Jing, Jingying Li and Liming Wang | N/A | N/A |
| Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization | Runzhe Zhan, Xuebo Liu, Derek F. Wong, Cuilian Zhang, Lidia S. Chao and Min Zhang | N/A | N/A |
| Lifting the Curse of Capacity Gap in Distilling Language Models | Chen Zhang, Yang Yang, Jiahao Liu, Jingang Wang, Yunsen Xian, Benyou Wang and Dawei Song | N/A | N/A |
| DualGATs: Dual Graph Attention Networks for Emotion Recognition in Conversations | Duzhen Zhang, Feilong Chen and Xiuyi Chen | N/A | N/A |
| Dual Class Knowledge Propagation Network for Multi-label Few-shot Intent Detection | Feng Zhang, Wei Chen, Fei Ding and Tengjiao Wang | N/A | N/A |
| Understanding and Improving the Robustness of Terminology Constraints in Neural Machine Translation | Huaao Zhang, Qiang Wang, Bo Qin, Zelin Shi, haibo wang and MING CHEN | N/A | N/A |
| Reasoning over Hierarchical Question Decomposition Tree for Explainable Question Answering | Jiajie Zhang, Shulin Cao, Tingjian Zhang, Xin Lv, Juanzi Li, Lei Hou, Jiaxin Shi and Qi Tian | N/A | N/A |
| What Is Overlap Knowledge in Event Argument Extraction? APE: A Cross-datasets Transfer Learning Model for EAE | Kaihang Zhang, Kai Shuang, Xinyue Yang, Xuyang Yao and Jinyu Guo | N/A | N/A |
| Self-Edit: Fault-Aware Code Editor for Code Generation | Kechi Zhang, Zhuo Li, Jia Li, Ge Li and Zhi Jin | N/A | N/A |
| FC-KBQA: A Fine-to-Coarse Composition Framework for Knowledge Base Question Answering | Lingxi Zhang, Jing Zhang, Yanling Wang, Shulin Cao, Xinmei Huang, Cuiping Li, Hong Chen and Juanzi Li | N/A | N/A |
| A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization | Lining Zhang, Simon Mille, Yufang Hou, Daniel Deutsch, Elizabeth Clark, Yixin Liu, Saad Mahamood, Sebastian Gehrmann, Miruna Clinciu, Khyathi Raghavi Chandu and João Sedoc | N/A | N/A |
| Span-level Aspect-based Sentiment Analysis via Table Filling | Mao Zhang, Yongxin Zhu, Zhen Liu, Zhimin Bao, Yunfei Wu, Xing Sun and Linli Xu | N/A | N/A |
| Learning Latent Relations for Temporal Knowledge Graph Reasoning | Mengqi Zhang, Yuwei Xia, Qiang Liu, Shu Wu and Liang Wang | N/A | N/A |
| Interpretable Math Word Problem Solution Generation via Step-by-step Planning | mengxue zhang, Zichao Wang, Zhichao Yang, weiqi feng and Andrew Lan | N/A | N/A |
| SafeConv: Explaining and Correcting Conversational Unsafe Behavior | Mian Zhang, Lifeng Jin, Linfeng Song, Haitao Mi, Wenliang Chen and Dong Yu | N/A | N/A |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | Mike Zhang, Rob van der Goot and Barbara Plank | N/A | N/A |
| A Survey for Efficient Open Domain Question Answering | Qin Zhang, Shangsi Chen, Dongkuan Xu, Qingqing Cao, Xiaojun Chen, Trevor Cohn and Meng Fang | N/A | N/A |
| A Novel Table-to-Graph Generation Approach for Document-Level Joint Entity and Relation Extraction | Ruoyu Zhang, Yanzeng Li and Lei Zou | N/A | N/A |
| MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies | Shiyue Zhang, Shijie Wu, Ozan Irsoy, Steven Lu, Mohit Bansal, Mark Dredze and David Rosenberg | N/A | N/A |
| Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization | Shiyue Zhang, David Wan and Mohit Bansal | N/A | N/A |
| Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation | Songming Zhang, Yunlong Liang, Shuaibo Wang, Yufeng Chen, Wenjuan Han, Jian Liu and Jinan Xu | N/A | N/A |
| Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms | Tianshu Zhang, Changchang Liu, Wei-Han Lee, Yu Su and Huan Sun | N/A | N/A |
| Bridging The Gap: Entailment Fused-T5 for Open-retrieval Conversational Machine Reading Comprehension | Xiao Zhang, Heyan Huang, Zewen Chi and Xian-Ling Mao | N/A | N/A |
| A Cross-Modality Context Fusion and Semantic Refinement Network for Emotion Recognition in Conversation | Xiaoheng Zhang and Yang Li | N/A | N/A |
| MIL-Decoding: Detoxifying Language Models at Token-Level via Multiple Instance Learning | Xu Zhang and Xiaojun Wan | N/A | N/A |
| Continual Knowledge Distillation for Neural Machine Translation | Yuanchi Zhang, Peng Li, Maosong Sun and Yang Liu | N/A | N/A |
| VLN-Trans: Translator for the Vision and Language Navigation Agent | Yue Zhang and Parisa Kordjamshidi | N/A | N/A |
| XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations | Yusen Zhang, Jun Wang, Zhiguo Wang and Rui Zhang | N/A | N/A |
| Plug-and-Play Knowledge Injection for Pre-trained Language Models | Zhengyan Zhang, Zhiyuan Zeng, Yankai Lin, Huadong Wang, Deming Ye, Chaojun Xiao, Xu Han, Zhiyuan Liu, Peng Li, Maosong Sun and Jie Zhou | N/A | N/A |
| Dialog-Post: Multi-Level Self-Supervised Objectives and Hierarchical Model for Dialogue Post-Training | Zhenyu Zhang, Lei Shen, Yuming Zhao, Meng Chen and Xiaodong He | N/A | N/A |
| ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation | Zhexin Zhang, Jiaxin Wen and Minlie Huang | N/A | N/A |
| Fine-tuning Happens in Tiny Subspaces: Exploring Intrinsic Task-specific Subspaces of Pre-trained Language Models | Zhong Zhang, Bang Liu and Junming Shao | N/A | N/A |
| FEDLEGAL: The First Real-World Federated Learning Benchmark for Legal NLP | Zhuo Zhang, Xiangjing Hu, Jingyuan Zhang, Yating Zhang, Hui Wang, Lizhen Qu and Zenglin Xu | N/A | N/A |
| C-STANCE: A Large Dataset for Chinese Zero-Shot Stance Detection | Chenye Zhao, Yingjie Li and Cornelia Caragea | N/A | N/A |
| Infusing Hierarchical Guidance into Prompt Tuning: A Parameter-Efficient Framework for Multi-level Implicit Discourse Relation Recognition | Haodong Zhao, Ruifang He, Mengnan Xiao and Jing Xu | N/A | N/A |
| CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models | Jiaxu Zhao, Meng Fang, Zijing Shi, Yitong Li, Ling Chen and Mykola Pechenizkiy | N/A | N/A |
| RE-Matching: A Fine-Grained Semantic Matching Method for Zero-Shot Relation Extraction | Jun Zhao, WenYu Zhan, Xin Zhao, Qi Zhang, Tao Gui, Zhongyu Wei, Junzhe Wang, Minlong Peng and Mingming Sun | N/A | N/A |
| Open Set Relation Extraction via Unknown-Aware Training | Jun Zhao, Xin Zhao, WenYu Zhan, Qi Zhang, Tao Gui, Zhongyu Wei, Yun Wen Chen, Xiang Gao and Xuanjing Huang | N/A | N/A |
| Actively Supervised Clustering for Open Relation Extraction | Jun Zhao, Yongxin Zhang, Qi Zhang, Tao Gui, Zhongyu Wei, Minlong Peng and Mingming Sun | N/A | N/A |
| Evaluating Open-Domain Dialogues in Latent Space with Next Sentence Prediction and Mutual Information | Kun Zhao, Bohao Yang, Chenghua Lin, Wenge Rong, Aline Villavicencio and Xiaohui Cui | N/A | N/A |
| Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework | Ruochen Zhao, Xingxuan Li, Shafiq Joty, Chengwei Qin and Lidong Bing | N/A | N/A |
| Abductive Commonsense Reasoning Exploiting Mutually Exclusive Explanations | Wenting Zhao, Justin Chiu, Claire Cardie and Alexander Rush | N/A | N/A |
| Improving Continual Relation Extraction by Distinguishing Analogous Semantics | Wenzheng Zhao, Yuanning Cui and Wei Hu | N/A | N/A |
| Pre-trained Language Models Can be Fully Zero-Shot Learners | Xuandong Zhao, Siqi Ouyang, Zhiguo Yu, Ming Wu and Lei Li | N/A | N/A |
| RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations | Yilun Zhao, Chen Zhao, Linyong Nan, Zhenting Qi, Wenlin Zhang, Xiangru Tang, Boyu Mi and Dragomir Radev | N/A | N/A |
| Generating Visual Spatial Description via Holistic 3D Scene Understanding | Yu Zhao, Hao Fei, Wei Ji, Jianguo Wei, Meishan Zhang, Min Zhang and Tat-Seng Chua | N/A | N/A |
| Incorporating Attribution Importance for Improving Faithfulness Metrics | Zhixue Zhao and Nikolaos Aletras | N/A | N/A |
| Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering | Ziwang Zhao, Linmei Hu, Hanyu Zhao, Yingxia Shao and Yequan Wang | N/A | N/A |
| An Invariant Learning Characterization of Controlled Text Generation | Carolina Zheng, Claudia Shi, Keyon Vafa, Amir Feder and David Blei | N/A | N/A |
| Rethinking Multimodal Entity and Relation Extraction from a Translation Point of View | Changmeng Zheng, Junhao Feng, Yi Cai, Xiaoyong Wei and Qing Li | N/A | N/A |
| Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference | Junhao Zheng, Qianli Ma, Shengjie Qiu, Yue Wu, Peitian Ma, Junlong Liu, Huawen Feng, Xichen Shang and Haibin Chen | N/A | N/A |
| Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization | Minghang Zheng, Shaogang Gong, Hailin Jin, Yuxin Peng and Yang Liu | N/A | N/A |
| IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures | Mingyu Zheng, Yang Hao, Wenbin Jiang, Zheng Lin, Yajuan Lyu, QiaoQiao She and Weiping Wang | N/A | N/A |
| Contextual Knowledge Learning for Dialogue Generation | Wen Zheng, Natasa Milic-Frayling and Ke Zhou | N/A | N/A |
| A Facial Expression-Aware Multimodal Multi-task Learning Framework for Emotion Recognition in Multi-party Conversations | Wenjie Zheng, Jianfei Yu, Rui Xia and Shijin Wang | N/A | N/A |
| Robust Representation Learning with Reliable Pseudo-labels Generation via Self-Adaptive Optimal Transport for Short Text Clustering | Xiaolin Zheng, Mengling Hu, Weiming Liu, Chaochao Chen and Xinting Liao | N/A | N/A |
| Jointprop: Joint Semi-supervised Learning for Entity and Relation Extraction with Heterogeneous Graph-based Propagation | yandan zheng, Anran Hao and Anh Tuan Luu | N/A | N/A |
| NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic | Zi’ou Zheng and Xiaodan Zhu | N/A | N/A |
| Revisiting Token Dropping Strategy in Efficient BERT Pretraining | Qihuang Zhong, Liang Ding, Juhua Liu, Xuebo Liu, Min Zhang, Bo Du and Dacheng Tao | N/A | N/A |
| Causal-Debias: Unifying Debiasing in Pretrained Language Models and Fine-tuning via Causal Invariant Learning | Fan Zhou, Yuzhou Mao, Liu Yu, Yi Yang and Ting Zhong | N/A | N/A |
| CLCL: Non-compositional Expression Detection with Contrastive Learning and Curriculum Learning | Jianing Zhou, Ziheng Zeng and Suma Bhat | N/A | N/A |
| CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation | Jinfeng Zhou, Chujie Zheng, Bo Wang, Zheng Zhang and Minlie Huang | N/A | N/A |
| Facilitating Multi-turn Emotional Support Conversation with Positive Emotion Elicitation: A Reinforcement Learning Approach | Jinfeng Zhou, Zhuang Chen, Bo Wang and Minlie Huang | N/A | N/A |
| SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluation | Junkai Zhou, Liang Pang, Huawei Shen and Xueqi Cheng | N/A | N/A |
| I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons | Pei Zhou, Andrew Zhu, Jennifer Hu, Jay Pujara, Xiang Ren, Chris Callison-Burch, Yejin Choi and Prithviraj Ammanabrolu | N/A | N/A |
| Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models | Qinhong Zhou, Zonghan Yang, Peng Li and Yang Liu | N/A | N/A |
| Improving Self-training for Cross-lingual Named Entity Recognition with Contrastive and Prototype Learning | Ran Zhou, Xin Li, Lidong Bing, Erik Cambria and Chunyan Miao | N/A | N/A |
| Continual Contrastive Finetuning Improves Low-Resource Relation Extraction | Wenxuan Zhou, Sheng Zhang, Tristan Naumann, Muhao Chen and Hoifung Poon | N/A | N/A |
| CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation | Yan Zhou, Qingkai Fang and Yang Feng | N/A | N/A |
| FLamE: Few-shot Learning from Natural Language Explanations | Yangqiaoyu Zhou, Yiming Zhang and Chenhao Tan | N/A | N/A |
| Non-Sequential Graph Script Induction via Multimedia Grounding | Yu Zhou, Sha Li, Manling Li, Xudong Lin, Shih-Fu Chang, Mohit Bansal and Heng Ji | N/A | N/A |
| Two Birds One Stone: Dynamic Ensemble for OOD Intent Classification | Yunhua Zhou, Jianqiang Yang, Pengyu Wang and Xipeng Qiu | N/A | N/A |
| A Probabilistic Framework for Discovering New Intents | Yunhua Zhou, Guofeng Quan and Xipeng Qiu | N/A | N/A |
| FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information | Andrew Zhu, Karmanya Aggarwal, Alexander Feng, Lara Martin and Chris Callison-Burch | N/A | N/A |
| Weaker Than You Think: A Critical Look at Weakly Supervised Learning | Dawei Zhu, Xiaoyu Shen, Marius Mosbach, Andreas Stephan and Dietrich Klakow | N/A | N/A |
| Neural Machine Translation Methods for Translating Text to Sign Language Glosses | Dele Zhu, Vera Czehmann and Eleftherios Avramidis | N/A | N/A |
| HiTIN: Hierarchy-aware Tree Isomorphism Network for Hierarchical Text Classification | He Zhu, Chong Zhang, Junjie Huang, Junran Wu and Ke Xu | N/A | N/A |
| PAED: Zero-Shot Persona Attribute Extraction in Dialogues | Luyao Zhu, Wei Li, Rui Mao, Vlad Pandelea and Erik Cambria | N/A | N/A |
| Annotating and Detecting Fine-grained Factual Errors for Dialogue Summarization | Rongxin Zhu, Jianzhong Qi and Jey Han Lau | N/A | N/A |
| PEIT: Bridging the Modality Gap with Pre-trained Models for End-to-End Image Translation | Shaolin Zhu, Shangjie Li, Yikun Lei and Deyi Xiong | N/A | N/A |
| INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation | Wenhao Zhu, Jingjing Xu, Shujian Huang, Lingpeng Kong and Jiajun CHEN | N/A | N/A |
| Solving Math Word Problems via Cooperative Reasoning induced Language Models | Xinyu Zhu, Junjie Wang, Lin Zhang, Yuxiang Zhang, Yongfeng Huang, ruyi gan, Jiaxing Zhang and Yujiu Yang | N/A | N/A |
| StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing | Xuekai Zhu, Jian Guan, Minlie Huang and Juan Liu | N/A | N/A |
| Pretrained Bidirectional Distillation for Machine Translation | Yimeng Zhuang and Mei Tu | N/A | N/A |
| WhitenedCSE: Whitening-based Contrastive Learning of Sentence Embeddings | Wenjie Zhuo, Yifan Sun, Xiaohan Wang, Linchao Zhu and Yi Yang | N/A | N/A |
| Modeling Appropriate Language in Argumentation | Timon Ziegenbein, Shahbaz Syed, Felix Lange, Martin Potthast and Henning Wachsmuth | N/A | N/A |
| NormBank: A Knowledge Bank of Situational Social Norms | Caleb Ziems, Jane Dwivedi-Yu, Yi-Chia Wang, Alon Halevy and Diyi Yang | N/A | N/A |
| Multi-VALUE: A Framework for Cross-Dialectal English NLP | Caleb Ziems, William Held, Jingfeng Yang, Jwala Dhamala, Rahul Gupta and Diyi Yang | N/A | N/A |
| Towards Understanding Omission in Dialogue Summarization | Yicheng Zou, Kaitao Song, Xu Tan, Zhongkai Fu, Qi Zhang, Dongsheng Li and Tao Gui | N/A | N/A |
| Tokenization and the Noiseless Channel | Vilém Zouhar, Clara Meister, Juan Gastaldi, Li Du, Mrinmaya Sachan and Ryan Cotterell | N/A | N/A |
| Soft Alignment Objectives for Robust Adaptation of Language Generation | Michal Štefánik, Marek Kadlcik and Petr Sojka | N/A | N/A |
| LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning | Amirhossein Abaskohi, Sascha Rothe and Yadollah Yaghoobzadeh | N/A | N/A |
| The Mechanical Bard: An Interpretable Machine Learning Approach to Shakespearean Sonnet Generation | Edwin Agnew, Michelle Qiu, Lily Zhu, Sam Wiseman and Cynthia Rudin | N/A | N/A |
| Learning Neuro-Symbolic World Models with Conversational Proprioception | Don Joven Agravante, Daiki Kimura, Michiaki Tatsubori, Asim Munawar and Alexander Gray | N/A | N/A |
| TeCS: A Dataset and Benchmark for Tense Consistency of Machine Translation | Yiming Ai, Zhiwei He, Kai Yu and Rui Wang | N/A | N/A |
| Is Anisotropy Truly Harmful? A Case Study on Text Clustering | Mira Ait-Saada and Mohamed Nadif | N/A | N/A |
| The Role of Global and Local Context in Named Entity Recognition | Arthur Amalvy, Vincent Labatut and Richard Dufour | N/A | N/A |
| Hexatagging: Projective Dependency Parsing as Tagging | Afra Amini, Tianyu Liu and Ryan Cotterell | N/A | N/A |
| Nichelle and Nancy: The Influence of Demographic Attributes and Tokenization Length on First Name Biases | Haozhe An and Rachel Rudinger | N/A | N/A |
| Split-NER: Named Entity Recognition via Two Question-Answering-based Classifications | Jatin Arora and Youngja Park | N/A | N/A |
| Faithfulness Tests for Natural Language Explanations | Pepa Atanasova, Oana-Maria Camburu, Christina Lioma, Thomas Lukasiewicz, Jakob Grue Simonsen and Isabelle Augenstein | N/A | N/A |
| A Simple and Effective Framework for Strict Zero-Shot Hierarchical Classification | Rohan Bhambhoria, Lei Chen and Xiaodan Zhu | N/A | N/A |
| Decomposed scoring of CCG dependencies | Aditya Bhargava and Gerald Penn | N/A | N/A |
| Efficient Diagnosis Assignment Using Unstructured Clinical Notes | Louis Blankemeier, Jason Fries, Robert Tinn, Joseph Preston, Nigam Shah and Akshay Chaudhari | N/A | N/A |
| An Open Dataset and Model for Language Identification | Laurie Burchell, Alexandra Birch, Nikolay Bogoychev and Kenneth Heafield | N/A | N/A |
| Evaluating Zero-Shot Event Structures: Recommendations for Automatic Content Extraction (ACE) Annotations | Erica Cai and Brendan O’Connor | N/A | N/A |
| Graph Propagation based Data Augmentation for Named Entity Recognition | Jiong Cai, Shen Huang, Yong Jiang, Zeqi Tan, Pengjun Xie and Kewei Tu | N/A | N/A |
| Substitution-based Semantic Change Detection using Contextual Embeddings | Dallas Card | N/A | N/A |
| XL-LEXEME: WiC Pretrained Model for Cross-Lingual LEXical sEMantic changE | Pierluigi Cassotti, Lucia Siciliani, Marco DeGemmis, Giovanni Semeraro and Pierpaolo Basile | N/A | N/A |
| Controllable Mixed-Initiative Dialogue Generation through Prompting | Maximillian Chen, Xiao Yu, Weiyan Shi, Urvi Awasthi and Zhou Yu | N/A | N/A |
| xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages | Mingda Chen, Kevin Heffernan, Onur Çelebi, Alexandre Mourachko and Holger Schwenk | N/A | N/A |
| Toward Expanding the Scope of Radiology Report Summarization to Multiple Anatomies and Modalities | Zhihong Chen, Maya Varma, Xiang Wan, Curtis Langlotz and Jean-Benoit Delbrouck | N/A | N/A |
| Text-to-SQL Error Correction with Language Models of Code | Ziru Chen, Shijie Chen, Michael White, Raymond Mooney, Ali Payani, Jayanth Srinivasa, Yu Su and Huan Sun | N/A | N/A |
| Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering | Hao Cheng, Hao Fang, Xiaodong Liu and Jianfeng Gao | N/A | N/A |
| PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English | Jianfeng Chi, Wasi Uddin Ahmad, Yuan Tian and Kai-Wei Chang | N/A | N/A |
| Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings | Ta-Chung Chi, Ting-Han Fan, Li-Wei Chen, alexander rudnicky and Peter Ramadge | N/A | N/A |
| Should you marginalize over possible tokenizations? | Nadezhda Chirkova, Germán Kruszewski, Jos Rozen and Marc Dymetman | N/A | N/A |
| Leveraging Prefix Transfer for Multi-Intent Text Revision | Ruining Chong, Cunliang Kong, Liu Wu, Zhenghao Liu, Ziye Jin, Liner Yang, Yange Fan, Hanghang Fan and Erhong Yang | N/A | N/A |
| Black-box language model explanation by context length probing | Ondřej Cífka and Antoine Liutkus | N/A | N/A |
| Scaling in Cognitive Modelling: a Multilingual Approach to Human Reading Times | Andrea Gregor de Varda and Marco Marelli | N/A | N/A |
| Zero-shot Cross-lingual Transfer With Learned Projections Using Unlabeled Target-Language Data | Ujan Deb, Ridayesh Parab and Preethi Jyothi | N/A | N/A |
| Context-Aware Transformer Pre-Training for Answer Sentence Selection | Luca Di Liello, Siddhant Garg and Alessandro Moschitti | N/A | N/A |
| When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants | Anuj Diwan, Eunsol Choi and David Harwath | N/A | N/A |
| Improving Factuality of Abstractive Summarization without Sacrificing Summary Quality | Tanay Dixit, Fei Wang and Muhao Chen | N/A | N/A |
| Surface-Based Retrieval Reduces Perplexity of Retrieval-Augmented Language Models | Ehsan Doostmohammadi, Tobias Norlund, Marco Kuhlmann and Richard Johansson | N/A | N/A |
| Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting | Zahra Fatemi, Chen Xing, Wenhao Liu, Caimming Xiong and Zahra Fatemi | N/A | N/A |
| Reasoning Implicit Sentiment with Chain-of-Thought Prompting | Hao Fei, Bobo Li, Qian Liu, Lidong Bing, Fei Li and Tat-Seng Chua | N/A | N/A |
| Using contradictions improves question answering systems | Etienne Fortier-Dubois and Domenic Rosati | N/A | N/A |
| Mind the Gap between the Application Track and the Real World | Ananya Ganesh, Jie Cao, E. Margaret Perkoff, Rosy Southwell, Martha Palmer and Katharina Kann | N/A | N/A |
| Analyzing Text Representations by Measuring Task Alignment | Cesar Gonzalez-Gutierrez, Audi Primadhanty, Francesco Cazzaro and Ariadna Quattoni | N/A | N/A |
| Morphological Inflection with Phonological Features | David Guriel, Omer Goldman and Reut Tsarfaty | N/A | N/A |
| Detoxifying Text with MaRCo: Controllable Revision with Experts and Anti-Experts | Skyler Hallinan, Alisa Liu, Yejin Choi and Maarten Sap | N/A | N/A |
| Modality Adaption or Regularization? A Case Study on End-to-End Speech Translation | Yuchen Han, Chen Xu, Tong Xiao and Jingbo Zhu | N/A | N/A |
| Ellipsis-Dependent Reasoning: a New Challenge for Large Language Models | Daniel Hardt | N/A | N/A |
| Characterization of Stigmatizing Language in Medical Records | Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne R. Links, Somnath Saha, Mary Catherine Beach and Mark Dredze | N/A | N/A |
| BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering | Jie He, Simon U, Victor Gutierrez-Basulto and Jeff Pan | N/A | N/A |
| ChatGPT for Zero-shot Dialogue State Tracking: A Solution or an Opportunity? | Michael Heck, Nurul Lubis, Benjamin Ruppik, Renato Vukovic, Shutong Feng, Christian Geishauser, Hsien-chin Lin, Carel van Niekerk and Milica Gasic | N/A | N/A |
| Contrastive Bootstrapping for Label Refinement | Shudi Hou, Yu Xia, Muhao Chen and Sujian Li | N/A | N/A |
| Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis | Xuming Hu, Zhijiang Guo, ZHIYANG TENG, Irwin King and Philip S. Yu | N/A | N/A |
| MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting | Tatsuro Inaba, Hirokazu Kiyomaru, Fei Cheng and Sadao Kurohashi | N/A | N/A |
| KNOW How to Make Up Your Mind! Adversarially Detecting and Alleviating Inconsistencies in Natural Language Explanations | Myeongjun Jang, Bodhisattwa Prasad Majumder, Julian McAuley, Thomas Lukasiewicz and Oana-Maria Camburu | N/A | N/A |
| An (unhelpful) guide to selecting the best ASR architecture for your under-resourced language | Robert Jimerson, Zoey Liu and Emily Prud’hommeaux | N/A | N/A |
| Discourse-Level Representations can Improve Prediction of Degree of Anxiety | Swanie Juhng, Matthew Matero, Vasudha Varadarajan, Johannes C. Eichstaedt, Adithya V Ganesan and H. Andrew Schwartz | N/A | N/A |
| Bring More Attention to Syntactic Symmetry for Automatic Postediting of High-Quality Machine Translations | Baikjin Jung, Myungji Lee, Jong-Hyeok Lee and Yunsu Kim | N/A | N/A |
| Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models | Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe | N/A | N/A |
| Stop Pre-Training: Adapt Visual-Language Models to Unseen Languages | Yasmine Karoui, Rémi Lebret, Negar Foroutan Eghlidi and Karl Aberer | N/A | N/A |
| A Better Way to Do Masked Language Model Scoring | Carina Kauf and Anna Ivanova | N/A | N/A |
| Tracing Linguistic Markers of Influence in a Large Online Organisation | Prashant Khare, Ravi Shekhar, Mladen Karan, Stephen McQuistin, Colin Perkins, Ignacio Castro, Gareth Tyson, Patrick Healey and Matthew Purver | N/A | N/A |
| Transformed Protoform Reconstruction | Young Min Kim, Kalvin Chang, Chenxuan Cui and David R. Mortensen | N/A | N/A |
| Probing Physical Reasoning with Counter-Commonsense Context | Kazushi Kondo, Saku Sugawara and Akiko Aizawa | N/A | N/A |
| Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning | Po-Nien Kung and Nanyun Peng | N/A | N/A |
| S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering | Fangyu Lei, Xiang Li, Yifan Wei, Shizhu He, Yiming Huang, Jun Zhao and Kang Liu | N/A | N/A |
| NarrowBERT: Accelerating Masked Language Model Pretraining and Inference | Haoxin Li, Phillip Keung, Daniel Cheng, Jungo Kasai and Noah A. Smith | N/A | N/A |
| HiPool: Modeling Long Documents Using Graph Neural Networks | Irene Li, Aosong Feng, Dragomir Radev and Rex Ying | N/A | N/A |
| How Well Apply Simple MLP to Incomplete Utterance Rewriting? | Jiang Li, Xiangdong Su, Xinlan Ma and Guanglai Gao | N/A | N/A |
| Counterfactual reasoning: Testing language models’ understanding of hypothetical scenarios | Jiaxuan Li, Lang Yu and Allyson Ettinger | N/A | N/A |
| Prefix Propagation: Parameter-Efficient Tuning for Long Sequences | Jonathan Li, Will Aitken, Rohan Bhambhoria and Xiaodan Zhu | N/A | N/A |
| Diversity-Aware Coherence Loss for Improving Neural Topic Models | Raymond Li, Felipe Gonzalez-Pizarro, Linzi Xing, Gabriel Murray and Giuseppe Carenini | N/A | N/A |
| AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models | Siheng Li, Cheng Yang, Yichun Yin, Xinyu Zhu, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu and Yujiu Yang | N/A | N/A |
| Metaphor Detection via Explicit Basic Meanings Modelling | Yucheng Li, Shun Wang, Chenghua Lin and Frank Guerin | N/A | N/A |
| Uncertainty-Aware Bootstrap Learning for Joint Extraction on Distantly-Supervised Data | Yufei Li, Xiao Yu, Yanchi Liu, Haifeng Chen and Cong Liu | N/A | N/A |
| LI-RAGE: Late Interaction Retrieval Augmented Generation with Explicit Signals for Open-Domain Table Question Answering | Weizhe Lin, Rexhina Blloshmi, Bill Byrne, Adria de Gispert and Gonzalo Iglesias | N/A | N/A |
| Linear Classifier: An Often-Forgotten Baseline for Text Classification | Yu-Chen Lin, Si-An Chen, Jie-Jyun Liu and Chih-Jen Lin | N/A | N/A |
| Are Sample-Efficient NLP Models More Robust? | Nelson F. Liu, Ananya Kumar, Percy Liang and Robin Jia | N/A | N/A |
| BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases | Xin Liu, Muhammad Khalifa and Lu Wang | N/A | N/A |
| MolXPT: Wrapping Molecules with Text for Generative Pre-training | Zequn Liu, Wei Zhang, Yingce Xia, Lijun Wu, Shufang Xie, Tao Qin, Ming Zhang and Tie-Yan Liu | N/A | N/A |
| TwistList: Resources and Baselines for Tongue Twister Generation | Tyler Loakman, Chen Tang and Chenghua Lin | N/A | N/A |
| Improving Grammar-based Sequence-to-Sequence Modeling with Decomposition and Constraints | Chao Lou and Kewei Tu | N/A | N/A |
| Event Extraction as Question Generation and Answering | Di Lu, Shihao Ran, Joel Tetreault and Alejandro Jaimes | N/A | N/A |
| A Study on the Efficiency and Generalization of Light Hybrid Retrievers | Man Luo, Shashank Jain, Anchit Gupta, Arash Einolghozati, Barlas Oguz, Debojeet Chatterjee, Xilun Chen, Chitta Baral and Peyman Heidari | N/A | N/A |
| Parameter-efficient Weight Ensembling Facilitates Task-level Knowledge Transfer | Xingtai Lv, Ning Ding, Yujia Qin, Zhiyuan Liu and Maosong Sun | N/A | N/A |
| Focused Prefix Tuning for Controllable Text Generation | Congda Ma, Tianyu Zhao, Makoto Shing, Kei Sawada and Manabu Okumura | N/A | N/A |
| Improving Syntactic Probing Correctness and Robustness with Control Tasks | Weicheng Ma, Brian Wang, Hefan Zhang, Lili Wang, Rolando Coto-Solano, Saeed Hassanpour and Soroush Vosoughi | N/A | N/A |
| Bhasa-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages | Yash Madhani, Mitesh M. Khapra and Anoop Kunchukuttan | N/A | N/A |
| Dataset Distillation with Attention Labels for Fine-tuning BERT | Aru Maekawa, Naoki Kobayashi, Kotaro Funakoshi and Manabu Okumura | N/A | N/A |
| Teaching Small Language Models to Reason | Lucie Charlotte Magister, Jonathan Mallinson, Jakub Adamek, Eric Malmi and Aliaksei Severyn | N/A | N/A |
| UniTRec: A Unified Text-to-Text Transformer and Joint Contrastive Learning Framework for Text-based Recommendation | Zhiming Mao, Huimin Wang, Yiming Du and Kam-Fai Wong | N/A | N/A |
| Exploring the Impact of Layer Normalization for Zero-shot Neural Machine Translation | Zhuoyuan Mao, Raj Dabre, Qianying Liu, Haiyue Song, Chenhui Chu and Sadao Kurohashi | N/A | N/A |
| AMRs Assemble! Learning to Ensemble with Autoregressive Models for AMR Parsing | Abelardo Carlos Martínez Lorenzo, Pere-Lluís Huguet Cabot and Roberto Navigli | N/A | N/A |
| Theory-Grounded Computational Text Analysis | Arya D. McCarthy and Giovanna Maria Dora Dore | N/A | N/A |
| A Natural Bias for Language Generation Models | Clara Meister, Wojciech Stokowiec, Tiago Pimentel, Lei Yu, Laura Rimell and Adhiguna Kuncoro | N/A | N/A |
| Trading Syntax Trees for Wordpieces: Target-oriented Opinion Words Extraction with Wordpieces and Aspect Enhancement | Samuel Mensah, Kai Sun and Nikolaos Aletras | N/A | N/A |
| Deep Active Learning for Morphophonological Processing | Seyed Morteza Mirbostani, Yasaman Boreshban, Salam Khalifa, SeyedAbolghasem Mirroshandel and Owen Rambow | N/A | N/A |
| mOKB6: A Multilingual Open Knowledge Base Completion Benchmark | Shubham Mittal, Keshav Kolluru, Soumen Chakrabarti and Mausam | N/A | N/A |
| MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models | Masoud Monajatipoor, Liunian Harold Li, Mozhdeh Rouhsedaghat, Lin Yang and Kai-Wei Chang | N/A | N/A |
| Enhancing Event Causality Identification with Counterfactual Reasoning | Feiteng Mu and Wenjie Li | N/A | N/A |
| Grokking of Hierarchical Structure in Vanilla Transformers | Shikhar Murty, Pratyusha Sharma, Jacob Andreas and Christopher Manning | N/A | N/A |
| Considerations for meaningful sign language machine translation based on glosses | Mathias Müller, Zifan Jiang, Amit Moryossef, Annette Rios and Sarah Ebling | N/A | N/A |
| Simple Augmentations of Logical Rules for Neuro-Symbolic Knowledge Graph Completion | Ananjan Nandi, Navdeep Kaur, Parag Singla and Mausam | N/A | N/A |
| Class based Influence Functions for Error Detection | Thang Nguyen-Duc, Hoang Thanh-Tung, Quan Hung Tran, Dang Huu-Tien, Hieu Nguyen, Anh T. V. Dau and Nghi Bui | N/A | N/A |
| A Fast Algorithm for Computing Prefix Probabilities | Franz Nowak and Ryan Cotterell | N/A | N/A |
| Self-Distilled Quantization: Achieving High Compression Rates in Transformer-Based Language Models | James O’Neill and Sourav Dutta | N/A | N/A |
| The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics | Matthias Orlikowski, Paul Röttger, Philipp Cimiano and Dirk Hovy | N/A | N/A |
| Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning | Mustafa Ozdayi, Charith Peris, Jack FitzGerald, Christophe Dupuy, Jimit Majmudar, Haidar Khan, Rahil Parikh and Rahul Gupta | N/A | N/A |
| Token-Level Self-Evolution Training for Sequence-to-Sequence Learning | Keqin Peng, Liang Ding, Qihuang Zhong, Yuanxin Ouyang, Wenge Rong, Zhang Xiong and Dacheng Tao | N/A | N/A |
| Credible without Credit: Domain Experts Assess Generative Language Models | Denis Peskoff and Brandon Stewart | N/A | N/A |
| STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions | Michel Plüss, Jan Deriu, Yanick Schraner, Claudio Paonessa, Julia Hartmann, Larissa Schmidt, Christian Scheller, Manuela Hürlimann, Tanja Samardžić, Manfred Vogel and Mark Cieliebak | N/A | N/A |
| Unsupervised Subtitle Segmentation with Masked Language Models | David Ponce, Thierry Etchegoyhen and Victor Ruiz | N/A | N/A |
| Multi-Document Summarization with Centroid-Based Pretraining | Ratish Surendran Puduppully, Parag Jain, Nancy Chen and Mark Steedman | N/A | N/A |
| Covering Uncommon Ground: Gap-Focused Question Generation for Answer Assessment | Roni Rabin, Alexandre Djerbetian, Roee Engelberg, Lidan Hackmon, Gal Elidan, Reut Tsarfaty and Amir Globerson | N/A | N/A |
| Improving Generalization in Language Model-based Text-to-SQL Semantic Parsing: Two Simple Semantic Boundary-based Techniques | Daking Rai, Bailin Wang, Yilun Zhou and Ziyu Yao | N/A | N/A |
| Do GPTs Produce Less Literal Translations? | Vikas Raunak, Arul Menezes, Matt Post and Hany Hassan | N/A | N/A |
| MIReAD: Simple Method for Learning High-quality Representations from Scientific Documents | Anastasiia Razdaibiedina and Aleksandr Brechalov | N/A | N/A |
| The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics | Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie and André Martins | N/A | N/A |
| Randomized Positional Encodings Boost Length Generalization of Transformers | Anian Ruoss, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Róbert Csordás, Mehdi Bennani, Shane Legg and Joel Veness | N/A | N/A |
| RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation | Gabriele Sarti, Phu Mon Htut, Xing Niu, Benjamin Hsu, Anna Currey, Georgiana Dinu and Maria Nadejde | N/A | N/A |
| ACTC: Active Threshold Calibration for Cold-Start Knowledge Graph Completion | Anastasiia Sedova and Benjamin Roth | N/A | N/A |
| The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks | Nikil Selvam, Sunipa Dev, Daniel Khashabi, Tushar Khot and Kai-Wei Chang | N/A | N/A |
| Summarizing, Simplifying, and Synthesizing Medical Evidence using GPT-3 (with Varying Success) | Chantal Shaib, Millicent Li, Sebastian Joseph, Iain Marshall, Junyi Jessy Li and Byron Wallace | N/A | N/A |
| Class-Incremental Learning based on Label Generation | Yijia Shao, Yiduo Guo, Dongyan Zhao and Bing Liu | N/A | N/A |
| ScoNe: Benchmarking Negation Reasoning in Language Models With Fine-Tuning and In-Context Learning | Jingyuan S. She, Christopher Potts, Samuel R. Bowman and Atticus Geiger | N/A | N/A |
| NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification | Iyanuoluwa Shode, David Ifeoluwa Adelani, JIng Peng and Anna Feldman | N/A | N/A |
| Detecting Contradictory COVID-19 Drug Efficacy Claims from Biomedical Literature | Daniel Sosa, Malavika Suresh, Christopher Potts and Russ Altman | N/A | N/A |
| Joint End-to-end Semantic Proto-role Labeling | Elizabeth Spaulding, Gary Kazantsev and Mark Dredze | N/A | N/A |
| Environmental Claim Detection | Dominik Stammbach, Nicolas Webersinke, Julia Bingler, Mathias Kraus and Markus Leippold | N/A | N/A |
| With a Little Push, NLI Models can Robustly and Efficiently Predict Faithfulness | Julius Steen, Juri Opitz, Anette Frank and Katja Markert | N/A | N/A |
| Modular Visual Question Answering via Code Generation | Sanjay Subramanian, Medhini Narasimhan, Kushal Khangaonkar, Kevin Yang, Arsha Nagrani, Cordelia Schmid, Andy Zeng, Trevor Darrell and Dan Klein | N/A | N/A |
| Balancing Lexical and Semantic Quality in Abstractive Summarization | Jeewoo Sul and Yong Suk Choi | N/A | N/A |
| Towards Fewer Hallucinations in Knowledge-Grounded Dialogue Generation via Augmentative and Contrastive Knowledge-Dialogue | Bin Sun, Yitong Li, Fei Mi, fanhu bie, Yiwei Li and Kan Li | N/A | N/A |
| Measuring the Effect of Influential Messages on Varying Personas | Chenkai Sun, Jinning Li, Hou Pong Chan, ChengXiang Zhai and Heng Ji | N/A | N/A |
| Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction? | Chenming Tang, Xiuyu Wu and Yunfang Wu | N/A | N/A |
| Bootstrapping Neural Relation and Explanation Classifiers | Zheng Tang and Mihai Surdeanu | N/A | N/A |
| Typo-Robust Representation Learning for Dense Retrieval | Panuthep Tasawong, Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Can Udomcharoenchaikit, Ekapol Chuangsuwanich and Sarana Nutanong | N/A | N/A |
| Language Models Get a Gender Makeover: Mitigating Gender Bias with Few-Shot Data Interventions | Himanshu Thakur, Atishay Jain, Praneetha Vaddamanu, Paul Pu Liang and Louis-Philippe Morency | N/A | N/A |
| Deriving Language Models from Masked Language Models | Lucas Torroba Hennigen and Yoon Kim | N/A | N/A |
| Evaluating pragmatic abilities of image captioners on A3DS | Polina Tsvilodub and Michael Franke | N/A | N/A |
| On the Interpretability and Significance of Bias Metrics in Texts: a PMI-based Approach | Francisco Valentini, Germán Rosati, Damián Blasi, Diego Fernandez Slezak and Edgar Altszyler | N/A | N/A |
| Abstractive Summarizers are Excellent Extractive Summarizers | Daniel Varab and Yumo Xu | N/A | N/A |
| Evaluating Paraphrastic Robustness in Textual Entailment Models | Dhruv Verma, Yash Kumar Lal, Shreyashee Sinha, Benjamin Van Durme and Adam Poliak | N/A | N/A |
| Improving Automatic Quotation Attribution in Literary Novels | Krishnapriya Vishnubhotla, Frank Rudzicz, Graeme Hirst and Adam Hammond | N/A | N/A |
| Going Beyond Sentence Embeddings: A Token-Level Matching Algorithm for Calculating Semantic Textual Similarity | Hongwei Wang and Dong Yu | N/A | N/A |
| MOSPC: MOS Prediction Based on Pairwise Comparison | Kexin Wang, Yunlong Zhao, Qianqian Dong, Tom Ko and Mingxuan Wang | N/A | N/A |
| The Art of Prompting: Event Detection based on Type Specific Prompts | Sijia Wang, Mo Yu and Lifu Huang | N/A | N/A |
| Let Me Check the Examples: Enhancing Demonstration Learning via Explicit Imitation | Sirui Wang, Kaiwen Wei, Hongzhi Zhang, Yuntao Li and Wei Wu | N/A | N/A |
| Learning Multi-Step Reasoning by Solving Arithmetic Tasks | Tianduo Wang and Wei Lu | N/A | N/A |
| How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives | Xinpeng Wang, Leonie Weissweiler, Hinrich Schütze and Barbara Plank | N/A | N/A |
| Zero-Shot and Few-Shot Stance Detection on Varied Topics via Conditional Generation | Haoyang Wen and Alexander Hauptmann | N/A | N/A |
| A Holistic Approach to Reference-Free Evaluation of Machine Translation | Hanming Wu, Wenjuan Han, Hui Di, Yufeng Chen and Jinan Xu | N/A | N/A |
| Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain | Shih-Lun Wu, Yi-Hui Chou and Liangze Li | N/A | N/A |
| Debiasing Generative Named Entity Recognition by Calibrating Sequence Likelihood | Yu Xia, Yongwei Zhao, Wenhao Wu and Sujian Li | N/A | N/A |
| mPMR: A Multilingual Pre-trained Machine Reader at Scale | Weiwen Xu, Xin Li, Wai Lam and Lidong Bing | N/A | N/A |
| Exploring Continual Learning for Code Generation Models | Prateek Yadav, Qing Sun, Hantian Ding, Xiaopeng Li, Dejiao Zhang, Ming Tan, Parminder Bhatia, Xiaofei Ma, Ramesh Nallapati, Murali Krishna Ramanathan, Mohit Bansal and Bing Xiang | N/A | N/A |
| An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition | Hang Yan, Yu Sun, Xiaonan Li and Xipeng Qiu | N/A | N/A |
| In and Out-of-Domain Text Adversarial Robustness via Label Smoothing | Yahan Yang, Soham Dan, Dan Roth and Insup Lee | N/A | N/A |
| A Weakly Supervised Classifier and Dataset of White Supremacist Language | Michael Yoder, Ahmad Diab, David Brown and Kathleen Carley | N/A | N/A |
| Gradient Ascent Post-training Enhances Language Model Generalization | Dongkeun Yoon, Joel Jang, Sungdong Kim and Minjoon Seo | N/A | N/A |
| Back to Patterns: Efficient Japanese Morphological Analysis with Feature-Sequence Trie | Naoki Yoshinaga | N/A | N/A |
| Target-Based Offensive Language Identification | Marcos Zampieri, Skye Morgan, Kai North, Tharindu Ranasinghe, Austin Simmmons, Paridhi Khandelwal, Sara Rosenthal and Preslav Nakov | N/A | N/A |
| COGEN: Abductive Commonsense Language Generation | rohola zandie, Diwanshu Shekhar and Mohammad Mahoor | N/A | N/A |
| ReAugKD: Retrieval-Augmented Knowledge Distillation For Pre-trained Language Models | Jianyi Zhang, Aashiq Muhamed, Aditya Anantharaman, Guoyin Wang, Changyou Chen, Kai Zhong, Qingjun Cui, Yi Xu, Belinda Zeng, Trishul Chilimbi and Yiran Chen | N/A | N/A |
| A Simple Concatenation can Effectively Improve Speech Translation | Linlin Zhang, Kai Fan, Boxing Chen and Luo Si | N/A | N/A |
| Understanding Demonstration-based Learning from a Causal Perspective | Ruiyi Zhang and Tong Yu | N/A | N/A |
| Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model Fine-tuning | Zhen-Ru Zhang, Chuanqi Tan, Haiyang Xu, Chengyu Wang, jun huang and Songfang Huang | N/A | N/A |
| Revisiting Automated Prompting: Are We Actually Doing Better? | Yulin Zhou, Yiren Zhao, Ilia Shumailov, Robert Mullins and Yarin Gal | N/A | N/A |
| Robust Learning for Multi-party Addressee Recognition with Discrete Addressee Codebook | Pengcheng Zhu, Wei Zhou, Kuncai Zhang, Yuankai Ma and Haiqing Chen | N/A | N/A |
ACL 2024
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models | Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Qing Li, Yong Jiang, Zhihao Jia | N/A | N/A |
| Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances | Hanlei Zhang, Hua Xu, Fei Long, Xin Wang, Kai Gao | N/A | N/A |
| MAGE: Machine-generated Text Detection in the Wild | Yafu Li, Qintong Li, Leyang Cui, Wei Bi, Zhilin Wang, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang | N/A | N/A |
| PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models | Haoran Li, Dadi Guo, Donghao Li, Wei Fan, Qi Hu, Xin Liu, Chunkit Chan, Duanyi YAO, Yuan Yao, Yangqiu Song | N/A | N/A |
| GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators | Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, EngSiong Chng | N/A | N/A |
| Exploring Chain-of-Thought for Multi-modal Metaphor Detection | Yanzhi Xu, Yueying Hua, Shichen Li, Zhongqing Wang | N/A | N/A |
| BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation | DaYou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu | N/A | N/A |
| A Unified Temporal Knowledge Graph Reasoning Model Towards Interpolation and Extrapolation | Kai Chen, Ye Wang, Yitong Li, Aiping Li, Han Yu, Xin Song | N/A | N/A |
| Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation | Shicheng Xu, Liang Pang, Mo Yu, Fandong Meng, Huawei Shen, Xueqi Cheng, Jie Zhou | N/A | N/A |
| CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers | Yong Hu, Fandong Meng, Jie Zhou | N/A | N/A |
| Evaluating Dynamic Topic Models | Charu Karakkaparambil James, Mayank Nagda, Nooshin Haji Ghassemi, Marius Kloft, Sophie Fellenz | N/A | N/A |
| How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition | Guanting Dong, Hongyi Yuan, Keming Lu, Chengpeng Li, Mingfeng Xue, Dayiheng Liu, Wei Wang, Zheng Yuan, Chang Zhou, Jingren Zhou | N/A | N/A |
| Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification | Shanshan Xu, Santosh T.Y.S.S, Oana Ichim, Barbara Plank, Matthias Grabmair | N/A | N/A |
| Inference to the Best Explanation in Large Language Models | Dhairya Dalal, Marco Valentino, Andre Freitas, Paul Buitelaar | N/A | N/A |
| A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus | Eduard Poesina, Cornelia Caragea, Radu Tudor Ionescu | N/A | N/A |
| DeVAn: Dense Video Annotation for Video-Language Models | Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fang, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang | N/A | N/A |
| MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering | Xiusi Chen, Jyun-Yu Jiang, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Wei Wang | N/A | N/A |
| SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs | Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh, Dong Yu, Fei Liu | N/A | N/A |
| SciMON: Scientific Inspiration Machines Optimized for Novelty | Qingyun Wang, Doug Downey, Heng Ji, Tom Hope | N/A | N/A |
| Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction | Yiren Jian, Tingkai Liu, Yunzhe Tao, Chunhui Zhang, Soroush Vosoughi, Hongxia Yang | N/A | N/A |
| Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models | Abhishek Kumar, Robert Morabito, Sanzhar Umbet, Jad Kabbara, Ali Emami | N/A | N/A |
| Retrieval-Augmented Multilingual Knowledge Editing | Weixuan Wang, Barry Haddow, Alexandra Birch | N/A | N/A |
| Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge | Brendan Park, Madeline Janecek, Naser Ezzati-Jivan, Yifeng Li, Ali Emami | N/A | N/A |
| Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models | Abhishek Kumar, Sarfaroz Yunusov, Ali Emami | N/A | N/A |
| Framing in the Presence of Supporting Data: A Case Study in U.S. Economic News | Alexandria Leto, Elliot E. Pickens, Coen D. Needell, David Rothschild, Maria Leonor Pacheco | N/A | N/A |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Fuxiao Liu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang | N/A | N/A |
| TTM-RE: Memory-Augmented Document-Level Relation Extraction | Chufan Gao, Xuan Wang, Jimeng Sun | N/A | N/A |
| Answer is All You Need: Instruction-following Text Embedding via Answering the Question | Letian Peng, Yuwei Zhang, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang | N/A | N/A |
| Explore Spurious Correlations at the Concept Level in Language Models for Text Classification | Yuhang Zhou, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, Furong Huang | N/A | N/A |
| Every Answer Matters: Evaluating Commonsense with Probabilistic Measures | Qi Cheng, Michael Boratko, Pranay Kumar Yelugam, Tim O’Gorman, Nalini Singh, Andrew McCallum, Xiang Lorraine Li | N/A | N/A |
| GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis | Yueqi XIE, Minghong Fang, Renjie Pi, Neil Zhenqiang Gong | N/A | N/A |
| How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs | Yi Zeng, Hongpeng Lin, Jingwen Zhang, Diyi Yang, Ruoxi Jia, Weiyan Shi | N/A | N/A |
| Pouring Your Heart Out: Investigating the Role of Figurative Language in Online Expressions of Empathy | Gyeongeun Lee, Christina Wong, Meghan Guo, Natalie Parde | N/A | N/A |
| An Information-Theoretic Approach to Analyze NLP Classification Tasks | Luran Wang, Mark Gales, Vatsal Raina | N/A | N/A |
| Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders | Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour | N/A | N/A |
| Wav2Gloss: Generating Interlinear Glossed Text from Speech | Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel Romney Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R Mortensen, Lori Levin | N/A | N/A |
| Leveraging Codebook Knowledge with NLI and ChatGPT for Zero-Shot Political Relation Classification | Yibo Hu, Erick Skorupa Parolin, Latifur Khan, Patrick Brandt, Javier Osorio, Vito D’Orazio | N/A | N/A |
| SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation | Ziyao Xu, Houfeng Wang | N/A | N/A |
| OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following | Haochen Shi, Zhiyuan Sun, Xingdi Yuan, Marc-Alexandre Côté, Bang Liu | N/A | N/A |
| Multimodal Instruction Tuning with Conditional Mixture of LoRA | Ying Shen, Zhiyang Xu, Qifan Wang, Yu Cheng, Wenpeng Yin, Lifu Huang | N/A | N/A |
| DocLens: Multi-aspect Fine-grained Medical Text Evaluation | Yiqing Xie, Sheng Zhang, Hao Cheng, Pengfei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn Rose | N/A | N/A |
| FOFO: A Benchmark to Evaluate LLMs’ Format-Following Capability | Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong | N/A | N/A |
| Hyper-CL: Conditioning Sentence Representations with Hypernetworks | Young Hyun Yoo, Jii Cha, Changhyeon Kim, Taeuk Kim | N/A | N/A |
| Analysis of Multi-Source Language Training in Cross-Lingual Transfer | Seonghoon Lim, Taejun Yun, Jinhyeon Kim, Jihun Choi, Taeuk Kim | N/A | N/A |
| ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions | Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramaneswaran S, S Sakshi, Dinesh Manocha | N/A | N/A |
| The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants | Lucas Bandarkar, Davis Liang, Benjamin Muller, Mikel Artetxe, Satya Narayan Shukla, Donald Husa, Naman Goyal, Abhinandan Krishnan, Luke Zettlemoyer, Madian Khabsa | N/A | N/A |
| Learn from Failure: Fine-tuning LLMs with Trial-and-Error Data for Intuitionistic Propositional Logic Proving | Chenyang An, Zhibo Chen, Qihao Ye, Emily First, Letian Peng, Jiayun Zhang, Zihan Wang, Sorin Lerner, Jingbo Shang | N/A | N/A |
| Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee, Sangwon Yu, Junsung Park, Jihun Yi, Sungroh Yoon | N/A | N/A |
| IMBUE: Improving Interpersonal Effectiveness through Simulation and Just-in-time Feedback with Human-Language Model Interaction | Inna Wanyin Lin, Ashish Sharma, Christopher Michael Rytting, Adam S Miner, Jina Suh, Tim Althoff | N/A | N/A |
| Token-wise Influential Training Data Retrieval for Large Language Models | Huawei Lin, Jikai Long, Zhaozhuo Xu, Weijie Zhao | N/A | N/A |
| Tree-of-Counterfactual Prompting for Zero-Shot Stance Detection | Maxwell Weinzierl, Sanda Harabagiu | N/A | N/A |
| VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks | Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Ruslan Salakhutdinov, Daniel Fried | N/A | N/A |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Hwanjun Song, Hang Su, Igor Shalyminov, Jason Cai, Saab Mansour | N/A | N/A |
| Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback | Daechul Ahn, Yura Choi, Youngjae Yu, Dongyeop Kang, Jonghyun Choi | N/A | N/A |
| Prompt Refinement with Image Pivot for Text-to-Image Generation | Jingtao Zhan, Qingyao Ai, Yiqun LIU, Yingwei Pan, Ting Yao, Jiaxin Mao, Shaoping Ma, Tao Mei | N/A | N/A |
| The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models | Adithya Bhaskar, Dan Friedman, Danqi Chen | N/A | N/A |
| Striking Gold in Advertising: Standardization and Exploration of Ad Text Generation | Masato Mita, Soichiro Murakami, Akihiko Kato, Peinan Zhang | N/A | N/A |
| AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation | Zhaowei Wang, Wei Fan, Qing Zong, Hongming Zhang, Sehyun Choi, Tianqing Fang, Xin Liu, Yangqiu Song, Ginny Wong, Simon See | N/A | N/A |
| Reflect-RL: Two-Player Online RL Fine-Tuning for LMs | Runlong Zhou, Simon Shaolei Du, Beibin Li | N/A | N/A |
| Can ChatGPT’s Performance be Improved on Verb Metaphor Detection Tasks? Bootstrapping and Combining Tacit Knowledge | Cheng Yang, Puli Chen, Qingbao Huang | N/A | N/A |
| Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning | Zhaorui Yang, Tianyu Pang, Haozhe Feng, Han Wang, Wei Chen, Minfeng Zhu, Qian Liu | N/A | N/A |
| An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation | kun Zhu, Xiaocheng Feng, Xiyuan Du, Yuxuan Gu, Weijiang Yu, Haotian Wang, Qianglong Chen, Zheng Chu, Jingchang Chen, Bing Qin | N/A | N/A |
| RORA: Robust Free-Text Rationale Evaluation | Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu | N/A | N/A |
| Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents | Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models | Lei Li, Yuqi Wang, Runxin Xu, Peiyi Wang, Xiachong Feng, Lingpeng Kong, Qi Liu | N/A | N/A |
| L-Eval: Instituting Standardized Evaluation for Long Context Language Models | Chenxin An, Shansan Gong, Ming Zhong, Xingjian Zhao, Mukai Li, Jun Zhang, Lingpeng Kong, Xipeng Qiu | N/A | N/A |
| DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages | Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos | N/A | N/A |
| InstructProtein: Aligning Human and Protein Language via Knowledge Instruction | Zeyuan Wang, Qiang Zhang, Keyan Ding, Ming Qin, Xiang Zhuang, Xiaotong Li, Huajun Chen | N/A | N/A |
| Causal-Guided Active Learning for Debiasing Large Language Models | Zhouhao Sun, Li Du, Xiao Ding, Yixuan Ma, Yang Zhao, Kaitao Qiu, Ting Liu, Bing Qin | N/A | N/A |
| ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models | Aparna Elangovan, Ling Liu, Lei Xu, Sravan Babu Bodapati, Dan Roth | N/A | N/A |
| Linguistically Conditioned Semantic Textual Similarity | Jingxuan Tu, Keer Xu, Liulu Yue, Bingyang Ye, Kyeongmin Rim, James Pustejovsky | N/A | N/A |
| Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future | Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Tao He, Haotian Wang, Weihua Peng, Ming Liu, Bing Qin, Ting Liu | N/A | N/A |
| TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models | Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Haotian Wang, Ming Liu, Bing Qin | N/A | N/A |
| BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering | Zheng Chu, Jingchang Chen, Qianglong Chen, Haotian Wang, kun Zhu, Xiyuan Du, Weijiang Yu, Ming Liu, Bing Qin | N/A | N/A |
| ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base | Siyu Yuan, Jiangjie Chen, Changzhi Sun, Jiaqing Liang, Yanghua Xiao, Deqing Yang | N/A | N/A |
| TaSL: Continual Dialog State Tracking via Task Skill Localization and Consolidation | Yujie Feng, Xu Chu, Yongxin Xu, Guangyuan SHI, Bo LIU, Xiao-Ming Wu | N/A | N/A |
| DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models | Damai Dai, Chengqi Deng, Chenggang Zhao, R.X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y.K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang | N/A | N/A |
| Grounding Language Model with Chunking-Free In-Context Retrieval | Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou | N/A | N/A |
| Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation | Jiaxin Bai, Yicheng Wang, Tianshi Zheng, Yue Guo, Xin Liu, Yangqiu Song | N/A | N/A |
| Active Prompting with Chain-of-Thought for Large Language Models | Shizhe Diao, Pengcheng Wang, Yong Lin, Rui Pan, Xiang Liu, Tong Zhang | N/A | N/A |
| EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs | Xiangyu Zhao, Bo LIU, Qijiong Liu, Guangyuan SHI, Xiao-Ming Wu | N/A | N/A |
| Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search | Haochen Li, Xin Zhou, Zhiqi Shen | N/A | N/A |
| A Multidimensional Framework for Evaluating Lexical Semantic Change with Social Science Applications | Naomi Baes, Nick Haslam, Ekaterina Vylomova | N/A | N/A |
| Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal | Jianheng Huang, Leyang Cui, Ante Wang, chengyiyang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su | N/A | N/A |
| Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency | Baizhou Huang, Shuai Lu, Xiaojun Wan, Nan Duan | N/A | N/A |
| Citation-Enhanced Generation for LLM-based Chatbots | Weitao Li, Junkai Li, Weizhi Ma, Yang Liu | N/A | N/A |
| Transitive Consistency Constrained Learning for Entity-to-Entity Stance Detection | Haoyang Wen, Eduard Hovy, Alexander G Hauptmann | N/A | N/A |
| Feature-Adaptive and Data-Scalable In-Context Learning | Jiahao Li, Quan Wang, Licheng Zhang, Guoqing Jin, Zhendong Mao | N/A | N/A |
| Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games | Yizhe Zhang, Jiarui Lu, Navdeep Jaitly | N/A | N/A |
| WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models | Shangqing Tu, Yuliang Sun, Yushi Bai, Jifan Yu, Lei Hou, Juanzi Li | N/A | N/A |
| Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Yida Zhao, Chao Lou, Kewei Tu | N/A | N/A |
| A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation | Zhengrui Ma, Qingkai Fang, Shaolei Zhang, Shoutao Guo, Yang Feng, Min zhang | N/A | N/A |
| PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents | Qisen Yang, Zekun Wang, Honghui Chen, Shenzhi Wang, Yifan Pu, Xin Gao, Wenhao Huang, Shiji Song, Gao Huang | N/A | N/A |
| Probing Language Models for Pre-training Data Detection | Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Bing Liu, Haonan Lu, Wenliang Chen | N/A | N/A |
| Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding | Zhihan Zhang, Yixin Cao, Chenchen Ye, Yunshan Ma, Lizi Liao, Tat-Seng Chua | N/A | N/A |
| IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation | Senyu Han, Lu Chen, Li-Min Lin, Zhengshan Xu, Kai Yu | N/A | N/A |
| Language Model Adaption for Reinforcement Learning with Natural Language Action Space | Jiangxing Wang, Jiachen Li, Xiao Han, Deheng Ye, Zongqing Lu | N/A | N/A |
| Evaluating Intention Detection Capability of Large Language Models in Persuasive Dialogues | Hiromasa Sakurai, Yusuke Miyao | N/A | N/A |
| LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression | Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu | N/A | N/A |
| Persuading across Diverse Domains: a Dataset and Persuasion Large Language Model | Chuhao Jin, Kening Ren, Lingzhen Kong, Xiting Wang, Ruihua Song, huan chen | N/A | N/A |
| HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy | Mengxi Xiao, Qianqian Xie, Ziyan Kuang, Zhicheng Liu, Kailai Yang, Min Peng, Weiguang Han, Jimin Huang | N/A | N/A |
| Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition | Zirun Guo, Tao Jin, Zhou Zhao | N/A | N/A |
| An Effective Pronunciation Assessment Approach Leveraging Hierarchical Transformers and Pre-training Strategies | Bi-Cheng Yan, Jiun-Ting Li, Yi-Cheng Wang, Hsin Wei Wang, Tien-Hong Lo, Yung-Chang Hsu, Wei-Cheng Chao, Berlin Chen | N/A | N/A |
| Detection-Correction Structure via General Language Model for Grammatical Error Correction | Wei Li, Houfeng Wang | N/A | N/A |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Yongxin Zhu, Dan Su, Liqiang He, Linli Xu, Dong Yu | N/A | N/A |
| Selene: Pioneering Automated Proof in Software Verification | Lichen Zhang, Shuai Lu, Nan Duan | N/A | N/A |
| Dissecting Human and LLM Preferences | Junlong Li, Fan Zhou, Shichao Sun, Yikai Zhang, hai zhao, Pengfei Liu | N/A | N/A |
| UniCoder: Scaling Code Large Language Model via Universal Code | Tao Sun, Linzheng Chai, Jian Yang, Yuwei Yin, Hongcheng Guo, Jiaheng Liu, Bing Wang, Liqun Yang, Zhoujun Li | N/A | N/A |
| AoE: Angle-optimized Embeddings for Semantic Textual Similarity | Xianming LI, Jing Li | N/A | N/A |
| InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews | Xintao Wang, Yunze Xiao, Jen-tse Huang, Siyu Yuan, Rui Xu, Haoran Guo, Quan Tu, Yaying Fei, Ziang Leng, Wei Wang, Jiangjie Chen, Cheng Li, Yanghua Xiao | N/A | N/A |
| Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better | Shengchao Liu, Xiaoming Liu, Yichen Wang, Zehua Cheng, Chengzhengxu Li, Zhaohan Zhang, Yu Lan, Chao Shen | N/A | N/A |
| AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators | Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold | N/A | N/A |
| Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering | Tobias Schimanski, Jingwei Ni, Mathias Kraus, Elliott Ash, Markus Leippold | N/A | N/A |
| LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin | Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Zhu Jiang, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation | Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Lifeng Jin, Linfeng Song, Haitao Mi, Helen M. Meng | N/A | N/A |
| M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions | Zheng Wang, Shu Xian Teo, Jieer Ouyang, Yongjun xu, Wei Shi | N/A | N/A |
| AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension | Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou | N/A | N/A |
| Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies | Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post | N/A | N/A |
| ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models | Yuanyi Ren, Haoran Ye, Hanjun Fang, Xin Zhang, Guojie Song | N/A | N/A |
| DM-BLI: Dynamic Multiple Subspaces Alignment for Unsupervised Bilingual Lexicon Induction | Ling Hu, Yuemei Xu | N/A | N/A |
| SparseFit: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations | Jesus Solano, Mardhiyah Sanni, Oana-Maria Camburu, Pasquale Minervini | N/A | N/A |
| Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation | Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N Sainath, Phil Woodland | N/A | N/A |
| REANO: Optimising Retrieval-Augmented Reader Models through Knowledge Graph Generation | Jinyuan Fang, Zaiqiao Meng, Craig MacDonald | N/A | N/A |
| Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks | Yingji Zhang, Danilo Carvalho, Andre Freitas | N/A | N/A |
| MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation | Yan Ma, Yu Qiao, Pengfei Liu | N/A | N/A |
| Open-Set Semi-Supervised Text Classification via Adversarial Disagreement Maximization | Junfan Chen, Richong Zhang, Junchi Chen, Chunming Hu | N/A | N/A |
| ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages | Junjie Ye, Sixian Li, Guanyu Li, Huangcaishuang, Songyang Gao, Yilong Wu, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| A synthetic data approach for domain generalization of NLI models | Mohammad Javad Hosseini, Andrey Petrov, Alex Fabrikant, Annie Louis | N/A | N/A |
| Enhancing Contrastive Learning with Noise-Guided Attack: Towards Continual Relation Extraction in the Wild | Ting Wu, Jingyi Liu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| LRQuant: Learnable and Robust Post-Training Quantization for Large Language Models | Jiaqi Zhao, Miao Zhang, Chao Zeng, Ming Wang, Xuebo Liu, Liqiang Nie | N/A | N/A |
| VariErr NLI: Separating Annotation Error from Human Label Variation | Leon Weber-Genzel, Siyao Peng, Marie-Catherine de Marneffe, Barbara Plank | N/A | N/A |
| Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient | Mingxin Li, Richong Zhang, Zhijie Nie | N/A | N/A |
| Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation | Xunjian Yin, Xu Zhang, Jie Ruan, Xiaojun Wan | N/A | N/A |
| ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval | Soyoung Yoon, Eunbi Choi, Jiyeon Kim, Hyeongu Yun, Yireun Kim, seung-won hwang | N/A | N/A |
| Exploring the Potential of Large Language Models in Computational Argumentation | Guizhen Chen, Liying Cheng, Anh Tuan Luu, Lidong Bing | N/A | N/A |
| TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Semantic Tasks | Viktor Moskvoretskii, Ekaterina Neminova, Alina Lobanova, Alexander Panchenko, Irina Nikishina | N/A | N/A |
| CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning | Weiqi Wang, Tianqing Fang, Chunyang Li, Haochen Shi, Wenxuan Ding, Baixuan Xu, Zhaowei Wang, Jiaxin Bai, Xin Liu, Cheng Jiayang, Chunkit Chan, Yangqiu Song | N/A | N/A |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jitai Hao, Weiwei Sun, Xin Xin, Qi Meng, Zhumin Chen, Pengjie Ren, Zhaochun Ren | N/A | N/A |
| Surgical Feature-Space Decomposition of LLMs: Why, When and How? | Arnav Chavan, Nahush Lele, Deepak Gupta | N/A | N/A |
| Reasoning in Flux: Enhancing Large Language Models Reasoning through Uncertainty-aware Adaptive Guidance | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Junqi Dai, Qinyuan Cheng, Xuanjing Huang, Xipeng Qiu | N/A | N/A |
| Modality-Aware Integration with Large Language Models for Knowledge-Based Visual Question Answering | Junnan Dong, Qinggang Zhang, Huachi Zhou, Daochen Zha, Pai Zheng, Xiao Huang | N/A | N/A |
| Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression | Peiyu Liu, Ze-Feng Gao, Xin Zhao, Yipeng Ma, Tao Wang, Ji-Rong Wen | N/A | N/A |
| Emergent Word Order Universals from Cognitively-Motivated Language Models | Tatsuki Kuribayashi, Ryo Ueda, Ryo Yoshida, Yohei Oseki, Ted Briscoe, Timothy Baldwin | N/A | N/A |
| VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models | Seoyeon Kim, Kwangwook Seo, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee | N/A | N/A |
| Making Long-Context Language Models Better Multi-Hop Reasoners | Yanyang Li, Shuo Liang, Michael Lyu, Liwei Wang | N/A | N/A |
| TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models | Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schuetze | N/A | N/A |
| Extreme Miscalibration and the Illusion of Adversarial Robustness | Vyas Raina, Samson Tan, Volkan Cevher, Aditya Rawal, Sheng Zha, George Karypis | N/A | N/A |
| HyCoRec: Hypergraph-Enhanced Multi-Preference Learning for Alleviating Matthew Effect in Conversational Recommendation | Yongsen Zheng, Ruilin Xu, Ziliang Chen, Guohua Wang, Mingjie Qian, Jinghui Qin, Liang Lin | N/A | N/A |
| Co-training for Low Resource Scientific Natural Language Inference | Mobashir Sadat, Cornelia Caragea | N/A | N/A |
| RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models | Jiongxiao Wang, Junlin Wu, Muhao Chen, Yevgeniy Vorobeychik, Chaowei Xiao | N/A | N/A |
| Time is Encoded in the Weights of Finetuned Language Models | Kai Nylund, Suchin Gururangan, Noah A. Smith | N/A | N/A |
| Long-Context Language Modeling with Parallel Context Encoding | Howard Yen, Tianyu Gao, Danqi Chen | N/A | N/A |
| SirLLM: Streaming Infinite Retentive LLM | Yao Yao, Zuchao Li, hai zhao | N/A | N/A |
| IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models | Tao Feng, Lizhen Qu, Zhuang Li, Haolan Zhan, YUNCHENG HUA, Reza Haf | N/A | N/A |
| Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale | Xiang Hu, Pengyu Ji, Qingyang Zhu, Wei Wu, Kewei Tu | N/A | N/A |
| MELA: Multilingual Evaluation of Linguistic Acceptability | Ziyin Zhang, Yikang Liu, Weifang Huang, Junyu Mao, Rui Wang, Hai Hu | N/A | N/A |
| Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View | Jintian Zhang, Xin Xu, Ningyu Zhang, Ruibo Liu, Bryan Hooi, Shumin Deng | N/A | N/A |
| CopyNE: Better Contextual ASR by Copying Named Entities | Shilin Zhou, Zhenghua Li, Yu Hong, Min Zhang, Zhefeng Wang, Baoxing Huai | N/A | N/A |
| Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval | Peter Baile Chen, Yi Zhang, Dan Roth | N/A | N/A |
| Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation | Haonan Chen, Zhicheng Dou, Kelong Mao, Jiongnan Liu, Ziliang Zhao | N/A | N/A |
| ItD: Large Language Models Can Teach Themselves Induction through Deduction | Wangtao Sun, Haotian Xu, Xuanqing Yu, Pei Chen, Shizhu He, Jun Zhao, Kang Liu | N/A | N/A |
| MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs | Zimu Lu, Aojun Zhou, Houxing Ren, Ke Wang, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li | N/A | N/A |
| MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin | Tianshuo Zhou, Sen Mei, Xinze Li, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Ge Yu | N/A | N/A |
| Rethinking Task-Oriented Dialogue Systems: From Complex Modularity to Zero-Shot Autonomous Agent | Heng-Da Xu, Xian-Ling Mao, Puhai Yang, Fanshu Sun, Heyan Huang | N/A | N/A |
| On Context Utilization in Summarization with Large Language Models | Mathieu Ravaut, Aixin Sun, Nancy F. Chen, Shafiq Joty | N/A | N/A |
| INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning | Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zheng Liu, Ji-Rong Wen, Zhicheng Dou | N/A | N/A |
| Enhancing In-Context Learning via Implicit Demonstration Augmentation | Xiaoling Zhou, Wei Ye, Yidong Wang, Chaoya Jiang, Zhemg Lee, Rui Xie, Shikun Zhang | N/A | N/A |
| PRoLoRA: Partial Rotation Empowers More Parameter-Efficient LoRA | Sheng Wang, Boyang XUE, Jiacheng Ye, Jiyue Jiang, Liheng Chen, Lingpeng Kong, Chuan Wu | N/A | N/A |
| Distributional Inclusion Hypothesis and Quantifications: Probing for Hypernymy in Functional Distributional Semantics | Chun Hei Lo, Wai Lam, Hong Cheng, Guy Emerson | N/A | N/A |
| Improving Event Definition Following For Zero-Shot Event Detection | Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng | N/A | N/A |
| Through the MUD: A Multi-Defendant Charge Prediction Benchmark with Linked Crime Elements | Xiao Wei, Xu Qi, Hang Yu, Qian Liu, Erik Cambria | N/A | N/A |
| Interpreting Conversational Dense Retrieval by Rewriting-Enhanced Inversion of Session Embedding | Yiruo Cheng, Kelong Mao, Zhicheng Dou | N/A | N/A |
| Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks | Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He | N/A | N/A |
| CausalGym: Benchmarking causal interpretability methods on linguistic tasks | Aryaman Arora, Dan Jurafsky, Christopher Potts | N/A | N/A |
| Training Language Models to Generate Text with Citations via Fine-grained Rewards | Chengyu Huang, Zeqiu Wu, Yushi Hu, Wenya Wang | N/A | N/A |
| Hypergraph based Understanding for Document Semantic Entity Recognition | Qiwei Li, Zuchao Li, Ping Wang, Haojun Ai, hai zhao | N/A | N/A |
| GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers | Qintong Li, Leyang Cui, Xueliang Zhao, Lingpeng Kong, Wei Bi | N/A | N/A |
| Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models | Qingkai Min, Qipeng Guo, Xiangkun Hu, Songfang Huang, Zheng Zhang, Yue Zhang | N/A | N/A |
| AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning | Shuofei Qiao, Ningyu Zhang, Runnan Fang, Yujie Luo, Wangchunshu Zhou, Yuchen Eleanor Jiang, chengfei lv, Huajun Chen | N/A | N/A |
| ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks | Santosh T.Y.S.S, Tuan-Quang Vuong, Matthias Grabmair | N/A | N/A |
| Virtual Compiler Is All You Need For Assembly Code Search | Zeyu Gao, Hao Wang, Yuanda Wang, Chao Zhang | N/A | N/A |
| MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning | Pengjie Ren, Chengshun Shi, Shiguang Wu, Mengqi Zhang, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Jiahuan Pei | N/A | N/A |
| Can LLMs Learn from Previous Mistakes? Investigating LLMs’ Errors to Boost for Reasoning | Yongqi Tong, Dawei Li, Sizhe Wang, Yujia Wang, Fei Teng, Jingbo Shang | N/A | N/A |
| An Iterative Associative Memory Model for Empathetic Response Generation | Zhou Yang, Zhaochun Ren, Wang Yufeng, Haizhou Sun, Chao Chen, Xiaofei Zhu, Xiangwen Liao | N/A | N/A |
| Detoxifying Large Language Models via Knowledge Editing | Mengru Wang, Ningyu Zhang, Ziwen Xu, Zekun Xi, Shumin Deng, Yunzhi Yao, Qishen Zhang, Linyi Yang, Jindong Wang, Huajun Chen | N/A | N/A |
| LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding | Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li | N/A | N/A |
| Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models | Yuyan Chen, Songzhou Yan, Panjun Liu, Yanghua Xiao | N/A | N/A |
| UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages | Trinh Pham, Khoi M. Le, Anh Tuan Luu | N/A | N/A |
| VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval | Junjie Zhou, Zheng Liu, Shitao Xiao, Bo Zhao, yongping xiong | N/A | N/A |
| Black-Box Prompt Optimization: Aligning Large Language Models without Model Training | Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang | N/A | N/A |
| Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark | Chanjun Park, Hyeonwoo Kim, Dahyun Kim, SeongHwan Cho, Sanghoon Kim, Sukyung Lee, Yungi Kim, Hwalsuk Lee | N/A | N/A |
| Unified Hallucination Detection for Multimodal Large Language Models | Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, xiaoyan yang, Qiang Li, YUE SHEN, Lei Liang, Jinjie GU, Huajun Chen | N/A | N/A |
| Empowering Character-level Text Infilling by Eliminating Sub-Tokens | Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Hongsheng Li | N/A | N/A |
| Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models | Kun Luo, Zheng Liu, Shitao Xiao, Tong Zhou, Yubo Chen, Jun Zhao, Kang Liu | N/A | N/A |
| GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge? | Dayoon Ko, Jinyoung Kim, Hahyeon Choi, Gunhee Kim | N/A | N/A |
| Attribute First, then Generate: Locally-attributable Grounded Text Generation | Aviv Slobodkin, Eran Hirsch, Arie Cattan, Tal Schuster, Ido Dagan | N/A | N/A |
| T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text | Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang | N/A | N/A |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Zhen Bi, Ningyu Zhang, Yida Xue, Yixin Ou, Daxiong Ji, Guozhou Zheng, Huajun Chen | N/A | N/A |
| Beyond Memorization: The Challenge of Random Memory Access in Language Models | Tongyao Zhu, Qian Liu, Liang Pang, Zhengbao Jiang, Min-Yen Kan, Min Lin | N/A | N/A |
| BIPED: Pedagogically Informed Tutoring System for ESL Education | Soonwoo Kwon, Sojung Kim, Minju Park, Seunghyun Lee, Kyuseok Kim | N/A | N/A |
| Timeline-based Sentence Decomposition with In Context Learning for Temporal Fact Extraction | Jianhao Chen, Haoyuan Ouyang, Junyang Ren, Wentao Ding, Wei Hu, Yuzhong Qu | N/A | N/A |
| Collaboration or Corporate Capture? Quantifying NLP’s Reliance on Industry Artifacts and Contributions | Will Aitken, Mohamed Abdalla, Karen Rudie, Catherine Stinson | N/A | N/A |
| Prompt Expansion for Adaptive Text-to-Image Generation | Siddhartha Datta, Alexander Ku, Deepak Ramachandran, Peter Anderson | N/A | N/A |
| Progressively Modality Freezing for Multi-Modal Entity Alignment | Yani Huang, Xuefeng Zhang, Richong Zhang, Junfan Chen, Jaein Kim | N/A | N/A |
| Llama2Vec: Unsupervised Adaptation of Large Language Models for Dense Retrieval | Chaofan Li, Zheng Liu, Shitao Xiao, Yingxia Shao, Defu Lian | N/A | N/A |
| Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts | Xuan-Phi Nguyen, Mahani Aljunied, Shafiq Joty, Lidong Bing | N/A | N/A |
| Metaphor Understanding Challenge Dataset for LLMs | Xiaoyu Tong, Rochelle Choenni, Martha Lewis, Ekaterina Shutova | N/A | N/A |
| A Multi-Task Embedder For Retrieval Augmented LLMs | Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, Jian-Yun Nie | N/A | N/A |
| Language Models Don’t Learn the Physical Manifestation of Language | Bruce W Lee, Jaehyuk Lim | N/A | N/A |
| Don’t Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration | Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov | N/A | N/A |
| What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection | Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov | N/A | N/A |
| Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives | Wenqi Zhang, Yongliang Shen, Linjuan Wu, Qiuying Peng, Jun Wang, Yueting Zhuang, Weiming Lu | N/A | N/A |
| Relying on the Unreliable: The Impact of Language Models’ Reluctance to Express Uncertainty | Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Maarten Sap | N/A | N/A |
| Mission: Impossible Language Models | Julie Kallini, Isabel Papadimitriou, Richard Futrell, Kyle Mahowald, Christopher Potts | N/A | N/A |
| Unity in Diversity: Collaborative Pre-training Across Multimodal Medical Sources | Xiaochen Wang, Junyu Luo, Jiaqi Wang, Yuan Zhong, Xiaokun Zhang, Yaqing Wang, Parminder Bhatia, Cao Xiao, Fenglong Ma | N/A | N/A |
| Semisupervised Neural Proto-Language Reconstruction | Liang Lu, Peirong Xie, David R Mortensen | N/A | N/A |
| When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP | Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri | N/A | N/A |
| SBAAM! Eliminating Transcript Dependency in Automatic Subtitling | Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli | N/A | N/A |
| Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? | Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli | N/A | N/A |
| StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection | Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli | N/A | N/A |
| ARL2: Aligning Retrievers with Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | LingXi Zhang, Yue Yu, Kuan Wang, Chao Zhang | N/A | N/A |
| Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference | Jihwan Bang, Juntae Lee, Kyuhong Shim, Seunghan Yang, Simyung Chang | N/A | N/A |
| FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model | Yebin Lee, Imseong Park, Myungjoo Kang | N/A | N/A |
| MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations | Yuxin Wang, Ivory Yang, Saeed Hassanpour, Soroush Vosoughi | N/A | N/A |
| MPCoder: Multi-user Personalized Code Generator with Explicit and Implicit Style Representation Learning | Zhenlong Dai, Chang Yao, WenKang Han, Yuanying, Zhipeng Gao, Jingyuan Chen | N/A | N/A |
| DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows | Ajay Patel, Colin Raffel, Chris Callison-Burch | N/A | N/A |
| Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective | Chenze Shao, Fandong Meng, Jiali Zeng, Jie Zhou | N/A | N/A |
| Identifying while Learning for Document Event Causality Identification | Cheng Liu, Wei Xiang, Bang Wang | N/A | N/A |
| OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Chaoqun He, Renjie Luo, Yuzhuo Bai, Shengding Hu, Zhen Leng Thai, Junhao Shen, Jinyi Hu, Xu Han, Yujie Huang, Yuxiang Zhang, Jie Liu, Lei Qi, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Insert or Attach: Taxonomy Completion via Box Embedding | Wei Xue, Yongliang Shen, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu | N/A | N/A |
| Semiparametric Token-Sequence Co-Supervision | Hyunji Lee, Doyoung Kim, Jihoon Jun, Se June Joo, Joel Jang, Kyoung-Woon On, Minjoon Seo | N/A | N/A |
| Instruction Fusion: Advancing Prompt Evolution through Hybridization | Weidong Guo, Jiuding Yang, Kaitong Yang, Xiangyang Li, Zhuwei Rao, Yu Xu, Di Niu | N/A | N/A |
| TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation | Yikai Zhang, Siyu Yuan, Caiyu Hu, Kyle Richardson, Yanghua Xiao, Jiangjie Chen | N/A | N/A |
| Exploring Memorization in Fine-tuned Language Models | Shenglai Zeng, Yaxin Li, Jie Ren, Yiding Liu, Han Xu, Pengfei He, Yue Xing, Shuaiqiang Wang, Jiliang Tang, Dawei Yin | N/A | N/A |
| Towards Real-world Scenario: Imbalanced New Intent Discovery | Shun Zhang, Yan Chaoran, Jian Yang, Jiaheng Liu, Ying Mo, Jiaqi Bai, Tongliang Li, Zhoujun Li | N/A | N/A |
| M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection | Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, OSAMA MOHAMMED AFZAL, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov | N/A | N/A |
| Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue | Jian Wang, Chak Tou Leong, Jiashuo WANG, Dongding Lin, Wenjie Li, Xiaoyong Wei | N/A | N/A |
| SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training | Nan He, Weichen Xiong, Hanwen Liu, Yi Liao, Lei Ding, Kai Zhang, Guohua Tang, Xiao Han, Yang Wei | N/A | N/A |
| Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models? | Ning Bian, Xianpei Han, Hongyu Lin, Yaojie Lu, Ben He, Le Sun | N/A | N/A |
| Learning Global Controller in Latent Space for Parameter-Efficient Fine-Tuning | Zeqi Tan, Yongliang Shen, Xiaoxia Cheng, Chang Zong, Wenqi Zhang, Jian Shao, Weiming Lu, Yueting Zhuang | N/A | N/A |
| CaMML: Context-Aware Multimodal Learner for Large Models | Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Li | N/A | N/A |
| MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation | Xiaozhi Wang, Hao Peng, Yong Guan, Kaisheng Zeng, Jianhui Chen, Lei Hou, Xu Han, Yankai Lin, Zhiyuan Liu, Ruobing Xie, Jie Zhou, Juanzi Li | N/A | N/A |
| NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes | Lizhou Fan, Wenyue Hua, Lingyao Li, Haoyang Ling, Yongfeng Zhang | N/A | N/A |
| Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models | Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang | N/A | N/A |
| Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? | Roshan Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Bhiksha Raj | N/A | N/A |
| Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors | Alicja Chaszczewicz, Raj Sanjay Shah, Ryan Louie, Bruce A Arnow, Robert Kraut, Diyi Yang | N/A | N/A |
| D2LLM: Decomposed and Distilled Large Language Models for Semantic Search | Zihan Liao, Hang Yu, Jianguo Li, Jun Wang, Wei Zhang | N/A | N/A |
| In-context Mixing (ICM): Code-mixed Prompts for Multilingual LLMs | Bhavani Shankar, Preethi Jyothi, Pushpak Bhattacharyya | N/A | N/A |
| Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models | Liang Zhang, Qin Jin, Haoyang Huang, Dongdong Zhang, Furu Wei | N/A | N/A |
| Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries | Yu-Hsiang Huang, Yuche Tsai, Hsiang Hsiao, Hong-Yi Lin, Shou-De Lin | N/A | N/A |
| Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding | Kuo Liao, Shuang Li, Meng Zhao, Liqun Liu, Mengge Xue, zhenyu hu, Honglin Han, Chengguo Yin | N/A | N/A |
| Intuitive or Dependent? Investigating LLMs’ Behavior Style to Conflicting Prompts | Jiahao Ying, Yixin Cao, Kai Xiong, Long Cui, yidong He, Yongbin Liu | N/A | N/A |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Shiyi Zhu, Jing Ye, Wei Jiang, Siqiao Xue, Qi Zhang, Yifan Wu, Jianguo Li | N/A | N/A |
| Arabic Diacritics in the Wild: Exploiting Opportunities for Improved Diacritization | Salman Elgamal, Ossama Obeid, MHD Tameem Kabbani, Go Inoue, Nizar Habash | N/A | N/A |
| InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification | Jan Trienes, Sebastian Antony Joseph, Jörg Schlötterer, Christin Seifert, Kyle Lo, Wei Xu, Byron C Wallace, Junyi Jessy Li | N/A | N/A |
| Disinformation Capabilities of Large Language Models | Ivan Vykopal, Matúš Pikuliak, Ivan Srba, Robert Moro, Dominik Macko, Maria Bielikova | N/A | N/A |
| Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models | Junhao Zheng, Shengjie Qiu, Qianli Ma | N/A | N/A |
| CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following | Kaiyan Zhang, Jianyu Wang, Ermo Hua, Biqing Qi, Ning Ding, Bowen Zhou | N/A | N/A |
| DAPR: A Benchmark on Document-Aware Passage Retrieval | Kexin Wang, Nils Reimers, Iryna Gurevych | N/A | N/A |
| How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study | Andreas Waldis, Yufang Hou, Iryna Gurevych | N/A | N/A |
| Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors | Mengge Xue, zhenyu hu, Liqun Liu, Kuo Liao, Shuang Li, Honglin Han, Meng Zhao, Chengguo Yin | N/A | N/A |
| SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graph | Hanzhu Chen, Xu Shen, Qitan Lv, Jie Wang, Xiaoqi Ni, Jieping Ye | N/A | N/A |
| Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages | Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Rifki Afina Putri, Wawan Cenggoro, Jhonson Lee, Salsabil Maulana Akbar, Emmanuel Dave, Nuurshadieq, Muhammad Ihza Mahendra, Rr Dea Annisayanti Putri, Bryan Wilie, Genta Indra Winata, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung | N/A | N/A |
| Uncertainty-Guided Modal Rebalance for Hateful Memes Detection | Chuanpeng Yang, Yaxin Liu, Fuqing Zhu, Jizhong Han, Songlin Hu | N/A | N/A |
| Must NLP be Extractive? | Steven Bird | N/A | N/A |
| Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Question Answering | Xiaoyang Chen, Ben He, Hongyu Lin, Xianpei Han, Tianshu Wang, Boxi Cao, Le Sun, Yingfei Sun | N/A | N/A |
| Missci: Reconstructing Fallacies in Misrepresented Science | Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych | N/A | N/A |
| Uncovering the Full Potential of Visual Grounding Methods in VQA | Daniel Reich, Tanja Schultz | N/A | N/A |
| Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs | Jiejun Tan, Zhicheng Dou, Yutao Zhu, Peidong Guo, Kun Fang, Ji-Rong Wen | N/A | N/A |
| Favi-Score: A Measure for Favoritism in Automated Preference Ratings for Generative AI Evaluation | Pius von Däniken, Jan Milan Deriu, Don Tuggener, Mark Cieliebak | N/A | N/A |
| LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback | Timon Ziegenbein, Gabriella Skitalinskaya, Alireza Bayat Makou, Henning Wachsmuth | N/A | N/A |
| Graph Language Models | Moritz Plenz, Anette Frank | N/A | N/A |
| Analyzing Semantic Change through Lexical Replacements | Francesco Periti, Pierluigi Cassotti, Haim Dubossarsky, Nina Tahmasebi | N/A | N/A |
| Exploiting Intrinsic Multilateral Logical Rules for Weakly Supervised Natural Language Video Localization | Zhe Xu, Kun Wei, Xu Yang, Cheng Deng | N/A | N/A |
| Latxa: An Open Language Model and Evaluation Suite for Basque | Julen Etxaniz, Oscar Sainz, Naiara Perez Miguel, Itziar Aldabe, German Rigau, Eneko Agirre, Aitor Ormazabal, Mikel Artetxe, Aitor Soroa | N/A | N/A |
| Interpretability of Language Models via Task Spaces | Lucas Weber, Jaap Jumelet, Elia Bruni, Dieuwke Hupkes | N/A | N/A |
| Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types | Pierluigi Cassotti, Stefano De Pascale, Nina Tahmasebi | N/A | N/A |
| Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators | Matéo Mahaut, Laura Aina, Paula Czarnowska, Momchil Hardalov, Thomas Müller, Lluis Marquez | N/A | N/A |
| StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback | Shihan Dou, Yan Liu, Haoxiang Jia, Enyu Zhou, Limao Xiong, Junjie Shan, Huangcaishuang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| One-Shot Learning as Instruction Data Prospector for Large Language Models | Yunshui Li, Binyuan Hui, Xiaobo Xia, Jiaxi Yang, Min Yang, Lei Zhang, Shuzheng Si, Ling-Hao Chen, Junhao Liu, Tongliang Liu, Fei Huang, Yongbin Li | N/A | N/A |
| Navigating the OverKill in Large Language Models | Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin | N/A | N/A |
| Why are Sensitive Functions Hard for Transformers? | Michael Hahn, Mark Rofin | N/A | N/A |
| A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains | Alon Jacovi, Yonatan Bitton, Bernd Bohnet, Jonathan Herzig, Or Honovich, Michael Tseng, Michael Collins, Roee Aharoni, Mor Geva | N/A | N/A |
| Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Revision | Qian Ruan, Ilia Kuznetsov, Iryna Gurevych | N/A | N/A |
| NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents | Tamara Czinczoll, Christoph Hönes, Maximilian Schall, Gerard de Melo | N/A | N/A |
| FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models | Yuxin Jiang, Yufei Wang, Xingshan Zeng, Wanjun Zhong, Liangyou Li, Fei Mi, Lifeng Shang, Xin Jiang, Qun Liu, Wei Wang | N/A | N/A |
| Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction | Haoqiu Yan, Yongxin Zhu, Kai Zheng, Bing Liu, Haoyu Cao, Deqiang Jiang, Linli Xu | N/A | N/A |
| Learning to Edit: Aligning LLMs with Knowledge Editing | Yuxin Jiang, Yufei Wang, Chuhan Wu, Wanjun Zhong, Xingshan Zeng, Jiahui Gao, Liangyou Li, Xin Jiang, Lifeng Shang, Ruiming Tang, Qun Liu, Wei Wang | N/A | N/A |
| DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning | Yejie Wang, Keqing He, Guanting Dong, Pei Wang, Weihao Zeng, Muxi Diao, Weiran Xu, Jingang Wang, Mengdi Zhang, Xunliang Cai | N/A | N/A |
| IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators | Indraneil Paul, Goran Glavaš, Iryna Gurevych | N/A | N/A |
| When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality | Brielen Madureira, Patrick Kahardipraja, David Schlangen | N/A | N/A |
| SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models | Md Imbesat Hassan Rizvi, Xiaodan Zhu, Iryna Gurevych | N/A | N/A |
| Planning Like Human: A Dual-process Framework for Dialogue Planning | Tao He, Lizi Liao, Yixin Cao, Yuanxing Liu, Ming Liu, Zerui Chen, Bing Qin | N/A | N/A |
| Spectral Filters, Dark Signals, and Attention Sinks | Nicola Cancedda | N/A | N/A |
| DiffuCOMET: Contextual Commonsense Knowledge Diffusion | Silin Gao, Mete Ismayilzada, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji, Antoine Bosselut | N/A | N/A |
| Systematic Task Exploration with LLMs: A Study in Citation Text Generation | Furkan Şahinuç, Ilia Kuznetsov, Yufang Hou, Iryna Gurevych | N/A | N/A |
| The Echoes of Multilinguality: Tracing Cultural Value Shifts during Language Model Fine-tuning | Rochelle Choenni, Anne Lauscher, Ekaterina Shutova | N/A | N/A |
| Limits of Theory of Mind Modelling in Dialogue-Based Collaborative Plan Acquisition | Matteo Bortoletto, Constantin Ruhdorfer, Adnen Abdessaied, Lei Shi, Andreas Bulling | N/A | N/A |
| MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling | Tomasz Limisiewicz, Terra Blevins, Hila Gonen, Orevaoghene Ahia, Luke Zettlemoyer | N/A | N/A |
| Temporal Knowledge Question Answering via Abstract Reasoning Induction | Ziyang Chen, Dongfang Li, Xiang Zhao, Baotian Hu, Min Zhang | N/A | N/A |
| MultiLegalPile: A 689GB Multilingual Legal Corpus | Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho | N/A | N/A |
| Who Wrote this Code? Watermarking for Code Generation | Taehyun Lee, Seokhee Hong, Jaewoo Ahn, Ilgee Hong, Hwaran Lee, Sangdoo Yun, Jamin Shin, Gunhee Kim | N/A | N/A |
| MapCoder: Multi-Agent Code Generation for Competitive Problem Solving | Md. Ashraful Islam, Mohammed Eunus Ali, Md Rizwan Parvez | N/A | N/A |
| RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Lei Zhu, Xinjiang Wang, Wayne Zhang, Rynson W. H. Lau | N/A | N/A |
| Boosting Language Models Reasoning with Chain-of-Knowledge Prompting | Jianing Wang, Qiushi Sun, Xiang Li, Ming Gao | N/A | N/A |
| Open Grounded Planning: Challenges and Benchmark Construction | Shiguang Guo, Ziliang Deng, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun | N/A | N/A |
| WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations | Haolin Deng, Chang Wang, Li Xin, Dezhang Yuan, Junlang Zhan, Tian Hua Zhou, Jin Ma, Jun Gao, Ruifeng Xu | N/A | N/A |
| LLM Knows Body Language, Too: Translating Speech Voices into Human Gestures | Chenghao Xu, Guangtao Lyu, Jiexi Yan, Muli Yang, Cheng Deng | N/A | N/A |
| QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback based Self-Correction | Xiang Huang, Sitao Cheng, Shanshan Huang, Jiayu Shen, Yong Xu, Chaoyun Zhang, Yuzhong Qu | N/A | N/A |
| PITA: Prompting Task Interaction for Argumentation Mining | Yang Sun, Muyi Wang, Jianzhu Bao, Bin Liang, Xiaoyan Zhao, Caihua Yang, Min Yang, Ruifeng Xu | N/A | N/A |
| Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models | Jinhao Duan, Hao Cheng, Shiqi Wang, Alex Zavalny, Chenan Wang, Renjing Xu, Bhavya Kailkhura, Kaidi Xu | N/A | N/A |
| Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Estimating Agreement by Chance for Sequence Annotation | Diya Li, Carolyn Rose, Ao Yuan, Chunxiao Zhou | N/A | N/A |
| What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages | Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell | N/A | N/A |
| Are Emergent Abilities in Large Language Models just In-Context Learning? | Sheng Lu, Irina Bigoulaeva, Rachneet Singh Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych | N/A | N/A |
| WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning | Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin | N/A | N/A |
| Eliciting Better Multilingual Structured Reasoning from LLMs through Code | Bryan Li, Tamer Alkhouli, Daniele Bonadiman, Nikolaos Pappas, Saab Mansour | N/A | N/A |
| OLIVE: Object Level In-Context Visual Embeddings | Timothy Ossowski, Junjie Hu | N/A | N/A |
| Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness | Jiuhai Chen, Jonas Mueller | N/A | N/A |
| Marathon: A Race Through the Realm of Long Context with Large Language Models | Lei Zhang, Yunshui Li, Ziqiang Liu, Jiaxi Yang, Junhao Liu, Longze Chen, Run Luo, Min Yang | N/A | N/A |
| Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph | Xiaochen Kev Gao, Feng Yao, Kewen Zhao, Beilei He, Animesh Kumar, Vish Krishnan, Jingbo Shang | N/A | N/A |
| PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling | Xianwei Zhuang, Xuxin Cheng, Liming Liang, Yuxin Xie, Zhichang Wang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| Rethinking the Multimodal Correlation of Multimodal Sequential Learning via Generalizable Attentional Results Alignment | Tao Jin, Wang Lin, Ye Wang, Linjun Li, Xize Cheng, Zhou Zhao | N/A | N/A |
| UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation | Xun Liang, Shichao Song, Simin Niu, Zhiyu li, Feiyu Xiong, Bo Tang, Yezhaohui Wang, Dawei He, Cheng Peng, Zhonghao Wang, Haiying Deng | N/A | N/A |
| PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers | Weizhe Lin, Jingbiao Mei, Jinghong Chen, Bill Byrne | N/A | N/A |
| Triple-Encoders: Representations That Fire Together, Wire Together | Justus-Jonas Erker, Florian Mai, Nils Reimers, Gerasimos Spanakis, Iryna Gurevych | N/A | N/A |
| Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning | Jingbiao Mei, Jinghong Chen, Weizhe Lin, Bill Byrne, Marcus Tomalin | N/A | N/A |
| Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | Wenqi Zhang, Ke Tang, Hai Wu, Mengna Wang, Yongliang Shen, Guiyang Hou, Zeqi Tan, Peng Li, Yueting Zhuang, Weiming Lu | N/A | N/A |
| Tree-Averaging Algorithms for Ensemble-Based Unsupervised Discontinuous Constituency Parsing | Behzad Shayegh, Yuqiao Wen, Lili Mou | N/A | N/A |
| Your Transformer is Secretly Linear | Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Nikolai Gerasimenko, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov | N/A | N/A |
| Noise Correction on Subjective Datasets | Uthman Jinadu, Yi Ding | N/A | N/A |
| Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers | Lütfi Kerem Senel, Besnik Fetahu, Davis Yoshida, Zhiyu Chen, Giuseppe Castellucci, Nikhita Vedula, Jason Ingyu Choi, Shervin Malmasi | N/A | N/A |
| Instruction-tuned Language Models are Better Knowledge Learners | Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srini Iyer | N/A | N/A |
| What Do Language Models Hear? Probing for Auditory Representations in Language Models | Jerry Ngo, Yoon Kim | N/A | N/A |
| Threads of Subtlety: Detecting Machine-Generated Texts Through Discourse Motifs | Zae Myung Kim, Kwang Hee Lee, Preston Zhu, Vipul Raheja, Dongyeop Kang | N/A | N/A |
| Jailbreak Open-Sourced Large Language Models via Enforced Decoding | Hangfan Zhang, Zhimeng Guo, Huaisheng Zhu, Bochuan Cao, Lu Lin, Jinyuan Jia, Jinghui Chen, Dinghao Wu | N/A | N/A |
| NICE: To Optimize In-Context Examples or Not? | Pragya Srivastava, Satvik Golechha, Amit Deshpande, Amit Sharma | N/A | N/A |
| CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation | Weixiang Yan, Haitian Liu, Yunkun Wang, Yunzhe Li, Qian Chen, Wen Wang, Tingyu Lin, Weishan Zhao, Li Zhu, Hari Sundaram, Shuiguang Deng | N/A | N/A |
| Digital Socrates: Evaluating LLMs through Explanation Critiques | Yuling Gu, Oyvind Tafjord, Peter Clark | N/A | N/A |
| SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding | Zhangchen Xu, Fengqing Jiang, Luyao Niu, Jinyuan Jia, Bill Yuchen Lin, Radha Poovendran | N/A | N/A |
| ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs | Fengqing Jiang, Zhangchen Xu, Luyao Niu, Zhen Xiang, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran | N/A | N/A |
| Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once? | Guijin Son, SangWon Baek, Sangdae Nam, Ilgyun Jeong, Seungone Kim | N/A | N/A |
| ChatDev: Communicative Agents for Software Development | Chen Qian, Wei Liu, Hongzhang Liu, Nuo Chen, Yufan Dang, Jiahao Li, Cheng Yang, Weize Chen, Yusheng Su, Xin Cong, Juyuan Xu, dahai li, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Experiential Co-Learning of Software-Developing Agents | Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Zihao Xie, YiFei Wang, Weize Chen, Cheng Yang, Xin Cong, Xiaoyin Che, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Learning Geometry-Aware Representations for New Intent Discovery | Kai Tang, Junbo Zhao, Xiao Ding, Runze Wu, Lei Feng, Gang Chen, Haobo Wang | N/A | N/A |
| Speaker Verification in Agent-generated Conversations | Yizhe Yang, Palakorn Achananuparp, Heyan Huang, Jing Jiang, Ee-Peng Lim | N/A | N/A |
| Benchmarking Data Science Agents | Yuge Zhang, Qiyang Jiang, XingyuHan, Nan Chen, Yuqing Yang, Kan Ren | N/A | N/A |
| Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models | Tianyi Tang, Wenyang Luo, Haoyang Huang, Dongdong Zhang, Xiaolei Wang, Xin Zhao, Furu Wei, Ji-Rong Wen | N/A | N/A |
| Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models | Shiwen Ni, Dingwei Chen, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang | N/A | N/A |
| A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques | Megh Thakkar, Quentin Fournier, Matthew D Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das, Sarath Chandar | N/A | N/A |
| Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation | Xiang Luo, Zhiwen Tang, Jin Wang, Xuejie Zhang | N/A | N/A |
| PRP-Graph: Pairwise Ranking Prompting to LLMs with Graph Aggregation for Effective Text Re-ranking | Jian Luo, Xuanang Chen, Ben He, Le Sun | N/A | N/A |
| RepCodec: A Speech Representation Codec for Speech Tokenization | Zhichao Huang, Chutong Meng, Tom Ko | N/A | N/A |
| Disentangled Learning with Synthetic Parallel Data for Text Style Transfer | Jingxuan Han, Quan Wang, Zikang Guo, Benfeng Xu, Licheng Zhang, Zhendong Mao | N/A | N/A |
| GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick | Jiayi Fu, Xuandong Zhao, Ruihan Yang, Yuansen Zhang, Jiangjie Chen, Yanghua Xiao | N/A | N/A |
| PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety | Zaibin Zhang, Yongting Zhang, Lijun Li, Jing Shao, Hongzhi Gao, Yu Qiao, Lijun Wang, Huchuan Lu, Feng Zhao | N/A | N/A |
| Event-Radar: Event-driven Multi-View Learning for Multimodal Fake News Detection | Zihan Ma, Minnan Luo, Hao Guo, Zhi Zeng, Yiran Hao, Xiang Zhao | N/A | N/A |
| Fine-Grained Modeling of Narrative Context: A Coherence Perspective via Retrospective Questions | Liyan Xu, Jiangnan Li, Mo Yu, Jie Zhou | N/A | N/A |
| Stealthy Attack on Large Language Model based Recommendation | Jinghao Zhang, Yuting Liu, Qiang Liu, Shu Wu, Guibing Guo, Liang Wang | N/A | N/A |
| Multi-Dimensional Optimization for Text Summarization via Reinforcement Learning | Sangwon Ryu, Heejin Do, Yunsu Kim, Gary Lee, Jungseul Ok | N/A | N/A |
| Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models | Changyu Chen, Xiting Wang, Ting-En Lin, Ang Lv, Yuchuan Wu, Xin Gao, Ji-Rong Wen, Rui Yan, Yongbin Li | N/A | N/A |
| SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning | Guoxin Chen, kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian | N/A | N/A |
| Towards Robust and Generalized Parameter-Efficient Fine-Tuning for Noisy Label Learning | Yeachan Kim, Junho Kim, SangKeun Lee | N/A | N/A |
| SparseFlow: Accelerating Transformers by Sparsifying Information Flows | Yeachan Kim, SangKeun Lee | N/A | N/A |
| ProtT3: Protein-to-Text Generation for Text-based Protein Understanding | Zhiyuan Liu, An Zhang, Hao Fei, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua | N/A | N/A |
| KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models | Zhuohao Yu, Chang Gao, Wenjin Yao, Yidong Wang, Wei Ye, Jindong Wang, Xing Xie, Yue Zhang, Shikun Zhang | N/A | N/A |
| EmoBench: Evaluating the Emotional Intelligence of Large Language Models | Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna Shiergetya Sunaryo, Tatia M.C. Lee, Rada Mihalcea, Minlie Huang | N/A | N/A |
| Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation | Dongjin Kang, Sunghwan Kim, Taeyoon Kwon, Seungjun Moon, Hyunsouk Cho, Youngjae Yu, Dongha Lee, Jinyoung Yeo | N/A | N/A |
| Are AI-Generated Text Detectors Robust to Adversarial Perturbations? | Guanhua Huang, Yuchen Zhang, Zhe Li, Yongjian You, Mingze Wang, Zhouwang Yang | N/A | N/A |
| FinTextQA: A Dataset for Long-form Financial Question Answering | Jian Chen, Peilin Zhou, Yining Hua, Loh Ying Xin, Kehui chen, Ziyuan Li, Bing Zhu, Junwei Liang | N/A | N/A |
| On Measuring Faithfulness or Self-consistency of Natural Language Explanations | Letitia Parcalabescu, Anette Frank | N/A | N/A |
| $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens | Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Learning or Self-aligning? Rethinking Instruction Fine-tuning | Mengjie Ren, Boxi Cao, Hongyu Lin, Cao Liu, Xianpei Han, Ke Zeng, Wan Guanglu, Xunliang Cai, Le Sun | N/A | N/A |
| Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key? | Qineng Wang, Zihao Wang, Ying Su, Hanghang Tong, Yangqiu Song | N/A | N/A |
| Soft Knowledge Prompt: Help External Knowledge Become a Better Teacher to Instruct LLM in Knowledge-based VQA | Qunbo Wang, Ruyi Ji, Tianhao Peng, Wenjun Wu, Zechao Li, Jing Liu | N/A | N/A |
| TasTe: Teaching Large Language Models to Translate through Self-Reflection | Yutong Wang, Jiali Zeng, Xuebo Liu, Fandong Meng, Jie Zhou, Min Zhang | N/A | N/A |
| Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li | N/A | N/A |
| Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models | Tharindu Madusanka, Ian Pratt-Hartmann, Riza Batista-Navarro | N/A | N/A |
| UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion | Wei Li, Xue Xu, Jiachen Liu, Xinyan Xiao | N/A | N/A |
| The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities | David Stap, Eva Hasler, Bill Byrne, Christof Monz, Ke Tran | N/A | N/A |
| Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models | Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Schuetze, Dirk Hovy | N/A | N/A |
| AI ‘News’ Content Farms Are Easy to Make and Hard to Detect: A Case Study in Italian | Giovanni Puccetti, Anna Rogers, Chiara Alzetta, Felice Dell’Orletta, Andrea Esuli | N/A | N/A |
| Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts When Knowledge Conflicts? | Hexiang Tan, Fei Sun, Wanli Yang, Yuanzhuo Wang, Qi Cao, Xueqi Cheng | N/A | N/A |
| Unveiling Linguistic Regions in Large Language Models | Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment | Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang | N/A | N/A |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Yufei Huang, Xu Han, Maosong Sun | N/A | N/A |
| Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations | Yisong Miao, Hongfu Liu, Wenqiang Lei, Nancy F. Chen, Min-Yen Kan | N/A | N/A |
| An Open Multilingual System for Scoring Readability of Wikipedia | Mykola Trokhymovych, Indira Sen, Martin Gerlach | N/A | N/A |
| Unlearning Traces the Influential Training Data of Language Models | Masaru Isonuma, Ivan Titov | N/A | N/A |
| Exploring Alignment in Shared Cross-lingual Spaces | Basel Mousi, Nadir Durrani, Fahim Dalvi, Majd Hawasly, Ahmed Abdelali | N/A | N/A |
| Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models | Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, Michael Lyu | N/A | N/A |
| Self-Evolving GPT: A Lifelong Autonomous Experiential Learner | Jinglong Gao, Xiao Ding, Yiming Cui, Jianbai Zhao, Hepeng Wang, Ting Liu, Bing Qin | N/A | N/A |
| WRP: Weight Recover Prune for Structured Sparsity | Zhendong Tan, Xingjun Zhang, Zheng Wei | N/A | N/A |
| Error-preserving Automatic Speech Recognition of Young English Learners’ Language | Janick Michot, Manuela Hürlimann, Jan Milan Deriu, Luzia Sauer, Katsiaryna Mlynchyk, Mark Cieliebak | N/A | N/A |
| DiFiNet: Boundary-Aware Semantic Differentiation and Filtration Network for Nested Named Entity Recognition | Yuxiang Cai, Qiao Liu, Yanglei Gan, Run Lin, Changlin Li, Xueyi Liu, Da Luo, JiayeYang | N/A | N/A |
| Legal Case Retrieval: A Survey of the State of the Art | Yi Feng, Chuanyi Li, Vincent Ng | N/A | N/A |
| Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models | Mosh Levy, Alon Jacoby, Yoav Goldberg | N/A | N/A |
| Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation | Tianqi Zhong, Zhaoyi Li, Quan Wang, Linqi Song, Ying Wei, Defu Lian, Zhendong Mao | N/A | N/A |
| LLaMA Pro: Progressive LLaMA with Block Expansion | Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ying Shan, Ping Luo | N/A | N/A |
| Generating Contrastive Narratives Using the Brownian Bridge Process for Narrative Coherence Learning | Feiteng Mu, Wenjie Li | N/A | N/A |
| A Causal Approach for Counterfactual Reasoning in Narratives | Feiteng Mu, Wenjie Li | N/A | N/A |
| SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation | Matthias Lindemann, Alexander Koller, Ivan Titov | N/A | N/A |
| The Hidden Space of Transformer Language Adapters | Jesujoba Oluwadara Alabi, Marius Mosbach, Matan Eyal, Dietrich Klakow, Mor Geva | N/A | N/A |
| A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts | Nafis Irtiza Tripto, Saranya Venkatraman, Dominik Macko, Robert Moro, Ivan Srba, Adaku Uchendu, Thai Le, Dongwon Lee | N/A | N/A |
| Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations | Guan-Ting Lin, Cheng-Han Chiang, Hung-yi Lee | N/A | N/A |
| RetinaQA: A Robust Knowledge Base Question Answering Model for both Answerable and Unanswerable Questions | Prayushi Faldu, Indrajit Bhattacharya, Mausam . | N/A | N/A |
| GroundingGPT: Language Enhanced Multi-modal Grounding Model | Zhaowei Li, Xu Qi, Dong Zhang, Hang Song, YiQing Cai, Qi Qi, Ran Zhou, Junting Pan, Zefeng Li, Vu Van Tu, Zhida Huang, Tao Wang | N/A | N/A |
| Automated Justification Production for Claim Veracity in Fact Checking: A Survey on Architectures and Approaches | Islam Eldifrawi, Shengrui Wang, Amine Trabelsi | N/A | N/A |
| Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages | Carlos Mullov, Quan Pham, Alexander Waibel | N/A | N/A |
| SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget | Rui Kong, Yuanchun Li, qingtian feng, Weijun Wang, Xiaozhou Ye, Ye Ouyang, Linghe Kong, Yunxin Liu | N/A | N/A |
| PixT3: Pixel-based Table-To-Text Generation | Iñigo Alonso, Eneko Agirre, Mirella Lapata | N/A | N/A |
| Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers | Gal Yona, Roee Aharoni, Mor Geva | N/A | N/A |
| TAMS: Translation-Assisted Morphological Segmentation | Enora Rice, Ali Marashian, Luke Gessler, Alexis Palmer, Katharina von der Wense | N/A | N/A |
| Disambiguate Words like Composing Them: A Morphology-Informed Approach to Enhance Chinese Word Sense Disambiguation | Yue Wang, Qiliang Liang, Yaqi Yin, Hansi Wang, Yang Liu | N/A | N/A |
| XCodeEval: An Execution-based Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval | Mohammad Abdullah Matin Khan, M Saiful Bari, Do Xuan Long, Weishi Wang, Md Rizwan Parvez, Shafiq Joty | N/A | N/A |
| ProxyQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models | Haochen Tan, Zhijiang Guo, Zhan Shi, Lu Xu, Zhili Liu, Yunlong Feng, Xiaoguang Li, Yasheng Wang, Lifeng Shang, Qun Liu, Linqi Song | N/A | N/A |
| A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia | Giovanni Monea, Maxime Peyrard, Martin Josifoski, Vishrav Chaudhary, Jason Eisner, Emre Kiciman, Hamid Palangi, Barun Patra, Robert West | N/A | N/A |
| Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA | Yue Fan, Jing Gu, Kaiwen Zhou, Qianqi Yan, Shan Jiang, Ching-Chen Kuo, Yang Zhao, Xinze Guan, Xin Eric Wang | N/A | N/A |
| WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models | Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu | N/A | N/A |
| Translation-based Lexicalization Generation and Lexical Gap Detection: Application to Kinship Terms | Senyu Li, Bradley Hauer, Ning Shi, Grzegorz Kondrak | N/A | N/A |
| Leveraging Machine-Generated Rationales to Facilitate Social Meaning Detection in Conversations | Ritam Dutt, Zhen Wu, Jiaxin Shi, Divyanshu Sheth, Prakhar Gupta, Carolyn Rose | N/A | N/A |
| Robust Frame-Semantic Models with Lexical Unit Trees and Negative Samples | Jacob Devasier, Yogesh Gurjar, Chengkai Li | N/A | N/A |
| Do Llamas Work in English? On the Latent Language of Multilingual Transformers | Chris Wendler, Veniamin Veselovsky, Giovanni Monea, Robert West | N/A | N/A |
| Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation | Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri | N/A | N/A |
| Lightweight reranking for language model generations | Siddhartha Jain, Xiaofei Ma, Anoop Deoras, Bing Xiang | N/A | N/A |
| ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews | Mike D’Arcy, Alexis Ross, Erin Bransom, Bailey Kuehl, Jonathan Bragg, Tom Hope, Doug Downey | N/A | N/A |
| The Unreasonable Effectiveness of Easy Training Data for Hard Tasks | Peter Hase, Mohit Bansal, Peter Clark, Sarah Wiegreffe | N/A | N/A |
| PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning | Zhihan Zhang, Dong-Ho Lee, Yuwei Fang, Wenhao Yu, Mengzhao Jia, Meng Jiang, Francesco Barbieri | N/A | N/A |
| MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning | Inderjeet Jayakumar Nair, Lu Wang | N/A | N/A |
| ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs | Justin Chen, Swarnadeep Saha, Mohit Bansal | N/A | N/A |
| Mirror: Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning | Hanqi Yan, Qinglin Zhu, Xinyu Wang, Lin Gui, Yulan He | N/A | N/A |
| Where Do People Tell Stories Online? Story Detection Across Online Communities | Maria Antoniak, Joel Mire, Maarten Sap, Elliott Ash, Andrew Piper | N/A | N/A |
| Large Language Models Are No Longer Shallow Parsers | Yuanhe Tian, Fei Xia, Yan Song | N/A | N/A |
| Dialogue Summarization with Mixture of Experts based on Large Language Models | Yuanhe Tian, Fei Xia, Yan Song | N/A | N/A |
| ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences | Yuanhe Tian, Ruyi Gan, Yan Song, Jiaxing Zhang, Yongdong Zhang | N/A | N/A |
| An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs | Daking Rai, Ziyu Yao | N/A | N/A |
| Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling | Hang Jiang, Xiajie Zhang, Robert Mahari, Daniel Kessler, Eric Ma, Tal August, Irene Li, Alex Pentland, Yoon Kim, Deb Roy, Jad Kabbara | N/A | N/A |
| Intrinsic Task-based Evaluation for Referring Expression Generation | Guanyi Chen, Fahime Same, Kees Van Deemter | N/A | N/A |
| From Moments to Milestones: Incremental Timeline Summarization Leveraging Large Language Models | Qisheng Hu, Geonsik Moon, Hwee Tou Ng | N/A | N/A |
| End-to-end Learning of Logical Rules for Enhancing Document-level Relation Extraction | Kunxun Qi, Jianfeng Du, Hai Wan | N/A | N/A |
| Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? | Qingkai Fang, Shaolei Zhang, Zhengrui Ma, Min zhang, Yang Feng | N/A | N/A |
| Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder | Jiaqi Wang, Zhenxi Song, Zhengyu Ma, Xipeng Qiu, Min zhang, Zhiguo Zhang | N/A | N/A |
| G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation | Xingyuan Pan, Luyang Huang, Liyan Kang, Zhicheng Liu, Yu Lu, Shanbo Cheng | N/A | N/A |
| CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers | Longwei Zou, Qingyang Wang, Han Zhao, jiangangkong, YI YANG, Yangdong Deng | N/A | N/A |
| Prompt Optimization via Adversarial In-Context Learning | Do Xuan Long, Yiran Zhao, Hannah Brown, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Shieh, Junxian He | N/A | N/A |
| StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion | Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Lei Xie, Yuping Wang | N/A | N/A |
| Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering | Zhengliang Shi, Shuo Zhang, Weiwei Sun, Shen Gao, Pengjie Ren, Zhumin Chen, Zhaochun Ren | N/A | N/A |
| Multimodal Contextualized Semantic Parsing from Speech | Jordan Voas, David Harwath, Ray Mooney | N/A | N/A |
| LaMP: When Large Language Models Meet Personalization | Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani | N/A | N/A |
| AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters | Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge | N/A | N/A |
| MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues | Ge Bai, Jie Liu, Xingyuan Bu, yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang | N/A | N/A |
| EFSA: Towards Event-Level Financial Sentiment Analysis | Tianyu Chen, Yiming Zhang, Guoxin Yu, Dapeng Zhang, Li Zeng, Qing He, Xiang Ao | N/A | N/A |
| Media Framing: A typology and Survey of Computational Approaches Across Disciplines | Yulia Otmakhova, Shima Khanehzar, Lea Frermann | N/A | N/A |
| What Evidence Do Language Models Find Convincing? | Alexander Wan, Eric Wallace, Dan Klein | N/A | N/A |
| Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models | Qihang Ai, Jiafan Li, Jincheng Dai, Jianwu Zhou, Lemao Liu, Haiyun Jiang, Shuming Shi | N/A | N/A |
| LangBridge: Multilingual Reasoning Without Multilingual Supervision | Dongkeun Yoon, Joel Jang, Sungdong Kim, Seungone Kim, Sheikh Shafayat, Minjoon Seo | N/A | N/A |
| Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs | Siyuan Wang, zhongyu wei, Yejin Choi, Xiang Ren | N/A | N/A |
| SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving | Xueliang Zhao, Xinting Huang, Wei Bi, Lingpeng Kong | N/A | N/A |
| Unlocking the Power of Large Language Models for Entity Alignment | Xuhui Jiang, Yinghan Shen, Zhichao Shi, Chengjin Xu, Wei Li, Zixuan Li, Jian Guo, Huawei Shen, Yuanzhuo Wang | N/A | N/A |
| SPZ: A Semantic Perturbation-based Data Augmentation Method with Zonal-Mixing for Alzheimer’s Disease Detection | FangFang Li, Cheng Huang, PuZhen Su, Jie Yin | N/A | N/A |
| Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents | Yifan Song, Da Yin, Xiang Yue, Jie Huang, Sujian Li, Bill Yuchen Lin | N/A | N/A |
| ReFT: Reasoning with Reinforced Fine-Tuning | Luong Quoc Trung, Xinbo Zhang, Zhanming Jie, peng sun, Xiaoran Jin, Hang Li | N/A | N/A |
| Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment | yunxin li, Xinyu Chen, Baotian Hu, Haoyuan Shi, Min Zhang | N/A | N/A |
| FreeCtrl: Constructing Control Centers with Feedforward Layers for Learning-Free Controllable Text Generation | Zijian Feng, Hanzhang Zhou, Kezhi Mao, Zixiao Zhu | N/A | N/A |
| HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition | Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang | N/A | N/A |
| Conundrums in Cross-Prompt Automated Essay Scoring: Making Sense of the State of the Art | Shengjie Li, Vincent Ng | N/A | N/A |
| Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution | Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Alba Cercas Curry, Gavin Abercrombie, Dirk Hovy | N/A | N/A |
| Label Augmentation for Zero-Shot Hierarchical Text Classification | Lorenzo Paletto, Valerio Basile, Roberto Esposito | N/A | N/A |
| STICKERCONV: Generating Multimodal Empathetic Responses from Scratch | Yiqun Zhang, Fanheng Kong, Peidong Wang, Shuang Sun, SWangLing, Shi Feng, Daling Wang, Yifei Zhang, Kaisong Song | N/A | N/A |
| EIT: Enhanced Interactive Transformer | Tong Zheng, Bei Li, Huiwen Bao, Tong Xiao, JingBo Zhu | N/A | N/A |
| MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs | Yavuz Faruk Bakman, Duygu Nur Yaldiz, Baturalp Buyukates, Chenyang Tao, Dimitrios Dimitriadis, Salman Avestimehr | N/A | N/A |
| EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models | Rocktim Jyoti Das, Simeon Emilov Hristov, Haonan Li, Dimitar Iliyanov Dimitrov, Ivan Koychev, Preslav Nakov | N/A | N/A |
| Order-Agnostic Data Augmentation for Few-Shot Named Entity Recognition | Huiming Wang, Liying Cheng, Wenxuan Zhang, De Wen Soh, Lidong Bing | N/A | N/A |
| Text Embedding Inversion Security for Multilingual Language Models | Yiyi Chen, Heather Lent, Johannes Bjerva | N/A | N/A |
| Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment | Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou | N/A | N/A |
| Calibrating Large Language Models Using Their Generations Only | Dennis Thomas Ulmer, Martin Gubri, Hwaran Lee, Sangdoo Yun, Seong Joon Oh | N/A | N/A |
| PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator | Chuyi Kong, Yaxin FAN, Xiang Wan, Feng Jiang, Benyou Wang | N/A | N/A |
| Synthesizing Text-to-SQL Data from Weak and Strong LLMs | Jiaxi Yang, Binyuan Hui, Min Yang, Jian Yang, Junyang Lin, Chang Zhou | N/A | N/A |
| Iterative Forward Tuning Boosts In-Context Learning in Language Models | Jiaxi Yang, Binyuan Hui, Min Yang, Bailin Wang, Bowen Li, Binhua Li, Fei Huang, Yongbin Li | N/A | N/A |
| STRUCTSUM Generation for Faster Text Comprehension | Parag Jain, Andreea Marzoca, Francesco Piccinno | N/A | N/A |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Yu Zhao, Yuanbin Qu, Konrad Staniszewski, Szymon Tworkowski, Wei Liu, Piotr Miłoś, Yuxiang Wu, Pasquale Minervini | N/A | N/A |
| NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time | Yilong Chen, Guoxia Wang, Junyuan Shang, Shiyao Cui, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun, Dianhai Yu, Hua Wu | N/A | N/A |
| SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network | Kexin Wang, Jiahong Zhang, Yong Ren, Man Yao, Di Shang, Bo XU, Guoqi Li | N/A | N/A |
| Context-aware Difference Distilling for Multi-change Captioning | Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang | N/A | N/A |
| Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion | Wei Cheng, Yuhan Wu, Wei Hu | N/A | N/A |
| Chain-of-Exemplar: Enhancing Distractor Generation for Multimodal Educational Question Generation | Haohao Luo, Yang Deng, Ying Shen, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| LLMEmbed: Rethinking Lightweight LLM’s Genuine Function in Text Classification | ChunLiu, Hongguang Zhang, Kainan Zhao, Xinghai Ju, Lin Yang | N/A | N/A |
| LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion | Yilong Chen, Junyuan Shang, Zhenyu Zhang, Shiyao Cui, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu | N/A | N/A |
| Speech Sense Disambiguation: Tackling Homophone Ambiguity in End-to-End Speech Translation | Tengfei Yu, Xuebo Liu, Liang Ding, Kehai Chen, Dacheng Tao, Min Zhang | N/A | N/A |
| To be Continuous, or to be Discrete, Those are Bits of Questions | Yiran Wang, Masao Utiyama | N/A | N/A |
| Moûsai: Efficient Text-to-Music Diffusion Models | Flavio Schneider, Ojasv Kamal, Zhijing Jin, Bernhard Schölkopf | N/A | N/A |
| PokeMQA: Programmable knowledge editing for Multi-hop Question Answering | Hengrui Gu, Kaixiong Zhou, Xiaotian Han, Ninghao Liu, Ruobing Wang, Xin Wang | N/A | N/A |
| MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention | Prince Jha, Raghav Jain, Konika Mandal, Aman Chadha, Sriparna Saha, Pushpak Bhattacharyya | N/A | N/A |
| Efficient OCR for Building a Diverse Digital History | Jacob Carlson, Tom Bryan, Melissa Dell | N/A | N/A |
| Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space | Zongru Wu, Zhuosheng Zhang, Pengzhou Cheng, Gongshen Liu | N/A | N/A |
| ANAH: Analytical Annotation of Hallucinations in Large Language Models | Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen | N/A | N/A |
| Aligning Large Language Models for Controllable Recommendations | Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie | N/A | N/A |
| Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods | Haeun Yu, Pepa Atanasova, Isabelle Augenstein | N/A | N/A |
| Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement | Wenda Xu, Guanglei Zhu, Xuandong Zhao, Liangming Pan, Lei Li, William Yang Wang | N/A | N/A |
| Full Parameter Fine-tuning for Large Language Models with Limited Resources | Kai Lv, Yuqing Yang, Tengxiao Liu, Qipeng Guo, Xipeng Qiu | N/A | N/A |
| M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought | Qiguang Chen, Libo Qin, Jin Zhang, Zhi Chen, Xiao Xu, Wanxiang Che | N/A | N/A |
| Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models | Longze Chen, Ziqiang Liu, Wanwei He, Yinhe Zheng, Hao Sun, Yunshui Li, Run Luo, Min Yang | N/A | N/A |
| Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation | Keqi Deng, Phil Woodland | N/A | N/A |
| Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL | Yunseon Choi, Sangmin Bae, Seonghyun Ban, Minchan Jeong, Chuheng Zhang, Lei Song, Li Zhao, Jiang Bian, Kee-Eung Kim | N/A | N/A |
| A Modular Approach for Multimodal Summarization of TV Shows | Louis Mahon, Mirella Lapata | N/A | N/A |
| Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities | Alex Wilf, Sihyun Shawn Lee, Paul Pu Liang, Louis-Philippe Morency | N/A | N/A |
| BizBench: A Quantitative Reasoning Benchmark for Business and Finance | Michael Krumdick, Rik Koncel-Kedziorski, Viet Dac Lai, Varshini Reddy, Charles Lovering, Chris Tanner | N/A | N/A |
| Direct Metric Optimization for Image Captioning through Reward-Weighted Augmented Data Utilization | Takumi Takada, Yuma Suzuki, Hiroki Takushima, Hayato Tanoue, Haruki Sato, Aiswariya Manoj Kumar, Hiroki Nishihara, Takayuki Hori, Kazuya Ueki | N/A | N/A |
| Deciphering Hate: Identifying Hateful Memes and Their Targets | Eftekhar Hossain, Omar Sharif, Mohammed Moshiul Hoque, Sarah Masud Preum | N/A | N/A |
| Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings | Yichen Jiang, Xiang Zhou, Mohit Bansal | N/A | N/A |
| Label-Efficient Model Selection for Text Generation | Shir Ashury Tahan, Ariel Gera, Benjamin Sznajder, Leshem Choshen, Liat Ein-Dor, Eyal Shnarch | N/A | N/A |
| Machine Unlearning of Pre-trained Large Language Models | Jin Yao, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang, Zezhou Cheng, Xiang Yue | N/A | N/A |
| Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals | Francesco Ortu, Zhijing Jin, Diego Doimo, Mrinmaya Sachan, Alberto Cazzaniga, Bernhard Schölkopf | N/A | N/A |
| FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence | Sebastian Antony Joseph, Lily Chen, Jan Trienes, Hannah Louisa Göke, Monika Coers, Wei Xu, Byron C Wallace, Junyi Jessy Li | N/A | N/A |
| BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction | Yinhao Bai, Yalan Xie, Xiaoyi Liu, Yuhua Zhao, Zhixin Han, Mengting Hu, Hang Gao, Renhong Cheng | N/A | N/A |
| Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack | Yu Fu, Yufei Li, Wen Xiao, Cong Liu, Yue Dong | N/A | N/A |
| Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn’t | Chihiro Taguchi, David Chiang | N/A | N/A |
| Speech language models lack important brain-relevant semantics | SUBBA REDDY OOTA, Emin Çelik, Fatma Deniz, Mariya Toneva | N/A | N/A |
| DocLLM: A Layout-Aware Generative Language Model for Multimodal Document Understanding | Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu | N/A | N/A |
| Bypassing LLM Watermarks with Color-Aware Substitutions | Qilong Wu, Varun Chandrasekaran | N/A | N/A |
| Parallel Structures in Pre-training Data Yield In-Context Learning | Yanda Chen, Chen Zhao, Zhou Yu, Kathleen McKeown, He He | N/A | N/A |
| OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models | Hainiu Xu, Runcong Zhao, Lixing Zhu, Jinhua Du, Yulan He | N/A | N/A |
| Towards Privacy-Aware Sign Language Translation at Scale | Phillip Rust, Bowen Shi, Skyler Wang, Necati Cihan Camgoz, Jean Maillard | N/A | N/A |
| Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang | N/A | N/A |
| Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters | Yinghui Li, Zishan Xu, Shaoshen Chen, Haojing Huang, Yangning Li, Shirong Ma, Yong Jiang, Zhongli Li, Qingyu Zhou, Hai-Tao Zheng, Ying Shen | N/A | N/A |
| Steering Llama 2 via Contrastive Activation Addition | Nina Rimsky, Nick Gabrieli, Julian Schulz, Meg Tong, Evan J Hubinger, Alexander Matt Turner | N/A | N/A |
| RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Jing Huang, Zhengxuan Wu, Christopher Potts, Mor Geva, Atticus Geiger | N/A | N/A |
| Large Language Models as Zero-shot Dialogue State Tracker through Function Calling | Zekun Li, Zhiyu Chen, Mike Ross, Patrick Huber, Seungwhan Moon, Zhaojiang Lin, Xin Luna Dong, Adithya Sagar, Xifeng Yan, Paul A. Crook | N/A | N/A |
| Faithful Chart Summarization with ChaTS-Pi | Syrine Krichene, Francesco Piccinno, Fangyu Liu, Julian Martin Eisenschlos | N/A | N/A |
| Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation | Cheng Niu, Xingguang Wang, Xuxin Cheng, Juntong Song, Tong Zhang | N/A | N/A |
| MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for Fact-Checking | Ting-Chih Chen, Chia-Wei Tang, Chris Thomas | N/A | N/A |
| KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction | Zixuan Li, Yutao Zeng, Yuxin Zuo, Weicheng Ren, Wenxuan Liu, Miao Su, Yucan Guo, Yantao Liu, lixiang, Zhilei Hu, Long Bai, Wei Li, Yidan Liu, Pan Yang, Xiaolong Jin, Jiafeng Guo, Xueqi Cheng | N/A | N/A |
| ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis | Yanming Liu, Xinyue Peng, Tianyu Du, Jianwei Yin, Weihao Liu, Xuhong Zhang | N/A | N/A |
| EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities | Nian Li, Chen Gao, Mingyu Li, Yong Li, Qingmin Liao | N/A | N/A |
| On the Multi-turn Instruction Following for Conversational Web Agents | Yang Deng, Xuan Zhang, Wenxuan Zhang, Yifei Yuan, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents | Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Liujianfeng, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang | N/A | N/A |
| MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China | Chen Zhang, Mingxu Tao, Quzhe Huang, Jiuheng Lin, Zhibin Chen, Yansong Feng | N/A | N/A |
| Decoder-only Streaming Transformer for Simultaneous Translation | Shoutao Guo, Shaolei Zhang, Yang Feng | N/A | N/A |
| Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization | Zhexin Zhang, Junxiao Yang, Pei Ke, Fei Mi, Hongning Wang, Minlie Huang | N/A | N/A |
| I am a Strange Dataset: Metalinguistic Tests for Language Models | Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela | N/A | N/A |
| SafetyBench: Evaluating the Safety of Large Language Models | Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang | N/A | N/A |
| Deciphering Oracle Bone Language with Diffusion Models | Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu | N/A | N/A |
| TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space | Shaolei Zhang, Tian Yu, Yang Feng | N/A | N/A |
| ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training | Le Zhuo, Zewen Chi, Minghao Xu, Heyan Huang, Jianan Zhao, Heqi Zheng, Conghui He, Xian-Ling Mao, Wentao Zhang | N/A | N/A |
| StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning | Shaolei Zhang, Qingkai Fang, Shoutao Guo, Zhengrui Ma, Min zhang, Yang Feng | N/A | N/A |
| Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models | Tianjie Ju, Yijin Chen, Xinwei Yuan, Zhuosheng Zhang, Wei Du, Yubin Zheng, Gongshen Liu | N/A | N/A |
| Why Don’t Prompt-Based Fairness Metrics Correlate? | Abdelrahman Zayed, Goncalo Mordido, Ioana Baldini, Sarath Chandar | N/A | N/A |
| NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data | Manuel Tonneau, Pedro Vitor Quinta de Castro, Karim Lasri, Ibrahim Sambo Farouq, Lakshmi Subramanian, Victor Orozco-Olvera, Samuel Fraiberger | N/A | N/A |
| M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset | Zhe Chen, Heyang Liu, Wenyi Yu, Guangzhi Sun, Hongcheng Liu, Ji Wu, Chao Zhang, Yu Wang, Yanfeng Wang | N/A | N/A |
| Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination | Nakyeong Yang, Taegwan Kang, Stanley Jungkyu Choi, Honglak Lee, Kyomin Jung | N/A | N/A |
| Domain Adaptation for Subjective Induction Questions Answering on Products by Adversarial Disentangled Learning | Yufeng Zhang, Jianxing Yu, Yanghui Rao, Libin Zheng, Qinliang Su, Huaijie Zhu, Jian Yin | N/A | N/A |
| Revisiting Demonstration Selection Strategies in In-Context Learning | Keqin Peng, Liang Ding, Yancheng Yuan, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao | N/A | N/A |
| Multimodal Table Understanding | Mingyu Zheng, Xinwei Feng, Qingyi Si, Qiaoqiao She, Zheng Lin, Wenbin Jiang, Weiping Wang | N/A | N/A |
| Ex\textsuperscript{3}: Automatic Novel Writing by Extracting, Excelsior and Expanding | Huang Lei, Jiaming Guo, Guanhua He, Xishan Zhang, Rui Zhang, Shaohui Peng, Shaoli Liu, Tianshi Chen | N/A | N/A |
| Few-shot Transfer Learning for Knowledge Base Question Answering: Fusing Supervised Models with In-Context Learning | Mayur Patidar, Riya Sawhney, Avinash Kumar Singh, Biswajit Chatterjee, Mausam ., Indrajit Bhattacharya | N/A | N/A |
| WatME: Towards Lossless Watermarking Through Lexical Redundancy | Liang CHEN, Yatao Bian, Yang Deng, Deng Cai, Shuaiyi Li, Peilin Zhao, Kam-Fai Wong | N/A | N/A |
| Text-like Encoding of Collaborative Information in Large Language Models for Recommendation | Yang Zhang, Keqin Bao, Ming Yan, Wenjie Wang, Fuli Feng, Xiangnan He | N/A | N/A |
| MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception | Yuhao Wang, Yusheng Liao, Heyang Liu, Hongcheng Liu, Yanfeng Wang, Yu Wang | N/A | N/A |
| Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning | Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation | Yi Liu, Xiangyu Liu, Xiangrong Zhu, Wei Hu | N/A | N/A |
| M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models | Wai-Chung Kwan, Xingshan Zeng, Yufei Wang, Yusen Sun, Liangyou Li, Yuxin Jiang, Lifeng Shang, Qun Liu, Kam-Fai Wong | N/A | N/A |
| Reward-based Input Construction for Cross-document Relation Extraction | Byeonghu Na, Suhyeon Jo, Yeongmin Kim, Il-chul Moon | N/A | N/A |
| Hyperspherical Multi-Prototype with Optimal Transport for Event Argument Extraction | Guangjun Zhang, Hu zhang, YuJie Wang, Ru Li, Hongye Tan, Jiye Liang | N/A | N/A |
| Understanding Retrieval Robustness for Retrieval-augmented Image Captioning | Wenyan Li, Jiaang Li, Rita Ramos, Raphael Tang, Desmond Elliott | N/A | N/A |
| Semi-Supervised Spoken Language Glossification | Huijie Yao, Wengang Zhou, Hao Zhou, Houqiang Li | N/A | N/A |
| SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents | Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Li YanTao, Jianbing Zhang, Zhiyong Wu | N/A | N/A |
| InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers | Yakir Yehuda, Itzik Malkiel, Oren Barkan, Jonathan Weill, Royi Ronen, Noam Koenigstein | N/A | N/A |
| F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods | Yu Sun, keyuchen, Shujie Wang, Peiji Li, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin | N/A | N/A |
| Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning | Philipp Mondorf, Barbara Plank | N/A | N/A |
| Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback | Maria Emilia Agis Lerner, Florian E. Dorner, Elliott Ash, Naman Goel | N/A | N/A |
| Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Peiyi Wang, Lei Li, Zhihong Shao, Runxin Xu, Damai Dai, Yifei Li, Deli Chen, Yu Wu, Zhifang Sui | N/A | N/A |
| Large Language Models are not Fair Evaluators | Peiyi Wang, Lei Li, Liang Chen, Zefan Cai, Dawei Zhu, Binghuai Lin, Yunbo Cao, Lingpeng Kong, Qi Liu, Tianyu Liu, Zhifang Sui | N/A | N/A |
| Improving Large Language Models in Event Relation Logical Prediction | Meiqi Chen, Yubo Ma, Kaitao Song, Yixin Cao, Yan Zhang, Dongsheng Li | N/A | N/A |
| Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline | Dingyi Yang, Chunru Zhan, Ziheng Wang, Biao Wang, Tiezheng Ge, Bo Zheng, Qin Jin | N/A | N/A |
| Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation | Wenting Chen, Linlin Shen, Jingyang Lin, Jiebo Luo, Xiang Li, Yixuan Yuan | N/A | N/A |
| T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step | Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao | N/A | N/A |
| Are LLM-based Evaluators Confusing NLG Quality Criteria? | Xinyu Hu, Mingqi Gao, Sen Hu, Yang Zhang, Yicheng Chen, TENG XU, Xiaojun Wan | N/A | N/A |
| Synergistic Interplay between Search and Large Language Models for Information Retrieval | Jiazhan Feng, Chongyang Tao, Xiubo Geng, Tao Shen, Can Xu, Guodong Long, Dongyan Zhao, Daxin Jiang | N/A | N/A |
| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Yaroslav Aksenov, Nikita Balagansky, Sofia Maria Lo Cicero Vaina, Boris Shaposhnikov, Alexey Gorbatovski, Daniil Gavrilov | N/A | N/A |
| Temperature-scaling surprisal estimates improve fit to human reading times – but does it do so for the “right reasons”? | Tong Liu, Iza Škrjanec, Vera Demberg | N/A | N/A |
| Beyond Recognising Entailment: Formalising Natural Language Inference from an Argumentative Perspective | Ameer Saadat-Yazdi, Nadin Kökciyan | N/A | N/A |
| RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization | Jaavid Aktar Husain J, Raj Dabre, Aswanth Kumar M, Jay Gala, Thanmay Jayakumar, Ratish Puduppully, Anoop Kunchukuttan | N/A | N/A |
| AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling | Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yu-Gang Jiang, Xipeng Qiu | N/A | N/A |
| CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models | Zixin Chen, Hongzhan Lin, Ziyang Luo, Mingfei Cheng, Jing Ma, Guang Chen | N/A | N/A |
| Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation | Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Xiaoming Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen | N/A | N/A |
| Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines | Michael Toker, Hadas Orgad, Mor Ventura, Dana Arad, Yonatan Belinkov | N/A | N/A |
| Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models | Yuchong Sun, Che Liu, Kun Zhou, Jinwen Huang, Ruihua Song, Xin Zhao, Fuzheng Zhang, Di ZHANG, Kun Gai | N/A | N/A |
| Robust Singing Voice Transcription Serves Synthesis | Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao | N/A | N/A |
| VulLibGen: Generating Names of Vulnerability-Affected Packages via a Large Language Model | Tianyu Chen, Lin Li, ZhuLiuchuan, Zongyang Li, Xueqing Liu, Guangtai Liang, Qianxiang Wang, Tao Xie | N/A | N/A |
| Self-Modifying State Modeling for Simultaneous Machine Translation | Donglei Yu, Xiaomian Kang, Yuchen Liu, Yu Zhou, Chengqing Zong | N/A | N/A |
| MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation | Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee K. Wong | N/A | N/A |
| BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents | Yifei Wang, Dizhan Xue, Shengjie Zhang, Shengsheng Qian | N/A | N/A |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Hongda Sun, Weikai Xu, Wei Liu, Jian Luan, Bin Wang, Shuo Shang, Ji-Rong Wen, Rui Yan | N/A | N/A |
| LePaRD: A Large-Scale Dataset of Judicial Citations to Precedent | Robert Mahari, Dominik Stammbach, Elliott Ash, Alex Pentland | N/A | N/A |
| To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering | Giacomo Frisoni, Alessio Cocchieri, Alex Presepi, Gianluca Moro, Zaiqiao Meng | N/A | N/A |
| MERA: A Comprehensive LLM Evaluation in Russian | Alena Fenogenova, Artem Chervyakov, Nikita Martynov, Anastasia Kozlova, Maria Tikhonova, Albina Akhmetgareeva, Anton Emelyanov, Denis Shevelev, Pavel Lebedev, Leonid S Sinev, Ulyana Isaeva, Katerina Kolomeytseva, Daniil Moskovskiy, Elizaveta Goncharova, Nikita Savushkin, Polina Mikhailova, Anastasia Minaeva, Denis Dimitrov, Alexander Panchenko, Sergey Markov | N/A | N/A |
| SC2: Towards Enhancing Content Preservation and Style Consistency in Long Text Style Transfer | Jie Zhao, Ziyu Guan, Cai Xu, Wei Zhao, Yue Jiang | N/A | N/A |
| Causal Estimation of Memorisation Profiles | Pietro Lesci, Clara Meister, Thomas Hofmann, Andreas Vlachos, Tiago Pimentel | N/A | N/A |
| CHECKWHY: Causal Fact Verification via Argument Structure | Jiasheng Si, Yibo Zhao, Yingjie Zhu, Haiyang Zhu, Wenpeng Lu, Deyu Zhou | N/A | N/A |
| Dodo: Dynamic Contextual Compression for Decoder-only LMs | Guanghui Qin, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme | N/A | N/A |
| POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation | Shilong Pan, Zhiliang Tian, Liang Ding, Haoqi Zheng, Zhen Huang, Zhihua Wen, Dongsheng Li | N/A | N/A |
| NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism | Miao Li, Ming-Bin Chen, Bo Tang, ShengbinHou, Pengyu Wang, Haiying Deng, Zhiyu li, Feiyu Xiong, Keming Mao, Cheng Peng, Yi Luo | N/A | N/A |
| MAPO: Advancing Multilingual Reasoning through Multilingual-Alignment-as-Preference Optimization | Shuaijie She, Wei Zou, Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen | N/A | N/A |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | Feiteng Fang, yuelin bai, Shiwen Ni, Min Yang, Xiaojun Chen, Ruifeng Xu | N/A | N/A |
| Predicting Text Preference Via Structured Comparative Reasoning | Jing Nathan Yan, Tianqi Liu, Justin T Chiu, Jiaming Shen, Zhen Qin, Yue Yu, Charumathi Lakshmanan, Yair Kurzion, Alexander M Rush, Jialu Liu, Michael Bendersky | N/A | N/A |
| CoELM: Construction-Enhanced Language Modeling | Lvxiaowei Xu, Zhilin Gong, Jianhua Dai, Tianxiang Wang, Ming Cai, Jiawei Peng | N/A | N/A |
| Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model | Christian Tomani, David Vilar, Markus Freitag, Colin Cherry, Subhajit Naskar, Mara Finkelstein, Xavier Garcia, Daniel Cremers | N/A | N/A |
| Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation | Songju Lei, Xize Cheng, Mengjiao Lyu, Jianqiao Hu, Jintao Tan, Runlin Liu, Lingyu Xiong, Tao Jin, Xiandong Li, Zhou Zhao | N/A | N/A |
| On the Impact of Calibration Data in Post-training Quantization and Pruning | Miles Williams, Nikolaos Aletras | N/A | N/A |
| SymKGQA: Few-Shot Knowledge Graph Question Answering via Symbolic Program Generation and Execution | Prerna Agarwal, Nishant Kumar, Srikanta J. Bedathur | N/A | N/A |
| Meta-Task Prompting Elicits Embeddings from Large Language Models | Yibin Lei, Di Wu, Tianyi Zhou, Tao Shen, Yu Cao, Chongyang Tao, Andrew Yates | N/A | N/A |
| A Sentiment Consolidation Framework for Meta-Review Generation | Miao Li, Jey Han Lau, Eduard Hovy | N/A | N/A |
| Revisiting Structured Sentiment Analysis as Latent Dependency Graph Parsing | Chengjie Zhou, Bobo Li, Hao Fei, Fei Li, Chong Teng, Donghong Ji | N/A | N/A |
| OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification | Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe | N/A | N/A |
| Do Large Language Models Latently Perform Multi-Hop Reasoning? | Sohee Yang, Elena Gribovskaya, Nora Kassner, Mor Geva, Sebastian Riedel | N/A | N/A |
| MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning | Chengpeng Li, Zheng Yuan, Hongyi Yuan, Guanting Dong, Keming Lu, Jiancan Wu, Chuanqi Tan, Xiang Wang, Chang Zhou | N/A | N/A |
| Harnessing Toulmin’s theory for zero-shot argument explication | Ankita Gupta, Ethan Zuckerman, Brendan O’Connor | N/A | N/A |
| BinaryAlign: Word Alignment as Binary Sequence Labeling | Gaetan Lopez Latouche, Marc-André Carbonneau, Benjamin Swanson | N/A | N/A |
| Quantifying the Persona Effect in LLM Simulations | Tiancheng Hu, Nigel Collier | N/A | N/A |
| On Efficient and Statistical Quality Estimation for Data Annotation | Jan-Christoph Klie, Juan Haladjian, Marc Kirchner, Rahul Nair | N/A | N/A |
| EZ-STANCE: A Large Dataset for English Zero-Shot Stance Detection | Chenye Zhao, Cornelia Caragea | N/A | N/A |
| Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? | Nishant Balepur, Abhilasha Ravichander, Rachel Rudinger | N/A | N/A |
| Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments | Zhenrui Yue, Huimin Zeng, Lanyu Shang, Yifan Liu, Yang Zhang, Dong Wang | N/A | N/A |
| SyllabusQA: A Course Logistics Question Answering Dataset | Nigel Fernandez, Alexander Scarlatos, Andrew Lan | N/A | N/A |
| American Sign Language Handshapes Reflect Pressures for Communicative Efficiency | Kayo Yin, Terry Regier, Dan Klein | N/A | N/A |
| MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models | Yilin Wen, Zifeng Wang, Jimeng Sun | N/A | N/A |
| AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts | Daniel Braun, Florian Matthes | N/A | N/A |
| Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks | Charlotte Siska, Katerina Marazopoulou, Melissa Ailem, James Bono | N/A | N/A |
| Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning | Eric Pasewark, Kyle Montgomery, Kefei Duan, Dawn Song, Chenguang Wang | N/A | N/A |
| Bridging the Preference Gap between Retrievers and LLMs | Zixuan Ke, Weize Kong, Cheng Li, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky | N/A | N/A |
| Large Language Models Can Learn Temporal Reasoning | Siheng Xiong, Ali Payani, Ramana Rao Kompella, Faramarz Fekri | N/A | N/A |
| Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research | Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Evan Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo | N/A | N/A |
| Learning Relational Decomposition of Queries for Question Answering from Tables | Raphaël Mouravieff, Benjamin Piwowarski, sylvain lamprier | N/A | N/A |
| Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People | Dun-Ming Huang, Pol Van Rijn, Ilia Sucholutsky, Raja Marjieh, Nori Jacoby | N/A | N/A |
| Pareto Optimal Learning for Estimating Large Language Model Errors | Theodore Zhao, Mu Wei, J. Samuel Preston, Hoifung Poon | N/A | N/A |
| Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models | Victor Agostinelli III, Max Wild, Matthew Raffel, Kazi Ahmed Asif Fuad, Lizhong Chen | N/A | N/A |
| Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM | Bochuan Cao, Yuanpu Cao, Lu Lin, Jinghui Chen | N/A | N/A |
| Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models | Guanming Xiong, Junwei Bao, Wen Zhao | N/A | N/A |
| LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error | Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su | N/A | N/A |
| HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts | Hao Zhao, Zihan Qiu, Huijia Wu, Zili Wang, Zhaofeng He, Jie Fu | N/A | N/A |
| Aligning Large Language Models with Human Preferences through Representation Engineering | Wenhao Liu, Xiaohua Wang, Muling Wu, Tianlong Li, Changze Lv, Zixuan Ling, Zhu JianHao, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models | Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu | N/A | N/A |
| ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation | Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, Jiancheng Lv | N/A | N/A |
| PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations | Qihao Yang, Yong Li, Xuelin Wang, Fu Lee Wang, Tianyong Hao | N/A | N/A |
| Prompted Aspect Key Point Analysis for Quantitative Review Summarization | An Quang Tang, Xiuzhen Zhang, Minh Ngoc Dinh, Erik Cambria | N/A | N/A |
| Ask Again, Then Fail: Large Language Models’ Vacillations in Judgment | Qiming Xie, Zengzhi Wang, Yi Feng, Rui Xia | N/A | N/A |
| CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models | Tong Zhang, Peixin Qin, Yang Deng, Chen Huang, Wenqiang Lei, Junhong Liu, Dingnan Jin, Hongru Liang, Tat-Seng Chua | N/A | N/A |
| Multimodal Reasoning with Multimodal Knowledge Graph | Junlin Lee, Yequan Wang, Jing Li, Min Zhang | N/A | N/A |
| Confidence is not Timeless: Modeling Temporal Validity for Rule-based Temporal Knowledge Graph Forecasting | Rikui Huang, Wei Wei, Xiaoye Qu, Shengzhe Zhang, Dangyang Chen, Yu Cheng | N/A | N/A |
| CARE: A Clue-guided Assistant for CSRs to Read User Manuals | Weihong Du, Jia Liu, zujie wen, Dingnan Jin, Hongru Liang, Wenqiang Lei | N/A | N/A |
| Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes | Dingzirui Wang, Longxu Dou, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che | N/A | N/A |
| PAGED: A Benchmark for Procedural Graphs Extraction from Documents | Weihong Du, Wenrui Liao, Hongru Liang, Wenqiang Lei | N/A | N/A |
| Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors | Ying Zhou, Ben He, Le Sun | N/A | N/A |
| RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models | Cheng Niu, Yuanhao Wu, Juno Zhu, Siliang Xu, KaShun SHUM, Randy Zhong, Juntong Song, Tong Zhang | N/A | N/A |
| The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models | Junyi Li, Jie Chen, Ruiyang Ren, Xiaoxue Cheng, Xin Zhao, Jian-Yun Nie, Ji-Rong Wen | N/A | N/A |
| Revisiting Knowledge Distillation for Autoregressive Language Models | Qihuang Zhong, Liang Ding, Li Shen, Juhua Liu, Bo Du, Dacheng Tao | N/A | N/A |
| OLMo: Accelerating the Science of Language Models | Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, William H. Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi | N/A | N/A |
| Continual Learning with Semi-supervised Contrastive Distillation for Incremental Neural Machine Translation | Yunlong Liang, Fandong Meng, Jiaan Wang, Jinan Xu, Yufeng Chen, Jie Zhou | N/A | N/A |
| Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners | Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, CHAO WENG, Zhou Zhao, Dong Yu | N/A | N/A |
| Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages | Shih-Cheng Huang, Pin-Zu Li, YU-CHI HSU, Kuang-Ming Chen, Yu Tung Lin, Shih-Kai Hsiao, Richard Tzong-Han Tsai, Hung-yi Lee | N/A | N/A |
| Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! | Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao | N/A | N/A |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Neal Mangaokar, Ashish Hooda, Jihye Choi, Shreyas Chandrashekaran, Kassem Fawaz, Somesh Jha, Atul Prakash | N/A | N/A |
| Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLMs-Powered Assistance | Bo Yuan, Yulin Chen, Yin Zhang, Wei Jiang | N/A | N/A |
| CLOMO: Counterfactual Logical Modification with Large Language Models | Yinya Huang, Ruixin Hong, Hongming Zhang, Wei Shao, Zhicheng YANG, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song | N/A | N/A |
| Exploring Hybrid Question Answering via Program-based Prompting | Qi Shi, Han Cui, Haofeng Wang, Qingfu Zhu, Wanxiang Che, Ting Liu | N/A | N/A |
| IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages | Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar | N/A | N/A |
| Simple but Effective Compound Geometric Operations for Temporal Knowledge Graph Completion | Rui Ying, Mengting Hu, Jianfeng Wu, Yalan Xie, Xiaoyi Liu, Zhunheng Wang, Ming Jiang, Hang Gao, Linlin Zhang, Renhong Cheng | N/A | N/A |
| Uncertainty Aware Learning for Language Model Alignment | Yikun Wang, Rui Zheng, Liang Ding, Qi Zhang, Dahua Lin, Dacheng Tao | N/A | N/A |
| Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models | Ying-Chun Lin, Jennifer Neville, Jack W Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, saurabh tiwary, Brent Hecht, Jaime Teevan | N/A | N/A |
| Fundamental Capabilities of Large Language Models and their Applications in Domain Scenarios: A Survey | Jiawei Li, Yizhe Yang, Yu Bai, Xiaofeng Zhou, Yinghao Li, Huashan Sun, Yuhang Liu, Xingpeng Si, Yuhao Ye, Yixiao Wu, 林一冠, Bin Xu, Ren bowen, Chong Feng, Yang Gao, Heyan Huang | N/A | N/A |
| IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages | Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad B, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M Khapra | N/A | N/A |
| Measuring Political Bias in Large Language Models: What Is Said and How It Is Said | Yejin Bang, Delong Chen, Nayeon Lee, Pascale Fung | N/A | N/A |
| Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use | Yuhan Chen, Ang Lv, Ting-En Lin, Changyu Chen, Yuchuan Wu, Fei Huang, Yongbin Li, Rui Yan | N/A | N/A |
| Layer-Condensed KV Cache for Efficient Inference of Large Language Models | Haoyi Wu, Kewei Tu | N/A | N/A |
| Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models | Xiaolong Wang, Yile Wang, Yuanchi Zhang, Fuwen Luo, Peng Li, Maosong Sun, Yang Liu | N/A | N/A |
| Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages | Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu | N/A | N/A |
| Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations | Jiaxing Sun, weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, Songyang Zhang, Hang Yan, Conghui He | N/A | N/A |
| Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion | Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu | N/A | N/A |
| Model Composition for Multimodal Large Language Models | Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu | N/A | N/A |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Jun Zhang, Jue WANG, Huan Li, Lidan Shou, Ke Chen, Gang Chen, Sharad Mehrotra | N/A | N/A |
| Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup | Xuxin Cheng, Ziyu Yao, Yifei Xin, Hao An, Hongxiang Li, Yaowei Li, Yuexian Zou | N/A | N/A |
| Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models | Changjiang Gao, Jixing Li, Jiajun Chen, Shujian Huang | N/A | N/A |
| MIST: Mutual Information Maximization for Short Text Clustering | Krissanee Kamthawee, Can Udomcharoenchaikit, Sarana Nutanong | N/A | N/A |
| Self-chats from Large Language Models Make Small Emotional Support Chatbot Better | Zhonghua Zheng, Lizi Liao, Yang Deng, Libo Qin, Liqiang Nie | N/A | N/A |
| Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment | Janghwan Lee, Seongmin Park, Sukjin Hong, Minsoo Kim, Du-Seong Chang, Jungwook Choi | N/A | N/A |
| Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs | Tianqing Fang, Zeming Chen, Yangqiu Song, Antoine Bosselut | N/A | N/A |
| An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing | Ziwei Chai, Guoyin Wang, Jing Su, Tianjie Zhang, Xuanwen Huang, Xuwu Wang, Jingjing Xu, Jianbo Yuan, Hongxia Yang, Fei Wu, Yang Yang | N/A | N/A |
| Learning to Plan and Generate Text with Citations | Constanza Fierro, Reinald Kim Amplayo, Fantine Huot, Nicola De Cao, Joshua Maynez, Shashi Narayan, Mirella Lapata | N/A | N/A |
| Exploring Precision and Recall to assess the quality and diversity of LLMs | Florian Le Bronnec, Alexandre Verine, benjamin negrevergne, Yann Chevaleyre, Alexandre Allauzen | N/A | N/A |
| Aligning Large Language Models by On-Policy Self-Judgment | Sangkyu Lee, Sungdong Kim, Ashkan Yousefpour, Minjoon Seo, Kang Min Yoo, Youngjae Yu | N/A | N/A |
| IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning | Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi | N/A | N/A |
| JumpCoder: Go Beyond Autoregressive Coder via Online Modification | Mouxiang Chen, Hao Tian, Zhongxin Liu, Xiaoxue Ren, Jianling Sun | N/A | N/A |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Shivalika Singh, Freddie Vargus, Daniel D’souza, Börje F. Karlsson, Abinaya Mahendiran, Wei-Yin Ko, Herumb Shandilya, Jay Patel, Deividas Mataciunas, Laura O’Mahony, Mike Zhang, Ramith Hettiarachchi, Joseph Wilson, Marina Machado, Luisa Souza Moura, Dominik Krzemiński, Hakimeh Fadaei, Irem Ergun, Ifeoma Okoh, Aisha Alaagib, Oshan Ivantha Mudannayake, Zaid Alyafeai, Vu Minh Chien, Sebastian Ruder, Surya Guthikonda, Emad A. Alghamdi, Sebastian Gehrmann, Niklas Muennighoff, Max Bartolo, Julia Kreutzer, Ahmet Üstün, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks | Anwoy Chatterjee, Eshaan Tanwar, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| Split and Rephrase with Large Language Models | Antonio David Ponce Martínez, Thierry Etchegoyhen, Jesus Javier Calleja Perez, Harritxu Gete | N/A | N/A |
| ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition | Lu Ye, Ze Tao, Yong Huang, Yang Li | N/A | N/A |
| AlignBench: Benchmarking Chinese Alignment of Large Language Models | Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Andrew Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Xiaotao Gu, Hongning Wang, Jing Zhang, Minlie Huang, Yuxiao Dong, Jie Tang | N/A | N/A |
| SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models | Weixiang Zhao, Shilong Wang, Yulin Hu, Yanyan Zhao, Bing Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution | Yulong Mao, Kaiyu Huang, Changhao Guan, Ganglin Bao, Fengran Mo, Jinan Xu | N/A | N/A |
| Cross-Lingual Knowledge Editing in Large Language Models | Jiaan Wang, Yunlong Liang, Zengkui Sun, Yuxuan Cao, Jiarong Xu, Fandong Meng | N/A | N/A |
| Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model | Ahmet Üstün, Viraat Aryabumi, Zheng Xin Yong, Wei-Yin Ko, Daniel D’souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker | N/A | N/A |
| Argument Mining in Data Scarce Settings: Cross-lingual Transfer and Few-shot Techniques | Anar Yeginbergen, Maite Oronoz, Rodrigo Agerri | N/A | N/A |
| Learning Task Decomposition to Assist Humans in Competitive Programming | Jiaxin Wen, Ruiqi Zhong, Pei Ke, Zhihong Shao, Hongning Wang, Minlie Huang | N/A | N/A |
| An Entropy-based Text Watermarking Detection Method | Yijian LU, Aiwei Liu, Dianzhi Yu, Jingjing Li, Irwin King | N/A | N/A |
| Enhancing Explainable Rating Prediction through Annotated Macro Concepts | Huachi Zhou, Shuang Zhou, Hao Chen, Ninghao Liu, Fan Yang, Xiao Huang | N/A | N/A |
| How to Engage your Readers? Generating Guiding Questions to Promote Active Reading | Peng Cui, Vilém Zouhar, Xiaoyu Zhang, Mrinmaya Sachan | N/A | N/A |
| Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective | Zihao Yue, Liang Zhang, Qin Jin | N/A | N/A |
| Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation | Xinglin Wang, Yiwei Li, Shaoxiong Feng, Peiwen Yuan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| More frequent verbs are associated with more diverse valency frames: Efficient principles at the lexicon-grammar interface | Siyu Tao, Lucia Donatelli, Michael Hahn | N/A | N/A |
| BatchEval: Towards Human-like Text Evaluation | Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| Quantifying Generalizations: Exploring the Divide Between Human and LLMs’ Sensitivity to Quantification | Claudia Collacciani, Giulia Rambelli, Marianna Bolognesi | N/A | N/A |
| Can Large Language Models Interpret Noun-Noun Compounds? A Linguistically-Motivated Study on Lexicalized and Novel Compounds | Giulia Rambelli, Emmanuele Chersoni, Claudia Collacciani, Marianna Bolognesi | N/A | N/A |
| CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation | Quan Tu, Shilong Fan, Zihang Tian, Tianhao Shen, Shuo Shang, Xin Gao, Rui Yan | N/A | N/A |
| Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond | Yongqi Li, Wenjie Wang, Leigang Qu, Liqiang Nie, Wenjie Li, Tat-Seng Chua | N/A | N/A |
| Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction | Yice Zhang, Jie Zeng, Weiming Hu, Ziyi Wang, Shiwei Chen, Ruifeng Xu | N/A | N/A |
| ToMBench: Benchmarking Theory of Mind in Large Language Models | Zhuang Chen, Jincenzi Wu, Jinfeng Zhou, Bosi Wen, Guanqun Bi, Gongyao Jiang, Yaru Cao, Mengting Hu, Yunghwei Lai, Zexuan Xiong, Minlie Huang | N/A | N/A |
| Learning to Generate Answers with Citations via Factual Consistency Models | Rami Aly, Zhiqiang Tang, Samson Tan, George Karypis | N/A | N/A |
| Improving Text Embeddings with Large Language Models | Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei | N/A | N/A |
| Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang, Shichen Li, Wei Lu | N/A | N/A |
| UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset | Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Document-level Claim Extraction and Decontextualisation for Fact-Checking | Zhenyun Deng, Michael Sejr Schlichtkrull, Andreas Vlachos | N/A | N/A |
| PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning | Xiaoqi Qiu, Yongjie Wang, Xu Guo, Zhiwei Zeng, Yu Yue, Yuhong Feng, Chunyan Miao | N/A | N/A |
| LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction | Hanzhang Zhou, Junlang Qian, Zijian Feng, Hui Lu, Zixiao Zhu, Kezhi Mao | N/A | N/A |
| Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models | Weihong Zhong, Xiaocheng Feng, Liang Zhao, Qiming Li, Lei Huang, Yuxuan Gu, Weitao Ma, Yuan Xu, Bing Qin | N/A | N/A |
| COKE: A Cognitive Knowledge Graph for Machine Theory of Mind | Jincenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Helen M. Meng, Minlie Huang | N/A | N/A |
| mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models | Huiyuan Lai, Malvina Nissim | N/A | N/A |
| GunStance: Stance Detection for Gun Control and Gun Regulation | Nikesh Gyawali, Iustin Sirbu, Tiberiu Sosea, Sarthak Khanal, Doina Caragea, Traian Rebedea, Cornelia Caragea | N/A | N/A |
| Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation | Zdeněk Kasner, Ondrej Dusek | N/A | N/A |
| Don’t Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection | Min Zhang, Jianfeng He, Taoran Ji, Chang-Tien Lu | N/A | N/A |
| Don’t Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation | Giorgos Vernikos, Andrei Popescu-Belis | N/A | N/A |
| Generating and Evaluating Plausible Explanations for Knowledge Graph Completion | Antonio Di Mauro, Zhao Xu, Wiem Ben Rim, Timo Sztyler, Carolin Lawrence | N/A | N/A |
| One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation | Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera | N/A | N/A |
| MultiPICo: Multilingual Perspectivist Irony Corpus | Silvia Casola, Simona Frenda, Soda Marem Lo, Erhan Sezerer, Antonio Uva, Valerio Basile, Cristina Bosco, Alessandro Pedrani, Chiara Rubagotti, Viviana Patti, Davide Bernardi | N/A | N/A |
| LANDeRMT: Dectecting and Routing Language-Aware Neurons for Selectively Finetuning LLMs to Machine Translation | shaolin Zhu, Leiyu Pan, Bo Li, Deyi Xiong | N/A | N/A |
| A Joint Coreference-Aware Approach to Document-Level Target Sentiment Analysis | Hongjie Cai, Heqing Ma, Jianfei Yu, Rui Xia | N/A | N/A |
| VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models | Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin | N/A | N/A |
| AutoDSL: Automated domain-specific language design for structural representation of procedures with constraints | Yu-Zhe Shi, Haofei Hou, Zhangqian Bi, Fanxu Meng, Xiang Wei, Lecheng Ruan, Qining Wang | N/A | N/A |
| Multipath parsing in the brain | Berta Franzluebbers, Donald Dunagan, Miloš Stanojević, Jan Buys, John T. Hale | N/A | N/A |
| Search-Adaptor: Embedding Customization for Information Retrieval | Jinsung Yoon, Yanfei Chen, Sercan O Arik, Tomas Pfister | N/A | N/A |
| Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs | Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker | N/A | N/A |
| VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation | Max Ku, Dongfu Jiang, Cong Wei, Xiang Yue, Wenhu Chen | N/A | N/A |
| Tree Transformer’s Disambiguation Ability of Prepositional Phrase Attachment and Garden Path Effects | Lingling Zhou, Suzan Verberne, Gijs Wijnholds | N/A | N/A |
| Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs | Elan Sopher Markowitz, Anil Ramakrishna, Jwala Dhamala, Ninareh Mehrabi, Charith Peris, Rahul Gupta, Kai-Wei Chang, Aram Galstyan | N/A | N/A |
| Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing | Freda Shi, Kevin Gimpel, Karen Livescu | N/A | N/A |
| ViSAGe: A Global-Scale Analysis of Visual Stereotypes in Text-to-Image Generation | Akshita Jha, Vinodkumar Prabhakaran, Remi Denton, Sarah Laszlo, Shachi Dave, Rida Qadri, Chandan K. Reddy, Sunipa Dev | N/A | N/A |
| AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents | Harsh Trivedi, Tushar Khot, Mareike Hartmann, Ruskin Manku, Vinty Dong, Edward Li, Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian | N/A | N/A |
| Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking | Xiaokang Zhang, Zijun Yao, Jing Zhang, Kaifeng Yun, Jifan Yu, Juanzi Li, Jie Tang | N/A | N/A |
| What Do Language Models Learn in Context? The Structured Task Hypothesis. | Jiaoda Li, Yifan Hou, Mrinmaya Sachan, Ryan Cotterell | N/A | N/A |
| Agent Lumos: Unified and Modular Training for Open-Source Language Agents | Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei Chang, Yejin Choi, Bill Yuchen Lin | N/A | N/A |
| Investigating Cultural Alignment of Large Language Models | Badr AlKhamissi, Muhammad ElNokrashy, Mai Alkhamissi, Mona T. Diab | N/A | N/A |
| More Victories, Less Cooperation: Assessing Cicero’s Diplomacy Play | Wichayaporn Wongkamjan, Feng Gu, Yanze Wang, Ulf Hermjakob, Jonathan May, Brandon M. Stewart, Jonathan K. Kummerfeld, Denis Peskoff, Jordan Lee Boyd-Graber | N/A | N/A |
| VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild | Puyuan Peng, Po-Yao Huang, Shang-Wen Li, Abdelrahman Mohamed, David Harwath | N/A | N/A |
| RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors | Liam Dugan, Alyssa Hwang, Filip Trhlík, Andrew Zhu, Josh magnus Ludan, Hainiu Xu, Daphne Ippolito, Chris Callison-Burch | N/A | N/A |
| Silent Signals, Loud Impact: LLMs for Word-Sense Disambiguation of Coded Dog Whistles | Julia Kruk, Michela Marchini, Rijul Magu, Caleb Ziems, David Muchlinski, Diyi Yang | N/A | N/A |
| On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning | Franz Nowak, Anej Svete, Alexandra Butoi, Ryan Cotterell | N/A | N/A |
| Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends | Sanjana Ramprasad, Elisa Ferracane, Zachary Chase Lipton | N/A | N/A |
| MMToM-QA: Multimodal Theory of Mind Question Answering | Chuanyang Jin, Yutong Wu, Jing Cao, Jiannan Xiang, Yen-Ling Kuo, Zhiting Hu, Tomer Ullman, Antonio Torralba, Joshua B. Tenenbaum, Tianmin Shu | N/A | N/A |
| LLM in a flash: Efficient Large Language Model Inference with Limited Memory | Keivan Alizadeh, Seyed Iman Mirzadeh, Dmitry Belenko, S. Karen Khatamifard, Minsik Cho, Carlo C del Mundo, Mohammad Rastegari, Mehrdad Farajtabar | N/A | N/A |
| Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models | Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan | N/A | N/A |
| To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation | Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed | N/A | N/A |
| DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Financial Documents | Yilun Zhao, Yitao Long, Hongjun Liu, Ryo Kamoi, Linyong Nan, Lyuhao Chen, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan | N/A | N/A |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti, Liangzhen Lai, Anas Mahmoud, Bilge Acun, Saurabh Agarwal, Ahmed Roman, Ahmed A Aly, Beidi Chen, Carole-Jean Wu | N/A | N/A |
| Unintended Impacts of LLM Alignment on Global Representation | Michael J Ryan, William Barr Held, Diyi Yang | N/A | N/A |
| Classist Tools: Social Class Correlates with Performance in NLP | Amanda Cercas Curry, Giuseppe Attanasio, Zeerak Talat, Dirk Hovy | N/A | N/A |
| ActionIE: Action Extraction from Scientific Literature with Programming Languages | Xianrui Zhong, Yufeng Du, Siru Ouyang, Ming Zhong, Tingfeng Luo, Qirong Ho, Hao Peng, Heng Ji, Jiawei Han | N/A | N/A |
| A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech | Gaurav Verma, Rynaa Grover, Jiawei Zhou, Binny Mathew, Jordan Kraemer, Munmun De Choudhury, Srijan Kumar | N/A | N/A |
| Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs | Zhiwei Cao, Qian Cao, Yu Lu, Ningxin Peng, Luyang Huang, Shanbo Cheng, Jinsong Su | N/A | N/A |
| COSMIC: Mutual Information for Task-Agnostic Summarization Evaluation | Maxime DARRIN, Philippe Formont, Jackie CK Cheung, Pablo Piantanida | N/A | N/A |
| ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer | Arkadiy Saakyan, Smaranda Muresan | N/A | N/A |
| EUROPA: A Legal Multilingual Keyphrase Generation Dataset | Olivier Salaün, Frédéric Piedboeuf, Guillaume Le Berre, David Alfonso-Hermelo, Philippe Langlais | N/A | N/A |
| GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews | Maxime DARRIN, Ines Arous, Pablo Piantanida, Jackie CK Cheung | N/A | N/A |
| MAP’s not dead yet: Uncovering true language model modes by conditioning away degeneracy | Davis Yoshida, Kartik Goyal, Kevin Gimpel | N/A | N/A |
| Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks | Fakhraddin Alwajih, El Moatez Billah Nagoudi, Gagan Bhatia, Abdelrahman Mohamed, Muhammad Abdul-Mageed | N/A | N/A |
| Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks | João Bordalo, Vasco Ramos, Rodrigo Valério, Diogo Glória-Silva, Yonatan Bitton, Michal Yarom, Idan Szpektor, Joao Magalhaes | N/A | N/A |
| Cheetah: Natural Language Generation for 517 African Languages | Ife Adebara, AbdelRahim A. Elmadany, Muhammad Abdul-Mageed | N/A | N/A |
| TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based Reasoning | Yilun Zhao, Lyuhao Chen, Arman Cohan, Chen Zhao | N/A | N/A |
| KnowledgeFMath: A Knowledge-Intensive Math Reasoning Dataset in Finance Domains | Yilun Zhao, Hongjun Liu, Yitao Long, Rui Zhang, Chen Zhao, Arman Cohan | N/A | N/A |
| API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs | Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell Crouse, Asim Munawar, Vernon Austel, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi, Luis A. Lastras | N/A | N/A |
| LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks | Hanqing Wang, Bowen Ping, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Harder Task Needs More Experts: Dynamic Routing in MoE Models | Quzhe Huang, Zhenwei An, Nan Zhuang, Mingxu Tao, Chen Zhang, Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Songfang Huang, Yansong Feng | N/A | N/A |
| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang | N/A | N/A |
| SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents | Ruiyi Wang, Haofei Yu, Wenxin Sharon Zhang, Zhengyang Qi, Maarten Sap, Yonatan Bisk, Graham Neubig, Hao Zhu | N/A | N/A |
| ${\mathcal X}$FT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts | Yifeng Ding, Jiawei Liu, Yuxiang Wei, LINGMING ZHANG | N/A | N/A |
| Generalizability of Mixture of Domain-Specific Adapters from the Lens of Signed Weight Directions and its Application to Effective Model Pruning | Tuc Van Nguyen, Thai Le | N/A | N/A |
| Learning to Decode Collaboratively with Multiple Language Models | Zejiang Shen, Hunter Lang, Bailin Wang, Yoon Kim, David Sontag | N/A | N/A |
| DRAGIN: Dynamic Retrieval Augmented Generation based on the Real-time Information Needs of Large Language Models | Weihang Su, Yichen Tang, Qingyao Ai, Zhijing Wu, Yiqun LIU | N/A | N/A |
| Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? | Zhaochen Su, Juntao Li, Jun Zhang, Tong Zhu, Xiaoye Qu, Pan Zhou, Yan Bowen, Yu Cheng, Min zhang | N/A | N/A |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Pei Ke, Bosi Wen, Andrew Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang | N/A | N/A |
| LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments | Junzhe Chen, Xuming Hu, Shuodi Liu, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Lijie Wen | N/A | N/A |
| Small But Funny: A Feedback-Driven Approach to Humor Distillation | Sahithya Ravi, Patrick Huber, Akshat Shrivastava, Vered Shwartz, Arash Einolghozati | N/A | N/A |
| Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models | Fangzhi Xu, Zhiyong Wu, Qiushi Sun, Siyu Ren, Fei Yuan, Shuai Yuan, Qika Lin, Yu Qiao, Jun Liu | N/A | N/A |
| From Sights to Insights: Towards Summarization of Multimodal Clinical Documents | Akash Ghosh, Mohit Singh Tomar, Abhisek Tiwari, Sriparna Saha, JATIN AVINASH SALVE, Setu Sinha | N/A | N/A |
| When Phrases Meet Probabilities: Enabling Open Relation Extraction with Cooperating Large Language Models | Jiaxin Wang, Lingling Zhang, Wee Sun Lee, Yujie Zhong, Liwei Kang, Jun Liu | N/A | N/A |
| Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation | Jan Cegin, Branislav Pecher, Jakub Simko, Ivan Srba, Maria Bielikova, Peter Brusilovsky | N/A | N/A |
| Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic | Yassine El Kheir, Hamdy Mubarak, Ahmed Ali, Shammur Absar Chowdhury | N/A | N/A |
| Document-Level Machine Translation with Large-Scale Public Parallel Corpora | Proyag Pal, Alexandra Birch, Kenneth Heafield | N/A | N/A |
| Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! | Stefano Perrella, Lorenzo Proietti, Alessandro Scirè, Edoardo Barba, Roberto Navigli | N/A | N/A |
| NounAtlas: Filling the Gap in Nominal Semantic Role Labeling | Roberto Navigli, Marco Lo Pinto, Pasquale Silvestri, Dennis Rotondi, Simone Ciciliano, Alessandro Scirè | N/A | N/A |
| Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length | Nur Lan, Emmanuel Chemla, Roni Katzir | N/A | N/A |
| Context versus Prior Knowledge in Language Models | Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell | N/A | N/A |
| Word Matters: What Influences Domain Adaptation in Summarization? | Yinghao Li, Siyu Miao, Heyan Huang, Yang Gao | N/A | N/A |
| Visualization Recommendation with Prompt-based Reprogramming of Large Language Models | Xinhang Li, Jingbo Zhou, Wei Chen, Derong Xu, Tong Xu, Enhong Chen | N/A | N/A |
| HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs | Pranoy Panda, Ankush Agarwal, Chaitanya Devaguptapu, Manohar Kaul, Prathosh AP | N/A | N/A |
| Toward In-Context Teaching: Adapting Examples to Students’ Misconceptions | Alexis Ross, Jacob Andreas | N/A | N/A |
| Bridging Word-Pair and Token-Level Metaphor Detection with Explainable Domain Mining | Yuan Tian, Ruike Zhang, Nan Xu, Wenji Mao | N/A | N/A |
| Faithful Logical Reasoning via Symbolic Chain-of-Thought | Jundong Xu, Hao Fei, Liangming Pan, Qian Liu, Mong-Li Lee, Wynne Hsu | N/A | N/A |
| S$^2$GSL: Incorporating Segment to Syntactic Enhanced Graph Structure Learning for Aspect-based Sentiment Analysis | Bingfeng chen, qihan ouyang, yongqi luo, Boyan Xu, Ruichu Cai, Zhifeng Hao | N/A | N/A |
| Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends | Giuliano Martinelli, Edoardo Barba, Roberto Navigli | N/A | N/A |
| ESCoT: Towards Interpretable Emotional Support Dialogue Systems | Tenggan Zhang, Xinjie Zhang, Jinming Zhao, Li Zhou, Qin Jin | N/A | N/A |
| PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering | Fangzhi Xu, Qika Lin, Tianzhe Zhao, JiaweiHan, Jun Liu | N/A | N/A |
| WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection | Anudeex Shetty, Yue Teng, Ke He, Qiongkai Xu | N/A | N/A |
| Advancing Parameter Efficiency in Fine-tuning via Representation Editing | Muling Wu, Wenhao Liu, Xiaohua Wang, Tianlong Li, Changze Lv, Zixuan Ling, Zhu JianHao, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| Context Consistency between Training and Inference in Simultaneous Machine Translation | Meizhi Zhong, Lemao Liu, Kehai Chen, Mingming Yang, Min Zhang | N/A | N/A |
| Using Natural Language Explanations to Improve Robustness of In-context Learning | Xuanli He, Yuxiang Wu, Oana-Maria Camburu, Pasquale Minervini, Pontus Stenetorp | N/A | N/A |
| The Earth is Flat because…: Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation | Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu, Han Qiu | N/A | N/A |
| Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers | Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, nan du | N/A | N/A |
| LooGLE: Can Long-Context Language Models Understand Long Contexts? | Jiaqi Li, Mengmeng Wang, Zilong Zheng, Muhan Zhang | N/A | N/A |
| ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models | Hojae Han, Jaejin Kim, Jaeseok Yoo, Youngwon Lee, seung-won hwang | N/A | N/A |
| Let’s Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation | Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeonghun Yeo, Yong Man Ro | N/A | N/A |
| Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels | Zixia Jia, Junpeng Li, Shichuan Zhang, Anji Liu, Zilong Zheng | N/A | N/A |
| MULFE: A Multi-Level Benchmark for Free Text Model Editing | Chenhao Wang, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech | Shengpeng Ji, Ziyue Jiang, Wang Hanting, Jialung Zuo, Zhou Zhao | N/A | N/A |
| Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation | Muraleekrishna Gopinathan, Martin Masek, Jumana Abu-Khalaf, David Suter | N/A | N/A |
| HiRoPE: Length Extrapolation for Code Models Using Hierarchical Position | Kechi Zhang, Ge Li, Huangzhao Zhang, Zhi Jin | N/A | N/A |
| Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training | Junqing He, Kunhao Pan, Xiaoqun Dong, Zhuoyang Song, LiuYiBo, qianguosun, Yuxin Liang, Hao Wang, Enming Zhang, Jiaxing Zhang | N/A | N/A |
| CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges | Kechi Zhang, Jia Li, Ge Li, xianjie Shi, Zhi Jin | N/A | N/A |
| When is Tree Search Useful for LLM Planning? It Depends on the Discriminator | Ziru Chen, Michael White, Ray Mooney, Ali Payani, Yu Su, Huan Sun | N/A | N/A |
| LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models | Mihir Parmar, Nisarg Patel, Neeraj Varshney, Mutsumi Nakamura, Man Luo, Santosh Mashetty, Arindam Mitra, Chitta Baral | N/A | N/A |
| ECBD: Evidence-Centered Benchmark Design for NLP | Yu Lu Liu, Su Lin Blodgett, Jackie CK Cheung, Vera Liao, Alexandra Olteanu, Ziang Xiao | N/A | N/A |
| Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding | Ruohao Guo, Wei Xu, Alan Ritter | N/A | N/A |
| Reducing Privacy Risks in Online Self-Disclosures with Language Models | Yao Dou, Isadora Krsek, Tarek Naous, Anubha Kabra, Sauvik Das, Alan Ritter, Wei Xu | N/A | N/A |
| Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models | Zihao Lin, Mohammad Beigi, Hongxuan Li, Yufan Zhou, Yuxiang Zhang, Qifan Wang, Wenpeng Yin, Lifu Huang | N/A | N/A |
| REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset | Vaidehi Patil, Leonardo F. R. Ribeiro, Mengwen Liu, Mohit Bansal, Markus Dreyer | N/A | N/A |
| When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards | Norah A. Alzahrani, Hisham Abdullah Alyahya, Yazeed Alnumay, Sultan AlRashed, Shaykhah Z. Alsubaie, Yousef Almushayqih, Faisal Abdulrahman Mirza, Nouf M. Alotaibi, Nora Al-Twairesh, Areeb Alowisheq, M Saiful Bari, Haidar Khan | N/A | N/A |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Helia Hashemi, Jason Eisner, Corby Rosset, Benjamin Van Durme, Chris Kedzie | N/A | N/A |
| LIEDER: Linguistically-Informed Evaluation for Discourse Entity Recognition | Xiaomeng Zhu, Robert Frank | N/A | N/A |
| Evaluating Very Long-Term Conversational Memory of LLM Agents | Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang | N/A | N/A |
| Prototypical Reward Network for Data-Efficient Model Alignment | Jinghan Zhang, Xiting Wang, Yiqiao Jin, Changyu Chen, Xinhao Zhang, Kunpeng Liu | N/A | N/A |
| NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Jonathan Zheng, Alan Ritter, Wei Xu | N/A | N/A |
| Impacts of Misspelled Queries on Translation and Product Search | Greg Hanneman, Natawut Monaikul, Taichi Nakatani | N/A | N/A |
| Having Beer after Prayer? Measuring Cultural Bias in Large Language Models | Tarek Naous, Michael J Ryan, Alan Ritter, Wei Xu | N/A | N/A |
| Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs | Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin | N/A | N/A |
| The MERSA Dataset and a Transformer-Based Approach for Speech Emotion Recognition | Enshi Zhang, Rafael Trujillo, Christian Poellabauer | N/A | N/A |
| Transparent and Scrutable Recommendations Using Natural Language User Profiles | Jerome Ramos, Hossein A. Rahmani, Xi Wang, Xiao Fu, Aldo Lipani | N/A | N/A |
| Fora: A corpus and framework for the study of facilitated dialogue | Hope Schroeder, Deb Roy, Jad Kabbara | N/A | N/A |
| Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning | Yue Yu, Jiaming Shen, Tianqi Liu, Zhen Qin, Jing Nathan Yan, Jialu Liu, Chao Zhang, Michael Bendersky | N/A | N/A |
| What is the Best Way for ChatGPT to Translate Poetry? | Shanshan Wang, Derek F. Wong, Jingming Yao, Lidia S. Chao | N/A | N/A |
| Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling | Pratyush Maini, Skyler Seto, Richard He Bai, David Grangier, Yizhe Zhang, Navdeep Jaitly | N/A | N/A |
| DeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causal Intervention | Junda Wu, Tong Yu, Xiang Chen, Haoliang Wang, Ryan A. Rossi, Sungchul Kim, Anup Rao, Julian McAuley | N/A | N/A |
| Representation Learning with Conditional Information Flow Maximization | Dou Hu, Lingwei Wei, Wei Zhou, Songlin Hu | N/A | N/A |
| GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction | Virginia K. Felkner, Jennifer A. Thompson, Jonathan May | N/A | N/A |
| Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models | Martin Riddell, Ansong Ni, Arman Cohan | N/A | N/A |
| Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic | Rishabh Bhardwaj, Duc Anh Do, Soujanya Poria | N/A | N/A |
| Tracking the Newsworthiness of Public Documents | Alexander Spangher, Serdar Tumgoren, Ben Welsh, Nanyun Peng, Emilio Ferrara, Jonathan May | N/A | N/A |
| EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems | Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye HAO, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh | N/A | N/A |
| Explicating the Implicit: Argument Detection Beyond Sentence Boundaries | Paul Roit, Aviv Slobodkin, Eran Hirsch, Arie Cattan, Ayal Klein, Valentina Pyatkin, Ido Dagan | N/A | N/A |
| Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models | Shengzhi LI, Rongyu Lin, Shichao Pei | N/A | N/A |
| Word Embeddings Are Steers for Language Models | Chi Han, Jialiang Xu, Manling Li, Yi Fung, Chenkai Sun, Nan Jiang, Tarek F. Abdelzaher, Heng Ji | N/A | N/A |
| Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation | Jiachen Zhao, Wenlong Zhao, Andrew Drozdov, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum | N/A | N/A |
| Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor | Sangwon Yu, Changmin Lee, Hojin Lee, Sungroh Yoon | N/A | N/A |
| LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP | Danlu Chen, Freda Shi, Aditi Agarwal, Jacobo Myerston, Taylor Berg-Kirkpatrick | N/A | N/A |
| Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning | Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou | N/A | N/A |
| Confabulation: The Surprising Value of Large Language Model Hallucinations | Peiqi Sui, Eamon Duede, Sophie Wu, Richard Jean So | N/A | N/A |
| IAPT: Instance-Aware Prompt Tuning for Large Language Models | Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie | N/A | N/A |
| Can Language Models Serve as Text-Based World Simulators? | Ruoyao Wang, Graham Todd, Ziang Xiao, Xingdi Yuan, Marc-Alexandre Côté, Peter Clark, Peter Jansen | N/A | N/A |
| FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models | Andrew Zhu, Alyssa Hwang, Liam Dugan, Chris Callison-Burch | N/A | N/A |
| Revisiting Code Similarity Evaluation with Abstract Syntax Tree Edit Distance | Yewei Song, Cedric Lothritz, Daniel Tang, Tegawendé F. Bissyandé, Jacques Klein | N/A | N/A |
| Resisting the Lure of the Skyline: Grounding Practices in Active Learning for Morphological Inflection | Saliha Muradoglu, Michael Ginn, Miikka Silfverberg, Mans Hulden | N/A | N/A |
| Speculative Contrastive Decoding | Hongyi Yuan, Keming Lu, Fei Huang, Zheng Yuan, Chang Zhou | N/A | N/A |
| RDRec: Rationale Distillation for LLM-based Recommendation | Xinfeng Wang, Jin Cui, Yoshimi Suzuki, Fumiyo Fukumoto | N/A | N/A |
| Isotropy, Clusters, and Classifiers | Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh | N/A | N/A |
| Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks | Andrew Gambardella, Yusuke Iwasawa, Yutaka Matsuo | N/A | N/A |
| Cleaner Pretraining Corpus Curation with Neural Web Scraping | Zhipeng Xu, Zhenghao Liu, Yukun Yan, Zhiyuan Liu, Ge Yu, Chenyan Xiong | N/A | N/A |
| Simpson’s Paradox and the Accuracy-Fluency Tradeoff in Translation | Zheng Wei Lim, Ekaterina Vylomova, Trevor Cohn, Charles Kemp | N/A | N/A |
| UltraSparseBERT: 99% Conditionally Sparse Language Modelling | Peter Belcak, Roger Wattenhofer | N/A | N/A |
| SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark | Zhenwen Liang, Kehan Guo, Gang Liu, Taicheng Guo, Yujun Zhou, Tianyu Yang, Jiajun Jiao, Renjie Pi, Jipeng Zhang, Xiangliang Zhang | N/A | N/A |
| On the Role of Long-tail Knowledge in Retrieval Augmented Large Language Models | Dongyang Li, Junbing Yan, Taolin Zhang, Chengyu Wang, Xiaofeng He, Longtao Huang, Hui Xue’, Jun Huang | N/A | N/A |
| IEPile: Unearthing Large Scale Schema-Conditioned Information Extraction Corpus | Honghao Gui, Lin Yuan, Hongbin Ye, Ningyu Zhang, Mengshu Sun, Lei Liang, Huajun Chen | N/A | N/A |
| Bi-Directional Multi-Granularity Generation Framework for Knowledge Graph-to-Text with Large Language Model | Haowei Du, Chen Li, Dinghao Zhang, Dongyan Zhao | N/A | N/A |
| Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment | Zhihong Zhu, Xuxin Cheng, Zhanpeng Chen, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models | Zeyu Liu, Souvik Kundu, Anni Li, Junrui Wan, Lianghao Jiang, Peter Anthony Beerel | N/A | N/A |
| DDPrompt: Differential Diversity Prompting in Large Language Models | Lin Mu, Wenhao Zhang, Yiwen Zhang, Peiquan Jin | N/A | N/A |
| Monotonic Representation of Numeric Attributes in Language Models | Benjamin Heinzerling, Kentaro Inui | N/A | N/A |
| Two Issues with Chinese Spelling Correction and A Refinement Solution | Changxuan Sun, Linlin She, Xuesong Lu | N/A | N/A |
| Linear-time Minimum Bayes Risk Decoding with Reference Aggregation | Jannis Vamvas, Rico Sennrich | N/A | N/A |
| DynaSemble: Dynamic Ensembling of Textual and Structure-Based Models for Knowledge Graph Completion | Ananjan Nandi, Navdeep Kaur, Parag Singla, Mausam . | N/A | N/A |
| Fine-Tuning Pre-Trained Language Models with Gaze Supervision | Shuwen Deng, Paul Prasse, David Robert Reich, Tobias Scheffer, Lena Ann Jäger | N/A | N/A |
| Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech | Adrien Pupier, Maximin Coavoux, Jérôme Goulian, Benjamin Lecouteux | N/A | N/A |
| Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access | Saibo Geng, Berkay Döner, Chris Wendler, Martin Josifoski, Robert West | N/A | N/A |
| On the Semantic Latent Space of Diffusion-Based Text-To-Speech Models | Miri Varshavsky, Roy Hirsch, Regev Cohen, Tomer Golany, Daniel Freedman, Ehud Rivlin | N/A | N/A |
| Learnable Privacy Neurons Localization in Language Models | Ruizhe Chen, Tianxiang Hu, YANG FENG, Zuozhu Liu | N/A | N/A |
| Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs | Akhila Yerukola, Saujas Vaduguru, Daniel Fried, Maarten Sap | N/A | N/A |
| Generating Harder Cross-document Event Coreference Resolution Datasets using Metaphoric Paraphrasing | Shafiuddin Rehan Ahmed, Zhiyong Wang, George Arthur Baker, Kevin Stowe, James H. Martin | N/A | N/A |
| Soft Self-Consistency Improves Language Models Agents | Han Wang, Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal | N/A | N/A |
| RecGPT: Generative Pre-training for Text-based Recommendation | Hoang Ngo, Dat Quoc Nguyen | N/A | N/A |
| MTP: A Dataset for Multi-Modal Turning Points in Casual Conversations | Gia-Bao Dinh Ho, Chang Wei Tan, Zahra Zamanzadeh Darban, Mahsa Salehi, Reza Haf, Wray Buntine | N/A | N/A |
| What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects | Verena Blaschke, Christoph Purschke, Hinrich Schuetze, Barbara Plank | N/A | N/A |
| What Does Parameter-free Probing Really Uncover? | Tommi Buder-Gröndahl | N/A | N/A |
| ATLAS: Improving Lay Summarisation with Attribute-based Control | Zhihao Zhang, Tomas Goldsack, Carolina Scarton, Chenghua Lin | N/A | N/A |
| EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models | Mengfei Du, Binhao Wu, Zejun Li, Xuanjing Huang, zhongyu wei | N/A | N/A |
| Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark | Niklas Wretblad, Fredrik Gordh Riseby, Rahul Biswas, Amin Ahmadi, Oskar Holmström | N/A | N/A |
| Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval | João Coelho, Bruno Martins, Joao Magalhaes, Jamie Callan, Chenyan Xiong | N/A | N/A |
| That’s Optional: A Contemporary Exploration of “that” Omission in English Subordinate Clauses | Ella Rabinovich | N/A | N/A |
| Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender? | Haozhe An, Christabel Acquaye, Colin Wang, Zongxia Li, Rachel Rudinger | N/A | N/A |
| Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster | Agostina Calabrese, Leonardo Neves, Neil Shah, Maarten W. Bos, Björn Ross, Mirella Lapata, Francesco Barbieri | N/A | N/A |
| Getting Serious about Humor: Crafting Humor Datasets with Unfunny Large Language Models | Zachary Horvitz, Jingru Chen, Rahul Aditya, Harshvardhan Srivastava, Robert West, Zhou Yu, Kathleen McKeown | N/A | N/A |
| Estimating the Level of Dialectness Predicts Inter-annotator Agreement in Multi-dialect Arabic Datasets | Amr Keleg, Walid Magdy, Sharon Goldwater | N/A | N/A |
| Born Differently Makes a Difference: Counterfactual Study of Bias in Biography Generation from a Data-to-Text Perspective | Biaoyan Fang, Ritvik Dinesh, Xiang Dai, Sarvnaz Karimi | N/A | N/A |
| Greed is All You Need: An Evaluation of Tokenizer Inference Methods | Omri Uzan, Craig W Schmidt, Chris Tanner, Yuval Pinter | N/A | N/A |
| Sign Language Translation with Sentence Embedding Supervision | HAMIDULLAH Yasser, Josef van Genabith, Cristina España-Bonet | N/A | N/A |
| STREAM: Simplified Topic Retrieval, Exploration, and Analysis Module | Anton Frederik Thielmann, Arik Reuter, Christoph Weisser, Gillian Kant, Manish Kumar, Benjamin Säfken | N/A | N/A |
| DocFinQA: A Long-Context Financial Reasoning Dataset | Varshini Reddy, Rik Koncel-Kedziorski, Viet Dac Lai, Michael Krumdick, Charles Lovering, Chris Tanner | N/A | N/A |
| MaskLID: Code-Switching Language Identification through Iterative Masking | Amir Hossein Kargaran, François Yvon, Hinrich Schuetze | N/A | N/A |
| An Empirical Analysis on Large Language Models in Debate Evaluation | Xinyi Liu, Pinxin Liu, Hangfeng He | N/A | N/A |
| Fine-Tuned Machine Translation Metrics Struggle in Unseen Domains | Vilém Zouhar, Shuoyang Ding, Anna Currey, Tatyana Badeka, Jenyuan Wang, Brian Thompson | N/A | N/A |
| IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages | Saiful Haq, Ashutosh Sharma, Omar Khattab, Niyati Chhaya, Pushpak Bhattacharyya | N/A | N/A |
| AGR: Reinforced Causal Agent-Guided Self-explaining Rationalization | Yunxiao Zhao, Zhiqiang Wang, Xiaoli Li, Jiye Liang, Ru Li | N/A | N/A |
| Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research | Surangika Ranathunga, Nisansa de Silva, Dilith Jayakody, Aloka Fernando | N/A | N/A |
| The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models | Noah Yamamoto Siegel, Oana-Maria Camburu, Nicolas Heess, Maria Perez-Ortiz | N/A | N/A |
| Don’t Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Models | Anna Bavaresco, Alberto Testoni, Raquel Fernández | N/A | N/A |
| Naming, Describing, and Quantifying Visual Objects in Humans and LLMs | Alberto Testoni, Juell Sprott, Sandro Pezzelle | N/A | N/A |
| Are LLMs classical or nonmonotonic reasoners? Lessons from generics | Alina Leidinger, Robert Van Rooij, Ekaterina Shutova | N/A | N/A |
| ConstitutionalExperts: Training a Mixture of Principle-based Prompts | Savvas Petridis, Ben Wedin, Ann Yuan, James Wexler, Nithum Thain | N/A | N/A |
| Time Sensitive Knowledge Editing through Efficient Finetuning | Xiou Ge, Ali Mousavi, Edouard Grave, Armand Joulin, Kun Qian, Benjamin Han, Mostafa Arefiyan, Yunyao Li | N/A | N/A |
| PRewrite: Prompt Rewriting with Reinforcement Learning | Weize Kong, Spurthi Amba Hombaiah, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky | N/A | N/A |
| SeeGULL Multilingual: a Dataset of Geo-Culturally Situated Stereotypes | Mukul Bhutani, Kevin Robinson, Vinodkumar Prabhakaran, Shachi Dave, Sunipa Dev | N/A | N/A |
| Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei, Eduardo Blanco | N/A | N/A |
| Exploring Conditional Variational Mechanism to Pinyin Input Method for Addressing One-to-Many Mappings in Low-Resource Scenarios | Bin Sun, Jianfeng Li, Hao Zhou, Fandong Meng, Kan Li, Jie Zhou | N/A | N/A |
| Consistency Training by Synthetic Question Generation for Conversational Question Answering | Hamed Hematian Hemati, Hamid Beigy | N/A | N/A |
| How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages? | Anushka Singh, Ananya B. Sai, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M Khapra | N/A | N/A |
| Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages | Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin | N/A | N/A |
| Cross-Modal Projection in Multimodal LLMs Doesn’t Really Project Visual Attributes to Textual Space | Gaurav Verma, Minje Choi, Kartik Sharma, Jamelle Watson-Daniels, Sejoon Oh, Srijan Kumar | N/A | N/A |
| Guidance-Based Prompt Data Augmentation in Specialized Domains for Named Entity Recognition | Hyeonseok Kang, Hyein Seo, Jeesu Jung, Sangkeun Jung, Du-Seong Chang, Riwoo Chung | N/A | N/A |
| Aligning Large Language Models via Fine-grained Supervision | Dehong Xu, Liang Qiu, Minseok Kim, Faisal Ladhak, Jaeyoung Do | N/A | N/A |
| Annotating FrameNet via Structure-Conditioned Language Generation | Xinyue Cui, Swabha Swayamdipta | N/A | N/A |
| DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms | Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang | N/A | N/A |
| Towards Artwork Explanation in Large-scale Vision Language Models | Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe | N/A | N/A |
| On the Hallucination in Simultaneous Machine Translation | Meizhi Zhong, Kehai Chen, Zhengshan Xue, Lemao Liu, Mingming Yang, Min Zhang | N/A | N/A |
| Self-Augmented In-Context Learning for Unsupervised Word Translation | Yaoyiran Li, Anna Korhonen, Ivan Vulić | N/A | N/A |
| RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records | Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May Dongmei Wang, Joyce C. Ho, Carl Yang | N/A | N/A |
| Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models | Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Qing Li, Yong Jiang, Zhihao Jia | N/A | N/A |
| Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances | Hanlei Zhang, Hua Xu, Fei Long, Xin Wang, Kai Gao | N/A | N/A |
| MAGE: Machine-generated Text Detection in the Wild | Yafu Li, Qintong Li, Leyang Cui, Wei Bi, Zhilin Wang, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang | N/A | N/A |
| PrivLM-Bench: A Multi-level Privacy Evaluation Benchmark for Language Models | Haoran Li, Dadi Guo, Donghao Li, Wei Fan, Qi Hu, Xin Liu, Chunkit Chan, Duanyi YAO, Yuan Yao, Yangqiu Song | N/A | N/A |
| GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators | Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, EngSiong Chng | N/A | N/A |
| Exploring Chain-of-Thought for Multi-modal Metaphor Detection | Yanzhi Xu, Yueying Hua, Shichen Li, Zhongqing Wang | N/A | N/A |
| BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation | DaYou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu | N/A | N/A |
| A Unified Temporal Knowledge Graph Reasoning Model Towards Interpolation and Extrapolation | Kai Chen, Ye Wang, Yitong Li, Aiping Li, Han Yu, Xin Song | N/A | N/A |
| Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation | Shicheng Xu, Liang Pang, Mo Yu, Fandong Meng, Huawei Shen, Xueqi Cheng, Jie Zhou | N/A | N/A |
| CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers | Yong Hu, Fandong Meng, Jie Zhou | N/A | N/A |
| Evaluating Dynamic Topic Models | Charu Karakkaparambil James, Mayank Nagda, Nooshin Haji Ghassemi, Marius Kloft, Sophie Fellenz | N/A | N/A |
| How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition | Guanting Dong, Hongyi Yuan, Keming Lu, Chengpeng Li, Mingfeng Xue, Dayiheng Liu, Wei Wang, Zheng Yuan, Chang Zhou, Jingren Zhou | N/A | N/A |
| Through the Lens of Split Vote: Exploring Disagreement, Difficulty and Calibration in Legal Case Outcome Classification | Shanshan Xu, Santosh T.Y.S.S, Oana Ichim, Barbara Plank, Matthias Grabmair | N/A | N/A |
| Inference to the Best Explanation in Large Language Models | Dhairya Dalal, Marco Valentino, Andre Freitas, Paul Buitelaar | N/A | N/A |
| A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus | Eduard Poesina, Cornelia Caragea, Radu Tudor Ionescu | N/A | N/A |
| DeVAn: Dense Video Annotation for Video-Language Models | Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fang, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang | N/A | N/A |
| MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering | Xiusi Chen, Jyun-Yu Jiang, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Wei Wang | N/A | N/A |
| SportsMetrics: Blending Text and Numerical Data to Understand Information Fusion in LLMs | Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Hassan Foroosh, Dong Yu, Fei Liu | N/A | N/A |
| SciMON: Scientific Inspiration Machines Optimized for Novelty | Qingyun Wang, Doug Downey, Heng Ji, Tom Hope | N/A | N/A |
| Expedited Training of Visual Conditioned Language Generation via Redundancy Reduction | Yiren Jian, Tingkai Liu, Yunzhe Tao, Chunhui Zhang, Soroush Vosoughi, Hongxia Yang | N/A | N/A |
| Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models | Abhishek Kumar, Robert Morabito, Sanzhar Umbet, Jad Kabbara, Ali Emami | N/A | N/A |
| Retrieval-Augmented Multilingual Knowledge Editing | Weixuan Wang, Barry Haddow, Alexandra Birch | N/A | N/A |
| Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge | Brendan Park, Madeline Janecek, Naser Ezzati-Jivan, Yifeng Li, Ali Emami | N/A | N/A |
| Subtle Biases Need Subtler Measures: Dual Metrics for Evaluating Representative and Affinity Bias in Large Language Models | Abhishek Kumar, Sarfaroz Yunusov, Ali Emami | N/A | N/A |
| Framing in the Presence of Supporting Data: A Case Study in U.S. Economic News | Alexandria Leto, Elliot E. Pickens, Coen D. Needell, David Rothschild, Maria Leonor Pacheco | N/A | N/A |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Fuxiao Liu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang | N/A | N/A |
| TTM-RE: Memory-Augmented Document-Level Relation Extraction | Chufan Gao, Xuan Wang, Jimeng Sun | N/A | N/A |
| Answer is All You Need: Instruction-following Text Embedding via Answering the Question | Letian Peng, Yuwei Zhang, Zilong Wang, Jayanth Srinivasa, Gaowen Liu, Zihan Wang, Jingbo Shang | N/A | N/A |
| Explore Spurious Correlations at the Concept Level in Language Models for Text Classification | Yuhang Zhou, Paiheng Xu, Xiaoyu Liu, Bang An, Wei Ai, Furong Huang | N/A | N/A |
| Every Answer Matters: Evaluating Commonsense with Probabilistic Measures | Qi Cheng, Michael Boratko, Pranay Kumar Yelugam, Tim O’Gorman, Nalini Singh, Andrew McCallum, Xiang Lorraine Li | N/A | N/A |
| GradSafe: Detecting Jailbreak Prompts for LLMs via Safety-Critical Gradient Analysis | Yueqi XIE, Minghong Fang, Renjie Pi, Neil Zhenqiang Gong | N/A | N/A |
| How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs | Yi Zeng, Hongpeng Lin, Jingwen Zhang, Diyi Yang, Ruoxi Jia, Weiyan Shi | N/A | N/A |
| Pouring Your Heart Out: Investigating the Role of Figurative Language in Online Expressions of Empathy | Gyeongeun Lee, Christina Wong, Meghan Guo, Natalie Parde | N/A | N/A |
| An Information-Theoretic Approach to Analyze NLP Classification Tasks | Luran Wang, Mark Gales, Vatsal Raina | N/A | N/A |
| Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders | Yuwei Zhang, Siffi Singh, Sailik Sengupta, Igor Shalyminov, Hang Su, Hwanjun Song, Saab Mansour | N/A | N/A |
| Wav2Gloss: Generating Interlinear Glossed Text from Speech | Taiqi He, Kwanghee Choi, Lindia Tjuatja, Nathaniel Romney Robinson, Jiatong Shi, Shinji Watanabe, Graham Neubig, David R Mortensen, Lori Levin | N/A | N/A |
| Leveraging Codebook Knowledge with NLI and ChatGPT for Zero-Shot Political Relation Classification | Yibo Hu, Erick Skorupa Parolin, Latifur Khan, Patrick Brandt, Javier Osorio, Vito D’Orazio | N/A | N/A |
| SPOR: A Comprehensive and Practical Evaluation Method for Compositional Generalization in Data-to-Text Generation | Ziyao Xu, Houfeng Wang | N/A | N/A |
| OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following | Haochen Shi, Zhiyuan Sun, Xingdi Yuan, Marc-Alexandre Côté, Bang Liu | N/A | N/A |
| Multimodal Instruction Tuning with Conditional Mixture of LoRA | Ying Shen, Zhiyang Xu, Qifan Wang, Yu Cheng, Wenpeng Yin, Lifu Huang | N/A | N/A |
| DocLens: Multi-aspect Fine-grained Medical Text Evaluation | Yiqing Xie, Sheng Zhang, Hao Cheng, Pengfei Liu, Zelalem Gero, Cliff Wong, Tristan Naumann, Hoifung Poon, Carolyn Rose | N/A | N/A |
| FOFO: A Benchmark to Evaluate LLMs’ Format-Following Capability | Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong | N/A | N/A |
| Hyper-CL: Conditioning Sentence Representations with Hypernetworks | Young Hyun Yoo, Jii Cha, Changhyeon Kim, Taeuk Kim | N/A | N/A |
| Analysis of Multi-Source Language Training in Cross-Lingual Transfer | Seonghoon Lim, Taejun Yun, Jinhyeon Kim, Jihun Choi, Taeuk Kim | N/A | N/A |
| ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions | Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Chandra Kiran Reddy Evuru, Ramaneswaran S, S Sakshi, Dinesh Manocha | N/A | N/A |
| The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants | Lucas Bandarkar, Davis Liang, Benjamin Muller, Mikel Artetxe, Satya Narayan Shukla, Donald Husa, Naman Goyal, Abhinandan Krishnan, Luke Zettlemoyer, Madian Khabsa | N/A | N/A |
| Learn from Failure: Fine-tuning LLMs with Trial-and-Error Data for Intuitionistic Propositional Logic Proving | Chenyang An, Zhibo Chen, Qihao Ye, Emily First, Letian Peng, Jiayun Zhang, Zihan Wang, Sorin Lerner, Jingbo Shang | N/A | N/A |
| Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach | Saehyung Lee, Sangwon Yu, Junsung Park, Jihun Yi, Sungroh Yoon | N/A | N/A |
| IMBUE: Improving Interpersonal Effectiveness through Simulation and Just-in-time Feedback with Human-Language Model Interaction | Inna Wanyin Lin, Ashish Sharma, Christopher Michael Rytting, Adam S Miner, Jina Suh, Tim Althoff | N/A | N/A |
| Token-wise Influential Training Data Retrieval for Large Language Models | Huawei Lin, Jikai Long, Zhaozhuo Xu, Weijie Zhao | N/A | N/A |
| Tree-of-Counterfactual Prompting for Zero-Shot Stance Detection | Maxwell Weinzierl, Sanda Harabagiu | N/A | N/A |
| VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks | Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Ruslan Salakhutdinov, Daniel Fried | N/A | N/A |
| FineSurE: Fine-grained Summarization Evaluation using LLMs | Hwanjun Song, Hang Su, Igor Shalyminov, Jason Cai, Saab Mansour | N/A | N/A |
| Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback | Daechul Ahn, Yura Choi, Youngjae Yu, Dongyeop Kang, Jonghyun Choi | N/A | N/A |
| Prompt Refinement with Image Pivot for Text-to-Image Generation | Jingtao Zhan, Qingyao Ai, Yiqun LIU, Yingwei Pan, Ting Yao, Jiaxin Mao, Shaoping Ma, Tao Mei | N/A | N/A |
| The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models | Adithya Bhaskar, Dan Friedman, Danqi Chen | N/A | N/A |
| Striking Gold in Advertising: Standardization and Exploration of Ad Text Generation | Masato Mita, Soichiro Murakami, Akihiko Kato, Peinan Zhang | N/A | N/A |
| AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation | Zhaowei Wang, Wei Fan, Qing Zong, Hongming Zhang, Sehyun Choi, Tianqing Fang, Xin Liu, Yangqiu Song, Ginny Wong, Simon See | N/A | N/A |
| Reflect-RL: Two-Player Online RL Fine-Tuning for LMs | Runlong Zhou, Simon Shaolei Du, Beibin Li | N/A | N/A |
| Can ChatGPT’s Performance be Improved on Verb Metaphor Detection Tasks? Bootstrapping and Combining Tacit Knowledge | Cheng Yang, Puli Chen, Qingbao Huang | N/A | N/A |
| Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning | Zhaorui Yang, Tianyu Pang, Haozhe Feng, Han Wang, Wei Chen, Minfeng Zhu, Qian Liu | N/A | N/A |
| An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation | kun Zhu, Xiaocheng Feng, Xiyuan Du, Yuxuan Gu, Weijiang Yu, Haotian Wang, Qianglong Chen, Zheng Chu, Jingchang Chen, Bing Qin | N/A | N/A |
| RORA: Robust Free-Text Rationale Evaluation | Zhengping Jiang, Yining Lu, Hanjie Chen, Daniel Khashabi, Benjamin Van Durme, Anqi Liu | N/A | N/A |
| Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents | Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models | Lei Li, Yuqi Wang, Runxin Xu, Peiyi Wang, Xiachong Feng, Lingpeng Kong, Qi Liu | N/A | N/A |
| L-Eval: Instituting Standardized Evaluation for Long Context Language Models | Chenxin An, Shansan Gong, Ming Zhong, Xingjian Zhao, Mukai Li, Jun Zhang, Lingpeng Kong, Xipeng Qiu | N/A | N/A |
| DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages | Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos | N/A | N/A |
| InstructProtein: Aligning Human and Protein Language via Knowledge Instruction | Zeyuan Wang, Qiang Zhang, Keyan Ding, Ming Qin, Xiang Zhuang, Xiaotong Li, Huajun Chen | N/A | N/A |
| Causal-Guided Active Learning for Debiasing Large Language Models | Zhouhao Sun, Li Du, Xiao Ding, Yixuan Ma, Yang Zhao, Kaitao Qiu, Ting Liu, Bing Qin | N/A | N/A |
| ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models | Aparna Elangovan, Ling Liu, Lei Xu, Sravan Babu Bodapati, Dan Roth | N/A | N/A |
| Linguistically Conditioned Semantic Textual Similarity | Jingxuan Tu, Keer Xu, Liulu Yue, Bingyang Ye, Kyeongmin Rim, James Pustejovsky | N/A | N/A |
| Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future | Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Tao He, Haotian Wang, Weihua Peng, Ming Liu, Bing Qin, Ting Liu | N/A | N/A |
| TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models | Zheng Chu, Jingchang Chen, Qianglong Chen, Weijiang Yu, Haotian Wang, Ming Liu, Bing Qin | N/A | N/A |
| BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering | Zheng Chu, Jingchang Chen, Qianglong Chen, Haotian Wang, kun Zhu, Xiyuan Du, Weijiang Yu, Ming Liu, Bing Qin | N/A | N/A |
| ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base | Siyu Yuan, Jiangjie Chen, Changzhi Sun, Jiaqing Liang, Yanghua Xiao, Deqing Yang | N/A | N/A |
| TaSL: Continual Dialog State Tracking via Task Skill Localization and Consolidation | Yujie Feng, Xu Chu, Yongxin Xu, Guangyuan SHI, Bo LIU, Xiao-Ming Wu | N/A | N/A |
| DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models | Damai Dai, Chengqi Deng, Chenggang Zhao, R.X. Xu, Huazuo Gao, Deli Chen, Jiashi Li, Wangding Zeng, Xingkai Yu, Y. Wu, Zhenda Xie, Y.K. Li, Panpan Huang, Fuli Luo, Chong Ruan, Zhifang Sui, Wenfeng Liang | N/A | N/A |
| Grounding Language Model with Chunking-Free In-Context Retrieval | Hongjin Qian, Zheng Liu, Kelong Mao, Yujia Zhou, Zhicheng Dou | N/A | N/A |
| Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation | Jiaxin Bai, Yicheng Wang, Tianshi Zheng, Yue Guo, Xin Liu, Yangqiu Song | N/A | N/A |
| Active Prompting with Chain-of-Thought for Large Language Models | Shizhe Diao, Pengcheng Wang, Yong Lin, Rui Pan, Xiang Liu, Tong Zhang | N/A | N/A |
| EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs | Xiangyu Zhao, Bo LIU, Qijiong Liu, Guangyuan SHI, Xiao-Ming Wu | N/A | N/A |
| Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search | Haochen Li, Xin Zhou, Zhiqi Shen | N/A | N/A |
| A Multidimensional Framework for Evaluating Lexical Semantic Change with Social Science Applications | Naomi Baes, Nick Haslam, Ekaterina Vylomova | N/A | N/A |
| Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal | Jianheng Huang, Leyang Cui, Ante Wang, chengyiyang, Xinting Liao, Linfeng Song, Junfeng Yao, Jinsong Su | N/A | N/A |
| Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency | Baizhou Huang, Shuai Lu, Xiaojun Wan, Nan Duan | N/A | N/A |
| Citation-Enhanced Generation for LLM-based Chatbots | Weitao Li, Junkai Li, Weizhi Ma, Yang Liu | N/A | N/A |
| Transitive Consistency Constrained Learning for Entity-to-Entity Stance Detection | Haoyang Wen, Eduard Hovy, Alexander G Hauptmann | N/A | N/A |
| Feature-Adaptive and Data-Scalable In-Context Learning | Jiahao Li, Quan Wang, Licheng Zhang, Guoqing Jin, Zhendong Mao | N/A | N/A |
| Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games | Yizhe Zhang, Jiarui Lu, Navdeep Jaitly | N/A | N/A |
| WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models | Shangqing Tu, Yuliang Sun, Yushi Bai, Jifan Yu, Lei Hou, Juanzi Li | N/A | N/A |
| Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Yida Zhao, Chao Lou, Kewei Tu | N/A | N/A |
| A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Any Translation | Zhengrui Ma, Qingkai Fang, Shaolei Zhang, Shoutao Guo, Yang Feng, Min zhang | N/A | N/A |
| PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents | Qisen Yang, Zekun Wang, Honghui Chen, Shenzhi Wang, Yifan Pu, Xin Gao, Wenhao Huang, Shiji Song, Gao Huang | N/A | N/A |
| Probing Language Models for Pre-training Data Detection | Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Bing Liu, Haonan Lu, Wenliang Chen | N/A | N/A |
| Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding | Zhihan Zhang, Yixin Cao, Chenchen Ye, Yunshan Ma, Lizi Liao, Tat-Seng Chua | N/A | N/A |
| IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation | Senyu Han, Lu Chen, Li-Min Lin, Zhengshan Xu, Kai Yu | N/A | N/A |
| Language Model Adaption for Reinforcement Learning with Natural Language Action Space | Jiangxing Wang, Jiachen Li, Xiao Han, Deheng Ye, Zongqing Lu | N/A | N/A |
| Evaluating Intention Detection Capability of Large Language Models in Persuasive Dialogues | Hiromasa Sakurai, Yusuke Miyao | N/A | N/A |
| LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression | Huiqiang Jiang, Qianhui Wu, Xufang Luo, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu | N/A | N/A |
| Persuading across Diverse Domains: a Dataset and Persuasion Large Language Model | Chuhao Jin, Kening Ren, Lingzhen Kong, Xiting Wang, Ruihua Song, huan chen | N/A | N/A |
| HealMe: Harnessing Cognitive Reframing in Large Language Models for Psychotherapy | Mengxi Xiao, Qianqian Xie, Ziyan Kuang, Zhicheng Liu, Kailai Yang, Min Peng, Weiguang Han, Jimin Huang | N/A | N/A |
| Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition | Zirun Guo, Tao Jin, Zhou Zhao | N/A | N/A |
| An Effective Pronunciation Assessment Approach Leveraging Hierarchical Transformers and Pre-training Strategies | Bi-Cheng Yan, Jiun-Ting Li, Yi-Cheng Wang, Hsin Wei Wang, Tien-Hong Lo, Yung-Chang Hsu, Wei-Cheng Chao, Berlin Chen | N/A | N/A |
| Detection-Correction Structure via General Language Model for Grammatical Error Correction | Wei Li, Houfeng Wang | N/A | N/A |
| Generative Pre-trained Speech Language Model with Efficient Hierarchical Transformer | Yongxin Zhu, Dan Su, Liqiang He, Linli Xu, Dong Yu | N/A | N/A |
| Selene: Pioneering Automated Proof in Software Verification | Lichen Zhang, Shuai Lu, Nan Duan | N/A | N/A |
| Dissecting Human and LLM Preferences | Junlong Li, Fan Zhou, Shichao Sun, Yikai Zhang, hai zhao, Pengfei Liu | N/A | N/A |
| UniCoder: Scaling Code Large Language Model via Universal Code | Tao Sun, Linzheng Chai, Jian Yang, Yuwei Yin, Hongcheng Guo, Jiaheng Liu, Bing Wang, Liqun Yang, Zhoujun Li | N/A | N/A |
| AoE: Angle-optimized Embeddings for Semantic Textual Similarity | Xianming LI, Jing Li | N/A | N/A |
| InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews | Xintao Wang, Yunze Xiao, Jen-tse Huang, Siyu Yuan, Rui Xu, Haoran Guo, Quan Tu, Yaying Fei, Ziang Leng, Wei Wang, Jiangjie Chen, Cheng Li, Yanghua Xiao | N/A | N/A |
| Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better | Shengchao Liu, Xiaoming Liu, Yichen Wang, Zehua Cheng, Chengzhengxu Li, Zhaohan Zhang, Yu Lan, Chao Shen | N/A | N/A |
| AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators | Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold | N/A | N/A |
| Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering | Tobias Schimanski, Jingwei Ni, Mathias Kraus, Elliott Ash, Markus Leippold | N/A | N/A |
| LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin | Shihan Dou, Enyu Zhou, Yan Liu, Songyang Gao, Wei Shen, Limao Xiong, Yuhao Zhou, Xiao Wang, Zhiheng Xi, Xiaoran Fan, Shiliang Pu, Zhu Jiang, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation | Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Lifeng Jin, Linfeng Song, Haitao Mi, Helen M. Meng | N/A | N/A |
| M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions | Zheng Wang, Shu Xian Teo, Jieer Ouyang, Yongjun xu, Wei Shi | N/A | N/A |
| AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension | Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou | N/A | N/A |
| Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies | Tom Kocmi, Vilém Zouhar, Christian Federmann, Matt Post | N/A | N/A |
| ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models | Yuanyi Ren, Haoran Ye, Hanjun Fang, Xin Zhang, Guojie Song | N/A | N/A |
| DM-BLI: Dynamic Multiple Subspaces Alignment for Unsupervised Bilingual Lexicon Induction | Ling Hu, Yuemei Xu | N/A | N/A |
| SparseFit: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations | Jesus Solano, Mardhiyah Sanni, Oana-Maria Camburu, Pasquale Minervini | N/A | N/A |
| Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation | Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N Sainath, Phil Woodland | N/A | N/A |
| REANO: Optimising Retrieval-Augmented Reader Models through Knowledge Graph Generation | Jinyuan Fang, Zaiqiao Meng, Craig MacDonald | N/A | N/A |
| Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks | Yingji Zhang, Danilo Carvalho, Andre Freitas | N/A | N/A |
| MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation | Yan Ma, Yu Qiao, Pengfei Liu | N/A | N/A |
| Open-Set Semi-Supervised Text Classification via Adversarial Disagreement Maximization | Junfan Chen, Richong Zhang, Junchi Chen, Chunming Hu | N/A | N/A |
| ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages | Junjie Ye, Sixian Li, Guanyu Li, Huangcaishuang, Songyang Gao, Yilong Wu, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| A synthetic data approach for domain generalization of NLI models | Mohammad Javad Hosseini, Andrey Petrov, Alex Fabrikant, Annie Louis | N/A | N/A |
| Enhancing Contrastive Learning with Noise-Guided Attack: Towards Continual Relation Extraction in the Wild | Ting Wu, Jingyi Liu, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| LRQuant: Learnable and Robust Post-Training Quantization for Large Language Models | Jiaqi Zhao, Miao Zhang, Chao Zeng, Ming Wang, Xuebo Liu, Liqiang Nie | N/A | N/A |
| VariErr NLI: Separating Annotation Error from Human Label Variation | Leon Weber-Genzel, Siyao Peng, Marie-Catherine de Marneffe, Barbara Plank | N/A | N/A |
| Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient | Mingxin Li, Richong Zhang, Zhijie Nie | N/A | N/A |
| Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation | Xunjian Yin, Xu Zhang, Jie Ruan, Xiaojun Wan | N/A | N/A |
| ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval | Soyoung Yoon, Eunbi Choi, Jiyeon Kim, Hyeongu Yun, Yireun Kim, seung-won hwang | N/A | N/A |
| Exploring the Potential of Large Language Models in Computational Argumentation | Guizhen Chen, Liying Cheng, Anh Tuan Luu, Lidong Bing | N/A | N/A |
| TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Semantic Tasks | Viktor Moskvoretskii, Ekaterina Neminova, Alina Lobanova, Alexander Panchenko, Irina Nikishina | N/A | N/A |
| CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning | Weiqi Wang, Tianqing Fang, Chunyang Li, Haochen Shi, Wenxuan Ding, Baixuan Xu, Zhaowei Wang, Jiaxin Bai, Xin Liu, Cheng Jiayang, Chunkit Chan, Yangqiu Song | N/A | N/A |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jitai Hao, Weiwei Sun, Xin Xin, Qi Meng, Zhumin Chen, Pengjie Ren, Zhaochun Ren | N/A | N/A |
| Surgical Feature-Space Decomposition of LLMs: Why, When and How? | Arnav Chavan, Nahush Lele, Deepak Gupta | N/A | N/A |
| Reasoning in Flux: Enhancing Large Language Models Reasoning through Uncertainty-aware Adaptive Guidance | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Xiaonan Li, Junqi Dai, Qinyuan Cheng, Xuanjing Huang, Xipeng Qiu | N/A | N/A |
| Modality-Aware Integration with Large Language Models for Knowledge-Based Visual Question Answering | Junnan Dong, Qinggang Zhang, Huachi Zhou, Daochen Zha, Pai Zheng, Xiao Huang | N/A | N/A |
| Unlocking Data-free Low-bit Quantization with Matrix Decomposition for KV Cache Compression | Peiyu Liu, Ze-Feng Gao, Xin Zhao, Yipeng Ma, Tao Wang, Ji-Rong Wen | N/A | N/A |
| Emergent Word Order Universals from Cognitively-Motivated Language Models | Tatsuki Kuribayashi, Ryo Ueda, Ryo Yoshida, Yohei Oseki, Ted Briscoe, Timothy Baldwin | N/A | N/A |
| VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models | Seoyeon Kim, Kwangwook Seo, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee | N/A | N/A |
| Making Long-Context Language Models Better Multi-Hop Reasoners | Yanyang Li, Shuo Liang, Michael Lyu, Liwei Wang | N/A | N/A |
| TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models | Yihong Liu, Chunlan Ma, Haotian Ye, Hinrich Schuetze | N/A | N/A |
| Extreme Miscalibration and the Illusion of Adversarial Robustness | Vyas Raina, Samson Tan, Volkan Cevher, Aditya Rawal, Sheng Zha, George Karypis | N/A | N/A |
| HyCoRec: Hypergraph-Enhanced Multi-Preference Learning for Alleviating Matthew Effect in Conversational Recommendation | Yongsen Zheng, Ruilin Xu, Ziliang Chen, Guohua Wang, Mingjie Qian, Jinghui Qin, Liang Lin | N/A | N/A |
| Co-training for Low Resource Scientific Natural Language Inference | Mobashir Sadat, Cornelia Caragea | N/A | N/A |
| RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models | Jiongxiao Wang, Junlin Wu, Muhao Chen, Yevgeniy Vorobeychik, Chaowei Xiao | N/A | N/A |
| Time is Encoded in the Weights of Finetuned Language Models | Kai Nylund, Suchin Gururangan, Noah A. Smith | N/A | N/A |
| Long-Context Language Modeling with Parallel Context Encoding | Howard Yen, Tianyu Gao, Danqi Chen | N/A | N/A |
| SirLLM: Streaming Infinite Retentive LLM | Yao Yao, Zuchao Li, hai zhao | N/A | N/A |
| IMO: Greedy Layer-Wise Sparse Representation Learning for Out-of-Distribution Text Classification with Pre-trained Models | Tao Feng, Lizhen Qu, Zhuang Li, Haolan Zhan, YUNCHENG HUA, Reza Haf | N/A | N/A |
| Generative Pretrained Structured Transformers: Unsupervised Syntactic Language Models at Scale | Xiang Hu, Pengyu Ji, Qingyang Zhu, Wei Wu, Kewei Tu | N/A | N/A |
| MELA: Multilingual Evaluation of Linguistic Acceptability | Ziyin Zhang, Yikang Liu, Weifang Huang, Junyu Mao, Rui Wang, Hai Hu | N/A | N/A |
| Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View | Jintian Zhang, Xin Xu, Ningyu Zhang, Ruibo Liu, Bryan Hooi, Shumin Deng | N/A | N/A |
| CopyNE: Better Contextual ASR by Copying Named Entities | Shilin Zhou, Zhenghua Li, Yu Hong, Min Zhang, Zhefeng Wang, Baoxing Huai | N/A | N/A |
| Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval | Peter Baile Chen, Yi Zhang, Dan Roth | N/A | N/A |
| Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation | Haonan Chen, Zhicheng Dou, Kelong Mao, Jiongnan Liu, Ziliang Zhao | N/A | N/A |
| ItD: Large Language Models Can Teach Themselves Induction through Deduction | Wangtao Sun, Haotian Xu, Xuanqing Yu, Pei Chen, Shizhu He, Jun Zhao, Kang Liu | N/A | N/A |
| MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs | Zimu Lu, Aojun Zhou, Houxing Ren, Ke Wang, Weikang Shi, Junting Pan, Mingjie Zhan, Hongsheng Li | N/A | N/A |
| MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin | Tianshuo Zhou, Sen Mei, Xinze Li, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Ge Yu | N/A | N/A |
| Rethinking Task-Oriented Dialogue Systems: From Complex Modularity to Zero-Shot Autonomous Agent | Heng-Da Xu, Xian-Ling Mao, Puhai Yang, Fanshu Sun, Heyan Huang | N/A | N/A |
| On Context Utilization in Summarization with Large Language Models | Mathieu Ravaut, Aixin Sun, Nancy F. Chen, Shafiq Joty | N/A | N/A |
| INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning | Yutao Zhu, Peitian Zhang, Chenghao Zhang, Yifei Chen, Binyu Xie, Zheng Liu, Ji-Rong Wen, Zhicheng Dou | N/A | N/A |
| Enhancing In-Context Learning via Implicit Demonstration Augmentation | Xiaoling Zhou, Wei Ye, Yidong Wang, Chaoya Jiang, Zhemg Lee, Rui Xie, Shikun Zhang | N/A | N/A |
| PRoLoRA: Partial Rotation Empowers More Parameter-Efficient LoRA | Sheng Wang, Boyang XUE, Jiacheng Ye, Jiyue Jiang, Liheng Chen, Lingpeng Kong, Chuan Wu | N/A | N/A |
| Distributional Inclusion Hypothesis and Quantifications: Probing for Hypernymy in Functional Distributional Semantics | Chun Hei Lo, Wai Lam, Hong Cheng, Guy Emerson | N/A | N/A |
| Improving Event Definition Following For Zero-Shot Event Detection | Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng | N/A | N/A |
| Through the MUD: A Multi-Defendant Charge Prediction Benchmark with Linked Crime Elements | Xiao Wei, Xu Qi, Hang Yu, Qian Liu, Erik Cambria | N/A | N/A |
| Interpreting Conversational Dense Retrieval by Rewriting-Enhanced Inversion of Session Embedding | Yiruo Cheng, Kelong Mao, Zhicheng Dou | N/A | N/A |
| Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks | Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He | N/A | N/A |
| CausalGym: Benchmarking causal interpretability methods on linguistic tasks | Aryaman Arora, Dan Jurafsky, Christopher Potts | N/A | N/A |
| Training Language Models to Generate Text with Citations via Fine-grained Rewards | Chengyu Huang, Zeqiu Wu, Yushi Hu, Wenya Wang | N/A | N/A |
| Hypergraph based Understanding for Document Semantic Entity Recognition | Qiwei Li, Zuchao Li, Ping Wang, Haojun Ai, hai zhao | N/A | N/A |
| GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers | Qintong Li, Leyang Cui, Xueliang Zhao, Lingpeng Kong, Wei Bi | N/A | N/A |
| Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models | Qingkai Min, Qipeng Guo, Xiangkun Hu, Songfang Huang, Zheng Zhang, Yue Zhang | N/A | N/A |
| AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning | Shuofei Qiao, Ningyu Zhang, Runnan Fang, Yujie Luo, Wangchunshu Zhou, Yuchen Eleanor Jiang, chengfei lv, Huajun Chen | N/A | N/A |
| ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks | Santosh T.Y.S.S, Tuan-Quang Vuong, Matthias Grabmair | N/A | N/A |
| Virtual Compiler Is All You Need For Assembly Code Search | Zeyu Gao, Hao Wang, Yuanda Wang, Chao Zhang | N/A | N/A |
| MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning | Pengjie Ren, Chengshun Shi, Shiguang Wu, Mengqi Zhang, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Jiahuan Pei | N/A | N/A |
| Can LLMs Learn from Previous Mistakes? Investigating LLMs’ Errors to Boost for Reasoning | Yongqi Tong, Dawei Li, Sizhe Wang, Yujia Wang, Fei Teng, Jingbo Shang | N/A | N/A |
| An Iterative Associative Memory Model for Empathetic Response Generation | Zhou Yang, Zhaochun Ren, Wang Yufeng, Haizhou Sun, Chao Chen, Xiaofei Zhu, Xiangwen Liao | N/A | N/A |
| Detoxifying Large Language Models via Knowledge Editing | Mengru Wang, Ningyu Zhang, Ziwen Xu, Zekun Xi, Shumin Deng, Yunzhi Yao, Qishen Zhang, Linyi Yang, Jindong Wang, Huajun Chen | N/A | N/A |
| LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding | Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li | N/A | N/A |
| Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models | Yuyan Chen, Songzhou Yan, Panjun Liu, Yanghua Xiao | N/A | N/A |
| UniBridge: A Unified Approach to Cross-Lingual Transfer Learning for Low-Resource Languages | Trinh Pham, Khoi M. Le, Anh Tuan Luu | N/A | N/A |
| VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval | Junjie Zhou, Zheng Liu, Shitao Xiao, Bo Zhao, yongping xiong | N/A | N/A |
| Black-Box Prompt Optimization: Aligning Large Language Models without Model Training | Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang | N/A | N/A |
| Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark | Chanjun Park, Hyeonwoo Kim, Dahyun Kim, SeongHwan Cho, Sanghoon Kim, Sukyung Lee, Yungi Kim, Hwalsuk Lee | N/A | N/A |
| Unified Hallucination Detection for Multimodal Large Language Models | Xiang Chen, Chenxi Wang, Yida Xue, Ningyu Zhang, xiaoyan yang, Qiang Li, YUE SHEN, Lei Liang, Jinjie GU, Huajun Chen | N/A | N/A |
| Empowering Character-level Text Infilling by Eliminating Sub-Tokens | Houxing Ren, Mingjie Zhan, Zhongyuan Wu, Hongsheng Li | N/A | N/A |
| Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models | Kun Luo, Zheng Liu, Shitao Xiao, Tong Zhou, Yubo Chen, Jun Zhao, Kang Liu | N/A | N/A |
| GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge? | Dayoon Ko, Jinyoung Kim, Hahyeon Choi, Gunhee Kim | N/A | N/A |
| Attribute First, then Generate: Locally-attributable Grounded Text Generation | Aviv Slobodkin, Eran Hirsch, Arie Cattan, Tal Schuster, Ido Dagan | N/A | N/A |
| T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text | Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang | N/A | N/A |
| OceanGPT: A Large Language Model for Ocean Science Tasks | Zhen Bi, Ningyu Zhang, Yida Xue, Yixin Ou, Daxiong Ji, Guozhou Zheng, Huajun Chen | N/A | N/A |
| Beyond Memorization: The Challenge of Random Memory Access in Language Models | Tongyao Zhu, Qian Liu, Liang Pang, Zhengbao Jiang, Min-Yen Kan, Min Lin | N/A | N/A |
| BIPED: Pedagogically Informed Tutoring System for ESL Education | Soonwoo Kwon, Sojung Kim, Minju Park, Seunghyun Lee, Kyuseok Kim | N/A | N/A |
| Timeline-based Sentence Decomposition with In Context Learning for Temporal Fact Extraction | Jianhao Chen, Haoyuan Ouyang, Junyang Ren, Wentao Ding, Wei Hu, Yuzhong Qu | N/A | N/A |
| Collaboration or Corporate Capture? Quantifying NLP’s Reliance on Industry Artifacts and Contributions | Will Aitken, Mohamed Abdalla, Karen Rudie, Catherine Stinson | N/A | N/A |
| Prompt Expansion for Adaptive Text-to-Image Generation | Siddhartha Datta, Alexander Ku, Deepak Ramachandran, Peter Anderson | N/A | N/A |
| Progressively Modality Freezing for Multi-Modal Entity Alignment | Yani Huang, Xuefeng Zhang, Richong Zhang, Junfan Chen, Jaein Kim | N/A | N/A |
| Llama2Vec: Unsupervised Adaptation of Large Language Models for Dense Retrieval | Chaofan Li, Zheng Liu, Shitao Xiao, Yingxia Shao, Defu Lian | N/A | N/A |
| Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts | Xuan-Phi Nguyen, Mahani Aljunied, Shafiq Joty, Lidong Bing | N/A | N/A |
| Metaphor Understanding Challenge Dataset for LLMs | Xiaoyu Tong, Rochelle Choenni, Martha Lewis, Ekaterina Shutova | N/A | N/A |
| A Multi-Task Embedder For Retrieval Augmented LLMs | Peitian Zhang, Zheng Liu, Shitao Xiao, Zhicheng Dou, Jian-Yun Nie | N/A | N/A |
| Language Models Don’t Learn the Physical Manifestation of Language | Bruce W Lee, Jaehyuk Lim | N/A | N/A |
| Don’t Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration | Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov | N/A | N/A |
| What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection | Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov | N/A | N/A |
| Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives | Wenqi Zhang, Yongliang Shen, Linjuan Wu, Qiuying Peng, Jun Wang, Yueting Zhuang, Weiming Lu | N/A | N/A |
| Relying on the Unreliable: The Impact of Language Models’ Reluctance to Express Uncertainty | Kaitlyn Zhou, Jena D. Hwang, Xiang Ren, Maarten Sap | N/A | N/A |
| Mission: Impossible Language Models | Julie Kallini, Isabel Papadimitriou, Richard Futrell, Kyle Mahowald, Christopher Potts | N/A | N/A |
| Unity in Diversity: Collaborative Pre-training Across Multimodal Medical Sources | Xiaochen Wang, Junyu Luo, Jiaqi Wang, Yuan Zhong, Xiaokun Zhang, Yaqing Wang, Parminder Bhatia, Cao Xiao, Fenglong Ma | N/A | N/A |
| Semisupervised Neural Proto-Language Reconstruction | Liang Lu, Peirong Xie, David R Mortensen | N/A | N/A |
| When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP | Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri | N/A | N/A |
| SBAAM! Eliminating Transcript Dependency in Automatic Subtitling | Marco Gaido, Sara Papi, Matteo Negri, Mauro Cettolo, Luisa Bentivogli | N/A | N/A |
| Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? | Marco Gaido, Sara Papi, Matteo Negri, Luisa Bentivogli | N/A | N/A |
| StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection | Sara Papi, Marco Gaido, Matteo Negri, Luisa Bentivogli | N/A | N/A |
| ARL2: Aligning Retrievers with Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | LingXi Zhang, Yue Yu, Kuan Wang, Chao Zhang | N/A | N/A |
| Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference | Jihwan Bang, Juntae Lee, Kyuhong Shim, Seunghan Yang, Simyung Chang | N/A | N/A |
| FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model | Yebin Lee, Imseong Park, Myungjoo Kang | N/A | N/A |
| MentalManip: A Dataset For Fine-grained Analysis of Mental Manipulation in Conversations | Yuxin Wang, Ivory Yang, Saeed Hassanpour, Soroush Vosoughi | N/A | N/A |
| MPCoder: Multi-user Personalized Code Generator with Explicit and Implicit Style Representation Learning | Zhenlong Dai, Chang Yao, WenKang Han, Yuanying, Zhipeng Gao, Jingyuan Chen | N/A | N/A |
| DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows | Ajay Patel, Colin Raffel, Chris Callison-Burch | N/A | N/A |
| Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective | Chenze Shao, Fandong Meng, Jiali Zeng, Jie Zhou | N/A | N/A |
| Identifying while Learning for Document Event Causality Identification | Cheng Liu, Wei Xiang, Bang Wang | N/A | N/A |
| OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems | Chaoqun He, Renjie Luo, Yuzhuo Bai, Shengding Hu, Zhen Leng Thai, Junhao Shen, Jinyi Hu, Xu Han, Yujie Huang, Yuxiang Zhang, Jie Liu, Lei Qi, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Insert or Attach: Taxonomy Completion via Box Embedding | Wei Xue, Yongliang Shen, Wenqi Ren, Jietian Guo, Shiliang Pu, Weiming Lu | N/A | N/A |
| Semiparametric Token-Sequence Co-Supervision | Hyunji Lee, Doyoung Kim, Jihoon Jun, Se June Joo, Joel Jang, Kyoung-Woon On, Minjoon Seo | N/A | N/A |
| Instruction Fusion: Advancing Prompt Evolution through Hybridization | Weidong Guo, Jiuding Yang, Kaitong Yang, Xiangyang Li, Zhuwei Rao, Yu Xu, Di Niu | N/A | N/A |
| TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation | Yikai Zhang, Siyu Yuan, Caiyu Hu, Kyle Richardson, Yanghua Xiao, Jiangjie Chen | N/A | N/A |
| Exploring Memorization in Fine-tuned Language Models | Shenglai Zeng, Yaxin Li, Jie Ren, Yiding Liu, Han Xu, Pengfei He, Yue Xing, Shuaiqiang Wang, Jiliang Tang, Dawei Yin | N/A | N/A |
| Towards Real-world Scenario: Imbalanced New Intent Discovery | Shun Zhang, Yan Chaoran, Jian Yang, Jiaheng Liu, Ying Mo, Jiaqi Bai, Tongliang Li, Zhoujun Li | N/A | N/A |
| M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection | Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, OSAMA MOHAMMED AFZAL, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov | N/A | N/A |
| Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue | Jian Wang, Chak Tou Leong, Jiashuo WANG, Dongding Lin, Wenjie Li, Xiaoyong Wei | N/A | N/A |
| SoftDedup: an Efficient Data Reweighting Method for Speeding Up Language Model Pre-training | Nan He, Weichen Xiong, Hanwen Liu, Yi Liao, Lei Ding, Kai Zhang, Guohua Tang, Xiao Han, Yang Wei | N/A | N/A |
| Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models? | Ning Bian, Xianpei Han, Hongyu Lin, Yaojie Lu, Ben He, Le Sun | N/A | N/A |
| Learning Global Controller in Latent Space for Parameter-Efficient Fine-Tuning | Zeqi Tan, Yongliang Shen, Xiaoxia Cheng, Chang Zong, Wenqi Zhang, Jian Shao, Weiming Lu, Yueting Zhuang | N/A | N/A |
| CaMML: Context-Aware Multimodal Learner for Large Models | Yixin Chen, Shuai Zhang, Boran Han, Tong He, Bo Li | N/A | N/A |
| MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation | Xiaozhi Wang, Hao Peng, Yong Guan, Kaisheng Zeng, Jianhui Chen, Lei Hou, Xu Han, Yankai Lin, Zhiyuan Liu, Ruobing Xie, Jie Zhou, Juanzi Li | N/A | N/A |
| NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes | Lizhou Fan, Wenyue Hua, Lingyao Li, Haoyang Ling, Yongfeng Zhang | N/A | N/A |
| Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models | Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang | N/A | N/A |
| Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? | Roshan Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Bhiksha Raj | N/A | N/A |
| Multi-Level Feedback Generation with Large Language Models for Empowering Novice Peer Counselors | Alicja Chaszczewicz, Raj Sanjay Shah, Ryan Louie, Bruce A Arnow, Robert Kraut, Diyi Yang | N/A | N/A |
| D2LLM: Decomposed and Distilled Large Language Models for Semantic Search | Zihan Liao, Hang Yu, Jianguo Li, Jun Wang, Wei Zhang | N/A | N/A |
| In-context Mixing (ICM): Code-mixed Prompts for Multilingual LLMs | Bhavani Shankar, Preethi Jyothi, Pushpak Bhattacharyya | N/A | N/A |
| Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models | Liang Zhang, Qin Jin, Haoyang Huang, Dongdong Zhang, Furu Wei | N/A | N/A |
| Transferable Embedding Inversion Attack: Uncovering Privacy Risks in Text Embeddings without Model Queries | Yu-Hsiang Huang, Yuche Tsai, Hsiang Hsiao, Hong-Yi Lin, Shou-De Lin | N/A | N/A |
| Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding | Kuo Liao, Shuang Li, Meng Zhao, Liqun Liu, Mengge Xue, zhenyu hu, Honglin Han, Chengguo Yin | N/A | N/A |
| Intuitive or Dependent? Investigating LLMs’ Behavior Style to Conflicting Prompts | Jiahao Ying, Yixin Cao, Kai Xiong, Long Cui, yidong He, Yongbin Liu | N/A | N/A |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Shiyi Zhu, Jing Ye, Wei Jiang, Siqiao Xue, Qi Zhang, Yifan Wu, Jianguo Li | N/A | N/A |
| Arabic Diacritics in the Wild: Exploiting Opportunities for Improved Diacritization | Salman Elgamal, Ossama Obeid, MHD Tameem Kabbani, Go Inoue, Nizar Habash | N/A | N/A |
| InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification | Jan Trienes, Sebastian Antony Joseph, Jörg Schlötterer, Christin Seifert, Kyle Lo, Wei Xu, Byron C Wallace, Junyi Jessy Li | N/A | N/A |
| Disinformation Capabilities of Large Language Models | Ivan Vykopal, Matúš Pikuliak, Ivan Srba, Robert Moro, Dominik Macko, Maria Bielikova | N/A | N/A |
| Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models | Junhao Zheng, Shengjie Qiu, Qianli Ma | N/A | N/A |
| CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following | Kaiyan Zhang, Jianyu Wang, Ermo Hua, Biqing Qi, Ning Ding, Bowen Zhou | N/A | N/A |
| DAPR: A Benchmark on Document-Aware Passage Retrieval | Kexin Wang, Nils Reimers, Iryna Gurevych | N/A | N/A |
| How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study | Andreas Waldis, Yufang Hou, Iryna Gurevych | N/A | N/A |
| Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors | Mengge Xue, zhenyu hu, Liqun Liu, Kuo Liao, Shuang Li, Honglin Han, Meng Zhao, Chengguo Yin | N/A | N/A |
| SAC-KG: Exploiting Large Language Models as Skilled Automatic Constructors for Domain Knowledge Graph | Hanzhu Chen, Xu Shen, Qitan Lv, Jie Wang, Xiaoqi Ni, Jieping Ye | N/A | N/A |
| Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages | Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Rifki Afina Putri, Wawan Cenggoro, Jhonson Lee, Salsabil Maulana Akbar, Emmanuel Dave, Nuurshadieq, Muhammad Ihza Mahendra, Rr Dea Annisayanti Putri, Bryan Wilie, Genta Indra Winata, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung | N/A | N/A |
| Uncertainty-Guided Modal Rebalance for Hateful Memes Detection | Chuanpeng Yang, Yaxin Liu, Fuqing Zhu, Jizhong Han, Songlin Hu | N/A | N/A |
| Must NLP be Extractive? | Steven Bird | N/A | N/A |
| Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Question Answering | Xiaoyang Chen, Ben He, Hongyu Lin, Xianpei Han, Tianshu Wang, Boxi Cao, Le Sun, Yingfei Sun | N/A | N/A |
| Missci: Reconstructing Fallacies in Misrepresented Science | Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych | N/A | N/A |
| Uncovering the Full Potential of Visual Grounding Methods in VQA | Daniel Reich, Tanja Schultz | N/A | N/A |
| Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs | Jiejun Tan, Zhicheng Dou, Yutao Zhu, Peidong Guo, Kun Fang, Ji-Rong Wen | N/A | N/A |
| Favi-Score: A Measure for Favoritism in Automated Preference Ratings for Generative AI Evaluation | Pius von Däniken, Jan Milan Deriu, Don Tuggener, Mark Cieliebak | N/A | N/A |
| LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback | Timon Ziegenbein, Gabriella Skitalinskaya, Alireza Bayat Makou, Henning Wachsmuth | N/A | N/A |
| Graph Language Models | Moritz Plenz, Anette Frank | N/A | N/A |
| Analyzing Semantic Change through Lexical Replacements | Francesco Periti, Pierluigi Cassotti, Haim Dubossarsky, Nina Tahmasebi | N/A | N/A |
| Exploiting Intrinsic Multilateral Logical Rules for Weakly Supervised Natural Language Video Localization | Zhe Xu, Kun Wei, Xu Yang, Cheng Deng | N/A | N/A |
| Latxa: An Open Language Model and Evaluation Suite for Basque | Julen Etxaniz, Oscar Sainz, Naiara Perez Miguel, Itziar Aldabe, German Rigau, Eneko Agirre, Aitor Ormazabal, Mikel Artetxe, Aitor Soroa | N/A | N/A |
| Interpretability of Language Models via Task Spaces | Lucas Weber, Jaap Jumelet, Elia Bruni, Dieuwke Hupkes | N/A | N/A |
| Using Synchronic Definitions and Semantic Relations to Classify Semantic Change Types | Pierluigi Cassotti, Stefano De Pascale, Nina Tahmasebi | N/A | N/A |
| Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators | Matéo Mahaut, Laura Aina, Paula Czarnowska, Momchil Hardalov, Thomas Müller, Lluis Marquez | N/A | N/A |
| StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback | Shihan Dou, Yan Liu, Haoxiang Jia, Enyu Zhou, Limao Xiong, Junjie Shan, Huangcaishuang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| One-Shot Learning as Instruction Data Prospector for Large Language Models | Yunshui Li, Binyuan Hui, Xiaobo Xia, Jiaxi Yang, Min Yang, Lei Zhang, Shuzheng Si, Ling-Hao Chen, Junhao Liu, Tongliang Liu, Fei Huang, Yongbin Li | N/A | N/A |
| Navigating the OverKill in Large Language Models | Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin | N/A | N/A |
| Why are Sensitive Functions Hard for Transformers? | Michael Hahn, Mark Rofin | N/A | N/A |
| A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains | Alon Jacovi, Yonatan Bitton, Bernd Bohnet, Jonathan Herzig, Or Honovich, Michael Tseng, Michael Collins, Roee Aharoni, Mor Geva | N/A | N/A |
| Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Revision | Qian Ruan, Ilia Kuznetsov, Iryna Gurevych | N/A | N/A |
| NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents | Tamara Czinczoll, Christoph Hönes, Maximilian Schall, Gerard de Melo | N/A | N/A |
| FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models | Yuxin Jiang, Yufei Wang, Xingshan Zeng, Wanjun Zhong, Liangyou Li, Fei Mi, Lifeng Shang, Xin Jiang, Qun Liu, Wei Wang | N/A | N/A |
| Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction | Haoqiu Yan, Yongxin Zhu, Kai Zheng, Bing Liu, Haoyu Cao, Deqiang Jiang, Linli Xu | N/A | N/A |
| Learning to Edit: Aligning LLMs with Knowledge Editing | Yuxin Jiang, Yufei Wang, Chuhan Wu, Wanjun Zhong, Xingshan Zeng, Jiahui Gao, Liangyou Li, Xin Jiang, Lifeng Shang, Ruiming Tang, Qun Liu, Wei Wang | N/A | N/A |
| DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning | Yejie Wang, Keqing He, Guanting Dong, Pei Wang, Weihao Zeng, Muxi Diao, Weiran Xu, Jingang Wang, Mengdi Zhang, Xunliang Cai | N/A | N/A |
| IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators | Indraneil Paul, Goran Glavaš, Iryna Gurevych | N/A | N/A |
| When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality | Brielen Madureira, Patrick Kahardipraja, David Schlangen | N/A | N/A |
| SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models | Md Imbesat Hassan Rizvi, Xiaodan Zhu, Iryna Gurevych | N/A | N/A |
| Planning Like Human: A Dual-process Framework for Dialogue Planning | Tao He, Lizi Liao, Yixin Cao, Yuanxing Liu, Ming Liu, Zerui Chen, Bing Qin | N/A | N/A |
| Spectral Filters, Dark Signals, and Attention Sinks | Nicola Cancedda | N/A | N/A |
| DiffuCOMET: Contextual Commonsense Knowledge Diffusion | Silin Gao, Mete Ismayilzada, Mengjie Zhao, Hiromi Wakaki, Yuki Mitsufuji, Antoine Bosselut | N/A | N/A |
| Systematic Task Exploration with LLMs: A Study in Citation Text Generation | Furkan Şahinuç, Ilia Kuznetsov, Yufang Hou, Iryna Gurevych | N/A | N/A |
| The Echoes of Multilinguality: Tracing Cultural Value Shifts during Language Model Fine-tuning | Rochelle Choenni, Anne Lauscher, Ekaterina Shutova | N/A | N/A |
| Limits of Theory of Mind Modelling in Dialogue-Based Collaborative Plan Acquisition | Matteo Bortoletto, Constantin Ruhdorfer, Adnen Abdessaied, Lei Shi, Andreas Bulling | N/A | N/A |
| MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling | Tomasz Limisiewicz, Terra Blevins, Hila Gonen, Orevaoghene Ahia, Luke Zettlemoyer | N/A | N/A |
| Temporal Knowledge Question Answering via Abstract Reasoning Induction | Ziyang Chen, Dongfang Li, Xiang Zhao, Baotian Hu, Min Zhang | N/A | N/A |
| MultiLegalPile: A 689GB Multilingual Legal Corpus | Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho | N/A | N/A |
| Who Wrote this Code? Watermarking for Code Generation | Taehyun Lee, Seokhee Hong, Jaewoo Ahn, Ilgee Hong, Hwaran Lee, Sangdoo Yun, Jamin Shin, Gunhee Kim | N/A | N/A |
| MapCoder: Multi-Agent Code Generation for Competitive Problem Solving | Md. Ashraful Islam, Mohammed Eunus Ali, Md Rizwan Parvez | N/A | N/A |
| RelayAttention for Efficient Large Language Model Serving with Long System Prompts | Lei Zhu, Xinjiang Wang, Wayne Zhang, Rynson W. H. Lau | N/A | N/A |
| Boosting Language Models Reasoning with Chain-of-Knowledge Prompting | Jianing Wang, Qiushi Sun, Xiang Li, Ming Gao | N/A | N/A |
| Open Grounded Planning: Challenges and Benchmark Construction | Shiguang Guo, Ziliang Deng, Hongyu Lin, Yaojie Lu, Xianpei Han, Le Sun | N/A | N/A |
| WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations | Haolin Deng, Chang Wang, Li Xin, Dezhang Yuan, Junlang Zhan, Tian Hua Zhou, Jin Ma, Jun Gao, Ruifeng Xu | N/A | N/A |
| LLM Knows Body Language, Too: Translating Speech Voices into Human Gestures | Chenghao Xu, Guangtao Lyu, Jiexi Yan, Muli Yang, Cheng Deng | N/A | N/A |
| QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback based Self-Correction | Xiang Huang, Sitao Cheng, Shanshan Huang, Jiayu Shen, Yong Xu, Chaoyun Zhang, Yuzhong Qu | N/A | N/A |
| PITA: Prompting Task Interaction for Argumentation Mining | Yang Sun, Muyi Wang, Jianzhu Bao, Bin Liang, Xiaoyan Zhao, Caihua Yang, Min Yang, Ruifeng Xu | N/A | N/A |
| Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models | Jinhao Duan, Hao Cheng, Shiqi Wang, Alex Zavalny, Chenan Wang, Renjing Xu, Bhavya Kailkhura, Kaidi Xu | N/A | N/A |
| Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Estimating Agreement by Chance for Sequence Annotation | Diya Li, Carolyn Rose, Ao Yuan, Chunxiao Zhou | N/A | N/A |
| What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages | Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell | N/A | N/A |
| Are Emergent Abilities in Large Language Models just In-Context Learning? | Sheng Lu, Irina Bigoulaeva, Rachneet Singh Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych | N/A | N/A |
| WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning | Zhaojian Yu, Xin Zhang, Ning Shang, Yangyu Huang, Can Xu, Yishujie Zhao, Wenxiang Hu, Qiufeng Yin | N/A | N/A |
| Eliciting Better Multilingual Structured Reasoning from LLMs through Code | Bryan Li, Tamer Alkhouli, Daniele Bonadiman, Nikolaos Pappas, Saab Mansour | N/A | N/A |
| OLIVE: Object Level In-Context Visual Embeddings | Timothy Ossowski, Junjie Hu | N/A | N/A |
| Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness | Jiuhai Chen, Jonas Mueller | N/A | N/A |
| Marathon: A Race Through the Realm of Long Context with Large Language Models | Lei Zhang, Yunshui Li, Ziqiang Liu, Jiaxi Yang, Junhao Liu, Longze Chen, Run Luo, Min Yang | N/A | N/A |
| Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph | Xiaochen Kev Gao, Feng Yao, Kewen Zhao, Beilei He, Animesh Kumar, Vish Krishnan, Jingbo Shang | N/A | N/A |
| PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling | Xianwei Zhuang, Xuxin Cheng, Liming Liang, Yuxin Xie, Zhichang Wang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| Rethinking the Multimodal Correlation of Multimodal Sequential Learning via Generalizable Attentional Results Alignment | Tao Jin, Wang Lin, Ye Wang, Linjun Li, Xize Cheng, Zhou Zhao | N/A | N/A |
| UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation | Xun Liang, Shichao Song, Simin Niu, Zhiyu li, Feiyu Xiong, Bo Tang, Yezhaohui Wang, Dawei He, Cheng Peng, Zhonghao Wang, Haiying Deng | N/A | N/A |
| PreFLMR: Scaling Up Fine-Grained Late-Interaction Multi-modal Retrievers | Weizhe Lin, Jingbiao Mei, Jinghong Chen, Bill Byrne | N/A | N/A |
| Triple-Encoders: Representations That Fire Together, Wire Together | Justus-Jonas Erker, Florian Mai, Nils Reimers, Gerasimos Spanakis, Iryna Gurevych | N/A | N/A |
| Improving Hateful Meme Detection through Retrieval-Guided Contrastive Learning | Jingbiao Mei, Jinghong Chen, Weizhe Lin, Bill Byrne, Marcus Tomalin | N/A | N/A |
| Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization | Wenqi Zhang, Ke Tang, Hai Wu, Mengna Wang, Yongliang Shen, Guiyang Hou, Zeqi Tan, Peng Li, Yueting Zhuang, Weiming Lu | N/A | N/A |
| Tree-Averaging Algorithms for Ensemble-Based Unsupervised Discontinuous Constituency Parsing | Behzad Shayegh, Yuqiao Wen, Lili Mou | N/A | N/A |
| Your Transformer is Secretly Linear | Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Nikolai Gerasimenko, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov | N/A | N/A |
| Noise Correction on Subjective Datasets | Uthman Jinadu, Yi Ding | N/A | N/A |
| Generative Explore-Exploit: Training-free Optimization of Generative Recommender Systems using LLM Optimizers | Lütfi Kerem Senel, Besnik Fetahu, Davis Yoshida, Zhiyu Chen, Giuseppe Castellucci, Nikhita Vedula, Jason Ingyu Choi, Shervin Malmasi | N/A | N/A |
| Instruction-tuned Language Models are Better Knowledge Learners | Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srini Iyer | N/A | N/A |
| What Do Language Models Hear? Probing for Auditory Representations in Language Models | Jerry Ngo, Yoon Kim | N/A | N/A |
| Threads of Subtlety: Detecting Machine-Generated Texts Through Discourse Motifs | Zae Myung Kim, Kwang Hee Lee, Preston Zhu, Vipul Raheja, Dongyeop Kang | N/A | N/A |
| Jailbreak Open-Sourced Large Language Models via Enforced Decoding | Hangfan Zhang, Zhimeng Guo, Huaisheng Zhu, Bochuan Cao, Lu Lin, Jinyuan Jia, Jinghui Chen, Dinghao Wu | N/A | N/A |
| NICE: To Optimize In-Context Examples or Not? | Pragya Srivastava, Satvik Golechha, Amit Deshpande, Amit Sharma | N/A | N/A |
| CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation | Weixiang Yan, Haitian Liu, Yunkun Wang, Yunzhe Li, Qian Chen, Wen Wang, Tingyu Lin, Weishan Zhao, Li Zhu, Hari Sundaram, Shuiguang Deng | N/A | N/A |
| Digital Socrates: Evaluating LLMs through Explanation Critiques | Yuling Gu, Oyvind Tafjord, Peter Clark | N/A | N/A |
| SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding | Zhangchen Xu, Fengqing Jiang, Luyao Niu, Jinyuan Jia, Bill Yuchen Lin, Radha Poovendran | N/A | N/A |
| ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs | Fengqing Jiang, Zhangchen Xu, Luyao Niu, Zhen Xiang, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran | N/A | N/A |
| Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once? | Guijin Son, SangWon Baek, Sangdae Nam, Ilgyun Jeong, Seungone Kim | N/A | N/A |
| ChatDev: Communicative Agents for Software Development | Chen Qian, Wei Liu, Hongzhang Liu, Nuo Chen, Yufan Dang, Jiahao Li, Cheng Yang, Weize Chen, Yusheng Su, Xin Cong, Juyuan Xu, dahai li, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Experiential Co-Learning of Software-Developing Agents | Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Zihao Xie, YiFei Wang, Weize Chen, Cheng Yang, Xin Cong, Xiaoyin Che, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Learning Geometry-Aware Representations for New Intent Discovery | Kai Tang, Junbo Zhao, Xiao Ding, Runze Wu, Lei Feng, Gang Chen, Haobo Wang | N/A | N/A |
| Speaker Verification in Agent-generated Conversations | Yizhe Yang, Palakorn Achananuparp, Heyan Huang, Jing Jiang, Ee-Peng Lim | N/A | N/A |
| Benchmarking Data Science Agents | Yuge Zhang, Qiyang Jiang, XingyuHan, Nan Chen, Yuqing Yang, Kan Ren | N/A | N/A |
| Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models | Tianyi Tang, Wenyang Luo, Haoyang Huang, Dongdong Zhang, Xiaolei Wang, Xin Zhao, Furu Wei, Ji-Rong Wen | N/A | N/A |
| Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models | Shiwen Ni, Dingwei Chen, Chengming Li, Xiping Hu, Ruifeng Xu, Min Yang | N/A | N/A |
| A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques | Megh Thakkar, Quentin Fournier, Matthew D Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das, Sarath Chandar | N/A | N/A |
| Zero-Shot Cross-Domain Dialogue State Tracking via Dual Low-Rank Adaptation | Xiang Luo, Zhiwen Tang, Jin Wang, Xuejie Zhang | N/A | N/A |
| PRP-Graph: Pairwise Ranking Prompting to LLMs with Graph Aggregation for Effective Text Re-ranking | Jian Luo, Xuanang Chen, Ben He, Le Sun | N/A | N/A |
| RepCodec: A Speech Representation Codec for Speech Tokenization | Zhichao Huang, Chutong Meng, Tom Ko | N/A | N/A |
| Disentangled Learning with Synthetic Parallel Data for Text Style Transfer | Jingxuan Han, Quan Wang, Zikang Guo, Benfeng Xu, Licheng Zhang, Zhendong Mao | N/A | N/A |
| GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick | Jiayi Fu, Xuandong Zhao, Ruihan Yang, Yuansen Zhang, Jiangjie Chen, Yanghua Xiao | N/A | N/A |
| PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety | Zaibin Zhang, Yongting Zhang, Lijun Li, Jing Shao, Hongzhi Gao, Yu Qiao, Lijun Wang, Huchuan Lu, Feng Zhao | N/A | N/A |
| Event-Radar: Event-driven Multi-View Learning for Multimodal Fake News Detection | Zihan Ma, Minnan Luo, Hao Guo, Zhi Zeng, Yiran Hao, Xiang Zhao | N/A | N/A |
| Fine-Grained Modeling of Narrative Context: A Coherence Perspective via Retrospective Questions | Liyan Xu, Jiangnan Li, Mo Yu, Jie Zhou | N/A | N/A |
| Stealthy Attack on Large Language Model based Recommendation | Jinghao Zhang, Yuting Liu, Qiang Liu, Shu Wu, Guibing Guo, Liang Wang | N/A | N/A |
| Multi-Dimensional Optimization for Text Summarization via Reinforcement Learning | Sangwon Ryu, Heejin Do, Yunsu Kim, Gary Lee, Jungseul Ok | N/A | N/A |
| Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models | Changyu Chen, Xiting Wang, Ting-En Lin, Ang Lv, Yuchuan Wu, Xin Gao, Ji-Rong Wen, Rui Yan, Yongbin Li | N/A | N/A |
| SEER: Facilitating Structured Reasoning and Explanation via Reinforcement Learning | Guoxin Chen, kexin Tang, Chao Yang, Fuying Ye, Yu Qiao, Yiming Qian | N/A | N/A |
| Towards Robust and Generalized Parameter-Efficient Fine-Tuning for Noisy Label Learning | Yeachan Kim, Junho Kim, SangKeun Lee | N/A | N/A |
| SparseFlow: Accelerating Transformers by Sparsifying Information Flows | Yeachan Kim, SangKeun Lee | N/A | N/A |
| ProtT3: Protein-to-Text Generation for Text-based Protein Understanding | Zhiyuan Liu, An Zhang, Hao Fei, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua | N/A | N/A |
| KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models | Zhuohao Yu, Chang Gao, Wenjin Yao, Yidong Wang, Wei Ye, Jindong Wang, Xing Xie, Yue Zhang, Shikun Zhang | N/A | N/A |
| EmoBench: Evaluating the Emotional Intelligence of Large Language Models | Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna Shiergetya Sunaryo, Tatia M.C. Lee, Rada Mihalcea, Minlie Huang | N/A | N/A |
| Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation | Dongjin Kang, Sunghwan Kim, Taeyoon Kwon, Seungjun Moon, Hyunsouk Cho, Youngjae Yu, Dongha Lee, Jinyoung Yeo | N/A | N/A |
| Are AI-Generated Text Detectors Robust to Adversarial Perturbations? | Guanhua Huang, Yuchen Zhang, Zhe Li, Yongjian You, Mingze Wang, Zhouwang Yang | N/A | N/A |
| FinTextQA: A Dataset for Long-form Financial Question Answering | Jian Chen, Peilin Zhou, Yining Hua, Loh Ying Xin, Kehui chen, Ziyuan Li, Bing Zhu, Junwei Liang | N/A | N/A |
| On Measuring Faithfulness or Self-consistency of Natural Language Explanations | Letitia Parcalabescu, Anette Frank | N/A | N/A |
| $\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens | Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Learning or Self-aligning? Rethinking Instruction Fine-tuning | Mengjie Ren, Boxi Cao, Hongyu Lin, Cao Liu, Xianpei Han, Ke Zeng, Wan Guanglu, Xunliang Cai, Le Sun | N/A | N/A |
| Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key? | Qineng Wang, Zihao Wang, Ying Su, Hanghang Tong, Yangqiu Song | N/A | N/A |
| Soft Knowledge Prompt: Help External Knowledge Become a Better Teacher to Instruct LLM in Knowledge-based VQA | Qunbo Wang, Ruyi Ji, Tianhao Peng, Wenjun Wu, Zechao Li, Jing Liu | N/A | N/A |
| TasTe: Teaching Large Language Models to Translate through Self-Reflection | Yutong Wang, Jiali Zeng, Xuebo Liu, Fandong Meng, Jie Zhou, Min Zhang | N/A | N/A |
| Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models | Xudong Lu, Qi Liu, Yuhui Xu, Aojun Zhou, Siyuan Huang, Bo Zhang, Junchi Yan, Hongsheng Li | N/A | N/A |
| Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models | Tharindu Madusanka, Ian Pratt-Hartmann, Riza Batista-Navarro | N/A | N/A |
| UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion | Wei Li, Xue Xu, Jiachen Liu, Xinyan Xiao | N/A | N/A |
| The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities | David Stap, Eva Hasler, Bill Byrne, Christof Monz, Ke Tran | N/A | N/A |
| Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models | Paul Röttger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck, Hannah Rose Kirk, Hinrich Schuetze, Dirk Hovy | N/A | N/A |
| AI ‘News’ Content Farms Are Easy to Make and Hard to Detect: A Case Study in Italian | Giovanni Puccetti, Anna Rogers, Chiara Alzetta, Felice Dell’Orletta, Andrea Esuli | N/A | N/A |
| Blinded by Generated Contexts: How Language Models Merge Generated and Retrieved Contexts When Knowledge Conflicts? | Hexiang Tan, Fei Sun, Wanli Yang, Yuanzhuo Wang, Qi Cao, Xueqi Cheng | N/A | N/A |
| Unveiling Linguistic Regions in Large Language Models | Zhihao Zhang, Jun Zhao, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment | Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang | N/A | N/A |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Yufei Huang, Xu Han, Maosong Sun | N/A | N/A |
| Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations | Yisong Miao, Hongfu Liu, Wenqiang Lei, Nancy F. Chen, Min-Yen Kan | N/A | N/A |
| An Open Multilingual System for Scoring Readability of Wikipedia | Mykola Trokhymovych, Indira Sen, Martin Gerlach | N/A | N/A |
| Unlearning Traces the Influential Training Data of Language Models | Masaru Isonuma, Ivan Titov | N/A | N/A |
| Exploring Alignment in Shared Cross-lingual Spaces | Basel Mousi, Nadir Durrani, Fahim Dalvi, Majd Hawasly, Ahmed Abdelali | N/A | N/A |
| Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models | Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, Michael Lyu | N/A | N/A |
| Self-Evolving GPT: A Lifelong Autonomous Experiential Learner | Jinglong Gao, Xiao Ding, Yiming Cui, Jianbai Zhao, Hepeng Wang, Ting Liu, Bing Qin | N/A | N/A |
| WRP: Weight Recover Prune for Structured Sparsity | Zhendong Tan, Xingjun Zhang, Zheng Wei | N/A | N/A |
| Error-preserving Automatic Speech Recognition of Young English Learners’ Language | Janick Michot, Manuela Hürlimann, Jan Milan Deriu, Luzia Sauer, Katsiaryna Mlynchyk, Mark Cieliebak | N/A | N/A |
| DiFiNet: Boundary-Aware Semantic Differentiation and Filtration Network for Nested Named Entity Recognition | Yuxiang Cai, Qiao Liu, Yanglei Gan, Run Lin, Changlin Li, Xueyi Liu, Da Luo, JiayeYang | N/A | N/A |
| Legal Case Retrieval: A Survey of the State of the Art | Yi Feng, Chuanyi Li, Vincent Ng | N/A | N/A |
| Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models | Mosh Levy, Alon Jacoby, Yoav Goldberg | N/A | N/A |
| Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation | Tianqi Zhong, Zhaoyi Li, Quan Wang, Linqi Song, Ying Wei, Defu Lian, Zhendong Mao | N/A | N/A |
| LLaMA Pro: Progressive LLaMA with Block Expansion | Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ying Shan, Ping Luo | N/A | N/A |
| Generating Contrastive Narratives Using the Brownian Bridge Process for Narrative Coherence Learning | Feiteng Mu, Wenjie Li | N/A | N/A |
| A Causal Approach for Counterfactual Reasoning in Narratives | Feiteng Mu, Wenjie Li | N/A | N/A |
| SIP: Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation | Matthias Lindemann, Alexander Koller, Ivan Titov | N/A | N/A |
| The Hidden Space of Transformer Language Adapters | Jesujoba Oluwadara Alabi, Marius Mosbach, Matan Eyal, Dietrich Klakow, Mor Geva | N/A | N/A |
| A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts | Nafis Irtiza Tripto, Saranya Venkatraman, Dominik Macko, Robert Moro, Ivan Srba, Adaku Uchendu, Thai Le, Dongwon Lee | N/A | N/A |
| Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations | Guan-Ting Lin, Cheng-Han Chiang, Hung-yi Lee | N/A | N/A |
| RetinaQA: A Robust Knowledge Base Question Answering Model for both Answerable and Unanswerable Questions | Prayushi Faldu, Indrajit Bhattacharya, Mausam . | N/A | N/A |
| GroundingGPT: Language Enhanced Multi-modal Grounding Model | Zhaowei Li, Xu Qi, Dong Zhang, Hang Song, YiQing Cai, Qi Qi, Ran Zhou, Junting Pan, Zefeng Li, Vu Van Tu, Zhida Huang, Tao Wang | N/A | N/A |
| Automated Justification Production for Claim Veracity in Fact Checking: A Survey on Architectures and Approaches | Islam Eldifrawi, Shengrui Wang, Amine Trabelsi | N/A | N/A |
| Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages | Carlos Mullov, Quan Pham, Alexander Waibel | N/A | N/A |
| SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget | Rui Kong, Yuanchun Li, qingtian feng, Weijun Wang, Xiaozhou Ye, Ye Ouyang, Linghe Kong, Yunxin Liu | N/A | N/A |
| PixT3: Pixel-based Table-To-Text Generation | Iñigo Alonso, Eneko Agirre, Mirella Lapata | N/A | N/A |
| Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers | Gal Yona, Roee Aharoni, Mor Geva | N/A | N/A |
| TAMS: Translation-Assisted Morphological Segmentation | Enora Rice, Ali Marashian, Luke Gessler, Alexis Palmer, Katharina von der Wense | N/A | N/A |
| Disambiguate Words like Composing Them: A Morphology-Informed Approach to Enhance Chinese Word Sense Disambiguation | Yue Wang, Qiliang Liang, Yaqi Yin, Hansi Wang, Yang Liu | N/A | N/A |
| XCodeEval: An Execution-based Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval | Mohammad Abdullah Matin Khan, M Saiful Bari, Do Xuan Long, Weishi Wang, Md Rizwan Parvez, Shafiq Joty | N/A | N/A |
| ProxyQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models | Haochen Tan, Zhijiang Guo, Zhan Shi, Lu Xu, Zhili Liu, Yunlong Feng, Xiaoguang Li, Yasheng Wang, Lifeng Shang, Qun Liu, Linqi Song | N/A | N/A |
| A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia | Giovanni Monea, Maxime Peyrard, Martin Josifoski, Vishrav Chaudhary, Jason Eisner, Emre Kiciman, Hamid Palangi, Barun Patra, Robert West | N/A | N/A |
| Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA | Yue Fan, Jing Gu, Kaiwen Zhou, Qianqi Yan, Shan Jiang, Ching-Chen Kuo, Yang Zhao, Xinze Guan, Xin Eric Wang | N/A | N/A |
| WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models | Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu | N/A | N/A |
| Translation-based Lexicalization Generation and Lexical Gap Detection: Application to Kinship Terms | Senyu Li, Bradley Hauer, Ning Shi, Grzegorz Kondrak | N/A | N/A |
| Leveraging Machine-Generated Rationales to Facilitate Social Meaning Detection in Conversations | Ritam Dutt, Zhen Wu, Jiaxin Shi, Divyanshu Sheth, Prakhar Gupta, Carolyn Rose | N/A | N/A |
| Robust Frame-Semantic Models with Lexical Unit Trees and Negative Samples | Jacob Devasier, Yogesh Gurjar, Chengkai Li | N/A | N/A |
| Do Llamas Work in English? On the Latent Language of Multilingual Transformers | Chris Wendler, Veniamin Veselovsky, Giovanni Monea, Robert West | N/A | N/A |
| Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation | Yuan Yang, Siheng Xiong, Ali Payani, Ehsan Shareghi, Faramarz Fekri | N/A | N/A |
| Lightweight reranking for language model generations | Siddhartha Jain, Xiaofei Ma, Anoop Deoras, Bing Xiang | N/A | N/A |
| ARIES: A Corpus of Scientific Paper Edits Made in Response to Peer Reviews | Mike D’Arcy, Alexis Ross, Erin Bransom, Bailey Kuehl, Jonathan Bragg, Tom Hope, Doug Downey | N/A | N/A |
| The Unreasonable Effectiveness of Easy Training Data for Hard Tasks | Peter Hase, Mohit Bansal, Peter Clark, Sarah Wiegreffe | N/A | N/A |
| PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning | Zhihan Zhang, Dong-Ho Lee, Yuwei Fang, Wenhao Yu, Mengzhao Jia, Meng Jiang, Francesco Barbieri | N/A | N/A |
| MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning | Inderjeet Jayakumar Nair, Lu Wang | N/A | N/A |
| ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs | Justin Chen, Swarnadeep Saha, Mohit Bansal | N/A | N/A |
| Mirror: Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning | Hanqi Yan, Qinglin Zhu, Xinyu Wang, Lin Gui, Yulan He | N/A | N/A |
| Where Do People Tell Stories Online? Story Detection Across Online Communities | Maria Antoniak, Joel Mire, Maarten Sap, Elliott Ash, Andrew Piper | N/A | N/A |
| Large Language Models Are No Longer Shallow Parsers | Yuanhe Tian, Fei Xia, Yan Song | N/A | N/A |
| Dialogue Summarization with Mixture of Experts based on Large Language Models | Yuanhe Tian, Fei Xia, Yan Song | N/A | N/A |
| ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human Preferences | Yuanhe Tian, Ruyi Gan, Yan Song, Jiaxing Zhang, Yongdong Zhang | N/A | N/A |
| An Investigation of Neuron Activation as a Unified Lens to Explain Chain-of-Thought Eliciting Arithmetic Reasoning of LLMs | Daking Rai, Ziyu Yao | N/A | N/A |
| Leveraging Large Language Models for Learning Complex Legal Concepts through Storytelling | Hang Jiang, Xiajie Zhang, Robert Mahari, Daniel Kessler, Eric Ma, Tal August, Irene Li, Alex Pentland, Yoon Kim, Deb Roy, Jad Kabbara | N/A | N/A |
| Intrinsic Task-based Evaluation for Referring Expression Generation | Guanyi Chen, Fahime Same, Kees Van Deemter | N/A | N/A |
| From Moments to Milestones: Incremental Timeline Summarization Leveraging Large Language Models | Qisheng Hu, Geonsik Moon, Hwee Tou Ng | N/A | N/A |
| End-to-end Learning of Logical Rules for Enhancing Document-level Relation Extraction | Kunxun Qi, Jianfeng Du, Hai Wan | N/A | N/A |
| Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? | Qingkai Fang, Shaolei Zhang, Zhengrui Ma, Min zhang, Yang Feng | N/A | N/A |
| Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder | Jiaqi Wang, Zhenxi Song, Zhengyu Ma, Xipeng Qiu, Min zhang, Zhiguo Zhang | N/A | N/A |
| G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation | Xingyuan Pan, Luyang Huang, Liyan Kang, Zhicheng Liu, Yu Lu, Shanbo Cheng | N/A | N/A |
| CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers | Longwei Zou, Qingyang Wang, Han Zhao, jiangangkong, YI YANG, Yangdong Deng | N/A | N/A |
| Prompt Optimization via Adversarial In-Context Learning | Do Xuan Long, Yiran Zhao, Hannah Brown, Yuxi Xie, James Xu Zhao, Nancy F. Chen, Kenji Kawaguchi, Michael Shieh, Junxian He | N/A | N/A |
| StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion | Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Lei Xie, Yuping Wang | N/A | N/A |
| Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering | Zhengliang Shi, Shuo Zhang, Weiwei Sun, Shen Gao, Pengjie Ren, Zhumin Chen, Zhaochun Ren | N/A | N/A |
| Multimodal Contextualized Semantic Parsing from Speech | Jordan Voas, David Harwath, Ray Mooney | N/A | N/A |
| LaMP: When Large Language Models Meet Personalization | Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani | N/A | N/A |
| AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters | Li Lucy, Suchin Gururangan, Luca Soldaini, Emma Strubell, David Bamman, Lauren Klein, Jesse Dodge | N/A | N/A |
| MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues | Ge Bai, Jie Liu, Xingyuan Bu, yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang | N/A | N/A |
| EFSA: Towards Event-Level Financial Sentiment Analysis | Tianyu Chen, Yiming Zhang, Guoxin Yu, Dapeng Zhang, Li Zeng, Qing He, Xiang Ao | N/A | N/A |
| Media Framing: A typology and Survey of Computational Approaches Across Disciplines | Yulia Otmakhova, Shima Khanehzar, Lea Frermann | N/A | N/A |
| What Evidence Do Language Models Find Convincing? | Alexander Wan, Eric Wallace, Dan Klein | N/A | N/A |
| Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models | Qihang Ai, Jiafan Li, Jincheng Dai, Jianwu Zhou, Lemao Liu, Haiyun Jiang, Shuming Shi | N/A | N/A |
| LangBridge: Multilingual Reasoning Without Multilingual Supervision | Dongkeun Yoon, Joel Jang, Sungdong Kim, Seungone Kim, Sheikh Shafayat, Minjoon Seo | N/A | N/A |
| Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs | Siyuan Wang, zhongyu wei, Yejin Choi, Xiang Ren | N/A | N/A |
| SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving | Xueliang Zhao, Xinting Huang, Wei Bi, Lingpeng Kong | N/A | N/A |
| Unlocking the Power of Large Language Models for Entity Alignment | Xuhui Jiang, Yinghan Shen, Zhichao Shi, Chengjin Xu, Wei Li, Zixuan Li, Jian Guo, Huawei Shen, Yuanzhuo Wang | N/A | N/A |
| SPZ: A Semantic Perturbation-based Data Augmentation Method with Zonal-Mixing for Alzheimer’s Disease Detection | FangFang Li, Cheng Huang, PuZhen Su, Jie Yin | N/A | N/A |
| Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents | Yifan Song, Da Yin, Xiang Yue, Jie Huang, Sujian Li, Bill Yuchen Lin | N/A | N/A |
| ReFT: Reasoning with Reinforced Fine-Tuning | Luong Quoc Trung, Xinbo Zhang, Zhanming Jie, peng sun, Xiaoran Jin, Hang Li | N/A | N/A |
| Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment | yunxin li, Xinyu Chen, Baotian Hu, Haoyuan Shi, Min Zhang | N/A | N/A |
| FreeCtrl: Constructing Control Centers with Feedforward Layers for Learning-Free Controllable Text Generation | Zijian Feng, Hanzhang Zhou, Kezhi Mao, Zixiao Zhu | N/A | N/A |
| HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition | Yuxuan Liu, Tianchi Yang, Shaohan Huang, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang | N/A | N/A |
| Conundrums in Cross-Prompt Automated Essay Scoring: Making Sense of the State of the Art | Shengjie Li, Vincent Ng | N/A | N/A |
| Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution | Flor Miriam Plaza-del-Arco, Amanda Cercas Curry, Alba Cercas Curry, Gavin Abercrombie, Dirk Hovy | N/A | N/A |
| Label Augmentation for Zero-Shot Hierarchical Text Classification | Lorenzo Paletto, Valerio Basile, Roberto Esposito | N/A | N/A |
| STICKERCONV: Generating Multimodal Empathetic Responses from Scratch | Yiqun Zhang, Fanheng Kong, Peidong Wang, Shuang Sun, SWangLing, Shi Feng, Daling Wang, Yifei Zhang, Kaisong Song | N/A | N/A |
| EIT: Enhanced Interactive Transformer | Tong Zheng, Bei Li, Huiwen Bao, Tong Xiao, JingBo Zhu | N/A | N/A |
| MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs | Yavuz Faruk Bakman, Duygu Nur Yaldiz, Baturalp Buyukates, Chenyang Tao, Dimitrios Dimitriadis, Salman Avestimehr | N/A | N/A |
| EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models | Rocktim Jyoti Das, Simeon Emilov Hristov, Haonan Li, Dimitar Iliyanov Dimitrov, Ivan Koychev, Preslav Nakov | N/A | N/A |
| Order-Agnostic Data Augmentation for Few-Shot Named Entity Recognition | Huiming Wang, Liying Cheng, Wenxuan Zhang, De Wen Soh, Lidong Bing | N/A | N/A |
| Text Embedding Inversion Security for Multilingual Language Models | Yiyi Chen, Heather Lent, Johannes Bjerva | N/A | N/A |
| Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment | Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou | N/A | N/A |
| Calibrating Large Language Models Using Their Generations Only | Dennis Thomas Ulmer, Martin Gubri, Hwaran Lee, Sangdoo Yun, Seong Joon Oh | N/A | N/A |
| PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator | Chuyi Kong, Yaxin FAN, Xiang Wan, Feng Jiang, Benyou Wang | N/A | N/A |
| Synthesizing Text-to-SQL Data from Weak and Strong LLMs | Jiaxi Yang, Binyuan Hui, Min Yang, Jian Yang, Junyang Lin, Chang Zhou | N/A | N/A |
| Iterative Forward Tuning Boosts In-Context Learning in Language Models | Jiaxi Yang, Binyuan Hui, Min Yang, Bailin Wang, Bowen Li, Binhua Li, Fei Huang, Yongbin Li | N/A | N/A |
| STRUCTSUM Generation for Faster Text Comprehension | Parag Jain, Andreea Marzoca, Francesco Piccinno | N/A | N/A |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Yu Zhao, Yuanbin Qu, Konrad Staniszewski, Szymon Tworkowski, Wei Liu, Piotr Miłoś, Yuxiang Wu, Pasquale Minervini | N/A | N/A |
| NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time | Yilong Chen, Guoxia Wang, Junyuan Shang, Shiyao Cui, Zhenyu Zhang, Tingwen Liu, Shuohuan Wang, Yu Sun, Dianhai Yu, Hua Wu | N/A | N/A |
| SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network | Kexin Wang, Jiahong Zhang, Yong Ren, Man Yao, Di Shang, Bo XU, Guoqi Li | N/A | N/A |
| Context-aware Difference Distilling for Multi-change Captioning | Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang | N/A | N/A |
| Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion | Wei Cheng, Yuhan Wu, Wei Hu | N/A | N/A |
| Chain-of-Exemplar: Enhancing Distractor Generation for Multimodal Educational Question Generation | Haohao Luo, Yang Deng, Ying Shen, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| LLMEmbed: Rethinking Lightweight LLM’s Genuine Function in Text Classification | ChunLiu, Hongguang Zhang, Kainan Zhao, Xinghai Ju, Lin Yang | N/A | N/A |
| LEMON: Reviving Stronger and Smaller LMs from Larger LMs with Linear Parameter Fusion | Yilong Chen, Junyuan Shang, Zhenyu Zhang, Shiyao Cui, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu | N/A | N/A |
| Speech Sense Disambiguation: Tackling Homophone Ambiguity in End-to-End Speech Translation | Tengfei Yu, Xuebo Liu, Liang Ding, Kehai Chen, Dacheng Tao, Min Zhang | N/A | N/A |
| To be Continuous, or to be Discrete, Those are Bits of Questions | Yiran Wang, Masao Utiyama | N/A | N/A |
| Moûsai: Efficient Text-to-Music Diffusion Models | Flavio Schneider, Ojasv Kamal, Zhijing Jin, Bernhard Schölkopf | N/A | N/A |
| PokeMQA: Programmable knowledge editing for Multi-hop Question Answering | Hengrui Gu, Kaixiong Zhou, Xiaotian Han, Ninghao Liu, Ruobing Wang, Xin Wang | N/A | N/A |
| MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention | Prince Jha, Raghav Jain, Konika Mandal, Aman Chadha, Sriparna Saha, Pushpak Bhattacharyya | N/A | N/A |
| Efficient OCR for Building a Diverse Digital History | Jacob Carlson, Tom Bryan, Melissa Dell | N/A | N/A |
| Acquiring Clean Language Models from Backdoor Poisoned Datasets by Downscaling Frequency Space | Zongru Wu, Zhuosheng Zhang, Pengzhou Cheng, Gongshen Liu | N/A | N/A |
| ANAH: Analytical Annotation of Hallucinations in Large Language Models | Ziwei Ji, Yuzhe Gu, Wenwei Zhang, Chengqi Lyu, Dahua Lin, Kai Chen | N/A | N/A |
| Aligning Large Language Models for Controllable Recommendations | Wensheng Lu, Jianxun Lian, Wei Zhang, Guanghua Li, Mingyang Zhou, Hao Liao, Xing Xie | N/A | N/A |
| Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods | Haeun Yu, Pepa Atanasova, Isabelle Augenstein | N/A | N/A |
| Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement | Wenda Xu, Guanglei Zhu, Xuandong Zhao, Liangming Pan, Lei Li, William Yang Wang | N/A | N/A |
| Full Parameter Fine-tuning for Large Language Models with Limited Resources | Kai Lv, Yuqing Yang, Tengxiao Liu, Qipeng Guo, Xipeng Qiu | N/A | N/A |
| M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought | Qiguang Chen, Libo Qin, Jin Zhang, Zhi Chen, Xiao Xu, Wanxiang Che | N/A | N/A |
| Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models | Longze Chen, Ziqiang Liu, Wanwei He, Yinhe Zheng, Hao Sun, Yunshui Li, Run Luo, Min Yang | N/A | N/A |
| Label-Synchronous Neural Transducer for E2E Simultaneous Speech Translation | Keqi Deng, Phil Woodland | N/A | N/A |
| Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL | Yunseon Choi, Sangmin Bae, Seonghyun Ban, Minchan Jeong, Chuheng Zhang, Lei Song, Li Zhao, Jiang Bian, Kee-Eung Kim | N/A | N/A |
| A Modular Approach for Multimodal Summarization of TV Shows | Louis Mahon, Mirella Lapata | N/A | N/A |
| Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities | Alex Wilf, Sihyun Shawn Lee, Paul Pu Liang, Louis-Philippe Morency | N/A | N/A |
| BizBench: A Quantitative Reasoning Benchmark for Business and Finance | Michael Krumdick, Rik Koncel-Kedziorski, Viet Dac Lai, Varshini Reddy, Charles Lovering, Chris Tanner | N/A | N/A |
| Direct Metric Optimization for Image Captioning through Reward-Weighted Augmented Data Utilization | Takumi Takada, Yuma Suzuki, Hiroki Takushima, Hayato Tanoue, Haruki Sato, Aiswariya Manoj Kumar, Hiroki Nishihara, Takayuki Hori, Kazuya Ueki | N/A | N/A |
| Deciphering Hate: Identifying Hateful Memes and Their Targets | Eftekhar Hossain, Omar Sharif, Mohammed Moshiul Hoque, Sarah Masud Preum | N/A | N/A |
| Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings | Yichen Jiang, Xiang Zhou, Mohit Bansal | N/A | N/A |
| Label-Efficient Model Selection for Text Generation | Shir Ashury Tahan, Ariel Gera, Benjamin Sznajder, Leshem Choshen, Liat Ein-Dor, Eyal Shnarch | N/A | N/A |
| Machine Unlearning of Pre-trained Large Language Models | Jin Yao, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang, Zezhou Cheng, Xiang Yue | N/A | N/A |
| Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals | Francesco Ortu, Zhijing Jin, Diego Doimo, Mrinmaya Sachan, Alberto Cazzaniga, Bernhard Schölkopf | N/A | N/A |
| FactPICO: Factuality Evaluation for Plain Language Summarization of Medical Evidence | Sebastian Antony Joseph, Lily Chen, Jan Trienes, Hannah Louisa Göke, Monika Coers, Wei Xu, Byron C Wallace, Junyi Jessy Li | N/A | N/A |
| BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction | Yinhao Bai, Yalan Xie, Xiaoyi Liu, Yuhua Zhao, Zhixin Han, Mengting Hu, Hang Gao, Renhong Cheng | N/A | N/A |
| Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack | Yu Fu, Yufei Li, Wen Xiao, Cong Liu, Yue Dong | N/A | N/A |
| Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn’t | Chihiro Taguchi, David Chiang | N/A | N/A |
| Speech language models lack important brain-relevant semantics | SUBBA REDDY OOTA, Emin Çelik, Fatma Deniz, Mariya Toneva | N/A | N/A |
| DocLLM: A Layout-Aware Generative Language Model for Multimodal Document Understanding | Dongsheng Wang, Natraj Raman, Mathieu Sibue, Zhiqiang Ma, Petr Babkin, Simerjot Kaur, Yulong Pei, Armineh Nourbakhsh, Xiaomo Liu | N/A | N/A |
| Bypassing LLM Watermarks with Color-Aware Substitutions | Qilong Wu, Varun Chandrasekaran | N/A | N/A |
| Parallel Structures in Pre-training Data Yield In-Context Learning | Yanda Chen, Chen Zhao, Zhou Yu, Kathleen McKeown, He He | N/A | N/A |
| OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models | Hainiu Xu, Runcong Zhao, Lixing Zhu, Jinhua Du, Yulan He | N/A | N/A |
| Towards Privacy-Aware Sign Language Translation at Scale | Phillip Rust, Bowen Shi, Skyler Wang, Necati Cihan Camgoz, Jean Maillard | N/A | N/A |
| Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards | Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang | N/A | N/A |
| Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters | Yinghui Li, Zishan Xu, Shaoshen Chen, Haojing Huang, Yangning Li, Shirong Ma, Yong Jiang, Zhongli Li, Qingyu Zhou, Hai-Tao Zheng, Ying Shen | N/A | N/A |
| Steering Llama 2 via Contrastive Activation Addition | Nina Rimsky, Nick Gabrieli, Julian Schulz, Meg Tong, Evan J Hubinger, Alexander Matt Turner | N/A | N/A |
| RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations | Jing Huang, Zhengxuan Wu, Christopher Potts, Mor Geva, Atticus Geiger | N/A | N/A |
| Large Language Models as Zero-shot Dialogue State Tracker through Function Calling | Zekun Li, Zhiyu Chen, Mike Ross, Patrick Huber, Seungwhan Moon, Zhaojiang Lin, Xin Luna Dong, Adithya Sagar, Xifeng Yan, Paul A. Crook | N/A | N/A |
| Faithful Chart Summarization with ChaTS-Pi | Syrine Krichene, Francesco Piccinno, Fangyu Liu, Julian Martin Eisenschlos | N/A | N/A |
| Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation | Cheng Niu, Xingguang Wang, Xuxin Cheng, Juntong Song, Tong Zhang | N/A | N/A |
| MetaSumPerceiver: Multimodal Multi-Document Evidence Summarization for Fact-Checking | Ting-Chih Chen, Chia-Wei Tang, Chris Thomas | N/A | N/A |
| KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction | Zixuan Li, Yutao Zeng, Yuxin Zuo, Weicheng Ren, Wenxuan Liu, Miao Su, Yucan Guo, Yantao Liu, lixiang, Zhilei Hu, Long Bai, Wei Li, Yidan Liu, Pan Yang, Xiaolong Jin, Jiafeng Guo, Xueqi Cheng | N/A | N/A |
| ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis | Yanming Liu, Xinyue Peng, Tianyu Du, Jianwei Yin, Weihao Liu, Xuhong Zhang | N/A | N/A |
| EconAgent: Large Language Model-Empowered Agents for Simulating Macroeconomic Activities | Nian Li, Chen Gao, Mingyu Li, Yong Li, Qingmin Liao | N/A | N/A |
| On the Multi-turn Instruction Following for Conversational Web Agents | Yang Deng, Xuan Zhang, Wenxuan Zhang, Yifei Yuan, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents | Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Liujianfeng, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang | N/A | N/A |
| MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China | Chen Zhang, Mingxu Tao, Quzhe Huang, Jiuheng Lin, Zhibin Chen, Yansong Feng | N/A | N/A |
| Decoder-only Streaming Transformer for Simultaneous Translation | Shoutao Guo, Shaolei Zhang, Yang Feng | N/A | N/A |
| Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization | Zhexin Zhang, Junxiao Yang, Pei Ke, Fei Mi, Hongning Wang, Minlie Huang | N/A | N/A |
| I am a Strange Dataset: Metalinguistic Tests for Language Models | Tristan Thrush, Jared Moore, Miguel Monares, Christopher Potts, Douwe Kiela | N/A | N/A |
| SafetyBench: Evaluating the Safety of Large Language Models | Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang | N/A | N/A |
| Deciphering Oracle Bone Language with Diffusion Models | Haisu Guan, Huanxin Yang, Xinyu Wang, Shengwei Han, Yongge Liu, Lianwen Jin, Xiang Bai, Yuliang Liu | N/A | N/A |
| TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space | Shaolei Zhang, Tian Yu, Yang Feng | N/A | N/A |
| ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training | Le Zhuo, Zewen Chi, Minghao Xu, Heyan Huang, Jianan Zhao, Heqi Zheng, Conghui He, Xian-Ling Mao, Wentao Zhang | N/A | N/A |
| StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning | Shaolei Zhang, Qingkai Fang, Shoutao Guo, Zhengrui Ma, Min zhang, Yang Feng | N/A | N/A |
| Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models | Tianjie Ju, Yijin Chen, Xinwei Yuan, Zhuosheng Zhang, Wei Du, Yubin Zheng, Gongshen Liu | N/A | N/A |
| Why Don’t Prompt-Based Fairness Metrics Correlate? | Abdelrahman Zayed, Goncalo Mordido, Ioana Baldini, Sarath Chandar | N/A | N/A |
| NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data | Manuel Tonneau, Pedro Vitor Quinta de Castro, Karim Lasri, Ibrahim Sambo Farouq, Lakshmi Subramanian, Victor Orozco-Olvera, Samuel Fraiberger | N/A | N/A |
| M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset | Zhe Chen, Heyang Liu, Wenyi Yu, Guangzhi Sun, Hongcheng Liu, Ji Wu, Chao Zhang, Yu Wang, Yanfeng Wang | N/A | N/A |
| Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination | Nakyeong Yang, Taegwan Kang, Stanley Jungkyu Choi, Honglak Lee, Kyomin Jung | N/A | N/A |
| Domain Adaptation for Subjective Induction Questions Answering on Products by Adversarial Disentangled Learning | Yufeng Zhang, Jianxing Yu, Yanghui Rao, Libin Zheng, Qinliang Su, Huaijie Zhu, Jian Yin | N/A | N/A |
| Revisiting Demonstration Selection Strategies in In-Context Learning | Keqin Peng, Liang Ding, Yancheng Yuan, Xuebo Liu, Min Zhang, Yuanxin Ouyang, Dacheng Tao | N/A | N/A |
| Multimodal Table Understanding | Mingyu Zheng, Xinwei Feng, Qingyi Si, Qiaoqiao She, Zheng Lin, Wenbin Jiang, Weiping Wang | N/A | N/A |
| Ex\textsuperscript{3}: Automatic Novel Writing by Extracting, Excelsior and Expanding | Huang Lei, Jiaming Guo, Guanhua He, Xishan Zhang, Rui Zhang, Shaohui Peng, Shaoli Liu, Tianshi Chen | N/A | N/A |
| Few-shot Transfer Learning for Knowledge Base Question Answering: Fusing Supervised Models with In-Context Learning | Mayur Patidar, Riya Sawhney, Avinash Kumar Singh, Biswajit Chatterjee, Mausam ., Indrajit Bhattacharya | N/A | N/A |
| WatME: Towards Lossless Watermarking Through Lexical Redundancy | Liang CHEN, Yatao Bian, Yang Deng, Deng Cai, Shuaiyi Li, Peilin Zhao, Kam-Fai Wong | N/A | N/A |
| Text-like Encoding of Collaborative Information in Large Language Models for Recommendation | Yang Zhang, Keqin Bao, Ming Yan, Wenjie Wang, Fuli Feng, Xiangnan He | N/A | N/A |
| MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception | Yuhao Wang, Yusheng Liao, Heyang Liu, Hongcheng Liu, Yanfeng Wang, Yu Wang | N/A | N/A |
| Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning | Jiachun Li, Pengfei Cao, Chenhao Wang, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation | Yi Liu, Xiangyu Liu, Xiangrong Zhu, Wei Hu | N/A | N/A |
| M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models | Wai-Chung Kwan, Xingshan Zeng, Yufei Wang, Yusen Sun, Liangyou Li, Yuxin Jiang, Lifeng Shang, Qun Liu, Kam-Fai Wong | N/A | N/A |
| Reward-based Input Construction for Cross-document Relation Extraction | Byeonghu Na, Suhyeon Jo, Yeongmin Kim, Il-chul Moon | N/A | N/A |
| Hyperspherical Multi-Prototype with Optimal Transport for Event Argument Extraction | Guangjun Zhang, Hu zhang, YuJie Wang, Ru Li, Hongye Tan, Jiye Liang | N/A | N/A |
| Understanding Retrieval Robustness for Retrieval-augmented Image Captioning | Wenyan Li, Jiaang Li, Rita Ramos, Raphael Tang, Desmond Elliott | N/A | N/A |
| Semi-Supervised Spoken Language Glossification | Huijie Yao, Wengang Zhou, Hao Zhou, Houqiang Li | N/A | N/A |
| SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents | Kanzhi Cheng, Qiushi Sun, Yougang Chu, Fangzhi Xu, Li YanTao, Jianbing Zhang, Zhiyong Wu | N/A | N/A |
| InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers | Yakir Yehuda, Itzik Malkiel, Oren Barkan, Jonathan Weill, Royi Ronen, Noam Koenigstein | N/A | N/A |
| F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods | Yu Sun, keyuchen, Shujie Wang, Peiji Li, Qipeng Guo, Hang Yan, Xipeng Qiu, Xuanjing Huang, Dahua Lin | N/A | N/A |
| Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning | Philipp Mondorf, Barbara Plank | N/A | N/A |
| Whose Preferences? Differences in Fairness Preferences and Their Impact on the Fairness of AI Utilizing Human Feedback | Maria Emilia Agis Lerner, Florian E. Dorner, Elliott Ash, Naman Goel | N/A | N/A |
| Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Peiyi Wang, Lei Li, Zhihong Shao, Runxin Xu, Damai Dai, Yifei Li, Deli Chen, Yu Wu, Zhifang Sui | N/A | N/A |
| Large Language Models are not Fair Evaluators | Peiyi Wang, Lei Li, Liang Chen, Zefan Cai, Dawei Zhu, Binghuai Lin, Yunbo Cao, Lingpeng Kong, Qi Liu, Tianyu Liu, Zhifang Sui | N/A | N/A |
| Improving Large Language Models in Event Relation Logical Prediction | Meiqi Chen, Yubo Ma, Kaitao Song, Yixin Cao, Yan Zhang, Dongsheng Li | N/A | N/A |
| Synchronized Video Storytelling: Generating Video Narrations with Structured Storyline | Dingyi Yang, Chunru Zhan, Ziheng Wang, Biao Wang, Tiezheng Ge, Bo Zheng, Qin Jin | N/A | N/A |
| Fine-Grained Image-Text Alignment in Medical Imaging Enables Explainable Cyclic Image-Report Generation | Wenting Chen, Linlin Shen, Jingyang Lin, Jiebo Luo, Xiang Li, Yixuan Yuan | N/A | N/A |
| T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step | Zehui Chen, Weihua Du, Wenwei Zhang, Kuikun Liu, Jiangning Liu, Miao Zheng, Jingming Zhuo, Songyang Zhang, Dahua Lin, Kai Chen, Feng Zhao | N/A | N/A |
| Are LLM-based Evaluators Confusing NLG Quality Criteria? | Xinyu Hu, Mingqi Gao, Sen Hu, Yang Zhang, Yicheng Chen, TENG XU, Xiaojun Wan | N/A | N/A |
| Synergistic Interplay between Search and Large Language Models for Information Retrieval | Jiazhan Feng, Chongyang Tao, Xiubo Geng, Tao Shen, Can Xu, Guodong Long, Dongyan Zhao, Daxin Jiang | N/A | N/A |
| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Yaroslav Aksenov, Nikita Balagansky, Sofia Maria Lo Cicero Vaina, Boris Shaposhnikov, Alexey Gorbatovski, Daniil Gavrilov | N/A | N/A |
| Temperature-scaling surprisal estimates improve fit to human reading times – but does it do so for the “right reasons”? | Tong Liu, Iza Škrjanec, Vera Demberg | N/A | N/A |
| Beyond Recognising Entailment: Formalising Natural Language Inference from an Argumentative Perspective | Ameer Saadat-Yazdi, Nadin Kökciyan | N/A | N/A |
| RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization | Jaavid Aktar Husain J, Raj Dabre, Aswanth Kumar M, Jay Gala, Thanmay Jayakumar, Ratish Puduppully, Anoop Kunchukuttan | N/A | N/A |
| AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling | Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yu-Gang Jiang, Xipeng Qiu | N/A | N/A |
| CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models | Zixin Chen, Hongzhan Lin, Ziyang Luo, Mingfei Cheng, Jing Ma, Guang Chen | N/A | N/A |
| Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation | Aiwei Liu, Haoping Bai, Zhiyun Lu, Xiang Kong, Xiaoming Simon Wang, Jiulong Shan, Meng Cao, Lijie Wen | N/A | N/A |
| Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines | Michael Toker, Hadas Orgad, Mor Ventura, Dana Arad, Yonatan Belinkov | N/A | N/A |
| Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models | Yuchong Sun, Che Liu, Kun Zhou, Jinwen Huang, Ruihua Song, Xin Zhao, Fuzheng Zhang, Di ZHANG, Kun Gai | N/A | N/A |
| Robust Singing Voice Transcription Serves Synthesis | Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao | N/A | N/A |
| VulLibGen: Generating Names of Vulnerability-Affected Packages via a Large Language Model | Tianyu Chen, Lin Li, ZhuLiuchuan, Zongyang Li, Xueqing Liu, Guangtai Liang, Qianxiang Wang, Tao Xie | N/A | N/A |
| Self-Modifying State Modeling for Simultaneous Machine Translation | Donglei Yu, Xiaomian Kang, Yuchen Liu, Yu Zhou, Chengqing Zong | N/A | N/A |
| MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation | Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee K. Wong | N/A | N/A |
| BadAgent: Inserting and Activating Backdoor Attacks in LLM Agents | Yifei Wang, Dizhan Xue, Shengjie Zhang, Shengsheng Qian | N/A | N/A |
| DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy | Hongda Sun, Weikai Xu, Wei Liu, Jian Luan, Bin Wang, Shuo Shang, Ji-Rong Wen, Rui Yan | N/A | N/A |
| LePaRD: A Large-Scale Dataset of Judicial Citations to Precedent | Robert Mahari, Dominik Stammbach, Elliott Ash, Alex Pentland | N/A | N/A |
| To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering | Giacomo Frisoni, Alessio Cocchieri, Alex Presepi, Gianluca Moro, Zaiqiao Meng | N/A | N/A |
| MERA: A Comprehensive LLM Evaluation in Russian | Alena Fenogenova, Artem Chervyakov, Nikita Martynov, Anastasia Kozlova, Maria Tikhonova, Albina Akhmetgareeva, Anton Emelyanov, Denis Shevelev, Pavel Lebedev, Leonid S Sinev, Ulyana Isaeva, Katerina Kolomeytseva, Daniil Moskovskiy, Elizaveta Goncharova, Nikita Savushkin, Polina Mikhailova, Anastasia Minaeva, Denis Dimitrov, Alexander Panchenko, Sergey Markov | N/A | N/A |
| SC2: Towards Enhancing Content Preservation and Style Consistency in Long Text Style Transfer | Jie Zhao, Ziyu Guan, Cai Xu, Wei Zhao, Yue Jiang | N/A | N/A |
| Causal Estimation of Memorisation Profiles | Pietro Lesci, Clara Meister, Thomas Hofmann, Andreas Vlachos, Tiago Pimentel | N/A | N/A |
| CHECKWHY: Causal Fact Verification via Argument Structure | Jiasheng Si, Yibo Zhao, Yingjie Zhu, Haiyang Zhu, Wenpeng Lu, Deyu Zhou | N/A | N/A |
| Dodo: Dynamic Contextual Compression for Decoder-only LMs | Guanghui Qin, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme | N/A | N/A |
| POMP: Probability-driven Meta-graph Prompter for LLMs in Low-resource Unsupervised Neural Machine Translation | Shilong Pan, Zhiliang Tian, Liang Ding, Haoqi Zheng, Zhen Huang, Zhihua Wen, Dongsheng Li | N/A | N/A |
| NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism | Miao Li, Ming-Bin Chen, Bo Tang, ShengbinHou, Pengyu Wang, Haiying Deng, Zhiyu li, Feiyu Xiong, Keming Mao, Cheng Peng, Yi Luo | N/A | N/A |
| MAPO: Advancing Multilingual Reasoning through Multilingual-Alignment-as-Preference Optimization | Shuaijie She, Wei Zou, Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen | N/A | N/A |
| Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training | Feiteng Fang, yuelin bai, Shiwen Ni, Min Yang, Xiaojun Chen, Ruifeng Xu | N/A | N/A |
| Predicting Text Preference Via Structured Comparative Reasoning | Jing Nathan Yan, Tianqi Liu, Justin T Chiu, Jiaming Shen, Zhen Qin, Yue Yu, Charumathi Lakshmanan, Yair Kurzion, Alexander M Rush, Jialu Liu, Michael Bendersky | N/A | N/A |
| CoELM: Construction-Enhanced Language Modeling | Lvxiaowei Xu, Zhilin Gong, Jianhua Dai, Tianxiang Wang, Ming Cai, Jiawei Peng | N/A | N/A |
| Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model | Christian Tomani, David Vilar, Markus Freitag, Colin Cherry, Subhajit Naskar, Mara Finkelstein, Xavier Garcia, Daniel Cremers | N/A | N/A |
| Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation | Songju Lei, Xize Cheng, Mengjiao Lyu, Jianqiao Hu, Jintao Tan, Runlin Liu, Lingyu Xiong, Tao Jin, Xiandong Li, Zhou Zhao | N/A | N/A |
| On the Impact of Calibration Data in Post-training Quantization and Pruning | Miles Williams, Nikolaos Aletras | N/A | N/A |
| SymKGQA: Few-Shot Knowledge Graph Question Answering via Symbolic Program Generation and Execution | Prerna Agarwal, Nishant Kumar, Srikanta J. Bedathur | N/A | N/A |
| Meta-Task Prompting Elicits Embeddings from Large Language Models | Yibin Lei, Di Wu, Tianyi Zhou, Tao Shen, Yu Cao, Chongyang Tao, Andrew Yates | N/A | N/A |
| A Sentiment Consolidation Framework for Meta-Review Generation | Miao Li, Jey Han Lau, Eduard Hovy | N/A | N/A |
| Revisiting Structured Sentiment Analysis as Latent Dependency Graph Parsing | Chengjie Zhou, Bobo Li, Hao Fei, Fei Li, Chong Teng, Donghong Ji | N/A | N/A |
| OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification | Yifan Peng, Yui Sudo, Muhammad Shakeel, Shinji Watanabe | N/A | N/A |
| Do Large Language Models Latently Perform Multi-Hop Reasoning? | Sohee Yang, Elena Gribovskaya, Nora Kassner, Mor Geva, Sebastian Riedel | N/A | N/A |
| MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning | Chengpeng Li, Zheng Yuan, Hongyi Yuan, Guanting Dong, Keming Lu, Jiancan Wu, Chuanqi Tan, Xiang Wang, Chang Zhou | N/A | N/A |
| Harnessing Toulmin’s theory for zero-shot argument explication | Ankita Gupta, Ethan Zuckerman, Brendan O’Connor | N/A | N/A |
| BinaryAlign: Word Alignment as Binary Sequence Labeling | Gaetan Lopez Latouche, Marc-André Carbonneau, Benjamin Swanson | N/A | N/A |
| Quantifying the Persona Effect in LLM Simulations | Tiancheng Hu, Nigel Collier | N/A | N/A |
| On Efficient and Statistical Quality Estimation for Data Annotation | Jan-Christoph Klie, Juan Haladjian, Marc Kirchner, Rahul Nair | N/A | N/A |
| EZ-STANCE: A Large Dataset for English Zero-Shot Stance Detection | Chenye Zhao, Cornelia Caragea | N/A | N/A |
| Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? | Nishant Balepur, Abhilasha Ravichander, Rachel Rudinger | N/A | N/A |
| Retrieval Augmented Fact Verification by Synthesizing Contrastive Arguments | Zhenrui Yue, Huimin Zeng, Lanyu Shang, Yifan Liu, Yang Zhang, Dong Wang | N/A | N/A |
| SyllabusQA: A Course Logistics Question Answering Dataset | Nigel Fernandez, Alexander Scarlatos, Andrew Lan | N/A | N/A |
| American Sign Language Handshapes Reflect Pressures for Communicative Efficiency | Kayo Yin, Terry Regier, Dan Klein | N/A | N/A |
| MindMap: Knowledge Graph Prompting Sparks Graph of Thoughts in Large Language Models | Yilin Wen, Zifeng Wang, Jimeng Sun | N/A | N/A |
| AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts | Daniel Braun, Florian Matthes | N/A | N/A |
| Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks | Charlotte Siska, Katerina Marazopoulou, Melissa Ailem, James Bono | N/A | N/A |
| Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning | Eric Pasewark, Kyle Montgomery, Kefei Duan, Dawn Song, Chenguang Wang | N/A | N/A |
| Bridging the Preference Gap between Retrievers and LLMs | Zixuan Ke, Weize Kong, Cheng Li, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky | N/A | N/A |
| Large Language Models Can Learn Temporal Reasoning | Siheng Xiong, Ali Payani, Ramana Rao Kompella, Faramarz Fekri | N/A | N/A |
| Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research | Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen, Emma Strubell, Nishant Subramani, Oyvind Tafjord, Evan Pete Walsh, Luke Zettlemoyer, Noah A. Smith, Hannaneh Hajishirzi, Iz Beltagy, Dirk Groeneveld, Jesse Dodge, Kyle Lo | N/A | N/A |
| Learning Relational Decomposition of Queries for Question Answering from Tables | Raphaël Mouravieff, Benjamin Piwowarski, sylvain lamprier | N/A | N/A |
| Characterizing Similarities and Divergences in Conversational Tones in Humans and LLMs by Sampling with People | Dun-Ming Huang, Pol Van Rijn, Ilia Sucholutsky, Raja Marjieh, Nori Jacoby | N/A | N/A |
| Pareto Optimal Learning for Estimating Large Language Model Errors | Theodore Zhao, Mu Wei, J. Samuel Preston, Hoifung Poon | N/A | N/A |
| Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models | Victor Agostinelli III, Max Wild, Matthew Raffel, Kazi Ahmed Asif Fuad, Lizhong Chen | N/A | N/A |
| Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM | Bochuan Cao, Yuanpu Cao, Lu Lin, Jinghui Chen | N/A | N/A |
| Interactive-KBQA: Multi-Turn Interactions for Knowledge Base Question Answering with Large Language Models | Guanming Xiong, Junwei Bao, Wen Zhao | N/A | N/A |
| LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error | Boshi Wang, Hao Fang, Jason Eisner, Benjamin Van Durme, Yu Su | N/A | N/A |
| HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts | Hao Zhao, Zihan Qiu, Huijia Wu, Zili Wang, Zhaofeng He, Jie Fu | N/A | N/A |
| Aligning Large Language Models with Human Preferences through Representation Engineering | Wenhao Liu, Xiaohua Wang, Muling Wu, Tianlong Li, Changze Lv, Zixuan Ling, Zhu JianHao, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models | Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu | N/A | N/A |
| ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation | Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, Jiancheng Lv | N/A | N/A |
| PolCLIP: A Unified Image-Text Word Sense Disambiguation Model via Generating Multimodal Complementary Representations | Qihao Yang, Yong Li, Xuelin Wang, Fu Lee Wang, Tianyong Hao | N/A | N/A |
| Prompted Aspect Key Point Analysis for Quantitative Review Summarization | An Quang Tang, Xiuzhen Zhang, Minh Ngoc Dinh, Erik Cambria | N/A | N/A |
| Ask Again, Then Fail: Large Language Models’ Vacillations in Judgment | Qiming Xie, Zengzhi Wang, Yi Feng, Rui Xia | N/A | N/A |
| CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models | Tong Zhang, Peixin Qin, Yang Deng, Chen Huang, Wenqiang Lei, Junhong Liu, Dingnan Jin, Hongru Liang, Tat-Seng Chua | N/A | N/A |
| Multimodal Reasoning with Multimodal Knowledge Graph | Junlin Lee, Yequan Wang, Jing Li, Min Zhang | N/A | N/A |
| Confidence is not Timeless: Modeling Temporal Validity for Rule-based Temporal Knowledge Graph Forecasting | Rikui Huang, Wei Wei, Xiaoye Qu, Shengzhe Zhang, Dangyang Chen, Yu Cheng | N/A | N/A |
| CARE: A Clue-guided Assistant for CSRs to Read User Manuals | Weihong Du, Jia Liu, zujie wen, Dingnan Jin, Hongru Liang, Wenqiang Lei | N/A | N/A |
| Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes | Dingzirui Wang, Longxu Dou, Xuanliang Zhang, Qingfu Zhu, Wanxiang Che | N/A | N/A |
| PAGED: A Benchmark for Procedural Graphs Extraction from Documents | Weihong Du, Wenrui Liao, Hongru Liang, Wenqiang Lei | N/A | N/A |
| Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors | Ying Zhou, Ben He, Le Sun | N/A | N/A |
| RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models | Cheng Niu, Yuanhao Wu, Juno Zhu, Siliang Xu, KaShun SHUM, Randy Zhong, Juntong Song, Tong Zhang | N/A | N/A |
| The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models | Junyi Li, Jie Chen, Ruiyang Ren, Xiaoxue Cheng, Xin Zhao, Jian-Yun Nie, Ji-Rong Wen | N/A | N/A |
| Revisiting Knowledge Distillation for Autoregressive Language Models | Qihuang Zhong, Liang Ding, Li Shen, Juhua Liu, Bo Du, Dacheng Tao | N/A | N/A |
| OLMo: Accelerating the Science of Language Models | Dirk Groeneveld, Iz Beltagy, Evan Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, William H. Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi | N/A | N/A |
| Continual Learning with Semi-supervised Contrastive Distillation for Incremental Neural Machine Translation | Yunlong Liang, Fandong Meng, Jiaan Wang, Jinan Xu, Yufeng Chen, Jie Zhou | N/A | N/A |
| Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners | Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, CHAO WENG, Zhou Zhao, Dong Yu | N/A | N/A |
| Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages | Shih-Cheng Huang, Pin-Zu Li, YU-CHI HSU, Kuang-Ming Chen, Yu Tung Lin, Shih-Kai Hsiao, Richard Tzong-Han Tsai, Hung-yi Lee | N/A | N/A |
| Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! | Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao | N/A | N/A |
| PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails | Neal Mangaokar, Ashish Hooda, Jihye Choi, Shreyas Chandrashekaran, Kassem Fawaz, Somesh Jha, Atul Prakash | N/A | N/A |
| Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLMs-Powered Assistance | Bo Yuan, Yulin Chen, Yin Zhang, Wei Jiang | N/A | N/A |
| CLOMO: Counterfactual Logical Modification with Large Language Models | Yinya Huang, Ruixin Hong, Hongming Zhang, Wei Shao, Zhicheng YANG, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song | N/A | N/A |
| Exploring Hybrid Question Answering via Program-based Prompting | Qi Shi, Han Cui, Haofeng Wang, Qingfu Zhu, Wanxiang Che, Ting Liu | N/A | N/A |
| IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages | Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar | N/A | N/A |
| Simple but Effective Compound Geometric Operations for Temporal Knowledge Graph Completion | Rui Ying, Mengting Hu, Jianfeng Wu, Yalan Xie, Xiaoyi Liu, Zhunheng Wang, Ming Jiang, Hang Gao, Linlin Zhang, Renhong Cheng | N/A | N/A |
| Uncertainty Aware Learning for Language Model Alignment | Yikun Wang, Rui Zheng, Liang Ding, Qi Zhang, Dahua Lin, Dacheng Tao | N/A | N/A |
| Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models | Ying-Chun Lin, Jennifer Neville, Jack W Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, saurabh tiwary, Brent Hecht, Jaime Teevan | N/A | N/A |
| Fundamental Capabilities of Large Language Models and their Applications in Domain Scenarios: A Survey | Jiawei Li, Yizhe Yang, Yu Bai, Xiaofeng Zhou, Yinghao Li, Huashan Sun, Yuhang Liu, Xingpeng Si, Yuhao Ye, Yixiao Wu, 林一冠, Bin Xu, Ren bowen, Chong Feng, Yang Gao, Heyan Huang | N/A | N/A |
| IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages | Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad B, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M Khapra | N/A | N/A |
| Measuring Political Bias in Large Language Models: What Is Said and How It Is Said | Yejin Bang, Delong Chen, Nayeon Lee, Pascale Fung | N/A | N/A |
| Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use | Yuhan Chen, Ang Lv, Ting-En Lin, Changyu Chen, Yuchuan Wu, Fei Huang, Yongbin Li, Rui Yan | N/A | N/A |
| Layer-Condensed KV Cache for Efficient Inference of Large Language Models | Haoyi Wu, Kewei Tu | N/A | N/A |
| Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models | Xiaolong Wang, Yile Wang, Yuanchi Zhang, Fuwen Luo, Peng Li, Maosong Sun, Yang Liu | N/A | N/A |
| Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages | Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu | N/A | N/A |
| Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations | Jiaxing Sun, weiquan Huang, Jiang Wu, Chenya Gu, Wei Li, Songyang Zhang, Hang Yan, Conghui He | N/A | N/A |
| Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion | Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu | N/A | N/A |
| Model Composition for Multimodal Large Language Models | Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu | N/A | N/A |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Jun Zhang, Jue WANG, Huan Li, Lidan Shou, Ke Chen, Gang Chen, Sharad Mehrotra | N/A | N/A |
| Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup | Xuxin Cheng, Ziyu Yao, Yifei Xin, Hao An, Hongxiang Li, Yaowei Li, Yuexian Zou | N/A | N/A |
| Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models | Changjiang Gao, Jixing Li, Jiajun Chen, Shujian Huang | N/A | N/A |
| MIST: Mutual Information Maximization for Short Text Clustering | Krissanee Kamthawee, Can Udomcharoenchaikit, Sarana Nutanong | N/A | N/A |
| Self-chats from Large Language Models Make Small Emotional Support Chatbot Better | Zhonghua Zheng, Lizi Liao, Yang Deng, Libo Qin, Liqiang Nie | N/A | N/A |
| Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment | Janghwan Lee, Seongmin Park, Sukjin Hong, Minsoo Kim, Du-Seong Chang, Jungwook Choi | N/A | N/A |
| Complex Reasoning over Logical Queries on Commonsense Knowledge Graphs | Tianqing Fang, Zeming Chen, Yangqiu Song, Antoine Bosselut | N/A | N/A |
| An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing | Ziwei Chai, Guoyin Wang, Jing Su, Tianjie Zhang, Xuanwen Huang, Xuwu Wang, Jingjing Xu, Jianbo Yuan, Hongxia Yang, Fei Wu, Yang Yang | N/A | N/A |
| Learning to Plan and Generate Text with Citations | Constanza Fierro, Reinald Kim Amplayo, Fantine Huot, Nicola De Cao, Joshua Maynez, Shashi Narayan, Mirella Lapata | N/A | N/A |
| Exploring Precision and Recall to assess the quality and diversity of LLMs | Florian Le Bronnec, Alexandre Verine, benjamin negrevergne, Yann Chevaleyre, Alexandre Allauzen | N/A | N/A |
| Aligning Large Language Models by On-Policy Self-Judgment | Sangkyu Lee, Sungdong Kim, Ashkan Yousefpour, Minjoon Seo, Kang Min Yoo, Youngjae Yu | N/A | N/A |
| IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning | Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi | N/A | N/A |
| JumpCoder: Go Beyond Autoregressive Coder via Online Modification | Mouxiang Chen, Hao Tian, Zhongxin Liu, Xiaoxue Ren, Jianling Sun | N/A | N/A |
| Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning | Shivalika Singh, Freddie Vargus, Daniel D’souza, Börje F. Karlsson, Abinaya Mahendiran, Wei-Yin Ko, Herumb Shandilya, Jay Patel, Deividas Mataciunas, Laura O’Mahony, Mike Zhang, Ramith Hettiarachchi, Joseph Wilson, Marina Machado, Luisa Souza Moura, Dominik Krzemiński, Hakimeh Fadaei, Irem Ergun, Ifeoma Okoh, Aisha Alaagib, Oshan Ivantha Mudannayake, Zaid Alyafeai, Vu Minh Chien, Sebastian Ruder, Surya Guthikonda, Emad A. Alghamdi, Sebastian Gehrmann, Niklas Muennighoff, Max Bartolo, Julia Kreutzer, Ahmet Üstün, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks | Anwoy Chatterjee, Eshaan Tanwar, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| Split and Rephrase with Large Language Models | Antonio David Ponce Martínez, Thierry Etchegoyhen, Jesus Javier Calleja Perez, Harritxu Gete | N/A | N/A |
| ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition | Lu Ye, Ze Tao, Yong Huang, Yang Li | N/A | N/A |
| AlignBench: Benchmarking Chinese Alignment of Large Language Models | Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Andrew Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Xiaotao Gu, Hongning Wang, Jing Zhang, Minlie Huang, Yuxiao Dong, Jie Tang | N/A | N/A |
| SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models | Weixiang Zhao, Shilong Wang, Yulin Hu, Yanyan Zhao, Bing Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| DoRA: Enhancing Parameter-Efficient Fine-Tuning with Dynamic Rank Distribution | Yulong Mao, Kaiyu Huang, Changhao Guan, Ganglin Bao, Fengran Mo, Jinan Xu | N/A | N/A |
| Cross-Lingual Knowledge Editing in Large Language Models | Jiaan Wang, Yunlong Liang, Zengkui Sun, Yuxuan Cao, Jiarong Xu, Fandong Meng | N/A | N/A |
| Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model | Ahmet Üstün, Viraat Aryabumi, Zheng Xin Yong, Wei-Yin Ko, Daniel D’souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker | N/A | N/A |
| Argument Mining in Data Scarce Settings: Cross-lingual Transfer and Few-shot Techniques | Anar Yeginbergen, Maite Oronoz, Rodrigo Agerri | N/A | N/A |
| Learning Task Decomposition to Assist Humans in Competitive Programming | Jiaxin Wen, Ruiqi Zhong, Pei Ke, Zhihong Shao, Hongning Wang, Minlie Huang | N/A | N/A |
| An Entropy-based Text Watermarking Detection Method | Yijian LU, Aiwei Liu, Dianzhi Yu, Jingjing Li, Irwin King | N/A | N/A |
| Enhancing Explainable Rating Prediction through Annotated Macro Concepts | Huachi Zhou, Shuang Zhou, Hao Chen, Ninghao Liu, Fan Yang, Xiao Huang | N/A | N/A |
| How to Engage your Readers? Generating Guiding Questions to Promote Active Reading | Peng Cui, Vilém Zouhar, Xiaoyu Zhang, Mrinmaya Sachan | N/A | N/A |
| Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective | Zihao Yue, Liang Zhang, Qin Jin | N/A | N/A |
| Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation | Xinglin Wang, Yiwei Li, Shaoxiong Feng, Peiwen Yuan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| More frequent verbs are associated with more diverse valency frames: Efficient principles at the lexicon-grammar interface | Siyu Tao, Lucia Donatelli, Michael Hahn | N/A | N/A |
| BatchEval: Towards Human-like Text Evaluation | Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| Quantifying Generalizations: Exploring the Divide Between Human and LLMs’ Sensitivity to Quantification | Claudia Collacciani, Giulia Rambelli, Marianna Bolognesi | N/A | N/A |
| Can Large Language Models Interpret Noun-Noun Compounds? A Linguistically-Motivated Study on Lexicalized and Novel Compounds | Giulia Rambelli, Emmanuele Chersoni, Claudia Collacciani, Marianna Bolognesi | N/A | N/A |
| CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation | Quan Tu, Shilong Fan, Zihang Tian, Tianhao Shen, Shuo Shang, Xin Gao, Rui Yan | N/A | N/A |
| Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond | Yongqi Li, Wenjie Wang, Leigang Qu, Liqiang Nie, Wenjie Li, Tat-Seng Chua | N/A | N/A |
| Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction | Yice Zhang, Jie Zeng, Weiming Hu, Ziyi Wang, Shiwei Chen, Ruifeng Xu | N/A | N/A |
| ToMBench: Benchmarking Theory of Mind in Large Language Models | Zhuang Chen, Jincenzi Wu, Jinfeng Zhou, Bosi Wen, Guanqun Bi, Gongyao Jiang, Yaru Cao, Mengting Hu, Yunghwei Lai, Zexuan Xiong, Minlie Huang | N/A | N/A |
| Learning to Generate Answers with Citations via Factual Consistency Models | Rami Aly, Zhiqiang Tang, Samson Tan, George Karypis | N/A | N/A |
| Improving Text Embeddings with Large Language Models | Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei | N/A | N/A |
| Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning | Tianduo Wang, Shichen Li, Wei Lu | N/A | N/A |
| UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset | Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Document-level Claim Extraction and Decontextualisation for Fact-Checking | Zhenyun Deng, Michael Sejr Schlichtkrull, Andreas Vlachos | N/A | N/A |
| PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning | Xiaoqi Qiu, Yongjie Wang, Xu Guo, Zhiwei Zeng, Yu Yue, Yuhong Feng, Chunyan Miao | N/A | N/A |
| LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction | Hanzhang Zhou, Junlang Qian, Zijian Feng, Hui Lu, Zixiao Zhu, Kezhi Mao | N/A | N/A |
| Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models | Weihong Zhong, Xiaocheng Feng, Liang Zhao, Qiming Li, Lei Huang, Yuxuan Gu, Weitao Ma, Yuan Xu, Bing Qin | N/A | N/A |
| COKE: A Cognitive Knowledge Graph for Machine Theory of Mind | Jincenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Helen M. Meng, Minlie Huang | N/A | N/A |
| mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models | Huiyuan Lai, Malvina Nissim | N/A | N/A |
| GunStance: Stance Detection for Gun Control and Gun Regulation | Nikesh Gyawali, Iustin Sirbu, Tiberiu Sosea, Sarthak Khanal, Doina Caragea, Traian Rebedea, Cornelia Caragea | N/A | N/A |
| Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation | Zdeněk Kasner, Ondrej Dusek | N/A | N/A |
| Don’t Go To Extremes: Revealing the Excessive Sensitivity and Calibration Limitations of LLMs in Implicit Hate Speech Detection | Min Zhang, Jianfeng He, Taoran Ji, Chang-Tien Lu | N/A | N/A |
| Don’t Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation | Giorgos Vernikos, Andrei Popescu-Belis | N/A | N/A |
| Generating and Evaluating Plausible Explanations for Knowledge Graph Completion | Antonio Di Mauro, Zhao Xu, Wiem Ben Rim, Timo Sztyler, Carolin Lawrence | N/A | N/A |
| One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation | Tejpalsingh Siledar, Swaroop Nath, Sankara Sri Raghava Ravindra Muddu, Rupasai Rangaraju, Swaprava Nath, Pushpak Bhattacharyya, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera | N/A | N/A |
| MultiPICo: Multilingual Perspectivist Irony Corpus | Silvia Casola, Simona Frenda, Soda Marem Lo, Erhan Sezerer, Antonio Uva, Valerio Basile, Cristina Bosco, Alessandro Pedrani, Chiara Rubagotti, Viviana Patti, Davide Bernardi | N/A | N/A |
| LANDeRMT: Dectecting and Routing Language-Aware Neurons for Selectively Finetuning LLMs to Machine Translation | shaolin Zhu, Leiyu Pan, Bo Li, Deyi Xiong | N/A | N/A |
| A Joint Coreference-Aware Approach to Document-Level Target Sentiment Analysis | Hongjie Cai, Heqing Ma, Jianfei Yu, Rui Xia | N/A | N/A |
| VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models | Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin | N/A | N/A |
| AutoDSL: Automated domain-specific language design for structural representation of procedures with constraints | Yu-Zhe Shi, Haofei Hou, Zhangqian Bi, Fanxu Meng, Xiang Wei, Lecheng Ruan, Qining Wang | N/A | N/A |
| Multipath parsing in the brain | Berta Franzluebbers, Donald Dunagan, Miloš Stanojević, Jan Buys, John T. Hale | N/A | N/A |
| Search-Adaptor: Embedding Customization for Information Retrieval | Jinsung Yoon, Yanfei Chen, Sercan O Arik, Tomas Pfister | N/A | N/A |
| Back to Basics: Revisiting REINFORCE-Style Optimization for Learning from Human Feedback in LLMs | Arash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara Hooker | N/A | N/A |
| VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation | Max Ku, Dongfu Jiang, Cong Wei, Xiang Yue, Wenhu Chen | N/A | N/A |
| Tree Transformer’s Disambiguation Ability of Prepositional Phrase Attachment and Garden Path Effects | Lingling Zhou, Suzan Verberne, Gijs Wijnholds | N/A | N/A |
| Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs | Elan Sopher Markowitz, Anil Ramakrishna, Jwala Dhamala, Ninareh Mehrabi, Charith Peris, Rahul Gupta, Kai-Wei Chang, Aram Galstyan | N/A | N/A |
| Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing | Freda Shi, Kevin Gimpel, Karen Livescu | N/A | N/A |
| ViSAGe: A Global-Scale Analysis of Visual Stereotypes in Text-to-Image Generation | Akshita Jha, Vinodkumar Prabhakaran, Remi Denton, Sarah Laszlo, Shachi Dave, Rida Qadri, Chandan K. Reddy, Sunipa Dev | N/A | N/A |
| AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents | Harsh Trivedi, Tushar Khot, Mareike Hartmann, Ruskin Manku, Vinty Dong, Edward Li, Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian | N/A | N/A |
| Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking | Xiaokang Zhang, Zijun Yao, Jing Zhang, Kaifeng Yun, Jifan Yu, Juanzi Li, Jie Tang | N/A | N/A |
| What Do Language Models Learn in Context? The Structured Task Hypothesis. | Jiaoda Li, Yifan Hou, Mrinmaya Sachan, Ryan Cotterell | N/A | N/A |
| Agent Lumos: Unified and Modular Training for Open-Source Language Agents | Da Yin, Faeze Brahman, Abhilasha Ravichander, Khyathi Chandu, Kai-Wei Chang, Yejin Choi, Bill Yuchen Lin | N/A | N/A |
| Investigating Cultural Alignment of Large Language Models | Badr AlKhamissi, Muhammad ElNokrashy, Mai Alkhamissi, Mona T. Diab | N/A | N/A |
| More Victories, Less Cooperation: Assessing Cicero’s Diplomacy Play | Wichayaporn Wongkamjan, Feng Gu, Yanze Wang, Ulf Hermjakob, Jonathan May, Brandon M. Stewart, Jonathan K. Kummerfeld, Denis Peskoff, Jordan Lee Boyd-Graber | N/A | N/A |
| VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild | Puyuan Peng, Po-Yao Huang, Shang-Wen Li, Abdelrahman Mohamed, David Harwath | N/A | N/A |
| RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors | Liam Dugan, Alyssa Hwang, Filip Trhlík, Andrew Zhu, Josh magnus Ludan, Hainiu Xu, Daphne Ippolito, Chris Callison-Burch | N/A | N/A |
| Silent Signals, Loud Impact: LLMs for Word-Sense Disambiguation of Coded Dog Whistles | Julia Kruk, Michela Marchini, Rijul Magu, Caleb Ziems, David Muchlinski, Diyi Yang | N/A | N/A |
| On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning | Franz Nowak, Anej Svete, Alexandra Butoi, Ryan Cotterell | N/A | N/A |
| Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends | Sanjana Ramprasad, Elisa Ferracane, Zachary Chase Lipton | N/A | N/A |
| MMToM-QA: Multimodal Theory of Mind Question Answering | Chuanyang Jin, Yutong Wu, Jing Cao, Jiannan Xiang, Yen-Ling Kuo, Zhiting Hu, Tomer Ullman, Antonio Torralba, Joshua B. Tenenbaum, Tianmin Shu | N/A | N/A |
| LLM in a flash: Efficient Large Language Model Inference with Limited Memory | Keivan Alizadeh, Seyed Iman Mirzadeh, Dmitry Belenko, S. Karen Khatamifard, Minsik Cho, Carlo C del Mundo, Mohammad Rastegari, Mehrdad Farajtabar | N/A | N/A |
| Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models | Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Fahad Shahbaz Khan | N/A | N/A |
| To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation | Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed | N/A | N/A |
| DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Financial Documents | Yilun Zhao, Yitao Long, Hongjun Liu, Ryo Kamoi, Linyong Nan, Lyuhao Chen, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan | N/A | N/A |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti, Liangzhen Lai, Anas Mahmoud, Bilge Acun, Saurabh Agarwal, Ahmed Roman, Ahmed A Aly, Beidi Chen, Carole-Jean Wu | N/A | N/A |
| Unintended Impacts of LLM Alignment on Global Representation | Michael J Ryan, William Barr Held, Diyi Yang | N/A | N/A |
| Classist Tools: Social Class Correlates with Performance in NLP | Amanda Cercas Curry, Giuseppe Attanasio, Zeerak Talat, Dirk Hovy | N/A | N/A |
| ActionIE: Action Extraction from Scientific Literature with Programming Languages | Xianrui Zhong, Yufeng Du, Siru Ouyang, Ming Zhong, Tingfeng Luo, Qirong Ho, Hao Peng, Heng Ji, Jiawei Han | N/A | N/A |
| A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech | Gaurav Verma, Rynaa Grover, Jiawei Zhou, Binny Mathew, Jordan Kraemer, Munmun De Choudhury, Srijan Kumar | N/A | N/A |
| Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs | Zhiwei Cao, Qian Cao, Yu Lu, Ningxin Peng, Luyang Huang, Shanbo Cheng, Jinsong Su | N/A | N/A |
| COSMIC: Mutual Information for Task-Agnostic Summarization Evaluation | Maxime DARRIN, Philippe Formont, Jackie CK Cheung, Pablo Piantanida | N/A | N/A |
| ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer | Arkadiy Saakyan, Smaranda Muresan | N/A | N/A |
| EUROPA: A Legal Multilingual Keyphrase Generation Dataset | Olivier Salaün, Frédéric Piedboeuf, Guillaume Le Berre, David Alfonso-Hermelo, Philippe Langlais | N/A | N/A |
| GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews | Maxime DARRIN, Ines Arous, Pablo Piantanida, Jackie CK Cheung | N/A | N/A |
| MAP’s not dead yet: Uncovering true language model modes by conditioning away degeneracy | Davis Yoshida, Kartik Goyal, Kevin Gimpel | N/A | N/A |
| Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks | Fakhraddin Alwajih, El Moatez Billah Nagoudi, Gagan Bhatia, Abdelrahman Mohamed, Muhammad Abdul-Mageed | N/A | N/A |
| Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks | João Bordalo, Vasco Ramos, Rodrigo Valério, Diogo Glória-Silva, Yonatan Bitton, Michal Yarom, Idan Szpektor, Joao Magalhaes | N/A | N/A |
| Cheetah: Natural Language Generation for 517 African Languages | Ife Adebara, AbdelRahim A. Elmadany, Muhammad Abdul-Mageed | N/A | N/A |
| TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based Reasoning | Yilun Zhao, Lyuhao Chen, Arman Cohan, Chen Zhao | N/A | N/A |
| KnowledgeFMath: A Knowledge-Intensive Math Reasoning Dataset in Finance Domains | Yilun Zhao, Hongjun Liu, Yitao Long, Rui Zhang, Chen Zhao, Arman Cohan | N/A | N/A |
| API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs | Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell Crouse, Asim Munawar, Vernon Austel, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi, Luis A. Lastras | N/A | N/A |
| LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks | Hanqing Wang, Bowen Ping, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Harder Task Needs More Experts: Dynamic Routing in MoE Models | Quzhe Huang, Zhenwei An, Nan Zhuang, Mingxu Tao, Chen Zhang, Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Songfang Huang, Yansong Feng | N/A | N/A |
| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | HyoJung Han, Mohamed Anwar, Juan Pino, Wei-Ning Hsu, Marine Carpuat, Bowen Shi, Changhan Wang | N/A | N/A |
| SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents | Ruiyi Wang, Haofei Yu, Wenxin Sharon Zhang, Zhengyang Qi, Maarten Sap, Yonatan Bisk, Graham Neubig, Hao Zhu | N/A | N/A |
| ${\mathcal X}$FT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts | Yifeng Ding, Jiawei Liu, Yuxiang Wei, LINGMING ZHANG | N/A | N/A |
| Generalizability of Mixture of Domain-Specific Adapters from the Lens of Signed Weight Directions and its Application to Effective Model Pruning | Tuc Van Nguyen, Thai Le | N/A | N/A |
| Learning to Decode Collaboratively with Multiple Language Models | Zejiang Shen, Hunter Lang, Bailin Wang, Yoon Kim, David Sontag | N/A | N/A |
| DRAGIN: Dynamic Retrieval Augmented Generation based on the Real-time Information Needs of Large Language Models | Weihang Su, Yichen Tang, Qingyao Ai, Zhijing Wu, Yiqun LIU | N/A | N/A |
| Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? | Zhaochen Su, Juntao Li, Jun Zhang, Tong Zhu, Xiaoye Qu, Pan Zhou, Yan Bowen, Yu Cheng, Min zhang | N/A | N/A |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Pei Ke, Bosi Wen, Andrew Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang | N/A | N/A |
| LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments | Junzhe Chen, Xuming Hu, Shuodi Liu, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Lijie Wen | N/A | N/A |
| Small But Funny: A Feedback-Driven Approach to Humor Distillation | Sahithya Ravi, Patrick Huber, Akshat Shrivastava, Vered Shwartz, Arash Einolghozati | N/A | N/A |
| Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models | Fangzhi Xu, Zhiyong Wu, Qiushi Sun, Siyu Ren, Fei Yuan, Shuai Yuan, Qika Lin, Yu Qiao, Jun Liu | N/A | N/A |
| From Sights to Insights: Towards Summarization of Multimodal Clinical Documents | Akash Ghosh, Mohit Singh Tomar, Abhisek Tiwari, Sriparna Saha, JATIN AVINASH SALVE, Setu Sinha | N/A | N/A |
| When Phrases Meet Probabilities: Enabling Open Relation Extraction with Cooperating Large Language Models | Jiaxin Wang, Lingling Zhang, Wee Sun Lee, Yujie Zhong, Liwei Kang, Jun Liu | N/A | N/A |
| Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation | Jan Cegin, Branislav Pecher, Jakub Simko, Ivan Srba, Maria Bielikova, Peter Brusilovsky | N/A | N/A |
| Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic | Yassine El Kheir, Hamdy Mubarak, Ahmed Ali, Shammur Absar Chowdhury | N/A | N/A |
| Document-Level Machine Translation with Large-Scale Public Parallel Corpora | Proyag Pal, Alexandra Birch, Kenneth Heafield | N/A | N/A |
| Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! | Stefano Perrella, Lorenzo Proietti, Alessandro Scirè, Edoardo Barba, Roberto Navigli | N/A | N/A |
| NounAtlas: Filling the Gap in Nominal Semantic Role Labeling | Roberto Navigli, Marco Lo Pinto, Pasquale Silvestri, Dennis Rotondi, Simone Ciciliano, Alessandro Scirè | N/A | N/A |
| Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length | Nur Lan, Emmanuel Chemla, Roni Katzir | N/A | N/A |
| Context versus Prior Knowledge in Language Models | Kevin Du, Vésteinn Snæbjarnarson, Niklas Stoehr, Jennifer C. White, Aaron Schein, Ryan Cotterell | N/A | N/A |
| Word Matters: What Influences Domain Adaptation in Summarization? | Yinghao Li, Siyu Miao, Heyan Huang, Yang Gao | N/A | N/A |
| Visualization Recommendation with Prompt-based Reprogramming of Large Language Models | Xinhang Li, Jingbo Zhou, Wei Chen, Derong Xu, Tong Xu, Enhong Chen | N/A | N/A |
| HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs | Pranoy Panda, Ankush Agarwal, Chaitanya Devaguptapu, Manohar Kaul, Prathosh AP | N/A | N/A |
| Toward In-Context Teaching: Adapting Examples to Students’ Misconceptions | Alexis Ross, Jacob Andreas | N/A | N/A |
| Bridging Word-Pair and Token-Level Metaphor Detection with Explainable Domain Mining | Yuan Tian, Ruike Zhang, Nan Xu, Wenji Mao | N/A | N/A |
| Faithful Logical Reasoning via Symbolic Chain-of-Thought | Jundong Xu, Hao Fei, Liangming Pan, Qian Liu, Mong-Li Lee, Wynne Hsu | N/A | N/A |
| S$^2$GSL: Incorporating Segment to Syntactic Enhanced Graph Structure Learning for Aspect-based Sentiment Analysis | Bingfeng chen, qihan ouyang, yongqi luo, Boyan Xu, Ruichu Cai, Zhifeng Hao | N/A | N/A |
| Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends | Giuliano Martinelli, Edoardo Barba, Roberto Navigli | N/A | N/A |
| ESCoT: Towards Interpretable Emotional Support Dialogue Systems | Tenggan Zhang, Xinjie Zhang, Jinming Zhao, Li Zhou, Qin Jin | N/A | N/A |
| PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering | Fangzhi Xu, Qika Lin, Tianzhe Zhao, JiaweiHan, Jun Liu | N/A | N/A |
| WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection | Anudeex Shetty, Yue Teng, Ke He, Qiongkai Xu | N/A | N/A |
| Advancing Parameter Efficiency in Fine-tuning via Representation Editing | Muling Wu, Wenhao Liu, Xiaohua Wang, Tianlong Li, Changze Lv, Zixuan Ling, Zhu JianHao, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| Context Consistency between Training and Inference in Simultaneous Machine Translation | Meizhi Zhong, Lemao Liu, Kehai Chen, Mingming Yang, Min Zhang | N/A | N/A |
| Using Natural Language Explanations to Improve Robustness of In-context Learning | Xuanli He, Yuxiang Wu, Oana-Maria Camburu, Pasquale Minervini, Pontus Stenetorp | N/A | N/A |
| The Earth is Flat because…: Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation | Rongwu Xu, Brian S. Lin, Shujian Yang, Tianqi Zhang, Weiyan Shi, Tianwei Zhang, Zhixuan Fang, Wei Xu, Han Qiu | N/A | N/A |
| Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers | Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, nan du | N/A | N/A |
| LooGLE: Can Long-Context Language Models Understand Long Contexts? | Jiaqi Li, Mengmeng Wang, Zilong Zheng, Muhan Zhang | N/A | N/A |
| ArchCode: Incorporating Software Requirements in Code Generation with Large Language Models | Hojae Han, Jaejin Kim, Jaeseok Yoo, Youngwon Lee, seung-won hwang | N/A | N/A |
| Let’s Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation | Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeonghun Yeo, Yong Man Ro | N/A | N/A |
| Combining Supervised Learning and Reinforcement Learning for Multi-Label Classification Tasks with Partial Labels | Zixia Jia, Junpeng Li, Shichuan Zhang, Anji Liu, Zilong Zheng | N/A | N/A |
| MULFE: A Multi-Level Benchmark for Free Text Model Editing | Chenhao Wang, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech | Shengpeng Ji, Ziyue Jiang, Wang Hanting, Jialung Zuo, Zhou Zhao | N/A | N/A |
| Spatially-Aware Speaker for Vision-and-Language Navigation Instruction Generation | Muraleekrishna Gopinathan, Martin Masek, Jumana Abu-Khalaf, David Suter | N/A | N/A |
| HiRoPE: Length Extrapolation for Code Models Using Hierarchical Position | Kechi Zhang, Ge Li, Huangzhao Zhang, Zhi Jin | N/A | N/A |
| Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training | Junqing He, Kunhao Pan, Xiaoqun Dong, Zhuoyang Song, LiuYiBo, qianguosun, Yuxin Liang, Hao Wang, Enming Zhang, Jiaxing Zhang | N/A | N/A |
| CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges | Kechi Zhang, Jia Li, Ge Li, xianjie Shi, Zhi Jin | N/A | N/A |
| When is Tree Search Useful for LLM Planning? It Depends on the Discriminator | Ziru Chen, Michael White, Ray Mooney, Ali Payani, Yu Su, Huan Sun | N/A | N/A |
| LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models | Mihir Parmar, Nisarg Patel, Neeraj Varshney, Mutsumi Nakamura, Man Luo, Santosh Mashetty, Arindam Mitra, Chitta Baral | N/A | N/A |
| ECBD: Evidence-Centered Benchmark Design for NLP | Yu Lu Liu, Su Lin Blodgett, Jackie CK Cheung, Vera Liao, Alexandra Olteanu, Ziang Xiao | N/A | N/A |
| Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding | Ruohao Guo, Wei Xu, Alan Ritter | N/A | N/A |
| Reducing Privacy Risks in Online Self-Disclosures with Language Models | Yao Dou, Isadora Krsek, Tarek Naous, Anubha Kabra, Sauvik Das, Alan Ritter, Wei Xu | N/A | N/A |
| Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models | Zihao Lin, Mohammad Beigi, Hongxuan Li, Yufan Zhou, Yuxiang Zhang, Qifan Wang, Wenpeng Yin, Lifu Huang | N/A | N/A |
| REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset | Vaidehi Patil, Leonardo F. R. Ribeiro, Mengwen Liu, Mohit Bansal, Markus Dreyer | N/A | N/A |
| When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards | Norah A. Alzahrani, Hisham Abdullah Alyahya, Yazeed Alnumay, Sultan AlRashed, Shaykhah Z. Alsubaie, Yousef Almushayqih, Faisal Abdulrahman Mirza, Nouf M. Alotaibi, Nora Al-Twairesh, Areeb Alowisheq, M Saiful Bari, Haidar Khan | N/A | N/A |
| LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts | Helia Hashemi, Jason Eisner, Corby Rosset, Benjamin Van Durme, Chris Kedzie | N/A | N/A |
| LIEDER: Linguistically-Informed Evaluation for Discourse Entity Recognition | Xiaomeng Zhu, Robert Frank | N/A | N/A |
| Evaluating Very Long-Term Conversational Memory of LLM Agents | Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov, Mohit Bansal, Francesco Barbieri, Yuwei Fang | N/A | N/A |
| Prototypical Reward Network for Data-Efficient Model Alignment | Jinghan Zhang, Xiting Wang, Yiqiao Jin, Changyu Chen, Xinhao Zhang, Kunpeng Liu | N/A | N/A |
| NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms | Jonathan Zheng, Alan Ritter, Wei Xu | N/A | N/A |
| Impacts of Misspelled Queries on Translation and Product Search | Greg Hanneman, Natawut Monaikul, Taichi Nakatani | N/A | N/A |
| Having Beer after Prayer? Measuring Cultural Bias in Large Language Models | Tarek Naous, Michael J Ryan, Alan Ritter, Wei Xu | N/A | N/A |
| Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs | Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin | N/A | N/A |
| The MERSA Dataset and a Transformer-Based Approach for Speech Emotion Recognition | Enshi Zhang, Rafael Trujillo, Christian Poellabauer | N/A | N/A |
| Transparent and Scrutable Recommendations Using Natural Language User Profiles | Jerome Ramos, Hossein A. Rahmani, Xi Wang, Xiao Fu, Aldo Lipani | N/A | N/A |
| Fora: A corpus and framework for the study of facilitated dialogue | Hope Schroeder, Deb Roy, Jad Kabbara | N/A | N/A |
| Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning | Yue Yu, Jiaming Shen, Tianqi Liu, Zhen Qin, Jing Nathan Yan, Jialu Liu, Chao Zhang, Michael Bendersky | N/A | N/A |
| What is the Best Way for ChatGPT to Translate Poetry? | Shanshan Wang, Derek F. Wong, Jingming Yao, Lidia S. Chao | N/A | N/A |
| Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling | Pratyush Maini, Skyler Seto, Richard He Bai, David Grangier, Yizhe Zhang, Navdeep Jaitly | N/A | N/A |
| DeCoT: Debiasing Chain-of-Thought for Knowledge-Intensive Tasks in Large Language Models via Causal Intervention | Junda Wu, Tong Yu, Xiang Chen, Haoliang Wang, Ryan A. Rossi, Sungchul Kim, Anup Rao, Julian McAuley | N/A | N/A |
| Representation Learning with Conditional Information Flow Maximization | Dou Hu, Lingwei Wei, Wei Zhou, Songlin Hu | N/A | N/A |
| GPT is Not an Annotator: The Necessity of Human Annotation in Fairness Benchmark Construction | Virginia K. Felkner, Jennifer A. Thompson, Jonathan May | N/A | N/A |
| Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models | Martin Riddell, Ansong Ni, Arman Cohan | N/A | N/A |
| Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic | Rishabh Bhardwaj, Duc Anh Do, Soujanya Poria | N/A | N/A |
| Tracking the Newsworthiness of Public Documents | Alexander Spangher, Serdar Tumgoren, Ben Welsh, Nanyun Peng, Emilio Ferrara, Jonathan May | N/A | N/A |
| EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems | Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye HAO, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh | N/A | N/A |
| Explicating the Implicit: Argument Detection Beyond Sentence Boundaries | Paul Roit, Aviv Slobodkin, Eran Hirsch, Arie Cattan, Ayal Klein, Valentina Pyatkin, Ido Dagan | N/A | N/A |
| Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models | Shengzhi LI, Rongyu Lin, Shichao Pei | N/A | N/A |
| Word Embeddings Are Steers for Language Models | Chi Han, Jialiang Xu, Manling Li, Yi Fung, Chenkai Sun, Nan Jiang, Tarek F. Abdelzaher, Heng Ji | N/A | N/A |
| Multistage Collaborative Knowledge Distillation from a Large Language Model for Semi-Supervised Sequence Generation | Jiachen Zhao, Wenlong Zhao, Andrew Drozdov, Benjamin Rozonoyer, Md Arafat Sultan, Jay-Yoon Lee, Mohit Iyyer, Andrew McCallum | N/A | N/A |
| Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor | Sangwon Yu, Changmin Lee, Hojin Lee, Sungroh Yoon | N/A | N/A |
| LogogramNLP: Comparing Visual and Textual Representations of Ancient Logographic Writing Systems for NLP | Danlu Chen, Freda Shi, Aditi Agarwal, Jacobo Myerston, Taylor Berg-Kirkpatrick | N/A | N/A |
| Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning | Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou | N/A | N/A |
| Confabulation: The Surprising Value of Large Language Model Hallucinations | Peiqi Sui, Eamon Duede, Sophie Wu, Richard Jean So | N/A | N/A |
| IAPT: Instance-Aware Prompt Tuning for Large Language Models | Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie | N/A | N/A |
| Can Language Models Serve as Text-Based World Simulators? | Ruoyao Wang, Graham Todd, Ziang Xiao, Xingdi Yuan, Marc-Alexandre Côté, Peter Clark, Peter Jansen | N/A | N/A |
| FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models | Andrew Zhu, Alyssa Hwang, Liam Dugan, Chris Callison-Burch | N/A | N/A |
| Revisiting Code Similarity Evaluation with Abstract Syntax Tree Edit Distance | Yewei Song, Cedric Lothritz, Daniel Tang, Tegawendé F. Bissyandé, Jacques Klein | N/A | N/A |
| Resisting the Lure of the Skyline: Grounding Practices in Active Learning for Morphological Inflection | Saliha Muradoglu, Michael Ginn, Miikka Silfverberg, Mans Hulden | N/A | N/A |
| Speculative Contrastive Decoding | Hongyi Yuan, Keming Lu, Fei Huang, Zheng Yuan, Chang Zhou | N/A | N/A |
| RDRec: Rationale Distillation for LLM-based Recommendation | Xinfeng Wang, Jin Cui, Yoshimi Suzuki, Fumiyo Fukumoto | N/A | N/A |
| Isotropy, Clusters, and Classifiers | Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh | N/A | N/A |
| Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks | Andrew Gambardella, Yusuke Iwasawa, Yutaka Matsuo | N/A | N/A |
| Cleaner Pretraining Corpus Curation with Neural Web Scraping | Zhipeng Xu, Zhenghao Liu, Yukun Yan, Zhiyuan Liu, Ge Yu, Chenyan Xiong | N/A | N/A |
| Simpson’s Paradox and the Accuracy-Fluency Tradeoff in Translation | Zheng Wei Lim, Ekaterina Vylomova, Trevor Cohn, Charles Kemp | N/A | N/A |
| UltraSparseBERT: 99% Conditionally Sparse Language Modelling | Peter Belcak, Roger Wattenhofer | N/A | N/A |
| SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark | Zhenwen Liang, Kehan Guo, Gang Liu, Taicheng Guo, Yujun Zhou, Tianyu Yang, Jiajun Jiao, Renjie Pi, Jipeng Zhang, Xiangliang Zhang | N/A | N/A |
| On the Role of Long-tail Knowledge in Retrieval Augmented Large Language Models | Dongyang Li, Junbing Yan, Taolin Zhang, Chengyu Wang, Xiaofeng He, Longtao Huang, Hui Xue’, Jun Huang | N/A | N/A |
| IEPile: Unearthing Large Scale Schema-Conditioned Information Extraction Corpus | Honghao Gui, Lin Yuan, Hongbin Ye, Ningyu Zhang, Mengshu Sun, Lei Liang, Huajun Chen | N/A | N/A |
| Bi-Directional Multi-Granularity Generation Framework for Knowledge Graph-to-Text with Large Language Model | Haowei Du, Chen Li, Dinghao Zhang, Dongyan Zhao | N/A | N/A |
| Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment | Zhihong Zhu, Xuxin Cheng, Zhanpeng Chen, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| AFLoRA: Adaptive Freezing of Low Rank Adaptation in Parameter Efficient Fine-Tuning of Large Models | Zeyu Liu, Souvik Kundu, Anni Li, Junrui Wan, Lianghao Jiang, Peter Anthony Beerel | N/A | N/A |
| DDPrompt: Differential Diversity Prompting in Large Language Models | Lin Mu, Wenhao Zhang, Yiwen Zhang, Peiquan Jin | N/A | N/A |
| Monotonic Representation of Numeric Attributes in Language Models | Benjamin Heinzerling, Kentaro Inui | N/A | N/A |
| Two Issues with Chinese Spelling Correction and A Refinement Solution | Changxuan Sun, Linlin She, Xuesong Lu | N/A | N/A |
| Linear-time Minimum Bayes Risk Decoding with Reference Aggregation | Jannis Vamvas, Rico Sennrich | N/A | N/A |
| DynaSemble: Dynamic Ensembling of Textual and Structure-Based Models for Knowledge Graph Completion | Ananjan Nandi, Navdeep Kaur, Parag Singla, Mausam . | N/A | N/A |
| Fine-Tuning Pre-Trained Language Models with Gaze Supervision | Shuwen Deng, Paul Prasse, David Robert Reich, Tobias Scheffer, Lena Ann Jäger | N/A | N/A |
| Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech | Adrien Pupier, Maximin Coavoux, Jérôme Goulian, Benjamin Lecouteux | N/A | N/A |
| Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access | Saibo Geng, Berkay Döner, Chris Wendler, Martin Josifoski, Robert West | N/A | N/A |
| On the Semantic Latent Space of Diffusion-Based Text-To-Speech Models | Miri Varshavsky, Roy Hirsch, Regev Cohen, Tomer Golany, Daniel Freedman, Ehud Rivlin | N/A | N/A |
| Learnable Privacy Neurons Localization in Language Models | Ruizhe Chen, Tianxiang Hu, YANG FENG, Zuozhu Liu | N/A | N/A |
| Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs | Akhila Yerukola, Saujas Vaduguru, Daniel Fried, Maarten Sap | N/A | N/A |
| Generating Harder Cross-document Event Coreference Resolution Datasets using Metaphoric Paraphrasing | Shafiuddin Rehan Ahmed, Zhiyong Wang, George Arthur Baker, Kevin Stowe, James H. Martin | N/A | N/A |
| Soft Self-Consistency Improves Language Models Agents | Han Wang, Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal | N/A | N/A |
| RecGPT: Generative Pre-training for Text-based Recommendation | Hoang Ngo, Dat Quoc Nguyen | N/A | N/A |
| MTP: A Dataset for Multi-Modal Turning Points in Casual Conversations | Gia-Bao Dinh Ho, Chang Wei Tan, Zahra Zamanzadeh Darban, Mahsa Salehi, Reza Haf, Wray Buntine | N/A | N/A |
| What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects | Verena Blaschke, Christoph Purschke, Hinrich Schuetze, Barbara Plank | N/A | N/A |
| What Does Parameter-free Probing Really Uncover? | Tommi Buder-Gröndahl | N/A | N/A |
| ATLAS: Improving Lay Summarisation with Attribute-based Control | Zhihao Zhang, Tomas Goldsack, Carolina Scarton, Chenghua Lin | N/A | N/A |
| EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models | Mengfei Du, Binhao Wu, Zejun Li, Xuanjing Huang, zhongyu wei | N/A | N/A |
| Understanding the Effects of Noise in Text-to-SQL: An Examination of the BIRD-Bench Benchmark | Niklas Wretblad, Fredrik Gordh Riseby, Rahul Biswas, Amin Ahmadi, Oskar Holmström | N/A | N/A |
| Dwell in the Beginning: How Language Models Embed Long Documents for Dense Retrieval | João Coelho, Bruno Martins, Joao Magalhaes, Jamie Callan, Chenyan Xiong | N/A | N/A |
| That’s Optional: A Contemporary Exploration of “that” Omission in English Subordinate Clauses | Ella Rabinovich | N/A | N/A |
| Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender? | Haozhe An, Christabel Acquaye, Colin Wang, Zongxia Li, Rachel Rudinger | N/A | N/A |
| Explainability and Hate Speech: Structured Explanations Make Social Media Moderators Faster | Agostina Calabrese, Leonardo Neves, Neil Shah, Maarten W. Bos, Björn Ross, Mirella Lapata, Francesco Barbieri | N/A | N/A |
| Getting Serious about Humor: Crafting Humor Datasets with Unfunny Large Language Models | Zachary Horvitz, Jingru Chen, Rahul Aditya, Harshvardhan Srivastava, Robert West, Zhou Yu, Kathleen McKeown | N/A | N/A |
| Estimating the Level of Dialectness Predicts Inter-annotator Agreement in Multi-dialect Arabic Datasets | Amr Keleg, Walid Magdy, Sharon Goldwater | N/A | N/A |
| Born Differently Makes a Difference: Counterfactual Study of Bias in Biography Generation from a Data-to-Text Perspective | Biaoyan Fang, Ritvik Dinesh, Xiang Dai, Sarvnaz Karimi | N/A | N/A |
| Greed is All You Need: An Evaluation of Tokenizer Inference Methods | Omri Uzan, Craig W Schmidt, Chris Tanner, Yuval Pinter | N/A | N/A |
| Sign Language Translation with Sentence Embedding Supervision | HAMIDULLAH Yasser, Josef van Genabith, Cristina España-Bonet | N/A | N/A |
| STREAM: Simplified Topic Retrieval, Exploration, and Analysis Module | Anton Frederik Thielmann, Arik Reuter, Christoph Weisser, Gillian Kant, Manish Kumar, Benjamin Säfken | N/A | N/A |
| DocFinQA: A Long-Context Financial Reasoning Dataset | Varshini Reddy, Rik Koncel-Kedziorski, Viet Dac Lai, Michael Krumdick, Charles Lovering, Chris Tanner | N/A | N/A |
| MaskLID: Code-Switching Language Identification through Iterative Masking | Amir Hossein Kargaran, François Yvon, Hinrich Schuetze | N/A | N/A |
| An Empirical Analysis on Large Language Models in Debate Evaluation | Xinyi Liu, Pinxin Liu, Hangfeng He | N/A | N/A |
| Fine-Tuned Machine Translation Metrics Struggle in Unseen Domains | Vilém Zouhar, Shuoyang Ding, Anna Currey, Tatyana Badeka, Jenyuan Wang, Brian Thompson | N/A | N/A |
| IndicIRSuite: Multilingual Dataset and Neural Information Models for Indian Languages | Saiful Haq, Ashutosh Sharma, Omar Khattab, Niyati Chhaya, Pushpak Bhattacharyya | N/A | N/A |
| AGR: Reinforced Causal Agent-Guided Self-explaining Rationalization | Yunxiao Zhao, Zhiqiang Wang, Xiaoli Li, Jiye Liang, Ru Li | N/A | N/A |
| Shoulders of Giants: A Look at the Degree and Utility of Openness in NLP Research | Surangika Ranathunga, Nisansa de Silva, Dilith Jayakody, Aloka Fernando | N/A | N/A |
| The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models | Noah Yamamoto Siegel, Oana-Maria Camburu, Nicolas Heess, Maria Perez-Ortiz | N/A | N/A |
| Don’t Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Models | Anna Bavaresco, Alberto Testoni, Raquel Fernández | N/A | N/A |
| Naming, Describing, and Quantifying Visual Objects in Humans and LLMs | Alberto Testoni, Juell Sprott, Sandro Pezzelle | N/A | N/A |
| Are LLMs classical or nonmonotonic reasoners? Lessons from generics | Alina Leidinger, Robert Van Rooij, Ekaterina Shutova | N/A | N/A |
| ConstitutionalExperts: Training a Mixture of Principle-based Prompts | Savvas Petridis, Ben Wedin, Ann Yuan, James Wexler, Nithum Thain | N/A | N/A |
| Time Sensitive Knowledge Editing through Efficient Finetuning | Xiou Ge, Ali Mousavi, Edouard Grave, Armand Joulin, Kun Qian, Benjamin Han, Mostafa Arefiyan, Yunyao Li | N/A | N/A |
| PRewrite: Prompt Rewriting with Reinforcement Learning | Weize Kong, Spurthi Amba Hombaiah, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky | N/A | N/A |
| SeeGULL Multilingual: a Dataset of Geo-Culturally Situated Stereotypes | Mukul Bhutani, Kevin Robinson, Vinodkumar Prabhakaran, Shachi Dave, Sunipa Dev | N/A | N/A |
| Paraphrasing in Affirmative Terms Improves Negation Understanding | MohammadHossein Rezaei, Eduardo Blanco | N/A | N/A |
| Exploring Conditional Variational Mechanism to Pinyin Input Method for Addressing One-to-Many Mappings in Low-Resource Scenarios | Bin Sun, Jianfeng Li, Hao Zhou, Fandong Meng, Kan Li, Jie Zhou | N/A | N/A |
| Consistency Training by Synthetic Question Generation for Conversational Question Answering | Hamed Hematian Hemati, Hamid Beigy | N/A | N/A |
| How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages? | Anushka Singh, Ananya B. Sai, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M Khapra | N/A | N/A |
| Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages | Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin | N/A | N/A |
| Cross-Modal Projection in Multimodal LLMs Doesn’t Really Project Visual Attributes to Textual Space | Gaurav Verma, Minje Choi, Kartik Sharma, Jamelle Watson-Daniels, Sejoon Oh, Srijan Kumar | N/A | N/A |
| Guidance-Based Prompt Data Augmentation in Specialized Domains for Named Entity Recognition | Hyeonseok Kang, Hyein Seo, Jeesu Jung, Sangkeun Jung, Du-Seong Chang, Riwoo Chung | N/A | N/A |
| Aligning Large Language Models via Fine-grained Supervision | Dehong Xu, Liang Qiu, Minseok Kim, Faisal Ladhak, Jaeyoung Do | N/A | N/A |
| Annotating FrameNet via Structure-Conditioned Language Generation | Xinyue Cui, Swabha Swayamdipta | N/A | N/A |
| DUAL-REFLECT: Enhancing Large Language Models for Reflective Translation through Dual Learning Feedback Mechanisms | Andong Chen, Lianzhang Lou, Kehai Chen, Xuefeng Bai, Yang Xiang, Muyun Yang, Tiejun Zhao, Min Zhang | N/A | N/A |
| Towards Artwork Explanation in Large-scale Vision Language Models | Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe | N/A | N/A |
| On the Hallucination in Simultaneous Machine Translation | Meizhi Zhong, Kehai Chen, Zhengshan Xue, Lemao Liu, Mingming Yang, Min Zhang | N/A | N/A |
| Self-Augmented In-Context Learning for Unsupervised Word Translation | Yaoyiran Li, Anna Korhonen, Ivan Vulić | N/A | N/A |
| RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records | Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May Dongmei Wang, Joyce C. Ho, Carl Yang | N/A | N/A |
CVPR 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation | Yen-Chi Cheng · Hsin-Ying Lee · Sergey Tulyakov · Alexander G. Schwing · Liang-Yan Gui | N/A | Code |
| Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring | Ruyang Liu · Jingjia Huang · Ge Li · Jiashi Feng · Xinglong Wu · Thomas H. Li | N/A | Code |
| Post-Processing Temporal Action Detection | Sauradip Nag · Xiatian Zhu · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Learning Analytical Posterior Probability for Human Mesh Recovery | Qi Fang · Kang Chen · Yinghui Fan · Qing Shuai · Jiefeng Li · Weidong Zhang | N/A | Code |
| Accidental Light Probes | Hong-Xing Yu · Samir Agarwala · Charles Herrmann · Richard Szeliski · Noah Snavely · Jiajun Wu · Deqing Sun | N/A | Code |
| Multi-Object Manipulation via Object-Centric Neural Scattering Functions | Stephen Tian · Yancheng Cai · Hong-Xing Yu · Sergey Zakharov · Katherine Liu · Adrien Gaidon · Yunzhu Li · Jiajun Wu | N/A | Code |
| CFA: Class-Wise Calibrated Fair Adversarial Training | Zeming Wei · Yifei Wang · Yiwen Guo · Yisen Wang | N/A | Code |
| AutoAD: Movie Description in Context | Tengda Han · Max Bain · Arsha Nagrani · Gül Varol · Weidi Xie · Andrew Zisserman | N/A | Code |
| Relational Context Learning for Human-Object Interaction Detection | Sanghyun Kim · Deunsol Jung · Minsu Cho | N/A | Code |
| Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations | Hagay Michaeli · Tomer Michaeli · Daniel Soudry | N/A | Code |
| Learning Distortion Invariant Representation for Image Restoration From a Causality Perspective | Xin Li · Bingchen Li · Xin Jin · Cuiling Lan · Zhibo Chen | N/A | Code |
| Iterative Vision-and-Language Navigation | Jacob Krantz · Shurjo Banerjee · Wang Zhu · Jason Corso · Peter Anderson · Stefan Lee · Jesse Thomason | N/A | Code |
| FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer | Zhijian Liu · Xinyu Yang · Haotian Tang · Shang Yang · Song Han | N/A | Code |
| BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration | Sheng Ao · Qingyong Hu · Hanyun Wang · Kai Xu · Yulan Guo | N/A | Code |
| Learning Event Guided High Dynamic Range Video Reconstruction | Yixin Yang · Jin Han · Jinxiu Liang · Imari Sato · Boxin Shi | N/A | Code |
| 3D Line Mapping Revisited | Shaohui Liu · Yifan Yu · Rémi Pautrat · Marc Pollefeys · Viktor Larsson | N/A | Code |
| High-Fidelity Event-Radiance Recovery via Transient Event Frequency | Jin Han · Yuta Asano · Boxin Shi · Yinqiang Zheng · Imari Sato | N/A | Code |
| OCELOT: Overlapped Cell on Tissue Dataset for Histopathology | Jeongun Ryu · Aaron Valero Puche · JaeWoong Shin · Seonwook Park · Biagio Brattoli · Jinhee Lee · Wonkyung Jung · Soo Ick Cho · Kyunghyun Paeng · Chan-Young Ock · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Blur Interpolation Transformer for Real-World Motion From Blur | Zhihang Zhong · Mingdeng Cao · Xiang Ji · Yinqiang Zheng · Imari Sato | N/A | Code |
| Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation | Clinton A. Mo · Kun Hu · Chengjiang Long · Zhiyong Wang | N/A | Code |
| Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions From Monocular RGBD Stream | Yuheng Jiang · Kaixin Yao · Zhuo Su · Zhehao Shen · Haimin Luo · Lan Xu | N/A | Code |
| HexPlane: A Fast Representation for Dynamic Scenes | Ang Cao · Justin Johnson | N/A | Code |
| Finetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Models | Sachin Goyal · Ananya Kumar · Sankalp Garg · Zico Kolter · Aditi Raghunathan | N/A | Code |
| A Whac-a-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others | Zhiheng Li · Ivan Evtimov · Albert Gordo · Caner Hazirbas · Tal Hassner · Cristian Canton Ferrer · Chenliang Xu · Mark Ibrahim | N/A | Code |
| GIVL: Improving Geographical Inclusivity of Vision-Language Models With Pre-Training Methods | Da Yin · Feng Gao · Govind Thattai · Michael Johnston · Kai-Wei Chang | N/A | Code |
| Devil’s on the Edges: Selective Quad Attention for Scene Graph Generation | Deunsol Jung · Sanghyun Kim · Won Hwa Kim · Minsu Cho | N/A | Code |
| GeoMVSNet: Learning Multi-View Stereo With Geometry Perception | Zhe Zhang · Rui Peng · Yuxi Hu · Ronggang Wang | N/A | Code |
| CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability | Fadi Boutros · Meiling Fang · Marcel Klemt · Biying Fu · Naser Damer | N/A | Code |
| NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images | Mingwu Zheng · Haiyu Zhang · Hongyu Yang · Di Huang | N/A | Code |
| MethaneMapper: Spectral Absorption Aware Hyperspectral Transformer for Methane Detection | Satish Kumar · Ivan Arevalo · ASM Iftekhar · B S Manjunath | N/A | Code |
| Re-Thinking Model Inversion Attacks Against Deep Neural Networks | Ngoc-Bao Nguyen · Keshigeyan Chandrasegaran · Milad Abdollahzadeh · Ngai-Man Cheung | N/A | Code |
| SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency | Yang Liu · Yao Zhang · Yixin Wang · Yang Zhang · Jiang Tian · Zhongchao Shi · Jianping Fan · Zhiqiang He | N/A | Code |
| VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation | Bingchen Yang · Haiyong Jiang · Hao Pan · Jun Xiao | N/A | Code |
| MARLIN: Masked Autoencoder for Facial Video Representation LearnINg | Zhixi Cai · Shreya Ghosh · Kalin Stefanov · Abhinav Dhall · Jianfei Cai · Hamid Rezatofighi · Reza Haffari · Munawar Hayat | N/A | Code |
| KD-DLGAN: Data Limited Image Generation via Knowledge Distillation | Kaiwen Cui · Yingchen Yu · Fangneng Zhan · Shengcai Liao · Shijian Lu · Eric P. Xing | N/A | Code |
| Hierarchical Neural Memory Network for Low Latency Event Processing | Ryuhei Hamaguchi · Yasutaka Furukawa · Masaki Onishi · Ken Sakurada | N/A | Code |
| Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting | Wei Lin · Antoni B. Chan | N/A | Code |
| Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information | Weijie Su · Xizhou Zhu · Chenxin Tao · Lewei Lu · Bin Li · Gao Huang · Yu Qiao · Xiaogang Wang · Jie Zhou · Jifeng Dai | N/A | Code |
| Revisiting Reverse Distillation for Anomaly Detection | Tran Dinh Tien · Anh Tuan Nguyen · Nguyen Hoang Tran · Ta Duc Huy · Soan T.M. Duong · Chanh D. Tr. Nguyen · Steven Q. H. Truong | N/A | Code |
| Conditional Generation of Audio From Video via Foley Analogies | Yuexi Du · Ziyang Chen · Justin Salamon · Bryan Russell · Andrew Owens | N/A | Code |
| Parameter Efficient Local Implicit Image Function Network for Face Segmentation | Mausoom Sarkar · Nikitha SR · Mayur Hemani · Rishabh Jain · Balaji Krishnamurthy | N/A | Code |
| Learning Decorrelated Representations Efficiently Using Fast Fourier Transform | Yutaro Shigeto · Masashi Shimbo · Yuya Yoshikawa · Akikazu Takeuchi | N/A | Code |
| FaceLit: Neural 3D Relightable Faces | Anurag Ranjan · Kwang Moo Yi · Jen-Hao Rick Chang · Oncel Tuzel | N/A | Code |
| Pointersect: Neural Rendering With Cloud-Ray Intersection | Jen-Hao Rick Chang · Wei-Yu Chen · Anurag Ranjan · Kwang Moo Yi · Oncel Tuzel | N/A | Code |
| High-Fidelity Clothed Avatar Reconstruction From a Single Image | Tingting Liao · Xiaomei Zhang · Yuliang Xiu · Hongwei Yi · Xudong Liu · Guo-Jun Qi · Yong Zhang · Xuan Wang · Xiangyu Zhu · Zhen Lei | N/A | Code |
| BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Peng Wang · Lingzhe Zhao · Ruijie Ma · Peidong Liu | N/A | Code |
| Meta-Tuning Loss Functions and Data Augmentation for Few-Shot Object Detection | Berkan Demirel · Orhun Buğra Baran · Ramazan Gokberk Cinbis | N/A | Code |
| StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields | Kunhao Liu · Fangneng Zhan · Yiwen Chen · Jiahui Zhang · Yingchen Yu · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting | Maoyuan Ye · Jing Zhang · Shanshan Zhao · Juhua Liu · Tongliang Liu · Bo Du · Dacheng Tao | N/A | Code |
| Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution | Jie-En Yao · Li-Yuan Tsao · Yi-Chen Lo · Roy Tseng · Chia-Che Chang · Chun-Yi Lee | N/A | Code |
| LAVENDER: Unifying Video-Language Understanding As Masked Language Modeling | Linjie Li · Zhe Gan · Kevin Lin · Chung-Ching Lin · Zicheng Liu · Ce Liu · Lijuan Wang | N/A | Code |
| Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution | Hao-Wei Chen · Yu-Syuan Xu · Min-Fong Hong · Yi-Min Tsai · Hsien-Kai Kuo · Chun-Yi Lee | N/A | Code |
| Fair Federated Medical Image Segmentation via Client Contribution Estimation | Meirui Jiang · Holger R. Roth · Wenqi Li · Dong Yang · Can Zhao · Vishwesh Nath · Daguang Xu · Qi Dou · Ziyue Xu | N/A | Code |
| An Empirical Study of End-to-End Video-Language Transformers With Masked Visual Modeling | Tsu-Jui Fu · Linjie Li · Zhe Gan · Kevin Lin · William Yang Wang · Lijuan Wang · Zicheng Liu | N/A | Code |
| ReCo: Region-Controlled Text-to-Image Generation | Zhengyuan Yang · Jianfeng Wang · Zhe Gan · Linjie Li · Kevin Lin · Chenfei Wu · Nan Duan · Zicheng Liu · Ce Liu · Michael Zeng · Lijuan Wang | N/A | Code |
| Uncertainty-Aware Vision-Based Metric Cross-View Geolocalization | Florian Fervers · Sebastian Bullinger · Christoph Bodensteiner · Michael Arens · Rainer Stiefelhagen | N/A | Code |
| LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation | Guangcong Zheng · Xianpan Zhou · Xuewei Li · Zhongang Qi · Ying Shan · Xi Li | N/A | Code |
| Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based Attacks | Yunrui Yu · Cheng-Zhong Xu | N/A | Code |
| NIKI: Neural Inverse Kinematics With Invertible Neural Networks for 3D Human Pose and Shape Estimation | Jiefeng Li · Siyuan Bian · Qi Liu · Jiasheng Tang · Fan Wang · Cewu Lu | N/A | Code |
| 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud | Mingtao Feng · Haoran Hou · Liang Zhang · Zijie Wu · Yulan Guo · Ajmal Mian | N/A | Code |
| Egocentric Auditory Attention Localization in Conversations | Fiona Ryan · Hao Jiang · Abhinav Shukla · James M. Rehg · Vamsi Krishna Ithapu | N/A | Code |
| EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision | Jiahui Lei · Congyue Deng · Karl Schmeckpeper · Leonidas Guibas · Kostas Daniilidis | N/A | Code |
| Divide and Conquer: Answering Questions With Object Factorization and Compositional Reasoning | Shi Chen · Qi Zhao | N/A | Code |
| Text-Visual Prompting for Efficient 2D Temporal Video Grounding | Yimeng Zhang · Xin Chen · Jinghan Jia · Sijia Liu · Ke Ding | N/A | Code |
| Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning | Youngjae Yu · Jiwan Chung · Heeseung Yun · Jack Hessel · Jae Sung Park · Ximing Lu · Rowan Zellers · Prithviraj Ammanabrolu · Ronan Le Bras · Gunhee Kim · Yejin Choi | N/A | Code |
| UniHCP: A Unified Model for Human-Centric Perceptions | Yuanzheng Ci · Yizhou Wang · Meilin Chen · Shixiang Tang · Lei Bai · Feng Zhu · Rui Zhao · Fengwei Yu · Donglian Qi · Wanli Ouyang | N/A | Code |
| VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval | Siteng Huang · Biao Gong · Yulin Pan · Jianwen Jiang · Yiliang Lv · Yuyuan Li · Donglin Wang | N/A | Code |
| PointConvFormer: Revenge of the Point-Based Convolution | Wenxuan Wu · Li Fuxin · Qi Shan | N/A | Code |
| BAAM: Monocular 3D Pose and Shape Reconstruction With Bi-Contextual Attention Module and Attention-Guided Modeling | Hyo-Jun Lee · Hanul Kim · Su-Min Choi · Seong-Gyun Jeong · Yeong Jun Koh | N/A | Code |
| HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining | Shixiang Tang · Cheng Chen · Qingsong Xie · Meilin Chen · Yizhou Wang · Yuanzheng Ci · Lei Bai · Feng Zhu · Haiyang Yang · Li Yi · Rui Zhao · Wanli Ouyang | N/A | Code |
| Local Connectivity-Based Density Estimation for Face Clustering | Junho Shin · Hyo-Jun Lee · Hyunseop Kim · Jong-Hyeon Baek · Daehyun Kim · Yeong Jun Koh | N/A | Code |
| DistilPose: Tokenized Pose Regression With Heatmap Distillation | Suhang Ye · Yingyi Zhang · Jie Hu · Liujuan Cao · Shengchuan Zhang · Lei Shen · Jun Wang · Shouhong Ding · Rongrong Ji | N/A | Code |
| Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks | Weihua Chen · Xianzhe Xu · Jian Jia · Hao Luo · Yaohua Wang · Fan Wang · Rong Jin · Xiuyu Sun | N/A | Code |
| ViPLO: Vision Transformer Based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection | Jeeseung Park · Jin-Woo Park · Jong-Seok Lee | N/A | Code |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Yuxin Fang · Wen Wang · Binhui Xie · Quan Sun · Ledell Wu · Xinggang Wang · Tiejun Huang · Xinlong Wang · Yue Cao | N/A | Code |
| I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs | Jingsen Zhu · Yuchi Huo · Qi Ye · Fujun Luan · Jifan Li · Dianbing Xi · Lisha Wang · Rui Tang · Wei Hua · Hujun Bao · Rui Wang | N/A | Code |
| DrapeNet: Garment Generation and Self-Supervised Draping | Luca De Luigi · Ren Li · Benoît Guillard · Mathieu Salzmann · Pascal Fua | N/A | Code |
| STMixer: A One-Stage Sparse Action Detector | Tao Wu · Mengqi Cao · Ziteng Gao · Gangshan Wu · Limin Wang | N/A | Code |
| Inverse Rendering of Translucent Objects Using Physical and Neural Renderers | Chenhao Li · Trung Thanh Ngo · Hajime Nagahara | N/A | Code |
| Humans As Light Bulbs: 3D Human Reconstruction From Thermal Reflection | Ruoshi Liu · Carl Vondrick | N/A | Code |
| CF-Font: Content Fusion for Few-Shot Font Generation | Chi Wang · Min Zhou · Tiezheng Ge · Yuning Jiang · Hujun Bao · Weiwei Xu | N/A | Code |
| GLeaD: Improving GANs With a Generator-Leading Task | Qingyan Bai · Ceyuan Yang · Yinghao Xu · Xihui Liu · Yujiu Yang · Yujun Shen | N/A | Code |
| StarCraftImage: A Dataset for Prototyping Spatial Reasoning Methods for Multi-Agent Environments | Sean Kulinski · Nicholas R. Waytowich · James Z. Hare · David I. Inouye | N/A | Code |
| WIRE: Wavelet Implicit Neural Representations | Vishwanath Saragadam · Daniel LeJeune · Jasper Tan · Guha Balakrishnan · Ashok Veeraraghavan · Richard G. Baraniuk | N/A | Code |
| Thermal Spread Functions (TSF): Physics-Guided Material Classification | Aniket Dashpute · Vishwanath Saragadam · Emma Alexander · Florian Willomitzer · Aggelos Katsaggelos · Ashok Veeraraghavan · Oliver Cossairt | N/A | Code |
| Improving Zero-Shot Generalization and Robustness of Multi-Modal Models | Yunhao Ge · Jie Ren · Andrew Gallagher · Yuxiao Wang · Ming-Hsuan Yang · Hartwig Adam · Laurent Itti · Balaji Lakshminarayanan · Jiaping Zhao | N/A | Code |
| The Differentiable Lens: Compound Lens Search Over Glass Surfaces and Materials for Object Detection | Geoffroi Côté · Fahim Mannan · Simon Thibault · Jean-François Lalonde · Felix Heide | N/A | Code |
| Federated Domain Generalization With Generalization Adjustment | Ruipeng Zhang · Qinwei Xu · Jiangchao Yao · Ya Zhang · Qi Tian · Yanfeng Wang | N/A | Code |
| Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking | Yihao Wang · Zhigang Wang · Bin Zhao · Dong Wang · Mulin Chen · Xuelong Li | N/A | Code |
| Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network | Zhengxin Pan · Fangyu Wu · Bailing Zhang | N/A | Code |
| On the Benefits of 3D Pose and Tracking for Human Action Recognition | Jathushan Rajasegaran · Georgios Pavlakos · Angjoo Kanazawa · Christoph Feichtenhofer · Jitendra Malik | N/A | Code |
| Visual DNA: Representing and Comparing Images Using Distributions of Neuron Activations | Benjamin Ramtoula · Matthew Gadd · Paul Newman · Daniele De Martini | N/A | Code |
| Fine-Tuned CLIP Models Are Efficient Video Learners | Hanoona Rasheed · Muhammad Uzair Khattak · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Connecting Vision and Language With Video Localized Narratives | Paul Voigtlaender · Soravit Changpinyo · Jordi Pont-Tuset · Radu Soricut · Vittorio Ferrari | N/A | Code |
| K-Planes: Explicit Radiance Fields in Space, Time, and Appearance | Sara Fridovich-Keil · Giacomo Meanti · Frederik Rahbæk Warburg · Benjamin Recht · Angjoo Kanazawa | N/A | Code |
| Virtual Occlusions Through Implicit Depth | Jamie Watson · Mohamed Sayed · Zawar Qureshi · Gabriel J. Brostow · Sara Vicente · Oisin Mac Aodha · Michael Firman | N/A | Code |
| Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories | Samarth Sinha · Roman Shapovalov · Jeremy Reizenstein · Ignacio Rocco · Natalia Neverova · Andrea Vedaldi · David Novotny | N/A | Code |
| LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising | Zichun Wang · Ying Fu · Ji Liu · Yulun Zhang | N/A | Code |
| One-Shot High-Fidelity Talking-Head Synthesis With Deformable Neural Radiance Field | Weichuang Li · Longhao Zhang · Dong Wang · Bin Zhao · Zhigang Wang · Mulin Chen · Bang Zhang · Zhongjian Wang · Liefeng Bo · Xuelong Li | N/A | Code |
| Collaborative Diffusion for Multi-Modal Face Generation and Editing | Ziqi Huang · Kelvin C.K. Chan · Yuming Jiang · Ziwei Liu | N/A | Code |
| Blind Video Deflickering by Neural Filtering With a Flawed Atlas | Chenyang Lei · Xuanchi Ren · Zhaoxiang Zhang · Qifeng Chen | N/A | Code |
| RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension | Jiamu Sun · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Zhiyu Wang · Rongrong Ji | N/A | Code |
| HNeRV: A Hybrid Neural Representation for Videos | Hao Chen · Matthew Gwilliam · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Learning 3D-Aware Image Synthesis With Unknown Pose Distribution | Zifan Shi · Yujun Shen · Yinghao Xu · Sida Peng · Yiyi Liao · Sheng Guo · Qifeng Chen · Dit-Yan Yeung | N/A | Code |
| DynaFed: Tackling Client Data Heterogeneity With Global Dynamics | Renjie Pi · Weizhong Zhang · Yueqi Xie · Jiahui Gao · Xiaoyu Wang · Sunghun Kim · Qifeng Chen | N/A | Code |
| Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition | Jun Cen · Shiwei Zhang · Xiang Wang · Yixuan Pei · Zhiwu Qing · Yingya Zhang · Qifeng Chen | N/A | Code |
| RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion | Tengfei Wang · Bo Zhang · Ting Zhang · Shuyang Gu · Jianmin Bao · Tadas Baltrusaitis · Jingjing Shen · Dong Chen · Fang Wen · Qifeng Chen · Baining Guo | N/A | Code |
| IFSeg: Image-Free Semantic Segmentation via Vision-Language Model | Sukmin Yun · Seong Hyeon Park · Paul Hongsuck Seo · Jinwoo Shin | N/A | Code |
| Detecting Everything in the Open World: Towards Universal Object Detection | Zhenyu Wang · Yali Li · Xi Chen · Ser-Nam Lim · Antonio Torralba · Hengshuang Zhao · Shengjin Wang | N/A | Code |
| Improving Visual Grounding by Encouraging Consistent Gradient-Based Explanations | Ziyan Yang · Kushal Kafle · Franck Dernoncourt · Vicente Ordonez | N/A | Code |
| Temporally Consistent Online Depth Estimation Using Point-Based Fusion | Numair Khan · Eric Penner · Douglas Lanman · Lei Xiao | N/A | Code |
| NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions | Juze Zhang · Haimin Luo · Hongdi Yang · Xinru Xu · Qianyang Wu · Ye Shi · Jingyi Yu · Lan Xu · Jingya Wang | N/A | Code |
| Token Turing Machines | Michael S. Ryoo · Keerthana Gopalakrishnan · Kumara Kahatapitiya · Ted Xiao · Kanishka Rao · Austin Stone · Yao Lu · Julian Ibarz · Anurag Arnab | N/A | Code |
| Computationally Budgeted Continual Learning: What Does Matter? | Ameya Prabhu · Hasan Abed Al Kader Hammoud · Puneet K. Dokania · Philip H.S. Torr · Ser-Nam Lim · Bernard Ghanem · Adel Bibi | N/A | Code |
| CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search | Fahad Shamshad · Muzammal Naseer · Karthik Nandakumar | N/A | Code |
| Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation From Image Sequence | Yang Tian · Jiyao Zhang · Zekai Yin · Hao Dong | N/A | Code |
| Affordances From Human Videos as a Versatile Representation for Robotics | Shikhar Bahl · Russell Mendonca · Lili Chen · Unnat Jain · Deepak Pathak | N/A | Code |
| MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation | Yong Yang · Qiong Chen · Yuan Feng · Tianlin Huang | N/A | Code |
| Learning To Generate Image Embeddings With User-Level Differential Privacy | Zheng Xu · Maxwell Collins · Yuxiao Wang · Liviu Panait · Sewoong Oh · Sean Augenstein · Ting Liu · Florian Schroff · H. Brendan McMahan | N/A | Code |
| Genie: Show Me the Data for Quantization | Yongkweon Jeon · Chungman Lee · Ho-young Kim | N/A | Code |
| DSVT: Dynamic Sparse Voxel Transformer With Rotated Sets | Haiyang Wang · Chen Shi · Shaoshuai Shi · Meng Lei · Sen Wang · Di He · Bernt Schiele · Liwei Wang | N/A | Code |
| Transformer-Based Learned Optimization | Erik Gärtner · Luke Metz · Mykhaylo Andriluka · C. Daniel Freeman · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Noise2Noise: Efficient Image Denoising Without Any Data | Youssef Mansour · Reinhard Heckel | N/A | Code |
| Super-Resolution Neural Operator | Min Wei · Xuesong Zhang | N/A | Code |
| StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping | Diqiong Jiang · Dan Song · Ruofeng Tong · Min Tang | N/A | Code |
| Self-Supervised Blind Motion Deblurring With Deep Expectation Maximization | Ji Li · Weixi Wang · Yuesong Nan · Hui Ji | N/A | Code |
| Confidence-Aware Personalized Federated Learning via Variational Expectation Maximization | Junyi Zhu · Xingchen Ma · Matthew B. Blaschko | N/A | Code |
| Human Pose As Compositional Tokens | Zigang Geng · Chunyu Wang · Yixuan Wei · Ze Liu · Houqiang Li · Han Hu | N/A | Code |
| GeoMAE: Masked Geometric Target Prediction for Self-Supervised Point Cloud Pre-Training | Xiaoyu Tian · Haoxi Ran · Yue Wang · Hang Zhao | N/A | Code |
| RUST: Latent Neural Scene Representations From Unposed Imagery | Mehdi S. M. Sajjadi · Aravindh Mahendran · Thomas Kipf · Etienne Pot · Daniel Duckworth · Mario Lučić · Klaus Greff | N/A | Code |
| Bias Mimicking: A Simple Sampling Approach for Bias Mitigation | Maan Qraitem · Kate Saenko · Bryan A. Plummer | N/A | Code |
| V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting | Haibao Yu · Wenxian Yang · Hongzhi Ruan · Zhenwei Yang · Yingjuan Tang · Xu Gao · Xin Hao · Yifeng Shi · Yifeng Pan · Ning Sun · Juan Song · Jirui Yuan · Ping Luo · Zaiqing Nie | N/A | Code |
| Conditional Image-to-Video Generation With Latent Flow Diffusion Models | Haomiao Ni · Changhao Shi · Kai Li · Sharon X. Huang · Martin Renqiang Min | N/A | Code |
| Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection | Shaofei Huang · Zhenwei Shen · Zehao Huang · Zi-han Ding · Jiao Dai · Jizhong Han · Naiyan Wang · Si Liu | N/A | Code |
| 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds | Aoran Xiao · Jiaxing Huang · Weihao Xuan · Ruijie Ren · Kangcheng Liu · Dayan Guan · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| NeMo: Learning 3D Neural Motion Fields From Multiple Video Instances of the Same Action | Kuan-Chieh Wang · Zhenzhen Weng · Maria Xenochristou · João Pedro Araújo · Jeffrey Gu · Karen Liu · Serena Yeung | N/A | Code |
| Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning | Xiaocheng Lu · Song Guo · Ziming Liu · Jingcai Guo | N/A | Code |
| iDisc: Internal Discretization for Monocular Depth Estimation | Luigi Piccinelli · Christos Sakaridis · Fisher Yu | N/A | Code |
| UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy | Yinzhen Xu · Weikang Wan · Jialiang Zhang · Haoran Liu · Zikang Shan · Hao Shen · Ruicheng Wang · Haoran Geng · Yijia Weng · Jiayi Chen · Tengyu Liu · Li Yi · He Wang | N/A | Code |
| PolyFormer: Referring Image Segmentation As Sequential Polygon Generation | Jiang Liu · Hui Ding · Zhaowei Cai · Yuting Zhang · Ravi Kumar Satzoda · Vijay Mahadevan · R. Manmatha | N/A | Code |
| Interactive Segmentation of Radiance Fields | Rahul Goel · Dhawal Sirikonda · Saurabh Saini · P. J. Narayanan | N/A | Code |
| PointCert: Point Cloud Classification With Deterministic Certified Robustness Guarantees | Jinghuai Zhang · Jinyuan Jia · Hongbin Liu · Neil Zhenqiang Gong | N/A | Code |
| Indiscernible Object Counting in Underwater Scenes | Guolei Sun · Zhaochong An · Yun Liu · Ce Liu · Christos Sakaridis · Deng-Ping Fan · Luc Van Gool | N/A | Code |
| Improving Robustness of Vision Transformers by Reducing Sensitivity To Patch Corruptions | Yong Guo · David Stutz · Bernt Schiele | N/A | Code |
| Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video | Wenzheng Zeng · Yang Xiao · Sicheng Wei · Jinfang Gan · Xintao Zhang · Zhiguo Cao · Zhiwen Fang · Joey Tianyi Zhou | N/A | Code |
| BEV-LaneDet: An Efficient 3D Lane Detection Based on Virtual Camera via Key-Points | Ruihao Wang · Jian Qin · Kaiying Li · Yaochen Li · Dong Cao · Jintao Xu | N/A | Code |
| Infinite Photorealistic Worlds Using Procedural Generation | Alexander Raistrick · Lahav Lipson · Zeyu Ma · Lingjie Mei · Mingzhe Wang · Yiming Zuo · Karhan Kayan · Hongyu Wen · Beining Han · Yihan Wang · Alejandro Newell · Hei Law · Ankit Goyal · Kaiyu Yang · Jia Deng | N/A | Code |
| High-Fidelity 3D Human Digitization From Single 2K Resolution Images | Sang-Hun Han · Min-Gyu Park · Ju Hong Yoon · Ju-Mi Kang · Young-Jae Park · Hae-Gon Jeon | N/A | Code |
| GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis | Ming Tao · Bing-Kun Bao · Hao Tang · Changsheng Xu | N/A | Code |
| Language-Guided Audio-Visual Source Separation via Trimodal Consistency | Reuben Tan · Arijit Ray · Andrea Burns · Bryan A. Plummer · Justin Salamon · Oriol Nieto · Bryan Russell · Kate Saenko | N/A | Code |
| Probabilistic Debiasing of Scene Graphs | Bashirul Azam Biswas · Qiang Ji | N/A | Code |
| PVO: Panoptic Visual Odometry | Weicai Ye · Xinyue Lan · Shuo Chen · Yuhang Ming · Xingyuan Yu · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| Superclass Learning With Representation Enhancement | Zeyu Gan · Suyun Zhao · Jinlong Kang · Liyuan Shang · Hong Chen · Cuiping Li | N/A | Code |
| GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts | Haoran Geng · Helin Xu · Chengyang Zhao · Chao Xu · Li Yi · Siyuan Huang · He Wang | N/A | Code |
| Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation | Liyan Chen · Weihan Wang · Philippos Mordohai | N/A | Code |
| Efficient View Synthesis and 3D-Based Multi-Frame Denoising With Multiplane Feature Representations | Thomas Tanay · Aleš Leonardis · Matteo Maggioni | N/A | Code |
| Large-Capacity and Flexible Video Steganography via Invertible Neural Network | Chong Mou · Youmin Xu · Jiechong Song · Chen Zhao · Bernard Ghanem · Jian Zhang | N/A | Code |
| Generating Part-Aware Editable 3D Shapes Without 3D Supervision | Konstantinos Tertikas · Despoina Paschalidou · Boxiao Pan · Jeong Joon Park · Mikaela Angelina Uy · Ioannis Emiris · Yannis Avrithis · Leonidas Guibas | N/A | Code |
| Vision Transformer With Super Token Sampling | Huaibo Huang · Xiaoqiang Zhou · Jie Cao · Ran He · Tieniu Tan | N/A | Code |
| Renderable Neural Radiance Map for Visual Navigation | Obin Kwon · Jeongho Park · Songhwai Oh | N/A | Code |
| Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong · Wei-Chiu Ma · Jingkang Wang · Raquel Urtasun | N/A | Code |
| CoMFormer: Continual Learning in Semantic and Panoptic Segmentation | Fabio Cermelli · Matthieu Cord · Arthur Douillard | N/A | Code |
| A Bag-of-Prototypes Representation for Dataset-Level Applications | Weijie Tu · Weijian Deng · Tom Gedeon · Liang Zheng | N/A | Code |
| Geometric Visual Similarity Learning in 3D Medical Image Self-Supervised Pre-Training | Yuting He · Guanyu Yang · Rongjun Ge · Yang Chen · Jean-Louis Coatrieux · Boyu Wang · Shuo Li | N/A | Code |
| Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Erasing Network | Zhicheng Zhang · Lijuan Wang · Jufeng Yang | N/A | Code |
| CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning | James Seale Smith · Leonid Karlinsky · Vyshnavi Gutta · Paola Cascante-Bonilla · Donghyun Kim · Assaf Arbelle · Rameswar Panda · Rogerio Feris · Zsolt Kira | N/A | Code |
| CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior | Jinbo Xing · Menghan Xia · Yuechen Zhang · Xiaodong Cun · Jue Wang · Tien-Tsin Wong | N/A | Code |
| VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction | Yufan Ren · Fangjinhua Wang · Tong Zhang · Marc Pollefeys · Sabine Süsstrunk | N/A | Code |
| NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation | Haoqian Wu · Keyu Chen · Haozhe Liu · Mingchen Zhuge · Bing Li · Ruizhi Qiao · Xiujun Shu · Bei Gan · Liangsheng Xu · Bo Ren · Mengmeng Xu · Wentian Zhang · Raghavendra Ramachandra · Chia-Wen Lin · Bernard Ghanem | N/A | Code |
| Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization | Yuechen Zhang · Zexin He · Jinbo Xing · Xufeng Yao · Jiaya Jia | N/A | Code |
| GANmouflage: 3D Object Nondetection With Texture Fields | Rui Guo · Jasmine Collins · Oscar de Lima · Andrew Owens | N/A | Code |
| GP-VTON: Towards General Purpose Virtual Try-On via Collaborative Local-Flow Global-Parsing Learning | Zhenyu Xie · Zaiyu Huang · Xin Dong · Fuwei Zhao · Haoye Dong · Xijin Zhang · Feida Zhu · Xiaodan Liang | N/A | Code |
| DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection | Xuan Zhang · Shiyu Li · Xi Li · Ping Huang · Jiulong Shan · Ting Chen | N/A | Code |
| Pix2map: Cross-Modal Retrieval for Inferring Street Maps From Images | Xindi Wu · KwunFung Lau · Francesco Ferroni · Aljoša Ošep · Deva Ramanan | N/A | Code |
| Beyond mAP: Towards Better Evaluation of Instance Segmentation | Rohit Jena · Lukas Zhornyak · Nehal Doiphode · Pratik Chaudhari · Vivek Buch · James Gee · Jianbo Shi | N/A | Code |
| Federated Learning With Data-Agnostic Distribution Fusion | Jian-hui Duan · Wenzhong Li · Derun Zou · Ruichen Li · Sanglu Lu | N/A | Code |
| Make-a-Story: Visual Memory Conditioned Consistent Story Generation | Tanzila Rahman · Hsin-Ying Lee · Jian Ren · Sergey Tulyakov · Shweta Mahajan · Leonid Sigal | N/A | Code |
| Scalable, Detailed and Mask-Free Universal Photometric Stereo | Satoshi Ikehata | N/A | Code |
| ToThePoint: Efficient Contrastive Learning of 3D Point Clouds via Recycling | Xinglin Li · Jiajing Chen · Jinhui Ouyang · Hanhui Deng · Senem Velipasalar · Di Wu | N/A | Code |
| Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields | Yue Chen · Xingyu Chen · Xuan Wang · Qi Zhang · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| UV Volumes for Real-Time Rendering of Editable Free-View Human Performance | Yue Chen · Xuan Wang · Xingyu Chen · Qi Zhang · Xiaoyu Li · Yu Guo · Jue Wang · Fei Wang | N/A | Code |
| SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries | Ahmed Imtiaz Humayun · Randall Balestriero · Guha Balakrishnan · Richard G. Baraniuk | N/A | Code |
| Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery From Sparse Image Ensemble | Chun-Han Yao · Wei-Chih Hung · Yuanzhen Li · Michael Rubinstein · Ming-Hsuan Yang · Varun Jampani | N/A | Code |
| VisFusion: Visibility-Aware Online 3D Scene Reconstruction From Videos | Huiyu Gao · Wei Mao · Miaomiao Liu | N/A | Code |
| Unsupervised Volumetric Animation | Aliaksandr Siarohin · Willi Menapace · Ivan Skorokhodov · Kyle Olszewski · Jian Ren · Hsin-Ying Lee · Menglei Chai · Sergey Tulyakov | N/A | Code |
| DKM: Dense Kernelized Feature Matching for Geometry Estimation | Johan Edstedt · Ioannis Athanasiadis · Mårten Wadenbäck · Michael Felsberg | N/A | Code |
| All in One: Exploring Unified Video-Language Pre-Training | Jinpeng Wang · Yixiao Ge · Rui Yan · Yuying Ge · Kevin Qinghong Lin · Satoshi Tsutsui · Xudong Lin · Guanyu Cai · Jianping Wu · Ying Shan · Xiaohu Qie · Mike Zheng Shou | N/A | Code |
| Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild | Yanhao Wu · Tong Zhang · Wei Ke · Sabine Süsstrunk · Mathieu Salzmann | N/A | Code |
| DynIBaR: Neural Dynamic Image-Based Rendering | Zhengqi Li · Qianqian Wang · Forrester Cole · Richard Tucker · Noah Snavely | N/A | Code |
| Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong · Sundaram Muthu · Fahira Afzal Maken · Chuong Nguyen · Hongdong Li | N/A | Code |
| JAWS: Just a Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang · Robin Courant · Jinglei Shi · Eric Marchand · Marc Christie | N/A | Code |
| CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes | Harshil Bhatia · Edith Tretschk · Zorah Lähner · Marcel Seelbach Benkner · Michael Moeller · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations | Joy Hsu · Jiayuan Mao · Jiajun Wu | N/A | Code |
| TempSAL – Uncovering Temporal Information for Deep Saliency Prediction | Bahar Aydemir · Ludo Hoffstetter · Tong Zhang · Mathieu Salzmann · Sabine Süsstrunk | N/A | Code |
| BiasBed – Rigorous Texture Bias Evaluation | Nikolai Kalischek · Rodrigo Caye Daudt · Torben Peters · Reinhard Furrer · Jan D. Wegner · Konrad Schindler | N/A | Code |
| Real-Time Neural Light Field on Mobile Devices | Junli Cao · Huan Wang · Pavlo Chemerys · Vladislav Shakhrai · Ju Hu · Yun Fu · Denys Makoviichuk · Sergey Tulyakov · Jian Ren | N/A | Code |
| Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization | Mengmeng Xu · Yanghao Li · Cheng-Yang Fu · Bernard Ghanem · Tao Xiang · Juan-Manuel Pérez-Rúa | N/A | Code |
| DiffusionRig: Learning Personalized Priors for Facial Appearance Editing | Zheng Ding · Xuaner Zhang · Zhihao Xia · Lars Jebe · Zhuowen Tu · Xiuming Zhang | N/A | Code |
| Neural Scene Chronology | Haotong Lin · Qianqian Wang · Ruojin Cai · Sida Peng · Hadar Averbuch-Elor · Xiaowei Zhou · Noah Snavely | N/A | Code |
| Diversity-Aware Meta Visual Prompting | Qidong Huang · Xiaoyi Dong · Dongdong Chen · Weiming Zhang · Feifei Wang · Gang Hua · Nenghai Yu | N/A | Code |
| Privacy-Preserving Representations Are Not Enough: Recovering Scene Content From Camera Poses | Kunal Chelani · Torsten Sattler · Fredrik Kahl · Zuzana Kukelova | N/A | Code |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | Bin Ren · Yahui Liu · Yue Song · Wei Bi · Rita Cucchiara · Nicu Sebe · Wei Wang | N/A | Code |
| Box-Level Active Detection | Mengyao Lyu · Jundong Zhou · Hui Chen · Yijie Huang · Dongdong Yu · Yaqian Li · Yandong Guo · Yuchen Guo · Liuyu Xiang · Guiguang Ding | N/A | Code |
| Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples | Jiaming Zhang · Xingjun Ma · Qi Yi · Jitao Sang · Yu-Gang Jiang · Yaowei Wang · Changsheng Xu | N/A | Code |
| Generalized Relation Modeling for Transformer Tracking | Shenyuan Gao · Chunluan Zhou · Jun Zhang | N/A | Code |
| Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis | Rishabh Dabral · Muhammad Hamza Mughal · Vladislav Golyanik · Christian Theobalt | N/A | Code |
| Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective | Jinjing Zhu · Haotian Bai · Lin Wang | N/A | Code |
| Distilling Neural Fields for Real-Time Articulated Shape Reconstruction | Jeff Tan · Gengshan Yang · Deva Ramanan | N/A | Code |
| Sampling Is Matter: Point-Guided 3D Human Mesh Reconstruction | Jeonghwan Kim · Mi-Gyeong Gwon · Hyunwoo Park · Hyukmin Kwon · Gi-Mun Um · Wonjun Kim | N/A | Code |
| Image Quality-Aware Diagnosis via Meta-Knowledge Co-Embedding | Haoxuan Che · Siyu Chen · Hao Chen | N/A | Code |
| Towards Practical Plug-and-Play Diffusion Models | Hyojun Go · Yunsung Lee · Jin-Young Kim · Seunghyun Lee · Myeongho Jeong · Hyun Seung Lee · Seungtaek Choi | N/A | Code |
| HRDFuse: Monocular 360° Depth Estimation by Collaboratively Learning Holistic-With-Regional Depth Distributions | Hao Ai · Zidong Cao · Yan-Pei Cao · Ying Shan · Lin Wang | N/A | Code |
| KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation | Xiangyang Li · Zihan Wang · Jiahao Yang · Yaowei Wang · Shuqiang Jiang | N/A | Code |
| Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang · Wenzhao Zheng · Yunpeng Zhang · Jie Zhou · Jiwen Lu | N/A | Code |
| EventNeRF: Neural Radiance Fields From a Single Colour Event Camera | Viktor Rudnev · Mohamed Elgharib · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling | Zhanhao Hu · Wenda Chu · Xiaopei Zhu · Hui Zhang · Bo Zhang · Xiaolin Hu | N/A | Code |
| Global Vision Transformer Pruning With Hessian-Aware Saliency | Huanrui Yang · Hongxu Yin · Maying Shen · Pavlo Molchanov · Hai Li · Jan Kautz | N/A | Code |
| 3D Human Pose Estimation With Spatio-Temporal Criss-Cross Attention | Zhenhua Tang · Zhaofan Qiu · Yanbin Hao · Richang Hong · Ting Yao | N/A | Code |
| Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution | Yunfan Lu · Zipeng Wang · Minjie Liu · Hongjian Wang · Lin Wang | N/A | Code |
| StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer | Sasikarn Khwanmuang · Pakkapon Phongthawee · Patsorn Sangkloy · Supasorn Suwajanakorn | N/A | Code |
| ShapeClipper: Scalable 3D Shape Learning From Single-View Images via Geometric and CLIP-Based Consistency | Zixuan Huang · Varun Jampani · Anh Thai · Yuanzhen Li · Stefan Stojanov · James M. Rehg | N/A | Code |
| Efficient Scale-Invariant Generator With Column-Row Entangled Pixel Synthesis | Thuan Hoang Nguyen · Thanh Van Le · Anh Tran | N/A | Code |
| Paired-Point Lifting for Enhanced Privacy-Preserving Visual Localization | Chunghwan Lee · Jaihoon Kim · Chanhyuk Yun · Je Hyeong Hong | N/A | Code |
| Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation | Xu Zheng · Jinjing Zhu · Yexin Liu · Zidong Cao · Chong Fu · Lin Wang | N/A | Code |
| Adaptive Human Matting for Dynamic Videos | Chung-Ching Lin · Jiang Wang · Kun Luo · Kevin Lin · Linjie Li · Lijuan Wang · Zicheng Liu | N/A | Code |
| High-Fidelity Facial Avatar Reconstruction From Monocular Video With Generative Priors | Yunpeng Bai · Yanbo Fan · Xuan Wang · Yong Zhang · Jingxiang Sun · Chun Yuan · Ying Shan | N/A | Code |
| Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint | Shikang Yu · Jiachen Chen · Hu Han · Shuqiang Jiang | N/A | Code |
| Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes | Jihyun Lee · Minhyuk Sung · Honggyu Choi · Tae-Kyun Kim | N/A | Code |
| MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos | Zicheng Zhang · Wei Wu · Wei Sun · Danyang Tu · Wei Lu · Xiongkuo Min · Ying Chen · Guangtao Zhai | N/A | Code |
| Make Landscape Flatter in Differentially Private Federated Learning | Yifan Shi · Yingqi Liu · Kang Wei · Li Shen · Xueqian Wang · Dacheng Tao | N/A | Code |
| A Large-Scale Robustness Analysis of Video Action Recognition Models | Madeline Chantry Schiappa · Naman Biyani · Prudvi Kamtam · Shruti Vyas · Hamid Palangi · Vibhav Vineet · Yogesh S. Rawat | N/A | Code |
| Multi-Concept Customization of Text-to-Image Diffusion | Nupur Kumari · Bingliang Zhang · Richard Zhang · Eli Shechtman · Jun-Yan Zhu | N/A | Code |
| GANHead: Towards Generative Animatable Neural Head Avatars | Sijing Wu · Yichao Yan · Yunhao Li · Yuhao Cheng · Wenhan Zhu · Ke Gao · Xiaobo Li · Guangtao Zhai | N/A | Code |
| Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition | Xinghan Wang · Xin Xu · Yadong Mu | N/A | Code |
| Hierarchical B-Frame Video Coding Using Two-Layer CANF Without Motion Coding | David Alexandre · Hsueh-Ming Hang · Wen-Hsiao Peng | N/A | Code |
| FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER | Ce Zheng · Matias Mendieta · Taojiannan Yang · Guo-Jun Qi · Chen Chen | N/A | Code |
| Delivering Arbitrary-Modal Semantic Segmentation | Jiaming Zhang · Ruiping Liu · Hao Shi · Kailun Yang · Simon Reiß · Kunyu Peng · Haodong Fu · Kaiwei Wang · Rainer Stiefelhagen | N/A | Code |
| Deep Graph-Based Spatial Consistency for Robust Non-Rigid Point Cloud Registration | Zheng Qin · Hao Yu · Changjian Wang · Yuxing Peng · Kai Xu | N/A | Code |
| HumanGen: Generating Human Radiance Fields With Explicit Priors | Suyi Jiang · Haoran Jiang · Ziyu Wang · Haimin Luo · Wenzheng Chen · Lan Xu | N/A | Code |
| Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation | Bo Huang · Mingyang Chen · Yi Wang · Junda Lu · Minhao Cheng · Wei Wang | N/A | Code |
| Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | Narek Tumanyan · Michal Geyer · Shai Bagon · Tali Dekel | N/A | Code |
| Rotation-Invariant Transformer for Point Cloud Matching | Hao Yu · Zheng Qin · Ji Hou · Saleh · Dongsheng Li · Benjamin Busam · Slobodan Ilic | N/A | Code |
| CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP | Runnan Chen · Youquan Liu · Lingdong Kong · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao · Wenping Wang | N/A | Code |
| Real-Time 6K Image Rescaling With Rate-Distortion Optimization | Chenyang Qi · Xin Yang · Ka Leong Cheng · Ying-Cong Chen · Qifeng Chen | N/A | Code |
| Focused and Collaborative Feedback Integration for Interactive Image Segmentation | Qiaoqiao Wei · Hui Zhang · Jun-Hai Yong | N/A | Code |
| Language-Guided Music Recommendation for Video via Prompt Analogies | Daniel McKee · Justin Salamon · Josef Sivic · Bryan Russell | N/A | Code |
| TarViS: A Unified Approach for Target-Based Video Segmentation | Ali Athar · Alexander Hermans · Jonathon Luiten · Deva Ramanan · Bastian Leibe | N/A | Code |
| Meta-Personalizing Vision-Language Models To Find Named Instances in Video | Chun-Hsiao Yeh · Bryan Russell · Josef Sivic · Fabian Caba Heilbron · Simon Jenni | N/A | Code |
| ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data | Haojie Zhao · Junsong Chen · Lijun Wang · Huchuan Lu | N/A | Code |
| Scaling Language-Image Pre-Training via Masking | Yanghao Li · Haoqi Fan · Ronghang Hu · Christoph Feichtenhofer · Kaiming He | N/A | Code |
| SeqTrack: Sequence to Sequence Learning for Visual Object Tracking | Xin Chen · Houwen Peng · Dong Wang · Huchuan Lu · Han Hu | N/A | Code |
| Learning Neural Parametric Head Models | Simon Giebenhain · Tobias Kirschstein · Markos Georgopoulos · Martin Rünz · Lourdes Agapito · Matthias Nießner | N/A | Code |
| L-CoIns: Language-Based Colorization With Instance Awareness | Zheng Chang · Shuchen Weng · Peixuan Zhang · Yu Li · Si Li · Boxin Shi | N/A | Code |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Antoine Yang · Arsha Nagrani · Paul Hongsuck Seo · Antoine Miech · Jordi Pont-Tuset · Ivan Laptev · Josef Sivic · Cordelia Schmid | N/A | Code |
| ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding | Le Xue · Mingfei Gao · Chen Xing · Roberto Martín-Martín · Jiajun Wu · Caiming Xiong · Ran Xu · Juan Carlos Niebles · Silvio Savarese | N/A | Code |
| GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields From Multi-View Images | Jianchuan Chen · Wentao Yi · Liqian Ma · Xu Jia · Huchuan Lu | N/A | Code |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Lukas Hoyer · Dengxin Dai · Haoran Wang · Luc Van Gool | N/A | Code |
| MED-VT: Multiscale Encoder-Decoder Video Transformer With Application To Object Segmentation | Rezaul Karim · He Zhao · Richard P. Wildes · Mennatullah Siam | N/A | Code |
| Hierarchical Dense Correlation Distillation for Few-Shot Segmentation | Bohao Peng · Zhuotao Tian · Xiaoyang Wu · Chengyao Wang · Shu Liu · Jingyong Su · Jiaya Jia | N/A | Code |
| Universal Instance Perception As Object Discovery and Retrieval | Bin Yan · Yi Jiang · Jiannan Wu · Dong Wang · Ping Luo · Zehuan Yuan · Huchuan Lu | N/A | Code |
| Bi-Directional Distribution Alignment for Transductive Zero-Shot Learning | Zhicai Wang · Yanbin Hao · Tingting Mu · Ouxiang Li · Shuo Wang · Xiangnan He | N/A | Code |
| Open-Vocabulary Semantic Segmentation With Mask-Adapted CLIP | Feng Liang · Bichen Wu · Xiaoliang Dai · Kunpeng Li · Yinan Zhao · Hang Zhang · Peizhao Zhang · Peter Vajda · Diana Marculescu | N/A | Code |
| ImageBind: One Embedding Space To Bind Them All | Rohit Girdhar · Alaaeldin El-Nouby · Zhuang Liu · Mannat Singh · Kalyan Vasudev Alwala · Armand Joulin · Ishan Misra | N/A | Code |
| Learning and Aggregating Lane Graphs for Urban Automated Driving | Martin Büchner · Jannik Zürn · Ion-George Todoran · Abhinav Valada · Wolfram Burgard | N/A | Code |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Yu Takagi · Shinji Nishimoto | N/A | Code |
| 3D Cinemagraphy From a Single Image | Xingyi Li · Zhiguo Cao · Huiqiang Sun · Jianming Zhang · Ke Xian · Guosheng Lin | N/A | Code |
| Understanding and Improving Visual Prompting: A Label-Mapping Perspective | Aochuan Chen · Yuguang Yao · Pin-Yu Chen · Yihua Zhang · Sijia Liu | N/A | Code |
| Cut and Learn for Unsupervised Object Detection and Instance Segmentation | Xudong Wang · Rohit Girdhar · Stella X. Yu · Ishan Misra | N/A | Code |
| DF-Platter: Multi-Face Heterogeneous Deepfake Dataset | Kartik Narayan · Harsh Agarwal · Kartik Thakral · Surbhi Mittal · Mayank Vatsa · Richa Singh | N/A | Code |
| BASiS: Batch Aligned Spectral Embedding Space | Or Streicher · Ido Cohen · Guy Gilboa | N/A | Code |
| Annealing-Based Label-Transfer Learning for Open World Object Detection | Yuqing Ma · Hainan Li · Zhange Zhang · Jinyang Guo · Shanghang Zhang · Ruihao Gong · Xianglong Liu | N/A | Code |
| Behind the Scenes: Density Fields for Single View Reconstruction | Felix Wimbauer · Nan Yang · Christian Rupprecht · Daniel Cremers | N/A | Code |
| Learning Video Representations From Large Language Models | Yue Zhao · Ishan Misra · Philipp Krähenbühl · Rohit Girdhar | N/A | Code |
| Quantum Multi-Model Fitting | Matteo Farina · Luca Magri · Willi Menapace · Elisa Ricci · Vladislav Golyanik · Federica Arrigoni | N/A | Code |
| Power Bundle Adjustment for Large-Scale 3D Reconstruction | Simon Weber · Nikolaus Demmel · Tin Chon Chan · Daniel Cremers | N/A | Code |
| Optimization-Inspired Cross-Attention Transformer for Compressive Sensing | Jiechong Song · Chong Mou · Shiqi Wang · Siwei Ma · Jian Zhang | N/A | Code |
| NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization | Shitao Tang · Sicong Tang · Andrea Tagliasacchi · Ping Tan · Yasutaka Furukawa | N/A | Code |
| Back to the Source: Diffusion-Driven Adaptation To Test-Time Corruption | Jin Gao · Jialing Zhang · Xihui Liu · Trevor Darrell · Evan Shelhamer · Dequan Wang | N/A | Code |
| Learning Neural Duplex Radiance Fields for Real-Time View Synthesis | Ziyu Wan · Christian Richardt · Aljaž Božič · Chao Li · Vijay Rengarajan · Seonghyeon Nam · Xiaoyu Xiang · Tuotuo Li · Bo Zhu · Rakesh Ranjan · Jing Liao | N/A | Code |
| Object Pop-Up: Can We Infer 3D Objects and Their Poses From Human Interactions Alone? | Ilya A. Petrov · Riccardo Marin · Julian Chibane · Gerard Pons-Moll | N/A | Code |
| G-MSM: Unsupervised Multi-Shape Matching With Graph-Based Affinity Priors | Marvin Eisenberger · Aysim Toker · Laura Leal-Taixé · Daniel Cremers | N/A | Code |
| Data-Efficient Large Scale Place Recognition With Graded Similarity Supervision | María Leyva-Vallina · Nicola Strisciuglio · Nicolai Petkov | N/A | Code |
| Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection With Single Point Supervision | Xinyi Ying · Li Liu · Yingqian Wang · Ruojing Li · Nuo Chen · Zaiping Lin · Weidong Sheng · Shilin Zhou | N/A | Code |
| Instant Domain Augmentation for LiDAR Semantic Segmentation | Kwonyoung Ryu · Soonmin Hwang · Jaesik Park | N/A | Code |
| R2Former: Unified Retrieval and Reranking Transformer for Place Recognition | Sijie Zhu · Linjie Yang · Chen Chen · Mubarak Shah · Xiaohui Shen · Heng Wang | N/A | Code |
| Detecting and Grounding Multi-Modal Media Manipulation | Rui Shao · Tianxing Wu · Ziwei Liu | N/A | Code |
| Detecting Backdoors in Pre-Trained Encoders | Shiwei Feng · Guanhong Tao · Siyuan Cheng · Guangyu Shen · Xiangzhe Xu · Yingqi Liu · Kaiyuan Zhang · Shiqing Ma · Xiangyu Zhang | N/A | Code |
| Scaling Up GANs for Text-to-Image Synthesis | Minguk Kang · Jun-Yan Zhu · Richard Zhang · Jaesik Park · Eli Shechtman · Sylvain Paris · Taesung Park | N/A | Code |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Tiantian Geng · Teng Wang · Jinming Duan · Runmin Cong · Feng Zheng | N/A | Code |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360° | Sizhe An · Hongyi Xu · Yichun Shi · Guoxian Song · Umit Y. Ogras · Linjie Luo | N/A | Code |
| Modality-Invariant Visual Odometry for Embodied Vision | Marius Memmel · Roman Bachmann · Amir Zamir | N/A | Code |
| 3D Video Loops From Asynchronous Input | Li Ma · Xiaoyu Li · Jing Liao · Pedro V. Sander | N/A | Code |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Xuan Ju · Ailing Zeng · Jianan Wang · Qiang Xu · Lei Zhang | N/A | Code |
| PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout | Hsiao Yuan Hsu · Xiangteng He · Yuxin Peng · Hao Kong · Qing Zhang | N/A | Code |
| A Soma Segmentation Benchmark in Full Adult Fly Brain | Xiaoyu Liu · Bo Hu · Mingxing Li · Wei Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer | Jing Lin · Ailing Zeng · Haoqian Wang · Lei Zhang · Yu Li | N/A | Code |
| Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals | Yuto Shibata · Yutaka Kawashima · Mariko Isogawa · Go Irie · Akisato Kimura · Yoshimitsu Aoki | N/A | Code |
| Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video | Xingyu Chen · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis | Hiuyi Cheng · Peirong Zhang · Sihang Wu · Jiaxin Zhang · Qiyuan Zhu · Zecheng Xie · Jing Li · Kai Ding · Lianwen Jin | N/A | Code |
| Neural Congealing: Aligning Images to a Joint Semantic Atlas | Dolev Ofri-Amar · Michal Geyer · Yoni Kasten · Tali Dekel | N/A | Code |
| BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation | Tianheng Cheng · Xinggang Wang · Shaoyu Chen · Qian Zhang · Wenyu Liu | N/A | Code |
| BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion | Michael J. Black · Priyanka Patel · Joachim Tesch · Jinlong Yang | N/A | Code |
| Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation | Feng Li · Hao Zhang · Huaizhe Xu · Shilong Liu · Lei Zhang · Lionel M. Ni · Heung-Yeung Shum | N/A | Code |
| Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis From Monocular Image | Yu Deng · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| 3DAvatarGAN: Bridging Domains for Personalized Editable Avatars | Rameen Abdal · Hsin-Ying Lee · Peihao Zhu · Menglei Chai · Aliaksandr Siarohin · Peter Wonka · Sergey Tulyakov | N/A | Code |
| FLEX: Full-Body Grasping Without Full-Body Grasps | Purva Tendulkar · Dídac Surís · Carl Vondrick | N/A | Code |
| UDE: A Unified Driving Engine for Human Motion Generation | Zixiang Zhou · Baoyuan Wang | N/A | Code |
| Video Test-Time Adaptation for Action Recognition | Wei Lin · Muhammad Jehanzeb Mirza · Mateusz Kozinski · Horst Possegger · Hilde Kuehne · Horst Bischof | N/A | Code |
| Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis | Duomin Wang · Yu Deng · Zixin Yin · Heung-Yeung Shum · Baoyuan Wang | N/A | Code |
| MIME: Human-Aware 3D Scene Generation | Hongwei Yi · Chun-Hao P. Huang · Shashank Tripathi · Lea Hering · Justus Thies · Michael J. Black | N/A | Code |
| AstroNet: When Astrocyte Meets Artificial Neural Network | Mengqiao Han · Liyuan Pan · Xiabi Liu | N/A | Code |
| Stimulus Verification Is a Universal and Effective Sampler in Multi-Modal Human Trajectory Prediction | Jianhua Sun · Yuxuan Li · Liang Chai · Cewu Lu | N/A | Code |
| ActMAD: Activation Matching To Align Distributions for Test-Time-Training | Muhammad Jehanzeb Mirza · Pol Jané Soneira · Wei Lin · Mateusz Kozinski · Horst Possegger · Horst Bischof | N/A | Code |
| Visual Prompt Multi-Modal Tracking | Jiawen Zhu · Simiao Lai · Xin Chen · Dong Wang · Huchuan Lu | N/A | Code |
| Reconstructing Signing Avatars From Video Using Linguistic Priors | Maria-Paola Forte · Peter Kulits · Chun-Hao P. Huang · Vasileios Choutas · Dimitrios Tzionas · Katherine J. Kuchenbecker · Michael J. Black | N/A | Code |
| Patch-Based 3D Natural Scene Generation From a Single Example | Weiyu Li · Xuelin Chen · Jue Wang · Baoquan Chen | N/A | Code |
| Re-Basin via Implicit Sinkhorn Differentiation | Fidel A. Guerrero Peña · Heitor Rapela Medeiros · Thomas Dubail · Masih Aminbeidokhti · Eric Granger · Marco Pedersoli | N/A | Code |
| Slide-Transformer: Hierarchical Vision Transformer With Local Self-Attention | Xuran Pan · Tianzhu Ye · Zhuofan Xia · Shiji Song · Gao Huang | N/A | Code |
| Planning-Oriented Autonomous Driving | Yihan Hu · Jiazhi Yang · Li Chen · Keyu Li · Chonghao Sima · Xizhou Zhu · Siqi Chai · Senyao Du · Tianwei Lin · Wenhai Wang · Lewei Lu · Xiaosong Jia · Qiang Liu · Jifeng Dai · Yu Qiao · Hongyang Li | N/A | Code |
| Enhancing Deformable Local Features by Jointly Learning To Detect and Describe Keypoints | Guilherme Potje · Felipe Cadar · André Araujo · Renato Martins · Erickson R. Nascimento | N/A | Code |
| 3D Human Pose Estimation via Intuitive Physics | Shashank Tripathi · Lea Müller · Chun-Hao P. Huang · Omid Taheri · Michael J. Black · Dimitrios Tzionas | N/A | Code |
| Defending Against Patch-Based Backdoor Attacks on Self-Supervised Learning | Ajinkya Tejankar · Maziar Sanjabi · Qifan Wang · Sinong Wang · Hamed Firooz · Hamed Pirsiavash · Liang Tan | N/A | Code |
| PointCMP: Contrastive Mask Prediction for Self-Supervised Learning on Point Cloud Videos | Zhiqiang Shen · Xiaoxiao Sheng · Longguang Wang · Yulan Guo · Qiong Liu · Xi Zhou | N/A | Code |
| Blowing in the Wind: CycleNet for Human Cinemagraphs From Still Images | Hugo Bertiche · Niloy J. Mitra · Kuldeep Kulkarni · Chun-Hao P. Huang · Tuanfeng Y. Wang · Meysam Madadi · Sergio Escalera · Duygu Ceylan | N/A | Code |
| Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning | Kangning Liu · Weicheng Zhu · Yiqiu Shen · Sheng Liu · Narges Razavian · Krzysztof J. Geras · Carlos Fernandez-Granda | N/A | Code |
| Learning Steerable Function for Efficient Image Resampling | Jiacheng Li · Chang Chen · Wei Huang · Zhiqiang Lang · Fenglong Song · Youliang Yan · Zhiwei Xiong | N/A | Code |
| Deep Deterministic Uncertainty: A New Simple Baseline | Jishnu Mukhoti · Andreas Kirsch · Joost van Amersfoort · Philip H.S. Torr · Yarin Gal | N/A | Code |
| Removing Objects From Neural Radiance Fields | Silvan Weder · Guillermo Garcia-Hernando · Áron Monszpart · Marc Pollefeys · Gabriel J. Brostow · Michael Firman · Sara Vicente | N/A | Code |
| PartManip: Learning Cross-Category Generalizable Part Manipulation Policy From Point Cloud Observations | Haoran Geng · Ziming Li · Yiran Geng · Jiayi Chen · Hao Dong · He Wang | N/A | Code |
| T-SEA: Transfer-Based Self-Ensemble Attack on Object Detection | Hao Huang · Ziyan Chen · Huanran Chen · Yongtao Wang · Kevin Zhang | N/A | Code |
| DINN360: Deformable Invertible Neural Network for Latitude-Aware 360° Image Rescaling | Yichen Guo · Mai Xu · Lai Jiang · Leonid Sigal · Yunjin Chen | N/A | Code |
| Learning Human-to-Robot Handovers From Point Clouds | Sammy Christen · Wei Yang · Claudia Pérez-D’Arpino · Otmar Hilliges · Dieter Fox · Yu-Wei Chao | N/A | Code |
| Multi-View Azimuth Stereo via Tangent Space Consistency | Xu Cao · Hiroaki Santo · Fumio Okura · Yasuyuki Matsushita | N/A | Code |
| Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners | Zitian Chen · Yikang Shen · Mingyu Ding · Zhenfang Chen · Hengshuang Zhao · Erik G. Learned-Miller · Chuang Gan | N/A | Code |
| gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction | Zerui Chen · Shizhe Chen · Cordelia Schmid · Ivan Laptev | N/A | Code |
| Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint | Hongyu Liu · Yibing Song · Qifeng Chen | N/A | Code |
| Generative Bias for Robust Visual Question Answering | Jae Won Cho · Dong-Jin Kim · Hyeonggon Ryu · In So Kweon | N/A | Code |
| Backdoor Defense via Deconfounded Representation Learning | Zaixi Zhang · Qi Liu · Zhicai Wang · Zepu Lu · Qingyong Hu | N/A | Code |
| High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization | Jiaxin Xie · Hao Ouyang · Jingtan Piao · Chenyang Lei · Qifeng Chen | N/A | Code |
| Affordance Diffusion: Synthesizing Hand-Object Interactions | Yufei Ye · Xueting Li · Abhinav Gupta · Shalini De Mello · Stan Birchfield · Jiaming Song · Shubham Tulsiani · Sifei Liu | N/A | Code |
| Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters | Jiashun Wang · Xueting Li · Sifei Liu · Shalini De Mello · Orazio Gallo · Xiaolong Wang · Jan Kautz | N/A | Code |
| Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting | Tarasha Khurana · Peiyun Hu · David Held · Deva Ramanan | N/A | Code |
| Are Data-Driven Explanations Robust Against Out-of-Distribution Data? | Tang Li · Fengchun Qiao · Mengmeng Ma · Xi Peng | N/A | Code |
| Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis | Kang Han · Wei Xiang | N/A | Code |
| Boosting Video Object Segmentation via Space-Time Correspondence Learning | Yurong Zhang · Liulei Li · Wenguan Wang · Rong Xie · Li Song · Wenjun Zhang | N/A | Code |
| X-Pruner: eXplainable Pruning for Vision Transformers | Lu Yu · Wei Xiang | N/A | Code |
| GazeNeRF: 3D-Aware Gaze Redirection With Neural Radiance Fields | Alessandro Ruzzi · Xiangwei Shi · Xi Wang · Gengyan Li · Shalini De Mello · Hyung Jin Chang · Xucong Zhang · Otmar Hilliges | N/A | Code |
| Real-Time Evaluation in Online Continual Learning: A New Hope | Yasir Ghunaim · Adel Bibi · Kumail Alhamoud · Motasem Alfarra · Hasan Abed Al Kader Hammoud · Ameya Prabhu · Philip H.S. Torr · Bernard Ghanem | N/A | Code |
| Contrastive Semi-Supervised Learning for Underwater Image Restoration via Reliable Bank | Shirui Huang · Keyan Wang · Huan Liu · Jun Chen · Yunsong Li | N/A | Code |
| A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet Categories | Reza Akbarian Bafghi · Danna Gurari | N/A | Code |
| Open-Vocabulary Panoptic Segmentation With Text-to-Image Diffusion Models | Jiarui Xu · Sifei Liu · Arash Vahdat · Wonmin Byeon · Xiaolong Wang · Shalini De Mello | N/A | Code |
| Reconstructing Animatable Categories From Videos | Gengshan Yang · Chaoyang Wang · N. Dinesh Reddy · Deva Ramanan | N/A | Code |
| Learning Visual Representations via Language-Guided Sampling | Mohamed El Banani · Karan Desai · Justin Johnson | N/A | Code |
| Four-View Geometry With Unknown Radial Distortion | Petr Hruby · Viktor Korotynskiy · Timothy Duff · Luke Oeding · Marc Pollefeys · Tomas Pajdla · Viktor Larsson | N/A | Code |
| DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model | Gwanghyun Kim · Se Young Chun | N/A | Code |
| ConZIC: Controllable Zero-Shot Image Captioning by Sampling-Based Polishing | Zequn Zeng · Hao Zhang · Ruiying Lu · Dongsheng Wang · Bo Chen · Zhengjue Wang | N/A | Code |
| Feature Separation and Recalibration for Adversarial Robustness | Woo Jae Kim · Yoonki Cho · Junsik Jung · Sung-Eui Yoon | N/A | Code |
| Event-Based Blurry Frame Interpolation Under Blind Exposure | Wenming Weng · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Zhiqin Chen · Thomas Funkhouser · Peter Hedman · Andrea Tagliasacchi | N/A | Code |
| HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu · Mariya I. Vasileva · Achal Dave · Arjun Seshadri | N/A | Code |
| Analyzing and Diagnosing Pose Estimation With Attributions | Qiyuan He · Linlin Yang · Kerui Gu · Qiuxia Lin · Angela Yao | N/A | Code |
| Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction | Ziwei Yu · Chen Li · Linlin Yang · Xiaoxu Zheng · Michael Bi Mi · Gim Hee Lee · Angela Yao | N/A | Code |
| VIVE3D: Viewpoint-Independent Video Editing Using 3D-Aware GANs | Anna Frühstück · Nikolaos Sarafianos · Yuanlu Xu · Peter Wonka · Tony Tung | N/A | Code |
| Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge | Changdi Yang · Pu Zhao · Yanyu Li · Wei Niu · Jiexiong Guan · Hao Tang · Minghai Qin · Bin Ren · Xue Lin · Yanzhi Wang | N/A | Code |
| Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation From 2D Supervision | Xiaoshuai Zhang · Abhijit Kundu · Thomas Funkhouser · Leonidas Guibas · Hao Su · Kyle Genova | N/A | Code |
| VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization | Bingfan Zhu · Yanchao Yang · Xulong Wang · Youyi Zheng · Leonidas Guibas | N/A | Code |
| OpenScene: 3D Scene Understanding With Open Vocabularies | Songyou Peng · Kyle Genova · Chiyu “Max” Jiang · Andrea Tagliasacchi · Marc Pollefeys · Thomas Funkhouser | N/A | Code |
| A New Benchmark: On the Utility of Synthetic Data With Blender for Bare Supervised Learning and Downstream Domain Adaptation | Hui Tang · Kui Jia | N/A | Code |
| Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform Coordinates | Avinash Paliwal · Andrii Tsarov · Nima Khademi Kalantari | N/A | Code |
| A Large-Scale Homography Benchmark | Daniel Barath · Dmytro Mishkin · Michal Polic · Wolfgang Förstner · Jiri Matas | N/A | Code |
| Glocal Energy-Based Learning for Few-Shot Open-Set Recognition | Haoyu Wang · Guansong Pang · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| MEDIC: Remove Model Backdoors via Importance Driven Cloning | Qiuling Xu · Guanhong Tao · Jean Honorio · Yingqi Liu · Shengwei An · Guangyu Shen · Siyuan Cheng · Xiangyu Zhang | N/A | Code |
| Finding Geometric Models by Clustering in the Consensus Space | Daniel Barath · Denys Rozumnyi · Ivan Eichhardt · Levente Hajder · Jiri Matas | N/A | Code |
| Imagic: Text-Based Real Image Editing With Diffusion Models | Bahjat Kawar · Shiran Zada · Oran Lang · Omer Tov · Huiwen Chang · Tali Dekel · Inbar Mosseri · Michal Irani | N/A | Code |
| DeepLSD: Line Segment Detection and Refinement With Deep Image Gradients | Rémi Pautrat · Daniel Barath · Viktor Larsson · Martin R. Oswald · Marc Pollefeys | N/A | Code |
| H2ONet: Hand-Occlusion-and-Orientation-Aware Network for Real-Time 3D Hand Mesh Reconstruction | Hao Xu · Tianyu Wang · Xiao Tang · Chi-Wing Fu | N/A | Code |
| Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions | Yurui Zhu · Tianyu Wang · Xueyang Fu · Xuanyu Yang · Xin Guo · Jifeng Dai · Yu Qiao · Xiaowei Hu | N/A | Code |
| MoDi: Unconditional Motion Synthesis From Diverse Data | Sigal Raab · Inbal Leibovitch · Peizhuo Li · Kfir Aberman · Olga Sorkine-Hornung · Daniel Cohen-Or | N/A | Code |
| PC2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction | Luke Melas-Kyriazi · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| SliceMatch: Geometry-Guided Aggregation for Cross-View Pose Estimation | Zimin Xia · Zimin Xia · Ted Lentsch · Julian F. P. Kooij | N/A | Code |
| RealFusion: 360° Reconstruction of Any Object From a Single Image | Luke Melas-Kyriazi · Iro Laina · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Masked and Adaptive Transformer for Exemplar Based Image Translation | Chang Jiang · Fei Gao · Biao Ma · Yuhao Lin · Nannan Wang · Gang Xu | N/A | Code |
| DynamicStereo: Consistent Dynamic Depth From Stereo Videos | Nikita Karaev · Ignacio Rocco · Benjamin Graham · Natalia Neverova · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Masked Representation Learning for Domain Generalized Stereo Matching | Zhibo Rao · Bangshu Xiong · Mingyi He · Mochu Xiang · Renjie He · Zhelun Shen · Xing Li | N/A | Code |
| MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training | Runsen Xu · Tai Wang · Wenwei Zhang · Runjian Chen · Jinkun Cao · Jiangmiao Pang · Dahua Lin | N/A | Code |
| Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection | Tomoki Ichikawa · Yoshiki Fukao · Shohei Nobuhara · Ko Nishino | N/A | Code |
| Instant Multi-View Head Capture Through Learnable Registration | Timo Bolkart · Tianye Li · Michael J. Black | N/A | Code |
| POEM: Reconstructing Hand in a Point Embedded Multi-View Stereo | Lixin Yang · Jian Xu · Licheng Zhong · Xinyu Zhan · Zhicheng Wang · Kejian Wu · Cewu Lu | N/A | Code |
| Diffusion-Based Generation, Optimization, and Planning in 3D Scenes | Siyuan Huang · Zan Wang · Puhao Li · Baoxiong Jia · Tengyu Liu · Yixin Zhu · Wei Liang · Song-Chun Zhu | N/A | Code |
| Visibility Constrained Wide-Band Illumination Spectrum Design for Seeing-in-the-Dark | Muyao Niu · Zhuoxiao Li · Zhihang Zhong · Yinqiang Zheng | N/A | Code |
| SketchXAI: A First Look at Explainability for Human Sketches | Zhiyu Qu · Yulia Gryaditskaya · Ke Li · Kaiyue Pang · Tao Xiang · Yi-Zhe Song | N/A | Code |
| TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation | Taeyeop Lee · Jonathan Tremblay · Valts Blukis · Bowen Wen · Byeong-Uk Lee · Inkyu Shin · Stan Birchfield · In So Kweon · Kuk-Jin Yoon | N/A | Code |
| Teleidoscopic Imaging System for Microscale 3D Shape Reconstruction | Ryo Kawahara · Meng-Yu Jennifer Kuo · Shohei Nobuhara | N/A | Code |
| Reliability in Semantic Segmentation: Are We on the Right Track? | Pau de Jorge · Riccardo Volpi · Philip H.S. Torr · Grégory Rogez | N/A | Code |
| SMPConv: Self-Moving Point Representations for Continuous Convolution | Sanghyeon Kim · Eunbyung Park | N/A | Code |
| Few-Shot Geometry-Aware Keypoint Localization | Xingzhe He · Gaurav Bharaj · David Ferman · Helge Rhodin · Pablo Garrido | N/A | Code |
| STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition | Xiaoyu Zhu · Po-Yao Huang · Junwei Liang · Celso M. de Melo · Alexander G. Hauptmann | N/A | Code |
| Knowledge Combination To Learn Rotated Detection Without Rotated Annotation | Tianyu Zhu · Bryce Ferenczi · Pulak Purkait · Tom Drummond · Hamid Rezatofighi · Anton van den Hengel | N/A | Code |
| OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering | Zhiyuan Ma · Xiangyu Zhu · Guo-Jun Qi · Zhen Lei · Lei Zhang | N/A | Code |
| Supervised Masked Knowledge Distillation for Few-Shot Transformers | Han Lin · Guangxing Han · Jiawei Ma · Shiyuan Huang · Xudong Lin · Shih-Fu Chang | N/A | Code |
| Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision | Jilan Xu · Junlin Hou · Yuejie Zhang · Rui Feng · Yi Wang · Yu Qiao · Weidi Xie | N/A | Code |
| ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing | Xiaodan Li · Yuefeng Chen · Yao Zhu · Shuhui Wang · Rong Zhang · Hui Xue | N/A | Code |
| Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos | Liao Wang · Qiang Hu · Qihan He · Ziyu Wang · Jingyi Yu · Tinne Tuytelaars · Lan Xu · Minye Wu | N/A | Code |
| Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm | Yichen Xie · Han Lu · Junchi Yan · Xiaokang Yang · Masayoshi Tomizuka · Wei Zhan | N/A | Code |
| Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering | Ruizhi Shao · Zerong Zheng · Hanzhang Tu · Boning Liu · Hongwen Zhang · Yebin Liu | N/A | Code |
| RiDDLE: Reversible and Diversified De-Identification With Latent Encryptor | Dongze Li · Wei Wang · Kang Zhao · Jing Dong · Tieniu Tan | N/A | Code |
| RobustNeRF: Ignoring Distractors With Robust Losses | Sara Sabour · Suhani Vora · Daniel Duckworth · Ivan Krasin · David J. Fleet · Andrea Tagliasacchi | N/A | Code |
| Bitstream-Corrupted JPEG Images Are Restorable: Two-Stage Compensation and Alignment Framework for Image Restoration | Wenyang Liu · Yi Wang · Kim-Hui Yap · Lap-Pui Chau | N/A | Code |
| HierVL: Learning Hierarchical Video-Language Embeddings | Kumar Ashutosh · Rohit Girdhar · Lorenzo Torresani · Kristen Grauman | N/A | Code |
| Phone2Proc: Bringing Robust Robots Into Our Chaotic World | Matt Deitke · Rose Hendrix · Ali Farhadi · Kiana Ehsani · Aniruddha Kembhavi | N/A | Code |
| A Light Touch Approach to Teaching Transformers Multi-View Geometry | Yash Bhalgat · João F. Henriques · Andrew Zisserman | N/A | Code |
| Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields | Kangkan Wang · Guofeng Zhang · Suxu Cong · Jian Yang | N/A | Code |
| AutoFocusFormer: Image Segmentation off the Grid | Chen Ziwen · Kaushik Patnaik · Shuangfei Zhai · Alvin Wan · Zhile Ren · Alexander G. Schwing · Alex Colburn · Li Fuxin | N/A | Code |
| Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe · Zhengyi Luo · Xue Bin Peng · Ye Yuan · Kris Kitani · Karsten Kreis · Sanja Fidler · Or Litany | N/A | Code |
| Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking | Jinkun Cao · Jiangmiao Pang · Xinshuo Weng · Rawal Khirodkar · Kris Kitani | N/A | Code |
| Spider GAN: Leveraging Friendly Neighbors To Accelerate GAN Training | Siddarth Asokan · Chandra Sekhar Seelamantula | N/A | Code |
| Minimizing the Accumulated Trajectory Error To Improve Dataset Distillation | Jiawei Du · Yidi Jiang · Vincent Y. F. Tan · Joey Tianyi Zhou · Haizhou Li | N/A | Code |
| Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo | Yuesong Wang · Zhaojie Zeng · Tao Guan · Wei Yang · Zhuo Chen · Wenkai Liu · Luoyuan Xu · Yawei Luo | N/A | Code |
| Learning Correspondence Uncertainty via Differentiable Nonlinear Least Squares | Dominik Muhle · Lukas Koestler · Krishna Murthy Jatavallabhula · Daniel Cremers | N/A | Code |
| Learning Anchor Transformations for 3D Garment Animation | Fang Zhao · Zekun Li · Shaoli Huang · Junwu Weng · Tianfei Zhou · Guo-Sen Xie · Jue Wang · Ying Shan | N/A | Code |
| PyPose: A Library for Robot Learning With Physics-Based Optimization | Chen Wang · Dasong Gao · Kuan Xu · Junyi Geng · Yaoyu Hu · Yuheng Qiu · Bowen Li · Fan Yang · Brady Moon · Abhinav Pandey · Aryan · Jiahe Xu · Tianhao Wu · Haonan He · Daning Huang · Zhongqiang Ren · Shibo Zhao · Taimeng Fu · Pranay Reddy · Xiao Lin · Wenshan Wang · Jingnan Shi · Rajat Talak · Kun Cao · Yi Du · Han Wang · Huai Yu · Shanzhao Wang · Siyu Chen · Ananth Kashyap · Rohan Bandaru · Karthik Dantu · Jiajun Wu · Lihua Xie · Luca Carlone · Marco Hutter · Sebastian Scherer | N/A | Code |
| Unicode Analogies: An Anti-Objectivist Visual Reasoning Challenge | Steven Spratley · Krista A. Ehinger · Tim Miller | N/A | Code |
| DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata | Ehsan Pajouheshgar · Yitao Xu · Tong Zhang · Sabine Süsstrunk | N/A | Code |
| Learning Generative Structure Prior for Blind Text Image Super-Resolution | Xiaoming Li · Wangmeng Zuo · Chen Change Loy | N/A | Code |
| CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis | Juntian Zheng · Qingyuan Zheng · Lixing Fang · Yun Liu · Li Yi | N/A | Code |
| SCPNet: Semantic Scene Completion on Point Cloud | Zhaoyang Xia · Youquan Liu · Xin Li · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao | N/A | Code |
| AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation | Zhen Li · Zuo-Liang Zhu · Ling-Hao Han · Qibin Hou · Chun-Le Guo · Ming-Ming Cheng | N/A | Code |
| Behavioral Analysis of Vision-and-Language Navigation Agents | Zijiao Yang · Arjun Majumdar · Stefan Lee | N/A | Code |
| Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation | Yuwei Yang · Munawar Hayat · Zhao Jin · Chao Ren · Yinjie Lei | N/A | Code |
| Directional Connectivity-Based Segmentation of Medical Images | Ziyun Yang · Sina Farsiu | N/A | Code |
| ScanDMM: A Deep Markov Model of Scanpath Prediction for 360° Images | Xiangjie Sui · Yuming Fang · Hanwei Zhu · Shiqi Wang · Zhou Wang | N/A | Code |
| 3D Shape Reconstruction of Semi-Transparent Worms | Thomas P. Ilett · Omer Yuval · Thomas Ranner · Netta Cohen · David C. Hogg | N/A | Code |
| Patch-Craft Self-Supervised Training for Correlated Image Denoising | Gregory Vaksman · Michael Elad | N/A | Code |
| NeAT: Learning Neural Implicit Surfaces With Arbitrary Topologies From Multi-View Images | Xiaoxu Meng · Weikai Chen · Bo Yang | N/A | Code |
| DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering | Zongrui Li · Qian Zheng · Boxin Shi · Gang Pan · Xudong Jiang | N/A | Code |
| Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training | Zhao Jin · Munawar Hayat · Yuwei Yang · Yulan Guo · Yinjie Lei | N/A | Code |
| Unsupervised Object Localization: Observing the Background To Discover Objects | Oriane Siméoni · Chloé Sekkat · Gilles Puy · Antonín Vobecký · Éloi Zablocki · Patrick Pérez | N/A | Code |
| Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery | Muli Yang · Liancheng Wang · Cheng Deng · Hanwang Zhang | N/A | Code |
| Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion | Yushi Lan · Xuyi Meng · Shuai Yang · Chen Change Loy · Bo Dai | N/A | Code |
| NeuralField-LDM: Scene Generation With Hierarchical Latent Diffusion Models | Seung Wook Kim · Bradley Brown · Kangxue Yin · Karsten Kreis · Katja Schwarz · Daiqing Li · Robin Rombach · Antonio Torralba · Sanja Fidler | N/A | Code |
| ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation | Alexandre Boulch · Corentin Sautier · Björn Michele · Gilles Puy · Renaud Marlet | N/A | Code |
| RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure Prediction | Donghao Zhou · Chunbin Gu · Junde Xu · Furui Liu · Qiong Wang · Guangyong Chen · Pheng-Ann Heng | N/A | Code |
| Aligning Bag of Regions for Open-Vocabulary Object Detection | Size Wu · Wenwei Zhang · Sheng Jin · Wentao Liu · Chen Change Loy | N/A | Code |
| Neuralangelo: High-Fidelity Neural Surface Reconstruction | Zhaoshuo Li · Thomas Müller · Alex Evans · Russell H. Taylor · Mathias Unberath · Ming-Yu Liu · Chen-Hsuan Lin | N/A | Code |
| PCT-Net: Full Resolution Image Harmonization Using Pixel-Wise Color Transformations | Julian Jorge Andrade Guerreiro · Mitsuru Nakazawa · Björn Stenger | N/A | Code |
| PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers | Ryan Grainger · Thomas Paniagua · Xi Song · Naresh Cuntoor · Mun Wai Lee · Tianfu Wu | N/A | Code |
| Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation | Dong Zhao · Shuang Wang · Qi Zang · Dou Quan · Xiutiao Ye · Licheng Jiao | N/A | Code |
| MEGANE: Morphable Eyeglass and Avatar Network | Junxuan Li · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Hongdong Li · Jason Saragih | N/A | Code |
| Generalizable Implicit Neural Representations via Instance Pattern Composers | Chiheon Kim · Doyup Lee · Saehoon Kim · Minsu Cho · Wook-Shin Han | N/A | Code |
| Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast Solution | Bangyan Liao · Delin Qu · Yifei Xue · Huiqing Zhang · Yizhen Lao | N/A | Code |
| Distribution Shift Inversion for Out-of-Distribution Prediction | Runpeng Yu · Songhua Liu · Xingyi Yang · Xinchao Wang | N/A | Code |
| Wide-Angle Rectification via Content-Aware Conformal Mapping | Qi Zhang · Hongdong Li · Qing Wang | N/A | Code |
| WildLight: In-the-Wild Inverse Rendering With a Flashlight | Ziang Cheng · Junxuan Li · Hongdong Li | N/A | Code |
| Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos | Kun Su · Kaizhi Qian · Eli Shlizerman · Antonio Torralba · Chuang Gan | N/A | Code |
| Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural Networks | Markus Frey · Christian F. Doeller · Caswell Barry | N/A | Code |
| Inverting the Imaging Process by Learning an Implicit Camera Model | Xin Huang · Qi Zhang · Ying Feng · Hongdong Li · Qing Wang | N/A | Code |
| EC2: Emergent Communication for Embodied Control | Yao Mu · Shunyu Yao · Mingyu Ding · Ping Luo · Chuang Gan | N/A | Code |
| Light Source Separation and Intrinsic Image Decomposition Under AC Illumination | Yusaku Yoshida · Ryo Kawahara · Takahiro Okabe | N/A | Code |
| FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding | Thanh-Dat Truong · Ngan Le · Bhiksha Raj · Jackson Cothren · Khoa Luu | N/A | Code |
| Learning Locally Editable Virtual Humans | Hsuan-I Ho · Lixin Xue · Jie Song · Otmar Hilliges | N/A | Code |
| Open-Vocabulary Point-Cloud Object Detection Without 3D Annotation | Yuheng Lu · Chenfeng Xu · Xiaobao Wei · Xiaodong Xie · Masayoshi Tomizuka · Kurt Keutzer · Shanghang Zhang | N/A | Code |
| Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning | Qiang He · Huangyuan Su · Jieyu Zhang · Xinwen Hou | N/A | Code |
| PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection | Anthony Chen · Kevin Zhang · Renrui Zhang · Zihan Wang · Yuheng Lu · Yandong Guo · Shanghang Zhang | N/A | Code |
| OrienterNet: Visual Localization in 2D Public Maps With Neural Matching | Paul-Edouard Sarlin · Daniel DeTone · Tsun-Yi Yang · Armen Avetisyan · Julian Straub · Tomasz Malisiewicz · Samuel Rota Bulò · Richard Newcombe · Peter Kontschieder · Vasileios Balntas | N/A | Code |
| Class Relationship Embedded Learning for Source-Free Unsupervised Domain Adaptation | Yixin Zhang · Zilei Wang · Weinan He | N/A | Code |
| Efficient Movie Scene Detection Using State-Space Transformers | Md Mohaiminul Islam · Mahmudul Hasan · Kishan Shamsundar Athrey · Tony Braskich · Gedas Bertasius | N/A | Code |
| Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction | Mingfang Zhang · Jinglu Wang · Xiao Li · Yifei Huang · Yoichi Sato · Yan Lu | N/A | Code |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Xiao Han · Xiatian Zhu · Licheng Yu · Li Zhang · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning | Qian Jiang · Changyou Chen · Han Zhao · Liqun Chen · Qing Ping · Son Dinh Tran · Yi Xu · Belinda Zeng · Trishul Chilimbi | N/A | Code |
| Level-S$^2$fM: Structure From Motion on Neural Level Set of Implicit Surfaces | Yuxi Xiao · Nan Xue · Tianfu Wu · Gui-Song Xia | N/A | Code |
| Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object Detection | Xinjiang Wang · Xingyi Yang · Shilong Zhang · Yijiang Li · Litong Feng · Shijie Fang · Chengqi Lyu · Kai Chen · Wayne Zhang | N/A | Code |
| Dense Distinct Query for End-to-End Object Detection | Shilong Zhang · Xinjiang Wang · Jiaqi Wang · Jiangmiao Pang · Chengqi Lyu · Wenwei Zhang · Ping Luo · Kai Chen | N/A | Code |
| ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation | Zicong Fan · Omid Taheri · Dimitrios Tzionas · Muhammed Kocabas · Manuel Kaufmann · Michael J. Black · Otmar Hilliges | N/A | Code |
| BiFormer: Vision Transformer With Bi-Level Routing Attention | Lei Zhu · Xinjiang Wang · Zhanghan Ke · Wayne Zhang · Rynson W.H. Lau | N/A | Code |
| Hierarchical Video-Moment Retrieval and Step-Captioning | Abhay Zala · Jaemin Cho · Satwik Kottur · Xilun Chen · Barlas Oguz · Yashar Mehdad · Mohit Bansal | N/A | Code |
| Progressive Open Space Expansion for Open-Set Model Attribution | Tianyun Yang · Danding Wang · Fan Tang · Xinying Zhao · Juan Cao · Sheng Tang | N/A | Code |
| Deep Depth Estimation From Thermal Image | Ukcheol Shin · Jinsun Park · In So Kweon | N/A | Code |
| Incremental 3D Semantic Scene Graph Prediction From RGB Sequences | Shun-Cheng Wu · Keisuke Tateno · Nassir Navab · Federico Tombari | N/A | Code |
| Visual Programming: Compositional Visual Reasoning Without Training | Tanmay Gupta · Aniruddha Kembhavi | N/A | Code |
| Change-Aware Sampling and Contrastive Learning for Satellite Images | Utkarsh Mall · Bharath Hariharan · Kavita Bala | N/A | Code |
| NULL-Text Inversion for Editing Real Images Using Guided Diffusion Models | Ron Mokady · Amir Hertz · Kfir Aberman · Yael Pritch · Daniel Cohen-Or | N/A | Code |
| RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors | Rui-Qi Wu · Zheng-Peng Duan · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| Neural Part Priors: Learning To Optimize Part-Based Object Completion in RGB-D Scans | Aleksei Bokhovkin · Angela Dai | N/A | Code |
| Hierarchical Discriminative Learning Improves Visual Representations of Biomedical Microscopy | Cheng Jiang · Xinhai Hou · Akhil Kondepudi · Asadur Chowdury · Christian W. Freudiger · Daniel A. Orringer · Honglak Lee · Todd C. Hollon | N/A | Code |
| Domain Expansion of Image Generators | Yotam Nitzan · Michaël Gharbi · Richard Zhang · Taesung Park · Jun-Yan Zhu · Daniel Cohen-Or · Eli Shechtman | N/A | Code |
| “Seeing” Electric Network Frequency From Events | Lexuan Xu · Guang Hua · Haijian Zhang · Lei Yu · Ning Qiao | N/A | Code |
| MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object Detection | Wenda Zhao · Shigeng Xie · Fan Zhao · You He · Huchuan Lu | N/A | Code |
| Adaptive Spot-Guided Transformer for Consistent Local Feature Matching | Jiahuan Yu · Jiahao Chang · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Feng Wu | N/A | Code |
| SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence | Jiacheng Deng · Chuxin Wang · Jiahao Lu · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Zhe Zhang | N/A | Code |
| Dynamic Coarse-To-Fine Learning for Oriented Tiny Object Detection | Chang Xu · Jian Ding · Jinwang Wang · Wen Yang · Huai Yu · Lei Yu · Gui-Song Xia | N/A | Code |
| Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning | Yu Wang · Pengchong Qiao · Chang Liu · Guoli Song · Xiawu Zheng · Jie Chen | N/A | Code |
| Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding | Praneeth Chakravarthula · Jim Aldon D’Souza · Ethan Tseng · Joe Bartusek · Felix Heide | N/A | Code |
| DNF: Decouple and Feedback Network for Seeing in the Dark | Xin Jin · Ling-Hao Han · Zhen Li · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation | Samir Yitzhak Gadre · Mitchell Wortsman · Gabriel Ilharco · Ludwig Schmidt · Shuran Song | N/A | Code |
| NVTC: Nonlinear Vector Transform Coding | Runsen Feng · Zongyu Guo · Weiping Li · Zhibo Chen | N/A | Code |
| Towards Unified Scene Text Spotting Based on Sequence Generation | Taeho Kil · Seonghyeon Kim · Sukmin Seo · Yoonsik Kim · Daehee Kim | N/A | Code |
| Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation | Tsu-Jui Fu · Licheng Yu · Ning Zhang · Cheng-Yang Fu · Jong-Chyi Su · William Yang Wang · Sean Bell | N/A | Code |
| Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation | Pengchong Qiao · Zhidan Wei · Yu Wang · Zhennan Wang · Guoli Song · Fan Xu · Xiangyang Ji · Chang Liu · Jie Chen | N/A | Code |
| Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andréas Meuleman · Yu-Lun Liu · Chen Gao · Jia-Bin Huang · Changil Kim · Min H. Kim · Johannes Kopf | N/A | Code |
| Neural Map Prior for Autonomous Driving | Xuan Xiong · Yicheng Liu · Tianyuan Yuan · Yue Wang · Yilun Wang · Hang Zhao | N/A | Code |
| Efficient and Explicit Modelling of Image Hierarchies for Image Restoration | Yawei Li · Yuchen Fan · Xiaoyu Xiang · Denis Demandolx · Rakesh Ranjan · Radu Timofte · Luc Van Gool | N/A | Code |
| F2-NeRF: Fast Neural Radiance Field Training With Free Camera Trajectories | Peng Wang · Yuan Liu · Zhaoxi Chen · Lingjie Liu · Ziwei Liu · Taku Komura · Christian Theobalt · Wenping Wang | N/A | Code |
| Procedure-Aware Pretraining for Instructional Video Understanding | Honglu Zhou · Roberto Martín-Martín · Mubbasir Kapadia · Silvio Savarese · Juan Carlos Niebles | N/A | Code |
| High-Fidelity Guided Image Synthesis With Latent Diffusion Models | Jaskirat Singh · Stephen Gould · Liang Zheng | N/A | Code |
| Progressive Random Convolutions for Single Domain Generalization | Seokeon Choi · Debasmit Das · Sungha Choi · Seunghan Yang · Hyunsin Park · Sungrack Yun | N/A | Code |
| EcoTTA: Memory-Efficient Continual Test-Time Adaptation via Self-Distilled Regularization | Junha Song · Jungsoo Lee · In So Kweon · Sungha Choi | N/A | Code |
| NoPe-NeRF: Optimising Neural Radiance Field With No Pose Prior | Wenjing Bian · Zirui Wang · Kejie Li · Jia-Wang Bian · Victor Adrian Prisacariu | N/A | Code |
| GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds | Zihui Zhang · Bo Yang · Bing Wang · Bo Li | N/A | Code |
| Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning | Kaiyou Song · Jin Xie · Shan Zhang · Zimeng Luo | N/A | Code |
| Aligning Step-by-Step Instructional Diagrams to Video Demonstrations | Jiahao Zhang · Anoop Cherian · Yanbin Liu · Yizhak Ben-Shabat · Cristian Rodriguez · Stephen Gould | N/A | Code |
| ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari · Camilla Carta · François Fleuret | N/A | Code |
| AutoRecon: Automated 3D Object Discovery and Reconstruction | Yuang Wang · Xingyi He · Sida Peng · Haotong Lin · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Ultra-High Resolution Segmentation With Ultra-Rich Context: A Novel Benchmark | Deyi Ji · Feng Zhao · Hongtao Lu · Mingyuan Tao · Jieping Ye | N/A | Code |
| NeUDF: Leaning Neural Unsigned Distance Fields With Volume Rendering | Yu-Tao Liu · Li Wang · Jie Yang · Weikai Chen · Xiaoxu Meng · Bo Yang · Lin Gao | N/A | Code |
| Improving Cross-Modal Retrieval With Set of Diverse Embeddings | Dongwon Kim · Namyup Kim · Suha Kwak | N/A | Code |
| An Image Quality Assessment Dataset for Portraits | Nicolas Chahine · Stefania Calarasanu · Davide Garcia-Civiero · Théo Cayla · Sira Ferradans · Jean Ponce | N/A | Code |
| Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor | Hyeokjun Kweon · Sung-Hoon Yoon · Kuk-Jin Yoon | N/A | Code |
| NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer | Kun Zhou · Wenbo Li · Yi Wang · Tao Hu · Nianjuan Jiang · Xiaoguang Han · Jiangbo Lu | N/A | Code |
| ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations | Panos Achlioptas · Ian Huang · Minhyuk Sung · Sergey Tulyakov · Leonidas Guibas | N/A | Code |
| RelightableHands: Efficient Neural Relighting of Articulated Hand Models | Shun Iwase · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Timur Bagautdinov · Rohan Joshi · Fabian Prada · Takaaki Shiratori · Yaser Sheikh · Jason Saragih | N/A | Code |
| VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud | Ziqin Wang · Bowen Cheng · Lichen Zhao · Dong Xu · Yang Tang · Lu Sheng | N/A | Code |
| MVImgNet: A Large-Scale Dataset of Multi-View Images | Xianggang Yu · Mutian Xu · Yidan Zhang · Haolin Liu · Chongjie Ye · Yushuang Wu · Zizheng Yan · Chenming Zhu · Zhangyang Xiong · Tianyou Liang · Guanying Chen · Shuguang Cui · Xiaoguang Han | N/A | Code |
| MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling With Informative-Preserved Reconstruction and Self-Distilled Consistency | Mingye Xu · Mutian Xu · Tong He · Wanli Ouyang · Yali Wang · Xiaoguang Han · Yu Qiao | N/A | Code |
| Self-Guided Diffusion Models | Vincent Tao Hu · David W. Zhang · Yuki M. Asano · Gertjan J. Burghouts · Cees G. M. Snoek | N/A | Code |
| REC-MV: REconstructing 3D Dynamic Cloth From Monocular Videos | Lingteng Qiu · Guanying Chen · Jiapeng Zhou · Mutian Xu · Junle Wang · Xiaoguang Han | N/A | Code |
| OneFormer: One Transformer To Rule Universal Image Segmentation | Jitesh Jain · Jiachen Li · Mang Tik Chiu · Ali Hassani · Nikita Orlov · Humphrey Shi | N/A | Code |
| Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations | Vibashan VS · Ning Yu · Chen Xing · Can Qin · Mingfei Gao · Juan Carlos Niebles · Vishal M. Patel · Ran Xu | N/A | Code |
| Multiclass Confidence and Localization Calibration for Object Detection | Bimsara Pathiraja · Malitha Gunawardhana · Muhammad Haris Khan | N/A | Code |
| Structured Kernel Estimation for Photon-Limited Deconvolution | Yash Sanghvi · Zhiyuan Mao · Stanley H. Chan | N/A | Code |
| CLIPPO: Image-and-Language Understanding From Pixels Only | Michael Tschannen · Basil Mustafa · Neil Houlsby | N/A | Code |
| Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition | Lilang Lin · Jiahang Zhang · Jiaying Liu | N/A | Code |
| Role of Transients in Two-Bounce Non-Line-of-Sight Imaging | Siddharth Somasundaram · Akshat Dave · Connor Henley · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Shape-Aware Text-Driven Layered Video Editing | Yao-Chih Lee · Ji-Ze Genevieve Jang · Yi-Ting Chen · Elizabeth Qiu · Jia-Bin Huang | N/A | Code |
| FlexiViT: One Model for All Patch Sizes | Lucas Beyer · Pavel Izmailov · Alexander Kolesnikov · Mathilde Caron · Simon Kornblith · Xiaohua Zhai · Matthias Minderer · Michael Tschannen · Ibrahim Alabdulmohsin · Filip Pavetic | N/A | Code |
| Turning Strengths Into Weaknesses: A Certified Robustness Inspired Attack Framework Against Graph Neural Networks | Binghui Wang · Meng Pang · Yun Dong | N/A | Code |
| HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling | Yujian Zheng · Zirong Jin · Moran Li · Haibin Huang · Chongyang Ma · Shuguang Cui · Xiaoguang Han | N/A | Code |
| RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal Retrieval | Yanglin Feng · Hongyuan Zhu · Dezhong Peng · Xi Peng · Peng Hu | N/A | Code |
| Learning Federated Visual Prompt in Null Space for MRI Reconstruction | Chun-Mei Feng · Bangjun Li · Xinxing Xu · Yong Liu · Huazhu Fu · Wangmeng Zuo | N/A | Code |
| VGFlow: Visibility Guided Flow Network for Human Reposing | Rishabh Jain · Krishna Kumar Singh · Mayur Hemani · Jingwan Lu · Mausoom Sarkar · Duygu Ceylan · Balaji Krishnamurthy | N/A | Code |
| Learning Attention As Disentangler for Compositional Zero-Shot Learning | Shaozhe Hao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | Yiqun Wang · Ivan Skorokhodov · Peter Wonka | N/A | Code |
| Perception-Oriented Single Image Super-Resolution Using Optimal Objective Estimation | Seung Ho Park · Young Su Moon · Nam Ik Cho | N/A | Code |
| Learning To Exploit Temporal Structure for Biomedical Vision–Language Processing | Shruthi Bannur · Stephanie Hyland · Qianchu Liu · Fernando Pérez-García · Maximilian Ilse · Daniel C. Castro · Benedikt Boecking · Harshita Sharma · Kenza Bouzid · Anja Thieme · Anton Schwaighofer · Maria Wetscherek · Matthew P. Lungren · Aditya Nori · Javier Alvarez-Valle · Ozan Oktay | N/A | Code |
| TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments | Yu Sun · Qian Bao · Wu Liu · Tao Mei · Michael J. Black | N/A | Code |
| Neumann Network With Recursive Kernels for Single Image Defocus Deblurring | Yuhui Quan · Zicong Wu · Hui Ji | N/A | Code |
| Guiding Pseudo-Labels With Uncertainty Estimation for Source-Free Unsupervised Domain Adaptation | Mattia Litrico · Alessio Del Bue · Pietro Morerio | N/A | Code |
| PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes | Ruoyu Wang · Zehao Yu · Shenghua Gao | N/A | Code |
| Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference | Haoran You · Yunyang Xiong · Xiaoliang Dai · Bichen Wu · Peizhao Zhang · Haoqi Fan · Peter Vajda · Yingyan (Celine) Lin | N/A | Code |
| Attention-Based Point Cloud Edge Sampling | Chengzhi Wu · Junwei Zheng · Julius Pfrommer · Jürgen Beyerer | N/A | Code |
| Structured 3D Features for Reconstructing Controllable Avatars | Enric Corona · Mihai Zanfir · Thiemo Alldieck · Eduard Gabriel Bazavan · Andrei Zanfir · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Referring Image Segmentation With Global-Local Context Features | Seonghoon Yu · Paul Hongsuck Seo · Jeany Son | N/A | Code |
| CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective | Junwen Xiong · Ganglai Wang · Peng Zhang · Wei Huang · Yufei Zha · Guangtao Zhai | N/A | Code |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Anwesa Choudhuri · Girish Chowdhary · Alexander G. Schwing | N/A | Code |
| Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields | Rohith Agaram · Shaurya Dewan · Rahul Sajnani · Adrien Poulenard · Madhava Krishna · Srinath Sridhar | N/A | Code |
| Decoupled Multimodal Distilling for Emotion Recognition | Yong Li · Yuanzhi Wang · Zhen Cui | N/A | Code |
| TensoIR: Tensorial Inverse Rendering | Haian Jin · Isabella Liu · Peijia Xu · Xiaoshuai Zhang · Songfang Han · Sai Bi · Xiaowei Zhou · Zexiang Xu · Hao Su | N/A | Code |
| Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning | Jiayi Guo · Chaofei Wang · You Wu · Eric Zhang · Kai Wang · Xingqian Xu · Shiji Song · Humphrey Shi · Gao Huang | N/A | Code |
| DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection | Jiawei Ma · Yulei Niu · Jincheng Xu · Shiyuan Huang · Guangxing Han · Shih-Fu Chang | N/A | Code |
| Unbalanced Optimal Transport: A Unified Framework for Object Detection | Henri De Plaen · Pierre-François De Plaen · Johan A. K. Suykens · Marc Proesmans · Tinne Tuytelaars · Luc Van Gool | N/A | Code |
| NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-Shot Real Image Animation | Yu Yin · Kamran Ghasedi · HsiangTao Wu · Jiaolong Yang · Xin Tong · Yun Fu | N/A | Code |
| Masked Image Training for Generalizable Deep Image Denoising | Haoyu Chen · Jinjin Gu · Yihao Liu · Salma Abdel Magid · Chao Dong · Qiong Wang · Hanspeter Pfister · Lei Zhu | N/A | Code |
| Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation | Mayu Otani · Riku Togashi · Yu Sawai · Ryosuke Ishigami · Yuta Nakashima · Esa Rahtu · Janne Heikkilä · Shin’ichi Satoh | N/A | Code |
| Towards Flexible Multi-Modal Document Models | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin · Mingkang Li · Da Li · Timothy Hospedales · Yi-Zhe Song · Yonggang Qi | N/A | Code |
| LidarGait: Benchmarking 3D Gait Recognition With Point Clouds | Chuanfu Shen · Chao Fan · Wei Wu · Rui Wang · George Q. Huang · Shiqi Yu | N/A | Code |
| OpenGait: Revisiting Gait Recognition Towards Better Practicality | Chao Fan · Junhao Liang · Chuanfu Shen · Saihui Hou · Yongzhen Huang · Shiqi Yu | N/A | Code |
| Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang · Anqi Joyce Yang · Yuwen Xiong · Sergio Casas · Bin Yang · Mengye Ren · Raquel Urtasun | N/A | Code |
| Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images | Ming Y. Lu · Bowen Chen · Andrew Zhang · Drew F. K. Williamson · Richard J. Chen · Tong Ding · Long Phi Le · Yung-Sung Chuang · Faisal Mahmood | N/A | Code |
| DivClust: Controlling Diversity in Deep Clustering | Ioannis Maniadis Metaxas · Georgios Tzimiropoulos · Ioannis Patras | N/A | Code |
| AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning | Runqi Wang · Xiaoyue Duan · Guoliang Kang · Jianzhuang Liu · Shaohui Lin · Songcen Xu · Jinhu Lü · Baochang Zhang | N/A | Code |
| Unsupervised Continual Semantic Adaptation Through Neural Rendering | Zhizheng Liu · Francesco Milano · Jonas Frey · Roland Siegwart · Hermann Blum · Cesar Cadena | N/A | Code |
| Semi-Supervised Parametric Real-World Image Harmonization | Ke Wang · Michaël Gharbi · He Zhang · Zhihao Xia · Eli Shechtman | N/A | Code |
| EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning | Chenxin Xu · Robby T. Tan · Yuhong Tan · Siheng Chen · Yu Guang Wang · Xinchao Wang · Yanfeng Wang | N/A | Code |
| BUOL: A Bottom-Up Framework With Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single Image | Tao Chu · Pan Zhang · Qiong Liu · Jiaqi Wang | N/A | Code |
| Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation | Ning Zhang · Francesco Nex · George Vosselman · Norman Kerle | N/A | Code |
| Novel-View Acoustic Synthesis | Changan Chen · Alexander Richard · Roman Shapovalov · Vamsi Krishna Ithapu · Natalia Neverova · Kristen Grauman · Andrea Vedaldi | N/A | Code |
| Audio-Visual Grouping Network for Sound Localization From Mixtures | Shentong Mo · Yapeng Tian | N/A | Code |
| Chat2Map: Efficient Scene Mapping From Multi-Ego Conversations | Sagnik Majumder · Hao Jiang · Pierre Moulon · Ethan Henderson · Paul Calamia · Kristen Grauman · Vamsi Krishna Ithapu | N/A | Code |
| ConvNeXt V2: Co-Designing and Scaling ConvNets With Masked Autoencoders | Sanghyun Woo · Shoubhik Debnath · Ronghang Hu · Xinlei Chen · Zhuang Liu · In So Kweon · Saining Xie | N/A | Code |
| Collaboration Helps Camera Overtake LiDAR in 3D Detection | Yue Hu · Yifan Lu · Runsheng Xu · Weidi Xie · Siheng Chen · Yanfeng Wang | N/A | Code |
| Few-Shot Learning With Visual Distribution Calibration and Cross-Modal Distribution Alignment | Runqi Wang · Hao Zheng · Xiaoyue Duan · Jianzhuang Liu · Yuning Lu · Tian Wang · Songcen Xu · Baochang Zhang | N/A | Code |
| MetaCLUE: Towards Comprehensive Visual Metaphors Research | Arjun R. Akula · Brendan Driscoll · Pradyumna Narayana · Soravit Changpinyo · Zhiwei Jia · Suyash Damle · Garima Pruthi · Sugato Basu · Leonidas Guibas · William Freeman · Yuanzhen Li · Varun Jampani | N/A | Code |
| Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric | Pengxin Zeng · Yunfan Li · Peng Hu · Dezhong Peng · Jiancheng Lv · Xi Peng | N/A | Code |
| Visual Atoms: Pre-Training Vision Transformers With Sinusoidal Waves | Sora Takashima · Ryo Hayamizu · Nakamasa Inoue · Hirokatsu Kataoka · Rio Yokota | N/A | Code |
| 3D-Aware Multi-Class Image-to-Image Translation With NeRFs | Senmao Li · Joost van de Weijer · Yaxing Wang · Fahad Shahbaz Khan · Meiqin Liu · Jian Yang | N/A | Code |
| E2PN: Efficient SE(3)-Equivariant Point Network | Minghan Zhu · Maani Ghaffari · William A. Clark · Huei Peng | N/A | Code |
| PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing | Yichen Sheng · Jianming Zhang · Julien Philip · Yannick Hold-Geoffroy · Xin Sun · He Zhang · Lu Ling · Bedrich Benes | N/A | Code |
| UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang · Yun Chen · Jingkang Wang · Sivabalan Manivasagam · Wei-Chiu Ma · Anqi Joyce Yang · Raquel Urtasun | N/A | Code |
| Occlusion-Free Scene Recovery via Neural Radiance Fields | Chengxuan Zhu · Renjie Wan · Yunkai Tang · Boxin Shi | N/A | Code |
| SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting With Neural Radiance Fields | Ashkan Mirzaei · Tristan Aumentado-Armstrong · Kosta Derpanis · Jonathan Kelly · Marcus A. Brubaker · Igor Gilitschenski · Alex Levinshtein | N/A | Code |
| Class-Incremental Exemplar Compression for Class-Incremental Learning | Zilin Luo · Yaoyao Liu · Bernt Schiele · Qianru Sun | N/A | Code |
| DETRs With Hybrid Matching | Ding Jia · Yuhui Yuan · Haodi He · Xiaopei Wu · Haojun Yu · Weihong Lin · Lei Sun · Chao Zhang · Han Hu | N/A | Code |
| 3D Human Mesh Estimation From Virtual Markers | Xiaoxuan Ma · Jiajun Su · Chunyu Wang · Wentao Zhu · Yizhou Wang | N/A | Code |
| Objaverse: A Universe of Annotated 3D Objects | Matt Deitke · Dustin Schwenk · Jordi Salvador · Luca Weihs · Oscar Michel · Eli VanderBilt · Ludwig Schmidt · Kiana Ehsani · Aniruddha Kembhavi · Ali Farhadi | N/A | Code |
| Adjustment and Alignment for Unbiased Open Set Domain Adaptation | Wuyang Li · Jie Liu · Bo Han · Yixuan Yuan | N/A | Code |
| TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition | Ishan Rajendrakumar Dave · Mamshad Nayeem Rizve · Chen Chen · Mubarak Shah | N/A | Code |
| EfficientSCI: Densely Connected Network With Space-Time Factorization for Large-Scale Video Snapshot Compressive Imaging | Lishun Wang · Miao Cao · Xin Yuan | N/A | Code |
| Continual Detection Transformer for Incremental Object Detection | Yaoyao Liu · Bernt Schiele · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Hierarchical Prompt Learning for Multi-Task Learning | Yajing Liu · Yuning Lu · Hao Liu · Yaozu An · Zhuoran Xu · Zhuokun Yao · Baofeng Zhang · Zhiwei Xiong · Chenguang Gui | N/A | Code |
| Boost Vision Transformer With GPU-Friendly Sparsity and Quantization | Chong Yu · Tao Chen · Zhongxue Gan · Jiayuan Fan | N/A | Code |
| Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression | Junho Kim · Byung-Kwan Lee · Yong Man Ro | N/A | Code |
| Regularizing Second-Order Influences for Continual Learning | Zhicheng Sun · Yadong Mu · Gang Hua | N/A | Code |
| Heterogeneous Continual Learning | Divyam Madaan · Hongxu Yin · Wonmin Byeon · Jan Kautz · Pavlo Molchanov | N/A | Code |
| DP-NeRF: Deblurred Neural Radiance Field With Physical Scene Priors | Dogyoon Lee · Minhyeok Lee · Chajin Shin · Sangyoun Lee | N/A | Code |
| 3D-POP – An Automated Annotation Approach to Facilitate Markerless 2D-3D Tracking of Freely Moving Birds With Marker-Based Motion Capture | Hemal Naik · Alex Hoi Hang Chan · Junran Yang · Mathilde Delacoux · Iain D. Couzin · Fumihiro Kano · Máté Nagy | N/A | Code |
| Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants With No False Negatives and No False Positives | Daniel Widdowson · Vitaliy Kurlin | N/A | Code |
| Robust Model-Based Face Reconstruction Through Weakly-Supervised Outlier Segmentation | Chunlu Li · Andreas Morel-Forster · Thomas Vetter · Bernhard Egger · Adam Kortylewski | N/A | Code |
| Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization | Shichao Dong · Jin Wang · Renhe Ji · Jiajun Liang · Haoqiang Fan · Zheng Ge | N/A | Code |
| PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation | Qihao Liu · Adam Kortylewski · Alan L. Yuille | N/A | Code |
| VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| 1000 FPS HDR Video With a Spike-RGB Hybrid Camera | Yakun Chang · Chu Zhou · Yuchen Hong · Liwen Hu · Chao Xu · Tiejun Huang · Boxin Shi | N/A | Code |
| How to Backdoor Diffusion Models? | Sheng-Yen Chou · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image Classification | Meike Nauta · Jörg Schlötterer · Maurice van Keulen · Christin Seifert | N/A | Code |
| Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers | Siyuan Wei · Tianzhu Ye · Shen Zhang · Yao Tang · Jiajun Liang | N/A | Code |
| Energy-Efficient Adaptive 3D Sensing | Brevin Tilmon · Zhanghao Sun · Sanjeev J. Koppal · Yicheng Wu · Georgios Evangelidis · Ramzi Zahreddine · Gurunandan Krishnan · Sizhuo Ma · Jian Wang | N/A | Code |
| Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data | Yuhao Chen · Xin Tan · Borui Zhao · Zhaowei Chen · Renjie Song · Jiajun Liang · Xuequan Lu | N/A | Code |
| Fix the Noise: Disentangling Source Feature for Controllable Domain Translation | Dongyeun Lee · Jae Young Lee · Doyeon Kim · Jaehyun Choi · Jaejun Yoo · Junmo Kim | N/A | Code |
| Learning Transferable Spatiotemporal Representations From Natural Script Knowledge | Ziyun Zeng · Yuying Ge · Xihui Liu · Bin Chen · Ping Luo · Shu-Tao Xia · Yixiao Ge | N/A | Code |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Mengde Xu · Zheng Zhang · Fangyun Wei · Han Hu · Xiang Bai | N/A | Code |
| A Strong Baseline for Generalized Few-Shot Semantic Segmentation | Sina Hajimiri · Malik Boudiaf · Ismail Ben Ayed · Jose Dolz | N/A | Code |
| Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations | Lei Hsiung · Yun-Yun Tsai · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| Normalizing Flow Based Feature Synthesis for Outlier-Aware Object Detection | Nishant Kumar · Siniša Šegvić · Abouzar Eslami · Stefan Gumhold | N/A | Code |
| AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction | Aggelina Chatziagapi · Dimitris Samaras | N/A | Code |
| Learning Semantic Relationship Among Instances for Image-Text Matching | Zheren Fu · Zhendong Mao · Yan Song · Yongdong Zhang | N/A | Code |
| Understanding Imbalanced Semantic Segmentation Through Neural Collapse | Zhisheng Zhong · Jiequan Cui · Yibo Yang · Xiaoyang Wu · Xiaojuan Qi · Xiangyu Zhang · Jiaya Jia | N/A | Code |
| SCADE: NeRFs from Space Carving With Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy · Ricardo Martin-Brualla · Leonidas Guibas · Ke Li | N/A | Code |
| MonoHuman: Animatable Human Neural Field From Monocular Video | Zhengming Yu · Wei Cheng · Xian Liu · Wayne Wu · Kwan-Yee Lin | N/A | Code |
| Affection: Learning Affective Explanations for Real-World Visual Data | Panos Achlioptas · Maks Ovsjanikov · Leonidas Guibas · Sergey Tulyakov | N/A | Code |
| Sharpness-Aware Gradient Matching for Domain Generalization | Pengfei Wang · Zhaoxiang Zhang · Zhen Lei · Lei Zhang | N/A | Code |
| Generalized Decoding for Pixel, Image, and Language | Xueyan Zou · Zi-Yi Dou · Jianwei Yang · Zhe Gan · Linjie Li · Chunyuan Li · Xiyang Dai · Harkirat Behl · Jianfeng Wang · Lu Yuan · Nanyun Peng · Lijuan Wang · Yong Jae Lee · Jianfeng Gao | N/A | Code |
| How You Feelin’? Learning Emotions and Mental States in Movie Scenes | Dhruv Srivastava · Aditya Kumar Singh · Makarand Tapaswi | N/A | Code |
| Improving Visual Representation Learning Through Perceptual Understanding | Samyakh Tukra · Frederick Hoffman · Ken Chatfield | N/A | Code |
| PlenVDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering | Han Yan · Celong Liu · Chao Ma · Xing Mei | N/A | Code |
| HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions | Anshul Shah · Aniket Roy · Ketul Shah · Shlok Mishra · David Jacobs · Anoop Cherian · Rama Chellappa | N/A | Code |
| FeatureBooster: Boosting Feature Descriptors With a Lightweight Neural Network | Xinjiang Wang · Zeyu Liu · Yu Hu · Wei Xi · Wenxian Yu · Danping Zou | N/A | Code |
| ACL-SPC: Adaptive Closed-Loop System for Self-Supervised Point Cloud Completion | Sangmin Hong · Mohsen Yavartanoo · Reyhaneh Neshatavar · Kyoung Mu Lee | N/A | Code |
| NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis | Allan Zhou · Moo Jin Kim · Lirui Wang · Pete Florence · Chelsea Finn | N/A | Code |
| Query-Centric Trajectory Prediction | Zikang Zhou · Jianping Wang · Yung-Hui Li · Yu-Kai Huang | N/A | Code |
| EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding | Yanmin Wu · Xinhua Cheng · Renrui Zhang · Zesen Cheng · Jian Zhang | N/A | Code |
| Sliced Optimal Partial Transport | Yikun Bai · Bernhard Schmitzer · Matthew Thorpe · Soheil Kolouri | N/A | Code |
| PersonNeRF: Personalized Reconstruction From Photo Collections | Chung-Yi Weng · Pratul P. Srinivasan · Brian Curless · Ira Kemelmacher-Shlizerman | N/A | Code |
| Feature Shrinkage Pyramid for Camouflaged Object Detection With Transformers | Zhou Huang · Hang Dai · Tian-Zhu Xiang · Shuo Wang · Huai-Xin Chen · Jie Qin · Huan Xiong | N/A | Code |
| HOLODIFFUSION: Training a 3D Diffusion Model Using 2D Images | Animesh Karnewar · Andrea Vedaldi · David Novotny · Niloy J. Mitra | N/A | Code |
| Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors | Gongjie Zhang · Zhipeng Luo · Zichen Tian · Jingyi Zhang · Xiaoqin Zhang · Shijian Lu | N/A | Code |
| Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images | Tiancheng Lin · Zhimiao Yu · Hongyu Hu · Yi Xu · Chang-Wen Chen | N/A | Code |
| Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding | Minyoung Hwang · Jaeyeon Jeong · Minsoo Kim · Yoonseon Oh · Songhwai Oh | N/A | Code |
| Sketch2Saliency: Learning To Detect Salient Objects From Human Drawings | Ayan Kumar Bhunia · Subhadeep Koley · Amandeep Kumar · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Picture That Sketch: Photorealistic Image Generation From Abstract Sketches | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain · Ayan Kumar Bhunia · Pinaki Nath Chowdhury · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data | Jihye Park · Sunwoo Kim · Soohyun Kim · Seokju Cho · Jaejun Yoo · Youngjung Uh · Seungryong Kim | N/A | Code |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Wenjie Chang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| SceneTrilogy: On Human Scene-Sketch and Its Complementarity With Photo and Text | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Markerless Camera-to-Robot Pose Estimation via Self-Supervised Sim-to-Real Transfer | Jingpei Lu · Florian Richter · Michael C. Yip | N/A | Code |
| Fine-Grained Audible Video Description | Xuyang Shen · Dong Li · Jinxing Zhou · Zhen Qin · Bowen He · Xiaodong Han · Aixuan Li · Mochu Xiang · Lingpeng Kong · Meng Wang · Yu Qiao · Yiran Zhong | N/A | Code |
| EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention | Xinyu Liu · Houwen Peng · Ningxin Zheng · Yuqing Yang · Han Hu · Yixuan Yuan | N/A | Code |
| Relightable Neural Human Assets From Multi-View Gradient Illuminations | Taotao Zhou · Kai He · Di Wu · Teng Xu · Qixuan Zhang · Kuixiang Shao · Wenzheng Chen · Lan Xu · Jingyi Yu | N/A | Code |
| Music-Driven Group Choreography | Nhat Le · Thang Pham · Tuong Do · Erman Tjiputra · Quang D. Tran · Anh Nguyen | N/A | Code |
| DIP: Dual Incongruity Perceiving Network for Sarcasm Detection | Changsong Wen · Guoli Jia · Jufeng Yang | N/A | Code |
| MagicPony: Learning Articulated 3D Animals in the Wild | Shangzhe Wu · Ruining Li · Tomas Jakab · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Preserving Linear Separability in Continual Learning by Backward Feature Projection | Qiao Gu · Dongsub Shim · Florian Shkurti | N/A | Code |
| Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues | Xingyu Ren · Jiankang Deng · Chao Ma · Yichao Yan · Xiaokang Yang | N/A | Code |
| HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models | Shan Ning · Longtian Qiu · Yongfei Liu · Xuming He | N/A | Code |
| Regularization of Polynomial Networks for Image Recognition | Grigorios G. Chrysos · Bohan Wang · Jiankang Deng · Volkan Cevher | N/A | Code |
| Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain · Ayan Kumar Bhunia · Subhadeep Koley · Pinaki Nath Chowdhury · Soumitri Chattopadhyay · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement | Yuhui Wu · Chen Pan · Guoqing Wang · Yang Yang · Jiwei Wei · Chongyi Li · Heng Tao Shen | N/A | Code |
| Block Selection Method for Using Feature Norm in Out-of-Distribution Detection | Yeonguk Yu · Sungho Shin · Seongju Lee · Changhyun Jun · Kyoobin Lee | N/A | Code |
| HouseDiffusion: Vector Floorplan Generation via a Diffusion Model With Discrete and Continuous Denoising | Mohammad Amin Shabani · Sepidehsadat Hosseini · Yasutaka Furukawa | N/A | Code |
| Integral Neural Networks | Kirill Solodskikh · Azim Kurbanov · Ruslan Aydarkhanov · Irina Zhelavskaya · Yury Parfenov · Dehua Song · Stamatios Lefkimmiatis | N/A | Code |
| FitMe: Deep Photorealistic 3D Morphable Model Avatars | Alexandros Lattas · Stylianos Moschoglou · Stylianos Ploumpis · Baris Gecer · Jiankang Deng · Stefanos Zafeiriou | N/A | Code |
| Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment | Kim Sung-Bin · Arda Senocak · Hyunwoo Ha · Andrew Owens · Tae-Hyun Oh | N/A | Code |
| Introducing Competition To Boost the Transferability of Targeted Adversarial Examples Through Clean Feature Mixup | Junyoung Byun · Myung-Joon Kwon · Seungju Cho · Yoonji Kim · Changick Kim | N/A | Code |
| Initialization Noise in Image Gradients and Saliency Maps | Ann-Christin Woerl · Jan Disselhoff · Michael Wand | N/A | Code |
| Two-Shot Video Object Segmentation | Kun Yan · Xiao Li · Fangyun Wei · Jinglu Wang · Chenbin Zhang · Ping Wang · Yan Lu | N/A | Code |
| SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow | Itai Lang · Dror Aiger · Forrester Cole · Shai Avidan · Michael Rubinstein | N/A | Code |
| Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM | Hengyi Wang · Jingwen Wang · Lourdes Agapito | N/A | Code |
| Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments | Masakazu Yoshimura · Junji Otsuka · Atsushi Irie · Takeshi Ohashi | N/A | Code |
| Diffusion-Based Signed Distance Fields for 3D Shape Generation | Jaehyeok Shim · Changwoo Kang · Kyungdon Joo | N/A | Code |
| Handwritten Text Generation From Visual Archetypes | Vittorio Pippi · Silvia Cascianelli · Rita Cucchiara | N/A | Code |
| Novel Class Discovery for 3D Point Cloud Semantic Segmentation | Luigi Riz · Cristiano Saltori · Elisa Ricci · Fabio Poiesi | N/A | Code |
| DeltaEdit: Exploring Text-Free Training for Text-Driven Image Manipulation | Yueming Lyu · Tianwei Lin · Fu Li · Dongliang He · Jing Dong · Tieniu Tan | N/A | Code |
| SkyEye: Self-Supervised Bird’s-Eye-View Semantic Mapping Using Monocular Frontal View Images | Nikhil Gosala · Kürsat Petek · Paulo L. J. Drews-Jr · Wolfram Burgard · Abhinav Valada | N/A | Code |
| Towards Open-World Segmentation of Parts | Tai-Yu Pan · Qing Liu · Wei-Lun Chao · Brian Price | N/A | Code |
| DeepMapping2: Self-Supervised Large-Scale LiDAR Map Optimization | Chao Chen · Xinhao Liu · Yiming Li · Li Ding · Chen Feng | N/A | Code |
| SINE: SINgle Image Editing With Text-to-Image Diffusion Models | Zhixing Zhang · Ligong Han · Arnab Ghosh · Dimitris N. Metaxas · Jian Ren | N/A | Code |
| Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection | Long Li · Junwei Han · Ni Zhang · Nian Liu · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Fahad Shahbaz Khan | N/A | Code |
| TruFor: Leveraging All-Round Clues for Trustworthy Image Forgery Detection and Localization | Fabrizio Guillaro · Davide Cozzolino · Avneesh Sud · Nicholas Dufour · Luisa Verdoliva | N/A | Code |
| SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction | Yukang Cao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-Shot Learning With Hyperspherical Embeddings | Daniel J. Trosten · Rwiddhi Chakraborty · Sigurd Løkse · Kristoffer Knutsen Wickstrøm · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| MAGE: MAsked Generative Encoder To Unify Representation Learning and Image Synthesis | Tianhong Li · Huiwen Chang · Shlok Mishra · Han Zhang · Dina Katabi · Dilip Krishnan | N/A | Code |
| Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection | Lianyu Wang · Meng Wang · Daoqiang Zhang · Huazhu Fu | N/A | Code |
| OvarNet: Towards Open-Vocabulary Object Attribute Recognition | Keyan Chen · Xiaolong Jiang · Yao Hu · Xu Tang · Yan Gao · Jianqi Chen · Weidi Xie | N/A | Code |
| GINA-3D: Learning To Generate Implicit Neural Assets in the Wild | Bokui Shen · Xinchen Yan · Charles R. Qi · Mahyar Najibi · Boyang Deng · Leonidas Guibas · Yin Zhou · Dragomir Anguelov | N/A | Code |
| PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation | Qitao Zhao · Ce Zheng · Mengyuan Liu · Pichao Wang · Chen Chen | N/A | Code |
| Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization | Huan Ren · Wenfei Yang · Tianzhu Zhang · Yongdong Zhang | N/A | Code |
| Learning Partial Correlation Based Deep Visual Representation for Image Classification | Saimunur Rahman · Piotr Koniusz · Lei Wang · Luping Zhou · Peyman Moghadam · Changming Sun | N/A | Code |
| Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph | Rixin Zhou · Jiafu Wei · Qian Zhang · Ruihua Qi · Xi Yang · Chuntao Li | N/A | Code |
| DexArt: Benchmarking Generalizable Dexterous Manipulation With Articulated Objects | Chen Bao · Helin Xu · Yuzhe Qin · Xiaolong Wang | N/A | Code |
| Modeling the Distributional Uncertainty for Salient Object Detection Models | Xinyu Tian · Jing Zhang · Mochu Xiang · Yuchao Dai | N/A | Code |
| Evading Forensic Classifiers With Attribute-Conditioned Adversarial Faces | Fahad Shamshad · Koushik Srivatsan · Karthik Nandakumar | N/A | Code |
| Scene-Aware Egocentric 3D Human Pose Estimation | Jian Wang · Diogo Luvizon · Weipeng Xu · Lingjie Liu · Kripasindhu Sarkar · Christian Theobalt | N/A | Code |
| Camouflaged Instance Segmentation via Explicit De-Camouflaging | Naisong Luo · Yuwen Pan · Rui Sun · Tianzhu Zhang · Zhiwei Xiong · Feng Wu | N/A | Code |
| N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution | Haram Choi · Jeongmin Lee · Jihoon Yang | N/A | Code |
| Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding | Gyeongman Kim · Hajin Shim · Hyunsu Kim · Yunjey Choi · Junho Kim · Eunho Yang | N/A | Code |
| GLIGEN: Open-Set Grounded Text-to-Image Generation | Yuheng Li · Haotian Liu · Qingyang Wu · Fangzhou Mu · Jianwei Yang · Jianfeng Gao · Chunyuan Li · Yong Jae Lee | N/A | Code |
| Balanced Spherical Grid for Egocentric View Synthesis | Changwoon Choi · Sang Min Kim · Young Min Kim | N/A | Code |
| V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception | Runsheng Xu · Xin Xia · JINLONG LI · Hanzhao Li · Shuo Zhang · Zhengzhong Tu · Zonglin Meng · Hao Xiang · Xiaoyu Dong · Rui Song · Hongkai Yu · Bolei Zhou · Jiaqi Ma | N/A | Code |
| VindLU: A Recipe for Effective Video-and-Language Pretraining | Feng Cheng · Xizi Wang · Jie Lei · David Crandall · Mohit Bansal · Gedas Bertasius | N/A | Code |
| FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation | Jie Qin · Jie Wu · Pengxiang Yan · Ming Li · Ren Yuxi · Xuefeng Xiao · Yitong Wang · Rui Wang · Shilei Wen · Xin Pan · Xingang Wang | N/A | Code |
| NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization | Zhixiang Min · Bingbing Zhuang · Samuel Schulter · Buyu Liu · Enrique Dunn · Manmohan Chandraker | N/A | Code |
| ABCD: Arbitrary Bitwise Coefficient for De-Quantization | Woo Kyoung Han · Byeonghun Lee · Sang Hyun Park · Kyong Hwan Jin | N/A | Code |
| PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery | Sheng Zhang · Salman Khan · Zhiqiang Shen · Muzammal Naseer · Guangyi Chen · Fahad Shahbaz Khan | N/A | Code |
| Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing With Non-Learnable Primitives | Chuntao Ding · Zhichao Lu · Shangguang Wang · Ran Cheng · Vishnu Naresh Boddeti | N/A | Code |
| MaPLe: Multi-Modal Prompt Learning | Muhammad Uzair Khattak · Hanoona Rasheed · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Revisiting Residual Networks for Adversarial Robustness | Shihua Huang · Zhichao Lu · Kalyanmoy Deb · Vishnu Naresh Boddeti | N/A | Code |
| Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification | Youngwook Kim · Jae Myung Kim · Jieun Jeong · Cordelia Schmid · Zeynep Akata · Jungwoo Lee | N/A | Code |
| Human Pose Estimation in Extremely Low-Light Conditions | Sohyun Lee · Jaesung Rim · Boseung Jeong · Geonu Kim · Byungju Woo · Haechan Lee · Sunghyun Cho · Suha Kwak | N/A | Code |
| Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution | Chenfan Qu · Chongyu Liu · Yuliang Liu · Xinhong Chen · Dezhi Peng · Fengjun Guo · Lianwen Jin | N/A | Code |
| SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene | Minjung Son · Jeong Joon Park · Leonidas Guibas · Gordon Wetzstein | N/A | Code |
| LEGO-Net: Learning Regular Rearrangements of Objects in Rooms | Qiuhong Anna Wei · Sijie Ding · Jeong Joon Park · Rahul Sajnani · Adrien Poulenard · Srinath Sridhar · Leonidas Guibas | N/A | Code |
| MACARONS: Mapping and Coverage Anticipation With RGB Online Self-Supervision | Antoine Guédon · Tom Monnier · Pascal Monasse · Vincent Lepetit | N/A | Code |
| ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction | Zhen Wang · Shijie Zhou · Jeong Joon Park · Despoina Paschalidou · Suya You · Gordon Wetzstein · Leonidas Guibas · Achuta Kadambi | N/A | Code |
| Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos | Rohit Gupta · Anirban Roy · Claire Christensen · Sujeong Kim · Sarah Gerard · Madeline Cincebeaux · Ajay Divakaran · Todd Grindal · Mubarak Shah | N/A | Code |
| DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network | Xuan Shen · Yaohua Wang · Ming Lin · Yilun Huang · Hao Tang · Xiuyu Sun · Yanzhi Wang | N/A | Code |
| ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects | Marco Toschi · Riccardo De Matteo · Riccardo Spezialetti · Daniele De Gregorio · Luigi Di Stefano · Samuele Salti | N/A | Code |
| Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields | Brian K. S. Isaac-Medina · Chris G. Willcocks · Toby P. Breckon | N/A | Code |
| A Generalized Framework for Video Instance Segmentation | Miran Heo · Sukjun Hwang · Jeongseok Hyun · Hanjung Kim · Seoung Wug Oh · Joon-Young Lee · Seon Joo Kim | N/A | Code |
| Video Probabilistic Diffusion Models in Projected Latent Space | Sihyun Yu · Kihyuk Sohn · Subin Kim · Jinwoo Shin | N/A | Code |
| X-Avatar: Expressive Human Avatars | Kaiyue Shen · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Julien Valentin · Jie Song · Otmar Hilliges | N/A | Code |
| Hi4D: 4D Instance Segmentation of Close Human Interaction | Yifei Yin · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Jie Song · Otmar Hilliges | N/A | Code |
| Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction | Bin Fan · Yuxin Mao · Mochu Xiang · Zhexiong Wan · Qi Liu | N/A | Code |
| Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting | Syed Talal Wasim · Muzammal Naseer · Salman Khan · Fahad Shahbaz Khan · Mubarak Shah | N/A | Code |
| MaskSketch: Unpaired Structure-Guided Masked Image Generation | Dina Bashkirova · José Lezama · Kihyuk Sohn · Kate Saenko · Irfan Essa | N/A | Code |
| Super-CLEVR: A Virtual Benchmark To Diagnose Domain Robustness in Visual Reasoning | Zhuowan Li · Xingrui Wang · Elias Stengel-Eskin · Adam Kortylewski · Wufei Ma · Benjamin Van Durme · Alan L. Yuille | N/A | Code |
| CREPE: Can Vision-Language Foundation Models Reason Compositionally? | Zixian Ma · Jerry Hong · Mustafa Omer Gul · Mona Gandhi · Irena Gao · Ranjay Krishna | N/A | Code |
| ORCa: Glossy Objects As Radiance-Field Cameras | Kushagra Tiwary · Akshat Dave · Nikhil Behari · Tzofi Klinghoffer · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Learning Common Rationale To Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems | Yangyang Shu · Anton van den Hengel · Lingqiao Liu | N/A | Code |
| Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro · Quinlan Sykora · Sergio Casas · Raquel Urtasun | N/A | Code |
| Improved Test-Time Adaptation for Domain Generalization | Liang Chen · Yong Zhang · Yibing Song · Ying Shan · Lingqiao Liu | N/A | Code |
| Wavelet Diffusion Models Are Fast and Scalable Image Generators | Hao Phung · Quan Dao · Anh Tran | N/A | Code |
| Robust Dynamic Radiance Fields | Yu-Lun Liu · Chen Gao · Andréas Meuleman · Hung-Yu Tseng · Ayush Saraf · Changil Kim · Yung-Yu Chuang · Johannes Kopf · Jia-Bin Huang | N/A | Code |
| MixSim: A Hierarchical Framework for Mixed Reality Traffic Simulation | Simon Suo · Kelvin Wong · Justin Xu · James Tu · Alexander Cui · Sergio Casas · Raquel Urtasun | N/A | Code |
| Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement | Nancy Mehta · Akshay Dudhane · Subrahmanyam Murala · Syed Waqas Zamir · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Class Adaptive Network Calibration | Bingyuan Liu · Jérôme Rony · Adrian Galdran · Jose Dolz · Ismail Ben Ayed | N/A | Code |
| PROB: Probabilistic Objectness for Open World Object Detection | Orr Zohar · Kuan-Chieh Wang · Serena Yeung | N/A | Code |
| Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation | Min Shi · Zihao Huang · Xianzheng Ma · Xiaowei Hu · Zhiguo Cao | N/A | Code |
| HyperCUT: Video Sequence From a Single Blurry Image Using Unsupervised Ordering | Bang-Dang Pham · Phong Tran · Anh Tran · Cuong Pham · Rang Nguyen · Minh Hoai | N/A | Code |
| On the Effects of Self-Supervision and Contrastive Alignment in Deep Multi-View Clustering | Daniel J. Trosten · Sigurd Løkse · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| Visual Prompt Tuning for Generative Transfer Learning | Kihyuk Sohn · Huiwen Chang · José Lezama · Luisa Polania · Han Zhang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| Towards End-to-End Generative Modeling of Long Videos With Memory-Efficient Bidirectional Transformers | Jaehoon Yoo · Semin Kim · Doyup Lee · Chiheon Kim · Seunghoon Hong | N/A | Code |
| MAGVIT: Masked Generative Video Transformer | Lijun Yu · Yong Cheng · Kihyuk Sohn · José Lezama · Han Zhang · Huiwen Chang · Alexander G. Hauptmann · Ming-Hsuan Yang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| NICO++: Towards Better Benchmarking for Domain Generalization | Xingxuan Zhang · Yue He · Renzhe Xu · Han Yu · Zheyan Shen · Peng Cui | N/A | Code |
| Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization | Xingxuan Zhang · Renzhe Xu · Han Yu · Hao Zou · Peng Cui | N/A | Code |
| All-in-Focus Imaging From Event Focal Stack | Hanyue Lou · Minggui Teng · Yixin Yang · Boxin Shi | N/A | Code |
| Clover: Towards a Unified Video-Language Alignment and Fusion Model | Jingjia Huang · Yinan Li · Jiashi Feng · Xinglong Wu · Xiaoshuai Sun · Rongrong Ji | N/A | Code |
| UMat: Uncertainty-Aware Single Image High Resolution Material Capture | Carlos Rodriguez-Pardo · Henar Domínguez-Elvira · David Pascual-Hernández · Elena Garces | N/A | Code |
| Polarimetric iToF: Measuring High-Fidelity Depth Through Scattering Media | Daniel S. Jeon · Andréas Meuleman · Seung-Hwan Baek · Min H. Kim | N/A | Code |
| Freestyle Layout-to-Image Synthesis | Han Xue · Zhiwu Huang · Qianru Sun · Li Song · Wenjun Zhang | N/A | Code |
| Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior | Yuekun Dai · Yihang Luo · Shangchen Zhou · Chongyi Li · Chen Change Loy | N/A | Code |
| Meta Omnium: A Benchmark for General-Purpose Learning-To-Learn | Ondrej Bohdal · Yinbing Tian · Yongshuo Zong · Ruchika Chavhan · Da Li · Henry Gouk · Li Guo · Timothy Hospedales | N/A | Code |
| EXCALIBUR: Encouraging and Evaluating Embodied Exploration | Hao Zhu · Raghav Kapoor · So Yeon Min · Winson Han · Jiatai Li · Kaiwen Geng · Graham Neubig · Yonatan Bisk · Aniruddha Kembhavi · Luca Weihs | N/A | Code |
| Detection of Out-of-Distribution Samples Using Binary Neuron Activation Patterns | Bartłomiej Olber · Krystian Radlak · Adam Popowicz · Michal Szczepankiewicz · Krystian Chachuła | N/A | Code |
| Shakes on a Plane: Unsupervised Depth Estimation From Unstabilized Photography | Ilya Chugunov · Yuxuan Zhang · Felix Heide | N/A | Code |
| JacobiNeRF: NeRF Shaping With Mutual Information Gradients | Xiaomeng Xu · Yanchao Yang · Kaichun Mo · Boxiao Pan · Li Yi · Leonidas Guibas | N/A | Code |
| MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers | Jihao Liu · Xin Huang · Jinliang Zheng · Yu Liu · Hongsheng Li | N/A | Code |
| Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement | Siddarth Ravichandran · Ondřej Texler · Dimitar Dinev · Hyun Jae Kang | N/A | Code |
| CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions | Ming Yan · Xin Wang · Yudi Dai · Siqi Shen · Chenglu Wen · Lan Xu · Yuexin Ma · Cheng Wang | N/A | Code |
| SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments | Yudi Dai · Yitai Lin · Xiping Lin · Chenglu Wen · Lan Xu · Hongwei Yi · Siqi Shen · Yuexin Ma · Cheng Wang | N/A | Code |
| Viewpoint Equivariance for Multi-View 3D Object Detection | Dian Chen · Jie Li · Vitor Guizilini · Rares Andrei Ambrus · Adrien Gaidon | N/A | Code |
| Balanced Product of Calibrated Experts for Long-Tailed Recognition | Emanuel Sanchez Aimar · Arvi Jonnarth · Michael Felsberg · Marco Kuhlmann | N/A | Code |
| Robust Mean Teacher for Continual and Gradual Test-Time Adaptation | Mario Döbler · Robert A. Marsden · Bin Yang | N/A | Code |
| Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation | Sara Sarto · Manuele Barraco · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara | N/A | Code |
| BITE: Beyond Priors for Improved Three-D Dog Pose Estimation | Nadine Rüegg · Shashank Tripathi · Konrad Schindler · Michael J. Black · Silvia Zuffi | N/A | Code |
| SparseFusion: Distilling View-Conditioned Diffusion for 3D Reconstruction | Zhizhuo Zhou · Shubham Tulsiani | N/A | Code |
| PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation | Liwen Zhang · Xinyan Zhang · Youcheng Zhang · Yufei Guo · Yuanpei Chen · Xuhui Huang · Zhe Ma | N/A | Code |
| Masked Wavelet Representation for Compact Neural Radiance Fields | Daniel Rho · Byeonghyeon Lee · Seungtae Nam · Joo Chan Lee · Jong Hwan Ko · Eunbyung Park | N/A | Code |
| Guided Depth Super-Resolution by Deep Anisotropic Diffusion | Nando Metzger · Rodrigo Caye Daudt · Konrad Schindler | N/A | Code |
| Masked Images Are Counterfactual Samples for Robust Fine-Tuning | Yao Xiao · Ziyi Tang · Pengxu Wei · Cong Liu · Liang Lin | N/A | Code |
| Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration | Guofeng Mei · Hao Tang · Xiaoshui Huang · Weijie Wang · Juan Liu · Jian Zhang · Luc Van Gool · Qiang Wu | N/A | Code |
| ECON: Explicit Clothed Humans Optimized via Normal Integration | Yuliang Xiu · Jinlong Yang · Xu Cao · Dimitrios Tzionas · Michael J. Black | N/A | Code |
| GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection | Xixi Liu · Yaroslava Lochman · Christopher Zach | N/A | Code |
| OCTET: Object-Aware Counterfactual Explanations | Mehdi Zemni · Mickaël Chen · Éloi Zablocki · Hédi Ben-Younes · Patrick Pérez · Matthieu Cord | N/A | Code |
| Consistent View Synthesis With Pose-Guided Diffusion Models | Hung-Yu Tseng · Qinbo Li · Changil Kim · Suhib Alsisan · Jia-Bin Huang · Johannes Kopf | N/A | Code |
| GFPose: Learning 3D Human Pose Prior With Gradient Fields | Hai Ci · Mingdong Wu · Wentao Zhu · Xiaoxuan Ma · Hao Dong · Fangwei Zhong · Yizhou Wang | N/A | Code |
| Bayesian Posterior Approximation With Stochastic Ensembles | Oleksandr Balabanov · Bernhard Mehlig · Hampus Linander | N/A | Code |
| Spatio-Focal Bidirectional Disparity Estimation From a Dual-Pixel Image | Donggun Kim · Hyeonjoong Jang · Inchul Kim · Min H. Kim | N/A | Code |
| Octree Guided Unoriented Surface Reconstruction | Chamin Hewa Koneputugodage · Yizhak Ben-Shabat · Stephen Gould | N/A | Code |
| HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning | Chia-Wen Kuo · Zsolt Kira | N/A | Code |
| SUDS: Scalable Urban Dynamic Scenes | Haithem Turki · Jason Y. Zhang · Francesco Ferroni · Deva Ramanan | N/A | Code |
| Harmonious Feature Learning for Interactive Hand-Object Pose Estimation | Zhifeng Lin · Changxing Ding · Huan Yao · Zengsheng Kuang · Shaoli Huang | N/A | Code |
| Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer | Agus Gunawan · Soo Ye Kim · Hyeonjun Sim · Jae-Ho Lee · Munchurl Kim | N/A | Code |
| Trainable Projected Gradient Method for Robust Fine-Tuning | Junjiao Tian · Zecheng He · Xiaoliang Dai · Chih-Yao Ma · Yen-Cheng Liu · Zsolt Kira | N/A | Code |
| OReX: Object Reconstruction From Planar Cross-Sections Using Neural Fields | Haim Sawdayee · Amir Vaxman · Amit H. Bermano | N/A | Code |
| CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects | Nick Heppert · Zubair Irshad · Sergey Zakharov · Katherine Liu · Rares Andrei Ambrus · Jeannette Bohg · Abhinav Valada · Thomas Kollar | N/A | Code |
| ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction | Zhengdi Yu · Shaoli Huang · Chen Fang · Toby P. Breckon · Jue Wang | N/A | Code |
| Perception and Semantic Aware Regularization for Sequential Confidence Calibration | Zhenghua Peng · Yu Luo · Tianshui Chen · Keke Xu · Shuangping Huang | N/A | Code |
| Crowd3D: Towards Hundreds of People Reconstruction From a Single Image | Hao Wen · Jing Huang · Huili Cui · Haozhe Lin · Yu-Kun Lai · Lu Fang · Kun Li | N/A | Code |
| ZegCLIP: Towards Adapting CLIP for Zero-Shot Semantic Segmentation | Ziqin Zhou · Yinjie Lei · Bowen Zhang · Lingqiao Liu · Yifan Liu | N/A | Code |
| Learning Semantic-Aware Disentangled Representation for Flexible 3D Human Body Editing | Xiaokun Sun · Qiao Feng · Xiongzheng Li · Jinsong Zhang · Yu-Kun Lai · Jingyu Yang · Kun Li | N/A | Code |
| Skinned Motion Retargeting With Residual Perception of Motion Semantics & Geometry | Jiaxu Zhang · Junwu Weng · Di Kang · Fang Zhao · Shaoli Huang · Xuefei Zhe · Linchao Bao · Ying Shan · Jue Wang · Zhigang Tu | N/A | Code |
| Unknown Sniffer for Object Detection: Don’t Turn a Blind Eye to Unknown Objects | Wenteng Liang · Feng Xue · Yihao Liu · Guofeng Zhong · Anlong Ming | N/A | Code |
| RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving | Angelika Ando · Spyros Gidaris · Andrei Bursuc · Gilles Puy · Alexandre Boulch · Renaud Marlet | N/A | Code |
| 3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data | Libing Zeng · Lele Chen · Wentao Bao · Zhong Li · Yi Xu · Junsong Yuan · Nima Khademi Kalantari | N/A | Code |
| Best of Both Worlds: Multimodal Contrastive Learning With Tabular and Imaging Data | Paul Hager · Martin J. Menten · Daniel Rueckert | N/A | Code |
| JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking | Edward Vendrow · Tho Le · Jianfei Cai · Hamid Rezatofighi | N/A | Code |
| Consistent Direct Time-of-Flight Video Depth Super-Resolution | Zhanghao Sun · Wei Ye · Jinhui Xiong · Gyeongmin Choe · Jialiang Wang · Shuochen Su · Rakesh Ranjan | N/A | Code |
| Correlational Image Modeling for Self-Supervised Visual Pre-Training | Wei Li · Jiahao Xie · Chen Change Loy | N/A | Code |
| CelebV-Text: A Large-Scale Facial Text-Video Dataset | Jianhui Yu · Hao Zhu · Liming Jiang · Chen Change Loy · Weidong Cai · Wayne Wu | N/A | Code |
| Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning | Wei Ji · Renjie Liang · Zhedong Zheng · Wenqiao Zhang · Shengyu Zhang · Juncheng Li · Mengze Li · Tat-seng Chua | N/A | Code |
| Learning 3D Scene Priors With 2D Supervision | Yinyu Nie · Angela Dai · Xiaoguang Han · Matthias Nießner | N/A | Code |
| Generating Aligned Pseudo-Supervision From Non-Aligned Data for Image Restoration in Under-Display Camera | Ruicheng Feng · Chongyi Li · Huaijin Chen · Shuai Li · Jinwei Gu · Chen Change Loy | N/A | Code |
| Siamese DETR | Zeren Chen · Gengshi Huang · Wei Li · Jianing Teng · Kun Wang · Jing Shao · Chen Change Loy · Lu Sheng | N/A | Code |
| Panoptic Video Scene Graph Generation | Jingkang Yang · Wenxuan Peng · Xiangtai Li · Zujin Guo · Liangyu Chen · Bo Li · Zheng Ma · Kaiyang Zhou · Wayne Zhang · Chen Change Loy · Ziwei Liu | N/A | Code |
| Randomized Adversarial Training via Taylor Expansion | Gaojie Jin · Xinping Yi · Dengyu Wu · Ronghui Mu · Xiaowei Huang | N/A | Code |
| Task Residual for Tuning Vision-Language Models | Tao Yu · Zhihe Lu · Xin Jin · Zhibo Chen · Xinchao Wang | N/A | Code |
| PACO: Parts and Attributes of Common Objects | Vignesh Ramanathan · Anmol Kalia · Vladan Petrovic · Yi Wen · Baixue Zheng · Baishan Guo · Rui Wang · Aaron Marquez · Rama Kovvuri · Abhishek Kadian · Amir Mousavi · Yiwen Song · Abhimanyu Dubey · Dhruv Mahajan | N/A | Code |
| CloSET: Modeling Clothed Humans on Continuous Surface With Explicit Template Decomposition | Hongwen Zhang · Siyou Lin · Ruizhi Shao · Yuxiang Zhang · Zerong Zheng · Han Huang · Yandong Guo · Yebin Liu | N/A | Code |
| Transductive Few-Shot Learning With Prototype-Based Label Propagation by Iterative Graph Refinement | Hao Zhu · Piotr Koniusz | N/A | Code |
| DualVector: Unsupervised Vector Font Synthesis With Dual-Part Representation | Ying-Tian Liu · Zhifei Zhang · Yuan-Chen Guo · Matthew Fisher · Zhaowen Wang · Song-Hai Zhang | N/A | Code |
| Invertible Neural Skinning | Yash Kant · Aliaksandr Siarohin · Riza Alp Guler · Menglei Chai · Jian Ren · Sergey Tulyakov · Igor Gilitschenski | N/A | Code |
| Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars | Jingxiang Sun · Xuan Wang · Lizhen Wang · Xiaoyu Li · Yong Zhang · Hongwen Zhang · Yebin Liu | N/A | Code |
| Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding | Morris Alper · Michael Fiman · Hadar Averbuch-Elor | N/A | Code |
| ConStruct-VL: Data-Free Continual Structured VL Concepts Learning | James Seale Smith · Paola Cascante-Bonilla · Assaf Arbelle · Donghyun Kim · Rameswar Panda · David Cox · Diyi Yang · Zsolt Kira · Rogerio Feris · Leonid Karlinsky | N/A | Code |
| LINe: Out-of-Distribution Detection by Leveraging Important Neurons | Yong Hyun Ahn · Gyeong-Moon Park · Seong Tae Kim | N/A | Code |
| Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning With Multimodal Models | Zhiqiu Lin · Samuel Yu · Zhiyi Kuang · Deepak Pathak · Deva Ramanan | N/A | Code |
| Panoptic Lifting for 3D Scene Understanding With Neural Fields | Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Norman Müller · Matthias Nießner · Angela Dai · Peter Kontschieder | N/A | Code |
| GamutMLP: A Lightweight MLP for Color Loss Recovery | Hoang M. Le · Brian Price · Scott Cohen · Michael S. Brown | N/A | Code |
| DIFu: Depth-Guided Implicit Function for Clothed Human Reconstruction | Dae-Young Song · HeeKyung Lee · Jeongil Seo · Donghyeon Cho | N/A | Code |
| NLOST: Non-Line-of-Sight Imaging With Transformer | Yue Li · Jiayong Peng · Juntian Ye · Yueyi Zhang · Feihu Xu · Zhiwei Xiong | N/A | Code |
| SeaThru-NeRF: Neural Radiance Fields in Scattering Media | Deborah Levy · Amit Peleg · Naama Pearl · Dan Rosenbaum · Derya Akkaynak · Simon Korman · Tali Treibitz | N/A | Code |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Garrick Brazil · Abhinav Kumar · Julian Straub · Nikhila Ravi · Justin Johnson · Georgia Gkioxari | N/A | Code |
| Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection | Chuangchuang Tan · Yao Zhao · Shikui Wei · Guanghua Gu · Yunchao Wei | N/A | Code |
| LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Learning Customized Visual Models With Retrieval-Augmented Knowledge | Haotian Liu · Kilho Son · Jianwei Yang · Ce Liu · Jianfeng Gao · Yong Jae Lee · Chunyuan Li | N/A | Code |
| MAIR: Multi-View Attention Inverse Rendering With 3D Spatially-Varying Lighting Estimation | JunYong Choi · SeokYeong Lee · Haesol Park · Seung-Won Jung · Ig-Jae Kim · Junghyun Cho | N/A | Code |
| Generalizing Dataset Distillation via Deep Generative Prior | George Cazenavette · Tongzhou Wang · Antonio Torralba · Alexei A. Efros · Jun-Yan Zhu | N/A | Code |
| Polarized Color Image Denoising | Zhuoxiao Li · Haiyang Jiang · Mingdeng Cao · Yinqiang Zheng | N/A | Code |
| Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation | Haochen Wang · Xiaodan Du · Jiahao Li · Raymond A. Yeh · Greg Shakhnarovich | N/A | Code |
| FJMP: Factorized Joint Multi-Agent Motion Prediction Over Learned Directed Acyclic Interaction Graphs | Luke Rowe · Martin Ethier · Eli-Henry Dykhne · Krzysztof Czarnecki | N/A | Code |
| Mask-Free Video Instance Segmentation | Lei Ke · Martin Danelljan · Henghui Ding · Yu-Wing Tai · Chi-Keung Tang · Fisher Yu | N/A | Code |
| OVTrack: Open-Vocabulary Multiple Object Tracking | Siyuan Li · Tobias Fischer · Lei Ke · Henghui Ding · Martin Danelljan · Fisher Yu | N/A | Code |
| LightPainter: Interactive Portrait Relighting With Freehand Scribble | Yiqun Mei · He Zhang · Xuaner Zhang · Jianming Zhang · Zhixin Shu · Yilin Wang · Zijun Wei · Shi Yan · HyunJoon Jung · Vishal M. Patel | N/A | Code |
| Towards Scalable Neural Representation for Diverse Videos | Bo He · Xitong Yang · Hanyu Wang · Zuxuan Wu · Hao Chen · Shuaiyi Huang · Yixuan Ren · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Teaching Matters: Investigating the Role of Supervision in Vision Transformers | Matthew Walmer · Saksham Suri · Kamal Gupta · Abhinav Shrivastava | N/A | Code |
| FlexNeRF: Photorealistic Free-Viewpoint Rendering of Moving Humans From Sparse Views | Vinoj Jayasundara · Amit Agrawal · Nicolas Heron · Abhinav Shrivastava · Larry S. Davis | N/A | Code |
| Leveraging Temporal Context in Low Representational Power Regimes | Camilo L. Fosco · SouYoung Jin · Emilie Josephs · Aude Oliva | N/A | Code |
| Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask | Shangzhan Zhang · Sida Peng · Tianrun Chen · Linzhan Mou · Haotong Lin · Kaicheng Yu · Yiyi Liao · Xiaowei Zhou | N/A | Code |
| Align and Attend: Multimodal Summarization With Dual Contrastive Losses | Bo He · Jun Wang · Jielin Qiu · Trung Bui · Abhinav Shrivastava · Zhaowen Wang | N/A | Code |
| SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network | Chuong Huynh · Yuqian Zhou · Zhe Lin · Connelly Barnes · Eli Shechtman · Sohrab Amirghodsi · Abhinav Shrivastava | N/A | Code |
| NIRVANA: Neural Implicit Representations of Videos With Adaptive Networks and Autoregressive Patch-Wise Modeling | Shishira R Maiya · Sharath Girish · Max Ehrlich · Hanyu Wang · Kwot Sin Lee · Patrick Poirson · Pengxiang Wu · Chen Wang · Abhinav Shrivastava | N/A | Code |
| Seeing Beyond the Brain: Conditional Diffusion Model With Sparse Masked Modeling for Vision Decoding | Zijiao Chen · Jiaxin Qing · Tiange Xiang · Wan Lin Yue · Juan Helen Zhou | N/A | Code |
| Position-Guided Text Prompt for Vision-Language Pre-Training | Jinpeng Wang · Pan Zhou · Mike Zheng Shou · Shuicheng Yan | N/A | Code |
| Progressive Spatio-Temporal Alignment for Efficient Event-Based Motion Estimation | Xueyan Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering | Difei Gao · Luowei Zhou · Lei Ji · Linchao Zhu · Yi Yang · Mike Zheng Shou | N/A | Code |
| Histopathology Whole Slide Image Analysis With Heterogeneous Graph Representation Learning | Tsai Hor Chan · Fernando Julio Cendra · Lan Ma · Guosheng Yin · Lequan Yu | N/A | Code |
| Making Vision Transformers Efficient From a Token Sparsification View | Shuning Chang · Pichao Wang · Ming Lin · Fan Wang · David Junhao Zhang · Rong Jin · Mike Zheng Shou | N/A | Code |
| Leverage Interactive Affinity for Affordance Learning | Hongchen Luo · Wei Zhai · Jing Zhang · Yang Cao · Dacheng Tao | N/A | Code |
| Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection | Fan Lu · Kai Zhu · Wei Zhai · Kecheng Zheng · Yang Cao | N/A | Code |
| HARP: Personalized Hand Reconstruction From a Monocular RGB Video | Korrawe Karunratanakul · Sergey Prokudin · Otmar Hilliges · Siyu Tang | N/A | Code |
| Towards Effective Visual Representations for Partial-Label Learning | Shiyu Xia · Jiaqi Lv · Ning Xu · Gang Niu · Xin Geng | N/A | Code |
| SFD2: Semantic-Guided Feature Detection and Description | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation | Bowen Zhang · Chenyang Qi · Pan Zhang · Bo Zhang · HsiangTao Wu · Dong Chen · Qifeng Chen · Yong Wang · Fang Wen | N/A | Code |
| The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training | Gi-Cheon Kang · Sungdong Kim · Jin-Hwa Kim · Donghyun Kwak · Byoung-Tak Zhang | N/A | Code |
| Temporal Interpolation Is All You Need for Dynamic Neural Radiance Fields | Sungheon Park · Minjung Son · Seokhwan Jang · Young Chun Ahn · Ji-Yeon Kim · Nahyup Kang | N/A | Code |
| DiGA: Distil To Generalize and Then Adapt for Domain Adaptive Semantic Segmentation | Fengyi Shen · Akhil Gurram · Ziyuan Liu · He Wang · Alois Knoll | N/A | Code |
| Multimodal Prompting With Missing Modalities for Visual Recognition | Yi-Lun Lee · Yi-Hsuan Tsai · Wei-Chen Chiu · Chen-Yu Lee | N/A | Code |
| On Calibrating Semantic Segmentation Models: Analyses and an Algorithm | Dongdong Wang · Boqing Gong · Liqiang Wang | N/A | Code |
| IMP: Iterative Matching and Pose Estimation With Adaptive Pooling | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| Grid-Guided Neural Radiance Fields for Large Urban Scenes | Linning Xu · Yuanbo Xiangli · Sida Peng · Xingang Pan · Nanxuan Zhao · Christian Theobalt · Bo Dai · Dahua Lin | N/A | Code |
| Neural Voting Field for Camera-Space 3D Hand Pose Estimation | Lin Huang · Chung-Ching Lin · Kevin Lin · Lin Liang · Lijuan Wang · Junsong Yuan · Zicheng Liu | N/A | Code |
| Dense Network Expansion for Class Incremental Learning | Zhiyuan Hu · Yunsheng Li · Jiancheng Lyu · Dashan Gao · Nuno Vasconcelos | N/A | Code |
| FashionSAP: Symbols and Attributes Prompt for Fine-Grained Fashion Vision-Language Pre-Training | Yunpeng Han · Lisai Zhang · Qingcai Chen · Zhijian Chen · Zhonghua Li · Jianxin Yang · Zhao Cao | N/A | Code |
| Batch Model Consolidation: A Multi-Task Model Consolidation Framework | Iordanis Fostiropoulos · Jiaye Zhu · Laurent Itti | N/A | Code |
| Open-Vocabulary Attribute Detection | María A. Bravo · Sudhanshu Mittal · Simon Ging · Thomas Brox | N/A | Code |
| Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models | Nithin Gopalakrishnan Nair · Wele Gedara Chaminda Bandara · Vishal M. Patel | N/A | Code |
| BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection | Lei Yang · Kaicheng Yu · Tao Tang · Jun Li · Kun Yuan · Li Wang · Xinyu Zhang · Peng Chen | N/A | Code |
| Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization | Chen Zhao · Shuming Liu · Karttikeya Mangalam · Bernard Ghanem | N/A | Code |
| C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation | Nazmul Karim · Niluthpol Chowdhury Mithun · Abhinav Rajvanshi · Han-pang Chiu · Supun Samarasekera · Nazanin Rahnavard | N/A | Code |
| Are Deep Neural Networks SMARTer Than Second Graders? | Anoop Cherian · Kuan-Chuan Peng · Suhas Lohit · Kevin A. Smith · Joshua B. Tenenbaum | N/A | Code |
| Persistent Nature: A Generative Model of Unbounded 3D Worlds | Lucy Chai · Richard Tucker · Zhengqi Li · Phillip Isola · Noah Snavely | N/A | Code |
| InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions | Wenhai Wang · Jifeng Dai · Zhe Chen · Zhenhang Huang · Zhiqi Li · Xizhou Zhu · Xiaowei Hu · Tong Lu · Lewei Lu · Hongsheng Li · Xiaogang Wang · Yu Qiao | N/A | Code |
| Learning To Fuse Monocular and Multi-View Cues for Multi-Frame Depth Estimation in Dynamic Scenes | Rui Li · Dong Gong · Wei Yin · Hao Chen · Yu Zhu · Kaixuan Wang · Xiaozhi Chen · Jinqiu Sun · Yanning Zhang | N/A | Code |
| Benchmarking Self-Supervised Learning on Diverse Pathology Datasets | Mingu Kang · Heon Song · Seonwook Park · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions | Shuxuan Guo · Yinlin Hu · Jose M. Alvarez · Mathieu Salzmann | N/A | Code |
| Self-Supervised Representation Learning for CAD | Benjamin T. Jones · Michael Hu · Milin Kodnongbua · Vladimir G. Kim · Adriana Schulz | N/A | Code |
| SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer | Xuanyao Chen · Zhijian Liu · Haotian Tang · Li Yi · Hang Zhao · Song Han | N/A | Code |
| Neural Pixel Composition for 3D-4D View Synthesis From Multi-Views | Aayush Bansal · Michael Zollhöfer | N/A | Code |
| ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries | Junru Gu · Chenxu Hu · Tianyuan Zhang · Xuanyao Chen · Yilun Wang · Yue Wang · Hang Zhao | N/A | Code |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders | Wele Gedara Chaminda Bandara · Naman Patel · Ali Gholami · Mehdi Nikkhah · Motilal Agrawal · Vishal M. Patel | N/A | Code |
| Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning | Xiaoyang Wu · Xin Wen · Xihui Liu · Hengshuang Zhao | N/A | Code |
| RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer | Jiahao Wang · Songyang Zhang · Yong Liu · Taiqiang Wu · Yujiu Yang · Xihui Liu · Kai Chen · Ping Luo · Dahua Lin | N/A | Code |
| TeSLA: Test-Time Self-Learning With Automatic Adversarial Augmentation | Devavrat Tomar · Guillaume Vray · Behzad Bozorgtabar · Jean-Philippe Thiran | N/A | Code |
| ObjectMatch: Robust Registration Using Canonical Object Correspondences | Can Gümeli · Angela Dai · Matthias Nießner | N/A | Code |
| Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models | Jiale Xu · Xintao Wang · Weihao Cheng · Yan-Pei Cao · Ying Shan · Xiaohu Qie · Shenghua Gao | N/A | Code |
| SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao · Yan-Pei Cao · Ying Shan | N/A | Code |
| Object Detection With Self-Supervised Scene Adaptation | Zekun Zhang · Minh Hoai | N/A | Code |
| Megahertz Light Steering Without Moving Parts | Adithya Pediredla · Srinivasa G. Narasimhan · Maysamreza Chamanzar · Ioannis Gkioulekas | N/A | Code |
| ISBNet: A 3D Point Cloud Instance Segmentation Network With Instance-Aware Sampling and Box-Aware Dynamic Convolution | Tuan Duc Ngo · Binh-Son Hua · Khoi Nguyen | N/A | Code |
| Rate Gradient Approximation Attack Threats Deep Spiking Neural Networks | Tong Bu · Jianhao Ding · Zecheng Hao · Zhaofei Yu | N/A | Code |
| PIVOT: Prompting for Video Continual Learning | Andrés Villa · Juan León Alcázar · Motasem Alfarra · Kumail Alhamoud · Julio Hurtado · Fabian Caba Heilbron · Alvaro Soto · Bernard Ghanem | N/A | Code |
| ARO-Net: Learning Implicit Fields From Anchored Radial Observations | Yizhi Wang · Zeyu Huang · Ariel Shamir · Hui Huang · Hao Zhang · Ruizhen Hu | N/A | Code |
| Parallel Diffusion Models of Operator and Image for Blind Inverse Problems | Hyungjin Chung · Jeongsol Kim · Sehui Kim · Jong Chul Ye | N/A | Code |
| Solving 3D Inverse Problems Using Pre-Trained 2D Diffusion Models | Hyungjin Chung · Dohoon Ryu · Michael T. McCann · Marc L. Klasky · Jong Chul Ye | N/A | Code |
| Affordance Grounding From Demonstration Video To Target Image | Joya Chen · Difei Gao · Kevin Qinghong Lin · Mike Zheng Shou | N/A | Code |
| Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations | Yiwu Zhong · Licheng Yu · Yang Bai · Shangwen Li · Xueting Yan · Yin Li | N/A | Code |
| YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors | Chien-Yao Wang · Alexey Bochkovskiy · Hong-Yuan Mark Liao | N/A | Code |
| OmniCity: Omnipotent City Understanding With Multi-Level and Multi-View Images | Weijia Li · Yawen Lai · Linning Xu · Yuanbo Xiangli · Jinhua Yu · Conghui He · Gui-Song Xia · Dahua Lin | N/A | Code |
| Object Discovery From Motion-Guided Tokens | Zhipeng Bao · Pavel Tokmakov · Yu-Xiong Wang · Adrien Gaidon · Martial Hebert | N/A | Code |
| MP-Former: Mask-Piloted Transformer for Image Segmentation | Hao Zhang · Feng Li · Huaizhe Xu · Shijia Huang · Shilong Liu · Lionel M. Ni · Lei Zhang | N/A | Code |
| Disentangling Writer and Character Styles for Handwriting Generation | Gang Dai · Yifan Zhang · Qingfeng Wang · Qing Du · Zhuliang Yu · Zhuoman Liu · Shuangping Huang | N/A | Code |
| Building Rearticulable Models for Arbitrary 3D Objects From 4D Point Clouds | Shaowei Liu · Saurabh Gupta · Shenlong Wang | N/A | Code |
| Gated Stereo: Joint Depth Estimation From Gated and Wide-Baseline Active Stereo Cues | Stefanie Walz · Mario Bijelic · Andrea Ramazzina · Amanpreet Walia · Fahim Mannan · Felix Heide | N/A | Code |
| Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation | Wei Wang · Zhun Zhong · Weijie Wang · Xi Chen · Charles Ling · Boyu Wang · Nicu Sebe | N/A | Code |
| Perspective Fields for Single Image Camera Calibration | Linyi Jin · Jianming Zhang · Yannick Hold-Geoffroy · Oliver Wang · Kevin Blackburn-Matzen · Matthew Sticha · David F. Fouhey | N/A | Code |
| Vision Transformers Are Parameter-Efficient Audio-Visual Learners | Yan-Bo Lin · Yi-Lin Sung · Jie Lei · Mohit Bansal · Gedas Bertasius | N/A | Code |
| Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos | Yubin Hu · Yuze He · Yanghao Li · Jisheng Li · Yuxing Han · Jiangtao Wen · Yong-Jin Liu | N/A | Code |
| DisWOT: Student Architecture Search for Distillation WithOut Training | Peijie Dong · Lujun Li · Zimian Wei | N/A | Code |
| Activating More Pixels in Image Super-Resolution Transformer | Xiangyu Chen · Xintao Wang · Jiantao Zhou · Yu Qiao · Chao Dong | N/A | Code |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Jie Hu · Linyan Huang · Tianhe Ren · Shengchuan Zhang · Rongrong Ji · Liujuan Cao | N/A | Code |
| PA&DA: Jointly Sampling Path and Data for Consistent NAS | Shun Lu · Yu Hu · Longxing Yang · Zihao Sun · Jilin Mei · Jianchao Tan · Chengru Song | N/A | Code |
| NeuralUDF: Learning Unsigned Distance Fields for Multi-View Reconstruction of Surfaces With Arbitrary Topologies | Xiaoxiao Long · Cheng Lin · Lingjie Liu · Yuan Liu · Peng Wang · Christian Theobalt · Taku Komura · Wenping Wang | N/A | Code |
| Towards Universal Fake Image Detectors That Generalize Across Generative Models | Utkarsh Ojha · Yuheng Li · Yong Jae Lee | N/A | Code |
| FLAG3D: A 3D Fitness Activity Dataset With Language Instruction | Yansong Tang · Jinpeng Liu · Aoyang Liu · Bin Yang · Wenxun Dai · Yongming Rao · Jiwen Lu · Jie Zhou · Xiu Li | N/A | Code |
| NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction From Multi-View Images | Yunfan Ye · Renjiao Yi · Zhirui Gao · Chenyang Zhu · Zhiping Cai · Kai Xu | N/A | Code |
| Executing Your Commands via Motion Diffusion in Latent Space | Xin Chen · Biao Jiang · Wen Liu · Zilong Huang · Bin Fu · Tao Chen · Gang Yu | N/A | Code |
| MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID | Jianyang Gu · Kai Wang · Hao Luo · Chen Chen · Wei Jiang · Yuqiang Fang · Shanghang Zhang · Yang You · Jian Zhao | N/A | Code |
| SunStage: Portrait Reconstruction and Relighting Using the Sun as a Light Stage | Yifan Wang · Aleksander Holynski · Xiuming Zhang · Xuaner Zhang | N/A | Code |
| IS-GGT: Iterative Scene Graph Generation With Generative Transformers | Sanjoy Kundu · Sathyanarayanan N. Aakur | N/A | Code |
| DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis | Yinghao Xu · Menglei Chai · Zifan Shi · Sida Peng · Ivan Skorokhodov · Aliaksandr Siarohin · Ceyuan Yang · Yujun Shen · Hsin-Ying Lee · Bolei Zhou · Sergey Tulyakov | N/A | Code |
| Breaking the “Object” in Video Object Segmentation | Pavel Tokmakov · Jie Li · Adrien Gaidon | N/A | Code |
| SimpleNet: A Simple Network for Image Anomaly Detection and Localization | Zhikang Liu · Yiming Zhou · Yuansheng Xu · Zilei Wang | N/A | Code |
| Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation | Lingting Zhu · Xian Liu · Xuanyu Liu · Rui Qian · Ziwei Liu · Lequan Yu | N/A | Code |
| Top-Down Visual Attention From Analysis by Synthesis | Baifeng Shi · Trevor Darrell · Xin Wang | N/A | Code |
| Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation With Cross-Scale Distortion Awareness | Zhijie Shen · Zishuo Zheng · Chunyu Lin · Lang Nie · Kang Liao · Shuai Zheng · Yao Zhao | N/A | Code |
| Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation | Xiaolong Shen · Zongxin Yang · Xiaohan Wang · Jianxin Ma · Chang Zhou · Yi Yang | N/A | Code |
| RaBit: Parametric Modeling of 3D Biped Cartoon Characters With a Topological-Consistent Dataset | Zhongjin Luo · Shengcai Cai · Jinguo Dong · Ruibo Ming · Liangdong Qiu · Xiaohang Zhan · Xiaoguang Han | N/A | Code |
| Masked Image Modeling With Local Multi-Scale Reconstruction | Haoqing Wang · Yehui Tang · Yunhe Wang · Jianyuan Guo · Zhi-Hong Deng · Kai Han | N/A | Code |
| Omni Aggregation Networks for Lightweight Image Super-Resolution | Hang Wang · Xuanhong Chen · Bingbing Ni · Yutian Liu · Jinfan Liu | N/A | Code |
| TryOnDiffusion: A Tale of Two UNets | Luyang Zhu · Dawei Yang · Tyler Zhu · Fitsum Reda · William Chan · Chitwan Saharia · Mohammad Norouzi · Ira Kemelmacher-Shlizerman | N/A | Code |
| MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition | Xiang Wang · Shiwei Zhang · Zhiwu Qing · Changxin Gao · Yingya Zhang · Deli Zhao · Nong Sang | N/A | Code |
| Dynamic Aggregated Network for Gait Recognition | Kang Ma · Ying Fu · Dezhi Zheng · Chunshui Cao · Xuecai Hu · Yongzhen Huang | N/A | Code |
| Equivalent Transformation and Dual Stream Network Construction for Mobile Image Super-Resolution | Jiahao Chao · Zhou Zhou · Hongfan Gao · Jiali Gong · Zhengfeng Yang · Zhenbing Zeng · Lydia Dehbi | N/A | Code |
| Semi-Supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination | Zimeng Zhao · Binghui Zuo · Zhiyu Long · Yangang Wang | N/A | Code |
| DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields | Yu Chen · Gim Hee Lee | N/A | Code |
| Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance Pursuit | Xiaohang Wang · Xuanhong Chen · Bingbing Ni · Hang Wang · Zhengyan Tong · Yutian Liu | N/A | Code |
| Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections | Jiaxiong Qiu · Peng-Tao Jiang · Yifan Zhu · Ze-Xin Yin · Ming-Ming Cheng · Bo Ren | N/A | Code |
| The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction | Alexandros Stergiou · Dima Damen | N/A | Code |
| Use Your Head: Improving Long-Tail Video Recognition | Toby Perrett · Saptarshi Sinha · Tilo Burghardt · Majid Mirmehdi · Dima Damen | N/A | Code |
| Large-Scale Training Data Search for Object Re-Identification | Yue Yao · Tom Gedeon · Liang Zheng | N/A | Code |
| Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction | Guangyi Chen · Zhenhao Chen · Shunxing Fan · Kun Zhang | N/A | Code |
| Seeing a Rose in Five Thousand Ways | Yunzhi Zhang · Shangzhe Wu · Noah Snavely · Jiajun Wu | N/A | Code |
| EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points | Chengwei Zheng · Wenbin Lin · Feng Xu | N/A | Code |
| Uncertainty-Aware Unsupervised Image Deblurring With Deep Residual Prior | Xiaole Tang · Xile Zhao · Jun Liu · Jianli Wang · Yuchun Miao · Tieyong Zeng | N/A | Code |
| Primitive Generation and Semantic-Related Alignment for Universal Zero-Shot Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Long-Tailed Visual Recognition via Self-Heterogeneous Integration With Knowledge Excavation | Yan Jin · Mengke Li · Yang Lu · Yiu-ming Cheung · Hanzi Wang | N/A | Code |
| Neuron Structure Modeling for Generalizable Remote Physiological Measurement | Hao Lu · Zitong Yu · Xuesong Niu · Ying-Cong Chen | N/A | Code |
| Decoupled Semantic Prototypes Enable Learning From Diverse Annotation Types for Semi-Weakly Segmentation in Expert-Driven Domains | Simon Reiß · Constantin Seibold · Alexander Freytag · Erik Rodner · Rainer Stiefelhagen | N/A | Code |
| Learning a Sparse Transformer Network for Effective Image Deraining | Xiang Chen · Hao Li · Mingqiang Li · Jinshan Pan | N/A | Code |
| Camouflaged Object Detection With Feature Decomposition and Edge Reconstruction | Chunming He · Kai Li · Yachao Zhang · Longxiang Tang · Yulun Zhang · Zhenhua Guo · Xiu Li | N/A | Code |
| LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding | Gen Li · Varun Jampani · Deqing Sun · Laura Sevilla-Lara | N/A | Code |
| DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | Nataniel Ruiz · Yuanzhen Li · Varun Jampani · Yael Pritch · Michael Rubinstein · Kfir Aberman | N/A | Code |
| GeneCIS: A Benchmark for General Conditional Image Similarity | Sagar Vaze · Nicolas Carion · Ishan Misra | N/A | Code |
| Neighborhood Attention Transformer | Ali Hassani · Steven Walton · Jiachen Li · Shen Li · Humphrey Shi | N/A | Code |
| 3D-Aware Conditional Image Synthesis | Kangle Deng · Gengshan Yang · Deva Ramanan · Jun-Yan Zhu | N/A | Code |
| Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin · Jun Gao · Luming Tang · Towaki Takikawa · Xiaohui Zeng · Xun Huang · Karsten Kreis · Sanja Fidler · Ming-Yu Liu · Tsung-Yi Lin | N/A | Code |
| QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity | Siyu Huang · Jie An · Donglai Wei · Jiebo Luo · Hanspeter Pfister | N/A | Code |
| SceneComposer: Any-Level Semantic Image Synthesis | Yu Zeng · Zhe Lin · Jianming Zhang · Qing Liu · John Collomosse · Jason Kuen · Vishal M. Patel | N/A | Code |
| Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style | Haoming Lu · Hazarapet Tunanyan · Kai Wang · Shant Navasardyan · Zhangyang Wang · Humphrey Shi | N/A | Code |
| In-Hand 3D Object Scanning From an RGB Sequence | Shreyas Hampali · Tomas Hodan · Luan Tran · Lingni Ma · Cem Keskin · Vincent Lepetit | N/A | Code |
| SHS-Net: Learning Signed Hyper Surfaces for Oriented Normal Estimation of Point Clouds | Qing Li · Huifang Feng · Kanle Shi · Yue Gao · Yi Fang · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Advancing Visual Grounding With Scene Knowledge: Benchmark and Method | Zhihong Chen · Ruifei Zhang · Yibing Song · Xiang Wan · Guanbin Li | N/A | Code |
| Putting People in Their Place: Affordance-Aware Human Insertion Into Scenes | Sumith Kulal · Tim Brooks · Alex Aiken · Jiajun Wu · Jimei Yang · Jingwan Lu · Alexei A. Efros · Krishna Kumar Singh | N/A | Code |
| Identity-Preserving Talking Face Generation With Landmark and Appearance Priors | Weizhi Zhong · Chaowei Fang · Yinqi Cai · Pengxu Wei · Gangming Zhao · Liang Lin · Guanbin Li | N/A | Code |
| Less Is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation | Li Li · Hubert P. H. Shum · Toby P. Breckon | N/A | Code |
| FAC: 3D Representation Learning via Foreground Aware Feature Contrast | Kangcheng Liu · Aoran Xiao · Xiaoqin Zhang · Shijian Lu · Ling Shao | N/A | Code |
| InstMove: Instance Motion for Object-Centric Video Segmentation | Qihao Liu · Junfeng Wu · Yi Jiang · Xiang Bai · Alan L. Yuille · Song Bai | N/A | Code |
| Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark | Xiaofeng Wang · Zheng Zhu · Yunpeng Zhang · Guan Huang · Yun Ye · Wenbo Xu · Ziwei Chen · Xingang Wang | N/A | Code |
| Self-Supervised Non-Uniform Kernel Estimation With Flow-Based Motion Prior for Blind Image Deblurring | Zhenxuan Fang · Fangfang Wu · Weisheng Dong · Xin Li · Jinjian Wu · Guangming Shi | N/A | Code |
| Neural Kernel Surface Reconstruction | Jiahui Huang · Zan Gojcic · Matan Atzmon · Or Litany · Sanja Fidler · Francis Williams | N/A | Code |
| Binary Latent Diffusion | Ze Wang · Jiang Wang · Zicheng Liu · Qiang Qiu | N/A | Code |
| Learning To Dub Movies via Hierarchical Prosody Models | Gaoxiang Cong · Liang Li · Yuankai Qi · Zheng-Jun Zha · Qi Wu · Wenyu Wang · Bin Jiang · Ming-Hsuan Yang · Qingming Huang | N/A | Code |
| Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs | Pattaramanee Arsomngern · Sarana Nutanong · Supasorn Suwajanakorn | N/A | Code |
| FrustumFormer: Adaptive Instance-Aware Resampling for Multi-View 3D Detection | Yuqi Wang · Yuntao Chen · Zhaoxiang Zhang | N/A | Code |
| Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! | Zaid Khan · Vijay Kumar BG · Samuel Schulter · Xiang Yu · Yun Fu · Manmohan Chandraker | N/A | Code |
| StyleRes: Transforming the Residuals for Real Image Editing With StyleGAN | Hamza Pehlivan · Yusuf Dalva · Aysegul Dundar | N/A | Code |
| PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer | Honghui Yang · Wenxiao Wang · Minghao Chen · Binbin Lin · Tong He · Hua Chen · Xiaofei He · Wanli Ouyang | N/A | Code |
| Boosting Verified Training for Robust Image Classifications via Abstraction | Zhaodi Zhang · Zhiyi Xue · Yang Chen · Si Liu · Yueling Zhang · Jing Liu · Min Zhang | N/A | Code |
| Interactive Segmentation As Gaussion Process Classification | Minghao Zhou · Hong Wang · Qian Zhao · Yuexiang Li · Yawen Huang · Deyu Meng · Yefeng Zheng | N/A | Code |
| OSRT: Omnidirectional Image Super-Resolution With Distortion-Aware Transformer | Fanghua Yu · Xintao Wang · Mingdeng Cao · Gen Li · Ying Shan · Chao Dong | N/A | Code |
| Accelerating Vision-Language Pretraining With Free Language Modeling | Teng Wang · Yixiao Ge · Feng Zheng · Ran Cheng · Ying Shan · Xiaohu Qie · Ping Luo | N/A | Code |
| TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation | Hanzhi Chen · Fabian Manhardt · Nassir Navab · Benjamin Busam | N/A | Code |
| Learning With Noisy Labels via Self-Supervised Adversarial Noisy Masking | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Jiangning Zhang · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| HelixSurf: A Robust and Efficient Neural Implicit Surface Learning of Indoor Scenes With Iterative Intertwined Regularization | Zhihao Liang · Zhangjin Huang · Changxing Ding · Kui Jia | N/A | Code |
| Multi-Space Neural Radiance Fields | Ze-Xin Yin · Jiaxiong Qiu · Ming-Ming Cheng · Bo Ren | N/A | Code |
| MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences | Chenhang He · Ruihuang Li · Yabin Zhang · Shuai Li · Lei Zhang | N/A | Code |
| DLBD: A Self-Supervised Direct-Learned Binary Descriptor | Bin Xiao · Yang Hu · Bo Liu · Xiuli Bi · Weisheng Li · Xinbo Gao | N/A | Code |
| Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries | Yuanwen Yue · Theodora Kontogianni · Konrad Schindler · Francis Engelmann | N/A | Code |
| PointAvatar: Deformable Point-Based Head Avatars From Videos | Yufeng Zheng · Wang Yifan · Gordon Wetzstein · Michael J. Black · Otmar Hilliges | N/A | Code |
| Diffusion-SDF: Text-To-Shape via Voxelized Diffusion | Muheng Li · Yueqi Duan · Jie Zhou · Jiwen Lu | N/A | Code |
| NeRF-RPN: A General Framework for Object Detection in NeRFs | Benran Hu · Junkai Huang · Yichen Liu · Yu-Wing Tai · Chi-Keung Tang | N/A | Code |
| CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset | Tian Gan · Qing Wang · Xingning Dong · Xiangyuan Ren · Liqiang Nie · Qingpei Guo | N/A | Code |
| Vid2Avatar: 3D Avatar Reconstruction From Videos in the Wild via Self-Supervised Scene Decomposition | Chen Guo · Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| Neural Preset for Color Style Transfer | Zhanghan Ke · Yuhao Liu · Lei Zhu · Nanxuan Zhao · Rynson W.H. Lau | N/A | Code |
| GRES: Generalized Referring Expression Segmentation | Chang Liu · Henghui Ding · Xudong Jiang | N/A | Code |
| Tracking Through Containers and Occluders in the Wild | Basile Van Hoorick · Pavel Tokmakov · Simon Stent · Jie Li · Carl Vondrick | N/A | Code |
| DepGraph: Towards Any Structural Pruning | Gongfan Fang · Xinyin Ma · Mingli Song · Michael Bi Mi · Xinchao Wang | N/A | Code |
| Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation | Yunqing Zhao · Chao Du · Milad Abdollahzadeh · Tianyu Pang · Min Lin · Shuicheng Yan · Ngai-Man Cheung | N/A | Code |
| RGB No More: Minimally-Decoded JPEG Vision Transformers | Jeongsoo Park · Justin Johnson | N/A | Code |
| iQuery: Instruments As Queries for Audio-Visual Sound Separation | Jiaben Chen · Renrui Zhang · Dongze Lian · Jiaqi Yang · Ziyao Zeng · Jianbo Shi | N/A | Code |
| Towards Professional Level Crowd Annotation of Expert Domain Data | Pei Wang · Nuno Vasconcelos | N/A | Code |
| VideoTrack: Learning To Track Objects via Video Transformer | Fei Xie · Lei Chu · Jiahao Li · Yan Lu · Chao Ma | N/A | Code |
| SCoDA: Domain Adaptive Shape Completion for Real Scans | Yushuang Wu · Zizheng Yan · Ce Chen · Lai Wei · Xiao Li · Guanbin Li · Yihao Li · Shuguang Cui · Xiaoguang Han | N/A | Code |
| Enhanced Training of Query-Based Object Detection via Selective Query Recollection | Fangyi Chen · Han Zhang · Kai Hu · Yu-Kai Huang · Chenchen Zhu · Marios Savvides | N/A | Code |
| LaserMix for Semi-Supervised LiDAR Semantic Segmentation | Lingdong Kong · Jiawei Ren · Liang Pan · Ziwei Liu | N/A | Code |
| MSMDFusion: Fusing LiDAR and Camera at Multiple Scales With Multi-Depth Seeds for 3D Object Detection | Yang Jiao · Zequn Jie · Shaoxiang Chen · Jingjing Chen · Lin Ma · Yu-Gang Jiang | N/A | Code |
| Learning With Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning | Zeyin Song · Yifan Zhao · Yujun Shi · Peixi Peng · Li Yuan · Yonghong Tian | N/A | Code |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Runyu Ding · Jihan Yang · Chuhui Xue · Wenqing Zhang · Song Bai · Xiaojuan Qi | N/A | Code |
| Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training | Junfan Lin · Jianlong Chang · Lingbo Liu · Guanbin Li · Liang Lin · Qi Tian · Chang-Wen Chen | N/A | Code |
| A Dynamic Multi-Scale Voxel Flow Network for Video Prediction | Xiaotao Hu · Zhewei Huang · Ailin Huang · Jun Xu · Shuchang Zhou | N/A | Code |
| Neural Dependencies Emerging From Learning Massive Categories | Ruili Feng · Kecheng Zheng · Kai Zhu · Yujun Shen · Jian Zhao · Yukun Huang · Deli Zhao · Jingren Zhou · Michael Jordan · Zheng-Jun Zha | N/A | Code |
| Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-Identification | Yukang Zhang · Hanzi Wang | N/A | Code |
| Neural Kaleidoscopic Space Sculpting | Byeongjoo Ahn · Michael De Zeeuw · Ioannis Gkioulekas · Aswin C. Sankaranarayanan | N/A | Code |
| PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow | Jiarui Lei · Xiaobo Hu · Yue Wang · Dong Liu | N/A | Code |
| Masked Motion Encoding for Self-Supervised Video Representation Learning | Xinyu Sun · Peihao Chen · Liangwei Chen · Changhao Li · Thomas H. Li · Mingkui Tan · Chuang Gan | N/A | Code |
| StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator | Jiazhi Guan · Zhanwang Zhang · Hang Zhou · Tianshu Hu · Kaisiyuan Wang · Dongliang He · Haocheng Feng · Jingtuo Liu · Errui Ding · Ziwei Liu · Jingdong Wang | N/A | Code |
| LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation | Song Wang · Wentong Li · Wenyu Liu · Xiaolu Liu · Jianke Zhu | N/A | Code |
| Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion | Changfeng Ma · Yinuo Chen · Pengxiao Guo · Jie Guo · Chongjun Wang · Yanwen Guo | N/A | Code |
| Boosting Detection in Crowd Analysis via Underutilized Output Features | Shaokai Wu · Fengyu Yang | N/A | Code |
| Representation Learning for Visual Object Tracking by Masked Appearance Transfer | Haojie Zhao · Dong Wang · Huchuan Lu | N/A | Code |
| NeuralLift-360: Lifting an In-the-Wild 2D Photo to a 3D Object With 360° Views | Dejia Xu · Yifan Jiang · Peihao Wang · Zhiwen Fan · Yi Wang · Zhangyang Wang | N/A | Code |
| DoNet: Deep De-Overlapping Network for Cytology Instance Segmentation | Hao Jiang · Rushan Zhang · Yanning Zhou · Yumeng Wang · Hao Chen | N/A | Code |
| Think Twice Before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | Xiaosong Jia · Penghao Wu · Li Chen · Jiangwei Xie · Conghui He · Junchi Yan · Hongyang Li | N/A | Code |
| Adversarial Counterfactual Visual Explanations | Guillaume Jeanneret · Loïc Simon · Frédéric Jurie | N/A | Code |
| ALOFT: A Lightweight MLP-Like Architecture With Dynamic Low-Frequency Transform for Domain Generalization | Jintao Guo · Na Wang · Lei Qi · Yinghuan Shi | N/A | Code |
| ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision | Jingwang Ling · Zhibo Wang · Feng Xu | N/A | Code |
| Coaching a Teachable Student | Jimuyang Zhang · Zanming Huang · Eshed Ohn-Bar | N/A | Code |
| POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery | Ce Zheng · Xianpeng Liu · Guo-Jun Qi · Chen Chen | N/A | Code |
| Layout-Based Causal Inference for Object Navigation | Sixian Zhang · Xinhang Song · Weijie Li · Yubing Bai · Xinyao Yu · Shuqiang Jiang | N/A | Code |
| Towards Bridging the Performance Gaps of Joint Energy-Based Models | Xiulong Yang · Qing Su · Shihao Ji | N/A | Code |
| Bringing Inputs to Shared Domains for 3D Interacting Hands Recovery in the Wild | Gyeongsik Moon | N/A | Code |
| Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting | Xiaogang Peng · Siyuan Mao · Zizhao Wu | N/A | Code |
| Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization | Chen Ju · Kunhao Zheng · Jinxiang Liu · Peisen Zhao · Ya Zhang · Jianlong Chang · Qi Tian · Yanfeng Wang | N/A | Code |
| DiffPose: Toward More Reliable 3D Pose Estimation | Jia Gong · Lin Geng Foo · Zhipeng Fan · Qiuhong Ke · Hossein Rahmani · Jun Liu | N/A | Code |
| SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection | Tiange Xiang · Yixiao Zhang · Yongyi Lu · Alan L. Yuille · Chaoyi Zhang · Weidong Cai · Zongwei Zhou | N/A | Code |
| On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer | Zhenjie Yu · Shuang Li · Yirui Shen · Chi Harold Liu · Shuigen Wang | N/A | Code |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network With Large Input | Senmao Tian · Ming Lu · Jiaming Liu · Yandong Guo · Yurong Chen · Shunli Zhang | N/A | Code |
| NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects | Zhiwen Yan · Chen Li · Gim Hee Lee | N/A | Code |
| Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo | Lukas Mehl · Jenny Schmalfuss · Azin Jahedi · Yaroslava Nalivayko · Andrés Bruhn | N/A | Code |
| Unifying Short and Long-Term Tracking With Graph Hierarchies | Orcun Cetintas · Guillem Brasó · Laura Leal-Taixé | N/A | Code |
| MixTeacher: Mining Promising Labels With Mixed Scale Teacher for Semi-Supervised Object Detection | Liang Liu · Boshen Zhang · Jiangning Zhang · Wuhao Zhang · Zhenye Gan · Guanzhong Tian · Wenbing Zhu · Yabiao Wang · Chengjie Wang | N/A | Code |
| A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image | Changlong Jiang · Yang Xiao · Cunlin Wu · Mingyang Zhang · Jinghong Zheng · Zhiguo Cao · Joey Tianyi Zhou | N/A | Code |
| Efficient Mask Correction for Click-Based Interactive Image Segmentation | Fei Du · Jianlong Yuan · Zhibin Wang · Fan Wang | N/A | Code |
| OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation | Tong Wu · Jiarui Zhang · Xiao Fu · Yuxin Wang · Jiawei Ren · Liang Pan · Wayne Wu · Lei Yang · Jiaqi Wang · Chen Qian · Dahua Lin · Ziwei Liu | N/A | Code |
| LoGoNet: Towards Accurate 3D Object Detection With Local-to-Global Cross-Modal Fusion | Xin Li · Tao Ma · Yuenan Hou · Botian Shi · Yuchen Yang · Youquan Liu · Xingjiao Wu · Qin Chen · Yikang Li · Yu Qiao · Liang He | N/A | Code |
| 3D Registration With Maximal Cliques | Xiyu Zhang · Jiaqi Yang · Shikun Zhang · Yanning Zhang | N/A | Code |
| Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis | Yuxiang Wei · Zhilong Ji · Xiaohe Wu · Jinfeng Bai · Lei Zhang · Wangmeng Zuo | N/A | Code |
| Frame-Event Alignment and Fusion Network for High Frame Rate Tracking | Jiqing Zhang · Yuanchen Wang · Wenxi Liu · Meng Li · Jinpeng Bai · Baocai Yin · Xin Yang | N/A | Code |
| Human Guided Ground-Truth Generation for Realistic Image Super-Resolution | Du Chen · Jie Liang · Xindong Zhang · Ming Liu · Hui Zeng · Lei Zhang | N/A | Code |
| Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration | Kemal Oksuz · Tom Joy · Puneet K. Dokania | N/A | Code |
| Generating Human Motion From Textual Descriptions With Discrete Representations | Jianrong Zhang · Yangsong Zhang · Xiaodong Cun · Yong Zhang · Hongwei Zhao · Hongtao Lu · Xi Shen · Ying Shan | N/A | Code |
| Curricular Contrastive Regularization for Physics-Aware Single Image Dehazing | Yu Zheng · Jiahui Zhan · Shengfeng He · Junyu Dong · Yong Du | N/A | Code |
| Learning Human Mesh Recovery in 3D Scenes | Zehong Shen · Zhi Cen · Sida Peng · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection | Luting Wang · Yi Liu · Penghui Du · Zihan Ding · Yue Liao · Qiaosong Qi · Biaolong Chen · Si Liu | N/A | Code |
| Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method | Ran Yi · Haoyuan Tian · Zhihao Gu · Yu-Kun Lai · Paul L. Rosin | N/A | Code |
| SOOD: Towards Semi-Supervised Oriented Object Detection | Wei Hua · Dingkang Liang · Jingyu Li · Xiaolong Liu · Zhikang Zou · Xiaoqing Ye · Xiang Bai | N/A | Code |
| Spherical Transformer for LiDAR-Based 3D Recognition | Xin Lai · Yukang Chen · Fanbin Lu · Jianhui Liu · Jiaya Jia | N/A | Code |
| Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri · Ayan Kumar Bhunia · Yi-Zhe Song · Anjan Dutta | N/A | Code |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Zixiang Zhao · Haowen Bai · Jiangshe Zhang · Yulun Zhang · Shuang Xu · Zudi Lin · Radu Timofte · Luc Van Gool | N/A | Code |
| Proximal Splitting Adversarial Attack for Semantic Segmentation | Jérôme Rony · Jean-Christophe Pesquet · Ismail Ben Ayed | N/A | Code |
| NeuWigs: A Neural Dynamic Model for Volumetric Hair Capture and Animation | Ziyan Wang · Giljoo Nam · Tuur Stuyck · Stephen Lombardi · Chen Cao · Jason Saragih · Michael Zollhöfer · Jessica Hodgins · Christoph Lassner | N/A | Code |
| Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language | Chuanhao Li · Zhen Li · Chenchen Jing · Yunde Jia · Yuwei Wu | N/A | Code |
| 3D-Aware Face Swapping | Yixuan Li · Chao Ma · Yichao Yan · Wenhan Zhu · Xiaokang Yang | N/A | Code |
| Representing Volumetric Videos As Dynamic MLP Maps | Sida Peng · Yunzhi Yan · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation | Hang Du · Xuejun Yan · Jingjing Wang · Di Xie · Shiliang Pu | N/A | Code |
| Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need | Jingyao Li · Pengguang Chen · Zexin He · Shaozuo Yu · Shu Liu · Jiaya Jia | N/A | Code |
| Paint by Example: Exemplar-Based Image Editing With Diffusion Models | Binxin Yang · Shuyang Gu · Bo Zhang · Ting Zhang · Xuejin Chen · Xiaoyan Sun · Dong Chen · Fang Wen | N/A | Code |
| Referring Multi-Object Tracking | Dongming Wu · Wencheng Han · Tiancai Wang · Xingping Dong · Xiangyu Zhang · Jianbing Shen | N/A | Code |
| NerVE: Neural Volumetric Edges for Parametric Curve Extraction From Point Cloud | Xiangyu Zhu · Dong Du · Weikai Chen · Zhiyou Zhao · Yinyu Nie · Xiaoguang Han | N/A | Code |
| AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection | Yipeng Gao · Kun-Yu Lin · Junkai Yan · Yaowei Wang · Wei-Shi Zheng | N/A | Code |
| CUF: Continuous Upsampling Filters | Cristina N. Vasconcelos · Cengiz Oztireli · Mark Matthews · Milad Hashemi · Kevin Swersky · Andrea Tagliasacchi | N/A | Code |
| MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors | Yuang Zhang · Tiancai Wang · Xiangyu Zhang | N/A | Code |
| CXTrack: Improving 3D Point Cloud Tracking With Contextual Information | Tian-Xing Xu · Yuan-Chen Guo · Yu-Kun Lai · Song-Hai Zhang | N/A | Code |
| Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection | Xincheng Yao · Ruoqi Li · Jing Zhang · Jun Sun · Chongyang Zhang | N/A | Code |
| Learning Bottleneck Concepts in Image Classification | Bowen Wang · Liangzhi Li · Yuta Nakashima · Hajime Nagahara | N/A | Code |
| Zero-Shot Model Diagnosis | Jinqi Luo · Zhaoning Wang · Chen Henry Wu · Dong Huang · Fernando De la Torre | N/A | Code |
| DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation | Shuai Shen · Wenliang Zhao · Zibin Meng · Wanhua Li · Zheng Zhu · Jie Zhou · Jiwen Lu | N/A | Code |
| DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer | Ping Chen · Xingpeng Zhang · Ye Li · Ju Tao · Bin Xiao · Bing Wang · Zongjie Jiang | N/A | Code |
| TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning With Structure-Trajectory Prompted Reconstruction for Person Re-Identification | Haocong Rao · Chunyan Miao | N/A | Code |
| Joint Visual Grounding and Tracking With Natural Language Specification | Li Zhou · Zikun Zhou · Kaige Mao · Zhenyu He | N/A | Code |
| Compressing Volumetric Radiance Fields to 1 MB | Lingzhi Li · Zhen Shen · Zhongshu Wang · Li Shen · Liefeng Bo | N/A | Code |
| HyperReel: High-Fidelity 6-DoF Video With Ray-Conditioned Sampling | Benjamin Attal · Jia-Bin Huang · Christian Richardt · Michael Zollhöfer · Johannes Kopf · Matthew O’Toole · Changil Kim | N/A | Code |
| Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sections | Alexander Gillert · Giulia Resente · Alba Anadon-Rosell · Martin Wilmking · Uwe Freiherr von Lukas | N/A | Code |
| Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li · Karen Liu · Jiajun Wu | N/A | Code |
| Learned Two-Plane Perspective Prior Based Image Resampling for Efficient Object Detection | Anurag Ghosh · N. Dinesh Reddy · Christoph Mertz · Srinivasa G. Narasimhan | N/A | Code |
| PaletteNeRF: Palette-Based Appearance Editing of Neural Radiance Fields | Zhengfei Kuang · Fujun Luan · Sai Bi · Zhixin Shu · Gordon Wetzstein · Kalyan Sunkavalli | N/A | Code |
| Long Range Pooling for 3D Large-Scale Scene Understanding | Xiang-Li Li · Meng-Hao Guo · Tai-Jiang Mu · Ralph R. Martin · Shi-Min Hu | N/A | Code |
| Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation | Mingjie Li · Bingqian Lin · Zicong Chen · Haokun Lin · Xiaodan Liang · Xiaojun Chang | N/A | Code |
| Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning | Chengzhi Cao · Xueyang Fu · Hongjian Liu · Yukun Huang · Kunyu Wang · Jiebo Luo · Zheng-Jun Zha | N/A | Code |
| Contrastive Grouping With Transformer for Referring Image Segmentation | Jiajin Tang · Ge Zheng · Cheng Shi · Sibei Yang | N/A | Code |
| Structure Aggregation for Cross-Spectral Stereo Image Guided Denoising | Zehua Sheng · Zhu Yu · Xiongwei Liu · Si-Yuan Cao · Yuqi Liu · Hui-Liang Shen · Huaqi Zhang | N/A | Code |
| Where Is My Spot? Few-Shot Image Generation via Latent Subspace Optimization | Chenxi Zheng · Bangzhen Liu · Huaidong Zhang · Xuemiao Xu · Shengfeng He | N/A | Code |
| EDGE: Editable Dance Generation From Music | Jonathan Tseng · Rodrigo Castellon · Karen Liu | N/A | Code |
| PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models | Minghua Liu · Yinhao Zhu · Hong Cai · Shizhong Han · Zhan Ling · Fatih Porikli · Hao Su | N/A | Code |
| EDICT: Exact Diffusion Inversion via Coupled Transformations | Bram Wallace · Akash Gokul · Nikhil Naik | N/A | Code |
| Complete 3D Human Reconstruction From a Single Incomplete Image | Junying Wang · Jae Shin Yoon · Tuanfeng Y. Wang · Krishna Kumar Singh · Ulrich Neumann | N/A | Code |
| PartDistillation: Learning Parts From Instance Segmentation | Jang Hyun Cho · Philipp Krähenbühl · Vignesh Ramanathan | N/A | Code |
| Neural Vector Fields: Implicit Representation by Explicit Learning | Xianghui Yang · Guosheng Lin · Zhenghao Chen · Luping Zhou | N/A | Code |
| Unsupervised Inference of Signed Distance Functions From Single Sparse Point Clouds Without Learning Priors | Chao Chen · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Texts as Images in Prompt Tuning for Multi-Label Image Recognition | Zixian Guo · Bowen Dong · Zhilong Ji · Jinfeng Bai · Yiwen Guo · Wangmeng Zuo | N/A | Code |
| Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent With Learned Distance Functions | Yun He · Danhang Tang · Yinda Zhang · Xiangyang Xue · Yanwei Fu | N/A | Code |
| MMANet: Margin-Aware Distillation and Modality-Aware Regularization for Incomplete Multimodal Learning | Shicai Wei · Chunbo Luo · Yang Luo | N/A | Code |
| Rethinking Optical Flow From Geometric Matching Consistent Perspective | Qiaole Dong · Chenjie Cao · Yanwei Fu | N/A | Code |
| FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation | Junjie He · Pengyu Li · Yifeng Geng · Xuansong Xie | N/A | Code |
| How Can Objects Help Action Recognition? | Xingyi Zhou · Anurag Arnab · Chen Sun · Cordelia Schmid | N/A | Code |
| Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang · Wen Wang · Yue Cao · Chunhua Shen · Tiejun Huang | N/A | Code |
| SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation | Huimin Huang · Shiao Xie · Lanfen Lin · Ruofeng Tong · Yen-Wei Chen · Yuexiang Li · Hong Wang · Yawen Huang · Yefeng Zheng | N/A | Code |
| A Unified Pyramid Recurrent Network for Video Frame Interpolation | Xin Jin · Longhai Wu · Jie Chen · Youxin Chen · Jayoon Koo · Cheul-hee Hahm | N/A | Code |
| Enhancing the Self-Universality for Transferable Targeted Attacks | Zhipeng Wei · Jingjing Chen · Zuxuan Wu · Yu-Gang Jiang | N/A | Code |
| Multi-View Inverse Rendering for Large-Scale Real-World Indoor Scenes | Zhen Li · Lingli Wang · Mofang Cheng · Cihui Pan · Jiaqi Yang | N/A | Code |
| TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision | Jiacheng Wei · Hao Wang · Jiashi Feng · Guosheng Lin · Kim-Hui Yap | N/A | Code |
| Frequency-Modulated Point Cloud Rendering With Easy Editing | Yi Zhang · Xiaoyang Huang · Bingbing Ni · Teng Li · Wenjun Zhang | N/A | Code |
| Vector Quantization With Self-Attention for Quality-Independent Representation Learning | Zhou Yang · Weisheng Dong · Xin Li · Mengluan Huang · Yulin Sun · Guangming Shi | N/A | Code |
| Fine-Grained Face Swapping via Regional GAN Inversion | Zhian Liu · Maomao Li · Yong Zhang · Cairong Wang · Qi Zhang · Jue Wang · Yongwei Nie | N/A | Code |
| Backdoor Defense via Adaptively Splitting Poisoned Dataset | Kuofeng Gao · Yang Bai · Jindong Gu · Yong Yang · Shu-Tao Xia | N/A | Code |
| RGBD2: Generative Scene Synthesis via Incremental View Inpainting Using RGBD Diffusion Models | Jiabao Lei · Jiapeng Tang · Kui Jia | N/A | Code |
| CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose | Xu Zhang · Wen Wang · Zhe Chen · Yufei Xu · Jing Zhang · Dacheng Tao | N/A | Code |
| Fake It Till You Make It: Learning Transferable Representations From Synthetic ImageNet Clones | Mert Bülent Sarıyıldız · Karteek Alahari · Diane Larlus · Yannis Kalantidis | N/A | Code |
| Efficient Frequency Domain-Based Transformers for High-Quality Image Deblurring | Lingshun Kong · Jiangxin Dong · Jianjun Ge · Mingqiang Li · Jinshan Pan | N/A | Code |
| DartBlur: Privacy Preservation With Detection Artifact Suppression | Baowei Jiang · Bing Bai · Haozhe Lin · Yu Wang · Yuchen Guo · Lu Fang | N/A | Code |
| FCC: Feature Clusters Compression for Long-Tailed Visual Recognition | Jian Li · Ziyao Meng · Daqian Shi · Rui Song · Xiaolei Diao · Jingwen Wang · Hao Xu | N/A | Code |
| CLOTH4D: A Dataset for Clothed Human Reconstruction | Xingxing Zou · Xintong Han · Waikeung Wong | N/A | Code |
| LinK: Linear Kernel for LiDAR-Based 3D Perception | Tao Lu · Xiang Ding · Haisong Liu · Gangshan Wu · Limin Wang | N/A | Code |
| Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation | Xiaoyang Wang · Bingfeng Zhang · Limin Yu · Jimin Xiao | N/A | Code |
| Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception | Junyu Gao · Mengyuan Chen · Changsheng Xu | N/A | Code |
| LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| Deep Learning of Partial Graph Matching via Differentiable Top-K | Runzhong Wang · Ziao Guo · Shaofei Jiang · Xiaokang Yang · Junchi Yan | N/A | Code |
| Analyzing Physical Impacts Using Transient Surface Wave Imaging | Tianyuan Zhang · Mark Sheinin · Dorian Chan · Mark Rau · Matthew O’Toole · Srinivasa G. Narasimhan | N/A | Code |
| Rethinking Domain Generalization for Face Anti-Spoofing: Separability and Alignment | Yiyou Sun · Yaojie Liu · Xiaoming Liu · Yixuan Li · Wen-Sheng Chu | N/A | Code |
| A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift | Dasong Li · Xiaoyu Shi · Yi Zhang · Ka Chun Cheung · Simon See · Xiaogang Wang · Hongwei Qin · Hongsheng Li | N/A | Code |
| The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects | Ruohan Gao · Yiming Dou · Hao Li · Tanmay Agarwal · Jeannette Bohg · Yunzhu Li · Li Fei-Fei · Jiajun Wu | N/A | Code |
| PIRLNav: Pretraining With Imitation and RL Finetuning for ObjectNav | Ram Ramrakhya · Dhruv Batra · Erik Wijmans · Abhishek Das | N/A | Code |
| DC2: Dual-Camera Defocus Control by Learning To Refocus | Hadi Alzayer · Abdullah Abuolaim · Leung Chun Chan · Yang Yang · Ying Chen Lou · Jia-Bin Huang · Abhishek Kar | N/A | Code |
| Habitat-Matterport 3D Semantics Dataset | Karmesh Yadav · Ram Ramrakhya · Santhosh Kumar Ramakrishnan · Theo Gervet · John Turner · Aaron Gokaslan · Noah Maestre · Angel Xuan Chang · Dhruv Batra · Manolis Savva · Alexander William Clegg · Devendra Singh Chaplot | N/A | Code |
| Prompting Large Language Models With Answer Heuristics for Knowledge-Based Visual Question Answering | Zhenwei Shao · Zhou Yu · Meng Wang · Jun Yu | N/A | Code |
| Similarity Metric Learning for RGB-Infrared Group Re-Identification | Jianghao Xiong · Jianhuang Lai | N/A | Code |
| DPF: Learning Dense Prediction Fields With Weak Supervision | Xiaoxue Chen · Yuhang Zheng · Yupeng Zheng · Qiang Zhou · Hao Zhao · Guyue Zhou · Ya-Qin Zhang | N/A | Code |
| Mixed Autoencoder for Self-Supervised Visual Representation Learning | Kai Chen · Zhili Liu · Lanqing Hong · Hang Xu · Zhenguo Li · Dit-Yan Yeung | N/A | Code |
| Content-Aware Token Sharing for Efficient Semantic Segmentation With Vision Transformers | Chenyang Lu · Daan de Geus · Gijs Dubbelman | N/A | Code |
| NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen · Jipeng Lyu · Yu-Xiong Wang | N/A | Code |
| Multiview Compressive Coding for 3D Reconstruction | Chao-Yuan Wu · Justin Johnson · Jitendra Malik · Christoph Feichtenhofer · Georgia Gkioxari | N/A | Code |
| Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation | Lihe Yang · Lei Qi · Litong Feng · Wayne Zhang · Yinghuan Shi | N/A | Code |
| Delving Into Shape-Aware Zero-Shot Semantic Segmentation | Xinyu Liu · Beiwen Tian · Zhen Wang · Rui Wang · Kehua Sheng · Bo Zhang · Hao Zhao · Guyue Zhou | N/A | Code |
| Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval | Yi Xie · Huaidong Zhang · Xuemiao Xu · Jianqing Zhu · Shengfeng He | N/A | Code |
| Bootstrapping Objectness From Videos by Relaxed Common Fate and Visual Grouping | Long Lian · Zhirong Wu · Stella X. Yu | N/A | Code |
| NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear Interpolation | Zehan Zheng · Danni Wu · Ruisi Lu · Fan Lu · Guang Chen · Changjun Jiang | N/A | Code |
| Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning | Zhuoyang Zhang · Yuhao Dong · Yunze Liu · Li Yi | N/A | Code |
| GFIE: A Dataset and Baseline for Gaze-Following From 2D to 3D in Indoor Environments | Zhengxi Hu · Yuxue Yang · Xiaolin Zhai · Dingye Yang · Bohan Zhou · Jingtai Liu | N/A | Code |
| Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? | Wenhao Wu · Haipeng Luo · Bo Fang · Jingdong Wang · Wanli Ouyang | N/A | Code |
| Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition From Egocentric RGB Videos | Yilin Wen · Hao Pan · Lei Yang · Jia Pan · Taku Komura · Wenping Wang | N/A | Code |
| CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer | Linfeng Wen · Chengying Gao · Changqing Zou | N/A | Code |
| Uncurated Image-Text Datasets: Shedding Light on Demographic Bias | Noa Garcia · Yusuke Hirota · Yankun Wu · Yuta Nakashima | N/A | Code |
| AltFreezing for More General Video Face Forgery Detection | Zhendong Wang · Jianmin Bao · Wengang Zhou · Weilun Wang · Houqiang Li | N/A | Code |
| Two-View Geometry Scoring Without Correspondences | Axel Barroso-Laguna · Eric Brachmann · Victor Adrian Prisacariu · Gabriel J. Brostow · Daniyar Turmukhambetov | N/A | Code |
| Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning | Wenjin Wang · Yunqing Hu · Qianglong Chen · Yin Zhang | N/A | Code |
| Revisiting Prototypical Network for Cross Domain Few-Shot Learning | Fei Zhou · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| Federated Incremental Semantic Segmentation | Jiahua Dong · Duzhen Zhang · Yang Cong · Wei Cong · Henghui Ding · Dengxin Dai | N/A | Code |
| Self-Supervised Learning for Multimodal Non-Rigid 3D Shape Matching | Dongliang Cao · Florian Bernard | N/A | Code |
| Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization | Aishan Liu · Shiyu Tang · Siyuan Liang · Ruihao Gong · Boxi Wu · Xianglong Liu · Dacheng Tao | N/A | Code |
| Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning | Peng Jin · Jinfa Huang · Pengfei Xiong · Shangxuan Tian · Chang Liu · Xiangyang Ji · Li Yuan · Jie Chen | N/A | Code |
| pCON: Polarimetric Coordinate Networks for Neural Scene Representations | Henry Peters · Yunhao Ba · Achuta Kadambi | N/A | Code |
| RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo | Changjiang Cai · Pan Ji · Qingan Yan · Yi Xu | N/A | Code |
| Depth Estimation From Camera Image and mmWave Radar Point Cloud | Akash Deep Singh · Yunhao Ba · Ankur Sarker · Howard Zhang · Achuta Kadambi · Stefano Soatto · Mani Srivastava · Alex Wong | N/A | Code |
| Normal-Guided Garment UV Prediction for Human Re-Texturing | Yasamin Jafarian · Tuanfeng Y. Wang · Duygu Ceylan · Jimei Yang · Nathan Carr · Yi Zhou · Hyun Soo Park | N/A | Code |
| WeatherStream: Light Transport Automation of Single Image Deweathering | Howard Zhang · Yunhao Ba · Ethan Yang · Varan Mehra · Blake Gella · Akira Suzuki · Arnold Pfahnl · Chethan Chinder Chandrappa · Alex Wong · Achuta Kadambi | N/A | Code |
| MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices | Kejie Li · Jia-Wang Bian · Robert Castle · Philip H.S. Torr · Victor Adrian Prisacariu | N/A | Code |
| Ultrahigh Resolution Image/Video Matting With Spatio-Temporal Sparsity | Yanan Sun · Chi-Keung Tang · Yu-Wing Tai | N/A | Code |
| Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection | Chuandong Liu · Chenqiang Gao · Fangcen Liu · Pengcheng Li · Deyu Meng · Xinbo Gao | N/A | Code |
| PATS: Patch Area Transportation With Subdivision for Local Feature Matching | Junjie Ni · Yijin Li · Zhaoyang Huang · Hongsheng Li · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| SINE: Semantic-Driven Image-Based NeRF Editing With Prior-Guided Editing Field | Chong Bao · Yinda Zhang · Bangbang Yang · Tianxing Fan · Zesong Yang · Hujun Bao · Guofeng Zhang · Zhaopeng Cui | N/A | Code |
| GeoNet: Benchmarking Unsupervised Adaptation Across Geographies | Tarun Kalluri · Wangdong Xu · Manmohan Chandraker | N/A | Code |
| Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset | Shuaizheng Liu · Xindong Zhang · Lingchen Sun · Zhetong Liang · Hui Zeng · Lei Zhang | N/A | Code |
| 3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification | Jiazhao Zhang · Liu Dai · Fanpeng Meng · Qingnan Fan · Xuelin Chen · Kai Xu · He Wang | N/A | Code |
| Delving Into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling | Yulin Liu · Haoran Liu · Yingda Yin · Yang Wang · Baoquan Chen · He Wang | N/A | Code |
| RILS: Masked Visual Reconstruction in Language Semantic Space | Shusheng Yang · Yixiao Ge · Kun Yi · Dian Li · Ying Shan · Xiaohu Qie · Xinggang Wang | N/A | Code |
| ConQueR: Query Contrast Voxel-DETR for 3D Object Detection | Benjin Zhu · Zhe Wang · Shaoshuai Shi · Hang Xu · Lanqing Hong · Hongsheng Li | N/A | Code |
| PREIM3D: 3D Consistent Precise Image Attribute Editing From a Single Image | Jianhui Li · Jianmin Li · Haoji Zhang · Shilong Liu · Zhengyi Wang · Zihao Xiao · Kaiwen Zheng · Jun Zhu | N/A | Code |
| Bridging Search Region Interaction With Template for RGB-T Tracking | Tianrui Hui · Zizheng Xun · Fengguang Peng · Junshi Huang · Xiaoming Wei · Xiaolin Wei · Jiao Dai · Jizhong Han · Si Liu | N/A | Code |
| Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels | Jingqiu Zhou · Linjiang Huang · Liang Wang · Si Liu · Hongsheng Li | N/A | Code |
| Learning To Zoom and Unzoom | Chittesh Thavamani · Mengtian Li · Francesco Ferroni · Deva Ramanan | N/A | Code |
| MaLP: Manipulation Localization Using a Proactive Scheme | Vishal Asnani · Xi Yin · Tal Hassner · Xiaoming Liu | N/A | Code |
| Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning | Haiyu Wu · Grace Bezold · Aman Bhatta · Kevin W. Bowyer | N/A | Code |
| Visual-Tactile Sensing for In-Hand Object Reconstruction | Wenqiang Xu · Zhenjun Yu · Han Xue · Ruolin Ye · Siqiong Yao · Cewu Lu | N/A | Code |
| Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training | Filip Radenovic · Abhimanyu Dubey · Abhishek Kadian · Todor Mihaylov · Simon Vandenhende · Yash Patel · Yi Wen · Vignesh Ramanathan · Dhruv Mahajan | N/A | Code |
| Semi-Supervised Domain Adaptation With Source Label Adaptation | Yu-Chu Yu · Hsuan-Tien Lin | N/A | Code |
| Self-Supervised Video Forensics by Audio-Visual Anomaly Detection | Chao Feng · Ziyang Chen · Andrew Owens | N/A | Code |
| IterativePFN: True Iterative Point Cloud Filtering | Dasith de Silva Edirimuni · Xuequan Lu · Zhiwen Shao · Gang Li · Antonio Robles-Kelly · Ying He | N/A | Code |
| Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time | Wei Shang · Dongwei Ren · Yi Yang · Hongzhi Zhang · Kede Ma · Wangmeng Zuo | N/A | Code |
| Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning | Yun-Hao Cao · Peiqin Sun · Shuchang Zhou | N/A | Code |
| Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking | Ziqi Pang · Jie Li · Pavel Tokmakov · Dian Chen · Sergey Zagoruyko · Yu-Xiong Wang | N/A | Code |
| VecFontSDF: Learning To Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance Functions | Zeqing Xia · Bojun Xiong · Zhouhui Lian | N/A | Code |
| Towards Better Gradient Consistency for Neural Signed Distance Functions via Level Set Alignment | Baorui Ma · Junsheng Zhou · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Visual-Language Prompt Tuning With Knowledge-Guided Context Optimization | Hantao Yao · Rui Zhang · Changsheng Xu | N/A | Code |
| Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation | Ju He · Jieneng Chen · Ming-Xian Lin · Qihang Yu · Alan L. Yuille | N/A | Code |
| Physics-Guided ISO-Dependent Sensor Noise Modeling for Extreme Low-Light Photography | Yue Cao · Ming Liu · Shuai Liu · Xiaotao Wang · Lei Lei · Wangmeng Zuo | N/A | Code |
| Dynamic Focus-Aware Positional Queries for Semantic Segmentation | Haoyu He · Jianfei Cai · Zizheng Pan · Jing Liu · Jing Zhang · Dacheng Tao · Bohan Zhuang | N/A | Code |
| Generic-to-Specific Distillation of Masked Autoencoders | Wei Huang · Zhiliang Peng · Li Dong · Furu Wei · Jianbin Jiao · Qixiang Ye | N/A | Code |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions | Yinpeng Dong · Caixin Kang · Jinlai Zhang · Zijian Zhu · Yikai Wang · Xiao Yang · Hang Su · Xingxing Wei · Jun Zhu | N/A | Code |
| GarmentTracking: Category-Level Garment Pose Tracking | Han Xue · Wenqiang Xu · Jieyi Zhang · Tutian Tang · Yutong Li · Wenxin Du · Ruolin Ye · Cewu Lu | N/A | Code |
| TrojDiff: Trojan Attacks on Diffusion Models With Diverse Targets | Weixin Chen · Dawn Song · Bo Li | N/A | Code |
| Weakly Supervised Video Representation Learning With Unaligned Text for Sequential Videos | Sixun Dong · Huazhang Hu · Dongze Lian · Weixin Luo · Yicheng Qian · Shenghua Gao | N/A | Code |
| Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process | Yuhan Li · Yishun Dou · Xuanhong Chen · Bingbing Ni · Yilin Sun · Yutian Liu · Fuzhen Wang | N/A | Code |
| SpaText: Spatio-Textual Representation for Controllable Image Generation | Omri Avrahami · Thomas Hayes · Oran Gafni · Sonal Gupta · Yaniv Taigman · Devi Parikh · Dani Lischinski · Ohad Fried · Xi Yin | N/A | Code |
| Watch or Listen: Robust Audio-Visual Speech Recognition With Visual Corruption Modeling and Reliability Scoring | Joanna Hong · Minsu Kim · Jeongsoo Choi · Yong Man Ro | N/A | Code |
| RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | Titas Anciukevičius · Zexiang Xu · Matthew Fisher · Paul Henderson · Hakan Bilen · Niloy J. Mitra · Paul Guerrero | N/A | Code |
| Self-Supervised 3D Scene Flow Estimation Guided by Superpoints | Yaqi Shen · Le Hui · Jin Xie · Jian Yang | N/A | Code |
| Adaptive Annealing for Robust Geometric Estimation | Chitturi Sidhartha · Lalit Manam · Venu Madhav Govindu | N/A | Code |
| Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising | Miaoyu Li · Ji Liu · Ying Fu · Yulun Zhang · Dejing Dou | N/A | Code |
| Partial Network Cloning | Jingwen Ye · Songhua Liu · Xinchao Wang | N/A | Code |
| Twin Contrastive Learning With Noisy Labels | Zhizhong Huang · Junping Zhang · Hongming Shan | N/A | Code |
| Ambiguous Medical Image Segmentation Using Diffusion Models | Aimon Rahman · Jeya Maria Jose Valanarasu · Ilker Hacihaliloglu · Vishal M. Patel | N/A | Code |
| High-Res Facial Appearance Capture From Polarized Smartphone Images | Dejan Azinović · Olivier Maury · Christophe Hery · Matthias Nießner · Justus Thies | N/A | Code |
| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Takehiko Ohkawa · Kun He · Fadime Sener · Tomas Hodan · Luan Tran · Cem Keskin | N/A | Code |
| EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata | Chenhao Zheng · Ayush Shrivastava · Andrew Owens | N/A | Code |
| Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Gal Metzer · Elad Richardson · Or Patashnik · Raja Giryes · Daniel Cohen-Or | N/A | Code |
| Rebalancing Batch Normalization for Exemplar-Based Class-Incremental Learning | Sungmin Cha · Sungjun Cho · Dasol Hwang · Sunwon Hong · Moontae Lee · Taesup Moon | N/A | Code |
| Progressive Neighbor Consistency Mining for Correspondence Pruning | Xin Liu · Jufeng Yang | N/A | Code |
| Post-Training Quantization on Diffusion Models | Yuzhang Shang · Zhihang Yuan · Bin Xie · Bingzhe Wu · Yan Yan | N/A | Code |
| Fully Self-Supervised Depth Estimation From Defocus Clue | Haozhe Si · Bin Zhao · Dong Wang · Yunpeng Gao · Mulin Chen · Zhigang Wang · Xuelong Li | N/A | Code |
| Curricular Object Manipulation in LiDAR-Based Object Detection | Ziyue Zhu · Qiang Meng · Xiao Wang · Ke Wang · Liujiang Yan · Jian Yang | N/A | Code |
| Adaptive Assignment for Geometry Aware Local Feature Matching | Dihe Huang · Ying Chen · Yong Liu · Jianlin Liu · Shang Xu · Wenlong Wu · Yikang Ding · Fan Tang · Chengjie Wang | N/A | Code |
| RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension | Lei Jin · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Annan Shu · Rongrong Ji | N/A | Code |
| ANetQA: A Large-Scale Benchmark for Fine-Grained Compositional Reasoning Over Untrimmed Videos | Zhou Yu · Lixiang Zheng · Zhou Zhao · Fei Wu · Jianping Fan · Kui Ren · Jun Yu | N/A | Code |
| GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds | Honghui Yang · Tong He · Jiaheng Liu · Hua Chen · Boxi Wu · Binbin Lin · Xiaofei He · Wanli Ouyang | N/A | Code |
| Multimodal Industrial Anomaly Detection via Hybrid Fusion | Yue Wang · Jinlong Peng · Jiangning Zhang · Ran Yi · Yabiao Wang · Chengjie Wang | N/A | Code |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Byeonghyun Pak · Jaewon Lee · Kyong Hwan Jin | N/A | Code |
| CLIP Is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation | Yuqi Lin · Minghao Chen · Wenxiao Wang · Boxi Wu · Ke Li · Binbin Lin · Haifeng Liu · Xiaofei He | N/A | Code |
| MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation | Ludan Ruan · Yiyang Ma · Huan Yang · Huiguo He · Bei Liu · Jianlong Fu · Nicholas Jing Yuan · Qin Jin · Baining Guo | N/A | Code |
| FreeNeRF: Improving Few-Shot Neural Rendering With Free Frequency Regularization | Jiawei Yang · Marco Pavone · Yue Wang | N/A | Code |
| SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory | Sicheng Li · Hao Li · Yue Wang · Yiyi Liao · Lu Yu | N/A | Code |
| Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks | Jierun Chen · Shiu-hong Kao · Hao He · Weipeng Zhuo · Song Wen · Chul-Ho Lee · S.-H. Gary Chan | N/A | Code |
| Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning | Cheng Tan · Zhangyang Gao · Lirong Wu · Yongjie Xu · Jun Xia · Siyuan Li · Stan Z. Li | N/A | Code |
| Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module | Linzhi Huang · Yulong Li · Hongbo Tian · Yue Yang · Xiangang Li · Weihong Deng · Jieping Ye | N/A | Code |
| Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification | Zhengwei Yang · Meng Lin · Xian Zhong · Yu Wu · Zheng Wang | N/A | Code |
| Feature Alignment and Uniformity for Test Time Adaptation | Shuai Wang · Daoan Zhang · Zipei Yan · Jianguo Zhang · Rui Li | N/A | Code |
| AeDet: Azimuth-Invariant Multi-View 3D Object Detection | Chengjian Feng · Zequn Jie · Yujie Zhong · Xiangxiang Chu · Lin Ma | N/A | Code |
| Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency Is All You Need | Tong Wei · Kai Gan | N/A | Code |
| OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization | Ying Zhao | N/A | Code |
| HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization | Sungyeon Kim · Boseung Jeong · Suha Kwak | N/A | Code |
| Generative Diffusion Prior for Unified Image Restoration and Enhancement | Ben Fei · Zhaoyang Lyu · Liang Pan · Junzhe Zhang · Weidong Yang · Tianyue Luo · Bo Zhang · Bo Dai | N/A | Code |
| Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder | Aming Wu · Cheng Deng | N/A | Code |
| 2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection | Mikhail Kennerley · Jian-Gang Wang · Bharadwaj Veeravalli · Robby T. Tan | N/A | Code |
| Linking Garment With Person via Semantically Associated Landmarks for Virtual Try-On | Keyu Yan · Tingwei Gao · Hui Zhang · Chengjun Xie | N/A | Code |
| A New Comprehensive Benchmark for Semi-Supervised Video Anomaly Detection and Anticipation | Congqi Cao · Yue Lu · Peng Wang · Yanning Zhang | N/A | Code |
| DINER: Depth-Aware Image-Based NEural Radiance Fields | Malte Prinzler · Otmar Hilliges · Justus Thies | N/A | Code |
| Learning Personalized High Quality Volumetric Head Avatars From Monocular RGB Videos | Ziqian Bai · Feitong Tan · Zeng Huang · Kripasindhu Sarkar · Danhang Tang · Di Qiu · Abhimitra Meka · Ruofei Du · Mingsong Dou · Sergio Orts-Escolano · Rohit Pandey · Ping Tan · Thabo Beeler · Sean Fanello · Yinda Zhang | N/A | Code |
| HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics | Artur Grigorev · Michael J. Black · Otmar Hilliges | N/A | Code |
| Boundary-Enhanced Co-Training for Weakly Supervised Semantic Segmentation | Shenghai Rong · Bohai Tu · Zilei Wang · Junjie Li | N/A | Code |
| Instant Volumetric Head Avatars | Wojciech Zielonka · Timo Bolkart · Justus Thies | N/A | Code |
| From Node Interaction To Hop Interaction: New Effective and Scalable Graph Learning Paradigm | Jie Chen · Zilong Li · Yin Zhu · Junping Zhang · Jian Pu | N/A | Code |
| Transfer4D: A Framework for Frugal Motion Capture and Deformation Transfer | Shubh Maheshwari · Rahul Narain · Ramya Hebbalaguppe | N/A | Code |
| An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions | Weijia Li · Saihui Hou · Chunjie Zhang · Chunshui Cao · Xu Liu · Yongzhen Huang · Yao Zhao | N/A | Code |
| Event-Based Shape From Polarization | Manasi Muglikar · Leonard Bauersfeld · Diederik Paul Moeys · Davide Scaramuzza | N/A | Code |
| Plateau-Reduced Differentiable Path Tracing | Michael Fischer · Tobias Ritschel | N/A | Code |
| End-to-End Video Matting With Trimap Propagation | Wei-Lun Huang · Ming-Sui Lee | N/A | Code |
| Weakly-Supervised Single-View Image Relighting | Renjiao Yi · Chenyang Zhu · Kai Xu | N/A | Code |
| Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning | Weixuan Sun · Jiayi Zhang · Jianyuan Wang · Zheyuan Liu · Yiran Zhong · Tianpeng Feng · Yandong Guo · Yanhao Zhang · Nick Barnes | N/A | Code |
| Non-Contrastive Unsupervised Learning of Physiological Signals From Video | Jeremy Speth · Nathan Vance · Patrick Flynn · Adam Czajka | N/A | Code |
| Structured Sparsity Learning for Efficient Video Super-Resolution | Bin Xia · Jingwen He · Yulun Zhang · Yitong Wang · Yapeng Tian · Wenming Yang · Luc Van Gool | N/A | Code |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Wei-Ning Hsu · Tal Remez · Bowen Shi · Jacob Donley · Yossi Adi | N/A | Code |
| Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion | Dario Pavllo · David Joseph Tan · Marie-Julie Rakotosaona · Federico Tombari | N/A | Code |
| Towards Better Decision Forests: Forest Alternating Optimization | Miguel Á. Carreira-Perpiñán · Magzhan Gabidolla · Arman Zharmagambetov | N/A | Code |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Thomas Stegmüller · Tim Lebailly · Behzad Bozorgtabar · Tinne Tuytelaars · Jean-Philippe Thiran | N/A | Code |
| Polynomial Implicit Neural Representations for Large Diverse Datasets | Rajhans Singh · Ankita Shukla · Pavan Turaga | N/A | Code |
| GradICON: Approximate Diffeomorphisms via Gradient Inverse Consistency | Lin Tian · Hastings Greer · François-Xavier Vialard · Roland Kwitt · Raúl San José Estépar · Richard Jarrett Rushmore · Nikolaos Makris · Sylvain Bouix · Marc Niethammer | N/A | Code |
| Exploring Discontinuity for Video Frame Interpolation | Sangjin Lee · Hyeongmin Lee · Chajin Shin · Hanbin Son · Sangyoun Lee | N/A | Code |
| Local 3D Editing via 3D Distillation of CLIP Knowledge | Junha Hyung · Sungwon Hwang · Daejin Kim · Hyunji Lee · Jaegul Choo | N/A | Code |
| Dynamic Conceptional Contrastive Learning for Generalized Category Discovery | Nan Pu · Zhun Zhong · Nicu Sebe | N/A | Code |
| Look, Radiate, and Learn: Self-Supervised Localisation via Radio-Visual Correspondence | Mohammed Alloulah · Maximilian Arnold | N/A | Code |
| Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection | Vibashan VS · Poojan Oza · Vishal M. Patel | N/A | Code |
| High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition | Tianyu Luan · Yuanhao Zhai · Jingjing Meng · Zhong Li · Zhang Chen · Yi Xu · Junsong Yuan | N/A | Code |
| 3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions | Dale Decatur · Itai Lang · Rana Hanocka | N/A | Code |
| Egocentric Video Task Translation | Zihui Xue · Yale Song · Kristen Grauman · Lorenzo Torresani | N/A | Code |
| Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection | Yi Wang · Ruili Wang · Xin Fan · Tianzhu Wang · Xiangjian He | N/A | Code |
| Balanced Energy Regularization Loss for Out-of-Distribution Detection | hyunjun choi · Hawook Jeong · Jin Young Choi | N/A | Code |
| Private Image Generation With Dual-Purpose Auxiliary Classifier | Chen Chen · Daochang Liu · Siqi Ma · Surya Nepal · Chang Xu | N/A | Code |
| Controllable Mesh Generation Through Sparse Latent Point Diffusion Models | Zhaoyang Lyu · Jinyi Wang · Yuwei An · Ya Zhang · Dahua Lin · Bo Dai | N/A | Code |
| Neural Video Compression With Diverse Contexts | Jiahao Li · Bin Li · Yan Lu | N/A | Code |
| Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection | Bo Zhang · Jiakang Yuan · Botian Shi · Tao Chen · Yikang Li · Yu Qiao | N/A | Code |
| ScarceNet: Animal Pose Estimation With Scarce Annotations | Chen Li · Gim Hee Lee | N/A | Code |
| Fast Contextual Scene Graph Generation With Unbiased Context Augmentation | Tianlei Jin · Fangtai Guo · Qiwei Meng · Shiqiang Zhu · Xiangming Xi · Wen Wang · Zonghao Mu · Wei Song | N/A | Code |
| TriDet: Temporal Action Detection With Relative Boundary Modeling | Dingfeng Shi · Yujie Zhong · Qiong Cao · Lin Ma · Jia Li · Dacheng Tao | N/A | Code |
| Multi-Level Logit Distillation | Ying Jin · Jiaqi Wang · Dahua Lin | N/A | Code |
| StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning | Yuqian Fu · Yu Xie · Yanwei Fu · Yu-Gang Jiang | N/A | Code |
| Text With Knowledge Graph Augmented Transformer for Video Captioning | Xin Gu · Guang Chen · Yufei Wang · Libo Zhang · Tiejian Luo · Longyin Wen | N/A | Code |
| Semantic Ray: Learning a Generalizable Semantic Field With Cross-Reprojection Attention | Fangfu Liu · Chubin Zhang · Yu Zheng · Yueqi Duan | N/A | Code |
| MELTR: Meta Loss Transformer for Learning To Fine-Tune Video Foundation Models | Dohwan Ko · Joonmyung Choi · Hyeong Kyu Choi · Kyoung-Woon On · Byungseok Roh · Hyunwoo J. Kim | N/A | Code |
| Self-Supervised AutoFlow | Hsin-Ping Huang · Charles Herrmann · Junhwa Hur · Erika Lu · Kyle Sargent · Austin Stone · Ming-Hsuan Yang · Deqing Sun | N/A | Code |
| Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images | Bowei Du · Yecheng Huang · Jiaxin Chen · Di Huang | N/A | Code |
| Context-Based Trit-Plane Coding for Progressive Image Compression | Seungmin Jeon · Kwang Pyo Choi · Youngo Park · Chang-Su Kim | N/A | Code |
| Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency Losses | Junbong Jang · Kwonmoo Lee · Tae-Kyun Kim | N/A | Code |
| VQACL: A Novel Visual Question Answering Continual Learning Setting | Xi Zhang · Feifei Zhang · Changsheng Xu | N/A | Code |
| Explicit Visual Prompting for Low-Level Structure Segmentations | Weihuang Liu · Xi Shen · Chi-Man Pun · Xiaodong Cun | N/A | Code |
| Practical Network Acceleration With Tiny Sets | Guo-Hua Wang · Jianxin Wu | N/A | Code |
| Sphere-Guided Training of Neural Implicit Surfaces | Andreea Dogaru · Andrei-Timotei Ardelean · Savva Ignatyev · Egor Zakharov · Evgeny Burnaev | N/A | Code |
| Texture-Guided Saliency Distilling for Unsupervised Salient Object Detection | Huajun Zhou · Bo Qiao · Lingxiao Yang · Jianhuang Lai · Xiaohua Xie | N/A | Code |
| FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction | Haoran Bai · Di Kang · Haoxian Zhang · Jinshan Pan · Linchao Bao | N/A | Code |
| Differentiable Shadow Mapping for Efficient Inverse Graphics | Markus Worchel · Marc Alexa | N/A | Code |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Wenxuan Zhang · Xiaodong Cun · Xuan Wang · Yong Zhang · Xi Shen · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| High-Fidelity and Freely Controllable Talking Head Video Generation | Yue Gao · Yuan Zhou · Jinglu Wang · Xiao Li · Xiang Ming · Yan Lu | N/A | Code |
| BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation | Junheum Park · Jintae Kim · Chang-Su Kim | N/A | Code |
| Noisy Correspondence Learning With Meta Similarity Correction | Haochen Han · Kaiyao Miao · Qinghua Zheng · Minnan Luo | N/A | Code |
| EVAL: Explainable Video Anomaly Localization | Ashish Singh · Michael J. Jones · Erik G. Learned-Miller | N/A | Code |
| Adaptive Plasticity Improvement for Continual Learning | Yan-Shuo Liang · Wu-Jun Li | N/A | Code |
| Edges to Shapes to Concepts: Adversarial Augmentation for Robust Vision | Aditay Tripathi · Rishubh Singh · Anirban Chakraborty · Pradeep Shenoy | N/A | Code |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mingzhen Sun · Weining Wang · Xinxin Zhu · Jing Liu | N/A | Code |
| Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and Poses | Eric Brachmann · Tommaso Cavallari · Victor Adrian Prisacariu | N/A | Code |
| A Probabilistic Attention Model With Occlusion-Aware Texture Regression for 3D Hand Reconstruction From a Single RGB Image | Zheheng Jiang · Hossein Rahmani · Sue Black · Bryan M. Williams | N/A | Code |
| Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang · Viktor Larsson · Daniel Barath | N/A | Code |
| LiDAR-in-the-Loop Hyperparameter Optimization | Félix Goudreault · Dominik Scheuble · Mario Bijelic · Nicolas Robidoux · Felix Heide | N/A | Code |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | WonJun Moon · Sangeek Hyun · SangUk Park · Dongchan Park · Jae-Pil Heo | N/A | Code |
| High-Fidelity 3D Face Generation From Natural Language Descriptions | Menghua Wu · Hao Zhu · Linjia Huang · Yiyu Zhuang · Yuanxun Lu · Xun Cao | N/A | Code |
| NeRF-Supervised Deep Stereo | Fabio Tosi · Alessio Tonioni · Daniele De Gregorio · Matteo Poggi | N/A | Code |
| vMAP: Vectorised Object Mapping for Neural Field SLAM | Xin Kong · Shikun Liu · Marwan Taher · Andrew J. Davison | N/A | Code |
| DiffRF: Rendering-Guided 3D Radiance Field Diffusion | Norman Müller · Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Peter Kontschieder · Matthias Nießner | N/A | Code |
| TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers | Cheng Zhang · Hai Liu · Yongjian Deng · Bochen Xie · Youfu Li | N/A | Code |
| Learning a Depth Covariance Function | Eric Dexheimer · Andrew J. Davison | N/A | Code |
| Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model | Rolandos Alexandros Potamias · Stylianos Ploumpis · Stylianos Moschoglou · Vasileios Triantafyllou · Stefanos Zafeiriou | N/A | Code |
| The Best Defense Is a Good Offense: Adversarial Augmentation Against Adversarial Attacks | Iuri Frosio · Jan Kautz | N/A | Code |
| Test of Time: Instilling Video-Language Models With a Sense of Time | Piyush Bagad · Makarand Tapaswi · Cees G. M. Snoek | N/A | Code |
| BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects | Bowen Wen · Jonathan Tremblay · Valts Blukis · Stephen Tyree · Thomas Müller · Alex Evans · Dieter Fox · Jan Kautz · Stan Birchfield | N/A | Code |
| Leveraging Hidden Positives for Unsupervised Semantic Segmentation | Hyun Seok Seong · WonJun Moon · SuBeen Lee · Jae-Pil Heo | N/A | Code |
| BlendFields: Few-Shot Example-Driven Facial Modeling | Kacper Kania · Stephan J. Garbin · Andrea Tagliasacchi · Virginia Estellers · Kwang Moo Yi · Julien Valentin · Tomasz Trzciński · Marek Kowalski | N/A | Code |
| CIRCLE: Capture in Rich Contextual Environments | João Pedro Araújo · Jiaman Li · Karthik Vetrivel · Rishi Agarwal · Jiajun Wu · Deepak Gopinath · Alexander William Clegg · Karen Liu | N/A | Code |
| Realistic Saliency Guided Image Enhancement | S. Mahdi H. Miangoleh · Zoya Bylinskii · Eric Kee · Eli Shechtman · Yağiz Aksoy | N/A | Code |
| Implicit Neural Head Synthesis via Controllable Local Deformation Fields | Chuhan Chen · Matthew O’Toole · Gaurav Bharaj · Pablo Garrido | N/A | Code |
| Ensemble-Based Blackbox Attacks on Dense Prediction | Zikui Cai · Yaoteng Tan · M. Salman Asif | N/A | Code |
| NaQ: Leveraging Narrations As Queries To Supervise Episodic Memory | Santhosh Kumar Ramakrishnan · Ziad Al-Halah · Kristen Grauman | N/A | Code |
| Rethinking Federated Learning With Domain Shift: A Prototype View | Wenke Huang · Mang Ye · Zekun Shi · He Li · Bo Du | N/A | Code |
| Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation | Shao-Yuan Lo · Poojan Oza · Sumanth Chennupati · Alejandro Galindo · Vishal M. Patel | N/A | Code |
| Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection | Jiakang Yuan · Bo Zhang · Xiangchao Yan · Tao Chen · Botian Shi · Yikang Li · Yu Qiao | N/A | Code |
| STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection | Zhenglin Zhou · Huaxia Li · Hong Liu · Nanyang Wang · Gang Yu · Rongrong Ji | N/A | Code |
| Diverse 3D Hand Gesture Prediction From Body Dynamics by Bilateral Hand Disentanglement | Xingqun Qi · Chen Liu · Muyi Sun · Lincheng Li · Changjie Fan · Xin Yu | N/A | Code |
| Sparsely Annotated Semantic Segmentation With Adaptive Gaussian Mixtures | Linshan Wu · Zhun Zhong · Leyuan Fang · Xingxin He · Qiang Liu · Jiayi Ma · Hao Chen | N/A | Code |
| Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention | Mingyu Ding · Yikang Shen · Lijie Fan · Zhenfang Chen · Zitian Chen · Ping Luo · Joshua B. Tenenbaum · Chuang Gan | N/A | Code |
| Frame Flexible Network | Yitian Zhang · Yue Bai · Chang Liu · Huan Wang · Sheng Li · Yun Fu | N/A | Code |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks | Hao Li · Jinguo Zhu · Xiaohu Jiang · Xizhou Zhu · Hongsheng Li · Chun Yuan · Xiaohua Wang · Yu Qiao · Xiaogang Wang · Wenhai Wang · Jifeng Dai | N/A | Code |
| DCFace: Synthetic Face Generation With Dual Condition Diffusion Model | Minchul Kim · Feng Liu · Anil Jain · Xiaoming Liu | N/A | Code |
| Referring Image Matting | Jizhizi Li · Jing Zhang · Dacheng Tao | N/A | Code |
| Fast Monocular Scene Reconstruction With Global-Sparse Local-Dense Grids | Wei Dong · Christopher Choy · Charles Loop · Or Litany · Yuke Zhu · Anima Anandkumar | N/A | Code |
| DPE: Disentanglement of Pose and Expression for General Video Portrait Editing | Youxin Pang · Yong Zhang · Weize Quan · Yanbo Fan · Xiaodong Cun · Ying Shan · Dong-Ming Yan | N/A | Code |
| IDGI: A Framework To Eliminate Explanation Noise From Integrated Gradients | Ruo Yang · Binghui Wang · Mustafa Bilgic | N/A | Code |
| DynamicDet: A Unified Dynamic Architecture for Object Detection | Zhihao Lin · Yongtao Wang · Jinhe Zhang · Xiaojie Chu | N/A | Code |
| Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification | Honglin Li · Chenglu Zhu · Yunlong Zhang · Yuxuan Sun · Zhongyi Shui · Wenwei Kuang · Sunyi Zheng · Lin Yang | N/A | Code |
| VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution | Jaeill Kim · Suhyun Kang · Duhun Hwang · Jungwook Shin · Wonjong Rhee | N/A | Code |
| Semi-Weakly Supervised Object Kinematic Motion Prediction | Gengxin Liu · Qian Sun · Haibin Huang · Chongyang Ma · Yulan Guo · Li Yi · Hui Huang · Ruizhen Hu | N/A | Code |
| Computational Flash Photography Through Intrinsics | Sepideh Sarajian Maralan · Chris Careaga · Yağiz Aksoy | N/A | Code |
| Inversion-Based Style Transfer With Diffusion Models | Yuxin Zhang · Nisha Huang · Fan Tang · Haibin Huang · Chongyang Ma · Weiming Dong · Changsheng Xu | N/A | Code |
| Data-Driven Feature Tracking for Event Cameras | Nico Messikommer · Carter Fang · Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation | Sicheng Yang · Zhiyong Wu · Minglei Li · Zhensong Zhang · Lei Hao · Weihong Bao · Haolin Zhuang | N/A | Code |
| Neural Fourier Filter Bank | Zhijie Wu · Yuhe Jin · Kwang Moo Yi | N/A | Code |
| Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective | Yuexiao Ma · Huixia Li · Xiawu Zheng · Xuefeng Xiao · Rui Wang · Shilei Wen · Xin Pan · Fei Chao · Rongrong Ji | N/A | Code |
| Full or Weak Annotations? An Adaptive Strategy for Budget-Constrained Annotation Campaigns | Javier Gamazo Tejero · Martin S. Zinkernagel · Sebastian Wolf · Raphael Sznitman · Pablo Márquez-Neila | N/A | Code |
| Trap Attention: Monocular Depth Estimation With Manual Traps | Chao Ning · Hongping Gan | N/A | Code |
| Physical-World Optical Adversarial Attacks on 3D Face Recognition | Yanjie Li · Yiquan Li · Xuelong Dai · Songtao Guo · Bin Xiao | N/A | Code |
| Re-Thinking Federated Active Learning Based on Inter-Class Diversity | SangMook Kim · Sangmin Bae · Hwanjun Song · Se-Young Yun | N/A | Code |
| EMT-NAS:Transferring Architectural Knowledge Between Tasks From Different Datasets | Peng Liao · Yaochu Jin · Wenli Du | N/A | Code |
| Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving | Lucas Nunes · Louis Wiesmann · Rodrigo Marcuzzi · Xieyuanli Chen · Jens Behley · Cyrill Stachniss | N/A | Code |
| Document Image Shadow Removal Guided by Color-Aware Background | Ling Zhang · Yinghao He · Qing Zhang · Zheng Liu · Xiaolong Zhang · Chunxia Xiao | N/A | Code |
| Pose-Disentangled Contrastive Learning for Self-Supervised Facial Representation | Yuanyuan Liu · Wenbin Wang · Yibing Zhan · Shaoze Feng · Kejun Liu · Zhe Chen | N/A | Code |
| Ham2Pose: Animating Sign Language Notation Into Pose Sequences | Rotem Shalev Arkushin · Amit Moryossef · Ohad Fried | N/A | Code |
| Resource-Efficient RGBD Aerial Tracking | Jinyu Yang · Shang Gao · Zhe Li · Feng Zheng · Aleš Leonardis | N/A | Code |
| Neural Transformation Fields for Arbitrary-Styled Font Generation | Bin Fu · Junjun He · Jianjun Wang · Yu Qiao | N/A | Code |
| Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection | Qianjiang Hu · Daizong Liu · Wei Hu | N/A | Code |
| PAniC-3D: Stylized Single-View 3D Reconstruction From Portraits of Anime Characters | Shuhong Chen · Kevin Zhang · Yichun Shi · Heng Wang · Yiheng Zhu · Guoxian Song · Sizhe An · Janus Kristjansson · Xiao Yang · Matthias Zwicker | N/A | Code |
| HS-Pose: Hybrid Scope Feature Extraction for Category-Level Object Pose Estimation | Linfang Zheng · Chen Wang · Yinghan Sun · Esha Dasgupta · Hua Chen · Aleš Leonardis · Wei Zhang · Hyung Jin Chang | N/A | Code |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction From In-the-Wild Images | Biwen Lei · Jianqiang Ren · Mengyang Feng · Miaomiao Cui · Xuansong Xie | N/A | Code |
| Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification | Yue Yang · Artemis Panagopoulou · Shenghao Zhou · Daniel Jin · Chris Callison-Burch · Mark Yatskar | N/A | Code |
| SfM-TTR: Using Structure From Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo · Javier Civera | N/A | Code |
| TINC: Tree-Structured Implicit Neural Compression | Runzhao Yang | N/A | Code |
| Cross-Domain Image Captioning With Discriminative Finetuning | Roberto Dessì · Michele Bevilacqua · Eleonora Gualdoni · Nathanaël Carraz Rakotonirina · Francesca Franzon · Marco Baroni | N/A | Code |
| Learning To Detect Mirrors From Videos via Dual Correspondences | Jiaying Lin · Xin Tan · Rynson W.H. Lau | N/A | Code |
| Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation | Zicheng Wang · Zhen Zhao · Xiaoxia Xing · Dong Xu · Xiangyu Kong · Luping Zhou | N/A | Code |
| Robust Unsupervised StyleGAN Image Restoration | Yohan Poirier-Ginter · Jean-François Lalonde | N/A | Code |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning | Rui Wang · Dongdong Chen · Zuxuan Wu · Yinpeng Chen · Xiyang Dai · Mengchen Liu · Lu Yuan · Yu-Gang Jiang | N/A | Code |
| Neural Fields Meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes | Zian Wang · Tianchang Shen · Jun Gao · Shengyu Huang · Jacob Munkberg · Jon Hasselgren · Zan Gojcic · Wenzheng Chen · Sanja Fidler | N/A | Code |
| Augmentation Matters: A Simple-Yet-Effective Approach to Semi-Supervised Semantic Segmentation | Zhen Zhao · Lihe Yang · Sifan Long · Jimin Pi · Luping Zhou · Jingdong Wang | N/A | Code |
| Policy Adaptation From Foundation Model Feedback | Yuying Ge · Annabella Macaluso · Li Erran Li · Ping Luo · Xiaolong Wang | N/A | Code |
| Person Image Synthesis via Denoising Diffusion Model | Ankan Kumar Bhunia · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Jorma Laaksonen · Mubarak Shah · Fahad Shahbaz Khan | N/A | Code |
| Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models | Wenhao Wu · Xiaohan Wang · Haipeng Luo · Jingdong Wang · Yi Yang · Wanli Ouyang | N/A | Code |
| CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution | Jiezhang Cao · Qin Wang · Yongqin Xian · Yawei Li · Bingbing Ni · Zhiming Pi · Kai Zhang · Yulun Zhang · Radu Timofte · Luc Van Gool | N/A | Code |
| Black-Box Sparse Adversarial Attack via Multi-Objective Optimisation | Phoenix Neale Williams · Ke Li | N/A | Code |
| AdaptiveMix: Improving GAN Training via Feature Space Shrinkage | Haozhe Liu · Wentian Zhang · Bing Li · Haoqian Wu · Nanjun He · Yawen Huang · Yuexiang Li · Bernard Ghanem · Yefeng Zheng | N/A | Code |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Michail Tarasiou · Erik Chavez · Stefanos Zafeiriou | N/A | Code |
| Latency Matters: Real-Time Action Forecasting Transformer | Harshayu Girase · Nakul Agarwal · Chiho Choi · Karttikeya Mangalam | N/A | Code |
| Multi-Sensor Large-Scale Dataset for Multi-View 3D Reconstruction | Oleg Voynov · Gleb Bobrovskikh · Pavel Karpyshev · Saveliy Galochkin · Andrei-Timotei Ardelean · Arseniy Bozhenko · Ekaterina Karmanova · Pavel Kopanev · Yaroslav Labutin-Rymsho · Ruslan Rakhimov · Aleksandr Safin · Valerii Serpiva · Alexey Artemov · Evgeny Burnaev · Dzmitry Tsetserukou · Denis Zorin | N/A | Code |
| Learning From Noisy Labels With Decoupled Meta Label Purifier | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| Flow Supervision for Deformable NeRF | Chaoyang Wang · Lachlan Ewen MacDonald · László A. Jeni · Simon Lucey | N/A | Code |
| Unifying Vision, Text, and Layout for Universal Document Processing | Zineng Tang · Ziyi Yang · Guoxin Wang · Yuwei Fang · Yang Liu · Chenguang Zhu · Michael Zeng · Cha Zhang · Mohit Bansal | N/A | Code |
| BKinD-3D: Self-Supervised 3D Keypoint Discovery From Multi-View Videos | Jennifer J. Sun · Lili Karashchuk · Amil Dravid · Serim Ryou · Sonia Fereidooni · John C. Tuthill · Aggelos Katsaggelos · Bingni W. Brunton · Georgia Gkioxari · Ann Kennedy · Yisong Yue · Pietro Perona | N/A | Code |
| Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning | Zixuan Hu · Li Shen · Zhenyi Wang · Tongliang Liu · Chun Yuan · Dacheng Tao | N/A | Code |
| RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-Ray Security Image Synthesis | Luwen Duan · Min Wu · Lijian Mao · Jun Yin · Jianping Xiong · Xi Li | N/A | Code |
| Meta Architecture for Point Cloud Analysis | Haojia Lin · Xiawu Zheng · Lijiang Li · Fei Chao · Shanshan Wang · Yan Wang · Yonghong Tian · Rongrong Ji | N/A | Code |
| DyLiN: Making Light Field Networks Dynamic | Heng Yu · Joel Julin · Zoltán Á. Milacski · Koichiro Niinuma · László A. Jeni | N/A | Code |
| OpenMix: Exploring Outlier Samples for Misclassification Detection | Fei Zhu · Zhen Cheng · Xu-Yao Zhang · Cheng-Lin Liu | N/A | Code |
| Adaptive Graph Convolutional Subspace Clustering | Lai Wei · Zhengwei Chen · Jun Yin · Changming Zhu · Rigui Zhou · Jin Liu | N/A | Code |
| Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation | Guozhen Zhang · Yuhan Zhu · Haonan Wang · Youxin Chen · Gangshan Wu · Limin Wang | N/A | Code |
| Hybrid Active Learning via Deep Clustering for Video Action Detection | Aayush J. Rana · Yogesh S. Rawat | N/A | Code |
| Equiangular Basis Vectors | Yang Shen · Xuhao Sun · Xiu-Shen Wei | N/A | Code |
| CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection | Shuailei Ma · Yuefeng Wang · Ying Wei · Jiaqi Fan · Thomas H. Li · Hongli Liu · Fanbing Lv | N/A | Code |
| An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity | Zhao Xie · Tian Gao · Kewei Wu · Jiao Chang | N/A | Code |
| GCFAgg: Global and Cross-View Feature Aggregation for Multi-View Clustering | Weiqing Yan · Yuanyang Zhang · Chenlei Lv · Chang Tang · Guanghui Yue · Liang Liao · Weisi Lin | N/A | Code |
| Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning | Zesen Wu · Mang Ye | N/A | Code |
| Similarity Maps for Self-Training Weakly-Supervised Phrase Grounding | Tal Shaharabany · Lior Wolf | N/A | Code |
| DA Wand: Distortion-Aware Selection Using Neural Mesh Parameterization | Richard Liu · Noam Aigerman · Vladimir G. Kim · Rana Hanocka | N/A | Code |
| BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency | Shuo Yang · Zhaopan Xu · Kai Wang · Yang You · Hongxun Yao · Tongliang Liu · Min Xu | N/A | Code |
| DaFKD: Domain-Aware Federated Knowledge Distillation | Haozhao Wang · Yichen Li · Wenchao Xu · Ruixuan Li · Yufeng Zhan · Zhigang Zeng | N/A | Code |
| Single Image Depth Prediction Made Better: A Multivariate Gaussian Take | Ce Liu · Suryansh Kumar · Shuhang Gu · Radu Timofte · Luc Van Gool | N/A | Code |
| Align Your Latents: High-Resolution Video Synthesis With Latent Diffusion Models | Andreas Blattmann · Robin Rombach · Huan Ling · Tim Dockhorn · Seung Wook Kim · Sanja Fidler · Karsten Kreis | N/A | Code |
| GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction | Chuwei Luo · Changxu Cheng · Qi Zheng · Cong Yao | N/A | Code |
| VideoMAE V2: Scaling Video Masked Autoencoders With Dual Masking | Limin Wang · Bingkun Huang · Zhiyu Zhao · Zhan Tong · Yinan He · Yi Wang · Yali Wang · Yu Qiao | N/A | Code |
| CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition With Variational Alignment | Jiangbin Zheng · Yile Wang · Cheng Tan · Siyuan Li · Ge Wang · Jun Xia · Yidong Chen · Stan Z. Li | N/A | Code |
| All Are Worth Words: A ViT Backbone for Diffusion Models | Fan Bao · Shen Nie · Kaiwen Xue · Yue Cao · Chongxuan Li · Hang Su · Jun Zhu | N/A | Code |
| PanoSwin: A Pano-Style Swin Transformer for Panorama Understanding | Zhixin Ling · Zhen Xing · Xiangdong Zhou · Manliang Cao · Guichun Zhou | N/A | Code |
| sRGB Real Noise Synthesizing With Neighboring Correlation-Aware Noise Model | Zixuan Fu · Lanqing Guo · Bihan Wen | N/A | Code |
| Extracting Class Activation Maps From Non-Discriminative Features As Well | Zhaozheng Chen · Qianru Sun | N/A | Code |
| GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental Task | Huiping Zhuang · Zhenyu Weng · Run He · Zhiping Lin · Ziqian Zeng | N/A | Code |
| ERM-KTP: Knowledge-Level Machine Unlearning via Knowledge Transfer | Shen Lin · Xiaoyu Zhang · Chenyang Chen · Xiaofeng Chen · Willy Susilo | N/A | Code |
| PDPP:Projected Diffusion for Procedure Planning in Instructional Videos | Hanlin Wang · Yilu Wu · Sheng Guo · Limin Wang | N/A | Code |
| NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction | Bowen Cai · Jinchi Huang · Rongfei Jia · Chengfei Lv · Huan Fu | N/A | Code |
| Deep Polarization Reconstruction With PDAVIS Events | Haiyang Mei · Zuowen Wang · Xin Yang · Xiaopeng Wei · Tobi Delbruck | N/A | Code |
| Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers | Sifan Long · Zhen Zhao · Jimin Pi · Shengsheng Wang · Jingdong Wang | N/A | Code |
| PointClustering: Unsupervised Point Cloud Pre-Training Using Transformation Invariance in Clustering | Fuchen Long · Ting Yao · Zhaofan Qiu · Lusong Li · Tao Mei | N/A | Code |
| PCR: Proxy-Based Contrastive Replay for Online Class-Incremental Continual Learning | Huiwei Lin · Baoquan Zhang · Shanshan Feng · Xutao Li · Yunming Ye | N/A | Code |
| Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval | Tan Pan · Furong Xu · Xudong Yang · Sifeng He · Chen Jiang · Qingpei Guo · Feng Qian · Xiaobo Zhang · Yuan Cheng · Lei Yang · Wei Chu | N/A | Code |
| PermutoSDF: Fast Multi-View Reconstruction With Implicit Surfaces Using Permutohedral Lattices | Radu Alexandru Rosu · Sven Behnke | N/A | Code |
| StyleGene: Crossover and Mutation of Region-Level Facial Genes for Kinship Face Synthesis | Hao Li · Xianxu Hou · Zepeng Huang · Linlin Shen | N/A | Code |
| MixNeRF: Modeling a Ray With Mixture Density for Novel View Synthesis From Sparse Inputs | Seunghyeon Seo · Donghoon Han · Yeonjin Chang · Nojun Kwak | N/A | Code |
| Upcycling Models Under Domain and Category Shift | Sanqing Qu · Tianpei Zou · Florian Röhrbein · Cewu Lu · Guang Chen · Dacheng Tao · Changjun Jiang | N/A | Code |
| Towards Unbiased Volume Rendering of Neural Implicit Surfaces With Geometry Priors | Yongqiang Zhang · Zhipeng Hu · Haoqian Wu · Minda Zhao · Lincheng Li · Zhengxia Zou · Changjie Fan | N/A | Code |
| Avatars Grow Legs: Generating Smooth Human Motion From Sparse Tracking Inputs With Diffusion Model | Yuming Du · Robin Kips · Albert Pumarola · Sebastian Starke · Ali Thabet · Artsiom Sanakoyeu | N/A | Code |
| MoStGAN-V: Video Generation With Temporal Motion Styles | Xiaoqian Shen · Xiang Li · Mohamed Elhoseiny | N/A | Code |
| On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung · Patrick Ruhkamp · Guangyao Zhai · Nikolas Brasch · Yitong Li · Yannick Verdie · Jifei Song · Yiren Zhou · Anil Armagan · Slobodan Ilic · Aleš Leonardis · Nassir Navab · Benjamin Busam | N/A | Code |
| DeGPR: Deep Guided Posterior Regularization for Multi-Class Cell Detection and Counting | Aayush Kumar Tyagi · Chirag Mohapatra · Prasenjit Das · Govind Makharia · Lalita Mehra · Prathosh AP · Mausam | N/A | Code |
| Learning Action Changes by Measuring Verb-Adverb Textual Relationships | Davide Moltisanti · Frank Keller · Hakan Bilen · Laura Sevilla-Lara | N/A | Code |
| Interactive and Explainable Region-Guided Radiology Report Generation | Tim Tanida · Philip Müller · Georgios Kaissis · Daniel Rueckert | N/A | Code |
| Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng · Sida Peng · Zhen Xu · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt | Hao Li · Dingwen Zhang · Nian Liu · Lechao Cheng · Yalun Dai · Chao Zhang · Xinggang Wang · Junwei Han | N/A | Code |
| Learning Rotation-Equivariant Features for Visual Correspondence | Jongmin Lee · Byungjin Kim · Seungwook Kim · Minsu Cho | N/A | Code |
| Co-Training 2L Submodels for Visual Recognition | Hugo Touvron · Matthieu Cord · Maxime Oquab · Piotr Bojanowski · Jakob Verbeek · Hervé Jégou | N/A | Code |
| HOTNAS: Hierarchical Optimal Transport for Neural Architecture Search | Jiechao Yang · Yong Liu · Hongteng Xu | N/A | Code |
| LANA: A Language-Capable Navigator for Instruction Following and Generation | Xiaohan Wang · Wenguan Wang · Jiayi Shao · Yi Yang | N/A | Code |
| Visual Localization Using Imperfect 3D Models From the Internet | Vojtech Panek · Zuzana Kukelova · Torsten Sattler | N/A | Code |
| Diversity-Measurable Anomaly Detection | Wenrui Liu · Hong Chang · Bingpeng Ma · Shiguang Shan · Xilin Chen | N/A | Code |
| SLACK: Stable Learning of Augmentations With Cold-Start and KL Regularization | Juliette Marrie · Michael Arbel · Diane Larlus · Julien Mairal | N/A | Code |
| Recurrent Vision Transformers for Object Detection With Event Cameras | Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| Efficient Verification of Neural Networks Against LVM-Based Specifications | Harleen Hanspal · Alessio Lomuscio | N/A | Code |
| Neuralizer: General Neuroimage Analysis Without Re-Training | Steffen Czolbe · Adrian V. Dalca | N/A | Code |
| MobileVOS: Real-Time Video Object Segmentation Contrastive Learning Meets Knowledge Distillation | Roy Miles · Mehmet Kerim Yucel · Bruno Manganelli · Albert Saà-Garriga | N/A | Code |
| SCOTCH and SODA: A Transformer Video Shadow Detection Framework | Lihao Liu · Jean Prost · Lei Zhu · Nicolas Papadakis · Pietro Liò · Carola-Bibiane Schönlieb · Angelica I. Aviles-Rivero | N/A | Code |
| A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance | Xianmin Xu · Yuxin Lin · Haoyang Zhou · Chong Zeng · Yaxin Yu · Kun Zhou · Hongzhi Wu | N/A | Code |
| Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures | Eugenia Iofinova · Alexandra Peste · Dan Alistarh | N/A | Code |
| InstructPix2Pix: Learning To Follow Image Editing Instructions | Tim Brooks · Aleksander Holynski · Alexei A. Efros | N/A | Code |
| AnchorFormer: Point Cloud Completion From Discriminative Nodes | Zhikai Chen · Fuchen Long · Zhaofan Qiu · Ting Yao · Wengang Zhou · Jiebo Luo · Tao Mei | N/A | Code |
| Robust Test-Time Adaptation in Dynamic Scenarios | Longhui Yuan · Binhui Xie · Shuang Li | N/A | Code |
| AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers | Zechuan Li · Hongshan Yu · Zhengeng Yang · Tongjia Chen · Naveed Akhtar | N/A | Code |
| Neural Texture Synthesis With Guided Correspondence | Yang Zhou · Kaijian Chen · Rongjun Xiao · Hui Huang | N/A | Code |
| Learning To Render Novel Views From Wide-Baseline Stereo Pairs | Yilun Du · Cameron Smith · Ayush Tewari · Vincent Sitzmann | N/A | Code |
| Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision | Fangqiang Ding · Andras Palffy · Dariu M. Gavrila · Chris Xiaoxuan Lu | N/A | Code |
| SmallCap: Lightweight Image Captioning Prompted With Retrieval Augmentation | Rita Ramos · Bruno Martins · Desmond Elliott · Yova Kementchedjhieva | N/A | Code |
| PIDNet: A Real-Time Semantic Segmentation Network Inspired by PID Controllers | Jiacong Xu · Zixiang Xiong · Shankar P. Bhattacharyya | N/A | Code |
| NeRFLight: Fast and Light Neural Radiance Fields Using a Shared Feature Grid | Fernando Rivas-Manzaneque · Jorge Sierra-Acosta · Adrian Penate-Sanchez · Francesc Moreno-Noguer · Angela Ribeiro | N/A | Code |
| Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete Counterparts | Nikolas Lamb · Cameron Palmer · Benjamin Molloy · Sean Banerjee · Natasha Kholgade Banerjee | N/A | Code |
| PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration | Junle Yu · Luwei Ren · Yu Zhang · Wenhui Zhou · Lili Lin · Guojun Dai | N/A | Code |
| Neural Volumetric Memory for Visual Locomotion Control | Ruihan Yang · Ge Yang · Xiaolong Wang | N/A | Code |
| InstantAvatar: Learning Avatars From Monocular Video in 60 Seconds | Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| TMO: Textured Mesh Acquisition of Objects With a Mobile Device by Using Differentiable Rendering | Jaehoon Choi · Dongki Jung · Taejae Lee · Sangwook Kim · Youngdong Jung · Dinesh Manocha · Donghwan Lee | N/A | Code |
| MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding | Jun Chen · Ming Hu · Darren J. Coker · Michael L. Berumen · Blair Costelloe · Sara Beery · Anna Rohrbach · Mohamed Elhoseiny | N/A | Code |
| Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval | Xudong Lin · Simran Tiwari · Shiyuan Huang · Manling Li · Mike Zheng Shou · Heng Ji · Shih-Fu Chang | N/A | Code |
| Hierarchical Fine-Grained Image Forgery Detection and Localization | Xiao Guo · Xiaohong Liu · Zhiyuan Ren · Steven Grosz · Iacopo Masi · Xiaoming Liu | N/A | Code |
| SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision | Xubo Liu · Egor Lakomkin · Konstantinos Vougioukas · Pingchuan Ma · Honglie Chen · Ruiming Xie · Morrie Doulaty · Niko Moritz · Jachym Kolar · Stavros Petridis · Maja Pantic · Christian Fuegen | N/A | Code |
| RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation With Natural Prompts | Han Liu · Yuhao Wu · Shixuan Zhai · Bo Yuan · Ning Zhang | N/A | Code |
| Unsupervised Intrinsic Image Decomposition With LiDAR Intensity | Shogo Sato · Yasuhiro Yao · Taiga Yoshida · Takuhiro Kaneko · Shingo Ando · Jun Shimamura | N/A | Code |
| SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples | Han Liu · Yuhao Wu · Zhiyuan Yu · Yevgeniy Vorobeychik · Ning Zhang | N/A | Code |
| NeFII: Inverse Rendering for Reflectance Decomposition With Near-Field Indirect Illumination | Haoqian Wu · Zhipeng Hu · Lincheng Li · Yongqiang Zhang · Changjie Fan · Xin Yu | N/A | Code |
| BEV-Guided Multi-Modality Fusion for Driving Perception | Yunze Man · Liang-Yan Gui · Yu-Xiong Wang | N/A | Code |
| MAGVLT: Masked Generative Vision-and-Language Transformer | Sungwoong Kim · Daejin Jo · Donghoon Lee · Jongmin Kim | N/A | Code |
| PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial Training | Qingjie Zeng · Yutong Xie · Zilin Lu · Yong Xia | N/A | Code |
| Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning | Cheng-Hao Tu · Zheda Mai · Wei-Lun Chao | N/A | Code |
| Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning | Wenju Sun · Qingyong Li · Jing Zhang · Wen Wang · Yangli-ao Geng | N/A | Code |
| PMR: Prototypical Modal Rebalance for Multimodal Learning | Yunfeng Fan · Wenchao Xu · Haozhao Wang · Junxiao Wang · Song Guo | N/A | Code |
| DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks | Samyak Jain · Sravanti Addepalli · Pawan Kumar Sahu · Priyam Dey · R. Venkatesh Babu | N/A | Code |
| Abstract Visual Reasoning: An Algebraic Approach for Solving Raven’s Progressive Matrices | Jingyi Xu · Tushar Vaidya · Yufei Wu · Saket Chandra · Zhangsheng Lai · Kai Fong Ernest Chong | N/A | Code |
| Swept-Angle Synthetic Wavelength Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Passive Micron-Scale Time-of-Flight With Sunlight Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Meta-Learning With a Geometry-Adaptive Preconditioner | Suhyun Kang · Duhun Hwang · Moonjung Eo · Taesup Kim · Wonjong Rhee | N/A | Code |
| 3D GAN Inversion With Facial Symmetry Prior | Fei Yin · Yong Zhang · Xuan Wang · Tengfei Wang · Xiaoyu Li · Yuan Gong · Yanbo Fan · Xiaodong Cun · Ying Shan · Cengiz Oztireli · Yujiu Yang | N/A | Code |
| ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection | Yongqi An · Xu Zhao · Tao Yu · Haiyun Guo · Chaoyang Zhao · Ming Tang · Jinqiao Wang | N/A | Code |
| Neural Lens Modeling | Wenqi Xian · Aljaž Božič · Noah Snavely · Christoph Lassner | N/A | Code |
| A Probabilistic Framework for Lifelong Test-Time Adaptation | Dhanajit Brahma · Piyush Rai | N/A | Code |
| Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation | Linglan Zhao · Jing Lu · Yunlu Xu · Zhanzhan Cheng · Dashan Guo · Yi Niu · Xiangzhong Fang | N/A | Code |
| GradMA: A Gradient-Memory-Based Accelerated Federated Learning With Alleviated Catastrophic Forgetting | Kangyang Luo · Xiang Li · Yunshi Lan · Ming Gao | N/A | Code |
| Hyperspherical Embedding for Point Cloud Completion | Junming Zhang · Haomeng Zhang · Ram Vasudevan · Matthew Johnson-Roberson | N/A | Code |
| Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning | Hyesong Choi · Hunsang Lee · Wonil Song · Sangryul Jeon · Kwanghoon Sohn · Dongbo Min | N/A | Code |
| Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation | Sun-Ao Liu · Yiheng Zhang · Zhaofan Qiu · Hongtao Xie · Yongdong Zhang · Ting Yao | N/A | Code |
| DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment | Heyuan Li · Bo Wang · Yu Cheng · Mohan Kankanhalli · Robby T. Tan | N/A | Code |
| SViTT: Temporal Learning of Sparse Video-Text Transformers | Yi Li · Kyle Min · Subarna Tripathi · Nuno Vasconcelos | N/A | Code |
| Independent Component Alignment for Multi-Task Learning | Dmitry Senushkin · Nikolay Patakin · Arseny Kuznetsov · Anton Konushin | N/A | Code |
| Logical Implications for Visual Question Answering Consistency | Sergio Tascon-Morales · Pablo Márquez-Neila · Raphael Sznitman | N/A | Code |
| MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset | Chen Feng · Ioannis Patras | N/A | Code |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Wenhui Wang · Hangbo Bao · Li Dong · Johan Bjorck · Zhiliang Peng · Qiang Liu · Kriti Aggarwal · Owais Khan Mohammed · Saksham Singhal · Subhojit Som · Furu Wei | N/A | Code |
| Manipulating Transfer Learning for Property Inference | Yulong Tian · Fnu Suya · Anshuman Suri · Fengyuan Xu · David Evans | N/A | Code |
| DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana · Ahmed Magd · Kyung-Soo Kim | N/A | Code |
| Learning a 3D Morphable Face Reflectance Model From Low-Cost Data | Yuxuan Han · Zhibo Wang · Feng Xu | N/A | Code |
| Principles of Forgetting in Domain-Incremental Semantic Segmentation in Adverse Weather Conditions | Tobias Kalb · Jürgen Beyerer | N/A | Code |
| Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models | Gowthami Somepalli · Vasu Singla · Micah Goldblum · Jonas Geiping · Tom Goldstein | N/A | Code |
| Adaptive Data-Free Quantization | Biao Qian · Yang Wang · Richang Hong · Meng Wang | N/A | Code |
| Coreset Sampling From Open-Set for Fine-Grained Self-Supervised Learning | Sungnyun Kim · Sangmin Bae · Se-Young Yun | N/A | Code |
| Jedi: Entropy-Based Localization and Removal of Adversarial Patches | Bilel Tarchoun · Anouar Ben Khalifa · Mohamed Ali Mahjoub · Nael Abu-Ghazaleh · Ihsen Alouani | N/A | Code |
| Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models | Qiucheng Wu · Yujian Liu · Handong Zhao · Ajinkya Kale · Trung Bui · Tong Yu · Zhe Lin · Yang Zhang · Shiyu Chang | N/A | Code |
| Semantic-Conditional Diffusion Networks for Image Captioning | Jianjie Luo · Yehao Li · Yingwei Pan · Ting Yao · Jianlin Feng · Hongyang Chao · Tao Mei | N/A | Code |
| Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation | Zhen Zhao · Sifan Long · Jimin Pi · Jingdong Wang · Luping Zhou | N/A | Code |
| Improving Robustness of Semantic Segmentation to Motion-Blur Using Class-Centric Augmentation | Aakanksha Aakanksha · A. N. Rajagopalan | N/A | Code |
| MetaViewer: Towards a Unified Multi-View Representation | Ren Wang · Haoliang Sun · Yuling Ma · Xiaoming Xi · Yilong Yin | N/A | Code |
| Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization | Simone Barattin · Christos Tzelepis · Ioannis Patras · Nicu Sebe | N/A | Code |
| A Light Weight Model for Active Speaker Detection | Junhua Liao · Haihan Duan · Kanghui Feng · Wanbing Zhao · Yanbing Yang · Liangyin Chen | N/A | Code |
| Shifted Diffusion for Text-to-Image Generation | Yufan Zhou · Bingchen Liu · Yizhe Zhu · Xiao Yang · Changyou Chen · Jinhui Xu | N/A | Code |
| Modular Memorability: Tiered Representations for Video Memorability Prediction | Théo Dumont · Juan Segundo Hevia · Camilo L. Fosco | N/A | Code |
| Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images | Anastasis Stathopoulos · Georgios Pavlakos · Ligong Han · Dimitris N. Metaxas | N/A | Code |
| RMLVQA: A Margin Loss Approach for Visual Question Answering With Language Biases | Abhipsa Basu · Sravanti Addepalli · R. Venkatesh Babu | N/A | Code |
| RealImpact: A Dataset of Impact Sound Fields for Real Objects | Samuel Clarke · Ruohan Gao · Mason Wang · Mark Rau · Julia Xu · Jui-Hsien Wang · Doug L. James · Jiajun Wu | N/A | Code |
| Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN Models | Nilesh Ahuja · Parual Datta · Bhavya Kanzariya · V. Srinivasa Somayazulu · Omesh Tickoo | N/A | Code |
| Improving Vision-and-Language Navigation by Generating Future-View Image Semantics | Jialu Li · Mohit Bansal | N/A | Code |
| Simulated Annealing in Early Layers Leads to Better Generalization | Amir M. Sarfi · Zahra Karimpour · Muawiz Chaudhary · Nasir M. Khalid · Mirco Ravanelli · Sudhir Mudur · Eugene Belilovsky | N/A | Code |
| From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models | Jiaxian Guo · Junnan Li · Dongxu Li · Anthony Meng Huat Tiong · Boyang Li · Dacheng Tao · Steven Hoi | N/A | Code |
| Where We Are and What We’re Looking At: Query Based Worldwide Image Geo-Localization Using Hierarchies and Scenes | Brandon Clark · Alec Kerrigan · Parth Parag Kulkarni · Vicente Vivanco Cepeda · Mubarak Shah | N/A | Code |
| CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes From Natural Language | Aditya Sanghi · Rao Fu · Vivian Liu · Karl D.D. Willis · Hooman Shayani · Amir H. Khasahmadi · Srinath Sridhar · Daniel Ritchie | N/A | Code |
| Learning To Generate Text-Grounded Mask for Open-World Semantic Segmentation From Only Image-Text Pairs | Junbum Cha · Jonghwan Mun · Byungseok Roh | N/A | Code |
| Imitation Learning As State Matching via Differentiable Physics | Siwei Chen · Xiao Ma · Zhongwen Xu | N/A | Code |
| BBDM: Image-to-Image Translation With Brownian Bridge Diffusion Models | Bo Li · Kaitao Xue · Bin Liu · Yu-Kun Lai | N/A | Code |
| CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning | Jianlong Wu · Haozhe Yang · Tian Gan · Ning Ding · Feijun Jiang · Liqiang Nie | N/A | Code |
| Re-GAN: Data-Efficient GANs Training via Architectural Reconfiguration | Divya Saxena · Jiannong Cao · Jiahao Xu · Tarun Kulshrestha | N/A | Code |
| Learning Debiased Representations via Conditional Attribute Interpolation | Yi-Kai Zhang · Qi-Wei Wang · De-Chuan Zhan · Han-Jia Ye | N/A | Code |
| Weakly Supervised Posture Mining for Fine-Grained Classification | Zhenchao Tang · Hualin Yang · Calvin Yu-Chian Chen | N/A | Code |
| Learning a Practical SDR-to-HDRTV Up-Conversion Using New Dataset and Degradation Models | Cheng Guo · Leidong Fan · Ziyu Xue · Xiuhua Jiang | N/A | Code |
| VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models | Ajay Jain · Amber Xie · Pieter Abbeel | N/A | Code |
| Adversarial Robustness via Random Projection Filters | Minjing Dong · Chang Xu | N/A | Code |
| IEEE Computer Society | Unknown | N/A | Code |
| The Computer Vision Foundation | Unknown | N/A | Code |
CVPR 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation | Yen-Chi Cheng · Hsin-Ying Lee · Sergey Tulyakov · Alexander G. Schwing · Liang-Yan Gui | N/A | Code |
| Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring | Ruyang Liu · Jingjia Huang · Ge Li · Jiashi Feng · Xinglong Wu · Thomas H. Li | N/A | Code |
| Post-Processing Temporal Action Detection | Sauradip Nag · Xiatian Zhu · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Learning Analytical Posterior Probability for Human Mesh Recovery | Qi Fang · Kang Chen · Yinghui Fan · Qing Shuai · Jiefeng Li · Weidong Zhang | N/A | Code |
| Accidental Light Probes | Hong-Xing Yu · Samir Agarwala · Charles Herrmann · Richard Szeliski · Noah Snavely · Jiajun Wu · Deqing Sun | N/A | Code |
| Multi-Object Manipulation via Object-Centric Neural Scattering Functions | Stephen Tian · Yancheng Cai · Hong-Xing Yu · Sergey Zakharov · Katherine Liu · Adrien Gaidon · Yunzhu Li · Jiajun Wu | N/A | Code |
| CFA: Class-Wise Calibrated Fair Adversarial Training | Zeming Wei · Yifei Wang · Yiwen Guo · Yisen Wang | N/A | Code |
| AutoAD: Movie Description in Context | Tengda Han · Max Bain · Arsha Nagrani · Gül Varol · Weidi Xie · Andrew Zisserman | N/A | Code |
| Relational Context Learning for Human-Object Interaction Detection | Sanghyun Kim · Deunsol Jung · Minsu Cho | N/A | Code |
| Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations | Hagay Michaeli · Tomer Michaeli · Daniel Soudry | N/A | Code |
| Learning Distortion Invariant Representation for Image Restoration From a Causality Perspective | Xin Li · Bingchen Li · Xin Jin · Cuiling Lan · Zhibo Chen | N/A | Code |
| Iterative Vision-and-Language Navigation | Jacob Krantz · Shurjo Banerjee · Wang Zhu · Jason Corso · Peter Anderson · Stefan Lee · Jesse Thomason | N/A | Code |
| FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer | Zhijian Liu · Xinyu Yang · Haotian Tang · Shang Yang · Song Han | N/A | Code |
| BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration | Sheng Ao · Qingyong Hu · Hanyun Wang · Kai Xu · Yulan Guo | N/A | Code |
| Learning Event Guided High Dynamic Range Video Reconstruction | Yixin Yang · Jin Han · Jinxiu Liang · Imari Sato · Boxin Shi | N/A | Code |
| 3D Line Mapping Revisited | Shaohui Liu · Yifan Yu · Rémi Pautrat · Marc Pollefeys · Viktor Larsson | N/A | Code |
| High-Fidelity Event-Radiance Recovery via Transient Event Frequency | Jin Han · Yuta Asano · Boxin Shi · Yinqiang Zheng · Imari Sato | N/A | Code |
| OCELOT: Overlapped Cell on Tissue Dataset for Histopathology | Jeongun Ryu · Aaron Valero Puche · JaeWoong Shin · Seonwook Park · Biagio Brattoli · Jinhee Lee · Wonkyung Jung · Soo Ick Cho · Kyunghyun Paeng · Chan-Young Ock · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Blur Interpolation Transformer for Real-World Motion From Blur | Zhihang Zhong · Mingdeng Cao · Xiang Ji · Yinqiang Zheng · Imari Sato | N/A | Code |
| Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation | Clinton A. Mo · Kun Hu · Chengjiang Long · Zhiyong Wang | N/A | Code |
| Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions From Monocular RGBD Stream | Yuheng Jiang · Kaixin Yao · Zhuo Su · Zhehao Shen · Haimin Luo · Lan Xu | N/A | Code |
| HexPlane: A Fast Representation for Dynamic Scenes | Ang Cao · Justin Johnson | N/A | Code |
| Finetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Models | Sachin Goyal · Ananya Kumar · Sankalp Garg · Zico Kolter · Aditi Raghunathan | N/A | Code |
| A Whac-a-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others | Zhiheng Li · Ivan Evtimov · Albert Gordo · Caner Hazirbas · Tal Hassner · Cristian Canton Ferrer · Chenliang Xu · Mark Ibrahim | N/A | Code |
| GIVL: Improving Geographical Inclusivity of Vision-Language Models With Pre-Training Methods | Da Yin · Feng Gao · Govind Thattai · Michael Johnston · Kai-Wei Chang | N/A | Code |
| Devil’s on the Edges: Selective Quad Attention for Scene Graph Generation | Deunsol Jung · Sanghyun Kim · Won Hwa Kim · Minsu Cho | N/A | Code |
| GeoMVSNet: Learning Multi-View Stereo With Geometry Perception | Zhe Zhang · Rui Peng · Yuxi Hu · Ronggang Wang | N/A | Code |
| CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability | Fadi Boutros · Meiling Fang · Marcel Klemt · Biying Fu · Naser Damer | N/A | Code |
| NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images | Mingwu Zheng · Haiyu Zhang · Hongyu Yang · Di Huang | N/A | Code |
| MethaneMapper: Spectral Absorption Aware Hyperspectral Transformer for Methane Detection | Satish Kumar · Ivan Arevalo · ASM Iftekhar · B S Manjunath | N/A | Code |
| Re-Thinking Model Inversion Attacks Against Deep Neural Networks | Ngoc-Bao Nguyen · Keshigeyan Chandrasegaran · Milad Abdollahzadeh · Ngai-Man Cheung | N/A | Code |
| SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency | Yang Liu · Yao Zhang · Yixin Wang · Yang Zhang · Jiang Tian · Zhongchao Shi · Jianping Fan · Zhiqiang He | N/A | Code |
| VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation | Bingchen Yang · Haiyong Jiang · Hao Pan · Jun Xiao | N/A | Code |
| MARLIN: Masked Autoencoder for Facial Video Representation LearnINg | Zhixi Cai · Shreya Ghosh · Kalin Stefanov · Abhinav Dhall · Jianfei Cai · Hamid Rezatofighi · Reza Haffari · Munawar Hayat | N/A | Code |
| KD-DLGAN: Data Limited Image Generation via Knowledge Distillation | Kaiwen Cui · Yingchen Yu · Fangneng Zhan · Shengcai Liao · Shijian Lu · Eric P. Xing | N/A | Code |
| Hierarchical Neural Memory Network for Low Latency Event Processing | Ryuhei Hamaguchi · Yasutaka Furukawa · Masaki Onishi · Ken Sakurada | N/A | Code |
| Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting | Wei Lin · Antoni B. Chan | N/A | Code |
| Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information | Weijie Su · Xizhou Zhu · Chenxin Tao · Lewei Lu · Bin Li · Gao Huang · Yu Qiao · Xiaogang Wang · Jie Zhou · Jifeng Dai | N/A | Code |
| Revisiting Reverse Distillation for Anomaly Detection | Tran Dinh Tien · Anh Tuan Nguyen · Nguyen Hoang Tran · Ta Duc Huy · Soan T.M. Duong · Chanh D. Tr. Nguyen · Steven Q. H. Truong | N/A | Code |
| Conditional Generation of Audio From Video via Foley Analogies | Yuexi Du · Ziyang Chen · Justin Salamon · Bryan Russell · Andrew Owens | N/A | Code |
| Parameter Efficient Local Implicit Image Function Network for Face Segmentation | Mausoom Sarkar · Nikitha SR · Mayur Hemani · Rishabh Jain · Balaji Krishnamurthy | N/A | Code |
| Learning Decorrelated Representations Efficiently Using Fast Fourier Transform | Yutaro Shigeto · Masashi Shimbo · Yuya Yoshikawa · Akikazu Takeuchi | N/A | Code |
| FaceLit: Neural 3D Relightable Faces | Anurag Ranjan · Kwang Moo Yi · Jen-Hao Rick Chang · Oncel Tuzel | N/A | Code |
| Pointersect: Neural Rendering With Cloud-Ray Intersection | Jen-Hao Rick Chang · Wei-Yu Chen · Anurag Ranjan · Kwang Moo Yi · Oncel Tuzel | N/A | Code |
| High-Fidelity Clothed Avatar Reconstruction From a Single Image | Tingting Liao · Xiaomei Zhang · Yuliang Xiu · Hongwei Yi · Xudong Liu · Guo-Jun Qi · Yong Zhang · Xuan Wang · Xiangyu Zhu · Zhen Lei | N/A | Code |
| BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Peng Wang · Lingzhe Zhao · Ruijie Ma · Peidong Liu | N/A | Code |
| Meta-Tuning Loss Functions and Data Augmentation for Few-Shot Object Detection | Berkan Demirel · Orhun Buğra Baran · Ramazan Gokberk Cinbis | N/A | Code |
| StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields | Kunhao Liu · Fangneng Zhan · Yiwen Chen · Jiahui Zhang · Yingchen Yu · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting | Maoyuan Ye · Jing Zhang · Shanshan Zhao · Juhua Liu · Tongliang Liu · Bo Du · Dacheng Tao | N/A | Code |
| Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution | Jie-En Yao · Li-Yuan Tsao · Yi-Chen Lo · Roy Tseng · Chia-Che Chang · Chun-Yi Lee | N/A | Code |
| LAVENDER: Unifying Video-Language Understanding As Masked Language Modeling | Linjie Li · Zhe Gan · Kevin Lin · Chung-Ching Lin · Zicheng Liu · Ce Liu · Lijuan Wang | N/A | Code |
| Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution | Hao-Wei Chen · Yu-Syuan Xu · Min-Fong Hong · Yi-Min Tsai · Hsien-Kai Kuo · Chun-Yi Lee | N/A | Code |
| Fair Federated Medical Image Segmentation via Client Contribution Estimation | Meirui Jiang · Holger R. Roth · Wenqi Li · Dong Yang · Can Zhao · Vishwesh Nath · Daguang Xu · Qi Dou · Ziyue Xu | N/A | Code |
| An Empirical Study of End-to-End Video-Language Transformers With Masked Visual Modeling | Tsu-Jui Fu · Linjie Li · Zhe Gan · Kevin Lin · William Yang Wang · Lijuan Wang · Zicheng Liu | N/A | Code |
| ReCo: Region-Controlled Text-to-Image Generation | Zhengyuan Yang · Jianfeng Wang · Zhe Gan · Linjie Li · Kevin Lin · Chenfei Wu · Nan Duan · Zicheng Liu · Ce Liu · Michael Zeng · Lijuan Wang | N/A | Code |
| Uncertainty-Aware Vision-Based Metric Cross-View Geolocalization | Florian Fervers · Sebastian Bullinger · Christoph Bodensteiner · Michael Arens · Rainer Stiefelhagen | N/A | Code |
| LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation | Guangcong Zheng · Xianpan Zhou · Xuewei Li · Zhongang Qi · Ying Shan · Xi Li | N/A | Code |
| Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based Attacks | Yunrui Yu · Cheng-Zhong Xu | N/A | Code |
| NIKI: Neural Inverse Kinematics With Invertible Neural Networks for 3D Human Pose and Shape Estimation | Jiefeng Li · Siyuan Bian · Qi Liu · Jiasheng Tang · Fan Wang · Cewu Lu | N/A | Code |
| 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud | Mingtao Feng · Haoran Hou · Liang Zhang · Zijie Wu · Yulan Guo · Ajmal Mian | N/A | Code |
| Egocentric Auditory Attention Localization in Conversations | Fiona Ryan · Hao Jiang · Abhinav Shukla · James M. Rehg · Vamsi Krishna Ithapu | N/A | Code |
| EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision | Jiahui Lei · Congyue Deng · Karl Schmeckpeper · Leonidas Guibas · Kostas Daniilidis | N/A | Code |
| Divide and Conquer: Answering Questions With Object Factorization and Compositional Reasoning | Shi Chen · Qi Zhao | N/A | Code |
| Text-Visual Prompting for Efficient 2D Temporal Video Grounding | Yimeng Zhang · Xin Chen · Jinghan Jia · Sijia Liu · Ke Ding | N/A | Code |
| Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning | Youngjae Yu · Jiwan Chung · Heeseung Yun · Jack Hessel · Jae Sung Park · Ximing Lu · Rowan Zellers · Prithviraj Ammanabrolu · Ronan Le Bras · Gunhee Kim · Yejin Choi | N/A | Code |
| UniHCP: A Unified Model for Human-Centric Perceptions | Yuanzheng Ci · Yizhou Wang · Meilin Chen · Shixiang Tang · Lei Bai · Feng Zhu · Rui Zhao · Fengwei Yu · Donglian Qi · Wanli Ouyang | N/A | Code |
| VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval | Siteng Huang · Biao Gong · Yulin Pan · Jianwen Jiang · Yiliang Lv · Yuyuan Li · Donglin Wang | N/A | Code |
| PointConvFormer: Revenge of the Point-Based Convolution | Wenxuan Wu · Li Fuxin · Qi Shan | N/A | Code |
| BAAM: Monocular 3D Pose and Shape Reconstruction With Bi-Contextual Attention Module and Attention-Guided Modeling | Hyo-Jun Lee · Hanul Kim · Su-Min Choi · Seong-Gyun Jeong · Yeong Jun Koh | N/A | Code |
| HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining | Shixiang Tang · Cheng Chen · Qingsong Xie · Meilin Chen · Yizhou Wang · Yuanzheng Ci · Lei Bai · Feng Zhu · Haiyang Yang · Li Yi · Rui Zhao · Wanli Ouyang | N/A | Code |
| Local Connectivity-Based Density Estimation for Face Clustering | Junho Shin · Hyo-Jun Lee · Hyunseop Kim · Jong-Hyeon Baek · Daehyun Kim · Yeong Jun Koh | N/A | Code |
| DistilPose: Tokenized Pose Regression With Heatmap Distillation | Suhang Ye · Yingyi Zhang · Jie Hu · Liujuan Cao · Shengchuan Zhang · Lei Shen · Jun Wang · Shouhong Ding · Rongrong Ji | N/A | Code |
| Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks | Weihua Chen · Xianzhe Xu · Jian Jia · Hao Luo · Yaohua Wang · Fan Wang · Rong Jin · Xiuyu Sun | N/A | Code |
| ViPLO: Vision Transformer Based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection | Jeeseung Park · Jin-Woo Park · Jong-Seok Lee | N/A | Code |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Yuxin Fang · Wen Wang · Binhui Xie · Quan Sun · Ledell Wu · Xinggang Wang · Tiejun Huang · Xinlong Wang · Yue Cao | N/A | Code |
| I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs | Jingsen Zhu · Yuchi Huo · Qi Ye · Fujun Luan · Jifan Li · Dianbing Xi · Lisha Wang · Rui Tang · Wei Hua · Hujun Bao · Rui Wang | N/A | Code |
| DrapeNet: Garment Generation and Self-Supervised Draping | Luca De Luigi · Ren Li · Benoît Guillard · Mathieu Salzmann · Pascal Fua | N/A | Code |
| STMixer: A One-Stage Sparse Action Detector | Tao Wu · Mengqi Cao · Ziteng Gao · Gangshan Wu · Limin Wang | N/A | Code |
| Inverse Rendering of Translucent Objects Using Physical and Neural Renderers | Chenhao Li · Trung Thanh Ngo · Hajime Nagahara | N/A | Code |
| Humans As Light Bulbs: 3D Human Reconstruction From Thermal Reflection | Ruoshi Liu · Carl Vondrick | N/A | Code |
| CF-Font: Content Fusion for Few-Shot Font Generation | Chi Wang · Min Zhou · Tiezheng Ge · Yuning Jiang · Hujun Bao · Weiwei Xu | N/A | Code |
| GLeaD: Improving GANs With a Generator-Leading Task | Qingyan Bai · Ceyuan Yang · Yinghao Xu · Xihui Liu · Yujiu Yang · Yujun Shen | N/A | Code |
| StarCraftImage: A Dataset for Prototyping Spatial Reasoning Methods for Multi-Agent Environments | Sean Kulinski · Nicholas R. Waytowich · James Z. Hare · David I. Inouye | N/A | Code |
| WIRE: Wavelet Implicit Neural Representations | Vishwanath Saragadam · Daniel LeJeune · Jasper Tan · Guha Balakrishnan · Ashok Veeraraghavan · Richard G. Baraniuk | N/A | Code |
| Thermal Spread Functions (TSF): Physics-Guided Material Classification | Aniket Dashpute · Vishwanath Saragadam · Emma Alexander · Florian Willomitzer · Aggelos Katsaggelos · Ashok Veeraraghavan · Oliver Cossairt | N/A | Code |
| Improving Zero-Shot Generalization and Robustness of Multi-Modal Models | Yunhao Ge · Jie Ren · Andrew Gallagher · Yuxiao Wang · Ming-Hsuan Yang · Hartwig Adam · Laurent Itti · Balaji Lakshminarayanan · Jiaping Zhao | N/A | Code |
| The Differentiable Lens: Compound Lens Search Over Glass Surfaces and Materials for Object Detection | Geoffroi Côté · Fahim Mannan · Simon Thibault · Jean-François Lalonde · Felix Heide | N/A | Code |
| Federated Domain Generalization With Generalization Adjustment | Ruipeng Zhang · Qinwei Xu · Jiangchao Yao · Ya Zhang · Qi Tian · Yanfeng Wang | N/A | Code |
| Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking | Yihao Wang · Zhigang Wang · Bin Zhao · Dong Wang · Mulin Chen · Xuelong Li | N/A | Code |
| Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network | Zhengxin Pan · Fangyu Wu · Bailing Zhang | N/A | Code |
| On the Benefits of 3D Pose and Tracking for Human Action Recognition | Jathushan Rajasegaran · Georgios Pavlakos · Angjoo Kanazawa · Christoph Feichtenhofer · Jitendra Malik | N/A | Code |
| Visual DNA: Representing and Comparing Images Using Distributions of Neuron Activations | Benjamin Ramtoula · Matthew Gadd · Paul Newman · Daniele De Martini | N/A | Code |
| Fine-Tuned CLIP Models Are Efficient Video Learners | Hanoona Rasheed · Muhammad Uzair Khattak · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Connecting Vision and Language With Video Localized Narratives | Paul Voigtlaender · Soravit Changpinyo · Jordi Pont-Tuset · Radu Soricut · Vittorio Ferrari | N/A | Code |
| K-Planes: Explicit Radiance Fields in Space, Time, and Appearance | Sara Fridovich-Keil · Giacomo Meanti · Frederik Rahbæk Warburg · Benjamin Recht · Angjoo Kanazawa | N/A | Code |
| Virtual Occlusions Through Implicit Depth | Jamie Watson · Mohamed Sayed · Zawar Qureshi · Gabriel J. Brostow · Sara Vicente · Oisin Mac Aodha · Michael Firman | N/A | Code |
| Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories | Samarth Sinha · Roman Shapovalov · Jeremy Reizenstein · Ignacio Rocco · Natalia Neverova · Andrea Vedaldi · David Novotny | N/A | Code |
| LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising | Zichun Wang · Ying Fu · Ji Liu · Yulun Zhang | N/A | Code |
| One-Shot High-Fidelity Talking-Head Synthesis With Deformable Neural Radiance Field | Weichuang Li · Longhao Zhang · Dong Wang · Bin Zhao · Zhigang Wang · Mulin Chen · Bang Zhang · Zhongjian Wang · Liefeng Bo · Xuelong Li | N/A | Code |
| Collaborative Diffusion for Multi-Modal Face Generation and Editing | Ziqi Huang · Kelvin C.K. Chan · Yuming Jiang · Ziwei Liu | N/A | Code |
| Blind Video Deflickering by Neural Filtering With a Flawed Atlas | Chenyang Lei · Xuanchi Ren · Zhaoxiang Zhang · Qifeng Chen | N/A | Code |
| RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension | Jiamu Sun · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Zhiyu Wang · Rongrong Ji | N/A | Code |
| HNeRV: A Hybrid Neural Representation for Videos | Hao Chen · Matthew Gwilliam · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Learning 3D-Aware Image Synthesis With Unknown Pose Distribution | Zifan Shi · Yujun Shen · Yinghao Xu · Sida Peng · Yiyi Liao · Sheng Guo · Qifeng Chen · Dit-Yan Yeung | N/A | Code |
| DynaFed: Tackling Client Data Heterogeneity With Global Dynamics | Renjie Pi · Weizhong Zhang · Yueqi Xie · Jiahui Gao · Xiaoyu Wang · Sunghun Kim · Qifeng Chen | N/A | Code |
| Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition | Jun Cen · Shiwei Zhang · Xiang Wang · Yixuan Pei · Zhiwu Qing · Yingya Zhang · Qifeng Chen | N/A | Code |
| RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion | Tengfei Wang · Bo Zhang · Ting Zhang · Shuyang Gu · Jianmin Bao · Tadas Baltrusaitis · Jingjing Shen · Dong Chen · Fang Wen · Qifeng Chen · Baining Guo | N/A | Code |
| IFSeg: Image-Free Semantic Segmentation via Vision-Language Model | Sukmin Yun · Seong Hyeon Park · Paul Hongsuck Seo · Jinwoo Shin | N/A | Code |
| Detecting Everything in the Open World: Towards Universal Object Detection | Zhenyu Wang · Yali Li · Xi Chen · Ser-Nam Lim · Antonio Torralba · Hengshuang Zhao · Shengjin Wang | N/A | Code |
| Improving Visual Grounding by Encouraging Consistent Gradient-Based Explanations | Ziyan Yang · Kushal Kafle · Franck Dernoncourt · Vicente Ordonez | N/A | Code |
| Temporally Consistent Online Depth Estimation Using Point-Based Fusion | Numair Khan · Eric Penner · Douglas Lanman · Lei Xiao | N/A | Code |
| NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions | Juze Zhang · Haimin Luo · Hongdi Yang · Xinru Xu · Qianyang Wu · Ye Shi · Jingyi Yu · Lan Xu · Jingya Wang | N/A | Code |
| Token Turing Machines | Michael S. Ryoo · Keerthana Gopalakrishnan · Kumara Kahatapitiya · Ted Xiao · Kanishka Rao · Austin Stone · Yao Lu · Julian Ibarz · Anurag Arnab | N/A | Code |
| Computationally Budgeted Continual Learning: What Does Matter? | Ameya Prabhu · Hasan Abed Al Kader Hammoud · Puneet K. Dokania · Philip H.S. Torr · Ser-Nam Lim · Bernard Ghanem · Adel Bibi | N/A | Code |
| CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search | Fahad Shamshad · Muzammal Naseer · Karthik Nandakumar | N/A | Code |
| Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation From Image Sequence | Yang Tian · Jiyao Zhang · Zekai Yin · Hao Dong | N/A | Code |
| Affordances From Human Videos as a Versatile Representation for Robotics | Shikhar Bahl · Russell Mendonca · Lili Chen · Unnat Jain · Deepak Pathak | N/A | Code |
| MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation | Yong Yang · Qiong Chen · Yuan Feng · Tianlin Huang | N/A | Code |
| Learning To Generate Image Embeddings With User-Level Differential Privacy | Zheng Xu · Maxwell Collins · Yuxiao Wang · Liviu Panait · Sewoong Oh · Sean Augenstein · Ting Liu · Florian Schroff · H. Brendan McMahan | N/A | Code |
| Genie: Show Me the Data for Quantization | Yongkweon Jeon · Chungman Lee · Ho-young Kim | N/A | Code |
| DSVT: Dynamic Sparse Voxel Transformer With Rotated Sets | Haiyang Wang · Chen Shi · Shaoshuai Shi · Meng Lei · Sen Wang · Di He · Bernt Schiele · Liwei Wang | N/A | Code |
| Transformer-Based Learned Optimization | Erik Gärtner · Luke Metz · Mykhaylo Andriluka · C. Daniel Freeman · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Noise2Noise: Efficient Image Denoising Without Any Data | Youssef Mansour · Reinhard Heckel | N/A | Code |
| Super-Resolution Neural Operator | Min Wei · Xuesong Zhang | N/A | Code |
| StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping | Diqiong Jiang · Dan Song · Ruofeng Tong · Min Tang | N/A | Code |
| Self-Supervised Blind Motion Deblurring With Deep Expectation Maximization | Ji Li · Weixi Wang · Yuesong Nan · Hui Ji | N/A | Code |
| Confidence-Aware Personalized Federated Learning via Variational Expectation Maximization | Junyi Zhu · Xingchen Ma · Matthew B. Blaschko | N/A | Code |
| Human Pose As Compositional Tokens | Zigang Geng · Chunyu Wang · Yixuan Wei · Ze Liu · Houqiang Li · Han Hu | N/A | Code |
| GeoMAE: Masked Geometric Target Prediction for Self-Supervised Point Cloud Pre-Training | Xiaoyu Tian · Haoxi Ran · Yue Wang · Hang Zhao | N/A | Code |
| RUST: Latent Neural Scene Representations From Unposed Imagery | Mehdi S. M. Sajjadi · Aravindh Mahendran · Thomas Kipf · Etienne Pot · Daniel Duckworth · Mario Lučić · Klaus Greff | N/A | Code |
| Bias Mimicking: A Simple Sampling Approach for Bias Mitigation | Maan Qraitem · Kate Saenko · Bryan A. Plummer | N/A | Code |
| V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting | Haibao Yu · Wenxian Yang · Hongzhi Ruan · Zhenwei Yang · Yingjuan Tang · Xu Gao · Xin Hao · Yifeng Shi · Yifeng Pan · Ning Sun · Juan Song · Jirui Yuan · Ping Luo · Zaiqing Nie | N/A | Code |
| Conditional Image-to-Video Generation With Latent Flow Diffusion Models | Haomiao Ni · Changhao Shi · Kai Li · Sharon X. Huang · Martin Renqiang Min | N/A | Code |
| Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection | Shaofei Huang · Zhenwei Shen · Zehao Huang · Zi-han Ding · Jiao Dai · Jizhong Han · Naiyan Wang · Si Liu | N/A | Code |
| 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds | Aoran Xiao · Jiaxing Huang · Weihao Xuan · Ruijie Ren · Kangcheng Liu · Dayan Guan · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| NeMo: Learning 3D Neural Motion Fields From Multiple Video Instances of the Same Action | Kuan-Chieh Wang · Zhenzhen Weng · Maria Xenochristou · João Pedro Araújo · Jeffrey Gu · Karen Liu · Serena Yeung | N/A | Code |
| Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning | Xiaocheng Lu · Song Guo · Ziming Liu · Jingcai Guo | N/A | Code |
| iDisc: Internal Discretization for Monocular Depth Estimation | Luigi Piccinelli · Christos Sakaridis · Fisher Yu | N/A | Code |
| UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy | Yinzhen Xu · Weikang Wan · Jialiang Zhang · Haoran Liu · Zikang Shan · Hao Shen · Ruicheng Wang · Haoran Geng · Yijia Weng · Jiayi Chen · Tengyu Liu · Li Yi · He Wang | N/A | Code |
| PolyFormer: Referring Image Segmentation As Sequential Polygon Generation | Jiang Liu · Hui Ding · Zhaowei Cai · Yuting Zhang · Ravi Kumar Satzoda · Vijay Mahadevan · R. Manmatha | N/A | Code |
| Interactive Segmentation of Radiance Fields | Rahul Goel · Dhawal Sirikonda · Saurabh Saini · P. J. Narayanan | N/A | Code |
| PointCert: Point Cloud Classification With Deterministic Certified Robustness Guarantees | Jinghuai Zhang · Jinyuan Jia · Hongbin Liu · Neil Zhenqiang Gong | N/A | Code |
| Indiscernible Object Counting in Underwater Scenes | Guolei Sun · Zhaochong An · Yun Liu · Ce Liu · Christos Sakaridis · Deng-Ping Fan · Luc Van Gool | N/A | Code |
| Improving Robustness of Vision Transformers by Reducing Sensitivity To Patch Corruptions | Yong Guo · David Stutz · Bernt Schiele | N/A | Code |
| Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video | Wenzheng Zeng · Yang Xiao · Sicheng Wei · Jinfang Gan · Xintao Zhang · Zhiguo Cao · Zhiwen Fang · Joey Tianyi Zhou | N/A | Code |
| BEV-LaneDet: An Efficient 3D Lane Detection Based on Virtual Camera via Key-Points | Ruihao Wang · Jian Qin · Kaiying Li · Yaochen Li · Dong Cao · Jintao Xu | N/A | Code |
| Infinite Photorealistic Worlds Using Procedural Generation | Alexander Raistrick · Lahav Lipson · Zeyu Ma · Lingjie Mei · Mingzhe Wang · Yiming Zuo · Karhan Kayan · Hongyu Wen · Beining Han · Yihan Wang · Alejandro Newell · Hei Law · Ankit Goyal · Kaiyu Yang · Jia Deng | N/A | Code |
| High-Fidelity 3D Human Digitization From Single 2K Resolution Images | Sang-Hun Han · Min-Gyu Park · Ju Hong Yoon · Ju-Mi Kang · Young-Jae Park · Hae-Gon Jeon | N/A | Code |
| GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis | Ming Tao · Bing-Kun Bao · Hao Tang · Changsheng Xu | N/A | Code |
| Language-Guided Audio-Visual Source Separation via Trimodal Consistency | Reuben Tan · Arijit Ray · Andrea Burns · Bryan A. Plummer · Justin Salamon · Oriol Nieto · Bryan Russell · Kate Saenko | N/A | Code |
| Probabilistic Debiasing of Scene Graphs | Bashirul Azam Biswas · Qiang Ji | N/A | Code |
| PVO: Panoptic Visual Odometry | Weicai Ye · Xinyue Lan · Shuo Chen · Yuhang Ming · Xingyuan Yu · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| Superclass Learning With Representation Enhancement | Zeyu Gan · Suyun Zhao · Jinlong Kang · Liyuan Shang · Hong Chen · Cuiping Li | N/A | Code |
| GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts | Haoran Geng · Helin Xu · Chengyang Zhao · Chao Xu · Li Yi · Siyuan Huang · He Wang | N/A | Code |
| Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation | Liyan Chen · Weihan Wang · Philippos Mordohai | N/A | Code |
| Efficient View Synthesis and 3D-Based Multi-Frame Denoising With Multiplane Feature Representations | Thomas Tanay · Aleš Leonardis · Matteo Maggioni | N/A | Code |
| Large-Capacity and Flexible Video Steganography via Invertible Neural Network | Chong Mou · Youmin Xu · Jiechong Song · Chen Zhao · Bernard Ghanem · Jian Zhang | N/A | Code |
| Generating Part-Aware Editable 3D Shapes Without 3D Supervision | Konstantinos Tertikas · Despoina Paschalidou · Boxiao Pan · Jeong Joon Park · Mikaela Angelina Uy · Ioannis Emiris · Yannis Avrithis · Leonidas Guibas | N/A | Code |
| Vision Transformer With Super Token Sampling | Huaibo Huang · Xiaoqiang Zhou · Jie Cao · Ran He · Tieniu Tan | N/A | Code |
| Renderable Neural Radiance Map for Visual Navigation | Obin Kwon · Jeongho Park · Songhwai Oh | N/A | Code |
| Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong · Wei-Chiu Ma · Jingkang Wang · Raquel Urtasun | N/A | Code |
| CoMFormer: Continual Learning in Semantic and Panoptic Segmentation | Fabio Cermelli · Matthieu Cord · Arthur Douillard | N/A | Code |
| A Bag-of-Prototypes Representation for Dataset-Level Applications | Weijie Tu · Weijian Deng · Tom Gedeon · Liang Zheng | N/A | Code |
| Geometric Visual Similarity Learning in 3D Medical Image Self-Supervised Pre-Training | Yuting He · Guanyu Yang · Rongjun Ge · Yang Chen · Jean-Louis Coatrieux · Boyu Wang · Shuo Li | N/A | Code |
| Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Erasing Network | Zhicheng Zhang · Lijuan Wang · Jufeng Yang | N/A | Code |
| CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning | James Seale Smith · Leonid Karlinsky · Vyshnavi Gutta · Paola Cascante-Bonilla · Donghyun Kim · Assaf Arbelle · Rameswar Panda · Rogerio Feris · Zsolt Kira | N/A | Code |
| CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior | Jinbo Xing · Menghan Xia · Yuechen Zhang · Xiaodong Cun · Jue Wang · Tien-Tsin Wong | N/A | Code |
| VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction | Yufan Ren · Fangjinhua Wang · Tong Zhang · Marc Pollefeys · Sabine Süsstrunk | N/A | Code |
| NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation | Haoqian Wu · Keyu Chen · Haozhe Liu · Mingchen Zhuge · Bing Li · Ruizhi Qiao · Xiujun Shu · Bei Gan · Liangsheng Xu · Bo Ren · Mengmeng Xu · Wentian Zhang · Raghavendra Ramachandra · Chia-Wen Lin · Bernard Ghanem | N/A | Code |
| Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization | Yuechen Zhang · Zexin He · Jinbo Xing · Xufeng Yao · Jiaya Jia | N/A | Code |
| GANmouflage: 3D Object Nondetection With Texture Fields | Rui Guo · Jasmine Collins · Oscar de Lima · Andrew Owens | N/A | Code |
| GP-VTON: Towards General Purpose Virtual Try-On via Collaborative Local-Flow Global-Parsing Learning | Zhenyu Xie · Zaiyu Huang · Xin Dong · Fuwei Zhao · Haoye Dong · Xijin Zhang · Feida Zhu · Xiaodan Liang | N/A | Code |
| DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection | Xuan Zhang · Shiyu Li · Xi Li · Ping Huang · Jiulong Shan · Ting Chen | N/A | Code |
| Pix2map: Cross-Modal Retrieval for Inferring Street Maps From Images | Xindi Wu · KwunFung Lau · Francesco Ferroni · Aljoša Ošep · Deva Ramanan | N/A | Code |
| Beyond mAP: Towards Better Evaluation of Instance Segmentation | Rohit Jena · Lukas Zhornyak · Nehal Doiphode · Pratik Chaudhari · Vivek Buch · James Gee · Jianbo Shi | N/A | Code |
| Federated Learning With Data-Agnostic Distribution Fusion | Jian-hui Duan · Wenzhong Li · Derun Zou · Ruichen Li · Sanglu Lu | N/A | Code |
| Make-a-Story: Visual Memory Conditioned Consistent Story Generation | Tanzila Rahman · Hsin-Ying Lee · Jian Ren · Sergey Tulyakov · Shweta Mahajan · Leonid Sigal | N/A | Code |
| Scalable, Detailed and Mask-Free Universal Photometric Stereo | Satoshi Ikehata | N/A | Code |
| ToThePoint: Efficient Contrastive Learning of 3D Point Clouds via Recycling | Xinglin Li · Jiajing Chen · Jinhui Ouyang · Hanhui Deng · Senem Velipasalar · Di Wu | N/A | Code |
| Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields | Yue Chen · Xingyu Chen · Xuan Wang · Qi Zhang · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| UV Volumes for Real-Time Rendering of Editable Free-View Human Performance | Yue Chen · Xuan Wang · Xingyu Chen · Qi Zhang · Xiaoyu Li · Yu Guo · Jue Wang · Fei Wang | N/A | Code |
| SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries | Ahmed Imtiaz Humayun · Randall Balestriero · Guha Balakrishnan · Richard G. Baraniuk | N/A | Code |
| Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery From Sparse Image Ensemble | Chun-Han Yao · Wei-Chih Hung · Yuanzhen Li · Michael Rubinstein · Ming-Hsuan Yang · Varun Jampani | N/A | Code |
| VisFusion: Visibility-Aware Online 3D Scene Reconstruction From Videos | Huiyu Gao · Wei Mao · Miaomiao Liu | N/A | Code |
| Unsupervised Volumetric Animation | Aliaksandr Siarohin · Willi Menapace · Ivan Skorokhodov · Kyle Olszewski · Jian Ren · Hsin-Ying Lee · Menglei Chai · Sergey Tulyakov | N/A | Code |
| DKM: Dense Kernelized Feature Matching for Geometry Estimation | Johan Edstedt · Ioannis Athanasiadis · Mårten Wadenbäck · Michael Felsberg | N/A | Code |
| All in One: Exploring Unified Video-Language Pre-Training | Jinpeng Wang · Yixiao Ge · Rui Yan · Yuying Ge · Kevin Qinghong Lin · Satoshi Tsutsui · Xudong Lin · Guanyu Cai · Jianping Wu · Ying Shan · Xiaohu Qie · Mike Zheng Shou | N/A | Code |
| Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild | Yanhao Wu · Tong Zhang · Wei Ke · Sabine Süsstrunk · Mathieu Salzmann | N/A | Code |
| DynIBaR: Neural Dynamic Image-Based Rendering | Zhengqi Li · Qianqian Wang · Forrester Cole · Richard Tucker · Noah Snavely | N/A | Code |
| Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong · Sundaram Muthu · Fahira Afzal Maken · Chuong Nguyen · Hongdong Li | N/A | Code |
| JAWS: Just a Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang · Robin Courant · Jinglei Shi · Eric Marchand · Marc Christie | N/A | Code |
| CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes | Harshil Bhatia · Edith Tretschk · Zorah Lähner · Marcel Seelbach Benkner · Michael Moeller · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations | Joy Hsu · Jiayuan Mao · Jiajun Wu | N/A | Code |
| TempSAL – Uncovering Temporal Information for Deep Saliency Prediction | Bahar Aydemir · Ludo Hoffstetter · Tong Zhang · Mathieu Salzmann · Sabine Süsstrunk | N/A | Code |
| BiasBed – Rigorous Texture Bias Evaluation | Nikolai Kalischek · Rodrigo Caye Daudt · Torben Peters · Reinhard Furrer · Jan D. Wegner · Konrad Schindler | N/A | Code |
| Real-Time Neural Light Field on Mobile Devices | Junli Cao · Huan Wang · Pavlo Chemerys · Vladislav Shakhrai · Ju Hu · Yun Fu · Denys Makoviichuk · Sergey Tulyakov · Jian Ren | N/A | Code |
| Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization | Mengmeng Xu · Yanghao Li · Cheng-Yang Fu · Bernard Ghanem · Tao Xiang · Juan-Manuel Pérez-Rúa | N/A | Code |
| DiffusionRig: Learning Personalized Priors for Facial Appearance Editing | Zheng Ding · Xuaner Zhang · Zhihao Xia · Lars Jebe · Zhuowen Tu · Xiuming Zhang | N/A | Code |
| Neural Scene Chronology | Haotong Lin · Qianqian Wang · Ruojin Cai · Sida Peng · Hadar Averbuch-Elor · Xiaowei Zhou · Noah Snavely | N/A | Code |
| Diversity-Aware Meta Visual Prompting | Qidong Huang · Xiaoyi Dong · Dongdong Chen · Weiming Zhang · Feifei Wang · Gang Hua · Nenghai Yu | N/A | Code |
| Privacy-Preserving Representations Are Not Enough: Recovering Scene Content From Camera Poses | Kunal Chelani · Torsten Sattler · Fredrik Kahl · Zuzana Kukelova | N/A | Code |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | Bin Ren · Yahui Liu · Yue Song · Wei Bi · Rita Cucchiara · Nicu Sebe · Wei Wang | N/A | Code |
| Box-Level Active Detection | Mengyao Lyu · Jundong Zhou · Hui Chen · Yijie Huang · Dongdong Yu · Yaqian Li · Yandong Guo · Yuchen Guo · Liuyu Xiang · Guiguang Ding | N/A | Code |
| Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples | Jiaming Zhang · Xingjun Ma · Qi Yi · Jitao Sang · Yu-Gang Jiang · Yaowei Wang · Changsheng Xu | N/A | Code |
| Generalized Relation Modeling for Transformer Tracking | Shenyuan Gao · Chunluan Zhou · Jun Zhang | N/A | Code |
| Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis | Rishabh Dabral · Muhammad Hamza Mughal · Vladislav Golyanik · Christian Theobalt | N/A | Code |
| Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective | Jinjing Zhu · Haotian Bai · Lin Wang | N/A | Code |
| Distilling Neural Fields for Real-Time Articulated Shape Reconstruction | Jeff Tan · Gengshan Yang · Deva Ramanan | N/A | Code |
| Sampling Is Matter: Point-Guided 3D Human Mesh Reconstruction | Jeonghwan Kim · Mi-Gyeong Gwon · Hyunwoo Park · Hyukmin Kwon · Gi-Mun Um · Wonjun Kim | N/A | Code |
| Image Quality-Aware Diagnosis via Meta-Knowledge Co-Embedding | Haoxuan Che · Siyu Chen · Hao Chen | N/A | Code |
| Towards Practical Plug-and-Play Diffusion Models | Hyojun Go · Yunsung Lee · Jin-Young Kim · Seunghyun Lee · Myeongho Jeong · Hyun Seung Lee · Seungtaek Choi | N/A | Code |
| HRDFuse: Monocular 360° Depth Estimation by Collaboratively Learning Holistic-With-Regional Depth Distributions | Hao Ai · Zidong Cao · Yan-Pei Cao · Ying Shan · Lin Wang | N/A | Code |
| KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation | Xiangyang Li · Zihan Wang · Jiahao Yang · Yaowei Wang · Shuqiang Jiang | N/A | Code |
| Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang · Wenzhao Zheng · Yunpeng Zhang · Jie Zhou · Jiwen Lu | N/A | Code |
| EventNeRF: Neural Radiance Fields From a Single Colour Event Camera | Viktor Rudnev · Mohamed Elgharib · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling | Zhanhao Hu · Wenda Chu · Xiaopei Zhu · Hui Zhang · Bo Zhang · Xiaolin Hu | N/A | Code |
| Global Vision Transformer Pruning With Hessian-Aware Saliency | Huanrui Yang · Hongxu Yin · Maying Shen · Pavlo Molchanov · Hai Li · Jan Kautz | N/A | Code |
| 3D Human Pose Estimation With Spatio-Temporal Criss-Cross Attention | Zhenhua Tang · Zhaofan Qiu · Yanbin Hao · Richang Hong · Ting Yao | N/A | Code |
| Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution | Yunfan Lu · Zipeng Wang · Minjie Liu · Hongjian Wang · Lin Wang | N/A | Code |
| StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer | Sasikarn Khwanmuang · Pakkapon Phongthawee · Patsorn Sangkloy · Supasorn Suwajanakorn | N/A | Code |
| ShapeClipper: Scalable 3D Shape Learning From Single-View Images via Geometric and CLIP-Based Consistency | Zixuan Huang · Varun Jampani · Anh Thai · Yuanzhen Li · Stefan Stojanov · James M. Rehg | N/A | Code |
| Efficient Scale-Invariant Generator With Column-Row Entangled Pixel Synthesis | Thuan Hoang Nguyen · Thanh Van Le · Anh Tran | N/A | Code |
| Paired-Point Lifting for Enhanced Privacy-Preserving Visual Localization | Chunghwan Lee · Jaihoon Kim · Chanhyuk Yun · Je Hyeong Hong | N/A | Code |
| Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation | Xu Zheng · Jinjing Zhu · Yexin Liu · Zidong Cao · Chong Fu · Lin Wang | N/A | Code |
| Adaptive Human Matting for Dynamic Videos | Chung-Ching Lin · Jiang Wang · Kun Luo · Kevin Lin · Linjie Li · Lijuan Wang · Zicheng Liu | N/A | Code |
| High-Fidelity Facial Avatar Reconstruction From Monocular Video With Generative Priors | Yunpeng Bai · Yanbo Fan · Xuan Wang · Yong Zhang · Jingxiang Sun · Chun Yuan · Ying Shan | N/A | Code |
| Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint | Shikang Yu · Jiachen Chen · Hu Han · Shuqiang Jiang | N/A | Code |
| Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes | Jihyun Lee · Minhyuk Sung · Honggyu Choi · Tae-Kyun Kim | N/A | Code |
| MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos | Zicheng Zhang · Wei Wu · Wei Sun · Danyang Tu · Wei Lu · Xiongkuo Min · Ying Chen · Guangtao Zhai | N/A | Code |
| Make Landscape Flatter in Differentially Private Federated Learning | Yifan Shi · Yingqi Liu · Kang Wei · Li Shen · Xueqian Wang · Dacheng Tao | N/A | Code |
| A Large-Scale Robustness Analysis of Video Action Recognition Models | Madeline Chantry Schiappa · Naman Biyani · Prudvi Kamtam · Shruti Vyas · Hamid Palangi · Vibhav Vineet · Yogesh S. Rawat | N/A | Code |
| Multi-Concept Customization of Text-to-Image Diffusion | Nupur Kumari · Bingliang Zhang · Richard Zhang · Eli Shechtman · Jun-Yan Zhu | N/A | Code |
| GANHead: Towards Generative Animatable Neural Head Avatars | Sijing Wu · Yichao Yan · Yunhao Li · Yuhao Cheng · Wenhan Zhu · Ke Gao · Xiaobo Li · Guangtao Zhai | N/A | Code |
| Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition | Xinghan Wang · Xin Xu · Yadong Mu | N/A | Code |
| Hierarchical B-Frame Video Coding Using Two-Layer CANF Without Motion Coding | David Alexandre · Hsueh-Ming Hang · Wen-Hsiao Peng | N/A | Code |
| FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER | Ce Zheng · Matias Mendieta · Taojiannan Yang · Guo-Jun Qi · Chen Chen | N/A | Code |
| Delivering Arbitrary-Modal Semantic Segmentation | Jiaming Zhang · Ruiping Liu · Hao Shi · Kailun Yang · Simon Reiß · Kunyu Peng · Haodong Fu · Kaiwei Wang · Rainer Stiefelhagen | N/A | Code |
| Deep Graph-Based Spatial Consistency for Robust Non-Rigid Point Cloud Registration | Zheng Qin · Hao Yu · Changjian Wang · Yuxing Peng · Kai Xu | N/A | Code |
| HumanGen: Generating Human Radiance Fields With Explicit Priors | Suyi Jiang · Haoran Jiang · Ziyu Wang · Haimin Luo · Wenzheng Chen · Lan Xu | N/A | Code |
| Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation | Bo Huang · Mingyang Chen · Yi Wang · Junda Lu · Minhao Cheng · Wei Wang | N/A | Code |
| Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | Narek Tumanyan · Michal Geyer · Shai Bagon · Tali Dekel | N/A | Code |
| Rotation-Invariant Transformer for Point Cloud Matching | Hao Yu · Zheng Qin · Ji Hou · Saleh · Dongsheng Li · Benjamin Busam · Slobodan Ilic | N/A | Code |
| CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP | Runnan Chen · Youquan Liu · Lingdong Kong · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao · Wenping Wang | N/A | Code |
| Real-Time 6K Image Rescaling With Rate-Distortion Optimization | Chenyang Qi · Xin Yang · Ka Leong Cheng · Ying-Cong Chen · Qifeng Chen | N/A | Code |
| Focused and Collaborative Feedback Integration for Interactive Image Segmentation | Qiaoqiao Wei · Hui Zhang · Jun-Hai Yong | N/A | Code |
| Language-Guided Music Recommendation for Video via Prompt Analogies | Daniel McKee · Justin Salamon · Josef Sivic · Bryan Russell | N/A | Code |
| TarViS: A Unified Approach for Target-Based Video Segmentation | Ali Athar · Alexander Hermans · Jonathon Luiten · Deva Ramanan · Bastian Leibe | N/A | Code |
| Meta-Personalizing Vision-Language Models To Find Named Instances in Video | Chun-Hsiao Yeh · Bryan Russell · Josef Sivic · Fabian Caba Heilbron · Simon Jenni | N/A | Code |
| ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data | Haojie Zhao · Junsong Chen · Lijun Wang · Huchuan Lu | N/A | Code |
| Scaling Language-Image Pre-Training via Masking | Yanghao Li · Haoqi Fan · Ronghang Hu · Christoph Feichtenhofer · Kaiming He | N/A | Code |
| SeqTrack: Sequence to Sequence Learning for Visual Object Tracking | Xin Chen · Houwen Peng · Dong Wang · Huchuan Lu · Han Hu | N/A | Code |
| Learning Neural Parametric Head Models | Simon Giebenhain · Tobias Kirschstein · Markos Georgopoulos · Martin Rünz · Lourdes Agapito · Matthias Nießner | N/A | Code |
| L-CoIns: Language-Based Colorization With Instance Awareness | Zheng Chang · Shuchen Weng · Peixuan Zhang · Yu Li · Si Li · Boxin Shi | N/A | Code |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Antoine Yang · Arsha Nagrani · Paul Hongsuck Seo · Antoine Miech · Jordi Pont-Tuset · Ivan Laptev · Josef Sivic · Cordelia Schmid | N/A | Code |
| ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding | Le Xue · Mingfei Gao · Chen Xing · Roberto Martín-Martín · Jiajun Wu · Caiming Xiong · Ran Xu · Juan Carlos Niebles · Silvio Savarese | N/A | Code |
| GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields From Multi-View Images | Jianchuan Chen · Wentao Yi · Liqian Ma · Xu Jia · Huchuan Lu | N/A | Code |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Lukas Hoyer · Dengxin Dai · Haoran Wang · Luc Van Gool | N/A | Code |
| MED-VT: Multiscale Encoder-Decoder Video Transformer With Application To Object Segmentation | Rezaul Karim · He Zhao · Richard P. Wildes · Mennatullah Siam | N/A | Code |
| Hierarchical Dense Correlation Distillation for Few-Shot Segmentation | Bohao Peng · Zhuotao Tian · Xiaoyang Wu · Chengyao Wang · Shu Liu · Jingyong Su · Jiaya Jia | N/A | Code |
| Universal Instance Perception As Object Discovery and Retrieval | Bin Yan · Yi Jiang · Jiannan Wu · Dong Wang · Ping Luo · Zehuan Yuan · Huchuan Lu | N/A | Code |
| Bi-Directional Distribution Alignment for Transductive Zero-Shot Learning | Zhicai Wang · Yanbin Hao · Tingting Mu · Ouxiang Li · Shuo Wang · Xiangnan He | N/A | Code |
| Open-Vocabulary Semantic Segmentation With Mask-Adapted CLIP | Feng Liang · Bichen Wu · Xiaoliang Dai · Kunpeng Li · Yinan Zhao · Hang Zhang · Peizhao Zhang · Peter Vajda · Diana Marculescu | N/A | Code |
| ImageBind: One Embedding Space To Bind Them All | Rohit Girdhar · Alaaeldin El-Nouby · Zhuang Liu · Mannat Singh · Kalyan Vasudev Alwala · Armand Joulin · Ishan Misra | N/A | Code |
| Learning and Aggregating Lane Graphs for Urban Automated Driving | Martin Büchner · Jannik Zürn · Ion-George Todoran · Abhinav Valada · Wolfram Burgard | N/A | Code |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Yu Takagi · Shinji Nishimoto | N/A | Code |
| 3D Cinemagraphy From a Single Image | Xingyi Li · Zhiguo Cao · Huiqiang Sun · Jianming Zhang · Ke Xian · Guosheng Lin | N/A | Code |
| Understanding and Improving Visual Prompting: A Label-Mapping Perspective | Aochuan Chen · Yuguang Yao · Pin-Yu Chen · Yihua Zhang · Sijia Liu | N/A | Code |
| Cut and Learn for Unsupervised Object Detection and Instance Segmentation | Xudong Wang · Rohit Girdhar · Stella X. Yu · Ishan Misra | N/A | Code |
| DF-Platter: Multi-Face Heterogeneous Deepfake Dataset | Kartik Narayan · Harsh Agarwal · Kartik Thakral · Surbhi Mittal · Mayank Vatsa · Richa Singh | N/A | Code |
| BASiS: Batch Aligned Spectral Embedding Space | Or Streicher · Ido Cohen · Guy Gilboa | N/A | Code |
| Annealing-Based Label-Transfer Learning for Open World Object Detection | Yuqing Ma · Hainan Li · Zhange Zhang · Jinyang Guo · Shanghang Zhang · Ruihao Gong · Xianglong Liu | N/A | Code |
| Behind the Scenes: Density Fields for Single View Reconstruction | Felix Wimbauer · Nan Yang · Christian Rupprecht · Daniel Cremers | N/A | Code |
| Learning Video Representations From Large Language Models | Yue Zhao · Ishan Misra · Philipp Krähenbühl · Rohit Girdhar | N/A | Code |
| Quantum Multi-Model Fitting | Matteo Farina · Luca Magri · Willi Menapace · Elisa Ricci · Vladislav Golyanik · Federica Arrigoni | N/A | Code |
| Power Bundle Adjustment for Large-Scale 3D Reconstruction | Simon Weber · Nikolaus Demmel · Tin Chon Chan · Daniel Cremers | N/A | Code |
| Optimization-Inspired Cross-Attention Transformer for Compressive Sensing | Jiechong Song · Chong Mou · Shiqi Wang · Siwei Ma · Jian Zhang | N/A | Code |
| NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization | Shitao Tang · Sicong Tang · Andrea Tagliasacchi · Ping Tan · Yasutaka Furukawa | N/A | Code |
| Back to the Source: Diffusion-Driven Adaptation To Test-Time Corruption | Jin Gao · Jialing Zhang · Xihui Liu · Trevor Darrell · Evan Shelhamer · Dequan Wang | N/A | Code |
| Learning Neural Duplex Radiance Fields for Real-Time View Synthesis | Ziyu Wan · Christian Richardt · Aljaž Božič · Chao Li · Vijay Rengarajan · Seonghyeon Nam · Xiaoyu Xiang · Tuotuo Li · Bo Zhu · Rakesh Ranjan · Jing Liao | N/A | Code |
| Object Pop-Up: Can We Infer 3D Objects and Their Poses From Human Interactions Alone? | Ilya A. Petrov · Riccardo Marin · Julian Chibane · Gerard Pons-Moll | N/A | Code |
| G-MSM: Unsupervised Multi-Shape Matching With Graph-Based Affinity Priors | Marvin Eisenberger · Aysim Toker · Laura Leal-Taixé · Daniel Cremers | N/A | Code |
| Data-Efficient Large Scale Place Recognition With Graded Similarity Supervision | María Leyva-Vallina · Nicola Strisciuglio · Nicolai Petkov | N/A | Code |
| Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection With Single Point Supervision | Xinyi Ying · Li Liu · Yingqian Wang · Ruojing Li · Nuo Chen · Zaiping Lin · Weidong Sheng · Shilin Zhou | N/A | Code |
| Instant Domain Augmentation for LiDAR Semantic Segmentation | Kwonyoung Ryu · Soonmin Hwang · Jaesik Park | N/A | Code |
| R2Former: Unified Retrieval and Reranking Transformer for Place Recognition | Sijie Zhu · Linjie Yang · Chen Chen · Mubarak Shah · Xiaohui Shen · Heng Wang | N/A | Code |
| Detecting and Grounding Multi-Modal Media Manipulation | Rui Shao · Tianxing Wu · Ziwei Liu | N/A | Code |
| Detecting Backdoors in Pre-Trained Encoders | Shiwei Feng · Guanhong Tao · Siyuan Cheng · Guangyu Shen · Xiangzhe Xu · Yingqi Liu · Kaiyuan Zhang · Shiqing Ma · Xiangyu Zhang | N/A | Code |
| Scaling Up GANs for Text-to-Image Synthesis | Minguk Kang · Jun-Yan Zhu · Richard Zhang · Jaesik Park · Eli Shechtman · Sylvain Paris · Taesung Park | N/A | Code |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Tiantian Geng · Teng Wang · Jinming Duan · Runmin Cong · Feng Zheng | N/A | Code |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360° | Sizhe An · Hongyi Xu · Yichun Shi · Guoxian Song · Umit Y. Ogras · Linjie Luo | N/A | Code |
| Modality-Invariant Visual Odometry for Embodied Vision | Marius Memmel · Roman Bachmann · Amir Zamir | N/A | Code |
| 3D Video Loops From Asynchronous Input | Li Ma · Xiaoyu Li · Jing Liao · Pedro V. Sander | N/A | Code |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Xuan Ju · Ailing Zeng · Jianan Wang · Qiang Xu · Lei Zhang | N/A | Code |
| PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout | Hsiao Yuan Hsu · Xiangteng He · Yuxin Peng · Hao Kong · Qing Zhang | N/A | Code |
| A Soma Segmentation Benchmark in Full Adult Fly Brain | Xiaoyu Liu · Bo Hu · Mingxing Li · Wei Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer | Jing Lin · Ailing Zeng · Haoqian Wang · Lei Zhang · Yu Li | N/A | Code |
| Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals | Yuto Shibata · Yutaka Kawashima · Mariko Isogawa · Go Irie · Akisato Kimura · Yoshimitsu Aoki | N/A | Code |
| Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video | Xingyu Chen · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis | Hiuyi Cheng · Peirong Zhang · Sihang Wu · Jiaxin Zhang · Qiyuan Zhu · Zecheng Xie · Jing Li · Kai Ding · Lianwen Jin | N/A | Code |
| Neural Congealing: Aligning Images to a Joint Semantic Atlas | Dolev Ofri-Amar · Michal Geyer · Yoni Kasten · Tali Dekel | N/A | Code |
| BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation | Tianheng Cheng · Xinggang Wang · Shaoyu Chen · Qian Zhang · Wenyu Liu | N/A | Code |
| BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion | Michael J. Black · Priyanka Patel · Joachim Tesch · Jinlong Yang | N/A | Code |
| Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation | Feng Li · Hao Zhang · Huaizhe Xu · Shilong Liu · Lei Zhang · Lionel M. Ni · Heung-Yeung Shum | N/A | Code |
| Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis From Monocular Image | Yu Deng · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| 3DAvatarGAN: Bridging Domains for Personalized Editable Avatars | Rameen Abdal · Hsin-Ying Lee · Peihao Zhu · Menglei Chai · Aliaksandr Siarohin · Peter Wonka · Sergey Tulyakov | N/A | Code |
| FLEX: Full-Body Grasping Without Full-Body Grasps | Purva Tendulkar · Dídac Surís · Carl Vondrick | N/A | Code |
| UDE: A Unified Driving Engine for Human Motion Generation | Zixiang Zhou · Baoyuan Wang | N/A | Code |
| Video Test-Time Adaptation for Action Recognition | Wei Lin · Muhammad Jehanzeb Mirza · Mateusz Kozinski · Horst Possegger · Hilde Kuehne · Horst Bischof | N/A | Code |
| Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis | Duomin Wang · Yu Deng · Zixin Yin · Heung-Yeung Shum · Baoyuan Wang | N/A | Code |
| MIME: Human-Aware 3D Scene Generation | Hongwei Yi · Chun-Hao P. Huang · Shashank Tripathi · Lea Hering · Justus Thies · Michael J. Black | N/A | Code |
| AstroNet: When Astrocyte Meets Artificial Neural Network | Mengqiao Han · Liyuan Pan · Xiabi Liu | N/A | Code |
| Stimulus Verification Is a Universal and Effective Sampler in Multi-Modal Human Trajectory Prediction | Jianhua Sun · Yuxuan Li · Liang Chai · Cewu Lu | N/A | Code |
| ActMAD: Activation Matching To Align Distributions for Test-Time-Training | Muhammad Jehanzeb Mirza · Pol Jané Soneira · Wei Lin · Mateusz Kozinski · Horst Possegger · Horst Bischof | N/A | Code |
| Visual Prompt Multi-Modal Tracking | Jiawen Zhu · Simiao Lai · Xin Chen · Dong Wang · Huchuan Lu | N/A | Code |
| Reconstructing Signing Avatars From Video Using Linguistic Priors | Maria-Paola Forte · Peter Kulits · Chun-Hao P. Huang · Vasileios Choutas · Dimitrios Tzionas · Katherine J. Kuchenbecker · Michael J. Black | N/A | Code |
| Patch-Based 3D Natural Scene Generation From a Single Example | Weiyu Li · Xuelin Chen · Jue Wang · Baoquan Chen | N/A | Code |
| Re-Basin via Implicit Sinkhorn Differentiation | Fidel A. Guerrero Peña · Heitor Rapela Medeiros · Thomas Dubail · Masih Aminbeidokhti · Eric Granger · Marco Pedersoli | N/A | Code |
| Slide-Transformer: Hierarchical Vision Transformer With Local Self-Attention | Xuran Pan · Tianzhu Ye · Zhuofan Xia · Shiji Song · Gao Huang | N/A | Code |
| Planning-Oriented Autonomous Driving | Yihan Hu · Jiazhi Yang · Li Chen · Keyu Li · Chonghao Sima · Xizhou Zhu · Siqi Chai · Senyao Du · Tianwei Lin · Wenhai Wang · Lewei Lu · Xiaosong Jia · Qiang Liu · Jifeng Dai · Yu Qiao · Hongyang Li | N/A | Code |
| Enhancing Deformable Local Features by Jointly Learning To Detect and Describe Keypoints | Guilherme Potje · Felipe Cadar · André Araujo · Renato Martins · Erickson R. Nascimento | N/A | Code |
| 3D Human Pose Estimation via Intuitive Physics | Shashank Tripathi · Lea Müller · Chun-Hao P. Huang · Omid Taheri · Michael J. Black · Dimitrios Tzionas | N/A | Code |
| Defending Against Patch-Based Backdoor Attacks on Self-Supervised Learning | Ajinkya Tejankar · Maziar Sanjabi · Qifan Wang · Sinong Wang · Hamed Firooz · Hamed Pirsiavash · Liang Tan | N/A | Code |
| PointCMP: Contrastive Mask Prediction for Self-Supervised Learning on Point Cloud Videos | Zhiqiang Shen · Xiaoxiao Sheng · Longguang Wang · Yulan Guo · Qiong Liu · Xi Zhou | N/A | Code |
| Blowing in the Wind: CycleNet for Human Cinemagraphs From Still Images | Hugo Bertiche · Niloy J. Mitra · Kuldeep Kulkarni · Chun-Hao P. Huang · Tuanfeng Y. Wang · Meysam Madadi · Sergio Escalera · Duygu Ceylan | N/A | Code |
| Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning | Kangning Liu · Weicheng Zhu · Yiqiu Shen · Sheng Liu · Narges Razavian · Krzysztof J. Geras · Carlos Fernandez-Granda | N/A | Code |
| Learning Steerable Function for Efficient Image Resampling | Jiacheng Li · Chang Chen · Wei Huang · Zhiqiang Lang · Fenglong Song · Youliang Yan · Zhiwei Xiong | N/A | Code |
| Deep Deterministic Uncertainty: A New Simple Baseline | Jishnu Mukhoti · Andreas Kirsch · Joost van Amersfoort · Philip H.S. Torr · Yarin Gal | N/A | Code |
| Removing Objects From Neural Radiance Fields | Silvan Weder · Guillermo Garcia-Hernando · Áron Monszpart · Marc Pollefeys · Gabriel J. Brostow · Michael Firman · Sara Vicente | N/A | Code |
| PartManip: Learning Cross-Category Generalizable Part Manipulation Policy From Point Cloud Observations | Haoran Geng · Ziming Li · Yiran Geng · Jiayi Chen · Hao Dong · He Wang | N/A | Code |
| T-SEA: Transfer-Based Self-Ensemble Attack on Object Detection | Hao Huang · Ziyan Chen · Huanran Chen · Yongtao Wang · Kevin Zhang | N/A | Code |
| DINN360: Deformable Invertible Neural Network for Latitude-Aware 360° Image Rescaling | Yichen Guo · Mai Xu · Lai Jiang · Leonid Sigal · Yunjin Chen | N/A | Code |
| Learning Human-to-Robot Handovers From Point Clouds | Sammy Christen · Wei Yang · Claudia Pérez-D’Arpino · Otmar Hilliges · Dieter Fox · Yu-Wei Chao | N/A | Code |
| Multi-View Azimuth Stereo via Tangent Space Consistency | Xu Cao · Hiroaki Santo · Fumio Okura · Yasuyuki Matsushita | N/A | Code |
| Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners | Zitian Chen · Yikang Shen · Mingyu Ding · Zhenfang Chen · Hengshuang Zhao · Erik G. Learned-Miller · Chuang Gan | N/A | Code |
| gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction | Zerui Chen · Shizhe Chen · Cordelia Schmid · Ivan Laptev | N/A | Code |
| Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint | Hongyu Liu · Yibing Song · Qifeng Chen | N/A | Code |
| Generative Bias for Robust Visual Question Answering | Jae Won Cho · Dong-Jin Kim · Hyeonggon Ryu · In So Kweon | N/A | Code |
| Backdoor Defense via Deconfounded Representation Learning | Zaixi Zhang · Qi Liu · Zhicai Wang · Zepu Lu · Qingyong Hu | N/A | Code |
| High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization | Jiaxin Xie · Hao Ouyang · Jingtan Piao · Chenyang Lei · Qifeng Chen | N/A | Code |
| Affordance Diffusion: Synthesizing Hand-Object Interactions | Yufei Ye · Xueting Li · Abhinav Gupta · Shalini De Mello · Stan Birchfield · Jiaming Song · Shubham Tulsiani · Sifei Liu | N/A | Code |
| Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters | Jiashun Wang · Xueting Li · Sifei Liu · Shalini De Mello · Orazio Gallo · Xiaolong Wang · Jan Kautz | N/A | Code |
| Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting | Tarasha Khurana · Peiyun Hu · David Held · Deva Ramanan | N/A | Code |
| Are Data-Driven Explanations Robust Against Out-of-Distribution Data? | Tang Li · Fengchun Qiao · Mengmeng Ma · Xi Peng | N/A | Code |
| Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis | Kang Han · Wei Xiang | N/A | Code |
| Boosting Video Object Segmentation via Space-Time Correspondence Learning | Yurong Zhang · Liulei Li · Wenguan Wang · Rong Xie · Li Song · Wenjun Zhang | N/A | Code |
| X-Pruner: eXplainable Pruning for Vision Transformers | Lu Yu · Wei Xiang | N/A | Code |
| GazeNeRF: 3D-Aware Gaze Redirection With Neural Radiance Fields | Alessandro Ruzzi · Xiangwei Shi · Xi Wang · Gengyan Li · Shalini De Mello · Hyung Jin Chang · Xucong Zhang · Otmar Hilliges | N/A | Code |
| Real-Time Evaluation in Online Continual Learning: A New Hope | Yasir Ghunaim · Adel Bibi · Kumail Alhamoud · Motasem Alfarra · Hasan Abed Al Kader Hammoud · Ameya Prabhu · Philip H.S. Torr · Bernard Ghanem | N/A | Code |
| Contrastive Semi-Supervised Learning for Underwater Image Restoration via Reliable Bank | Shirui Huang · Keyan Wang · Huan Liu · Jun Chen · Yunsong Li | N/A | Code |
| A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet Categories | Reza Akbarian Bafghi · Danna Gurari | N/A | Code |
| Open-Vocabulary Panoptic Segmentation With Text-to-Image Diffusion Models | Jiarui Xu · Sifei Liu · Arash Vahdat · Wonmin Byeon · Xiaolong Wang · Shalini De Mello | N/A | Code |
| Reconstructing Animatable Categories From Videos | Gengshan Yang · Chaoyang Wang · N. Dinesh Reddy · Deva Ramanan | N/A | Code |
| Learning Visual Representations via Language-Guided Sampling | Mohamed El Banani · Karan Desai · Justin Johnson | N/A | Code |
| Four-View Geometry With Unknown Radial Distortion | Petr Hruby · Viktor Korotynskiy · Timothy Duff · Luke Oeding · Marc Pollefeys · Tomas Pajdla · Viktor Larsson | N/A | Code |
| DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model | Gwanghyun Kim · Se Young Chun | N/A | Code |
| ConZIC: Controllable Zero-Shot Image Captioning by Sampling-Based Polishing | Zequn Zeng · Hao Zhang · Ruiying Lu · Dongsheng Wang · Bo Chen · Zhengjue Wang | N/A | Code |
| Feature Separation and Recalibration for Adversarial Robustness | Woo Jae Kim · Yoonki Cho · Junsik Jung · Sung-Eui Yoon | N/A | Code |
| Event-Based Blurry Frame Interpolation Under Blind Exposure | Wenming Weng · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Zhiqin Chen · Thomas Funkhouser · Peter Hedman · Andrea Tagliasacchi | N/A | Code |
| HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu · Mariya I. Vasileva · Achal Dave · Arjun Seshadri | N/A | Code |
| Analyzing and Diagnosing Pose Estimation With Attributions | Qiyuan He · Linlin Yang · Kerui Gu · Qiuxia Lin · Angela Yao | N/A | Code |
| Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction | Ziwei Yu · Chen Li · Linlin Yang · Xiaoxu Zheng · Michael Bi Mi · Gim Hee Lee · Angela Yao | N/A | Code |
| VIVE3D: Viewpoint-Independent Video Editing Using 3D-Aware GANs | Anna Frühstück · Nikolaos Sarafianos · Yuanlu Xu · Peter Wonka · Tony Tung | N/A | Code |
| Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge | Changdi Yang · Pu Zhao · Yanyu Li · Wei Niu · Jiexiong Guan · Hao Tang · Minghai Qin · Bin Ren · Xue Lin · Yanzhi Wang | N/A | Code |
| Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation From 2D Supervision | Xiaoshuai Zhang · Abhijit Kundu · Thomas Funkhouser · Leonidas Guibas · Hao Su · Kyle Genova | N/A | Code |
| VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization | Bingfan Zhu · Yanchao Yang · Xulong Wang · Youyi Zheng · Leonidas Guibas | N/A | Code |
| OpenScene: 3D Scene Understanding With Open Vocabularies | Songyou Peng · Kyle Genova · Chiyu “Max” Jiang · Andrea Tagliasacchi · Marc Pollefeys · Thomas Funkhouser | N/A | Code |
| A New Benchmark: On the Utility of Synthetic Data With Blender for Bare Supervised Learning and Downstream Domain Adaptation | Hui Tang · Kui Jia | N/A | Code |
| Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform Coordinates | Avinash Paliwal · Andrii Tsarov · Nima Khademi Kalantari | N/A | Code |
| A Large-Scale Homography Benchmark | Daniel Barath · Dmytro Mishkin · Michal Polic · Wolfgang Förstner · Jiri Matas | N/A | Code |
| Glocal Energy-Based Learning for Few-Shot Open-Set Recognition | Haoyu Wang · Guansong Pang · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| MEDIC: Remove Model Backdoors via Importance Driven Cloning | Qiuling Xu · Guanhong Tao · Jean Honorio · Yingqi Liu · Shengwei An · Guangyu Shen · Siyuan Cheng · Xiangyu Zhang | N/A | Code |
| Finding Geometric Models by Clustering in the Consensus Space | Daniel Barath · Denys Rozumnyi · Ivan Eichhardt · Levente Hajder · Jiri Matas | N/A | Code |
| Imagic: Text-Based Real Image Editing With Diffusion Models | Bahjat Kawar · Shiran Zada · Oran Lang · Omer Tov · Huiwen Chang · Tali Dekel · Inbar Mosseri · Michal Irani | N/A | Code |
| DeepLSD: Line Segment Detection and Refinement With Deep Image Gradients | Rémi Pautrat · Daniel Barath · Viktor Larsson · Martin R. Oswald · Marc Pollefeys | N/A | Code |
| H2ONet: Hand-Occlusion-and-Orientation-Aware Network for Real-Time 3D Hand Mesh Reconstruction | Hao Xu · Tianyu Wang · Xiao Tang · Chi-Wing Fu | N/A | Code |
| Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions | Yurui Zhu · Tianyu Wang · Xueyang Fu · Xuanyu Yang · Xin Guo · Jifeng Dai · Yu Qiao · Xiaowei Hu | N/A | Code |
| MoDi: Unconditional Motion Synthesis From Diverse Data | Sigal Raab · Inbal Leibovitch · Peizhuo Li · Kfir Aberman · Olga Sorkine-Hornung · Daniel Cohen-Or | N/A | Code |
| PC2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction | Luke Melas-Kyriazi · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| SliceMatch: Geometry-Guided Aggregation for Cross-View Pose Estimation | Zimin Xia · Zimin Xia · Ted Lentsch · Julian F. P. Kooij | N/A | Code |
| RealFusion: 360° Reconstruction of Any Object From a Single Image | Luke Melas-Kyriazi · Iro Laina · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Masked and Adaptive Transformer for Exemplar Based Image Translation | Chang Jiang · Fei Gao · Biao Ma · Yuhao Lin · Nannan Wang · Gang Xu | N/A | Code |
| DynamicStereo: Consistent Dynamic Depth From Stereo Videos | Nikita Karaev · Ignacio Rocco · Benjamin Graham · Natalia Neverova · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Masked Representation Learning for Domain Generalized Stereo Matching | Zhibo Rao · Bangshu Xiong · Mingyi He · Mochu Xiang · Renjie He · Zhelun Shen · Xing Li | N/A | Code |
| MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training | Runsen Xu · Tai Wang · Wenwei Zhang · Runjian Chen · Jinkun Cao · Jiangmiao Pang · Dahua Lin | N/A | Code |
| Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection | Tomoki Ichikawa · Yoshiki Fukao · Shohei Nobuhara · Ko Nishino | N/A | Code |
| Instant Multi-View Head Capture Through Learnable Registration | Timo Bolkart · Tianye Li · Michael J. Black | N/A | Code |
| POEM: Reconstructing Hand in a Point Embedded Multi-View Stereo | Lixin Yang · Jian Xu · Licheng Zhong · Xinyu Zhan · Zhicheng Wang · Kejian Wu · Cewu Lu | N/A | Code |
| Diffusion-Based Generation, Optimization, and Planning in 3D Scenes | Siyuan Huang · Zan Wang · Puhao Li · Baoxiong Jia · Tengyu Liu · Yixin Zhu · Wei Liang · Song-Chun Zhu | N/A | Code |
| Visibility Constrained Wide-Band Illumination Spectrum Design for Seeing-in-the-Dark | Muyao Niu · Zhuoxiao Li · Zhihang Zhong · Yinqiang Zheng | N/A | Code |
| SketchXAI: A First Look at Explainability for Human Sketches | Zhiyu Qu · Yulia Gryaditskaya · Ke Li · Kaiyue Pang · Tao Xiang · Yi-Zhe Song | N/A | Code |
| TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation | Taeyeop Lee · Jonathan Tremblay · Valts Blukis · Bowen Wen · Byeong-Uk Lee · Inkyu Shin · Stan Birchfield · In So Kweon · Kuk-Jin Yoon | N/A | Code |
| Teleidoscopic Imaging System for Microscale 3D Shape Reconstruction | Ryo Kawahara · Meng-Yu Jennifer Kuo · Shohei Nobuhara | N/A | Code |
| Reliability in Semantic Segmentation: Are We on the Right Track? | Pau de Jorge · Riccardo Volpi · Philip H.S. Torr · Grégory Rogez | N/A | Code |
| SMPConv: Self-Moving Point Representations for Continuous Convolution | Sanghyeon Kim · Eunbyung Park | N/A | Code |
| Few-Shot Geometry-Aware Keypoint Localization | Xingzhe He · Gaurav Bharaj · David Ferman · Helge Rhodin · Pablo Garrido | N/A | Code |
| STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition | Xiaoyu Zhu · Po-Yao Huang · Junwei Liang · Celso M. de Melo · Alexander G. Hauptmann | N/A | Code |
| Knowledge Combination To Learn Rotated Detection Without Rotated Annotation | Tianyu Zhu · Bryce Ferenczi · Pulak Purkait · Tom Drummond · Hamid Rezatofighi · Anton van den Hengel | N/A | Code |
| OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering | Zhiyuan Ma · Xiangyu Zhu · Guo-Jun Qi · Zhen Lei · Lei Zhang | N/A | Code |
| Supervised Masked Knowledge Distillation for Few-Shot Transformers | Han Lin · Guangxing Han · Jiawei Ma · Shiyuan Huang · Xudong Lin · Shih-Fu Chang | N/A | Code |
| Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision | Jilan Xu · Junlin Hou · Yuejie Zhang · Rui Feng · Yi Wang · Yu Qiao · Weidi Xie | N/A | Code |
| ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing | Xiaodan Li · Yuefeng Chen · Yao Zhu · Shuhui Wang · Rong Zhang · Hui Xue | N/A | Code |
| Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos | Liao Wang · Qiang Hu · Qihan He · Ziyu Wang · Jingyi Yu · Tinne Tuytelaars · Lan Xu · Minye Wu | N/A | Code |
| Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm | Yichen Xie · Han Lu · Junchi Yan · Xiaokang Yang · Masayoshi Tomizuka · Wei Zhan | N/A | Code |
| Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering | Ruizhi Shao · Zerong Zheng · Hanzhang Tu · Boning Liu · Hongwen Zhang · Yebin Liu | N/A | Code |
| RiDDLE: Reversible and Diversified De-Identification With Latent Encryptor | Dongze Li · Wei Wang · Kang Zhao · Jing Dong · Tieniu Tan | N/A | Code |
| RobustNeRF: Ignoring Distractors With Robust Losses | Sara Sabour · Suhani Vora · Daniel Duckworth · Ivan Krasin · David J. Fleet · Andrea Tagliasacchi | N/A | Code |
| Bitstream-Corrupted JPEG Images Are Restorable: Two-Stage Compensation and Alignment Framework for Image Restoration | Wenyang Liu · Yi Wang · Kim-Hui Yap · Lap-Pui Chau | N/A | Code |
| HierVL: Learning Hierarchical Video-Language Embeddings | Kumar Ashutosh · Rohit Girdhar · Lorenzo Torresani · Kristen Grauman | N/A | Code |
| Phone2Proc: Bringing Robust Robots Into Our Chaotic World | Matt Deitke · Rose Hendrix · Ali Farhadi · Kiana Ehsani · Aniruddha Kembhavi | N/A | Code |
| A Light Touch Approach to Teaching Transformers Multi-View Geometry | Yash Bhalgat · João F. Henriques · Andrew Zisserman | N/A | Code |
| Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields | Kangkan Wang · Guofeng Zhang · Suxu Cong · Jian Yang | N/A | Code |
| AutoFocusFormer: Image Segmentation off the Grid | Chen Ziwen · Kaushik Patnaik · Shuangfei Zhai · Alvin Wan · Zhile Ren · Alexander G. Schwing · Alex Colburn · Li Fuxin | N/A | Code |
| Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe · Zhengyi Luo · Xue Bin Peng · Ye Yuan · Kris Kitani · Karsten Kreis · Sanja Fidler · Or Litany | N/A | Code |
| Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking | Jinkun Cao · Jiangmiao Pang · Xinshuo Weng · Rawal Khirodkar · Kris Kitani | N/A | Code |
| Spider GAN: Leveraging Friendly Neighbors To Accelerate GAN Training | Siddarth Asokan · Chandra Sekhar Seelamantula | N/A | Code |
| Minimizing the Accumulated Trajectory Error To Improve Dataset Distillation | Jiawei Du · Yidi Jiang · Vincent Y. F. Tan · Joey Tianyi Zhou · Haizhou Li | N/A | Code |
| Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo | Yuesong Wang · Zhaojie Zeng · Tao Guan · Wei Yang · Zhuo Chen · Wenkai Liu · Luoyuan Xu · Yawei Luo | N/A | Code |
| Learning Correspondence Uncertainty via Differentiable Nonlinear Least Squares | Dominik Muhle · Lukas Koestler · Krishna Murthy Jatavallabhula · Daniel Cremers | N/A | Code |
| Learning Anchor Transformations for 3D Garment Animation | Fang Zhao · Zekun Li · Shaoli Huang · Junwu Weng · Tianfei Zhou · Guo-Sen Xie · Jue Wang · Ying Shan | N/A | Code |
| PyPose: A Library for Robot Learning With Physics-Based Optimization | Chen Wang · Dasong Gao · Kuan Xu · Junyi Geng · Yaoyu Hu · Yuheng Qiu · Bowen Li · Fan Yang · Brady Moon · Abhinav Pandey · Aryan · Jiahe Xu · Tianhao Wu · Haonan He · Daning Huang · Zhongqiang Ren · Shibo Zhao · Taimeng Fu · Pranay Reddy · Xiao Lin · Wenshan Wang · Jingnan Shi · Rajat Talak · Kun Cao · Yi Du · Han Wang · Huai Yu · Shanzhao Wang · Siyu Chen · Ananth Kashyap · Rohan Bandaru · Karthik Dantu · Jiajun Wu · Lihua Xie · Luca Carlone · Marco Hutter · Sebastian Scherer | N/A | Code |
| Unicode Analogies: An Anti-Objectivist Visual Reasoning Challenge | Steven Spratley · Krista A. Ehinger · Tim Miller | N/A | Code |
| DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata | Ehsan Pajouheshgar · Yitao Xu · Tong Zhang · Sabine Süsstrunk | N/A | Code |
| Learning Generative Structure Prior for Blind Text Image Super-Resolution | Xiaoming Li · Wangmeng Zuo · Chen Change Loy | N/A | Code |
| CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis | Juntian Zheng · Qingyuan Zheng · Lixing Fang · Yun Liu · Li Yi | N/A | Code |
| SCPNet: Semantic Scene Completion on Point Cloud | Zhaoyang Xia · Youquan Liu · Xin Li · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao | N/A | Code |
| AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation | Zhen Li · Zuo-Liang Zhu · Ling-Hao Han · Qibin Hou · Chun-Le Guo · Ming-Ming Cheng | N/A | Code |
| Behavioral Analysis of Vision-and-Language Navigation Agents | Zijiao Yang · Arjun Majumdar · Stefan Lee | N/A | Code |
| Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation | Yuwei Yang · Munawar Hayat · Zhao Jin · Chao Ren · Yinjie Lei | N/A | Code |
| Directional Connectivity-Based Segmentation of Medical Images | Ziyun Yang · Sina Farsiu | N/A | Code |
| ScanDMM: A Deep Markov Model of Scanpath Prediction for 360° Images | Xiangjie Sui · Yuming Fang · Hanwei Zhu · Shiqi Wang · Zhou Wang | N/A | Code |
| 3D Shape Reconstruction of Semi-Transparent Worms | Thomas P. Ilett · Omer Yuval · Thomas Ranner · Netta Cohen · David C. Hogg | N/A | Code |
| Patch-Craft Self-Supervised Training for Correlated Image Denoising | Gregory Vaksman · Michael Elad | N/A | Code |
| NeAT: Learning Neural Implicit Surfaces With Arbitrary Topologies From Multi-View Images | Xiaoxu Meng · Weikai Chen · Bo Yang | N/A | Code |
| DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering | Zongrui Li · Qian Zheng · Boxin Shi · Gang Pan · Xudong Jiang | N/A | Code |
| Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training | Zhao Jin · Munawar Hayat · Yuwei Yang · Yulan Guo · Yinjie Lei | N/A | Code |
| Unsupervised Object Localization: Observing the Background To Discover Objects | Oriane Siméoni · Chloé Sekkat · Gilles Puy · Antonín Vobecký · Éloi Zablocki · Patrick Pérez | N/A | Code |
| Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery | Muli Yang · Liancheng Wang · Cheng Deng · Hanwang Zhang | N/A | Code |
| Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion | Yushi Lan · Xuyi Meng · Shuai Yang · Chen Change Loy · Bo Dai | N/A | Code |
| NeuralField-LDM: Scene Generation With Hierarchical Latent Diffusion Models | Seung Wook Kim · Bradley Brown · Kangxue Yin · Karsten Kreis · Katja Schwarz · Daiqing Li · Robin Rombach · Antonio Torralba · Sanja Fidler | N/A | Code |
| ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation | Alexandre Boulch · Corentin Sautier · Björn Michele · Gilles Puy · Renaud Marlet | N/A | Code |
| RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure Prediction | Donghao Zhou · Chunbin Gu · Junde Xu · Furui Liu · Qiong Wang · Guangyong Chen · Pheng-Ann Heng | N/A | Code |
| Aligning Bag of Regions for Open-Vocabulary Object Detection | Size Wu · Wenwei Zhang · Sheng Jin · Wentao Liu · Chen Change Loy | N/A | Code |
| Neuralangelo: High-Fidelity Neural Surface Reconstruction | Zhaoshuo Li · Thomas Müller · Alex Evans · Russell H. Taylor · Mathias Unberath · Ming-Yu Liu · Chen-Hsuan Lin | N/A | Code |
| PCT-Net: Full Resolution Image Harmonization Using Pixel-Wise Color Transformations | Julian Jorge Andrade Guerreiro · Mitsuru Nakazawa · Björn Stenger | N/A | Code |
| PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers | Ryan Grainger · Thomas Paniagua · Xi Song · Naresh Cuntoor · Mun Wai Lee · Tianfu Wu | N/A | Code |
| Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation | Dong Zhao · Shuang Wang · Qi Zang · Dou Quan · Xiutiao Ye · Licheng Jiao | N/A | Code |
| MEGANE: Morphable Eyeglass and Avatar Network | Junxuan Li · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Hongdong Li · Jason Saragih | N/A | Code |
| Generalizable Implicit Neural Representations via Instance Pattern Composers | Chiheon Kim · Doyup Lee · Saehoon Kim · Minsu Cho · Wook-Shin Han | N/A | Code |
| Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast Solution | Bangyan Liao · Delin Qu · Yifei Xue · Huiqing Zhang · Yizhen Lao | N/A | Code |
| Distribution Shift Inversion for Out-of-Distribution Prediction | Runpeng Yu · Songhua Liu · Xingyi Yang · Xinchao Wang | N/A | Code |
| Wide-Angle Rectification via Content-Aware Conformal Mapping | Qi Zhang · Hongdong Li · Qing Wang | N/A | Code |
| WildLight: In-the-Wild Inverse Rendering With a Flashlight | Ziang Cheng · Junxuan Li · Hongdong Li | N/A | Code |
| Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos | Kun Su · Kaizhi Qian · Eli Shlizerman · Antonio Torralba · Chuang Gan | N/A | Code |
| Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural Networks | Markus Frey · Christian F. Doeller · Caswell Barry | N/A | Code |
| Inverting the Imaging Process by Learning an Implicit Camera Model | Xin Huang · Qi Zhang · Ying Feng · Hongdong Li · Qing Wang | N/A | Code |
| EC2: Emergent Communication for Embodied Control | Yao Mu · Shunyu Yao · Mingyu Ding · Ping Luo · Chuang Gan | N/A | Code |
| Light Source Separation and Intrinsic Image Decomposition Under AC Illumination | Yusaku Yoshida · Ryo Kawahara · Takahiro Okabe | N/A | Code |
| FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding | Thanh-Dat Truong · Ngan Le · Bhiksha Raj · Jackson Cothren · Khoa Luu | N/A | Code |
| Learning Locally Editable Virtual Humans | Hsuan-I Ho · Lixin Xue · Jie Song · Otmar Hilliges | N/A | Code |
| Open-Vocabulary Point-Cloud Object Detection Without 3D Annotation | Yuheng Lu · Chenfeng Xu · Xiaobao Wei · Xiaodong Xie · Masayoshi Tomizuka · Kurt Keutzer · Shanghang Zhang | N/A | Code |
| Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning | Qiang He · Huangyuan Su · Jieyu Zhang · Xinwen Hou | N/A | Code |
| PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection | Anthony Chen · Kevin Zhang · Renrui Zhang · Zihan Wang · Yuheng Lu · Yandong Guo · Shanghang Zhang | N/A | Code |
| OrienterNet: Visual Localization in 2D Public Maps With Neural Matching | Paul-Edouard Sarlin · Daniel DeTone · Tsun-Yi Yang · Armen Avetisyan · Julian Straub · Tomasz Malisiewicz · Samuel Rota Bulò · Richard Newcombe · Peter Kontschieder · Vasileios Balntas | N/A | Code |
| Class Relationship Embedded Learning for Source-Free Unsupervised Domain Adaptation | Yixin Zhang · Zilei Wang · Weinan He | N/A | Code |
| Efficient Movie Scene Detection Using State-Space Transformers | Md Mohaiminul Islam · Mahmudul Hasan · Kishan Shamsundar Athrey · Tony Braskich · Gedas Bertasius | N/A | Code |
| Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction | Mingfang Zhang · Jinglu Wang · Xiao Li · Yifei Huang · Yoichi Sato · Yan Lu | N/A | Code |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Xiao Han · Xiatian Zhu · Licheng Yu · Li Zhang · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning | Qian Jiang · Changyou Chen · Han Zhao · Liqun Chen · Qing Ping · Son Dinh Tran · Yi Xu · Belinda Zeng · Trishul Chilimbi | N/A | Code |
| Level-S$^2$fM: Structure From Motion on Neural Level Set of Implicit Surfaces | Yuxi Xiao · Nan Xue · Tianfu Wu · Gui-Song Xia | N/A | Code |
| Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object Detection | Xinjiang Wang · Xingyi Yang · Shilong Zhang · Yijiang Li · Litong Feng · Shijie Fang · Chengqi Lyu · Kai Chen · Wayne Zhang | N/A | Code |
| Dense Distinct Query for End-to-End Object Detection | Shilong Zhang · Xinjiang Wang · Jiaqi Wang · Jiangmiao Pang · Chengqi Lyu · Wenwei Zhang · Ping Luo · Kai Chen | N/A | Code |
| ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation | Zicong Fan · Omid Taheri · Dimitrios Tzionas · Muhammed Kocabas · Manuel Kaufmann · Michael J. Black · Otmar Hilliges | N/A | Code |
| BiFormer: Vision Transformer With Bi-Level Routing Attention | Lei Zhu · Xinjiang Wang · Zhanghan Ke · Wayne Zhang · Rynson W.H. Lau | N/A | Code |
| Hierarchical Video-Moment Retrieval and Step-Captioning | Abhay Zala · Jaemin Cho · Satwik Kottur · Xilun Chen · Barlas Oguz · Yashar Mehdad · Mohit Bansal | N/A | Code |
| Progressive Open Space Expansion for Open-Set Model Attribution | Tianyun Yang · Danding Wang · Fan Tang · Xinying Zhao · Juan Cao · Sheng Tang | N/A | Code |
| Deep Depth Estimation From Thermal Image | Ukcheol Shin · Jinsun Park · In So Kweon | N/A | Code |
| Incremental 3D Semantic Scene Graph Prediction From RGB Sequences | Shun-Cheng Wu · Keisuke Tateno · Nassir Navab · Federico Tombari | N/A | Code |
| Visual Programming: Compositional Visual Reasoning Without Training | Tanmay Gupta · Aniruddha Kembhavi | N/A | Code |
| Change-Aware Sampling and Contrastive Learning for Satellite Images | Utkarsh Mall · Bharath Hariharan · Kavita Bala | N/A | Code |
| NULL-Text Inversion for Editing Real Images Using Guided Diffusion Models | Ron Mokady · Amir Hertz · Kfir Aberman · Yael Pritch · Daniel Cohen-Or | N/A | Code |
| RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors | Rui-Qi Wu · Zheng-Peng Duan · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| Neural Part Priors: Learning To Optimize Part-Based Object Completion in RGB-D Scans | Aleksei Bokhovkin · Angela Dai | N/A | Code |
| Hierarchical Discriminative Learning Improves Visual Representations of Biomedical Microscopy | Cheng Jiang · Xinhai Hou · Akhil Kondepudi · Asadur Chowdury · Christian W. Freudiger · Daniel A. Orringer · Honglak Lee · Todd C. Hollon | N/A | Code |
| Domain Expansion of Image Generators | Yotam Nitzan · Michaël Gharbi · Richard Zhang · Taesung Park · Jun-Yan Zhu · Daniel Cohen-Or · Eli Shechtman | N/A | Code |
| “Seeing” Electric Network Frequency From Events | Lexuan Xu · Guang Hua · Haijian Zhang · Lei Yu · Ning Qiao | N/A | Code |
| MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object Detection | Wenda Zhao · Shigeng Xie · Fan Zhao · You He · Huchuan Lu | N/A | Code |
| Adaptive Spot-Guided Transformer for Consistent Local Feature Matching | Jiahuan Yu · Jiahao Chang · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Feng Wu | N/A | Code |
| SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence | Jiacheng Deng · Chuxin Wang · Jiahao Lu · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Zhe Zhang | N/A | Code |
| Dynamic Coarse-To-Fine Learning for Oriented Tiny Object Detection | Chang Xu · Jian Ding · Jinwang Wang · Wen Yang · Huai Yu · Lei Yu · Gui-Song Xia | N/A | Code |
| Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning | Yu Wang · Pengchong Qiao · Chang Liu · Guoli Song · Xiawu Zheng · Jie Chen | N/A | Code |
| Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding | Praneeth Chakravarthula · Jim Aldon D’Souza · Ethan Tseng · Joe Bartusek · Felix Heide | N/A | Code |
| DNF: Decouple and Feedback Network for Seeing in the Dark | Xin Jin · Ling-Hao Han · Zhen Li · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation | Samir Yitzhak Gadre · Mitchell Wortsman · Gabriel Ilharco · Ludwig Schmidt · Shuran Song | N/A | Code |
| NVTC: Nonlinear Vector Transform Coding | Runsen Feng · Zongyu Guo · Weiping Li · Zhibo Chen | N/A | Code |
| Towards Unified Scene Text Spotting Based on Sequence Generation | Taeho Kil · Seonghyeon Kim · Sukmin Seo · Yoonsik Kim · Daehee Kim | N/A | Code |
| Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation | Tsu-Jui Fu · Licheng Yu · Ning Zhang · Cheng-Yang Fu · Jong-Chyi Su · William Yang Wang · Sean Bell | N/A | Code |
| Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation | Pengchong Qiao · Zhidan Wei · Yu Wang · Zhennan Wang · Guoli Song · Fan Xu · Xiangyang Ji · Chang Liu · Jie Chen | N/A | Code |
| Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andréas Meuleman · Yu-Lun Liu · Chen Gao · Jia-Bin Huang · Changil Kim · Min H. Kim · Johannes Kopf | N/A | Code |
| Neural Map Prior for Autonomous Driving | Xuan Xiong · Yicheng Liu · Tianyuan Yuan · Yue Wang · Yilun Wang · Hang Zhao | N/A | Code |
| Efficient and Explicit Modelling of Image Hierarchies for Image Restoration | Yawei Li · Yuchen Fan · Xiaoyu Xiang · Denis Demandolx · Rakesh Ranjan · Radu Timofte · Luc Van Gool | N/A | Code |
| F2-NeRF: Fast Neural Radiance Field Training With Free Camera Trajectories | Peng Wang · Yuan Liu · Zhaoxi Chen · Lingjie Liu · Ziwei Liu · Taku Komura · Christian Theobalt · Wenping Wang | N/A | Code |
| Procedure-Aware Pretraining for Instructional Video Understanding | Honglu Zhou · Roberto Martín-Martín · Mubbasir Kapadia · Silvio Savarese · Juan Carlos Niebles | N/A | Code |
| High-Fidelity Guided Image Synthesis With Latent Diffusion Models | Jaskirat Singh · Stephen Gould · Liang Zheng | N/A | Code |
| Progressive Random Convolutions for Single Domain Generalization | Seokeon Choi · Debasmit Das · Sungha Choi · Seunghan Yang · Hyunsin Park · Sungrack Yun | N/A | Code |
| EcoTTA: Memory-Efficient Continual Test-Time Adaptation via Self-Distilled Regularization | Junha Song · Jungsoo Lee · In So Kweon · Sungha Choi | N/A | Code |
| NoPe-NeRF: Optimising Neural Radiance Field With No Pose Prior | Wenjing Bian · Zirui Wang · Kejie Li · Jia-Wang Bian · Victor Adrian Prisacariu | N/A | Code |
| GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds | Zihui Zhang · Bo Yang · Bing Wang · Bo Li | N/A | Code |
| Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning | Kaiyou Song · Jin Xie · Shan Zhang · Zimeng Luo | N/A | Code |
| Aligning Step-by-Step Instructional Diagrams to Video Demonstrations | Jiahao Zhang · Anoop Cherian · Yanbin Liu · Yizhak Ben-Shabat · Cristian Rodriguez · Stephen Gould | N/A | Code |
| ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari · Camilla Carta · François Fleuret | N/A | Code |
| AutoRecon: Automated 3D Object Discovery and Reconstruction | Yuang Wang · Xingyi He · Sida Peng · Haotong Lin · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Ultra-High Resolution Segmentation With Ultra-Rich Context: A Novel Benchmark | Deyi Ji · Feng Zhao · Hongtao Lu · Mingyuan Tao · Jieping Ye | N/A | Code |
| NeUDF: Leaning Neural Unsigned Distance Fields With Volume Rendering | Yu-Tao Liu · Li Wang · Jie Yang · Weikai Chen · Xiaoxu Meng · Bo Yang · Lin Gao | N/A | Code |
| Improving Cross-Modal Retrieval With Set of Diverse Embeddings | Dongwon Kim · Namyup Kim · Suha Kwak | N/A | Code |
| An Image Quality Assessment Dataset for Portraits | Nicolas Chahine · Stefania Calarasanu · Davide Garcia-Civiero · Théo Cayla · Sira Ferradans · Jean Ponce | N/A | Code |
| Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor | Hyeokjun Kweon · Sung-Hoon Yoon · Kuk-Jin Yoon | N/A | Code |
| NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer | Kun Zhou · Wenbo Li · Yi Wang · Tao Hu · Nianjuan Jiang · Xiaoguang Han · Jiangbo Lu | N/A | Code |
| ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations | Panos Achlioptas · Ian Huang · Minhyuk Sung · Sergey Tulyakov · Leonidas Guibas | N/A | Code |
| RelightableHands: Efficient Neural Relighting of Articulated Hand Models | Shun Iwase · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Timur Bagautdinov · Rohan Joshi · Fabian Prada · Takaaki Shiratori · Yaser Sheikh · Jason Saragih | N/A | Code |
| VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud | Ziqin Wang · Bowen Cheng · Lichen Zhao · Dong Xu · Yang Tang · Lu Sheng | N/A | Code |
| MVImgNet: A Large-Scale Dataset of Multi-View Images | Xianggang Yu · Mutian Xu · Yidan Zhang · Haolin Liu · Chongjie Ye · Yushuang Wu · Zizheng Yan · Chenming Zhu · Zhangyang Xiong · Tianyou Liang · Guanying Chen · Shuguang Cui · Xiaoguang Han | N/A | Code |
| MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling With Informative-Preserved Reconstruction and Self-Distilled Consistency | Mingye Xu · Mutian Xu · Tong He · Wanli Ouyang · Yali Wang · Xiaoguang Han · Yu Qiao | N/A | Code |
| Self-Guided Diffusion Models | Vincent Tao Hu · David W. Zhang · Yuki M. Asano · Gertjan J. Burghouts · Cees G. M. Snoek | N/A | Code |
| REC-MV: REconstructing 3D Dynamic Cloth From Monocular Videos | Lingteng Qiu · Guanying Chen · Jiapeng Zhou · Mutian Xu · Junle Wang · Xiaoguang Han | N/A | Code |
| OneFormer: One Transformer To Rule Universal Image Segmentation | Jitesh Jain · Jiachen Li · Mang Tik Chiu · Ali Hassani · Nikita Orlov · Humphrey Shi | N/A | Code |
| Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations | Vibashan VS · Ning Yu · Chen Xing · Can Qin · Mingfei Gao · Juan Carlos Niebles · Vishal M. Patel · Ran Xu | N/A | Code |
| Multiclass Confidence and Localization Calibration for Object Detection | Bimsara Pathiraja · Malitha Gunawardhana · Muhammad Haris Khan | N/A | Code |
| Structured Kernel Estimation for Photon-Limited Deconvolution | Yash Sanghvi · Zhiyuan Mao · Stanley H. Chan | N/A | Code |
| CLIPPO: Image-and-Language Understanding From Pixels Only | Michael Tschannen · Basil Mustafa · Neil Houlsby | N/A | Code |
| Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition | Lilang Lin · Jiahang Zhang · Jiaying Liu | N/A | Code |
| Role of Transients in Two-Bounce Non-Line-of-Sight Imaging | Siddharth Somasundaram · Akshat Dave · Connor Henley · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Shape-Aware Text-Driven Layered Video Editing | Yao-Chih Lee · Ji-Ze Genevieve Jang · Yi-Ting Chen · Elizabeth Qiu · Jia-Bin Huang | N/A | Code |
| FlexiViT: One Model for All Patch Sizes | Lucas Beyer · Pavel Izmailov · Alexander Kolesnikov · Mathilde Caron · Simon Kornblith · Xiaohua Zhai · Matthias Minderer · Michael Tschannen · Ibrahim Alabdulmohsin · Filip Pavetic | N/A | Code |
| Turning Strengths Into Weaknesses: A Certified Robustness Inspired Attack Framework Against Graph Neural Networks | Binghui Wang · Meng Pang · Yun Dong | N/A | Code |
| HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling | Yujian Zheng · Zirong Jin · Moran Li · Haibin Huang · Chongyang Ma · Shuguang Cui · Xiaoguang Han | N/A | Code |
| RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal Retrieval | Yanglin Feng · Hongyuan Zhu · Dezhong Peng · Xi Peng · Peng Hu | N/A | Code |
| Learning Federated Visual Prompt in Null Space for MRI Reconstruction | Chun-Mei Feng · Bangjun Li · Xinxing Xu · Yong Liu · Huazhu Fu · Wangmeng Zuo | N/A | Code |
| VGFlow: Visibility Guided Flow Network for Human Reposing | Rishabh Jain · Krishna Kumar Singh · Mayur Hemani · Jingwan Lu · Mausoom Sarkar · Duygu Ceylan · Balaji Krishnamurthy | N/A | Code |
| Learning Attention As Disentangler for Compositional Zero-Shot Learning | Shaozhe Hao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | Yiqun Wang · Ivan Skorokhodov · Peter Wonka | N/A | Code |
| Perception-Oriented Single Image Super-Resolution Using Optimal Objective Estimation | Seung Ho Park · Young Su Moon · Nam Ik Cho | N/A | Code |
| Learning To Exploit Temporal Structure for Biomedical Vision–Language Processing | Shruthi Bannur · Stephanie Hyland · Qianchu Liu · Fernando Pérez-García · Maximilian Ilse · Daniel C. Castro · Benedikt Boecking · Harshita Sharma · Kenza Bouzid · Anja Thieme · Anton Schwaighofer · Maria Wetscherek · Matthew P. Lungren · Aditya Nori · Javier Alvarez-Valle · Ozan Oktay | N/A | Code |
| TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments | Yu Sun · Qian Bao · Wu Liu · Tao Mei · Michael J. Black | N/A | Code |
| Neumann Network With Recursive Kernels for Single Image Defocus Deblurring | Yuhui Quan · Zicong Wu · Hui Ji | N/A | Code |
| Guiding Pseudo-Labels With Uncertainty Estimation for Source-Free Unsupervised Domain Adaptation | Mattia Litrico · Alessio Del Bue · Pietro Morerio | N/A | Code |
| PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes | Ruoyu Wang · Zehao Yu · Shenghua Gao | N/A | Code |
| Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference | Haoran You · Yunyang Xiong · Xiaoliang Dai · Bichen Wu · Peizhao Zhang · Haoqi Fan · Peter Vajda · Yingyan (Celine) Lin | N/A | Code |
| Attention-Based Point Cloud Edge Sampling | Chengzhi Wu · Junwei Zheng · Julius Pfrommer · Jürgen Beyerer | N/A | Code |
| Structured 3D Features for Reconstructing Controllable Avatars | Enric Corona · Mihai Zanfir · Thiemo Alldieck · Eduard Gabriel Bazavan · Andrei Zanfir · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Referring Image Segmentation With Global-Local Context Features | Seonghoon Yu · Paul Hongsuck Seo · Jeany Son | N/A | Code |
| CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective | Junwen Xiong · Ganglai Wang · Peng Zhang · Wei Huang · Yufei Zha · Guangtao Zhai | N/A | Code |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Anwesa Choudhuri · Girish Chowdhary · Alexander G. Schwing | N/A | Code |
| Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields | Rohith Agaram · Shaurya Dewan · Rahul Sajnani · Adrien Poulenard · Madhava Krishna · Srinath Sridhar | N/A | Code |
| Decoupled Multimodal Distilling for Emotion Recognition | Yong Li · Yuanzhi Wang · Zhen Cui | N/A | Code |
| TensoIR: Tensorial Inverse Rendering | Haian Jin · Isabella Liu · Peijia Xu · Xiaoshuai Zhang · Songfang Han · Sai Bi · Xiaowei Zhou · Zexiang Xu · Hao Su | N/A | Code |
| Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning | Jiayi Guo · Chaofei Wang · You Wu · Eric Zhang · Kai Wang · Xingqian Xu · Shiji Song · Humphrey Shi · Gao Huang | N/A | Code |
| DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection | Jiawei Ma · Yulei Niu · Jincheng Xu · Shiyuan Huang · Guangxing Han · Shih-Fu Chang | N/A | Code |
| Unbalanced Optimal Transport: A Unified Framework for Object Detection | Henri De Plaen · Pierre-François De Plaen · Johan A. K. Suykens · Marc Proesmans · Tinne Tuytelaars · Luc Van Gool | N/A | Code |
| NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-Shot Real Image Animation | Yu Yin · Kamran Ghasedi · HsiangTao Wu · Jiaolong Yang · Xin Tong · Yun Fu | N/A | Code |
| Masked Image Training for Generalizable Deep Image Denoising | Haoyu Chen · Jinjin Gu · Yihao Liu · Salma Abdel Magid · Chao Dong · Qiong Wang · Hanspeter Pfister · Lei Zhu | N/A | Code |
| Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation | Mayu Otani · Riku Togashi · Yu Sawai · Ryosuke Ishigami · Yuta Nakashima · Esa Rahtu · Janne Heikkilä · Shin’ichi Satoh | N/A | Code |
| Towards Flexible Multi-Modal Document Models | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin · Mingkang Li · Da Li · Timothy Hospedales · Yi-Zhe Song · Yonggang Qi | N/A | Code |
| LidarGait: Benchmarking 3D Gait Recognition With Point Clouds | Chuanfu Shen · Chao Fan · Wei Wu · Rui Wang · George Q. Huang · Shiqi Yu | N/A | Code |
| OpenGait: Revisiting Gait Recognition Towards Better Practicality | Chao Fan · Junhao Liang · Chuanfu Shen · Saihui Hou · Yongzhen Huang · Shiqi Yu | N/A | Code |
| Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang · Anqi Joyce Yang · Yuwen Xiong · Sergio Casas · Bin Yang · Mengye Ren · Raquel Urtasun | N/A | Code |
| Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images | Ming Y. Lu · Bowen Chen · Andrew Zhang · Drew F. K. Williamson · Richard J. Chen · Tong Ding · Long Phi Le · Yung-Sung Chuang · Faisal Mahmood | N/A | Code |
| DivClust: Controlling Diversity in Deep Clustering | Ioannis Maniadis Metaxas · Georgios Tzimiropoulos · Ioannis Patras | N/A | Code |
| AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning | Runqi Wang · Xiaoyue Duan · Guoliang Kang · Jianzhuang Liu · Shaohui Lin · Songcen Xu · Jinhu Lü · Baochang Zhang | N/A | Code |
| Unsupervised Continual Semantic Adaptation Through Neural Rendering | Zhizheng Liu · Francesco Milano · Jonas Frey · Roland Siegwart · Hermann Blum · Cesar Cadena | N/A | Code |
| Semi-Supervised Parametric Real-World Image Harmonization | Ke Wang · Michaël Gharbi · He Zhang · Zhihao Xia · Eli Shechtman | N/A | Code |
| EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning | Chenxin Xu · Robby T. Tan · Yuhong Tan · Siheng Chen · Yu Guang Wang · Xinchao Wang · Yanfeng Wang | N/A | Code |
| BUOL: A Bottom-Up Framework With Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single Image | Tao Chu · Pan Zhang · Qiong Liu · Jiaqi Wang | N/A | Code |
| Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation | Ning Zhang · Francesco Nex · George Vosselman · Norman Kerle | N/A | Code |
| Novel-View Acoustic Synthesis | Changan Chen · Alexander Richard · Roman Shapovalov · Vamsi Krishna Ithapu · Natalia Neverova · Kristen Grauman · Andrea Vedaldi | N/A | Code |
| Audio-Visual Grouping Network for Sound Localization From Mixtures | Shentong Mo · Yapeng Tian | N/A | Code |
| Chat2Map: Efficient Scene Mapping From Multi-Ego Conversations | Sagnik Majumder · Hao Jiang · Pierre Moulon · Ethan Henderson · Paul Calamia · Kristen Grauman · Vamsi Krishna Ithapu | N/A | Code |
| ConvNeXt V2: Co-Designing and Scaling ConvNets With Masked Autoencoders | Sanghyun Woo · Shoubhik Debnath · Ronghang Hu · Xinlei Chen · Zhuang Liu · In So Kweon · Saining Xie | N/A | Code |
| Collaboration Helps Camera Overtake LiDAR in 3D Detection | Yue Hu · Yifan Lu · Runsheng Xu · Weidi Xie · Siheng Chen · Yanfeng Wang | N/A | Code |
| Few-Shot Learning With Visual Distribution Calibration and Cross-Modal Distribution Alignment | Runqi Wang · Hao Zheng · Xiaoyue Duan · Jianzhuang Liu · Yuning Lu · Tian Wang · Songcen Xu · Baochang Zhang | N/A | Code |
| MetaCLUE: Towards Comprehensive Visual Metaphors Research | Arjun R. Akula · Brendan Driscoll · Pradyumna Narayana · Soravit Changpinyo · Zhiwei Jia · Suyash Damle · Garima Pruthi · Sugato Basu · Leonidas Guibas · William Freeman · Yuanzhen Li · Varun Jampani | N/A | Code |
| Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric | Pengxin Zeng · Yunfan Li · Peng Hu · Dezhong Peng · Jiancheng Lv · Xi Peng | N/A | Code |
| Visual Atoms: Pre-Training Vision Transformers With Sinusoidal Waves | Sora Takashima · Ryo Hayamizu · Nakamasa Inoue · Hirokatsu Kataoka · Rio Yokota | N/A | Code |
| 3D-Aware Multi-Class Image-to-Image Translation With NeRFs | Senmao Li · Joost van de Weijer · Yaxing Wang · Fahad Shahbaz Khan · Meiqin Liu · Jian Yang | N/A | Code |
| E2PN: Efficient SE(3)-Equivariant Point Network | Minghan Zhu · Maani Ghaffari · William A. Clark · Huei Peng | N/A | Code |
| PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing | Yichen Sheng · Jianming Zhang · Julien Philip · Yannick Hold-Geoffroy · Xin Sun · He Zhang · Lu Ling · Bedrich Benes | N/A | Code |
| UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang · Yun Chen · Jingkang Wang · Sivabalan Manivasagam · Wei-Chiu Ma · Anqi Joyce Yang · Raquel Urtasun | N/A | Code |
| Occlusion-Free Scene Recovery via Neural Radiance Fields | Chengxuan Zhu · Renjie Wan · Yunkai Tang · Boxin Shi | N/A | Code |
| SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting With Neural Radiance Fields | Ashkan Mirzaei · Tristan Aumentado-Armstrong · Kosta Derpanis · Jonathan Kelly · Marcus A. Brubaker · Igor Gilitschenski · Alex Levinshtein | N/A | Code |
| Class-Incremental Exemplar Compression for Class-Incremental Learning | Zilin Luo · Yaoyao Liu · Bernt Schiele · Qianru Sun | N/A | Code |
| DETRs With Hybrid Matching | Ding Jia · Yuhui Yuan · Haodi He · Xiaopei Wu · Haojun Yu · Weihong Lin · Lei Sun · Chao Zhang · Han Hu | N/A | Code |
| 3D Human Mesh Estimation From Virtual Markers | Xiaoxuan Ma · Jiajun Su · Chunyu Wang · Wentao Zhu · Yizhou Wang | N/A | Code |
| Objaverse: A Universe of Annotated 3D Objects | Matt Deitke · Dustin Schwenk · Jordi Salvador · Luca Weihs · Oscar Michel · Eli VanderBilt · Ludwig Schmidt · Kiana Ehsani · Aniruddha Kembhavi · Ali Farhadi | N/A | Code |
| Adjustment and Alignment for Unbiased Open Set Domain Adaptation | Wuyang Li · Jie Liu · Bo Han · Yixuan Yuan | N/A | Code |
| TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition | Ishan Rajendrakumar Dave · Mamshad Nayeem Rizve · Chen Chen · Mubarak Shah | N/A | Code |
| EfficientSCI: Densely Connected Network With Space-Time Factorization for Large-Scale Video Snapshot Compressive Imaging | Lishun Wang · Miao Cao · Xin Yuan | N/A | Code |
| Continual Detection Transformer for Incremental Object Detection | Yaoyao Liu · Bernt Schiele · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Hierarchical Prompt Learning for Multi-Task Learning | Yajing Liu · Yuning Lu · Hao Liu · Yaozu An · Zhuoran Xu · Zhuokun Yao · Baofeng Zhang · Zhiwei Xiong · Chenguang Gui | N/A | Code |
| Boost Vision Transformer With GPU-Friendly Sparsity and Quantization | Chong Yu · Tao Chen · Zhongxue Gan · Jiayuan Fan | N/A | Code |
| Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression | Junho Kim · Byung-Kwan Lee · Yong Man Ro | N/A | Code |
| Regularizing Second-Order Influences for Continual Learning | Zhicheng Sun · Yadong Mu · Gang Hua | N/A | Code |
| Heterogeneous Continual Learning | Divyam Madaan · Hongxu Yin · Wonmin Byeon · Jan Kautz · Pavlo Molchanov | N/A | Code |
| DP-NeRF: Deblurred Neural Radiance Field With Physical Scene Priors | Dogyoon Lee · Minhyeok Lee · Chajin Shin · Sangyoun Lee | N/A | Code |
| 3D-POP – An Automated Annotation Approach to Facilitate Markerless 2D-3D Tracking of Freely Moving Birds With Marker-Based Motion Capture | Hemal Naik · Alex Hoi Hang Chan · Junran Yang · Mathilde Delacoux · Iain D. Couzin · Fumihiro Kano · Máté Nagy | N/A | Code |
| Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants With No False Negatives and No False Positives | Daniel Widdowson · Vitaliy Kurlin | N/A | Code |
| Robust Model-Based Face Reconstruction Through Weakly-Supervised Outlier Segmentation | Chunlu Li · Andreas Morel-Forster · Thomas Vetter · Bernhard Egger · Adam Kortylewski | N/A | Code |
| Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization | Shichao Dong · Jin Wang · Renhe Ji · Jiajun Liang · Haoqiang Fan · Zheng Ge | N/A | Code |
| PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation | Qihao Liu · Adam Kortylewski · Alan L. Yuille | N/A | Code |
| VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| 1000 FPS HDR Video With a Spike-RGB Hybrid Camera | Yakun Chang · Chu Zhou · Yuchen Hong · Liwen Hu · Chao Xu · Tiejun Huang · Boxin Shi | N/A | Code |
| How to Backdoor Diffusion Models? | Sheng-Yen Chou · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image Classification | Meike Nauta · Jörg Schlötterer · Maurice van Keulen · Christin Seifert | N/A | Code |
| Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers | Siyuan Wei · Tianzhu Ye · Shen Zhang · Yao Tang · Jiajun Liang | N/A | Code |
| Energy-Efficient Adaptive 3D Sensing | Brevin Tilmon · Zhanghao Sun · Sanjeev J. Koppal · Yicheng Wu · Georgios Evangelidis · Ramzi Zahreddine · Gurunandan Krishnan · Sizhuo Ma · Jian Wang | N/A | Code |
| Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data | Yuhao Chen · Xin Tan · Borui Zhao · Zhaowei Chen · Renjie Song · Jiajun Liang · Xuequan Lu | N/A | Code |
| Fix the Noise: Disentangling Source Feature for Controllable Domain Translation | Dongyeun Lee · Jae Young Lee · Doyeon Kim · Jaehyun Choi · Jaejun Yoo · Junmo Kim | N/A | Code |
| Learning Transferable Spatiotemporal Representations From Natural Script Knowledge | Ziyun Zeng · Yuying Ge · Xihui Liu · Bin Chen · Ping Luo · Shu-Tao Xia · Yixiao Ge | N/A | Code |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Mengde Xu · Zheng Zhang · Fangyun Wei · Han Hu · Xiang Bai | N/A | Code |
| A Strong Baseline for Generalized Few-Shot Semantic Segmentation | Sina Hajimiri · Malik Boudiaf · Ismail Ben Ayed · Jose Dolz | N/A | Code |
| Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations | Lei Hsiung · Yun-Yun Tsai · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| Normalizing Flow Based Feature Synthesis for Outlier-Aware Object Detection | Nishant Kumar · Siniša Šegvić · Abouzar Eslami · Stefan Gumhold | N/A | Code |
| AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction | Aggelina Chatziagapi · Dimitris Samaras | N/A | Code |
| Learning Semantic Relationship Among Instances for Image-Text Matching | Zheren Fu · Zhendong Mao · Yan Song · Yongdong Zhang | N/A | Code |
| Understanding Imbalanced Semantic Segmentation Through Neural Collapse | Zhisheng Zhong · Jiequan Cui · Yibo Yang · Xiaoyang Wu · Xiaojuan Qi · Xiangyu Zhang · Jiaya Jia | N/A | Code |
| SCADE: NeRFs from Space Carving With Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy · Ricardo Martin-Brualla · Leonidas Guibas · Ke Li | N/A | Code |
| MonoHuman: Animatable Human Neural Field From Monocular Video | Zhengming Yu · Wei Cheng · Xian Liu · Wayne Wu · Kwan-Yee Lin | N/A | Code |
| Affection: Learning Affective Explanations for Real-World Visual Data | Panos Achlioptas · Maks Ovsjanikov · Leonidas Guibas · Sergey Tulyakov | N/A | Code |
| Sharpness-Aware Gradient Matching for Domain Generalization | Pengfei Wang · Zhaoxiang Zhang · Zhen Lei · Lei Zhang | N/A | Code |
| Generalized Decoding for Pixel, Image, and Language | Xueyan Zou · Zi-Yi Dou · Jianwei Yang · Zhe Gan · Linjie Li · Chunyuan Li · Xiyang Dai · Harkirat Behl · Jianfeng Wang · Lu Yuan · Nanyun Peng · Lijuan Wang · Yong Jae Lee · Jianfeng Gao | N/A | Code |
| How You Feelin’? Learning Emotions and Mental States in Movie Scenes | Dhruv Srivastava · Aditya Kumar Singh · Makarand Tapaswi | N/A | Code |
| Improving Visual Representation Learning Through Perceptual Understanding | Samyakh Tukra · Frederick Hoffman · Ken Chatfield | N/A | Code |
| PlenVDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering | Han Yan · Celong Liu · Chao Ma · Xing Mei | N/A | Code |
| HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions | Anshul Shah · Aniket Roy · Ketul Shah · Shlok Mishra · David Jacobs · Anoop Cherian · Rama Chellappa | N/A | Code |
| FeatureBooster: Boosting Feature Descriptors With a Lightweight Neural Network | Xinjiang Wang · Zeyu Liu · Yu Hu · Wei Xi · Wenxian Yu · Danping Zou | N/A | Code |
| ACL-SPC: Adaptive Closed-Loop System for Self-Supervised Point Cloud Completion | Sangmin Hong · Mohsen Yavartanoo · Reyhaneh Neshatavar · Kyoung Mu Lee | N/A | Code |
| NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis | Allan Zhou · Moo Jin Kim · Lirui Wang · Pete Florence · Chelsea Finn | N/A | Code |
| Query-Centric Trajectory Prediction | Zikang Zhou · Jianping Wang · Yung-Hui Li · Yu-Kai Huang | N/A | Code |
| EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding | Yanmin Wu · Xinhua Cheng · Renrui Zhang · Zesen Cheng · Jian Zhang | N/A | Code |
| Sliced Optimal Partial Transport | Yikun Bai · Bernhard Schmitzer · Matthew Thorpe · Soheil Kolouri | N/A | Code |
| PersonNeRF: Personalized Reconstruction From Photo Collections | Chung-Yi Weng · Pratul P. Srinivasan · Brian Curless · Ira Kemelmacher-Shlizerman | N/A | Code |
| Feature Shrinkage Pyramid for Camouflaged Object Detection With Transformers | Zhou Huang · Hang Dai · Tian-Zhu Xiang · Shuo Wang · Huai-Xin Chen · Jie Qin · Huan Xiong | N/A | Code |
| HOLODIFFUSION: Training a 3D Diffusion Model Using 2D Images | Animesh Karnewar · Andrea Vedaldi · David Novotny · Niloy J. Mitra | N/A | Code |
| Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors | Gongjie Zhang · Zhipeng Luo · Zichen Tian · Jingyi Zhang · Xiaoqin Zhang · Shijian Lu | N/A | Code |
| Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images | Tiancheng Lin · Zhimiao Yu · Hongyu Hu · Yi Xu · Chang-Wen Chen | N/A | Code |
| Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding | Minyoung Hwang · Jaeyeon Jeong · Minsoo Kim · Yoonseon Oh · Songhwai Oh | N/A | Code |
| Sketch2Saliency: Learning To Detect Salient Objects From Human Drawings | Ayan Kumar Bhunia · Subhadeep Koley · Amandeep Kumar · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Picture That Sketch: Photorealistic Image Generation From Abstract Sketches | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain · Ayan Kumar Bhunia · Pinaki Nath Chowdhury · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data | Jihye Park · Sunwoo Kim · Soohyun Kim · Seokju Cho · Jaejun Yoo · Youngjung Uh · Seungryong Kim | N/A | Code |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Wenjie Chang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| SceneTrilogy: On Human Scene-Sketch and Its Complementarity With Photo and Text | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Markerless Camera-to-Robot Pose Estimation via Self-Supervised Sim-to-Real Transfer | Jingpei Lu · Florian Richter · Michael C. Yip | N/A | Code |
| Fine-Grained Audible Video Description | Xuyang Shen · Dong Li · Jinxing Zhou · Zhen Qin · Bowen He · Xiaodong Han · Aixuan Li · Mochu Xiang · Lingpeng Kong · Meng Wang · Yu Qiao · Yiran Zhong | N/A | Code |
| EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention | Xinyu Liu · Houwen Peng · Ningxin Zheng · Yuqing Yang · Han Hu · Yixuan Yuan | N/A | Code |
| Relightable Neural Human Assets From Multi-View Gradient Illuminations | Taotao Zhou · Kai He · Di Wu · Teng Xu · Qixuan Zhang · Kuixiang Shao · Wenzheng Chen · Lan Xu · Jingyi Yu | N/A | Code |
| Music-Driven Group Choreography | Nhat Le · Thang Pham · Tuong Do · Erman Tjiputra · Quang D. Tran · Anh Nguyen | N/A | Code |
| DIP: Dual Incongruity Perceiving Network for Sarcasm Detection | Changsong Wen · Guoli Jia · Jufeng Yang | N/A | Code |
| MagicPony: Learning Articulated 3D Animals in the Wild | Shangzhe Wu · Ruining Li · Tomas Jakab · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Preserving Linear Separability in Continual Learning by Backward Feature Projection | Qiao Gu · Dongsub Shim · Florian Shkurti | N/A | Code |
| Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues | Xingyu Ren · Jiankang Deng · Chao Ma · Yichao Yan · Xiaokang Yang | N/A | Code |
| HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models | Shan Ning · Longtian Qiu · Yongfei Liu · Xuming He | N/A | Code |
| Regularization of Polynomial Networks for Image Recognition | Grigorios G. Chrysos · Bohan Wang · Jiankang Deng · Volkan Cevher | N/A | Code |
| Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain · Ayan Kumar Bhunia · Subhadeep Koley · Pinaki Nath Chowdhury · Soumitri Chattopadhyay · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement | Yuhui Wu · Chen Pan · Guoqing Wang · Yang Yang · Jiwei Wei · Chongyi Li · Heng Tao Shen | N/A | Code |
| Block Selection Method for Using Feature Norm in Out-of-Distribution Detection | Yeonguk Yu · Sungho Shin · Seongju Lee · Changhyun Jun · Kyoobin Lee | N/A | Code |
| HouseDiffusion: Vector Floorplan Generation via a Diffusion Model With Discrete and Continuous Denoising | Mohammad Amin Shabani · Sepidehsadat Hosseini · Yasutaka Furukawa | N/A | Code |
| Integral Neural Networks | Kirill Solodskikh · Azim Kurbanov · Ruslan Aydarkhanov · Irina Zhelavskaya · Yury Parfenov · Dehua Song · Stamatios Lefkimmiatis | N/A | Code |
| FitMe: Deep Photorealistic 3D Morphable Model Avatars | Alexandros Lattas · Stylianos Moschoglou · Stylianos Ploumpis · Baris Gecer · Jiankang Deng · Stefanos Zafeiriou | N/A | Code |
| Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment | Kim Sung-Bin · Arda Senocak · Hyunwoo Ha · Andrew Owens · Tae-Hyun Oh | N/A | Code |
| Introducing Competition To Boost the Transferability of Targeted Adversarial Examples Through Clean Feature Mixup | Junyoung Byun · Myung-Joon Kwon · Seungju Cho · Yoonji Kim · Changick Kim | N/A | Code |
| Initialization Noise in Image Gradients and Saliency Maps | Ann-Christin Woerl · Jan Disselhoff · Michael Wand | N/A | Code |
| Two-Shot Video Object Segmentation | Kun Yan · Xiao Li · Fangyun Wei · Jinglu Wang · Chenbin Zhang · Ping Wang · Yan Lu | N/A | Code |
| SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow | Itai Lang · Dror Aiger · Forrester Cole · Shai Avidan · Michael Rubinstein | N/A | Code |
| Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM | Hengyi Wang · Jingwen Wang · Lourdes Agapito | N/A | Code |
| Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments | Masakazu Yoshimura · Junji Otsuka · Atsushi Irie · Takeshi Ohashi | N/A | Code |
| Diffusion-Based Signed Distance Fields for 3D Shape Generation | Jaehyeok Shim · Changwoo Kang · Kyungdon Joo | N/A | Code |
| Handwritten Text Generation From Visual Archetypes | Vittorio Pippi · Silvia Cascianelli · Rita Cucchiara | N/A | Code |
| Novel Class Discovery for 3D Point Cloud Semantic Segmentation | Luigi Riz · Cristiano Saltori · Elisa Ricci · Fabio Poiesi | N/A | Code |
| DeltaEdit: Exploring Text-Free Training for Text-Driven Image Manipulation | Yueming Lyu · Tianwei Lin · Fu Li · Dongliang He · Jing Dong · Tieniu Tan | N/A | Code |
| SkyEye: Self-Supervised Bird’s-Eye-View Semantic Mapping Using Monocular Frontal View Images | Nikhil Gosala · Kürsat Petek · Paulo L. J. Drews-Jr · Wolfram Burgard · Abhinav Valada | N/A | Code |
| Towards Open-World Segmentation of Parts | Tai-Yu Pan · Qing Liu · Wei-Lun Chao · Brian Price | N/A | Code |
| DeepMapping2: Self-Supervised Large-Scale LiDAR Map Optimization | Chao Chen · Xinhao Liu · Yiming Li · Li Ding · Chen Feng | N/A | Code |
| SINE: SINgle Image Editing With Text-to-Image Diffusion Models | Zhixing Zhang · Ligong Han · Arnab Ghosh · Dimitris N. Metaxas · Jian Ren | N/A | Code |
| Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection | Long Li · Junwei Han · Ni Zhang · Nian Liu · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Fahad Shahbaz Khan | N/A | Code |
| TruFor: Leveraging All-Round Clues for Trustworthy Image Forgery Detection and Localization | Fabrizio Guillaro · Davide Cozzolino · Avneesh Sud · Nicholas Dufour · Luisa Verdoliva | N/A | Code |
| SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction | Yukang Cao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-Shot Learning With Hyperspherical Embeddings | Daniel J. Trosten · Rwiddhi Chakraborty · Sigurd Løkse · Kristoffer Knutsen Wickstrøm · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| MAGE: MAsked Generative Encoder To Unify Representation Learning and Image Synthesis | Tianhong Li · Huiwen Chang · Shlok Mishra · Han Zhang · Dina Katabi · Dilip Krishnan | N/A | Code |
| Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection | Lianyu Wang · Meng Wang · Daoqiang Zhang · Huazhu Fu | N/A | Code |
| OvarNet: Towards Open-Vocabulary Object Attribute Recognition | Keyan Chen · Xiaolong Jiang · Yao Hu · Xu Tang · Yan Gao · Jianqi Chen · Weidi Xie | N/A | Code |
| GINA-3D: Learning To Generate Implicit Neural Assets in the Wild | Bokui Shen · Xinchen Yan · Charles R. Qi · Mahyar Najibi · Boyang Deng · Leonidas Guibas · Yin Zhou · Dragomir Anguelov | N/A | Code |
| PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation | Qitao Zhao · Ce Zheng · Mengyuan Liu · Pichao Wang · Chen Chen | N/A | Code |
| Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization | Huan Ren · Wenfei Yang · Tianzhu Zhang · Yongdong Zhang | N/A | Code |
| Learning Partial Correlation Based Deep Visual Representation for Image Classification | Saimunur Rahman · Piotr Koniusz · Lei Wang · Luping Zhou · Peyman Moghadam · Changming Sun | N/A | Code |
| Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph | Rixin Zhou · Jiafu Wei · Qian Zhang · Ruihua Qi · Xi Yang · Chuntao Li | N/A | Code |
| DexArt: Benchmarking Generalizable Dexterous Manipulation With Articulated Objects | Chen Bao · Helin Xu · Yuzhe Qin · Xiaolong Wang | N/A | Code |
| Modeling the Distributional Uncertainty for Salient Object Detection Models | Xinyu Tian · Jing Zhang · Mochu Xiang · Yuchao Dai | N/A | Code |
| Evading Forensic Classifiers With Attribute-Conditioned Adversarial Faces | Fahad Shamshad · Koushik Srivatsan · Karthik Nandakumar | N/A | Code |
| Scene-Aware Egocentric 3D Human Pose Estimation | Jian Wang · Diogo Luvizon · Weipeng Xu · Lingjie Liu · Kripasindhu Sarkar · Christian Theobalt | N/A | Code |
| Camouflaged Instance Segmentation via Explicit De-Camouflaging | Naisong Luo · Yuwen Pan · Rui Sun · Tianzhu Zhang · Zhiwei Xiong · Feng Wu | N/A | Code |
| N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution | Haram Choi · Jeongmin Lee · Jihoon Yang | N/A | Code |
| Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding | Gyeongman Kim · Hajin Shim · Hyunsu Kim · Yunjey Choi · Junho Kim · Eunho Yang | N/A | Code |
| GLIGEN: Open-Set Grounded Text-to-Image Generation | Yuheng Li · Haotian Liu · Qingyang Wu · Fangzhou Mu · Jianwei Yang · Jianfeng Gao · Chunyuan Li · Yong Jae Lee | N/A | Code |
| Balanced Spherical Grid for Egocentric View Synthesis | Changwoon Choi · Sang Min Kim · Young Min Kim | N/A | Code |
| V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception | Runsheng Xu · Xin Xia · JINLONG LI · Hanzhao Li · Shuo Zhang · Zhengzhong Tu · Zonglin Meng · Hao Xiang · Xiaoyu Dong · Rui Song · Hongkai Yu · Bolei Zhou · Jiaqi Ma | N/A | Code |
| VindLU: A Recipe for Effective Video-and-Language Pretraining | Feng Cheng · Xizi Wang · Jie Lei · David Crandall · Mohit Bansal · Gedas Bertasius | N/A | Code |
| FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation | Jie Qin · Jie Wu · Pengxiang Yan · Ming Li · Ren Yuxi · Xuefeng Xiao · Yitong Wang · Rui Wang · Shilei Wen · Xin Pan · Xingang Wang | N/A | Code |
| NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization | Zhixiang Min · Bingbing Zhuang · Samuel Schulter · Buyu Liu · Enrique Dunn · Manmohan Chandraker | N/A | Code |
| ABCD: Arbitrary Bitwise Coefficient for De-Quantization | Woo Kyoung Han · Byeonghun Lee · Sang Hyun Park · Kyong Hwan Jin | N/A | Code |
| PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery | Sheng Zhang · Salman Khan · Zhiqiang Shen · Muzammal Naseer · Guangyi Chen · Fahad Shahbaz Khan | N/A | Code |
| Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing With Non-Learnable Primitives | Chuntao Ding · Zhichao Lu · Shangguang Wang · Ran Cheng · Vishnu Naresh Boddeti | N/A | Code |
| MaPLe: Multi-Modal Prompt Learning | Muhammad Uzair Khattak · Hanoona Rasheed · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Revisiting Residual Networks for Adversarial Robustness | Shihua Huang · Zhichao Lu · Kalyanmoy Deb · Vishnu Naresh Boddeti | N/A | Code |
| Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification | Youngwook Kim · Jae Myung Kim · Jieun Jeong · Cordelia Schmid · Zeynep Akata · Jungwoo Lee | N/A | Code |
| Human Pose Estimation in Extremely Low-Light Conditions | Sohyun Lee · Jaesung Rim · Boseung Jeong · Geonu Kim · Byungju Woo · Haechan Lee · Sunghyun Cho · Suha Kwak | N/A | Code |
| Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution | Chenfan Qu · Chongyu Liu · Yuliang Liu · Xinhong Chen · Dezhi Peng · Fengjun Guo · Lianwen Jin | N/A | Code |
| SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene | Minjung Son · Jeong Joon Park · Leonidas Guibas · Gordon Wetzstein | N/A | Code |
| LEGO-Net: Learning Regular Rearrangements of Objects in Rooms | Qiuhong Anna Wei · Sijie Ding · Jeong Joon Park · Rahul Sajnani · Adrien Poulenard · Srinath Sridhar · Leonidas Guibas | N/A | Code |
| MACARONS: Mapping and Coverage Anticipation With RGB Online Self-Supervision | Antoine Guédon · Tom Monnier · Pascal Monasse · Vincent Lepetit | N/A | Code |
| ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction | Zhen Wang · Shijie Zhou · Jeong Joon Park · Despoina Paschalidou · Suya You · Gordon Wetzstein · Leonidas Guibas · Achuta Kadambi | N/A | Code |
| Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos | Rohit Gupta · Anirban Roy · Claire Christensen · Sujeong Kim · Sarah Gerard · Madeline Cincebeaux · Ajay Divakaran · Todd Grindal · Mubarak Shah | N/A | Code |
| DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network | Xuan Shen · Yaohua Wang · Ming Lin · Yilun Huang · Hao Tang · Xiuyu Sun · Yanzhi Wang | N/A | Code |
| ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects | Marco Toschi · Riccardo De Matteo · Riccardo Spezialetti · Daniele De Gregorio · Luigi Di Stefano · Samuele Salti | N/A | Code |
| Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields | Brian K. S. Isaac-Medina · Chris G. Willcocks · Toby P. Breckon | N/A | Code |
| A Generalized Framework for Video Instance Segmentation | Miran Heo · Sukjun Hwang · Jeongseok Hyun · Hanjung Kim · Seoung Wug Oh · Joon-Young Lee · Seon Joo Kim | N/A | Code |
| Video Probabilistic Diffusion Models in Projected Latent Space | Sihyun Yu · Kihyuk Sohn · Subin Kim · Jinwoo Shin | N/A | Code |
| X-Avatar: Expressive Human Avatars | Kaiyue Shen · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Julien Valentin · Jie Song · Otmar Hilliges | N/A | Code |
| Hi4D: 4D Instance Segmentation of Close Human Interaction | Yifei Yin · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Jie Song · Otmar Hilliges | N/A | Code |
| Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction | Bin Fan · Yuxin Mao · Mochu Xiang · Zhexiong Wan · Qi Liu | N/A | Code |
| Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting | Syed Talal Wasim · Muzammal Naseer · Salman Khan · Fahad Shahbaz Khan · Mubarak Shah | N/A | Code |
| MaskSketch: Unpaired Structure-Guided Masked Image Generation | Dina Bashkirova · José Lezama · Kihyuk Sohn · Kate Saenko · Irfan Essa | N/A | Code |
| Super-CLEVR: A Virtual Benchmark To Diagnose Domain Robustness in Visual Reasoning | Zhuowan Li · Xingrui Wang · Elias Stengel-Eskin · Adam Kortylewski · Wufei Ma · Benjamin Van Durme · Alan L. Yuille | N/A | Code |
| CREPE: Can Vision-Language Foundation Models Reason Compositionally? | Zixian Ma · Jerry Hong · Mustafa Omer Gul · Mona Gandhi · Irena Gao · Ranjay Krishna | N/A | Code |
| ORCa: Glossy Objects As Radiance-Field Cameras | Kushagra Tiwary · Akshat Dave · Nikhil Behari · Tzofi Klinghoffer · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Learning Common Rationale To Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems | Yangyang Shu · Anton van den Hengel · Lingqiao Liu | N/A | Code |
| Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro · Quinlan Sykora · Sergio Casas · Raquel Urtasun | N/A | Code |
| Improved Test-Time Adaptation for Domain Generalization | Liang Chen · Yong Zhang · Yibing Song · Ying Shan · Lingqiao Liu | N/A | Code |
| Wavelet Diffusion Models Are Fast and Scalable Image Generators | Hao Phung · Quan Dao · Anh Tran | N/A | Code |
| Robust Dynamic Radiance Fields | Yu-Lun Liu · Chen Gao · Andréas Meuleman · Hung-Yu Tseng · Ayush Saraf · Changil Kim · Yung-Yu Chuang · Johannes Kopf · Jia-Bin Huang | N/A | Code |
| MixSim: A Hierarchical Framework for Mixed Reality Traffic Simulation | Simon Suo · Kelvin Wong · Justin Xu · James Tu · Alexander Cui · Sergio Casas · Raquel Urtasun | N/A | Code |
| Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement | Nancy Mehta · Akshay Dudhane · Subrahmanyam Murala · Syed Waqas Zamir · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Class Adaptive Network Calibration | Bingyuan Liu · Jérôme Rony · Adrian Galdran · Jose Dolz · Ismail Ben Ayed | N/A | Code |
| PROB: Probabilistic Objectness for Open World Object Detection | Orr Zohar · Kuan-Chieh Wang · Serena Yeung | N/A | Code |
| Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation | Min Shi · Zihao Huang · Xianzheng Ma · Xiaowei Hu · Zhiguo Cao | N/A | Code |
| HyperCUT: Video Sequence From a Single Blurry Image Using Unsupervised Ordering | Bang-Dang Pham · Phong Tran · Anh Tran · Cuong Pham · Rang Nguyen · Minh Hoai | N/A | Code |
| On the Effects of Self-Supervision and Contrastive Alignment in Deep Multi-View Clustering | Daniel J. Trosten · Sigurd Løkse · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| Visual Prompt Tuning for Generative Transfer Learning | Kihyuk Sohn · Huiwen Chang · José Lezama · Luisa Polania · Han Zhang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| Towards End-to-End Generative Modeling of Long Videos With Memory-Efficient Bidirectional Transformers | Jaehoon Yoo · Semin Kim · Doyup Lee · Chiheon Kim · Seunghoon Hong | N/A | Code |
| MAGVIT: Masked Generative Video Transformer | Lijun Yu · Yong Cheng · Kihyuk Sohn · José Lezama · Han Zhang · Huiwen Chang · Alexander G. Hauptmann · Ming-Hsuan Yang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| NICO++: Towards Better Benchmarking for Domain Generalization | Xingxuan Zhang · Yue He · Renzhe Xu · Han Yu · Zheyan Shen · Peng Cui | N/A | Code |
| Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization | Xingxuan Zhang · Renzhe Xu · Han Yu · Hao Zou · Peng Cui | N/A | Code |
| All-in-Focus Imaging From Event Focal Stack | Hanyue Lou · Minggui Teng · Yixin Yang · Boxin Shi | N/A | Code |
| Clover: Towards a Unified Video-Language Alignment and Fusion Model | Jingjia Huang · Yinan Li · Jiashi Feng · Xinglong Wu · Xiaoshuai Sun · Rongrong Ji | N/A | Code |
| UMat: Uncertainty-Aware Single Image High Resolution Material Capture | Carlos Rodriguez-Pardo · Henar Domínguez-Elvira · David Pascual-Hernández · Elena Garces | N/A | Code |
| Polarimetric iToF: Measuring High-Fidelity Depth Through Scattering Media | Daniel S. Jeon · Andréas Meuleman · Seung-Hwan Baek · Min H. Kim | N/A | Code |
| Freestyle Layout-to-Image Synthesis | Han Xue · Zhiwu Huang · Qianru Sun · Li Song · Wenjun Zhang | N/A | Code |
| Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior | Yuekun Dai · Yihang Luo · Shangchen Zhou · Chongyi Li · Chen Change Loy | N/A | Code |
| Meta Omnium: A Benchmark for General-Purpose Learning-To-Learn | Ondrej Bohdal · Yinbing Tian · Yongshuo Zong · Ruchika Chavhan · Da Li · Henry Gouk · Li Guo · Timothy Hospedales | N/A | Code |
| EXCALIBUR: Encouraging and Evaluating Embodied Exploration | Hao Zhu · Raghav Kapoor · So Yeon Min · Winson Han · Jiatai Li · Kaiwen Geng · Graham Neubig · Yonatan Bisk · Aniruddha Kembhavi · Luca Weihs | N/A | Code |
| Detection of Out-of-Distribution Samples Using Binary Neuron Activation Patterns | Bartłomiej Olber · Krystian Radlak · Adam Popowicz · Michal Szczepankiewicz · Krystian Chachuła | N/A | Code |
| Shakes on a Plane: Unsupervised Depth Estimation From Unstabilized Photography | Ilya Chugunov · Yuxuan Zhang · Felix Heide | N/A | Code |
| JacobiNeRF: NeRF Shaping With Mutual Information Gradients | Xiaomeng Xu · Yanchao Yang · Kaichun Mo · Boxiao Pan · Li Yi · Leonidas Guibas | N/A | Code |
| MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers | Jihao Liu · Xin Huang · Jinliang Zheng · Yu Liu · Hongsheng Li | N/A | Code |
| Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement | Siddarth Ravichandran · Ondřej Texler · Dimitar Dinev · Hyun Jae Kang | N/A | Code |
| CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions | Ming Yan · Xin Wang · Yudi Dai · Siqi Shen · Chenglu Wen · Lan Xu · Yuexin Ma · Cheng Wang | N/A | Code |
| SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments | Yudi Dai · Yitai Lin · Xiping Lin · Chenglu Wen · Lan Xu · Hongwei Yi · Siqi Shen · Yuexin Ma · Cheng Wang | N/A | Code |
| Viewpoint Equivariance for Multi-View 3D Object Detection | Dian Chen · Jie Li · Vitor Guizilini · Rares Andrei Ambrus · Adrien Gaidon | N/A | Code |
| Balanced Product of Calibrated Experts for Long-Tailed Recognition | Emanuel Sanchez Aimar · Arvi Jonnarth · Michael Felsberg · Marco Kuhlmann | N/A | Code |
| Robust Mean Teacher for Continual and Gradual Test-Time Adaptation | Mario Döbler · Robert A. Marsden · Bin Yang | N/A | Code |
| Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation | Sara Sarto · Manuele Barraco · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara | N/A | Code |
| BITE: Beyond Priors for Improved Three-D Dog Pose Estimation | Nadine Rüegg · Shashank Tripathi · Konrad Schindler · Michael J. Black · Silvia Zuffi | N/A | Code |
| SparseFusion: Distilling View-Conditioned Diffusion for 3D Reconstruction | Zhizhuo Zhou · Shubham Tulsiani | N/A | Code |
| PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation | Liwen Zhang · Xinyan Zhang · Youcheng Zhang · Yufei Guo · Yuanpei Chen · Xuhui Huang · Zhe Ma | N/A | Code |
| Masked Wavelet Representation for Compact Neural Radiance Fields | Daniel Rho · Byeonghyeon Lee · Seungtae Nam · Joo Chan Lee · Jong Hwan Ko · Eunbyung Park | N/A | Code |
| Guided Depth Super-Resolution by Deep Anisotropic Diffusion | Nando Metzger · Rodrigo Caye Daudt · Konrad Schindler | N/A | Code |
| Masked Images Are Counterfactual Samples for Robust Fine-Tuning | Yao Xiao · Ziyi Tang · Pengxu Wei · Cong Liu · Liang Lin | N/A | Code |
| Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration | Guofeng Mei · Hao Tang · Xiaoshui Huang · Weijie Wang · Juan Liu · Jian Zhang · Luc Van Gool · Qiang Wu | N/A | Code |
| ECON: Explicit Clothed Humans Optimized via Normal Integration | Yuliang Xiu · Jinlong Yang · Xu Cao · Dimitrios Tzionas · Michael J. Black | N/A | Code |
| GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection | Xixi Liu · Yaroslava Lochman · Christopher Zach | N/A | Code |
| OCTET: Object-Aware Counterfactual Explanations | Mehdi Zemni · Mickaël Chen · Éloi Zablocki · Hédi Ben-Younes · Patrick Pérez · Matthieu Cord | N/A | Code |
| Consistent View Synthesis With Pose-Guided Diffusion Models | Hung-Yu Tseng · Qinbo Li · Changil Kim · Suhib Alsisan · Jia-Bin Huang · Johannes Kopf | N/A | Code |
| GFPose: Learning 3D Human Pose Prior With Gradient Fields | Hai Ci · Mingdong Wu · Wentao Zhu · Xiaoxuan Ma · Hao Dong · Fangwei Zhong · Yizhou Wang | N/A | Code |
| Bayesian Posterior Approximation With Stochastic Ensembles | Oleksandr Balabanov · Bernhard Mehlig · Hampus Linander | N/A | Code |
| Spatio-Focal Bidirectional Disparity Estimation From a Dual-Pixel Image | Donggun Kim · Hyeonjoong Jang · Inchul Kim · Min H. Kim | N/A | Code |
| Octree Guided Unoriented Surface Reconstruction | Chamin Hewa Koneputugodage · Yizhak Ben-Shabat · Stephen Gould | N/A | Code |
| HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning | Chia-Wen Kuo · Zsolt Kira | N/A | Code |
| SUDS: Scalable Urban Dynamic Scenes | Haithem Turki · Jason Y. Zhang · Francesco Ferroni · Deva Ramanan | N/A | Code |
| Harmonious Feature Learning for Interactive Hand-Object Pose Estimation | Zhifeng Lin · Changxing Ding · Huan Yao · Zengsheng Kuang · Shaoli Huang | N/A | Code |
| Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer | Agus Gunawan · Soo Ye Kim · Hyeonjun Sim · Jae-Ho Lee · Munchurl Kim | N/A | Code |
| Trainable Projected Gradient Method for Robust Fine-Tuning | Junjiao Tian · Zecheng He · Xiaoliang Dai · Chih-Yao Ma · Yen-Cheng Liu · Zsolt Kira | N/A | Code |
| OReX: Object Reconstruction From Planar Cross-Sections Using Neural Fields | Haim Sawdayee · Amir Vaxman · Amit H. Bermano | N/A | Code |
| CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects | Nick Heppert · Zubair Irshad · Sergey Zakharov · Katherine Liu · Rares Andrei Ambrus · Jeannette Bohg · Abhinav Valada · Thomas Kollar | N/A | Code |
| ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction | Zhengdi Yu · Shaoli Huang · Chen Fang · Toby P. Breckon · Jue Wang | N/A | Code |
| Perception and Semantic Aware Regularization for Sequential Confidence Calibration | Zhenghua Peng · Yu Luo · Tianshui Chen · Keke Xu · Shuangping Huang | N/A | Code |
| Crowd3D: Towards Hundreds of People Reconstruction From a Single Image | Hao Wen · Jing Huang · Huili Cui · Haozhe Lin · Yu-Kun Lai · Lu Fang · Kun Li | N/A | Code |
| ZegCLIP: Towards Adapting CLIP for Zero-Shot Semantic Segmentation | Ziqin Zhou · Yinjie Lei · Bowen Zhang · Lingqiao Liu · Yifan Liu | N/A | Code |
| Learning Semantic-Aware Disentangled Representation for Flexible 3D Human Body Editing | Xiaokun Sun · Qiao Feng · Xiongzheng Li · Jinsong Zhang · Yu-Kun Lai · Jingyu Yang · Kun Li | N/A | Code |
| Skinned Motion Retargeting With Residual Perception of Motion Semantics & Geometry | Jiaxu Zhang · Junwu Weng · Di Kang · Fang Zhao · Shaoli Huang · Xuefei Zhe · Linchao Bao · Ying Shan · Jue Wang · Zhigang Tu | N/A | Code |
| Unknown Sniffer for Object Detection: Don’t Turn a Blind Eye to Unknown Objects | Wenteng Liang · Feng Xue · Yihao Liu · Guofeng Zhong · Anlong Ming | N/A | Code |
| RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving | Angelika Ando · Spyros Gidaris · Andrei Bursuc · Gilles Puy · Alexandre Boulch · Renaud Marlet | N/A | Code |
| 3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data | Libing Zeng · Lele Chen · Wentao Bao · Zhong Li · Yi Xu · Junsong Yuan · Nima Khademi Kalantari | N/A | Code |
| Best of Both Worlds: Multimodal Contrastive Learning With Tabular and Imaging Data | Paul Hager · Martin J. Menten · Daniel Rueckert | N/A | Code |
| JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking | Edward Vendrow · Tho Le · Jianfei Cai · Hamid Rezatofighi | N/A | Code |
| Consistent Direct Time-of-Flight Video Depth Super-Resolution | Zhanghao Sun · Wei Ye · Jinhui Xiong · Gyeongmin Choe · Jialiang Wang · Shuochen Su · Rakesh Ranjan | N/A | Code |
| Correlational Image Modeling for Self-Supervised Visual Pre-Training | Wei Li · Jiahao Xie · Chen Change Loy | N/A | Code |
| CelebV-Text: A Large-Scale Facial Text-Video Dataset | Jianhui Yu · Hao Zhu · Liming Jiang · Chen Change Loy · Weidong Cai · Wayne Wu | N/A | Code |
| Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning | Wei Ji · Renjie Liang · Zhedong Zheng · Wenqiao Zhang · Shengyu Zhang · Juncheng Li · Mengze Li · Tat-seng Chua | N/A | Code |
| Learning 3D Scene Priors With 2D Supervision | Yinyu Nie · Angela Dai · Xiaoguang Han · Matthias Nießner | N/A | Code |
| Generating Aligned Pseudo-Supervision From Non-Aligned Data for Image Restoration in Under-Display Camera | Ruicheng Feng · Chongyi Li · Huaijin Chen · Shuai Li · Jinwei Gu · Chen Change Loy | N/A | Code |
| Siamese DETR | Zeren Chen · Gengshi Huang · Wei Li · Jianing Teng · Kun Wang · Jing Shao · Chen Change Loy · Lu Sheng | N/A | Code |
| Panoptic Video Scene Graph Generation | Jingkang Yang · Wenxuan Peng · Xiangtai Li · Zujin Guo · Liangyu Chen · Bo Li · Zheng Ma · Kaiyang Zhou · Wayne Zhang · Chen Change Loy · Ziwei Liu | N/A | Code |
| Randomized Adversarial Training via Taylor Expansion | Gaojie Jin · Xinping Yi · Dengyu Wu · Ronghui Mu · Xiaowei Huang | N/A | Code |
| Task Residual for Tuning Vision-Language Models | Tao Yu · Zhihe Lu · Xin Jin · Zhibo Chen · Xinchao Wang | N/A | Code |
| PACO: Parts and Attributes of Common Objects | Vignesh Ramanathan · Anmol Kalia · Vladan Petrovic · Yi Wen · Baixue Zheng · Baishan Guo · Rui Wang · Aaron Marquez · Rama Kovvuri · Abhishek Kadian · Amir Mousavi · Yiwen Song · Abhimanyu Dubey · Dhruv Mahajan | N/A | Code |
| CloSET: Modeling Clothed Humans on Continuous Surface With Explicit Template Decomposition | Hongwen Zhang · Siyou Lin · Ruizhi Shao · Yuxiang Zhang · Zerong Zheng · Han Huang · Yandong Guo · Yebin Liu | N/A | Code |
| Transductive Few-Shot Learning With Prototype-Based Label Propagation by Iterative Graph Refinement | Hao Zhu · Piotr Koniusz | N/A | Code |
| DualVector: Unsupervised Vector Font Synthesis With Dual-Part Representation | Ying-Tian Liu · Zhifei Zhang · Yuan-Chen Guo · Matthew Fisher · Zhaowen Wang · Song-Hai Zhang | N/A | Code |
| Invertible Neural Skinning | Yash Kant · Aliaksandr Siarohin · Riza Alp Guler · Menglei Chai · Jian Ren · Sergey Tulyakov · Igor Gilitschenski | N/A | Code |
| Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars | Jingxiang Sun · Xuan Wang · Lizhen Wang · Xiaoyu Li · Yong Zhang · Hongwen Zhang · Yebin Liu | N/A | Code |
| Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding | Morris Alper · Michael Fiman · Hadar Averbuch-Elor | N/A | Code |
| ConStruct-VL: Data-Free Continual Structured VL Concepts Learning | James Seale Smith · Paola Cascante-Bonilla · Assaf Arbelle · Donghyun Kim · Rameswar Panda · David Cox · Diyi Yang · Zsolt Kira · Rogerio Feris · Leonid Karlinsky | N/A | Code |
| LINe: Out-of-Distribution Detection by Leveraging Important Neurons | Yong Hyun Ahn · Gyeong-Moon Park · Seong Tae Kim | N/A | Code |
| Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning With Multimodal Models | Zhiqiu Lin · Samuel Yu · Zhiyi Kuang · Deepak Pathak · Deva Ramanan | N/A | Code |
| Panoptic Lifting for 3D Scene Understanding With Neural Fields | Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Norman Müller · Matthias Nießner · Angela Dai · Peter Kontschieder | N/A | Code |
| GamutMLP: A Lightweight MLP for Color Loss Recovery | Hoang M. Le · Brian Price · Scott Cohen · Michael S. Brown | N/A | Code |
| DIFu: Depth-Guided Implicit Function for Clothed Human Reconstruction | Dae-Young Song · HeeKyung Lee · Jeongil Seo · Donghyeon Cho | N/A | Code |
| NLOST: Non-Line-of-Sight Imaging With Transformer | Yue Li · Jiayong Peng · Juntian Ye · Yueyi Zhang · Feihu Xu · Zhiwei Xiong | N/A | Code |
| SeaThru-NeRF: Neural Radiance Fields in Scattering Media | Deborah Levy · Amit Peleg · Naama Pearl · Dan Rosenbaum · Derya Akkaynak · Simon Korman · Tali Treibitz | N/A | Code |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Garrick Brazil · Abhinav Kumar · Julian Straub · Nikhila Ravi · Justin Johnson · Georgia Gkioxari | N/A | Code |
| Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection | Chuangchuang Tan · Yao Zhao · Shikui Wei · Guanghua Gu · Yunchao Wei | N/A | Code |
| LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Learning Customized Visual Models With Retrieval-Augmented Knowledge | Haotian Liu · Kilho Son · Jianwei Yang · Ce Liu · Jianfeng Gao · Yong Jae Lee · Chunyuan Li | N/A | Code |
| MAIR: Multi-View Attention Inverse Rendering With 3D Spatially-Varying Lighting Estimation | JunYong Choi · SeokYeong Lee · Haesol Park · Seung-Won Jung · Ig-Jae Kim · Junghyun Cho | N/A | Code |
| Generalizing Dataset Distillation via Deep Generative Prior | George Cazenavette · Tongzhou Wang · Antonio Torralba · Alexei A. Efros · Jun-Yan Zhu | N/A | Code |
| Polarized Color Image Denoising | Zhuoxiao Li · Haiyang Jiang · Mingdeng Cao · Yinqiang Zheng | N/A | Code |
| Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation | Haochen Wang · Xiaodan Du · Jiahao Li · Raymond A. Yeh · Greg Shakhnarovich | N/A | Code |
| FJMP: Factorized Joint Multi-Agent Motion Prediction Over Learned Directed Acyclic Interaction Graphs | Luke Rowe · Martin Ethier · Eli-Henry Dykhne · Krzysztof Czarnecki | N/A | Code |
| Mask-Free Video Instance Segmentation | Lei Ke · Martin Danelljan · Henghui Ding · Yu-Wing Tai · Chi-Keung Tang · Fisher Yu | N/A | Code |
| OVTrack: Open-Vocabulary Multiple Object Tracking | Siyuan Li · Tobias Fischer · Lei Ke · Henghui Ding · Martin Danelljan · Fisher Yu | N/A | Code |
| LightPainter: Interactive Portrait Relighting With Freehand Scribble | Yiqun Mei · He Zhang · Xuaner Zhang · Jianming Zhang · Zhixin Shu · Yilin Wang · Zijun Wei · Shi Yan · HyunJoon Jung · Vishal M. Patel | N/A | Code |
| Towards Scalable Neural Representation for Diverse Videos | Bo He · Xitong Yang · Hanyu Wang · Zuxuan Wu · Hao Chen · Shuaiyi Huang · Yixuan Ren · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Teaching Matters: Investigating the Role of Supervision in Vision Transformers | Matthew Walmer · Saksham Suri · Kamal Gupta · Abhinav Shrivastava | N/A | Code |
| FlexNeRF: Photorealistic Free-Viewpoint Rendering of Moving Humans From Sparse Views | Vinoj Jayasundara · Amit Agrawal · Nicolas Heron · Abhinav Shrivastava · Larry S. Davis | N/A | Code |
| Leveraging Temporal Context in Low Representational Power Regimes | Camilo L. Fosco · SouYoung Jin · Emilie Josephs · Aude Oliva | N/A | Code |
| Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask | Shangzhan Zhang · Sida Peng · Tianrun Chen · Linzhan Mou · Haotong Lin · Kaicheng Yu · Yiyi Liao · Xiaowei Zhou | N/A | Code |
| Align and Attend: Multimodal Summarization With Dual Contrastive Losses | Bo He · Jun Wang · Jielin Qiu · Trung Bui · Abhinav Shrivastava · Zhaowen Wang | N/A | Code |
| SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network | Chuong Huynh · Yuqian Zhou · Zhe Lin · Connelly Barnes · Eli Shechtman · Sohrab Amirghodsi · Abhinav Shrivastava | N/A | Code |
| NIRVANA: Neural Implicit Representations of Videos With Adaptive Networks and Autoregressive Patch-Wise Modeling | Shishira R Maiya · Sharath Girish · Max Ehrlich · Hanyu Wang · Kwot Sin Lee · Patrick Poirson · Pengxiang Wu · Chen Wang · Abhinav Shrivastava | N/A | Code |
| Seeing Beyond the Brain: Conditional Diffusion Model With Sparse Masked Modeling for Vision Decoding | Zijiao Chen · Jiaxin Qing · Tiange Xiang · Wan Lin Yue · Juan Helen Zhou | N/A | Code |
| Position-Guided Text Prompt for Vision-Language Pre-Training | Jinpeng Wang · Pan Zhou · Mike Zheng Shou · Shuicheng Yan | N/A | Code |
| Progressive Spatio-Temporal Alignment for Efficient Event-Based Motion Estimation | Xueyan Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering | Difei Gao · Luowei Zhou · Lei Ji · Linchao Zhu · Yi Yang · Mike Zheng Shou | N/A | Code |
| Histopathology Whole Slide Image Analysis With Heterogeneous Graph Representation Learning | Tsai Hor Chan · Fernando Julio Cendra · Lan Ma · Guosheng Yin · Lequan Yu | N/A | Code |
| Making Vision Transformers Efficient From a Token Sparsification View | Shuning Chang · Pichao Wang · Ming Lin · Fan Wang · David Junhao Zhang · Rong Jin · Mike Zheng Shou | N/A | Code |
| Leverage Interactive Affinity for Affordance Learning | Hongchen Luo · Wei Zhai · Jing Zhang · Yang Cao · Dacheng Tao | N/A | Code |
| Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection | Fan Lu · Kai Zhu · Wei Zhai · Kecheng Zheng · Yang Cao | N/A | Code |
| HARP: Personalized Hand Reconstruction From a Monocular RGB Video | Korrawe Karunratanakul · Sergey Prokudin · Otmar Hilliges · Siyu Tang | N/A | Code |
| Towards Effective Visual Representations for Partial-Label Learning | Shiyu Xia · Jiaqi Lv · Ning Xu · Gang Niu · Xin Geng | N/A | Code |
| SFD2: Semantic-Guided Feature Detection and Description | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation | Bowen Zhang · Chenyang Qi · Pan Zhang · Bo Zhang · HsiangTao Wu · Dong Chen · Qifeng Chen · Yong Wang · Fang Wen | N/A | Code |
| The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training | Gi-Cheon Kang · Sungdong Kim · Jin-Hwa Kim · Donghyun Kwak · Byoung-Tak Zhang | N/A | Code |
| Temporal Interpolation Is All You Need for Dynamic Neural Radiance Fields | Sungheon Park · Minjung Son · Seokhwan Jang · Young Chun Ahn · Ji-Yeon Kim · Nahyup Kang | N/A | Code |
| DiGA: Distil To Generalize and Then Adapt for Domain Adaptive Semantic Segmentation | Fengyi Shen · Akhil Gurram · Ziyuan Liu · He Wang · Alois Knoll | N/A | Code |
| Multimodal Prompting With Missing Modalities for Visual Recognition | Yi-Lun Lee · Yi-Hsuan Tsai · Wei-Chen Chiu · Chen-Yu Lee | N/A | Code |
| On Calibrating Semantic Segmentation Models: Analyses and an Algorithm | Dongdong Wang · Boqing Gong · Liqiang Wang | N/A | Code |
| IMP: Iterative Matching and Pose Estimation With Adaptive Pooling | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| Grid-Guided Neural Radiance Fields for Large Urban Scenes | Linning Xu · Yuanbo Xiangli · Sida Peng · Xingang Pan · Nanxuan Zhao · Christian Theobalt · Bo Dai · Dahua Lin | N/A | Code |
| Neural Voting Field for Camera-Space 3D Hand Pose Estimation | Lin Huang · Chung-Ching Lin · Kevin Lin · Lin Liang · Lijuan Wang · Junsong Yuan · Zicheng Liu | N/A | Code |
| Dense Network Expansion for Class Incremental Learning | Zhiyuan Hu · Yunsheng Li · Jiancheng Lyu · Dashan Gao · Nuno Vasconcelos | N/A | Code |
| FashionSAP: Symbols and Attributes Prompt for Fine-Grained Fashion Vision-Language Pre-Training | Yunpeng Han · Lisai Zhang · Qingcai Chen · Zhijian Chen · Zhonghua Li · Jianxin Yang · Zhao Cao | N/A | Code |
| Batch Model Consolidation: A Multi-Task Model Consolidation Framework | Iordanis Fostiropoulos · Jiaye Zhu · Laurent Itti | N/A | Code |
| Open-Vocabulary Attribute Detection | María A. Bravo · Sudhanshu Mittal · Simon Ging · Thomas Brox | N/A | Code |
| Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models | Nithin Gopalakrishnan Nair · Wele Gedara Chaminda Bandara · Vishal M. Patel | N/A | Code |
| BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection | Lei Yang · Kaicheng Yu · Tao Tang · Jun Li · Kun Yuan · Li Wang · Xinyu Zhang · Peng Chen | N/A | Code |
| Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization | Chen Zhao · Shuming Liu · Karttikeya Mangalam · Bernard Ghanem | N/A | Code |
| C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation | Nazmul Karim · Niluthpol Chowdhury Mithun · Abhinav Rajvanshi · Han-pang Chiu · Supun Samarasekera · Nazanin Rahnavard | N/A | Code |
| Are Deep Neural Networks SMARTer Than Second Graders? | Anoop Cherian · Kuan-Chuan Peng · Suhas Lohit · Kevin A. Smith · Joshua B. Tenenbaum | N/A | Code |
| Persistent Nature: A Generative Model of Unbounded 3D Worlds | Lucy Chai · Richard Tucker · Zhengqi Li · Phillip Isola · Noah Snavely | N/A | Code |
| InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions | Wenhai Wang · Jifeng Dai · Zhe Chen · Zhenhang Huang · Zhiqi Li · Xizhou Zhu · Xiaowei Hu · Tong Lu · Lewei Lu · Hongsheng Li · Xiaogang Wang · Yu Qiao | N/A | Code |
| Learning To Fuse Monocular and Multi-View Cues for Multi-Frame Depth Estimation in Dynamic Scenes | Rui Li · Dong Gong · Wei Yin · Hao Chen · Yu Zhu · Kaixuan Wang · Xiaozhi Chen · Jinqiu Sun · Yanning Zhang | N/A | Code |
| Benchmarking Self-Supervised Learning on Diverse Pathology Datasets | Mingu Kang · Heon Song · Seonwook Park · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions | Shuxuan Guo · Yinlin Hu · Jose M. Alvarez · Mathieu Salzmann | N/A | Code |
| Self-Supervised Representation Learning for CAD | Benjamin T. Jones · Michael Hu · Milin Kodnongbua · Vladimir G. Kim · Adriana Schulz | N/A | Code |
| SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer | Xuanyao Chen · Zhijian Liu · Haotian Tang · Li Yi · Hang Zhao · Song Han | N/A | Code |
| Neural Pixel Composition for 3D-4D View Synthesis From Multi-Views | Aayush Bansal · Michael Zollhöfer | N/A | Code |
| ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries | Junru Gu · Chenxu Hu · Tianyuan Zhang · Xuanyao Chen · Yilun Wang · Yue Wang · Hang Zhao | N/A | Code |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders | Wele Gedara Chaminda Bandara · Naman Patel · Ali Gholami · Mehdi Nikkhah · Motilal Agrawal · Vishal M. Patel | N/A | Code |
| Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning | Xiaoyang Wu · Xin Wen · Xihui Liu · Hengshuang Zhao | N/A | Code |
| RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer | Jiahao Wang · Songyang Zhang · Yong Liu · Taiqiang Wu · Yujiu Yang · Xihui Liu · Kai Chen · Ping Luo · Dahua Lin | N/A | Code |
| TeSLA: Test-Time Self-Learning With Automatic Adversarial Augmentation | Devavrat Tomar · Guillaume Vray · Behzad Bozorgtabar · Jean-Philippe Thiran | N/A | Code |
| ObjectMatch: Robust Registration Using Canonical Object Correspondences | Can Gümeli · Angela Dai · Matthias Nießner | N/A | Code |
| Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models | Jiale Xu · Xintao Wang · Weihao Cheng · Yan-Pei Cao · Ying Shan · Xiaohu Qie · Shenghua Gao | N/A | Code |
| SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao · Yan-Pei Cao · Ying Shan | N/A | Code |
| Object Detection With Self-Supervised Scene Adaptation | Zekun Zhang · Minh Hoai | N/A | Code |
| Megahertz Light Steering Without Moving Parts | Adithya Pediredla · Srinivasa G. Narasimhan · Maysamreza Chamanzar · Ioannis Gkioulekas | N/A | Code |
| ISBNet: A 3D Point Cloud Instance Segmentation Network With Instance-Aware Sampling and Box-Aware Dynamic Convolution | Tuan Duc Ngo · Binh-Son Hua · Khoi Nguyen | N/A | Code |
| Rate Gradient Approximation Attack Threats Deep Spiking Neural Networks | Tong Bu · Jianhao Ding · Zecheng Hao · Zhaofei Yu | N/A | Code |
| PIVOT: Prompting for Video Continual Learning | Andrés Villa · Juan León Alcázar · Motasem Alfarra · Kumail Alhamoud · Julio Hurtado · Fabian Caba Heilbron · Alvaro Soto · Bernard Ghanem | N/A | Code |
| ARO-Net: Learning Implicit Fields From Anchored Radial Observations | Yizhi Wang · Zeyu Huang · Ariel Shamir · Hui Huang · Hao Zhang · Ruizhen Hu | N/A | Code |
| Parallel Diffusion Models of Operator and Image for Blind Inverse Problems | Hyungjin Chung · Jeongsol Kim · Sehui Kim · Jong Chul Ye | N/A | Code |
| Solving 3D Inverse Problems Using Pre-Trained 2D Diffusion Models | Hyungjin Chung · Dohoon Ryu · Michael T. McCann · Marc L. Klasky · Jong Chul Ye | N/A | Code |
| Affordance Grounding From Demonstration Video To Target Image | Joya Chen · Difei Gao · Kevin Qinghong Lin · Mike Zheng Shou | N/A | Code |
| Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations | Yiwu Zhong · Licheng Yu · Yang Bai · Shangwen Li · Xueting Yan · Yin Li | N/A | Code |
| YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors | Chien-Yao Wang · Alexey Bochkovskiy · Hong-Yuan Mark Liao | N/A | Code |
| OmniCity: Omnipotent City Understanding With Multi-Level and Multi-View Images | Weijia Li · Yawen Lai · Linning Xu · Yuanbo Xiangli · Jinhua Yu · Conghui He · Gui-Song Xia · Dahua Lin | N/A | Code |
| Object Discovery From Motion-Guided Tokens | Zhipeng Bao · Pavel Tokmakov · Yu-Xiong Wang · Adrien Gaidon · Martial Hebert | N/A | Code |
| MP-Former: Mask-Piloted Transformer for Image Segmentation | Hao Zhang · Feng Li · Huaizhe Xu · Shijia Huang · Shilong Liu · Lionel M. Ni · Lei Zhang | N/A | Code |
| Disentangling Writer and Character Styles for Handwriting Generation | Gang Dai · Yifan Zhang · Qingfeng Wang · Qing Du · Zhuliang Yu · Zhuoman Liu · Shuangping Huang | N/A | Code |
| Building Rearticulable Models for Arbitrary 3D Objects From 4D Point Clouds | Shaowei Liu · Saurabh Gupta · Shenlong Wang | N/A | Code |
| Gated Stereo: Joint Depth Estimation From Gated and Wide-Baseline Active Stereo Cues | Stefanie Walz · Mario Bijelic · Andrea Ramazzina · Amanpreet Walia · Fahim Mannan · Felix Heide | N/A | Code |
| Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation | Wei Wang · Zhun Zhong · Weijie Wang · Xi Chen · Charles Ling · Boyu Wang · Nicu Sebe | N/A | Code |
| Perspective Fields for Single Image Camera Calibration | Linyi Jin · Jianming Zhang · Yannick Hold-Geoffroy · Oliver Wang · Kevin Blackburn-Matzen · Matthew Sticha · David F. Fouhey | N/A | Code |
| Vision Transformers Are Parameter-Efficient Audio-Visual Learners | Yan-Bo Lin · Yi-Lin Sung · Jie Lei · Mohit Bansal · Gedas Bertasius | N/A | Code |
| Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos | Yubin Hu · Yuze He · Yanghao Li · Jisheng Li · Yuxing Han · Jiangtao Wen · Yong-Jin Liu | N/A | Code |
| DisWOT: Student Architecture Search for Distillation WithOut Training | Peijie Dong · Lujun Li · Zimian Wei | N/A | Code |
| Activating More Pixels in Image Super-Resolution Transformer | Xiangyu Chen · Xintao Wang · Jiantao Zhou · Yu Qiao · Chao Dong | N/A | Code |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Jie Hu · Linyan Huang · Tianhe Ren · Shengchuan Zhang · Rongrong Ji · Liujuan Cao | N/A | Code |
| PA&DA: Jointly Sampling Path and Data for Consistent NAS | Shun Lu · Yu Hu · Longxing Yang · Zihao Sun · Jilin Mei · Jianchao Tan · Chengru Song | N/A | Code |
| NeuralUDF: Learning Unsigned Distance Fields for Multi-View Reconstruction of Surfaces With Arbitrary Topologies | Xiaoxiao Long · Cheng Lin · Lingjie Liu · Yuan Liu · Peng Wang · Christian Theobalt · Taku Komura · Wenping Wang | N/A | Code |
| Towards Universal Fake Image Detectors That Generalize Across Generative Models | Utkarsh Ojha · Yuheng Li · Yong Jae Lee | N/A | Code |
| FLAG3D: A 3D Fitness Activity Dataset With Language Instruction | Yansong Tang · Jinpeng Liu · Aoyang Liu · Bin Yang · Wenxun Dai · Yongming Rao · Jiwen Lu · Jie Zhou · Xiu Li | N/A | Code |
| NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction From Multi-View Images | Yunfan Ye · Renjiao Yi · Zhirui Gao · Chenyang Zhu · Zhiping Cai · Kai Xu | N/A | Code |
| Executing Your Commands via Motion Diffusion in Latent Space | Xin Chen · Biao Jiang · Wen Liu · Zilong Huang · Bin Fu · Tao Chen · Gang Yu | N/A | Code |
| MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID | Jianyang Gu · Kai Wang · Hao Luo · Chen Chen · Wei Jiang · Yuqiang Fang · Shanghang Zhang · Yang You · Jian Zhao | N/A | Code |
| SunStage: Portrait Reconstruction and Relighting Using the Sun as a Light Stage | Yifan Wang · Aleksander Holynski · Xiuming Zhang · Xuaner Zhang | N/A | Code |
| IS-GGT: Iterative Scene Graph Generation With Generative Transformers | Sanjoy Kundu · Sathyanarayanan N. Aakur | N/A | Code |
| DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis | Yinghao Xu · Menglei Chai · Zifan Shi · Sida Peng · Ivan Skorokhodov · Aliaksandr Siarohin · Ceyuan Yang · Yujun Shen · Hsin-Ying Lee · Bolei Zhou · Sergey Tulyakov | N/A | Code |
| Breaking the “Object” in Video Object Segmentation | Pavel Tokmakov · Jie Li · Adrien Gaidon | N/A | Code |
| SimpleNet: A Simple Network for Image Anomaly Detection and Localization | Zhikang Liu · Yiming Zhou · Yuansheng Xu · Zilei Wang | N/A | Code |
| Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation | Lingting Zhu · Xian Liu · Xuanyu Liu · Rui Qian · Ziwei Liu · Lequan Yu | N/A | Code |
| Top-Down Visual Attention From Analysis by Synthesis | Baifeng Shi · Trevor Darrell · Xin Wang | N/A | Code |
| Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation With Cross-Scale Distortion Awareness | Zhijie Shen · Zishuo Zheng · Chunyu Lin · Lang Nie · Kang Liao · Shuai Zheng · Yao Zhao | N/A | Code |
| Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation | Xiaolong Shen · Zongxin Yang · Xiaohan Wang · Jianxin Ma · Chang Zhou · Yi Yang | N/A | Code |
| RaBit: Parametric Modeling of 3D Biped Cartoon Characters With a Topological-Consistent Dataset | Zhongjin Luo · Shengcai Cai · Jinguo Dong · Ruibo Ming · Liangdong Qiu · Xiaohang Zhan · Xiaoguang Han | N/A | Code |
| Masked Image Modeling With Local Multi-Scale Reconstruction | Haoqing Wang · Yehui Tang · Yunhe Wang · Jianyuan Guo · Zhi-Hong Deng · Kai Han | N/A | Code |
| Omni Aggregation Networks for Lightweight Image Super-Resolution | Hang Wang · Xuanhong Chen · Bingbing Ni · Yutian Liu · Jinfan Liu | N/A | Code |
| TryOnDiffusion: A Tale of Two UNets | Luyang Zhu · Dawei Yang · Tyler Zhu · Fitsum Reda · William Chan · Chitwan Saharia · Mohammad Norouzi · Ira Kemelmacher-Shlizerman | N/A | Code |
| MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition | Xiang Wang · Shiwei Zhang · Zhiwu Qing · Changxin Gao · Yingya Zhang · Deli Zhao · Nong Sang | N/A | Code |
| Dynamic Aggregated Network for Gait Recognition | Kang Ma · Ying Fu · Dezhi Zheng · Chunshui Cao · Xuecai Hu · Yongzhen Huang | N/A | Code |
| Equivalent Transformation and Dual Stream Network Construction for Mobile Image Super-Resolution | Jiahao Chao · Zhou Zhou · Hongfan Gao · Jiali Gong · Zhengfeng Yang · Zhenbing Zeng · Lydia Dehbi | N/A | Code |
| Semi-Supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination | Zimeng Zhao · Binghui Zuo · Zhiyu Long · Yangang Wang | N/A | Code |
| DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields | Yu Chen · Gim Hee Lee | N/A | Code |
| Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance Pursuit | Xiaohang Wang · Xuanhong Chen · Bingbing Ni · Hang Wang · Zhengyan Tong · Yutian Liu | N/A | Code |
| Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections | Jiaxiong Qiu · Peng-Tao Jiang · Yifan Zhu · Ze-Xin Yin · Ming-Ming Cheng · Bo Ren | N/A | Code |
| The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction | Alexandros Stergiou · Dima Damen | N/A | Code |
| Use Your Head: Improving Long-Tail Video Recognition | Toby Perrett · Saptarshi Sinha · Tilo Burghardt · Majid Mirmehdi · Dima Damen | N/A | Code |
| Large-Scale Training Data Search for Object Re-Identification | Yue Yao · Tom Gedeon · Liang Zheng | N/A | Code |
| Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction | Guangyi Chen · Zhenhao Chen · Shunxing Fan · Kun Zhang | N/A | Code |
| Seeing a Rose in Five Thousand Ways | Yunzhi Zhang · Shangzhe Wu · Noah Snavely · Jiajun Wu | N/A | Code |
| EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points | Chengwei Zheng · Wenbin Lin · Feng Xu | N/A | Code |
| Uncertainty-Aware Unsupervised Image Deblurring With Deep Residual Prior | Xiaole Tang · Xile Zhao · Jun Liu · Jianli Wang · Yuchun Miao · Tieyong Zeng | N/A | Code |
| Primitive Generation and Semantic-Related Alignment for Universal Zero-Shot Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Long-Tailed Visual Recognition via Self-Heterogeneous Integration With Knowledge Excavation | Yan Jin · Mengke Li · Yang Lu · Yiu-ming Cheung · Hanzi Wang | N/A | Code |
| Neuron Structure Modeling for Generalizable Remote Physiological Measurement | Hao Lu · Zitong Yu · Xuesong Niu · Ying-Cong Chen | N/A | Code |
| Decoupled Semantic Prototypes Enable Learning From Diverse Annotation Types for Semi-Weakly Segmentation in Expert-Driven Domains | Simon Reiß · Constantin Seibold · Alexander Freytag · Erik Rodner · Rainer Stiefelhagen | N/A | Code |
| Learning a Sparse Transformer Network for Effective Image Deraining | Xiang Chen · Hao Li · Mingqiang Li · Jinshan Pan | N/A | Code |
| Camouflaged Object Detection With Feature Decomposition and Edge Reconstruction | Chunming He · Kai Li · Yachao Zhang · Longxiang Tang · Yulun Zhang · Zhenhua Guo · Xiu Li | N/A | Code |
| LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding | Gen Li · Varun Jampani · Deqing Sun · Laura Sevilla-Lara | N/A | Code |
| DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | Nataniel Ruiz · Yuanzhen Li · Varun Jampani · Yael Pritch · Michael Rubinstein · Kfir Aberman | N/A | Code |
| GeneCIS: A Benchmark for General Conditional Image Similarity | Sagar Vaze · Nicolas Carion · Ishan Misra | N/A | Code |
| Neighborhood Attention Transformer | Ali Hassani · Steven Walton · Jiachen Li · Shen Li · Humphrey Shi | N/A | Code |
| 3D-Aware Conditional Image Synthesis | Kangle Deng · Gengshan Yang · Deva Ramanan · Jun-Yan Zhu | N/A | Code |
| Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin · Jun Gao · Luming Tang · Towaki Takikawa · Xiaohui Zeng · Xun Huang · Karsten Kreis · Sanja Fidler · Ming-Yu Liu · Tsung-Yi Lin | N/A | Code |
| QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity | Siyu Huang · Jie An · Donglai Wei · Jiebo Luo · Hanspeter Pfister | N/A | Code |
| SceneComposer: Any-Level Semantic Image Synthesis | Yu Zeng · Zhe Lin · Jianming Zhang · Qing Liu · John Collomosse · Jason Kuen · Vishal M. Patel | N/A | Code |
| Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style | Haoming Lu · Hazarapet Tunanyan · Kai Wang · Shant Navasardyan · Zhangyang Wang · Humphrey Shi | N/A | Code |
| In-Hand 3D Object Scanning From an RGB Sequence | Shreyas Hampali · Tomas Hodan · Luan Tran · Lingni Ma · Cem Keskin · Vincent Lepetit | N/A | Code |
| SHS-Net: Learning Signed Hyper Surfaces for Oriented Normal Estimation of Point Clouds | Qing Li · Huifang Feng · Kanle Shi · Yue Gao · Yi Fang · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Advancing Visual Grounding With Scene Knowledge: Benchmark and Method | Zhihong Chen · Ruifei Zhang · Yibing Song · Xiang Wan · Guanbin Li | N/A | Code |
| Putting People in Their Place: Affordance-Aware Human Insertion Into Scenes | Sumith Kulal · Tim Brooks · Alex Aiken · Jiajun Wu · Jimei Yang · Jingwan Lu · Alexei A. Efros · Krishna Kumar Singh | N/A | Code |
| Identity-Preserving Talking Face Generation With Landmark and Appearance Priors | Weizhi Zhong · Chaowei Fang · Yinqi Cai · Pengxu Wei · Gangming Zhao · Liang Lin · Guanbin Li | N/A | Code |
| Less Is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation | Li Li · Hubert P. H. Shum · Toby P. Breckon | N/A | Code |
| FAC: 3D Representation Learning via Foreground Aware Feature Contrast | Kangcheng Liu · Aoran Xiao · Xiaoqin Zhang · Shijian Lu · Ling Shao | N/A | Code |
| InstMove: Instance Motion for Object-Centric Video Segmentation | Qihao Liu · Junfeng Wu · Yi Jiang · Xiang Bai · Alan L. Yuille · Song Bai | N/A | Code |
| Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark | Xiaofeng Wang · Zheng Zhu · Yunpeng Zhang · Guan Huang · Yun Ye · Wenbo Xu · Ziwei Chen · Xingang Wang | N/A | Code |
| Self-Supervised Non-Uniform Kernel Estimation With Flow-Based Motion Prior for Blind Image Deblurring | Zhenxuan Fang · Fangfang Wu · Weisheng Dong · Xin Li · Jinjian Wu · Guangming Shi | N/A | Code |
| Neural Kernel Surface Reconstruction | Jiahui Huang · Zan Gojcic · Matan Atzmon · Or Litany · Sanja Fidler · Francis Williams | N/A | Code |
| Binary Latent Diffusion | Ze Wang · Jiang Wang · Zicheng Liu · Qiang Qiu | N/A | Code |
| Learning To Dub Movies via Hierarchical Prosody Models | Gaoxiang Cong · Liang Li · Yuankai Qi · Zheng-Jun Zha · Qi Wu · Wenyu Wang · Bin Jiang · Ming-Hsuan Yang · Qingming Huang | N/A | Code |
| Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs | Pattaramanee Arsomngern · Sarana Nutanong · Supasorn Suwajanakorn | N/A | Code |
| FrustumFormer: Adaptive Instance-Aware Resampling for Multi-View 3D Detection | Yuqi Wang · Yuntao Chen · Zhaoxiang Zhang | N/A | Code |
| Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! | Zaid Khan · Vijay Kumar BG · Samuel Schulter · Xiang Yu · Yun Fu · Manmohan Chandraker | N/A | Code |
| StyleRes: Transforming the Residuals for Real Image Editing With StyleGAN | Hamza Pehlivan · Yusuf Dalva · Aysegul Dundar | N/A | Code |
| PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer | Honghui Yang · Wenxiao Wang · Minghao Chen · Binbin Lin · Tong He · Hua Chen · Xiaofei He · Wanli Ouyang | N/A | Code |
| Boosting Verified Training for Robust Image Classifications via Abstraction | Zhaodi Zhang · Zhiyi Xue · Yang Chen · Si Liu · Yueling Zhang · Jing Liu · Min Zhang | N/A | Code |
| Interactive Segmentation As Gaussion Process Classification | Minghao Zhou · Hong Wang · Qian Zhao · Yuexiang Li · Yawen Huang · Deyu Meng · Yefeng Zheng | N/A | Code |
| OSRT: Omnidirectional Image Super-Resolution With Distortion-Aware Transformer | Fanghua Yu · Xintao Wang · Mingdeng Cao · Gen Li · Ying Shan · Chao Dong | N/A | Code |
| Accelerating Vision-Language Pretraining With Free Language Modeling | Teng Wang · Yixiao Ge · Feng Zheng · Ran Cheng · Ying Shan · Xiaohu Qie · Ping Luo | N/A | Code |
| TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation | Hanzhi Chen · Fabian Manhardt · Nassir Navab · Benjamin Busam | N/A | Code |
| Learning With Noisy Labels via Self-Supervised Adversarial Noisy Masking | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Jiangning Zhang · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| HelixSurf: A Robust and Efficient Neural Implicit Surface Learning of Indoor Scenes With Iterative Intertwined Regularization | Zhihao Liang · Zhangjin Huang · Changxing Ding · Kui Jia | N/A | Code |
| Multi-Space Neural Radiance Fields | Ze-Xin Yin · Jiaxiong Qiu · Ming-Ming Cheng · Bo Ren | N/A | Code |
| MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences | Chenhang He · Ruihuang Li · Yabin Zhang · Shuai Li · Lei Zhang | N/A | Code |
| DLBD: A Self-Supervised Direct-Learned Binary Descriptor | Bin Xiao · Yang Hu · Bo Liu · Xiuli Bi · Weisheng Li · Xinbo Gao | N/A | Code |
| Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries | Yuanwen Yue · Theodora Kontogianni · Konrad Schindler · Francis Engelmann | N/A | Code |
| PointAvatar: Deformable Point-Based Head Avatars From Videos | Yufeng Zheng · Wang Yifan · Gordon Wetzstein · Michael J. Black · Otmar Hilliges | N/A | Code |
| Diffusion-SDF: Text-To-Shape via Voxelized Diffusion | Muheng Li · Yueqi Duan · Jie Zhou · Jiwen Lu | N/A | Code |
| NeRF-RPN: A General Framework for Object Detection in NeRFs | Benran Hu · Junkai Huang · Yichen Liu · Yu-Wing Tai · Chi-Keung Tang | N/A | Code |
| CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset | Tian Gan · Qing Wang · Xingning Dong · Xiangyuan Ren · Liqiang Nie · Qingpei Guo | N/A | Code |
| Vid2Avatar: 3D Avatar Reconstruction From Videos in the Wild via Self-Supervised Scene Decomposition | Chen Guo · Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| Neural Preset for Color Style Transfer | Zhanghan Ke · Yuhao Liu · Lei Zhu · Nanxuan Zhao · Rynson W.H. Lau | N/A | Code |
| GRES: Generalized Referring Expression Segmentation | Chang Liu · Henghui Ding · Xudong Jiang | N/A | Code |
| Tracking Through Containers and Occluders in the Wild | Basile Van Hoorick · Pavel Tokmakov · Simon Stent · Jie Li · Carl Vondrick | N/A | Code |
| DepGraph: Towards Any Structural Pruning | Gongfan Fang · Xinyin Ma · Mingli Song · Michael Bi Mi · Xinchao Wang | N/A | Code |
| Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation | Yunqing Zhao · Chao Du · Milad Abdollahzadeh · Tianyu Pang · Min Lin · Shuicheng Yan · Ngai-Man Cheung | N/A | Code |
| RGB No More: Minimally-Decoded JPEG Vision Transformers | Jeongsoo Park · Justin Johnson | N/A | Code |
| iQuery: Instruments As Queries for Audio-Visual Sound Separation | Jiaben Chen · Renrui Zhang · Dongze Lian · Jiaqi Yang · Ziyao Zeng · Jianbo Shi | N/A | Code |
| Towards Professional Level Crowd Annotation of Expert Domain Data | Pei Wang · Nuno Vasconcelos | N/A | Code |
| VideoTrack: Learning To Track Objects via Video Transformer | Fei Xie · Lei Chu · Jiahao Li · Yan Lu · Chao Ma | N/A | Code |
| SCoDA: Domain Adaptive Shape Completion for Real Scans | Yushuang Wu · Zizheng Yan · Ce Chen · Lai Wei · Xiao Li · Guanbin Li · Yihao Li · Shuguang Cui · Xiaoguang Han | N/A | Code |
| Enhanced Training of Query-Based Object Detection via Selective Query Recollection | Fangyi Chen · Han Zhang · Kai Hu · Yu-Kai Huang · Chenchen Zhu · Marios Savvides | N/A | Code |
| LaserMix for Semi-Supervised LiDAR Semantic Segmentation | Lingdong Kong · Jiawei Ren · Liang Pan · Ziwei Liu | N/A | Code |
| MSMDFusion: Fusing LiDAR and Camera at Multiple Scales With Multi-Depth Seeds for 3D Object Detection | Yang Jiao · Zequn Jie · Shaoxiang Chen · Jingjing Chen · Lin Ma · Yu-Gang Jiang | N/A | Code |
| Learning With Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning | Zeyin Song · Yifan Zhao · Yujun Shi · Peixi Peng · Li Yuan · Yonghong Tian | N/A | Code |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Runyu Ding · Jihan Yang · Chuhui Xue · Wenqing Zhang · Song Bai · Xiaojuan Qi | N/A | Code |
| Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training | Junfan Lin · Jianlong Chang · Lingbo Liu · Guanbin Li · Liang Lin · Qi Tian · Chang-Wen Chen | N/A | Code |
| A Dynamic Multi-Scale Voxel Flow Network for Video Prediction | Xiaotao Hu · Zhewei Huang · Ailin Huang · Jun Xu · Shuchang Zhou | N/A | Code |
| Neural Dependencies Emerging From Learning Massive Categories | Ruili Feng · Kecheng Zheng · Kai Zhu · Yujun Shen · Jian Zhao · Yukun Huang · Deli Zhao · Jingren Zhou · Michael Jordan · Zheng-Jun Zha | N/A | Code |
| Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-Identification | Yukang Zhang · Hanzi Wang | N/A | Code |
| Neural Kaleidoscopic Space Sculpting | Byeongjoo Ahn · Michael De Zeeuw · Ioannis Gkioulekas · Aswin C. Sankaranarayanan | N/A | Code |
| PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow | Jiarui Lei · Xiaobo Hu · Yue Wang · Dong Liu | N/A | Code |
| Masked Motion Encoding for Self-Supervised Video Representation Learning | Xinyu Sun · Peihao Chen · Liangwei Chen · Changhao Li · Thomas H. Li · Mingkui Tan · Chuang Gan | N/A | Code |
| StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator | Jiazhi Guan · Zhanwang Zhang · Hang Zhou · Tianshu Hu · Kaisiyuan Wang · Dongliang He · Haocheng Feng · Jingtuo Liu · Errui Ding · Ziwei Liu · Jingdong Wang | N/A | Code |
| LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation | Song Wang · Wentong Li · Wenyu Liu · Xiaolu Liu · Jianke Zhu | N/A | Code |
| Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion | Changfeng Ma · Yinuo Chen · Pengxiao Guo · Jie Guo · Chongjun Wang · Yanwen Guo | N/A | Code |
| Boosting Detection in Crowd Analysis via Underutilized Output Features | Shaokai Wu · Fengyu Yang | N/A | Code |
| Representation Learning for Visual Object Tracking by Masked Appearance Transfer | Haojie Zhao · Dong Wang · Huchuan Lu | N/A | Code |
| NeuralLift-360: Lifting an In-the-Wild 2D Photo to a 3D Object With 360° Views | Dejia Xu · Yifan Jiang · Peihao Wang · Zhiwen Fan · Yi Wang · Zhangyang Wang | N/A | Code |
| DoNet: Deep De-Overlapping Network for Cytology Instance Segmentation | Hao Jiang · Rushan Zhang · Yanning Zhou · Yumeng Wang · Hao Chen | N/A | Code |
| Think Twice Before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | Xiaosong Jia · Penghao Wu · Li Chen · Jiangwei Xie · Conghui He · Junchi Yan · Hongyang Li | N/A | Code |
| Adversarial Counterfactual Visual Explanations | Guillaume Jeanneret · Loïc Simon · Frédéric Jurie | N/A | Code |
| ALOFT: A Lightweight MLP-Like Architecture With Dynamic Low-Frequency Transform for Domain Generalization | Jintao Guo · Na Wang · Lei Qi · Yinghuan Shi | N/A | Code |
| ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision | Jingwang Ling · Zhibo Wang · Feng Xu | N/A | Code |
| Coaching a Teachable Student | Jimuyang Zhang · Zanming Huang · Eshed Ohn-Bar | N/A | Code |
| POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery | Ce Zheng · Xianpeng Liu · Guo-Jun Qi · Chen Chen | N/A | Code |
| Layout-Based Causal Inference for Object Navigation | Sixian Zhang · Xinhang Song · Weijie Li · Yubing Bai · Xinyao Yu · Shuqiang Jiang | N/A | Code |
| Towards Bridging the Performance Gaps of Joint Energy-Based Models | Xiulong Yang · Qing Su · Shihao Ji | N/A | Code |
| Bringing Inputs to Shared Domains for 3D Interacting Hands Recovery in the Wild | Gyeongsik Moon | N/A | Code |
| Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting | Xiaogang Peng · Siyuan Mao · Zizhao Wu | N/A | Code |
| Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization | Chen Ju · Kunhao Zheng · Jinxiang Liu · Peisen Zhao · Ya Zhang · Jianlong Chang · Qi Tian · Yanfeng Wang | N/A | Code |
| DiffPose: Toward More Reliable 3D Pose Estimation | Jia Gong · Lin Geng Foo · Zhipeng Fan · Qiuhong Ke · Hossein Rahmani · Jun Liu | N/A | Code |
| SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection | Tiange Xiang · Yixiao Zhang · Yongyi Lu · Alan L. Yuille · Chaoyi Zhang · Weidong Cai · Zongwei Zhou | N/A | Code |
| On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer | Zhenjie Yu · Shuang Li · Yirui Shen · Chi Harold Liu · Shuigen Wang | N/A | Code |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network With Large Input | Senmao Tian · Ming Lu · Jiaming Liu · Yandong Guo · Yurong Chen · Shunli Zhang | N/A | Code |
| NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects | Zhiwen Yan · Chen Li · Gim Hee Lee | N/A | Code |
| Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo | Lukas Mehl · Jenny Schmalfuss · Azin Jahedi · Yaroslava Nalivayko · Andrés Bruhn | N/A | Code |
| Unifying Short and Long-Term Tracking With Graph Hierarchies | Orcun Cetintas · Guillem Brasó · Laura Leal-Taixé | N/A | Code |
| MixTeacher: Mining Promising Labels With Mixed Scale Teacher for Semi-Supervised Object Detection | Liang Liu · Boshen Zhang · Jiangning Zhang · Wuhao Zhang · Zhenye Gan · Guanzhong Tian · Wenbing Zhu · Yabiao Wang · Chengjie Wang | N/A | Code |
| A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image | Changlong Jiang · Yang Xiao · Cunlin Wu · Mingyang Zhang · Jinghong Zheng · Zhiguo Cao · Joey Tianyi Zhou | N/A | Code |
| Efficient Mask Correction for Click-Based Interactive Image Segmentation | Fei Du · Jianlong Yuan · Zhibin Wang · Fan Wang | N/A | Code |
| OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation | Tong Wu · Jiarui Zhang · Xiao Fu · Yuxin Wang · Jiawei Ren · Liang Pan · Wayne Wu · Lei Yang · Jiaqi Wang · Chen Qian · Dahua Lin · Ziwei Liu | N/A | Code |
| LoGoNet: Towards Accurate 3D Object Detection With Local-to-Global Cross-Modal Fusion | Xin Li · Tao Ma · Yuenan Hou · Botian Shi · Yuchen Yang · Youquan Liu · Xingjiao Wu · Qin Chen · Yikang Li · Yu Qiao · Liang He | N/A | Code |
| 3D Registration With Maximal Cliques | Xiyu Zhang · Jiaqi Yang · Shikun Zhang · Yanning Zhang | N/A | Code |
| Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis | Yuxiang Wei · Zhilong Ji · Xiaohe Wu · Jinfeng Bai · Lei Zhang · Wangmeng Zuo | N/A | Code |
| Frame-Event Alignment and Fusion Network for High Frame Rate Tracking | Jiqing Zhang · Yuanchen Wang · Wenxi Liu · Meng Li · Jinpeng Bai · Baocai Yin · Xin Yang | N/A | Code |
| Human Guided Ground-Truth Generation for Realistic Image Super-Resolution | Du Chen · Jie Liang · Xindong Zhang · Ming Liu · Hui Zeng · Lei Zhang | N/A | Code |
| Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration | Kemal Oksuz · Tom Joy · Puneet K. Dokania | N/A | Code |
| Generating Human Motion From Textual Descriptions With Discrete Representations | Jianrong Zhang · Yangsong Zhang · Xiaodong Cun · Yong Zhang · Hongwei Zhao · Hongtao Lu · Xi Shen · Ying Shan | N/A | Code |
| Curricular Contrastive Regularization for Physics-Aware Single Image Dehazing | Yu Zheng · Jiahui Zhan · Shengfeng He · Junyu Dong · Yong Du | N/A | Code |
| Learning Human Mesh Recovery in 3D Scenes | Zehong Shen · Zhi Cen · Sida Peng · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection | Luting Wang · Yi Liu · Penghui Du · Zihan Ding · Yue Liao · Qiaosong Qi · Biaolong Chen · Si Liu | N/A | Code |
| Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method | Ran Yi · Haoyuan Tian · Zhihao Gu · Yu-Kun Lai · Paul L. Rosin | N/A | Code |
| SOOD: Towards Semi-Supervised Oriented Object Detection | Wei Hua · Dingkang Liang · Jingyu Li · Xiaolong Liu · Zhikang Zou · Xiaoqing Ye · Xiang Bai | N/A | Code |
| Spherical Transformer for LiDAR-Based 3D Recognition | Xin Lai · Yukang Chen · Fanbin Lu · Jianhui Liu · Jiaya Jia | N/A | Code |
| Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri · Ayan Kumar Bhunia · Yi-Zhe Song · Anjan Dutta | N/A | Code |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Zixiang Zhao · Haowen Bai · Jiangshe Zhang · Yulun Zhang · Shuang Xu · Zudi Lin · Radu Timofte · Luc Van Gool | N/A | Code |
| Proximal Splitting Adversarial Attack for Semantic Segmentation | Jérôme Rony · Jean-Christophe Pesquet · Ismail Ben Ayed | N/A | Code |
| NeuWigs: A Neural Dynamic Model for Volumetric Hair Capture and Animation | Ziyan Wang · Giljoo Nam · Tuur Stuyck · Stephen Lombardi · Chen Cao · Jason Saragih · Michael Zollhöfer · Jessica Hodgins · Christoph Lassner | N/A | Code |
| Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language | Chuanhao Li · Zhen Li · Chenchen Jing · Yunde Jia · Yuwei Wu | N/A | Code |
| 3D-Aware Face Swapping | Yixuan Li · Chao Ma · Yichao Yan · Wenhan Zhu · Xiaokang Yang | N/A | Code |
| Representing Volumetric Videos As Dynamic MLP Maps | Sida Peng · Yunzhi Yan · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation | Hang Du · Xuejun Yan · Jingjing Wang · Di Xie · Shiliang Pu | N/A | Code |
| Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need | Jingyao Li · Pengguang Chen · Zexin He · Shaozuo Yu · Shu Liu · Jiaya Jia | N/A | Code |
| Paint by Example: Exemplar-Based Image Editing With Diffusion Models | Binxin Yang · Shuyang Gu · Bo Zhang · Ting Zhang · Xuejin Chen · Xiaoyan Sun · Dong Chen · Fang Wen | N/A | Code |
| Referring Multi-Object Tracking | Dongming Wu · Wencheng Han · Tiancai Wang · Xingping Dong · Xiangyu Zhang · Jianbing Shen | N/A | Code |
| NerVE: Neural Volumetric Edges for Parametric Curve Extraction From Point Cloud | Xiangyu Zhu · Dong Du · Weikai Chen · Zhiyou Zhao · Yinyu Nie · Xiaoguang Han | N/A | Code |
| AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection | Yipeng Gao · Kun-Yu Lin · Junkai Yan · Yaowei Wang · Wei-Shi Zheng | N/A | Code |
| CUF: Continuous Upsampling Filters | Cristina N. Vasconcelos · Cengiz Oztireli · Mark Matthews · Milad Hashemi · Kevin Swersky · Andrea Tagliasacchi | N/A | Code |
| MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors | Yuang Zhang · Tiancai Wang · Xiangyu Zhang | N/A | Code |
| CXTrack: Improving 3D Point Cloud Tracking With Contextual Information | Tian-Xing Xu · Yuan-Chen Guo · Yu-Kun Lai · Song-Hai Zhang | N/A | Code |
| Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection | Xincheng Yao · Ruoqi Li · Jing Zhang · Jun Sun · Chongyang Zhang | N/A | Code |
| Learning Bottleneck Concepts in Image Classification | Bowen Wang · Liangzhi Li · Yuta Nakashima · Hajime Nagahara | N/A | Code |
| Zero-Shot Model Diagnosis | Jinqi Luo · Zhaoning Wang · Chen Henry Wu · Dong Huang · Fernando De la Torre | N/A | Code |
| DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation | Shuai Shen · Wenliang Zhao · Zibin Meng · Wanhua Li · Zheng Zhu · Jie Zhou · Jiwen Lu | N/A | Code |
| DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer | Ping Chen · Xingpeng Zhang · Ye Li · Ju Tao · Bin Xiao · Bing Wang · Zongjie Jiang | N/A | Code |
| TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning With Structure-Trajectory Prompted Reconstruction for Person Re-Identification | Haocong Rao · Chunyan Miao | N/A | Code |
| Joint Visual Grounding and Tracking With Natural Language Specification | Li Zhou · Zikun Zhou · Kaige Mao · Zhenyu He | N/A | Code |
| Compressing Volumetric Radiance Fields to 1 MB | Lingzhi Li · Zhen Shen · Zhongshu Wang · Li Shen · Liefeng Bo | N/A | Code |
| HyperReel: High-Fidelity 6-DoF Video With Ray-Conditioned Sampling | Benjamin Attal · Jia-Bin Huang · Christian Richardt · Michael Zollhöfer · Johannes Kopf · Matthew O’Toole · Changil Kim | N/A | Code |
| Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sections | Alexander Gillert · Giulia Resente · Alba Anadon-Rosell · Martin Wilmking · Uwe Freiherr von Lukas | N/A | Code |
| Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li · Karen Liu · Jiajun Wu | N/A | Code |
| Learned Two-Plane Perspective Prior Based Image Resampling for Efficient Object Detection | Anurag Ghosh · N. Dinesh Reddy · Christoph Mertz · Srinivasa G. Narasimhan | N/A | Code |
| PaletteNeRF: Palette-Based Appearance Editing of Neural Radiance Fields | Zhengfei Kuang · Fujun Luan · Sai Bi · Zhixin Shu · Gordon Wetzstein · Kalyan Sunkavalli | N/A | Code |
| Long Range Pooling for 3D Large-Scale Scene Understanding | Xiang-Li Li · Meng-Hao Guo · Tai-Jiang Mu · Ralph R. Martin · Shi-Min Hu | N/A | Code |
| Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation | Mingjie Li · Bingqian Lin · Zicong Chen · Haokun Lin · Xiaodan Liang · Xiaojun Chang | N/A | Code |
| Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning | Chengzhi Cao · Xueyang Fu · Hongjian Liu · Yukun Huang · Kunyu Wang · Jiebo Luo · Zheng-Jun Zha | N/A | Code |
| Contrastive Grouping With Transformer for Referring Image Segmentation | Jiajin Tang · Ge Zheng · Cheng Shi · Sibei Yang | N/A | Code |
| Structure Aggregation for Cross-Spectral Stereo Image Guided Denoising | Zehua Sheng · Zhu Yu · Xiongwei Liu · Si-Yuan Cao · Yuqi Liu · Hui-Liang Shen · Huaqi Zhang | N/A | Code |
| Where Is My Spot? Few-Shot Image Generation via Latent Subspace Optimization | Chenxi Zheng · Bangzhen Liu · Huaidong Zhang · Xuemiao Xu · Shengfeng He | N/A | Code |
| EDGE: Editable Dance Generation From Music | Jonathan Tseng · Rodrigo Castellon · Karen Liu | N/A | Code |
| PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models | Minghua Liu · Yinhao Zhu · Hong Cai · Shizhong Han · Zhan Ling · Fatih Porikli · Hao Su | N/A | Code |
| EDICT: Exact Diffusion Inversion via Coupled Transformations | Bram Wallace · Akash Gokul · Nikhil Naik | N/A | Code |
| Complete 3D Human Reconstruction From a Single Incomplete Image | Junying Wang · Jae Shin Yoon · Tuanfeng Y. Wang · Krishna Kumar Singh · Ulrich Neumann | N/A | Code |
| PartDistillation: Learning Parts From Instance Segmentation | Jang Hyun Cho · Philipp Krähenbühl · Vignesh Ramanathan | N/A | Code |
| Neural Vector Fields: Implicit Representation by Explicit Learning | Xianghui Yang · Guosheng Lin · Zhenghao Chen · Luping Zhou | N/A | Code |
| Unsupervised Inference of Signed Distance Functions From Single Sparse Point Clouds Without Learning Priors | Chao Chen · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Texts as Images in Prompt Tuning for Multi-Label Image Recognition | Zixian Guo · Bowen Dong · Zhilong Ji · Jinfeng Bai · Yiwen Guo · Wangmeng Zuo | N/A | Code |
| Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent With Learned Distance Functions | Yun He · Danhang Tang · Yinda Zhang · Xiangyang Xue · Yanwei Fu | N/A | Code |
| MMANet: Margin-Aware Distillation and Modality-Aware Regularization for Incomplete Multimodal Learning | Shicai Wei · Chunbo Luo · Yang Luo | N/A | Code |
| Rethinking Optical Flow From Geometric Matching Consistent Perspective | Qiaole Dong · Chenjie Cao · Yanwei Fu | N/A | Code |
| FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation | Junjie He · Pengyu Li · Yifeng Geng · Xuansong Xie | N/A | Code |
| How Can Objects Help Action Recognition? | Xingyi Zhou · Anurag Arnab · Chen Sun · Cordelia Schmid | N/A | Code |
| Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang · Wen Wang · Yue Cao · Chunhua Shen · Tiejun Huang | N/A | Code |
| SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation | Huimin Huang · Shiao Xie · Lanfen Lin · Ruofeng Tong · Yen-Wei Chen · Yuexiang Li · Hong Wang · Yawen Huang · Yefeng Zheng | N/A | Code |
| A Unified Pyramid Recurrent Network for Video Frame Interpolation | Xin Jin · Longhai Wu · Jie Chen · Youxin Chen · Jayoon Koo · Cheul-hee Hahm | N/A | Code |
| Enhancing the Self-Universality for Transferable Targeted Attacks | Zhipeng Wei · Jingjing Chen · Zuxuan Wu · Yu-Gang Jiang | N/A | Code |
| Multi-View Inverse Rendering for Large-Scale Real-World Indoor Scenes | Zhen Li · Lingli Wang · Mofang Cheng · Cihui Pan · Jiaqi Yang | N/A | Code |
| TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision | Jiacheng Wei · Hao Wang · Jiashi Feng · Guosheng Lin · Kim-Hui Yap | N/A | Code |
| Frequency-Modulated Point Cloud Rendering With Easy Editing | Yi Zhang · Xiaoyang Huang · Bingbing Ni · Teng Li · Wenjun Zhang | N/A | Code |
| Vector Quantization With Self-Attention for Quality-Independent Representation Learning | Zhou Yang · Weisheng Dong · Xin Li · Mengluan Huang · Yulin Sun · Guangming Shi | N/A | Code |
| Fine-Grained Face Swapping via Regional GAN Inversion | Zhian Liu · Maomao Li · Yong Zhang · Cairong Wang · Qi Zhang · Jue Wang · Yongwei Nie | N/A | Code |
| Backdoor Defense via Adaptively Splitting Poisoned Dataset | Kuofeng Gao · Yang Bai · Jindong Gu · Yong Yang · Shu-Tao Xia | N/A | Code |
| RGBD2: Generative Scene Synthesis via Incremental View Inpainting Using RGBD Diffusion Models | Jiabao Lei · Jiapeng Tang · Kui Jia | N/A | Code |
| CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose | Xu Zhang · Wen Wang · Zhe Chen · Yufei Xu · Jing Zhang · Dacheng Tao | N/A | Code |
| Fake It Till You Make It: Learning Transferable Representations From Synthetic ImageNet Clones | Mert Bülent Sarıyıldız · Karteek Alahari · Diane Larlus · Yannis Kalantidis | N/A | Code |
| Efficient Frequency Domain-Based Transformers for High-Quality Image Deblurring | Lingshun Kong · Jiangxin Dong · Jianjun Ge · Mingqiang Li · Jinshan Pan | N/A | Code |
| DartBlur: Privacy Preservation With Detection Artifact Suppression | Baowei Jiang · Bing Bai · Haozhe Lin · Yu Wang · Yuchen Guo · Lu Fang | N/A | Code |
| FCC: Feature Clusters Compression for Long-Tailed Visual Recognition | Jian Li · Ziyao Meng · Daqian Shi · Rui Song · Xiaolei Diao · Jingwen Wang · Hao Xu | N/A | Code |
| CLOTH4D: A Dataset for Clothed Human Reconstruction | Xingxing Zou · Xintong Han · Waikeung Wong | N/A | Code |
| LinK: Linear Kernel for LiDAR-Based 3D Perception | Tao Lu · Xiang Ding · Haisong Liu · Gangshan Wu · Limin Wang | N/A | Code |
| Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation | Xiaoyang Wang · Bingfeng Zhang · Limin Yu · Jimin Xiao | N/A | Code |
| Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception | Junyu Gao · Mengyuan Chen · Changsheng Xu | N/A | Code |
| LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| Deep Learning of Partial Graph Matching via Differentiable Top-K | Runzhong Wang · Ziao Guo · Shaofei Jiang · Xiaokang Yang · Junchi Yan | N/A | Code |
| Analyzing Physical Impacts Using Transient Surface Wave Imaging | Tianyuan Zhang · Mark Sheinin · Dorian Chan · Mark Rau · Matthew O’Toole · Srinivasa G. Narasimhan | N/A | Code |
| Rethinking Domain Generalization for Face Anti-Spoofing: Separability and Alignment | Yiyou Sun · Yaojie Liu · Xiaoming Liu · Yixuan Li · Wen-Sheng Chu | N/A | Code |
| A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift | Dasong Li · Xiaoyu Shi · Yi Zhang · Ka Chun Cheung · Simon See · Xiaogang Wang · Hongwei Qin · Hongsheng Li | N/A | Code |
| The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects | Ruohan Gao · Yiming Dou · Hao Li · Tanmay Agarwal · Jeannette Bohg · Yunzhu Li · Li Fei-Fei · Jiajun Wu | N/A | Code |
| PIRLNav: Pretraining With Imitation and RL Finetuning for ObjectNav | Ram Ramrakhya · Dhruv Batra · Erik Wijmans · Abhishek Das | N/A | Code |
| DC2: Dual-Camera Defocus Control by Learning To Refocus | Hadi Alzayer · Abdullah Abuolaim · Leung Chun Chan · Yang Yang · Ying Chen Lou · Jia-Bin Huang · Abhishek Kar | N/A | Code |
| Habitat-Matterport 3D Semantics Dataset | Karmesh Yadav · Ram Ramrakhya · Santhosh Kumar Ramakrishnan · Theo Gervet · John Turner · Aaron Gokaslan · Noah Maestre · Angel Xuan Chang · Dhruv Batra · Manolis Savva · Alexander William Clegg · Devendra Singh Chaplot | N/A | Code |
| Prompting Large Language Models With Answer Heuristics for Knowledge-Based Visual Question Answering | Zhenwei Shao · Zhou Yu · Meng Wang · Jun Yu | N/A | Code |
| Similarity Metric Learning for RGB-Infrared Group Re-Identification | Jianghao Xiong · Jianhuang Lai | N/A | Code |
| DPF: Learning Dense Prediction Fields With Weak Supervision | Xiaoxue Chen · Yuhang Zheng · Yupeng Zheng · Qiang Zhou · Hao Zhao · Guyue Zhou · Ya-Qin Zhang | N/A | Code |
| Mixed Autoencoder for Self-Supervised Visual Representation Learning | Kai Chen · Zhili Liu · Lanqing Hong · Hang Xu · Zhenguo Li · Dit-Yan Yeung | N/A | Code |
| Content-Aware Token Sharing for Efficient Semantic Segmentation With Vision Transformers | Chenyang Lu · Daan de Geus · Gijs Dubbelman | N/A | Code |
| NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen · Jipeng Lyu · Yu-Xiong Wang | N/A | Code |
| Multiview Compressive Coding for 3D Reconstruction | Chao-Yuan Wu · Justin Johnson · Jitendra Malik · Christoph Feichtenhofer · Georgia Gkioxari | N/A | Code |
| Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation | Lihe Yang · Lei Qi · Litong Feng · Wayne Zhang · Yinghuan Shi | N/A | Code |
| Delving Into Shape-Aware Zero-Shot Semantic Segmentation | Xinyu Liu · Beiwen Tian · Zhen Wang · Rui Wang · Kehua Sheng · Bo Zhang · Hao Zhao · Guyue Zhou | N/A | Code |
| Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval | Yi Xie · Huaidong Zhang · Xuemiao Xu · Jianqing Zhu · Shengfeng He | N/A | Code |
| Bootstrapping Objectness From Videos by Relaxed Common Fate and Visual Grouping | Long Lian · Zhirong Wu · Stella X. Yu | N/A | Code |
| NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear Interpolation | Zehan Zheng · Danni Wu · Ruisi Lu · Fan Lu · Guang Chen · Changjun Jiang | N/A | Code |
| Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning | Zhuoyang Zhang · Yuhao Dong · Yunze Liu · Li Yi | N/A | Code |
| GFIE: A Dataset and Baseline for Gaze-Following From 2D to 3D in Indoor Environments | Zhengxi Hu · Yuxue Yang · Xiaolin Zhai · Dingye Yang · Bohan Zhou · Jingtai Liu | N/A | Code |
| Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? | Wenhao Wu · Haipeng Luo · Bo Fang · Jingdong Wang · Wanli Ouyang | N/A | Code |
| Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition From Egocentric RGB Videos | Yilin Wen · Hao Pan · Lei Yang · Jia Pan · Taku Komura · Wenping Wang | N/A | Code |
| CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer | Linfeng Wen · Chengying Gao · Changqing Zou | N/A | Code |
| Uncurated Image-Text Datasets: Shedding Light on Demographic Bias | Noa Garcia · Yusuke Hirota · Yankun Wu · Yuta Nakashima | N/A | Code |
| AltFreezing for More General Video Face Forgery Detection | Zhendong Wang · Jianmin Bao · Wengang Zhou · Weilun Wang · Houqiang Li | N/A | Code |
| Two-View Geometry Scoring Without Correspondences | Axel Barroso-Laguna · Eric Brachmann · Victor Adrian Prisacariu · Gabriel J. Brostow · Daniyar Turmukhambetov | N/A | Code |
| Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning | Wenjin Wang · Yunqing Hu · Qianglong Chen · Yin Zhang | N/A | Code |
| Revisiting Prototypical Network for Cross Domain Few-Shot Learning | Fei Zhou · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| Federated Incremental Semantic Segmentation | Jiahua Dong · Duzhen Zhang · Yang Cong · Wei Cong · Henghui Ding · Dengxin Dai | N/A | Code |
| Self-Supervised Learning for Multimodal Non-Rigid 3D Shape Matching | Dongliang Cao · Florian Bernard | N/A | Code |
| Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization | Aishan Liu · Shiyu Tang · Siyuan Liang · Ruihao Gong · Boxi Wu · Xianglong Liu · Dacheng Tao | N/A | Code |
| Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning | Peng Jin · Jinfa Huang · Pengfei Xiong · Shangxuan Tian · Chang Liu · Xiangyang Ji · Li Yuan · Jie Chen | N/A | Code |
| pCON: Polarimetric Coordinate Networks for Neural Scene Representations | Henry Peters · Yunhao Ba · Achuta Kadambi | N/A | Code |
| RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo | Changjiang Cai · Pan Ji · Qingan Yan · Yi Xu | N/A | Code |
| Depth Estimation From Camera Image and mmWave Radar Point Cloud | Akash Deep Singh · Yunhao Ba · Ankur Sarker · Howard Zhang · Achuta Kadambi · Stefano Soatto · Mani Srivastava · Alex Wong | N/A | Code |
| Normal-Guided Garment UV Prediction for Human Re-Texturing | Yasamin Jafarian · Tuanfeng Y. Wang · Duygu Ceylan · Jimei Yang · Nathan Carr · Yi Zhou · Hyun Soo Park | N/A | Code |
| WeatherStream: Light Transport Automation of Single Image Deweathering | Howard Zhang · Yunhao Ba · Ethan Yang · Varan Mehra · Blake Gella · Akira Suzuki · Arnold Pfahnl · Chethan Chinder Chandrappa · Alex Wong · Achuta Kadambi | N/A | Code |
| MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices | Kejie Li · Jia-Wang Bian · Robert Castle · Philip H.S. Torr · Victor Adrian Prisacariu | N/A | Code |
| Ultrahigh Resolution Image/Video Matting With Spatio-Temporal Sparsity | Yanan Sun · Chi-Keung Tang · Yu-Wing Tai | N/A | Code |
| Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection | Chuandong Liu · Chenqiang Gao · Fangcen Liu · Pengcheng Li · Deyu Meng · Xinbo Gao | N/A | Code |
| PATS: Patch Area Transportation With Subdivision for Local Feature Matching | Junjie Ni · Yijin Li · Zhaoyang Huang · Hongsheng Li · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| SINE: Semantic-Driven Image-Based NeRF Editing With Prior-Guided Editing Field | Chong Bao · Yinda Zhang · Bangbang Yang · Tianxing Fan · Zesong Yang · Hujun Bao · Guofeng Zhang · Zhaopeng Cui | N/A | Code |
| GeoNet: Benchmarking Unsupervised Adaptation Across Geographies | Tarun Kalluri · Wangdong Xu · Manmohan Chandraker | N/A | Code |
| Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset | Shuaizheng Liu · Xindong Zhang · Lingchen Sun · Zhetong Liang · Hui Zeng · Lei Zhang | N/A | Code |
| 3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification | Jiazhao Zhang · Liu Dai · Fanpeng Meng · Qingnan Fan · Xuelin Chen · Kai Xu · He Wang | N/A | Code |
| Delving Into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling | Yulin Liu · Haoran Liu · Yingda Yin · Yang Wang · Baoquan Chen · He Wang | N/A | Code |
| RILS: Masked Visual Reconstruction in Language Semantic Space | Shusheng Yang · Yixiao Ge · Kun Yi · Dian Li · Ying Shan · Xiaohu Qie · Xinggang Wang | N/A | Code |
| ConQueR: Query Contrast Voxel-DETR for 3D Object Detection | Benjin Zhu · Zhe Wang · Shaoshuai Shi · Hang Xu · Lanqing Hong · Hongsheng Li | N/A | Code |
| PREIM3D: 3D Consistent Precise Image Attribute Editing From a Single Image | Jianhui Li · Jianmin Li · Haoji Zhang · Shilong Liu · Zhengyi Wang · Zihao Xiao · Kaiwen Zheng · Jun Zhu | N/A | Code |
| Bridging Search Region Interaction With Template for RGB-T Tracking | Tianrui Hui · Zizheng Xun · Fengguang Peng · Junshi Huang · Xiaoming Wei · Xiaolin Wei · Jiao Dai · Jizhong Han · Si Liu | N/A | Code |
| Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels | Jingqiu Zhou · Linjiang Huang · Liang Wang · Si Liu · Hongsheng Li | N/A | Code |
| Learning To Zoom and Unzoom | Chittesh Thavamani · Mengtian Li · Francesco Ferroni · Deva Ramanan | N/A | Code |
| MaLP: Manipulation Localization Using a Proactive Scheme | Vishal Asnani · Xi Yin · Tal Hassner · Xiaoming Liu | N/A | Code |
| Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning | Haiyu Wu · Grace Bezold · Aman Bhatta · Kevin W. Bowyer | N/A | Code |
| Visual-Tactile Sensing for In-Hand Object Reconstruction | Wenqiang Xu · Zhenjun Yu · Han Xue · Ruolin Ye · Siqiong Yao · Cewu Lu | N/A | Code |
| Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training | Filip Radenovic · Abhimanyu Dubey · Abhishek Kadian · Todor Mihaylov · Simon Vandenhende · Yash Patel · Yi Wen · Vignesh Ramanathan · Dhruv Mahajan | N/A | Code |
| Semi-Supervised Domain Adaptation With Source Label Adaptation | Yu-Chu Yu · Hsuan-Tien Lin | N/A | Code |
| Self-Supervised Video Forensics by Audio-Visual Anomaly Detection | Chao Feng · Ziyang Chen · Andrew Owens | N/A | Code |
| IterativePFN: True Iterative Point Cloud Filtering | Dasith de Silva Edirimuni · Xuequan Lu · Zhiwen Shao · Gang Li · Antonio Robles-Kelly · Ying He | N/A | Code |
| Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time | Wei Shang · Dongwei Ren · Yi Yang · Hongzhi Zhang · Kede Ma · Wangmeng Zuo | N/A | Code |
| Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning | Yun-Hao Cao · Peiqin Sun · Shuchang Zhou | N/A | Code |
| Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking | Ziqi Pang · Jie Li · Pavel Tokmakov · Dian Chen · Sergey Zagoruyko · Yu-Xiong Wang | N/A | Code |
| VecFontSDF: Learning To Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance Functions | Zeqing Xia · Bojun Xiong · Zhouhui Lian | N/A | Code |
| Towards Better Gradient Consistency for Neural Signed Distance Functions via Level Set Alignment | Baorui Ma · Junsheng Zhou · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Visual-Language Prompt Tuning With Knowledge-Guided Context Optimization | Hantao Yao · Rui Zhang · Changsheng Xu | N/A | Code |
| Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation | Ju He · Jieneng Chen · Ming-Xian Lin · Qihang Yu · Alan L. Yuille | N/A | Code |
| Physics-Guided ISO-Dependent Sensor Noise Modeling for Extreme Low-Light Photography | Yue Cao · Ming Liu · Shuai Liu · Xiaotao Wang · Lei Lei · Wangmeng Zuo | N/A | Code |
| Dynamic Focus-Aware Positional Queries for Semantic Segmentation | Haoyu He · Jianfei Cai · Zizheng Pan · Jing Liu · Jing Zhang · Dacheng Tao · Bohan Zhuang | N/A | Code |
| Generic-to-Specific Distillation of Masked Autoencoders | Wei Huang · Zhiliang Peng · Li Dong · Furu Wei · Jianbin Jiao · Qixiang Ye | N/A | Code |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions | Yinpeng Dong · Caixin Kang · Jinlai Zhang · Zijian Zhu · Yikai Wang · Xiao Yang · Hang Su · Xingxing Wei · Jun Zhu | N/A | Code |
| GarmentTracking: Category-Level Garment Pose Tracking | Han Xue · Wenqiang Xu · Jieyi Zhang · Tutian Tang · Yutong Li · Wenxin Du · Ruolin Ye · Cewu Lu | N/A | Code |
| TrojDiff: Trojan Attacks on Diffusion Models With Diverse Targets | Weixin Chen · Dawn Song · Bo Li | N/A | Code |
| Weakly Supervised Video Representation Learning With Unaligned Text for Sequential Videos | Sixun Dong · Huazhang Hu · Dongze Lian · Weixin Luo · Yicheng Qian · Shenghua Gao | N/A | Code |
| Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process | Yuhan Li · Yishun Dou · Xuanhong Chen · Bingbing Ni · Yilin Sun · Yutian Liu · Fuzhen Wang | N/A | Code |
| SpaText: Spatio-Textual Representation for Controllable Image Generation | Omri Avrahami · Thomas Hayes · Oran Gafni · Sonal Gupta · Yaniv Taigman · Devi Parikh · Dani Lischinski · Ohad Fried · Xi Yin | N/A | Code |
| Watch or Listen: Robust Audio-Visual Speech Recognition With Visual Corruption Modeling and Reliability Scoring | Joanna Hong · Minsu Kim · Jeongsoo Choi · Yong Man Ro | N/A | Code |
| RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | Titas Anciukevičius · Zexiang Xu · Matthew Fisher · Paul Henderson · Hakan Bilen · Niloy J. Mitra · Paul Guerrero | N/A | Code |
| Self-Supervised 3D Scene Flow Estimation Guided by Superpoints | Yaqi Shen · Le Hui · Jin Xie · Jian Yang | N/A | Code |
| Adaptive Annealing for Robust Geometric Estimation | Chitturi Sidhartha · Lalit Manam · Venu Madhav Govindu | N/A | Code |
| Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising | Miaoyu Li · Ji Liu · Ying Fu · Yulun Zhang · Dejing Dou | N/A | Code |
| Partial Network Cloning | Jingwen Ye · Songhua Liu · Xinchao Wang | N/A | Code |
| Twin Contrastive Learning With Noisy Labels | Zhizhong Huang · Junping Zhang · Hongming Shan | N/A | Code |
| Ambiguous Medical Image Segmentation Using Diffusion Models | Aimon Rahman · Jeya Maria Jose Valanarasu · Ilker Hacihaliloglu · Vishal M. Patel | N/A | Code |
| High-Res Facial Appearance Capture From Polarized Smartphone Images | Dejan Azinović · Olivier Maury · Christophe Hery · Matthias Nießner · Justus Thies | N/A | Code |
| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Takehiko Ohkawa · Kun He · Fadime Sener · Tomas Hodan · Luan Tran · Cem Keskin | N/A | Code |
| EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata | Chenhao Zheng · Ayush Shrivastava · Andrew Owens | N/A | Code |
| Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Gal Metzer · Elad Richardson · Or Patashnik · Raja Giryes · Daniel Cohen-Or | N/A | Code |
| Rebalancing Batch Normalization for Exemplar-Based Class-Incremental Learning | Sungmin Cha · Sungjun Cho · Dasol Hwang · Sunwon Hong · Moontae Lee · Taesup Moon | N/A | Code |
| Progressive Neighbor Consistency Mining for Correspondence Pruning | Xin Liu · Jufeng Yang | N/A | Code |
| Post-Training Quantization on Diffusion Models | Yuzhang Shang · Zhihang Yuan · Bin Xie · Bingzhe Wu · Yan Yan | N/A | Code |
| Fully Self-Supervised Depth Estimation From Defocus Clue | Haozhe Si · Bin Zhao · Dong Wang · Yunpeng Gao · Mulin Chen · Zhigang Wang · Xuelong Li | N/A | Code |
| Curricular Object Manipulation in LiDAR-Based Object Detection | Ziyue Zhu · Qiang Meng · Xiao Wang · Ke Wang · Liujiang Yan · Jian Yang | N/A | Code |
| Adaptive Assignment for Geometry Aware Local Feature Matching | Dihe Huang · Ying Chen · Yong Liu · Jianlin Liu · Shang Xu · Wenlong Wu · Yikang Ding · Fan Tang · Chengjie Wang | N/A | Code |
| RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension | Lei Jin · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Annan Shu · Rongrong Ji | N/A | Code |
| ANetQA: A Large-Scale Benchmark for Fine-Grained Compositional Reasoning Over Untrimmed Videos | Zhou Yu · Lixiang Zheng · Zhou Zhao · Fei Wu · Jianping Fan · Kui Ren · Jun Yu | N/A | Code |
| GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds | Honghui Yang · Tong He · Jiaheng Liu · Hua Chen · Boxi Wu · Binbin Lin · Xiaofei He · Wanli Ouyang | N/A | Code |
| Multimodal Industrial Anomaly Detection via Hybrid Fusion | Yue Wang · Jinlong Peng · Jiangning Zhang · Ran Yi · Yabiao Wang · Chengjie Wang | N/A | Code |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Byeonghyun Pak · Jaewon Lee · Kyong Hwan Jin | N/A | Code |
| CLIP Is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation | Yuqi Lin · Minghao Chen · Wenxiao Wang · Boxi Wu · Ke Li · Binbin Lin · Haifeng Liu · Xiaofei He | N/A | Code |
| MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation | Ludan Ruan · Yiyang Ma · Huan Yang · Huiguo He · Bei Liu · Jianlong Fu · Nicholas Jing Yuan · Qin Jin · Baining Guo | N/A | Code |
| FreeNeRF: Improving Few-Shot Neural Rendering With Free Frequency Regularization | Jiawei Yang · Marco Pavone · Yue Wang | N/A | Code |
| SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory | Sicheng Li · Hao Li · Yue Wang · Yiyi Liao · Lu Yu | N/A | Code |
| Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks | Jierun Chen · Shiu-hong Kao · Hao He · Weipeng Zhuo · Song Wen · Chul-Ho Lee · S.-H. Gary Chan | N/A | Code |
| Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning | Cheng Tan · Zhangyang Gao · Lirong Wu · Yongjie Xu · Jun Xia · Siyuan Li · Stan Z. Li | N/A | Code |
| Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module | Linzhi Huang · Yulong Li · Hongbo Tian · Yue Yang · Xiangang Li · Weihong Deng · Jieping Ye | N/A | Code |
| Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification | Zhengwei Yang · Meng Lin · Xian Zhong · Yu Wu · Zheng Wang | N/A | Code |
| Feature Alignment and Uniformity for Test Time Adaptation | Shuai Wang · Daoan Zhang · Zipei Yan · Jianguo Zhang · Rui Li | N/A | Code |
| AeDet: Azimuth-Invariant Multi-View 3D Object Detection | Chengjian Feng · Zequn Jie · Yujie Zhong · Xiangxiang Chu · Lin Ma | N/A | Code |
| Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency Is All You Need | Tong Wei · Kai Gan | N/A | Code |
| OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization | Ying Zhao | N/A | Code |
| HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization | Sungyeon Kim · Boseung Jeong · Suha Kwak | N/A | Code |
| Generative Diffusion Prior for Unified Image Restoration and Enhancement | Ben Fei · Zhaoyang Lyu · Liang Pan · Junzhe Zhang · Weidong Yang · Tianyue Luo · Bo Zhang · Bo Dai | N/A | Code |
| Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder | Aming Wu · Cheng Deng | N/A | Code |
| 2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection | Mikhail Kennerley · Jian-Gang Wang · Bharadwaj Veeravalli · Robby T. Tan | N/A | Code |
| Linking Garment With Person via Semantically Associated Landmarks for Virtual Try-On | Keyu Yan · Tingwei Gao · Hui Zhang · Chengjun Xie | N/A | Code |
| A New Comprehensive Benchmark for Semi-Supervised Video Anomaly Detection and Anticipation | Congqi Cao · Yue Lu · Peng Wang · Yanning Zhang | N/A | Code |
| DINER: Depth-Aware Image-Based NEural Radiance Fields | Malte Prinzler · Otmar Hilliges · Justus Thies | N/A | Code |
| Learning Personalized High Quality Volumetric Head Avatars From Monocular RGB Videos | Ziqian Bai · Feitong Tan · Zeng Huang · Kripasindhu Sarkar · Danhang Tang · Di Qiu · Abhimitra Meka · Ruofei Du · Mingsong Dou · Sergio Orts-Escolano · Rohit Pandey · Ping Tan · Thabo Beeler · Sean Fanello · Yinda Zhang | N/A | Code |
| HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics | Artur Grigorev · Michael J. Black · Otmar Hilliges | N/A | Code |
| Boundary-Enhanced Co-Training for Weakly Supervised Semantic Segmentation | Shenghai Rong · Bohai Tu · Zilei Wang · Junjie Li | N/A | Code |
| Instant Volumetric Head Avatars | Wojciech Zielonka · Timo Bolkart · Justus Thies | N/A | Code |
| From Node Interaction To Hop Interaction: New Effective and Scalable Graph Learning Paradigm | Jie Chen · Zilong Li · Yin Zhu · Junping Zhang · Jian Pu | N/A | Code |
| Transfer4D: A Framework for Frugal Motion Capture and Deformation Transfer | Shubh Maheshwari · Rahul Narain · Ramya Hebbalaguppe | N/A | Code |
| An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions | Weijia Li · Saihui Hou · Chunjie Zhang · Chunshui Cao · Xu Liu · Yongzhen Huang · Yao Zhao | N/A | Code |
| Event-Based Shape From Polarization | Manasi Muglikar · Leonard Bauersfeld · Diederik Paul Moeys · Davide Scaramuzza | N/A | Code |
| Plateau-Reduced Differentiable Path Tracing | Michael Fischer · Tobias Ritschel | N/A | Code |
| End-to-End Video Matting With Trimap Propagation | Wei-Lun Huang · Ming-Sui Lee | N/A | Code |
| Weakly-Supervised Single-View Image Relighting | Renjiao Yi · Chenyang Zhu · Kai Xu | N/A | Code |
| Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning | Weixuan Sun · Jiayi Zhang · Jianyuan Wang · Zheyuan Liu · Yiran Zhong · Tianpeng Feng · Yandong Guo · Yanhao Zhang · Nick Barnes | N/A | Code |
| Non-Contrastive Unsupervised Learning of Physiological Signals From Video | Jeremy Speth · Nathan Vance · Patrick Flynn · Adam Czajka | N/A | Code |
| Structured Sparsity Learning for Efficient Video Super-Resolution | Bin Xia · Jingwen He · Yulun Zhang · Yitong Wang · Yapeng Tian · Wenming Yang · Luc Van Gool | N/A | Code |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Wei-Ning Hsu · Tal Remez · Bowen Shi · Jacob Donley · Yossi Adi | N/A | Code |
| Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion | Dario Pavllo · David Joseph Tan · Marie-Julie Rakotosaona · Federico Tombari | N/A | Code |
| Towards Better Decision Forests: Forest Alternating Optimization | Miguel Á. Carreira-Perpiñán · Magzhan Gabidolla · Arman Zharmagambetov | N/A | Code |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Thomas Stegmüller · Tim Lebailly · Behzad Bozorgtabar · Tinne Tuytelaars · Jean-Philippe Thiran | N/A | Code |
| Polynomial Implicit Neural Representations for Large Diverse Datasets | Rajhans Singh · Ankita Shukla · Pavan Turaga | N/A | Code |
| GradICON: Approximate Diffeomorphisms via Gradient Inverse Consistency | Lin Tian · Hastings Greer · François-Xavier Vialard · Roland Kwitt · Raúl San José Estépar · Richard Jarrett Rushmore · Nikolaos Makris · Sylvain Bouix · Marc Niethammer | N/A | Code |
| Exploring Discontinuity for Video Frame Interpolation | Sangjin Lee · Hyeongmin Lee · Chajin Shin · Hanbin Son · Sangyoun Lee | N/A | Code |
| Local 3D Editing via 3D Distillation of CLIP Knowledge | Junha Hyung · Sungwon Hwang · Daejin Kim · Hyunji Lee · Jaegul Choo | N/A | Code |
| Dynamic Conceptional Contrastive Learning for Generalized Category Discovery | Nan Pu · Zhun Zhong · Nicu Sebe | N/A | Code |
| Look, Radiate, and Learn: Self-Supervised Localisation via Radio-Visual Correspondence | Mohammed Alloulah · Maximilian Arnold | N/A | Code |
| Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection | Vibashan VS · Poojan Oza · Vishal M. Patel | N/A | Code |
| High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition | Tianyu Luan · Yuanhao Zhai · Jingjing Meng · Zhong Li · Zhang Chen · Yi Xu · Junsong Yuan | N/A | Code |
| 3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions | Dale Decatur · Itai Lang · Rana Hanocka | N/A | Code |
| Egocentric Video Task Translation | Zihui Xue · Yale Song · Kristen Grauman · Lorenzo Torresani | N/A | Code |
| Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection | Yi Wang · Ruili Wang · Xin Fan · Tianzhu Wang · Xiangjian He | N/A | Code |
| Balanced Energy Regularization Loss for Out-of-Distribution Detection | hyunjun choi · Hawook Jeong · Jin Young Choi | N/A | Code |
| Private Image Generation With Dual-Purpose Auxiliary Classifier | Chen Chen · Daochang Liu · Siqi Ma · Surya Nepal · Chang Xu | N/A | Code |
| Controllable Mesh Generation Through Sparse Latent Point Diffusion Models | Zhaoyang Lyu · Jinyi Wang · Yuwei An · Ya Zhang · Dahua Lin · Bo Dai | N/A | Code |
| Neural Video Compression With Diverse Contexts | Jiahao Li · Bin Li · Yan Lu | N/A | Code |
| Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection | Bo Zhang · Jiakang Yuan · Botian Shi · Tao Chen · Yikang Li · Yu Qiao | N/A | Code |
| ScarceNet: Animal Pose Estimation With Scarce Annotations | Chen Li · Gim Hee Lee | N/A | Code |
| Fast Contextual Scene Graph Generation With Unbiased Context Augmentation | Tianlei Jin · Fangtai Guo · Qiwei Meng · Shiqiang Zhu · Xiangming Xi · Wen Wang · Zonghao Mu · Wei Song | N/A | Code |
| TriDet: Temporal Action Detection With Relative Boundary Modeling | Dingfeng Shi · Yujie Zhong · Qiong Cao · Lin Ma · Jia Li · Dacheng Tao | N/A | Code |
| Multi-Level Logit Distillation | Ying Jin · Jiaqi Wang · Dahua Lin | N/A | Code |
| StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning | Yuqian Fu · Yu Xie · Yanwei Fu · Yu-Gang Jiang | N/A | Code |
| Text With Knowledge Graph Augmented Transformer for Video Captioning | Xin Gu · Guang Chen · Yufei Wang · Libo Zhang · Tiejian Luo · Longyin Wen | N/A | Code |
| Semantic Ray: Learning a Generalizable Semantic Field With Cross-Reprojection Attention | Fangfu Liu · Chubin Zhang · Yu Zheng · Yueqi Duan | N/A | Code |
| MELTR: Meta Loss Transformer for Learning To Fine-Tune Video Foundation Models | Dohwan Ko · Joonmyung Choi · Hyeong Kyu Choi · Kyoung-Woon On · Byungseok Roh · Hyunwoo J. Kim | N/A | Code |
| Self-Supervised AutoFlow | Hsin-Ping Huang · Charles Herrmann · Junhwa Hur · Erika Lu · Kyle Sargent · Austin Stone · Ming-Hsuan Yang · Deqing Sun | N/A | Code |
| Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images | Bowei Du · Yecheng Huang · Jiaxin Chen · Di Huang | N/A | Code |
| Context-Based Trit-Plane Coding for Progressive Image Compression | Seungmin Jeon · Kwang Pyo Choi · Youngo Park · Chang-Su Kim | N/A | Code |
| Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency Losses | Junbong Jang · Kwonmoo Lee · Tae-Kyun Kim | N/A | Code |
| VQACL: A Novel Visual Question Answering Continual Learning Setting | Xi Zhang · Feifei Zhang · Changsheng Xu | N/A | Code |
| Explicit Visual Prompting for Low-Level Structure Segmentations | Weihuang Liu · Xi Shen · Chi-Man Pun · Xiaodong Cun | N/A | Code |
| Practical Network Acceleration With Tiny Sets | Guo-Hua Wang · Jianxin Wu | N/A | Code |
| Sphere-Guided Training of Neural Implicit Surfaces | Andreea Dogaru · Andrei-Timotei Ardelean · Savva Ignatyev · Egor Zakharov · Evgeny Burnaev | N/A | Code |
| Texture-Guided Saliency Distilling for Unsupervised Salient Object Detection | Huajun Zhou · Bo Qiao · Lingxiao Yang · Jianhuang Lai · Xiaohua Xie | N/A | Code |
| FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction | Haoran Bai · Di Kang · Haoxian Zhang · Jinshan Pan · Linchao Bao | N/A | Code |
| Differentiable Shadow Mapping for Efficient Inverse Graphics | Markus Worchel · Marc Alexa | N/A | Code |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Wenxuan Zhang · Xiaodong Cun · Xuan Wang · Yong Zhang · Xi Shen · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| High-Fidelity and Freely Controllable Talking Head Video Generation | Yue Gao · Yuan Zhou · Jinglu Wang · Xiao Li · Xiang Ming · Yan Lu | N/A | Code |
| BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation | Junheum Park · Jintae Kim · Chang-Su Kim | N/A | Code |
| Noisy Correspondence Learning With Meta Similarity Correction | Haochen Han · Kaiyao Miao · Qinghua Zheng · Minnan Luo | N/A | Code |
| EVAL: Explainable Video Anomaly Localization | Ashish Singh · Michael J. Jones · Erik G. Learned-Miller | N/A | Code |
| Adaptive Plasticity Improvement for Continual Learning | Yan-Shuo Liang · Wu-Jun Li | N/A | Code |
| Edges to Shapes to Concepts: Adversarial Augmentation for Robust Vision | Aditay Tripathi · Rishubh Singh · Anirban Chakraborty · Pradeep Shenoy | N/A | Code |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mingzhen Sun · Weining Wang · Xinxin Zhu · Jing Liu | N/A | Code |
| Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and Poses | Eric Brachmann · Tommaso Cavallari · Victor Adrian Prisacariu | N/A | Code |
| A Probabilistic Attention Model With Occlusion-Aware Texture Regression for 3D Hand Reconstruction From a Single RGB Image | Zheheng Jiang · Hossein Rahmani · Sue Black · Bryan M. Williams | N/A | Code |
| Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang · Viktor Larsson · Daniel Barath | N/A | Code |
| LiDAR-in-the-Loop Hyperparameter Optimization | Félix Goudreault · Dominik Scheuble · Mario Bijelic · Nicolas Robidoux · Felix Heide | N/A | Code |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | WonJun Moon · Sangeek Hyun · SangUk Park · Dongchan Park · Jae-Pil Heo | N/A | Code |
| High-Fidelity 3D Face Generation From Natural Language Descriptions | Menghua Wu · Hao Zhu · Linjia Huang · Yiyu Zhuang · Yuanxun Lu · Xun Cao | N/A | Code |
| NeRF-Supervised Deep Stereo | Fabio Tosi · Alessio Tonioni · Daniele De Gregorio · Matteo Poggi | N/A | Code |
| vMAP: Vectorised Object Mapping for Neural Field SLAM | Xin Kong · Shikun Liu · Marwan Taher · Andrew J. Davison | N/A | Code |
| DiffRF: Rendering-Guided 3D Radiance Field Diffusion | Norman Müller · Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Peter Kontschieder · Matthias Nießner | N/A | Code |
| TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers | Cheng Zhang · Hai Liu · Yongjian Deng · Bochen Xie · Youfu Li | N/A | Code |
| Learning a Depth Covariance Function | Eric Dexheimer · Andrew J. Davison | N/A | Code |
| Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model | Rolandos Alexandros Potamias · Stylianos Ploumpis · Stylianos Moschoglou · Vasileios Triantafyllou · Stefanos Zafeiriou | N/A | Code |
| The Best Defense Is a Good Offense: Adversarial Augmentation Against Adversarial Attacks | Iuri Frosio · Jan Kautz | N/A | Code |
| Test of Time: Instilling Video-Language Models With a Sense of Time | Piyush Bagad · Makarand Tapaswi · Cees G. M. Snoek | N/A | Code |
| BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects | Bowen Wen · Jonathan Tremblay · Valts Blukis · Stephen Tyree · Thomas Müller · Alex Evans · Dieter Fox · Jan Kautz · Stan Birchfield | N/A | Code |
| Leveraging Hidden Positives for Unsupervised Semantic Segmentation | Hyun Seok Seong · WonJun Moon · SuBeen Lee · Jae-Pil Heo | N/A | Code |
| BlendFields: Few-Shot Example-Driven Facial Modeling | Kacper Kania · Stephan J. Garbin · Andrea Tagliasacchi · Virginia Estellers · Kwang Moo Yi · Julien Valentin · Tomasz Trzciński · Marek Kowalski | N/A | Code |
| CIRCLE: Capture in Rich Contextual Environments | João Pedro Araújo · Jiaman Li · Karthik Vetrivel · Rishi Agarwal · Jiajun Wu · Deepak Gopinath · Alexander William Clegg · Karen Liu | N/A | Code |
| Realistic Saliency Guided Image Enhancement | S. Mahdi H. Miangoleh · Zoya Bylinskii · Eric Kee · Eli Shechtman · Yağiz Aksoy | N/A | Code |
| Implicit Neural Head Synthesis via Controllable Local Deformation Fields | Chuhan Chen · Matthew O’Toole · Gaurav Bharaj · Pablo Garrido | N/A | Code |
| Ensemble-Based Blackbox Attacks on Dense Prediction | Zikui Cai · Yaoteng Tan · M. Salman Asif | N/A | Code |
| NaQ: Leveraging Narrations As Queries To Supervise Episodic Memory | Santhosh Kumar Ramakrishnan · Ziad Al-Halah · Kristen Grauman | N/A | Code |
| Rethinking Federated Learning With Domain Shift: A Prototype View | Wenke Huang · Mang Ye · Zekun Shi · He Li · Bo Du | N/A | Code |
| Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation | Shao-Yuan Lo · Poojan Oza · Sumanth Chennupati · Alejandro Galindo · Vishal M. Patel | N/A | Code |
| Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection | Jiakang Yuan · Bo Zhang · Xiangchao Yan · Tao Chen · Botian Shi · Yikang Li · Yu Qiao | N/A | Code |
| STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection | Zhenglin Zhou · Huaxia Li · Hong Liu · Nanyang Wang · Gang Yu · Rongrong Ji | N/A | Code |
| Diverse 3D Hand Gesture Prediction From Body Dynamics by Bilateral Hand Disentanglement | Xingqun Qi · Chen Liu · Muyi Sun · Lincheng Li · Changjie Fan · Xin Yu | N/A | Code |
| Sparsely Annotated Semantic Segmentation With Adaptive Gaussian Mixtures | Linshan Wu · Zhun Zhong · Leyuan Fang · Xingxin He · Qiang Liu · Jiayi Ma · Hao Chen | N/A | Code |
| Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention | Mingyu Ding · Yikang Shen · Lijie Fan · Zhenfang Chen · Zitian Chen · Ping Luo · Joshua B. Tenenbaum · Chuang Gan | N/A | Code |
| Frame Flexible Network | Yitian Zhang · Yue Bai · Chang Liu · Huan Wang · Sheng Li · Yun Fu | N/A | Code |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks | Hao Li · Jinguo Zhu · Xiaohu Jiang · Xizhou Zhu · Hongsheng Li · Chun Yuan · Xiaohua Wang · Yu Qiao · Xiaogang Wang · Wenhai Wang · Jifeng Dai | N/A | Code |
| DCFace: Synthetic Face Generation With Dual Condition Diffusion Model | Minchul Kim · Feng Liu · Anil Jain · Xiaoming Liu | N/A | Code |
| Referring Image Matting | Jizhizi Li · Jing Zhang · Dacheng Tao | N/A | Code |
| Fast Monocular Scene Reconstruction With Global-Sparse Local-Dense Grids | Wei Dong · Christopher Choy · Charles Loop · Or Litany · Yuke Zhu · Anima Anandkumar | N/A | Code |
| DPE: Disentanglement of Pose and Expression for General Video Portrait Editing | Youxin Pang · Yong Zhang · Weize Quan · Yanbo Fan · Xiaodong Cun · Ying Shan · Dong-Ming Yan | N/A | Code |
| IDGI: A Framework To Eliminate Explanation Noise From Integrated Gradients | Ruo Yang · Binghui Wang · Mustafa Bilgic | N/A | Code |
| DynamicDet: A Unified Dynamic Architecture for Object Detection | Zhihao Lin · Yongtao Wang · Jinhe Zhang · Xiaojie Chu | N/A | Code |
| Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification | Honglin Li · Chenglu Zhu · Yunlong Zhang · Yuxuan Sun · Zhongyi Shui · Wenwei Kuang · Sunyi Zheng · Lin Yang | N/A | Code |
| VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution | Jaeill Kim · Suhyun Kang · Duhun Hwang · Jungwook Shin · Wonjong Rhee | N/A | Code |
| Semi-Weakly Supervised Object Kinematic Motion Prediction | Gengxin Liu · Qian Sun · Haibin Huang · Chongyang Ma · Yulan Guo · Li Yi · Hui Huang · Ruizhen Hu | N/A | Code |
| Computational Flash Photography Through Intrinsics | Sepideh Sarajian Maralan · Chris Careaga · Yağiz Aksoy | N/A | Code |
| Inversion-Based Style Transfer With Diffusion Models | Yuxin Zhang · Nisha Huang · Fan Tang · Haibin Huang · Chongyang Ma · Weiming Dong · Changsheng Xu | N/A | Code |
| Data-Driven Feature Tracking for Event Cameras | Nico Messikommer · Carter Fang · Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation | Sicheng Yang · Zhiyong Wu · Minglei Li · Zhensong Zhang · Lei Hao · Weihong Bao · Haolin Zhuang | N/A | Code |
| Neural Fourier Filter Bank | Zhijie Wu · Yuhe Jin · Kwang Moo Yi | N/A | Code |
| Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective | Yuexiao Ma · Huixia Li · Xiawu Zheng · Xuefeng Xiao · Rui Wang · Shilei Wen · Xin Pan · Fei Chao · Rongrong Ji | N/A | Code |
| Full or Weak Annotations? An Adaptive Strategy for Budget-Constrained Annotation Campaigns | Javier Gamazo Tejero · Martin S. Zinkernagel · Sebastian Wolf · Raphael Sznitman · Pablo Márquez-Neila | N/A | Code |
| Trap Attention: Monocular Depth Estimation With Manual Traps | Chao Ning · Hongping Gan | N/A | Code |
| Physical-World Optical Adversarial Attacks on 3D Face Recognition | Yanjie Li · Yiquan Li · Xuelong Dai · Songtao Guo · Bin Xiao | N/A | Code |
| Re-Thinking Federated Active Learning Based on Inter-Class Diversity | SangMook Kim · Sangmin Bae · Hwanjun Song · Se-Young Yun | N/A | Code |
| EMT-NAS:Transferring Architectural Knowledge Between Tasks From Different Datasets | Peng Liao · Yaochu Jin · Wenli Du | N/A | Code |
| Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving | Lucas Nunes · Louis Wiesmann · Rodrigo Marcuzzi · Xieyuanli Chen · Jens Behley · Cyrill Stachniss | N/A | Code |
| Document Image Shadow Removal Guided by Color-Aware Background | Ling Zhang · Yinghao He · Qing Zhang · Zheng Liu · Xiaolong Zhang · Chunxia Xiao | N/A | Code |
| Pose-Disentangled Contrastive Learning for Self-Supervised Facial Representation | Yuanyuan Liu · Wenbin Wang · Yibing Zhan · Shaoze Feng · Kejun Liu · Zhe Chen | N/A | Code |
| Ham2Pose: Animating Sign Language Notation Into Pose Sequences | Rotem Shalev Arkushin · Amit Moryossef · Ohad Fried | N/A | Code |
| Resource-Efficient RGBD Aerial Tracking | Jinyu Yang · Shang Gao · Zhe Li · Feng Zheng · Aleš Leonardis | N/A | Code |
| Neural Transformation Fields for Arbitrary-Styled Font Generation | Bin Fu · Junjun He · Jianjun Wang · Yu Qiao | N/A | Code |
| Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection | Qianjiang Hu · Daizong Liu · Wei Hu | N/A | Code |
| PAniC-3D: Stylized Single-View 3D Reconstruction From Portraits of Anime Characters | Shuhong Chen · Kevin Zhang · Yichun Shi · Heng Wang · Yiheng Zhu · Guoxian Song · Sizhe An · Janus Kristjansson · Xiao Yang · Matthias Zwicker | N/A | Code |
| HS-Pose: Hybrid Scope Feature Extraction for Category-Level Object Pose Estimation | Linfang Zheng · Chen Wang · Yinghan Sun · Esha Dasgupta · Hua Chen · Aleš Leonardis · Wei Zhang · Hyung Jin Chang | N/A | Code |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction From In-the-Wild Images | Biwen Lei · Jianqiang Ren · Mengyang Feng · Miaomiao Cui · Xuansong Xie | N/A | Code |
| Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification | Yue Yang · Artemis Panagopoulou · Shenghao Zhou · Daniel Jin · Chris Callison-Burch · Mark Yatskar | N/A | Code |
| SfM-TTR: Using Structure From Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo · Javier Civera | N/A | Code |
| TINC: Tree-Structured Implicit Neural Compression | Runzhao Yang | N/A | Code |
| Cross-Domain Image Captioning With Discriminative Finetuning | Roberto Dessì · Michele Bevilacqua · Eleonora Gualdoni · Nathanaël Carraz Rakotonirina · Francesca Franzon · Marco Baroni | N/A | Code |
| Learning To Detect Mirrors From Videos via Dual Correspondences | Jiaying Lin · Xin Tan · Rynson W.H. Lau | N/A | Code |
| Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation | Zicheng Wang · Zhen Zhao · Xiaoxia Xing · Dong Xu · Xiangyu Kong · Luping Zhou | N/A | Code |
| Robust Unsupervised StyleGAN Image Restoration | Yohan Poirier-Ginter · Jean-François Lalonde | N/A | Code |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning | Rui Wang · Dongdong Chen · Zuxuan Wu · Yinpeng Chen · Xiyang Dai · Mengchen Liu · Lu Yuan · Yu-Gang Jiang | N/A | Code |
| Neural Fields Meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes | Zian Wang · Tianchang Shen · Jun Gao · Shengyu Huang · Jacob Munkberg · Jon Hasselgren · Zan Gojcic · Wenzheng Chen · Sanja Fidler | N/A | Code |
| Augmentation Matters: A Simple-Yet-Effective Approach to Semi-Supervised Semantic Segmentation | Zhen Zhao · Lihe Yang · Sifan Long · Jimin Pi · Luping Zhou · Jingdong Wang | N/A | Code |
| Policy Adaptation From Foundation Model Feedback | Yuying Ge · Annabella Macaluso · Li Erran Li · Ping Luo · Xiaolong Wang | N/A | Code |
| Person Image Synthesis via Denoising Diffusion Model | Ankan Kumar Bhunia · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Jorma Laaksonen · Mubarak Shah · Fahad Shahbaz Khan | N/A | Code |
| Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models | Wenhao Wu · Xiaohan Wang · Haipeng Luo · Jingdong Wang · Yi Yang · Wanli Ouyang | N/A | Code |
| CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution | Jiezhang Cao · Qin Wang · Yongqin Xian · Yawei Li · Bingbing Ni · Zhiming Pi · Kai Zhang · Yulun Zhang · Radu Timofte · Luc Van Gool | N/A | Code |
| Black-Box Sparse Adversarial Attack via Multi-Objective Optimisation | Phoenix Neale Williams · Ke Li | N/A | Code |
| AdaptiveMix: Improving GAN Training via Feature Space Shrinkage | Haozhe Liu · Wentian Zhang · Bing Li · Haoqian Wu · Nanjun He · Yawen Huang · Yuexiang Li · Bernard Ghanem · Yefeng Zheng | N/A | Code |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Michail Tarasiou · Erik Chavez · Stefanos Zafeiriou | N/A | Code |
| Latency Matters: Real-Time Action Forecasting Transformer | Harshayu Girase · Nakul Agarwal · Chiho Choi · Karttikeya Mangalam | N/A | Code |
| Multi-Sensor Large-Scale Dataset for Multi-View 3D Reconstruction | Oleg Voynov · Gleb Bobrovskikh · Pavel Karpyshev · Saveliy Galochkin · Andrei-Timotei Ardelean · Arseniy Bozhenko · Ekaterina Karmanova · Pavel Kopanev · Yaroslav Labutin-Rymsho · Ruslan Rakhimov · Aleksandr Safin · Valerii Serpiva · Alexey Artemov · Evgeny Burnaev · Dzmitry Tsetserukou · Denis Zorin | N/A | Code |
| Learning From Noisy Labels With Decoupled Meta Label Purifier | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| Flow Supervision for Deformable NeRF | Chaoyang Wang · Lachlan Ewen MacDonald · László A. Jeni · Simon Lucey | N/A | Code |
| Unifying Vision, Text, and Layout for Universal Document Processing | Zineng Tang · Ziyi Yang · Guoxin Wang · Yuwei Fang · Yang Liu · Chenguang Zhu · Michael Zeng · Cha Zhang · Mohit Bansal | N/A | Code |
| BKinD-3D: Self-Supervised 3D Keypoint Discovery From Multi-View Videos | Jennifer J. Sun · Lili Karashchuk · Amil Dravid · Serim Ryou · Sonia Fereidooni · John C. Tuthill · Aggelos Katsaggelos · Bingni W. Brunton · Georgia Gkioxari · Ann Kennedy · Yisong Yue · Pietro Perona | N/A | Code |
| Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning | Zixuan Hu · Li Shen · Zhenyi Wang · Tongliang Liu · Chun Yuan · Dacheng Tao | N/A | Code |
| RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-Ray Security Image Synthesis | Luwen Duan · Min Wu · Lijian Mao · Jun Yin · Jianping Xiong · Xi Li | N/A | Code |
| Meta Architecture for Point Cloud Analysis | Haojia Lin · Xiawu Zheng · Lijiang Li · Fei Chao · Shanshan Wang · Yan Wang · Yonghong Tian · Rongrong Ji | N/A | Code |
| DyLiN: Making Light Field Networks Dynamic | Heng Yu · Joel Julin · Zoltán Á. Milacski · Koichiro Niinuma · László A. Jeni | N/A | Code |
| OpenMix: Exploring Outlier Samples for Misclassification Detection | Fei Zhu · Zhen Cheng · Xu-Yao Zhang · Cheng-Lin Liu | N/A | Code |
| Adaptive Graph Convolutional Subspace Clustering | Lai Wei · Zhengwei Chen · Jun Yin · Changming Zhu · Rigui Zhou · Jin Liu | N/A | Code |
| Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation | Guozhen Zhang · Yuhan Zhu · Haonan Wang · Youxin Chen · Gangshan Wu · Limin Wang | N/A | Code |
| Hybrid Active Learning via Deep Clustering for Video Action Detection | Aayush J. Rana · Yogesh S. Rawat | N/A | Code |
| Equiangular Basis Vectors | Yang Shen · Xuhao Sun · Xiu-Shen Wei | N/A | Code |
| CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection | Shuailei Ma · Yuefeng Wang · Ying Wei · Jiaqi Fan · Thomas H. Li · Hongli Liu · Fanbing Lv | N/A | Code |
| An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity | Zhao Xie · Tian Gao · Kewei Wu · Jiao Chang | N/A | Code |
| GCFAgg: Global and Cross-View Feature Aggregation for Multi-View Clustering | Weiqing Yan · Yuanyang Zhang · Chenlei Lv · Chang Tang · Guanghui Yue · Liang Liao · Weisi Lin | N/A | Code |
| Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning | Zesen Wu · Mang Ye | N/A | Code |
| Similarity Maps for Self-Training Weakly-Supervised Phrase Grounding | Tal Shaharabany · Lior Wolf | N/A | Code |
| DA Wand: Distortion-Aware Selection Using Neural Mesh Parameterization | Richard Liu · Noam Aigerman · Vladimir G. Kim · Rana Hanocka | N/A | Code |
| BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency | Shuo Yang · Zhaopan Xu · Kai Wang · Yang You · Hongxun Yao · Tongliang Liu · Min Xu | N/A | Code |
| DaFKD: Domain-Aware Federated Knowledge Distillation | Haozhao Wang · Yichen Li · Wenchao Xu · Ruixuan Li · Yufeng Zhan · Zhigang Zeng | N/A | Code |
| Single Image Depth Prediction Made Better: A Multivariate Gaussian Take | Ce Liu · Suryansh Kumar · Shuhang Gu · Radu Timofte · Luc Van Gool | N/A | Code |
| Align Your Latents: High-Resolution Video Synthesis With Latent Diffusion Models | Andreas Blattmann · Robin Rombach · Huan Ling · Tim Dockhorn · Seung Wook Kim · Sanja Fidler · Karsten Kreis | N/A | Code |
| GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction | Chuwei Luo · Changxu Cheng · Qi Zheng · Cong Yao | N/A | Code |
| VideoMAE V2: Scaling Video Masked Autoencoders With Dual Masking | Limin Wang · Bingkun Huang · Zhiyu Zhao · Zhan Tong · Yinan He · Yi Wang · Yali Wang · Yu Qiao | N/A | Code |
| CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition With Variational Alignment | Jiangbin Zheng · Yile Wang · Cheng Tan · Siyuan Li · Ge Wang · Jun Xia · Yidong Chen · Stan Z. Li | N/A | Code |
| All Are Worth Words: A ViT Backbone for Diffusion Models | Fan Bao · Shen Nie · Kaiwen Xue · Yue Cao · Chongxuan Li · Hang Su · Jun Zhu | N/A | Code |
| PanoSwin: A Pano-Style Swin Transformer for Panorama Understanding | Zhixin Ling · Zhen Xing · Xiangdong Zhou · Manliang Cao · Guichun Zhou | N/A | Code |
| sRGB Real Noise Synthesizing With Neighboring Correlation-Aware Noise Model | Zixuan Fu · Lanqing Guo · Bihan Wen | N/A | Code |
| Extracting Class Activation Maps From Non-Discriminative Features As Well | Zhaozheng Chen · Qianru Sun | N/A | Code |
| GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental Task | Huiping Zhuang · Zhenyu Weng · Run He · Zhiping Lin · Ziqian Zeng | N/A | Code |
| ERM-KTP: Knowledge-Level Machine Unlearning via Knowledge Transfer | Shen Lin · Xiaoyu Zhang · Chenyang Chen · Xiaofeng Chen · Willy Susilo | N/A | Code |
| PDPP:Projected Diffusion for Procedure Planning in Instructional Videos | Hanlin Wang · Yilu Wu · Sheng Guo · Limin Wang | N/A | Code |
| NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction | Bowen Cai · Jinchi Huang · Rongfei Jia · Chengfei Lv · Huan Fu | N/A | Code |
| Deep Polarization Reconstruction With PDAVIS Events | Haiyang Mei · Zuowen Wang · Xin Yang · Xiaopeng Wei · Tobi Delbruck | N/A | Code |
| Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers | Sifan Long · Zhen Zhao · Jimin Pi · Shengsheng Wang · Jingdong Wang | N/A | Code |
| PointClustering: Unsupervised Point Cloud Pre-Training Using Transformation Invariance in Clustering | Fuchen Long · Ting Yao · Zhaofan Qiu · Lusong Li · Tao Mei | N/A | Code |
| PCR: Proxy-Based Contrastive Replay for Online Class-Incremental Continual Learning | Huiwei Lin · Baoquan Zhang · Shanshan Feng · Xutao Li · Yunming Ye | N/A | Code |
| Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval | Tan Pan · Furong Xu · Xudong Yang · Sifeng He · Chen Jiang · Qingpei Guo · Feng Qian · Xiaobo Zhang · Yuan Cheng · Lei Yang · Wei Chu | N/A | Code |
| PermutoSDF: Fast Multi-View Reconstruction With Implicit Surfaces Using Permutohedral Lattices | Radu Alexandru Rosu · Sven Behnke | N/A | Code |
| StyleGene: Crossover and Mutation of Region-Level Facial Genes for Kinship Face Synthesis | Hao Li · Xianxu Hou · Zepeng Huang · Linlin Shen | N/A | Code |
| MixNeRF: Modeling a Ray With Mixture Density for Novel View Synthesis From Sparse Inputs | Seunghyeon Seo · Donghoon Han · Yeonjin Chang · Nojun Kwak | N/A | Code |
| Upcycling Models Under Domain and Category Shift | Sanqing Qu · Tianpei Zou · Florian Röhrbein · Cewu Lu · Guang Chen · Dacheng Tao · Changjun Jiang | N/A | Code |
| Towards Unbiased Volume Rendering of Neural Implicit Surfaces With Geometry Priors | Yongqiang Zhang · Zhipeng Hu · Haoqian Wu · Minda Zhao · Lincheng Li · Zhengxia Zou · Changjie Fan | N/A | Code |
| Avatars Grow Legs: Generating Smooth Human Motion From Sparse Tracking Inputs With Diffusion Model | Yuming Du · Robin Kips · Albert Pumarola · Sebastian Starke · Ali Thabet · Artsiom Sanakoyeu | N/A | Code |
| MoStGAN-V: Video Generation With Temporal Motion Styles | Xiaoqian Shen · Xiang Li · Mohamed Elhoseiny | N/A | Code |
| On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung · Patrick Ruhkamp · Guangyao Zhai · Nikolas Brasch · Yitong Li · Yannick Verdie · Jifei Song · Yiren Zhou · Anil Armagan · Slobodan Ilic · Aleš Leonardis · Nassir Navab · Benjamin Busam | N/A | Code |
| DeGPR: Deep Guided Posterior Regularization for Multi-Class Cell Detection and Counting | Aayush Kumar Tyagi · Chirag Mohapatra · Prasenjit Das · Govind Makharia · Lalita Mehra · Prathosh AP · Mausam | N/A | Code |
| Learning Action Changes by Measuring Verb-Adverb Textual Relationships | Davide Moltisanti · Frank Keller · Hakan Bilen · Laura Sevilla-Lara | N/A | Code |
| Interactive and Explainable Region-Guided Radiology Report Generation | Tim Tanida · Philip Müller · Georgios Kaissis · Daniel Rueckert | N/A | Code |
| Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng · Sida Peng · Zhen Xu · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt | Hao Li · Dingwen Zhang · Nian Liu · Lechao Cheng · Yalun Dai · Chao Zhang · Xinggang Wang · Junwei Han | N/A | Code |
| Learning Rotation-Equivariant Features for Visual Correspondence | Jongmin Lee · Byungjin Kim · Seungwook Kim · Minsu Cho | N/A | Code |
| Co-Training 2L Submodels for Visual Recognition | Hugo Touvron · Matthieu Cord · Maxime Oquab · Piotr Bojanowski · Jakob Verbeek · Hervé Jégou | N/A | Code |
| HOTNAS: Hierarchical Optimal Transport for Neural Architecture Search | Jiechao Yang · Yong Liu · Hongteng Xu | N/A | Code |
| LANA: A Language-Capable Navigator for Instruction Following and Generation | Xiaohan Wang · Wenguan Wang · Jiayi Shao · Yi Yang | N/A | Code |
| Visual Localization Using Imperfect 3D Models From the Internet | Vojtech Panek · Zuzana Kukelova · Torsten Sattler | N/A | Code |
| Diversity-Measurable Anomaly Detection | Wenrui Liu · Hong Chang · Bingpeng Ma · Shiguang Shan · Xilin Chen | N/A | Code |
| SLACK: Stable Learning of Augmentations With Cold-Start and KL Regularization | Juliette Marrie · Michael Arbel · Diane Larlus · Julien Mairal | N/A | Code |
| Recurrent Vision Transformers for Object Detection With Event Cameras | Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| Efficient Verification of Neural Networks Against LVM-Based Specifications | Harleen Hanspal · Alessio Lomuscio | N/A | Code |
| Neuralizer: General Neuroimage Analysis Without Re-Training | Steffen Czolbe · Adrian V. Dalca | N/A | Code |
| MobileVOS: Real-Time Video Object Segmentation Contrastive Learning Meets Knowledge Distillation | Roy Miles · Mehmet Kerim Yucel · Bruno Manganelli · Albert Saà-Garriga | N/A | Code |
| SCOTCH and SODA: A Transformer Video Shadow Detection Framework | Lihao Liu · Jean Prost · Lei Zhu · Nicolas Papadakis · Pietro Liò · Carola-Bibiane Schönlieb · Angelica I. Aviles-Rivero | N/A | Code |
| A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance | Xianmin Xu · Yuxin Lin · Haoyang Zhou · Chong Zeng · Yaxin Yu · Kun Zhou · Hongzhi Wu | N/A | Code |
| Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures | Eugenia Iofinova · Alexandra Peste · Dan Alistarh | N/A | Code |
| InstructPix2Pix: Learning To Follow Image Editing Instructions | Tim Brooks · Aleksander Holynski · Alexei A. Efros | N/A | Code |
| AnchorFormer: Point Cloud Completion From Discriminative Nodes | Zhikai Chen · Fuchen Long · Zhaofan Qiu · Ting Yao · Wengang Zhou · Jiebo Luo · Tao Mei | N/A | Code |
| Robust Test-Time Adaptation in Dynamic Scenarios | Longhui Yuan · Binhui Xie · Shuang Li | N/A | Code |
| AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers | Zechuan Li · Hongshan Yu · Zhengeng Yang · Tongjia Chen · Naveed Akhtar | N/A | Code |
| Neural Texture Synthesis With Guided Correspondence | Yang Zhou · Kaijian Chen · Rongjun Xiao · Hui Huang | N/A | Code |
| Learning To Render Novel Views From Wide-Baseline Stereo Pairs | Yilun Du · Cameron Smith · Ayush Tewari · Vincent Sitzmann | N/A | Code |
| Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision | Fangqiang Ding · Andras Palffy · Dariu M. Gavrila · Chris Xiaoxuan Lu | N/A | Code |
| SmallCap: Lightweight Image Captioning Prompted With Retrieval Augmentation | Rita Ramos · Bruno Martins · Desmond Elliott · Yova Kementchedjhieva | N/A | Code |
| PIDNet: A Real-Time Semantic Segmentation Network Inspired by PID Controllers | Jiacong Xu · Zixiang Xiong · Shankar P. Bhattacharyya | N/A | Code |
| NeRFLight: Fast and Light Neural Radiance Fields Using a Shared Feature Grid | Fernando Rivas-Manzaneque · Jorge Sierra-Acosta · Adrian Penate-Sanchez · Francesc Moreno-Noguer · Angela Ribeiro | N/A | Code |
| Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete Counterparts | Nikolas Lamb · Cameron Palmer · Benjamin Molloy · Sean Banerjee · Natasha Kholgade Banerjee | N/A | Code |
| PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration | Junle Yu · Luwei Ren · Yu Zhang · Wenhui Zhou · Lili Lin · Guojun Dai | N/A | Code |
| Neural Volumetric Memory for Visual Locomotion Control | Ruihan Yang · Ge Yang · Xiaolong Wang | N/A | Code |
| InstantAvatar: Learning Avatars From Monocular Video in 60 Seconds | Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| TMO: Textured Mesh Acquisition of Objects With a Mobile Device by Using Differentiable Rendering | Jaehoon Choi · Dongki Jung · Taejae Lee · Sangwook Kim · Youngdong Jung · Dinesh Manocha · Donghwan Lee | N/A | Code |
| MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding | Jun Chen · Ming Hu · Darren J. Coker · Michael L. Berumen · Blair Costelloe · Sara Beery · Anna Rohrbach · Mohamed Elhoseiny | N/A | Code |
| Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval | Xudong Lin · Simran Tiwari · Shiyuan Huang · Manling Li · Mike Zheng Shou · Heng Ji · Shih-Fu Chang | N/A | Code |
| Hierarchical Fine-Grained Image Forgery Detection and Localization | Xiao Guo · Xiaohong Liu · Zhiyuan Ren · Steven Grosz · Iacopo Masi · Xiaoming Liu | N/A | Code |
| SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision | Xubo Liu · Egor Lakomkin · Konstantinos Vougioukas · Pingchuan Ma · Honglie Chen · Ruiming Xie · Morrie Doulaty · Niko Moritz · Jachym Kolar · Stavros Petridis · Maja Pantic · Christian Fuegen | N/A | Code |
| RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation With Natural Prompts | Han Liu · Yuhao Wu · Shixuan Zhai · Bo Yuan · Ning Zhang | N/A | Code |
| Unsupervised Intrinsic Image Decomposition With LiDAR Intensity | Shogo Sato · Yasuhiro Yao · Taiga Yoshida · Takuhiro Kaneko · Shingo Ando · Jun Shimamura | N/A | Code |
| SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples | Han Liu · Yuhao Wu · Zhiyuan Yu · Yevgeniy Vorobeychik · Ning Zhang | N/A | Code |
| NeFII: Inverse Rendering for Reflectance Decomposition With Near-Field Indirect Illumination | Haoqian Wu · Zhipeng Hu · Lincheng Li · Yongqiang Zhang · Changjie Fan · Xin Yu | N/A | Code |
| BEV-Guided Multi-Modality Fusion for Driving Perception | Yunze Man · Liang-Yan Gui · Yu-Xiong Wang | N/A | Code |
| MAGVLT: Masked Generative Vision-and-Language Transformer | Sungwoong Kim · Daejin Jo · Donghoon Lee · Jongmin Kim | N/A | Code |
| PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial Training | Qingjie Zeng · Yutong Xie · Zilin Lu · Yong Xia | N/A | Code |
| Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning | Cheng-Hao Tu · Zheda Mai · Wei-Lun Chao | N/A | Code |
| Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning | Wenju Sun · Qingyong Li · Jing Zhang · Wen Wang · Yangli-ao Geng | N/A | Code |
| PMR: Prototypical Modal Rebalance for Multimodal Learning | Yunfeng Fan · Wenchao Xu · Haozhao Wang · Junxiao Wang · Song Guo | N/A | Code |
| DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks | Samyak Jain · Sravanti Addepalli · Pawan Kumar Sahu · Priyam Dey · R. Venkatesh Babu | N/A | Code |
| Abstract Visual Reasoning: An Algebraic Approach for Solving Raven’s Progressive Matrices | Jingyi Xu · Tushar Vaidya · Yufei Wu · Saket Chandra · Zhangsheng Lai · Kai Fong Ernest Chong | N/A | Code |
| Swept-Angle Synthetic Wavelength Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Passive Micron-Scale Time-of-Flight With Sunlight Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Meta-Learning With a Geometry-Adaptive Preconditioner | Suhyun Kang · Duhun Hwang · Moonjung Eo · Taesup Kim · Wonjong Rhee | N/A | Code |
| 3D GAN Inversion With Facial Symmetry Prior | Fei Yin · Yong Zhang · Xuan Wang · Tengfei Wang · Xiaoyu Li · Yuan Gong · Yanbo Fan · Xiaodong Cun · Ying Shan · Cengiz Oztireli · Yujiu Yang | N/A | Code |
| ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection | Yongqi An · Xu Zhao · Tao Yu · Haiyun Guo · Chaoyang Zhao · Ming Tang · Jinqiao Wang | N/A | Code |
| Neural Lens Modeling | Wenqi Xian · Aljaž Božič · Noah Snavely · Christoph Lassner | N/A | Code |
| A Probabilistic Framework for Lifelong Test-Time Adaptation | Dhanajit Brahma · Piyush Rai | N/A | Code |
| Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation | Linglan Zhao · Jing Lu · Yunlu Xu · Zhanzhan Cheng · Dashan Guo · Yi Niu · Xiangzhong Fang | N/A | Code |
| GradMA: A Gradient-Memory-Based Accelerated Federated Learning With Alleviated Catastrophic Forgetting | Kangyang Luo · Xiang Li · Yunshi Lan · Ming Gao | N/A | Code |
| Hyperspherical Embedding for Point Cloud Completion | Junming Zhang · Haomeng Zhang · Ram Vasudevan · Matthew Johnson-Roberson | N/A | Code |
| Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning | Hyesong Choi · Hunsang Lee · Wonil Song · Sangryul Jeon · Kwanghoon Sohn · Dongbo Min | N/A | Code |
| Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation | Sun-Ao Liu · Yiheng Zhang · Zhaofan Qiu · Hongtao Xie · Yongdong Zhang · Ting Yao | N/A | Code |
| DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment | Heyuan Li · Bo Wang · Yu Cheng · Mohan Kankanhalli · Robby T. Tan | N/A | Code |
| SViTT: Temporal Learning of Sparse Video-Text Transformers | Yi Li · Kyle Min · Subarna Tripathi · Nuno Vasconcelos | N/A | Code |
| Independent Component Alignment for Multi-Task Learning | Dmitry Senushkin · Nikolay Patakin · Arseny Kuznetsov · Anton Konushin | N/A | Code |
| Logical Implications for Visual Question Answering Consistency | Sergio Tascon-Morales · Pablo Márquez-Neila · Raphael Sznitman | N/A | Code |
| MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset | Chen Feng · Ioannis Patras | N/A | Code |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Wenhui Wang · Hangbo Bao · Li Dong · Johan Bjorck · Zhiliang Peng · Qiang Liu · Kriti Aggarwal · Owais Khan Mohammed · Saksham Singhal · Subhojit Som · Furu Wei | N/A | Code |
| Manipulating Transfer Learning for Property Inference | Yulong Tian · Fnu Suya · Anshuman Suri · Fengyuan Xu · David Evans | N/A | Code |
| DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana · Ahmed Magd · Kyung-Soo Kim | N/A | Code |
| Learning a 3D Morphable Face Reflectance Model From Low-Cost Data | Yuxuan Han · Zhibo Wang · Feng Xu | N/A | Code |
| Principles of Forgetting in Domain-Incremental Semantic Segmentation in Adverse Weather Conditions | Tobias Kalb · Jürgen Beyerer | N/A | Code |
| Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models | Gowthami Somepalli · Vasu Singla · Micah Goldblum · Jonas Geiping · Tom Goldstein | N/A | Code |
| Adaptive Data-Free Quantization | Biao Qian · Yang Wang · Richang Hong · Meng Wang | N/A | Code |
| Coreset Sampling From Open-Set for Fine-Grained Self-Supervised Learning | Sungnyun Kim · Sangmin Bae · Se-Young Yun | N/A | Code |
| Jedi: Entropy-Based Localization and Removal of Adversarial Patches | Bilel Tarchoun · Anouar Ben Khalifa · Mohamed Ali Mahjoub · Nael Abu-Ghazaleh · Ihsen Alouani | N/A | Code |
| Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models | Qiucheng Wu · Yujian Liu · Handong Zhao · Ajinkya Kale · Trung Bui · Tong Yu · Zhe Lin · Yang Zhang · Shiyu Chang | N/A | Code |
| Semantic-Conditional Diffusion Networks for Image Captioning | Jianjie Luo · Yehao Li · Yingwei Pan · Ting Yao · Jianlin Feng · Hongyang Chao · Tao Mei | N/A | Code |
| Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation | Zhen Zhao · Sifan Long · Jimin Pi · Jingdong Wang · Luping Zhou | N/A | Code |
| Improving Robustness of Semantic Segmentation to Motion-Blur Using Class-Centric Augmentation | Aakanksha Aakanksha · A. N. Rajagopalan | N/A | Code |
| MetaViewer: Towards a Unified Multi-View Representation | Ren Wang · Haoliang Sun · Yuling Ma · Xiaoming Xi · Yilong Yin | N/A | Code |
| Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization | Simone Barattin · Christos Tzelepis · Ioannis Patras · Nicu Sebe | N/A | Code |
| A Light Weight Model for Active Speaker Detection | Junhua Liao · Haihan Duan · Kanghui Feng · Wanbing Zhao · Yanbing Yang · Liangyin Chen | N/A | Code |
| Shifted Diffusion for Text-to-Image Generation | Yufan Zhou · Bingchen Liu · Yizhe Zhu · Xiao Yang · Changyou Chen · Jinhui Xu | N/A | Code |
| Modular Memorability: Tiered Representations for Video Memorability Prediction | Théo Dumont · Juan Segundo Hevia · Camilo L. Fosco | N/A | Code |
| Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images | Anastasis Stathopoulos · Georgios Pavlakos · Ligong Han · Dimitris N. Metaxas | N/A | Code |
| RMLVQA: A Margin Loss Approach for Visual Question Answering With Language Biases | Abhipsa Basu · Sravanti Addepalli · R. Venkatesh Babu | N/A | Code |
| RealImpact: A Dataset of Impact Sound Fields for Real Objects | Samuel Clarke · Ruohan Gao · Mason Wang · Mark Rau · Julia Xu · Jui-Hsien Wang · Doug L. James · Jiajun Wu | N/A | Code |
| Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN Models | Nilesh Ahuja · Parual Datta · Bhavya Kanzariya · V. Srinivasa Somayazulu · Omesh Tickoo | N/A | Code |
| Improving Vision-and-Language Navigation by Generating Future-View Image Semantics | Jialu Li · Mohit Bansal | N/A | Code |
| Simulated Annealing in Early Layers Leads to Better Generalization | Amir M. Sarfi · Zahra Karimpour · Muawiz Chaudhary · Nasir M. Khalid · Mirco Ravanelli · Sudhir Mudur · Eugene Belilovsky | N/A | Code |
| From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models | Jiaxian Guo · Junnan Li · Dongxu Li · Anthony Meng Huat Tiong · Boyang Li · Dacheng Tao · Steven Hoi | N/A | Code |
| Where We Are and What We’re Looking At: Query Based Worldwide Image Geo-Localization Using Hierarchies and Scenes | Brandon Clark · Alec Kerrigan · Parth Parag Kulkarni · Vicente Vivanco Cepeda · Mubarak Shah | N/A | Code |
| CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes From Natural Language | Aditya Sanghi · Rao Fu · Vivian Liu · Karl D.D. Willis · Hooman Shayani · Amir H. Khasahmadi · Srinath Sridhar · Daniel Ritchie | N/A | Code |
| Learning To Generate Text-Grounded Mask for Open-World Semantic Segmentation From Only Image-Text Pairs | Junbum Cha · Jonghwan Mun · Byungseok Roh | N/A | Code |
| Imitation Learning As State Matching via Differentiable Physics | Siwei Chen · Xiao Ma · Zhongwen Xu | N/A | Code |
| BBDM: Image-to-Image Translation With Brownian Bridge Diffusion Models | Bo Li · Kaitao Xue · Bin Liu · Yu-Kun Lai | N/A | Code |
| CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning | Jianlong Wu · Haozhe Yang · Tian Gan · Ning Ding · Feijun Jiang · Liqiang Nie | N/A | Code |
| Re-GAN: Data-Efficient GANs Training via Architectural Reconfiguration | Divya Saxena · Jiannong Cao · Jiahao Xu · Tarun Kulshrestha | N/A | Code |
| Learning Debiased Representations via Conditional Attribute Interpolation | Yi-Kai Zhang · Qi-Wei Wang · De-Chuan Zhan · Han-Jia Ye | N/A | Code |
| Weakly Supervised Posture Mining for Fine-Grained Classification | Zhenchao Tang · Hualin Yang · Calvin Yu-Chian Chen | N/A | Code |
| Learning a Practical SDR-to-HDRTV Up-Conversion Using New Dataset and Degradation Models | Cheng Guo · Leidong Fan · Ziyu Xue · Xiuhua Jiang | N/A | Code |
| VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models | Ajay Jain · Amber Xie · Pieter Abbeel | N/A | Code |
| Adversarial Robustness via Random Projection Filters | Minjing Dong · Chang Xu | N/A | Code |
| IEEE Computer Society | Unknown | N/A | Code |
| The Computer Vision Foundation | Unknown | N/A | Code |
CVPR 2022
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation | Yen-Chi Cheng · Hsin-Ying Lee · Sergey Tulyakov · Alexander G. Schwing · Liang-Yan Gui | N/A | Code |
| Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring | Ruyang Liu · Jingjia Huang · Ge Li · Jiashi Feng · Xinglong Wu · Thomas H. Li | N/A | Code |
| Post-Processing Temporal Action Detection | Sauradip Nag · Xiatian Zhu · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Learning Analytical Posterior Probability for Human Mesh Recovery | Qi Fang · Kang Chen · Yinghui Fan · Qing Shuai · Jiefeng Li · Weidong Zhang | N/A | Code |
| Accidental Light Probes | Hong-Xing Yu · Samir Agarwala · Charles Herrmann · Richard Szeliski · Noah Snavely · Jiajun Wu · Deqing Sun | N/A | Code |
| Multi-Object Manipulation via Object-Centric Neural Scattering Functions | Stephen Tian · Yancheng Cai · Hong-Xing Yu · Sergey Zakharov · Katherine Liu · Adrien Gaidon · Yunzhu Li · Jiajun Wu | N/A | Code |
| CFA: Class-Wise Calibrated Fair Adversarial Training | Zeming Wei · Yifei Wang · Yiwen Guo · Yisen Wang | N/A | Code |
| AutoAD: Movie Description in Context | Tengda Han · Max Bain · Arsha Nagrani · Gül Varol · Weidi Xie · Andrew Zisserman | N/A | Code |
| Relational Context Learning for Human-Object Interaction Detection | Sanghyun Kim · Deunsol Jung · Minsu Cho | N/A | Code |
| Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations | Hagay Michaeli · Tomer Michaeli · Daniel Soudry | N/A | Code |
| Learning Distortion Invariant Representation for Image Restoration From a Causality Perspective | Xin Li · Bingchen Li · Xin Jin · Cuiling Lan · Zhibo Chen | N/A | Code |
| Iterative Vision-and-Language Navigation | Jacob Krantz · Shurjo Banerjee · Wang Zhu · Jason Corso · Peter Anderson · Stefan Lee · Jesse Thomason | N/A | Code |
| FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer | Zhijian Liu · Xinyu Yang · Haotian Tang · Shang Yang · Song Han | N/A | Code |
| BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration | Sheng Ao · Qingyong Hu · Hanyun Wang · Kai Xu · Yulan Guo | N/A | Code |
| Learning Event Guided High Dynamic Range Video Reconstruction | Yixin Yang · Jin Han · Jinxiu Liang · Imari Sato · Boxin Shi | N/A | Code |
| 3D Line Mapping Revisited | Shaohui Liu · Yifan Yu · Rémi Pautrat · Marc Pollefeys · Viktor Larsson | N/A | Code |
| High-Fidelity Event-Radiance Recovery via Transient Event Frequency | Jin Han · Yuta Asano · Boxin Shi · Yinqiang Zheng · Imari Sato | N/A | Code |
| OCELOT: Overlapped Cell on Tissue Dataset for Histopathology | Jeongun Ryu · Aaron Valero Puche · JaeWoong Shin · Seonwook Park · Biagio Brattoli · Jinhee Lee · Wonkyung Jung · Soo Ick Cho · Kyunghyun Paeng · Chan-Young Ock · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Blur Interpolation Transformer for Real-World Motion From Blur | Zhihang Zhong · Mingdeng Cao · Xiang Ji · Yinqiang Zheng · Imari Sato | N/A | Code |
| Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation | Clinton A. Mo · Kun Hu · Chengjiang Long · Zhiyong Wang | N/A | Code |
| Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions From Monocular RGBD Stream | Yuheng Jiang · Kaixin Yao · Zhuo Su · Zhehao Shen · Haimin Luo · Lan Xu | N/A | Code |
| HexPlane: A Fast Representation for Dynamic Scenes | Ang Cao · Justin Johnson | N/A | Code |
| Finetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Models | Sachin Goyal · Ananya Kumar · Sankalp Garg · Zico Kolter · Aditi Raghunathan | N/A | Code |
| A Whac-a-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others | Zhiheng Li · Ivan Evtimov · Albert Gordo · Caner Hazirbas · Tal Hassner · Cristian Canton Ferrer · Chenliang Xu · Mark Ibrahim | N/A | Code |
| GIVL: Improving Geographical Inclusivity of Vision-Language Models With Pre-Training Methods | Da Yin · Feng Gao · Govind Thattai · Michael Johnston · Kai-Wei Chang | N/A | Code |
| Devil’s on the Edges: Selective Quad Attention for Scene Graph Generation | Deunsol Jung · Sanghyun Kim · Won Hwa Kim · Minsu Cho | N/A | Code |
| GeoMVSNet: Learning Multi-View Stereo With Geometry Perception | Zhe Zhang · Rui Peng · Yuxi Hu · Ronggang Wang | N/A | Code |
| CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability | Fadi Boutros · Meiling Fang · Marcel Klemt · Biying Fu · Naser Damer | N/A | Code |
| NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images | Mingwu Zheng · Haiyu Zhang · Hongyu Yang · Di Huang | N/A | Code |
| MethaneMapper: Spectral Absorption Aware Hyperspectral Transformer for Methane Detection | Satish Kumar · Ivan Arevalo · ASM Iftekhar · B S Manjunath | N/A | Code |
| Re-Thinking Model Inversion Attacks Against Deep Neural Networks | Ngoc-Bao Nguyen · Keshigeyan Chandrasegaran · Milad Abdollahzadeh · Ngai-Man Cheung | N/A | Code |
| SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency | Yang Liu · Yao Zhang · Yixin Wang · Yang Zhang · Jiang Tian · Zhongchao Shi · Jianping Fan · Zhiqiang He | N/A | Code |
| VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation | Bingchen Yang · Haiyong Jiang · Hao Pan · Jun Xiao | N/A | Code |
| MARLIN: Masked Autoencoder for Facial Video Representation LearnINg | Zhixi Cai · Shreya Ghosh · Kalin Stefanov · Abhinav Dhall · Jianfei Cai · Hamid Rezatofighi · Reza Haffari · Munawar Hayat | N/A | Code |
| KD-DLGAN: Data Limited Image Generation via Knowledge Distillation | Kaiwen Cui · Yingchen Yu · Fangneng Zhan · Shengcai Liao · Shijian Lu · Eric P. Xing | N/A | Code |
| Hierarchical Neural Memory Network for Low Latency Event Processing | Ryuhei Hamaguchi · Yasutaka Furukawa · Masaki Onishi · Ken Sakurada | N/A | Code |
| Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting | Wei Lin · Antoni B. Chan | N/A | Code |
| Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information | Weijie Su · Xizhou Zhu · Chenxin Tao · Lewei Lu · Bin Li · Gao Huang · Yu Qiao · Xiaogang Wang · Jie Zhou · Jifeng Dai | N/A | Code |
| Revisiting Reverse Distillation for Anomaly Detection | Tran Dinh Tien · Anh Tuan Nguyen · Nguyen Hoang Tran · Ta Duc Huy · Soan T.M. Duong · Chanh D. Tr. Nguyen · Steven Q. H. Truong | N/A | Code |
| Conditional Generation of Audio From Video via Foley Analogies | Yuexi Du · Ziyang Chen · Justin Salamon · Bryan Russell · Andrew Owens | N/A | Code |
| Parameter Efficient Local Implicit Image Function Network for Face Segmentation | Mausoom Sarkar · Nikitha SR · Mayur Hemani · Rishabh Jain · Balaji Krishnamurthy | N/A | Code |
| Learning Decorrelated Representations Efficiently Using Fast Fourier Transform | Yutaro Shigeto · Masashi Shimbo · Yuya Yoshikawa · Akikazu Takeuchi | N/A | Code |
| FaceLit: Neural 3D Relightable Faces | Anurag Ranjan · Kwang Moo Yi · Jen-Hao Rick Chang · Oncel Tuzel | N/A | Code |
| Pointersect: Neural Rendering With Cloud-Ray Intersection | Jen-Hao Rick Chang · Wei-Yu Chen · Anurag Ranjan · Kwang Moo Yi · Oncel Tuzel | N/A | Code |
| High-Fidelity Clothed Avatar Reconstruction From a Single Image | Tingting Liao · Xiaomei Zhang · Yuliang Xiu · Hongwei Yi · Xudong Liu · Guo-Jun Qi · Yong Zhang · Xuan Wang · Xiangyu Zhu · Zhen Lei | N/A | Code |
| BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Peng Wang · Lingzhe Zhao · Ruijie Ma · Peidong Liu | N/A | Code |
| Meta-Tuning Loss Functions and Data Augmentation for Few-Shot Object Detection | Berkan Demirel · Orhun Buğra Baran · Ramazan Gokberk Cinbis | N/A | Code |
| StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields | Kunhao Liu · Fangneng Zhan · Yiwen Chen · Jiahui Zhang · Yingchen Yu · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting | Maoyuan Ye · Jing Zhang · Shanshan Zhao · Juhua Liu · Tongliang Liu · Bo Du · Dacheng Tao | N/A | Code |
| Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution | Jie-En Yao · Li-Yuan Tsao · Yi-Chen Lo · Roy Tseng · Chia-Che Chang · Chun-Yi Lee | N/A | Code |
| LAVENDER: Unifying Video-Language Understanding As Masked Language Modeling | Linjie Li · Zhe Gan · Kevin Lin · Chung-Ching Lin · Zicheng Liu · Ce Liu · Lijuan Wang | N/A | Code |
| Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution | Hao-Wei Chen · Yu-Syuan Xu · Min-Fong Hong · Yi-Min Tsai · Hsien-Kai Kuo · Chun-Yi Lee | N/A | Code |
| Fair Federated Medical Image Segmentation via Client Contribution Estimation | Meirui Jiang · Holger R. Roth · Wenqi Li · Dong Yang · Can Zhao · Vishwesh Nath · Daguang Xu · Qi Dou · Ziyue Xu | N/A | Code |
| An Empirical Study of End-to-End Video-Language Transformers With Masked Visual Modeling | Tsu-Jui Fu · Linjie Li · Zhe Gan · Kevin Lin · William Yang Wang · Lijuan Wang · Zicheng Liu | N/A | Code |
| ReCo: Region-Controlled Text-to-Image Generation | Zhengyuan Yang · Jianfeng Wang · Zhe Gan · Linjie Li · Kevin Lin · Chenfei Wu · Nan Duan · Zicheng Liu · Ce Liu · Michael Zeng · Lijuan Wang | N/A | Code |
| Uncertainty-Aware Vision-Based Metric Cross-View Geolocalization | Florian Fervers · Sebastian Bullinger · Christoph Bodensteiner · Michael Arens · Rainer Stiefelhagen | N/A | Code |
| LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation | Guangcong Zheng · Xianpan Zhou · Xuewei Li · Zhongang Qi · Ying Shan · Xi Li | N/A | Code |
| Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based Attacks | Yunrui Yu · Cheng-Zhong Xu | N/A | Code |
| NIKI: Neural Inverse Kinematics With Invertible Neural Networks for 3D Human Pose and Shape Estimation | Jiefeng Li · Siyuan Bian · Qi Liu · Jiasheng Tang · Fan Wang · Cewu Lu | N/A | Code |
| 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud | Mingtao Feng · Haoran Hou · Liang Zhang · Zijie Wu · Yulan Guo · Ajmal Mian | N/A | Code |
| Egocentric Auditory Attention Localization in Conversations | Fiona Ryan · Hao Jiang · Abhinav Shukla · James M. Rehg · Vamsi Krishna Ithapu | N/A | Code |
| EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision | Jiahui Lei · Congyue Deng · Karl Schmeckpeper · Leonidas Guibas · Kostas Daniilidis | N/A | Code |
| Divide and Conquer: Answering Questions With Object Factorization and Compositional Reasoning | Shi Chen · Qi Zhao | N/A | Code |
| Text-Visual Prompting for Efficient 2D Temporal Video Grounding | Yimeng Zhang · Xin Chen · Jinghan Jia · Sijia Liu · Ke Ding | N/A | Code |
| Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning | Youngjae Yu · Jiwan Chung · Heeseung Yun · Jack Hessel · Jae Sung Park · Ximing Lu · Rowan Zellers · Prithviraj Ammanabrolu · Ronan Le Bras · Gunhee Kim · Yejin Choi | N/A | Code |
| UniHCP: A Unified Model for Human-Centric Perceptions | Yuanzheng Ci · Yizhou Wang · Meilin Chen · Shixiang Tang · Lei Bai · Feng Zhu · Rui Zhao · Fengwei Yu · Donglian Qi · Wanli Ouyang | N/A | Code |
| VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval | Siteng Huang · Biao Gong · Yulin Pan · Jianwen Jiang · Yiliang Lv · Yuyuan Li · Donglin Wang | N/A | Code |
| PointConvFormer: Revenge of the Point-Based Convolution | Wenxuan Wu · Li Fuxin · Qi Shan | N/A | Code |
| BAAM: Monocular 3D Pose and Shape Reconstruction With Bi-Contextual Attention Module and Attention-Guided Modeling | Hyo-Jun Lee · Hanul Kim · Su-Min Choi · Seong-Gyun Jeong · Yeong Jun Koh | N/A | Code |
| HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining | Shixiang Tang · Cheng Chen · Qingsong Xie · Meilin Chen · Yizhou Wang · Yuanzheng Ci · Lei Bai · Feng Zhu · Haiyang Yang · Li Yi · Rui Zhao · Wanli Ouyang | N/A | Code |
| Local Connectivity-Based Density Estimation for Face Clustering | Junho Shin · Hyo-Jun Lee · Hyunseop Kim · Jong-Hyeon Baek · Daehyun Kim · Yeong Jun Koh | N/A | Code |
| DistilPose: Tokenized Pose Regression With Heatmap Distillation | Suhang Ye · Yingyi Zhang · Jie Hu · Liujuan Cao · Shengchuan Zhang · Lei Shen · Jun Wang · Shouhong Ding · Rongrong Ji | N/A | Code |
| Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks | Weihua Chen · Xianzhe Xu · Jian Jia · Hao Luo · Yaohua Wang · Fan Wang · Rong Jin · Xiuyu Sun | N/A | Code |
| ViPLO: Vision Transformer Based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection | Jeeseung Park · Jin-Woo Park · Jong-Seok Lee | N/A | Code |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Yuxin Fang · Wen Wang · Binhui Xie · Quan Sun · Ledell Wu · Xinggang Wang · Tiejun Huang · Xinlong Wang · Yue Cao | N/A | Code |
| I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs | Jingsen Zhu · Yuchi Huo · Qi Ye · Fujun Luan · Jifan Li · Dianbing Xi · Lisha Wang · Rui Tang · Wei Hua · Hujun Bao · Rui Wang | N/A | Code |
| DrapeNet: Garment Generation and Self-Supervised Draping | Luca De Luigi · Ren Li · Benoît Guillard · Mathieu Salzmann · Pascal Fua | N/A | Code |
| STMixer: A One-Stage Sparse Action Detector | Tao Wu · Mengqi Cao · Ziteng Gao · Gangshan Wu · Limin Wang | N/A | Code |
| Inverse Rendering of Translucent Objects Using Physical and Neural Renderers | Chenhao Li · Trung Thanh Ngo · Hajime Nagahara | N/A | Code |
| Humans As Light Bulbs: 3D Human Reconstruction From Thermal Reflection | Ruoshi Liu · Carl Vondrick | N/A | Code |
| CF-Font: Content Fusion for Few-Shot Font Generation | Chi Wang · Min Zhou · Tiezheng Ge · Yuning Jiang · Hujun Bao · Weiwei Xu | N/A | Code |
| GLeaD: Improving GANs With a Generator-Leading Task | Qingyan Bai · Ceyuan Yang · Yinghao Xu · Xihui Liu · Yujiu Yang · Yujun Shen | N/A | Code |
| StarCraftImage: A Dataset for Prototyping Spatial Reasoning Methods for Multi-Agent Environments | Sean Kulinski · Nicholas R. Waytowich · James Z. Hare · David I. Inouye | N/A | Code |
| WIRE: Wavelet Implicit Neural Representations | Vishwanath Saragadam · Daniel LeJeune · Jasper Tan · Guha Balakrishnan · Ashok Veeraraghavan · Richard G. Baraniuk | N/A | Code |
| Thermal Spread Functions (TSF): Physics-Guided Material Classification | Aniket Dashpute · Vishwanath Saragadam · Emma Alexander · Florian Willomitzer · Aggelos Katsaggelos · Ashok Veeraraghavan · Oliver Cossairt | N/A | Code |
| Improving Zero-Shot Generalization and Robustness of Multi-Modal Models | Yunhao Ge · Jie Ren · Andrew Gallagher · Yuxiao Wang · Ming-Hsuan Yang · Hartwig Adam · Laurent Itti · Balaji Lakshminarayanan · Jiaping Zhao | N/A | Code |
| The Differentiable Lens: Compound Lens Search Over Glass Surfaces and Materials for Object Detection | Geoffroi Côté · Fahim Mannan · Simon Thibault · Jean-François Lalonde · Felix Heide | N/A | Code |
| Federated Domain Generalization With Generalization Adjustment | Ruipeng Zhang · Qinwei Xu · Jiangchao Yao · Ya Zhang · Qi Tian · Yanfeng Wang | N/A | Code |
| Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking | Yihao Wang · Zhigang Wang · Bin Zhao · Dong Wang · Mulin Chen · Xuelong Li | N/A | Code |
| Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network | Zhengxin Pan · Fangyu Wu · Bailing Zhang | N/A | Code |
| On the Benefits of 3D Pose and Tracking for Human Action Recognition | Jathushan Rajasegaran · Georgios Pavlakos · Angjoo Kanazawa · Christoph Feichtenhofer · Jitendra Malik | N/A | Code |
| Visual DNA: Representing and Comparing Images Using Distributions of Neuron Activations | Benjamin Ramtoula · Matthew Gadd · Paul Newman · Daniele De Martini | N/A | Code |
| Fine-Tuned CLIP Models Are Efficient Video Learners | Hanoona Rasheed · Muhammad Uzair Khattak · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Connecting Vision and Language With Video Localized Narratives | Paul Voigtlaender · Soravit Changpinyo · Jordi Pont-Tuset · Radu Soricut · Vittorio Ferrari | N/A | Code |
| K-Planes: Explicit Radiance Fields in Space, Time, and Appearance | Sara Fridovich-Keil · Giacomo Meanti · Frederik Rahbæk Warburg · Benjamin Recht · Angjoo Kanazawa | N/A | Code |
| Virtual Occlusions Through Implicit Depth | Jamie Watson · Mohamed Sayed · Zawar Qureshi · Gabriel J. Brostow · Sara Vicente · Oisin Mac Aodha · Michael Firman | N/A | Code |
| Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories | Samarth Sinha · Roman Shapovalov · Jeremy Reizenstein · Ignacio Rocco · Natalia Neverova · Andrea Vedaldi · David Novotny | N/A | Code |
| LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising | Zichun Wang · Ying Fu · Ji Liu · Yulun Zhang | N/A | Code |
| One-Shot High-Fidelity Talking-Head Synthesis With Deformable Neural Radiance Field | Weichuang Li · Longhao Zhang · Dong Wang · Bin Zhao · Zhigang Wang · Mulin Chen · Bang Zhang · Zhongjian Wang · Liefeng Bo · Xuelong Li | N/A | Code |
| Collaborative Diffusion for Multi-Modal Face Generation and Editing | Ziqi Huang · Kelvin C.K. Chan · Yuming Jiang · Ziwei Liu | N/A | Code |
| Blind Video Deflickering by Neural Filtering With a Flawed Atlas | Chenyang Lei · Xuanchi Ren · Zhaoxiang Zhang · Qifeng Chen | N/A | Code |
| RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension | Jiamu Sun · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Zhiyu Wang · Rongrong Ji | N/A | Code |
| HNeRV: A Hybrid Neural Representation for Videos | Hao Chen · Matthew Gwilliam · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Learning 3D-Aware Image Synthesis With Unknown Pose Distribution | Zifan Shi · Yujun Shen · Yinghao Xu · Sida Peng · Yiyi Liao · Sheng Guo · Qifeng Chen · Dit-Yan Yeung | N/A | Code |
| DynaFed: Tackling Client Data Heterogeneity With Global Dynamics | Renjie Pi · Weizhong Zhang · Yueqi Xie · Jiahui Gao · Xiaoyu Wang · Sunghun Kim · Qifeng Chen | N/A | Code |
| Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition | Jun Cen · Shiwei Zhang · Xiang Wang · Yixuan Pei · Zhiwu Qing · Yingya Zhang · Qifeng Chen | N/A | Code |
| RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion | Tengfei Wang · Bo Zhang · Ting Zhang · Shuyang Gu · Jianmin Bao · Tadas Baltrusaitis · Jingjing Shen · Dong Chen · Fang Wen · Qifeng Chen · Baining Guo | N/A | Code |
| IFSeg: Image-Free Semantic Segmentation via Vision-Language Model | Sukmin Yun · Seong Hyeon Park · Paul Hongsuck Seo · Jinwoo Shin | N/A | Code |
| Detecting Everything in the Open World: Towards Universal Object Detection | Zhenyu Wang · Yali Li · Xi Chen · Ser-Nam Lim · Antonio Torralba · Hengshuang Zhao · Shengjin Wang | N/A | Code |
| Improving Visual Grounding by Encouraging Consistent Gradient-Based Explanations | Ziyan Yang · Kushal Kafle · Franck Dernoncourt · Vicente Ordonez | N/A | Code |
| Temporally Consistent Online Depth Estimation Using Point-Based Fusion | Numair Khan · Eric Penner · Douglas Lanman · Lei Xiao | N/A | Code |
| NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions | Juze Zhang · Haimin Luo · Hongdi Yang · Xinru Xu · Qianyang Wu · Ye Shi · Jingyi Yu · Lan Xu · Jingya Wang | N/A | Code |
| Token Turing Machines | Michael S. Ryoo · Keerthana Gopalakrishnan · Kumara Kahatapitiya · Ted Xiao · Kanishka Rao · Austin Stone · Yao Lu · Julian Ibarz · Anurag Arnab | N/A | Code |
| Computationally Budgeted Continual Learning: What Does Matter? | Ameya Prabhu · Hasan Abed Al Kader Hammoud · Puneet K. Dokania · Philip H.S. Torr · Ser-Nam Lim · Bernard Ghanem · Adel Bibi | N/A | Code |
| CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search | Fahad Shamshad · Muzammal Naseer · Karthik Nandakumar | N/A | Code |
| Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation From Image Sequence | Yang Tian · Jiyao Zhang · Zekai Yin · Hao Dong | N/A | Code |
| Affordances From Human Videos as a Versatile Representation for Robotics | Shikhar Bahl · Russell Mendonca · Lili Chen · Unnat Jain · Deepak Pathak | N/A | Code |
| MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation | Yong Yang · Qiong Chen · Yuan Feng · Tianlin Huang | N/A | Code |
| Learning To Generate Image Embeddings With User-Level Differential Privacy | Zheng Xu · Maxwell Collins · Yuxiao Wang · Liviu Panait · Sewoong Oh · Sean Augenstein · Ting Liu · Florian Schroff · H. Brendan McMahan | N/A | Code |
| Genie: Show Me the Data for Quantization | Yongkweon Jeon · Chungman Lee · Ho-young Kim | N/A | Code |
| DSVT: Dynamic Sparse Voxel Transformer With Rotated Sets | Haiyang Wang · Chen Shi · Shaoshuai Shi · Meng Lei · Sen Wang · Di He · Bernt Schiele · Liwei Wang | N/A | Code |
| Transformer-Based Learned Optimization | Erik Gärtner · Luke Metz · Mykhaylo Andriluka · C. Daniel Freeman · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Noise2Noise: Efficient Image Denoising Without Any Data | Youssef Mansour · Reinhard Heckel | N/A | Code |
| Super-Resolution Neural Operator | Min Wei · Xuesong Zhang | N/A | Code |
| StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping | Diqiong Jiang · Dan Song · Ruofeng Tong · Min Tang | N/A | Code |
| Self-Supervised Blind Motion Deblurring With Deep Expectation Maximization | Ji Li · Weixi Wang · Yuesong Nan · Hui Ji | N/A | Code |
| Confidence-Aware Personalized Federated Learning via Variational Expectation Maximization | Junyi Zhu · Xingchen Ma · Matthew B. Blaschko | N/A | Code |
| Human Pose As Compositional Tokens | Zigang Geng · Chunyu Wang · Yixuan Wei · Ze Liu · Houqiang Li · Han Hu | N/A | Code |
| GeoMAE: Masked Geometric Target Prediction for Self-Supervised Point Cloud Pre-Training | Xiaoyu Tian · Haoxi Ran · Yue Wang · Hang Zhao | N/A | Code |
| RUST: Latent Neural Scene Representations From Unposed Imagery | Mehdi S. M. Sajjadi · Aravindh Mahendran · Thomas Kipf · Etienne Pot · Daniel Duckworth · Mario Lučić · Klaus Greff | N/A | Code |
| Bias Mimicking: A Simple Sampling Approach for Bias Mitigation | Maan Qraitem · Kate Saenko · Bryan A. Plummer | N/A | Code |
| V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting | Haibao Yu · Wenxian Yang · Hongzhi Ruan · Zhenwei Yang · Yingjuan Tang · Xu Gao · Xin Hao · Yifeng Shi · Yifeng Pan · Ning Sun · Juan Song · Jirui Yuan · Ping Luo · Zaiqing Nie | N/A | Code |
| Conditional Image-to-Video Generation With Latent Flow Diffusion Models | Haomiao Ni · Changhao Shi · Kai Li · Sharon X. Huang · Martin Renqiang Min | N/A | Code |
| Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection | Shaofei Huang · Zhenwei Shen · Zehao Huang · Zi-han Ding · Jiao Dai · Jizhong Han · Naiyan Wang · Si Liu | N/A | Code |
| 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds | Aoran Xiao · Jiaxing Huang · Weihao Xuan · Ruijie Ren · Kangcheng Liu · Dayan Guan · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| NeMo: Learning 3D Neural Motion Fields From Multiple Video Instances of the Same Action | Kuan-Chieh Wang · Zhenzhen Weng · Maria Xenochristou · João Pedro Araújo · Jeffrey Gu · Karen Liu · Serena Yeung | N/A | Code |
| Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning | Xiaocheng Lu · Song Guo · Ziming Liu · Jingcai Guo | N/A | Code |
| iDisc: Internal Discretization for Monocular Depth Estimation | Luigi Piccinelli · Christos Sakaridis · Fisher Yu | N/A | Code |
| UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy | Yinzhen Xu · Weikang Wan · Jialiang Zhang · Haoran Liu · Zikang Shan · Hao Shen · Ruicheng Wang · Haoran Geng · Yijia Weng · Jiayi Chen · Tengyu Liu · Li Yi · He Wang | N/A | Code |
| PolyFormer: Referring Image Segmentation As Sequential Polygon Generation | Jiang Liu · Hui Ding · Zhaowei Cai · Yuting Zhang · Ravi Kumar Satzoda · Vijay Mahadevan · R. Manmatha | N/A | Code |
| Interactive Segmentation of Radiance Fields | Rahul Goel · Dhawal Sirikonda · Saurabh Saini · P. J. Narayanan | N/A | Code |
| PointCert: Point Cloud Classification With Deterministic Certified Robustness Guarantees | Jinghuai Zhang · Jinyuan Jia · Hongbin Liu · Neil Zhenqiang Gong | N/A | Code |
| Indiscernible Object Counting in Underwater Scenes | Guolei Sun · Zhaochong An · Yun Liu · Ce Liu · Christos Sakaridis · Deng-Ping Fan · Luc Van Gool | N/A | Code |
| Improving Robustness of Vision Transformers by Reducing Sensitivity To Patch Corruptions | Yong Guo · David Stutz · Bernt Schiele | N/A | Code |
| Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video | Wenzheng Zeng · Yang Xiao · Sicheng Wei · Jinfang Gan · Xintao Zhang · Zhiguo Cao · Zhiwen Fang · Joey Tianyi Zhou | N/A | Code |
| BEV-LaneDet: An Efficient 3D Lane Detection Based on Virtual Camera via Key-Points | Ruihao Wang · Jian Qin · Kaiying Li · Yaochen Li · Dong Cao · Jintao Xu | N/A | Code |
| Infinite Photorealistic Worlds Using Procedural Generation | Alexander Raistrick · Lahav Lipson · Zeyu Ma · Lingjie Mei · Mingzhe Wang · Yiming Zuo · Karhan Kayan · Hongyu Wen · Beining Han · Yihan Wang · Alejandro Newell · Hei Law · Ankit Goyal · Kaiyu Yang · Jia Deng | N/A | Code |
| High-Fidelity 3D Human Digitization From Single 2K Resolution Images | Sang-Hun Han · Min-Gyu Park · Ju Hong Yoon · Ju-Mi Kang · Young-Jae Park · Hae-Gon Jeon | N/A | Code |
| GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis | Ming Tao · Bing-Kun Bao · Hao Tang · Changsheng Xu | N/A | Code |
| Language-Guided Audio-Visual Source Separation via Trimodal Consistency | Reuben Tan · Arijit Ray · Andrea Burns · Bryan A. Plummer · Justin Salamon · Oriol Nieto · Bryan Russell · Kate Saenko | N/A | Code |
| Probabilistic Debiasing of Scene Graphs | Bashirul Azam Biswas · Qiang Ji | N/A | Code |
| PVO: Panoptic Visual Odometry | Weicai Ye · Xinyue Lan · Shuo Chen · Yuhang Ming · Xingyuan Yu · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| Superclass Learning With Representation Enhancement | Zeyu Gan · Suyun Zhao · Jinlong Kang · Liyuan Shang · Hong Chen · Cuiping Li | N/A | Code |
| GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts | Haoran Geng · Helin Xu · Chengyang Zhao · Chao Xu · Li Yi · Siyuan Huang · He Wang | N/A | Code |
| Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation | Liyan Chen · Weihan Wang · Philippos Mordohai | N/A | Code |
| Efficient View Synthesis and 3D-Based Multi-Frame Denoising With Multiplane Feature Representations | Thomas Tanay · Aleš Leonardis · Matteo Maggioni | N/A | Code |
| Large-Capacity and Flexible Video Steganography via Invertible Neural Network | Chong Mou · Youmin Xu · Jiechong Song · Chen Zhao · Bernard Ghanem · Jian Zhang | N/A | Code |
| Generating Part-Aware Editable 3D Shapes Without 3D Supervision | Konstantinos Tertikas · Despoina Paschalidou · Boxiao Pan · Jeong Joon Park · Mikaela Angelina Uy · Ioannis Emiris · Yannis Avrithis · Leonidas Guibas | N/A | Code |
| Vision Transformer With Super Token Sampling | Huaibo Huang · Xiaoqiang Zhou · Jie Cao · Ran He · Tieniu Tan | N/A | Code |
| Renderable Neural Radiance Map for Visual Navigation | Obin Kwon · Jeongho Park · Songhwai Oh | N/A | Code |
| Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong · Wei-Chiu Ma · Jingkang Wang · Raquel Urtasun | N/A | Code |
| CoMFormer: Continual Learning in Semantic and Panoptic Segmentation | Fabio Cermelli · Matthieu Cord · Arthur Douillard | N/A | Code |
| A Bag-of-Prototypes Representation for Dataset-Level Applications | Weijie Tu · Weijian Deng · Tom Gedeon · Liang Zheng | N/A | Code |
| Geometric Visual Similarity Learning in 3D Medical Image Self-Supervised Pre-Training | Yuting He · Guanyu Yang · Rongjun Ge · Yang Chen · Jean-Louis Coatrieux · Boyu Wang · Shuo Li | N/A | Code |
| Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Erasing Network | Zhicheng Zhang · Lijuan Wang · Jufeng Yang | N/A | Code |
| CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning | James Seale Smith · Leonid Karlinsky · Vyshnavi Gutta · Paola Cascante-Bonilla · Donghyun Kim · Assaf Arbelle · Rameswar Panda · Rogerio Feris · Zsolt Kira | N/A | Code |
| CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior | Jinbo Xing · Menghan Xia · Yuechen Zhang · Xiaodong Cun · Jue Wang · Tien-Tsin Wong | N/A | Code |
| VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction | Yufan Ren · Fangjinhua Wang · Tong Zhang · Marc Pollefeys · Sabine Süsstrunk | N/A | Code |
| NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation | Haoqian Wu · Keyu Chen · Haozhe Liu · Mingchen Zhuge · Bing Li · Ruizhi Qiao · Xiujun Shu · Bei Gan · Liangsheng Xu · Bo Ren · Mengmeng Xu · Wentian Zhang · Raghavendra Ramachandra · Chia-Wen Lin · Bernard Ghanem | N/A | Code |
| Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization | Yuechen Zhang · Zexin He · Jinbo Xing · Xufeng Yao · Jiaya Jia | N/A | Code |
| GANmouflage: 3D Object Nondetection With Texture Fields | Rui Guo · Jasmine Collins · Oscar de Lima · Andrew Owens | N/A | Code |
| GP-VTON: Towards General Purpose Virtual Try-On via Collaborative Local-Flow Global-Parsing Learning | Zhenyu Xie · Zaiyu Huang · Xin Dong · Fuwei Zhao · Haoye Dong · Xijin Zhang · Feida Zhu · Xiaodan Liang | N/A | Code |
| DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection | Xuan Zhang · Shiyu Li · Xi Li · Ping Huang · Jiulong Shan · Ting Chen | N/A | Code |
| Pix2map: Cross-Modal Retrieval for Inferring Street Maps From Images | Xindi Wu · KwunFung Lau · Francesco Ferroni · Aljoša Ošep · Deva Ramanan | N/A | Code |
| Beyond mAP: Towards Better Evaluation of Instance Segmentation | Rohit Jena · Lukas Zhornyak · Nehal Doiphode · Pratik Chaudhari · Vivek Buch · James Gee · Jianbo Shi | N/A | Code |
| Federated Learning With Data-Agnostic Distribution Fusion | Jian-hui Duan · Wenzhong Li · Derun Zou · Ruichen Li · Sanglu Lu | N/A | Code |
| Make-a-Story: Visual Memory Conditioned Consistent Story Generation | Tanzila Rahman · Hsin-Ying Lee · Jian Ren · Sergey Tulyakov · Shweta Mahajan · Leonid Sigal | N/A | Code |
| Scalable, Detailed and Mask-Free Universal Photometric Stereo | Satoshi Ikehata | N/A | Code |
| ToThePoint: Efficient Contrastive Learning of 3D Point Clouds via Recycling | Xinglin Li · Jiajing Chen · Jinhui Ouyang · Hanhui Deng · Senem Velipasalar · Di Wu | N/A | Code |
| Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields | Yue Chen · Xingyu Chen · Xuan Wang · Qi Zhang · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| UV Volumes for Real-Time Rendering of Editable Free-View Human Performance | Yue Chen · Xuan Wang · Xingyu Chen · Qi Zhang · Xiaoyu Li · Yu Guo · Jue Wang · Fei Wang | N/A | Code |
| SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries | Ahmed Imtiaz Humayun · Randall Balestriero · Guha Balakrishnan · Richard G. Baraniuk | N/A | Code |
| Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery From Sparse Image Ensemble | Chun-Han Yao · Wei-Chih Hung · Yuanzhen Li · Michael Rubinstein · Ming-Hsuan Yang · Varun Jampani | N/A | Code |
| VisFusion: Visibility-Aware Online 3D Scene Reconstruction From Videos | Huiyu Gao · Wei Mao · Miaomiao Liu | N/A | Code |
| Unsupervised Volumetric Animation | Aliaksandr Siarohin · Willi Menapace · Ivan Skorokhodov · Kyle Olszewski · Jian Ren · Hsin-Ying Lee · Menglei Chai · Sergey Tulyakov | N/A | Code |
| DKM: Dense Kernelized Feature Matching for Geometry Estimation | Johan Edstedt · Ioannis Athanasiadis · Mårten Wadenbäck · Michael Felsberg | N/A | Code |
| All in One: Exploring Unified Video-Language Pre-Training | Jinpeng Wang · Yixiao Ge · Rui Yan · Yuying Ge · Kevin Qinghong Lin · Satoshi Tsutsui · Xudong Lin · Guanyu Cai · Jianping Wu · Ying Shan · Xiaohu Qie · Mike Zheng Shou | N/A | Code |
| Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild | Yanhao Wu · Tong Zhang · Wei Ke · Sabine Süsstrunk · Mathieu Salzmann | N/A | Code |
| DynIBaR: Neural Dynamic Image-Based Rendering | Zhengqi Li · Qianqian Wang · Forrester Cole · Richard Tucker · Noah Snavely | N/A | Code |
| Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong · Sundaram Muthu · Fahira Afzal Maken · Chuong Nguyen · Hongdong Li | N/A | Code |
| JAWS: Just a Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang · Robin Courant · Jinglei Shi · Eric Marchand · Marc Christie | N/A | Code |
| CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes | Harshil Bhatia · Edith Tretschk · Zorah Lähner · Marcel Seelbach Benkner · Michael Moeller · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations | Joy Hsu · Jiayuan Mao · Jiajun Wu | N/A | Code |
| TempSAL – Uncovering Temporal Information for Deep Saliency Prediction | Bahar Aydemir · Ludo Hoffstetter · Tong Zhang · Mathieu Salzmann · Sabine Süsstrunk | N/A | Code |
| BiasBed – Rigorous Texture Bias Evaluation | Nikolai Kalischek · Rodrigo Caye Daudt · Torben Peters · Reinhard Furrer · Jan D. Wegner · Konrad Schindler | N/A | Code |
| Real-Time Neural Light Field on Mobile Devices | Junli Cao · Huan Wang · Pavlo Chemerys · Vladislav Shakhrai · Ju Hu · Yun Fu · Denys Makoviichuk · Sergey Tulyakov · Jian Ren | N/A | Code |
| Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization | Mengmeng Xu · Yanghao Li · Cheng-Yang Fu · Bernard Ghanem · Tao Xiang · Juan-Manuel Pérez-Rúa | N/A | Code |
| DiffusionRig: Learning Personalized Priors for Facial Appearance Editing | Zheng Ding · Xuaner Zhang · Zhihao Xia · Lars Jebe · Zhuowen Tu · Xiuming Zhang | N/A | Code |
| Neural Scene Chronology | Haotong Lin · Qianqian Wang · Ruojin Cai · Sida Peng · Hadar Averbuch-Elor · Xiaowei Zhou · Noah Snavely | N/A | Code |
| Diversity-Aware Meta Visual Prompting | Qidong Huang · Xiaoyi Dong · Dongdong Chen · Weiming Zhang · Feifei Wang · Gang Hua · Nenghai Yu | N/A | Code |
| Privacy-Preserving Representations Are Not Enough: Recovering Scene Content From Camera Poses | Kunal Chelani · Torsten Sattler · Fredrik Kahl · Zuzana Kukelova | N/A | Code |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | Bin Ren · Yahui Liu · Yue Song · Wei Bi · Rita Cucchiara · Nicu Sebe · Wei Wang | N/A | Code |
| Box-Level Active Detection | Mengyao Lyu · Jundong Zhou · Hui Chen · Yijie Huang · Dongdong Yu · Yaqian Li · Yandong Guo · Yuchen Guo · Liuyu Xiang · Guiguang Ding | N/A | Code |
| Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples | Jiaming Zhang · Xingjun Ma · Qi Yi · Jitao Sang · Yu-Gang Jiang · Yaowei Wang · Changsheng Xu | N/A | Code |
| Generalized Relation Modeling for Transformer Tracking | Shenyuan Gao · Chunluan Zhou · Jun Zhang | N/A | Code |
| Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis | Rishabh Dabral · Muhammad Hamza Mughal · Vladislav Golyanik · Christian Theobalt | N/A | Code |
| Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective | Jinjing Zhu · Haotian Bai · Lin Wang | N/A | Code |
| Distilling Neural Fields for Real-Time Articulated Shape Reconstruction | Jeff Tan · Gengshan Yang · Deva Ramanan | N/A | Code |
| Sampling Is Matter: Point-Guided 3D Human Mesh Reconstruction | Jeonghwan Kim · Mi-Gyeong Gwon · Hyunwoo Park · Hyukmin Kwon · Gi-Mun Um · Wonjun Kim | N/A | Code |
| Image Quality-Aware Diagnosis via Meta-Knowledge Co-Embedding | Haoxuan Che · Siyu Chen · Hao Chen | N/A | Code |
| Towards Practical Plug-and-Play Diffusion Models | Hyojun Go · Yunsung Lee · Jin-Young Kim · Seunghyun Lee · Myeongho Jeong · Hyun Seung Lee · Seungtaek Choi | N/A | Code |
| HRDFuse: Monocular 360° Depth Estimation by Collaboratively Learning Holistic-With-Regional Depth Distributions | Hao Ai · Zidong Cao · Yan-Pei Cao · Ying Shan · Lin Wang | N/A | Code |
| KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation | Xiangyang Li · Zihan Wang · Jiahao Yang · Yaowei Wang · Shuqiang Jiang | N/A | Code |
| Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang · Wenzhao Zheng · Yunpeng Zhang · Jie Zhou · Jiwen Lu | N/A | Code |
| EventNeRF: Neural Radiance Fields From a Single Colour Event Camera | Viktor Rudnev · Mohamed Elgharib · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling | Zhanhao Hu · Wenda Chu · Xiaopei Zhu · Hui Zhang · Bo Zhang · Xiaolin Hu | N/A | Code |
| Global Vision Transformer Pruning With Hessian-Aware Saliency | Huanrui Yang · Hongxu Yin · Maying Shen · Pavlo Molchanov · Hai Li · Jan Kautz | N/A | Code |
| 3D Human Pose Estimation With Spatio-Temporal Criss-Cross Attention | Zhenhua Tang · Zhaofan Qiu · Yanbin Hao · Richang Hong · Ting Yao | N/A | Code |
| Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution | Yunfan Lu · Zipeng Wang · Minjie Liu · Hongjian Wang · Lin Wang | N/A | Code |
| StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer | Sasikarn Khwanmuang · Pakkapon Phongthawee · Patsorn Sangkloy · Supasorn Suwajanakorn | N/A | Code |
| ShapeClipper: Scalable 3D Shape Learning From Single-View Images via Geometric and CLIP-Based Consistency | Zixuan Huang · Varun Jampani · Anh Thai · Yuanzhen Li · Stefan Stojanov · James M. Rehg | N/A | Code |
| Efficient Scale-Invariant Generator With Column-Row Entangled Pixel Synthesis | Thuan Hoang Nguyen · Thanh Van Le · Anh Tran | N/A | Code |
| Paired-Point Lifting for Enhanced Privacy-Preserving Visual Localization | Chunghwan Lee · Jaihoon Kim · Chanhyuk Yun · Je Hyeong Hong | N/A | Code |
| Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation | Xu Zheng · Jinjing Zhu · Yexin Liu · Zidong Cao · Chong Fu · Lin Wang | N/A | Code |
| Adaptive Human Matting for Dynamic Videos | Chung-Ching Lin · Jiang Wang · Kun Luo · Kevin Lin · Linjie Li · Lijuan Wang · Zicheng Liu | N/A | Code |
| High-Fidelity Facial Avatar Reconstruction From Monocular Video With Generative Priors | Yunpeng Bai · Yanbo Fan · Xuan Wang · Yong Zhang · Jingxiang Sun · Chun Yuan · Ying Shan | N/A | Code |
| Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint | Shikang Yu · Jiachen Chen · Hu Han · Shuqiang Jiang | N/A | Code |
| Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes | Jihyun Lee · Minhyuk Sung · Honggyu Choi · Tae-Kyun Kim | N/A | Code |
| MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos | Zicheng Zhang · Wei Wu · Wei Sun · Danyang Tu · Wei Lu · Xiongkuo Min · Ying Chen · Guangtao Zhai | N/A | Code |
| Make Landscape Flatter in Differentially Private Federated Learning | Yifan Shi · Yingqi Liu · Kang Wei · Li Shen · Xueqian Wang · Dacheng Tao | N/A | Code |
| A Large-Scale Robustness Analysis of Video Action Recognition Models | Madeline Chantry Schiappa · Naman Biyani · Prudvi Kamtam · Shruti Vyas · Hamid Palangi · Vibhav Vineet · Yogesh S. Rawat | N/A | Code |
| Multi-Concept Customization of Text-to-Image Diffusion | Nupur Kumari · Bingliang Zhang · Richard Zhang · Eli Shechtman · Jun-Yan Zhu | N/A | Code |
| GANHead: Towards Generative Animatable Neural Head Avatars | Sijing Wu · Yichao Yan · Yunhao Li · Yuhao Cheng · Wenhan Zhu · Ke Gao · Xiaobo Li · Guangtao Zhai | N/A | Code |
| Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition | Xinghan Wang · Xin Xu · Yadong Mu | N/A | Code |
| Hierarchical B-Frame Video Coding Using Two-Layer CANF Without Motion Coding | David Alexandre · Hsueh-Ming Hang · Wen-Hsiao Peng | N/A | Code |
| FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER | Ce Zheng · Matias Mendieta · Taojiannan Yang · Guo-Jun Qi · Chen Chen | N/A | Code |
| Delivering Arbitrary-Modal Semantic Segmentation | Jiaming Zhang · Ruiping Liu · Hao Shi · Kailun Yang · Simon Reiß · Kunyu Peng · Haodong Fu · Kaiwei Wang · Rainer Stiefelhagen | N/A | Code |
| Deep Graph-Based Spatial Consistency for Robust Non-Rigid Point Cloud Registration | Zheng Qin · Hao Yu · Changjian Wang · Yuxing Peng · Kai Xu | N/A | Code |
| HumanGen: Generating Human Radiance Fields With Explicit Priors | Suyi Jiang · Haoran Jiang · Ziyu Wang · Haimin Luo · Wenzheng Chen · Lan Xu | N/A | Code |
| Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation | Bo Huang · Mingyang Chen · Yi Wang · Junda Lu · Minhao Cheng · Wei Wang | N/A | Code |
| Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | Narek Tumanyan · Michal Geyer · Shai Bagon · Tali Dekel | N/A | Code |
| Rotation-Invariant Transformer for Point Cloud Matching | Hao Yu · Zheng Qin · Ji Hou · Saleh · Dongsheng Li · Benjamin Busam · Slobodan Ilic | N/A | Code |
| CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP | Runnan Chen · Youquan Liu · Lingdong Kong · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao · Wenping Wang | N/A | Code |
| Real-Time 6K Image Rescaling With Rate-Distortion Optimization | Chenyang Qi · Xin Yang · Ka Leong Cheng · Ying-Cong Chen · Qifeng Chen | N/A | Code |
| Focused and Collaborative Feedback Integration for Interactive Image Segmentation | Qiaoqiao Wei · Hui Zhang · Jun-Hai Yong | N/A | Code |
| Language-Guided Music Recommendation for Video via Prompt Analogies | Daniel McKee · Justin Salamon · Josef Sivic · Bryan Russell | N/A | Code |
| TarViS: A Unified Approach for Target-Based Video Segmentation | Ali Athar · Alexander Hermans · Jonathon Luiten · Deva Ramanan · Bastian Leibe | N/A | Code |
| Meta-Personalizing Vision-Language Models To Find Named Instances in Video | Chun-Hsiao Yeh · Bryan Russell · Josef Sivic · Fabian Caba Heilbron · Simon Jenni | N/A | Code |
| ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data | Haojie Zhao · Junsong Chen · Lijun Wang · Huchuan Lu | N/A | Code |
| Scaling Language-Image Pre-Training via Masking | Yanghao Li · Haoqi Fan · Ronghang Hu · Christoph Feichtenhofer · Kaiming He | N/A | Code |
| SeqTrack: Sequence to Sequence Learning for Visual Object Tracking | Xin Chen · Houwen Peng · Dong Wang · Huchuan Lu · Han Hu | N/A | Code |
| Learning Neural Parametric Head Models | Simon Giebenhain · Tobias Kirschstein · Markos Georgopoulos · Martin Rünz · Lourdes Agapito · Matthias Nießner | N/A | Code |
| L-CoIns: Language-Based Colorization With Instance Awareness | Zheng Chang · Shuchen Weng · Peixuan Zhang · Yu Li · Si Li · Boxin Shi | N/A | Code |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Antoine Yang · Arsha Nagrani · Paul Hongsuck Seo · Antoine Miech · Jordi Pont-Tuset · Ivan Laptev · Josef Sivic · Cordelia Schmid | N/A | Code |
| ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding | Le Xue · Mingfei Gao · Chen Xing · Roberto Martín-Martín · Jiajun Wu · Caiming Xiong · Ran Xu · Juan Carlos Niebles · Silvio Savarese | N/A | Code |
| GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields From Multi-View Images | Jianchuan Chen · Wentao Yi · Liqian Ma · Xu Jia · Huchuan Lu | N/A | Code |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Lukas Hoyer · Dengxin Dai · Haoran Wang · Luc Van Gool | N/A | Code |
| MED-VT: Multiscale Encoder-Decoder Video Transformer With Application To Object Segmentation | Rezaul Karim · He Zhao · Richard P. Wildes · Mennatullah Siam | N/A | Code |
| Hierarchical Dense Correlation Distillation for Few-Shot Segmentation | Bohao Peng · Zhuotao Tian · Xiaoyang Wu · Chengyao Wang · Shu Liu · Jingyong Su · Jiaya Jia | N/A | Code |
| Universal Instance Perception As Object Discovery and Retrieval | Bin Yan · Yi Jiang · Jiannan Wu · Dong Wang · Ping Luo · Zehuan Yuan · Huchuan Lu | N/A | Code |
| Bi-Directional Distribution Alignment for Transductive Zero-Shot Learning | Zhicai Wang · Yanbin Hao · Tingting Mu · Ouxiang Li · Shuo Wang · Xiangnan He | N/A | Code |
| Open-Vocabulary Semantic Segmentation With Mask-Adapted CLIP | Feng Liang · Bichen Wu · Xiaoliang Dai · Kunpeng Li · Yinan Zhao · Hang Zhang · Peizhao Zhang · Peter Vajda · Diana Marculescu | N/A | Code |
| ImageBind: One Embedding Space To Bind Them All | Rohit Girdhar · Alaaeldin El-Nouby · Zhuang Liu · Mannat Singh · Kalyan Vasudev Alwala · Armand Joulin · Ishan Misra | N/A | Code |
| Learning and Aggregating Lane Graphs for Urban Automated Driving | Martin Büchner · Jannik Zürn · Ion-George Todoran · Abhinav Valada · Wolfram Burgard | N/A | Code |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Yu Takagi · Shinji Nishimoto | N/A | Code |
| 3D Cinemagraphy From a Single Image | Xingyi Li · Zhiguo Cao · Huiqiang Sun · Jianming Zhang · Ke Xian · Guosheng Lin | N/A | Code |
| Understanding and Improving Visual Prompting: A Label-Mapping Perspective | Aochuan Chen · Yuguang Yao · Pin-Yu Chen · Yihua Zhang · Sijia Liu | N/A | Code |
| Cut and Learn for Unsupervised Object Detection and Instance Segmentation | Xudong Wang · Rohit Girdhar · Stella X. Yu · Ishan Misra | N/A | Code |
| DF-Platter: Multi-Face Heterogeneous Deepfake Dataset | Kartik Narayan · Harsh Agarwal · Kartik Thakral · Surbhi Mittal · Mayank Vatsa · Richa Singh | N/A | Code |
| BASiS: Batch Aligned Spectral Embedding Space | Or Streicher · Ido Cohen · Guy Gilboa | N/A | Code |
| Annealing-Based Label-Transfer Learning for Open World Object Detection | Yuqing Ma · Hainan Li · Zhange Zhang · Jinyang Guo · Shanghang Zhang · Ruihao Gong · Xianglong Liu | N/A | Code |
| Behind the Scenes: Density Fields for Single View Reconstruction | Felix Wimbauer · Nan Yang · Christian Rupprecht · Daniel Cremers | N/A | Code |
| Learning Video Representations From Large Language Models | Yue Zhao · Ishan Misra · Philipp Krähenbühl · Rohit Girdhar | N/A | Code |
| Quantum Multi-Model Fitting | Matteo Farina · Luca Magri · Willi Menapace · Elisa Ricci · Vladislav Golyanik · Federica Arrigoni | N/A | Code |
| Power Bundle Adjustment for Large-Scale 3D Reconstruction | Simon Weber · Nikolaus Demmel · Tin Chon Chan · Daniel Cremers | N/A | Code |
| Optimization-Inspired Cross-Attention Transformer for Compressive Sensing | Jiechong Song · Chong Mou · Shiqi Wang · Siwei Ma · Jian Zhang | N/A | Code |
| NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization | Shitao Tang · Sicong Tang · Andrea Tagliasacchi · Ping Tan · Yasutaka Furukawa | N/A | Code |
| Back to the Source: Diffusion-Driven Adaptation To Test-Time Corruption | Jin Gao · Jialing Zhang · Xihui Liu · Trevor Darrell · Evan Shelhamer · Dequan Wang | N/A | Code |
| Learning Neural Duplex Radiance Fields for Real-Time View Synthesis | Ziyu Wan · Christian Richardt · Aljaž Božič · Chao Li · Vijay Rengarajan · Seonghyeon Nam · Xiaoyu Xiang · Tuotuo Li · Bo Zhu · Rakesh Ranjan · Jing Liao | N/A | Code |
| Object Pop-Up: Can We Infer 3D Objects and Their Poses From Human Interactions Alone? | Ilya A. Petrov · Riccardo Marin · Julian Chibane · Gerard Pons-Moll | N/A | Code |
| G-MSM: Unsupervised Multi-Shape Matching With Graph-Based Affinity Priors | Marvin Eisenberger · Aysim Toker · Laura Leal-Taixé · Daniel Cremers | N/A | Code |
| Data-Efficient Large Scale Place Recognition With Graded Similarity Supervision | María Leyva-Vallina · Nicola Strisciuglio · Nicolai Petkov | N/A | Code |
| Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection With Single Point Supervision | Xinyi Ying · Li Liu · Yingqian Wang · Ruojing Li · Nuo Chen · Zaiping Lin · Weidong Sheng · Shilin Zhou | N/A | Code |
| Instant Domain Augmentation for LiDAR Semantic Segmentation | Kwonyoung Ryu · Soonmin Hwang · Jaesik Park | N/A | Code |
| R2Former: Unified Retrieval and Reranking Transformer for Place Recognition | Sijie Zhu · Linjie Yang · Chen Chen · Mubarak Shah · Xiaohui Shen · Heng Wang | N/A | Code |
| Detecting and Grounding Multi-Modal Media Manipulation | Rui Shao · Tianxing Wu · Ziwei Liu | N/A | Code |
| Detecting Backdoors in Pre-Trained Encoders | Shiwei Feng · Guanhong Tao · Siyuan Cheng · Guangyu Shen · Xiangzhe Xu · Yingqi Liu · Kaiyuan Zhang · Shiqing Ma · Xiangyu Zhang | N/A | Code |
| Scaling Up GANs for Text-to-Image Synthesis | Minguk Kang · Jun-Yan Zhu · Richard Zhang · Jaesik Park · Eli Shechtman · Sylvain Paris · Taesung Park | N/A | Code |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Tiantian Geng · Teng Wang · Jinming Duan · Runmin Cong · Feng Zheng | N/A | Code |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360° | Sizhe An · Hongyi Xu · Yichun Shi · Guoxian Song · Umit Y. Ogras · Linjie Luo | N/A | Code |
| Modality-Invariant Visual Odometry for Embodied Vision | Marius Memmel · Roman Bachmann · Amir Zamir | N/A | Code |
| 3D Video Loops From Asynchronous Input | Li Ma · Xiaoyu Li · Jing Liao · Pedro V. Sander | N/A | Code |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Xuan Ju · Ailing Zeng · Jianan Wang · Qiang Xu · Lei Zhang | N/A | Code |
| PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout | Hsiao Yuan Hsu · Xiangteng He · Yuxin Peng · Hao Kong · Qing Zhang | N/A | Code |
| A Soma Segmentation Benchmark in Full Adult Fly Brain | Xiaoyu Liu · Bo Hu · Mingxing Li · Wei Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer | Jing Lin · Ailing Zeng · Haoqian Wang · Lei Zhang · Yu Li | N/A | Code |
| Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals | Yuto Shibata · Yutaka Kawashima · Mariko Isogawa · Go Irie · Akisato Kimura · Yoshimitsu Aoki | N/A | Code |
| Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video | Xingyu Chen · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis | Hiuyi Cheng · Peirong Zhang · Sihang Wu · Jiaxin Zhang · Qiyuan Zhu · Zecheng Xie · Jing Li · Kai Ding · Lianwen Jin | N/A | Code |
| Neural Congealing: Aligning Images to a Joint Semantic Atlas | Dolev Ofri-Amar · Michal Geyer · Yoni Kasten · Tali Dekel | N/A | Code |
| BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation | Tianheng Cheng · Xinggang Wang · Shaoyu Chen · Qian Zhang · Wenyu Liu | N/A | Code |
| BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion | Michael J. Black · Priyanka Patel · Joachim Tesch · Jinlong Yang | N/A | Code |
| Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation | Feng Li · Hao Zhang · Huaizhe Xu · Shilong Liu · Lei Zhang · Lionel M. Ni · Heung-Yeung Shum | N/A | Code |
| Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis From Monocular Image | Yu Deng · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| 3DAvatarGAN: Bridging Domains for Personalized Editable Avatars | Rameen Abdal · Hsin-Ying Lee · Peihao Zhu · Menglei Chai · Aliaksandr Siarohin · Peter Wonka · Sergey Tulyakov | N/A | Code |
| FLEX: Full-Body Grasping Without Full-Body Grasps | Purva Tendulkar · Dídac Surís · Carl Vondrick | N/A | Code |
| UDE: A Unified Driving Engine for Human Motion Generation | Zixiang Zhou · Baoyuan Wang | N/A | Code |
| Video Test-Time Adaptation for Action Recognition | Wei Lin · Muhammad Jehanzeb Mirza · Mateusz Kozinski · Horst Possegger · Hilde Kuehne · Horst Bischof | N/A | Code |
| Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis | Duomin Wang · Yu Deng · Zixin Yin · Heung-Yeung Shum · Baoyuan Wang | N/A | Code |
| MIME: Human-Aware 3D Scene Generation | Hongwei Yi · Chun-Hao P. Huang · Shashank Tripathi · Lea Hering · Justus Thies · Michael J. Black | N/A | Code |
| AstroNet: When Astrocyte Meets Artificial Neural Network | Mengqiao Han · Liyuan Pan · Xiabi Liu | N/A | Code |
| Stimulus Verification Is a Universal and Effective Sampler in Multi-Modal Human Trajectory Prediction | Jianhua Sun · Yuxuan Li · Liang Chai · Cewu Lu | N/A | Code |
| ActMAD: Activation Matching To Align Distributions for Test-Time-Training | Muhammad Jehanzeb Mirza · Pol Jané Soneira · Wei Lin · Mateusz Kozinski · Horst Possegger · Horst Bischof | N/A | Code |
| Visual Prompt Multi-Modal Tracking | Jiawen Zhu · Simiao Lai · Xin Chen · Dong Wang · Huchuan Lu | N/A | Code |
| Reconstructing Signing Avatars From Video Using Linguistic Priors | Maria-Paola Forte · Peter Kulits · Chun-Hao P. Huang · Vasileios Choutas · Dimitrios Tzionas · Katherine J. Kuchenbecker · Michael J. Black | N/A | Code |
| Patch-Based 3D Natural Scene Generation From a Single Example | Weiyu Li · Xuelin Chen · Jue Wang · Baoquan Chen | N/A | Code |
| Re-Basin via Implicit Sinkhorn Differentiation | Fidel A. Guerrero Peña · Heitor Rapela Medeiros · Thomas Dubail · Masih Aminbeidokhti · Eric Granger · Marco Pedersoli | N/A | Code |
| Slide-Transformer: Hierarchical Vision Transformer With Local Self-Attention | Xuran Pan · Tianzhu Ye · Zhuofan Xia · Shiji Song · Gao Huang | N/A | Code |
| Planning-Oriented Autonomous Driving | Yihan Hu · Jiazhi Yang · Li Chen · Keyu Li · Chonghao Sima · Xizhou Zhu · Siqi Chai · Senyao Du · Tianwei Lin · Wenhai Wang · Lewei Lu · Xiaosong Jia · Qiang Liu · Jifeng Dai · Yu Qiao · Hongyang Li | N/A | Code |
| Enhancing Deformable Local Features by Jointly Learning To Detect and Describe Keypoints | Guilherme Potje · Felipe Cadar · André Araujo · Renato Martins · Erickson R. Nascimento | N/A | Code |
| 3D Human Pose Estimation via Intuitive Physics | Shashank Tripathi · Lea Müller · Chun-Hao P. Huang · Omid Taheri · Michael J. Black · Dimitrios Tzionas | N/A | Code |
| Defending Against Patch-Based Backdoor Attacks on Self-Supervised Learning | Ajinkya Tejankar · Maziar Sanjabi · Qifan Wang · Sinong Wang · Hamed Firooz · Hamed Pirsiavash · Liang Tan | N/A | Code |
| PointCMP: Contrastive Mask Prediction for Self-Supervised Learning on Point Cloud Videos | Zhiqiang Shen · Xiaoxiao Sheng · Longguang Wang · Yulan Guo · Qiong Liu · Xi Zhou | N/A | Code |
| Blowing in the Wind: CycleNet for Human Cinemagraphs From Still Images | Hugo Bertiche · Niloy J. Mitra · Kuldeep Kulkarni · Chun-Hao P. Huang · Tuanfeng Y. Wang · Meysam Madadi · Sergio Escalera · Duygu Ceylan | N/A | Code |
| Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning | Kangning Liu · Weicheng Zhu · Yiqiu Shen · Sheng Liu · Narges Razavian · Krzysztof J. Geras · Carlos Fernandez-Granda | N/A | Code |
| Learning Steerable Function for Efficient Image Resampling | Jiacheng Li · Chang Chen · Wei Huang · Zhiqiang Lang · Fenglong Song · Youliang Yan · Zhiwei Xiong | N/A | Code |
| Deep Deterministic Uncertainty: A New Simple Baseline | Jishnu Mukhoti · Andreas Kirsch · Joost van Amersfoort · Philip H.S. Torr · Yarin Gal | N/A | Code |
| Removing Objects From Neural Radiance Fields | Silvan Weder · Guillermo Garcia-Hernando · Áron Monszpart · Marc Pollefeys · Gabriel J. Brostow · Michael Firman · Sara Vicente | N/A | Code |
| PartManip: Learning Cross-Category Generalizable Part Manipulation Policy From Point Cloud Observations | Haoran Geng · Ziming Li · Yiran Geng · Jiayi Chen · Hao Dong · He Wang | N/A | Code |
| T-SEA: Transfer-Based Self-Ensemble Attack on Object Detection | Hao Huang · Ziyan Chen · Huanran Chen · Yongtao Wang · Kevin Zhang | N/A | Code |
| DINN360: Deformable Invertible Neural Network for Latitude-Aware 360° Image Rescaling | Yichen Guo · Mai Xu · Lai Jiang · Leonid Sigal · Yunjin Chen | N/A | Code |
| Learning Human-to-Robot Handovers From Point Clouds | Sammy Christen · Wei Yang · Claudia Pérez-D’Arpino · Otmar Hilliges · Dieter Fox · Yu-Wei Chao | N/A | Code |
| Multi-View Azimuth Stereo via Tangent Space Consistency | Xu Cao · Hiroaki Santo · Fumio Okura · Yasuyuki Matsushita | N/A | Code |
| Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners | Zitian Chen · Yikang Shen · Mingyu Ding · Zhenfang Chen · Hengshuang Zhao · Erik G. Learned-Miller · Chuang Gan | N/A | Code |
| gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction | Zerui Chen · Shizhe Chen · Cordelia Schmid · Ivan Laptev | N/A | Code |
| Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint | Hongyu Liu · Yibing Song · Qifeng Chen | N/A | Code |
| Generative Bias for Robust Visual Question Answering | Jae Won Cho · Dong-Jin Kim · Hyeonggon Ryu · In So Kweon | N/A | Code |
| Backdoor Defense via Deconfounded Representation Learning | Zaixi Zhang · Qi Liu · Zhicai Wang · Zepu Lu · Qingyong Hu | N/A | Code |
| High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization | Jiaxin Xie · Hao Ouyang · Jingtan Piao · Chenyang Lei · Qifeng Chen | N/A | Code |
| Affordance Diffusion: Synthesizing Hand-Object Interactions | Yufei Ye · Xueting Li · Abhinav Gupta · Shalini De Mello · Stan Birchfield · Jiaming Song · Shubham Tulsiani · Sifei Liu | N/A | Code |
| Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters | Jiashun Wang · Xueting Li · Sifei Liu · Shalini De Mello · Orazio Gallo · Xiaolong Wang · Jan Kautz | N/A | Code |
| Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting | Tarasha Khurana · Peiyun Hu · David Held · Deva Ramanan | N/A | Code |
| Are Data-Driven Explanations Robust Against Out-of-Distribution Data? | Tang Li · Fengchun Qiao · Mengmeng Ma · Xi Peng | N/A | Code |
| Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis | Kang Han · Wei Xiang | N/A | Code |
| Boosting Video Object Segmentation via Space-Time Correspondence Learning | Yurong Zhang · Liulei Li · Wenguan Wang · Rong Xie · Li Song · Wenjun Zhang | N/A | Code |
| X-Pruner: eXplainable Pruning for Vision Transformers | Lu Yu · Wei Xiang | N/A | Code |
| GazeNeRF: 3D-Aware Gaze Redirection With Neural Radiance Fields | Alessandro Ruzzi · Xiangwei Shi · Xi Wang · Gengyan Li · Shalini De Mello · Hyung Jin Chang · Xucong Zhang · Otmar Hilliges | N/A | Code |
| Real-Time Evaluation in Online Continual Learning: A New Hope | Yasir Ghunaim · Adel Bibi · Kumail Alhamoud · Motasem Alfarra · Hasan Abed Al Kader Hammoud · Ameya Prabhu · Philip H.S. Torr · Bernard Ghanem | N/A | Code |
| Contrastive Semi-Supervised Learning for Underwater Image Restoration via Reliable Bank | Shirui Huang · Keyan Wang · Huan Liu · Jun Chen · Yunsong Li | N/A | Code |
| A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet Categories | Reza Akbarian Bafghi · Danna Gurari | N/A | Code |
| Open-Vocabulary Panoptic Segmentation With Text-to-Image Diffusion Models | Jiarui Xu · Sifei Liu · Arash Vahdat · Wonmin Byeon · Xiaolong Wang · Shalini De Mello | N/A | Code |
| Reconstructing Animatable Categories From Videos | Gengshan Yang · Chaoyang Wang · N. Dinesh Reddy · Deva Ramanan | N/A | Code |
| Learning Visual Representations via Language-Guided Sampling | Mohamed El Banani · Karan Desai · Justin Johnson | N/A | Code |
| Four-View Geometry With Unknown Radial Distortion | Petr Hruby · Viktor Korotynskiy · Timothy Duff · Luke Oeding · Marc Pollefeys · Tomas Pajdla · Viktor Larsson | N/A | Code |
| DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model | Gwanghyun Kim · Se Young Chun | N/A | Code |
| ConZIC: Controllable Zero-Shot Image Captioning by Sampling-Based Polishing | Zequn Zeng · Hao Zhang · Ruiying Lu · Dongsheng Wang · Bo Chen · Zhengjue Wang | N/A | Code |
| Feature Separation and Recalibration for Adversarial Robustness | Woo Jae Kim · Yoonki Cho · Junsik Jung · Sung-Eui Yoon | N/A | Code |
| Event-Based Blurry Frame Interpolation Under Blind Exposure | Wenming Weng · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Zhiqin Chen · Thomas Funkhouser · Peter Hedman · Andrea Tagliasacchi | N/A | Code |
| HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu · Mariya I. Vasileva · Achal Dave · Arjun Seshadri | N/A | Code |
| Analyzing and Diagnosing Pose Estimation With Attributions | Qiyuan He · Linlin Yang · Kerui Gu · Qiuxia Lin · Angela Yao | N/A | Code |
| Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction | Ziwei Yu · Chen Li · Linlin Yang · Xiaoxu Zheng · Michael Bi Mi · Gim Hee Lee · Angela Yao | N/A | Code |
| VIVE3D: Viewpoint-Independent Video Editing Using 3D-Aware GANs | Anna Frühstück · Nikolaos Sarafianos · Yuanlu Xu · Peter Wonka · Tony Tung | N/A | Code |
| Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge | Changdi Yang · Pu Zhao · Yanyu Li · Wei Niu · Jiexiong Guan · Hao Tang · Minghai Qin · Bin Ren · Xue Lin · Yanzhi Wang | N/A | Code |
| Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation From 2D Supervision | Xiaoshuai Zhang · Abhijit Kundu · Thomas Funkhouser · Leonidas Guibas · Hao Su · Kyle Genova | N/A | Code |
| VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization | Bingfan Zhu · Yanchao Yang · Xulong Wang · Youyi Zheng · Leonidas Guibas | N/A | Code |
| OpenScene: 3D Scene Understanding With Open Vocabularies | Songyou Peng · Kyle Genova · Chiyu “Max” Jiang · Andrea Tagliasacchi · Marc Pollefeys · Thomas Funkhouser | N/A | Code |
| A New Benchmark: On the Utility of Synthetic Data With Blender for Bare Supervised Learning and Downstream Domain Adaptation | Hui Tang · Kui Jia | N/A | Code |
| Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform Coordinates | Avinash Paliwal · Andrii Tsarov · Nima Khademi Kalantari | N/A | Code |
| A Large-Scale Homography Benchmark | Daniel Barath · Dmytro Mishkin · Michal Polic · Wolfgang Förstner · Jiri Matas | N/A | Code |
| Glocal Energy-Based Learning for Few-Shot Open-Set Recognition | Haoyu Wang · Guansong Pang · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| MEDIC: Remove Model Backdoors via Importance Driven Cloning | Qiuling Xu · Guanhong Tao · Jean Honorio · Yingqi Liu · Shengwei An · Guangyu Shen · Siyuan Cheng · Xiangyu Zhang | N/A | Code |
| Finding Geometric Models by Clustering in the Consensus Space | Daniel Barath · Denys Rozumnyi · Ivan Eichhardt · Levente Hajder · Jiri Matas | N/A | Code |
| Imagic: Text-Based Real Image Editing With Diffusion Models | Bahjat Kawar · Shiran Zada · Oran Lang · Omer Tov · Huiwen Chang · Tali Dekel · Inbar Mosseri · Michal Irani | N/A | Code |
| DeepLSD: Line Segment Detection and Refinement With Deep Image Gradients | Rémi Pautrat · Daniel Barath · Viktor Larsson · Martin R. Oswald · Marc Pollefeys | N/A | Code |
| H2ONet: Hand-Occlusion-and-Orientation-Aware Network for Real-Time 3D Hand Mesh Reconstruction | Hao Xu · Tianyu Wang · Xiao Tang · Chi-Wing Fu | N/A | Code |
| Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions | Yurui Zhu · Tianyu Wang · Xueyang Fu · Xuanyu Yang · Xin Guo · Jifeng Dai · Yu Qiao · Xiaowei Hu | N/A | Code |
| MoDi: Unconditional Motion Synthesis From Diverse Data | Sigal Raab · Inbal Leibovitch · Peizhuo Li · Kfir Aberman · Olga Sorkine-Hornung · Daniel Cohen-Or | N/A | Code |
| PC2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction | Luke Melas-Kyriazi · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| SliceMatch: Geometry-Guided Aggregation for Cross-View Pose Estimation | Zimin Xia · Zimin Xia · Ted Lentsch · Julian F. P. Kooij | N/A | Code |
| RealFusion: 360° Reconstruction of Any Object From a Single Image | Luke Melas-Kyriazi · Iro Laina · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Masked and Adaptive Transformer for Exemplar Based Image Translation | Chang Jiang · Fei Gao · Biao Ma · Yuhao Lin · Nannan Wang · Gang Xu | N/A | Code |
| DynamicStereo: Consistent Dynamic Depth From Stereo Videos | Nikita Karaev · Ignacio Rocco · Benjamin Graham · Natalia Neverova · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Masked Representation Learning for Domain Generalized Stereo Matching | Zhibo Rao · Bangshu Xiong · Mingyi He · Mochu Xiang · Renjie He · Zhelun Shen · Xing Li | N/A | Code |
| MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training | Runsen Xu · Tai Wang · Wenwei Zhang · Runjian Chen · Jinkun Cao · Jiangmiao Pang · Dahua Lin | N/A | Code |
| Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection | Tomoki Ichikawa · Yoshiki Fukao · Shohei Nobuhara · Ko Nishino | N/A | Code |
| Instant Multi-View Head Capture Through Learnable Registration | Timo Bolkart · Tianye Li · Michael J. Black | N/A | Code |
| POEM: Reconstructing Hand in a Point Embedded Multi-View Stereo | Lixin Yang · Jian Xu · Licheng Zhong · Xinyu Zhan · Zhicheng Wang · Kejian Wu · Cewu Lu | N/A | Code |
| Diffusion-Based Generation, Optimization, and Planning in 3D Scenes | Siyuan Huang · Zan Wang · Puhao Li · Baoxiong Jia · Tengyu Liu · Yixin Zhu · Wei Liang · Song-Chun Zhu | N/A | Code |
| Visibility Constrained Wide-Band Illumination Spectrum Design for Seeing-in-the-Dark | Muyao Niu · Zhuoxiao Li · Zhihang Zhong · Yinqiang Zheng | N/A | Code |
| SketchXAI: A First Look at Explainability for Human Sketches | Zhiyu Qu · Yulia Gryaditskaya · Ke Li · Kaiyue Pang · Tao Xiang · Yi-Zhe Song | N/A | Code |
| TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation | Taeyeop Lee · Jonathan Tremblay · Valts Blukis · Bowen Wen · Byeong-Uk Lee · Inkyu Shin · Stan Birchfield · In So Kweon · Kuk-Jin Yoon | N/A | Code |
| Teleidoscopic Imaging System for Microscale 3D Shape Reconstruction | Ryo Kawahara · Meng-Yu Jennifer Kuo · Shohei Nobuhara | N/A | Code |
| Reliability in Semantic Segmentation: Are We on the Right Track? | Pau de Jorge · Riccardo Volpi · Philip H.S. Torr · Grégory Rogez | N/A | Code |
| SMPConv: Self-Moving Point Representations for Continuous Convolution | Sanghyeon Kim · Eunbyung Park | N/A | Code |
| Few-Shot Geometry-Aware Keypoint Localization | Xingzhe He · Gaurav Bharaj · David Ferman · Helge Rhodin · Pablo Garrido | N/A | Code |
| STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition | Xiaoyu Zhu · Po-Yao Huang · Junwei Liang · Celso M. de Melo · Alexander G. Hauptmann | N/A | Code |
| Knowledge Combination To Learn Rotated Detection Without Rotated Annotation | Tianyu Zhu · Bryce Ferenczi · Pulak Purkait · Tom Drummond · Hamid Rezatofighi · Anton van den Hengel | N/A | Code |
| OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering | Zhiyuan Ma · Xiangyu Zhu · Guo-Jun Qi · Zhen Lei · Lei Zhang | N/A | Code |
| Supervised Masked Knowledge Distillation for Few-Shot Transformers | Han Lin · Guangxing Han · Jiawei Ma · Shiyuan Huang · Xudong Lin · Shih-Fu Chang | N/A | Code |
| Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision | Jilan Xu · Junlin Hou · Yuejie Zhang · Rui Feng · Yi Wang · Yu Qiao · Weidi Xie | N/A | Code |
| ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing | Xiaodan Li · Yuefeng Chen · Yao Zhu · Shuhui Wang · Rong Zhang · Hui Xue | N/A | Code |
| Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos | Liao Wang · Qiang Hu · Qihan He · Ziyu Wang · Jingyi Yu · Tinne Tuytelaars · Lan Xu · Minye Wu | N/A | Code |
| Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm | Yichen Xie · Han Lu · Junchi Yan · Xiaokang Yang · Masayoshi Tomizuka · Wei Zhan | N/A | Code |
| Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering | Ruizhi Shao · Zerong Zheng · Hanzhang Tu · Boning Liu · Hongwen Zhang · Yebin Liu | N/A | Code |
| RiDDLE: Reversible and Diversified De-Identification With Latent Encryptor | Dongze Li · Wei Wang · Kang Zhao · Jing Dong · Tieniu Tan | N/A | Code |
| RobustNeRF: Ignoring Distractors With Robust Losses | Sara Sabour · Suhani Vora · Daniel Duckworth · Ivan Krasin · David J. Fleet · Andrea Tagliasacchi | N/A | Code |
| Bitstream-Corrupted JPEG Images Are Restorable: Two-Stage Compensation and Alignment Framework for Image Restoration | Wenyang Liu · Yi Wang · Kim-Hui Yap · Lap-Pui Chau | N/A | Code |
| HierVL: Learning Hierarchical Video-Language Embeddings | Kumar Ashutosh · Rohit Girdhar · Lorenzo Torresani · Kristen Grauman | N/A | Code |
| Phone2Proc: Bringing Robust Robots Into Our Chaotic World | Matt Deitke · Rose Hendrix · Ali Farhadi · Kiana Ehsani · Aniruddha Kembhavi | N/A | Code |
| A Light Touch Approach to Teaching Transformers Multi-View Geometry | Yash Bhalgat · João F. Henriques · Andrew Zisserman | N/A | Code |
| Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields | Kangkan Wang · Guofeng Zhang · Suxu Cong · Jian Yang | N/A | Code |
| AutoFocusFormer: Image Segmentation off the Grid | Chen Ziwen · Kaushik Patnaik · Shuangfei Zhai · Alvin Wan · Zhile Ren · Alexander G. Schwing · Alex Colburn · Li Fuxin | N/A | Code |
| Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe · Zhengyi Luo · Xue Bin Peng · Ye Yuan · Kris Kitani · Karsten Kreis · Sanja Fidler · Or Litany | N/A | Code |
| Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking | Jinkun Cao · Jiangmiao Pang · Xinshuo Weng · Rawal Khirodkar · Kris Kitani | N/A | Code |
| Spider GAN: Leveraging Friendly Neighbors To Accelerate GAN Training | Siddarth Asokan · Chandra Sekhar Seelamantula | N/A | Code |
| Minimizing the Accumulated Trajectory Error To Improve Dataset Distillation | Jiawei Du · Yidi Jiang · Vincent Y. F. Tan · Joey Tianyi Zhou · Haizhou Li | N/A | Code |
| Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo | Yuesong Wang · Zhaojie Zeng · Tao Guan · Wei Yang · Zhuo Chen · Wenkai Liu · Luoyuan Xu · Yawei Luo | N/A | Code |
| Learning Correspondence Uncertainty via Differentiable Nonlinear Least Squares | Dominik Muhle · Lukas Koestler · Krishna Murthy Jatavallabhula · Daniel Cremers | N/A | Code |
| Learning Anchor Transformations for 3D Garment Animation | Fang Zhao · Zekun Li · Shaoli Huang · Junwu Weng · Tianfei Zhou · Guo-Sen Xie · Jue Wang · Ying Shan | N/A | Code |
| PyPose: A Library for Robot Learning With Physics-Based Optimization | Chen Wang · Dasong Gao · Kuan Xu · Junyi Geng · Yaoyu Hu · Yuheng Qiu · Bowen Li · Fan Yang · Brady Moon · Abhinav Pandey · Aryan · Jiahe Xu · Tianhao Wu · Haonan He · Daning Huang · Zhongqiang Ren · Shibo Zhao · Taimeng Fu · Pranay Reddy · Xiao Lin · Wenshan Wang · Jingnan Shi · Rajat Talak · Kun Cao · Yi Du · Han Wang · Huai Yu · Shanzhao Wang · Siyu Chen · Ananth Kashyap · Rohan Bandaru · Karthik Dantu · Jiajun Wu · Lihua Xie · Luca Carlone · Marco Hutter · Sebastian Scherer | N/A | Code |
| Unicode Analogies: An Anti-Objectivist Visual Reasoning Challenge | Steven Spratley · Krista A. Ehinger · Tim Miller | N/A | Code |
| DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata | Ehsan Pajouheshgar · Yitao Xu · Tong Zhang · Sabine Süsstrunk | N/A | Code |
| Learning Generative Structure Prior for Blind Text Image Super-Resolution | Xiaoming Li · Wangmeng Zuo · Chen Change Loy | N/A | Code |
| CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis | Juntian Zheng · Qingyuan Zheng · Lixing Fang · Yun Liu · Li Yi | N/A | Code |
| SCPNet: Semantic Scene Completion on Point Cloud | Zhaoyang Xia · Youquan Liu · Xin Li · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao | N/A | Code |
| AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation | Zhen Li · Zuo-Liang Zhu · Ling-Hao Han · Qibin Hou · Chun-Le Guo · Ming-Ming Cheng | N/A | Code |
| Behavioral Analysis of Vision-and-Language Navigation Agents | Zijiao Yang · Arjun Majumdar · Stefan Lee | N/A | Code |
| Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation | Yuwei Yang · Munawar Hayat · Zhao Jin · Chao Ren · Yinjie Lei | N/A | Code |
| Directional Connectivity-Based Segmentation of Medical Images | Ziyun Yang · Sina Farsiu | N/A | Code |
| ScanDMM: A Deep Markov Model of Scanpath Prediction for 360° Images | Xiangjie Sui · Yuming Fang · Hanwei Zhu · Shiqi Wang · Zhou Wang | N/A | Code |
| 3D Shape Reconstruction of Semi-Transparent Worms | Thomas P. Ilett · Omer Yuval · Thomas Ranner · Netta Cohen · David C. Hogg | N/A | Code |
| Patch-Craft Self-Supervised Training for Correlated Image Denoising | Gregory Vaksman · Michael Elad | N/A | Code |
| NeAT: Learning Neural Implicit Surfaces With Arbitrary Topologies From Multi-View Images | Xiaoxu Meng · Weikai Chen · Bo Yang | N/A | Code |
| DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering | Zongrui Li · Qian Zheng · Boxin Shi · Gang Pan · Xudong Jiang | N/A | Code |
| Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training | Zhao Jin · Munawar Hayat · Yuwei Yang · Yulan Guo · Yinjie Lei | N/A | Code |
| Unsupervised Object Localization: Observing the Background To Discover Objects | Oriane Siméoni · Chloé Sekkat · Gilles Puy · Antonín Vobecký · Éloi Zablocki · Patrick Pérez | N/A | Code |
| Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery | Muli Yang · Liancheng Wang · Cheng Deng · Hanwang Zhang | N/A | Code |
| Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion | Yushi Lan · Xuyi Meng · Shuai Yang · Chen Change Loy · Bo Dai | N/A | Code |
| NeuralField-LDM: Scene Generation With Hierarchical Latent Diffusion Models | Seung Wook Kim · Bradley Brown · Kangxue Yin · Karsten Kreis · Katja Schwarz · Daiqing Li · Robin Rombach · Antonio Torralba · Sanja Fidler | N/A | Code |
| ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation | Alexandre Boulch · Corentin Sautier · Björn Michele · Gilles Puy · Renaud Marlet | N/A | Code |
| RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure Prediction | Donghao Zhou · Chunbin Gu · Junde Xu · Furui Liu · Qiong Wang · Guangyong Chen · Pheng-Ann Heng | N/A | Code |
| Aligning Bag of Regions for Open-Vocabulary Object Detection | Size Wu · Wenwei Zhang · Sheng Jin · Wentao Liu · Chen Change Loy | N/A | Code |
| Neuralangelo: High-Fidelity Neural Surface Reconstruction | Zhaoshuo Li · Thomas Müller · Alex Evans · Russell H. Taylor · Mathias Unberath · Ming-Yu Liu · Chen-Hsuan Lin | N/A | Code |
| PCT-Net: Full Resolution Image Harmonization Using Pixel-Wise Color Transformations | Julian Jorge Andrade Guerreiro · Mitsuru Nakazawa · Björn Stenger | N/A | Code |
| PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers | Ryan Grainger · Thomas Paniagua · Xi Song · Naresh Cuntoor · Mun Wai Lee · Tianfu Wu | N/A | Code |
| Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation | Dong Zhao · Shuang Wang · Qi Zang · Dou Quan · Xiutiao Ye · Licheng Jiao | N/A | Code |
| MEGANE: Morphable Eyeglass and Avatar Network | Junxuan Li · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Hongdong Li · Jason Saragih | N/A | Code |
| Generalizable Implicit Neural Representations via Instance Pattern Composers | Chiheon Kim · Doyup Lee · Saehoon Kim · Minsu Cho · Wook-Shin Han | N/A | Code |
| Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast Solution | Bangyan Liao · Delin Qu · Yifei Xue · Huiqing Zhang · Yizhen Lao | N/A | Code |
| Distribution Shift Inversion for Out-of-Distribution Prediction | Runpeng Yu · Songhua Liu · Xingyi Yang · Xinchao Wang | N/A | Code |
| Wide-Angle Rectification via Content-Aware Conformal Mapping | Qi Zhang · Hongdong Li · Qing Wang | N/A | Code |
| WildLight: In-the-Wild Inverse Rendering With a Flashlight | Ziang Cheng · Junxuan Li · Hongdong Li | N/A | Code |
| Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos | Kun Su · Kaizhi Qian · Eli Shlizerman · Antonio Torralba · Chuang Gan | N/A | Code |
| Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural Networks | Markus Frey · Christian F. Doeller · Caswell Barry | N/A | Code |
| Inverting the Imaging Process by Learning an Implicit Camera Model | Xin Huang · Qi Zhang · Ying Feng · Hongdong Li · Qing Wang | N/A | Code |
| EC2: Emergent Communication for Embodied Control | Yao Mu · Shunyu Yao · Mingyu Ding · Ping Luo · Chuang Gan | N/A | Code |
| Light Source Separation and Intrinsic Image Decomposition Under AC Illumination | Yusaku Yoshida · Ryo Kawahara · Takahiro Okabe | N/A | Code |
| FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding | Thanh-Dat Truong · Ngan Le · Bhiksha Raj · Jackson Cothren · Khoa Luu | N/A | Code |
| Learning Locally Editable Virtual Humans | Hsuan-I Ho · Lixin Xue · Jie Song · Otmar Hilliges | N/A | Code |
| Open-Vocabulary Point-Cloud Object Detection Without 3D Annotation | Yuheng Lu · Chenfeng Xu · Xiaobao Wei · Xiaodong Xie · Masayoshi Tomizuka · Kurt Keutzer · Shanghang Zhang | N/A | Code |
| Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning | Qiang He · Huangyuan Su · Jieyu Zhang · Xinwen Hou | N/A | Code |
| PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection | Anthony Chen · Kevin Zhang · Renrui Zhang · Zihan Wang · Yuheng Lu · Yandong Guo · Shanghang Zhang | N/A | Code |
| OrienterNet: Visual Localization in 2D Public Maps With Neural Matching | Paul-Edouard Sarlin · Daniel DeTone · Tsun-Yi Yang · Armen Avetisyan · Julian Straub · Tomasz Malisiewicz · Samuel Rota Bulò · Richard Newcombe · Peter Kontschieder · Vasileios Balntas | N/A | Code |
| Class Relationship Embedded Learning for Source-Free Unsupervised Domain Adaptation | Yixin Zhang · Zilei Wang · Weinan He | N/A | Code |
| Efficient Movie Scene Detection Using State-Space Transformers | Md Mohaiminul Islam · Mahmudul Hasan · Kishan Shamsundar Athrey · Tony Braskich · Gedas Bertasius | N/A | Code |
| Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction | Mingfang Zhang · Jinglu Wang · Xiao Li · Yifei Huang · Yoichi Sato · Yan Lu | N/A | Code |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Xiao Han · Xiatian Zhu · Licheng Yu · Li Zhang · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning | Qian Jiang · Changyou Chen · Han Zhao · Liqun Chen · Qing Ping · Son Dinh Tran · Yi Xu · Belinda Zeng · Trishul Chilimbi | N/A | Code |
| Level-S$^2$fM: Structure From Motion on Neural Level Set of Implicit Surfaces | Yuxi Xiao · Nan Xue · Tianfu Wu · Gui-Song Xia | N/A | Code |
| Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object Detection | Xinjiang Wang · Xingyi Yang · Shilong Zhang · Yijiang Li · Litong Feng · Shijie Fang · Chengqi Lyu · Kai Chen · Wayne Zhang | N/A | Code |
| Dense Distinct Query for End-to-End Object Detection | Shilong Zhang · Xinjiang Wang · Jiaqi Wang · Jiangmiao Pang · Chengqi Lyu · Wenwei Zhang · Ping Luo · Kai Chen | N/A | Code |
| ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation | Zicong Fan · Omid Taheri · Dimitrios Tzionas · Muhammed Kocabas · Manuel Kaufmann · Michael J. Black · Otmar Hilliges | N/A | Code |
| BiFormer: Vision Transformer With Bi-Level Routing Attention | Lei Zhu · Xinjiang Wang · Zhanghan Ke · Wayne Zhang · Rynson W.H. Lau | N/A | Code |
| Hierarchical Video-Moment Retrieval and Step-Captioning | Abhay Zala · Jaemin Cho · Satwik Kottur · Xilun Chen · Barlas Oguz · Yashar Mehdad · Mohit Bansal | N/A | Code |
| Progressive Open Space Expansion for Open-Set Model Attribution | Tianyun Yang · Danding Wang · Fan Tang · Xinying Zhao · Juan Cao · Sheng Tang | N/A | Code |
| Deep Depth Estimation From Thermal Image | Ukcheol Shin · Jinsun Park · In So Kweon | N/A | Code |
| Incremental 3D Semantic Scene Graph Prediction From RGB Sequences | Shun-Cheng Wu · Keisuke Tateno · Nassir Navab · Federico Tombari | N/A | Code |
| Visual Programming: Compositional Visual Reasoning Without Training | Tanmay Gupta · Aniruddha Kembhavi | N/A | Code |
| Change-Aware Sampling and Contrastive Learning for Satellite Images | Utkarsh Mall · Bharath Hariharan · Kavita Bala | N/A | Code |
| NULL-Text Inversion for Editing Real Images Using Guided Diffusion Models | Ron Mokady · Amir Hertz · Kfir Aberman · Yael Pritch · Daniel Cohen-Or | N/A | Code |
| RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors | Rui-Qi Wu · Zheng-Peng Duan · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| Neural Part Priors: Learning To Optimize Part-Based Object Completion in RGB-D Scans | Aleksei Bokhovkin · Angela Dai | N/A | Code |
| Hierarchical Discriminative Learning Improves Visual Representations of Biomedical Microscopy | Cheng Jiang · Xinhai Hou · Akhil Kondepudi · Asadur Chowdury · Christian W. Freudiger · Daniel A. Orringer · Honglak Lee · Todd C. Hollon | N/A | Code |
| Domain Expansion of Image Generators | Yotam Nitzan · Michaël Gharbi · Richard Zhang · Taesung Park · Jun-Yan Zhu · Daniel Cohen-Or · Eli Shechtman | N/A | Code |
| “Seeing” Electric Network Frequency From Events | Lexuan Xu · Guang Hua · Haijian Zhang · Lei Yu · Ning Qiao | N/A | Code |
| MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object Detection | Wenda Zhao · Shigeng Xie · Fan Zhao · You He · Huchuan Lu | N/A | Code |
| Adaptive Spot-Guided Transformer for Consistent Local Feature Matching | Jiahuan Yu · Jiahao Chang · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Feng Wu | N/A | Code |
| SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence | Jiacheng Deng · Chuxin Wang · Jiahao Lu · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Zhe Zhang | N/A | Code |
| Dynamic Coarse-To-Fine Learning for Oriented Tiny Object Detection | Chang Xu · Jian Ding · Jinwang Wang · Wen Yang · Huai Yu · Lei Yu · Gui-Song Xia | N/A | Code |
| Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning | Yu Wang · Pengchong Qiao · Chang Liu · Guoli Song · Xiawu Zheng · Jie Chen | N/A | Code |
| Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding | Praneeth Chakravarthula · Jim Aldon D’Souza · Ethan Tseng · Joe Bartusek · Felix Heide | N/A | Code |
| DNF: Decouple and Feedback Network for Seeing in the Dark | Xin Jin · Ling-Hao Han · Zhen Li · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation | Samir Yitzhak Gadre · Mitchell Wortsman · Gabriel Ilharco · Ludwig Schmidt · Shuran Song | N/A | Code |
| NVTC: Nonlinear Vector Transform Coding | Runsen Feng · Zongyu Guo · Weiping Li · Zhibo Chen | N/A | Code |
| Towards Unified Scene Text Spotting Based on Sequence Generation | Taeho Kil · Seonghyeon Kim · Sukmin Seo · Yoonsik Kim · Daehee Kim | N/A | Code |
| Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation | Tsu-Jui Fu · Licheng Yu · Ning Zhang · Cheng-Yang Fu · Jong-Chyi Su · William Yang Wang · Sean Bell | N/A | Code |
| Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation | Pengchong Qiao · Zhidan Wei · Yu Wang · Zhennan Wang · Guoli Song · Fan Xu · Xiangyang Ji · Chang Liu · Jie Chen | N/A | Code |
| Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andréas Meuleman · Yu-Lun Liu · Chen Gao · Jia-Bin Huang · Changil Kim · Min H. Kim · Johannes Kopf | N/A | Code |
| Neural Map Prior for Autonomous Driving | Xuan Xiong · Yicheng Liu · Tianyuan Yuan · Yue Wang · Yilun Wang · Hang Zhao | N/A | Code |
| Efficient and Explicit Modelling of Image Hierarchies for Image Restoration | Yawei Li · Yuchen Fan · Xiaoyu Xiang · Denis Demandolx · Rakesh Ranjan · Radu Timofte · Luc Van Gool | N/A | Code |
| F2-NeRF: Fast Neural Radiance Field Training With Free Camera Trajectories | Peng Wang · Yuan Liu · Zhaoxi Chen · Lingjie Liu · Ziwei Liu · Taku Komura · Christian Theobalt · Wenping Wang | N/A | Code |
| Procedure-Aware Pretraining for Instructional Video Understanding | Honglu Zhou · Roberto Martín-Martín · Mubbasir Kapadia · Silvio Savarese · Juan Carlos Niebles | N/A | Code |
| High-Fidelity Guided Image Synthesis With Latent Diffusion Models | Jaskirat Singh · Stephen Gould · Liang Zheng | N/A | Code |
| Progressive Random Convolutions for Single Domain Generalization | Seokeon Choi · Debasmit Das · Sungha Choi · Seunghan Yang · Hyunsin Park · Sungrack Yun | N/A | Code |
| EcoTTA: Memory-Efficient Continual Test-Time Adaptation via Self-Distilled Regularization | Junha Song · Jungsoo Lee · In So Kweon · Sungha Choi | N/A | Code |
| NoPe-NeRF: Optimising Neural Radiance Field With No Pose Prior | Wenjing Bian · Zirui Wang · Kejie Li · Jia-Wang Bian · Victor Adrian Prisacariu | N/A | Code |
| GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds | Zihui Zhang · Bo Yang · Bing Wang · Bo Li | N/A | Code |
| Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning | Kaiyou Song · Jin Xie · Shan Zhang · Zimeng Luo | N/A | Code |
| Aligning Step-by-Step Instructional Diagrams to Video Demonstrations | Jiahao Zhang · Anoop Cherian · Yanbin Liu · Yizhak Ben-Shabat · Cristian Rodriguez · Stephen Gould | N/A | Code |
| ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari · Camilla Carta · François Fleuret | N/A | Code |
| AutoRecon: Automated 3D Object Discovery and Reconstruction | Yuang Wang · Xingyi He · Sida Peng · Haotong Lin · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Ultra-High Resolution Segmentation With Ultra-Rich Context: A Novel Benchmark | Deyi Ji · Feng Zhao · Hongtao Lu · Mingyuan Tao · Jieping Ye | N/A | Code |
| NeUDF: Leaning Neural Unsigned Distance Fields With Volume Rendering | Yu-Tao Liu · Li Wang · Jie Yang · Weikai Chen · Xiaoxu Meng · Bo Yang · Lin Gao | N/A | Code |
| Improving Cross-Modal Retrieval With Set of Diverse Embeddings | Dongwon Kim · Namyup Kim · Suha Kwak | N/A | Code |
| An Image Quality Assessment Dataset for Portraits | Nicolas Chahine · Stefania Calarasanu · Davide Garcia-Civiero · Théo Cayla · Sira Ferradans · Jean Ponce | N/A | Code |
| Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor | Hyeokjun Kweon · Sung-Hoon Yoon · Kuk-Jin Yoon | N/A | Code |
| NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer | Kun Zhou · Wenbo Li · Yi Wang · Tao Hu · Nianjuan Jiang · Xiaoguang Han · Jiangbo Lu | N/A | Code |
| ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations | Panos Achlioptas · Ian Huang · Minhyuk Sung · Sergey Tulyakov · Leonidas Guibas | N/A | Code |
| RelightableHands: Efficient Neural Relighting of Articulated Hand Models | Shun Iwase · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Timur Bagautdinov · Rohan Joshi · Fabian Prada · Takaaki Shiratori · Yaser Sheikh · Jason Saragih | N/A | Code |
| VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud | Ziqin Wang · Bowen Cheng · Lichen Zhao · Dong Xu · Yang Tang · Lu Sheng | N/A | Code |
| MVImgNet: A Large-Scale Dataset of Multi-View Images | Xianggang Yu · Mutian Xu · Yidan Zhang · Haolin Liu · Chongjie Ye · Yushuang Wu · Zizheng Yan · Chenming Zhu · Zhangyang Xiong · Tianyou Liang · Guanying Chen · Shuguang Cui · Xiaoguang Han | N/A | Code |
| MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling With Informative-Preserved Reconstruction and Self-Distilled Consistency | Mingye Xu · Mutian Xu · Tong He · Wanli Ouyang · Yali Wang · Xiaoguang Han · Yu Qiao | N/A | Code |
| Self-Guided Diffusion Models | Vincent Tao Hu · David W. Zhang · Yuki M. Asano · Gertjan J. Burghouts · Cees G. M. Snoek | N/A | Code |
| REC-MV: REconstructing 3D Dynamic Cloth From Monocular Videos | Lingteng Qiu · Guanying Chen · Jiapeng Zhou · Mutian Xu · Junle Wang · Xiaoguang Han | N/A | Code |
| OneFormer: One Transformer To Rule Universal Image Segmentation | Jitesh Jain · Jiachen Li · Mang Tik Chiu · Ali Hassani · Nikita Orlov · Humphrey Shi | N/A | Code |
| Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations | Vibashan VS · Ning Yu · Chen Xing · Can Qin · Mingfei Gao · Juan Carlos Niebles · Vishal M. Patel · Ran Xu | N/A | Code |
| Multiclass Confidence and Localization Calibration for Object Detection | Bimsara Pathiraja · Malitha Gunawardhana · Muhammad Haris Khan | N/A | Code |
| Structured Kernel Estimation for Photon-Limited Deconvolution | Yash Sanghvi · Zhiyuan Mao · Stanley H. Chan | N/A | Code |
| CLIPPO: Image-and-Language Understanding From Pixels Only | Michael Tschannen · Basil Mustafa · Neil Houlsby | N/A | Code |
| Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition | Lilang Lin · Jiahang Zhang · Jiaying Liu | N/A | Code |
| Role of Transients in Two-Bounce Non-Line-of-Sight Imaging | Siddharth Somasundaram · Akshat Dave · Connor Henley · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Shape-Aware Text-Driven Layered Video Editing | Yao-Chih Lee · Ji-Ze Genevieve Jang · Yi-Ting Chen · Elizabeth Qiu · Jia-Bin Huang | N/A | Code |
| FlexiViT: One Model for All Patch Sizes | Lucas Beyer · Pavel Izmailov · Alexander Kolesnikov · Mathilde Caron · Simon Kornblith · Xiaohua Zhai · Matthias Minderer · Michael Tschannen · Ibrahim Alabdulmohsin · Filip Pavetic | N/A | Code |
| Turning Strengths Into Weaknesses: A Certified Robustness Inspired Attack Framework Against Graph Neural Networks | Binghui Wang · Meng Pang · Yun Dong | N/A | Code |
| HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling | Yujian Zheng · Zirong Jin · Moran Li · Haibin Huang · Chongyang Ma · Shuguang Cui · Xiaoguang Han | N/A | Code |
| RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal Retrieval | Yanglin Feng · Hongyuan Zhu · Dezhong Peng · Xi Peng · Peng Hu | N/A | Code |
| Learning Federated Visual Prompt in Null Space for MRI Reconstruction | Chun-Mei Feng · Bangjun Li · Xinxing Xu · Yong Liu · Huazhu Fu · Wangmeng Zuo | N/A | Code |
| VGFlow: Visibility Guided Flow Network for Human Reposing | Rishabh Jain · Krishna Kumar Singh · Mayur Hemani · Jingwan Lu · Mausoom Sarkar · Duygu Ceylan · Balaji Krishnamurthy | N/A | Code |
| Learning Attention As Disentangler for Compositional Zero-Shot Learning | Shaozhe Hao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | Yiqun Wang · Ivan Skorokhodov · Peter Wonka | N/A | Code |
| Perception-Oriented Single Image Super-Resolution Using Optimal Objective Estimation | Seung Ho Park · Young Su Moon · Nam Ik Cho | N/A | Code |
| Learning To Exploit Temporal Structure for Biomedical Vision–Language Processing | Shruthi Bannur · Stephanie Hyland · Qianchu Liu · Fernando Pérez-García · Maximilian Ilse · Daniel C. Castro · Benedikt Boecking · Harshita Sharma · Kenza Bouzid · Anja Thieme · Anton Schwaighofer · Maria Wetscherek · Matthew P. Lungren · Aditya Nori · Javier Alvarez-Valle · Ozan Oktay | N/A | Code |
| TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments | Yu Sun · Qian Bao · Wu Liu · Tao Mei · Michael J. Black | N/A | Code |
| Neumann Network With Recursive Kernels for Single Image Defocus Deblurring | Yuhui Quan · Zicong Wu · Hui Ji | N/A | Code |
| Guiding Pseudo-Labels With Uncertainty Estimation for Source-Free Unsupervised Domain Adaptation | Mattia Litrico · Alessio Del Bue · Pietro Morerio | N/A | Code |
| PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes | Ruoyu Wang · Zehao Yu · Shenghua Gao | N/A | Code |
| Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference | Haoran You · Yunyang Xiong · Xiaoliang Dai · Bichen Wu · Peizhao Zhang · Haoqi Fan · Peter Vajda · Yingyan (Celine) Lin | N/A | Code |
| Attention-Based Point Cloud Edge Sampling | Chengzhi Wu · Junwei Zheng · Julius Pfrommer · Jürgen Beyerer | N/A | Code |
| Structured 3D Features for Reconstructing Controllable Avatars | Enric Corona · Mihai Zanfir · Thiemo Alldieck · Eduard Gabriel Bazavan · Andrei Zanfir · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Referring Image Segmentation With Global-Local Context Features | Seonghoon Yu · Paul Hongsuck Seo · Jeany Son | N/A | Code |
| CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective | Junwen Xiong · Ganglai Wang · Peng Zhang · Wei Huang · Yufei Zha · Guangtao Zhai | N/A | Code |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Anwesa Choudhuri · Girish Chowdhary · Alexander G. Schwing | N/A | Code |
| Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields | Rohith Agaram · Shaurya Dewan · Rahul Sajnani · Adrien Poulenard · Madhava Krishna · Srinath Sridhar | N/A | Code |
| Decoupled Multimodal Distilling for Emotion Recognition | Yong Li · Yuanzhi Wang · Zhen Cui | N/A | Code |
| TensoIR: Tensorial Inverse Rendering | Haian Jin · Isabella Liu · Peijia Xu · Xiaoshuai Zhang · Songfang Han · Sai Bi · Xiaowei Zhou · Zexiang Xu · Hao Su | N/A | Code |
| Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning | Jiayi Guo · Chaofei Wang · You Wu · Eric Zhang · Kai Wang · Xingqian Xu · Shiji Song · Humphrey Shi · Gao Huang | N/A | Code |
| DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection | Jiawei Ma · Yulei Niu · Jincheng Xu · Shiyuan Huang · Guangxing Han · Shih-Fu Chang | N/A | Code |
| Unbalanced Optimal Transport: A Unified Framework for Object Detection | Henri De Plaen · Pierre-François De Plaen · Johan A. K. Suykens · Marc Proesmans · Tinne Tuytelaars · Luc Van Gool | N/A | Code |
| NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-Shot Real Image Animation | Yu Yin · Kamran Ghasedi · HsiangTao Wu · Jiaolong Yang · Xin Tong · Yun Fu | N/A | Code |
| Masked Image Training for Generalizable Deep Image Denoising | Haoyu Chen · Jinjin Gu · Yihao Liu · Salma Abdel Magid · Chao Dong · Qiong Wang · Hanspeter Pfister · Lei Zhu | N/A | Code |
| Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation | Mayu Otani · Riku Togashi · Yu Sawai · Ryosuke Ishigami · Yuta Nakashima · Esa Rahtu · Janne Heikkilä · Shin’ichi Satoh | N/A | Code |
| Towards Flexible Multi-Modal Document Models | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin · Mingkang Li · Da Li · Timothy Hospedales · Yi-Zhe Song · Yonggang Qi | N/A | Code |
| LidarGait: Benchmarking 3D Gait Recognition With Point Clouds | Chuanfu Shen · Chao Fan · Wei Wu · Rui Wang · George Q. Huang · Shiqi Yu | N/A | Code |
| OpenGait: Revisiting Gait Recognition Towards Better Practicality | Chao Fan · Junhao Liang · Chuanfu Shen · Saihui Hou · Yongzhen Huang · Shiqi Yu | N/A | Code |
| Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang · Anqi Joyce Yang · Yuwen Xiong · Sergio Casas · Bin Yang · Mengye Ren · Raquel Urtasun | N/A | Code |
| Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images | Ming Y. Lu · Bowen Chen · Andrew Zhang · Drew F. K. Williamson · Richard J. Chen · Tong Ding · Long Phi Le · Yung-Sung Chuang · Faisal Mahmood | N/A | Code |
| DivClust: Controlling Diversity in Deep Clustering | Ioannis Maniadis Metaxas · Georgios Tzimiropoulos · Ioannis Patras | N/A | Code |
| AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning | Runqi Wang · Xiaoyue Duan · Guoliang Kang · Jianzhuang Liu · Shaohui Lin · Songcen Xu · Jinhu Lü · Baochang Zhang | N/A | Code |
| Unsupervised Continual Semantic Adaptation Through Neural Rendering | Zhizheng Liu · Francesco Milano · Jonas Frey · Roland Siegwart · Hermann Blum · Cesar Cadena | N/A | Code |
| Semi-Supervised Parametric Real-World Image Harmonization | Ke Wang · Michaël Gharbi · He Zhang · Zhihao Xia · Eli Shechtman | N/A | Code |
| EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning | Chenxin Xu · Robby T. Tan · Yuhong Tan · Siheng Chen · Yu Guang Wang · Xinchao Wang · Yanfeng Wang | N/A | Code |
| BUOL: A Bottom-Up Framework With Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single Image | Tao Chu · Pan Zhang · Qiong Liu · Jiaqi Wang | N/A | Code |
| Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation | Ning Zhang · Francesco Nex · George Vosselman · Norman Kerle | N/A | Code |
| Novel-View Acoustic Synthesis | Changan Chen · Alexander Richard · Roman Shapovalov · Vamsi Krishna Ithapu · Natalia Neverova · Kristen Grauman · Andrea Vedaldi | N/A | Code |
| Audio-Visual Grouping Network for Sound Localization From Mixtures | Shentong Mo · Yapeng Tian | N/A | Code |
| Chat2Map: Efficient Scene Mapping From Multi-Ego Conversations | Sagnik Majumder · Hao Jiang · Pierre Moulon · Ethan Henderson · Paul Calamia · Kristen Grauman · Vamsi Krishna Ithapu | N/A | Code |
| ConvNeXt V2: Co-Designing and Scaling ConvNets With Masked Autoencoders | Sanghyun Woo · Shoubhik Debnath · Ronghang Hu · Xinlei Chen · Zhuang Liu · In So Kweon · Saining Xie | N/A | Code |
| Collaboration Helps Camera Overtake LiDAR in 3D Detection | Yue Hu · Yifan Lu · Runsheng Xu · Weidi Xie · Siheng Chen · Yanfeng Wang | N/A | Code |
| Few-Shot Learning With Visual Distribution Calibration and Cross-Modal Distribution Alignment | Runqi Wang · Hao Zheng · Xiaoyue Duan · Jianzhuang Liu · Yuning Lu · Tian Wang · Songcen Xu · Baochang Zhang | N/A | Code |
| MetaCLUE: Towards Comprehensive Visual Metaphors Research | Arjun R. Akula · Brendan Driscoll · Pradyumna Narayana · Soravit Changpinyo · Zhiwei Jia · Suyash Damle · Garima Pruthi · Sugato Basu · Leonidas Guibas · William Freeman · Yuanzhen Li · Varun Jampani | N/A | Code |
| Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric | Pengxin Zeng · Yunfan Li · Peng Hu · Dezhong Peng · Jiancheng Lv · Xi Peng | N/A | Code |
| Visual Atoms: Pre-Training Vision Transformers With Sinusoidal Waves | Sora Takashima · Ryo Hayamizu · Nakamasa Inoue · Hirokatsu Kataoka · Rio Yokota | N/A | Code |
| 3D-Aware Multi-Class Image-to-Image Translation With NeRFs | Senmao Li · Joost van de Weijer · Yaxing Wang · Fahad Shahbaz Khan · Meiqin Liu · Jian Yang | N/A | Code |
| E2PN: Efficient SE(3)-Equivariant Point Network | Minghan Zhu · Maani Ghaffari · William A. Clark · Huei Peng | N/A | Code |
| PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing | Yichen Sheng · Jianming Zhang · Julien Philip · Yannick Hold-Geoffroy · Xin Sun · He Zhang · Lu Ling · Bedrich Benes | N/A | Code |
| UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang · Yun Chen · Jingkang Wang · Sivabalan Manivasagam · Wei-Chiu Ma · Anqi Joyce Yang · Raquel Urtasun | N/A | Code |
| Occlusion-Free Scene Recovery via Neural Radiance Fields | Chengxuan Zhu · Renjie Wan · Yunkai Tang · Boxin Shi | N/A | Code |
| SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting With Neural Radiance Fields | Ashkan Mirzaei · Tristan Aumentado-Armstrong · Kosta Derpanis · Jonathan Kelly · Marcus A. Brubaker · Igor Gilitschenski · Alex Levinshtein | N/A | Code |
| Class-Incremental Exemplar Compression for Class-Incremental Learning | Zilin Luo · Yaoyao Liu · Bernt Schiele · Qianru Sun | N/A | Code |
| DETRs With Hybrid Matching | Ding Jia · Yuhui Yuan · Haodi He · Xiaopei Wu · Haojun Yu · Weihong Lin · Lei Sun · Chao Zhang · Han Hu | N/A | Code |
| 3D Human Mesh Estimation From Virtual Markers | Xiaoxuan Ma · Jiajun Su · Chunyu Wang · Wentao Zhu · Yizhou Wang | N/A | Code |
| Objaverse: A Universe of Annotated 3D Objects | Matt Deitke · Dustin Schwenk · Jordi Salvador · Luca Weihs · Oscar Michel · Eli VanderBilt · Ludwig Schmidt · Kiana Ehsani · Aniruddha Kembhavi · Ali Farhadi | N/A | Code |
| Adjustment and Alignment for Unbiased Open Set Domain Adaptation | Wuyang Li · Jie Liu · Bo Han · Yixuan Yuan | N/A | Code |
| TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition | Ishan Rajendrakumar Dave · Mamshad Nayeem Rizve · Chen Chen · Mubarak Shah | N/A | Code |
| EfficientSCI: Densely Connected Network With Space-Time Factorization for Large-Scale Video Snapshot Compressive Imaging | Lishun Wang · Miao Cao · Xin Yuan | N/A | Code |
| Continual Detection Transformer for Incremental Object Detection | Yaoyao Liu · Bernt Schiele · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Hierarchical Prompt Learning for Multi-Task Learning | Yajing Liu · Yuning Lu · Hao Liu · Yaozu An · Zhuoran Xu · Zhuokun Yao · Baofeng Zhang · Zhiwei Xiong · Chenguang Gui | N/A | Code |
| Boost Vision Transformer With GPU-Friendly Sparsity and Quantization | Chong Yu · Tao Chen · Zhongxue Gan · Jiayuan Fan | N/A | Code |
| Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression | Junho Kim · Byung-Kwan Lee · Yong Man Ro | N/A | Code |
| Regularizing Second-Order Influences for Continual Learning | Zhicheng Sun · Yadong Mu · Gang Hua | N/A | Code |
| Heterogeneous Continual Learning | Divyam Madaan · Hongxu Yin · Wonmin Byeon · Jan Kautz · Pavlo Molchanov | N/A | Code |
| DP-NeRF: Deblurred Neural Radiance Field With Physical Scene Priors | Dogyoon Lee · Minhyeok Lee · Chajin Shin · Sangyoun Lee | N/A | Code |
| 3D-POP – An Automated Annotation Approach to Facilitate Markerless 2D-3D Tracking of Freely Moving Birds With Marker-Based Motion Capture | Hemal Naik · Alex Hoi Hang Chan · Junran Yang · Mathilde Delacoux · Iain D. Couzin · Fumihiro Kano · Máté Nagy | N/A | Code |
| Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants With No False Negatives and No False Positives | Daniel Widdowson · Vitaliy Kurlin | N/A | Code |
| Robust Model-Based Face Reconstruction Through Weakly-Supervised Outlier Segmentation | Chunlu Li · Andreas Morel-Forster · Thomas Vetter · Bernhard Egger · Adam Kortylewski | N/A | Code |
| Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization | Shichao Dong · Jin Wang · Renhe Ji · Jiajun Liang · Haoqiang Fan · Zheng Ge | N/A | Code |
| PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation | Qihao Liu · Adam Kortylewski · Alan L. Yuille | N/A | Code |
| VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| 1000 FPS HDR Video With a Spike-RGB Hybrid Camera | Yakun Chang · Chu Zhou · Yuchen Hong · Liwen Hu · Chao Xu · Tiejun Huang · Boxin Shi | N/A | Code |
| How to Backdoor Diffusion Models? | Sheng-Yen Chou · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image Classification | Meike Nauta · Jörg Schlötterer · Maurice van Keulen · Christin Seifert | N/A | Code |
| Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers | Siyuan Wei · Tianzhu Ye · Shen Zhang · Yao Tang · Jiajun Liang | N/A | Code |
| Energy-Efficient Adaptive 3D Sensing | Brevin Tilmon · Zhanghao Sun · Sanjeev J. Koppal · Yicheng Wu · Georgios Evangelidis · Ramzi Zahreddine · Gurunandan Krishnan · Sizhuo Ma · Jian Wang | N/A | Code |
| Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data | Yuhao Chen · Xin Tan · Borui Zhao · Zhaowei Chen · Renjie Song · Jiajun Liang · Xuequan Lu | N/A | Code |
| Fix the Noise: Disentangling Source Feature for Controllable Domain Translation | Dongyeun Lee · Jae Young Lee · Doyeon Kim · Jaehyun Choi · Jaejun Yoo · Junmo Kim | N/A | Code |
| Learning Transferable Spatiotemporal Representations From Natural Script Knowledge | Ziyun Zeng · Yuying Ge · Xihui Liu · Bin Chen · Ping Luo · Shu-Tao Xia · Yixiao Ge | N/A | Code |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Mengde Xu · Zheng Zhang · Fangyun Wei · Han Hu · Xiang Bai | N/A | Code |
| A Strong Baseline for Generalized Few-Shot Semantic Segmentation | Sina Hajimiri · Malik Boudiaf · Ismail Ben Ayed · Jose Dolz | N/A | Code |
| Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations | Lei Hsiung · Yun-Yun Tsai · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| Normalizing Flow Based Feature Synthesis for Outlier-Aware Object Detection | Nishant Kumar · Siniša Šegvić · Abouzar Eslami · Stefan Gumhold | N/A | Code |
| AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction | Aggelina Chatziagapi · Dimitris Samaras | N/A | Code |
| Learning Semantic Relationship Among Instances for Image-Text Matching | Zheren Fu · Zhendong Mao · Yan Song · Yongdong Zhang | N/A | Code |
| Understanding Imbalanced Semantic Segmentation Through Neural Collapse | Zhisheng Zhong · Jiequan Cui · Yibo Yang · Xiaoyang Wu · Xiaojuan Qi · Xiangyu Zhang · Jiaya Jia | N/A | Code |
| SCADE: NeRFs from Space Carving With Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy · Ricardo Martin-Brualla · Leonidas Guibas · Ke Li | N/A | Code |
| MonoHuman: Animatable Human Neural Field From Monocular Video | Zhengming Yu · Wei Cheng · Xian Liu · Wayne Wu · Kwan-Yee Lin | N/A | Code |
| Affection: Learning Affective Explanations for Real-World Visual Data | Panos Achlioptas · Maks Ovsjanikov · Leonidas Guibas · Sergey Tulyakov | N/A | Code |
| Sharpness-Aware Gradient Matching for Domain Generalization | Pengfei Wang · Zhaoxiang Zhang · Zhen Lei · Lei Zhang | N/A | Code |
| Generalized Decoding for Pixel, Image, and Language | Xueyan Zou · Zi-Yi Dou · Jianwei Yang · Zhe Gan · Linjie Li · Chunyuan Li · Xiyang Dai · Harkirat Behl · Jianfeng Wang · Lu Yuan · Nanyun Peng · Lijuan Wang · Yong Jae Lee · Jianfeng Gao | N/A | Code |
| How You Feelin’? Learning Emotions and Mental States in Movie Scenes | Dhruv Srivastava · Aditya Kumar Singh · Makarand Tapaswi | N/A | Code |
| Improving Visual Representation Learning Through Perceptual Understanding | Samyakh Tukra · Frederick Hoffman · Ken Chatfield | N/A | Code |
| PlenVDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering | Han Yan · Celong Liu · Chao Ma · Xing Mei | N/A | Code |
| HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions | Anshul Shah · Aniket Roy · Ketul Shah · Shlok Mishra · David Jacobs · Anoop Cherian · Rama Chellappa | N/A | Code |
| FeatureBooster: Boosting Feature Descriptors With a Lightweight Neural Network | Xinjiang Wang · Zeyu Liu · Yu Hu · Wei Xi · Wenxian Yu · Danping Zou | N/A | Code |
| ACL-SPC: Adaptive Closed-Loop System for Self-Supervised Point Cloud Completion | Sangmin Hong · Mohsen Yavartanoo · Reyhaneh Neshatavar · Kyoung Mu Lee | N/A | Code |
| NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis | Allan Zhou · Moo Jin Kim · Lirui Wang · Pete Florence · Chelsea Finn | N/A | Code |
| Query-Centric Trajectory Prediction | Zikang Zhou · Jianping Wang · Yung-Hui Li · Yu-Kai Huang | N/A | Code |
| EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding | Yanmin Wu · Xinhua Cheng · Renrui Zhang · Zesen Cheng · Jian Zhang | N/A | Code |
| Sliced Optimal Partial Transport | Yikun Bai · Bernhard Schmitzer · Matthew Thorpe · Soheil Kolouri | N/A | Code |
| PersonNeRF: Personalized Reconstruction From Photo Collections | Chung-Yi Weng · Pratul P. Srinivasan · Brian Curless · Ira Kemelmacher-Shlizerman | N/A | Code |
| Feature Shrinkage Pyramid for Camouflaged Object Detection With Transformers | Zhou Huang · Hang Dai · Tian-Zhu Xiang · Shuo Wang · Huai-Xin Chen · Jie Qin · Huan Xiong | N/A | Code |
| HOLODIFFUSION: Training a 3D Diffusion Model Using 2D Images | Animesh Karnewar · Andrea Vedaldi · David Novotny · Niloy J. Mitra | N/A | Code |
| Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors | Gongjie Zhang · Zhipeng Luo · Zichen Tian · Jingyi Zhang · Xiaoqin Zhang · Shijian Lu | N/A | Code |
| Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images | Tiancheng Lin · Zhimiao Yu · Hongyu Hu · Yi Xu · Chang-Wen Chen | N/A | Code |
| Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding | Minyoung Hwang · Jaeyeon Jeong · Minsoo Kim · Yoonseon Oh · Songhwai Oh | N/A | Code |
| Sketch2Saliency: Learning To Detect Salient Objects From Human Drawings | Ayan Kumar Bhunia · Subhadeep Koley · Amandeep Kumar · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Picture That Sketch: Photorealistic Image Generation From Abstract Sketches | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain · Ayan Kumar Bhunia · Pinaki Nath Chowdhury · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data | Jihye Park · Sunwoo Kim · Soohyun Kim · Seokju Cho · Jaejun Yoo · Youngjung Uh · Seungryong Kim | N/A | Code |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Wenjie Chang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| SceneTrilogy: On Human Scene-Sketch and Its Complementarity With Photo and Text | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Markerless Camera-to-Robot Pose Estimation via Self-Supervised Sim-to-Real Transfer | Jingpei Lu · Florian Richter · Michael C. Yip | N/A | Code |
| Fine-Grained Audible Video Description | Xuyang Shen · Dong Li · Jinxing Zhou · Zhen Qin · Bowen He · Xiaodong Han · Aixuan Li · Mochu Xiang · Lingpeng Kong · Meng Wang · Yu Qiao · Yiran Zhong | N/A | Code |
| EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention | Xinyu Liu · Houwen Peng · Ningxin Zheng · Yuqing Yang · Han Hu · Yixuan Yuan | N/A | Code |
| Relightable Neural Human Assets From Multi-View Gradient Illuminations | Taotao Zhou · Kai He · Di Wu · Teng Xu · Qixuan Zhang · Kuixiang Shao · Wenzheng Chen · Lan Xu · Jingyi Yu | N/A | Code |
| Music-Driven Group Choreography | Nhat Le · Thang Pham · Tuong Do · Erman Tjiputra · Quang D. Tran · Anh Nguyen | N/A | Code |
| DIP: Dual Incongruity Perceiving Network for Sarcasm Detection | Changsong Wen · Guoli Jia · Jufeng Yang | N/A | Code |
| MagicPony: Learning Articulated 3D Animals in the Wild | Shangzhe Wu · Ruining Li · Tomas Jakab · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Preserving Linear Separability in Continual Learning by Backward Feature Projection | Qiao Gu · Dongsub Shim · Florian Shkurti | N/A | Code |
| Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues | Xingyu Ren · Jiankang Deng · Chao Ma · Yichao Yan · Xiaokang Yang | N/A | Code |
| HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models | Shan Ning · Longtian Qiu · Yongfei Liu · Xuming He | N/A | Code |
| Regularization of Polynomial Networks for Image Recognition | Grigorios G. Chrysos · Bohan Wang · Jiankang Deng · Volkan Cevher | N/A | Code |
| Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain · Ayan Kumar Bhunia · Subhadeep Koley · Pinaki Nath Chowdhury · Soumitri Chattopadhyay · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement | Yuhui Wu · Chen Pan · Guoqing Wang · Yang Yang · Jiwei Wei · Chongyi Li · Heng Tao Shen | N/A | Code |
| Block Selection Method for Using Feature Norm in Out-of-Distribution Detection | Yeonguk Yu · Sungho Shin · Seongju Lee · Changhyun Jun · Kyoobin Lee | N/A | Code |
| HouseDiffusion: Vector Floorplan Generation via a Diffusion Model With Discrete and Continuous Denoising | Mohammad Amin Shabani · Sepidehsadat Hosseini · Yasutaka Furukawa | N/A | Code |
| Integral Neural Networks | Kirill Solodskikh · Azim Kurbanov · Ruslan Aydarkhanov · Irina Zhelavskaya · Yury Parfenov · Dehua Song · Stamatios Lefkimmiatis | N/A | Code |
| FitMe: Deep Photorealistic 3D Morphable Model Avatars | Alexandros Lattas · Stylianos Moschoglou · Stylianos Ploumpis · Baris Gecer · Jiankang Deng · Stefanos Zafeiriou | N/A | Code |
| Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment | Kim Sung-Bin · Arda Senocak · Hyunwoo Ha · Andrew Owens · Tae-Hyun Oh | N/A | Code |
| Introducing Competition To Boost the Transferability of Targeted Adversarial Examples Through Clean Feature Mixup | Junyoung Byun · Myung-Joon Kwon · Seungju Cho · Yoonji Kim · Changick Kim | N/A | Code |
| Initialization Noise in Image Gradients and Saliency Maps | Ann-Christin Woerl · Jan Disselhoff · Michael Wand | N/A | Code |
| Two-Shot Video Object Segmentation | Kun Yan · Xiao Li · Fangyun Wei · Jinglu Wang · Chenbin Zhang · Ping Wang · Yan Lu | N/A | Code |
| SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow | Itai Lang · Dror Aiger · Forrester Cole · Shai Avidan · Michael Rubinstein | N/A | Code |
| Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM | Hengyi Wang · Jingwen Wang · Lourdes Agapito | N/A | Code |
| Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments | Masakazu Yoshimura · Junji Otsuka · Atsushi Irie · Takeshi Ohashi | N/A | Code |
| Diffusion-Based Signed Distance Fields for 3D Shape Generation | Jaehyeok Shim · Changwoo Kang · Kyungdon Joo | N/A | Code |
| Handwritten Text Generation From Visual Archetypes | Vittorio Pippi · Silvia Cascianelli · Rita Cucchiara | N/A | Code |
| Novel Class Discovery for 3D Point Cloud Semantic Segmentation | Luigi Riz · Cristiano Saltori · Elisa Ricci · Fabio Poiesi | N/A | Code |
| DeltaEdit: Exploring Text-Free Training for Text-Driven Image Manipulation | Yueming Lyu · Tianwei Lin · Fu Li · Dongliang He · Jing Dong · Tieniu Tan | N/A | Code |
| SkyEye: Self-Supervised Bird’s-Eye-View Semantic Mapping Using Monocular Frontal View Images | Nikhil Gosala · Kürsat Petek · Paulo L. J. Drews-Jr · Wolfram Burgard · Abhinav Valada | N/A | Code |
| Towards Open-World Segmentation of Parts | Tai-Yu Pan · Qing Liu · Wei-Lun Chao · Brian Price | N/A | Code |
| DeepMapping2: Self-Supervised Large-Scale LiDAR Map Optimization | Chao Chen · Xinhao Liu · Yiming Li · Li Ding · Chen Feng | N/A | Code |
| SINE: SINgle Image Editing With Text-to-Image Diffusion Models | Zhixing Zhang · Ligong Han · Arnab Ghosh · Dimitris N. Metaxas · Jian Ren | N/A | Code |
| Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection | Long Li · Junwei Han · Ni Zhang · Nian Liu · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Fahad Shahbaz Khan | N/A | Code |
| TruFor: Leveraging All-Round Clues for Trustworthy Image Forgery Detection and Localization | Fabrizio Guillaro · Davide Cozzolino · Avneesh Sud · Nicholas Dufour · Luisa Verdoliva | N/A | Code |
| SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction | Yukang Cao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-Shot Learning With Hyperspherical Embeddings | Daniel J. Trosten · Rwiddhi Chakraborty · Sigurd Løkse · Kristoffer Knutsen Wickstrøm · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| MAGE: MAsked Generative Encoder To Unify Representation Learning and Image Synthesis | Tianhong Li · Huiwen Chang · Shlok Mishra · Han Zhang · Dina Katabi · Dilip Krishnan | N/A | Code |
| Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection | Lianyu Wang · Meng Wang · Daoqiang Zhang · Huazhu Fu | N/A | Code |
| OvarNet: Towards Open-Vocabulary Object Attribute Recognition | Keyan Chen · Xiaolong Jiang · Yao Hu · Xu Tang · Yan Gao · Jianqi Chen · Weidi Xie | N/A | Code |
| GINA-3D: Learning To Generate Implicit Neural Assets in the Wild | Bokui Shen · Xinchen Yan · Charles R. Qi · Mahyar Najibi · Boyang Deng · Leonidas Guibas · Yin Zhou · Dragomir Anguelov | N/A | Code |
| PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation | Qitao Zhao · Ce Zheng · Mengyuan Liu · Pichao Wang · Chen Chen | N/A | Code |
| Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization | Huan Ren · Wenfei Yang · Tianzhu Zhang · Yongdong Zhang | N/A | Code |
| Learning Partial Correlation Based Deep Visual Representation for Image Classification | Saimunur Rahman · Piotr Koniusz · Lei Wang · Luping Zhou · Peyman Moghadam · Changming Sun | N/A | Code |
| Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph | Rixin Zhou · Jiafu Wei · Qian Zhang · Ruihua Qi · Xi Yang · Chuntao Li | N/A | Code |
| DexArt: Benchmarking Generalizable Dexterous Manipulation With Articulated Objects | Chen Bao · Helin Xu · Yuzhe Qin · Xiaolong Wang | N/A | Code |
| Modeling the Distributional Uncertainty for Salient Object Detection Models | Xinyu Tian · Jing Zhang · Mochu Xiang · Yuchao Dai | N/A | Code |
| Evading Forensic Classifiers With Attribute-Conditioned Adversarial Faces | Fahad Shamshad · Koushik Srivatsan · Karthik Nandakumar | N/A | Code |
| Scene-Aware Egocentric 3D Human Pose Estimation | Jian Wang · Diogo Luvizon · Weipeng Xu · Lingjie Liu · Kripasindhu Sarkar · Christian Theobalt | N/A | Code |
| Camouflaged Instance Segmentation via Explicit De-Camouflaging | Naisong Luo · Yuwen Pan · Rui Sun · Tianzhu Zhang · Zhiwei Xiong · Feng Wu | N/A | Code |
| N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution | Haram Choi · Jeongmin Lee · Jihoon Yang | N/A | Code |
| Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding | Gyeongman Kim · Hajin Shim · Hyunsu Kim · Yunjey Choi · Junho Kim · Eunho Yang | N/A | Code |
| GLIGEN: Open-Set Grounded Text-to-Image Generation | Yuheng Li · Haotian Liu · Qingyang Wu · Fangzhou Mu · Jianwei Yang · Jianfeng Gao · Chunyuan Li · Yong Jae Lee | N/A | Code |
| Balanced Spherical Grid for Egocentric View Synthesis | Changwoon Choi · Sang Min Kim · Young Min Kim | N/A | Code |
| V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception | Runsheng Xu · Xin Xia · JINLONG LI · Hanzhao Li · Shuo Zhang · Zhengzhong Tu · Zonglin Meng · Hao Xiang · Xiaoyu Dong · Rui Song · Hongkai Yu · Bolei Zhou · Jiaqi Ma | N/A | Code |
| VindLU: A Recipe for Effective Video-and-Language Pretraining | Feng Cheng · Xizi Wang · Jie Lei · David Crandall · Mohit Bansal · Gedas Bertasius | N/A | Code |
| FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation | Jie Qin · Jie Wu · Pengxiang Yan · Ming Li · Ren Yuxi · Xuefeng Xiao · Yitong Wang · Rui Wang · Shilei Wen · Xin Pan · Xingang Wang | N/A | Code |
| NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization | Zhixiang Min · Bingbing Zhuang · Samuel Schulter · Buyu Liu · Enrique Dunn · Manmohan Chandraker | N/A | Code |
| ABCD: Arbitrary Bitwise Coefficient for De-Quantization | Woo Kyoung Han · Byeonghun Lee · Sang Hyun Park · Kyong Hwan Jin | N/A | Code |
| PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery | Sheng Zhang · Salman Khan · Zhiqiang Shen · Muzammal Naseer · Guangyi Chen · Fahad Shahbaz Khan | N/A | Code |
| Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing With Non-Learnable Primitives | Chuntao Ding · Zhichao Lu · Shangguang Wang · Ran Cheng · Vishnu Naresh Boddeti | N/A | Code |
| MaPLe: Multi-Modal Prompt Learning | Muhammad Uzair Khattak · Hanoona Rasheed · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Revisiting Residual Networks for Adversarial Robustness | Shihua Huang · Zhichao Lu · Kalyanmoy Deb · Vishnu Naresh Boddeti | N/A | Code |
| Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification | Youngwook Kim · Jae Myung Kim · Jieun Jeong · Cordelia Schmid · Zeynep Akata · Jungwoo Lee | N/A | Code |
| Human Pose Estimation in Extremely Low-Light Conditions | Sohyun Lee · Jaesung Rim · Boseung Jeong · Geonu Kim · Byungju Woo · Haechan Lee · Sunghyun Cho · Suha Kwak | N/A | Code |
| Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution | Chenfan Qu · Chongyu Liu · Yuliang Liu · Xinhong Chen · Dezhi Peng · Fengjun Guo · Lianwen Jin | N/A | Code |
| SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene | Minjung Son · Jeong Joon Park · Leonidas Guibas · Gordon Wetzstein | N/A | Code |
| LEGO-Net: Learning Regular Rearrangements of Objects in Rooms | Qiuhong Anna Wei · Sijie Ding · Jeong Joon Park · Rahul Sajnani · Adrien Poulenard · Srinath Sridhar · Leonidas Guibas | N/A | Code |
| MACARONS: Mapping and Coverage Anticipation With RGB Online Self-Supervision | Antoine Guédon · Tom Monnier · Pascal Monasse · Vincent Lepetit | N/A | Code |
| ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction | Zhen Wang · Shijie Zhou · Jeong Joon Park · Despoina Paschalidou · Suya You · Gordon Wetzstein · Leonidas Guibas · Achuta Kadambi | N/A | Code |
| Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos | Rohit Gupta · Anirban Roy · Claire Christensen · Sujeong Kim · Sarah Gerard · Madeline Cincebeaux · Ajay Divakaran · Todd Grindal · Mubarak Shah | N/A | Code |
| DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network | Xuan Shen · Yaohua Wang · Ming Lin · Yilun Huang · Hao Tang · Xiuyu Sun · Yanzhi Wang | N/A | Code |
| ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects | Marco Toschi · Riccardo De Matteo · Riccardo Spezialetti · Daniele De Gregorio · Luigi Di Stefano · Samuele Salti | N/A | Code |
| Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields | Brian K. S. Isaac-Medina · Chris G. Willcocks · Toby P. Breckon | N/A | Code |
| A Generalized Framework for Video Instance Segmentation | Miran Heo · Sukjun Hwang · Jeongseok Hyun · Hanjung Kim · Seoung Wug Oh · Joon-Young Lee · Seon Joo Kim | N/A | Code |
| Video Probabilistic Diffusion Models in Projected Latent Space | Sihyun Yu · Kihyuk Sohn · Subin Kim · Jinwoo Shin | N/A | Code |
| X-Avatar: Expressive Human Avatars | Kaiyue Shen · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Julien Valentin · Jie Song · Otmar Hilliges | N/A | Code |
| Hi4D: 4D Instance Segmentation of Close Human Interaction | Yifei Yin · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Jie Song · Otmar Hilliges | N/A | Code |
| Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction | Bin Fan · Yuxin Mao · Mochu Xiang · Zhexiong Wan · Qi Liu | N/A | Code |
| Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting | Syed Talal Wasim · Muzammal Naseer · Salman Khan · Fahad Shahbaz Khan · Mubarak Shah | N/A | Code |
| MaskSketch: Unpaired Structure-Guided Masked Image Generation | Dina Bashkirova · José Lezama · Kihyuk Sohn · Kate Saenko · Irfan Essa | N/A | Code |
| Super-CLEVR: A Virtual Benchmark To Diagnose Domain Robustness in Visual Reasoning | Zhuowan Li · Xingrui Wang · Elias Stengel-Eskin · Adam Kortylewski · Wufei Ma · Benjamin Van Durme · Alan L. Yuille | N/A | Code |
| CREPE: Can Vision-Language Foundation Models Reason Compositionally? | Zixian Ma · Jerry Hong · Mustafa Omer Gul · Mona Gandhi · Irena Gao · Ranjay Krishna | N/A | Code |
| ORCa: Glossy Objects As Radiance-Field Cameras | Kushagra Tiwary · Akshat Dave · Nikhil Behari · Tzofi Klinghoffer · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Learning Common Rationale To Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems | Yangyang Shu · Anton van den Hengel · Lingqiao Liu | N/A | Code |
| Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro · Quinlan Sykora · Sergio Casas · Raquel Urtasun | N/A | Code |
| Improved Test-Time Adaptation for Domain Generalization | Liang Chen · Yong Zhang · Yibing Song · Ying Shan · Lingqiao Liu | N/A | Code |
| Wavelet Diffusion Models Are Fast and Scalable Image Generators | Hao Phung · Quan Dao · Anh Tran | N/A | Code |
| Robust Dynamic Radiance Fields | Yu-Lun Liu · Chen Gao · Andréas Meuleman · Hung-Yu Tseng · Ayush Saraf · Changil Kim · Yung-Yu Chuang · Johannes Kopf · Jia-Bin Huang | N/A | Code |
| MixSim: A Hierarchical Framework for Mixed Reality Traffic Simulation | Simon Suo · Kelvin Wong · Justin Xu · James Tu · Alexander Cui · Sergio Casas · Raquel Urtasun | N/A | Code |
| Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement | Nancy Mehta · Akshay Dudhane · Subrahmanyam Murala · Syed Waqas Zamir · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Class Adaptive Network Calibration | Bingyuan Liu · Jérôme Rony · Adrian Galdran · Jose Dolz · Ismail Ben Ayed | N/A | Code |
| PROB: Probabilistic Objectness for Open World Object Detection | Orr Zohar · Kuan-Chieh Wang · Serena Yeung | N/A | Code |
| Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation | Min Shi · Zihao Huang · Xianzheng Ma · Xiaowei Hu · Zhiguo Cao | N/A | Code |
| HyperCUT: Video Sequence From a Single Blurry Image Using Unsupervised Ordering | Bang-Dang Pham · Phong Tran · Anh Tran · Cuong Pham · Rang Nguyen · Minh Hoai | N/A | Code |
| On the Effects of Self-Supervision and Contrastive Alignment in Deep Multi-View Clustering | Daniel J. Trosten · Sigurd Løkse · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| Visual Prompt Tuning for Generative Transfer Learning | Kihyuk Sohn · Huiwen Chang · José Lezama · Luisa Polania · Han Zhang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| Towards End-to-End Generative Modeling of Long Videos With Memory-Efficient Bidirectional Transformers | Jaehoon Yoo · Semin Kim · Doyup Lee · Chiheon Kim · Seunghoon Hong | N/A | Code |
| MAGVIT: Masked Generative Video Transformer | Lijun Yu · Yong Cheng · Kihyuk Sohn · José Lezama · Han Zhang · Huiwen Chang · Alexander G. Hauptmann · Ming-Hsuan Yang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| NICO++: Towards Better Benchmarking for Domain Generalization | Xingxuan Zhang · Yue He · Renzhe Xu · Han Yu · Zheyan Shen · Peng Cui | N/A | Code |
| Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization | Xingxuan Zhang · Renzhe Xu · Han Yu · Hao Zou · Peng Cui | N/A | Code |
| All-in-Focus Imaging From Event Focal Stack | Hanyue Lou · Minggui Teng · Yixin Yang · Boxin Shi | N/A | Code |
| Clover: Towards a Unified Video-Language Alignment and Fusion Model | Jingjia Huang · Yinan Li · Jiashi Feng · Xinglong Wu · Xiaoshuai Sun · Rongrong Ji | N/A | Code |
| UMat: Uncertainty-Aware Single Image High Resolution Material Capture | Carlos Rodriguez-Pardo · Henar Domínguez-Elvira · David Pascual-Hernández · Elena Garces | N/A | Code |
| Polarimetric iToF: Measuring High-Fidelity Depth Through Scattering Media | Daniel S. Jeon · Andréas Meuleman · Seung-Hwan Baek · Min H. Kim | N/A | Code |
| Freestyle Layout-to-Image Synthesis | Han Xue · Zhiwu Huang · Qianru Sun · Li Song · Wenjun Zhang | N/A | Code |
| Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior | Yuekun Dai · Yihang Luo · Shangchen Zhou · Chongyi Li · Chen Change Loy | N/A | Code |
| Meta Omnium: A Benchmark for General-Purpose Learning-To-Learn | Ondrej Bohdal · Yinbing Tian · Yongshuo Zong · Ruchika Chavhan · Da Li · Henry Gouk · Li Guo · Timothy Hospedales | N/A | Code |
| EXCALIBUR: Encouraging and Evaluating Embodied Exploration | Hao Zhu · Raghav Kapoor · So Yeon Min · Winson Han · Jiatai Li · Kaiwen Geng · Graham Neubig · Yonatan Bisk · Aniruddha Kembhavi · Luca Weihs | N/A | Code |
| Detection of Out-of-Distribution Samples Using Binary Neuron Activation Patterns | Bartłomiej Olber · Krystian Radlak · Adam Popowicz · Michal Szczepankiewicz · Krystian Chachuła | N/A | Code |
| Shakes on a Plane: Unsupervised Depth Estimation From Unstabilized Photography | Ilya Chugunov · Yuxuan Zhang · Felix Heide | N/A | Code |
| JacobiNeRF: NeRF Shaping With Mutual Information Gradients | Xiaomeng Xu · Yanchao Yang · Kaichun Mo · Boxiao Pan · Li Yi · Leonidas Guibas | N/A | Code |
| MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers | Jihao Liu · Xin Huang · Jinliang Zheng · Yu Liu · Hongsheng Li | N/A | Code |
| Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement | Siddarth Ravichandran · Ondřej Texler · Dimitar Dinev · Hyun Jae Kang | N/A | Code |
| CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions | Ming Yan · Xin Wang · Yudi Dai · Siqi Shen · Chenglu Wen · Lan Xu · Yuexin Ma · Cheng Wang | N/A | Code |
| SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments | Yudi Dai · Yitai Lin · Xiping Lin · Chenglu Wen · Lan Xu · Hongwei Yi · Siqi Shen · Yuexin Ma · Cheng Wang | N/A | Code |
| Viewpoint Equivariance for Multi-View 3D Object Detection | Dian Chen · Jie Li · Vitor Guizilini · Rares Andrei Ambrus · Adrien Gaidon | N/A | Code |
| Balanced Product of Calibrated Experts for Long-Tailed Recognition | Emanuel Sanchez Aimar · Arvi Jonnarth · Michael Felsberg · Marco Kuhlmann | N/A | Code |
| Robust Mean Teacher for Continual and Gradual Test-Time Adaptation | Mario Döbler · Robert A. Marsden · Bin Yang | N/A | Code |
| Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation | Sara Sarto · Manuele Barraco · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara | N/A | Code |
| BITE: Beyond Priors for Improved Three-D Dog Pose Estimation | Nadine Rüegg · Shashank Tripathi · Konrad Schindler · Michael J. Black · Silvia Zuffi | N/A | Code |
| SparseFusion: Distilling View-Conditioned Diffusion for 3D Reconstruction | Zhizhuo Zhou · Shubham Tulsiani | N/A | Code |
| PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation | Liwen Zhang · Xinyan Zhang · Youcheng Zhang · Yufei Guo · Yuanpei Chen · Xuhui Huang · Zhe Ma | N/A | Code |
| Masked Wavelet Representation for Compact Neural Radiance Fields | Daniel Rho · Byeonghyeon Lee · Seungtae Nam · Joo Chan Lee · Jong Hwan Ko · Eunbyung Park | N/A | Code |
| Guided Depth Super-Resolution by Deep Anisotropic Diffusion | Nando Metzger · Rodrigo Caye Daudt · Konrad Schindler | N/A | Code |
| Masked Images Are Counterfactual Samples for Robust Fine-Tuning | Yao Xiao · Ziyi Tang · Pengxu Wei · Cong Liu · Liang Lin | N/A | Code |
| Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration | Guofeng Mei · Hao Tang · Xiaoshui Huang · Weijie Wang · Juan Liu · Jian Zhang · Luc Van Gool · Qiang Wu | N/A | Code |
| ECON: Explicit Clothed Humans Optimized via Normal Integration | Yuliang Xiu · Jinlong Yang · Xu Cao · Dimitrios Tzionas · Michael J. Black | N/A | Code |
| GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection | Xixi Liu · Yaroslava Lochman · Christopher Zach | N/A | Code |
| OCTET: Object-Aware Counterfactual Explanations | Mehdi Zemni · Mickaël Chen · Éloi Zablocki · Hédi Ben-Younes · Patrick Pérez · Matthieu Cord | N/A | Code |
| Consistent View Synthesis With Pose-Guided Diffusion Models | Hung-Yu Tseng · Qinbo Li · Changil Kim · Suhib Alsisan · Jia-Bin Huang · Johannes Kopf | N/A | Code |
| GFPose: Learning 3D Human Pose Prior With Gradient Fields | Hai Ci · Mingdong Wu · Wentao Zhu · Xiaoxuan Ma · Hao Dong · Fangwei Zhong · Yizhou Wang | N/A | Code |
| Bayesian Posterior Approximation With Stochastic Ensembles | Oleksandr Balabanov · Bernhard Mehlig · Hampus Linander | N/A | Code |
| Spatio-Focal Bidirectional Disparity Estimation From a Dual-Pixel Image | Donggun Kim · Hyeonjoong Jang · Inchul Kim · Min H. Kim | N/A | Code |
| Octree Guided Unoriented Surface Reconstruction | Chamin Hewa Koneputugodage · Yizhak Ben-Shabat · Stephen Gould | N/A | Code |
| HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning | Chia-Wen Kuo · Zsolt Kira | N/A | Code |
| SUDS: Scalable Urban Dynamic Scenes | Haithem Turki · Jason Y. Zhang · Francesco Ferroni · Deva Ramanan | N/A | Code |
| Harmonious Feature Learning for Interactive Hand-Object Pose Estimation | Zhifeng Lin · Changxing Ding · Huan Yao · Zengsheng Kuang · Shaoli Huang | N/A | Code |
| Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer | Agus Gunawan · Soo Ye Kim · Hyeonjun Sim · Jae-Ho Lee · Munchurl Kim | N/A | Code |
| Trainable Projected Gradient Method for Robust Fine-Tuning | Junjiao Tian · Zecheng He · Xiaoliang Dai · Chih-Yao Ma · Yen-Cheng Liu · Zsolt Kira | N/A | Code |
| OReX: Object Reconstruction From Planar Cross-Sections Using Neural Fields | Haim Sawdayee · Amir Vaxman · Amit H. Bermano | N/A | Code |
| CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects | Nick Heppert · Zubair Irshad · Sergey Zakharov · Katherine Liu · Rares Andrei Ambrus · Jeannette Bohg · Abhinav Valada · Thomas Kollar | N/A | Code |
| ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction | Zhengdi Yu · Shaoli Huang · Chen Fang · Toby P. Breckon · Jue Wang | N/A | Code |
| Perception and Semantic Aware Regularization for Sequential Confidence Calibration | Zhenghua Peng · Yu Luo · Tianshui Chen · Keke Xu · Shuangping Huang | N/A | Code |
| Crowd3D: Towards Hundreds of People Reconstruction From a Single Image | Hao Wen · Jing Huang · Huili Cui · Haozhe Lin · Yu-Kun Lai · Lu Fang · Kun Li | N/A | Code |
| ZegCLIP: Towards Adapting CLIP for Zero-Shot Semantic Segmentation | Ziqin Zhou · Yinjie Lei · Bowen Zhang · Lingqiao Liu · Yifan Liu | N/A | Code |
| Learning Semantic-Aware Disentangled Representation for Flexible 3D Human Body Editing | Xiaokun Sun · Qiao Feng · Xiongzheng Li · Jinsong Zhang · Yu-Kun Lai · Jingyu Yang · Kun Li | N/A | Code |
| Skinned Motion Retargeting With Residual Perception of Motion Semantics & Geometry | Jiaxu Zhang · Junwu Weng · Di Kang · Fang Zhao · Shaoli Huang · Xuefei Zhe · Linchao Bao · Ying Shan · Jue Wang · Zhigang Tu | N/A | Code |
| Unknown Sniffer for Object Detection: Don’t Turn a Blind Eye to Unknown Objects | Wenteng Liang · Feng Xue · Yihao Liu · Guofeng Zhong · Anlong Ming | N/A | Code |
| RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving | Angelika Ando · Spyros Gidaris · Andrei Bursuc · Gilles Puy · Alexandre Boulch · Renaud Marlet | N/A | Code |
| 3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data | Libing Zeng · Lele Chen · Wentao Bao · Zhong Li · Yi Xu · Junsong Yuan · Nima Khademi Kalantari | N/A | Code |
| Best of Both Worlds: Multimodal Contrastive Learning With Tabular and Imaging Data | Paul Hager · Martin J. Menten · Daniel Rueckert | N/A | Code |
| JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking | Edward Vendrow · Tho Le · Jianfei Cai · Hamid Rezatofighi | N/A | Code |
| Consistent Direct Time-of-Flight Video Depth Super-Resolution | Zhanghao Sun · Wei Ye · Jinhui Xiong · Gyeongmin Choe · Jialiang Wang · Shuochen Su · Rakesh Ranjan | N/A | Code |
| Correlational Image Modeling for Self-Supervised Visual Pre-Training | Wei Li · Jiahao Xie · Chen Change Loy | N/A | Code |
| CelebV-Text: A Large-Scale Facial Text-Video Dataset | Jianhui Yu · Hao Zhu · Liming Jiang · Chen Change Loy · Weidong Cai · Wayne Wu | N/A | Code |
| Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning | Wei Ji · Renjie Liang · Zhedong Zheng · Wenqiao Zhang · Shengyu Zhang · Juncheng Li · Mengze Li · Tat-seng Chua | N/A | Code |
| Learning 3D Scene Priors With 2D Supervision | Yinyu Nie · Angela Dai · Xiaoguang Han · Matthias Nießner | N/A | Code |
| Generating Aligned Pseudo-Supervision From Non-Aligned Data for Image Restoration in Under-Display Camera | Ruicheng Feng · Chongyi Li · Huaijin Chen · Shuai Li · Jinwei Gu · Chen Change Loy | N/A | Code |
| Siamese DETR | Zeren Chen · Gengshi Huang · Wei Li · Jianing Teng · Kun Wang · Jing Shao · Chen Change Loy · Lu Sheng | N/A | Code |
| Panoptic Video Scene Graph Generation | Jingkang Yang · Wenxuan Peng · Xiangtai Li · Zujin Guo · Liangyu Chen · Bo Li · Zheng Ma · Kaiyang Zhou · Wayne Zhang · Chen Change Loy · Ziwei Liu | N/A | Code |
| Randomized Adversarial Training via Taylor Expansion | Gaojie Jin · Xinping Yi · Dengyu Wu · Ronghui Mu · Xiaowei Huang | N/A | Code |
| Task Residual for Tuning Vision-Language Models | Tao Yu · Zhihe Lu · Xin Jin · Zhibo Chen · Xinchao Wang | N/A | Code |
| PACO: Parts and Attributes of Common Objects | Vignesh Ramanathan · Anmol Kalia · Vladan Petrovic · Yi Wen · Baixue Zheng · Baishan Guo · Rui Wang · Aaron Marquez · Rama Kovvuri · Abhishek Kadian · Amir Mousavi · Yiwen Song · Abhimanyu Dubey · Dhruv Mahajan | N/A | Code |
| CloSET: Modeling Clothed Humans on Continuous Surface With Explicit Template Decomposition | Hongwen Zhang · Siyou Lin · Ruizhi Shao · Yuxiang Zhang · Zerong Zheng · Han Huang · Yandong Guo · Yebin Liu | N/A | Code |
| Transductive Few-Shot Learning With Prototype-Based Label Propagation by Iterative Graph Refinement | Hao Zhu · Piotr Koniusz | N/A | Code |
| DualVector: Unsupervised Vector Font Synthesis With Dual-Part Representation | Ying-Tian Liu · Zhifei Zhang · Yuan-Chen Guo · Matthew Fisher · Zhaowen Wang · Song-Hai Zhang | N/A | Code |
| Invertible Neural Skinning | Yash Kant · Aliaksandr Siarohin · Riza Alp Guler · Menglei Chai · Jian Ren · Sergey Tulyakov · Igor Gilitschenski | N/A | Code |
| Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars | Jingxiang Sun · Xuan Wang · Lizhen Wang · Xiaoyu Li · Yong Zhang · Hongwen Zhang · Yebin Liu | N/A | Code |
| Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding | Morris Alper · Michael Fiman · Hadar Averbuch-Elor | N/A | Code |
| ConStruct-VL: Data-Free Continual Structured VL Concepts Learning | James Seale Smith · Paola Cascante-Bonilla · Assaf Arbelle · Donghyun Kim · Rameswar Panda · David Cox · Diyi Yang · Zsolt Kira · Rogerio Feris · Leonid Karlinsky | N/A | Code |
| LINe: Out-of-Distribution Detection by Leveraging Important Neurons | Yong Hyun Ahn · Gyeong-Moon Park · Seong Tae Kim | N/A | Code |
| Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning With Multimodal Models | Zhiqiu Lin · Samuel Yu · Zhiyi Kuang · Deepak Pathak · Deva Ramanan | N/A | Code |
| Panoptic Lifting for 3D Scene Understanding With Neural Fields | Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Norman Müller · Matthias Nießner · Angela Dai · Peter Kontschieder | N/A | Code |
| GamutMLP: A Lightweight MLP for Color Loss Recovery | Hoang M. Le · Brian Price · Scott Cohen · Michael S. Brown | N/A | Code |
| DIFu: Depth-Guided Implicit Function for Clothed Human Reconstruction | Dae-Young Song · HeeKyung Lee · Jeongil Seo · Donghyeon Cho | N/A | Code |
| NLOST: Non-Line-of-Sight Imaging With Transformer | Yue Li · Jiayong Peng · Juntian Ye · Yueyi Zhang · Feihu Xu · Zhiwei Xiong | N/A | Code |
| SeaThru-NeRF: Neural Radiance Fields in Scattering Media | Deborah Levy · Amit Peleg · Naama Pearl · Dan Rosenbaum · Derya Akkaynak · Simon Korman · Tali Treibitz | N/A | Code |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Garrick Brazil · Abhinav Kumar · Julian Straub · Nikhila Ravi · Justin Johnson · Georgia Gkioxari | N/A | Code |
| Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection | Chuangchuang Tan · Yao Zhao · Shikui Wei · Guanghua Gu · Yunchao Wei | N/A | Code |
| LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Learning Customized Visual Models With Retrieval-Augmented Knowledge | Haotian Liu · Kilho Son · Jianwei Yang · Ce Liu · Jianfeng Gao · Yong Jae Lee · Chunyuan Li | N/A | Code |
| MAIR: Multi-View Attention Inverse Rendering With 3D Spatially-Varying Lighting Estimation | JunYong Choi · SeokYeong Lee · Haesol Park · Seung-Won Jung · Ig-Jae Kim · Junghyun Cho | N/A | Code |
| Generalizing Dataset Distillation via Deep Generative Prior | George Cazenavette · Tongzhou Wang · Antonio Torralba · Alexei A. Efros · Jun-Yan Zhu | N/A | Code |
| Polarized Color Image Denoising | Zhuoxiao Li · Haiyang Jiang · Mingdeng Cao · Yinqiang Zheng | N/A | Code |
| Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation | Haochen Wang · Xiaodan Du · Jiahao Li · Raymond A. Yeh · Greg Shakhnarovich | N/A | Code |
| FJMP: Factorized Joint Multi-Agent Motion Prediction Over Learned Directed Acyclic Interaction Graphs | Luke Rowe · Martin Ethier · Eli-Henry Dykhne · Krzysztof Czarnecki | N/A | Code |
| Mask-Free Video Instance Segmentation | Lei Ke · Martin Danelljan · Henghui Ding · Yu-Wing Tai · Chi-Keung Tang · Fisher Yu | N/A | Code |
| OVTrack: Open-Vocabulary Multiple Object Tracking | Siyuan Li · Tobias Fischer · Lei Ke · Henghui Ding · Martin Danelljan · Fisher Yu | N/A | Code |
| LightPainter: Interactive Portrait Relighting With Freehand Scribble | Yiqun Mei · He Zhang · Xuaner Zhang · Jianming Zhang · Zhixin Shu · Yilin Wang · Zijun Wei · Shi Yan · HyunJoon Jung · Vishal M. Patel | N/A | Code |
| Towards Scalable Neural Representation for Diverse Videos | Bo He · Xitong Yang · Hanyu Wang · Zuxuan Wu · Hao Chen · Shuaiyi Huang · Yixuan Ren · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Teaching Matters: Investigating the Role of Supervision in Vision Transformers | Matthew Walmer · Saksham Suri · Kamal Gupta · Abhinav Shrivastava | N/A | Code |
| FlexNeRF: Photorealistic Free-Viewpoint Rendering of Moving Humans From Sparse Views | Vinoj Jayasundara · Amit Agrawal · Nicolas Heron · Abhinav Shrivastava · Larry S. Davis | N/A | Code |
| Leveraging Temporal Context in Low Representational Power Regimes | Camilo L. Fosco · SouYoung Jin · Emilie Josephs · Aude Oliva | N/A | Code |
| Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask | Shangzhan Zhang · Sida Peng · Tianrun Chen · Linzhan Mou · Haotong Lin · Kaicheng Yu · Yiyi Liao · Xiaowei Zhou | N/A | Code |
| Align and Attend: Multimodal Summarization With Dual Contrastive Losses | Bo He · Jun Wang · Jielin Qiu · Trung Bui · Abhinav Shrivastava · Zhaowen Wang | N/A | Code |
| SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network | Chuong Huynh · Yuqian Zhou · Zhe Lin · Connelly Barnes · Eli Shechtman · Sohrab Amirghodsi · Abhinav Shrivastava | N/A | Code |
| NIRVANA: Neural Implicit Representations of Videos With Adaptive Networks and Autoregressive Patch-Wise Modeling | Shishira R Maiya · Sharath Girish · Max Ehrlich · Hanyu Wang · Kwot Sin Lee · Patrick Poirson · Pengxiang Wu · Chen Wang · Abhinav Shrivastava | N/A | Code |
| Seeing Beyond the Brain: Conditional Diffusion Model With Sparse Masked Modeling for Vision Decoding | Zijiao Chen · Jiaxin Qing · Tiange Xiang · Wan Lin Yue · Juan Helen Zhou | N/A | Code |
| Position-Guided Text Prompt for Vision-Language Pre-Training | Jinpeng Wang · Pan Zhou · Mike Zheng Shou · Shuicheng Yan | N/A | Code |
| Progressive Spatio-Temporal Alignment for Efficient Event-Based Motion Estimation | Xueyan Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering | Difei Gao · Luowei Zhou · Lei Ji · Linchao Zhu · Yi Yang · Mike Zheng Shou | N/A | Code |
| Histopathology Whole Slide Image Analysis With Heterogeneous Graph Representation Learning | Tsai Hor Chan · Fernando Julio Cendra · Lan Ma · Guosheng Yin · Lequan Yu | N/A | Code |
| Making Vision Transformers Efficient From a Token Sparsification View | Shuning Chang · Pichao Wang · Ming Lin · Fan Wang · David Junhao Zhang · Rong Jin · Mike Zheng Shou | N/A | Code |
| Leverage Interactive Affinity for Affordance Learning | Hongchen Luo · Wei Zhai · Jing Zhang · Yang Cao · Dacheng Tao | N/A | Code |
| Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection | Fan Lu · Kai Zhu · Wei Zhai · Kecheng Zheng · Yang Cao | N/A | Code |
| HARP: Personalized Hand Reconstruction From a Monocular RGB Video | Korrawe Karunratanakul · Sergey Prokudin · Otmar Hilliges · Siyu Tang | N/A | Code |
| Towards Effective Visual Representations for Partial-Label Learning | Shiyu Xia · Jiaqi Lv · Ning Xu · Gang Niu · Xin Geng | N/A | Code |
| SFD2: Semantic-Guided Feature Detection and Description | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation | Bowen Zhang · Chenyang Qi · Pan Zhang · Bo Zhang · HsiangTao Wu · Dong Chen · Qifeng Chen · Yong Wang · Fang Wen | N/A | Code |
| The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training | Gi-Cheon Kang · Sungdong Kim · Jin-Hwa Kim · Donghyun Kwak · Byoung-Tak Zhang | N/A | Code |
| Temporal Interpolation Is All You Need for Dynamic Neural Radiance Fields | Sungheon Park · Minjung Son · Seokhwan Jang · Young Chun Ahn · Ji-Yeon Kim · Nahyup Kang | N/A | Code |
| DiGA: Distil To Generalize and Then Adapt for Domain Adaptive Semantic Segmentation | Fengyi Shen · Akhil Gurram · Ziyuan Liu · He Wang · Alois Knoll | N/A | Code |
| Multimodal Prompting With Missing Modalities for Visual Recognition | Yi-Lun Lee · Yi-Hsuan Tsai · Wei-Chen Chiu · Chen-Yu Lee | N/A | Code |
| On Calibrating Semantic Segmentation Models: Analyses and an Algorithm | Dongdong Wang · Boqing Gong · Liqiang Wang | N/A | Code |
| IMP: Iterative Matching and Pose Estimation With Adaptive Pooling | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| Grid-Guided Neural Radiance Fields for Large Urban Scenes | Linning Xu · Yuanbo Xiangli · Sida Peng · Xingang Pan · Nanxuan Zhao · Christian Theobalt · Bo Dai · Dahua Lin | N/A | Code |
| Neural Voting Field for Camera-Space 3D Hand Pose Estimation | Lin Huang · Chung-Ching Lin · Kevin Lin · Lin Liang · Lijuan Wang · Junsong Yuan · Zicheng Liu | N/A | Code |
| Dense Network Expansion for Class Incremental Learning | Zhiyuan Hu · Yunsheng Li · Jiancheng Lyu · Dashan Gao · Nuno Vasconcelos | N/A | Code |
| FashionSAP: Symbols and Attributes Prompt for Fine-Grained Fashion Vision-Language Pre-Training | Yunpeng Han · Lisai Zhang · Qingcai Chen · Zhijian Chen · Zhonghua Li · Jianxin Yang · Zhao Cao | N/A | Code |
| Batch Model Consolidation: A Multi-Task Model Consolidation Framework | Iordanis Fostiropoulos · Jiaye Zhu · Laurent Itti | N/A | Code |
| Open-Vocabulary Attribute Detection | María A. Bravo · Sudhanshu Mittal · Simon Ging · Thomas Brox | N/A | Code |
| Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models | Nithin Gopalakrishnan Nair · Wele Gedara Chaminda Bandara · Vishal M. Patel | N/A | Code |
| BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection | Lei Yang · Kaicheng Yu · Tao Tang · Jun Li · Kun Yuan · Li Wang · Xinyu Zhang · Peng Chen | N/A | Code |
| Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization | Chen Zhao · Shuming Liu · Karttikeya Mangalam · Bernard Ghanem | N/A | Code |
| C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation | Nazmul Karim · Niluthpol Chowdhury Mithun · Abhinav Rajvanshi · Han-pang Chiu · Supun Samarasekera · Nazanin Rahnavard | N/A | Code |
| Are Deep Neural Networks SMARTer Than Second Graders? | Anoop Cherian · Kuan-Chuan Peng · Suhas Lohit · Kevin A. Smith · Joshua B. Tenenbaum | N/A | Code |
| Persistent Nature: A Generative Model of Unbounded 3D Worlds | Lucy Chai · Richard Tucker · Zhengqi Li · Phillip Isola · Noah Snavely | N/A | Code |
| InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions | Wenhai Wang · Jifeng Dai · Zhe Chen · Zhenhang Huang · Zhiqi Li · Xizhou Zhu · Xiaowei Hu · Tong Lu · Lewei Lu · Hongsheng Li · Xiaogang Wang · Yu Qiao | N/A | Code |
| Learning To Fuse Monocular and Multi-View Cues for Multi-Frame Depth Estimation in Dynamic Scenes | Rui Li · Dong Gong · Wei Yin · Hao Chen · Yu Zhu · Kaixuan Wang · Xiaozhi Chen · Jinqiu Sun · Yanning Zhang | N/A | Code |
| Benchmarking Self-Supervised Learning on Diverse Pathology Datasets | Mingu Kang · Heon Song · Seonwook Park · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions | Shuxuan Guo · Yinlin Hu · Jose M. Alvarez · Mathieu Salzmann | N/A | Code |
| Self-Supervised Representation Learning for CAD | Benjamin T. Jones · Michael Hu · Milin Kodnongbua · Vladimir G. Kim · Adriana Schulz | N/A | Code |
| SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer | Xuanyao Chen · Zhijian Liu · Haotian Tang · Li Yi · Hang Zhao · Song Han | N/A | Code |
| Neural Pixel Composition for 3D-4D View Synthesis From Multi-Views | Aayush Bansal · Michael Zollhöfer | N/A | Code |
| ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries | Junru Gu · Chenxu Hu · Tianyuan Zhang · Xuanyao Chen · Yilun Wang · Yue Wang · Hang Zhao | N/A | Code |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders | Wele Gedara Chaminda Bandara · Naman Patel · Ali Gholami · Mehdi Nikkhah · Motilal Agrawal · Vishal M. Patel | N/A | Code |
| Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning | Xiaoyang Wu · Xin Wen · Xihui Liu · Hengshuang Zhao | N/A | Code |
| RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer | Jiahao Wang · Songyang Zhang · Yong Liu · Taiqiang Wu · Yujiu Yang · Xihui Liu · Kai Chen · Ping Luo · Dahua Lin | N/A | Code |
| TeSLA: Test-Time Self-Learning With Automatic Adversarial Augmentation | Devavrat Tomar · Guillaume Vray · Behzad Bozorgtabar · Jean-Philippe Thiran | N/A | Code |
| ObjectMatch: Robust Registration Using Canonical Object Correspondences | Can Gümeli · Angela Dai · Matthias Nießner | N/A | Code |
| Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models | Jiale Xu · Xintao Wang · Weihao Cheng · Yan-Pei Cao · Ying Shan · Xiaohu Qie · Shenghua Gao | N/A | Code |
| SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao · Yan-Pei Cao · Ying Shan | N/A | Code |
| Object Detection With Self-Supervised Scene Adaptation | Zekun Zhang · Minh Hoai | N/A | Code |
| Megahertz Light Steering Without Moving Parts | Adithya Pediredla · Srinivasa G. Narasimhan · Maysamreza Chamanzar · Ioannis Gkioulekas | N/A | Code |
| ISBNet: A 3D Point Cloud Instance Segmentation Network With Instance-Aware Sampling and Box-Aware Dynamic Convolution | Tuan Duc Ngo · Binh-Son Hua · Khoi Nguyen | N/A | Code |
| Rate Gradient Approximation Attack Threats Deep Spiking Neural Networks | Tong Bu · Jianhao Ding · Zecheng Hao · Zhaofei Yu | N/A | Code |
| PIVOT: Prompting for Video Continual Learning | Andrés Villa · Juan León Alcázar · Motasem Alfarra · Kumail Alhamoud · Julio Hurtado · Fabian Caba Heilbron · Alvaro Soto · Bernard Ghanem | N/A | Code |
| ARO-Net: Learning Implicit Fields From Anchored Radial Observations | Yizhi Wang · Zeyu Huang · Ariel Shamir · Hui Huang · Hao Zhang · Ruizhen Hu | N/A | Code |
| Parallel Diffusion Models of Operator and Image for Blind Inverse Problems | Hyungjin Chung · Jeongsol Kim · Sehui Kim · Jong Chul Ye | N/A | Code |
| Solving 3D Inverse Problems Using Pre-Trained 2D Diffusion Models | Hyungjin Chung · Dohoon Ryu · Michael T. McCann · Marc L. Klasky · Jong Chul Ye | N/A | Code |
| Affordance Grounding From Demonstration Video To Target Image | Joya Chen · Difei Gao · Kevin Qinghong Lin · Mike Zheng Shou | N/A | Code |
| Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations | Yiwu Zhong · Licheng Yu · Yang Bai · Shangwen Li · Xueting Yan · Yin Li | N/A | Code |
| YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors | Chien-Yao Wang · Alexey Bochkovskiy · Hong-Yuan Mark Liao | N/A | Code |
| OmniCity: Omnipotent City Understanding With Multi-Level and Multi-View Images | Weijia Li · Yawen Lai · Linning Xu · Yuanbo Xiangli · Jinhua Yu · Conghui He · Gui-Song Xia · Dahua Lin | N/A | Code |
| Object Discovery From Motion-Guided Tokens | Zhipeng Bao · Pavel Tokmakov · Yu-Xiong Wang · Adrien Gaidon · Martial Hebert | N/A | Code |
| MP-Former: Mask-Piloted Transformer for Image Segmentation | Hao Zhang · Feng Li · Huaizhe Xu · Shijia Huang · Shilong Liu · Lionel M. Ni · Lei Zhang | N/A | Code |
| Disentangling Writer and Character Styles for Handwriting Generation | Gang Dai · Yifan Zhang · Qingfeng Wang · Qing Du · Zhuliang Yu · Zhuoman Liu · Shuangping Huang | N/A | Code |
| Building Rearticulable Models for Arbitrary 3D Objects From 4D Point Clouds | Shaowei Liu · Saurabh Gupta · Shenlong Wang | N/A | Code |
| Gated Stereo: Joint Depth Estimation From Gated and Wide-Baseline Active Stereo Cues | Stefanie Walz · Mario Bijelic · Andrea Ramazzina · Amanpreet Walia · Fahim Mannan · Felix Heide | N/A | Code |
| Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation | Wei Wang · Zhun Zhong · Weijie Wang · Xi Chen · Charles Ling · Boyu Wang · Nicu Sebe | N/A | Code |
| Perspective Fields for Single Image Camera Calibration | Linyi Jin · Jianming Zhang · Yannick Hold-Geoffroy · Oliver Wang · Kevin Blackburn-Matzen · Matthew Sticha · David F. Fouhey | N/A | Code |
| Vision Transformers Are Parameter-Efficient Audio-Visual Learners | Yan-Bo Lin · Yi-Lin Sung · Jie Lei · Mohit Bansal · Gedas Bertasius | N/A | Code |
| Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos | Yubin Hu · Yuze He · Yanghao Li · Jisheng Li · Yuxing Han · Jiangtao Wen · Yong-Jin Liu | N/A | Code |
| DisWOT: Student Architecture Search for Distillation WithOut Training | Peijie Dong · Lujun Li · Zimian Wei | N/A | Code |
| Activating More Pixels in Image Super-Resolution Transformer | Xiangyu Chen · Xintao Wang · Jiantao Zhou · Yu Qiao · Chao Dong | N/A | Code |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Jie Hu · Linyan Huang · Tianhe Ren · Shengchuan Zhang · Rongrong Ji · Liujuan Cao | N/A | Code |
| PA&DA: Jointly Sampling Path and Data for Consistent NAS | Shun Lu · Yu Hu · Longxing Yang · Zihao Sun · Jilin Mei · Jianchao Tan · Chengru Song | N/A | Code |
| NeuralUDF: Learning Unsigned Distance Fields for Multi-View Reconstruction of Surfaces With Arbitrary Topologies | Xiaoxiao Long · Cheng Lin · Lingjie Liu · Yuan Liu · Peng Wang · Christian Theobalt · Taku Komura · Wenping Wang | N/A | Code |
| Towards Universal Fake Image Detectors That Generalize Across Generative Models | Utkarsh Ojha · Yuheng Li · Yong Jae Lee | N/A | Code |
| FLAG3D: A 3D Fitness Activity Dataset With Language Instruction | Yansong Tang · Jinpeng Liu · Aoyang Liu · Bin Yang · Wenxun Dai · Yongming Rao · Jiwen Lu · Jie Zhou · Xiu Li | N/A | Code |
| NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction From Multi-View Images | Yunfan Ye · Renjiao Yi · Zhirui Gao · Chenyang Zhu · Zhiping Cai · Kai Xu | N/A | Code |
| Executing Your Commands via Motion Diffusion in Latent Space | Xin Chen · Biao Jiang · Wen Liu · Zilong Huang · Bin Fu · Tao Chen · Gang Yu | N/A | Code |
| MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID | Jianyang Gu · Kai Wang · Hao Luo · Chen Chen · Wei Jiang · Yuqiang Fang · Shanghang Zhang · Yang You · Jian Zhao | N/A | Code |
| SunStage: Portrait Reconstruction and Relighting Using the Sun as a Light Stage | Yifan Wang · Aleksander Holynski · Xiuming Zhang · Xuaner Zhang | N/A | Code |
| IS-GGT: Iterative Scene Graph Generation With Generative Transformers | Sanjoy Kundu · Sathyanarayanan N. Aakur | N/A | Code |
| DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis | Yinghao Xu · Menglei Chai · Zifan Shi · Sida Peng · Ivan Skorokhodov · Aliaksandr Siarohin · Ceyuan Yang · Yujun Shen · Hsin-Ying Lee · Bolei Zhou · Sergey Tulyakov | N/A | Code |
| Breaking the “Object” in Video Object Segmentation | Pavel Tokmakov · Jie Li · Adrien Gaidon | N/A | Code |
| SimpleNet: A Simple Network for Image Anomaly Detection and Localization | Zhikang Liu · Yiming Zhou · Yuansheng Xu · Zilei Wang | N/A | Code |
| Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation | Lingting Zhu · Xian Liu · Xuanyu Liu · Rui Qian · Ziwei Liu · Lequan Yu | N/A | Code |
| Top-Down Visual Attention From Analysis by Synthesis | Baifeng Shi · Trevor Darrell · Xin Wang | N/A | Code |
| Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation With Cross-Scale Distortion Awareness | Zhijie Shen · Zishuo Zheng · Chunyu Lin · Lang Nie · Kang Liao · Shuai Zheng · Yao Zhao | N/A | Code |
| Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation | Xiaolong Shen · Zongxin Yang · Xiaohan Wang · Jianxin Ma · Chang Zhou · Yi Yang | N/A | Code |
| RaBit: Parametric Modeling of 3D Biped Cartoon Characters With a Topological-Consistent Dataset | Zhongjin Luo · Shengcai Cai · Jinguo Dong · Ruibo Ming · Liangdong Qiu · Xiaohang Zhan · Xiaoguang Han | N/A | Code |
| Masked Image Modeling With Local Multi-Scale Reconstruction | Haoqing Wang · Yehui Tang · Yunhe Wang · Jianyuan Guo · Zhi-Hong Deng · Kai Han | N/A | Code |
| Omni Aggregation Networks for Lightweight Image Super-Resolution | Hang Wang · Xuanhong Chen · Bingbing Ni · Yutian Liu · Jinfan Liu | N/A | Code |
| TryOnDiffusion: A Tale of Two UNets | Luyang Zhu · Dawei Yang · Tyler Zhu · Fitsum Reda · William Chan · Chitwan Saharia · Mohammad Norouzi · Ira Kemelmacher-Shlizerman | N/A | Code |
| MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition | Xiang Wang · Shiwei Zhang · Zhiwu Qing · Changxin Gao · Yingya Zhang · Deli Zhao · Nong Sang | N/A | Code |
| Dynamic Aggregated Network for Gait Recognition | Kang Ma · Ying Fu · Dezhi Zheng · Chunshui Cao · Xuecai Hu · Yongzhen Huang | N/A | Code |
| Equivalent Transformation and Dual Stream Network Construction for Mobile Image Super-Resolution | Jiahao Chao · Zhou Zhou · Hongfan Gao · Jiali Gong · Zhengfeng Yang · Zhenbing Zeng · Lydia Dehbi | N/A | Code |
| Semi-Supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination | Zimeng Zhao · Binghui Zuo · Zhiyu Long · Yangang Wang | N/A | Code |
| DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields | Yu Chen · Gim Hee Lee | N/A | Code |
| Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance Pursuit | Xiaohang Wang · Xuanhong Chen · Bingbing Ni · Hang Wang · Zhengyan Tong · Yutian Liu | N/A | Code |
| Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections | Jiaxiong Qiu · Peng-Tao Jiang · Yifan Zhu · Ze-Xin Yin · Ming-Ming Cheng · Bo Ren | N/A | Code |
| The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction | Alexandros Stergiou · Dima Damen | N/A | Code |
| Use Your Head: Improving Long-Tail Video Recognition | Toby Perrett · Saptarshi Sinha · Tilo Burghardt · Majid Mirmehdi · Dima Damen | N/A | Code |
| Large-Scale Training Data Search for Object Re-Identification | Yue Yao · Tom Gedeon · Liang Zheng | N/A | Code |
| Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction | Guangyi Chen · Zhenhao Chen · Shunxing Fan · Kun Zhang | N/A | Code |
| Seeing a Rose in Five Thousand Ways | Yunzhi Zhang · Shangzhe Wu · Noah Snavely · Jiajun Wu | N/A | Code |
| EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points | Chengwei Zheng · Wenbin Lin · Feng Xu | N/A | Code |
| Uncertainty-Aware Unsupervised Image Deblurring With Deep Residual Prior | Xiaole Tang · Xile Zhao · Jun Liu · Jianli Wang · Yuchun Miao · Tieyong Zeng | N/A | Code |
| Primitive Generation and Semantic-Related Alignment for Universal Zero-Shot Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Long-Tailed Visual Recognition via Self-Heterogeneous Integration With Knowledge Excavation | Yan Jin · Mengke Li · Yang Lu · Yiu-ming Cheung · Hanzi Wang | N/A | Code |
| Neuron Structure Modeling for Generalizable Remote Physiological Measurement | Hao Lu · Zitong Yu · Xuesong Niu · Ying-Cong Chen | N/A | Code |
| Decoupled Semantic Prototypes Enable Learning From Diverse Annotation Types for Semi-Weakly Segmentation in Expert-Driven Domains | Simon Reiß · Constantin Seibold · Alexander Freytag · Erik Rodner · Rainer Stiefelhagen | N/A | Code |
| Learning a Sparse Transformer Network for Effective Image Deraining | Xiang Chen · Hao Li · Mingqiang Li · Jinshan Pan | N/A | Code |
| Camouflaged Object Detection With Feature Decomposition and Edge Reconstruction | Chunming He · Kai Li · Yachao Zhang · Longxiang Tang · Yulun Zhang · Zhenhua Guo · Xiu Li | N/A | Code |
| LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding | Gen Li · Varun Jampani · Deqing Sun · Laura Sevilla-Lara | N/A | Code |
| DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | Nataniel Ruiz · Yuanzhen Li · Varun Jampani · Yael Pritch · Michael Rubinstein · Kfir Aberman | N/A | Code |
| GeneCIS: A Benchmark for General Conditional Image Similarity | Sagar Vaze · Nicolas Carion · Ishan Misra | N/A | Code |
| Neighborhood Attention Transformer | Ali Hassani · Steven Walton · Jiachen Li · Shen Li · Humphrey Shi | N/A | Code |
| 3D-Aware Conditional Image Synthesis | Kangle Deng · Gengshan Yang · Deva Ramanan · Jun-Yan Zhu | N/A | Code |
| Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin · Jun Gao · Luming Tang · Towaki Takikawa · Xiaohui Zeng · Xun Huang · Karsten Kreis · Sanja Fidler · Ming-Yu Liu · Tsung-Yi Lin | N/A | Code |
| QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity | Siyu Huang · Jie An · Donglai Wei · Jiebo Luo · Hanspeter Pfister | N/A | Code |
| SceneComposer: Any-Level Semantic Image Synthesis | Yu Zeng · Zhe Lin · Jianming Zhang · Qing Liu · John Collomosse · Jason Kuen · Vishal M. Patel | N/A | Code |
| Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style | Haoming Lu · Hazarapet Tunanyan · Kai Wang · Shant Navasardyan · Zhangyang Wang · Humphrey Shi | N/A | Code |
| In-Hand 3D Object Scanning From an RGB Sequence | Shreyas Hampali · Tomas Hodan · Luan Tran · Lingni Ma · Cem Keskin · Vincent Lepetit | N/A | Code |
| SHS-Net: Learning Signed Hyper Surfaces for Oriented Normal Estimation of Point Clouds | Qing Li · Huifang Feng · Kanle Shi · Yue Gao · Yi Fang · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Advancing Visual Grounding With Scene Knowledge: Benchmark and Method | Zhihong Chen · Ruifei Zhang · Yibing Song · Xiang Wan · Guanbin Li | N/A | Code |
| Putting People in Their Place: Affordance-Aware Human Insertion Into Scenes | Sumith Kulal · Tim Brooks · Alex Aiken · Jiajun Wu · Jimei Yang · Jingwan Lu · Alexei A. Efros · Krishna Kumar Singh | N/A | Code |
| Identity-Preserving Talking Face Generation With Landmark and Appearance Priors | Weizhi Zhong · Chaowei Fang · Yinqi Cai · Pengxu Wei · Gangming Zhao · Liang Lin · Guanbin Li | N/A | Code |
| Less Is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation | Li Li · Hubert P. H. Shum · Toby P. Breckon | N/A | Code |
| FAC: 3D Representation Learning via Foreground Aware Feature Contrast | Kangcheng Liu · Aoran Xiao · Xiaoqin Zhang · Shijian Lu · Ling Shao | N/A | Code |
| InstMove: Instance Motion for Object-Centric Video Segmentation | Qihao Liu · Junfeng Wu · Yi Jiang · Xiang Bai · Alan L. Yuille · Song Bai | N/A | Code |
| Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark | Xiaofeng Wang · Zheng Zhu · Yunpeng Zhang · Guan Huang · Yun Ye · Wenbo Xu · Ziwei Chen · Xingang Wang | N/A | Code |
| Self-Supervised Non-Uniform Kernel Estimation With Flow-Based Motion Prior for Blind Image Deblurring | Zhenxuan Fang · Fangfang Wu · Weisheng Dong · Xin Li · Jinjian Wu · Guangming Shi | N/A | Code |
| Neural Kernel Surface Reconstruction | Jiahui Huang · Zan Gojcic · Matan Atzmon · Or Litany · Sanja Fidler · Francis Williams | N/A | Code |
| Binary Latent Diffusion | Ze Wang · Jiang Wang · Zicheng Liu · Qiang Qiu | N/A | Code |
| Learning To Dub Movies via Hierarchical Prosody Models | Gaoxiang Cong · Liang Li · Yuankai Qi · Zheng-Jun Zha · Qi Wu · Wenyu Wang · Bin Jiang · Ming-Hsuan Yang · Qingming Huang | N/A | Code |
| Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs | Pattaramanee Arsomngern · Sarana Nutanong · Supasorn Suwajanakorn | N/A | Code |
| FrustumFormer: Adaptive Instance-Aware Resampling for Multi-View 3D Detection | Yuqi Wang · Yuntao Chen · Zhaoxiang Zhang | N/A | Code |
| Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! | Zaid Khan · Vijay Kumar BG · Samuel Schulter · Xiang Yu · Yun Fu · Manmohan Chandraker | N/A | Code |
| StyleRes: Transforming the Residuals for Real Image Editing With StyleGAN | Hamza Pehlivan · Yusuf Dalva · Aysegul Dundar | N/A | Code |
| PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer | Honghui Yang · Wenxiao Wang · Minghao Chen · Binbin Lin · Tong He · Hua Chen · Xiaofei He · Wanli Ouyang | N/A | Code |
| Boosting Verified Training for Robust Image Classifications via Abstraction | Zhaodi Zhang · Zhiyi Xue · Yang Chen · Si Liu · Yueling Zhang · Jing Liu · Min Zhang | N/A | Code |
| Interactive Segmentation As Gaussion Process Classification | Minghao Zhou · Hong Wang · Qian Zhao · Yuexiang Li · Yawen Huang · Deyu Meng · Yefeng Zheng | N/A | Code |
| OSRT: Omnidirectional Image Super-Resolution With Distortion-Aware Transformer | Fanghua Yu · Xintao Wang · Mingdeng Cao · Gen Li · Ying Shan · Chao Dong | N/A | Code |
| Accelerating Vision-Language Pretraining With Free Language Modeling | Teng Wang · Yixiao Ge · Feng Zheng · Ran Cheng · Ying Shan · Xiaohu Qie · Ping Luo | N/A | Code |
| TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation | Hanzhi Chen · Fabian Manhardt · Nassir Navab · Benjamin Busam | N/A | Code |
| Learning With Noisy Labels via Self-Supervised Adversarial Noisy Masking | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Jiangning Zhang · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| HelixSurf: A Robust and Efficient Neural Implicit Surface Learning of Indoor Scenes With Iterative Intertwined Regularization | Zhihao Liang · Zhangjin Huang · Changxing Ding · Kui Jia | N/A | Code |
| Multi-Space Neural Radiance Fields | Ze-Xin Yin · Jiaxiong Qiu · Ming-Ming Cheng · Bo Ren | N/A | Code |
| MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences | Chenhang He · Ruihuang Li · Yabin Zhang · Shuai Li · Lei Zhang | N/A | Code |
| DLBD: A Self-Supervised Direct-Learned Binary Descriptor | Bin Xiao · Yang Hu · Bo Liu · Xiuli Bi · Weisheng Li · Xinbo Gao | N/A | Code |
| Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries | Yuanwen Yue · Theodora Kontogianni · Konrad Schindler · Francis Engelmann | N/A | Code |
| PointAvatar: Deformable Point-Based Head Avatars From Videos | Yufeng Zheng · Wang Yifan · Gordon Wetzstein · Michael J. Black · Otmar Hilliges | N/A | Code |
| Diffusion-SDF: Text-To-Shape via Voxelized Diffusion | Muheng Li · Yueqi Duan · Jie Zhou · Jiwen Lu | N/A | Code |
| NeRF-RPN: A General Framework for Object Detection in NeRFs | Benran Hu · Junkai Huang · Yichen Liu · Yu-Wing Tai · Chi-Keung Tang | N/A | Code |
| CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset | Tian Gan · Qing Wang · Xingning Dong · Xiangyuan Ren · Liqiang Nie · Qingpei Guo | N/A | Code |
| Vid2Avatar: 3D Avatar Reconstruction From Videos in the Wild via Self-Supervised Scene Decomposition | Chen Guo · Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| Neural Preset for Color Style Transfer | Zhanghan Ke · Yuhao Liu · Lei Zhu · Nanxuan Zhao · Rynson W.H. Lau | N/A | Code |
| GRES: Generalized Referring Expression Segmentation | Chang Liu · Henghui Ding · Xudong Jiang | N/A | Code |
| Tracking Through Containers and Occluders in the Wild | Basile Van Hoorick · Pavel Tokmakov · Simon Stent · Jie Li · Carl Vondrick | N/A | Code |
| DepGraph: Towards Any Structural Pruning | Gongfan Fang · Xinyin Ma · Mingli Song · Michael Bi Mi · Xinchao Wang | N/A | Code |
| Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation | Yunqing Zhao · Chao Du · Milad Abdollahzadeh · Tianyu Pang · Min Lin · Shuicheng Yan · Ngai-Man Cheung | N/A | Code |
| RGB No More: Minimally-Decoded JPEG Vision Transformers | Jeongsoo Park · Justin Johnson | N/A | Code |
| iQuery: Instruments As Queries for Audio-Visual Sound Separation | Jiaben Chen · Renrui Zhang · Dongze Lian · Jiaqi Yang · Ziyao Zeng · Jianbo Shi | N/A | Code |
| Towards Professional Level Crowd Annotation of Expert Domain Data | Pei Wang · Nuno Vasconcelos | N/A | Code |
| VideoTrack: Learning To Track Objects via Video Transformer | Fei Xie · Lei Chu · Jiahao Li · Yan Lu · Chao Ma | N/A | Code |
| SCoDA: Domain Adaptive Shape Completion for Real Scans | Yushuang Wu · Zizheng Yan · Ce Chen · Lai Wei · Xiao Li · Guanbin Li · Yihao Li · Shuguang Cui · Xiaoguang Han | N/A | Code |
| Enhanced Training of Query-Based Object Detection via Selective Query Recollection | Fangyi Chen · Han Zhang · Kai Hu · Yu-Kai Huang · Chenchen Zhu · Marios Savvides | N/A | Code |
| LaserMix for Semi-Supervised LiDAR Semantic Segmentation | Lingdong Kong · Jiawei Ren · Liang Pan · Ziwei Liu | N/A | Code |
| MSMDFusion: Fusing LiDAR and Camera at Multiple Scales With Multi-Depth Seeds for 3D Object Detection | Yang Jiao · Zequn Jie · Shaoxiang Chen · Jingjing Chen · Lin Ma · Yu-Gang Jiang | N/A | Code |
| Learning With Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning | Zeyin Song · Yifan Zhao · Yujun Shi · Peixi Peng · Li Yuan · Yonghong Tian | N/A | Code |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Runyu Ding · Jihan Yang · Chuhui Xue · Wenqing Zhang · Song Bai · Xiaojuan Qi | N/A | Code |
| Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training | Junfan Lin · Jianlong Chang · Lingbo Liu · Guanbin Li · Liang Lin · Qi Tian · Chang-Wen Chen | N/A | Code |
| A Dynamic Multi-Scale Voxel Flow Network for Video Prediction | Xiaotao Hu · Zhewei Huang · Ailin Huang · Jun Xu · Shuchang Zhou | N/A | Code |
| Neural Dependencies Emerging From Learning Massive Categories | Ruili Feng · Kecheng Zheng · Kai Zhu · Yujun Shen · Jian Zhao · Yukun Huang · Deli Zhao · Jingren Zhou · Michael Jordan · Zheng-Jun Zha | N/A | Code |
| Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-Identification | Yukang Zhang · Hanzi Wang | N/A | Code |
| Neural Kaleidoscopic Space Sculpting | Byeongjoo Ahn · Michael De Zeeuw · Ioannis Gkioulekas · Aswin C. Sankaranarayanan | N/A | Code |
| PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow | Jiarui Lei · Xiaobo Hu · Yue Wang · Dong Liu | N/A | Code |
| Masked Motion Encoding for Self-Supervised Video Representation Learning | Xinyu Sun · Peihao Chen · Liangwei Chen · Changhao Li · Thomas H. Li · Mingkui Tan · Chuang Gan | N/A | Code |
| StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator | Jiazhi Guan · Zhanwang Zhang · Hang Zhou · Tianshu Hu · Kaisiyuan Wang · Dongliang He · Haocheng Feng · Jingtuo Liu · Errui Ding · Ziwei Liu · Jingdong Wang | N/A | Code |
| LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation | Song Wang · Wentong Li · Wenyu Liu · Xiaolu Liu · Jianke Zhu | N/A | Code |
| Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion | Changfeng Ma · Yinuo Chen · Pengxiao Guo · Jie Guo · Chongjun Wang · Yanwen Guo | N/A | Code |
| Boosting Detection in Crowd Analysis via Underutilized Output Features | Shaokai Wu · Fengyu Yang | N/A | Code |
| Representation Learning for Visual Object Tracking by Masked Appearance Transfer | Haojie Zhao · Dong Wang · Huchuan Lu | N/A | Code |
| NeuralLift-360: Lifting an In-the-Wild 2D Photo to a 3D Object With 360° Views | Dejia Xu · Yifan Jiang · Peihao Wang · Zhiwen Fan · Yi Wang · Zhangyang Wang | N/A | Code |
| DoNet: Deep De-Overlapping Network for Cytology Instance Segmentation | Hao Jiang · Rushan Zhang · Yanning Zhou · Yumeng Wang · Hao Chen | N/A | Code |
| Think Twice Before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | Xiaosong Jia · Penghao Wu · Li Chen · Jiangwei Xie · Conghui He · Junchi Yan · Hongyang Li | N/A | Code |
| Adversarial Counterfactual Visual Explanations | Guillaume Jeanneret · Loïc Simon · Frédéric Jurie | N/A | Code |
| ALOFT: A Lightweight MLP-Like Architecture With Dynamic Low-Frequency Transform for Domain Generalization | Jintao Guo · Na Wang · Lei Qi · Yinghuan Shi | N/A | Code |
| ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision | Jingwang Ling · Zhibo Wang · Feng Xu | N/A | Code |
| Coaching a Teachable Student | Jimuyang Zhang · Zanming Huang · Eshed Ohn-Bar | N/A | Code |
| POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery | Ce Zheng · Xianpeng Liu · Guo-Jun Qi · Chen Chen | N/A | Code |
| Layout-Based Causal Inference for Object Navigation | Sixian Zhang · Xinhang Song · Weijie Li · Yubing Bai · Xinyao Yu · Shuqiang Jiang | N/A | Code |
| Towards Bridging the Performance Gaps of Joint Energy-Based Models | Xiulong Yang · Qing Su · Shihao Ji | N/A | Code |
| Bringing Inputs to Shared Domains for 3D Interacting Hands Recovery in the Wild | Gyeongsik Moon | N/A | Code |
| Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting | Xiaogang Peng · Siyuan Mao · Zizhao Wu | N/A | Code |
| Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization | Chen Ju · Kunhao Zheng · Jinxiang Liu · Peisen Zhao · Ya Zhang · Jianlong Chang · Qi Tian · Yanfeng Wang | N/A | Code |
| DiffPose: Toward More Reliable 3D Pose Estimation | Jia Gong · Lin Geng Foo · Zhipeng Fan · Qiuhong Ke · Hossein Rahmani · Jun Liu | N/A | Code |
| SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection | Tiange Xiang · Yixiao Zhang · Yongyi Lu · Alan L. Yuille · Chaoyi Zhang · Weidong Cai · Zongwei Zhou | N/A | Code |
| On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer | Zhenjie Yu · Shuang Li · Yirui Shen · Chi Harold Liu · Shuigen Wang | N/A | Code |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network With Large Input | Senmao Tian · Ming Lu · Jiaming Liu · Yandong Guo · Yurong Chen · Shunli Zhang | N/A | Code |
| NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects | Zhiwen Yan · Chen Li · Gim Hee Lee | N/A | Code |
| Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo | Lukas Mehl · Jenny Schmalfuss · Azin Jahedi · Yaroslava Nalivayko · Andrés Bruhn | N/A | Code |
| Unifying Short and Long-Term Tracking With Graph Hierarchies | Orcun Cetintas · Guillem Brasó · Laura Leal-Taixé | N/A | Code |
| MixTeacher: Mining Promising Labels With Mixed Scale Teacher for Semi-Supervised Object Detection | Liang Liu · Boshen Zhang · Jiangning Zhang · Wuhao Zhang · Zhenye Gan · Guanzhong Tian · Wenbing Zhu · Yabiao Wang · Chengjie Wang | N/A | Code |
| A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image | Changlong Jiang · Yang Xiao · Cunlin Wu · Mingyang Zhang · Jinghong Zheng · Zhiguo Cao · Joey Tianyi Zhou | N/A | Code |
| Efficient Mask Correction for Click-Based Interactive Image Segmentation | Fei Du · Jianlong Yuan · Zhibin Wang · Fan Wang | N/A | Code |
| OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation | Tong Wu · Jiarui Zhang · Xiao Fu · Yuxin Wang · Jiawei Ren · Liang Pan · Wayne Wu · Lei Yang · Jiaqi Wang · Chen Qian · Dahua Lin · Ziwei Liu | N/A | Code |
| LoGoNet: Towards Accurate 3D Object Detection With Local-to-Global Cross-Modal Fusion | Xin Li · Tao Ma · Yuenan Hou · Botian Shi · Yuchen Yang · Youquan Liu · Xingjiao Wu · Qin Chen · Yikang Li · Yu Qiao · Liang He | N/A | Code |
| 3D Registration With Maximal Cliques | Xiyu Zhang · Jiaqi Yang · Shikun Zhang · Yanning Zhang | N/A | Code |
| Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis | Yuxiang Wei · Zhilong Ji · Xiaohe Wu · Jinfeng Bai · Lei Zhang · Wangmeng Zuo | N/A | Code |
| Frame-Event Alignment and Fusion Network for High Frame Rate Tracking | Jiqing Zhang · Yuanchen Wang · Wenxi Liu · Meng Li · Jinpeng Bai · Baocai Yin · Xin Yang | N/A | Code |
| Human Guided Ground-Truth Generation for Realistic Image Super-Resolution | Du Chen · Jie Liang · Xindong Zhang · Ming Liu · Hui Zeng · Lei Zhang | N/A | Code |
| Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration | Kemal Oksuz · Tom Joy · Puneet K. Dokania | N/A | Code |
| Generating Human Motion From Textual Descriptions With Discrete Representations | Jianrong Zhang · Yangsong Zhang · Xiaodong Cun · Yong Zhang · Hongwei Zhao · Hongtao Lu · Xi Shen · Ying Shan | N/A | Code |
| Curricular Contrastive Regularization for Physics-Aware Single Image Dehazing | Yu Zheng · Jiahui Zhan · Shengfeng He · Junyu Dong · Yong Du | N/A | Code |
| Learning Human Mesh Recovery in 3D Scenes | Zehong Shen · Zhi Cen · Sida Peng · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection | Luting Wang · Yi Liu · Penghui Du · Zihan Ding · Yue Liao · Qiaosong Qi · Biaolong Chen · Si Liu | N/A | Code |
| Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method | Ran Yi · Haoyuan Tian · Zhihao Gu · Yu-Kun Lai · Paul L. Rosin | N/A | Code |
| SOOD: Towards Semi-Supervised Oriented Object Detection | Wei Hua · Dingkang Liang · Jingyu Li · Xiaolong Liu · Zhikang Zou · Xiaoqing Ye · Xiang Bai | N/A | Code |
| Spherical Transformer for LiDAR-Based 3D Recognition | Xin Lai · Yukang Chen · Fanbin Lu · Jianhui Liu · Jiaya Jia | N/A | Code |
| Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri · Ayan Kumar Bhunia · Yi-Zhe Song · Anjan Dutta | N/A | Code |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Zixiang Zhao · Haowen Bai · Jiangshe Zhang · Yulun Zhang · Shuang Xu · Zudi Lin · Radu Timofte · Luc Van Gool | N/A | Code |
| Proximal Splitting Adversarial Attack for Semantic Segmentation | Jérôme Rony · Jean-Christophe Pesquet · Ismail Ben Ayed | N/A | Code |
| NeuWigs: A Neural Dynamic Model for Volumetric Hair Capture and Animation | Ziyan Wang · Giljoo Nam · Tuur Stuyck · Stephen Lombardi · Chen Cao · Jason Saragih · Michael Zollhöfer · Jessica Hodgins · Christoph Lassner | N/A | Code |
| Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language | Chuanhao Li · Zhen Li · Chenchen Jing · Yunde Jia · Yuwei Wu | N/A | Code |
| 3D-Aware Face Swapping | Yixuan Li · Chao Ma · Yichao Yan · Wenhan Zhu · Xiaokang Yang | N/A | Code |
| Representing Volumetric Videos As Dynamic MLP Maps | Sida Peng · Yunzhi Yan · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation | Hang Du · Xuejun Yan · Jingjing Wang · Di Xie · Shiliang Pu | N/A | Code |
| Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need | Jingyao Li · Pengguang Chen · Zexin He · Shaozuo Yu · Shu Liu · Jiaya Jia | N/A | Code |
| Paint by Example: Exemplar-Based Image Editing With Diffusion Models | Binxin Yang · Shuyang Gu · Bo Zhang · Ting Zhang · Xuejin Chen · Xiaoyan Sun · Dong Chen · Fang Wen | N/A | Code |
| Referring Multi-Object Tracking | Dongming Wu · Wencheng Han · Tiancai Wang · Xingping Dong · Xiangyu Zhang · Jianbing Shen | N/A | Code |
| NerVE: Neural Volumetric Edges for Parametric Curve Extraction From Point Cloud | Xiangyu Zhu · Dong Du · Weikai Chen · Zhiyou Zhao · Yinyu Nie · Xiaoguang Han | N/A | Code |
| AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection | Yipeng Gao · Kun-Yu Lin · Junkai Yan · Yaowei Wang · Wei-Shi Zheng | N/A | Code |
| CUF: Continuous Upsampling Filters | Cristina N. Vasconcelos · Cengiz Oztireli · Mark Matthews · Milad Hashemi · Kevin Swersky · Andrea Tagliasacchi | N/A | Code |
| MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors | Yuang Zhang · Tiancai Wang · Xiangyu Zhang | N/A | Code |
| CXTrack: Improving 3D Point Cloud Tracking With Contextual Information | Tian-Xing Xu · Yuan-Chen Guo · Yu-Kun Lai · Song-Hai Zhang | N/A | Code |
| Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection | Xincheng Yao · Ruoqi Li · Jing Zhang · Jun Sun · Chongyang Zhang | N/A | Code |
| Learning Bottleneck Concepts in Image Classification | Bowen Wang · Liangzhi Li · Yuta Nakashima · Hajime Nagahara | N/A | Code |
| Zero-Shot Model Diagnosis | Jinqi Luo · Zhaoning Wang · Chen Henry Wu · Dong Huang · Fernando De la Torre | N/A | Code |
| DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation | Shuai Shen · Wenliang Zhao · Zibin Meng · Wanhua Li · Zheng Zhu · Jie Zhou · Jiwen Lu | N/A | Code |
| DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer | Ping Chen · Xingpeng Zhang · Ye Li · Ju Tao · Bin Xiao · Bing Wang · Zongjie Jiang | N/A | Code |
| TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning With Structure-Trajectory Prompted Reconstruction for Person Re-Identification | Haocong Rao · Chunyan Miao | N/A | Code |
| Joint Visual Grounding and Tracking With Natural Language Specification | Li Zhou · Zikun Zhou · Kaige Mao · Zhenyu He | N/A | Code |
| Compressing Volumetric Radiance Fields to 1 MB | Lingzhi Li · Zhen Shen · Zhongshu Wang · Li Shen · Liefeng Bo | N/A | Code |
| HyperReel: High-Fidelity 6-DoF Video With Ray-Conditioned Sampling | Benjamin Attal · Jia-Bin Huang · Christian Richardt · Michael Zollhöfer · Johannes Kopf · Matthew O’Toole · Changil Kim | N/A | Code |
| Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sections | Alexander Gillert · Giulia Resente · Alba Anadon-Rosell · Martin Wilmking · Uwe Freiherr von Lukas | N/A | Code |
| Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li · Karen Liu · Jiajun Wu | N/A | Code |
| Learned Two-Plane Perspective Prior Based Image Resampling for Efficient Object Detection | Anurag Ghosh · N. Dinesh Reddy · Christoph Mertz · Srinivasa G. Narasimhan | N/A | Code |
| PaletteNeRF: Palette-Based Appearance Editing of Neural Radiance Fields | Zhengfei Kuang · Fujun Luan · Sai Bi · Zhixin Shu · Gordon Wetzstein · Kalyan Sunkavalli | N/A | Code |
| Long Range Pooling for 3D Large-Scale Scene Understanding | Xiang-Li Li · Meng-Hao Guo · Tai-Jiang Mu · Ralph R. Martin · Shi-Min Hu | N/A | Code |
| Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation | Mingjie Li · Bingqian Lin · Zicong Chen · Haokun Lin · Xiaodan Liang · Xiaojun Chang | N/A | Code |
| Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning | Chengzhi Cao · Xueyang Fu · Hongjian Liu · Yukun Huang · Kunyu Wang · Jiebo Luo · Zheng-Jun Zha | N/A | Code |
| Contrastive Grouping With Transformer for Referring Image Segmentation | Jiajin Tang · Ge Zheng · Cheng Shi · Sibei Yang | N/A | Code |
| Structure Aggregation for Cross-Spectral Stereo Image Guided Denoising | Zehua Sheng · Zhu Yu · Xiongwei Liu · Si-Yuan Cao · Yuqi Liu · Hui-Liang Shen · Huaqi Zhang | N/A | Code |
| Where Is My Spot? Few-Shot Image Generation via Latent Subspace Optimization | Chenxi Zheng · Bangzhen Liu · Huaidong Zhang · Xuemiao Xu · Shengfeng He | N/A | Code |
| EDGE: Editable Dance Generation From Music | Jonathan Tseng · Rodrigo Castellon · Karen Liu | N/A | Code |
| PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models | Minghua Liu · Yinhao Zhu · Hong Cai · Shizhong Han · Zhan Ling · Fatih Porikli · Hao Su | N/A | Code |
| EDICT: Exact Diffusion Inversion via Coupled Transformations | Bram Wallace · Akash Gokul · Nikhil Naik | N/A | Code |
| Complete 3D Human Reconstruction From a Single Incomplete Image | Junying Wang · Jae Shin Yoon · Tuanfeng Y. Wang · Krishna Kumar Singh · Ulrich Neumann | N/A | Code |
| PartDistillation: Learning Parts From Instance Segmentation | Jang Hyun Cho · Philipp Krähenbühl · Vignesh Ramanathan | N/A | Code |
| Neural Vector Fields: Implicit Representation by Explicit Learning | Xianghui Yang · Guosheng Lin · Zhenghao Chen · Luping Zhou | N/A | Code |
| Unsupervised Inference of Signed Distance Functions From Single Sparse Point Clouds Without Learning Priors | Chao Chen · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Texts as Images in Prompt Tuning for Multi-Label Image Recognition | Zixian Guo · Bowen Dong · Zhilong Ji · Jinfeng Bai · Yiwen Guo · Wangmeng Zuo | N/A | Code |
| Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent With Learned Distance Functions | Yun He · Danhang Tang · Yinda Zhang · Xiangyang Xue · Yanwei Fu | N/A | Code |
| MMANet: Margin-Aware Distillation and Modality-Aware Regularization for Incomplete Multimodal Learning | Shicai Wei · Chunbo Luo · Yang Luo | N/A | Code |
| Rethinking Optical Flow From Geometric Matching Consistent Perspective | Qiaole Dong · Chenjie Cao · Yanwei Fu | N/A | Code |
| FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation | Junjie He · Pengyu Li · Yifeng Geng · Xuansong Xie | N/A | Code |
| How Can Objects Help Action Recognition? | Xingyi Zhou · Anurag Arnab · Chen Sun · Cordelia Schmid | N/A | Code |
| Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang · Wen Wang · Yue Cao · Chunhua Shen · Tiejun Huang | N/A | Code |
| SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation | Huimin Huang · Shiao Xie · Lanfen Lin · Ruofeng Tong · Yen-Wei Chen · Yuexiang Li · Hong Wang · Yawen Huang · Yefeng Zheng | N/A | Code |
| A Unified Pyramid Recurrent Network for Video Frame Interpolation | Xin Jin · Longhai Wu · Jie Chen · Youxin Chen · Jayoon Koo · Cheul-hee Hahm | N/A | Code |
| Enhancing the Self-Universality for Transferable Targeted Attacks | Zhipeng Wei · Jingjing Chen · Zuxuan Wu · Yu-Gang Jiang | N/A | Code |
| Multi-View Inverse Rendering for Large-Scale Real-World Indoor Scenes | Zhen Li · Lingli Wang · Mofang Cheng · Cihui Pan · Jiaqi Yang | N/A | Code |
| TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision | Jiacheng Wei · Hao Wang · Jiashi Feng · Guosheng Lin · Kim-Hui Yap | N/A | Code |
| Frequency-Modulated Point Cloud Rendering With Easy Editing | Yi Zhang · Xiaoyang Huang · Bingbing Ni · Teng Li · Wenjun Zhang | N/A | Code |
| Vector Quantization With Self-Attention for Quality-Independent Representation Learning | Zhou Yang · Weisheng Dong · Xin Li · Mengluan Huang · Yulin Sun · Guangming Shi | N/A | Code |
| Fine-Grained Face Swapping via Regional GAN Inversion | Zhian Liu · Maomao Li · Yong Zhang · Cairong Wang · Qi Zhang · Jue Wang · Yongwei Nie | N/A | Code |
| Backdoor Defense via Adaptively Splitting Poisoned Dataset | Kuofeng Gao · Yang Bai · Jindong Gu · Yong Yang · Shu-Tao Xia | N/A | Code |
| RGBD2: Generative Scene Synthesis via Incremental View Inpainting Using RGBD Diffusion Models | Jiabao Lei · Jiapeng Tang · Kui Jia | N/A | Code |
| CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose | Xu Zhang · Wen Wang · Zhe Chen · Yufei Xu · Jing Zhang · Dacheng Tao | N/A | Code |
| Fake It Till You Make It: Learning Transferable Representations From Synthetic ImageNet Clones | Mert Bülent Sarıyıldız · Karteek Alahari · Diane Larlus · Yannis Kalantidis | N/A | Code |
| Efficient Frequency Domain-Based Transformers for High-Quality Image Deblurring | Lingshun Kong · Jiangxin Dong · Jianjun Ge · Mingqiang Li · Jinshan Pan | N/A | Code |
| DartBlur: Privacy Preservation With Detection Artifact Suppression | Baowei Jiang · Bing Bai · Haozhe Lin · Yu Wang · Yuchen Guo · Lu Fang | N/A | Code |
| FCC: Feature Clusters Compression for Long-Tailed Visual Recognition | Jian Li · Ziyao Meng · Daqian Shi · Rui Song · Xiaolei Diao · Jingwen Wang · Hao Xu | N/A | Code |
| CLOTH4D: A Dataset for Clothed Human Reconstruction | Xingxing Zou · Xintong Han · Waikeung Wong | N/A | Code |
| LinK: Linear Kernel for LiDAR-Based 3D Perception | Tao Lu · Xiang Ding · Haisong Liu · Gangshan Wu · Limin Wang | N/A | Code |
| Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation | Xiaoyang Wang · Bingfeng Zhang · Limin Yu · Jimin Xiao | N/A | Code |
| Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception | Junyu Gao · Mengyuan Chen · Changsheng Xu | N/A | Code |
| LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| Deep Learning of Partial Graph Matching via Differentiable Top-K | Runzhong Wang · Ziao Guo · Shaofei Jiang · Xiaokang Yang · Junchi Yan | N/A | Code |
| Analyzing Physical Impacts Using Transient Surface Wave Imaging | Tianyuan Zhang · Mark Sheinin · Dorian Chan · Mark Rau · Matthew O’Toole · Srinivasa G. Narasimhan | N/A | Code |
| Rethinking Domain Generalization for Face Anti-Spoofing: Separability and Alignment | Yiyou Sun · Yaojie Liu · Xiaoming Liu · Yixuan Li · Wen-Sheng Chu | N/A | Code |
| A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift | Dasong Li · Xiaoyu Shi · Yi Zhang · Ka Chun Cheung · Simon See · Xiaogang Wang · Hongwei Qin · Hongsheng Li | N/A | Code |
| The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects | Ruohan Gao · Yiming Dou · Hao Li · Tanmay Agarwal · Jeannette Bohg · Yunzhu Li · Li Fei-Fei · Jiajun Wu | N/A | Code |
| PIRLNav: Pretraining With Imitation and RL Finetuning for ObjectNav | Ram Ramrakhya · Dhruv Batra · Erik Wijmans · Abhishek Das | N/A | Code |
| DC2: Dual-Camera Defocus Control by Learning To Refocus | Hadi Alzayer · Abdullah Abuolaim · Leung Chun Chan · Yang Yang · Ying Chen Lou · Jia-Bin Huang · Abhishek Kar | N/A | Code |
| Habitat-Matterport 3D Semantics Dataset | Karmesh Yadav · Ram Ramrakhya · Santhosh Kumar Ramakrishnan · Theo Gervet · John Turner · Aaron Gokaslan · Noah Maestre · Angel Xuan Chang · Dhruv Batra · Manolis Savva · Alexander William Clegg · Devendra Singh Chaplot | N/A | Code |
| Prompting Large Language Models With Answer Heuristics for Knowledge-Based Visual Question Answering | Zhenwei Shao · Zhou Yu · Meng Wang · Jun Yu | N/A | Code |
| Similarity Metric Learning for RGB-Infrared Group Re-Identification | Jianghao Xiong · Jianhuang Lai | N/A | Code |
| DPF: Learning Dense Prediction Fields With Weak Supervision | Xiaoxue Chen · Yuhang Zheng · Yupeng Zheng · Qiang Zhou · Hao Zhao · Guyue Zhou · Ya-Qin Zhang | N/A | Code |
| Mixed Autoencoder for Self-Supervised Visual Representation Learning | Kai Chen · Zhili Liu · Lanqing Hong · Hang Xu · Zhenguo Li · Dit-Yan Yeung | N/A | Code |
| Content-Aware Token Sharing for Efficient Semantic Segmentation With Vision Transformers | Chenyang Lu · Daan de Geus · Gijs Dubbelman | N/A | Code |
| NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen · Jipeng Lyu · Yu-Xiong Wang | N/A | Code |
| Multiview Compressive Coding for 3D Reconstruction | Chao-Yuan Wu · Justin Johnson · Jitendra Malik · Christoph Feichtenhofer · Georgia Gkioxari | N/A | Code |
| Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation | Lihe Yang · Lei Qi · Litong Feng · Wayne Zhang · Yinghuan Shi | N/A | Code |
| Delving Into Shape-Aware Zero-Shot Semantic Segmentation | Xinyu Liu · Beiwen Tian · Zhen Wang · Rui Wang · Kehua Sheng · Bo Zhang · Hao Zhao · Guyue Zhou | N/A | Code |
| Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval | Yi Xie · Huaidong Zhang · Xuemiao Xu · Jianqing Zhu · Shengfeng He | N/A | Code |
| Bootstrapping Objectness From Videos by Relaxed Common Fate and Visual Grouping | Long Lian · Zhirong Wu · Stella X. Yu | N/A | Code |
| NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear Interpolation | Zehan Zheng · Danni Wu · Ruisi Lu · Fan Lu · Guang Chen · Changjun Jiang | N/A | Code |
| Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning | Zhuoyang Zhang · Yuhao Dong · Yunze Liu · Li Yi | N/A | Code |
| GFIE: A Dataset and Baseline for Gaze-Following From 2D to 3D in Indoor Environments | Zhengxi Hu · Yuxue Yang · Xiaolin Zhai · Dingye Yang · Bohan Zhou · Jingtai Liu | N/A | Code |
| Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? | Wenhao Wu · Haipeng Luo · Bo Fang · Jingdong Wang · Wanli Ouyang | N/A | Code |
| Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition From Egocentric RGB Videos | Yilin Wen · Hao Pan · Lei Yang · Jia Pan · Taku Komura · Wenping Wang | N/A | Code |
| CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer | Linfeng Wen · Chengying Gao · Changqing Zou | N/A | Code |
| Uncurated Image-Text Datasets: Shedding Light on Demographic Bias | Noa Garcia · Yusuke Hirota · Yankun Wu · Yuta Nakashima | N/A | Code |
| AltFreezing for More General Video Face Forgery Detection | Zhendong Wang · Jianmin Bao · Wengang Zhou · Weilun Wang · Houqiang Li | N/A | Code |
| Two-View Geometry Scoring Without Correspondences | Axel Barroso-Laguna · Eric Brachmann · Victor Adrian Prisacariu · Gabriel J. Brostow · Daniyar Turmukhambetov | N/A | Code |
| Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning | Wenjin Wang · Yunqing Hu · Qianglong Chen · Yin Zhang | N/A | Code |
| Revisiting Prototypical Network for Cross Domain Few-Shot Learning | Fei Zhou · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| Federated Incremental Semantic Segmentation | Jiahua Dong · Duzhen Zhang · Yang Cong · Wei Cong · Henghui Ding · Dengxin Dai | N/A | Code |
| Self-Supervised Learning for Multimodal Non-Rigid 3D Shape Matching | Dongliang Cao · Florian Bernard | N/A | Code |
| Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization | Aishan Liu · Shiyu Tang · Siyuan Liang · Ruihao Gong · Boxi Wu · Xianglong Liu · Dacheng Tao | N/A | Code |
| Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning | Peng Jin · Jinfa Huang · Pengfei Xiong · Shangxuan Tian · Chang Liu · Xiangyang Ji · Li Yuan · Jie Chen | N/A | Code |
| pCON: Polarimetric Coordinate Networks for Neural Scene Representations | Henry Peters · Yunhao Ba · Achuta Kadambi | N/A | Code |
| RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo | Changjiang Cai · Pan Ji · Qingan Yan · Yi Xu | N/A | Code |
| Depth Estimation From Camera Image and mmWave Radar Point Cloud | Akash Deep Singh · Yunhao Ba · Ankur Sarker · Howard Zhang · Achuta Kadambi · Stefano Soatto · Mani Srivastava · Alex Wong | N/A | Code |
| Normal-Guided Garment UV Prediction for Human Re-Texturing | Yasamin Jafarian · Tuanfeng Y. Wang · Duygu Ceylan · Jimei Yang · Nathan Carr · Yi Zhou · Hyun Soo Park | N/A | Code |
| WeatherStream: Light Transport Automation of Single Image Deweathering | Howard Zhang · Yunhao Ba · Ethan Yang · Varan Mehra · Blake Gella · Akira Suzuki · Arnold Pfahnl · Chethan Chinder Chandrappa · Alex Wong · Achuta Kadambi | N/A | Code |
| MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices | Kejie Li · Jia-Wang Bian · Robert Castle · Philip H.S. Torr · Victor Adrian Prisacariu | N/A | Code |
| Ultrahigh Resolution Image/Video Matting With Spatio-Temporal Sparsity | Yanan Sun · Chi-Keung Tang · Yu-Wing Tai | N/A | Code |
| Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection | Chuandong Liu · Chenqiang Gao · Fangcen Liu · Pengcheng Li · Deyu Meng · Xinbo Gao | N/A | Code |
| PATS: Patch Area Transportation With Subdivision for Local Feature Matching | Junjie Ni · Yijin Li · Zhaoyang Huang · Hongsheng Li · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| SINE: Semantic-Driven Image-Based NeRF Editing With Prior-Guided Editing Field | Chong Bao · Yinda Zhang · Bangbang Yang · Tianxing Fan · Zesong Yang · Hujun Bao · Guofeng Zhang · Zhaopeng Cui | N/A | Code |
| GeoNet: Benchmarking Unsupervised Adaptation Across Geographies | Tarun Kalluri · Wangdong Xu · Manmohan Chandraker | N/A | Code |
| Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset | Shuaizheng Liu · Xindong Zhang · Lingchen Sun · Zhetong Liang · Hui Zeng · Lei Zhang | N/A | Code |
| 3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification | Jiazhao Zhang · Liu Dai · Fanpeng Meng · Qingnan Fan · Xuelin Chen · Kai Xu · He Wang | N/A | Code |
| Delving Into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling | Yulin Liu · Haoran Liu · Yingda Yin · Yang Wang · Baoquan Chen · He Wang | N/A | Code |
| RILS: Masked Visual Reconstruction in Language Semantic Space | Shusheng Yang · Yixiao Ge · Kun Yi · Dian Li · Ying Shan · Xiaohu Qie · Xinggang Wang | N/A | Code |
| ConQueR: Query Contrast Voxel-DETR for 3D Object Detection | Benjin Zhu · Zhe Wang · Shaoshuai Shi · Hang Xu · Lanqing Hong · Hongsheng Li | N/A | Code |
| PREIM3D: 3D Consistent Precise Image Attribute Editing From a Single Image | Jianhui Li · Jianmin Li · Haoji Zhang · Shilong Liu · Zhengyi Wang · Zihao Xiao · Kaiwen Zheng · Jun Zhu | N/A | Code |
| Bridging Search Region Interaction With Template for RGB-T Tracking | Tianrui Hui · Zizheng Xun · Fengguang Peng · Junshi Huang · Xiaoming Wei · Xiaolin Wei · Jiao Dai · Jizhong Han · Si Liu | N/A | Code |
| Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels | Jingqiu Zhou · Linjiang Huang · Liang Wang · Si Liu · Hongsheng Li | N/A | Code |
| Learning To Zoom and Unzoom | Chittesh Thavamani · Mengtian Li · Francesco Ferroni · Deva Ramanan | N/A | Code |
| MaLP: Manipulation Localization Using a Proactive Scheme | Vishal Asnani · Xi Yin · Tal Hassner · Xiaoming Liu | N/A | Code |
| Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning | Haiyu Wu · Grace Bezold · Aman Bhatta · Kevin W. Bowyer | N/A | Code |
| Visual-Tactile Sensing for In-Hand Object Reconstruction | Wenqiang Xu · Zhenjun Yu · Han Xue · Ruolin Ye · Siqiong Yao · Cewu Lu | N/A | Code |
| Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training | Filip Radenovic · Abhimanyu Dubey · Abhishek Kadian · Todor Mihaylov · Simon Vandenhende · Yash Patel · Yi Wen · Vignesh Ramanathan · Dhruv Mahajan | N/A | Code |
| Semi-Supervised Domain Adaptation With Source Label Adaptation | Yu-Chu Yu · Hsuan-Tien Lin | N/A | Code |
| Self-Supervised Video Forensics by Audio-Visual Anomaly Detection | Chao Feng · Ziyang Chen · Andrew Owens | N/A | Code |
| IterativePFN: True Iterative Point Cloud Filtering | Dasith de Silva Edirimuni · Xuequan Lu · Zhiwen Shao · Gang Li · Antonio Robles-Kelly · Ying He | N/A | Code |
| Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time | Wei Shang · Dongwei Ren · Yi Yang · Hongzhi Zhang · Kede Ma · Wangmeng Zuo | N/A | Code |
| Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning | Yun-Hao Cao · Peiqin Sun · Shuchang Zhou | N/A | Code |
| Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking | Ziqi Pang · Jie Li · Pavel Tokmakov · Dian Chen · Sergey Zagoruyko · Yu-Xiong Wang | N/A | Code |
| VecFontSDF: Learning To Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance Functions | Zeqing Xia · Bojun Xiong · Zhouhui Lian | N/A | Code |
| Towards Better Gradient Consistency for Neural Signed Distance Functions via Level Set Alignment | Baorui Ma · Junsheng Zhou · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Visual-Language Prompt Tuning With Knowledge-Guided Context Optimization | Hantao Yao · Rui Zhang · Changsheng Xu | N/A | Code |
| Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation | Ju He · Jieneng Chen · Ming-Xian Lin · Qihang Yu · Alan L. Yuille | N/A | Code |
| Physics-Guided ISO-Dependent Sensor Noise Modeling for Extreme Low-Light Photography | Yue Cao · Ming Liu · Shuai Liu · Xiaotao Wang · Lei Lei · Wangmeng Zuo | N/A | Code |
| Dynamic Focus-Aware Positional Queries for Semantic Segmentation | Haoyu He · Jianfei Cai · Zizheng Pan · Jing Liu · Jing Zhang · Dacheng Tao · Bohan Zhuang | N/A | Code |
| Generic-to-Specific Distillation of Masked Autoencoders | Wei Huang · Zhiliang Peng · Li Dong · Furu Wei · Jianbin Jiao · Qixiang Ye | N/A | Code |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions | Yinpeng Dong · Caixin Kang · Jinlai Zhang · Zijian Zhu · Yikai Wang · Xiao Yang · Hang Su · Xingxing Wei · Jun Zhu | N/A | Code |
| GarmentTracking: Category-Level Garment Pose Tracking | Han Xue · Wenqiang Xu · Jieyi Zhang · Tutian Tang · Yutong Li · Wenxin Du · Ruolin Ye · Cewu Lu | N/A | Code |
| TrojDiff: Trojan Attacks on Diffusion Models With Diverse Targets | Weixin Chen · Dawn Song · Bo Li | N/A | Code |
| Weakly Supervised Video Representation Learning With Unaligned Text for Sequential Videos | Sixun Dong · Huazhang Hu · Dongze Lian · Weixin Luo · Yicheng Qian · Shenghua Gao | N/A | Code |
| Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process | Yuhan Li · Yishun Dou · Xuanhong Chen · Bingbing Ni · Yilin Sun · Yutian Liu · Fuzhen Wang | N/A | Code |
| SpaText: Spatio-Textual Representation for Controllable Image Generation | Omri Avrahami · Thomas Hayes · Oran Gafni · Sonal Gupta · Yaniv Taigman · Devi Parikh · Dani Lischinski · Ohad Fried · Xi Yin | N/A | Code |
| Watch or Listen: Robust Audio-Visual Speech Recognition With Visual Corruption Modeling and Reliability Scoring | Joanna Hong · Minsu Kim · Jeongsoo Choi · Yong Man Ro | N/A | Code |
| RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | Titas Anciukevičius · Zexiang Xu · Matthew Fisher · Paul Henderson · Hakan Bilen · Niloy J. Mitra · Paul Guerrero | N/A | Code |
| Self-Supervised 3D Scene Flow Estimation Guided by Superpoints | Yaqi Shen · Le Hui · Jin Xie · Jian Yang | N/A | Code |
| Adaptive Annealing for Robust Geometric Estimation | Chitturi Sidhartha · Lalit Manam · Venu Madhav Govindu | N/A | Code |
| Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising | Miaoyu Li · Ji Liu · Ying Fu · Yulun Zhang · Dejing Dou | N/A | Code |
| Partial Network Cloning | Jingwen Ye · Songhua Liu · Xinchao Wang | N/A | Code |
| Twin Contrastive Learning With Noisy Labels | Zhizhong Huang · Junping Zhang · Hongming Shan | N/A | Code |
| Ambiguous Medical Image Segmentation Using Diffusion Models | Aimon Rahman · Jeya Maria Jose Valanarasu · Ilker Hacihaliloglu · Vishal M. Patel | N/A | Code |
| High-Res Facial Appearance Capture From Polarized Smartphone Images | Dejan Azinović · Olivier Maury · Christophe Hery · Matthias Nießner · Justus Thies | N/A | Code |
| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Takehiko Ohkawa · Kun He · Fadime Sener · Tomas Hodan · Luan Tran · Cem Keskin | N/A | Code |
| EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata | Chenhao Zheng · Ayush Shrivastava · Andrew Owens | N/A | Code |
| Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Gal Metzer · Elad Richardson · Or Patashnik · Raja Giryes · Daniel Cohen-Or | N/A | Code |
| Rebalancing Batch Normalization for Exemplar-Based Class-Incremental Learning | Sungmin Cha · Sungjun Cho · Dasol Hwang · Sunwon Hong · Moontae Lee · Taesup Moon | N/A | Code |
| Progressive Neighbor Consistency Mining for Correspondence Pruning | Xin Liu · Jufeng Yang | N/A | Code |
| Post-Training Quantization on Diffusion Models | Yuzhang Shang · Zhihang Yuan · Bin Xie · Bingzhe Wu · Yan Yan | N/A | Code |
| Fully Self-Supervised Depth Estimation From Defocus Clue | Haozhe Si · Bin Zhao · Dong Wang · Yunpeng Gao · Mulin Chen · Zhigang Wang · Xuelong Li | N/A | Code |
| Curricular Object Manipulation in LiDAR-Based Object Detection | Ziyue Zhu · Qiang Meng · Xiao Wang · Ke Wang · Liujiang Yan · Jian Yang | N/A | Code |
| Adaptive Assignment for Geometry Aware Local Feature Matching | Dihe Huang · Ying Chen · Yong Liu · Jianlin Liu · Shang Xu · Wenlong Wu · Yikang Ding · Fan Tang · Chengjie Wang | N/A | Code |
| RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension | Lei Jin · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Annan Shu · Rongrong Ji | N/A | Code |
| ANetQA: A Large-Scale Benchmark for Fine-Grained Compositional Reasoning Over Untrimmed Videos | Zhou Yu · Lixiang Zheng · Zhou Zhao · Fei Wu · Jianping Fan · Kui Ren · Jun Yu | N/A | Code |
| GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds | Honghui Yang · Tong He · Jiaheng Liu · Hua Chen · Boxi Wu · Binbin Lin · Xiaofei He · Wanli Ouyang | N/A | Code |
| Multimodal Industrial Anomaly Detection via Hybrid Fusion | Yue Wang · Jinlong Peng · Jiangning Zhang · Ran Yi · Yabiao Wang · Chengjie Wang | N/A | Code |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Byeonghyun Pak · Jaewon Lee · Kyong Hwan Jin | N/A | Code |
| CLIP Is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation | Yuqi Lin · Minghao Chen · Wenxiao Wang · Boxi Wu · Ke Li · Binbin Lin · Haifeng Liu · Xiaofei He | N/A | Code |
| MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation | Ludan Ruan · Yiyang Ma · Huan Yang · Huiguo He · Bei Liu · Jianlong Fu · Nicholas Jing Yuan · Qin Jin · Baining Guo | N/A | Code |
| FreeNeRF: Improving Few-Shot Neural Rendering With Free Frequency Regularization | Jiawei Yang · Marco Pavone · Yue Wang | N/A | Code |
| SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory | Sicheng Li · Hao Li · Yue Wang · Yiyi Liao · Lu Yu | N/A | Code |
| Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks | Jierun Chen · Shiu-hong Kao · Hao He · Weipeng Zhuo · Song Wen · Chul-Ho Lee · S.-H. Gary Chan | N/A | Code |
| Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning | Cheng Tan · Zhangyang Gao · Lirong Wu · Yongjie Xu · Jun Xia · Siyuan Li · Stan Z. Li | N/A | Code |
| Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module | Linzhi Huang · Yulong Li · Hongbo Tian · Yue Yang · Xiangang Li · Weihong Deng · Jieping Ye | N/A | Code |
| Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification | Zhengwei Yang · Meng Lin · Xian Zhong · Yu Wu · Zheng Wang | N/A | Code |
| Feature Alignment and Uniformity for Test Time Adaptation | Shuai Wang · Daoan Zhang · Zipei Yan · Jianguo Zhang · Rui Li | N/A | Code |
| AeDet: Azimuth-Invariant Multi-View 3D Object Detection | Chengjian Feng · Zequn Jie · Yujie Zhong · Xiangxiang Chu · Lin Ma | N/A | Code |
| Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency Is All You Need | Tong Wei · Kai Gan | N/A | Code |
| OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization | Ying Zhao | N/A | Code |
| HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization | Sungyeon Kim · Boseung Jeong · Suha Kwak | N/A | Code |
| Generative Diffusion Prior for Unified Image Restoration and Enhancement | Ben Fei · Zhaoyang Lyu · Liang Pan · Junzhe Zhang · Weidong Yang · Tianyue Luo · Bo Zhang · Bo Dai | N/A | Code |
| Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder | Aming Wu · Cheng Deng | N/A | Code |
| 2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection | Mikhail Kennerley · Jian-Gang Wang · Bharadwaj Veeravalli · Robby T. Tan | N/A | Code |
| Linking Garment With Person via Semantically Associated Landmarks for Virtual Try-On | Keyu Yan · Tingwei Gao · Hui Zhang · Chengjun Xie | N/A | Code |
| A New Comprehensive Benchmark for Semi-Supervised Video Anomaly Detection and Anticipation | Congqi Cao · Yue Lu · Peng Wang · Yanning Zhang | N/A | Code |
| DINER: Depth-Aware Image-Based NEural Radiance Fields | Malte Prinzler · Otmar Hilliges · Justus Thies | N/A | Code |
| Learning Personalized High Quality Volumetric Head Avatars From Monocular RGB Videos | Ziqian Bai · Feitong Tan · Zeng Huang · Kripasindhu Sarkar · Danhang Tang · Di Qiu · Abhimitra Meka · Ruofei Du · Mingsong Dou · Sergio Orts-Escolano · Rohit Pandey · Ping Tan · Thabo Beeler · Sean Fanello · Yinda Zhang | N/A | Code |
| HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics | Artur Grigorev · Michael J. Black · Otmar Hilliges | N/A | Code |
| Boundary-Enhanced Co-Training for Weakly Supervised Semantic Segmentation | Shenghai Rong · Bohai Tu · Zilei Wang · Junjie Li | N/A | Code |
| Instant Volumetric Head Avatars | Wojciech Zielonka · Timo Bolkart · Justus Thies | N/A | Code |
| From Node Interaction To Hop Interaction: New Effective and Scalable Graph Learning Paradigm | Jie Chen · Zilong Li · Yin Zhu · Junping Zhang · Jian Pu | N/A | Code |
| Transfer4D: A Framework for Frugal Motion Capture and Deformation Transfer | Shubh Maheshwari · Rahul Narain · Ramya Hebbalaguppe | N/A | Code |
| An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions | Weijia Li · Saihui Hou · Chunjie Zhang · Chunshui Cao · Xu Liu · Yongzhen Huang · Yao Zhao | N/A | Code |
| Event-Based Shape From Polarization | Manasi Muglikar · Leonard Bauersfeld · Diederik Paul Moeys · Davide Scaramuzza | N/A | Code |
| Plateau-Reduced Differentiable Path Tracing | Michael Fischer · Tobias Ritschel | N/A | Code |
| End-to-End Video Matting With Trimap Propagation | Wei-Lun Huang · Ming-Sui Lee | N/A | Code |
| Weakly-Supervised Single-View Image Relighting | Renjiao Yi · Chenyang Zhu · Kai Xu | N/A | Code |
| Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning | Weixuan Sun · Jiayi Zhang · Jianyuan Wang · Zheyuan Liu · Yiran Zhong · Tianpeng Feng · Yandong Guo · Yanhao Zhang · Nick Barnes | N/A | Code |
| Non-Contrastive Unsupervised Learning of Physiological Signals From Video | Jeremy Speth · Nathan Vance · Patrick Flynn · Adam Czajka | N/A | Code |
| Structured Sparsity Learning for Efficient Video Super-Resolution | Bin Xia · Jingwen He · Yulun Zhang · Yitong Wang · Yapeng Tian · Wenming Yang · Luc Van Gool | N/A | Code |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Wei-Ning Hsu · Tal Remez · Bowen Shi · Jacob Donley · Yossi Adi | N/A | Code |
| Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion | Dario Pavllo · David Joseph Tan · Marie-Julie Rakotosaona · Federico Tombari | N/A | Code |
| Towards Better Decision Forests: Forest Alternating Optimization | Miguel Á. Carreira-Perpiñán · Magzhan Gabidolla · Arman Zharmagambetov | N/A | Code |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Thomas Stegmüller · Tim Lebailly · Behzad Bozorgtabar · Tinne Tuytelaars · Jean-Philippe Thiran | N/A | Code |
| Polynomial Implicit Neural Representations for Large Diverse Datasets | Rajhans Singh · Ankita Shukla · Pavan Turaga | N/A | Code |
| GradICON: Approximate Diffeomorphisms via Gradient Inverse Consistency | Lin Tian · Hastings Greer · François-Xavier Vialard · Roland Kwitt · Raúl San José Estépar · Richard Jarrett Rushmore · Nikolaos Makris · Sylvain Bouix · Marc Niethammer | N/A | Code |
| Exploring Discontinuity for Video Frame Interpolation | Sangjin Lee · Hyeongmin Lee · Chajin Shin · Hanbin Son · Sangyoun Lee | N/A | Code |
| Local 3D Editing via 3D Distillation of CLIP Knowledge | Junha Hyung · Sungwon Hwang · Daejin Kim · Hyunji Lee · Jaegul Choo | N/A | Code |
| Dynamic Conceptional Contrastive Learning for Generalized Category Discovery | Nan Pu · Zhun Zhong · Nicu Sebe | N/A | Code |
| Look, Radiate, and Learn: Self-Supervised Localisation via Radio-Visual Correspondence | Mohammed Alloulah · Maximilian Arnold | N/A | Code |
| Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection | Vibashan VS · Poojan Oza · Vishal M. Patel | N/A | Code |
| High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition | Tianyu Luan · Yuanhao Zhai · Jingjing Meng · Zhong Li · Zhang Chen · Yi Xu · Junsong Yuan | N/A | Code |
| 3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions | Dale Decatur · Itai Lang · Rana Hanocka | N/A | Code |
| Egocentric Video Task Translation | Zihui Xue · Yale Song · Kristen Grauman · Lorenzo Torresani | N/A | Code |
| Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection | Yi Wang · Ruili Wang · Xin Fan · Tianzhu Wang · Xiangjian He | N/A | Code |
| Balanced Energy Regularization Loss for Out-of-Distribution Detection | hyunjun choi · Hawook Jeong · Jin Young Choi | N/A | Code |
| Private Image Generation With Dual-Purpose Auxiliary Classifier | Chen Chen · Daochang Liu · Siqi Ma · Surya Nepal · Chang Xu | N/A | Code |
| Controllable Mesh Generation Through Sparse Latent Point Diffusion Models | Zhaoyang Lyu · Jinyi Wang · Yuwei An · Ya Zhang · Dahua Lin · Bo Dai | N/A | Code |
| Neural Video Compression With Diverse Contexts | Jiahao Li · Bin Li · Yan Lu | N/A | Code |
| Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection | Bo Zhang · Jiakang Yuan · Botian Shi · Tao Chen · Yikang Li · Yu Qiao | N/A | Code |
| ScarceNet: Animal Pose Estimation With Scarce Annotations | Chen Li · Gim Hee Lee | N/A | Code |
| Fast Contextual Scene Graph Generation With Unbiased Context Augmentation | Tianlei Jin · Fangtai Guo · Qiwei Meng · Shiqiang Zhu · Xiangming Xi · Wen Wang · Zonghao Mu · Wei Song | N/A | Code |
| TriDet: Temporal Action Detection With Relative Boundary Modeling | Dingfeng Shi · Yujie Zhong · Qiong Cao · Lin Ma · Jia Li · Dacheng Tao | N/A | Code |
| Multi-Level Logit Distillation | Ying Jin · Jiaqi Wang · Dahua Lin | N/A | Code |
| StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning | Yuqian Fu · Yu Xie · Yanwei Fu · Yu-Gang Jiang | N/A | Code |
| Text With Knowledge Graph Augmented Transformer for Video Captioning | Xin Gu · Guang Chen · Yufei Wang · Libo Zhang · Tiejian Luo · Longyin Wen | N/A | Code |
| Semantic Ray: Learning a Generalizable Semantic Field With Cross-Reprojection Attention | Fangfu Liu · Chubin Zhang · Yu Zheng · Yueqi Duan | N/A | Code |
| MELTR: Meta Loss Transformer for Learning To Fine-Tune Video Foundation Models | Dohwan Ko · Joonmyung Choi · Hyeong Kyu Choi · Kyoung-Woon On · Byungseok Roh · Hyunwoo J. Kim | N/A | Code |
| Self-Supervised AutoFlow | Hsin-Ping Huang · Charles Herrmann · Junhwa Hur · Erika Lu · Kyle Sargent · Austin Stone · Ming-Hsuan Yang · Deqing Sun | N/A | Code |
| Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images | Bowei Du · Yecheng Huang · Jiaxin Chen · Di Huang | N/A | Code |
| Context-Based Trit-Plane Coding for Progressive Image Compression | Seungmin Jeon · Kwang Pyo Choi · Youngo Park · Chang-Su Kim | N/A | Code |
| Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency Losses | Junbong Jang · Kwonmoo Lee · Tae-Kyun Kim | N/A | Code |
| VQACL: A Novel Visual Question Answering Continual Learning Setting | Xi Zhang · Feifei Zhang · Changsheng Xu | N/A | Code |
| Explicit Visual Prompting for Low-Level Structure Segmentations | Weihuang Liu · Xi Shen · Chi-Man Pun · Xiaodong Cun | N/A | Code |
| Practical Network Acceleration With Tiny Sets | Guo-Hua Wang · Jianxin Wu | N/A | Code |
| Sphere-Guided Training of Neural Implicit Surfaces | Andreea Dogaru · Andrei-Timotei Ardelean · Savva Ignatyev · Egor Zakharov · Evgeny Burnaev | N/A | Code |
| Texture-Guided Saliency Distilling for Unsupervised Salient Object Detection | Huajun Zhou · Bo Qiao · Lingxiao Yang · Jianhuang Lai · Xiaohua Xie | N/A | Code |
| FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction | Haoran Bai · Di Kang · Haoxian Zhang · Jinshan Pan · Linchao Bao | N/A | Code |
| Differentiable Shadow Mapping for Efficient Inverse Graphics | Markus Worchel · Marc Alexa | N/A | Code |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Wenxuan Zhang · Xiaodong Cun · Xuan Wang · Yong Zhang · Xi Shen · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| High-Fidelity and Freely Controllable Talking Head Video Generation | Yue Gao · Yuan Zhou · Jinglu Wang · Xiao Li · Xiang Ming · Yan Lu | N/A | Code |
| BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation | Junheum Park · Jintae Kim · Chang-Su Kim | N/A | Code |
| Noisy Correspondence Learning With Meta Similarity Correction | Haochen Han · Kaiyao Miao · Qinghua Zheng · Minnan Luo | N/A | Code |
| EVAL: Explainable Video Anomaly Localization | Ashish Singh · Michael J. Jones · Erik G. Learned-Miller | N/A | Code |
| Adaptive Plasticity Improvement for Continual Learning | Yan-Shuo Liang · Wu-Jun Li | N/A | Code |
| Edges to Shapes to Concepts: Adversarial Augmentation for Robust Vision | Aditay Tripathi · Rishubh Singh · Anirban Chakraborty · Pradeep Shenoy | N/A | Code |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mingzhen Sun · Weining Wang · Xinxin Zhu · Jing Liu | N/A | Code |
| Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and Poses | Eric Brachmann · Tommaso Cavallari · Victor Adrian Prisacariu | N/A | Code |
| A Probabilistic Attention Model With Occlusion-Aware Texture Regression for 3D Hand Reconstruction From a Single RGB Image | Zheheng Jiang · Hossein Rahmani · Sue Black · Bryan M. Williams | N/A | Code |
| Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang · Viktor Larsson · Daniel Barath | N/A | Code |
| LiDAR-in-the-Loop Hyperparameter Optimization | Félix Goudreault · Dominik Scheuble · Mario Bijelic · Nicolas Robidoux · Felix Heide | N/A | Code |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | WonJun Moon · Sangeek Hyun · SangUk Park · Dongchan Park · Jae-Pil Heo | N/A | Code |
| High-Fidelity 3D Face Generation From Natural Language Descriptions | Menghua Wu · Hao Zhu · Linjia Huang · Yiyu Zhuang · Yuanxun Lu · Xun Cao | N/A | Code |
| NeRF-Supervised Deep Stereo | Fabio Tosi · Alessio Tonioni · Daniele De Gregorio · Matteo Poggi | N/A | Code |
| vMAP: Vectorised Object Mapping for Neural Field SLAM | Xin Kong · Shikun Liu · Marwan Taher · Andrew J. Davison | N/A | Code |
| DiffRF: Rendering-Guided 3D Radiance Field Diffusion | Norman Müller · Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Peter Kontschieder · Matthias Nießner | N/A | Code |
| TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers | Cheng Zhang · Hai Liu · Yongjian Deng · Bochen Xie · Youfu Li | N/A | Code |
| Learning a Depth Covariance Function | Eric Dexheimer · Andrew J. Davison | N/A | Code |
| Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model | Rolandos Alexandros Potamias · Stylianos Ploumpis · Stylianos Moschoglou · Vasileios Triantafyllou · Stefanos Zafeiriou | N/A | Code |
| The Best Defense Is a Good Offense: Adversarial Augmentation Against Adversarial Attacks | Iuri Frosio · Jan Kautz | N/A | Code |
| Test of Time: Instilling Video-Language Models With a Sense of Time | Piyush Bagad · Makarand Tapaswi · Cees G. M. Snoek | N/A | Code |
| BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects | Bowen Wen · Jonathan Tremblay · Valts Blukis · Stephen Tyree · Thomas Müller · Alex Evans · Dieter Fox · Jan Kautz · Stan Birchfield | N/A | Code |
| Leveraging Hidden Positives for Unsupervised Semantic Segmentation | Hyun Seok Seong · WonJun Moon · SuBeen Lee · Jae-Pil Heo | N/A | Code |
| BlendFields: Few-Shot Example-Driven Facial Modeling | Kacper Kania · Stephan J. Garbin · Andrea Tagliasacchi · Virginia Estellers · Kwang Moo Yi · Julien Valentin · Tomasz Trzciński · Marek Kowalski | N/A | Code |
| CIRCLE: Capture in Rich Contextual Environments | João Pedro Araújo · Jiaman Li · Karthik Vetrivel · Rishi Agarwal · Jiajun Wu · Deepak Gopinath · Alexander William Clegg · Karen Liu | N/A | Code |
| Realistic Saliency Guided Image Enhancement | S. Mahdi H. Miangoleh · Zoya Bylinskii · Eric Kee · Eli Shechtman · Yağiz Aksoy | N/A | Code |
| Implicit Neural Head Synthesis via Controllable Local Deformation Fields | Chuhan Chen · Matthew O’Toole · Gaurav Bharaj · Pablo Garrido | N/A | Code |
| Ensemble-Based Blackbox Attacks on Dense Prediction | Zikui Cai · Yaoteng Tan · M. Salman Asif | N/A | Code |
| NaQ: Leveraging Narrations As Queries To Supervise Episodic Memory | Santhosh Kumar Ramakrishnan · Ziad Al-Halah · Kristen Grauman | N/A | Code |
| Rethinking Federated Learning With Domain Shift: A Prototype View | Wenke Huang · Mang Ye · Zekun Shi · He Li · Bo Du | N/A | Code |
| Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation | Shao-Yuan Lo · Poojan Oza · Sumanth Chennupati · Alejandro Galindo · Vishal M. Patel | N/A | Code |
| Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection | Jiakang Yuan · Bo Zhang · Xiangchao Yan · Tao Chen · Botian Shi · Yikang Li · Yu Qiao | N/A | Code |
| STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection | Zhenglin Zhou · Huaxia Li · Hong Liu · Nanyang Wang · Gang Yu · Rongrong Ji | N/A | Code |
| Diverse 3D Hand Gesture Prediction From Body Dynamics by Bilateral Hand Disentanglement | Xingqun Qi · Chen Liu · Muyi Sun · Lincheng Li · Changjie Fan · Xin Yu | N/A | Code |
| Sparsely Annotated Semantic Segmentation With Adaptive Gaussian Mixtures | Linshan Wu · Zhun Zhong · Leyuan Fang · Xingxin He · Qiang Liu · Jiayi Ma · Hao Chen | N/A | Code |
| Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention | Mingyu Ding · Yikang Shen · Lijie Fan · Zhenfang Chen · Zitian Chen · Ping Luo · Joshua B. Tenenbaum · Chuang Gan | N/A | Code |
| Frame Flexible Network | Yitian Zhang · Yue Bai · Chang Liu · Huan Wang · Sheng Li · Yun Fu | N/A | Code |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks | Hao Li · Jinguo Zhu · Xiaohu Jiang · Xizhou Zhu · Hongsheng Li · Chun Yuan · Xiaohua Wang · Yu Qiao · Xiaogang Wang · Wenhai Wang · Jifeng Dai | N/A | Code |
| DCFace: Synthetic Face Generation With Dual Condition Diffusion Model | Minchul Kim · Feng Liu · Anil Jain · Xiaoming Liu | N/A | Code |
| Referring Image Matting | Jizhizi Li · Jing Zhang · Dacheng Tao | N/A | Code |
| Fast Monocular Scene Reconstruction With Global-Sparse Local-Dense Grids | Wei Dong · Christopher Choy · Charles Loop · Or Litany · Yuke Zhu · Anima Anandkumar | N/A | Code |
| DPE: Disentanglement of Pose and Expression for General Video Portrait Editing | Youxin Pang · Yong Zhang · Weize Quan · Yanbo Fan · Xiaodong Cun · Ying Shan · Dong-Ming Yan | N/A | Code |
| IDGI: A Framework To Eliminate Explanation Noise From Integrated Gradients | Ruo Yang · Binghui Wang · Mustafa Bilgic | N/A | Code |
| DynamicDet: A Unified Dynamic Architecture for Object Detection | Zhihao Lin · Yongtao Wang · Jinhe Zhang · Xiaojie Chu | N/A | Code |
| Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification | Honglin Li · Chenglu Zhu · Yunlong Zhang · Yuxuan Sun · Zhongyi Shui · Wenwei Kuang · Sunyi Zheng · Lin Yang | N/A | Code |
| VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution | Jaeill Kim · Suhyun Kang · Duhun Hwang · Jungwook Shin · Wonjong Rhee | N/A | Code |
| Semi-Weakly Supervised Object Kinematic Motion Prediction | Gengxin Liu · Qian Sun · Haibin Huang · Chongyang Ma · Yulan Guo · Li Yi · Hui Huang · Ruizhen Hu | N/A | Code |
| Computational Flash Photography Through Intrinsics | Sepideh Sarajian Maralan · Chris Careaga · Yağiz Aksoy | N/A | Code |
| Inversion-Based Style Transfer With Diffusion Models | Yuxin Zhang · Nisha Huang · Fan Tang · Haibin Huang · Chongyang Ma · Weiming Dong · Changsheng Xu | N/A | Code |
| Data-Driven Feature Tracking for Event Cameras | Nico Messikommer · Carter Fang · Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation | Sicheng Yang · Zhiyong Wu · Minglei Li · Zhensong Zhang · Lei Hao · Weihong Bao · Haolin Zhuang | N/A | Code |
| Neural Fourier Filter Bank | Zhijie Wu · Yuhe Jin · Kwang Moo Yi | N/A | Code |
| Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective | Yuexiao Ma · Huixia Li · Xiawu Zheng · Xuefeng Xiao · Rui Wang · Shilei Wen · Xin Pan · Fei Chao · Rongrong Ji | N/A | Code |
| Full or Weak Annotations? An Adaptive Strategy for Budget-Constrained Annotation Campaigns | Javier Gamazo Tejero · Martin S. Zinkernagel · Sebastian Wolf · Raphael Sznitman · Pablo Márquez-Neila | N/A | Code |
| Trap Attention: Monocular Depth Estimation With Manual Traps | Chao Ning · Hongping Gan | N/A | Code |
| Physical-World Optical Adversarial Attacks on 3D Face Recognition | Yanjie Li · Yiquan Li · Xuelong Dai · Songtao Guo · Bin Xiao | N/A | Code |
| Re-Thinking Federated Active Learning Based on Inter-Class Diversity | SangMook Kim · Sangmin Bae · Hwanjun Song · Se-Young Yun | N/A | Code |
| EMT-NAS:Transferring Architectural Knowledge Between Tasks From Different Datasets | Peng Liao · Yaochu Jin · Wenli Du | N/A | Code |
| Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving | Lucas Nunes · Louis Wiesmann · Rodrigo Marcuzzi · Xieyuanli Chen · Jens Behley · Cyrill Stachniss | N/A | Code |
| Document Image Shadow Removal Guided by Color-Aware Background | Ling Zhang · Yinghao He · Qing Zhang · Zheng Liu · Xiaolong Zhang · Chunxia Xiao | N/A | Code |
| Pose-Disentangled Contrastive Learning for Self-Supervised Facial Representation | Yuanyuan Liu · Wenbin Wang · Yibing Zhan · Shaoze Feng · Kejun Liu · Zhe Chen | N/A | Code |
| Ham2Pose: Animating Sign Language Notation Into Pose Sequences | Rotem Shalev Arkushin · Amit Moryossef · Ohad Fried | N/A | Code |
| Resource-Efficient RGBD Aerial Tracking | Jinyu Yang · Shang Gao · Zhe Li · Feng Zheng · Aleš Leonardis | N/A | Code |
| Neural Transformation Fields for Arbitrary-Styled Font Generation | Bin Fu · Junjun He · Jianjun Wang · Yu Qiao | N/A | Code |
| Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection | Qianjiang Hu · Daizong Liu · Wei Hu | N/A | Code |
| PAniC-3D: Stylized Single-View 3D Reconstruction From Portraits of Anime Characters | Shuhong Chen · Kevin Zhang · Yichun Shi · Heng Wang · Yiheng Zhu · Guoxian Song · Sizhe An · Janus Kristjansson · Xiao Yang · Matthias Zwicker | N/A | Code |
| HS-Pose: Hybrid Scope Feature Extraction for Category-Level Object Pose Estimation | Linfang Zheng · Chen Wang · Yinghan Sun · Esha Dasgupta · Hua Chen · Aleš Leonardis · Wei Zhang · Hyung Jin Chang | N/A | Code |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction From In-the-Wild Images | Biwen Lei · Jianqiang Ren · Mengyang Feng · Miaomiao Cui · Xuansong Xie | N/A | Code |
| Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification | Yue Yang · Artemis Panagopoulou · Shenghao Zhou · Daniel Jin · Chris Callison-Burch · Mark Yatskar | N/A | Code |
| SfM-TTR: Using Structure From Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo · Javier Civera | N/A | Code |
| TINC: Tree-Structured Implicit Neural Compression | Runzhao Yang | N/A | Code |
| Cross-Domain Image Captioning With Discriminative Finetuning | Roberto Dessì · Michele Bevilacqua · Eleonora Gualdoni · Nathanaël Carraz Rakotonirina · Francesca Franzon · Marco Baroni | N/A | Code |
| Learning To Detect Mirrors From Videos via Dual Correspondences | Jiaying Lin · Xin Tan · Rynson W.H. Lau | N/A | Code |
| Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation | Zicheng Wang · Zhen Zhao · Xiaoxia Xing · Dong Xu · Xiangyu Kong · Luping Zhou | N/A | Code |
| Robust Unsupervised StyleGAN Image Restoration | Yohan Poirier-Ginter · Jean-François Lalonde | N/A | Code |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning | Rui Wang · Dongdong Chen · Zuxuan Wu · Yinpeng Chen · Xiyang Dai · Mengchen Liu · Lu Yuan · Yu-Gang Jiang | N/A | Code |
| Neural Fields Meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes | Zian Wang · Tianchang Shen · Jun Gao · Shengyu Huang · Jacob Munkberg · Jon Hasselgren · Zan Gojcic · Wenzheng Chen · Sanja Fidler | N/A | Code |
| Augmentation Matters: A Simple-Yet-Effective Approach to Semi-Supervised Semantic Segmentation | Zhen Zhao · Lihe Yang · Sifan Long · Jimin Pi · Luping Zhou · Jingdong Wang | N/A | Code |
| Policy Adaptation From Foundation Model Feedback | Yuying Ge · Annabella Macaluso · Li Erran Li · Ping Luo · Xiaolong Wang | N/A | Code |
| Person Image Synthesis via Denoising Diffusion Model | Ankan Kumar Bhunia · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Jorma Laaksonen · Mubarak Shah · Fahad Shahbaz Khan | N/A | Code |
| Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models | Wenhao Wu · Xiaohan Wang · Haipeng Luo · Jingdong Wang · Yi Yang · Wanli Ouyang | N/A | Code |
| CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution | Jiezhang Cao · Qin Wang · Yongqin Xian · Yawei Li · Bingbing Ni · Zhiming Pi · Kai Zhang · Yulun Zhang · Radu Timofte · Luc Van Gool | N/A | Code |
| Black-Box Sparse Adversarial Attack via Multi-Objective Optimisation | Phoenix Neale Williams · Ke Li | N/A | Code |
| AdaptiveMix: Improving GAN Training via Feature Space Shrinkage | Haozhe Liu · Wentian Zhang · Bing Li · Haoqian Wu · Nanjun He · Yawen Huang · Yuexiang Li · Bernard Ghanem · Yefeng Zheng | N/A | Code |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Michail Tarasiou · Erik Chavez · Stefanos Zafeiriou | N/A | Code |
| Latency Matters: Real-Time Action Forecasting Transformer | Harshayu Girase · Nakul Agarwal · Chiho Choi · Karttikeya Mangalam | N/A | Code |
| Multi-Sensor Large-Scale Dataset for Multi-View 3D Reconstruction | Oleg Voynov · Gleb Bobrovskikh · Pavel Karpyshev · Saveliy Galochkin · Andrei-Timotei Ardelean · Arseniy Bozhenko · Ekaterina Karmanova · Pavel Kopanev · Yaroslav Labutin-Rymsho · Ruslan Rakhimov · Aleksandr Safin · Valerii Serpiva · Alexey Artemov · Evgeny Burnaev · Dzmitry Tsetserukou · Denis Zorin | N/A | Code |
| Learning From Noisy Labels With Decoupled Meta Label Purifier | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| Flow Supervision for Deformable NeRF | Chaoyang Wang · Lachlan Ewen MacDonald · László A. Jeni · Simon Lucey | N/A | Code |
| Unifying Vision, Text, and Layout for Universal Document Processing | Zineng Tang · Ziyi Yang · Guoxin Wang · Yuwei Fang · Yang Liu · Chenguang Zhu · Michael Zeng · Cha Zhang · Mohit Bansal | N/A | Code |
| BKinD-3D: Self-Supervised 3D Keypoint Discovery From Multi-View Videos | Jennifer J. Sun · Lili Karashchuk · Amil Dravid · Serim Ryou · Sonia Fereidooni · John C. Tuthill · Aggelos Katsaggelos · Bingni W. Brunton · Georgia Gkioxari · Ann Kennedy · Yisong Yue · Pietro Perona | N/A | Code |
| Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning | Zixuan Hu · Li Shen · Zhenyi Wang · Tongliang Liu · Chun Yuan · Dacheng Tao | N/A | Code |
| RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-Ray Security Image Synthesis | Luwen Duan · Min Wu · Lijian Mao · Jun Yin · Jianping Xiong · Xi Li | N/A | Code |
| Meta Architecture for Point Cloud Analysis | Haojia Lin · Xiawu Zheng · Lijiang Li · Fei Chao · Shanshan Wang · Yan Wang · Yonghong Tian · Rongrong Ji | N/A | Code |
| DyLiN: Making Light Field Networks Dynamic | Heng Yu · Joel Julin · Zoltán Á. Milacski · Koichiro Niinuma · László A. Jeni | N/A | Code |
| OpenMix: Exploring Outlier Samples for Misclassification Detection | Fei Zhu · Zhen Cheng · Xu-Yao Zhang · Cheng-Lin Liu | N/A | Code |
| Adaptive Graph Convolutional Subspace Clustering | Lai Wei · Zhengwei Chen · Jun Yin · Changming Zhu · Rigui Zhou · Jin Liu | N/A | Code |
| Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation | Guozhen Zhang · Yuhan Zhu · Haonan Wang · Youxin Chen · Gangshan Wu · Limin Wang | N/A | Code |
| Hybrid Active Learning via Deep Clustering for Video Action Detection | Aayush J. Rana · Yogesh S. Rawat | N/A | Code |
| Equiangular Basis Vectors | Yang Shen · Xuhao Sun · Xiu-Shen Wei | N/A | Code |
| CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection | Shuailei Ma · Yuefeng Wang · Ying Wei · Jiaqi Fan · Thomas H. Li · Hongli Liu · Fanbing Lv | N/A | Code |
| An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity | Zhao Xie · Tian Gao · Kewei Wu · Jiao Chang | N/A | Code |
| GCFAgg: Global and Cross-View Feature Aggregation for Multi-View Clustering | Weiqing Yan · Yuanyang Zhang · Chenlei Lv · Chang Tang · Guanghui Yue · Liang Liao · Weisi Lin | N/A | Code |
| Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning | Zesen Wu · Mang Ye | N/A | Code |
| Similarity Maps for Self-Training Weakly-Supervised Phrase Grounding | Tal Shaharabany · Lior Wolf | N/A | Code |
| DA Wand: Distortion-Aware Selection Using Neural Mesh Parameterization | Richard Liu · Noam Aigerman · Vladimir G. Kim · Rana Hanocka | N/A | Code |
| BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency | Shuo Yang · Zhaopan Xu · Kai Wang · Yang You · Hongxun Yao · Tongliang Liu · Min Xu | N/A | Code |
| DaFKD: Domain-Aware Federated Knowledge Distillation | Haozhao Wang · Yichen Li · Wenchao Xu · Ruixuan Li · Yufeng Zhan · Zhigang Zeng | N/A | Code |
| Single Image Depth Prediction Made Better: A Multivariate Gaussian Take | Ce Liu · Suryansh Kumar · Shuhang Gu · Radu Timofte · Luc Van Gool | N/A | Code |
| Align Your Latents: High-Resolution Video Synthesis With Latent Diffusion Models | Andreas Blattmann · Robin Rombach · Huan Ling · Tim Dockhorn · Seung Wook Kim · Sanja Fidler · Karsten Kreis | N/A | Code |
| GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction | Chuwei Luo · Changxu Cheng · Qi Zheng · Cong Yao | N/A | Code |
| VideoMAE V2: Scaling Video Masked Autoencoders With Dual Masking | Limin Wang · Bingkun Huang · Zhiyu Zhao · Zhan Tong · Yinan He · Yi Wang · Yali Wang · Yu Qiao | N/A | Code |
| CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition With Variational Alignment | Jiangbin Zheng · Yile Wang · Cheng Tan · Siyuan Li · Ge Wang · Jun Xia · Yidong Chen · Stan Z. Li | N/A | Code |
| All Are Worth Words: A ViT Backbone for Diffusion Models | Fan Bao · Shen Nie · Kaiwen Xue · Yue Cao · Chongxuan Li · Hang Su · Jun Zhu | N/A | Code |
| PanoSwin: A Pano-Style Swin Transformer for Panorama Understanding | Zhixin Ling · Zhen Xing · Xiangdong Zhou · Manliang Cao · Guichun Zhou | N/A | Code |
| sRGB Real Noise Synthesizing With Neighboring Correlation-Aware Noise Model | Zixuan Fu · Lanqing Guo · Bihan Wen | N/A | Code |
| Extracting Class Activation Maps From Non-Discriminative Features As Well | Zhaozheng Chen · Qianru Sun | N/A | Code |
| GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental Task | Huiping Zhuang · Zhenyu Weng · Run He · Zhiping Lin · Ziqian Zeng | N/A | Code |
| ERM-KTP: Knowledge-Level Machine Unlearning via Knowledge Transfer | Shen Lin · Xiaoyu Zhang · Chenyang Chen · Xiaofeng Chen · Willy Susilo | N/A | Code |
| PDPP:Projected Diffusion for Procedure Planning in Instructional Videos | Hanlin Wang · Yilu Wu · Sheng Guo · Limin Wang | N/A | Code |
| NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction | Bowen Cai · Jinchi Huang · Rongfei Jia · Chengfei Lv · Huan Fu | N/A | Code |
| Deep Polarization Reconstruction With PDAVIS Events | Haiyang Mei · Zuowen Wang · Xin Yang · Xiaopeng Wei · Tobi Delbruck | N/A | Code |
| Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers | Sifan Long · Zhen Zhao · Jimin Pi · Shengsheng Wang · Jingdong Wang | N/A | Code |
| PointClustering: Unsupervised Point Cloud Pre-Training Using Transformation Invariance in Clustering | Fuchen Long · Ting Yao · Zhaofan Qiu · Lusong Li · Tao Mei | N/A | Code |
| PCR: Proxy-Based Contrastive Replay for Online Class-Incremental Continual Learning | Huiwei Lin · Baoquan Zhang · Shanshan Feng · Xutao Li · Yunming Ye | N/A | Code |
| Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval | Tan Pan · Furong Xu · Xudong Yang · Sifeng He · Chen Jiang · Qingpei Guo · Feng Qian · Xiaobo Zhang · Yuan Cheng · Lei Yang · Wei Chu | N/A | Code |
| PermutoSDF: Fast Multi-View Reconstruction With Implicit Surfaces Using Permutohedral Lattices | Radu Alexandru Rosu · Sven Behnke | N/A | Code |
| StyleGene: Crossover and Mutation of Region-Level Facial Genes for Kinship Face Synthesis | Hao Li · Xianxu Hou · Zepeng Huang · Linlin Shen | N/A | Code |
| MixNeRF: Modeling a Ray With Mixture Density for Novel View Synthesis From Sparse Inputs | Seunghyeon Seo · Donghoon Han · Yeonjin Chang · Nojun Kwak | N/A | Code |
| Upcycling Models Under Domain and Category Shift | Sanqing Qu · Tianpei Zou · Florian Röhrbein · Cewu Lu · Guang Chen · Dacheng Tao · Changjun Jiang | N/A | Code |
| Towards Unbiased Volume Rendering of Neural Implicit Surfaces With Geometry Priors | Yongqiang Zhang · Zhipeng Hu · Haoqian Wu · Minda Zhao · Lincheng Li · Zhengxia Zou · Changjie Fan | N/A | Code |
| Avatars Grow Legs: Generating Smooth Human Motion From Sparse Tracking Inputs With Diffusion Model | Yuming Du · Robin Kips · Albert Pumarola · Sebastian Starke · Ali Thabet · Artsiom Sanakoyeu | N/A | Code |
| MoStGAN-V: Video Generation With Temporal Motion Styles | Xiaoqian Shen · Xiang Li · Mohamed Elhoseiny | N/A | Code |
| On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung · Patrick Ruhkamp · Guangyao Zhai · Nikolas Brasch · Yitong Li · Yannick Verdie · Jifei Song · Yiren Zhou · Anil Armagan · Slobodan Ilic · Aleš Leonardis · Nassir Navab · Benjamin Busam | N/A | Code |
| DeGPR: Deep Guided Posterior Regularization for Multi-Class Cell Detection and Counting | Aayush Kumar Tyagi · Chirag Mohapatra · Prasenjit Das · Govind Makharia · Lalita Mehra · Prathosh AP · Mausam | N/A | Code |
| Learning Action Changes by Measuring Verb-Adverb Textual Relationships | Davide Moltisanti · Frank Keller · Hakan Bilen · Laura Sevilla-Lara | N/A | Code |
| Interactive and Explainable Region-Guided Radiology Report Generation | Tim Tanida · Philip Müller · Georgios Kaissis · Daniel Rueckert | N/A | Code |
| Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng · Sida Peng · Zhen Xu · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt | Hao Li · Dingwen Zhang · Nian Liu · Lechao Cheng · Yalun Dai · Chao Zhang · Xinggang Wang · Junwei Han | N/A | Code |
| Learning Rotation-Equivariant Features for Visual Correspondence | Jongmin Lee · Byungjin Kim · Seungwook Kim · Minsu Cho | N/A | Code |
| Co-Training 2L Submodels for Visual Recognition | Hugo Touvron · Matthieu Cord · Maxime Oquab · Piotr Bojanowski · Jakob Verbeek · Hervé Jégou | N/A | Code |
| HOTNAS: Hierarchical Optimal Transport for Neural Architecture Search | Jiechao Yang · Yong Liu · Hongteng Xu | N/A | Code |
| LANA: A Language-Capable Navigator for Instruction Following and Generation | Xiaohan Wang · Wenguan Wang · Jiayi Shao · Yi Yang | N/A | Code |
| Visual Localization Using Imperfect 3D Models From the Internet | Vojtech Panek · Zuzana Kukelova · Torsten Sattler | N/A | Code |
| Diversity-Measurable Anomaly Detection | Wenrui Liu · Hong Chang · Bingpeng Ma · Shiguang Shan · Xilin Chen | N/A | Code |
| SLACK: Stable Learning of Augmentations With Cold-Start and KL Regularization | Juliette Marrie · Michael Arbel · Diane Larlus · Julien Mairal | N/A | Code |
| Recurrent Vision Transformers for Object Detection With Event Cameras | Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| Efficient Verification of Neural Networks Against LVM-Based Specifications | Harleen Hanspal · Alessio Lomuscio | N/A | Code |
| Neuralizer: General Neuroimage Analysis Without Re-Training | Steffen Czolbe · Adrian V. Dalca | N/A | Code |
| MobileVOS: Real-Time Video Object Segmentation Contrastive Learning Meets Knowledge Distillation | Roy Miles · Mehmet Kerim Yucel · Bruno Manganelli · Albert Saà-Garriga | N/A | Code |
| SCOTCH and SODA: A Transformer Video Shadow Detection Framework | Lihao Liu · Jean Prost · Lei Zhu · Nicolas Papadakis · Pietro Liò · Carola-Bibiane Schönlieb · Angelica I. Aviles-Rivero | N/A | Code |
| A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance | Xianmin Xu · Yuxin Lin · Haoyang Zhou · Chong Zeng · Yaxin Yu · Kun Zhou · Hongzhi Wu | N/A | Code |
| Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures | Eugenia Iofinova · Alexandra Peste · Dan Alistarh | N/A | Code |
| InstructPix2Pix: Learning To Follow Image Editing Instructions | Tim Brooks · Aleksander Holynski · Alexei A. Efros | N/A | Code |
| AnchorFormer: Point Cloud Completion From Discriminative Nodes | Zhikai Chen · Fuchen Long · Zhaofan Qiu · Ting Yao · Wengang Zhou · Jiebo Luo · Tao Mei | N/A | Code |
| Robust Test-Time Adaptation in Dynamic Scenarios | Longhui Yuan · Binhui Xie · Shuang Li | N/A | Code |
| AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers | Zechuan Li · Hongshan Yu · Zhengeng Yang · Tongjia Chen · Naveed Akhtar | N/A | Code |
| Neural Texture Synthesis With Guided Correspondence | Yang Zhou · Kaijian Chen · Rongjun Xiao · Hui Huang | N/A | Code |
| Learning To Render Novel Views From Wide-Baseline Stereo Pairs | Yilun Du · Cameron Smith · Ayush Tewari · Vincent Sitzmann | N/A | Code |
| Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision | Fangqiang Ding · Andras Palffy · Dariu M. Gavrila · Chris Xiaoxuan Lu | N/A | Code |
| SmallCap: Lightweight Image Captioning Prompted With Retrieval Augmentation | Rita Ramos · Bruno Martins · Desmond Elliott · Yova Kementchedjhieva | N/A | Code |
| PIDNet: A Real-Time Semantic Segmentation Network Inspired by PID Controllers | Jiacong Xu · Zixiang Xiong · Shankar P. Bhattacharyya | N/A | Code |
| NeRFLight: Fast and Light Neural Radiance Fields Using a Shared Feature Grid | Fernando Rivas-Manzaneque · Jorge Sierra-Acosta · Adrian Penate-Sanchez · Francesc Moreno-Noguer · Angela Ribeiro | N/A | Code |
| Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete Counterparts | Nikolas Lamb · Cameron Palmer · Benjamin Molloy · Sean Banerjee · Natasha Kholgade Banerjee | N/A | Code |
| PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration | Junle Yu · Luwei Ren · Yu Zhang · Wenhui Zhou · Lili Lin · Guojun Dai | N/A | Code |
| Neural Volumetric Memory for Visual Locomotion Control | Ruihan Yang · Ge Yang · Xiaolong Wang | N/A | Code |
| InstantAvatar: Learning Avatars From Monocular Video in 60 Seconds | Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| TMO: Textured Mesh Acquisition of Objects With a Mobile Device by Using Differentiable Rendering | Jaehoon Choi · Dongki Jung · Taejae Lee · Sangwook Kim · Youngdong Jung · Dinesh Manocha · Donghwan Lee | N/A | Code |
| MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding | Jun Chen · Ming Hu · Darren J. Coker · Michael L. Berumen · Blair Costelloe · Sara Beery · Anna Rohrbach · Mohamed Elhoseiny | N/A | Code |
| Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval | Xudong Lin · Simran Tiwari · Shiyuan Huang · Manling Li · Mike Zheng Shou · Heng Ji · Shih-Fu Chang | N/A | Code |
| Hierarchical Fine-Grained Image Forgery Detection and Localization | Xiao Guo · Xiaohong Liu · Zhiyuan Ren · Steven Grosz · Iacopo Masi · Xiaoming Liu | N/A | Code |
| SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision | Xubo Liu · Egor Lakomkin · Konstantinos Vougioukas · Pingchuan Ma · Honglie Chen · Ruiming Xie · Morrie Doulaty · Niko Moritz · Jachym Kolar · Stavros Petridis · Maja Pantic · Christian Fuegen | N/A | Code |
| RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation With Natural Prompts | Han Liu · Yuhao Wu · Shixuan Zhai · Bo Yuan · Ning Zhang | N/A | Code |
| Unsupervised Intrinsic Image Decomposition With LiDAR Intensity | Shogo Sato · Yasuhiro Yao · Taiga Yoshida · Takuhiro Kaneko · Shingo Ando · Jun Shimamura | N/A | Code |
| SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples | Han Liu · Yuhao Wu · Zhiyuan Yu · Yevgeniy Vorobeychik · Ning Zhang | N/A | Code |
| NeFII: Inverse Rendering for Reflectance Decomposition With Near-Field Indirect Illumination | Haoqian Wu · Zhipeng Hu · Lincheng Li · Yongqiang Zhang · Changjie Fan · Xin Yu | N/A | Code |
| BEV-Guided Multi-Modality Fusion for Driving Perception | Yunze Man · Liang-Yan Gui · Yu-Xiong Wang | N/A | Code |
| MAGVLT: Masked Generative Vision-and-Language Transformer | Sungwoong Kim · Daejin Jo · Donghoon Lee · Jongmin Kim | N/A | Code |
| PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial Training | Qingjie Zeng · Yutong Xie · Zilin Lu · Yong Xia | N/A | Code |
| Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning | Cheng-Hao Tu · Zheda Mai · Wei-Lun Chao | N/A | Code |
| Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning | Wenju Sun · Qingyong Li · Jing Zhang · Wen Wang · Yangli-ao Geng | N/A | Code |
| PMR: Prototypical Modal Rebalance for Multimodal Learning | Yunfeng Fan · Wenchao Xu · Haozhao Wang · Junxiao Wang · Song Guo | N/A | Code |
| DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks | Samyak Jain · Sravanti Addepalli · Pawan Kumar Sahu · Priyam Dey · R. Venkatesh Babu | N/A | Code |
| Abstract Visual Reasoning: An Algebraic Approach for Solving Raven’s Progressive Matrices | Jingyi Xu · Tushar Vaidya · Yufei Wu · Saket Chandra · Zhangsheng Lai · Kai Fong Ernest Chong | N/A | Code |
| Swept-Angle Synthetic Wavelength Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Passive Micron-Scale Time-of-Flight With Sunlight Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Meta-Learning With a Geometry-Adaptive Preconditioner | Suhyun Kang · Duhun Hwang · Moonjung Eo · Taesup Kim · Wonjong Rhee | N/A | Code |
| 3D GAN Inversion With Facial Symmetry Prior | Fei Yin · Yong Zhang · Xuan Wang · Tengfei Wang · Xiaoyu Li · Yuan Gong · Yanbo Fan · Xiaodong Cun · Ying Shan · Cengiz Oztireli · Yujiu Yang | N/A | Code |
| ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection | Yongqi An · Xu Zhao · Tao Yu · Haiyun Guo · Chaoyang Zhao · Ming Tang · Jinqiao Wang | N/A | Code |
| Neural Lens Modeling | Wenqi Xian · Aljaž Božič · Noah Snavely · Christoph Lassner | N/A | Code |
| A Probabilistic Framework for Lifelong Test-Time Adaptation | Dhanajit Brahma · Piyush Rai | N/A | Code |
| Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation | Linglan Zhao · Jing Lu · Yunlu Xu · Zhanzhan Cheng · Dashan Guo · Yi Niu · Xiangzhong Fang | N/A | Code |
| GradMA: A Gradient-Memory-Based Accelerated Federated Learning With Alleviated Catastrophic Forgetting | Kangyang Luo · Xiang Li · Yunshi Lan · Ming Gao | N/A | Code |
| Hyperspherical Embedding for Point Cloud Completion | Junming Zhang · Haomeng Zhang · Ram Vasudevan · Matthew Johnson-Roberson | N/A | Code |
| Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning | Hyesong Choi · Hunsang Lee · Wonil Song · Sangryul Jeon · Kwanghoon Sohn · Dongbo Min | N/A | Code |
| Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation | Sun-Ao Liu · Yiheng Zhang · Zhaofan Qiu · Hongtao Xie · Yongdong Zhang · Ting Yao | N/A | Code |
| DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment | Heyuan Li · Bo Wang · Yu Cheng · Mohan Kankanhalli · Robby T. Tan | N/A | Code |
| SViTT: Temporal Learning of Sparse Video-Text Transformers | Yi Li · Kyle Min · Subarna Tripathi · Nuno Vasconcelos | N/A | Code |
| Independent Component Alignment for Multi-Task Learning | Dmitry Senushkin · Nikolay Patakin · Arseny Kuznetsov · Anton Konushin | N/A | Code |
| Logical Implications for Visual Question Answering Consistency | Sergio Tascon-Morales · Pablo Márquez-Neila · Raphael Sznitman | N/A | Code |
| MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset | Chen Feng · Ioannis Patras | N/A | Code |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Wenhui Wang · Hangbo Bao · Li Dong · Johan Bjorck · Zhiliang Peng · Qiang Liu · Kriti Aggarwal · Owais Khan Mohammed · Saksham Singhal · Subhojit Som · Furu Wei | N/A | Code |
| Manipulating Transfer Learning for Property Inference | Yulong Tian · Fnu Suya · Anshuman Suri · Fengyuan Xu · David Evans | N/A | Code |
| DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana · Ahmed Magd · Kyung-Soo Kim | N/A | Code |
| Learning a 3D Morphable Face Reflectance Model From Low-Cost Data | Yuxuan Han · Zhibo Wang · Feng Xu | N/A | Code |
| Principles of Forgetting in Domain-Incremental Semantic Segmentation in Adverse Weather Conditions | Tobias Kalb · Jürgen Beyerer | N/A | Code |
| Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models | Gowthami Somepalli · Vasu Singla · Micah Goldblum · Jonas Geiping · Tom Goldstein | N/A | Code |
| Adaptive Data-Free Quantization | Biao Qian · Yang Wang · Richang Hong · Meng Wang | N/A | Code |
| Coreset Sampling From Open-Set for Fine-Grained Self-Supervised Learning | Sungnyun Kim · Sangmin Bae · Se-Young Yun | N/A | Code |
| Jedi: Entropy-Based Localization and Removal of Adversarial Patches | Bilel Tarchoun · Anouar Ben Khalifa · Mohamed Ali Mahjoub · Nael Abu-Ghazaleh · Ihsen Alouani | N/A | Code |
| Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models | Qiucheng Wu · Yujian Liu · Handong Zhao · Ajinkya Kale · Trung Bui · Tong Yu · Zhe Lin · Yang Zhang · Shiyu Chang | N/A | Code |
| Semantic-Conditional Diffusion Networks for Image Captioning | Jianjie Luo · Yehao Li · Yingwei Pan · Ting Yao · Jianlin Feng · Hongyang Chao · Tao Mei | N/A | Code |
| Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation | Zhen Zhao · Sifan Long · Jimin Pi · Jingdong Wang · Luping Zhou | N/A | Code |
| Improving Robustness of Semantic Segmentation to Motion-Blur Using Class-Centric Augmentation | Aakanksha Aakanksha · A. N. Rajagopalan | N/A | Code |
| MetaViewer: Towards a Unified Multi-View Representation | Ren Wang · Haoliang Sun · Yuling Ma · Xiaoming Xi · Yilong Yin | N/A | Code |
| Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization | Simone Barattin · Christos Tzelepis · Ioannis Patras · Nicu Sebe | N/A | Code |
| A Light Weight Model for Active Speaker Detection | Junhua Liao · Haihan Duan · Kanghui Feng · Wanbing Zhao · Yanbing Yang · Liangyin Chen | N/A | Code |
| Shifted Diffusion for Text-to-Image Generation | Yufan Zhou · Bingchen Liu · Yizhe Zhu · Xiao Yang · Changyou Chen · Jinhui Xu | N/A | Code |
| Modular Memorability: Tiered Representations for Video Memorability Prediction | Théo Dumont · Juan Segundo Hevia · Camilo L. Fosco | N/A | Code |
| Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images | Anastasis Stathopoulos · Georgios Pavlakos · Ligong Han · Dimitris N. Metaxas | N/A | Code |
| RMLVQA: A Margin Loss Approach for Visual Question Answering With Language Biases | Abhipsa Basu · Sravanti Addepalli · R. Venkatesh Babu | N/A | Code |
| RealImpact: A Dataset of Impact Sound Fields for Real Objects | Samuel Clarke · Ruohan Gao · Mason Wang · Mark Rau · Julia Xu · Jui-Hsien Wang · Doug L. James · Jiajun Wu | N/A | Code |
| Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN Models | Nilesh Ahuja · Parual Datta · Bhavya Kanzariya · V. Srinivasa Somayazulu · Omesh Tickoo | N/A | Code |
| Improving Vision-and-Language Navigation by Generating Future-View Image Semantics | Jialu Li · Mohit Bansal | N/A | Code |
| Simulated Annealing in Early Layers Leads to Better Generalization | Amir M. Sarfi · Zahra Karimpour · Muawiz Chaudhary · Nasir M. Khalid · Mirco Ravanelli · Sudhir Mudur · Eugene Belilovsky | N/A | Code |
| From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models | Jiaxian Guo · Junnan Li · Dongxu Li · Anthony Meng Huat Tiong · Boyang Li · Dacheng Tao · Steven Hoi | N/A | Code |
| Where We Are and What We’re Looking At: Query Based Worldwide Image Geo-Localization Using Hierarchies and Scenes | Brandon Clark · Alec Kerrigan · Parth Parag Kulkarni · Vicente Vivanco Cepeda · Mubarak Shah | N/A | Code |
| CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes From Natural Language | Aditya Sanghi · Rao Fu · Vivian Liu · Karl D.D. Willis · Hooman Shayani · Amir H. Khasahmadi · Srinath Sridhar · Daniel Ritchie | N/A | Code |
| Learning To Generate Text-Grounded Mask for Open-World Semantic Segmentation From Only Image-Text Pairs | Junbum Cha · Jonghwan Mun · Byungseok Roh | N/A | Code |
| Imitation Learning As State Matching via Differentiable Physics | Siwei Chen · Xiao Ma · Zhongwen Xu | N/A | Code |
| BBDM: Image-to-Image Translation With Brownian Bridge Diffusion Models | Bo Li · Kaitao Xue · Bin Liu · Yu-Kun Lai | N/A | Code |
| CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning | Jianlong Wu · Haozhe Yang · Tian Gan · Ning Ding · Feijun Jiang · Liqiang Nie | N/A | Code |
| Re-GAN: Data-Efficient GANs Training via Architectural Reconfiguration | Divya Saxena · Jiannong Cao · Jiahao Xu · Tarun Kulshrestha | N/A | Code |
| Learning Debiased Representations via Conditional Attribute Interpolation | Yi-Kai Zhang · Qi-Wei Wang · De-Chuan Zhan · Han-Jia Ye | N/A | Code |
| Weakly Supervised Posture Mining for Fine-Grained Classification | Zhenchao Tang · Hualin Yang · Calvin Yu-Chian Chen | N/A | Code |
| Learning a Practical SDR-to-HDRTV Up-Conversion Using New Dataset and Degradation Models | Cheng Guo · Leidong Fan · Ziyu Xue · Xiuhua Jiang | N/A | Code |
| VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models | Ajay Jain · Amber Xie · Pieter Abbeel | N/A | Code |
| Adversarial Robustness via Random Projection Filters | Minjing Dong · Chang Xu | N/A | Code |
| IEEE Computer Society | Unknown | N/A | Code |
| The Computer Vision Foundation | Unknown | N/A | Code |
CVPR 2023
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation | Yen-Chi Cheng · Hsin-Ying Lee · Sergey Tulyakov · Alexander G. Schwing · Liang-Yan Gui | N/A | Code |
| Revisiting Temporal Modeling for CLIP-Based Image-to-Video Knowledge Transferring | Ruyang Liu · Jingjia Huang · Ge Li · Jiashi Feng · Xinglong Wu · Thomas H. Li | N/A | Code |
| Post-Processing Temporal Action Detection | Sauradip Nag · Xiatian Zhu · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Learning Analytical Posterior Probability for Human Mesh Recovery | Qi Fang · Kang Chen · Yinghui Fan · Qing Shuai · Jiefeng Li · Weidong Zhang | N/A | Code |
| Accidental Light Probes | Hong-Xing Yu · Samir Agarwala · Charles Herrmann · Richard Szeliski · Noah Snavely · Jiajun Wu · Deqing Sun | N/A | Code |
| Multi-Object Manipulation via Object-Centric Neural Scattering Functions | Stephen Tian · Yancheng Cai · Hong-Xing Yu · Sergey Zakharov · Katherine Liu · Adrien Gaidon · Yunzhu Li · Jiajun Wu | N/A | Code |
| CFA: Class-Wise Calibrated Fair Adversarial Training | Zeming Wei · Yifei Wang · Yiwen Guo · Yisen Wang | N/A | Code |
| AutoAD: Movie Description in Context | Tengda Han · Max Bain · Arsha Nagrani · Gül Varol · Weidi Xie · Andrew Zisserman | N/A | Code |
| Relational Context Learning for Human-Object Interaction Detection | Sanghyun Kim · Deunsol Jung · Minsu Cho | N/A | Code |
| Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations | Hagay Michaeli · Tomer Michaeli · Daniel Soudry | N/A | Code |
| Learning Distortion Invariant Representation for Image Restoration From a Causality Perspective | Xin Li · Bingchen Li · Xin Jin · Cuiling Lan · Zhibo Chen | N/A | Code |
| Iterative Vision-and-Language Navigation | Jacob Krantz · Shurjo Banerjee · Wang Zhu · Jason Corso · Peter Anderson · Stefan Lee · Jesse Thomason | N/A | Code |
| FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer | Zhijian Liu · Xinyu Yang · Haotian Tang · Shang Yang · Song Han | N/A | Code |
| BUFFER: Balancing Accuracy, Efficiency, and Generalizability in Point Cloud Registration | Sheng Ao · Qingyong Hu · Hanyun Wang · Kai Xu · Yulan Guo | N/A | Code |
| Learning Event Guided High Dynamic Range Video Reconstruction | Yixin Yang · Jin Han · Jinxiu Liang · Imari Sato · Boxin Shi | N/A | Code |
| 3D Line Mapping Revisited | Shaohui Liu · Yifan Yu · Rémi Pautrat · Marc Pollefeys · Viktor Larsson | N/A | Code |
| High-Fidelity Event-Radiance Recovery via Transient Event Frequency | Jin Han · Yuta Asano · Boxin Shi · Yinqiang Zheng · Imari Sato | N/A | Code |
| OCELOT: Overlapped Cell on Tissue Dataset for Histopathology | Jeongun Ryu · Aaron Valero Puche · JaeWoong Shin · Seonwook Park · Biagio Brattoli · Jinhee Lee · Wonkyung Jung · Soo Ick Cho · Kyunghyun Paeng · Chan-Young Ock · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Blur Interpolation Transformer for Real-World Motion From Blur | Zhihang Zhong · Mingdeng Cao · Xiang Ji · Yinqiang Zheng · Imari Sato | N/A | Code |
| Continuous Intermediate Token Learning With Implicit Motion Manifold for Keyframe Based Motion Interpolation | Clinton A. Mo · Kun Hu · Chengjiang Long · Zhiyong Wang | N/A | Code |
| Instant-NVR: Instant Neural Volumetric Rendering for Human-Object Interactions From Monocular RGBD Stream | Yuheng Jiang · Kaixin Yao · Zhuo Su · Zhehao Shen · Haimin Luo · Lan Xu | N/A | Code |
| HexPlane: A Fast Representation for Dynamic Scenes | Ang Cao · Justin Johnson | N/A | Code |
| Finetune Like You Pretrain: Improved Finetuning of Zero-Shot Vision Models | Sachin Goyal · Ananya Kumar · Sankalp Garg · Zico Kolter · Aditi Raghunathan | N/A | Code |
| A Whac-a-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others | Zhiheng Li · Ivan Evtimov · Albert Gordo · Caner Hazirbas · Tal Hassner · Cristian Canton Ferrer · Chenliang Xu · Mark Ibrahim | N/A | Code |
| GIVL: Improving Geographical Inclusivity of Vision-Language Models With Pre-Training Methods | Da Yin · Feng Gao · Govind Thattai · Michael Johnston · Kai-Wei Chang | N/A | Code |
| Devil’s on the Edges: Selective Quad Attention for Scene Graph Generation | Deunsol Jung · Sanghyun Kim · Won Hwa Kim · Minsu Cho | N/A | Code |
| GeoMVSNet: Learning Multi-View Stereo With Geometry Perception | Zhe Zhang · Rui Peng · Yuxi Hu · Ronggang Wang | N/A | Code |
| CR-FIQA: Face Image Quality Assessment by Learning Sample Relative Classifiability | Fadi Boutros · Meiling Fang · Marcel Klemt · Biying Fu · Naser Damer | N/A | Code |
| NeuFace: Realistic 3D Neural Face Rendering From Multi-View Images | Mingwu Zheng · Haiyu Zhang · Hongyu Yang · Di Huang | N/A | Code |
| MethaneMapper: Spectral Absorption Aware Hyperspectral Transformer for Methane Detection | Satish Kumar · Ivan Arevalo · ASM Iftekhar · B S Manjunath | N/A | Code |
| Re-Thinking Model Inversion Attacks Against Deep Neural Networks | Ngoc-Bao Nguyen · Keshigeyan Chandrasegaran · Milad Abdollahzadeh · Ngai-Man Cheung | N/A | Code |
| SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency | Yang Liu · Yao Zhang · Yixin Wang · Yang Zhang · Jiang Tian · Zhongchao Shi · Jianping Fan · Zhiqiang He | N/A | Code |
| VectorFloorSeg: Two-Stream Graph Attention Network for Vectorized Roughcast Floorplan Segmentation | Bingchen Yang · Haiyong Jiang · Hao Pan · Jun Xiao | N/A | Code |
| MARLIN: Masked Autoencoder for Facial Video Representation LearnINg | Zhixi Cai · Shreya Ghosh · Kalin Stefanov · Abhinav Dhall · Jianfei Cai · Hamid Rezatofighi · Reza Haffari · Munawar Hayat | N/A | Code |
| KD-DLGAN: Data Limited Image Generation via Knowledge Distillation | Kaiwen Cui · Yingchen Yu · Fangneng Zhan · Shengcai Liao · Shijian Lu · Eric P. Xing | N/A | Code |
| Hierarchical Neural Memory Network for Low Latency Event Processing | Ryuhei Hamaguchi · Yasutaka Furukawa · Masaki Onishi · Ken Sakurada | N/A | Code |
| Optimal Transport Minimization: Crowd Localization on Density Maps for Semi-Supervised Counting | Wei Lin · Antoni B. Chan | N/A | Code |
| Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information | Weijie Su · Xizhou Zhu · Chenxin Tao · Lewei Lu · Bin Li · Gao Huang · Yu Qiao · Xiaogang Wang · Jie Zhou · Jifeng Dai | N/A | Code |
| Revisiting Reverse Distillation for Anomaly Detection | Tran Dinh Tien · Anh Tuan Nguyen · Nguyen Hoang Tran · Ta Duc Huy · Soan T.M. Duong · Chanh D. Tr. Nguyen · Steven Q. H. Truong | N/A | Code |
| Conditional Generation of Audio From Video via Foley Analogies | Yuexi Du · Ziyang Chen · Justin Salamon · Bryan Russell · Andrew Owens | N/A | Code |
| Parameter Efficient Local Implicit Image Function Network for Face Segmentation | Mausoom Sarkar · Nikitha SR · Mayur Hemani · Rishabh Jain · Balaji Krishnamurthy | N/A | Code |
| Learning Decorrelated Representations Efficiently Using Fast Fourier Transform | Yutaro Shigeto · Masashi Shimbo · Yuya Yoshikawa · Akikazu Takeuchi | N/A | Code |
| FaceLit: Neural 3D Relightable Faces | Anurag Ranjan · Kwang Moo Yi · Jen-Hao Rick Chang · Oncel Tuzel | N/A | Code |
| Pointersect: Neural Rendering With Cloud-Ray Intersection | Jen-Hao Rick Chang · Wei-Yu Chen · Anurag Ranjan · Kwang Moo Yi · Oncel Tuzel | N/A | Code |
| High-Fidelity Clothed Avatar Reconstruction From a Single Image | Tingting Liao · Xiaomei Zhang · Yuliang Xiu · Hongwei Yi · Xudong Liu · Guo-Jun Qi · Yong Zhang · Xuan Wang · Xiangyu Zhu · Zhen Lei | N/A | Code |
| BAD-NeRF: Bundle Adjusted Deblur Neural Radiance Fields | Peng Wang · Lingzhe Zhao · Ruijie Ma · Peidong Liu | N/A | Code |
| Meta-Tuning Loss Functions and Data Augmentation for Few-Shot Object Detection | Berkan Demirel · Orhun Buğra Baran · Ramazan Gokberk Cinbis | N/A | Code |
| StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields | Kunhao Liu · Fangneng Zhan · Yiwen Chen · Jiahui Zhang · Yingchen Yu · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting | Maoyuan Ye · Jing Zhang · Shanshan Zhao · Juhua Liu · Tongliang Liu · Bo Du · Dacheng Tao | N/A | Code |
| Local Implicit Normalizing Flow for Arbitrary-Scale Image Super-Resolution | Jie-En Yao · Li-Yuan Tsao · Yi-Chen Lo · Roy Tseng · Chia-Che Chang · Chun-Yi Lee | N/A | Code |
| LAVENDER: Unifying Video-Language Understanding As Masked Language Modeling | Linjie Li · Zhe Gan · Kevin Lin · Chung-Ching Lin · Zicheng Liu · Ce Liu · Lijuan Wang | N/A | Code |
| Cascaded Local Implicit Transformer for Arbitrary-Scale Super-Resolution | Hao-Wei Chen · Yu-Syuan Xu · Min-Fong Hong · Yi-Min Tsai · Hsien-Kai Kuo · Chun-Yi Lee | N/A | Code |
| Fair Federated Medical Image Segmentation via Client Contribution Estimation | Meirui Jiang · Holger R. Roth · Wenqi Li · Dong Yang · Can Zhao · Vishwesh Nath · Daguang Xu · Qi Dou · Ziyue Xu | N/A | Code |
| An Empirical Study of End-to-End Video-Language Transformers With Masked Visual Modeling | Tsu-Jui Fu · Linjie Li · Zhe Gan · Kevin Lin · William Yang Wang · Lijuan Wang · Zicheng Liu | N/A | Code |
| ReCo: Region-Controlled Text-to-Image Generation | Zhengyuan Yang · Jianfeng Wang · Zhe Gan · Linjie Li · Kevin Lin · Chenfei Wu · Nan Duan · Zicheng Liu · Ce Liu · Michael Zeng · Lijuan Wang | N/A | Code |
| Uncertainty-Aware Vision-Based Metric Cross-View Geolocalization | Florian Fervers · Sebastian Bullinger · Christoph Bodensteiner · Michael Arens · Rainer Stiefelhagen | N/A | Code |
| LayoutDiffusion: Controllable Diffusion Model for Layout-to-Image Generation | Guangcong Zheng · Xianpan Zhou · Xuewei Li · Zhongang Qi · Ying Shan · Xi Li | N/A | Code |
| Efficient Loss Function by Minimizing the Detrimental Effect of Floating-Point Errors on Gradient-Based Attacks | Yunrui Yu · Cheng-Zhong Xu | N/A | Code |
| NIKI: Neural Inverse Kinematics With Invertible Neural Networks for 3D Human Pose and Shape Estimation | Jiefeng Li · Siyuan Bian · Qi Liu · Jiasheng Tang · Fan Wang · Cewu Lu | N/A | Code |
| 3D Spatial Multimodal Knowledge Accumulation for Scene Graph Prediction in Point Cloud | Mingtao Feng · Haoran Hou · Liang Zhang · Zijie Wu · Yulan Guo · Ajmal Mian | N/A | Code |
| Egocentric Auditory Attention Localization in Conversations | Fiona Ryan · Hao Jiang · Abhinav Shukla · James M. Rehg · Vamsi Krishna Ithapu | N/A | Code |
| EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision | Jiahui Lei · Congyue Deng · Karl Schmeckpeper · Leonidas Guibas · Kostas Daniilidis | N/A | Code |
| Divide and Conquer: Answering Questions With Object Factorization and Compositional Reasoning | Shi Chen · Qi Zhao | N/A | Code |
| Text-Visual Prompting for Efficient 2D Temporal Video Grounding | Yimeng Zhang · Xin Chen · Jinghan Jia · Sijia Liu · Ke Ding | N/A | Code |
| Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning | Youngjae Yu · Jiwan Chung · Heeseung Yun · Jack Hessel · Jae Sung Park · Ximing Lu · Rowan Zellers · Prithviraj Ammanabrolu · Ronan Le Bras · Gunhee Kim · Yejin Choi | N/A | Code |
| UniHCP: A Unified Model for Human-Centric Perceptions | Yuanzheng Ci · Yizhou Wang · Meilin Chen · Shixiang Tang · Lei Bai · Feng Zhu · Rui Zhao · Fengwei Yu · Donglian Qi · Wanli Ouyang | N/A | Code |
| VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval | Siteng Huang · Biao Gong · Yulin Pan · Jianwen Jiang · Yiliang Lv · Yuyuan Li · Donglin Wang | N/A | Code |
| PointConvFormer: Revenge of the Point-Based Convolution | Wenxuan Wu · Li Fuxin · Qi Shan | N/A | Code |
| BAAM: Monocular 3D Pose and Shape Reconstruction With Bi-Contextual Attention Module and Attention-Guided Modeling | Hyo-Jun Lee · Hanul Kim · Su-Min Choi · Seong-Gyun Jeong · Yeong Jun Koh | N/A | Code |
| HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining | Shixiang Tang · Cheng Chen · Qingsong Xie · Meilin Chen · Yizhou Wang · Yuanzheng Ci · Lei Bai · Feng Zhu · Haiyang Yang · Li Yi · Rui Zhao · Wanli Ouyang | N/A | Code |
| Local Connectivity-Based Density Estimation for Face Clustering | Junho Shin · Hyo-Jun Lee · Hyunseop Kim · Jong-Hyeon Baek · Daehyun Kim · Yeong Jun Koh | N/A | Code |
| DistilPose: Tokenized Pose Regression With Heatmap Distillation | Suhang Ye · Yingyi Zhang · Jie Hu · Liujuan Cao · Shengchuan Zhang · Lei Shen · Jun Wang · Shouhong Ding · Rongrong Ji | N/A | Code |
| Beyond Appearance: A Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks | Weihua Chen · Xianzhe Xu · Jian Jia · Hao Luo · Yaohua Wang · Fan Wang · Rong Jin · Xiuyu Sun | N/A | Code |
| ViPLO: Vision Transformer Based Pose-Conditioned Self-Loop Graph for Human-Object Interaction Detection | Jeeseung Park · Jin-Woo Park · Jong-Seok Lee | N/A | Code |
| EVA: Exploring the Limits of Masked Visual Representation Learning at Scale | Yuxin Fang · Wen Wang · Binhui Xie · Quan Sun · Ledell Wu · Xinggang Wang · Tiejun Huang · Xinlong Wang · Yue Cao | N/A | Code |
| I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs | Jingsen Zhu · Yuchi Huo · Qi Ye · Fujun Luan · Jifan Li · Dianbing Xi · Lisha Wang · Rui Tang · Wei Hua · Hujun Bao · Rui Wang | N/A | Code |
| DrapeNet: Garment Generation and Self-Supervised Draping | Luca De Luigi · Ren Li · Benoît Guillard · Mathieu Salzmann · Pascal Fua | N/A | Code |
| STMixer: A One-Stage Sparse Action Detector | Tao Wu · Mengqi Cao · Ziteng Gao · Gangshan Wu · Limin Wang | N/A | Code |
| Inverse Rendering of Translucent Objects Using Physical and Neural Renderers | Chenhao Li · Trung Thanh Ngo · Hajime Nagahara | N/A | Code |
| Humans As Light Bulbs: 3D Human Reconstruction From Thermal Reflection | Ruoshi Liu · Carl Vondrick | N/A | Code |
| CF-Font: Content Fusion for Few-Shot Font Generation | Chi Wang · Min Zhou · Tiezheng Ge · Yuning Jiang · Hujun Bao · Weiwei Xu | N/A | Code |
| GLeaD: Improving GANs With a Generator-Leading Task | Qingyan Bai · Ceyuan Yang · Yinghao Xu · Xihui Liu · Yujiu Yang · Yujun Shen | N/A | Code |
| StarCraftImage: A Dataset for Prototyping Spatial Reasoning Methods for Multi-Agent Environments | Sean Kulinski · Nicholas R. Waytowich · James Z. Hare · David I. Inouye | N/A | Code |
| WIRE: Wavelet Implicit Neural Representations | Vishwanath Saragadam · Daniel LeJeune · Jasper Tan · Guha Balakrishnan · Ashok Veeraraghavan · Richard G. Baraniuk | N/A | Code |
| Thermal Spread Functions (TSF): Physics-Guided Material Classification | Aniket Dashpute · Vishwanath Saragadam · Emma Alexander · Florian Willomitzer · Aggelos Katsaggelos · Ashok Veeraraghavan · Oliver Cossairt | N/A | Code |
| Improving Zero-Shot Generalization and Robustness of Multi-Modal Models | Yunhao Ge · Jie Ren · Andrew Gallagher · Yuxiao Wang · Ming-Hsuan Yang · Hartwig Adam · Laurent Itti · Balaji Lakshminarayanan · Jiaping Zhao | N/A | Code |
| The Differentiable Lens: Compound Lens Search Over Glass Surfaces and Materials for Object Detection | Geoffroi Côté · Fahim Mannan · Simon Thibault · Jean-François Lalonde · Felix Heide | N/A | Code |
| Federated Domain Generalization With Generalization Adjustment | Ruipeng Zhang · Qinwei Xu · Jiangchao Yao · Ya Zhang · Qi Tian · Yanfeng Wang | N/A | Code |
| Propagate and Calibrate: Real-Time Passive Non-Line-of-Sight Tracking | Yihao Wang · Zhigang Wang · Bin Zhao · Dong Wang · Mulin Chen · Xuelong Li | N/A | Code |
| Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network | Zhengxin Pan · Fangyu Wu · Bailing Zhang | N/A | Code |
| On the Benefits of 3D Pose and Tracking for Human Action Recognition | Jathushan Rajasegaran · Georgios Pavlakos · Angjoo Kanazawa · Christoph Feichtenhofer · Jitendra Malik | N/A | Code |
| Visual DNA: Representing and Comparing Images Using Distributions of Neuron Activations | Benjamin Ramtoula · Matthew Gadd · Paul Newman · Daniele De Martini | N/A | Code |
| Fine-Tuned CLIP Models Are Efficient Video Learners | Hanoona Rasheed · Muhammad Uzair Khattak · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Connecting Vision and Language With Video Localized Narratives | Paul Voigtlaender · Soravit Changpinyo · Jordi Pont-Tuset · Radu Soricut · Vittorio Ferrari | N/A | Code |
| K-Planes: Explicit Radiance Fields in Space, Time, and Appearance | Sara Fridovich-Keil · Giacomo Meanti · Frederik Rahbæk Warburg · Benjamin Recht · Angjoo Kanazawa | N/A | Code |
| Virtual Occlusions Through Implicit Depth | Jamie Watson · Mohamed Sayed · Zawar Qureshi · Gabriel J. Brostow · Sara Vicente · Oisin Mac Aodha · Michael Firman | N/A | Code |
| Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories | Samarth Sinha · Roman Shapovalov · Jeremy Reizenstein · Ignacio Rocco · Natalia Neverova · Andrea Vedaldi · David Novotny | N/A | Code |
| LG-BPN: Local and Global Blind-Patch Network for Self-Supervised Real-World Denoising | Zichun Wang · Ying Fu · Ji Liu · Yulun Zhang | N/A | Code |
| One-Shot High-Fidelity Talking-Head Synthesis With Deformable Neural Radiance Field | Weichuang Li · Longhao Zhang · Dong Wang · Bin Zhao · Zhigang Wang · Mulin Chen · Bang Zhang · Zhongjian Wang · Liefeng Bo · Xuelong Li | N/A | Code |
| Collaborative Diffusion for Multi-Modal Face Generation and Editing | Ziqi Huang · Kelvin C.K. Chan · Yuming Jiang · Ziwei Liu | N/A | Code |
| Blind Video Deflickering by Neural Filtering With a Flawed Atlas | Chenyang Lei · Xuanchi Ren · Zhaoxiang Zhang · Qifeng Chen | N/A | Code |
| RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension | Jiamu Sun · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Zhiyu Wang · Rongrong Ji | N/A | Code |
| HNeRV: A Hybrid Neural Representation for Videos | Hao Chen · Matthew Gwilliam · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Learning 3D-Aware Image Synthesis With Unknown Pose Distribution | Zifan Shi · Yujun Shen · Yinghao Xu · Sida Peng · Yiyi Liao · Sheng Guo · Qifeng Chen · Dit-Yan Yeung | N/A | Code |
| DynaFed: Tackling Client Data Heterogeneity With Global Dynamics | Renjie Pi · Weizhong Zhang · Yueqi Xie · Jiahui Gao · Xiaoyu Wang · Sunghun Kim · Qifeng Chen | N/A | Code |
| Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition | Jun Cen · Shiwei Zhang · Xiang Wang · Yixuan Pei · Zhiwu Qing · Yingya Zhang · Qifeng Chen | N/A | Code |
| RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion | Tengfei Wang · Bo Zhang · Ting Zhang · Shuyang Gu · Jianmin Bao · Tadas Baltrusaitis · Jingjing Shen · Dong Chen · Fang Wen · Qifeng Chen · Baining Guo | N/A | Code |
| IFSeg: Image-Free Semantic Segmentation via Vision-Language Model | Sukmin Yun · Seong Hyeon Park · Paul Hongsuck Seo · Jinwoo Shin | N/A | Code |
| Detecting Everything in the Open World: Towards Universal Object Detection | Zhenyu Wang · Yali Li · Xi Chen · Ser-Nam Lim · Antonio Torralba · Hengshuang Zhao · Shengjin Wang | N/A | Code |
| Improving Visual Grounding by Encouraging Consistent Gradient-Based Explanations | Ziyan Yang · Kushal Kafle · Franck Dernoncourt · Vicente Ordonez | N/A | Code |
| Temporally Consistent Online Depth Estimation Using Point-Based Fusion | Numair Khan · Eric Penner · Douglas Lanman · Lei Xiao | N/A | Code |
| NeuralDome: A Neural Modeling Pipeline on Multi-View Human-Object Interactions | Juze Zhang · Haimin Luo · Hongdi Yang · Xinru Xu · Qianyang Wu · Ye Shi · Jingyi Yu · Lan Xu · Jingya Wang | N/A | Code |
| Token Turing Machines | Michael S. Ryoo · Keerthana Gopalakrishnan · Kumara Kahatapitiya · Ted Xiao · Kanishka Rao · Austin Stone · Yao Lu · Julian Ibarz · Anurag Arnab | N/A | Code |
| Computationally Budgeted Continual Learning: What Does Matter? | Ameya Prabhu · Hasan Abed Al Kader Hammoud · Puneet K. Dokania · Philip H.S. Torr · Ser-Nam Lim · Bernard Ghanem · Adel Bibi | N/A | Code |
| CLIP2Protect: Protecting Facial Privacy Using Text-Guided Makeup via Adversarial Latent Search | Fahad Shamshad · Muzammal Naseer · Karthik Nandakumar | N/A | Code |
| Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation From Image Sequence | Yang Tian · Jiyao Zhang · Zekai Yin · Hao Dong | N/A | Code |
| Affordances From Human Videos as a Versatile Representation for Robotics | Shikhar Bahl · Russell Mendonca · Lili Chen · Unnat Jain · Deepak Pathak | N/A | Code |
| MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation | Yong Yang · Qiong Chen · Yuan Feng · Tianlin Huang | N/A | Code |
| Learning To Generate Image Embeddings With User-Level Differential Privacy | Zheng Xu · Maxwell Collins · Yuxiao Wang · Liviu Panait · Sewoong Oh · Sean Augenstein · Ting Liu · Florian Schroff · H. Brendan McMahan | N/A | Code |
| Genie: Show Me the Data for Quantization | Yongkweon Jeon · Chungman Lee · Ho-young Kim | N/A | Code |
| DSVT: Dynamic Sparse Voxel Transformer With Rotated Sets | Haiyang Wang · Chen Shi · Shaoshuai Shi · Meng Lei · Sen Wang · Di He · Bernt Schiele · Liwei Wang | N/A | Code |
| Transformer-Based Learned Optimization | Erik Gärtner · Luke Metz · Mykhaylo Andriluka · C. Daniel Freeman · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Noise2Noise: Efficient Image Denoising Without Any Data | Youssef Mansour · Reinhard Heckel | N/A | Code |
| Super-Resolution Neural Operator | Min Wei · Xuesong Zhang | N/A | Code |
| StyleIPSB: Identity-Preserving Semantic Basis of StyleGAN for High Fidelity Face Swapping | Diqiong Jiang · Dan Song · Ruofeng Tong · Min Tang | N/A | Code |
| Self-Supervised Blind Motion Deblurring With Deep Expectation Maximization | Ji Li · Weixi Wang · Yuesong Nan · Hui Ji | N/A | Code |
| Confidence-Aware Personalized Federated Learning via Variational Expectation Maximization | Junyi Zhu · Xingchen Ma · Matthew B. Blaschko | N/A | Code |
| Human Pose As Compositional Tokens | Zigang Geng · Chunyu Wang · Yixuan Wei · Ze Liu · Houqiang Li · Han Hu | N/A | Code |
| GeoMAE: Masked Geometric Target Prediction for Self-Supervised Point Cloud Pre-Training | Xiaoyu Tian · Haoxi Ran · Yue Wang · Hang Zhao | N/A | Code |
| RUST: Latent Neural Scene Representations From Unposed Imagery | Mehdi S. M. Sajjadi · Aravindh Mahendran · Thomas Kipf · Etienne Pot · Daniel Duckworth · Mario Lučić · Klaus Greff | N/A | Code |
| Bias Mimicking: A Simple Sampling Approach for Bias Mitigation | Maan Qraitem · Kate Saenko · Bryan A. Plummer | N/A | Code |
| V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting | Haibao Yu · Wenxian Yang · Hongzhi Ruan · Zhenwei Yang · Yingjuan Tang · Xu Gao · Xin Hao · Yifeng Shi · Yifeng Pan · Ning Sun · Juan Song · Jirui Yuan · Ping Luo · Zaiqing Nie | N/A | Code |
| Conditional Image-to-Video Generation With Latent Flow Diffusion Models | Haomiao Ni · Changhao Shi · Kai Li · Sharon X. Huang · Martin Renqiang Min | N/A | Code |
| Anchor3DLane: Learning To Regress 3D Anchors for Monocular 3D Lane Detection | Shaofei Huang · Zhenwei Shen · Zehao Huang · Zi-han Ding · Jiao Dai · Jizhong Han · Naiyan Wang · Si Liu | N/A | Code |
| 3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds | Aoran Xiao · Jiaxing Huang · Weihao Xuan · Ruijie Ren · Kangcheng Liu · Dayan Guan · Abdulmotaleb El Saddik · Shijian Lu · Eric P. Xing | N/A | Code |
| NeMo: Learning 3D Neural Motion Fields From Multiple Video Instances of the Same Action | Kuan-Chieh Wang · Zhenzhen Weng · Maria Xenochristou · João Pedro Araújo · Jeffrey Gu · Karen Liu · Serena Yeung | N/A | Code |
| Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning | Xiaocheng Lu · Song Guo · Ziming Liu · Jingcai Guo | N/A | Code |
| iDisc: Internal Discretization for Monocular Depth Estimation | Luigi Piccinelli · Christos Sakaridis · Fisher Yu | N/A | Code |
| UniDexGrasp: Universal Robotic Dexterous Grasping via Learning Diverse Proposal Generation and Goal-Conditioned Policy | Yinzhen Xu · Weikang Wan · Jialiang Zhang · Haoran Liu · Zikang Shan · Hao Shen · Ruicheng Wang · Haoran Geng · Yijia Weng · Jiayi Chen · Tengyu Liu · Li Yi · He Wang | N/A | Code |
| PolyFormer: Referring Image Segmentation As Sequential Polygon Generation | Jiang Liu · Hui Ding · Zhaowei Cai · Yuting Zhang · Ravi Kumar Satzoda · Vijay Mahadevan · R. Manmatha | N/A | Code |
| Interactive Segmentation of Radiance Fields | Rahul Goel · Dhawal Sirikonda · Saurabh Saini · P. J. Narayanan | N/A | Code |
| PointCert: Point Cloud Classification With Deterministic Certified Robustness Guarantees | Jinghuai Zhang · Jinyuan Jia · Hongbin Liu · Neil Zhenqiang Gong | N/A | Code |
| Indiscernible Object Counting in Underwater Scenes | Guolei Sun · Zhaochong An · Yun Liu · Ce Liu · Christos Sakaridis · Deng-Ping Fan · Luc Van Gool | N/A | Code |
| Improving Robustness of Vision Transformers by Reducing Sensitivity To Patch Corruptions | Yong Guo · David Stutz · Bernt Schiele | N/A | Code |
| Real-Time Multi-Person Eyeblink Detection in the Wild for Untrimmed Video | Wenzheng Zeng · Yang Xiao · Sicheng Wei · Jinfang Gan · Xintao Zhang · Zhiguo Cao · Zhiwen Fang · Joey Tianyi Zhou | N/A | Code |
| BEV-LaneDet: An Efficient 3D Lane Detection Based on Virtual Camera via Key-Points | Ruihao Wang · Jian Qin · Kaiying Li · Yaochen Li · Dong Cao · Jintao Xu | N/A | Code |
| Infinite Photorealistic Worlds Using Procedural Generation | Alexander Raistrick · Lahav Lipson · Zeyu Ma · Lingjie Mei · Mingzhe Wang · Yiming Zuo · Karhan Kayan · Hongyu Wen · Beining Han · Yihan Wang · Alejandro Newell · Hei Law · Ankit Goyal · Kaiyu Yang · Jia Deng | N/A | Code |
| High-Fidelity 3D Human Digitization From Single 2K Resolution Images | Sang-Hun Han · Min-Gyu Park · Ju Hong Yoon · Ju-Mi Kang · Young-Jae Park · Hae-Gon Jeon | N/A | Code |
| GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis | Ming Tao · Bing-Kun Bao · Hao Tang · Changsheng Xu | N/A | Code |
| Language-Guided Audio-Visual Source Separation via Trimodal Consistency | Reuben Tan · Arijit Ray · Andrea Burns · Bryan A. Plummer · Justin Salamon · Oriol Nieto · Bryan Russell · Kate Saenko | N/A | Code |
| Probabilistic Debiasing of Scene Graphs | Bashirul Azam Biswas · Qiang Ji | N/A | Code |
| PVO: Panoptic Visual Odometry | Weicai Ye · Xinyue Lan · Shuo Chen · Yuhang Ming · Xingyuan Yu · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| Superclass Learning With Representation Enhancement | Zeyu Gan · Suyun Zhao · Jinlong Kang · Liyuan Shang · Hong Chen · Cuiping Li | N/A | Code |
| GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts | Haoran Geng · Helin Xu · Chengyang Zhao · Chao Xu · Li Yi · Siyuan Huang · He Wang | N/A | Code |
| Learning the Distribution of Errors in Stereo Matching for Joint Disparity and Uncertainty Estimation | Liyan Chen · Weihan Wang · Philippos Mordohai | N/A | Code |
| Efficient View Synthesis and 3D-Based Multi-Frame Denoising With Multiplane Feature Representations | Thomas Tanay · Aleš Leonardis · Matteo Maggioni | N/A | Code |
| Large-Capacity and Flexible Video Steganography via Invertible Neural Network | Chong Mou · Youmin Xu · Jiechong Song · Chen Zhao · Bernard Ghanem · Jian Zhang | N/A | Code |
| Generating Part-Aware Editable 3D Shapes Without 3D Supervision | Konstantinos Tertikas · Despoina Paschalidou · Boxiao Pan · Jeong Joon Park · Mikaela Angelina Uy · Ioannis Emiris · Yannis Avrithis · Leonidas Guibas | N/A | Code |
| Vision Transformer With Super Token Sampling | Huaibo Huang · Xiaoqiang Zhou · Jie Cao · Ran He · Tieniu Tan | N/A | Code |
| Renderable Neural Radiance Map for Visual Navigation | Obin Kwon · Jeongho Park · Songhwai Oh | N/A | Code |
| Learning Compact Representations for LiDAR Completion and Generation | Yuwen Xiong · Wei-Chiu Ma · Jingkang Wang · Raquel Urtasun | N/A | Code |
| CoMFormer: Continual Learning in Semantic and Panoptic Segmentation | Fabio Cermelli · Matthieu Cord · Arthur Douillard | N/A | Code |
| A Bag-of-Prototypes Representation for Dataset-Level Applications | Weijie Tu · Weijian Deng · Tom Gedeon · Liang Zheng | N/A | Code |
| Geometric Visual Similarity Learning in 3D Medical Image Self-Supervised Pre-Training | Yuting He · Guanyu Yang · Rongjun Ge · Yang Chen · Jean-Louis Coatrieux · Boyu Wang · Shuo Li | N/A | Code |
| Weakly Supervised Video Emotion Detection and Prediction via Cross-Modal Temporal Erasing Network | Zhicheng Zhang · Lijuan Wang · Jufeng Yang | N/A | Code |
| CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning | James Seale Smith · Leonid Karlinsky · Vyshnavi Gutta · Paola Cascante-Bonilla · Donghyun Kim · Assaf Arbelle · Rameswar Panda · Rogerio Feris · Zsolt Kira | N/A | Code |
| CodeTalker: Speech-Driven 3D Facial Animation With Discrete Motion Prior | Jinbo Xing · Menghan Xia · Yuechen Zhang · Xiaodong Cun · Jue Wang · Tien-Tsin Wong | N/A | Code |
| VolRecon: Volume Rendering of Signed Ray Distance Functions for Generalizable Multi-View Reconstruction | Yufan Ren · Fangjinhua Wang · Tong Zhang · Marc Pollefeys · Sabine Süsstrunk | N/A | Code |
| NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation | Haoqian Wu · Keyu Chen · Haozhe Liu · Mingchen Zhuge · Bing Li · Ruizhi Qiao · Xiujun Shu · Bei Gan · Liangsheng Xu · Bo Ren · Mengmeng Xu · Wentian Zhang · Raghavendra Ramachandra · Chia-Wen Lin · Bernard Ghanem | N/A | Code |
| Ref-NPR: Reference-Based Non-Photorealistic Radiance Fields for Controllable Scene Stylization | Yuechen Zhang · Zexin He · Jinbo Xing · Xufeng Yao · Jiaya Jia | N/A | Code |
| GANmouflage: 3D Object Nondetection With Texture Fields | Rui Guo · Jasmine Collins · Oscar de Lima · Andrew Owens | N/A | Code |
| GP-VTON: Towards General Purpose Virtual Try-On via Collaborative Local-Flow Global-Parsing Learning | Zhenyu Xie · Zaiyu Huang · Xin Dong · Fuwei Zhao · Haoye Dong · Xijin Zhang · Feida Zhu · Xiaodan Liang | N/A | Code |
| DeSTSeg: Segmentation Guided Denoising Student-Teacher for Anomaly Detection | Xuan Zhang · Shiyu Li · Xi Li · Ping Huang · Jiulong Shan · Ting Chen | N/A | Code |
| Pix2map: Cross-Modal Retrieval for Inferring Street Maps From Images | Xindi Wu · KwunFung Lau · Francesco Ferroni · Aljoša Ošep · Deva Ramanan | N/A | Code |
| Beyond mAP: Towards Better Evaluation of Instance Segmentation | Rohit Jena · Lukas Zhornyak · Nehal Doiphode · Pratik Chaudhari · Vivek Buch · James Gee · Jianbo Shi | N/A | Code |
| Federated Learning With Data-Agnostic Distribution Fusion | Jian-hui Duan · Wenzhong Li · Derun Zou · Ruichen Li · Sanglu Lu | N/A | Code |
| Make-a-Story: Visual Memory Conditioned Consistent Story Generation | Tanzila Rahman · Hsin-Ying Lee · Jian Ren · Sergey Tulyakov · Shweta Mahajan · Leonid Sigal | N/A | Code |
| Scalable, Detailed and Mask-Free Universal Photometric Stereo | Satoshi Ikehata | N/A | Code |
| ToThePoint: Efficient Contrastive Learning of 3D Point Clouds via Recycling | Xinglin Li · Jiajing Chen · Jinhui Ouyang · Hanhui Deng · Senem Velipasalar · Di Wu | N/A | Code |
| Local-to-Global Registration for Bundle-Adjusting Neural Radiance Fields | Yue Chen · Xingyu Chen · Xuan Wang · Qi Zhang · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| UV Volumes for Real-Time Rendering of Editable Free-View Human Performance | Yue Chen · Xuan Wang · Xingyu Chen · Qi Zhang · Xiaoyu Li · Yu Guo · Jue Wang · Fei Wang | N/A | Code |
| SplineCam: Exact Visualization and Characterization of Deep Network Geometry and Decision Boundaries | Ahmed Imtiaz Humayun · Randall Balestriero · Guha Balakrishnan · Richard G. Baraniuk | N/A | Code |
| Hi-LASSIE: High-Fidelity Articulated Shape and Skeleton Discovery From Sparse Image Ensemble | Chun-Han Yao · Wei-Chih Hung · Yuanzhen Li · Michael Rubinstein · Ming-Hsuan Yang · Varun Jampani | N/A | Code |
| VisFusion: Visibility-Aware Online 3D Scene Reconstruction From Videos | Huiyu Gao · Wei Mao · Miaomiao Liu | N/A | Code |
| Unsupervised Volumetric Animation | Aliaksandr Siarohin · Willi Menapace · Ivan Skorokhodov · Kyle Olszewski · Jian Ren · Hsin-Ying Lee · Menglei Chai · Sergey Tulyakov | N/A | Code |
| DKM: Dense Kernelized Feature Matching for Geometry Estimation | Johan Edstedt · Ioannis Athanasiadis · Mårten Wadenbäck · Michael Felsberg | N/A | Code |
| All in One: Exploring Unified Video-Language Pre-Training | Jinpeng Wang · Yixiao Ge · Rui Yan · Yuying Ge · Kevin Qinghong Lin · Satoshi Tsutsui · Xudong Lin · Guanyu Cai · Jianping Wu · Ying Shan · Xiaohu Qie · Mike Zheng Shou | N/A | Code |
| Spatiotemporal Self-Supervised Learning for Point Clouds in the Wild | Yanhao Wu · Tong Zhang · Wei Ke · Sabine Süsstrunk · Mathieu Salzmann | N/A | Code |
| DynIBaR: Neural Dynamic Image-Based Rendering | Zhengqi Li · Qianqian Wang · Forrester Cole · Richard Tucker · Noah Snavely | N/A | Code |
| Seeing Through the Glass: Neural 3D Reconstruction of Object Inside a Transparent Container | Jinguang Tong · Sundaram Muthu · Fahira Afzal Maken · Chuong Nguyen · Hongdong Li | N/A | Code |
| JAWS: Just a Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang · Robin Courant · Jinglei Shi · Eric Marchand · Marc Christie | N/A | Code |
| CCuantuMM: Cycle-Consistent Quantum-Hybrid Matching of Multiple Shapes | Harshil Bhatia · Edith Tretschk · Zorah Lähner · Marcel Seelbach Benkner · Michael Moeller · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations | Joy Hsu · Jiayuan Mao · Jiajun Wu | N/A | Code |
| TempSAL – Uncovering Temporal Information for Deep Saliency Prediction | Bahar Aydemir · Ludo Hoffstetter · Tong Zhang · Mathieu Salzmann · Sabine Süsstrunk | N/A | Code |
| BiasBed – Rigorous Texture Bias Evaluation | Nikolai Kalischek · Rodrigo Caye Daudt · Torben Peters · Reinhard Furrer · Jan D. Wegner · Konrad Schindler | N/A | Code |
| Real-Time Neural Light Field on Mobile Devices | Junli Cao · Huan Wang · Pavlo Chemerys · Vladislav Shakhrai · Ju Hu · Yun Fu · Denys Makoviichuk · Sergey Tulyakov · Jian Ren | N/A | Code |
| Where Is My Wallet? Modeling Object Proposal Sets for Egocentric Visual Query Localization | Mengmeng Xu · Yanghao Li · Cheng-Yang Fu · Bernard Ghanem · Tao Xiang · Juan-Manuel Pérez-Rúa | N/A | Code |
| DiffusionRig: Learning Personalized Priors for Facial Appearance Editing | Zheng Ding · Xuaner Zhang · Zhihao Xia · Lars Jebe · Zhuowen Tu · Xiuming Zhang | N/A | Code |
| Neural Scene Chronology | Haotong Lin · Qianqian Wang · Ruojin Cai · Sida Peng · Hadar Averbuch-Elor · Xiaowei Zhou · Noah Snavely | N/A | Code |
| Diversity-Aware Meta Visual Prompting | Qidong Huang · Xiaoyi Dong · Dongdong Chen · Weiming Zhang · Feifei Wang · Gang Hua · Nenghai Yu | N/A | Code |
| Privacy-Preserving Representations Are Not Enough: Recovering Scene Content From Camera Poses | Kunal Chelani · Torsten Sattler · Fredrik Kahl · Zuzana Kukelova | N/A | Code |
| Masked Jigsaw Puzzle: A Versatile Position Embedding for Vision Transformers | Bin Ren · Yahui Liu · Yue Song · Wei Bi · Rita Cucchiara · Nicu Sebe · Wei Wang | N/A | Code |
| Box-Level Active Detection | Mengyao Lyu · Jundong Zhou · Hui Chen · Yijie Huang · Dongdong Yu · Yaqian Li · Yandong Guo · Yuchen Guo · Liuyu Xiang · Guiguang Ding | N/A | Code |
| Unlearnable Clusters: Towards Label-Agnostic Unlearnable Examples | Jiaming Zhang · Xingjun Ma · Qi Yi · Jitao Sang · Yu-Gang Jiang · Yaowei Wang · Changsheng Xu | N/A | Code |
| Generalized Relation Modeling for Transformer Tracking | Shenyuan Gao · Chunluan Zhou · Jun Zhang | N/A | Code |
| Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis | Rishabh Dabral · Muhammad Hamza Mughal · Vladislav Golyanik · Christian Theobalt | N/A | Code |
| Patch-Mix Transformer for Unsupervised Domain Adaptation: A Game Perspective | Jinjing Zhu · Haotian Bai · Lin Wang | N/A | Code |
| Distilling Neural Fields for Real-Time Articulated Shape Reconstruction | Jeff Tan · Gengshan Yang · Deva Ramanan | N/A | Code |
| Sampling Is Matter: Point-Guided 3D Human Mesh Reconstruction | Jeonghwan Kim · Mi-Gyeong Gwon · Hyunwoo Park · Hyukmin Kwon · Gi-Mun Um · Wonjun Kim | N/A | Code |
| Image Quality-Aware Diagnosis via Meta-Knowledge Co-Embedding | Haoxuan Che · Siyu Chen · Hao Chen | N/A | Code |
| Towards Practical Plug-and-Play Diffusion Models | Hyojun Go · Yunsung Lee · Jin-Young Kim · Seunghyun Lee · Myeongho Jeong · Hyun Seung Lee · Seungtaek Choi | N/A | Code |
| HRDFuse: Monocular 360° Depth Estimation by Collaboratively Learning Holistic-With-Regional Depth Distributions | Hao Ai · Zidong Cao · Yan-Pei Cao · Ying Shan · Lin Wang | N/A | Code |
| KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation | Xiangyang Li · Zihan Wang · Jiahao Yang · Yaowei Wang · Shuqiang Jiang | N/A | Code |
| Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang · Wenzhao Zheng · Yunpeng Zhang · Jie Zhou · Jiwen Lu | N/A | Code |
| EventNeRF: Neural Radiance Fields From a Single Colour Event Camera | Viktor Rudnev · Mohamed Elgharib · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| Physically Realizable Natural-Looking Clothing Textures Evade Person Detectors via 3D Modeling | Zhanhao Hu · Wenda Chu · Xiaopei Zhu · Hui Zhang · Bo Zhang · Xiaolin Hu | N/A | Code |
| Global Vision Transformer Pruning With Hessian-Aware Saliency | Huanrui Yang · Hongxu Yin · Maying Shen · Pavlo Molchanov · Hai Li · Jan Kautz | N/A | Code |
| 3D Human Pose Estimation With Spatio-Temporal Criss-Cross Attention | Zhenhua Tang · Zhaofan Qiu · Yanbin Hao · Richang Hong · Ting Yao | N/A | Code |
| Learning Spatial-Temporal Implicit Neural Representations for Event-Guided Video Super-Resolution | Yunfan Lu · Zipeng Wang · Minjie Liu · Hongjian Wang · Lin Wang | N/A | Code |
| StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer | Sasikarn Khwanmuang · Pakkapon Phongthawee · Patsorn Sangkloy · Supasorn Suwajanakorn | N/A | Code |
| ShapeClipper: Scalable 3D Shape Learning From Single-View Images via Geometric and CLIP-Based Consistency | Zixuan Huang · Varun Jampani · Anh Thai · Yuanzhen Li · Stefan Stojanov · James M. Rehg | N/A | Code |
| Efficient Scale-Invariant Generator With Column-Row Entangled Pixel Synthesis | Thuan Hoang Nguyen · Thanh Van Le · Anh Tran | N/A | Code |
| Paired-Point Lifting for Enhanced Privacy-Preserving Visual Localization | Chunghwan Lee · Jaihoon Kim · Chanhyuk Yun · Je Hyeong Hong | N/A | Code |
| Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation | Xu Zheng · Jinjing Zhu · Yexin Liu · Zidong Cao · Chong Fu · Lin Wang | N/A | Code |
| Adaptive Human Matting for Dynamic Videos | Chung-Ching Lin · Jiang Wang · Kun Luo · Kevin Lin · Linjie Li · Lijuan Wang · Zicheng Liu | N/A | Code |
| High-Fidelity Facial Avatar Reconstruction From Monocular Video With Generative Priors | Yunpeng Bai · Yanbo Fan · Xuan Wang · Yong Zhang · Jingxiang Sun · Chun Yuan · Ying Shan | N/A | Code |
| Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint | Shikang Yu · Jiachen Chen · Hu Han · Shuqiang Jiang | N/A | Code |
| Im2Hands: Learning Attentive Implicit Representation of Interacting Two-Hand Shapes | Jihyun Lee · Minhyuk Sung · Honggyu Choi · Tae-Kyun Kim | N/A | Code |
| MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos | Zicheng Zhang · Wei Wu · Wei Sun · Danyang Tu · Wei Lu · Xiongkuo Min · Ying Chen · Guangtao Zhai | N/A | Code |
| Make Landscape Flatter in Differentially Private Federated Learning | Yifan Shi · Yingqi Liu · Kang Wei · Li Shen · Xueqian Wang · Dacheng Tao | N/A | Code |
| A Large-Scale Robustness Analysis of Video Action Recognition Models | Madeline Chantry Schiappa · Naman Biyani · Prudvi Kamtam · Shruti Vyas · Hamid Palangi · Vibhav Vineet · Yogesh S. Rawat | N/A | Code |
| Multi-Concept Customization of Text-to-Image Diffusion | Nupur Kumari · Bingliang Zhang · Richard Zhang · Eli Shechtman · Jun-Yan Zhu | N/A | Code |
| GANHead: Towards Generative Animatable Neural Head Avatars | Sijing Wu · Yichao Yan · Yunhao Li · Yuhao Cheng · Wenhan Zhu · Ke Gao · Xiaobo Li · Guangtao Zhai | N/A | Code |
| Neural Koopman Pooling: Control-Inspired Temporal Dynamics Encoding for Skeleton-Based Action Recognition | Xinghan Wang · Xin Xu · Yadong Mu | N/A | Code |
| Hierarchical B-Frame Video Coding Using Two-Layer CANF Without Motion Coding | David Alexandre · Hsueh-Ming Hang · Wen-Hsiao Peng | N/A | Code |
| FeatER: An Efficient Network for Human Reconstruction via Feature Map-Based TransformER | Ce Zheng · Matias Mendieta · Taojiannan Yang · Guo-Jun Qi · Chen Chen | N/A | Code |
| Delivering Arbitrary-Modal Semantic Segmentation | Jiaming Zhang · Ruiping Liu · Hao Shi · Kailun Yang · Simon Reiß · Kunyu Peng · Haodong Fu · Kaiwei Wang · Rainer Stiefelhagen | N/A | Code |
| Deep Graph-Based Spatial Consistency for Robust Non-Rigid Point Cloud Registration | Zheng Qin · Hao Yu · Changjian Wang · Yuxing Peng · Kai Xu | N/A | Code |
| HumanGen: Generating Human Radiance Fields With Explicit Priors | Suyi Jiang · Haoran Jiang · Ziyu Wang · Haimin Luo · Wenzheng Chen · Lan Xu | N/A | Code |
| Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation | Bo Huang · Mingyang Chen · Yi Wang · Junda Lu · Minhao Cheng · Wei Wang | N/A | Code |
| Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation | Narek Tumanyan · Michal Geyer · Shai Bagon · Tali Dekel | N/A | Code |
| Rotation-Invariant Transformer for Point Cloud Matching | Hao Yu · Zheng Qin · Ji Hou · Saleh · Dongsheng Li · Benjamin Busam · Slobodan Ilic | N/A | Code |
| CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP | Runnan Chen · Youquan Liu · Lingdong Kong · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao · Wenping Wang | N/A | Code |
| Real-Time 6K Image Rescaling With Rate-Distortion Optimization | Chenyang Qi · Xin Yang · Ka Leong Cheng · Ying-Cong Chen · Qifeng Chen | N/A | Code |
| Focused and Collaborative Feedback Integration for Interactive Image Segmentation | Qiaoqiao Wei · Hui Zhang · Jun-Hai Yong | N/A | Code |
| Language-Guided Music Recommendation for Video via Prompt Analogies | Daniel McKee · Justin Salamon · Josef Sivic · Bryan Russell | N/A | Code |
| TarViS: A Unified Approach for Target-Based Video Segmentation | Ali Athar · Alexander Hermans · Jonathon Luiten · Deva Ramanan · Bastian Leibe | N/A | Code |
| Meta-Personalizing Vision-Language Models To Find Named Instances in Video | Chun-Hsiao Yeh · Bryan Russell · Josef Sivic · Fabian Caba Heilbron · Simon Jenni | N/A | Code |
| ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data | Haojie Zhao · Junsong Chen · Lijun Wang · Huchuan Lu | N/A | Code |
| Scaling Language-Image Pre-Training via Masking | Yanghao Li · Haoqi Fan · Ronghang Hu · Christoph Feichtenhofer · Kaiming He | N/A | Code |
| SeqTrack: Sequence to Sequence Learning for Visual Object Tracking | Xin Chen · Houwen Peng · Dong Wang · Huchuan Lu · Han Hu | N/A | Code |
| Learning Neural Parametric Head Models | Simon Giebenhain · Tobias Kirschstein · Markos Georgopoulos · Martin Rünz · Lourdes Agapito · Matthias Nießner | N/A | Code |
| L-CoIns: Language-Based Colorization With Instance Awareness | Zheng Chang · Shuchen Weng · Peixuan Zhang · Yu Li · Si Li · Boxin Shi | N/A | Code |
| Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning | Antoine Yang · Arsha Nagrani · Paul Hongsuck Seo · Antoine Miech · Jordi Pont-Tuset · Ivan Laptev · Josef Sivic · Cordelia Schmid | N/A | Code |
| ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding | Le Xue · Mingfei Gao · Chen Xing · Roberto Martín-Martín · Jiajun Wu · Caiming Xiong · Ran Xu · Juan Carlos Niebles · Silvio Savarese | N/A | Code |
| GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields From Multi-View Images | Jianchuan Chen · Wentao Yi · Liqian Ma · Xu Jia · Huchuan Lu | N/A | Code |
| MIC: Masked Image Consistency for Context-Enhanced Domain Adaptation | Lukas Hoyer · Dengxin Dai · Haoran Wang · Luc Van Gool | N/A | Code |
| MED-VT: Multiscale Encoder-Decoder Video Transformer With Application To Object Segmentation | Rezaul Karim · He Zhao · Richard P. Wildes · Mennatullah Siam | N/A | Code |
| Hierarchical Dense Correlation Distillation for Few-Shot Segmentation | Bohao Peng · Zhuotao Tian · Xiaoyang Wu · Chengyao Wang · Shu Liu · Jingyong Su · Jiaya Jia | N/A | Code |
| Universal Instance Perception As Object Discovery and Retrieval | Bin Yan · Yi Jiang · Jiannan Wu · Dong Wang · Ping Luo · Zehuan Yuan · Huchuan Lu | N/A | Code |
| Bi-Directional Distribution Alignment for Transductive Zero-Shot Learning | Zhicai Wang · Yanbin Hao · Tingting Mu · Ouxiang Li · Shuo Wang · Xiangnan He | N/A | Code |
| Open-Vocabulary Semantic Segmentation With Mask-Adapted CLIP | Feng Liang · Bichen Wu · Xiaoliang Dai · Kunpeng Li · Yinan Zhao · Hang Zhang · Peizhao Zhang · Peter Vajda · Diana Marculescu | N/A | Code |
| ImageBind: One Embedding Space To Bind Them All | Rohit Girdhar · Alaaeldin El-Nouby · Zhuang Liu · Mannat Singh · Kalyan Vasudev Alwala · Armand Joulin · Ishan Misra | N/A | Code |
| Learning and Aggregating Lane Graphs for Urban Automated Driving | Martin Büchner · Jannik Zürn · Ion-George Todoran · Abhinav Valada · Wolfram Burgard | N/A | Code |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Yu Takagi · Shinji Nishimoto | N/A | Code |
| 3D Cinemagraphy From a Single Image | Xingyi Li · Zhiguo Cao · Huiqiang Sun · Jianming Zhang · Ke Xian · Guosheng Lin | N/A | Code |
| Understanding and Improving Visual Prompting: A Label-Mapping Perspective | Aochuan Chen · Yuguang Yao · Pin-Yu Chen · Yihua Zhang · Sijia Liu | N/A | Code |
| Cut and Learn for Unsupervised Object Detection and Instance Segmentation | Xudong Wang · Rohit Girdhar · Stella X. Yu · Ishan Misra | N/A | Code |
| DF-Platter: Multi-Face Heterogeneous Deepfake Dataset | Kartik Narayan · Harsh Agarwal · Kartik Thakral · Surbhi Mittal · Mayank Vatsa · Richa Singh | N/A | Code |
| BASiS: Batch Aligned Spectral Embedding Space | Or Streicher · Ido Cohen · Guy Gilboa | N/A | Code |
| Annealing-Based Label-Transfer Learning for Open World Object Detection | Yuqing Ma · Hainan Li · Zhange Zhang · Jinyang Guo · Shanghang Zhang · Ruihao Gong · Xianglong Liu | N/A | Code |
| Behind the Scenes: Density Fields for Single View Reconstruction | Felix Wimbauer · Nan Yang · Christian Rupprecht · Daniel Cremers | N/A | Code |
| Learning Video Representations From Large Language Models | Yue Zhao · Ishan Misra · Philipp Krähenbühl · Rohit Girdhar | N/A | Code |
| Quantum Multi-Model Fitting | Matteo Farina · Luca Magri · Willi Menapace · Elisa Ricci · Vladislav Golyanik · Federica Arrigoni | N/A | Code |
| Power Bundle Adjustment for Large-Scale 3D Reconstruction | Simon Weber · Nikolaus Demmel · Tin Chon Chan · Daniel Cremers | N/A | Code |
| Optimization-Inspired Cross-Attention Transformer for Compressive Sensing | Jiechong Song · Chong Mou · Shiqi Wang · Siwei Ma · Jian Zhang | N/A | Code |
| NeuMap: Neural Coordinate Mapping by Auto-Transdecoder for Camera Localization | Shitao Tang · Sicong Tang · Andrea Tagliasacchi · Ping Tan · Yasutaka Furukawa | N/A | Code |
| Back to the Source: Diffusion-Driven Adaptation To Test-Time Corruption | Jin Gao · Jialing Zhang · Xihui Liu · Trevor Darrell · Evan Shelhamer · Dequan Wang | N/A | Code |
| Learning Neural Duplex Radiance Fields for Real-Time View Synthesis | Ziyu Wan · Christian Richardt · Aljaž Božič · Chao Li · Vijay Rengarajan · Seonghyeon Nam · Xiaoyu Xiang · Tuotuo Li · Bo Zhu · Rakesh Ranjan · Jing Liao | N/A | Code |
| Object Pop-Up: Can We Infer 3D Objects and Their Poses From Human Interactions Alone? | Ilya A. Petrov · Riccardo Marin · Julian Chibane · Gerard Pons-Moll | N/A | Code |
| G-MSM: Unsupervised Multi-Shape Matching With Graph-Based Affinity Priors | Marvin Eisenberger · Aysim Toker · Laura Leal-Taixé · Daniel Cremers | N/A | Code |
| Data-Efficient Large Scale Place Recognition With Graded Similarity Supervision | María Leyva-Vallina · Nicola Strisciuglio · Nicolai Petkov | N/A | Code |
| Mapping Degeneration Meets Label Evolution: Learning Infrared Small Target Detection With Single Point Supervision | Xinyi Ying · Li Liu · Yingqian Wang · Ruojing Li · Nuo Chen · Zaiping Lin · Weidong Sheng · Shilin Zhou | N/A | Code |
| Instant Domain Augmentation for LiDAR Semantic Segmentation | Kwonyoung Ryu · Soonmin Hwang · Jaesik Park | N/A | Code |
| R2Former: Unified Retrieval and Reranking Transformer for Place Recognition | Sijie Zhu · Linjie Yang · Chen Chen · Mubarak Shah · Xiaohui Shen · Heng Wang | N/A | Code |
| Detecting and Grounding Multi-Modal Media Manipulation | Rui Shao · Tianxing Wu · Ziwei Liu | N/A | Code |
| Detecting Backdoors in Pre-Trained Encoders | Shiwei Feng · Guanhong Tao · Siyuan Cheng · Guangyu Shen · Xiangzhe Xu · Yingqi Liu · Kaiyuan Zhang · Shiqing Ma · Xiangyu Zhang | N/A | Code |
| Scaling Up GANs for Text-to-Image Synthesis | Minguk Kang · Jun-Yan Zhu · Richard Zhang · Jaesik Park · Eli Shechtman · Sylvain Paris · Taesung Park | N/A | Code |
| Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline | Tiantian Geng · Teng Wang · Jinming Duan · Runmin Cong · Feng Zheng | N/A | Code |
| PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360° | Sizhe An · Hongyi Xu · Yichun Shi · Guoxian Song · Umit Y. Ogras · Linjie Luo | N/A | Code |
| Modality-Invariant Visual Odometry for Embodied Vision | Marius Memmel · Roman Bachmann · Amir Zamir | N/A | Code |
| 3D Video Loops From Asynchronous Input | Li Ma · Xiaoyu Li · Jing Liao · Pedro V. Sander | N/A | Code |
| Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes | Xuan Ju · Ailing Zeng · Jianan Wang · Qiang Xu · Lei Zhang | N/A | Code |
| PosterLayout: A New Benchmark and Approach for Content-Aware Visual-Textual Presentation Layout | Hsiao Yuan Hsu · Xiangteng He · Yuxin Peng · Hao Kong · Qing Zhang | N/A | Code |
| A Soma Segmentation Benchmark in Full Adult Fly Brain | Xiaoyu Liu · Bo Hu · Mingxing Li · Wei Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| One-Stage 3D Whole-Body Mesh Recovery With Component Aware Transformer | Jing Lin · Ailing Zeng · Haoqian Wang · Lei Zhang · Yu Li | N/A | Code |
| Listening Human Behavior: 3D Human Pose Estimation With Acoustic Signals | Yuto Shibata · Yutaka Kawashima · Mariko Isogawa · Go Irie · Akisato Kimura · Yoshimitsu Aoki | N/A | Code |
| Hand Avatar: Free-Pose Hand Animation and Rendering From Monocular Video | Xingyu Chen · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| M6Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis | Hiuyi Cheng · Peirong Zhang · Sihang Wu · Jiaxin Zhang · Qiyuan Zhu · Zecheng Xie · Jing Li · Kai Ding · Lianwen Jin | N/A | Code |
| Neural Congealing: Aligning Images to a Joint Semantic Atlas | Dolev Ofri-Amar · Michal Geyer · Yoni Kasten · Tali Dekel | N/A | Code |
| BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation | Tianheng Cheng · Xinggang Wang · Shaoyu Chen · Qian Zhang · Wenyu Liu | N/A | Code |
| BEDLAM: A Synthetic Dataset of Bodies Exhibiting Detailed Lifelike Animated Motion | Michael J. Black · Priyanka Patel · Joachim Tesch · Jinlong Yang | N/A | Code |
| Mask DINO: Towards a Unified Transformer-Based Framework for Object Detection and Segmentation | Feng Li · Hao Zhang · Huaizhe Xu · Shilong Liu · Lei Zhang · Lionel M. Ni · Heung-Yeung Shum | N/A | Code |
| Learning Detailed Radiance Manifolds for High-Fidelity and 3D-Consistent Portrait Synthesis From Monocular Image | Yu Deng · Baoyuan Wang · Heung-Yeung Shum | N/A | Code |
| 3DAvatarGAN: Bridging Domains for Personalized Editable Avatars | Rameen Abdal · Hsin-Ying Lee · Peihao Zhu · Menglei Chai · Aliaksandr Siarohin · Peter Wonka · Sergey Tulyakov | N/A | Code |
| FLEX: Full-Body Grasping Without Full-Body Grasps | Purva Tendulkar · Dídac Surís · Carl Vondrick | N/A | Code |
| UDE: A Unified Driving Engine for Human Motion Generation | Zixiang Zhou · Baoyuan Wang | N/A | Code |
| Video Test-Time Adaptation for Action Recognition | Wei Lin · Muhammad Jehanzeb Mirza · Mateusz Kozinski · Horst Possegger · Hilde Kuehne · Horst Bischof | N/A | Code |
| Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis | Duomin Wang · Yu Deng · Zixin Yin · Heung-Yeung Shum · Baoyuan Wang | N/A | Code |
| MIME: Human-Aware 3D Scene Generation | Hongwei Yi · Chun-Hao P. Huang · Shashank Tripathi · Lea Hering · Justus Thies · Michael J. Black | N/A | Code |
| AstroNet: When Astrocyte Meets Artificial Neural Network | Mengqiao Han · Liyuan Pan · Xiabi Liu | N/A | Code |
| Stimulus Verification Is a Universal and Effective Sampler in Multi-Modal Human Trajectory Prediction | Jianhua Sun · Yuxuan Li · Liang Chai · Cewu Lu | N/A | Code |
| ActMAD: Activation Matching To Align Distributions for Test-Time-Training | Muhammad Jehanzeb Mirza · Pol Jané Soneira · Wei Lin · Mateusz Kozinski · Horst Possegger · Horst Bischof | N/A | Code |
| Visual Prompt Multi-Modal Tracking | Jiawen Zhu · Simiao Lai · Xin Chen · Dong Wang · Huchuan Lu | N/A | Code |
| Reconstructing Signing Avatars From Video Using Linguistic Priors | Maria-Paola Forte · Peter Kulits · Chun-Hao P. Huang · Vasileios Choutas · Dimitrios Tzionas · Katherine J. Kuchenbecker · Michael J. Black | N/A | Code |
| Patch-Based 3D Natural Scene Generation From a Single Example | Weiyu Li · Xuelin Chen · Jue Wang · Baoquan Chen | N/A | Code |
| Re-Basin via Implicit Sinkhorn Differentiation | Fidel A. Guerrero Peña · Heitor Rapela Medeiros · Thomas Dubail · Masih Aminbeidokhti · Eric Granger · Marco Pedersoli | N/A | Code |
| Slide-Transformer: Hierarchical Vision Transformer With Local Self-Attention | Xuran Pan · Tianzhu Ye · Zhuofan Xia · Shiji Song · Gao Huang | N/A | Code |
| Planning-Oriented Autonomous Driving | Yihan Hu · Jiazhi Yang · Li Chen · Keyu Li · Chonghao Sima · Xizhou Zhu · Siqi Chai · Senyao Du · Tianwei Lin · Wenhai Wang · Lewei Lu · Xiaosong Jia · Qiang Liu · Jifeng Dai · Yu Qiao · Hongyang Li | N/A | Code |
| Enhancing Deformable Local Features by Jointly Learning To Detect and Describe Keypoints | Guilherme Potje · Felipe Cadar · André Araujo · Renato Martins · Erickson R. Nascimento | N/A | Code |
| 3D Human Pose Estimation via Intuitive Physics | Shashank Tripathi · Lea Müller · Chun-Hao P. Huang · Omid Taheri · Michael J. Black · Dimitrios Tzionas | N/A | Code |
| Defending Against Patch-Based Backdoor Attacks on Self-Supervised Learning | Ajinkya Tejankar · Maziar Sanjabi · Qifan Wang · Sinong Wang · Hamed Firooz · Hamed Pirsiavash · Liang Tan | N/A | Code |
| PointCMP: Contrastive Mask Prediction for Self-Supervised Learning on Point Cloud Videos | Zhiqiang Shen · Xiaoxiao Sheng · Longguang Wang · Yulan Guo · Qiong Liu · Xi Zhou | N/A | Code |
| Blowing in the Wind: CycleNet for Human Cinemagraphs From Still Images | Hugo Bertiche · Niloy J. Mitra · Kuldeep Kulkarni · Chun-Hao P. Huang · Tuanfeng Y. Wang · Meysam Madadi · Sergio Escalera · Duygu Ceylan | N/A | Code |
| Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning | Kangning Liu · Weicheng Zhu · Yiqiu Shen · Sheng Liu · Narges Razavian · Krzysztof J. Geras · Carlos Fernandez-Granda | N/A | Code |
| Learning Steerable Function for Efficient Image Resampling | Jiacheng Li · Chang Chen · Wei Huang · Zhiqiang Lang · Fenglong Song · Youliang Yan · Zhiwei Xiong | N/A | Code |
| Deep Deterministic Uncertainty: A New Simple Baseline | Jishnu Mukhoti · Andreas Kirsch · Joost van Amersfoort · Philip H.S. Torr · Yarin Gal | N/A | Code |
| Removing Objects From Neural Radiance Fields | Silvan Weder · Guillermo Garcia-Hernando · Áron Monszpart · Marc Pollefeys · Gabriel J. Brostow · Michael Firman · Sara Vicente | N/A | Code |
| PartManip: Learning Cross-Category Generalizable Part Manipulation Policy From Point Cloud Observations | Haoran Geng · Ziming Li · Yiran Geng · Jiayi Chen · Hao Dong · He Wang | N/A | Code |
| T-SEA: Transfer-Based Self-Ensemble Attack on Object Detection | Hao Huang · Ziyan Chen · Huanran Chen · Yongtao Wang · Kevin Zhang | N/A | Code |
| DINN360: Deformable Invertible Neural Network for Latitude-Aware 360° Image Rescaling | Yichen Guo · Mai Xu · Lai Jiang · Leonid Sigal · Yunjin Chen | N/A | Code |
| Learning Human-to-Robot Handovers From Point Clouds | Sammy Christen · Wei Yang · Claudia Pérez-D’Arpino · Otmar Hilliges · Dieter Fox · Yu-Wei Chao | N/A | Code |
| Multi-View Azimuth Stereo via Tangent Space Consistency | Xu Cao · Hiroaki Santo · Fumio Okura · Yasuyuki Matsushita | N/A | Code |
| Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners | Zitian Chen · Yikang Shen · Mingyu Ding · Zhenfang Chen · Hengshuang Zhao · Erik G. Learned-Miller · Chuang Gan | N/A | Code |
| gSDF: Geometry-Driven Signed Distance Functions for 3D Hand-Object Reconstruction | Zerui Chen · Shizhe Chen · Cordelia Schmid · Ivan Laptev | N/A | Code |
| Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint | Hongyu Liu · Yibing Song · Qifeng Chen | N/A | Code |
| Generative Bias for Robust Visual Question Answering | Jae Won Cho · Dong-Jin Kim · Hyeonggon Ryu · In So Kweon | N/A | Code |
| Backdoor Defense via Deconfounded Representation Learning | Zaixi Zhang · Qi Liu · Zhicai Wang · Zepu Lu · Qingyong Hu | N/A | Code |
| High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization | Jiaxin Xie · Hao Ouyang · Jingtan Piao · Chenyang Lei · Qifeng Chen | N/A | Code |
| Affordance Diffusion: Synthesizing Hand-Object Interactions | Yufei Ye · Xueting Li · Abhinav Gupta · Shalini De Mello · Stan Birchfield · Jiaming Song · Shubham Tulsiani · Sifei Liu | N/A | Code |
| Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters | Jiashun Wang · Xueting Li · Sifei Liu · Shalini De Mello · Orazio Gallo · Xiaolong Wang · Jan Kautz | N/A | Code |
| Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting | Tarasha Khurana · Peiyun Hu · David Held · Deva Ramanan | N/A | Code |
| Are Data-Driven Explanations Robust Against Out-of-Distribution Data? | Tang Li · Fengchun Qiao · Mengmeng Ma · Xi Peng | N/A | Code |
| Multiscale Tensor Decomposition and Rendering Equation Encoding for View Synthesis | Kang Han · Wei Xiang | N/A | Code |
| Boosting Video Object Segmentation via Space-Time Correspondence Learning | Yurong Zhang · Liulei Li · Wenguan Wang · Rong Xie · Li Song · Wenjun Zhang | N/A | Code |
| X-Pruner: eXplainable Pruning for Vision Transformers | Lu Yu · Wei Xiang | N/A | Code |
| GazeNeRF: 3D-Aware Gaze Redirection With Neural Radiance Fields | Alessandro Ruzzi · Xiangwei Shi · Xi Wang · Gengyan Li · Shalini De Mello · Hyung Jin Chang · Xucong Zhang · Otmar Hilliges | N/A | Code |
| Real-Time Evaluation in Online Continual Learning: A New Hope | Yasir Ghunaim · Adel Bibi · Kumail Alhamoud · Motasem Alfarra · Hasan Abed Al Kader Hammoud · Ameya Prabhu · Philip H.S. Torr · Bernard Ghanem | N/A | Code |
| Contrastive Semi-Supervised Learning for Underwater Image Restoration via Reliable Bank | Shirui Huang · Keyan Wang · Huan Liu · Jun Chen · Yunsong Li | N/A | Code |
| A New Dataset Based on Images Taken by Blind People for Testing the Robustness of Image Classification Models Trained for ImageNet Categories | Reza Akbarian Bafghi · Danna Gurari | N/A | Code |
| Open-Vocabulary Panoptic Segmentation With Text-to-Image Diffusion Models | Jiarui Xu · Sifei Liu · Arash Vahdat · Wonmin Byeon · Xiaolong Wang · Shalini De Mello | N/A | Code |
| Reconstructing Animatable Categories From Videos | Gengshan Yang · Chaoyang Wang · N. Dinesh Reddy · Deva Ramanan | N/A | Code |
| Learning Visual Representations via Language-Guided Sampling | Mohamed El Banani · Karan Desai · Justin Johnson | N/A | Code |
| Four-View Geometry With Unknown Radial Distortion | Petr Hruby · Viktor Korotynskiy · Timothy Duff · Luke Oeding · Marc Pollefeys · Tomas Pajdla · Viktor Larsson | N/A | Code |
| DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model | Gwanghyun Kim · Se Young Chun | N/A | Code |
| ConZIC: Controllable Zero-Shot Image Captioning by Sampling-Based Polishing | Zequn Zeng · Hao Zhang · Ruiying Lu · Dongsheng Wang · Bo Chen · Zhengjue Wang | N/A | Code |
| Feature Separation and Recalibration for Adversarial Robustness | Woo Jae Kim · Yoonki Cho · Junsik Jung · Sung-Eui Yoon | N/A | Code |
| Event-Based Blurry Frame Interpolation Under Blind Exposure | Wenming Weng · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures | Zhiqin Chen · Thomas Funkhouser · Peter Hedman · Andrea Tagliasacchi | N/A | Code |
| HandsOff: Labeled Dataset Generation With No Additional Human Annotations | Austin Xu · Mariya I. Vasileva · Achal Dave · Arjun Seshadri | N/A | Code |
| Analyzing and Diagnosing Pose Estimation With Attributions | Qiyuan He · Linlin Yang · Kerui Gu · Qiuxia Lin · Angela Yao | N/A | Code |
| Overcoming the Trade-Off Between Accuracy and Plausibility in 3D Hand Shape Reconstruction | Ziwei Yu · Chen Li · Linlin Yang · Xiaoxu Zheng · Michael Bi Mi · Gim Hee Lee · Angela Yao | N/A | Code |
| VIVE3D: Viewpoint-Independent Video Editing Using 3D-Aware GANs | Anna Frühstück · Nikolaos Sarafianos · Yuanlu Xu · Peter Wonka · Tony Tung | N/A | Code |
| Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge | Changdi Yang · Pu Zhao · Yanyu Li · Wei Niu · Jiexiong Guan · Hao Tang · Minghai Qin · Bin Ren · Xue Lin · Yanzhi Wang | N/A | Code |
| Nerflets: Local Radiance Fields for Efficient Structure-Aware 3D Scene Representation From 2D Supervision | Xiaoshuai Zhang · Abhijit Kundu · Thomas Funkhouser · Leonidas Guibas · Hao Su · Kyle Genova | N/A | Code |
| VDN-NeRF: Resolving Shape-Radiance Ambiguity via View-Dependence Normalization | Bingfan Zhu · Yanchao Yang · Xulong Wang · Youyi Zheng · Leonidas Guibas | N/A | Code |
| OpenScene: 3D Scene Understanding With Open Vocabularies | Songyou Peng · Kyle Genova · Chiyu “Max” Jiang · Andrea Tagliasacchi · Marc Pollefeys · Thomas Funkhouser | N/A | Code |
| A New Benchmark: On the Utility of Synthetic Data With Blender for Bare Supervised Learning and Downstream Domain Adaptation | Hui Tang · Kui Jia | N/A | Code |
| Implicit View-Time Interpolation of Stereo Videos Using Multi-Plane Disparities and Non-Uniform Coordinates | Avinash Paliwal · Andrii Tsarov · Nima Khademi Kalantari | N/A | Code |
| A Large-Scale Homography Benchmark | Daniel Barath · Dmytro Mishkin · Michal Polic · Wolfgang Förstner · Jiri Matas | N/A | Code |
| Glocal Energy-Based Learning for Few-Shot Open-Set Recognition | Haoyu Wang · Guansong Pang · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| MEDIC: Remove Model Backdoors via Importance Driven Cloning | Qiuling Xu · Guanhong Tao · Jean Honorio · Yingqi Liu · Shengwei An · Guangyu Shen · Siyuan Cheng · Xiangyu Zhang | N/A | Code |
| Finding Geometric Models by Clustering in the Consensus Space | Daniel Barath · Denys Rozumnyi · Ivan Eichhardt · Levente Hajder · Jiri Matas | N/A | Code |
| Imagic: Text-Based Real Image Editing With Diffusion Models | Bahjat Kawar · Shiran Zada · Oran Lang · Omer Tov · Huiwen Chang · Tali Dekel · Inbar Mosseri · Michal Irani | N/A | Code |
| DeepLSD: Line Segment Detection and Refinement With Deep Image Gradients | Rémi Pautrat · Daniel Barath · Viktor Larsson · Martin R. Oswald · Marc Pollefeys | N/A | Code |
| H2ONet: Hand-Occlusion-and-Orientation-Aware Network for Real-Time 3D Hand Mesh Reconstruction | Hao Xu · Tianyu Wang · Xiao Tang · Chi-Wing Fu | N/A | Code |
| Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions | Yurui Zhu · Tianyu Wang · Xueyang Fu · Xuanyu Yang · Xin Guo · Jifeng Dai · Yu Qiao · Xiaowei Hu | N/A | Code |
| MoDi: Unconditional Motion Synthesis From Diverse Data | Sigal Raab · Inbal Leibovitch · Peizhuo Li · Kfir Aberman · Olga Sorkine-Hornung · Daniel Cohen-Or | N/A | Code |
| PC2: Projection-Conditioned Point Cloud Diffusion for Single-Image 3D Reconstruction | Luke Melas-Kyriazi · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| SliceMatch: Geometry-Guided Aggregation for Cross-View Pose Estimation | Zimin Xia · Zimin Xia · Ted Lentsch · Julian F. P. Kooij | N/A | Code |
| RealFusion: 360° Reconstruction of Any Object From a Single Image | Luke Melas-Kyriazi · Iro Laina · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Masked and Adaptive Transformer for Exemplar Based Image Translation | Chang Jiang · Fei Gao · Biao Ma · Yuhao Lin · Nannan Wang · Gang Xu | N/A | Code |
| DynamicStereo: Consistent Dynamic Depth From Stereo Videos | Nikita Karaev · Ignacio Rocco · Benjamin Graham · Natalia Neverova · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Masked Representation Learning for Domain Generalized Stereo Matching | Zhibo Rao · Bangshu Xiong · Mingyi He · Mochu Xiang · Renjie He · Zhelun Shen · Xing Li | N/A | Code |
| MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training | Runsen Xu · Tai Wang · Wenwei Zhang · Runjian Chen · Jinkun Cao · Jiangmiao Pang · Dahua Lin | N/A | Code |
| Fresnel Microfacet BRDF: Unification of Polari-Radiometric Surface-Body Reflection | Tomoki Ichikawa · Yoshiki Fukao · Shohei Nobuhara · Ko Nishino | N/A | Code |
| Instant Multi-View Head Capture Through Learnable Registration | Timo Bolkart · Tianye Li · Michael J. Black | N/A | Code |
| POEM: Reconstructing Hand in a Point Embedded Multi-View Stereo | Lixin Yang · Jian Xu · Licheng Zhong · Xinyu Zhan · Zhicheng Wang · Kejian Wu · Cewu Lu | N/A | Code |
| Diffusion-Based Generation, Optimization, and Planning in 3D Scenes | Siyuan Huang · Zan Wang · Puhao Li · Baoxiong Jia · Tengyu Liu · Yixin Zhu · Wei Liang · Song-Chun Zhu | N/A | Code |
| Visibility Constrained Wide-Band Illumination Spectrum Design for Seeing-in-the-Dark | Muyao Niu · Zhuoxiao Li · Zhihang Zhong · Yinqiang Zheng | N/A | Code |
| SketchXAI: A First Look at Explainability for Human Sketches | Zhiyu Qu · Yulia Gryaditskaya · Ke Li · Kaiyue Pang · Tao Xiang · Yi-Zhe Song | N/A | Code |
| TTA-COPE: Test-Time Adaptation for Category-Level Object Pose Estimation | Taeyeop Lee · Jonathan Tremblay · Valts Blukis · Bowen Wen · Byeong-Uk Lee · Inkyu Shin · Stan Birchfield · In So Kweon · Kuk-Jin Yoon | N/A | Code |
| Teleidoscopic Imaging System for Microscale 3D Shape Reconstruction | Ryo Kawahara · Meng-Yu Jennifer Kuo · Shohei Nobuhara | N/A | Code |
| Reliability in Semantic Segmentation: Are We on the Right Track? | Pau de Jorge · Riccardo Volpi · Philip H.S. Torr · Grégory Rogez | N/A | Code |
| SMPConv: Self-Moving Point Representations for Continuous Convolution | Sanghyeon Kim · Eunbyung Park | N/A | Code |
| Few-Shot Geometry-Aware Keypoint Localization | Xingzhe He · Gaurav Bharaj · David Ferman · Helge Rhodin · Pablo Garrido | N/A | Code |
| STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition | Xiaoyu Zhu · Po-Yao Huang · Junwei Liang · Celso M. de Melo · Alexander G. Hauptmann | N/A | Code |
| Knowledge Combination To Learn Rotated Detection Without Rotated Annotation | Tianyu Zhu · Bryce Ferenczi · Pulak Purkait · Tom Drummond · Hamid Rezatofighi · Anton van den Hengel | N/A | Code |
| OTAvatar: One-Shot Talking Face Avatar With Controllable Tri-Plane Rendering | Zhiyuan Ma · Xiangyu Zhu · Guo-Jun Qi · Zhen Lei · Lei Zhang | N/A | Code |
| Supervised Masked Knowledge Distillation for Few-Shot Transformers | Han Lin · Guangxing Han · Jiawei Ma · Shiyuan Huang · Xudong Lin · Shih-Fu Chang | N/A | Code |
| Learning Open-Vocabulary Semantic Segmentation Models From Natural Language Supervision | Jilan Xu · Junlin Hou · Yuejie Zhang · Rui Feng · Yi Wang · Yu Qiao · Weidi Xie | N/A | Code |
| ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing | Xiaodan Li · Yuefeng Chen · Yao Zhu · Shuhui Wang · Rong Zhang · Hui Xue | N/A | Code |
| Neural Residual Radiance Fields for Streamably Free-Viewpoint Videos | Liao Wang · Qiang Hu · Qihan He · Ziyu Wang · Jingyi Yu · Tinne Tuytelaars · Lan Xu · Minye Wu | N/A | Code |
| Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm | Yichen Xie · Han Lu · Junchi Yan · Xiaokang Yang · Masayoshi Tomizuka · Wei Zhan | N/A | Code |
| Tensor4D: Efficient Neural 4D Decomposition for High-Fidelity Dynamic Reconstruction and Rendering | Ruizhi Shao · Zerong Zheng · Hanzhang Tu · Boning Liu · Hongwen Zhang · Yebin Liu | N/A | Code |
| RiDDLE: Reversible and Diversified De-Identification With Latent Encryptor | Dongze Li · Wei Wang · Kang Zhao · Jing Dong · Tieniu Tan | N/A | Code |
| RobustNeRF: Ignoring Distractors With Robust Losses | Sara Sabour · Suhani Vora · Daniel Duckworth · Ivan Krasin · David J. Fleet · Andrea Tagliasacchi | N/A | Code |
| Bitstream-Corrupted JPEG Images Are Restorable: Two-Stage Compensation and Alignment Framework for Image Restoration | Wenyang Liu · Yi Wang · Kim-Hui Yap · Lap-Pui Chau | N/A | Code |
| HierVL: Learning Hierarchical Video-Language Embeddings | Kumar Ashutosh · Rohit Girdhar · Lorenzo Torresani · Kristen Grauman | N/A | Code |
| Phone2Proc: Bringing Robust Robots Into Our Chaotic World | Matt Deitke · Rose Hendrix · Ali Farhadi · Kiana Ehsani · Aniruddha Kembhavi | N/A | Code |
| A Light Touch Approach to Teaching Transformers Multi-View Geometry | Yash Bhalgat · João F. Henriques · Andrew Zisserman | N/A | Code |
| Clothed Human Performance Capture With a Double-Layer Neural Radiance Fields | Kangkan Wang · Guofeng Zhang · Suxu Cong · Jian Yang | N/A | Code |
| AutoFocusFormer: Image Segmentation off the Grid | Chen Ziwen · Kaushik Patnaik · Shuangfei Zhai · Alvin Wan · Zhile Ren · Alexander G. Schwing · Alex Colburn · Li Fuxin | N/A | Code |
| Trace and Pace: Controllable Pedestrian Animation via Guided Trajectory Diffusion | Davis Rempe · Zhengyi Luo · Xue Bin Peng · Ye Yuan · Kris Kitani · Karsten Kreis · Sanja Fidler · Or Litany | N/A | Code |
| Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking | Jinkun Cao · Jiangmiao Pang · Xinshuo Weng · Rawal Khirodkar · Kris Kitani | N/A | Code |
| Spider GAN: Leveraging Friendly Neighbors To Accelerate GAN Training | Siddarth Asokan · Chandra Sekhar Seelamantula | N/A | Code |
| Minimizing the Accumulated Trajectory Error To Improve Dataset Distillation | Jiawei Du · Yidi Jiang · Vincent Y. F. Tan · Joey Tianyi Zhou · Haizhou Li | N/A | Code |
| Adaptive Patch Deformation for Textureless-Resilient Multi-View Stereo | Yuesong Wang · Zhaojie Zeng · Tao Guan · Wei Yang · Zhuo Chen · Wenkai Liu · Luoyuan Xu · Yawei Luo | N/A | Code |
| Learning Correspondence Uncertainty via Differentiable Nonlinear Least Squares | Dominik Muhle · Lukas Koestler · Krishna Murthy Jatavallabhula · Daniel Cremers | N/A | Code |
| Learning Anchor Transformations for 3D Garment Animation | Fang Zhao · Zekun Li · Shaoli Huang · Junwu Weng · Tianfei Zhou · Guo-Sen Xie · Jue Wang · Ying Shan | N/A | Code |
| PyPose: A Library for Robot Learning With Physics-Based Optimization | Chen Wang · Dasong Gao · Kuan Xu · Junyi Geng · Yaoyu Hu · Yuheng Qiu · Bowen Li · Fan Yang · Brady Moon · Abhinav Pandey · Aryan · Jiahe Xu · Tianhao Wu · Haonan He · Daning Huang · Zhongqiang Ren · Shibo Zhao · Taimeng Fu · Pranay Reddy · Xiao Lin · Wenshan Wang · Jingnan Shi · Rajat Talak · Kun Cao · Yi Du · Han Wang · Huai Yu · Shanzhao Wang · Siyu Chen · Ananth Kashyap · Rohan Bandaru · Karthik Dantu · Jiajun Wu · Lihua Xie · Luca Carlone · Marco Hutter · Sebastian Scherer | N/A | Code |
| Unicode Analogies: An Anti-Objectivist Visual Reasoning Challenge | Steven Spratley · Krista A. Ehinger · Tim Miller | N/A | Code |
| DyNCA: Real-Time Dynamic Texture Synthesis Using Neural Cellular Automata | Ehsan Pajouheshgar · Yitao Xu · Tong Zhang · Sabine Süsstrunk | N/A | Code |
| Learning Generative Structure Prior for Blind Text Image Super-Resolution | Xiaoming Li · Wangmeng Zuo · Chen Change Loy | N/A | Code |
| CAMS: CAnonicalized Manipulation Spaces for Category-Level Functional Hand-Object Manipulation Synthesis | Juntian Zheng · Qingyuan Zheng · Lixing Fang · Yun Liu · Li Yi | N/A | Code |
| SCPNet: Semantic Scene Completion on Point Cloud | Zhaoyang Xia · Youquan Liu · Xin Li · Xinge Zhu · Yuexin Ma · Yikang Li · Yuenan Hou · Yu Qiao | N/A | Code |
| AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation | Zhen Li · Zuo-Liang Zhu · Ling-Hao Han · Qibin Hou · Chun-Le Guo · Ming-Ming Cheng | N/A | Code |
| Behavioral Analysis of Vision-and-Language Navigation Agents | Zijiao Yang · Arjun Majumdar · Stefan Lee | N/A | Code |
| Geometry and Uncertainty-Aware 3D Point Cloud Class-Incremental Semantic Segmentation | Yuwei Yang · Munawar Hayat · Zhao Jin · Chao Ren · Yinjie Lei | N/A | Code |
| Directional Connectivity-Based Segmentation of Medical Images | Ziyun Yang · Sina Farsiu | N/A | Code |
| ScanDMM: A Deep Markov Model of Scanpath Prediction for 360° Images | Xiangjie Sui · Yuming Fang · Hanwei Zhu · Shiqi Wang · Zhou Wang | N/A | Code |
| 3D Shape Reconstruction of Semi-Transparent Worms | Thomas P. Ilett · Omer Yuval · Thomas Ranner · Netta Cohen · David C. Hogg | N/A | Code |
| Patch-Craft Self-Supervised Training for Correlated Image Denoising | Gregory Vaksman · Michael Elad | N/A | Code |
| NeAT: Learning Neural Implicit Surfaces With Arbitrary Topologies From Multi-View Images | Xiaoxu Meng · Weikai Chen · Bo Yang | N/A | Code |
| DANI-Net: Uncalibrated Photometric Stereo by Differentiable Shadow Handling, Anisotropic Reflectance Modeling, and Neural Inverse Rendering | Zongrui Li · Qian Zheng · Boxin Shi · Gang Pan · Xudong Jiang | N/A | Code |
| Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training | Zhao Jin · Munawar Hayat · Yuwei Yang · Yulan Guo · Yinjie Lei | N/A | Code |
| Unsupervised Object Localization: Observing the Background To Discover Objects | Oriane Siméoni · Chloé Sekkat · Gilles Puy · Antonín Vobecký · Éloi Zablocki · Patrick Pérez | N/A | Code |
| Bootstrap Your Own Prior: Towards Distribution-Agnostic Novel Class Discovery | Muli Yang · Liancheng Wang · Cheng Deng · Hanwang Zhang | N/A | Code |
| Self-Supervised Geometry-Aware Encoder for Style-Based 3D GAN Inversion | Yushi Lan · Xuyi Meng · Shuai Yang · Chen Change Loy · Bo Dai | N/A | Code |
| NeuralField-LDM: Scene Generation With Hierarchical Latent Diffusion Models | Seung Wook Kim · Bradley Brown · Kangxue Yin · Karsten Kreis · Katja Schwarz · Daiqing Li · Robin Rombach · Antonio Torralba · Sanja Fidler | N/A | Code |
| ALSO: Automotive Lidar Self-Supervision by Occupancy Estimation | Alexandre Boulch · Corentin Sautier · Björn Michele · Gilles Puy · Renaud Marlet | N/A | Code |
| RepMode: Learning to Re-Parameterize Diverse Experts for Subcellular Structure Prediction | Donghao Zhou · Chunbin Gu · Junde Xu · Furui Liu · Qiong Wang · Guangyong Chen · Pheng-Ann Heng | N/A | Code |
| Aligning Bag of Regions for Open-Vocabulary Object Detection | Size Wu · Wenwei Zhang · Sheng Jin · Wentao Liu · Chen Change Loy | N/A | Code |
| Neuralangelo: High-Fidelity Neural Surface Reconstruction | Zhaoshuo Li · Thomas Müller · Alex Evans · Russell H. Taylor · Mathias Unberath · Ming-Yu Liu · Chen-Hsuan Lin | N/A | Code |
| PCT-Net: Full Resolution Image Harmonization Using Pixel-Wise Color Transformations | Julian Jorge Andrade Guerreiro · Mitsuru Nakazawa · Björn Stenger | N/A | Code |
| PaCa-ViT: Learning Patch-to-Cluster Attention in Vision Transformers | Ryan Grainger · Thomas Paniagua · Xi Song · Naresh Cuntoor · Mun Wai Lee · Tianfu Wu | N/A | Code |
| Towards Better Stability and Adaptability: Improve Online Self-Training for Model Adaptation in Semantic Segmentation | Dong Zhao · Shuang Wang · Qi Zang · Dou Quan · Xiutiao Ye · Licheng Jiao | N/A | Code |
| MEGANE: Morphable Eyeglass and Avatar Network | Junxuan Li · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Hongdong Li · Jason Saragih | N/A | Code |
| Generalizable Implicit Neural Representations via Instance Pattern Composers | Chiheon Kim · Doyup Lee · Saehoon Kim · Minsu Cho · Wook-Shin Han | N/A | Code |
| Revisiting Rolling Shutter Bundle Adjustment: Toward Accurate and Fast Solution | Bangyan Liao · Delin Qu · Yifei Xue · Huiqing Zhang · Yizhen Lao | N/A | Code |
| Distribution Shift Inversion for Out-of-Distribution Prediction | Runpeng Yu · Songhua Liu · Xingyi Yang · Xinchao Wang | N/A | Code |
| Wide-Angle Rectification via Content-Aware Conformal Mapping | Qi Zhang · Hongdong Li · Qing Wang | N/A | Code |
| WildLight: In-the-Wild Inverse Rendering With a Flashlight | Ziang Cheng · Junxuan Li · Hongdong Li | N/A | Code |
| Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos | Kun Su · Kaizhi Qian · Eli Shlizerman · Antonio Torralba · Chuang Gan | N/A | Code |
| Probing Neural Representations of Scene Perception in a Hippocampally Dependent Task Using Artificial Neural Networks | Markus Frey · Christian F. Doeller · Caswell Barry | N/A | Code |
| Inverting the Imaging Process by Learning an Implicit Camera Model | Xin Huang · Qi Zhang · Ying Feng · Hongdong Li · Qing Wang | N/A | Code |
| EC2: Emergent Communication for Embodied Control | Yao Mu · Shunyu Yao · Mingyu Ding · Ping Luo · Chuang Gan | N/A | Code |
| Light Source Separation and Intrinsic Image Decomposition Under AC Illumination | Yusaku Yoshida · Ryo Kawahara · Takahiro Okabe | N/A | Code |
| FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding | Thanh-Dat Truong · Ngan Le · Bhiksha Raj · Jackson Cothren · Khoa Luu | N/A | Code |
| Learning Locally Editable Virtual Humans | Hsuan-I Ho · Lixin Xue · Jie Song · Otmar Hilliges | N/A | Code |
| Open-Vocabulary Point-Cloud Object Detection Without 3D Annotation | Yuheng Lu · Chenfeng Xu · Xiaobao Wei · Xiaodong Xie · Masayoshi Tomizuka · Kurt Keutzer · Shanghang Zhang | N/A | Code |
| Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning | Qiang He · Huangyuan Su · Jieyu Zhang · Xinwen Hou | N/A | Code |
| PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection | Anthony Chen · Kevin Zhang · Renrui Zhang · Zihan Wang · Yuheng Lu · Yandong Guo · Shanghang Zhang | N/A | Code |
| OrienterNet: Visual Localization in 2D Public Maps With Neural Matching | Paul-Edouard Sarlin · Daniel DeTone · Tsun-Yi Yang · Armen Avetisyan · Julian Straub · Tomasz Malisiewicz · Samuel Rota Bulò · Richard Newcombe · Peter Kontschieder · Vasileios Balntas | N/A | Code |
| Class Relationship Embedded Learning for Source-Free Unsupervised Domain Adaptation | Yixin Zhang · Zilei Wang · Weinan He | N/A | Code |
| Efficient Movie Scene Detection Using State-Space Transformers | Md Mohaiminul Islam · Mahmudul Hasan · Kishan Shamsundar Athrey · Tony Braskich · Gedas Bertasius | N/A | Code |
| Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction | Mingfang Zhang · Jinglu Wang · Xiao Li · Yifei Huang · Yoichi Sato · Yan Lu | N/A | Code |
| FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks | Xiao Han · Xiatian Zhu · Licheng Yu · Li Zhang · Yi-Zhe Song · Tao Xiang | N/A | Code |
| Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning | Qian Jiang · Changyou Chen · Han Zhao · Liqun Chen · Qing Ping · Son Dinh Tran · Yi Xu · Belinda Zeng · Trishul Chilimbi | N/A | Code |
| Level-S$^2$fM: Structure From Motion on Neural Level Set of Implicit Surfaces | Yuxi Xiao · Nan Xue · Tianfu Wu · Gui-Song Xia | N/A | Code |
| Consistent-Teacher: Towards Reducing Inconsistent Pseudo-Targets in Semi-Supervised Object Detection | Xinjiang Wang · Xingyi Yang · Shilong Zhang · Yijiang Li · Litong Feng · Shijie Fang · Chengqi Lyu · Kai Chen · Wayne Zhang | N/A | Code |
| Dense Distinct Query for End-to-End Object Detection | Shilong Zhang · Xinjiang Wang · Jiaqi Wang · Jiangmiao Pang · Chengqi Lyu · Wenwei Zhang · Ping Luo · Kai Chen | N/A | Code |
| ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation | Zicong Fan · Omid Taheri · Dimitrios Tzionas · Muhammed Kocabas · Manuel Kaufmann · Michael J. Black · Otmar Hilliges | N/A | Code |
| BiFormer: Vision Transformer With Bi-Level Routing Attention | Lei Zhu · Xinjiang Wang · Zhanghan Ke · Wayne Zhang · Rynson W.H. Lau | N/A | Code |
| Hierarchical Video-Moment Retrieval and Step-Captioning | Abhay Zala · Jaemin Cho · Satwik Kottur · Xilun Chen · Barlas Oguz · Yashar Mehdad · Mohit Bansal | N/A | Code |
| Progressive Open Space Expansion for Open-Set Model Attribution | Tianyun Yang · Danding Wang · Fan Tang · Xinying Zhao · Juan Cao · Sheng Tang | N/A | Code |
| Deep Depth Estimation From Thermal Image | Ukcheol Shin · Jinsun Park · In So Kweon | N/A | Code |
| Incremental 3D Semantic Scene Graph Prediction From RGB Sequences | Shun-Cheng Wu · Keisuke Tateno · Nassir Navab · Federico Tombari | N/A | Code |
| Visual Programming: Compositional Visual Reasoning Without Training | Tanmay Gupta · Aniruddha Kembhavi | N/A | Code |
| Change-Aware Sampling and Contrastive Learning for Satellite Images | Utkarsh Mall · Bharath Hariharan · Kavita Bala | N/A | Code |
| NULL-Text Inversion for Editing Real Images Using Guided Diffusion Models | Ron Mokady · Amir Hertz · Kfir Aberman · Yael Pritch · Daniel Cohen-Or | N/A | Code |
| RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors | Rui-Qi Wu · Zheng-Peng Duan · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| Neural Part Priors: Learning To Optimize Part-Based Object Completion in RGB-D Scans | Aleksei Bokhovkin · Angela Dai | N/A | Code |
| Hierarchical Discriminative Learning Improves Visual Representations of Biomedical Microscopy | Cheng Jiang · Xinhai Hou · Akhil Kondepudi · Asadur Chowdury · Christian W. Freudiger · Daniel A. Orringer · Honglak Lee · Todd C. Hollon | N/A | Code |
| Domain Expansion of Image Generators | Yotam Nitzan · Michaël Gharbi · Richard Zhang · Taesung Park · Jun-Yan Zhu · Daniel Cohen-Or · Eli Shechtman | N/A | Code |
| “Seeing” Electric Network Frequency From Events | Lexuan Xu · Guang Hua · Haijian Zhang · Lei Yu · Ning Qiao | N/A | Code |
| MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object Detection | Wenda Zhao · Shigeng Xie · Fan Zhao · You He · Huchuan Lu | N/A | Code |
| Adaptive Spot-Guided Transformer for Consistent Local Feature Matching | Jiahuan Yu · Jiahao Chang · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Feng Wu | N/A | Code |
| SE-ORNet: Self-Ensembling Orientation-Aware Network for Unsupervised Point Cloud Shape Correspondence | Jiacheng Deng · Chuxin Wang · Jiahao Lu · Jianfeng He · Tianzhu Zhang · Jiyang Yu · Zhe Zhang | N/A | Code |
| Dynamic Coarse-To-Fine Learning for Oriented Tiny Object Detection | Chang Xu · Jian Ding · Jinwang Wang · Wen Yang · Huai Yu · Lei Yu · Gui-Song Xia | N/A | Code |
| Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning | Yu Wang · Pengchong Qiao · Chang Liu · Guoli Song · Xiawu Zheng · Jie Chen | N/A | Code |
| Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding | Praneeth Chakravarthula · Jim Aldon D’Souza · Ethan Tseng · Joe Bartusek · Felix Heide | N/A | Code |
| DNF: Decouple and Feedback Network for Seeing in the Dark | Xin Jin · Ling-Hao Han · Zhen Li · Chun-Le Guo · Zhi Chai · Chongyi Li | N/A | Code |
| CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation | Samir Yitzhak Gadre · Mitchell Wortsman · Gabriel Ilharco · Ludwig Schmidt · Shuran Song | N/A | Code |
| NVTC: Nonlinear Vector Transform Coding | Runsen Feng · Zongyu Guo · Weiping Li · Zhibo Chen | N/A | Code |
| Towards Unified Scene Text Spotting Based on Sequence Generation | Taeho Kil · Seonghyeon Kim · Sukmin Seo · Yoonsik Kim · Daehee Kim | N/A | Code |
| Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation | Tsu-Jui Fu · Licheng Yu · Ning Zhang · Cheng-Yang Fu · Jong-Chyi Su · William Yang Wang · Sean Bell | N/A | Code |
| Fuzzy Positive Learning for Semi-Supervised Semantic Segmentation | Pengchong Qiao · Zhidan Wei · Yu Wang · Zhennan Wang · Guoli Song · Fan Xu · Xiangyang Ji · Chang Liu · Jie Chen | N/A | Code |
| Progressively Optimized Local Radiance Fields for Robust View Synthesis | Andréas Meuleman · Yu-Lun Liu · Chen Gao · Jia-Bin Huang · Changil Kim · Min H. Kim · Johannes Kopf | N/A | Code |
| Neural Map Prior for Autonomous Driving | Xuan Xiong · Yicheng Liu · Tianyuan Yuan · Yue Wang · Yilun Wang · Hang Zhao | N/A | Code |
| Efficient and Explicit Modelling of Image Hierarchies for Image Restoration | Yawei Li · Yuchen Fan · Xiaoyu Xiang · Denis Demandolx · Rakesh Ranjan · Radu Timofte · Luc Van Gool | N/A | Code |
| F2-NeRF: Fast Neural Radiance Field Training With Free Camera Trajectories | Peng Wang · Yuan Liu · Zhaoxi Chen · Lingjie Liu · Ziwei Liu · Taku Komura · Christian Theobalt · Wenping Wang | N/A | Code |
| Procedure-Aware Pretraining for Instructional Video Understanding | Honglu Zhou · Roberto Martín-Martín · Mubbasir Kapadia · Silvio Savarese · Juan Carlos Niebles | N/A | Code |
| High-Fidelity Guided Image Synthesis With Latent Diffusion Models | Jaskirat Singh · Stephen Gould · Liang Zheng | N/A | Code |
| Progressive Random Convolutions for Single Domain Generalization | Seokeon Choi · Debasmit Das · Sungha Choi · Seunghan Yang · Hyunsin Park · Sungrack Yun | N/A | Code |
| EcoTTA: Memory-Efficient Continual Test-Time Adaptation via Self-Distilled Regularization | Junha Song · Jungsoo Lee · In So Kweon · Sungha Choi | N/A | Code |
| NoPe-NeRF: Optimising Neural Radiance Field With No Pose Prior | Wenjing Bian · Zirui Wang · Kejie Li · Jia-Wang Bian · Victor Adrian Prisacariu | N/A | Code |
| GrowSP: Unsupervised Semantic Segmentation of 3D Point Clouds | Zihui Zhang · Bo Yang · Bing Wang · Bo Li | N/A | Code |
| Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning | Kaiyou Song · Jin Xie · Shan Zhang · Zimeng Luo | N/A | Code |
| Aligning Step-by-Step Instructional Diagrams to Video Demonstrations | Jiahao Zhang · Anoop Cherian · Yanbin Liu · Yizhak Ben-Shabat · Cristian Rodriguez · Stephen Gould | N/A | Code |
| ESLAM: Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields | Mohammad Mahdi Johari · Camilla Carta · François Fleuret | N/A | Code |
| AutoRecon: Automated 3D Object Discovery and Reconstruction | Yuang Wang · Xingyi He · Sida Peng · Haotong Lin · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Ultra-High Resolution Segmentation With Ultra-Rich Context: A Novel Benchmark | Deyi Ji · Feng Zhao · Hongtao Lu · Mingyuan Tao · Jieping Ye | N/A | Code |
| NeUDF: Leaning Neural Unsigned Distance Fields With Volume Rendering | Yu-Tao Liu · Li Wang · Jie Yang · Weikai Chen · Xiaoxu Meng · Bo Yang · Lin Gao | N/A | Code |
| Improving Cross-Modal Retrieval With Set of Diverse Embeddings | Dongwon Kim · Namyup Kim · Suha Kwak | N/A | Code |
| An Image Quality Assessment Dataset for Portraits | Nicolas Chahine · Stefania Calarasanu · Davide Garcia-Civiero · Théo Cayla · Sira Ferradans · Jean Ponce | N/A | Code |
| Weakly Supervised Semantic Segmentation via Adversarial Learning of Classifier and Reconstructor | Hyeokjun Kweon · Sung-Hoon Yoon · Kuk-Jin Yoon | N/A | Code |
| NeRFLix: High-Quality Neural View Synthesis by Learning a Degradation-Driven Inter-Viewpoint MiXer | Kun Zhou · Wenbo Li · Yi Wang · Tao Hu · Nianjuan Jiang · Xiaoguang Han · Jiangbo Lu | N/A | Code |
| ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations | Panos Achlioptas · Ian Huang · Minhyuk Sung · Sergey Tulyakov · Leonidas Guibas | N/A | Code |
| RelightableHands: Efficient Neural Relighting of Articulated Hand Models | Shun Iwase · Shunsuke Saito · Tomas Simon · Stephen Lombardi · Timur Bagautdinov · Rohan Joshi · Fabian Prada · Takaaki Shiratori · Yaser Sheikh · Jason Saragih | N/A | Code |
| VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud | Ziqin Wang · Bowen Cheng · Lichen Zhao · Dong Xu · Yang Tang · Lu Sheng | N/A | Code |
| MVImgNet: A Large-Scale Dataset of Multi-View Images | Xianggang Yu · Mutian Xu · Yidan Zhang · Haolin Liu · Chongjie Ye · Yushuang Wu · Zizheng Yan · Chenming Zhu · Zhangyang Xiong · Tianyou Liang · Guanying Chen · Shuguang Cui · Xiaoguang Han | N/A | Code |
| MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling With Informative-Preserved Reconstruction and Self-Distilled Consistency | Mingye Xu · Mutian Xu · Tong He · Wanli Ouyang · Yali Wang · Xiaoguang Han · Yu Qiao | N/A | Code |
| Self-Guided Diffusion Models | Vincent Tao Hu · David W. Zhang · Yuki M. Asano · Gertjan J. Burghouts · Cees G. M. Snoek | N/A | Code |
| REC-MV: REconstructing 3D Dynamic Cloth From Monocular Videos | Lingteng Qiu · Guanying Chen · Jiapeng Zhou · Mutian Xu · Junle Wang · Xiaoguang Han | N/A | Code |
| OneFormer: One Transformer To Rule Universal Image Segmentation | Jitesh Jain · Jiachen Li · Mang Tik Chiu · Ali Hassani · Nikita Orlov · Humphrey Shi | N/A | Code |
| Mask-Free OVIS: Open-Vocabulary Instance Segmentation Without Manual Mask Annotations | Vibashan VS · Ning Yu · Chen Xing · Can Qin · Mingfei Gao · Juan Carlos Niebles · Vishal M. Patel · Ran Xu | N/A | Code |
| Multiclass Confidence and Localization Calibration for Object Detection | Bimsara Pathiraja · Malitha Gunawardhana · Muhammad Haris Khan | N/A | Code |
| Structured Kernel Estimation for Photon-Limited Deconvolution | Yash Sanghvi · Zhiyuan Mao · Stanley H. Chan | N/A | Code |
| CLIPPO: Image-and-Language Understanding From Pixels Only | Michael Tschannen · Basil Mustafa · Neil Houlsby | N/A | Code |
| Actionlet-Dependent Contrastive Learning for Unsupervised Skeleton-Based Action Recognition | Lilang Lin · Jiahang Zhang · Jiaying Liu | N/A | Code |
| Role of Transients in Two-Bounce Non-Line-of-Sight Imaging | Siddharth Somasundaram · Akshat Dave · Connor Henley · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Shape-Aware Text-Driven Layered Video Editing | Yao-Chih Lee · Ji-Ze Genevieve Jang · Yi-Ting Chen · Elizabeth Qiu · Jia-Bin Huang | N/A | Code |
| FlexiViT: One Model for All Patch Sizes | Lucas Beyer · Pavel Izmailov · Alexander Kolesnikov · Mathilde Caron · Simon Kornblith · Xiaohua Zhai · Matthias Minderer · Michael Tschannen · Ibrahim Alabdulmohsin · Filip Pavetic | N/A | Code |
| Turning Strengths Into Weaknesses: A Certified Robustness Inspired Attack Framework Against Graph Neural Networks | Binghui Wang · Meng Pang · Yun Dong | N/A | Code |
| HairStep: Transfer Synthetic to Real Using Strand and Depth Maps for Single-View 3D Hair Modeling | Yujian Zheng · Zirong Jin · Moran Li · Haibin Huang · Chongyang Ma · Shuguang Cui · Xiaoguang Han | N/A | Code |
| RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal Retrieval | Yanglin Feng · Hongyuan Zhu · Dezhong Peng · Xi Peng · Peng Hu | N/A | Code |
| Learning Federated Visual Prompt in Null Space for MRI Reconstruction | Chun-Mei Feng · Bangjun Li · Xinxing Xu · Yong Liu · Huazhu Fu · Wangmeng Zuo | N/A | Code |
| VGFlow: Visibility Guided Flow Network for Human Reposing | Rishabh Jain · Krishna Kumar Singh · Mayur Hemani · Jingwan Lu · Mausoom Sarkar · Duygu Ceylan · Balaji Krishnamurthy | N/A | Code |
| Learning Attention As Disentangler for Compositional Zero-Shot Learning | Shaozhe Hao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| PET-NeuS: Positional Encoding Tri-Planes for Neural Surfaces | Yiqun Wang · Ivan Skorokhodov · Peter Wonka | N/A | Code |
| Perception-Oriented Single Image Super-Resolution Using Optimal Objective Estimation | Seung Ho Park · Young Su Moon · Nam Ik Cho | N/A | Code |
| Learning To Exploit Temporal Structure for Biomedical Vision–Language Processing | Shruthi Bannur · Stephanie Hyland · Qianchu Liu · Fernando Pérez-García · Maximilian Ilse · Daniel C. Castro · Benedikt Boecking · Harshita Sharma · Kenza Bouzid · Anja Thieme · Anton Schwaighofer · Maria Wetscherek · Matthew P. Lungren · Aditya Nori · Javier Alvarez-Valle · Ozan Oktay | N/A | Code |
| TRACE: 5D Temporal Regression of Avatars With Dynamic Cameras in 3D Environments | Yu Sun · Qian Bao · Wu Liu · Tao Mei · Michael J. Black | N/A | Code |
| Neumann Network With Recursive Kernels for Single Image Defocus Deblurring | Yuhui Quan · Zicong Wu · Hui Ji | N/A | Code |
| Guiding Pseudo-Labels With Uncertainty Estimation for Source-Free Unsupervised Domain Adaptation | Mattia Litrico · Alessio Del Bue · Pietro Morerio | N/A | Code |
| PlaneDepth: Self-Supervised Depth Estimation via Orthogonal Planes | Ruoyu Wang · Zehao Yu · Shenghua Gao | N/A | Code |
| Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention at Vision Transformer Inference | Haoran You · Yunyang Xiong · Xiaoliang Dai · Bichen Wu · Peizhao Zhang · Haoqi Fan · Peter Vajda · Yingyan (Celine) Lin | N/A | Code |
| Attention-Based Point Cloud Edge Sampling | Chengzhi Wu · Junwei Zheng · Julius Pfrommer · Jürgen Beyerer | N/A | Code |
| Structured 3D Features for Reconstructing Controllable Avatars | Enric Corona · Mihai Zanfir · Thiemo Alldieck · Eduard Gabriel Bazavan · Andrei Zanfir · Cristian Sminchisescu | N/A | Code |
| Zero-Shot Referring Image Segmentation With Global-Local Context Features | Seonghoon Yu · Paul Hongsuck Seo · Jeany Son | N/A | Code |
| CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective | Junwen Xiong · Ganglai Wang · Peng Zhang · Wei Huang · Yufei Zha · Guangtao Zhai | N/A | Code |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Anwesa Choudhuri · Girish Chowdhary · Alexander G. Schwing | N/A | Code |
| Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields | Rohith Agaram · Shaurya Dewan · Rahul Sajnani · Adrien Poulenard · Madhava Krishna · Srinath Sridhar | N/A | Code |
| Decoupled Multimodal Distilling for Emotion Recognition | Yong Li · Yuanzhi Wang · Zhen Cui | N/A | Code |
| TensoIR: Tensorial Inverse Rendering | Haian Jin · Isabella Liu · Peijia Xu · Xiaoshuai Zhang · Songfang Han · Sai Bi · Xiaowei Zhou · Zexiang Xu · Hao Su | N/A | Code |
| Zero-Shot Generative Model Adaptation via Image-Specific Prompt Learning | Jiayi Guo · Chaofei Wang · You Wu · Eric Zhang · Kai Wang · Xingqian Xu · Shiji Song · Humphrey Shi · Gao Huang | N/A | Code |
| DiGeo: Discriminative Geometry-Aware Learning for Generalized Few-Shot Object Detection | Jiawei Ma · Yulei Niu · Jincheng Xu · Shiyuan Huang · Guangxing Han · Shih-Fu Chang | N/A | Code |
| Unbalanced Optimal Transport: A Unified Framework for Object Detection | Henri De Plaen · Pierre-François De Plaen · Johan A. K. Suykens · Marc Proesmans · Tinne Tuytelaars · Luc Van Gool | N/A | Code |
| NeRFInvertor: High Fidelity NeRF-GAN Inversion for Single-Shot Real Image Animation | Yu Yin · Kamran Ghasedi · HsiangTao Wu · Jiaolong Yang · Xin Tong · Yun Fu | N/A | Code |
| Masked Image Training for Generalizable Deep Image Denoising | Haoyu Chen · Jinjin Gu · Yihao Liu · Salma Abdel Magid · Chao Dong · Qiong Wang · Hanspeter Pfister · Lei Zhu | N/A | Code |
| Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation | Mayu Otani · Riku Togashi · Yu Sawai · Ryosuke Ishigami · Yuta Nakashima · Esa Rahtu · Janne Heikkilä · Shin’ichi Satoh | N/A | Code |
| Towards Flexible Multi-Modal Document Models | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Zero-Shot Everything Sketch-Based Image Retrieval, and in Explainable Style | Fengyin Lin · Mingkang Li · Da Li · Timothy Hospedales · Yi-Zhe Song · Yonggang Qi | N/A | Code |
| LidarGait: Benchmarking 3D Gait Recognition With Point Clouds | Chuanfu Shen · Chao Fan · Wei Wu · Rui Wang · George Q. Huang · Shiqi Yu | N/A | Code |
| OpenGait: Revisiting Gait Recognition Towards Better Practicality | Chao Fan · Junhao Liang · Chuanfu Shen · Saihui Hou · Yongzhen Huang · Shiqi Yu | N/A | Code |
| Towards Unsupervised Object Detection From LiDAR Point Clouds | Lunjun Zhang · Anqi Joyce Yang · Yuwen Xiong · Sergio Casas · Bin Yang · Mengye Ren · Raquel Urtasun | N/A | Code |
| Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images | Ming Y. Lu · Bowen Chen · Andrew Zhang · Drew F. K. Williamson · Richard J. Chen · Tong Ding · Long Phi Le · Yung-Sung Chuang · Faisal Mahmood | N/A | Code |
| DivClust: Controlling Diversity in Deep Clustering | Ioannis Maniadis Metaxas · Georgios Tzimiropoulos · Ioannis Patras | N/A | Code |
| AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning | Runqi Wang · Xiaoyue Duan · Guoliang Kang · Jianzhuang Liu · Shaohui Lin · Songcen Xu · Jinhu Lü · Baochang Zhang | N/A | Code |
| Unsupervised Continual Semantic Adaptation Through Neural Rendering | Zhizheng Liu · Francesco Milano · Jonas Frey · Roland Siegwart · Hermann Blum · Cesar Cadena | N/A | Code |
| Semi-Supervised Parametric Real-World Image Harmonization | Ke Wang · Michaël Gharbi · He Zhang · Zhihao Xia · Eli Shechtman | N/A | Code |
| EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning | Chenxin Xu · Robby T. Tan · Yuhong Tan · Siheng Chen · Yu Guang Wang · Xinchao Wang · Yanfeng Wang | N/A | Code |
| BUOL: A Bottom-Up Framework With Occupancy-Aware Lifting for Panoptic 3D Scene Reconstruction From a Single Image | Tao Chu · Pan Zhang · Qiong Liu · Jiaqi Wang | N/A | Code |
| Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation | Ning Zhang · Francesco Nex · George Vosselman · Norman Kerle | N/A | Code |
| Novel-View Acoustic Synthesis | Changan Chen · Alexander Richard · Roman Shapovalov · Vamsi Krishna Ithapu · Natalia Neverova · Kristen Grauman · Andrea Vedaldi | N/A | Code |
| Audio-Visual Grouping Network for Sound Localization From Mixtures | Shentong Mo · Yapeng Tian | N/A | Code |
| Chat2Map: Efficient Scene Mapping From Multi-Ego Conversations | Sagnik Majumder · Hao Jiang · Pierre Moulon · Ethan Henderson · Paul Calamia · Kristen Grauman · Vamsi Krishna Ithapu | N/A | Code |
| ConvNeXt V2: Co-Designing and Scaling ConvNets With Masked Autoencoders | Sanghyun Woo · Shoubhik Debnath · Ronghang Hu · Xinlei Chen · Zhuang Liu · In So Kweon · Saining Xie | N/A | Code |
| Collaboration Helps Camera Overtake LiDAR in 3D Detection | Yue Hu · Yifan Lu · Runsheng Xu · Weidi Xie · Siheng Chen · Yanfeng Wang | N/A | Code |
| Few-Shot Learning With Visual Distribution Calibration and Cross-Modal Distribution Alignment | Runqi Wang · Hao Zheng · Xiaoyue Duan · Jianzhuang Liu · Yuning Lu · Tian Wang · Songcen Xu · Baochang Zhang | N/A | Code |
| MetaCLUE: Towards Comprehensive Visual Metaphors Research | Arjun R. Akula · Brendan Driscoll · Pradyumna Narayana · Soravit Changpinyo · Zhiwei Jia · Suyash Damle · Garima Pruthi · Sugato Basu · Leonidas Guibas · William Freeman · Yuanzhen Li · Varun Jampani | N/A | Code |
| Deep Fair Clustering via Maximizing and Minimizing Mutual Information: Theory, Algorithm and Metric | Pengxin Zeng · Yunfan Li · Peng Hu · Dezhong Peng · Jiancheng Lv · Xi Peng | N/A | Code |
| Visual Atoms: Pre-Training Vision Transformers With Sinusoidal Waves | Sora Takashima · Ryo Hayamizu · Nakamasa Inoue · Hirokatsu Kataoka · Rio Yokota | N/A | Code |
| 3D-Aware Multi-Class Image-to-Image Translation With NeRFs | Senmao Li · Joost van de Weijer · Yaxing Wang · Fahad Shahbaz Khan · Meiqin Liu · Jian Yang | N/A | Code |
| E2PN: Efficient SE(3)-Equivariant Point Network | Minghan Zhu · Maani Ghaffari · William A. Clark · Huei Peng | N/A | Code |
| PixHt-Lab: Pixel Height Based Light Effect Generation for Image Compositing | Yichen Sheng · Jianming Zhang · Julien Philip · Yannick Hold-Geoffroy · Xin Sun · He Zhang · Lu Ling · Bedrich Benes | N/A | Code |
| UniSim: A Neural Closed-Loop Sensor Simulator | Ze Yang · Yun Chen · Jingkang Wang · Sivabalan Manivasagam · Wei-Chiu Ma · Anqi Joyce Yang · Raquel Urtasun | N/A | Code |
| Occlusion-Free Scene Recovery via Neural Radiance Fields | Chengxuan Zhu · Renjie Wan · Yunkai Tang · Boxin Shi | N/A | Code |
| SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting With Neural Radiance Fields | Ashkan Mirzaei · Tristan Aumentado-Armstrong · Kosta Derpanis · Jonathan Kelly · Marcus A. Brubaker · Igor Gilitschenski · Alex Levinshtein | N/A | Code |
| Class-Incremental Exemplar Compression for Class-Incremental Learning | Zilin Luo · Yaoyao Liu · Bernt Schiele · Qianru Sun | N/A | Code |
| DETRs With Hybrid Matching | Ding Jia · Yuhui Yuan · Haodi He · Xiaopei Wu · Haojun Yu · Weihong Lin · Lei Sun · Chao Zhang · Han Hu | N/A | Code |
| 3D Human Mesh Estimation From Virtual Markers | Xiaoxuan Ma · Jiajun Su · Chunyu Wang · Wentao Zhu · Yizhou Wang | N/A | Code |
| Objaverse: A Universe of Annotated 3D Objects | Matt Deitke · Dustin Schwenk · Jordi Salvador · Luca Weihs · Oscar Michel · Eli VanderBilt · Ludwig Schmidt · Kiana Ehsani · Aniruddha Kembhavi · Ali Farhadi | N/A | Code |
| Adjustment and Alignment for Unbiased Open Set Domain Adaptation | Wuyang Li · Jie Liu · Bo Han · Yixuan Yuan | N/A | Code |
| TimeBalance: Temporally-Invariant and Temporally-Distinctive Video Representations for Semi-Supervised Action Recognition | Ishan Rajendrakumar Dave · Mamshad Nayeem Rizve · Chen Chen · Mubarak Shah | N/A | Code |
| EfficientSCI: Densely Connected Network With Space-Time Factorization for Large-Scale Video Snapshot Compressive Imaging | Lishun Wang · Miao Cao · Xin Yuan | N/A | Code |
| Continual Detection Transformer for Incremental Object Detection | Yaoyao Liu · Bernt Schiele · Andrea Vedaldi · Christian Rupprecht | N/A | Code |
| Hierarchical Prompt Learning for Multi-Task Learning | Yajing Liu · Yuning Lu · Hao Liu · Yaozu An · Zhuoran Xu · Zhuokun Yao · Baofeng Zhang · Zhiwei Xiong · Chenguang Gui | N/A | Code |
| Boost Vision Transformer With GPU-Friendly Sparsity and Quantization | Chong Yu · Tao Chen · Zhongxue Gan · Jiayuan Fan | N/A | Code |
| Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression | Junho Kim · Byung-Kwan Lee · Yong Man Ro | N/A | Code |
| Regularizing Second-Order Influences for Continual Learning | Zhicheng Sun · Yadong Mu · Gang Hua | N/A | Code |
| Heterogeneous Continual Learning | Divyam Madaan · Hongxu Yin · Wonmin Byeon · Jan Kautz · Pavlo Molchanov | N/A | Code |
| DP-NeRF: Deblurred Neural Radiance Field With Physical Scene Priors | Dogyoon Lee · Minhyeok Lee · Chajin Shin · Sangyoun Lee | N/A | Code |
| 3D-POP – An Automated Annotation Approach to Facilitate Markerless 2D-3D Tracking of Freely Moving Birds With Marker-Based Motion Capture | Hemal Naik · Alex Hoi Hang Chan · Junran Yang · Mathilde Delacoux · Iain D. Couzin · Fumihiro Kano · Máté Nagy | N/A | Code |
| Recognizing Rigid Patterns of Unlabeled Point Clouds by Complete and Continuous Isometry Invariants With No False Negatives and No False Positives | Daniel Widdowson · Vitaliy Kurlin | N/A | Code |
| Robust Model-Based Face Reconstruction Through Weakly-Supervised Outlier Segmentation | Chunlu Li · Andreas Morel-Forster · Thomas Vetter · Bernhard Egger · Adam Kortylewski | N/A | Code |
| Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization | Shichao Dong · Jin Wang · Renhe Ji · Jiajun Liang · Haoqiang Fan · Zheng Ge | N/A | Code |
| PoseExaminer: Automated Testing of Out-of-Distribution Robustness in Human Pose and Shape Estimation | Qihao Liu · Adam Kortylewski · Alan L. Yuille | N/A | Code |
| VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| 1000 FPS HDR Video With a Spike-RGB Hybrid Camera | Yakun Chang · Chu Zhou · Yuchen Hong · Liwen Hu · Chao Xu · Tiejun Huang · Boxin Shi | N/A | Code |
| How to Backdoor Diffusion Models? | Sheng-Yen Chou · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| PIP-Net: Patch-Based Intuitive Prototypes for Interpretable Image Classification | Meike Nauta · Jörg Schlötterer · Maurice van Keulen · Christin Seifert | N/A | Code |
| Joint Token Pruning and Squeezing Towards More Aggressive Compression of Vision Transformers | Siyuan Wei · Tianzhu Ye · Shen Zhang · Yao Tang · Jiajun Liang | N/A | Code |
| Energy-Efficient Adaptive 3D Sensing | Brevin Tilmon · Zhanghao Sun · Sanjeev J. Koppal · Yicheng Wu · Georgios Evangelidis · Ramzi Zahreddine · Gurunandan Krishnan · Sizhuo Ma · Jian Wang | N/A | Code |
| Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data | Yuhao Chen · Xin Tan · Borui Zhao · Zhaowei Chen · Renjie Song · Jiajun Liang · Xuequan Lu | N/A | Code |
| Fix the Noise: Disentangling Source Feature for Controllable Domain Translation | Dongyeun Lee · Jae Young Lee · Doyeon Kim · Jaehyun Choi · Jaejun Yoo · Junmo Kim | N/A | Code |
| Learning Transferable Spatiotemporal Representations From Natural Script Knowledge | Ziyun Zeng · Yuying Ge · Xihui Liu · Bin Chen · Ping Luo · Shu-Tao Xia · Yixiao Ge | N/A | Code |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Mengde Xu · Zheng Zhang · Fangyun Wei · Han Hu · Xiang Bai | N/A | Code |
| A Strong Baseline for Generalized Few-Shot Semantic Segmentation | Sina Hajimiri · Malik Boudiaf · Ismail Ben Ayed · Jose Dolz | N/A | Code |
| Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations | Lei Hsiung · Yun-Yun Tsai · Pin-Yu Chen · Tsung-Yi Ho | N/A | Code |
| Normalizing Flow Based Feature Synthesis for Outlier-Aware Object Detection | Nishant Kumar · Siniša Šegvić · Abouzar Eslami · Stefan Gumhold | N/A | Code |
| AVFace: Towards Detailed Audio-Visual 4D Face Reconstruction | Aggelina Chatziagapi · Dimitris Samaras | N/A | Code |
| Learning Semantic Relationship Among Instances for Image-Text Matching | Zheren Fu · Zhendong Mao · Yan Song · Yongdong Zhang | N/A | Code |
| Understanding Imbalanced Semantic Segmentation Through Neural Collapse | Zhisheng Zhong · Jiequan Cui · Yibo Yang · Xiaoyang Wu · Xiaojuan Qi · Xiangyu Zhang · Jiaya Jia | N/A | Code |
| SCADE: NeRFs from Space Carving With Ambiguity-Aware Depth Estimates | Mikaela Angelina Uy · Ricardo Martin-Brualla · Leonidas Guibas · Ke Li | N/A | Code |
| MonoHuman: Animatable Human Neural Field From Monocular Video | Zhengming Yu · Wei Cheng · Xian Liu · Wayne Wu · Kwan-Yee Lin | N/A | Code |
| Affection: Learning Affective Explanations for Real-World Visual Data | Panos Achlioptas · Maks Ovsjanikov · Leonidas Guibas · Sergey Tulyakov | N/A | Code |
| Sharpness-Aware Gradient Matching for Domain Generalization | Pengfei Wang · Zhaoxiang Zhang · Zhen Lei · Lei Zhang | N/A | Code |
| Generalized Decoding for Pixel, Image, and Language | Xueyan Zou · Zi-Yi Dou · Jianwei Yang · Zhe Gan · Linjie Li · Chunyuan Li · Xiyang Dai · Harkirat Behl · Jianfeng Wang · Lu Yuan · Nanyun Peng · Lijuan Wang · Yong Jae Lee · Jianfeng Gao | N/A | Code |
| How You Feelin’? Learning Emotions and Mental States in Movie Scenes | Dhruv Srivastava · Aditya Kumar Singh · Makarand Tapaswi | N/A | Code |
| Improving Visual Representation Learning Through Perceptual Understanding | Samyakh Tukra · Frederick Hoffman · Ken Chatfield | N/A | Code |
| PlenVDB: Memory Efficient VDB-Based Radiance Fields for Fast Training and Rendering | Han Yan · Celong Liu · Chao Ma · Xing Mei | N/A | Code |
| HaLP: Hallucinating Latent Positives for Skeleton-Based Self-Supervised Learning of Actions | Anshul Shah · Aniket Roy · Ketul Shah · Shlok Mishra · David Jacobs · Anoop Cherian · Rama Chellappa | N/A | Code |
| FeatureBooster: Boosting Feature Descriptors With a Lightweight Neural Network | Xinjiang Wang · Zeyu Liu · Yu Hu · Wei Xi · Wenxian Yu · Danping Zou | N/A | Code |
| ACL-SPC: Adaptive Closed-Loop System for Self-Supervised Point Cloud Completion | Sangmin Hong · Mohsen Yavartanoo · Reyhaneh Neshatavar · Kyoung Mu Lee | N/A | Code |
| NeRF in the Palm of Your Hand: Corrective Augmentation for Robotics via Novel-View Synthesis | Allan Zhou · Moo Jin Kim · Lirui Wang · Pete Florence · Chelsea Finn | N/A | Code |
| Query-Centric Trajectory Prediction | Zikang Zhou · Jianping Wang · Yung-Hui Li · Yu-Kai Huang | N/A | Code |
| EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding | Yanmin Wu · Xinhua Cheng · Renrui Zhang · Zesen Cheng · Jian Zhang | N/A | Code |
| Sliced Optimal Partial Transport | Yikun Bai · Bernhard Schmitzer · Matthew Thorpe · Soheil Kolouri | N/A | Code |
| PersonNeRF: Personalized Reconstruction From Photo Collections | Chung-Yi Weng · Pratul P. Srinivasan · Brian Curless · Ira Kemelmacher-Shlizerman | N/A | Code |
| Feature Shrinkage Pyramid for Camouflaged Object Detection With Transformers | Zhou Huang · Hang Dai · Tian-Zhu Xiang · Shuo Wang · Huai-Xin Chen · Jie Qin · Huan Xiong | N/A | Code |
| HOLODIFFUSION: Training a 3D Diffusion Model Using 2D Images | Animesh Karnewar · Andrea Vedaldi · David Novotny · Niloy J. Mitra | N/A | Code |
| Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors | Gongjie Zhang · Zhipeng Luo · Zichen Tian · Jingyi Zhang · Xiaoqin Zhang · Shijian Lu | N/A | Code |
| Interventional Bag Multi-Instance Learning on Whole-Slide Pathological Images | Tiancheng Lin · Zhimiao Yu · Hongyu Hu · Yi Xu · Chang-Wen Chen | N/A | Code |
| Meta-Explore: Exploratory Hierarchical Vision-and-Language Navigation Using Scene Object Spectrum Grounding | Minyoung Hwang · Jaeyeon Jeong · Minsoo Kim · Yoonseon Oh · Songhwai Oh | N/A | Code |
| Sketch2Saliency: Learning To Detect Salient Objects From Human Drawings | Ayan Kumar Bhunia · Subhadeep Koley · Amandeep Kumar · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Picture That Sketch: Photorealistic Image Generation From Abstract Sketches | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not | Aneeshan Sain · Ayan Kumar Bhunia · Pinaki Nath Chowdhury · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data | Jihye Park · Sunwoo Kim · Soohyun Kim · Seokju Cho · Jaejun Yoo · Youngjung Uh · Seungryong Kim | N/A | Code |
| Depth Estimation From Indoor Panoramas With Neural Scene Representation | Wenjie Chang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| What Can Human Sketches Do for Object Detection? | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| SceneTrilogy: On Human Scene-Sketch and Its Complementarity With Photo and Text | Pinaki Nath Chowdhury · Ayan Kumar Bhunia · Aneeshan Sain · Subhadeep Koley · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Markerless Camera-to-Robot Pose Estimation via Self-Supervised Sim-to-Real Transfer | Jingpei Lu · Florian Richter · Michael C. Yip | N/A | Code |
| Fine-Grained Audible Video Description | Xuyang Shen · Dong Li · Jinxing Zhou · Zhen Qin · Bowen He · Xiaodong Han · Aixuan Li · Mochu Xiang · Lingpeng Kong · Meng Wang · Yu Qiao · Yiran Zhong | N/A | Code |
| EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention | Xinyu Liu · Houwen Peng · Ningxin Zheng · Yuqing Yang · Han Hu · Yixuan Yuan | N/A | Code |
| Relightable Neural Human Assets From Multi-View Gradient Illuminations | Taotao Zhou · Kai He · Di Wu · Teng Xu · Qixuan Zhang · Kuixiang Shao · Wenzheng Chen · Lan Xu · Jingyi Yu | N/A | Code |
| Music-Driven Group Choreography | Nhat Le · Thang Pham · Tuong Do · Erman Tjiputra · Quang D. Tran · Anh Nguyen | N/A | Code |
| DIP: Dual Incongruity Perceiving Network for Sarcasm Detection | Changsong Wen · Guoli Jia · Jufeng Yang | N/A | Code |
| MagicPony: Learning Articulated 3D Animals in the Wild | Shangzhe Wu · Ruining Li · Tomas Jakab · Christian Rupprecht · Andrea Vedaldi | N/A | Code |
| Preserving Linear Separability in Continual Learning by Backward Feature Projection | Qiao Gu · Dongsub Shim · Florian Shkurti | N/A | Code |
| Improving Fairness in Facial Albedo Estimation via Visual-Textual Cues | Xingyu Ren · Jiankang Deng · Chao Ma · Yichao Yan · Xiaokang Yang | N/A | Code |
| HOICLIP: Efficient Knowledge Transfer for HOI Detection With Vision-Language Models | Shan Ning · Longtian Qiu · Yongfei Liu · Xuming He | N/A | Code |
| Regularization of Polynomial Networks for Image Recognition | Grigorios G. Chrysos · Bohan Wang · Jiankang Deng · Volkan Cevher | N/A | Code |
| Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Aneeshan Sain · Ayan Kumar Bhunia · Subhadeep Koley · Pinaki Nath Chowdhury · Soumitri Chattopadhyay · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Learning Semantic-Aware Knowledge Guidance for Low-Light Image Enhancement | Yuhui Wu · Chen Pan · Guoqing Wang · Yang Yang · Jiwei Wei · Chongyi Li · Heng Tao Shen | N/A | Code |
| Block Selection Method for Using Feature Norm in Out-of-Distribution Detection | Yeonguk Yu · Sungho Shin · Seongju Lee · Changhyun Jun · Kyoobin Lee | N/A | Code |
| HouseDiffusion: Vector Floorplan Generation via a Diffusion Model With Discrete and Continuous Denoising | Mohammad Amin Shabani · Sepidehsadat Hosseini · Yasutaka Furukawa | N/A | Code |
| Integral Neural Networks | Kirill Solodskikh · Azim Kurbanov · Ruslan Aydarkhanov · Irina Zhelavskaya · Yury Parfenov · Dehua Song · Stamatios Lefkimmiatis | N/A | Code |
| FitMe: Deep Photorealistic 3D Morphable Model Avatars | Alexandros Lattas · Stylianos Moschoglou · Stylianos Ploumpis · Baris Gecer · Jiankang Deng · Stefanos Zafeiriou | N/A | Code |
| Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment | Kim Sung-Bin · Arda Senocak · Hyunwoo Ha · Andrew Owens · Tae-Hyun Oh | N/A | Code |
| Introducing Competition To Boost the Transferability of Targeted Adversarial Examples Through Clean Feature Mixup | Junyoung Byun · Myung-Joon Kwon · Seungju Cho · Yoonji Kim · Changick Kim | N/A | Code |
| Initialization Noise in Image Gradients and Saliency Maps | Ann-Christin Woerl · Jan Disselhoff · Michael Wand | N/A | Code |
| Two-Shot Video Object Segmentation | Kun Yan · Xiao Li · Fangyun Wei · Jinglu Wang · Chenbin Zhang · Ping Wang · Yan Lu | N/A | Code |
| SCOOP: Self-Supervised Correspondence and Optimization-Based Scene Flow | Itai Lang · Dror Aiger · Forrester Cole · Shai Avidan · Michael Rubinstein | N/A | Code |
| Co-SLAM: Joint Coordinate and Sparse Parametric Encodings for Neural Real-Time SLAM | Hengyi Wang · Jingwen Wang · Lourdes Agapito | N/A | Code |
| Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Rawgment: Noise-Accounted RAW Augmentation Enables Recognition in a Wide Variety of Environments | Masakazu Yoshimura · Junji Otsuka · Atsushi Irie · Takeshi Ohashi | N/A | Code |
| Diffusion-Based Signed Distance Fields for 3D Shape Generation | Jaehyeok Shim · Changwoo Kang · Kyungdon Joo | N/A | Code |
| Handwritten Text Generation From Visual Archetypes | Vittorio Pippi · Silvia Cascianelli · Rita Cucchiara | N/A | Code |
| Novel Class Discovery for 3D Point Cloud Semantic Segmentation | Luigi Riz · Cristiano Saltori · Elisa Ricci · Fabio Poiesi | N/A | Code |
| DeltaEdit: Exploring Text-Free Training for Text-Driven Image Manipulation | Yueming Lyu · Tianwei Lin · Fu Li · Dongliang He · Jing Dong · Tieniu Tan | N/A | Code |
| SkyEye: Self-Supervised Bird’s-Eye-View Semantic Mapping Using Monocular Frontal View Images | Nikhil Gosala · Kürsat Petek · Paulo L. J. Drews-Jr · Wolfram Burgard · Abhinav Valada | N/A | Code |
| Towards Open-World Segmentation of Parts | Tai-Yu Pan · Qing Liu · Wei-Lun Chao · Brian Price | N/A | Code |
| DeepMapping2: Self-Supervised Large-Scale LiDAR Map Optimization | Chao Chen · Xinhao Liu · Yiming Li · Li Ding · Chen Feng | N/A | Code |
| SINE: SINgle Image Editing With Text-to-Image Diffusion Models | Zhixing Zhang · Ligong Han · Arnab Ghosh · Dimitris N. Metaxas · Jian Ren | N/A | Code |
| Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object Detection | Long Li · Junwei Han · Ni Zhang · Nian Liu · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Fahad Shahbaz Khan | N/A | Code |
| TruFor: Leveraging All-Round Clues for Trustworthy Image Forgery Detection and Localization | Fabrizio Guillaro · Davide Cozzolino · Avneesh Sud · Nicholas Dufour · Luisa Verdoliva | N/A | Code |
| SeSDF: Self-Evolved Signed Distance Field for Implicit 3D Clothed Human Reconstruction | Yukang Cao · Kai Han · Kwan-Yee K. Wong | N/A | Code |
| Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-Shot Learning With Hyperspherical Embeddings | Daniel J. Trosten · Rwiddhi Chakraborty · Sigurd Løkse · Kristoffer Knutsen Wickstrøm · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| MAGE: MAsked Generative Encoder To Unify Representation Learning and Image Synthesis | Tianhong Li · Huiwen Chang · Shlok Mishra · Han Zhang · Dina Katabi · Dilip Krishnan | N/A | Code |
| Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection | Lianyu Wang · Meng Wang · Daoqiang Zhang · Huazhu Fu | N/A | Code |
| OvarNet: Towards Open-Vocabulary Object Attribute Recognition | Keyan Chen · Xiaolong Jiang · Yao Hu · Xu Tang · Yan Gao · Jianqi Chen · Weidi Xie | N/A | Code |
| GINA-3D: Learning To Generate Implicit Neural Assets in the Wild | Bokui Shen · Xinchen Yan · Charles R. Qi · Mahyar Najibi · Boyang Deng · Leonidas Guibas · Yin Zhou · Dragomir Anguelov | N/A | Code |
| PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation | Qitao Zhao · Ce Zheng · Mengyuan Liu · Pichao Wang · Chen Chen | N/A | Code |
| Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization | Huan Ren · Wenfei Yang · Tianzhu Zhang · Yongdong Zhang | N/A | Code |
| Learning Partial Correlation Based Deep Visual Representation for Image Classification | Saimunur Rahman · Piotr Koniusz · Lei Wang · Luping Zhou · Peyman Moghadam · Changming Sun | N/A | Code |
| Multi-Granularity Archaeological Dating of Chinese Bronze Dings Based on a Knowledge-Guided Relation Graph | Rixin Zhou · Jiafu Wei · Qian Zhang · Ruihua Qi · Xi Yang · Chuntao Li | N/A | Code |
| DexArt: Benchmarking Generalizable Dexterous Manipulation With Articulated Objects | Chen Bao · Helin Xu · Yuzhe Qin · Xiaolong Wang | N/A | Code |
| Modeling the Distributional Uncertainty for Salient Object Detection Models | Xinyu Tian · Jing Zhang · Mochu Xiang · Yuchao Dai | N/A | Code |
| Evading Forensic Classifiers With Attribute-Conditioned Adversarial Faces | Fahad Shamshad · Koushik Srivatsan · Karthik Nandakumar | N/A | Code |
| Scene-Aware Egocentric 3D Human Pose Estimation | Jian Wang · Diogo Luvizon · Weipeng Xu · Lingjie Liu · Kripasindhu Sarkar · Christian Theobalt | N/A | Code |
| Camouflaged Instance Segmentation via Explicit De-Camouflaging | Naisong Luo · Yuwen Pan · Rui Sun · Tianzhu Zhang · Zhiwei Xiong · Feng Wu | N/A | Code |
| N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution | Haram Choi · Jeongmin Lee · Jihoon Yang | N/A | Code |
| Diffusion Video Autoencoders: Toward Temporally Consistent Face Video Editing via Disentangled Video Encoding | Gyeongman Kim · Hajin Shim · Hyunsu Kim · Yunjey Choi · Junho Kim · Eunho Yang | N/A | Code |
| GLIGEN: Open-Set Grounded Text-to-Image Generation | Yuheng Li · Haotian Liu · Qingyang Wu · Fangzhou Mu · Jianwei Yang · Jianfeng Gao · Chunyuan Li · Yong Jae Lee | N/A | Code |
| Balanced Spherical Grid for Egocentric View Synthesis | Changwoon Choi · Sang Min Kim · Young Min Kim | N/A | Code |
| V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception | Runsheng Xu · Xin Xia · JINLONG LI · Hanzhao Li · Shuo Zhang · Zhengzhong Tu · Zonglin Meng · Hao Xiang · Xiaoyu Dong · Rui Song · Hongkai Yu · Bolei Zhou · Jiaqi Ma | N/A | Code |
| VindLU: A Recipe for Effective Video-and-Language Pretraining | Feng Cheng · Xizi Wang · Jie Lei · David Crandall · Mohit Bansal · Gedas Bertasius | N/A | Code |
| FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation | Jie Qin · Jie Wu · Pengxiang Yan · Ming Li · Ren Yuxi · Xuefeng Xiao · Yitong Wang · Rui Wang · Shilei Wen · Xin Pan · Xingang Wang | N/A | Code |
| NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization | Zhixiang Min · Bingbing Zhuang · Samuel Schulter · Buyu Liu · Enrique Dunn · Manmohan Chandraker | N/A | Code |
| ABCD: Arbitrary Bitwise Coefficient for De-Quantization | Woo Kyoung Han · Byeonghun Lee · Sang Hyun Park · Kyong Hwan Jin | N/A | Code |
| PromptCAL: Contrastive Affinity Learning via Auxiliary Prompts for Generalized Novel Category Discovery | Sheng Zhang · Salman Khan · Zhiqiang Shen · Muzammal Naseer · Guangyi Chen · Fahad Shahbaz Khan | N/A | Code |
| Mitigating Task Interference in Multi-Task Learning via Explicit Task Routing With Non-Learnable Primitives | Chuntao Ding · Zhichao Lu · Shangguang Wang · Ran Cheng · Vishnu Naresh Boddeti | N/A | Code |
| MaPLe: Multi-Modal Prompt Learning | Muhammad Uzair Khattak · Hanoona Rasheed · Muhammad Maaz · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Revisiting Residual Networks for Adversarial Robustness | Shihua Huang · Zhichao Lu · Kalyanmoy Deb · Vishnu Naresh Boddeti | N/A | Code |
| Bridging the Gap Between Model Explanations in Partially Annotated Multi-Label Classification | Youngwook Kim · Jae Myung Kim · Jieun Jeong · Cordelia Schmid · Zeynep Akata · Jungwoo Lee | N/A | Code |
| Human Pose Estimation in Extremely Low-Light Conditions | Sohyun Lee · Jaesung Rim · Boseung Jeong · Geonu Kim · Byungju Woo · Haechan Lee · Sunghyun Cho · Suha Kwak | N/A | Code |
| Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution | Chenfan Qu · Chongyu Liu · Yuliang Liu · Xinhong Chen · Dezhi Peng · Fengjun Guo · Lianwen Jin | N/A | Code |
| SinGRAF: Learning a 3D Generative Radiance Field for a Single Scene | Minjung Son · Jeong Joon Park · Leonidas Guibas · Gordon Wetzstein | N/A | Code |
| LEGO-Net: Learning Regular Rearrangements of Objects in Rooms | Qiuhong Anna Wei · Sijie Ding · Jeong Joon Park · Rahul Sajnani · Adrien Poulenard · Srinath Sridhar · Leonidas Guibas | N/A | Code |
| MACARONS: Mapping and Coverage Anticipation With RGB Online Self-Supervision | Antoine Guédon · Tom Monnier · Pascal Monasse · Vincent Lepetit | N/A | Code |
| ALTO: Alternating Latent Topologies for Implicit 3D Reconstruction | Zhen Wang · Shijie Zhou · Jeong Joon Park · Despoina Paschalidou · Suya You · Gordon Wetzstein · Leonidas Guibas · Achuta Kadambi | N/A | Code |
| Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos | Rohit Gupta · Anirban Roy · Claire Christensen · Sujeong Kim · Sarah Gerard · Madeline Cincebeaux · Ajay Divakaran · Todd Grindal · Mubarak Shah | N/A | Code |
| DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network | Xuan Shen · Yaohua Wang · Ming Lin · Yilun Huang · Hao Tang · Xiuyu Sun · Yanzhi Wang | N/A | Code |
| ReLight My NeRF: A Dataset for Novel View Synthesis and Relighting of Real World Objects | Marco Toschi · Riccardo De Matteo · Riccardo Spezialetti · Daniele De Gregorio · Luigi Di Stefano · Samuele Salti | N/A | Code |
| Exact-NeRF: An Exploration of a Precise Volumetric Parameterization for Neural Radiance Fields | Brian K. S. Isaac-Medina · Chris G. Willcocks · Toby P. Breckon | N/A | Code |
| A Generalized Framework for Video Instance Segmentation | Miran Heo · Sukjun Hwang · Jeongseok Hyun · Hanjung Kim · Seoung Wug Oh · Joon-Young Lee · Seon Joo Kim | N/A | Code |
| Video Probabilistic Diffusion Models in Projected Latent Space | Sihyun Yu · Kihyuk Sohn · Subin Kim · Jinwoo Shin | N/A | Code |
| X-Avatar: Expressive Human Avatars | Kaiyue Shen · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Julien Valentin · Jie Song · Otmar Hilliges | N/A | Code |
| Hi4D: 4D Instance Segmentation of Close Human Interaction | Yifei Yin · Chen Guo · Manuel Kaufmann · Juan Jose Zarate · Jie Song · Otmar Hilliges | N/A | Code |
| Joint Appearance and Motion Learning for Efficient Rolling Shutter Correction | Bin Fan · Yuxin Mao · Mochu Xiang · Zhexiong Wan · Qi Liu | N/A | Code |
| Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting | Syed Talal Wasim · Muzammal Naseer · Salman Khan · Fahad Shahbaz Khan · Mubarak Shah | N/A | Code |
| MaskSketch: Unpaired Structure-Guided Masked Image Generation | Dina Bashkirova · José Lezama · Kihyuk Sohn · Kate Saenko · Irfan Essa | N/A | Code |
| Super-CLEVR: A Virtual Benchmark To Diagnose Domain Robustness in Visual Reasoning | Zhuowan Li · Xingrui Wang · Elias Stengel-Eskin · Adam Kortylewski · Wufei Ma · Benjamin Van Durme · Alan L. Yuille | N/A | Code |
| CREPE: Can Vision-Language Foundation Models Reason Compositionally? | Zixian Ma · Jerry Hong · Mustafa Omer Gul · Mona Gandhi · Irena Gao · Ranjay Krishna | N/A | Code |
| ORCa: Glossy Objects As Radiance-Field Cameras | Kushagra Tiwary · Akshat Dave · Nikhil Behari · Tzofi Klinghoffer · Ashok Veeraraghavan · Ramesh Raskar | N/A | Code |
| Learning Common Rationale To Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems | Yangyang Shu · Anton van den Hengel · Lingqiao Liu | N/A | Code |
| Implicit Occupancy Flow Fields for Perception and Prediction in Self-Driving | Ben Agro · Quinlan Sykora · Sergio Casas · Raquel Urtasun | N/A | Code |
| Improved Test-Time Adaptation for Domain Generalization | Liang Chen · Yong Zhang · Yibing Song · Ying Shan · Lingqiao Liu | N/A | Code |
| Wavelet Diffusion Models Are Fast and Scalable Image Generators | Hao Phung · Quan Dao · Anh Tran | N/A | Code |
| Robust Dynamic Radiance Fields | Yu-Lun Liu · Chen Gao · Andréas Meuleman · Hung-Yu Tseng · Ayush Saraf · Changil Kim · Yung-Yu Chuang · Johannes Kopf · Jia-Bin Huang | N/A | Code |
| MixSim: A Hierarchical Framework for Mixed Reality Traffic Simulation | Simon Suo · Kelvin Wong · Justin Xu · James Tu · Alexander Cui · Sergio Casas · Raquel Urtasun | N/A | Code |
| Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement | Nancy Mehta · Akshay Dudhane · Subrahmanyam Murala · Syed Waqas Zamir · Salman Khan · Fahad Shahbaz Khan | N/A | Code |
| Class Adaptive Network Calibration | Bingyuan Liu · Jérôme Rony · Adrian Galdran · Jose Dolz · Ismail Ben Ayed | N/A | Code |
| PROB: Probabilistic Objectness for Open World Object Detection | Orr Zohar · Kuan-Chieh Wang · Serena Yeung | N/A | Code |
| Matching Is Not Enough: A Two-Stage Framework for Category-Agnostic Pose Estimation | Min Shi · Zihao Huang · Xianzheng Ma · Xiaowei Hu · Zhiguo Cao | N/A | Code |
| HyperCUT: Video Sequence From a Single Blurry Image Using Unsupervised Ordering | Bang-Dang Pham · Phong Tran · Anh Tran · Cuong Pham · Rang Nguyen · Minh Hoai | N/A | Code |
| On the Effects of Self-Supervision and Contrastive Alignment in Deep Multi-View Clustering | Daniel J. Trosten · Sigurd Løkse · Robert Jenssen · Michael C. Kampffmeyer | N/A | Code |
| Visual Prompt Tuning for Generative Transfer Learning | Kihyuk Sohn · Huiwen Chang · José Lezama · Luisa Polania · Han Zhang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| Towards End-to-End Generative Modeling of Long Videos With Memory-Efficient Bidirectional Transformers | Jaehoon Yoo · Semin Kim · Doyup Lee · Chiheon Kim · Seunghoon Hong | N/A | Code |
| MAGVIT: Masked Generative Video Transformer | Lijun Yu · Yong Cheng · Kihyuk Sohn · José Lezama · Han Zhang · Huiwen Chang · Alexander G. Hauptmann · Ming-Hsuan Yang · Yuan Hao · Irfan Essa · Lu Jiang | N/A | Code |
| NICO++: Towards Better Benchmarking for Domain Generalization | Xingxuan Zhang · Yue He · Renzhe Xu · Han Yu · Zheyan Shen · Peng Cui | N/A | Code |
| Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization | Xingxuan Zhang · Renzhe Xu · Han Yu · Hao Zou · Peng Cui | N/A | Code |
| All-in-Focus Imaging From Event Focal Stack | Hanyue Lou · Minggui Teng · Yixin Yang · Boxin Shi | N/A | Code |
| Clover: Towards a Unified Video-Language Alignment and Fusion Model | Jingjia Huang · Yinan Li · Jiashi Feng · Xinglong Wu · Xiaoshuai Sun · Rongrong Ji | N/A | Code |
| UMat: Uncertainty-Aware Single Image High Resolution Material Capture | Carlos Rodriguez-Pardo · Henar Domínguez-Elvira · David Pascual-Hernández · Elena Garces | N/A | Code |
| Polarimetric iToF: Measuring High-Fidelity Depth Through Scattering Media | Daniel S. Jeon · Andréas Meuleman · Seung-Hwan Baek · Min H. Kim | N/A | Code |
| Freestyle Layout-to-Image Synthesis | Han Xue · Zhiwu Huang · Qianru Sun · Li Song · Wenjun Zhang | N/A | Code |
| Nighttime Smartphone Reflective Flare Removal Using Optical Center Symmetry Prior | Yuekun Dai · Yihang Luo · Shangchen Zhou · Chongyi Li · Chen Change Loy | N/A | Code |
| Meta Omnium: A Benchmark for General-Purpose Learning-To-Learn | Ondrej Bohdal · Yinbing Tian · Yongshuo Zong · Ruchika Chavhan · Da Li · Henry Gouk · Li Guo · Timothy Hospedales | N/A | Code |
| EXCALIBUR: Encouraging and Evaluating Embodied Exploration | Hao Zhu · Raghav Kapoor · So Yeon Min · Winson Han · Jiatai Li · Kaiwen Geng · Graham Neubig · Yonatan Bisk · Aniruddha Kembhavi · Luca Weihs | N/A | Code |
| Detection of Out-of-Distribution Samples Using Binary Neuron Activation Patterns | Bartłomiej Olber · Krystian Radlak · Adam Popowicz · Michal Szczepankiewicz · Krystian Chachuła | N/A | Code |
| Shakes on a Plane: Unsupervised Depth Estimation From Unstabilized Photography | Ilya Chugunov · Yuxuan Zhang · Felix Heide | N/A | Code |
| JacobiNeRF: NeRF Shaping With Mutual Information Gradients | Xiaomeng Xu · Yanchao Yang · Kaichun Mo · Boxiao Pan · Li Yi · Leonidas Guibas | N/A | Code |
| MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers | Jihao Liu · Xin Huang · Jinliang Zheng · Yu Liu · Hongsheng Li | N/A | Code |
| Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement | Siddarth Ravichandran · Ondřej Texler · Dimitar Dinev · Hyun Jae Kang | N/A | Code |
| CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions | Ming Yan · Xin Wang · Yudi Dai · Siqi Shen · Chenglu Wen · Lan Xu · Yuexin Ma · Cheng Wang | N/A | Code |
| SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments | Yudi Dai · Yitai Lin · Xiping Lin · Chenglu Wen · Lan Xu · Hongwei Yi · Siqi Shen · Yuexin Ma · Cheng Wang | N/A | Code |
| Viewpoint Equivariance for Multi-View 3D Object Detection | Dian Chen · Jie Li · Vitor Guizilini · Rares Andrei Ambrus · Adrien Gaidon | N/A | Code |
| Balanced Product of Calibrated Experts for Long-Tailed Recognition | Emanuel Sanchez Aimar · Arvi Jonnarth · Michael Felsberg · Marco Kuhlmann | N/A | Code |
| Robust Mean Teacher for Continual and Gradual Test-Time Adaptation | Mario Döbler · Robert A. Marsden · Bin Yang | N/A | Code |
| Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation | Sara Sarto · Manuele Barraco · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara | N/A | Code |
| BITE: Beyond Priors for Improved Three-D Dog Pose Estimation | Nadine Rüegg · Shashank Tripathi · Konrad Schindler · Michael J. Black · Silvia Zuffi | N/A | Code |
| SparseFusion: Distilling View-Conditioned Diffusion for 3D Reconstruction | Zhizhuo Zhou · Shubham Tulsiani | N/A | Code |
| PeakConv: Learning Peak Receptive Field for Radar Semantic Segmentation | Liwen Zhang · Xinyan Zhang · Youcheng Zhang · Yufei Guo · Yuanpei Chen · Xuhui Huang · Zhe Ma | N/A | Code |
| Masked Wavelet Representation for Compact Neural Radiance Fields | Daniel Rho · Byeonghyeon Lee · Seungtae Nam · Joo Chan Lee · Jong Hwan Ko · Eunbyung Park | N/A | Code |
| Guided Depth Super-Resolution by Deep Anisotropic Diffusion | Nando Metzger · Rodrigo Caye Daudt · Konrad Schindler | N/A | Code |
| Masked Images Are Counterfactual Samples for Robust Fine-Tuning | Yao Xiao · Ziyi Tang · Pengxu Wei · Cong Liu · Liang Lin | N/A | Code |
| Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration | Guofeng Mei · Hao Tang · Xiaoshui Huang · Weijie Wang · Juan Liu · Jian Zhang · Luc Van Gool · Qiang Wu | N/A | Code |
| ECON: Explicit Clothed Humans Optimized via Normal Integration | Yuliang Xiu · Jinlong Yang · Xu Cao · Dimitrios Tzionas · Michael J. Black | N/A | Code |
| GEN: Pushing the Limits of Softmax-Based Out-of-Distribution Detection | Xixi Liu · Yaroslava Lochman · Christopher Zach | N/A | Code |
| OCTET: Object-Aware Counterfactual Explanations | Mehdi Zemni · Mickaël Chen · Éloi Zablocki · Hédi Ben-Younes · Patrick Pérez · Matthieu Cord | N/A | Code |
| Consistent View Synthesis With Pose-Guided Diffusion Models | Hung-Yu Tseng · Qinbo Li · Changil Kim · Suhib Alsisan · Jia-Bin Huang · Johannes Kopf | N/A | Code |
| GFPose: Learning 3D Human Pose Prior With Gradient Fields | Hai Ci · Mingdong Wu · Wentao Zhu · Xiaoxuan Ma · Hao Dong · Fangwei Zhong · Yizhou Wang | N/A | Code |
| Bayesian Posterior Approximation With Stochastic Ensembles | Oleksandr Balabanov · Bernhard Mehlig · Hampus Linander | N/A | Code |
| Spatio-Focal Bidirectional Disparity Estimation From a Dual-Pixel Image | Donggun Kim · Hyeonjoong Jang · Inchul Kim · Min H. Kim | N/A | Code |
| Octree Guided Unoriented Surface Reconstruction | Chamin Hewa Koneputugodage · Yizhak Ben-Shabat · Stephen Gould | N/A | Code |
| HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning | Chia-Wen Kuo · Zsolt Kira | N/A | Code |
| SUDS: Scalable Urban Dynamic Scenes | Haithem Turki · Jason Y. Zhang · Francesco Ferroni · Deva Ramanan | N/A | Code |
| Harmonious Feature Learning for Interactive Hand-Object Pose Estimation | Zhifeng Lin · Changxing Ding · Huan Yao · Zengsheng Kuang · Shaoli Huang | N/A | Code |
| Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer | Agus Gunawan · Soo Ye Kim · Hyeonjun Sim · Jae-Ho Lee · Munchurl Kim | N/A | Code |
| Trainable Projected Gradient Method for Robust Fine-Tuning | Junjiao Tian · Zecheng He · Xiaoliang Dai · Chih-Yao Ma · Yen-Cheng Liu · Zsolt Kira | N/A | Code |
| OReX: Object Reconstruction From Planar Cross-Sections Using Neural Fields | Haim Sawdayee · Amir Vaxman · Amit H. Bermano | N/A | Code |
| CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects | Nick Heppert · Zubair Irshad · Sergey Zakharov · Katherine Liu · Rares Andrei Ambrus · Jeannette Bohg · Abhinav Valada · Thomas Kollar | N/A | Code |
| ACR: Attention Collaboration-Based Regressor for Arbitrary Two-Hand Reconstruction | Zhengdi Yu · Shaoli Huang · Chen Fang · Toby P. Breckon · Jue Wang | N/A | Code |
| Perception and Semantic Aware Regularization for Sequential Confidence Calibration | Zhenghua Peng · Yu Luo · Tianshui Chen · Keke Xu · Shuangping Huang | N/A | Code |
| Crowd3D: Towards Hundreds of People Reconstruction From a Single Image | Hao Wen · Jing Huang · Huili Cui · Haozhe Lin · Yu-Kun Lai · Lu Fang · Kun Li | N/A | Code |
| ZegCLIP: Towards Adapting CLIP for Zero-Shot Semantic Segmentation | Ziqin Zhou · Yinjie Lei · Bowen Zhang · Lingqiao Liu · Yifan Liu | N/A | Code |
| Learning Semantic-Aware Disentangled Representation for Flexible 3D Human Body Editing | Xiaokun Sun · Qiao Feng · Xiongzheng Li · Jinsong Zhang · Yu-Kun Lai · Jingyu Yang · Kun Li | N/A | Code |
| Skinned Motion Retargeting With Residual Perception of Motion Semantics & Geometry | Jiaxu Zhang · Junwu Weng · Di Kang · Fang Zhao · Shaoli Huang · Xuefei Zhe · Linchao Bao · Ying Shan · Jue Wang · Zhigang Tu | N/A | Code |
| Unknown Sniffer for Object Detection: Don’t Turn a Blind Eye to Unknown Objects | Wenteng Liang · Feng Xue · Yihao Liu · Guofeng Zhong · Anlong Ming | N/A | Code |
| RangeViT: Towards Vision Transformers for 3D Semantic Segmentation in Autonomous Driving | Angelika Ando · Spyros Gidaris · Andrei Bursuc · Gilles Puy · Alexandre Boulch · Renaud Marlet | N/A | Code |
| 3D-Aware Facial Landmark Detection via Multi-View Consistent Training on Synthetic Data | Libing Zeng · Lele Chen · Wentao Bao · Zhong Li · Yi Xu · Junsong Yuan · Nima Khademi Kalantari | N/A | Code |
| Best of Both Worlds: Multimodal Contrastive Learning With Tabular and Imaging Data | Paul Hager · Martin J. Menten · Daniel Rueckert | N/A | Code |
| JRDB-Pose: A Large-Scale Dataset for Multi-Person Pose Estimation and Tracking | Edward Vendrow · Tho Le · Jianfei Cai · Hamid Rezatofighi | N/A | Code |
| Consistent Direct Time-of-Flight Video Depth Super-Resolution | Zhanghao Sun · Wei Ye · Jinhui Xiong · Gyeongmin Choe · Jialiang Wang · Shuochen Su · Rakesh Ranjan | N/A | Code |
| Correlational Image Modeling for Self-Supervised Visual Pre-Training | Wei Li · Jiahao Xie · Chen Change Loy | N/A | Code |
| CelebV-Text: A Large-Scale Facial Text-Video Dataset | Jianhui Yu · Hao Zhu · Liming Jiang · Chen Change Loy · Weidong Cai · Wayne Wu | N/A | Code |
| Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning | Wei Ji · Renjie Liang · Zhedong Zheng · Wenqiao Zhang · Shengyu Zhang · Juncheng Li · Mengze Li · Tat-seng Chua | N/A | Code |
| Learning 3D Scene Priors With 2D Supervision | Yinyu Nie · Angela Dai · Xiaoguang Han · Matthias Nießner | N/A | Code |
| Generating Aligned Pseudo-Supervision From Non-Aligned Data for Image Restoration in Under-Display Camera | Ruicheng Feng · Chongyi Li · Huaijin Chen · Shuai Li · Jinwei Gu · Chen Change Loy | N/A | Code |
| Siamese DETR | Zeren Chen · Gengshi Huang · Wei Li · Jianing Teng · Kun Wang · Jing Shao · Chen Change Loy · Lu Sheng | N/A | Code |
| Panoptic Video Scene Graph Generation | Jingkang Yang · Wenxuan Peng · Xiangtai Li · Zujin Guo · Liangyu Chen · Bo Li · Zheng Ma · Kaiyang Zhou · Wayne Zhang · Chen Change Loy · Ziwei Liu | N/A | Code |
| Randomized Adversarial Training via Taylor Expansion | Gaojie Jin · Xinping Yi · Dengyu Wu · Ronghui Mu · Xiaowei Huang | N/A | Code |
| Task Residual for Tuning Vision-Language Models | Tao Yu · Zhihe Lu · Xin Jin · Zhibo Chen · Xinchao Wang | N/A | Code |
| PACO: Parts and Attributes of Common Objects | Vignesh Ramanathan · Anmol Kalia · Vladan Petrovic · Yi Wen · Baixue Zheng · Baishan Guo · Rui Wang · Aaron Marquez · Rama Kovvuri · Abhishek Kadian · Amir Mousavi · Yiwen Song · Abhimanyu Dubey · Dhruv Mahajan | N/A | Code |
| CloSET: Modeling Clothed Humans on Continuous Surface With Explicit Template Decomposition | Hongwen Zhang · Siyou Lin · Ruizhi Shao · Yuxiang Zhang · Zerong Zheng · Han Huang · Yandong Guo · Yebin Liu | N/A | Code |
| Transductive Few-Shot Learning With Prototype-Based Label Propagation by Iterative Graph Refinement | Hao Zhu · Piotr Koniusz | N/A | Code |
| DualVector: Unsupervised Vector Font Synthesis With Dual-Part Representation | Ying-Tian Liu · Zhifei Zhang · Yuan-Chen Guo · Matthew Fisher · Zhaowen Wang · Song-Hai Zhang | N/A | Code |
| Invertible Neural Skinning | Yash Kant · Aliaksandr Siarohin · Riza Alp Guler · Menglei Chai · Jian Ren · Sergey Tulyakov · Igor Gilitschenski | N/A | Code |
| Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars | Jingxiang Sun · Xuan Wang · Lizhen Wang · Xiaoyu Li · Yong Zhang · Hongwen Zhang · Yebin Liu | N/A | Code |
| Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding | Morris Alper · Michael Fiman · Hadar Averbuch-Elor | N/A | Code |
| ConStruct-VL: Data-Free Continual Structured VL Concepts Learning | James Seale Smith · Paola Cascante-Bonilla · Assaf Arbelle · Donghyun Kim · Rameswar Panda · David Cox · Diyi Yang · Zsolt Kira · Rogerio Feris · Leonid Karlinsky | N/A | Code |
| LINe: Out-of-Distribution Detection by Leveraging Important Neurons | Yong Hyun Ahn · Gyeong-Moon Park · Seong Tae Kim | N/A | Code |
| Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning With Multimodal Models | Zhiqiu Lin · Samuel Yu · Zhiyi Kuang · Deepak Pathak · Deva Ramanan | N/A | Code |
| Panoptic Lifting for 3D Scene Understanding With Neural Fields | Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Norman Müller · Matthias Nießner · Angela Dai · Peter Kontschieder | N/A | Code |
| GamutMLP: A Lightweight MLP for Color Loss Recovery | Hoang M. Le · Brian Price · Scott Cohen · Michael S. Brown | N/A | Code |
| DIFu: Depth-Guided Implicit Function for Clothed Human Reconstruction | Dae-Young Song · HeeKyung Lee · Jeongil Seo · Donghyeon Cho | N/A | Code |
| NLOST: Non-Line-of-Sight Imaging With Transformer | Yue Li · Jiayong Peng · Juntian Ye · Yueyi Zhang · Feihu Xu · Zhiwei Xiong | N/A | Code |
| SeaThru-NeRF: Neural Radiance Fields in Scattering Media | Deborah Levy · Amit Peleg · Naama Pearl · Dan Rosenbaum · Derya Akkaynak · Simon Korman · Tali Treibitz | N/A | Code |
| Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild | Garrick Brazil · Abhinav Kumar · Julian Straub · Nikhila Ravi · Justin Johnson · Georgia Gkioxari | N/A | Code |
| Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection | Chuangchuang Tan · Yao Zhao · Shikui Wei · Guanghua Gu · Yunchao Wei | N/A | Code |
| LayoutDM: Discrete Diffusion Model for Controllable Layout Generation | Naoto Inoue · Kotaro Kikuchi · Edgar Simo-Serra · Mayu Otani · Kota Yamaguchi | N/A | Code |
| Learning Customized Visual Models With Retrieval-Augmented Knowledge | Haotian Liu · Kilho Son · Jianwei Yang · Ce Liu · Jianfeng Gao · Yong Jae Lee · Chunyuan Li | N/A | Code |
| MAIR: Multi-View Attention Inverse Rendering With 3D Spatially-Varying Lighting Estimation | JunYong Choi · SeokYeong Lee · Haesol Park · Seung-Won Jung · Ig-Jae Kim · Junghyun Cho | N/A | Code |
| Generalizing Dataset Distillation via Deep Generative Prior | George Cazenavette · Tongzhou Wang · Antonio Torralba · Alexei A. Efros · Jun-Yan Zhu | N/A | Code |
| Polarized Color Image Denoising | Zhuoxiao Li · Haiyang Jiang · Mingdeng Cao · Yinqiang Zheng | N/A | Code |
| Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation | Haochen Wang · Xiaodan Du · Jiahao Li · Raymond A. Yeh · Greg Shakhnarovich | N/A | Code |
| FJMP: Factorized Joint Multi-Agent Motion Prediction Over Learned Directed Acyclic Interaction Graphs | Luke Rowe · Martin Ethier · Eli-Henry Dykhne · Krzysztof Czarnecki | N/A | Code |
| Mask-Free Video Instance Segmentation | Lei Ke · Martin Danelljan · Henghui Ding · Yu-Wing Tai · Chi-Keung Tang · Fisher Yu | N/A | Code |
| OVTrack: Open-Vocabulary Multiple Object Tracking | Siyuan Li · Tobias Fischer · Lei Ke · Henghui Ding · Martin Danelljan · Fisher Yu | N/A | Code |
| LightPainter: Interactive Portrait Relighting With Freehand Scribble | Yiqun Mei · He Zhang · Xuaner Zhang · Jianming Zhang · Zhixin Shu · Yilin Wang · Zijun Wei · Shi Yan · HyunJoon Jung · Vishal M. Patel | N/A | Code |
| Towards Scalable Neural Representation for Diverse Videos | Bo He · Xitong Yang · Hanyu Wang · Zuxuan Wu · Hao Chen · Shuaiyi Huang · Yixuan Ren · Ser-Nam Lim · Abhinav Shrivastava | N/A | Code |
| Teaching Matters: Investigating the Role of Supervision in Vision Transformers | Matthew Walmer · Saksham Suri · Kamal Gupta · Abhinav Shrivastava | N/A | Code |
| FlexNeRF: Photorealistic Free-Viewpoint Rendering of Moving Humans From Sparse Views | Vinoj Jayasundara · Amit Agrawal · Nicolas Heron · Abhinav Shrivastava · Larry S. Davis | N/A | Code |
| Leveraging Temporal Context in Low Representational Power Regimes | Camilo L. Fosco · SouYoung Jin · Emilie Josephs · Aude Oliva | N/A | Code |
| Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask | Shangzhan Zhang · Sida Peng · Tianrun Chen · Linzhan Mou · Haotong Lin · Kaicheng Yu · Yiyi Liao · Xiaowei Zhou | N/A | Code |
| Align and Attend: Multimodal Summarization With Dual Contrastive Losses | Bo He · Jun Wang · Jielin Qiu · Trung Bui · Abhinav Shrivastava · Zhaowen Wang | N/A | Code |
| SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network | Chuong Huynh · Yuqian Zhou · Zhe Lin · Connelly Barnes · Eli Shechtman · Sohrab Amirghodsi · Abhinav Shrivastava | N/A | Code |
| NIRVANA: Neural Implicit Representations of Videos With Adaptive Networks and Autoregressive Patch-Wise Modeling | Shishira R Maiya · Sharath Girish · Max Ehrlich · Hanyu Wang · Kwot Sin Lee · Patrick Poirson · Pengxiang Wu · Chen Wang · Abhinav Shrivastava | N/A | Code |
| Seeing Beyond the Brain: Conditional Diffusion Model With Sparse Masked Modeling for Vision Decoding | Zijiao Chen · Jiaxin Qing · Tiange Xiang · Wan Lin Yue · Juan Helen Zhou | N/A | Code |
| Position-Guided Text Prompt for Vision-Language Pre-Training | Jinpeng Wang · Pan Zhou · Mike Zheng Shou · Shuicheng Yan | N/A | Code |
| Progressive Spatio-Temporal Alignment for Efficient Event-Based Motion Estimation | Xueyan Huang · Yueyi Zhang · Zhiwei Xiong | N/A | Code |
| MIST: Multi-Modal Iterative Spatial-Temporal Transformer for Long-Form Video Question Answering | Difei Gao · Luowei Zhou · Lei Ji · Linchao Zhu · Yi Yang · Mike Zheng Shou | N/A | Code |
| Histopathology Whole Slide Image Analysis With Heterogeneous Graph Representation Learning | Tsai Hor Chan · Fernando Julio Cendra · Lan Ma · Guosheng Yin · Lequan Yu | N/A | Code |
| Making Vision Transformers Efficient From a Token Sparsification View | Shuning Chang · Pichao Wang · Ming Lin · Fan Wang · David Junhao Zhang · Rong Jin · Mike Zheng Shou | N/A | Code |
| Leverage Interactive Affinity for Affordance Learning | Hongchen Luo · Wei Zhai · Jing Zhang · Yang Cao · Dacheng Tao | N/A | Code |
| Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection | Fan Lu · Kai Zhu · Wei Zhai · Kecheng Zheng · Yang Cao | N/A | Code |
| HARP: Personalized Hand Reconstruction From a Monocular RGB Video | Korrawe Karunratanakul · Sergey Prokudin · Otmar Hilliges · Siyu Tang | N/A | Code |
| Towards Effective Visual Representations for Partial-Label Learning | Shiyu Xia · Jiaqi Lv · Ning Xu · Gang Niu · Xin Geng | N/A | Code |
| SFD2: Semantic-Guided Feature Detection and Description | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation | Bowen Zhang · Chenyang Qi · Pan Zhang · Bo Zhang · HsiangTao Wu · Dong Chen · Qifeng Chen · Yong Wang · Fang Wen | N/A | Code |
| The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training | Gi-Cheon Kang · Sungdong Kim · Jin-Hwa Kim · Donghyun Kwak · Byoung-Tak Zhang | N/A | Code |
| Temporal Interpolation Is All You Need for Dynamic Neural Radiance Fields | Sungheon Park · Minjung Son · Seokhwan Jang · Young Chun Ahn · Ji-Yeon Kim · Nahyup Kang | N/A | Code |
| DiGA: Distil To Generalize and Then Adapt for Domain Adaptive Semantic Segmentation | Fengyi Shen · Akhil Gurram · Ziyuan Liu · He Wang · Alois Knoll | N/A | Code |
| Multimodal Prompting With Missing Modalities for Visual Recognition | Yi-Lun Lee · Yi-Hsuan Tsai · Wei-Chen Chiu · Chen-Yu Lee | N/A | Code |
| On Calibrating Semantic Segmentation Models: Analyses and an Algorithm | Dongdong Wang · Boqing Gong · Liqiang Wang | N/A | Code |
| IMP: Iterative Matching and Pose Estimation With Adaptive Pooling | Fei Xue · Ignas Budvytis · Roberto Cipolla | N/A | Code |
| Grid-Guided Neural Radiance Fields for Large Urban Scenes | Linning Xu · Yuanbo Xiangli · Sida Peng · Xingang Pan · Nanxuan Zhao · Christian Theobalt · Bo Dai · Dahua Lin | N/A | Code |
| Neural Voting Field for Camera-Space 3D Hand Pose Estimation | Lin Huang · Chung-Ching Lin · Kevin Lin · Lin Liang · Lijuan Wang · Junsong Yuan · Zicheng Liu | N/A | Code |
| Dense Network Expansion for Class Incremental Learning | Zhiyuan Hu · Yunsheng Li · Jiancheng Lyu · Dashan Gao · Nuno Vasconcelos | N/A | Code |
| FashionSAP: Symbols and Attributes Prompt for Fine-Grained Fashion Vision-Language Pre-Training | Yunpeng Han · Lisai Zhang · Qingcai Chen · Zhijian Chen · Zhonghua Li · Jianxin Yang · Zhao Cao | N/A | Code |
| Batch Model Consolidation: A Multi-Task Model Consolidation Framework | Iordanis Fostiropoulos · Jiaye Zhu · Laurent Itti | N/A | Code |
| Open-Vocabulary Attribute Detection | María A. Bravo · Sudhanshu Mittal · Simon Ging · Thomas Brox | N/A | Code |
| Unite and Conquer: Plug & Play Multi-Modal Synthesis Using Diffusion Models | Nithin Gopalakrishnan Nair · Wele Gedara Chaminda Bandara · Vishal M. Patel | N/A | Code |
| BEVHeight: A Robust Framework for Vision-Based Roadside 3D Object Detection | Lei Yang · Kaicheng Yu · Tao Tang · Jun Li · Kun Yuan · Li Wang · Xinyu Zhang · Peng Chen | N/A | Code |
| Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization | Chen Zhao · Shuming Liu · Karttikeya Mangalam · Bernard Ghanem | N/A | Code |
| C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation | Nazmul Karim · Niluthpol Chowdhury Mithun · Abhinav Rajvanshi · Han-pang Chiu · Supun Samarasekera · Nazanin Rahnavard | N/A | Code |
| Are Deep Neural Networks SMARTer Than Second Graders? | Anoop Cherian · Kuan-Chuan Peng · Suhas Lohit · Kevin A. Smith · Joshua B. Tenenbaum | N/A | Code |
| Persistent Nature: A Generative Model of Unbounded 3D Worlds | Lucy Chai · Richard Tucker · Zhengqi Li · Phillip Isola · Noah Snavely | N/A | Code |
| InternImage: Exploring Large-Scale Vision Foundation Models With Deformable Convolutions | Wenhai Wang · Jifeng Dai · Zhe Chen · Zhenhang Huang · Zhiqi Li · Xizhou Zhu · Xiaowei Hu · Tong Lu · Lewei Lu · Hongsheng Li · Xiaogang Wang · Yu Qiao | N/A | Code |
| Learning To Fuse Monocular and Multi-View Cues for Multi-Frame Depth Estimation in Dynamic Scenes | Rui Li · Dong Gong · Wei Yin · Hao Chen · Yu Zhu · Kaixuan Wang · Xiaozhi Chen · Jinqiu Sun · Yanning Zhang | N/A | Code |
| Benchmarking Self-Supervised Learning on Diverse Pathology Datasets | Mingu Kang · Heon Song · Seonwook Park · Donggeun Yoo · Sérgio Pereira | N/A | Code |
| Knowledge Distillation for 6D Pose Estimation by Aligning Distributions of Local Predictions | Shuxuan Guo · Yinlin Hu · Jose M. Alvarez · Mathieu Salzmann | N/A | Code |
| Self-Supervised Representation Learning for CAD | Benjamin T. Jones · Michael Hu · Milin Kodnongbua · Vladimir G. Kim · Adriana Schulz | N/A | Code |
| SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer | Xuanyao Chen · Zhijian Liu · Haotian Tang · Li Yi · Hang Zhao · Song Han | N/A | Code |
| Neural Pixel Composition for 3D-4D View Synthesis From Multi-Views | Aayush Bansal · Michael Zollhöfer | N/A | Code |
| ViP3D: End-to-End Visual Trajectory Prediction via 3D Agent Queries | Junru Gu · Chenxu Hu · Tianyuan Zhang · Xuanyao Chen · Yilun Wang · Yue Wang · Hang Zhao | N/A | Code |
| AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning With Masked Autoencoders | Wele Gedara Chaminda Bandara · Naman Patel · Ali Gholami · Mehdi Nikkhah · Motilal Agrawal · Vishal M. Patel | N/A | Code |
| Masked Scene Contrast: A Scalable Framework for Unsupervised 3D Representation Learning | Xiaoyang Wu · Xin Wen · Xihui Liu · Hengshuang Zhao | N/A | Code |
| RIFormer: Keep Your Vision Backbone Effective but Removing Token Mixer | Jiahao Wang · Songyang Zhang · Yong Liu · Taiqiang Wu · Yujiu Yang · Xihui Liu · Kai Chen · Ping Luo · Dahua Lin | N/A | Code |
| TeSLA: Test-Time Self-Learning With Automatic Adversarial Augmentation | Devavrat Tomar · Guillaume Vray · Behzad Bozorgtabar · Jean-Philippe Thiran | N/A | Code |
| ObjectMatch: Robust Registration Using Canonical Object Correspondences | Can Gümeli · Angela Dai · Matthias Nießner | N/A | Code |
| Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models | Jiale Xu · Xintao Wang · Weihao Cheng · Yan-Pei Cao · Ying Shan · Xiaohu Qie · Shenghua Gao | N/A | Code |
| SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes | Yiming Gao · Yan-Pei Cao · Ying Shan | N/A | Code |
| Object Detection With Self-Supervised Scene Adaptation | Zekun Zhang · Minh Hoai | N/A | Code |
| Megahertz Light Steering Without Moving Parts | Adithya Pediredla · Srinivasa G. Narasimhan · Maysamreza Chamanzar · Ioannis Gkioulekas | N/A | Code |
| ISBNet: A 3D Point Cloud Instance Segmentation Network With Instance-Aware Sampling and Box-Aware Dynamic Convolution | Tuan Duc Ngo · Binh-Son Hua · Khoi Nguyen | N/A | Code |
| Rate Gradient Approximation Attack Threats Deep Spiking Neural Networks | Tong Bu · Jianhao Ding · Zecheng Hao · Zhaofei Yu | N/A | Code |
| PIVOT: Prompting for Video Continual Learning | Andrés Villa · Juan León Alcázar · Motasem Alfarra · Kumail Alhamoud · Julio Hurtado · Fabian Caba Heilbron · Alvaro Soto · Bernard Ghanem | N/A | Code |
| ARO-Net: Learning Implicit Fields From Anchored Radial Observations | Yizhi Wang · Zeyu Huang · Ariel Shamir · Hui Huang · Hao Zhang · Ruizhen Hu | N/A | Code |
| Parallel Diffusion Models of Operator and Image for Blind Inverse Problems | Hyungjin Chung · Jeongsol Kim · Sehui Kim · Jong Chul Ye | N/A | Code |
| Solving 3D Inverse Problems Using Pre-Trained 2D Diffusion Models | Hyungjin Chung · Dohoon Ryu · Michael T. McCann · Marc L. Klasky · Jong Chul Ye | N/A | Code |
| Affordance Grounding From Demonstration Video To Target Image | Joya Chen · Difei Gao · Kevin Qinghong Lin · Mike Zheng Shou | N/A | Code |
| Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations | Yiwu Zhong · Licheng Yu · Yang Bai · Shangwen Li · Xueting Yan · Yin Li | N/A | Code |
| YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors | Chien-Yao Wang · Alexey Bochkovskiy · Hong-Yuan Mark Liao | N/A | Code |
| OmniCity: Omnipotent City Understanding With Multi-Level and Multi-View Images | Weijia Li · Yawen Lai · Linning Xu · Yuanbo Xiangli · Jinhua Yu · Conghui He · Gui-Song Xia · Dahua Lin | N/A | Code |
| Object Discovery From Motion-Guided Tokens | Zhipeng Bao · Pavel Tokmakov · Yu-Xiong Wang · Adrien Gaidon · Martial Hebert | N/A | Code |
| MP-Former: Mask-Piloted Transformer for Image Segmentation | Hao Zhang · Feng Li · Huaizhe Xu · Shijia Huang · Shilong Liu · Lionel M. Ni · Lei Zhang | N/A | Code |
| Disentangling Writer and Character Styles for Handwriting Generation | Gang Dai · Yifan Zhang · Qingfeng Wang · Qing Du · Zhuliang Yu · Zhuoman Liu · Shuangping Huang | N/A | Code |
| Building Rearticulable Models for Arbitrary 3D Objects From 4D Point Clouds | Shaowei Liu · Saurabh Gupta · Shenlong Wang | N/A | Code |
| Gated Stereo: Joint Depth Estimation From Gated and Wide-Baseline Active Stereo Cues | Stefanie Walz · Mario Bijelic · Andrea Ramazzina · Amanpreet Walia · Fahim Mannan · Felix Heide | N/A | Code |
| Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation | Wei Wang · Zhun Zhong · Weijie Wang · Xi Chen · Charles Ling · Boyu Wang · Nicu Sebe | N/A | Code |
| Perspective Fields for Single Image Camera Calibration | Linyi Jin · Jianming Zhang · Yannick Hold-Geoffroy · Oliver Wang · Kevin Blackburn-Matzen · Matthew Sticha · David F. Fouhey | N/A | Code |
| Vision Transformers Are Parameter-Efficient Audio-Visual Learners | Yan-Bo Lin · Yi-Lin Sung · Jie Lei · Mohit Bansal · Gedas Bertasius | N/A | Code |
| Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos | Yubin Hu · Yuze He · Yanghao Li · Jisheng Li · Yuxing Han · Jiangtao Wen · Yong-Jin Liu | N/A | Code |
| DisWOT: Student Architecture Search for Distillation WithOut Training | Peijie Dong · Lujun Li · Zimian Wei | N/A | Code |
| Activating More Pixels in Image Super-Resolution Transformer | Xiangyu Chen · Xintao Wang · Jiantao Zhou · Yu Qiao · Chao Dong | N/A | Code |
| You Only Segment Once: Towards Real-Time Panoptic Segmentation | Jie Hu · Linyan Huang · Tianhe Ren · Shengchuan Zhang · Rongrong Ji · Liujuan Cao | N/A | Code |
| PA&DA: Jointly Sampling Path and Data for Consistent NAS | Shun Lu · Yu Hu · Longxing Yang · Zihao Sun · Jilin Mei · Jianchao Tan · Chengru Song | N/A | Code |
| NeuralUDF: Learning Unsigned Distance Fields for Multi-View Reconstruction of Surfaces With Arbitrary Topologies | Xiaoxiao Long · Cheng Lin · Lingjie Liu · Yuan Liu · Peng Wang · Christian Theobalt · Taku Komura · Wenping Wang | N/A | Code |
| Towards Universal Fake Image Detectors That Generalize Across Generative Models | Utkarsh Ojha · Yuheng Li · Yong Jae Lee | N/A | Code |
| FLAG3D: A 3D Fitness Activity Dataset With Language Instruction | Yansong Tang · Jinpeng Liu · Aoyang Liu · Bin Yang · Wenxun Dai · Yongming Rao · Jiwen Lu · Jie Zhou · Xiu Li | N/A | Code |
| NEF: Neural Edge Fields for 3D Parametric Curve Reconstruction From Multi-View Images | Yunfan Ye · Renjiao Yi · Zhirui Gao · Chenyang Zhu · Zhiping Cai · Kai Xu | N/A | Code |
| Executing Your Commands via Motion Diffusion in Latent Space | Xin Chen · Biao Jiang · Wen Liu · Zilong Huang · Bin Fu · Tao Chen · Gang Yu | N/A | Code |
| MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID | Jianyang Gu · Kai Wang · Hao Luo · Chen Chen · Wei Jiang · Yuqiang Fang · Shanghang Zhang · Yang You · Jian Zhao | N/A | Code |
| SunStage: Portrait Reconstruction and Relighting Using the Sun as a Light Stage | Yifan Wang · Aleksander Holynski · Xiuming Zhang · Xuaner Zhang | N/A | Code |
| IS-GGT: Iterative Scene Graph Generation With Generative Transformers | Sanjoy Kundu · Sathyanarayanan N. Aakur | N/A | Code |
| DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis | Yinghao Xu · Menglei Chai · Zifan Shi · Sida Peng · Ivan Skorokhodov · Aliaksandr Siarohin · Ceyuan Yang · Yujun Shen · Hsin-Ying Lee · Bolei Zhou · Sergey Tulyakov | N/A | Code |
| Breaking the “Object” in Video Object Segmentation | Pavel Tokmakov · Jie Li · Adrien Gaidon | N/A | Code |
| SimpleNet: A Simple Network for Image Anomaly Detection and Localization | Zhikang Liu · Yiming Zhou · Yuansheng Xu · Zilei Wang | N/A | Code |
| Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation | Lingting Zhu · Xian Liu · Xuanyu Liu · Rui Qian · Ziwei Liu · Lequan Yu | N/A | Code |
| Top-Down Visual Attention From Analysis by Synthesis | Baifeng Shi · Trevor Darrell · Xin Wang | N/A | Code |
| Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation With Cross-Scale Distortion Awareness | Zhijie Shen · Zishuo Zheng · Chunyu Lin · Lang Nie · Kang Liao · Shuai Zheng · Yao Zhao | N/A | Code |
| Global-to-Local Modeling for Video-Based 3D Human Pose and Shape Estimation | Xiaolong Shen · Zongxin Yang · Xiaohan Wang · Jianxin Ma · Chang Zhou · Yi Yang | N/A | Code |
| RaBit: Parametric Modeling of 3D Biped Cartoon Characters With a Topological-Consistent Dataset | Zhongjin Luo · Shengcai Cai · Jinguo Dong · Ruibo Ming · Liangdong Qiu · Xiaohang Zhan · Xiaoguang Han | N/A | Code |
| Masked Image Modeling With Local Multi-Scale Reconstruction | Haoqing Wang · Yehui Tang · Yunhe Wang · Jianyuan Guo · Zhi-Hong Deng · Kai Han | N/A | Code |
| Omni Aggregation Networks for Lightweight Image Super-Resolution | Hang Wang · Xuanhong Chen · Bingbing Ni · Yutian Liu · Jinfan Liu | N/A | Code |
| TryOnDiffusion: A Tale of Two UNets | Luyang Zhu · Dawei Yang · Tyler Zhu · Fitsum Reda · William Chan · Chitwan Saharia · Mohammad Norouzi · Ira Kemelmacher-Shlizerman | N/A | Code |
| MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition | Xiang Wang · Shiwei Zhang · Zhiwu Qing · Changxin Gao · Yingya Zhang · Deli Zhao · Nong Sang | N/A | Code |
| Dynamic Aggregated Network for Gait Recognition | Kang Ma · Ying Fu · Dezhi Zheng · Chunshui Cao · Xuecai Hu · Yongzhen Huang | N/A | Code |
| Equivalent Transformation and Dual Stream Network Construction for Mobile Image Super-Resolution | Jiahao Chao · Zhou Zhou · Hongfan Gao · Jiali Gong · Zhengfeng Yang · Zhenbing Zeng · Lydia Dehbi | N/A | Code |
| Semi-Supervised Hand Appearance Recovery via Structure Disentanglement and Dual Adversarial Discrimination | Zimeng Zhao · Binghui Zuo · Zhiyu Long · Yangang Wang | N/A | Code |
| DBARF: Deep Bundle-Adjusting Generalizable Neural Radiance Fields | Yu Chen · Gim Hee Lee | N/A | Code |
| Deep Arbitrary-Scale Image Super-Resolution via Scale-Equivariance Pursuit | Xiaohang Wang · Xuanhong Chen · Bingbing Ni · Hang Wang · Zhengyan Tong · Yutian Liu | N/A | Code |
| Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections | Jiaxiong Qiu · Peng-Tao Jiang · Yifan Zhu · Ze-Xin Yin · Ming-Ming Cheng · Bo Ren | N/A | Code |
| The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction | Alexandros Stergiou · Dima Damen | N/A | Code |
| Use Your Head: Improving Long-Tail Video Recognition | Toby Perrett · Saptarshi Sinha · Tilo Burghardt · Majid Mirmehdi · Dima Damen | N/A | Code |
| Large-Scale Training Data Search for Object Re-Identification | Yue Yao · Tom Gedeon · Liang Zheng | N/A | Code |
| Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction | Guangyi Chen · Zhenhao Chen · Shunxing Fan · Kun Zhang | N/A | Code |
| Seeing a Rose in Five Thousand Ways | Yunzhi Zhang · Shangzhe Wu · Noah Snavely · Jiajun Wu | N/A | Code |
| EditableNeRF: Editing Topologically Varying Neural Radiance Fields by Key Points | Chengwei Zheng · Wenbin Lin · Feng Xu | N/A | Code |
| Uncertainty-Aware Unsupervised Image Deblurring With Deep Residual Prior | Xiaole Tang · Xile Zhao · Jun Liu · Jianli Wang · Yuchun Miao · Tieyong Zeng | N/A | Code |
| Primitive Generation and Semantic-Related Alignment for Universal Zero-Shot Segmentation | Shuting He · Henghui Ding · Wei Jiang | N/A | Code |
| Long-Tailed Visual Recognition via Self-Heterogeneous Integration With Knowledge Excavation | Yan Jin · Mengke Li · Yang Lu · Yiu-ming Cheung · Hanzi Wang | N/A | Code |
| Neuron Structure Modeling for Generalizable Remote Physiological Measurement | Hao Lu · Zitong Yu · Xuesong Niu · Ying-Cong Chen | N/A | Code |
| Decoupled Semantic Prototypes Enable Learning From Diverse Annotation Types for Semi-Weakly Segmentation in Expert-Driven Domains | Simon Reiß · Constantin Seibold · Alexander Freytag · Erik Rodner · Rainer Stiefelhagen | N/A | Code |
| Learning a Sparse Transformer Network for Effective Image Deraining | Xiang Chen · Hao Li · Mingqiang Li · Jinshan Pan | N/A | Code |
| Camouflaged Object Detection With Feature Decomposition and Edge Reconstruction | Chunming He · Kai Li · Yachao Zhang · Longxiang Tang · Yulun Zhang · Zhenhua Guo · Xiu Li | N/A | Code |
| LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding | Gen Li · Varun Jampani · Deqing Sun · Laura Sevilla-Lara | N/A | Code |
| DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation | Nataniel Ruiz · Yuanzhen Li · Varun Jampani · Yael Pritch · Michael Rubinstein · Kfir Aberman | N/A | Code |
| GeneCIS: A Benchmark for General Conditional Image Similarity | Sagar Vaze · Nicolas Carion · Ishan Misra | N/A | Code |
| Neighborhood Attention Transformer | Ali Hassani · Steven Walton · Jiachen Li · Shen Li · Humphrey Shi | N/A | Code |
| 3D-Aware Conditional Image Synthesis | Kangle Deng · Gengshan Yang · Deva Ramanan · Jun-Yan Zhu | N/A | Code |
| Magic3D: High-Resolution Text-to-3D Content Creation | Chen-Hsuan Lin · Jun Gao · Luming Tang · Towaki Takikawa · Xiaohui Zeng · Xun Huang · Karsten Kreis · Sanja Fidler · Ming-Yu Liu · Tsung-Yi Lin | N/A | Code |
| QuantArt: Quantizing Image Style Transfer Towards High Visual Fidelity | Siyu Huang · Jie An · Donglai Wei · Jiebo Luo · Hanspeter Pfister | N/A | Code |
| SceneComposer: Any-Level Semantic Image Synthesis | Yu Zeng · Zhe Lin · Jianming Zhang · Qing Liu · John Collomosse · Jason Kuen · Vishal M. Patel | N/A | Code |
| Specialist Diffusion: Plug-and-Play Sample-Efficient Fine-Tuning of Text-to-Image Diffusion Models To Learn Any Unseen Style | Haoming Lu · Hazarapet Tunanyan · Kai Wang · Shant Navasardyan · Zhangyang Wang · Humphrey Shi | N/A | Code |
| In-Hand 3D Object Scanning From an RGB Sequence | Shreyas Hampali · Tomas Hodan · Luan Tran · Lingni Ma · Cem Keskin · Vincent Lepetit | N/A | Code |
| SHS-Net: Learning Signed Hyper Surfaces for Oriented Normal Estimation of Point Clouds | Qing Li · Huifang Feng · Kanle Shi · Yue Gao · Yi Fang · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Advancing Visual Grounding With Scene Knowledge: Benchmark and Method | Zhihong Chen · Ruifei Zhang · Yibing Song · Xiang Wan · Guanbin Li | N/A | Code |
| Putting People in Their Place: Affordance-Aware Human Insertion Into Scenes | Sumith Kulal · Tim Brooks · Alex Aiken · Jiajun Wu · Jimei Yang · Jingwan Lu · Alexei A. Efros · Krishna Kumar Singh | N/A | Code |
| Identity-Preserving Talking Face Generation With Landmark and Appearance Priors | Weizhi Zhong · Chaowei Fang · Yinqi Cai · Pengxu Wei · Gangming Zhao · Liang Lin · Guanbin Li | N/A | Code |
| Less Is More: Reducing Task and Model Complexity for 3D Point Cloud Semantic Segmentation | Li Li · Hubert P. H. Shum · Toby P. Breckon | N/A | Code |
| FAC: 3D Representation Learning via Foreground Aware Feature Contrast | Kangcheng Liu · Aoran Xiao · Xiaoqin Zhang · Shijian Lu · Ling Shao | N/A | Code |
| InstMove: Instance Motion for Object-Centric Video Segmentation | Qihao Liu · Junfeng Wu · Yi Jiang · Xiang Bai · Alan L. Yuille · Song Bai | N/A | Code |
| Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark | Xiaofeng Wang · Zheng Zhu · Yunpeng Zhang · Guan Huang · Yun Ye · Wenbo Xu · Ziwei Chen · Xingang Wang | N/A | Code |
| Self-Supervised Non-Uniform Kernel Estimation With Flow-Based Motion Prior for Blind Image Deblurring | Zhenxuan Fang · Fangfang Wu · Weisheng Dong · Xin Li · Jinjian Wu · Guangming Shi | N/A | Code |
| Neural Kernel Surface Reconstruction | Jiahui Huang · Zan Gojcic · Matan Atzmon · Or Litany · Sanja Fidler · Francis Williams | N/A | Code |
| Binary Latent Diffusion | Ze Wang · Jiang Wang · Zicheng Liu · Qiang Qiu | N/A | Code |
| Learning To Dub Movies via Hierarchical Prosody Models | Gaoxiang Cong · Liang Li · Yuankai Qi · Zheng-Jun Zha · Qi Wu · Wenyu Wang · Bin Jiang · Ming-Hsuan Yang · Qingming Huang | N/A | Code |
| Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs | Pattaramanee Arsomngern · Sarana Nutanong · Supasorn Suwajanakorn | N/A | Code |
| FrustumFormer: Adaptive Instance-Aware Resampling for Multi-View 3D Detection | Yuqi Wang · Yuntao Chen · Zhaoxiang Zhang | N/A | Code |
| Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images! | Zaid Khan · Vijay Kumar BG · Samuel Schulter · Xiang Yu · Yun Fu · Manmohan Chandraker | N/A | Code |
| StyleRes: Transforming the Residuals for Real Image Editing With StyleGAN | Hamza Pehlivan · Yusuf Dalva · Aysegul Dundar | N/A | Code |
| PVT-SSD: Single-Stage 3D Object Detector With Point-Voxel Transformer | Honghui Yang · Wenxiao Wang · Minghao Chen · Binbin Lin · Tong He · Hua Chen · Xiaofei He · Wanli Ouyang | N/A | Code |
| Boosting Verified Training for Robust Image Classifications via Abstraction | Zhaodi Zhang · Zhiyi Xue · Yang Chen · Si Liu · Yueling Zhang · Jing Liu · Min Zhang | N/A | Code |
| Interactive Segmentation As Gaussion Process Classification | Minghao Zhou · Hong Wang · Qian Zhao · Yuexiang Li · Yawen Huang · Deyu Meng · Yefeng Zheng | N/A | Code |
| OSRT: Omnidirectional Image Super-Resolution With Distortion-Aware Transformer | Fanghua Yu · Xintao Wang · Mingdeng Cao · Gen Li · Ying Shan · Chao Dong | N/A | Code |
| Accelerating Vision-Language Pretraining With Free Language Modeling | Teng Wang · Yixiao Ge · Feng Zheng · Ran Cheng · Ying Shan · Xiaohu Qie · Ping Luo | N/A | Code |
| TexPose: Neural Texture Learning for Self-Supervised 6D Object Pose Estimation | Hanzhi Chen · Fabian Manhardt · Nassir Navab · Benjamin Busam | N/A | Code |
| Learning With Noisy Labels via Self-Supervised Adversarial Noisy Masking | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Jiangning Zhang · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| HelixSurf: A Robust and Efficient Neural Implicit Surface Learning of Indoor Scenes With Iterative Intertwined Regularization | Zhihao Liang · Zhangjin Huang · Changxing Ding · Kui Jia | N/A | Code |
| Multi-Space Neural Radiance Fields | Ze-Xin Yin · Jiaxiong Qiu · Ming-Ming Cheng · Bo Ren | N/A | Code |
| MSF: Motion-Guided Sequential Fusion for Efficient 3D Object Detection From Point Cloud Sequences | Chenhang He · Ruihuang Li · Yabin Zhang · Shuai Li · Lei Zhang | N/A | Code |
| DLBD: A Self-Supervised Direct-Learned Binary Descriptor | Bin Xiao · Yang Hu · Bo Liu · Xiuli Bi · Weisheng Li · Xinbo Gao | N/A | Code |
| Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries | Yuanwen Yue · Theodora Kontogianni · Konrad Schindler · Francis Engelmann | N/A | Code |
| PointAvatar: Deformable Point-Based Head Avatars From Videos | Yufeng Zheng · Wang Yifan · Gordon Wetzstein · Michael J. Black · Otmar Hilliges | N/A | Code |
| Diffusion-SDF: Text-To-Shape via Voxelized Diffusion | Muheng Li · Yueqi Duan · Jie Zhou · Jiwen Lu | N/A | Code |
| NeRF-RPN: A General Framework for Object Detection in NeRFs | Benran Hu · Junkai Huang · Yichen Liu · Yu-Wing Tai · Chi-Keung Tang | N/A | Code |
| CNVid-3.5M: Build, Filter, and Pre-Train the Large-Scale Public Chinese Video-Text Dataset | Tian Gan · Qing Wang · Xingning Dong · Xiangyuan Ren · Liqiang Nie · Qingpei Guo | N/A | Code |
| Vid2Avatar: 3D Avatar Reconstruction From Videos in the Wild via Self-Supervised Scene Decomposition | Chen Guo · Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| Neural Preset for Color Style Transfer | Zhanghan Ke · Yuhao Liu · Lei Zhu · Nanxuan Zhao · Rynson W.H. Lau | N/A | Code |
| GRES: Generalized Referring Expression Segmentation | Chang Liu · Henghui Ding · Xudong Jiang | N/A | Code |
| Tracking Through Containers and Occluders in the Wild | Basile Van Hoorick · Pavel Tokmakov · Simon Stent · Jie Li · Carl Vondrick | N/A | Code |
| DepGraph: Towards Any Structural Pruning | Gongfan Fang · Xinyin Ma · Mingli Song · Michael Bi Mi · Xinchao Wang | N/A | Code |
| Exploring Incompatible Knowledge Transfer in Few-Shot Image Generation | Yunqing Zhao · Chao Du · Milad Abdollahzadeh · Tianyu Pang · Min Lin · Shuicheng Yan · Ngai-Man Cheung | N/A | Code |
| RGB No More: Minimally-Decoded JPEG Vision Transformers | Jeongsoo Park · Justin Johnson | N/A | Code |
| iQuery: Instruments As Queries for Audio-Visual Sound Separation | Jiaben Chen · Renrui Zhang · Dongze Lian · Jiaqi Yang · Ziyao Zeng · Jianbo Shi | N/A | Code |
| Towards Professional Level Crowd Annotation of Expert Domain Data | Pei Wang · Nuno Vasconcelos | N/A | Code |
| VideoTrack: Learning To Track Objects via Video Transformer | Fei Xie · Lei Chu · Jiahao Li · Yan Lu · Chao Ma | N/A | Code |
| SCoDA: Domain Adaptive Shape Completion for Real Scans | Yushuang Wu · Zizheng Yan · Ce Chen · Lai Wei · Xiao Li · Guanbin Li · Yihao Li · Shuguang Cui · Xiaoguang Han | N/A | Code |
| Enhanced Training of Query-Based Object Detection via Selective Query Recollection | Fangyi Chen · Han Zhang · Kai Hu · Yu-Kai Huang · Chenchen Zhu · Marios Savvides | N/A | Code |
| LaserMix for Semi-Supervised LiDAR Semantic Segmentation | Lingdong Kong · Jiawei Ren · Liang Pan · Ziwei Liu | N/A | Code |
| MSMDFusion: Fusing LiDAR and Camera at Multiple Scales With Multi-Depth Seeds for 3D Object Detection | Yang Jiao · Zequn Jie · Shaoxiang Chen · Jingjing Chen · Lin Ma · Yu-Gang Jiang | N/A | Code |
| Learning With Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning | Zeyin Song · Yifan Zhao · Yujun Shi · Peixi Peng · Li Yuan · Yonghong Tian | N/A | Code |
| PLA: Language-Driven Open-Vocabulary 3D Scene Understanding | Runyu Ding · Jihan Yang · Chuhui Xue · Wenqing Zhang · Song Bai · Xiaojuan Qi | N/A | Code |
| Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training | Junfan Lin · Jianlong Chang · Lingbo Liu · Guanbin Li · Liang Lin · Qi Tian · Chang-Wen Chen | N/A | Code |
| A Dynamic Multi-Scale Voxel Flow Network for Video Prediction | Xiaotao Hu · Zhewei Huang · Ailin Huang · Jun Xu · Shuchang Zhou | N/A | Code |
| Neural Dependencies Emerging From Learning Massive Categories | Ruili Feng · Kecheng Zheng · Kai Zhu · Yujun Shen · Jian Zhao · Yukun Huang · Deli Zhao · Jingren Zhou · Michael Jordan · Zheng-Jun Zha | N/A | Code |
| Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-Identification | Yukang Zhang · Hanzi Wang | N/A | Code |
| Neural Kaleidoscopic Space Sculpting | Byeongjoo Ahn · Michael De Zeeuw · Ioannis Gkioulekas · Aswin C. Sankaranarayanan | N/A | Code |
| PyramidFlow: High-Resolution Defect Contrastive Localization Using Pyramid Normalizing Flow | Jiarui Lei · Xiaobo Hu · Yue Wang · Dong Liu | N/A | Code |
| Masked Motion Encoding for Self-Supervised Video Representation Learning | Xinyu Sun · Peihao Chen · Liangwei Chen · Changhao Li · Thomas H. Li · Mingkui Tan · Chuang Gan | N/A | Code |
| StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator | Jiazhi Guan · Zhanwang Zhang · Hang Zhou · Tianshu Hu · Kaisiyuan Wang · Dongliang He · Haocheng Feng · Jingtuo Liu · Errui Ding · Ziwei Liu · Jingdong Wang | N/A | Code |
| LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation | Song Wang · Wentong Li · Wenyu Liu · Xiaolu Liu · Jianke Zhu | N/A | Code |
| Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion | Changfeng Ma · Yinuo Chen · Pengxiao Guo · Jie Guo · Chongjun Wang · Yanwen Guo | N/A | Code |
| Boosting Detection in Crowd Analysis via Underutilized Output Features | Shaokai Wu · Fengyu Yang | N/A | Code |
| Representation Learning for Visual Object Tracking by Masked Appearance Transfer | Haojie Zhao · Dong Wang · Huchuan Lu | N/A | Code |
| NeuralLift-360: Lifting an In-the-Wild 2D Photo to a 3D Object With 360° Views | Dejia Xu · Yifan Jiang · Peihao Wang · Zhiwen Fan · Yi Wang · Zhangyang Wang | N/A | Code |
| DoNet: Deep De-Overlapping Network for Cytology Instance Segmentation | Hao Jiang · Rushan Zhang · Yanning Zhou · Yumeng Wang · Hao Chen | N/A | Code |
| Think Twice Before Driving: Towards Scalable Decoders for End-to-End Autonomous Driving | Xiaosong Jia · Penghao Wu · Li Chen · Jiangwei Xie · Conghui He · Junchi Yan · Hongyang Li | N/A | Code |
| Adversarial Counterfactual Visual Explanations | Guillaume Jeanneret · Loïc Simon · Frédéric Jurie | N/A | Code |
| ALOFT: A Lightweight MLP-Like Architecture With Dynamic Low-Frequency Transform for Domain Generalization | Jintao Guo · Na Wang · Lei Qi · Yinghuan Shi | N/A | Code |
| ShadowNeuS: Neural SDF Reconstruction by Shadow Ray Supervision | Jingwang Ling · Zhibo Wang · Feng Xu | N/A | Code |
| Coaching a Teachable Student | Jimuyang Zhang · Zanming Huang · Eshed Ohn-Bar | N/A | Code |
| POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery | Ce Zheng · Xianpeng Liu · Guo-Jun Qi · Chen Chen | N/A | Code |
| Layout-Based Causal Inference for Object Navigation | Sixian Zhang · Xinhang Song · Weijie Li · Yubing Bai · Xinyao Yu · Shuqiang Jiang | N/A | Code |
| Towards Bridging the Performance Gaps of Joint Energy-Based Models | Xiulong Yang · Qing Su · Shihao Ji | N/A | Code |
| Bringing Inputs to Shared Domains for 3D Interacting Hands Recovery in the Wild | Gyeongsik Moon | N/A | Code |
| Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting | Xiaogang Peng · Siyuan Mao · Zizhao Wu | N/A | Code |
| Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization | Chen Ju · Kunhao Zheng · Jinxiang Liu · Peisen Zhao · Ya Zhang · Jianlong Chang · Qi Tian · Yanfeng Wang | N/A | Code |
| DiffPose: Toward More Reliable 3D Pose Estimation | Jia Gong · Lin Geng Foo · Zhipeng Fan · Qiuhong Ke · Hossein Rahmani · Jun Liu | N/A | Code |
| SQUID: Deep Feature In-Painting for Unsupervised Anomaly Detection | Tiange Xiang · Yixiao Zhang · Yongyi Lu · Alan L. Yuille · Chaoyi Zhang · Weidong Cai · Zongwei Zhou | N/A | Code |
| On the Difficulty of Unpaired Infrared-to-Visible Video Translation: Fine-Grained Content-Rich Patches Transfer | Zhenjie Yu · Shuang Li · Yirui Shen · Chi Harold Liu · Shuigen Wang | N/A | Code |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network With Large Input | Senmao Tian · Ming Lu · Jiaming Liu · Yandong Guo · Yurong Chen · Shunli Zhang | N/A | Code |
| NeRF-DS: Neural Radiance Fields for Dynamic Specular Objects | Zhiwen Yan · Chen Li · Gim Hee Lee | N/A | Code |
| Spring: A High-Resolution High-Detail Dataset and Benchmark for Scene Flow, Optical Flow and Stereo | Lukas Mehl · Jenny Schmalfuss · Azin Jahedi · Yaroslava Nalivayko · Andrés Bruhn | N/A | Code |
| Unifying Short and Long-Term Tracking With Graph Hierarchies | Orcun Cetintas · Guillem Brasó · Laura Leal-Taixé | N/A | Code |
| MixTeacher: Mining Promising Labels With Mixed Scale Teacher for Semi-Supervised Object Detection | Liang Liu · Boshen Zhang · Jiangning Zhang · Wuhao Zhang · Zhenye Gan · Guanzhong Tian · Wenbing Zhu · Yabiao Wang · Chengjie Wang | N/A | Code |
| A2J-Transformer: Anchor-to-Joint Transformer Network for 3D Interacting Hand Pose Estimation From a Single RGB Image | Changlong Jiang · Yang Xiao · Cunlin Wu · Mingyang Zhang · Jinghong Zheng · Zhiguo Cao · Joey Tianyi Zhou | N/A | Code |
| Efficient Mask Correction for Click-Based Interactive Image Segmentation | Fei Du · Jianlong Yuan · Zhibin Wang · Fan Wang | N/A | Code |
| OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation | Tong Wu · Jiarui Zhang · Xiao Fu · Yuxin Wang · Jiawei Ren · Liang Pan · Wayne Wu · Lei Yang · Jiaqi Wang · Chen Qian · Dahua Lin · Ziwei Liu | N/A | Code |
| LoGoNet: Towards Accurate 3D Object Detection With Local-to-Global Cross-Modal Fusion | Xin Li · Tao Ma · Yuenan Hou · Botian Shi · Yuchen Yang · Youquan Liu · Xingjiao Wu · Qin Chen · Yikang Li · Yu Qiao · Liang He | N/A | Code |
| 3D Registration With Maximal Cliques | Xiyu Zhang · Jiaqi Yang · Shikun Zhang · Yanning Zhang | N/A | Code |
| Inferring and Leveraging Parts From Object Shape for Improving Semantic Image Synthesis | Yuxiang Wei · Zhilong Ji · Xiaohe Wu · Jinfeng Bai · Lei Zhang · Wangmeng Zuo | N/A | Code |
| Frame-Event Alignment and Fusion Network for High Frame Rate Tracking | Jiqing Zhang · Yuanchen Wang · Wenxi Liu · Meng Li · Jinpeng Bai · Baocai Yin · Xin Yang | N/A | Code |
| Human Guided Ground-Truth Generation for Realistic Image Super-Resolution | Du Chen · Jie Liang · Xindong Zhang · Ming Liu · Hui Zeng · Lei Zhang | N/A | Code |
| Towards Building Self-Aware Object Detectors via Reliable Uncertainty Quantification and Calibration | Kemal Oksuz · Tom Joy · Puneet K. Dokania | N/A | Code |
| Generating Human Motion From Textual Descriptions With Discrete Representations | Jianrong Zhang · Yangsong Zhang · Xiaodong Cun · Yong Zhang · Hongwei Zhao · Hongtao Lu · Xi Shen · Ying Shan | N/A | Code |
| Curricular Contrastive Regularization for Physics-Aware Single Image Dehazing | Yu Zheng · Jiahui Zhan · Shengfeng He · Junyu Dong · Yong Du | N/A | Code |
| Learning Human Mesh Recovery in 3D Scenes | Zehong Shen · Zhi Cen · Sida Peng · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection | Luting Wang · Yi Liu · Penghui Du · Zihan Ding · Yue Liao · Qiaosong Qi · Biaolong Chen · Si Liu | N/A | Code |
| Towards Artistic Image Aesthetics Assessment: A Large-Scale Dataset and a New Method | Ran Yi · Haoyuan Tian · Zhihao Gu · Yu-Kun Lai · Paul L. Rosin | N/A | Code |
| SOOD: Towards Semi-Supervised Oriented Object Detection | Wei Hua · Dingkang Liang · Jingyu Li · Xiaolong Liu · Zhikang Zou · Xiaoqing Ye · Xiang Bai | N/A | Code |
| Spherical Transformer for LiDAR-Based 3D Recognition | Xin Lai · Yukang Chen · Fanbin Lu · Jianhui Liu · Jiaya Jia | N/A | Code |
| Data-Free Sketch-Based Image Retrieval | Abhra Chaudhuri · Ayan Kumar Bhunia · Yi-Zhe Song · Anjan Dutta | N/A | Code |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Zixiang Zhao · Haowen Bai · Jiangshe Zhang · Yulun Zhang · Shuang Xu · Zudi Lin · Radu Timofte · Luc Van Gool | N/A | Code |
| Proximal Splitting Adversarial Attack for Semantic Segmentation | Jérôme Rony · Jean-Christophe Pesquet · Ismail Ben Ayed | N/A | Code |
| NeuWigs: A Neural Dynamic Model for Volumetric Hair Capture and Animation | Ziyan Wang · Giljoo Nam · Tuur Stuyck · Stephen Lombardi · Chen Cao · Jason Saragih · Michael Zollhöfer · Jessica Hodgins · Christoph Lassner | N/A | Code |
| Exploring the Effect of Primitives for Compositional Generalization in Vision-and-Language | Chuanhao Li · Zhen Li · Chenchen Jing · Yunde Jia · Yuwei Wu | N/A | Code |
| 3D-Aware Face Swapping | Yixuan Li · Chao Ma · Yichao Yan · Wenhan Zhu · Xiaokang Yang | N/A | Code |
| Representing Volumetric Videos As Dynamic MLP Maps | Sida Peng · Yunzhi Yan · Qing Shuai · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Rethinking the Approximation Error in 3D Surface Fitting for Point Cloud Normal Estimation | Hang Du · Xuejun Yan · Jingjing Wang · Di Xie · Shiliang Pu | N/A | Code |
| Rethinking Out-of-Distribution (OOD) Detection: Masked Image Modeling Is All You Need | Jingyao Li · Pengguang Chen · Zexin He · Shaozuo Yu · Shu Liu · Jiaya Jia | N/A | Code |
| Paint by Example: Exemplar-Based Image Editing With Diffusion Models | Binxin Yang · Shuyang Gu · Bo Zhang · Ting Zhang · Xuejin Chen · Xiaoyan Sun · Dong Chen · Fang Wen | N/A | Code |
| Referring Multi-Object Tracking | Dongming Wu · Wencheng Han · Tiancai Wang · Xingping Dong · Xiangyu Zhang · Jianbing Shen | N/A | Code |
| NerVE: Neural Volumetric Edges for Parametric Curve Extraction From Point Cloud | Xiangyu Zhu · Dong Du · Weikai Chen · Zhiyou Zhao · Yinyu Nie · Xiaoguang Han | N/A | Code |
| AsyFOD: An Asymmetric Adaptation Paradigm for Few-Shot Domain Adaptive Object Detection | Yipeng Gao · Kun-Yu Lin · Junkai Yan · Yaowei Wang · Wei-Shi Zheng | N/A | Code |
| CUF: Continuous Upsampling Filters | Cristina N. Vasconcelos · Cengiz Oztireli · Mark Matthews · Milad Hashemi · Kevin Swersky · Andrea Tagliasacchi | N/A | Code |
| MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors | Yuang Zhang · Tiancai Wang · Xiangyu Zhang | N/A | Code |
| CXTrack: Improving 3D Point Cloud Tracking With Contextual Information | Tian-Xing Xu · Yuan-Chen Guo · Yu-Kun Lai · Song-Hai Zhang | N/A | Code |
| Explicit Boundary Guided Semi-Push-Pull Contrastive Learning for Supervised Anomaly Detection | Xincheng Yao · Ruoqi Li · Jing Zhang · Jun Sun · Chongyang Zhang | N/A | Code |
| Learning Bottleneck Concepts in Image Classification | Bowen Wang · Liangzhi Li · Yuta Nakashima · Hajime Nagahara | N/A | Code |
| Zero-Shot Model Diagnosis | Jinqi Luo · Zhaoning Wang · Chen Henry Wu · Dong Huang · Fernando De la Torre | N/A | Code |
| DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation | Shuai Shen · Wenliang Zhao · Zibin Meng · Wanhua Li · Zheng Zhu · Jie Zhou · Jiwen Lu | N/A | Code |
| DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer | Ping Chen · Xingpeng Zhang · Ye Li · Ju Tao · Bin Xiao · Bing Wang · Zongjie Jiang | N/A | Code |
| TranSG: Transformer-Based Skeleton Graph Prototype Contrastive Learning With Structure-Trajectory Prompted Reconstruction for Person Re-Identification | Haocong Rao · Chunyan Miao | N/A | Code |
| Joint Visual Grounding and Tracking With Natural Language Specification | Li Zhou · Zikun Zhou · Kaige Mao · Zhenyu He | N/A | Code |
| Compressing Volumetric Radiance Fields to 1 MB | Lingzhi Li · Zhen Shen · Zhongshu Wang · Li Shen · Liefeng Bo | N/A | Code |
| HyperReel: High-Fidelity 6-DoF Video With Ray-Conditioned Sampling | Benjamin Attal · Jia-Bin Huang · Christian Richardt · Michael Zollhöfer · Johannes Kopf · Matthew O’Toole · Changil Kim | N/A | Code |
| Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sections | Alexander Gillert · Giulia Resente · Alba Anadon-Rosell · Martin Wilmking · Uwe Freiherr von Lukas | N/A | Code |
| Ego-Body Pose Estimation via Ego-Head Pose Estimation | Jiaman Li · Karen Liu · Jiajun Wu | N/A | Code |
| Learned Two-Plane Perspective Prior Based Image Resampling for Efficient Object Detection | Anurag Ghosh · N. Dinesh Reddy · Christoph Mertz · Srinivasa G. Narasimhan | N/A | Code |
| PaletteNeRF: Palette-Based Appearance Editing of Neural Radiance Fields | Zhengfei Kuang · Fujun Luan · Sai Bi · Zhixin Shu · Gordon Wetzstein · Kalyan Sunkavalli | N/A | Code |
| Long Range Pooling for 3D Large-Scale Scene Understanding | Xiang-Li Li · Meng-Hao Guo · Tai-Jiang Mu · Ralph R. Martin · Shi-Min Hu | N/A | Code |
| Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation | Mingjie Li · Bingqian Lin · Zicong Chen · Haokun Lin · Xiaodan Liang · Xiaojun Chang | N/A | Code |
| Event-Guided Person Re-Identification via Sparse-Dense Complementary Learning | Chengzhi Cao · Xueyang Fu · Hongjian Liu · Yukun Huang · Kunyu Wang · Jiebo Luo · Zheng-Jun Zha | N/A | Code |
| Contrastive Grouping With Transformer for Referring Image Segmentation | Jiajin Tang · Ge Zheng · Cheng Shi · Sibei Yang | N/A | Code |
| Structure Aggregation for Cross-Spectral Stereo Image Guided Denoising | Zehua Sheng · Zhu Yu · Xiongwei Liu · Si-Yuan Cao · Yuqi Liu · Hui-Liang Shen · Huaqi Zhang | N/A | Code |
| Where Is My Spot? Few-Shot Image Generation via Latent Subspace Optimization | Chenxi Zheng · Bangzhen Liu · Huaidong Zhang · Xuemiao Xu · Shengfeng He | N/A | Code |
| EDGE: Editable Dance Generation From Music | Jonathan Tseng · Rodrigo Castellon · Karen Liu | N/A | Code |
| PartSLIP: Low-Shot Part Segmentation for 3D Point Clouds via Pretrained Image-Language Models | Minghua Liu · Yinhao Zhu · Hong Cai · Shizhong Han · Zhan Ling · Fatih Porikli · Hao Su | N/A | Code |
| EDICT: Exact Diffusion Inversion via Coupled Transformations | Bram Wallace · Akash Gokul · Nikhil Naik | N/A | Code |
| Complete 3D Human Reconstruction From a Single Incomplete Image | Junying Wang · Jae Shin Yoon · Tuanfeng Y. Wang · Krishna Kumar Singh · Ulrich Neumann | N/A | Code |
| PartDistillation: Learning Parts From Instance Segmentation | Jang Hyun Cho · Philipp Krähenbühl · Vignesh Ramanathan | N/A | Code |
| Neural Vector Fields: Implicit Representation by Explicit Learning | Xianghui Yang · Guosheng Lin · Zhenghao Chen · Luping Zhou | N/A | Code |
| Unsupervised Inference of Signed Distance Functions From Single Sparse Point Clouds Without Learning Priors | Chao Chen · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Texts as Images in Prompt Tuning for Multi-Label Image Recognition | Zixian Guo · Bowen Dong · Zhilong Ji · Jinfeng Bai · Yiwen Guo · Wangmeng Zuo | N/A | Code |
| Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent With Learned Distance Functions | Yun He · Danhang Tang · Yinda Zhang · Xiangyang Xue · Yanwei Fu | N/A | Code |
| MMANet: Margin-Aware Distillation and Modality-Aware Regularization for Incomplete Multimodal Learning | Shicai Wei · Chunbo Luo · Yang Luo | N/A | Code |
| Rethinking Optical Flow From Geometric Matching Consistent Perspective | Qiaole Dong · Chenjie Cao · Yanwei Fu | N/A | Code |
| FastInst: A Simple Query-Based Model for Real-Time Instance Segmentation | Junjie He · Pengyu Li · Yifeng Geng · Xuansong Xie | N/A | Code |
| How Can Objects Help Action Recognition? | Xingyi Zhou · Anurag Arnab · Chen Sun · Cordelia Schmid | N/A | Code |
| Images Speak in Images: A Generalist Painter for In-Context Visual Learning | Xinlong Wang · Wen Wang · Yue Cao · Chunhua Shen · Tiejun Huang | N/A | Code |
| SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation | Huimin Huang · Shiao Xie · Lanfen Lin · Ruofeng Tong · Yen-Wei Chen · Yuexiang Li · Hong Wang · Yawen Huang · Yefeng Zheng | N/A | Code |
| A Unified Pyramid Recurrent Network for Video Frame Interpolation | Xin Jin · Longhai Wu · Jie Chen · Youxin Chen · Jayoon Koo · Cheul-hee Hahm | N/A | Code |
| Enhancing the Self-Universality for Transferable Targeted Attacks | Zhipeng Wei · Jingjing Chen · Zuxuan Wu · Yu-Gang Jiang | N/A | Code |
| Multi-View Inverse Rendering for Large-Scale Real-World Indoor Scenes | Zhen Li · Lingli Wang · Mofang Cheng · Cihui Pan · Jiaqi Yang | N/A | Code |
| TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision | Jiacheng Wei · Hao Wang · Jiashi Feng · Guosheng Lin · Kim-Hui Yap | N/A | Code |
| Frequency-Modulated Point Cloud Rendering With Easy Editing | Yi Zhang · Xiaoyang Huang · Bingbing Ni · Teng Li · Wenjun Zhang | N/A | Code |
| Vector Quantization With Self-Attention for Quality-Independent Representation Learning | Zhou Yang · Weisheng Dong · Xin Li · Mengluan Huang · Yulin Sun · Guangming Shi | N/A | Code |
| Fine-Grained Face Swapping via Regional GAN Inversion | Zhian Liu · Maomao Li · Yong Zhang · Cairong Wang · Qi Zhang · Jue Wang · Yongwei Nie | N/A | Code |
| Backdoor Defense via Adaptively Splitting Poisoned Dataset | Kuofeng Gao · Yang Bai · Jindong Gu · Yong Yang · Shu-Tao Xia | N/A | Code |
| RGBD2: Generative Scene Synthesis via Incremental View Inpainting Using RGBD Diffusion Models | Jiabao Lei · Jiapeng Tang · Kui Jia | N/A | Code |
| CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose | Xu Zhang · Wen Wang · Zhe Chen · Yufei Xu · Jing Zhang · Dacheng Tao | N/A | Code |
| Fake It Till You Make It: Learning Transferable Representations From Synthetic ImageNet Clones | Mert Bülent Sarıyıldız · Karteek Alahari · Diane Larlus · Yannis Kalantidis | N/A | Code |
| Efficient Frequency Domain-Based Transformers for High-Quality Image Deblurring | Lingshun Kong · Jiangxin Dong · Jianjun Ge · Mingqiang Li · Jinshan Pan | N/A | Code |
| DartBlur: Privacy Preservation With Detection Artifact Suppression | Baowei Jiang · Bing Bai · Haozhe Lin · Yu Wang · Yuchen Guo · Lu Fang | N/A | Code |
| FCC: Feature Clusters Compression for Long-Tailed Visual Recognition | Jian Li · Ziyao Meng · Daqian Shi · Rui Song · Xiaolei Diao · Jingwen Wang · Hao Xu | N/A | Code |
| CLOTH4D: A Dataset for Clothed Human Reconstruction | Xingxing Zou · Xintong Han · Waikeung Wong | N/A | Code |
| LinK: Linear Kernel for LiDAR-Based 3D Perception | Tao Lu · Xiang Ding · Haisong Liu · Gangshan Wu · Limin Wang | N/A | Code |
| Hunting Sparsity: Density-Guided Contrastive Learning for Semi-Supervised Semantic Segmentation | Xiaoyang Wang · Bingfeng Zhang · Limin Yu · Jimin Xiao | N/A | Code |
| Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception | Junyu Gao · Mengyuan Chen · Changsheng Xu | N/A | Code |
| LargeKernel3D: Scaling Up Kernels in 3D Sparse CNNs | Yukang Chen · Jianhui Liu · Xiangyu Zhang · Xiaojuan Qi · Jiaya Jia | N/A | Code |
| Deep Learning of Partial Graph Matching via Differentiable Top-K | Runzhong Wang · Ziao Guo · Shaofei Jiang · Xiaokang Yang · Junchi Yan | N/A | Code |
| Analyzing Physical Impacts Using Transient Surface Wave Imaging | Tianyuan Zhang · Mark Sheinin · Dorian Chan · Mark Rau · Matthew O’Toole · Srinivasa G. Narasimhan | N/A | Code |
| Rethinking Domain Generalization for Face Anti-Spoofing: Separability and Alignment | Yiyou Sun · Yaojie Liu · Xiaoming Liu · Yixuan Li · Wen-Sheng Chu | N/A | Code |
| A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift | Dasong Li · Xiaoyu Shi · Yi Zhang · Ka Chun Cheung · Simon See · Xiaogang Wang · Hongwei Qin · Hongsheng Li | N/A | Code |
| The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects | Ruohan Gao · Yiming Dou · Hao Li · Tanmay Agarwal · Jeannette Bohg · Yunzhu Li · Li Fei-Fei · Jiajun Wu | N/A | Code |
| PIRLNav: Pretraining With Imitation and RL Finetuning for ObjectNav | Ram Ramrakhya · Dhruv Batra · Erik Wijmans · Abhishek Das | N/A | Code |
| DC2: Dual-Camera Defocus Control by Learning To Refocus | Hadi Alzayer · Abdullah Abuolaim · Leung Chun Chan · Yang Yang · Ying Chen Lou · Jia-Bin Huang · Abhishek Kar | N/A | Code |
| Habitat-Matterport 3D Semantics Dataset | Karmesh Yadav · Ram Ramrakhya · Santhosh Kumar Ramakrishnan · Theo Gervet · John Turner · Aaron Gokaslan · Noah Maestre · Angel Xuan Chang · Dhruv Batra · Manolis Savva · Alexander William Clegg · Devendra Singh Chaplot | N/A | Code |
| Prompting Large Language Models With Answer Heuristics for Knowledge-Based Visual Question Answering | Zhenwei Shao · Zhou Yu · Meng Wang · Jun Yu | N/A | Code |
| Similarity Metric Learning for RGB-Infrared Group Re-Identification | Jianghao Xiong · Jianhuang Lai | N/A | Code |
| DPF: Learning Dense Prediction Fields With Weak Supervision | Xiaoxue Chen · Yuhang Zheng · Yupeng Zheng · Qiang Zhou · Hao Zhao · Guyue Zhou · Ya-Qin Zhang | N/A | Code |
| Mixed Autoencoder for Self-Supervised Visual Representation Learning | Kai Chen · Zhili Liu · Lanqing Hong · Hang Xu · Zhenguo Li · Dit-Yan Yeung | N/A | Code |
| Content-Aware Token Sharing for Efficient Semantic Segmentation With Vision Transformers | Chenyang Lu · Daan de Geus · Gijs Dubbelman | N/A | Code |
| NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen · Jipeng Lyu · Yu-Xiong Wang | N/A | Code |
| Multiview Compressive Coding for 3D Reconstruction | Chao-Yuan Wu · Justin Johnson · Jitendra Malik · Christoph Feichtenhofer · Georgia Gkioxari | N/A | Code |
| Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation | Lihe Yang · Lei Qi · Litong Feng · Wayne Zhang · Yinghuan Shi | N/A | Code |
| Delving Into Shape-Aware Zero-Shot Semantic Segmentation | Xinyu Liu · Beiwen Tian · Zhen Wang · Rui Wang · Kehua Sheng · Bo Zhang · Hao Zhao · Guyue Zhou | N/A | Code |
| Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval | Yi Xie · Huaidong Zhang · Xuemiao Xu · Jianqing Zhu · Shengfeng He | N/A | Code |
| Bootstrapping Objectness From Videos by Relaxed Common Fate and Visual Grouping | Long Lian · Zhirong Wu · Stella X. Yu | N/A | Code |
| NeuralPCI: Spatio-Temporal Neural Field for 3D Point Cloud Multi-Frame Non-Linear Interpolation | Zehan Zheng · Danni Wu · Ruisi Lu · Fan Lu · Guang Chen · Changjun Jiang | N/A | Code |
| Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning | Zhuoyang Zhang · Yuhao Dong · Yunze Liu · Li Yi | N/A | Code |
| GFIE: A Dataset and Baseline for Gaze-Following From 2D to 3D in Indoor Environments | Zhengxi Hu · Yuxue Yang · Xiaolin Zhai · Dingye Yang · Bohan Zhou · Jingtai Liu | N/A | Code |
| Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval? | Wenhao Wu · Haipeng Luo · Bo Fang · Jingdong Wang · Wanli Ouyang | N/A | Code |
| Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition From Egocentric RGB Videos | Yilin Wen · Hao Pan · Lei Yang · Jia Pan · Taku Komura · Wenping Wang | N/A | Code |
| CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer | Linfeng Wen · Chengying Gao · Changqing Zou | N/A | Code |
| Uncurated Image-Text Datasets: Shedding Light on Demographic Bias | Noa Garcia · Yusuke Hirota · Yankun Wu · Yuta Nakashima | N/A | Code |
| AltFreezing for More General Video Face Forgery Detection | Zhendong Wang · Jianmin Bao · Wengang Zhou · Weilun Wang · Houqiang Li | N/A | Code |
| Two-View Geometry Scoring Without Correspondences | Axel Barroso-Laguna · Eric Brachmann · Victor Adrian Prisacariu · Gabriel J. Brostow · Daniyar Turmukhambetov | N/A | Code |
| Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning | Wenjin Wang · Yunqing Hu · Qianglong Chen · Yin Zhang | N/A | Code |
| Revisiting Prototypical Network for Cross Domain Few-Shot Learning | Fei Zhou · Peng Wang · Lei Zhang · Wei Wei · Yanning Zhang | N/A | Code |
| Federated Incremental Semantic Segmentation | Jiahua Dong · Duzhen Zhang · Yang Cong · Wei Cong · Henghui Ding · Dengxin Dai | N/A | Code |
| Self-Supervised Learning for Multimodal Non-Rigid 3D Shape Matching | Dongliang Cao · Florian Bernard | N/A | Code |
| Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization | Aishan Liu · Shiyu Tang · Siyuan Liang · Ruihao Gong · Boxi Wu · Xianglong Liu · Dacheng Tao | N/A | Code |
| Video-Text As Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning | Peng Jin · Jinfa Huang · Pengfei Xiong · Shangxuan Tian · Chang Liu · Xiangyang Ji · Li Yuan · Jie Chen | N/A | Code |
| pCON: Polarimetric Coordinate Networks for Neural Scene Representations | Henry Peters · Yunhao Ba · Achuta Kadambi | N/A | Code |
| RIAV-MVS: Recurrent-Indexing an Asymmetric Volume for Multi-View Stereo | Changjiang Cai · Pan Ji · Qingan Yan · Yi Xu | N/A | Code |
| Depth Estimation From Camera Image and mmWave Radar Point Cloud | Akash Deep Singh · Yunhao Ba · Ankur Sarker · Howard Zhang · Achuta Kadambi · Stefano Soatto · Mani Srivastava · Alex Wong | N/A | Code |
| Normal-Guided Garment UV Prediction for Human Re-Texturing | Yasamin Jafarian · Tuanfeng Y. Wang · Duygu Ceylan · Jimei Yang · Nathan Carr · Yi Zhou · Hyun Soo Park | N/A | Code |
| WeatherStream: Light Transport Automation of Single Image Deweathering | Howard Zhang · Yunhao Ba · Ethan Yang · Varan Mehra · Blake Gella · Akira Suzuki · Arnold Pfahnl · Chethan Chinder Chandrappa · Alex Wong · Achuta Kadambi | N/A | Code |
| MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices | Kejie Li · Jia-Wang Bian · Robert Castle · Philip H.S. Torr · Victor Adrian Prisacariu | N/A | Code |
| Ultrahigh Resolution Image/Video Matting With Spatio-Temporal Sparsity | Yanan Sun · Chi-Keung Tang · Yu-Wing Tai | N/A | Code |
| Hierarchical Supervision and Shuffle Data Augmentation for 3D Semi-Supervised Object Detection | Chuandong Liu · Chenqiang Gao · Fangcen Liu · Pengcheng Li · Deyu Meng · Xinbo Gao | N/A | Code |
| PATS: Patch Area Transportation With Subdivision for Local Feature Matching | Junjie Ni · Yijin Li · Zhaoyang Huang · Hongsheng Li · Hujun Bao · Zhaopeng Cui · Guofeng Zhang | N/A | Code |
| SINE: Semantic-Driven Image-Based NeRF Editing With Prior-Guided Editing Field | Chong Bao · Yinda Zhang · Bangbang Yang · Tianxing Fan · Zesong Yang · Hujun Bao · Guofeng Zhang · Zhaopeng Cui | N/A | Code |
| GeoNet: Benchmarking Unsupervised Adaptation Across Geographies | Tarun Kalluri · Wangdong Xu · Manmohan Chandraker | N/A | Code |
| Joint HDR Denoising and Fusion: A Real-World Mobile HDR Image Dataset | Shuaizheng Liu · Xindong Zhang · Lingchen Sun · Zhetong Liang · Hui Zeng · Lei Zhang | N/A | Code |
| 3D-Aware Object Goal Navigation via Simultaneous Exploration and Identification | Jiazhao Zhang · Liu Dai · Fanpeng Meng · Qingnan Fan · Xuelin Chen · Kai Xu · He Wang | N/A | Code |
| Delving Into Discrete Normalizing Flows on SO(3) Manifold for Probabilistic Rotation Modeling | Yulin Liu · Haoran Liu · Yingda Yin · Yang Wang · Baoquan Chen · He Wang | N/A | Code |
| RILS: Masked Visual Reconstruction in Language Semantic Space | Shusheng Yang · Yixiao Ge · Kun Yi · Dian Li · Ying Shan · Xiaohu Qie · Xinggang Wang | N/A | Code |
| ConQueR: Query Contrast Voxel-DETR for 3D Object Detection | Benjin Zhu · Zhe Wang · Shaoshuai Shi · Hang Xu · Lanqing Hong · Hongsheng Li | N/A | Code |
| PREIM3D: 3D Consistent Precise Image Attribute Editing From a Single Image | Jianhui Li · Jianmin Li · Haoji Zhang · Shilong Liu · Zhengyi Wang · Zihao Xiao · Kaiwen Zheng · Jun Zhu | N/A | Code |
| Bridging Search Region Interaction With Template for RGB-T Tracking | Tianrui Hui · Zizheng Xun · Fengguang Peng · Junshi Huang · Xiaoming Wei · Xiaolin Wei · Jiao Dai · Jizhong Han · Si Liu | N/A | Code |
| Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels | Jingqiu Zhou · Linjiang Huang · Liang Wang · Si Liu · Hongsheng Li | N/A | Code |
| Learning To Zoom and Unzoom | Chittesh Thavamani · Mengtian Li · Francesco Ferroni · Deva Ramanan | N/A | Code |
| MaLP: Manipulation Localization Using a Proactive Scheme | Vishal Asnani · Xi Yin · Tal Hassner · Xiaoming Liu | N/A | Code |
| Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning | Haiyu Wu · Grace Bezold · Aman Bhatta · Kevin W. Bowyer | N/A | Code |
| Visual-Tactile Sensing for In-Hand Object Reconstruction | Wenqiang Xu · Zhenjun Yu · Han Xue · Ruolin Ye · Siqiong Yao · Cewu Lu | N/A | Code |
| Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training | Filip Radenovic · Abhimanyu Dubey · Abhishek Kadian · Todor Mihaylov · Simon Vandenhende · Yash Patel · Yi Wen · Vignesh Ramanathan · Dhruv Mahajan | N/A | Code |
| Semi-Supervised Domain Adaptation With Source Label Adaptation | Yu-Chu Yu · Hsuan-Tien Lin | N/A | Code |
| Self-Supervised Video Forensics by Audio-Visual Anomaly Detection | Chao Feng · Ziyang Chen · Andrew Owens | N/A | Code |
| IterativePFN: True Iterative Point Cloud Filtering | Dasith de Silva Edirimuni · Xuequan Lu · Zhiwen Shao · Gang Li · Antonio Robles-Kelly · Ying He | N/A | Code |
| Joint Video Multi-Frame Interpolation and Deblurring Under Unknown Exposure Time | Wei Shang · Dongwei Ren · Yi Yang · Hongzhi Zhang · Kede Ma · Wangmeng Zuo | N/A | Code |
| Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning | Yun-Hao Cao · Peiqin Sun · Shuchang Zhou | N/A | Code |
| Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object Tracking | Ziqi Pang · Jie Li · Pavel Tokmakov · Dian Chen · Sergey Zagoruyko · Yu-Xiong Wang | N/A | Code |
| VecFontSDF: Learning To Reconstruct and Synthesize High-Quality Vector Fonts via Signed Distance Functions | Zeqing Xia · Bojun Xiong · Zhouhui Lian | N/A | Code |
| Towards Better Gradient Consistency for Neural Signed Distance Functions via Level Set Alignment | Baorui Ma · Junsheng Zhou · Yu-Shen Liu · Zhizhong Han | N/A | Code |
| Visual-Language Prompt Tuning With Knowledge-Guided Context Optimization | Hantao Yao · Rui Zhang · Changsheng Xu | N/A | Code |
| Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation | Ju He · Jieneng Chen · Ming-Xian Lin · Qihang Yu · Alan L. Yuille | N/A | Code |
| Physics-Guided ISO-Dependent Sensor Noise Modeling for Extreme Low-Light Photography | Yue Cao · Ming Liu · Shuai Liu · Xiaotao Wang · Lei Lei · Wangmeng Zuo | N/A | Code |
| Dynamic Focus-Aware Positional Queries for Semantic Segmentation | Haoyu He · Jianfei Cai · Zizheng Pan · Jing Liu · Jing Zhang · Dacheng Tao · Bohan Zhuang | N/A | Code |
| Generic-to-Specific Distillation of Masked Autoencoders | Wei Huang · Zhiliang Peng · Li Dong · Furu Wei · Jianbin Jiao · Qixiang Ye | N/A | Code |
| Benchmarking Robustness of 3D Object Detection to Common Corruptions | Yinpeng Dong · Caixin Kang · Jinlai Zhang · Zijian Zhu · Yikai Wang · Xiao Yang · Hang Su · Xingxing Wei · Jun Zhu | N/A | Code |
| GarmentTracking: Category-Level Garment Pose Tracking | Han Xue · Wenqiang Xu · Jieyi Zhang · Tutian Tang · Yutong Li · Wenxin Du · Ruolin Ye · Cewu Lu | N/A | Code |
| TrojDiff: Trojan Attacks on Diffusion Models With Diverse Targets | Weixin Chen · Dawn Song · Bo Li | N/A | Code |
| Weakly Supervised Video Representation Learning With Unaligned Text for Sequential Videos | Sixun Dong · Huazhang Hu · Dongze Lian · Weixin Luo · Yicheng Qian · Shenghua Gao | N/A | Code |
| Generalized Deep 3D Shape Prior via Part-Discretized Diffusion Process | Yuhan Li · Yishun Dou · Xuanhong Chen · Bingbing Ni · Yilin Sun · Yutian Liu · Fuzhen Wang | N/A | Code |
| SpaText: Spatio-Textual Representation for Controllable Image Generation | Omri Avrahami · Thomas Hayes · Oran Gafni · Sonal Gupta · Yaniv Taigman · Devi Parikh · Dani Lischinski · Ohad Fried · Xi Yin | N/A | Code |
| Watch or Listen: Robust Audio-Visual Speech Recognition With Visual Corruption Modeling and Reliability Scoring | Joanna Hong · Minsu Kim · Jeongsoo Choi · Yong Man Ro | N/A | Code |
| RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation | Titas Anciukevičius · Zexiang Xu · Matthew Fisher · Paul Henderson · Hakan Bilen · Niloy J. Mitra · Paul Guerrero | N/A | Code |
| Self-Supervised 3D Scene Flow Estimation Guided by Superpoints | Yaqi Shen · Le Hui · Jin Xie · Jian Yang | N/A | Code |
| Adaptive Annealing for Robust Geometric Estimation | Chitturi Sidhartha · Lalit Manam · Venu Madhav Govindu | N/A | Code |
| Spectral Enhanced Rectangle Transformer for Hyperspectral Image Denoising | Miaoyu Li · Ji Liu · Ying Fu · Yulun Zhang · Dejing Dou | N/A | Code |
| Partial Network Cloning | Jingwen Ye · Songhua Liu · Xinchao Wang | N/A | Code |
| Twin Contrastive Learning With Noisy Labels | Zhizhong Huang · Junping Zhang · Hongming Shan | N/A | Code |
| Ambiguous Medical Image Segmentation Using Diffusion Models | Aimon Rahman · Jeya Maria Jose Valanarasu · Ilker Hacihaliloglu · Vishal M. Patel | N/A | Code |
| High-Res Facial Appearance Capture From Polarized Smartphone Images | Dejan Azinović · Olivier Maury · Christophe Hery · Matthias Nießner · Justus Thies | N/A | Code |
| AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Takehiko Ohkawa · Kun He · Fadime Sener · Tomas Hodan · Luan Tran · Cem Keskin | N/A | Code |
| EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata | Chenhao Zheng · Ayush Shrivastava · Andrew Owens | N/A | Code |
| Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures | Gal Metzer · Elad Richardson · Or Patashnik · Raja Giryes · Daniel Cohen-Or | N/A | Code |
| Rebalancing Batch Normalization for Exemplar-Based Class-Incremental Learning | Sungmin Cha · Sungjun Cho · Dasol Hwang · Sunwon Hong · Moontae Lee · Taesup Moon | N/A | Code |
| Progressive Neighbor Consistency Mining for Correspondence Pruning | Xin Liu · Jufeng Yang | N/A | Code |
| Post-Training Quantization on Diffusion Models | Yuzhang Shang · Zhihang Yuan · Bin Xie · Bingzhe Wu · Yan Yan | N/A | Code |
| Fully Self-Supervised Depth Estimation From Defocus Clue | Haozhe Si · Bin Zhao · Dong Wang · Yunpeng Gao · Mulin Chen · Zhigang Wang · Xuelong Li | N/A | Code |
| Curricular Object Manipulation in LiDAR-Based Object Detection | Ziyue Zhu · Qiang Meng · Xiao Wang · Ke Wang · Liujiang Yan · Jian Yang | N/A | Code |
| Adaptive Assignment for Geometry Aware Local Feature Matching | Dihe Huang · Ying Chen · Yong Liu · Jianlin Liu · Shang Xu · Wenlong Wu · Yikang Ding · Fan Tang · Chengjie Wang | N/A | Code |
| RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension | Lei Jin · Gen Luo · Yiyi Zhou · Xiaoshuai Sun · Guannan Jiang · Annan Shu · Rongrong Ji | N/A | Code |
| ANetQA: A Large-Scale Benchmark for Fine-Grained Compositional Reasoning Over Untrimmed Videos | Zhou Yu · Lixiang Zheng · Zhou Zhao · Fei Wu · Jianping Fan · Kui Ren · Jun Yu | N/A | Code |
| GD-MAE: Generative Decoder for MAE Pre-Training on LiDAR Point Clouds | Honghui Yang · Tong He · Jiaheng Liu · Hua Chen · Boxi Wu · Binbin Lin · Xiaofei He · Wanli Ouyang | N/A | Code |
| Multimodal Industrial Anomaly Detection via Hybrid Fusion | Yue Wang · Jinlong Peng · Jiangning Zhang · Ran Yi · Yabiao Wang · Chengjie Wang | N/A | Code |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Byeonghyun Pak · Jaewon Lee · Kyong Hwan Jin | N/A | Code |
| CLIP Is Also an Efficient Segmenter: A Text-Driven Approach for Weakly Supervised Semantic Segmentation | Yuqi Lin · Minghao Chen · Wenxiao Wang · Boxi Wu · Ke Li · Binbin Lin · Haifeng Liu · Xiaofei He | N/A | Code |
| MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation | Ludan Ruan · Yiyang Ma · Huan Yang · Huiguo He · Bei Liu · Jianlong Fu · Nicholas Jing Yuan · Qin Jin · Baining Guo | N/A | Code |
| FreeNeRF: Improving Few-Shot Neural Rendering With Free Frequency Regularization | Jiawei Yang · Marco Pavone · Yue Wang | N/A | Code |
| SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory | Sicheng Li · Hao Li · Yue Wang · Yiyi Liao · Lu Yu | N/A | Code |
| Run, Don’t Walk: Chasing Higher FLOPS for Faster Neural Networks | Jierun Chen · Shiu-hong Kao · Hao He · Weipeng Zhuo · Song Wen · Chul-Ho Lee · S.-H. Gary Chan | N/A | Code |
| Temporal Attention Unit: Towards Efficient Spatiotemporal Predictive Learning | Cheng Tan · Zhangyang Gao · Lirong Wu · Yongjie Xu · Jun Xia · Siyuan Li · Stan Z. Li | N/A | Code |
| Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module | Linzhi Huang · Yulong Li · Hongbo Tian · Yue Yang · Xiangang Li · Weihong Deng · Jieping Ye | N/A | Code |
| Good Is Bad: Causality Inspired Cloth-Debiasing for Cloth-Changing Person Re-Identification | Zhengwei Yang · Meng Lin · Xian Zhong · Yu Wu · Zheng Wang | N/A | Code |
| Feature Alignment and Uniformity for Test Time Adaptation | Shuai Wang · Daoan Zhang · Zipei Yan · Jianguo Zhang · Rui Li | N/A | Code |
| AeDet: Azimuth-Invariant Multi-View 3D Object Detection | Chengjian Feng · Zequn Jie · Yujie Zhong · Xiangxiang Chu · Lin Ma | N/A | Code |
| Towards Realistic Long-Tailed Semi-Supervised Learning: Consistency Is All You Need | Tong Wei · Kai Gan | N/A | Code |
| OmniAL: A Unified CNN Framework for Unsupervised Anomaly Localization | Ying Zhao | N/A | Code |
| HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization | Sungyeon Kim · Boseung Jeong · Suha Kwak | N/A | Code |
| Generative Diffusion Prior for Unified Image Restoration and Enhancement | Ben Fei · Zhaoyang Lyu · Liang Pan · Junzhe Zhang · Weidong Yang · Tianyue Luo · Bo Zhang · Bo Dai | N/A | Code |
| Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoder | Aming Wu · Cheng Deng | N/A | Code |
| 2PCNet: Two-Phase Consistency Training for Day-to-Night Unsupervised Domain Adaptive Object Detection | Mikhail Kennerley · Jian-Gang Wang · Bharadwaj Veeravalli · Robby T. Tan | N/A | Code |
| Linking Garment With Person via Semantically Associated Landmarks for Virtual Try-On | Keyu Yan · Tingwei Gao · Hui Zhang · Chengjun Xie | N/A | Code |
| A New Comprehensive Benchmark for Semi-Supervised Video Anomaly Detection and Anticipation | Congqi Cao · Yue Lu · Peng Wang · Yanning Zhang | N/A | Code |
| DINER: Depth-Aware Image-Based NEural Radiance Fields | Malte Prinzler · Otmar Hilliges · Justus Thies | N/A | Code |
| Learning Personalized High Quality Volumetric Head Avatars From Monocular RGB Videos | Ziqian Bai · Feitong Tan · Zeng Huang · Kripasindhu Sarkar · Danhang Tang · Di Qiu · Abhimitra Meka · Ruofei Du · Mingsong Dou · Sergio Orts-Escolano · Rohit Pandey · Ping Tan · Thabo Beeler · Sean Fanello · Yinda Zhang | N/A | Code |
| HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics | Artur Grigorev · Michael J. Black · Otmar Hilliges | N/A | Code |
| Boundary-Enhanced Co-Training for Weakly Supervised Semantic Segmentation | Shenghai Rong · Bohai Tu · Zilei Wang · Junjie Li | N/A | Code |
| Instant Volumetric Head Avatars | Wojciech Zielonka · Timo Bolkart · Justus Thies | N/A | Code |
| From Node Interaction To Hop Interaction: New Effective and Scalable Graph Learning Paradigm | Jie Chen · Zilong Li · Yin Zhu · Junping Zhang · Jian Pu | N/A | Code |
| Transfer4D: A Framework for Frugal Motion Capture and Deformation Transfer | Shubh Maheshwari · Rahul Narain · Ramya Hebbalaguppe | N/A | Code |
| An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions | Weijia Li · Saihui Hou · Chunjie Zhang · Chunshui Cao · Xu Liu · Yongzhen Huang · Yao Zhao | N/A | Code |
| Event-Based Shape From Polarization | Manasi Muglikar · Leonard Bauersfeld · Diederik Paul Moeys · Davide Scaramuzza | N/A | Code |
| Plateau-Reduced Differentiable Path Tracing | Michael Fischer · Tobias Ritschel | N/A | Code |
| End-to-End Video Matting With Trimap Propagation | Wei-Lun Huang · Ming-Sui Lee | N/A | Code |
| Weakly-Supervised Single-View Image Relighting | Renjiao Yi · Chenyang Zhu · Kai Xu | N/A | Code |
| Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning | Weixuan Sun · Jiayi Zhang · Jianyuan Wang · Zheyuan Liu · Yiran Zhong · Tianpeng Feng · Yandong Guo · Yanhao Zhang · Nick Barnes | N/A | Code |
| Non-Contrastive Unsupervised Learning of Physiological Signals From Video | Jeremy Speth · Nathan Vance · Patrick Flynn · Adam Czajka | N/A | Code |
| Structured Sparsity Learning for Efficient Video Super-Resolution | Bin Xia · Jingwen He · Yulun Zhang · Yitong Wang · Yapeng Tian · Wenming Yang · Luc Van Gool | N/A | Code |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Wei-Ning Hsu · Tal Remez · Bowen Shi · Jacob Donley · Yossi Adi | N/A | Code |
| Shape, Pose, and Appearance From a Single Image via Bootstrapped Radiance Field Inversion | Dario Pavllo · David Joseph Tan · Marie-Julie Rakotosaona · Federico Tombari | N/A | Code |
| Towards Better Decision Forests: Forest Alternating Optimization | Miguel Á. Carreira-Perpiñán · Magzhan Gabidolla · Arman Zharmagambetov | N/A | Code |
| CrOC: Cross-View Online Clustering for Dense Visual Representation Learning | Thomas Stegmüller · Tim Lebailly · Behzad Bozorgtabar · Tinne Tuytelaars · Jean-Philippe Thiran | N/A | Code |
| Polynomial Implicit Neural Representations for Large Diverse Datasets | Rajhans Singh · Ankita Shukla · Pavan Turaga | N/A | Code |
| GradICON: Approximate Diffeomorphisms via Gradient Inverse Consistency | Lin Tian · Hastings Greer · François-Xavier Vialard · Roland Kwitt · Raúl San José Estépar · Richard Jarrett Rushmore · Nikolaos Makris · Sylvain Bouix · Marc Niethammer | N/A | Code |
| Exploring Discontinuity for Video Frame Interpolation | Sangjin Lee · Hyeongmin Lee · Chajin Shin · Hanbin Son · Sangyoun Lee | N/A | Code |
| Local 3D Editing via 3D Distillation of CLIP Knowledge | Junha Hyung · Sungwon Hwang · Daejin Kim · Hyunji Lee · Jaegul Choo | N/A | Code |
| Dynamic Conceptional Contrastive Learning for Generalized Category Discovery | Nan Pu · Zhun Zhong · Nicu Sebe | N/A | Code |
| Look, Radiate, and Learn: Self-Supervised Localisation via Radio-Visual Correspondence | Mohammed Alloulah · Maximilian Arnold | N/A | Code |
| Instance Relation Graph Guided Source-Free Domain Adaptive Object Detection | Vibashan VS · Poojan Oza · Vishal M. Patel | N/A | Code |
| High Fidelity 3D Hand Shape Reconstruction via Scalable Graph Frequency Decomposition | Tianyu Luan · Yuanhao Zhai · Jingjing Meng · Zhong Li · Zhang Chen · Yi Xu · Junsong Yuan | N/A | Code |
| 3D Highlighter: Localizing Regions on 3D Shapes via Text Descriptions | Dale Decatur · Itai Lang · Rana Hanocka | N/A | Code |
| Egocentric Video Task Translation | Zihui Xue · Yale Song · Kristen Grauman · Lorenzo Torresani | N/A | Code |
| Pixels, Regions, and Objects: Multiple Enhancement for Salient Object Detection | Yi Wang · Ruili Wang · Xin Fan · Tianzhu Wang · Xiangjian He | N/A | Code |
| Balanced Energy Regularization Loss for Out-of-Distribution Detection | hyunjun choi · Hawook Jeong · Jin Young Choi | N/A | Code |
| Private Image Generation With Dual-Purpose Auxiliary Classifier | Chen Chen · Daochang Liu · Siqi Ma · Surya Nepal · Chang Xu | N/A | Code |
| Controllable Mesh Generation Through Sparse Latent Point Diffusion Models | Zhaoyang Lyu · Jinyi Wang · Yuwei An · Ya Zhang · Dahua Lin · Bo Dai | N/A | Code |
| Neural Video Compression With Diverse Contexts | Jiahao Li · Bin Li · Yan Lu | N/A | Code |
| Uni3D: A Unified Baseline for Multi-Dataset 3D Object Detection | Bo Zhang · Jiakang Yuan · Botian Shi · Tao Chen · Yikang Li · Yu Qiao | N/A | Code |
| ScarceNet: Animal Pose Estimation With Scarce Annotations | Chen Li · Gim Hee Lee | N/A | Code |
| Fast Contextual Scene Graph Generation With Unbiased Context Augmentation | Tianlei Jin · Fangtai Guo · Qiwei Meng · Shiqiang Zhu · Xiangming Xi · Wen Wang · Zonghao Mu · Wei Song | N/A | Code |
| TriDet: Temporal Action Detection With Relative Boundary Modeling | Dingfeng Shi · Yujie Zhong · Qiong Cao · Lin Ma · Jia Li · Dacheng Tao | N/A | Code |
| Multi-Level Logit Distillation | Ying Jin · Jiaqi Wang · Dahua Lin | N/A | Code |
| StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning | Yuqian Fu · Yu Xie · Yanwei Fu · Yu-Gang Jiang | N/A | Code |
| Text With Knowledge Graph Augmented Transformer for Video Captioning | Xin Gu · Guang Chen · Yufei Wang · Libo Zhang · Tiejian Luo · Longyin Wen | N/A | Code |
| Semantic Ray: Learning a Generalizable Semantic Field With Cross-Reprojection Attention | Fangfu Liu · Chubin Zhang · Yu Zheng · Yueqi Duan | N/A | Code |
| MELTR: Meta Loss Transformer for Learning To Fine-Tune Video Foundation Models | Dohwan Ko · Joonmyung Choi · Hyeong Kyu Choi · Kyoung-Woon On · Byungseok Roh · Hyunwoo J. Kim | N/A | Code |
| Self-Supervised AutoFlow | Hsin-Ping Huang · Charles Herrmann · Junhwa Hur · Erika Lu · Kyle Sargent · Austin Stone · Ming-Hsuan Yang · Deqing Sun | N/A | Code |
| Adaptive Sparse Convolutional Networks With Global Context Enhancement for Faster Object Detection on Drone Images | Bowei Du · Yecheng Huang · Jiaxin Chen · Di Huang | N/A | Code |
| Context-Based Trit-Plane Coding for Progressive Image Compression | Seungmin Jeon · Kwang Pyo Choi · Youngo Park · Chang-Su Kim | N/A | Code |
| Unsupervised Contour Tracking of Live Cells by Mechanical and Cycle Consistency Losses | Junbong Jang · Kwonmoo Lee · Tae-Kyun Kim | N/A | Code |
| VQACL: A Novel Visual Question Answering Continual Learning Setting | Xi Zhang · Feifei Zhang · Changsheng Xu | N/A | Code |
| Explicit Visual Prompting for Low-Level Structure Segmentations | Weihuang Liu · Xi Shen · Chi-Man Pun · Xiaodong Cun | N/A | Code |
| Practical Network Acceleration With Tiny Sets | Guo-Hua Wang · Jianxin Wu | N/A | Code |
| Sphere-Guided Training of Neural Implicit Surfaces | Andreea Dogaru · Andrei-Timotei Ardelean · Savva Ignatyev · Egor Zakharov · Evgeny Burnaev | N/A | Code |
| Texture-Guided Saliency Distilling for Unsupervised Salient Object Detection | Huajun Zhou · Bo Qiao · Lingxiao Yang · Jianhuang Lai · Xiaohua Xie | N/A | Code |
| FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction | Haoran Bai · Di Kang · Haoxian Zhang · Jinshan Pan · Linchao Bao | N/A | Code |
| Differentiable Shadow Mapping for Efficient Inverse Graphics | Markus Worchel · Marc Alexa | N/A | Code |
| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Wenxuan Zhang · Xiaodong Cun · Xuan Wang · Yong Zhang · Xi Shen · Yu Guo · Ying Shan · Fei Wang | N/A | Code |
| High-Fidelity and Freely Controllable Talking Head Video Generation | Yue Gao · Yuan Zhou · Jinglu Wang · Xiao Li · Xiang Ming · Yan Lu | N/A | Code |
| BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation | Junheum Park · Jintae Kim · Chang-Su Kim | N/A | Code |
| Noisy Correspondence Learning With Meta Similarity Correction | Haochen Han · Kaiyao Miao · Qinghua Zheng · Minnan Luo | N/A | Code |
| EVAL: Explainable Video Anomaly Localization | Ashish Singh · Michael J. Jones · Erik G. Learned-Miller | N/A | Code |
| Adaptive Plasticity Improvement for Continual Learning | Yan-Shuo Liang · Wu-Jun Li | N/A | Code |
| Edges to Shapes to Concepts: Adversarial Augmentation for Robust Vision | Aditay Tripathi · Rishubh Singh · Anirban Chakraborty · Pradeep Shenoy | N/A | Code |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mingzhen Sun · Weining Wang · Xinxin Zhu · Jing Liu | N/A | Code |
| Accelerated Coordinate Encoding: Learning to Relocalize in Minutes Using RGB and Poses | Eric Brachmann · Tommaso Cavallari · Victor Adrian Prisacariu | N/A | Code |
| A Probabilistic Attention Model With Occlusion-Aware Texture Regression for 3D Hand Reconstruction From a Single RGB Image | Zheheng Jiang · Hossein Rahmani · Sue Black · Bryan M. Williams | N/A | Code |
| Revisiting Rotation Averaging: Uncertainties and Robust Losses | Ganlin Zhang · Viktor Larsson · Daniel Barath | N/A | Code |
| LiDAR-in-the-Loop Hyperparameter Optimization | Félix Goudreault · Dominik Scheuble · Mario Bijelic · Nicolas Robidoux · Felix Heide | N/A | Code |
| Query-Dependent Video Representation for Moment Retrieval and Highlight Detection | WonJun Moon · Sangeek Hyun · SangUk Park · Dongchan Park · Jae-Pil Heo | N/A | Code |
| High-Fidelity 3D Face Generation From Natural Language Descriptions | Menghua Wu · Hao Zhu · Linjia Huang · Yiyu Zhuang · Yuanxun Lu · Xun Cao | N/A | Code |
| NeRF-Supervised Deep Stereo | Fabio Tosi · Alessio Tonioni · Daniele De Gregorio · Matteo Poggi | N/A | Code |
| vMAP: Vectorised Object Mapping for Neural Field SLAM | Xin Kong · Shikun Liu · Marwan Taher · Andrew J. Davison | N/A | Code |
| DiffRF: Rendering-Guided 3D Radiance Field Diffusion | Norman Müller · Yawar Siddiqui · Lorenzo Porzi · Samuel Rota Bulò · Peter Kontschieder · Matthias Nießner | N/A | Code |
| TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers | Cheng Zhang · Hai Liu · Yongjian Deng · Bochen Xie · Youfu Li | N/A | Code |
| Learning a Depth Covariance Function | Eric Dexheimer · Andrew J. Davison | N/A | Code |
| Handy: Towards a High Fidelity 3D Hand Shape and Appearance Model | Rolandos Alexandros Potamias · Stylianos Ploumpis · Stylianos Moschoglou · Vasileios Triantafyllou · Stefanos Zafeiriou | N/A | Code |
| The Best Defense Is a Good Offense: Adversarial Augmentation Against Adversarial Attacks | Iuri Frosio · Jan Kautz | N/A | Code |
| Test of Time: Instilling Video-Language Models With a Sense of Time | Piyush Bagad · Makarand Tapaswi · Cees G. M. Snoek | N/A | Code |
| BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects | Bowen Wen · Jonathan Tremblay · Valts Blukis · Stephen Tyree · Thomas Müller · Alex Evans · Dieter Fox · Jan Kautz · Stan Birchfield | N/A | Code |
| Leveraging Hidden Positives for Unsupervised Semantic Segmentation | Hyun Seok Seong · WonJun Moon · SuBeen Lee · Jae-Pil Heo | N/A | Code |
| BlendFields: Few-Shot Example-Driven Facial Modeling | Kacper Kania · Stephan J. Garbin · Andrea Tagliasacchi · Virginia Estellers · Kwang Moo Yi · Julien Valentin · Tomasz Trzciński · Marek Kowalski | N/A | Code |
| CIRCLE: Capture in Rich Contextual Environments | João Pedro Araújo · Jiaman Li · Karthik Vetrivel · Rishi Agarwal · Jiajun Wu · Deepak Gopinath · Alexander William Clegg · Karen Liu | N/A | Code |
| Realistic Saliency Guided Image Enhancement | S. Mahdi H. Miangoleh · Zoya Bylinskii · Eric Kee · Eli Shechtman · Yağiz Aksoy | N/A | Code |
| Implicit Neural Head Synthesis via Controllable Local Deformation Fields | Chuhan Chen · Matthew O’Toole · Gaurav Bharaj · Pablo Garrido | N/A | Code |
| Ensemble-Based Blackbox Attacks on Dense Prediction | Zikui Cai · Yaoteng Tan · M. Salman Asif | N/A | Code |
| NaQ: Leveraging Narrations As Queries To Supervise Episodic Memory | Santhosh Kumar Ramakrishnan · Ziad Al-Halah · Kristen Grauman | N/A | Code |
| Rethinking Federated Learning With Domain Shift: A Prototype View | Wenke Huang · Mang Ye · Zekun Shi · He Li · Bo Du | N/A | Code |
| Spatio-Temporal Pixel-Level Contrastive Learning-Based Source-Free Domain Adaptation for Video Semantic Segmentation | Shao-Yuan Lo · Poojan Oza · Sumanth Chennupati · Alejandro Galindo · Vishal M. Patel | N/A | Code |
| Bi3D: Bi-Domain Active Learning for Cross-Domain 3D Object Detection | Jiakang Yuan · Bo Zhang · Xiangchao Yan · Tao Chen · Botian Shi · Yikang Li · Yu Qiao | N/A | Code |
| STAR Loss: Reducing Semantic Ambiguity in Facial Landmark Detection | Zhenglin Zhou · Huaxia Li · Hong Liu · Nanyang Wang · Gang Yu · Rongrong Ji | N/A | Code |
| Diverse 3D Hand Gesture Prediction From Body Dynamics by Bilateral Hand Disentanglement | Xingqun Qi · Chen Liu · Muyi Sun · Lincheng Li · Changjie Fan · Xin Yu | N/A | Code |
| Sparsely Annotated Semantic Segmentation With Adaptive Gaussian Mixtures | Linshan Wu · Zhun Zhong · Leyuan Fang · Xingxin He · Qiang Liu · Jiayi Ma · Hao Chen | N/A | Code |
| Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention | Mingyu Ding · Yikang Shen · Lijie Fan · Zhenfang Chen · Zitian Chen · Ping Luo · Joshua B. Tenenbaum · Chuang Gan | N/A | Code |
| Frame Flexible Network | Yitian Zhang · Yue Bai · Chang Liu · Huan Wang · Sheng Li · Yun Fu | N/A | Code |
| Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks | Hao Li · Jinguo Zhu · Xiaohu Jiang · Xizhou Zhu · Hongsheng Li · Chun Yuan · Xiaohua Wang · Yu Qiao · Xiaogang Wang · Wenhai Wang · Jifeng Dai | N/A | Code |
| DCFace: Synthetic Face Generation With Dual Condition Diffusion Model | Minchul Kim · Feng Liu · Anil Jain · Xiaoming Liu | N/A | Code |
| Referring Image Matting | Jizhizi Li · Jing Zhang · Dacheng Tao | N/A | Code |
| Fast Monocular Scene Reconstruction With Global-Sparse Local-Dense Grids | Wei Dong · Christopher Choy · Charles Loop · Or Litany · Yuke Zhu · Anima Anandkumar | N/A | Code |
| DPE: Disentanglement of Pose and Expression for General Video Portrait Editing | Youxin Pang · Yong Zhang · Weize Quan · Yanbo Fan · Xiaodong Cun · Ying Shan · Dong-Ming Yan | N/A | Code |
| IDGI: A Framework To Eliminate Explanation Noise From Integrated Gradients | Ruo Yang · Binghui Wang · Mustafa Bilgic | N/A | Code |
| DynamicDet: A Unified Dynamic Architecture for Object Detection | Zhihao Lin · Yongtao Wang · Jinhe Zhang · Xiaojie Chu | N/A | Code |
| Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification | Honglin Li · Chenglu Zhu · Yunlong Zhang · Yuxuan Sun · Zhongyi Shui · Wenwei Kuang · Sunyi Zheng · Lin Yang | N/A | Code |
| VNE: An Effective Method for Improving Deep Representation by Manipulating Eigenvalue Distribution | Jaeill Kim · Suhyun Kang · Duhun Hwang · Jungwook Shin · Wonjong Rhee | N/A | Code |
| Semi-Weakly Supervised Object Kinematic Motion Prediction | Gengxin Liu · Qian Sun · Haibin Huang · Chongyang Ma · Yulan Guo · Li Yi · Hui Huang · Ruizhen Hu | N/A | Code |
| Computational Flash Photography Through Intrinsics | Sepideh Sarajian Maralan · Chris Careaga · Yağiz Aksoy | N/A | Code |
| Inversion-Based Style Transfer With Diffusion Models | Yuxin Zhang · Nisha Huang · Fan Tang · Haibin Huang · Chongyang Ma · Weiming Dong · Changsheng Xu | N/A | Code |
| Data-Driven Feature Tracking for Event Cameras | Nico Messikommer · Carter Fang · Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation | Sicheng Yang · Zhiyong Wu · Minglei Li · Zhensong Zhang · Lei Hao · Weihong Bao · Haolin Zhuang | N/A | Code |
| Neural Fourier Filter Bank | Zhijie Wu · Yuhe Jin · Kwang Moo Yi | N/A | Code |
| Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective | Yuexiao Ma · Huixia Li · Xiawu Zheng · Xuefeng Xiao · Rui Wang · Shilei Wen · Xin Pan · Fei Chao · Rongrong Ji | N/A | Code |
| Full or Weak Annotations? An Adaptive Strategy for Budget-Constrained Annotation Campaigns | Javier Gamazo Tejero · Martin S. Zinkernagel · Sebastian Wolf · Raphael Sznitman · Pablo Márquez-Neila | N/A | Code |
| Trap Attention: Monocular Depth Estimation With Manual Traps | Chao Ning · Hongping Gan | N/A | Code |
| Physical-World Optical Adversarial Attacks on 3D Face Recognition | Yanjie Li · Yiquan Li · Xuelong Dai · Songtao Guo · Bin Xiao | N/A | Code |
| Re-Thinking Federated Active Learning Based on Inter-Class Diversity | SangMook Kim · Sangmin Bae · Hwanjun Song · Se-Young Yun | N/A | Code |
| EMT-NAS:Transferring Architectural Knowledge Between Tasks From Different Datasets | Peng Liao · Yaochu Jin · Wenli Du | N/A | Code |
| Temporal Consistent 3D LiDAR Representation Learning for Semantic Perception in Autonomous Driving | Lucas Nunes · Louis Wiesmann · Rodrigo Marcuzzi · Xieyuanli Chen · Jens Behley · Cyrill Stachniss | N/A | Code |
| Document Image Shadow Removal Guided by Color-Aware Background | Ling Zhang · Yinghao He · Qing Zhang · Zheng Liu · Xiaolong Zhang · Chunxia Xiao | N/A | Code |
| Pose-Disentangled Contrastive Learning for Self-Supervised Facial Representation | Yuanyuan Liu · Wenbin Wang · Yibing Zhan · Shaoze Feng · Kejun Liu · Zhe Chen | N/A | Code |
| Ham2Pose: Animating Sign Language Notation Into Pose Sequences | Rotem Shalev Arkushin · Amit Moryossef · Ohad Fried | N/A | Code |
| Resource-Efficient RGBD Aerial Tracking | Jinyu Yang · Shang Gao · Zhe Li · Feng Zheng · Aleš Leonardis | N/A | Code |
| Neural Transformation Fields for Arbitrary-Styled Font Generation | Bin Fu · Junjun He · Jianjun Wang · Yu Qiao | N/A | Code |
| Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection | Qianjiang Hu · Daizong Liu · Wei Hu | N/A | Code |
| PAniC-3D: Stylized Single-View 3D Reconstruction From Portraits of Anime Characters | Shuhong Chen · Kevin Zhang · Yichun Shi · Heng Wang · Yiheng Zhu · Guoxian Song · Sizhe An · Janus Kristjansson · Xiao Yang · Matthias Zwicker | N/A | Code |
| HS-Pose: Hybrid Scope Feature Extraction for Category-Level Object Pose Estimation | Linfang Zheng · Chen Wang · Yinghan Sun · Esha Dasgupta · Hua Chen · Aleš Leonardis · Wei Zhang · Hyung Jin Chang | N/A | Code |
| A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction From In-the-Wild Images | Biwen Lei · Jianqiang Ren · Mengyang Feng · Miaomiao Cui · Xuansong Xie | N/A | Code |
| Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification | Yue Yang · Artemis Panagopoulou · Shenghao Zhou · Daniel Jin · Chris Callison-Burch · Mark Yatskar | N/A | Code |
| SfM-TTR: Using Structure From Motion for Test-Time Refinement of Single-View Depth Networks | Sergio Izquierdo · Javier Civera | N/A | Code |
| TINC: Tree-Structured Implicit Neural Compression | Runzhao Yang | N/A | Code |
| Cross-Domain Image Captioning With Discriminative Finetuning | Roberto Dessì · Michele Bevilacqua · Eleonora Gualdoni · Nathanaël Carraz Rakotonirina · Francesca Franzon · Marco Baroni | N/A | Code |
| Learning To Detect Mirrors From Videos via Dual Correspondences | Jiaying Lin · Xin Tan · Rynson W.H. Lau | N/A | Code |
| Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation | Zicheng Wang · Zhen Zhao · Xiaoxia Xing · Dong Xu · Xiangyu Kong · Luping Zhou | N/A | Code |
| Robust Unsupervised StyleGAN Image Restoration | Yohan Poirier-Ginter · Jean-François Lalonde | N/A | Code |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning | Rui Wang · Dongdong Chen · Zuxuan Wu · Yinpeng Chen · Xiyang Dai · Mengchen Liu · Lu Yuan · Yu-Gang Jiang | N/A | Code |
| Neural Fields Meet Explicit Geometric Representations for Inverse Rendering of Urban Scenes | Zian Wang · Tianchang Shen · Jun Gao · Shengyu Huang · Jacob Munkberg · Jon Hasselgren · Zan Gojcic · Wenzheng Chen · Sanja Fidler | N/A | Code |
| Augmentation Matters: A Simple-Yet-Effective Approach to Semi-Supervised Semantic Segmentation | Zhen Zhao · Lihe Yang · Sifan Long · Jimin Pi · Luping Zhou · Jingdong Wang | N/A | Code |
| Policy Adaptation From Foundation Model Feedback | Yuying Ge · Annabella Macaluso · Li Erran Li · Ping Luo · Xiaolong Wang | N/A | Code |
| Person Image Synthesis via Denoising Diffusion Model | Ankan Kumar Bhunia · Salman Khan · Hisham Cholakkal · Rao Muhammad Anwer · Jorma Laaksonen · Mubarak Shah · Fahad Shahbaz Khan | N/A | Code |
| Bidirectional Cross-Modal Knowledge Exploration for Video Recognition With Pre-Trained Vision-Language Models | Wenhao Wu · Xiaohan Wang · Haipeng Luo · Jingdong Wang · Yi Yang · Wanli Ouyang | N/A | Code |
| CiaoSR: Continuous Implicit Attention-in-Attention Network for Arbitrary-Scale Image Super-Resolution | Jiezhang Cao · Qin Wang · Yongqin Xian · Yawei Li · Bingbing Ni · Zhiming Pi · Kai Zhang · Yulun Zhang · Radu Timofte · Luc Van Gool | N/A | Code |
| Black-Box Sparse Adversarial Attack via Multi-Objective Optimisation | Phoenix Neale Williams · Ke Li | N/A | Code |
| AdaptiveMix: Improving GAN Training via Feature Space Shrinkage | Haozhe Liu · Wentian Zhang · Bing Li · Haoqian Wu · Nanjun He · Yawen Huang · Yuexiang Li · Bernard Ghanem · Yefeng Zheng | N/A | Code |
| ViTs for SITS: Vision Transformers for Satellite Image Time Series | Michail Tarasiou · Erik Chavez · Stefanos Zafeiriou | N/A | Code |
| Latency Matters: Real-Time Action Forecasting Transformer | Harshayu Girase · Nakul Agarwal · Chiho Choi · Karttikeya Mangalam | N/A | Code |
| Multi-Sensor Large-Scale Dataset for Multi-View 3D Reconstruction | Oleg Voynov · Gleb Bobrovskikh · Pavel Karpyshev · Saveliy Galochkin · Andrei-Timotei Ardelean · Arseniy Bozhenko · Ekaterina Karmanova · Pavel Kopanev · Yaroslav Labutin-Rymsho · Ruslan Rakhimov · Aleksandr Safin · Valerii Serpiva · Alexey Artemov · Evgeny Burnaev · Dzmitry Tsetserukou · Denis Zorin | N/A | Code |
| Learning From Noisy Labels With Decoupled Meta Label Purifier | Yuanpeng Tu · Boshen Zhang · Yuxi Li · Liang Liu · Jian Li · Yabiao Wang · Chengjie Wang · Cai Rong Zhao | N/A | Code |
| Flow Supervision for Deformable NeRF | Chaoyang Wang · Lachlan Ewen MacDonald · László A. Jeni · Simon Lucey | N/A | Code |
| Unifying Vision, Text, and Layout for Universal Document Processing | Zineng Tang · Ziyi Yang · Guoxin Wang · Yuwei Fang · Yang Liu · Chenguang Zhu · Michael Zeng · Cha Zhang · Mohit Bansal | N/A | Code |
| BKinD-3D: Self-Supervised 3D Keypoint Discovery From Multi-View Videos | Jennifer J. Sun · Lili Karashchuk · Amil Dravid · Serim Ryou · Sonia Fereidooni · John C. Tuthill · Aggelos Katsaggelos · Bingni W. Brunton · Georgia Gkioxari · Ann Kennedy · Yisong Yue · Pietro Perona | N/A | Code |
| Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning | Zixuan Hu · Li Shen · Zhenyi Wang · Tongliang Liu · Chun Yuan · Dacheng Tao | N/A | Code |
| RWSC-Fusion: Region-Wise Style-Controlled Fusion Network for the Prohibited X-Ray Security Image Synthesis | Luwen Duan · Min Wu · Lijian Mao · Jun Yin · Jianping Xiong · Xi Li | N/A | Code |
| Meta Architecture for Point Cloud Analysis | Haojia Lin · Xiawu Zheng · Lijiang Li · Fei Chao · Shanshan Wang · Yan Wang · Yonghong Tian · Rongrong Ji | N/A | Code |
| DyLiN: Making Light Field Networks Dynamic | Heng Yu · Joel Julin · Zoltán Á. Milacski · Koichiro Niinuma · László A. Jeni | N/A | Code |
| OpenMix: Exploring Outlier Samples for Misclassification Detection | Fei Zhu · Zhen Cheng · Xu-Yao Zhang · Cheng-Lin Liu | N/A | Code |
| Adaptive Graph Convolutional Subspace Clustering | Lai Wei · Zhengwei Chen · Jun Yin · Changming Zhu · Rigui Zhou · Jin Liu | N/A | Code |
| Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation | Guozhen Zhang · Yuhan Zhu · Haonan Wang · Youxin Chen · Gangshan Wu · Limin Wang | N/A | Code |
| Hybrid Active Learning via Deep Clustering for Video Action Detection | Aayush J. Rana · Yogesh S. Rawat | N/A | Code |
| Equiangular Basis Vectors | Yang Shen · Xuhao Sun · Xiu-Shen Wei | N/A | Code |
| CAT: LoCalization and IdentificAtion Cascade Detection Transformer for Open-World Object Detection | Shuailei Ma · Yuefeng Wang · Ying Wei · Jiaqi Fan · Thomas H. Li · Hongli Liu · Fanbing Lv | N/A | Code |
| An Actor-Centric Causality Graph for Asynchronous Temporal Inference in Group Activity | Zhao Xie · Tian Gao · Kewei Wu · Jiao Chang | N/A | Code |
| GCFAgg: Global and Cross-View Feature Aggregation for Multi-View Clustering | Weiqing Yan · Yuanyang Zhang · Chenlei Lv · Chang Tang · Guanghui Yue · Liang Liao · Weisi Lin | N/A | Code |
| Unsupervised Visible-Infrared Person Re-Identification via Progressive Graph Matching and Alternate Learning | Zesen Wu · Mang Ye | N/A | Code |
| Similarity Maps for Self-Training Weakly-Supervised Phrase Grounding | Tal Shaharabany · Lior Wolf | N/A | Code |
| DA Wand: Distortion-Aware Selection Using Neural Mesh Parameterization | Richard Liu · Noam Aigerman · Vladimir G. Kim · Rana Hanocka | N/A | Code |
| BiCro: Noisy Correspondence Rectification for Multi-Modality Data via Bi-Directional Cross-Modal Similarity Consistency | Shuo Yang · Zhaopan Xu · Kai Wang · Yang You · Hongxun Yao · Tongliang Liu · Min Xu | N/A | Code |
| DaFKD: Domain-Aware Federated Knowledge Distillation | Haozhao Wang · Yichen Li · Wenchao Xu · Ruixuan Li · Yufeng Zhan · Zhigang Zeng | N/A | Code |
| Single Image Depth Prediction Made Better: A Multivariate Gaussian Take | Ce Liu · Suryansh Kumar · Shuhang Gu · Radu Timofte · Luc Van Gool | N/A | Code |
| Align Your Latents: High-Resolution Video Synthesis With Latent Diffusion Models | Andreas Blattmann · Robin Rombach · Huan Ling · Tim Dockhorn · Seung Wook Kim · Sanja Fidler · Karsten Kreis | N/A | Code |
| GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction | Chuwei Luo · Changxu Cheng · Qi Zheng · Cong Yao | N/A | Code |
| VideoMAE V2: Scaling Video Masked Autoencoders With Dual Masking | Limin Wang · Bingkun Huang · Zhiyu Zhao · Zhan Tong · Yinan He · Yi Wang · Yali Wang · Yu Qiao | N/A | Code |
| CVT-SLR: Contrastive Visual-Textual Transformation for Sign Language Recognition With Variational Alignment | Jiangbin Zheng · Yile Wang · Cheng Tan · Siyuan Li · Ge Wang · Jun Xia · Yidong Chen · Stan Z. Li | N/A | Code |
| All Are Worth Words: A ViT Backbone for Diffusion Models | Fan Bao · Shen Nie · Kaiwen Xue · Yue Cao · Chongxuan Li · Hang Su · Jun Zhu | N/A | Code |
| PanoSwin: A Pano-Style Swin Transformer for Panorama Understanding | Zhixin Ling · Zhen Xing · Xiangdong Zhou · Manliang Cao · Guichun Zhou | N/A | Code |
| sRGB Real Noise Synthesizing With Neighboring Correlation-Aware Noise Model | Zixuan Fu · Lanqing Guo · Bihan Wen | N/A | Code |
| Extracting Class Activation Maps From Non-Discriminative Features As Well | Zhaozheng Chen · Qianru Sun | N/A | Code |
| GKEAL: Gaussian Kernel Embedded Analytic Learning for Few-Shot Class Incremental Task | Huiping Zhuang · Zhenyu Weng · Run He · Zhiping Lin · Ziqian Zeng | N/A | Code |
| ERM-KTP: Knowledge-Level Machine Unlearning via Knowledge Transfer | Shen Lin · Xiaoyu Zhang · Chenyang Chen · Xiaofeng Chen · Willy Susilo | N/A | Code |
| PDPP:Projected Diffusion for Procedure Planning in Instructional Videos | Hanlin Wang · Yilu Wu · Sheng Guo · Limin Wang | N/A | Code |
| NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction | Bowen Cai · Jinchi Huang · Rongfei Jia · Chengfei Lv · Huan Fu | N/A | Code |
| Deep Polarization Reconstruction With PDAVIS Events | Haiyang Mei · Zuowen Wang · Xin Yang · Xiaopeng Wei · Tobi Delbruck | N/A | Code |
| Beyond Attentive Tokens: Incorporating Token Importance and Diversity for Efficient Vision Transformers | Sifan Long · Zhen Zhao · Jimin Pi · Shengsheng Wang · Jingdong Wang | N/A | Code |
| PointClustering: Unsupervised Point Cloud Pre-Training Using Transformation Invariance in Clustering | Fuchen Long · Ting Yao · Zhaofan Qiu · Lusong Li · Tao Mei | N/A | Code |
| PCR: Proxy-Based Contrastive Replay for Online Class-Incremental Continual Learning | Huiwei Lin · Baoquan Zhang · Shanshan Feng · Xutao Li · Yunming Ye | N/A | Code |
| Boundary-Aware Backward-Compatible Representation via Adversarial Learning in Image Retrieval | Tan Pan · Furong Xu · Xudong Yang · Sifeng He · Chen Jiang · Qingpei Guo · Feng Qian · Xiaobo Zhang · Yuan Cheng · Lei Yang · Wei Chu | N/A | Code |
| PermutoSDF: Fast Multi-View Reconstruction With Implicit Surfaces Using Permutohedral Lattices | Radu Alexandru Rosu · Sven Behnke | N/A | Code |
| StyleGene: Crossover and Mutation of Region-Level Facial Genes for Kinship Face Synthesis | Hao Li · Xianxu Hou · Zepeng Huang · Linlin Shen | N/A | Code |
| MixNeRF: Modeling a Ray With Mixture Density for Novel View Synthesis From Sparse Inputs | Seunghyeon Seo · Donghoon Han · Yeonjin Chang · Nojun Kwak | N/A | Code |
| Upcycling Models Under Domain and Category Shift | Sanqing Qu · Tianpei Zou · Florian Röhrbein · Cewu Lu · Guang Chen · Dacheng Tao · Changjun Jiang | N/A | Code |
| Towards Unbiased Volume Rendering of Neural Implicit Surfaces With Geometry Priors | Yongqiang Zhang · Zhipeng Hu · Haoqian Wu · Minda Zhao · Lincheng Li · Zhengxia Zou · Changjie Fan | N/A | Code |
| Avatars Grow Legs: Generating Smooth Human Motion From Sparse Tracking Inputs With Diffusion Model | Yuming Du · Robin Kips · Albert Pumarola · Sebastian Starke · Ali Thabet · Artsiom Sanakoyeu | N/A | Code |
| MoStGAN-V: Video Generation With Temporal Motion Styles | Xiaoqian Shen · Xiang Li · Mohamed Elhoseiny | N/A | Code |
| On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks | HyunJun Jung · Patrick Ruhkamp · Guangyao Zhai · Nikolas Brasch · Yitong Li · Yannick Verdie · Jifei Song · Yiren Zhou · Anil Armagan · Slobodan Ilic · Aleš Leonardis · Nassir Navab · Benjamin Busam | N/A | Code |
| DeGPR: Deep Guided Posterior Regularization for Multi-Class Cell Detection and Counting | Aayush Kumar Tyagi · Chirag Mohapatra · Prasenjit Das · Govind Makharia · Lalita Mehra · Prathosh AP · Mausam | N/A | Code |
| Learning Action Changes by Measuring Verb-Adverb Textual Relationships | Davide Moltisanti · Frank Keller · Hakan Bilen · Laura Sevilla-Lara | N/A | Code |
| Interactive and Explainable Region-Guided Radiology Report Generation | Tim Tanida · Philip Müller · Georgios Kaissis · Daniel Rueckert | N/A | Code |
| Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng · Sida Peng · Zhen Xu · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt | Hao Li · Dingwen Zhang · Nian Liu · Lechao Cheng · Yalun Dai · Chao Zhang · Xinggang Wang · Junwei Han | N/A | Code |
| Learning Rotation-Equivariant Features for Visual Correspondence | Jongmin Lee · Byungjin Kim · Seungwook Kim · Minsu Cho | N/A | Code |
| Co-Training 2L Submodels for Visual Recognition | Hugo Touvron · Matthieu Cord · Maxime Oquab · Piotr Bojanowski · Jakob Verbeek · Hervé Jégou | N/A | Code |
| HOTNAS: Hierarchical Optimal Transport for Neural Architecture Search | Jiechao Yang · Yong Liu · Hongteng Xu | N/A | Code |
| LANA: A Language-Capable Navigator for Instruction Following and Generation | Xiaohan Wang · Wenguan Wang · Jiayi Shao · Yi Yang | N/A | Code |
| Visual Localization Using Imperfect 3D Models From the Internet | Vojtech Panek · Zuzana Kukelova · Torsten Sattler | N/A | Code |
| Diversity-Measurable Anomaly Detection | Wenrui Liu · Hong Chang · Bingpeng Ma · Shiguang Shan · Xilin Chen | N/A | Code |
| SLACK: Stable Learning of Augmentations With Cold-Start and KL Regularization | Juliette Marrie · Michael Arbel · Diane Larlus · Julien Mairal | N/A | Code |
| Recurrent Vision Transformers for Object Detection With Event Cameras | Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| Efficient Verification of Neural Networks Against LVM-Based Specifications | Harleen Hanspal · Alessio Lomuscio | N/A | Code |
| Neuralizer: General Neuroimage Analysis Without Re-Training | Steffen Czolbe · Adrian V. Dalca | N/A | Code |
| MobileVOS: Real-Time Video Object Segmentation Contrastive Learning Meets Knowledge Distillation | Roy Miles · Mehmet Kerim Yucel · Bruno Manganelli · Albert Saà-Garriga | N/A | Code |
| SCOTCH and SODA: A Transformer Video Shadow Detection Framework | Lihao Liu · Jean Prost · Lei Zhu · Nicolas Papadakis · Pietro Liò · Carola-Bibiane Schönlieb · Angelica I. Aviles-Rivero | N/A | Code |
| A Unified Spatial-Angular Structured Light for Single-View Acquisition of Shape and Reflectance | Xianmin Xu · Yuxin Lin · Haoyang Zhou · Chong Zeng · Yaxin Yu · Kun Zhou · Hongzhi Wu | N/A | Code |
| Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures | Eugenia Iofinova · Alexandra Peste · Dan Alistarh | N/A | Code |
| InstructPix2Pix: Learning To Follow Image Editing Instructions | Tim Brooks · Aleksander Holynski · Alexei A. Efros | N/A | Code |
| AnchorFormer: Point Cloud Completion From Discriminative Nodes | Zhikai Chen · Fuchen Long · Zhaofan Qiu · Ting Yao · Wengang Zhou · Jiebo Luo · Tao Mei | N/A | Code |
| Robust Test-Time Adaptation in Dynamic Scenarios | Longhui Yuan · Binhui Xie · Shuang Li | N/A | Code |
| AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via Transformers | Zechuan Li · Hongshan Yu · Zhengeng Yang · Tongjia Chen · Naveed Akhtar | N/A | Code |
| Neural Texture Synthesis With Guided Correspondence | Yang Zhou · Kaijian Chen · Rongjun Xiao · Hui Huang | N/A | Code |
| Learning To Render Novel Views From Wide-Baseline Stereo Pairs | Yilun Du · Cameron Smith · Ayush Tewari · Vincent Sitzmann | N/A | Code |
| Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision | Fangqiang Ding · Andras Palffy · Dariu M. Gavrila · Chris Xiaoxuan Lu | N/A | Code |
| SmallCap: Lightweight Image Captioning Prompted With Retrieval Augmentation | Rita Ramos · Bruno Martins · Desmond Elliott · Yova Kementchedjhieva | N/A | Code |
| PIDNet: A Real-Time Semantic Segmentation Network Inspired by PID Controllers | Jiacong Xu · Zixiang Xiong · Shankar P. Bhattacharyya | N/A | Code |
| NeRFLight: Fast and Light Neural Radiance Fields Using a Shared Feature Grid | Fernando Rivas-Manzaneque · Jorge Sierra-Acosta · Adrian Penate-Sanchez · Francesc Moreno-Noguer · Angela Ribeiro | N/A | Code |
| Fantastic Breaks: A Dataset of Paired 3D Scans of Real-World Broken Objects and Their Complete Counterparts | Nikolas Lamb · Cameron Palmer · Benjamin Molloy · Sean Banerjee · Natasha Kholgade Banerjee | N/A | Code |
| PEAL: Prior-Embedded Explicit Attention Learning for Low-Overlap Point Cloud Registration | Junle Yu · Luwei Ren · Yu Zhang · Wenhui Zhou · Lili Lin · Guojun Dai | N/A | Code |
| Neural Volumetric Memory for Visual Locomotion Control | Ruihan Yang · Ge Yang · Xiaolong Wang | N/A | Code |
| InstantAvatar: Learning Avatars From Monocular Video in 60 Seconds | Tianjian Jiang · Xu Chen · Jie Song · Otmar Hilliges | N/A | Code |
| TMO: Textured Mesh Acquisition of Objects With a Mobile Device by Using Differentiable Rendering | Jaehoon Choi · Dongki Jung · Taejae Lee · Sangwook Kim · Youngdong Jung · Dinesh Manocha · Donghwan Lee | N/A | Code |
| MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding | Jun Chen · Ming Hu · Darren J. Coker · Michael L. Berumen · Blair Costelloe · Sara Beery · Anna Rohrbach · Mohamed Elhoseiny | N/A | Code |
| Towards Fast Adaptation of Pretrained Contrastive Models for Multi-Channel Video-Language Retrieval | Xudong Lin · Simran Tiwari · Shiyuan Huang · Manling Li · Mike Zheng Shou · Heng Ji · Shih-Fu Chang | N/A | Code |
| Hierarchical Fine-Grained Image Forgery Detection and Localization | Xiao Guo · Xiaohong Liu · Zhiyuan Ren · Steven Grosz · Iacopo Masi · Xiaoming Liu | N/A | Code |
| SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision | Xubo Liu · Egor Lakomkin · Konstantinos Vougioukas · Pingchuan Ma · Honglie Chen · Ruiming Xie · Morrie Doulaty · Niko Moritz · Jachym Kolar · Stavros Petridis · Maja Pantic · Christian Fuegen | N/A | Code |
| RIATIG: Reliable and Imperceptible Adversarial Text-to-Image Generation With Natural Prompts | Han Liu · Yuhao Wu · Shixuan Zhai · Bo Yuan · Ning Zhang | N/A | Code |
| Unsupervised Intrinsic Image Decomposition With LiDAR Intensity | Shogo Sato · Yasuhiro Yao · Taiga Yoshida · Takuhiro Kaneko · Shingo Ando · Jun Shimamura | N/A | Code |
| SlowLiDAR: Increasing the Latency of LiDAR-Based Detection Using Adversarial Examples | Han Liu · Yuhao Wu · Zhiyuan Yu · Yevgeniy Vorobeychik · Ning Zhang | N/A | Code |
| NeFII: Inverse Rendering for Reflectance Decomposition With Near-Field Indirect Illumination | Haoqian Wu · Zhipeng Hu · Lincheng Li · Yongqiang Zhang · Changjie Fan · Xin Yu | N/A | Code |
| BEV-Guided Multi-Modality Fusion for Driving Perception | Yunze Man · Liang-Yan Gui · Yu-Xiong Wang | N/A | Code |
| MAGVLT: Masked Generative Vision-and-Language Transformer | Sungwoong Kim · Daejin Jo · Donghoon Lee · Jongmin Kim | N/A | Code |
| PEFAT: Boosting Semi-Supervised Medical Image Classification via Pseudo-Loss Estimation and Feature Adversarial Training | Qingjie Zeng · Yutong Xie · Zilin Lu · Yong Xia | N/A | Code |
| Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning | Cheng-Hao Tu · Zheda Mai · Wei-Lun Chao | N/A | Code |
| Decoupling Learning and Remembering: A Bilevel Memory Framework With Knowledge Projection for Task-Incremental Learning | Wenju Sun · Qingyong Li · Jing Zhang · Wen Wang · Yangli-ao Geng | N/A | Code |
| PMR: Prototypical Modal Rebalance for Multimodal Learning | Yunfeng Fan · Wenchao Xu · Haozhao Wang · Junxiao Wang · Song Guo | N/A | Code |
| DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks | Samyak Jain · Sravanti Addepalli · Pawan Kumar Sahu · Priyam Dey · R. Venkatesh Babu | N/A | Code |
| Abstract Visual Reasoning: An Algebraic Approach for Solving Raven’s Progressive Matrices | Jingyi Xu · Tushar Vaidya · Yufei Wu · Saket Chandra · Zhangsheng Lai · Kai Fong Ernest Chong | N/A | Code |
| Swept-Angle Synthetic Wavelength Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Passive Micron-Scale Time-of-Flight With Sunlight Interferometry | Alankar Kotwal · Anat Levin · Ioannis Gkioulekas | N/A | Code |
| Meta-Learning With a Geometry-Adaptive Preconditioner | Suhyun Kang · Duhun Hwang · Moonjung Eo · Taesup Kim · Wonjong Rhee | N/A | Code |
| 3D GAN Inversion With Facial Symmetry Prior | Fei Yin · Yong Zhang · Xuan Wang · Tengfei Wang · Xiaoyu Li · Yuan Gong · Yanbo Fan · Xiaodong Cun · Ying Shan · Cengiz Oztireli · Yujiu Yang | N/A | Code |
| ZBS: Zero-Shot Background Subtraction via Instance-Level Background Modeling and Foreground Selection | Yongqi An · Xu Zhao · Tao Yu · Haiyun Guo · Chaoyang Zhao · Ming Tang · Jinqiao Wang | N/A | Code |
| Neural Lens Modeling | Wenqi Xian · Aljaž Božič · Noah Snavely · Christoph Lassner | N/A | Code |
| A Probabilistic Framework for Lifelong Test-Time Adaptation | Dhanajit Brahma · Piyush Rai | N/A | Code |
| Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation | Linglan Zhao · Jing Lu · Yunlu Xu · Zhanzhan Cheng · Dashan Guo · Yi Niu · Xiangzhong Fang | N/A | Code |
| GradMA: A Gradient-Memory-Based Accelerated Federated Learning With Alleviated Catastrophic Forgetting | Kangyang Luo · Xiang Li · Yunshi Lan · Ming Gao | N/A | Code |
| Hyperspherical Embedding for Point Cloud Completion | Junming Zhang · Haomeng Zhang · Ram Vasudevan · Matthew Johnson-Roberson | N/A | Code |
| Local-Guided Global: Paired Similarity Representation for Visual Reinforcement Learning | Hyesong Choi · Hunsang Lee · Wonil Song · Sangryul Jeon · Kwanghoon Sohn · Dongbo Min | N/A | Code |
| Learning Orthogonal Prototypes for Generalized Few-Shot Semantic Segmentation | Sun-Ao Liu · Yiheng Zhang · Zhaofan Qiu · Hongtao Xie · Yongdong Zhang · Ting Yao | N/A | Code |
| DSFNet: Dual Space Fusion Network for Occlusion-Robust 3D Dense Face Alignment | Heyuan Li · Bo Wang · Yu Cheng · Mohan Kankanhalli · Robby T. Tan | N/A | Code |
| SViTT: Temporal Learning of Sparse Video-Text Transformers | Yi Li · Kyle Min · Subarna Tripathi · Nuno Vasconcelos | N/A | Code |
| Independent Component Alignment for Multi-Task Learning | Dmitry Senushkin · Nikolay Patakin · Arseny Kuznetsov · Anton Konushin | N/A | Code |
| Logical Implications for Visual Question Answering Consistency | Sergio Tascon-Morales · Pablo Márquez-Neila · Raphael Sznitman | N/A | Code |
| MaskCon: Masked Contrastive Learning for Coarse-Labelled Dataset | Chen Feng · Ioannis Patras | N/A | Code |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Wenhui Wang · Hangbo Bao · Li Dong · Johan Bjorck · Zhiliang Peng · Qiang Liu · Kriti Aggarwal · Owais Khan Mohammed · Saksham Singhal · Subhojit Som · Furu Wei | N/A | Code |
| Manipulating Transfer Learning for Property Inference | Yulong Tian · Fnu Suya · Anshuman Suri · Fengyuan Xu · David Evans | N/A | Code |
| DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward Equilibrium | Antyanta Bangunharcana · Ahmed Magd · Kyung-Soo Kim | N/A | Code |
| Learning a 3D Morphable Face Reflectance Model From Low-Cost Data | Yuxuan Han · Zhibo Wang · Feng Xu | N/A | Code |
| Principles of Forgetting in Domain-Incremental Semantic Segmentation in Adverse Weather Conditions | Tobias Kalb · Jürgen Beyerer | N/A | Code |
| Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models | Gowthami Somepalli · Vasu Singla · Micah Goldblum · Jonas Geiping · Tom Goldstein | N/A | Code |
| Adaptive Data-Free Quantization | Biao Qian · Yang Wang · Richang Hong · Meng Wang | N/A | Code |
| Coreset Sampling From Open-Set for Fine-Grained Self-Supervised Learning | Sungnyun Kim · Sangmin Bae · Se-Young Yun | N/A | Code |
| Jedi: Entropy-Based Localization and Removal of Adversarial Patches | Bilel Tarchoun · Anouar Ben Khalifa · Mohamed Ali Mahjoub · Nael Abu-Ghazaleh · Ihsen Alouani | N/A | Code |
| Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models | Qiucheng Wu · Yujian Liu · Handong Zhao · Ajinkya Kale · Trung Bui · Tong Yu · Zhe Lin · Yang Zhang · Shiyu Chang | N/A | Code |
| Semantic-Conditional Diffusion Networks for Image Captioning | Jianjie Luo · Yehao Li · Yingwei Pan · Ting Yao · Jianlin Feng · Hongyang Chao · Tao Mei | N/A | Code |
| Instance-Specific and Model-Adaptive Supervision for Semi-Supervised Semantic Segmentation | Zhen Zhao · Sifan Long · Jimin Pi · Jingdong Wang · Luping Zhou | N/A | Code |
| Improving Robustness of Semantic Segmentation to Motion-Blur Using Class-Centric Augmentation | Aakanksha Aakanksha · A. N. Rajagopalan | N/A | Code |
| MetaViewer: Towards a Unified Multi-View Representation | Ren Wang · Haoliang Sun · Yuling Ma · Xiaoming Xi · Yilong Yin | N/A | Code |
| Attribute-Preserving Face Dataset Anonymization via Latent Code Optimization | Simone Barattin · Christos Tzelepis · Ioannis Patras · Nicu Sebe | N/A | Code |
| A Light Weight Model for Active Speaker Detection | Junhua Liao · Haihan Duan · Kanghui Feng · Wanbing Zhao · Yanbing Yang · Liangyin Chen | N/A | Code |
| Shifted Diffusion for Text-to-Image Generation | Yufan Zhou · Bingchen Liu · Yizhe Zhu · Xiao Yang · Changyou Chen · Jinhui Xu | N/A | Code |
| Modular Memorability: Tiered Representations for Video Memorability Prediction | Théo Dumont · Juan Segundo Hevia · Camilo L. Fosco | N/A | Code |
| Learning Articulated Shape With Keypoint Pseudo-Labels From Web Images | Anastasis Stathopoulos · Georgios Pavlakos · Ligong Han · Dimitris N. Metaxas | N/A | Code |
| RMLVQA: A Margin Loss Approach for Visual Question Answering With Language Biases | Abhipsa Basu · Sravanti Addepalli · R. Venkatesh Babu | N/A | Code |
| RealImpact: A Dataset of Impact Sound Fields for Real Objects | Samuel Clarke · Ruohan Gao · Mason Wang · Mark Rau · Julia Xu · Jui-Hsien Wang · Doug L. James · Jiajun Wu | N/A | Code |
| Neural Rate Estimator and Unsupervised Learning for Efficient Distributed Image Analytics in Split-DNN Models | Nilesh Ahuja · Parual Datta · Bhavya Kanzariya · V. Srinivasa Somayazulu · Omesh Tickoo | N/A | Code |
| Improving Vision-and-Language Navigation by Generating Future-View Image Semantics | Jialu Li · Mohit Bansal | N/A | Code |
| Simulated Annealing in Early Layers Leads to Better Generalization | Amir M. Sarfi · Zahra Karimpour · Muawiz Chaudhary · Nasir M. Khalid · Mirco Ravanelli · Sudhir Mudur · Eugene Belilovsky | N/A | Code |
| From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models | Jiaxian Guo · Junnan Li · Dongxu Li · Anthony Meng Huat Tiong · Boyang Li · Dacheng Tao · Steven Hoi | N/A | Code |
| Where We Are and What We’re Looking At: Query Based Worldwide Image Geo-Localization Using Hierarchies and Scenes | Brandon Clark · Alec Kerrigan · Parth Parag Kulkarni · Vicente Vivanco Cepeda · Mubarak Shah | N/A | Code |
| CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes From Natural Language | Aditya Sanghi · Rao Fu · Vivian Liu · Karl D.D. Willis · Hooman Shayani · Amir H. Khasahmadi · Srinath Sridhar · Daniel Ritchie | N/A | Code |
| Learning To Generate Text-Grounded Mask for Open-World Semantic Segmentation From Only Image-Text Pairs | Junbum Cha · Jonghwan Mun · Byungseok Roh | N/A | Code |
| Imitation Learning As State Matching via Differentiable Physics | Siwei Chen · Xiao Ma · Zhongwen Xu | N/A | Code |
| BBDM: Image-to-Image Translation With Brownian Bridge Diffusion Models | Bo Li · Kaitao Xue · Bin Liu · Yu-Kun Lai | N/A | Code |
| CHMATCH: Contrastive Hierarchical Matching and Robust Adaptive Threshold Boosted Semi-Supervised Learning | Jianlong Wu · Haozhe Yang · Tian Gan · Ning Ding · Feijun Jiang · Liqiang Nie | N/A | Code |
| Re-GAN: Data-Efficient GANs Training via Architectural Reconfiguration | Divya Saxena · Jiannong Cao · Jiahao Xu · Tarun Kulshrestha | N/A | Code |
| Learning Debiased Representations via Conditional Attribute Interpolation | Yi-Kai Zhang · Qi-Wei Wang · De-Chuan Zhan · Han-Jia Ye | N/A | Code |
| Weakly Supervised Posture Mining for Fine-Grained Classification | Zhenchao Tang · Hualin Yang · Calvin Yu-Chian Chen | N/A | Code |
| Learning a Practical SDR-to-HDRTV Up-Conversion Using New Dataset and Degradation Models | Cheng Guo · Leidong Fan · Ziyu Xue · Xiuhua Jiang | N/A | Code |
| VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models | Ajay Jain · Amber Xie · Pieter Abbeel | N/A | Code |
| Adversarial Robustness via Random Projection Filters | Minjing Dong · Chang Xu | N/A | Code |
| IEEE Computer Society | Unknown | N/A | Code |
| The Computer Vision Foundation | Unknown | N/A | Code |
CVPR 2024
| Title | Author | Code URL | |
|---|---|---|---|
| X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition | Shuofeng Sun · Yongming Rao · Jiwen Lu · Haibin Yan | N/A | Code |
| BiPer: Binary Neural Networks using a Periodic Function | Edwin Vargas · Claudia Correa · Carlos Hinojosa · Henry Arguello | N/A | Code |
| Understanding and Improving Source-free Domain Adaptation from a Theoretical Perspective | Yu Mitsuzumi · Akisato Kimura · Hisashi Kashima | N/A | Code |
| Putting the Object Back into Video Object Segmentation | Ho Kei Cheng · Seoung Wug Oh · Brian Price · Joon-Young Lee · Alexander G. Schwing | N/A | Code |
| CoDeF: Content Deformation Fields for Temporally Consistent Video Processing | Hao Ouyang · Qiuyu Wang · Yuxi Xiao · Qingyan Bai · Juntao Zhang · Kecheng Zheng · Xiaowei Zhou · Qifeng Chen · Yujun Shen | N/A | Code |
| Lane2Seq: Towards Unified Lane Detection via Sequence Generation | Kunyang Zhou | N/A | Code |
| CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation | Bo-Yuan Sun · Yuqi Yang · Le Zhang · Ming-Ming Cheng · Qibin Hou | N/A | Code |
| Rethinking Boundary Discontinuity Problem for Oriented Object Detection | Hang Xu · Xinyuan Liu · Haonan Xu · Yike Ma · Zunjie Zhu · Chenggang Yan · Feng Dai | N/A | Code |
| SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing | Tomoki Ichikawa · Shohei Nobuhara · Ko Nishino | N/A | Code |
| Dual Prior Unfolding for Snapshot Compressive Imaging | Jiancheng Zhang · Haijin Zeng · Jiezhang Cao · Yongyong Chen · Dengxiu Yu · Yinping Zhao | N/A | Code |
| MCNet: Rethinking the Core Ingredients for Accurate and Efficient Homography Estimation | Haokai Zhu · Si-Yuan Cao · Jianxin Hu · Sitong Zuo · Beinan Yu · Jiacheng Ying · Junwei Li · Hui-Liang Shen | N/A | Code |
| CuVLER: Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers | Shahaf Arica · Or Rubin · Sapir Gershov · Shlomi Laufer | N/A | Code |
| UniDepth: Universal Monocular Metric Depth Estimation | Luigi Piccinelli · Yung-Hsu Yang · Christos Sakaridis · Mattia Segu · Siyuan Li · Luc Van Gool · Fisher Yu | N/A | Code |
| Masked Autoencoders for Microscopy are Scalable Learners of Cellular Biology | Oren Kraus · Kian Kenyon-Dean · Saber Saberian · Maryam Fallah · Peter McLean · Jess Leung · Vasudev Sharma · Ayla Khan · Jia Balakrishnan · Safiye Celik · Dominique Beaini · Maciej Sypetkowski · Chi Cheng · Kristen Morse · Maureen Makes · Ben Mabey · Berton Earnshaw | N/A | Code |
| Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration | Shihao Zhou · Duosheng Chen · Jinshan Pan · Jinglei Shi · Jufeng Yang | N/A | Code |
| 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos | Jiakai Sun · Han Jiao · Guangyuan Li · Zhanjie Zhang · Lei Zhao · Wei Xing | N/A | Code |
| LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering | Jaehoon Choi · Rajvi Shah · Qinbo Li · Yipeng Wang · Ayush Saraf · Changil Kim · Jia-Bin Huang · Dinesh Manocha · Suhib Alsisan · Johannes Kopf | N/A | Code |
| VCoder: Versatile Vision Encoders for Multimodal Large Language Models | Jitesh Jain · Jianwei Yang · Humphrey Shi | N/A | Code |
| Geometry Transfer for Stylizing Radiance Fields | Hyunyoung Jung · Seonghyeon Nam · Nikolaos Sarafianos · Sungjoo Yoo · Alexander Sorkine-Hornung · Rakesh Ranjan | N/A | Code |
| Prompt3D: Random Prompt Assisted Weakly-Supervised 3D Object Detection | Xiaohong Zhang · Huisheng Ye · Jingwen Li · Qinyu Tang · Yuanqi Li · Yanwen Guo · Jie Guo | N/A | Code |
| Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction | Inhwan Bae · Junoh Lee · Hae-Gon Jeon | N/A | Code |
| Efficient Meshflow and Optical Flow Estimation from Event Cameras | Xinglong Luo · Ao Luo · Zhengning Wang · Chunyu Lin · Bing Zeng · Shuaicheng Liu | N/A | Code |
| Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation | Keonhee Han · Dominik Muhle · Felix Wimbauer · Daniel Cremers | N/A | Code |
| CrossKD: Cross-Head Knowledge Distillation for Object Detection | JiaBao Wang · yuming chen · Zhaohui Zheng · Xiang Li · Ming-Ming Cheng · Qibin Hou | N/A | Code |
| Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation | Jiaming Liu · Ran Xu · Senqiao Yang · Renrui Zhang · Qizhe Zhang · Zehui Chen · Yandong Guo · Shanghang Zhang | N/A | Code |
| TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing | Sherry X. Chen · Yaron Vaxman · Elad Ben Baruch · David Asulin · Aviad Moreshet · Kuo-Chin Lien · Misha Sra · Pradeep Sen | N/A | Code |
| Leveraging Camera Triplets for Efficient and Accurate Structure-from-Motion | Lalit Manam · Venu Madhav Govindu | N/A | Code |
| LEAD: Learning Decomposition for Source-free Universal Domain Adaptation | Sanqing Qu · Tianpei Zou · Lianghua He · Florian Röhrbein · Alois Knoll · Guang Chen · Changjun Jiang | N/A | Code |
| CG-HOI: Contact-Guided 3D Human-Object Interaction Generation | Christian Diller · Angela Dai | N/A | Code |
| Seeing Motion at Nighttime with an Event Camera | Haoyue Liu · Shihan Peng · Lin Zhu · Yi Chang · Hanyu Zhou · Luxin Yan | N/A | Code |
| Resurrecting Old Classes with New Data for Exemplar-Free Continual Learning | Dipam Goswami · Albin Soutif · Yuyang Liu · Sandesh Kamath · Bartłomiej Twardowski · Joost van de Weijer | N/A | Code |
| ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval | Fang Kaipeng · Jingkuan Song · Lianli Gao · Pengpeng Zeng · Zhi-Qi Cheng · Xiyao LI · Heng Tao Shen | N/A | Code |
| NeISF: Neural Incident Stokes Field for Geometry and Material Estimation | Chenhao Li · Taishi Ono · Takeshi Uemori · Hajime Mihara · Alexander Gatto · Hajime Nagahara · Yusuke Moriuchi | N/A | Code |
| PromptKD: Unsupervised Prompt Distillation for Vision-Language Models | Zheng Li · Xiang Li · xinyi fu · Xin Zhang · Weiqiang Wang · Shuo Chen · Jian Yang | N/A | Code |
| Domain Gap Embeddings for Generative Dataset Augmentation | Yinong Oliver Wang · Younjoon Chung · Chen Henry Wu · Fernando De la Torre | N/A | Code |
| Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation | Zhekai Du · Xinyao Li · Fengling Li · Ke Lu · Lei Zhu · Jingjing Li | N/A | Code |
| Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification | Sravanti Addepalli · Ashish Asokan · Lakshay Sharma · R. Venkatesh Babu | N/A | Code |
| IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation | Yizhi Song · Zhifei Zhang · Zhe Lin · Scott Cohen · Brian Price · Jianming Zhang · Soo Ye Kim · He Zhang · Wei Xiong · Daniel Aliaga | N/A | Code |
| Absolute Pose from One or Two Scaled and Oriented Features | Jonathan Ventura · Zuzana Kukelova · Torsten Sattler · Daniel Barath | N/A | Code |
| DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation | Zeeshan Hayder · Xuming He | N/A | Code |
| RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization | Mengqi Huang · Zhendong Mao · Mingcong Liu · Qian HE · Yongdong Zhang | N/A | Code |
| LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis | Zehan Zheng · Fan Lu · Weiyi Xue · Guang Chen · Changjun Jiang | N/A | Code |
| ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning | Beomyoung Kim · Joonsang Yu · Sung Ju Hwang | N/A | Code |
| Design2Cloth: 3D Cloth Generation from 2D Masks | Jiali Zheng · Rolandos Alexandros Potamias · Stefanos Zafeiriou | N/A | Code |
| DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes | Hao Yan · Zhihui Ke · Xiaobo Zhou · Tie Qiu · Xidong Shi · DaDong Jiang | N/A | Code |
| Rolling Shutter Correction with Intermediate Distortion Flow Estimation | Mingdeng Cao · Sidi Yang · Yujiu Yang · Yinqiang Zheng | N/A | Code |
| Hybrid Functional Maps for Crease-Aware Non-Isometric Shape Matching | Lennart Bastian · Yizheng Xie · Nassir Navab · Zorah Lähner | N/A | Code |
| SFOD: Spiking Fusion Object Detector | Yimeng Fan · Wei Zhang · Changsong Liu · Mingyang Li · Wenrui Lu | N/A | Code |
| GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs | Gege Gao · Weiyang Liu · Anpei Chen · Andreas Geiger · Bernhard Schölkopf | N/A | Code |
| Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation | Song Wang · Jiawei Yu · Wentong Li · Wenyu Liu · Xiaolu Liu · Junbo Chen · Jianke Zhu | N/A | Code |
| MuRF: Multi-Baseline Radiance Fields | Haofei Xu · Anpei Chen · Yuedong Chen · Christos Sakaridis · Yulun Zhang · Marc Pollefeys · Andreas Geiger · Fisher Yu | N/A | Code |
| Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans | Romain Loiseau · Elliot Vincent · Mathieu Aubry · Loic Landrieu | N/A | Code |
| TeTriRF: Temporal Tri-Plane Radiance Fields for Efficient Free-Viewpoint Video | Minye Wu · Zehao Wang · Georgios Kouros · Tinne Tuytelaars | N/A | Code |
| PIGEON: Predicting Image Geolocations | Lukas Haas · Michal Skreta · Silas Alberti · Chelsea Finn | N/A | Code |
| FMA-Net: Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring | Geunhyuk Youk · Jihyong Oh · Munchurl Kim | N/A | Code |
| GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors | Yuan Dong · Qi Zuo · Xiaodong Gu · Weihao Yuan · zhengyi zhao · Zilong Dong · Liefeng Bo · Qixing Huang | N/A | Code |
| Understanding Video Transformers via Universal Concept Discovery | Matthew Kowal · Achal Dave · Rares Andrei Ambrus · Adrien Gaidon · Kosta Derpanis · Pavel Tokmakov | N/A | Code |
| HashPoint: Accelerated Point Searching and Sampling for Neural Rendering | Jiahao Ma · Miaomiao Liu · David Ahmedt-Aristizabal · Chuong Nguyen | N/A | Code |
| Kandinsky Conformal Prediction: Efficient Calibration of Image Segmentation Algorithms | Joren Brunekreef · Eric Marcus · Ray Sheombarsing · Jan-Jakob Sonke · Jonas Teuwen | N/A | Code |
| Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding | Wujian Peng · Sicheng Xie · Zuyao You · Shiyi Lan · Zuxuan Wu | N/A | Code |
| FedAS: Bridging Inconsistency in Personalized Federated Learning | Xiyuan Yang · Wenke Huang · Mang Ye | N/A | Code |
| COLMAP-Free 3D Gaussian Splatting | Yang Fu · Sifei Liu · Amey Kulkarni · Jan Kautz · Alexei A. Efros · Xiaolong Wang | N/A | Code |
| Amodal Completion via Progressive Mixed Context Diffusion | Katherine Xu · Lingzhi Zhang · Jianbo Shi | N/A | Code |
| LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding | Chuwei Luo · Yufan Shen · Zhaoqing Zhu · Qi Zheng · Zhi Yu · Cong Yao | N/A | Code |
| Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models | Hongjie Wang · Difan Liu · Yan Kang · Yijun Li · Zhe Lin · Niraj Jha · Yuchen Liu | N/A | Code |
| Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners | Chun Feng · Joy Hsu · Weiyu Liu · Jiajun Wu | N/A | Code |
| PIE-NeRF: Physics-based Interactive Elastodynamics with NeRF | Yutao Feng · Yintong Shang · Xuan Li · Tianjia Shao · Chenfanfu Jiang · Yin Yang | N/A | Code |
| SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities | Boyuan Chen · Zhuo Xu · Sean Kirmani · brian ichter · Dorsa Sadigh · Leonidas Guibas · Fei Xia | N/A | Code |
| 3D Paintbrush: Local Stylization of 3D Shapes with Cascaded Score Distillation | Dale Decatur · Itai Lang · Kfir Aberman · Rana Hanocka | N/A | Code |
| Blur2Blur: Blur Conversion for Unsupervised Image Deblurring on Unknown Domains | Bang-Dang Pham · Phong Tran · Anh Tran · Cuong Pham · Rang Nguyen · Minh Hoai | N/A | Code |
| RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models | Ozgur Kara · Bariscan Kurtkaya · Hidir Yesiltepe · James Rehg · Pinar Yanardag | N/A | Code |
| Generalizable Novel-View Synthesis using a Stereo Camera | Haechan Lee · Wonjoon Jin · Seung-Hwan Baek · Sunghyun Cho | N/A | Code |
| Learning Structure-from-Motion with Graph Attention Networks | Lucas Brynte · José Pedro Iglesias · Carl Olsson · Fredrik Kahl | N/A | Code |
| In-distribution Public Data Synthesis with Diffusion Models for Differentially Private Image Classification | Jinseong Park · Yujin Choi · Jaewook Lee | N/A | Code |
| Don’t Drop Your Samples! Coherence-Aware Training Benefits Conditional Diffusion | Nicolas Dufour · Victor Besnier · Vicky Kalogeiton · David Picard | N/A | Code |
| SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection | Peng Qi · Zehong Yan · Wynne Hsu · Mong Li Lee | N/A | Code |
| Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval | Young Kyun Jang · Donghyun Kim · Zihang Meng · Dat Huynh · Ser-Nam Lim | N/A | Code |
| Action-slot: Visual Action-centric Representations for Multi-label Atomic Activity Recognition in Traffic Scenes | Chi-Hsi Kung · 書緯 呂 · Yi-Hsuan Tsai · Yi-Ting Chen | N/A | Code |
| Diff-BGM: A Diffusion Model for Video Background Music Generation | Sizhe Li · Yiming Qin · Minghang Zheng · Xin Jin · Yang Liu | N/A | Code |
| ArtAdapter: Text-to-Image Style Transfer using Multi-Level Style Encoder and Explicit Adaptation | Dar-Yen Chen · Hamish Tennent · Ching-Wen Hsu | N/A | Code |
| Specularity Factorization for Low-Light Enhancement | Saurabh Saini · P. J. Narayanan | N/A | Code |
| Latent Modulated Function for Computational Optimal Continuous Image Representation | Zongyao He · Zhi Jin | N/A | Code |
| Makeup Prior Models for 3D Facial Makeup Estimation and Applications | Xingchao Yang · Takafumi Taketomi · Yuki Endo · Yoshihiro Kanamori | N/A | Code |
| Spherical Mask: Coarse-to-Fine 3D Point Cloud Instance Segmentation with Spherical Representation | Sangyun Shin · Kaichen Zhou · Madhu Vankadari · Andrew Markham · Niki Trigoni | N/A | Code |
| OED: Towards One-stage End-to-End Dynamic Scene Graph Generation | Guan Wang · Zhimin Li · Qingchao Chen · Yang Liu | N/A | Code |
| SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Ziqiao Peng · Wentao Hu · Yue Shi · Xiangyu Zhu · Xiaomei Zhang · Hao Zhao · Jun He · Hongyan Liu · Zhaoxin Fan | N/A | Code |
| MTLoRA: Low-Rank Adaptation Approach for Efficient Multi-Task Learning | Ahmed Agiza · Marina Neseem · Sherief Reda | N/A | Code |
| Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields | Haoyuan Wang · Wenbo Hu · Lei Zhu · Rynson W.H. Lau | N/A | Code |
| PREGO: Online Mistake Detection in PRocedural EGOcentric Videos | Alessandro Flaborea · Guido M. D'Amely di Melendugno · Leonardo Plini · Luca Scofano · Edoardo De Matteis · Antonino Furnari · Giovanni Maria Farinella · Fabio Galasso | N/A | Code |
| 3D LiDAR Mapping in Dynamic Environments using a 4D Implicit Neural Representation | Xingguang Zhong · Yue Pan · Cyrill Stachniss · Jens Behley | N/A | Code |
| SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction | Yuanhui Huang · Wenzhao Zheng · Borui Zhang · Jie Zhou · Jiwen Lu | N/A | Code |
| DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors | Biwen Lei · Kai Yu · Mengyang Feng · Miaomiao Cui · Xuansong Xie | N/A | Code |
| See Say and Segment: Teaching LMMs to Overcome False Premises | Tsung-Han Wu · Giscard Biamby · David Chan · Lisa Dunlap · Ritwik Gupta · XuDong Wang · Trevor Darrell · Joseph Gonzalez | N/A | Code |
| PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models | Fei Deng · Qifei Wang · Wei Wei · Tingbo Hou · Matthias Grundmann | N/A | Code |
| Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection | Jongha Kim · Jihwan Park · Jinyoung Park · Jinyoung Kim · Sehyung Kim · Hyunwoo J. Kim | N/A | Code |
| MMVP: A Multimodal MoCap Dataset with Vision and Pressure Sensors | He Zhang · Shenghao Ren · Haolei Yuan · Jianhui Zhao · Fan Li · Shuangpeng Sun · Zhenghao Liang · Tao Yu · Qiu Shen · Xun Cao | N/A | Code |
| Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications | Junyi Ma · Xieyuanli Chen · Jiawei Huang · Jingyi Xu · Zhen Luo · Jintao Xu · Weihao Gu · Rui Ai · Hesheng Wang | N/A | Code |
| Relightable and Animatable Neural Avatar from Sparse-View Video | Zhen Xu · Sida Peng · Chen Geng · Linzhan Mou · Zihan Yan · Jiaming Sun · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Objects as Volumes: A Stochastic Geometry View of Opaque Solids | Bailey Miller · Hanyu Chen · Alice Lai · Ioannis Gkioulekas | N/A | Code |
| Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields | Shijie Zhou · Haoran Chang · Sicheng Jiang · Zhiwen Fan · Zehao Zhu · Dejia Xu · Pradyumna Chari · Suya You · Zhangyang Wang · Achuta Kadambi | N/A | Code |
| ControlRoom3D: Room Generation using Semantic Proxy Rooms | Jonas Schult · Sam Tsai · Lukas Höllein · Bichen Wu · Jialiang Wang · Chih-Yao Ma · Kunpeng Li · Xiaofang Wang · Felix Wimbauer · Zijian He · Peizhao Zhang · Bastian Leibe · Peter Vajda · Ji Hou | N/A | Code |
| WANDR: Intention-guided Human Motion Generation | Markos Diomataris · Nikos Athanasiou · Omid Taheri · Xi Wang · Otmar Hilliges · Michael J. Black | N/A | Code |
| Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval | Minkuk Kim · Hyeon Bae Kim · Jinyoung Moon · Jinwoo Choi · Seong Tae Kim | N/A | Code |
| Rich Human Feedback for Text-to-Image Generation | Youwei Liang · Junfeng He · Gang Li · Peizhao Li · Arseniy Klimovskiy · Nicholas Carolan · Jiao Sun · Jordi Pont-Tuset · Sarah Young · Feng Yang · Junjie Ke · Krishnamurthy Dvijotham · Katherine Collins · Yiwen Luo · Yang Li · Kai Kohlhoff · Deepak Ramachandran · Vidhya Navalpakkam | N/A | Code |
| SSR-Encoder: Encoding Selective Subject Representation for Subject-Driven Generation | Yuxuan Zhang · Yiren Song · Jiaming Liu · Rui Wang · Jinpeng Yu · Hao Tang · Huaxia Li · Xu Tang · Yao Hu · Han Pan · Zhongliang Jing | N/A | Code |
| Learning from Observer Gaze: Zero-Shot Attention Prediction Oriented by Human-Object Interaction Recognition | Yuchen Zhou · Linkai Liu · Chao Gou | N/A | Code |
| Super-Resolution Reconstruction from Bayer-Pattern Spike Streams | Yanchen Dong · Ruiqin Xiong · Jian Zhang · Zhaofei Yu · Xiaopeng Fan · Shuyuan Zhu · Tiejun Huang | N/A | Code |
| Image Neural Field Diffusion Models | Yinbo Chen · Oliver Wang · Richard Zhang · Eli Shechtman · Xiaolong Wang · Michaël Gharbi | N/A | Code |
| Artist-Friendly Relightable and Animatable Neural Heads | Yingyan Xu · Prashanth Chandran · Sebastian Weiss · Markus Gross · Gaspard Zoss · Derek Bradley | N/A | Code |
| Anchor-based Robust Finetuning of Vision-Language Models | Jinwei Han · Zhiwen Lin · Zhongyisun Sun · Yingguo Gao · Ke Yan · Shouhong Ding · Yuan Gao · Gui-Song Xia | N/A | Code |
| SwiftBrush: One-Step Text-to-Image Diffusion Model with Variational Score Distillation | Thuan Nguyen · Anh Tran | N/A | Code |
| Looking 3D: Anomaly Detection with 2D-3D Alignment | Ankan Kumar Bhunia · Changjian Li · Hakan Bilen | N/A | Code |
| EventPS: Real-Time Photometric Stereo Using an Event Camera | Bohan Yu · Jieji Ren · Jin Han · Feishi Wang · Jinxiu Liang · Boxin Shi | N/A | Code |
| Diffusion Handles Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D | Karran Pandey · Paul Guerrero · Matheus Gadelha · Yannick Hold-Geoffroy · Karan Singh · Niloy J. Mitra | N/A | Code |
| Circuit Design and Efficient Simulation of Quantum Inner Product and Empirical Studies of Its Effect on Near-Term Hybrid Quantum-Classic Machine Learning | Hao Xiong · Yehui Tang · Xinyu Ye · Junchi Yan | N/A | Code |
| OpenBias: Open-set Bias Detection in Text-to-Image Generative Models | Moreno D'Incà · Elia Peruzzo · Massimiliano Mancini · Dejia Xu · Vidit Goel · Xingqian Xu · Zhangyang Wang · Humphrey Shi · Nicu Sebe | N/A | Code |
| COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction | Qihang Ma · Xin Tan · Yanyun Qu · Lizhuang Ma · Zhizhong Zhang · Yuan Xie | N/A | Code |
| Towards 3D Vision with Low-Cost Single-Photon Cameras | Fangzhou Mu · Carter Sifferman · Sacha Jungerman · Yiquan Li · Zhiyue Han · Michael Gleicher · Mohit Gupta · Yin Li | N/A | Code |
| Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now | Ayush Sarkar · Hanlin Mai · Amitabh Mahapatra · David Forsyth · Svetlana Lazebnik · Anand Bhattad | N/A | Code |
| Aligning Logits Generatively for Principled Black-Box Knowledge Distillation | Jing Ma · Xiang Xiang · Ke Wang · Yuchuan Wu · Yongbin Li | N/A | Code |
| Semantically-Shifted Incremental Adapter-Tuning is A Continual ViTransformer | Yuwen Tan · Qinhao Zhou · Xiang Xiang · Ke Wang · Yuchuan Wu · Yongbin Li | N/A | Code |
| EgoGen: An Egocentric Synthetic Data Generator | Gen Li · Kaifeng Zhao · Siwei Zhang · Xiaozhong Lyu · Mihai Dusmanu · Yan Zhang · Marc Pollefeys · Siyu Tang | N/A | Code |
| Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models | David Stotko · Nils Wandel · Reinhard Klein | N/A | Code |
| M&M VTO: Multi-Garment Virtual Try-On and Editing | Luyang Zhu · Yingwei Li · Nan Liu · Hao Peng · Dawei Yang · Ira Kemelmacher-Shlizerman | N/A | Code |
| VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection | Zihua Liu · Hiroki Sakuma · Masatoshi Okutomi | N/A | Code |
| Permutation Equivariance of Transformers and Its Applications | Hengyuan Xu · Liyao Xiang · Hangyu Ye · Dixi Yao · Pengzhi Chu · Baochun Li | N/A | Code |
| Unified Language-driven Zero-shot Domain Adaptation | Senqiao Yang · Zhuotao Tian · Li Jiang · Jiaya Jia | N/A | Code |
| SubT-MRS Dataset: Pushing SLAM Towards All-weather Environments | Shibo Zhao · Yuanjun Gao · Tianhao Wu · Damanpreet Singh · Rushan Jiang · Haoxiang Sun · Mansi Sarawata · Warren Whittaker · Ian Higgins · Shaoshu Su · Yi Du · Can Xu · John Keller · Jay Karhade · Lucas Nogueira · Sourojit Saha · Yuheng Qiu · Ji Zhang · Wenshan Wang · Chen Wang · Sebastian Scherer | N/A | Code |
| Parameter Efficient Self-Supervised Geospatial Domain Adaptation | Linus Scheibenreif · Michael Mommert · Damian Borth | N/A | Code |
| SpecNeRF: Gaussian Directional Encoding for Specular Reflections | Li Ma · Vasu Agrawal · Haithem Turki · Changil Kim · Chen Gao · Pedro V. Sander · Michael Zollhoefer · Christian Richardt | N/A | Code |
| ALGM: Adaptive Local-then-Global Token Merging for Efficient Semantic Segmentation with Plain Vision Transformers | Narges Norouzi · Svetlana Orlova · Daan de Geus · Gijs Dubbelman | N/A | Code |
| Scaling Laws of Synthetic Images for Model Training ... for Now | Lijie Fan · Kaifeng Chen · Dilip Krishnan · Dina Katabi · Phillip Isola · Yonglong Tian | N/A | Code |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Seokju Yun · Youngmin Ro | N/A | Code |
| DemoCaricature: Democratising Caricature Generation with a Rough Sketch | Dar-Yen Chen · Ayan Kumar Bhunia · Subhadeep Koley · Aneeshan Sain · Pinaki Nath Chowdhury · Yi-Zhe Song | N/A | Code |
| GenZI: Zero-Shot 3D Human-Scene Interaction Generation | Lei Li · Angela Dai | N/A | Code |
| 360+x: A Panoptic Multi-modal Scene Understanding Dataset | Hao Chen · Yuqi Hou · Chenyuan Qu · Irene Testini · Xiaohan Hong · Jianbo Jiao | N/A | Code |
| EFHQ: Multi-purpose ExtremePose-Face-HQ dataset | Trung Dao · Duc H Vu · Cuong Pham · Anh Tran | N/A | Code |
| Producing and Leveraging Online Map Uncertainty in Trajectory Prediction | Xunjiang Gu · Guanyu Song · Igor Gilitschenski · Marco Pavone · Boris Ivanovic | N/A | Code |
| FaceLift: Semi-supervised 3D Facial Landmark Localization | David Ferman · Pablo Garrido · Gaurav Bharaj | N/A | Code |
| TokenCompose: Text-to-Image Diffusion with Token-level Supervision | Zirui Wang · Zhizhou Sha · Zheng Ding · Yilin Wang · Zhuowen Tu | N/A | Code |
| MICap: A Unified Model for Identity-Aware Movie Descriptions | Haran Raajesh · Naveen Reddy Desanur · Zeeshan Khan · Makarand Tapaswi | N/A | Code |
| Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates | Ka Chun SHUM · Jaeyeon Kim · Binh-Son Hua · Thanh Nguyen · Sai-Kit Yeung | N/A | Code |
| FreeU: Free Lunch in Diffusion U-Net | Chenyang Si · Ziqi Huang · Yuming Jiang · Ziwei Liu | N/A | Code |
| Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance | Tomer Garber · Tom Tirer | N/A | Code |
| Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation | Xiaohan Lei · Min Wang · Wengang Zhou · Li Li · Houqiang Li | N/A | Code |
| Learning Vision from Models Rivals Learning Vision from Data | Yonglong Tian · Lijie Fan · Kaifeng Chen · Dina Katabi · Dilip Krishnan · Phillip Isola | N/A | Code |
| Learning SO(3)-Invariant Semantic Correspondence via Local Shape Transform | Chunghyun Park · Seungwook Kim · Jaesik Park · Minsu Cho | N/A | Code |
| RegionGPT: Towards Region Understanding Vision Language Model | Qiushan Guo · Shalini De Mello · Danny Yin · Wonmin Byeon · Ka Chun Cheung · Yizhou Yu · Ping Luo · Sifei Liu | N/A | Code |
| In-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face Editing | Yiran Xu · Zhixin Shu · Cameron Smith · Seoung Wug Oh · Jia-Bin Huang | N/A | Code |
| Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image | Yiqun Mei · Yu Zeng · He Zhang · Zhixin Shu · Xuaner Zhang · Sai Bi · Jianming Zhang · HyunJoon Jung · Vishal M. Patel | N/A | Code |
| Relightful Harmonization: Lighting-aware Portrait Background Replacement | Mengwei Ren · Wei Xiong · Jae Shin Yoon · Zhixin Shu · Jianming Zhang · HyunJoon Jung · Guido Gerig · He Zhang | N/A | Code |
| NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning | Mustafa B Gurbuz · Jean Moorman · Constantine Dovrolis | N/A | Code |
| Taming the Tail in Class-Conditional GANs: Knowledge Sharing via Unconditional Training at Lower Resolutions | Saeed Khorram · Mingqi Jiang · Mohamad Shahbazi · Mohamad Hosein Danesh · Li Fuxin | N/A | Code |
| Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences | Axel Barroso-Laguna · Sowmya Munukutla · Victor Adrian Prisacariu · Eric Brachmann | N/A | Code |
| MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI | Xiang Yue · Yuansheng Ni · Kai Zhang · Tianyu Zheng · Ruoqi Liu · Ge Zhang · Samuel Stevens · Dongfu Jiang · Weiming Ren · Yuxuan Sun · Cong Wei · Botao Yu · Ruibin Yuan · Renliang Sun · Ming Yin · Boyuan Zheng · Zhenzhu Yang · Yibo Liu · Wenhao Huang · Huan Sun · Yu Su · Wenhu Chen | N/A | Code |
| DeiT-LT: Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets | Harsh Rangwani · Pradipto Mondal · Mayank Mishra · Ashish Asokan · R. Venkatesh Babu | N/A | Code |
| MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video | Hengyi Wang · Jingwen Wang · Lourdes Agapito | N/A | Code |
| FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders | Soumen Basu · Mayuna Gupta · Chetan Madan · Pankaj Gupta · Chetan Arora | N/A | Code |
| CSTA: CNN-based Spatiotemporal Attention for Video Summarization | Jaewon Son · Jaehun Park · Kwangsu Kim | N/A | Code |
| Atom-Level Optical Chemical Structure Recognition with Limited Supervision | Martijn Oldenhof · Edward De Brouwer · Adam Arany · Yves Moreau | N/A | Code |
| Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language | Mark Hamilton · Andrew Zisserman · John Hershey · William Freeman | N/A | Code |
| WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights | Youngdong Jang · Dong In Lee · MinHyuk Jang · Jong Wook Kim · Feng Yang · Sangpil Kim | N/A | Code |
| GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models | Taoran Yi · Jiemin Fang · Junjie Wang · Guanjun Wu · Lingxi Xie · Xiaopeng Zhang · Wenyu Liu · Qi Tian · Xinggang Wang | N/A | Code |
| Active Generalized Category Discovery | Shijie Ma · Fei Zhu · Zhun Zhong · Xu-Yao Zhang · Cheng-Lin Liu | N/A | Code |
| Commonsense Prototype for Outdoor Unsupervised 3D Object Detection | Hai Wu · Shijia Zhao · Xun Huang · Chenglu Wen · Xin Li · Cheng Wang | N/A | Code |
| LPSNet: End-to-End Human Pose and Shape Estimation with Lensless Imaging | Haoyang Ge · Qiao Feng · Hailong Jia · Xiongzheng Li · Xiangjun Yin · You Zhou · Jingyu Yang · Kun Li | N/A | Code |
| ManiFPT: Defining and Analyzing Fingerprints of Generative Models | Hae Jin Song · Mahyar Khayatkhoei · Wael AbdAlmageed | N/A | Code |
| StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN | Jongwoo Choi · Kwanggyoon Seo · Amirsaman Ashtari · Junyong Noh | N/A | Code |
| MS-MANO: Enabling Hand Pose Tracking with Biomechanical Constraints | Pengfei Xie · Wenqiang Xu · Tutian Tang · Zhenjun Yu · Cewu Lu | N/A | Code |
| CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion | Xiaoyu Wu · Yang Hua · Chumeng Liang · Jiaru Zhang · Hao Wang · Tao Song · Haibing Guan | N/A | Code |
| Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation | Ming Xu · Stephen Gould | N/A | Code |
| Robust Depth Enhancement via Polarization Prompt Fusion Tuning | Kei IKEMURA · Yiming Huang · Felix Heide · Zhaoxiang Zhang · Qifeng Chen · Chenyang Lei | N/A | Code |
| Learning Large-Factor EM Image Super-Resolution with Generative Priors | Jiateng Shou · Zeyu Xiao · Shiyu Deng · Wei Huang · ShiPeiyao · Ruobing Zhang · Zhiwei Xiong · Feng Wu | N/A | Code |
| GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning | Ye Yuan · Xueting Li · Yangyi Huang · Shalini De Mello · Koki Nagano · Jan Kautz · Umar Iqbal | N/A | Code |
| Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning | Leonardo Iurada · Marco Ciccone · Tatiana Tommasi | N/A | Code |
| ScoreHypo: Probabilistic Human Mesh Estimation with Hypothesis Scoring | Yuan Xu · Xiaoxuan Ma · Jiajun Su · Wentao Zhu · Yu Qiao · Yizhou Wang | N/A | Code |
| Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering | Kim Youwang · Tae-Hyun Oh · Gerard Pons-Moll | N/A | Code |
| A Theory of Joint Light and Heat Transport for Lambertian Scenes | Mani Ramanagopal · Sriram Narayanan · Aswin C. Sankaranarayanan · Srinivasa G. Narasimhan | N/A | Code |
| Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation | Zhiwei Yang · Kexue Fu · Minghong Duan · Linhao Qu · Shuo Wang · Zhijian Song | N/A | Code |
| Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence | Junyi Zhang · Charles Herrmann · Junhwa Hur · Eric Chen · Varun Jampani · Deqing Sun · Ming-Hsuan Yang | N/A | Code |
| Amodal Ground Truth and Completion in the Wild | Guanqi Zhan · Chuanxia Zheng · Weidi Xie · Andrew Zisserman | N/A | Code |
| Novel Class Discovery for Ultra-Fine-Grained Visual Categorization | Qi Jia · Yaqi Cai · Qi Jia · Binglin Qiu · Weimin Wang · Nan Pu | N/A | Code |
| Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models | Chang Liu · Haoning Wu · Yujie Zhong · Xiaoyun Zhang · Yanfeng Wang · Weidi Xie | N/A | Code |
| Real-Time Exposure Correction via Collaborative Transformations and Adaptive Sampling | Ziwen Li · Feng Zhang · Meng Cao · Jinpu Zhang · Yuanjie Shao · Yuehuan Wang · Nong Sang | N/A | Code |
| Gaussian Splatting SLAM | Hidenobu Matsuki · Riku Murai · Paul Kelly · Andrew J. Davison | N/A | Code |
| NeLF-Pro: Neural Light Field Probes for Multi-Scale Novel View Synthesis | Zinuo You · Andreas Geiger · Anpei Chen | N/A | Code |
| A Simple Baseline for Efficient Hand Mesh Reconstruction | zhishan zhou · shihao zhou · Zhi Lv · minqiang zou · Yao Tang · Jiajun Liang | N/A | Code |
| OpenEQA: Embodied Question Answering in the Era of Foundation Models | Arjun Majumdar · Anurag Ajay · Xiaohan Zhang · Sriram Yenamandra · Mikael Henaff · Alexander Sax · Sneha Silwal · Paul McVay · Oleksandr Maksymets · Sergio Arnaud · Pranav Putta · Karmesh Yadav · Qiyang Li · Benjamin Newman · Mohit Sharma · Vincent-Pierre Berges · Shiqi Zhang · Pulkit Agrawal · Dhruv Batra · Yonatan Bisk · Mrinal Kalakrishnan · Franziska Meier · Chris Paxton · Aravind Rajeswaran | N/A | Code |
| Dynamic Policy-Driven Adaptive Multi-Instance Learning for Whole Slide Image Classification | Tingting Zheng · Kui Jiang · Hongxun Yao | N/A | Code |
| Privacy-Preserving Optics for Enhancing Protection in Face De-Identification | Jhon Lopez · Carlos Hinojosa · Henry Arguello · Bernard Ghanem | N/A | Code |
| Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models | Jiayi Guo · Xingqian Xu · Yifan Pu · Zanlin Ni · Chaofei Wang · Manushree Vasu · Shiji Song · Gao Huang · Humphrey Shi | N/A | Code |
| Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents | Yuxi Wei · Zi Wang · Yifan Lu · Chenxin Xu · Changxing Liu · Hao Zhao · Siheng Chen · Yanfeng Wang | N/A | Code |
| Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models | Yabin Zhang · Wenjie Zhu · Hui Tang · Zhiyuan Ma · Kaiyang Zhou · Lei Zhang | N/A | Code |
| PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor | Vidit Goel · Elia Peruzzo · Yifan Jiang · Dejia Xu · Xingqian Xu · Nicu Sebe · Trevor Darrell · Zhangyang Wang · Humphrey Shi | N/A | Code |
| Learning Continuous 3D Words for Text-to-Image Generation | Ta-Ying Cheng · Matheus Gadelha · Thibault Groueix · Matthew Fisher · Radomir Mech · Andrew Markham · Niki Trigoni | N/A | Code |
| Content-Adaptive Non-Local Convolution for Remote Sensing Pansharpening | Yule Duan · Xiao Wu · Haoyu Deng · Liang-Jian Deng | N/A | Code |
| NeuRAD: Neural Rendering for Autonomous Driving | Adam Tonderski · Carl Lindström · Georg Hess · William Ljungbergh · Lennart Svensson · Christoffer Petersson | N/A | Code |
| Efficient Solution of Point-Line Absolute Pose | Petr Hruby · Timothy Duff · Marc Pollefeys | N/A | Code |
| V?: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Penghao Wu · Saining Xie | N/A | Code |
| CAMixerSR: Only Details Need More "Attention" | Yan Wang · Yi Liu · Shijie Zhao · Junlin Li · Li zhang | N/A | Code |
| MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning | Chaoyi Zhang · Kevin Lin · Zhengyuan Yang · Jianfeng Wang · Linjie Li · Chung-Ching Lin · Zicheng Liu · Lijuan Wang | N/A | Code |
| Gradient Reweighting: Towards Imbalanced Class-Incremental Learning | Jiangpeng He | N/A | Code |
| Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach | Mir Rayat Imtiaz Hossain · Mennatullah Siam · Leonid Sigal · Jim Little | N/A | Code |
| Face2Diffusion for Fast and Editable Face Personalization | Kaede Shiohara · Toshihiko Yamasaki | N/A | Code |
| HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding | Trong-Thuan Nguyen · Pha Nguyen · Khoa Luu | N/A | Code |
| GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting | Chi Yan · Delin Qu · Dong Wang · Dan Xu · Zhigang Wang · Bin Zhao · Xuelong Li | N/A | Code |
| Active Object Detection with Knowledge Aggregation and Distillation from Large Models | Dejie Yang · Yang Liu | N/A | Code |
| ShapeWalk: Compositional Shape Editing Through Language-Guided Chains | Habib Slim · Mohamed Elhoseiny | N/A | Code |
| Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos · Ligong Han · Dimitris N. Metaxas | N/A | Code |
| MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation | Yuelong Li · Yafei Mao · Raja Bala · Sunil Hadap | N/A | Code |
| RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation | Peng Lu · Tao Jiang · Yining Li · Xiangtai Li · Kai Chen · Wenming Yang | N/A | Code |
| SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes | Soubhik Sanyal · Partha Ghosh · Jinlong Yang · Michael J. Black · Justus Thies · Timo Bolkart | N/A | Code |
| Training-Free Pretrained Model Merging | Zhengqi Xu · Ke Yuan · Huiqiong Wang · Yong Wang · Mingli Song · Jie Song | N/A | Code |
| Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance | Phuc Nguyen · Tuan Duc Ngo · Evangelos Kalogerakis · Chuang Gan · Anh Tran · Cuong Pham · Khoi Nguyen | N/A | Code |
| Faces that Speak: Jointly Synthesising Talking Face and Speech from Text | Youngjoon Jang · Jihoon Kim · Junseok Ahn · Doyeop Kwak · Hongsun Yang · Yooncheol Ju · ILHWAN KIM · Byeong-Yeol Kim · Joon Chung | N/A | Code |
| EGTR: Extracting Graph from Transformer for Scene Graph Generation | Jinbae Im · JeongYeon Nam · Nokyung Park · Hyungmin Lee · Seunghyun Park | N/A | Code |
| Continual Forgetting for Pre-trained Vision Models | Hongbo Zhao · Bolin Ni · Junsong Fan · Yuxi Wang · Yuntao Chen · Gaofeng Meng · Zhaoxiang Zhang | N/A | Code |
| Distributionally Generative Augmentation for Fair Facial Attribute Classification | Fengda Zhang · Qianpei He · Kun Kuang · Jiashuo Liu · Long Chen · Chao Wu · Jun Xiao · Hanwang Zhang | N/A | Code |
| Boosting Neural Representations for Videos with a Conditional Decoder | XINJIE ZHANG · Ren Yang · Dailan He · Xingtong Ge · Tongda Xu · Yan Wang · Hongwei Qin · Jun Zhang | N/A | Code |
| IReNe: Instant Recoloring of Neural Radiance Fields | Alessio Mazzucchelli · Adrian Garcia-Garcia · Elena Garces · Fernando Rivas-Manzaneque · Francesc Moreno-Noguer · Adrian Penate-Sanchez | N/A | Code |
| Fair Federated Learning under Domain Skew with Local Consistency and Domain Diversity | Yuhang Chen · Wenke Huang · Mang Ye | N/A | Code |
| Improving Physics-Augmented Continuum Neural Radiance Field-Based Geometry-Agnostic System Identification with Lagrangian Particle Optimization | Takuhiro Kaneko | N/A | Code |
| FastMAC: Stochastic Spectral Sampling of Correspondence Graph | Yifei Zhang · Hao Zhao · Hongyang Li · Siheng Chen | N/A | Code |
| Learning the 3D Fauna of the Web | Zizhang Li · Dor Litvak · Ruining Li · Yunzhi Zhang · Tomas Jakab · Christian Rupprecht · Shangzhe Wu · Andrea Vedaldi · Jiajun Wu | N/A | Code |
| Zero-TPrune: Zero-Shot Token Pruning through Leveraging of the Attention Graph in Pre-Trained Transformers | Hongjie Wang · Bhishma Dedhia · Niraj Jha | N/A | Code |
| URHand: Universal Relightable Hands | Zhaoxi Chen · Gyeongsik Moon · Kaiwen Guo · Chen Cao · Stanislav Pidhorskyi · Tomas Simon · Rohan Joshi · Yuan Dong · Yichen Xu · Bernardo Pires · He Wen · Lucas Evans · Bo Peng · Julia Buffalini · Autumn Trimble · Kevyn McPhail · Melissa Schoeller · Shoou-I Yu · Javier Romero · Michael Zollhoefer · Yaser Sheikh · Ziwei Liu · Shunsuke Saito | N/A | Code |
| Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions | Oindrila Saha · Grant Horn · Subhransu Maji | N/A | Code |
| SIGNeRF: Scene Integrated Generation for Neural Radiance Fields | Jan-Niklas Dihlmann · Andreas Engelhardt · Hendrik Lensch | N/A | Code |
| Synergistic Global-space Camera and Human Reconstruction from Videos | Yizhou Zhao · Tuanfeng Y. Wang · Bhiksha Raj · Min Xu · Jimei Yang · Chun-Hao P. Huang | N/A | Code |
| Neural Implicit Morphing of Face Images | Guilherme Schardong · Tiago Novello · Hallison Paz · Iurii Medvedev · Vinícius Silva · Luiz Velho · Nuno Gonçalves | N/A | Code |
| Towards Generalizing to Unseen Domains with Few Labels | Chamuditha Jayanga Galappaththige · Sanoojan Baliah · Malitha Gunawardhana · Muhammad Haris Khan | N/A | Code |
| Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps | Octave Mariotti · Oisin Mac Aodha · Hakan Bilen | N/A | Code |
| GLID: Pre-training a Generalist Encoder-Decoder Vision Model | Jihao Liu · Jinliang Zheng · Yu Liu · Hongsheng Li | N/A | Code |
| Control4D: Efficient 4D Portrait Editing with Text | Ruizhi Shao · Jingxiang Sun · Cheng Peng · Zerong Zheng · Boyao ZHOU · Hongwen Zhang · Yebin Liu | N/A | Code |
| Osprey: Pixel Understanding with Visual Instruction Tuning | Yuqian Yuan · Wentong Li · Jian liu · Dongqi Tang · Xinjie Luo · Chi Qin · Lei Zhang · Jianke Zhu | N/A | Code |
| NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation | Sicheng Li · Hao Li · Yiyi Liao · Lu Yu | N/A | Code |
| Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing | Hyelin Nam · Gihyun Kwon · Geon Yeong Park · Jong Chul Ye | N/A | Code |
| Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Zhan Li · Zhang Chen · Zhong Li · Yi Xu | N/A | Code |
| LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning | Siyuan Cheng · Guanhong Tao · Yingqi Liu · Guangyu Shen · Shengwei An · Shiwei Feng · Xiangzhe Xu · Kaiyuan Zhang · Shiqing Ma · Xiangyu Zhang | N/A | Code |
| Long-Tailed Anomaly Detection with Learnable Class Names | Chih-Hui Ho · Kuan-Chuan Peng · Nuno Vasconcelos | N/A | Code |
| PlatoNeRF: 3D Reconstruction in Plato's Cave via Single-View Two-Bounce Lidar | Tzofi Klinghoffer · Xiaoyu Xiang · Siddharth Somasundaram · Yuchen Fan · Christian Richardt · Ramesh Raskar · Rakesh Ranjan | N/A | Code |
| Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives | Ronghui Li · Yuxiang Zhang · Yachao Zhang · Hongwen Zhang · Jie Guo · Yan Zhang · Yebin Liu · Xiu Li | N/A | Code |
| Estimating Noisy Class Posterior with Part-level Labels for Noisy Label Learning | Rui Zhao · Bin Shi · Jianfei Ruan · Tianze Pan · Bo Dong | N/A | Code |
| DPHMs: Diffusion Parametric Head Models for Depth-based Tracking | Jiapeng Tang · Angela Dai · Yinyu Nie · Lev Markhasin · Justus Thies · Matthias Nießner | N/A | Code |
| CNC-Net: Self-Supervised Learning for CNC Machining Operations | Mohsen Yavartanoo · Sangmin Hong · Reyhaneh Neshatavar · Kyoung Mu Lee | N/A | Code |
| MaxQ: Multi-Axis Query for N:M Sparsity Network | Jingyang Xiang · Siqi Li · Junhao Chen · Zhuangzhi Chen · Tianxin Huang · Linpeng Peng · Yong Liu | N/A | Code |
| High-Quality Facial Geometry and Appearance Capture at Home | Yuxuan Han · Junfeng Lyu · Feng Xu | N/A | Code |
| Text-Guided Variational Image Generation for Industrial Anomaly Detection and Segmentation | Mingyu Lee · Jongwon Choi | N/A | Code |
| Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models | Pengze Zhang · Hubery Yin · Chen Li · Xiaohua Xie | N/A | Code |
| Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models | Haoning Wu · Zicheng Zhang · Erli Zhang · Chaofeng Chen · Liang Liao · Annan Wang · Kaixin Xu · Chunyi Li · Jingwen Hou · Guangtao Zhai · Xue Geng · Wenxiu Sun · Qiong Yan · Weisi Lin | N/A | Code |
| Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data | Yu Deng · Duomin Wang · Xiaohang Ren · Xingyu Chen · Baoyuan Wang | N/A | Code |
| Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding | Hoang-Quan Nguyen · Thanh-Dat Truong · Xuan-Bac Nguyen · Ashley Dowling · Xin Li · Khoa Luu | N/A | Code |
| Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection | Huan Liu · Zichang Tan · Chuangchuang Tan · Yunchao Wei · Jingdong Wang · Yao Zhao | N/A | Code |
| MeaCap: Memory-Augmented Zero-shot Image Captioning | Zequn Zeng · Yan Xie · Hao Zhang · Chiyu Chen · Zhengjue Wang · Bo Chen | N/A | Code |
| Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation | Daichi Horita · Naoto Inoue · Kotaro Kikuchi · Kota Yamaguchi · Kiyoharu Aizawa | N/A | Code |
| Novel View Synthesis with View-Dependent Effects from a Single Image | Juan Luis Gonzalez Bello · Munchurl Kim | N/A | Code |
| Wired Perspectives: Multi-View Wire Art Embraces Generative AI | Zhiyu Qu · LAN YANG · Honggang Zhang · Tao Xiang · Kaiyue Pang · Yi-Zhe Song | N/A | Code |
| UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures | Mingyuan Zhou · Rakib Hyder · Ziwei Xuan · Guo-Jun Qi | N/A | Code |
| Transfer CLIP for Generalizable Image Denoising | Jun Cheng · Dong Liang · Shan Tan | N/A | Code |
| Alchemist: Parametric Control of Material Properties with Diffusion Models | Prafull Sharma · Varun Jampani · Yuanzhen Li · Xuhui Jia · Dmitry Lagun · Fredo Durand · William Freeman · Mark Matthews | N/A | Code |
| FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning | Junyuan Zhang · Shuang Zeng · Miao Zhang · Runxi Wang · Feifei Wang · Yuyin Zhou · Paul Pu Liang · Liangqiong Qu | N/A | Code |
| Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding | Peng Jin · Ryuichi Takanobu · Cai Zhang · Xiaochun Cao · Li Yuan | N/A | Code |
| Named Entity Driven Zero-Shot Image Manipulation | Zhida Feng · Li Chen · Jing Tian · Jiaxiang Liu · Shikun Feng | N/A | Code |
| Non-Rigid Structure-from-Motion: Temporally-Smooth Procrustean Alignment and Spatially-Variant Deformation Modeling | Jiawei Shi · Hui Deng · Yuchao Dai | N/A | Code |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Rui-Qi Wu · Liangyu Chen · Tong Yang · Chun-Le Guo · Chongyi Li · Xiangyu Zhang | N/A | Code |
| iKUN: Speak to Trackers without Retraining | Yunhao Du · Cheng Lei · Zhicheng Zhao · Fei Su | N/A | Code |
| SocialCircle: Learning the Angle-based Social Interaction Representation for Pedestrian Trajectory Prediction | Conghao Wong · Beihao Xia · Ziqian Zou · Yulong Wang · Xinge You | N/A | Code |
| Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation | Zihan Wang · Xiangyang Li · Jiahao Yang · Yeqi Liu · Junjie Hu · Ming Jiang · Shuqiang Jiang | N/A | Code |
| Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis | Simon Niedermayr · Josef Stumpfegger · rüdiger westermann | N/A | Code |
| SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation | Bin Xie · Jiale Cao · Jin Xie · Fahad Shahbaz Khan · Yanwei Pang | N/A | Code |
| Motion Blur Decomposition with Cross-shutter Guidance | Xiang Ji · Haiyang Jiang · Yinqiang Zheng | N/A | Code |
| TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models | Zhongwei Zhang · Fuchen Long · Yingwei Pan · Zhaofan Qiu · Ting Yao · Yang Cao · Tao Mei | N/A | Code |
| Time-Efficient Light-Field Acquisition Using Coded Aperture and Events | Shuji Habuchi · Keita Takahashi · Chihiro Tsutake · Toshiaki Fujii · Hajime Nagahara | N/A | Code |
| Pixel-Aligned Language Model | Jiarui Xu · Xingyi Zhou · Shen Yan · Xiuye Gu · Anurag Arnab · Chen Sun · Xiaolong Wang · Cordelia Schmid | N/A | Code |
| Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications | Yuwen Xiong · Zhiqi Li · Yuntao Chen · Feng Wang · Xizhou Zhu · Jiapeng Luo · Wenhai Wang · Tong Lu · Hongsheng Li · Yu Qiao · Lewei Lu · Jie Zhou · Jifeng Dai | N/A | Code |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | Bo Li · Haoke Xiao · Lv Tang | N/A | Code |
| ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis | Muhammad Hamza Mughal · Rishabh Dabral · Ikhsanul Habibie · Lucia Donatelli · Marc Habermann · Christian Theobalt | N/A | Code |
| ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models | Lukas Höllein · Aljaž Božič · Norman Müller · David Novotny · Hung-Yu Tseng · Christian Richardt · Michael Zollhoefer · Matthias Nießner | N/A | Code |
| Exploiting Diffusion Prior for Generalizable Dense Prediction | Hsin-Ying Lee · Hung-Yu Tseng · Hsin-Ying Lee · Ming-Hsuan Yang | N/A | Code |
| GSVA: Generalized Segmentation via Multimodal Large Language Models | Zhuofan Xia · Dongchen Han · Yizeng Han · Xuran Pan · Shiji Song · Gao Huang | N/A | Code |
| Coherent Temporal Synthesis for Incremental Action Segmentation | Guodong Ding · Hans Golong · Angela Yao | N/A | Code |
| Person in Place: Generating Associative Skeleton-Guidance Maps for Human-Object Interaction Image Editing | ChangHee Yang · ChanHee Kang · Kyeongbo Kong · Hanni Oh · Suk-Ju Kang | N/A | Code |
| Depth-aware Test-Time Training for Zero-shot Video Object Segmentation | Weihuang Liu · Xi Shen · Haolun Li · Xiuli Bi · Bo Liu · Chi-Man Pun · Xiaodong Cun | N/A | Code |
| Real-time 3D-aware Portrait Video Relighting | Ziqi Cai · Kaiwen Jiang · Shu-Yu Chen · Yu-Kun Lai · Hongbo Fu · Boxin Shi · Lin Gao | N/A | Code |
| Towards Real-World HDR Video Reconstruction: A Large-Scale Benchmark Dataset and A Two-Stage Alignment Network | Yong Shu · Liquan Shen · Xiangyu Hu · Mengyao Li · Zihao Zhou | N/A | Code |
| DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D Data | Qihao Liu · Yi Zhang · Song Bai · Adam Kortylewski · Alan L. Yuille | N/A | Code |
| VOODOO 3D: Volumetric Portrait Disentanglement For One-Shot 3D Head Reenactment | Phong Tran · Egor Zakharov · Long Nhat Ho · Anh Tran · Liwen Hu · Hao Li | N/A | Code |
| NAPGuard: Towards Detecting Naturalistic Adversarial Patches | Siyang Wu · Jiakai Wang · Jiejie Zhao · Yazhe Wang · Xianglong Liu | N/A | Code |
| Descriptor and Word Soups: Overcoming the Parameter Efficiency Accuracy Tradeoff for Out-of-Distribution Few-shot Learning | Christopher Liao · Theodoros Tsiligkaridis · Brian Kulis | N/A | Code |
| Bootstrapping SparseFormers from Vision Foundation Models | Ziteng Gao · Zhan Tong · Kevin Qinghong Lin · Joya Chen · Mike Zheng Shou | N/A | Code |
| NARUTO: Neural Active Reconstruction from Uncertain Target Observations | Ziyue Feng · Huangying Zhan · Zheng Chen · Qingan Yan · Xiangyu Xu · Changjiang Cai · Bing Li · Qilun Zhu · Yi Xu | N/A | Code |
| Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches | Qing Yu · Mikihiro Tanaka · Kent Fujiwara | N/A | Code |
| Text-Enhanced Data-free Approach for Federated Class-Incremental Learning | Minh-Tuan Tran · Trung Le · Xuan-May Le · Mehrtash Harandi · Dinh Phung | N/A | Code |
| G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images | Zixiong Huang · Qi Chen · Libo Sun · Yifan Yang · Naizhou Wang · Qi Wu · Mingkui Tan | N/A | Code |
| MoCha-Stereo: Motif Channel Attention Network for Stereo Matching | Ziyang Chen · Wei Long · He Yao · Yongjun Zhang · Bingshu Wang · Yongbin Qin · Jia Wu | N/A | Code |
| GLACE: Global Local Accelerated Coordinate Encoding | Fangjinhua Wang · Xudong Jiang · Silvano Galliani · Christoph Vogel · Marc Pollefeys | N/A | Code |
| HouseCat6D - A Large-Scale Multi-Modal Category Level 6D Object Perception Dataset with Household Objects in Realistic Scenarios | HyunJun Jung · Shun-Cheng Wu · Patrick Ruhkamp · Guangyao Zhai · Hannah Schieber · Giulia Rizzoli · Pengyuan Wang · Hongcheng Zhao · Lorenzo Garattoni · Sven Meier · Daniel Roth · Nassir Navab · Benjamin Busam | N/A | Code |
| MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation | Petru-Daniel Tudosiu · Yongxin Yang · Shifeng Zhang · Fei Chen · Steven McDonagh · Gerasimos Lampouras · Ignacio Iacobacci · Sarah Parisot | N/A | Code |
| VidToMe: Video Token Merging for Zero-Shot Video Editing | Xirui Li · Chao Ma · Xiaokang Yang · Ming-Hsuan Yang | N/A | Code |
| Text-Image Alignment for Diffusion-Based Perception | Neehar Kondapaneni · Markus Marks · Manuel Knott · Rogério Guimarães · Pietro Perona | N/A | Code |
| RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback | Tianyu Yu · Yuan Yao · Haoye Zhang · Taiwen He · Yifeng Han · Ganqu Cui · Jinyi Hu · Zhiyuan Liu · Hai-Tao Zheng · Maosong Sun | N/A | Code |
| Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes | Haobin Duan · Miao Wang · Yanxun Li · Yong-Liang Yang | N/A | Code |
| Generating Handwritten Mathematical Expressions From Symbol Graphs: An End-to-End Pipeline | Yu chen · Fei Gao · YanguangZhang · Maoying Qiao · Nannan Wang | N/A | Code |
| On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation | Agneet Chatterjee · Tejas Gokhale · Chitta Baral · 'YZ' Yezhou Yang | N/A | Code |
| SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model | Inhwan Bae · Young-Jae Park · Hae-Gon Jeon | N/A | Code |
| Mitigating Motion Blur in Neural Radiance Fields with Events and Frames | Marco Cannici · Davide Scaramuzza | N/A | Code |
| Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features | Thomas Wimmer · Peter Wonka · Maks Ovsjanikov | N/A | Code |
| Alpha-CLIP: A CLIP Model Focusing on Wherever You Want | Zeyi Sun · Ye Fang · Tong Wu · Pan Zhang · Yuhang Zang · Shu Kong · Yuanjun Xiong · Dahua Lin · Jiaqi Wang | N/A | Code |
| DemoFusion: Democratising High-Resolution Image Generation With No $$$ | Ruoyi DU · Dongliang Chang · Timothy Hospedales · Yi-Zhe Song · Zhanyu Ma | N/A | Code |
| Dr.Hair: Reconstructing Scalp-Connected Hair Strands without Pre-Training via Differentiable Rendering of Line Segments | Yusuke Takimoto · Hikari Takehara · Hiroyuki Sato · Zihao Zhu · Bo Zheng | N/A | Code |
| SketchINR: A First Look into Sketches as Implicit Neural Representations | Hmrishav Bandyopadhyay · Ayan Kumar Bhunia · Pinaki Nath Chowdhury · Aneeshan Sain · Tao Xiang · Timothy Hospedales · Yi-Zhe Song | N/A | Code |
| Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving | Yuqi Wang · Jiawei He · Lue Fan · Hongxin Li · Yuntao Chen · Zhaoxiang Zhang | N/A | Code |
| MULDE: Multiscale Log-Density Estimation via Denoising Score Matching for Video Anomaly Detection | Jakub Micorek · Horst Possegger · Dominik Narnhofer · Horst Bischof · Mateusz Kozinski | N/A | Code |
| Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation | Shenshen Bu · Taiji Li · Zhiming Dai · Yuedong Yang | N/A | Code |
| The Manga Whisperer: Automatically Generating Transcriptions for Comics | Ragav Sachdeva · Andrew Zisserman | N/A | Code |
| Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes | YuJie Lu · Long Wan · Nayu Ding · Yulong Wang · Shuhan Shen · Shen Cai · Lin Gao | N/A | Code |
| From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact Prior | Jaeho Moon · Juan Luis Gonzalez Bello · Byeongjun Kwon · Munchurl Kim | N/A | Code |
| Deep-TROJ: An Inference Stage Trojan Insertion Algorithm through Efficient Weight Replacement Attack | Sabbir Ahmed · RANYANG ZHOU · Shaahin Angizi · Adnan Rakin Rakin | N/A | Code |
| Mask Grounding for Referring Image Segmentation | Yong Xien Chng · Henry Zheng · Yizeng Han · Xuchong QIU · Gao Huang | N/A | Code |
| SignGraph: A Sign Sequence is Worth Graphs of Nodes | Shiwei Gan · Yafeng Yin · Zhiwei Jiang · Hongkai Wen · Lei Xie · Sanglu Lu | N/A | Code |
| Just Add ?! Pose Induced Video Transformers for Understanding Activities of Daily Living | Dominick Reilly · Srijan Das | N/A | Code |
| StyLitGAN: Image-Based Relighting via Latent Control | Anand Bhattad · James Soole · David Forsyth | N/A | Code |
| DGC-GNN: Leveraging Geometry and Color Cues for Visual Descriptor-Free 2D-3D Matching | Shuzhe Wang · Juho Kannala · Daniel Barath | N/A | Code |
| Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation | Fahimeh Hosseini Noohdani · Parsa Hosseini · Aryan Yazdan Parast · Hamidreza Araghi · Mahdieh Baghshah | N/A | Code |
| TI2V-Zero: Zero-Shot Image Conditioning for Text-to-Video Diffusion Models | Haomiao Ni · Bernhard Egger · Suhas Lohit · Anoop Cherian · Ye Wang · Toshiaki Koike-Akino · Sharon X. Huang · Tim Marks | N/A | Code |
| PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics | Tianyi Xie · Zeshun Zong · Yuxing Qiu · Xuan Li · Yutao Feng · Yin Yang · Chenfanfu Jiang | N/A | Code |
| A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives | Simone Alberto Peirone · Francesca Pistilli · Antonio Alliegro · Giuseppe Averta | N/A | Code |
| VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media Reasoning | Kang Chen · Xiangqian Wu | N/A | Code |
| Compact 3D Gaussian Representation for Radiance Field | Joo Chan Lee · Daniel Rho · Xiangyu Sun · Jong Hwan Ko · Eunbyung Park | N/A | Code |
| Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer | Rafail Fridman · Danah Yatim · Omer Bar-Tal · Yoni Kasten · Tali Dekel | N/A | Code |
| FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations | Christian Diller · Thomas Funkhouser · Angela Dai | N/A | Code |
| TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models | Yushi Huang · Ruihao Gong · Jing Liu · Tianlong Chen · Xianglong Liu | N/A | Code |
| GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians | Shenhan Qian · Tobias Kirschstein · Liam Schoneveld · Davide Davoli · Simon Giebenhain · Matthias Nießner | N/A | Code |
| Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li · Zerong Zheng · Lizhen Wang · Yebin Liu | N/A | Code |
| Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models | Shengqu Cai · Duygu Ceylan · Matheus Gadelha · Chun-Hao P. Huang · Tuanfeng Y. Wang · Gordon Wetzstein | N/A | Code |
| Improving Out-of-Distribution Generalization in Graphs via Hierarchical Semantic Environments | Yinhua Piao · Sangseon Lee · Yijingxiu Lu · Sun Kim | N/A | Code |
| Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models | Gianni Franchi · Olivier Laurent · Maxence Leguéry · Andrei Bursuc · Andrea Pilzer · Angela Yao | N/A | Code |
| ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting | Yankai Jiang · Zhongzhen Huang · Rongzhao Zhang · Xiaofan Zhang · Shaoting Zhang | N/A | Code |
| CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD Programs | Haocheng Yuan · Jing Xu · Hao Pan · Adrien Bousseau · Niloy J. Mitra · Changjian Li | N/A | Code |
| Structure-Aware Sparse-View X-ray 3D Reconstruction | Yuanhao Cai · Jiahao Wang · Alan L. Yuille · Zongwei Zhou · Angtian Wang | N/A | Code |
| LangSplat: 3D Language Gaussian Splatting | Minghan Qin · Wanhua Li · Jiawei ZHOU · Haoqian Wang · Hanspeter Pfister | N/A | Code |
| Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining | Xiang Chen · Jinshan Pan · Jiangxin Dong | N/A | Code |
| SceneTex: High-Quality Texture Synthesis for Indoor Scenes via Diffusion Priors | Dave Zhenyu Chen · Haoxuan Li · Hsin-Ying Lee · Sergey Tulyakov · Matthias Nießner | N/A | Code |
| MonoCD: Monocular 3D Object Detection with Complementary Depths | Longfei Yan · Pei Yan · Shengzhou Xiong · Xuanyu Xiang · Yihua Tan | N/A | Code |
| JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation | Yu Zeng · Vishal M. Patel · Haochen Wang · Xun Huang · Ting-Chun Wang · Ming-Yu Liu · Yogesh Balaji | N/A | Code |
| G3DR: Generative 3D Reconstruction in ImageNet | Pradyumna Reddy · Ismail Elezi · Jiankang Deng | N/A | Code |
| SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering | Antoine Guédon · Vincent Lepetit | N/A | Code |
| Zero-Reference Low-Light Enhancement via Physical Quadruple Priors | Wenjing Wang · Huan Yang · Jianlong Fu · Jiaying Liu | N/A | Code |
| DiffusionPoser: Real-time Human Motion Reconstruction From Arbitrary Sparse Sensors Using Autoregressive Diffusion | Tom Van Wouwe · Seunghwan Lee · Antoine Falisse · Scott Delp · Karen Liu | N/A | Code |
| MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning | Matteo Farina · Massimiliano Mancini · Elia Cunegatti · Gaowen Liu · Giovanni Iacca · Elisa Ricci | N/A | Code |
| Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving | Junhao Zheng · Chenhao Lin · Jiahao Sun · Zhengyu Zhao · Qian Li · Chao Shen | N/A | Code |
| SeaBird: Segmentation in Bird’s View with Dice Loss Improves Monocular 3D Detection of Large Objects | Abhinav Kumar · Yuliang Guo · Xinyu Huang · Liu Ren · Xiaoming Liu | N/A | Code |
| Move as You Say Interact as You Can: Language-guided Human Motion Generation with Scene Affordance | Zan Wang · Yixin Chen · Baoxiong Jia · Puhao Li · Jinlu Zhang · Jingze Zhang · Tengyu Liu · Yixin Zhu · Wei Liang · Siyuan Huang | N/A | Code |
| Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation | Yi Zhang · Meng-Hao Guo · Miao Wang · Shi-Min Hu | N/A | Code |
| Single Mesh Diffusion Models with Field Latents for Texture Generation | Thomas W. Mitchel · Carlos Esteves · Ameesh Makadia | N/A | Code |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Niccolò Cavagnero · Gabriele Rosi · Claudia Cuttano · Francesca Pistilli · Marco Ciccone · Giuseppe Averta · Fabio Cermelli | N/A | Code |
| VILA: On Pre-training for Visual Language Models | Ji Lin · Danny Yin · Wei Ping · Pavlo Molchanov · Mohammad Shoeybi · Song Han | N/A | Code |
| Hearing Anything Anywhere | Mason Wang · Ryosuke Sawata · Samuel Clarke · Ruohan Gao · Shangzhe Wu · Jiajun Wu | N/A | Code |
| Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3) for Visual Robotic Manipulation | Hyunwoo Ryu · Jiwoo Kim · Hyunseok An · Junwoo Chang · Joohwan Seo · Taehan Kim · Yubin Kim · Chaewon Hwang · Jongeun Choi · Roberto Horowitz | N/A | Code |
| BOTH2Hands: Inferring 3D Hands from Both Text Prompts and Body Dynamics | Wenqian Zhang · Molin Huang · Yuxuan Zhou · Juze Zhang · Jingyi Yu · Jingya Wang · Lan Xu | N/A | Code |
| A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models | Julio Silva-Rodríguez · Sina Hajimiri · Ismail Ben Ayed · Jose Dolz | N/A | Code |
| A Noisy Elephant in the Room: Is Your Out-of-Distribution Detector Robust to Label Noise? | Galadrielle Humblot-Renaux · Sergio Escalera · Thomas B. Moeslund | N/A | Code |
| DriveTrack: A Benchmark for Long-Range Point Tracking in Real-World Videos | Arjun Balasingam · Joseph Chandler · Chenning Li · Zhoutong Zhang · Hari Balakrishnan | N/A | Code |
| Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder | Jinseok Kim · Tae-Kyun Kim | N/A | Code |
| Implicit Event-RGBD Neural SLAM | Delin Qu · Chi Yan · Dong Wang · Jie Yin · Qizhi Chen · Dan Xu · Yiting Zhang · Bin Zhao · Xuelong Li | N/A | Code |
| Three Pillars Improving Vision Foundation Model Distillation for Lidar | Gilles Puy · Spyros Gidaris · Alexandre Boulch · Oriane Siméoni · Corentin Sautier · Patrick Pérez · Andrei Bursuc · Renaud Marlet | N/A | Code |
| DreamControl: Control-Based Text-to-3D Generation with 3D Self-Prior | Tianyu Huang · Yihan Zeng · Zhilu Zhang · Wan Xu · Hang Xu · Songcen Xu · Rynson W.H. Lau · Wangmeng Zuo | N/A | Code |
| Seamless Human Motion Composition with Blended Positional Encodings | German Barquero · Sergio Escalera · Cristina Palmero | N/A | Code |
| Single Domain Generalization for Crowd Counting | Zhuoxuan Peng · S.-H. Gary Chan | N/A | Code |
| Collaborative Learning of Anomalies with Privacy (CLAP) for Unsupervised Video Anomaly Detection: A New Baseline | Anas Al-lahham · Muhammad Zaigham Zaheer · Nurbek Tastan · Karthik Nandakumar | N/A | Code |
| RepKPU: Point Cloud Upsampling with Kernel Point Representation and Deformation | Yi Rong · Haoran Zhou · Kang Xia · Cheng Mei · Jiahao Wang · Tong Lu | N/A | Code |
| InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models | Jiun Tian Hoe · Xudong Jiang · Chee Seng Chan · Yap-peng Tan · Weipeng Hu | N/A | Code |
| MAP: MAsk-Pruning for Source-Free Model Intellectual Property Protection | Boyang Peng · Sanqing Qu · Yong Wu · Tianpei Zou · Lianghua He · Alois Knoll · Guang Chen · Changjun Jiang | N/A | Code |
| 4K4D: Real-Time 4D View Synthesis at 4K Resolution | Zhen Xu · Sida Peng · Haotong Lin · Guangzhao He · Jiaming Sun · Yujun Shen · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis | Xin Zhou · Dingkang Liang · Wei Xu · Xingkui Zhu · Yihan Xu · Zhikang Zou · Xiang Bai | N/A | Code |
| DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation | Junming Chen · Yunfei Liu · Jianan Wang · Ailing Zeng · Yu Li · Qifeng Chen | N/A | Code |
| EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image Segmentation | Md Mostafijur Rahman · Mustafa Munir · Radu Marculescu | N/A | Code |
| TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation | Sai Kumar Dwivedi · Yu Sun · Priyanka Patel · Yao Feng · Michael J. Black | N/A | Code |
| On Exact Inversion of DPM-Solvers | Seongmin Hong · Kyeonghyun Lee · Suh Yoon Jeon · Hyewon Bae · Se Young Chun | N/A | Code |
| PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding | Zhen Li · Mingdeng Cao · Xintao Wang · Zhongang Qi · Ming-Ming Cheng · Ying Shan | N/A | Code |
| Privacy-Preserving Face Recognition Using Trainable Feature Subtraction | Yuxi Mi · Zhizhou Zhong · Yuge Huang · Jiazhen Ji · Jianqing Xu · Jun Wang · ShaoMing Wang · Shouhong Ding · Shuigeng Zhou | N/A | Code |
| GROUNDHOG: Grounding Large Language Models to Holistic Segmentation | Yichi Zhang · Ziqiao Ma · Xiaofeng Gao · Suhaila Shakiah · Qiaozi Gao · Joyce Chai | N/A | Code |
| Improving Spectral Snapshot Reconstruction with Spectral-Spatial Rectification | Jiancheng Zhang · Haijin Zeng · Yongyong Chen · Dengxiu Yu · Yinping Zhao | N/A | Code |
| D3T: Distinctive Dual-Domain Teacher Zigzagging Across RGB-Thermal Gap for Domain-Adaptive Object Detection | Dinh Phat Do · Taehoon Kim · JAEMIN NA · Jiwon Kim · Keonho LEE · Kyunghwan Cho · Wonjun Hwang | N/A | Code |
| Logit Standardization in Knowledge Distillation | Shangquan Sun · Wenqi Ren · Jingzhi Li · Rui Wang · Xiaochun Cao | N/A | Code |
| Progressive Divide-and-Conquer via Subsampling Decomposition for Accelerated MRI | Chong Wang · Lanqing Guo · Yufei Wang · Hao Cheng · Yi Yu · Bihan Wen | N/A | Code |
| MAGICK: A Large-scale Captioned Dataset from Matting Generated Images using Chroma Keying | Ryan Burgert · Brian Price · Jason Kuen · Yijun Li · Michael Ryoo | N/A | Code |
| Intrinsic Image Diffusion for Indoor Single-view Material Estimation | Peter Kocsis · Vincent Sitzmann · Matthias Nießner | N/A | Code |
| RealNet: A Feature Selection Network with Realistic Synthetic Anomaly for Anomaly Detection | Ximiao Zhang · Min Xu · Xiuzhuang Zhou | N/A | Code |
| Prompt Highlighter: Interactive Control for Multi-Modal LLMs | Yuechen Zhang · Shengju Qian · Bohao Peng · Shu Liu · Jiaya Jia | N/A | Code |
| One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models | Lin Li · Haoyan Guan · Jianing Qiu · Michael Spratling | N/A | Code |
| Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model | Wenfeng Song · Xingliang Jin · Shuai Li · Chenglizhao Chen · Aimin Hao · Xia HOU · Ning Li · Hong Qin | N/A | Code |
| 3DSFLabelling: Boosting 3D Scene Flow Estimation by Pseudo Auto-labelling | Chaokang Jiang · Guangming Wang · Jiuming Liu · Hesheng Wang · Zhuang Ma · Zhenqiang Liu · LIANG · Yi Shan · Dalong Du | N/A | Code |
| Video-Based Human Pose Regression via Decoupled Space-Time Aggregation | Jijie He · Wenwu Yang | N/A | Code |
| UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement | yaofeng xie · Lingwei Kong · Kai Chen · Zheng Ziqiang · Xiao Yu · Zhibin Yu · Bing Zheng | N/A | Code |
| A Generative Approach for Wikipedia-Scale Visual Entity Recognition | Mathilde Caron · Ahmet Iscen · Alireza Fathi · Cordelia Schmid | N/A | Code |
| PKU-DyMVHumans: A Multi-View Video Benchmark for High-Fidelity Dynamic Human Modeling | Xiaoyun Zheng · Liwei Liao · Xufeng Li · Jianbo Jiao · Rongjie Wang · Feng Gao · Shiqi Wang · Ronggang Wang | N/A | Code |
| DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting | Demin Yu · Xutao Li · Yunming Ye · Baoquan Zhang · Luo Chuyao · Kuai Dai · wangrui · Chenxunlai | N/A | Code |
| Deep Single Image Camera Calibration by Heatmap Regression to Recover Fisheye Images Under Manhattan World Assumption | Nobuhiko Wakai · Satoshi Sato · Yasunori Ishii · Takayoshi Yamashita | N/A | Code |
| MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers | Yawar Siddiqui · Antonio Alliegro · Alexey Artemov · Tatiana Tommasi · Daniele Sirigatti · Vladislav Rosov · Angela Dai · Matthias Nießner | N/A | Code |
| MatFuse: Controllable Material Generation with Diffusion Models | Giuseppe Vecchio · Renato Sortino · Simone Palazzo · Concetto Spampinato | N/A | Code |
| Restoration by Generation with Constrained Priors | Zheng Ding · Xuaner Zhang · Zhuowen Tu · Zhihao Xia | N/A | Code |
| Task-Conditioned Adaptation of Visual Features in Multi-Task Policy Learning | Pierre Marza · Laetitia Matignon · Olivier Simonin · Christian Wolf | N/A | Code |
| OmniSeg3D: Omniversal 3D Segmentation via Hierarchical Contrastive Learning | Haiyang Ying · Yixuan Yin · Jinzhi Zhang · Fan Wang · Tao Yu · Ruqi Huang · Lu Fang | N/A | Code |
| EditGuard: Versatile Image Watermarking for Tamper Localization and Copyright Protection | Xuanyu Zhang · Runyi Li · Jiwen Yu · Youmin Xu · Weiqi Li · Jian Zhang | N/A | Code |
| 3D-LFM: Lifting Foundation Model | Mosam Dabhi · László A. Jeni · Simon Lucey | N/A | Code |
| Localization Is All You Evaluate: Data Leakage in Online Mapping Datasets and How to Fix It | Adam Lilja · Junsheng Fu · Erik Stenborg · Lars Hammarstrand | N/A | Code |
| CLOVA: A Closed-LOop Visual Assistant with Tool Usage and Update | Zhi Gao · Yuntao Du. · Xintong Zhang · Xiaojian Ma · Wenjuan Han · Song-Chun Zhu · Qing Li | N/A | Code |
| LASIL: Learner-Aware Supervised Imitation Learning For Long-term Microscopic Traffic Simulation | Ke Guo · Zhenwei Miao · Wei Jing · Weiwei Liu · Weizi Li · Dayang Hao · Jia Pan | N/A | Code |
| Unified Entropy Optimization for Open-Set Test-Time Adaptation | Zhengqing Gao · Xu-Yao Zhang · Cheng-Lin Liu | N/A | Code |
| One-dimensional Adapter to Rule Them All: Concepts Diffusion Models and Erasing Applications | Mengyao Lyu · Yuhong Yang · Haiwen Hong · Hui Chen · Xuan Jin · Yuan He · Hui Xue · Jungong Han · Guiguang Ding | N/A | Code |
| Turb-Seg-Res: A Segment-then-Restore Pipeline for Dynamic Videos with Atmospheric Turbulence | Ripon Saha · Dehao Qin · Nianyi Li · Jinwei Ye · Suren Jayasuriya | N/A | Code |
| HybridNeRF: Efficient Neural Rendering via Adaptive Volumetric Surfaces | Haithem Turki · Vasu Agrawal · Samuel Rota Bulò · Lorenzo Porzi · Peter Kontschieder · Deva Ramanan · Michael Zollhoefer · Christian Richardt | N/A | Code |
| Single-Model and Any-Modality for Video Object Tracking | Zongwei Wu · Jilai Zheng · Xiangxuan Ren · Florin-Alexandru Vasluianu · Chao Ma · Danda Paudel · Luc Van Gool · Radu Timofte | N/A | Code |
| Frequency-Adaptive Dilated Convolution for Semantic Segmentation | Linwei Chen · Lin Gu · Dezhi Zheng · Ying Fu | N/A | Code |
| LSK3DNet: Towards Effective and Efficient 3D Perception with Large Sparse Kernels | Tuo Feng · Wenguan Wang · Fan Ma · Yi Yang | N/A | Code |
| MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction | Xiaolu Liu · Song Wang · Wentong Li · Ruizi Yang · Junbo Chen · Jianke Zhu | N/A | Code |
| Style Aligned Image Generation via Shared Attention | Amir Hertz · Andrey Voynov · Shlomi Fruchter · Daniel Cohen-Or | N/A | Code |
| Steganographic Passport: An Owner and User Verifiable Credential for Deep Model IP Protection Without Retraining | Qi Cui · Ruohan Meng · Chaohui Xu · Chip Hong Chang | N/A | Code |
| TexTile: A Differentiable Metric for Texture Tileability | Carlos Rodriguez-Pardo · Dan Casas · Elena Garces · Jorge Lopez-Moreno | N/A | Code |
| MatSynth: A Modern PBR Materials Dataset | Giuseppe Vecchio · Valentin Deschaintre | N/A | Code |
| BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models | Fengyuan Shi · Jiaxi Gu · Hang Xu · Songcen Xu · Wei Zhang · Limin Wang | N/A | Code |
| ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation | Suraj Patni · Aradhye Agarwal · Chetan Arora | N/A | Code |
| Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring | Huicong Zhang · Haozhe Xie · Hongxun Yao | N/A | Code |
| MTMMC: A Large-Scale Real-World Multi-Modal Camera Tracking Benchmark | Sanghyun Woo · Kwanyong Park · Inkyu Shin · Myungchul Kim · In So Kweon | N/A | Code |
| LTGC: Long-tail Recognition via Leveraging LLMs-driven Generated Content | Qihao Zhao · Yalun Dai · Hao Li · Wei Hu · Fan Zhang · Jun Liu | N/A | Code |
| How to Train Neural Field Representations: A Comprehensive Study and Benchmark | Samuele Papa · Riccardo Valperga · David Knigge · Miltiadis Kofinas · Phillip Lippe · Jan-Jakob Sonke · Efstratios Gavves | N/A | Code |
| Language-conditioned Detection Transformer | Jang Hyun Cho · Philipp Krähenbühl | N/A | Code |
| Riemannian Multinomial Logistics Regression for SPD Neural Networks | Ziheng Chen · Yue Song · Gaowen Liu · Ramana Kompella · Xiaojun Wu · Nicu Sebe | N/A | Code |
| Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer | Hyeongjin Nam · Daniel Jung · Gyeongsik Moon · Kyoung Mu Lee | N/A | Code |
| Digital Life Project: Autonomous 3D Characters with Social Intelligence | Zhongang Cai · Jianping Jiang · Zhongfei Qing · Xinying Guo · Mingyuan Zhang · Zhengyu Lin · Haiy Mei · Chen Wei · Wang Ruisi · Wanqi Yin · Liang Pan · Xiangyu Fan · Han Du · Peng Gao · Zhitao Yang · Yang Gao · Jiaqi Li · Tianxiang Ren · YuKun Wei · Xiaogang Wang · Chen Change Loy · Lei Yang · Ziwei Liu | N/A | Code |
| MonoHair: High-Fidelity Hair Modeling from a Monocular Video | Keyu Wu · LINGCHEN YANG · Zhiyi Kuang · Yao Feng · Xutao Han · Yuefan Shen · Hongbo Fu · Kun Zhou · Youyi Zheng | N/A | Code |
| Learned Scanpaths Aid Blind Panoramic Video Quality Assessment | Kanglong FAN · Wen Wen · Mu Li · YIFAN PENG · Kede Ma | N/A | Code |
| Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding | Zhihao Yuan · Jinke Ren · Chun-Mei Feng · Hengshuang Zhao · Shuguang Cui · Zhen Li | N/A | Code |
| SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation | Zhixuan Liu · Peter Schaldenbrand · Beverley-Claire Okogwu · Wenxuan Peng · Youngsik Yun · Andrew Hundt · Jihie Kim · Jean Oh | N/A | Code |
| Smart Help: Strategic Opponent Modeling for Proactive and Adaptive Robot Assistance in Households | Zhihao Cao · ZiDong Wang · Siwen Xie · Anji Liu · Lifeng Fan | N/A | Code |
| CMA: A Chromaticity Map Adapter for Robust Detection of Screen-Recapture Document Images | Changsheng Chen · Liangwei Lin · Yongqi Chen · Bin Li · Jishen Zeng · Jiwu Huang | N/A | Code |
| MAS: Multi-view Ancestral Sampling for 3D Motion Generation Using 2D Diffusion | Roy Kapon · Guy Tevet · Daniel Cohen-Or · Amit H. Bermano | N/A | Code |
| PointInfinity: Resolution-Invariant Point Diffusion Models | Zixuan Huang · Justin Johnson · Shoubhik Debnath · James Rehg · Chao-Yuan Wu | N/A | Code |
| CoralSCOP: Segment any COral Image on this Planet | Zheng Ziqiang · Liang Haixin · Binh-Son Hua · Tim, Yue Him Wong · Put ANG · Apple CHUI · Sai-Kit Yeung | N/A | Code |
| GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding | Chengyao Wang · Li Jiang · Xiaoyang Wu · Zhuotao Tian · Bohao Peng · Hengshuang Zhao · Jiaya Jia | N/A | Code |
| SplaTAM: Splat Track & Map 3D Gaussians for Dense RGB-D SLAM | Nikhil Keetha · Jay Karhade · Krishna Murthy Jatavallabhula · Gengshan Yang · Sebastian Scherer · Deva Ramanan · Jonathon Luiten | N/A | Code |
| Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging | Bhargav Ghanekar · Salman Siddique Khan · Pranav Sharma · Shreyas Singh · Vivek Boominathan · Kaushik Mitra · Ashok Veeraraghavan | N/A | Code |
| ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association | Shuxiao Ding · Lukas Schneider · Marius Cordts · Jürgen Gall | N/A | Code |
| Open Vocabulary Semantic Scene Sketch Understanding | Ahmed Bourouis · Judith Fan · Yulia Gryaditskaya | N/A | Code |
| Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors | Nicolae Ristea · Florinel Croitoru · Radu Tudor Ionescu · Marius Popescu · Fahad Shahbaz Khan · Mubarak Shah | N/A | Code |
| UniPAD: A Universal Pre-training Paradigm for Autonomous Driving | Honghui Yang · Sha Zhang · Di Huang · Xiaoyang Wu · Haoyi Zhu · Tong He · SHIXIANG TANG · Hengshuang Zhao · Qibo Qiu · Binbin Lin · Xiaofei He · Wanli Ouyang | N/A | Code |
| You Only Need Less Attention at Each Stage in Vision Transformers | Shuoxi Zhang · Hanpeng Liu · Stephen Lin · Kun He | N/A | Code |
| Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection | Chuangchuang Tan · Huan Liu · Yao Zhao · Shikui Wei · Guanghua Gu · Ping Liu · Yunchao Wei | N/A | Code |
| MonoNPHM: Dynamic Head Reconstruction from Monocular Videos | Simon Giebenhain · Tobias Kirschstein · Markos Georgopoulos · Martin Rünz · Lourdes Agapito · Matthias Nießner | N/A | Code |
| Discriminative Probing and Tuning for Text-to-Image Generation | Leigang Qu · Wenjie Wang · Yongqi Li · Hanwang Zhang · Liqiang Nie · Tat-seng Chua | N/A | Code |
| Comparing the Decision-Making Mechanisms by Transformers and CNNs via Explanation Methods | Mingqi Jiang · Saeed Khorram · Li Fuxin | N/A | Code |
| HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting | Yuheng Jiang · Zhehao Shen · Penghao Wang · Zhuo Su · Yu Hong · Yingliang Zhang · Jingyi Yu · Lan Xu | N/A | Code |
| Image Sculpting: Precise Object Editing with 3D Geometry Control | Jiraphon Yenphraphai · Xichen Pan · Sainan Liu · Daniele Panozzo · Saining Xie | N/A | Code |
| Doubly Abductive Counterfactual Inference for Text-based Image Editing | Xue Song · Jiequan Cui · Hanwang Zhang · Jingjing Chen · Richang Hong · Yu-Gang Jiang | N/A | Code |
| Scaling Diffusion Models to Real-World 3D LiDAR Scene Completion | Lucas Nunes · Rodrigo Marcuzzi · Benedikt Mersch · Jens Behley · Cyrill Stachniss | N/A | Code |
| UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory | Haiwen Diao · Bo Wan · Ying Zhang · Xu Jia · Huchuan Lu · Long Chen | N/A | Code |
| EscherNet: A Generative Model for Scalable View Synthesis | Xin Kong · Shikun Liu · Xiaoyang Lyu · Marwan Taher · Xiaojuan Qi · Andrew J. Davison | N/A | Code |
| MVCPS-NeuS: Multi-view Constrained Photometric Stereo for Neural Surface Reconstruction | Hiroaki Santo · Fumio Okura · Yasuyuki Matsushita | N/A | Code |
| Towards Memorization-Free Diffusion Models | Chen Chen · Daochang Liu · Chang Xu | N/A | Code |
| Semantics Distortion and Style Matter: Towards Source-free UDA for Panoramic Segmentation | Xu Zheng · Pengyuan Zhou · ATHANASIOS · Addison, Lin Wang | N/A | Code |
| PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness | Anh-Quan Cao · Angela Dai · Raoul de Charette | N/A | Code |
| ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks | Kai Han · Yunhe Wang · Jianyuan Guo · Enhua Wu | N/A | Code |
| AV-RIR: Audio-Visual Room Impulse Response Estimation | Anton Ratnarajah · Sreyan Ghosh · Sonal Kumar · Purva Chiniya · Dinesh Manocha | N/A | Code |
| RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method | Ming Yan · Yan Zhang · Shuqiang Cai · Shuqi Fan · Xincheng Lin · Yudi Dai · Siqi Shen · Chenglu Wen · Lan Xu · Yuexin Ma · Cheng Wang | N/A | Code |
| Entangled View-Epipolar Information Aggregation for Generalizable Neural Radiance Fields | Zhiyuan Min · Yawei Luo · Wei Yang · Yuesong Wang · Yi Yang | N/A | Code |
| OHTA: One-shot Hand Avatar via Data-driven Implicit Priors | Xiaozheng Zheng · Chao Wen · Zhuo Su · Zeran Xu · Zhaohu Li · Yang Zhao · Zhou Xue | N/A | Code |
| Instance Tracking in 3D Scenes from Egocentric Videos | Yunhan Zhao · Haoyu Ma · Shu Kong · Charless Fowlkes | N/A | Code |
| ID-Blau: Image Deblurring by Implicit Diffusion-based reBLurring AUgmentation | Jia-Hao Wu · Fu-Jen Tsai · Yan-Tsung Peng · Charles Tsai · Chia-Wen Lin · Yen-Yu Lin | N/A | Code |
| Adversarial Score Distillation: When score distillation meets GAN | Min Wei · Jingkai Zhou · Junyao Sun · Xuesong Zhang | N/A | Code |
| Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification | Pingping Zhang · Yuhao Wang · Yang Liu · Zhengzheng Tu · Huchuan Lu | N/A | Code |
| DMR: Decomposed Multi-Modality Representations for Frames and Events Fusion in Visual Reinforcement Learning | Haoran Xu · Peixi Peng · Guang Tan · Yuan Li · Xinhai Xu · Yonghong Tian | N/A | Code |
| QUADify: Extracting Meshes with Pixel-level Details and Materials from Images | Maximilian Frühauf · Hayko Riemenschneider · Markus Gross · Christopher Schroers | N/A | Code |
| FedHCA2: Towards Hetero-Client Federated Multi-Task Learning | Yuxiang Lu · Suizhi Huang · Yuwen Yang · Shalayiding Sirejiding · Yue Ding · Hongtao Lu | N/A | Code |
| DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data | Chengxiang Fan · Muzhi Zhu · Hao Chen · Yang Liu · Weijia Wu · Huaqi Zhang · Chunhua Shen | N/A | Code |
| SurMo: Surface-based 4D Motion Modeling for Dynamic Human Rendering | Tao Hu · Fangzhou Hong · Ziwei Liu | N/A | Code |
| LeftRefill: Filling Right Canvas based on Left Reference through Generalized Text-to-Image Diffusion Model | Chenjie Cao · Yunuo Cai · Qiaole Dong · Yikai Wang · Yanwei Fu | N/A | Code |
| Data Poisoning based Backdoor Attacks to Contrastive Learning | Jinghuai Zhang · Hongbin Liu · Jinyuan Jia · Neil Zhenqiang Gong | N/A | Code |
| DART: Implicit Doppler Tomography for Radar Novel View Synthesis | Tianshu Huang · John Miller · Akarsh Prabhakara · Tao Jin · Tarana Laroia · Zico Kolter · Anthony Rowe | N/A | Code |
| RoHM: Robust Human Motion Reconstruction via Diffusion | Siwei Zhang · Bharat Lal Bhatnagar · Yuanlu Xu · Alexander Winkler · Petr Kadlecek · Siyu Tang · Federica Bogo | N/A | Code |
| ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation | Xiaoqi Li · Mingxu Zhang · Yiran Geng · Haoran Geng · Yuxing Long · Yan Shen · Renrui Zhang · Jiaming Liu · Hao Dong | N/A | Code |
| Video Interpolation with Diffusion Models | Siddhant Jain · Daniel Watson · Aleksander Holynski · Eric Tabellion · Ben Poole · Janne Kontkanen | N/A | Code |
| DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization | Jiahe Li · Jiawei Zhang · Xiao Bai · Jin Zheng · Xin Ning · Jun Zhou · Lin Gu | N/A | Code |
| DiffHuman: Probabilistic Photorealistic 3D Reconstruction of Humans | Akash Sengupta · Thiemo Alldieck · NIKOS KOLOTOUROS · Enric Corona · Andrei Zanfir · Cristian Sminchisescu | N/A | Code |
| SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos | Tao Wu · Runyu He · Gangshan Wu · Limin Wang | N/A | Code |
| Global and Local Prompts Cooperation via Optimal Transport for Federated Learning | Hongxia Li · Wei Huang · Jingya Wang · Ye Shi | N/A | Code |
| Dense Optical Tracking: Connecting the Dots | Guillaume Le Moing · Jean Ponce · Cordelia Schmid | N/A | Code |
| ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic Object | Chenshuang Zhang · Fei Pan · Junmo Kim · In So Kweon · Chengzhi Mao | N/A | Code |
| Neural Markov Random Field for Stereo Matching | Tongfan Guan · Chen Wang · Yun-Hui Liu | N/A | Code |
| BlockGCN: Redefine Topology Awareness for Skeleton-Based Action Recognition | Yuxuan Zhou · Xudong Yan · Zhi-Qi Cheng · Yan Yan · Qi Dai · Xian-Sheng Hua | N/A | Code |
| Language-only Training of Zero-shot Composed Image Retrieval | Geonmo Gu · Sanghyuk Chun · Wonjae Kim · Yoohoon Kang · Sangdoo Yun | N/A | Code |
| Unifying Correspondence Pose and NeRF for Generalized Pose-Free Novel View Synthesis | Sunghwan Hong · Jaewoo Jung · Heeseong Shin · Jiaolong Yang · Chong Luo · Seungryong Kim | N/A | Code |
| Person-in-WiFi 3D: End-to-End Multi-Person 3D Pose Estimation with Wi-Fi | Kangwei Yan · Fei Wang · Bo Qian · Han Ding · Jinsong Han · Xing Wei | N/A | Code |
| MemFlow: Optical Flow Estimation and Prediction with Memory | Qiaole Dong · Yanwei Fu | N/A | Code |
| A Unified and Interpretable Emotion Representation and Expression Generation | Reni Paskaleva · Mykyta Holubakha · Andela Ilic · Saman Motamed · Luc Van Gool · Danda Paudel | N/A | Code |
| SPAD: Spatially Aware Multi-View Diffusers | Yash Kant · Aliaksandr Siarohin · Ziyi Wu · Michael Vasilkovsky · Guocheng Qian · Jian Ren · Riza Alp Guler · Bernard Ghanem · Sergey Tulyakov · Igor Gilitschenski | N/A | Code |
| Instruct-Imagen: Image Generation with Multi-modal Instruction | Hexiang Hu · Kelvin C.K. Chan · Yu-Chuan Su · Wenhu Chen · Yandong Li · Kihyuk Sohn · Yang Zhao · Xue Ben · William Cohen · Ming-Wei Chang · Xuhui Jia | N/A | Code |
| DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction | Junwen Xiong · Peng Zhang · Tao You · Chuanyue Li · Wei Huang · Yufei Zha | N/A | Code |
| VP3D: Unleashing 2D Visual Prompt for Text-to-3D Generation | Yang Chen · Yingwei Pan · haibo yang · Ting Yao · Tao Mei | N/A | Code |
| Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning | Woo-Jin Ahn · Geun-Yeong Yang · Hyunduck Choi · Myo-Taeg Lim | N/A | Code |
| In2SET: Intra-Inter Similarity Exploiting Transformer for Dual-Camera Compressive Hyperspectral Imaging | Xin Wang · Lizhi Wang · Xiangtian Ma · Maoqing Zhang · Lin Zhu · Hua Huang | N/A | Code |
| Perception-Oriented Video Frame Interpolation via Asymmetric Blending | Guangyang Wu · Xin Tao · Changlin Li · Wenyi Wang · Xiaohong Liu · Qingqing Zheng | N/A | Code |
| Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-Visual Segmentation | Qi Yang · Xing Nie · Tong Li · Gaopengfei · Ying Guo · Cheng Zhen · Pengfei Yan · Shiming Xiang | N/A | Code |
| Monocular Identity-Conditioned Facial Reflectance Reconstruction | Xingyu Ren · Jiankang Deng · Yuhao Cheng · Jia Guo · Chao Ma · Yichao Yan · Wenhan Zhu · Xiaokang Yang | N/A | Code |
| Holodeck: Language Guided Generation of 3D Embodied AI Environments | Yue Yang · Fan-Yun Sun · Luca Weihs · Eli VanderBilt · Alvaro Herrasti · Winson Han · Jiajun Wu · Nick Haber · Ranjay Krishna · Lingjie Liu · Chris Callison-Burch · Mark Yatskar · Aniruddha Kembhavi · Christopher Clark | N/A | Code |
| Unleashing Network Potentials for Semantic Scene Completion | Fengyun Wang · Qianru Sun · Dong Zhang · Jinhui Tang | N/A | Code |
| Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation | Haofeng Liu · Chenshu Xu · Yifei Yang · Lihua Zeng · Shengfeng He | N/A | Code |
| AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring | Xintian Mao · Xiwen Gao · Yan Wang | N/A | Code |
| Fully Geometric Panoramic Localization | Junho Kim · Jiwon Jeong · Young Min Kim | N/A | Code |
| BiTT: Bi-directional Texture Reconstruction of Interacting Two Hands from a Single Image | Minje Kim · Tae-Kyun Kim | N/A | Code |
| DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly | Gianluca Scarpellini · Stefano Fiorini · Francesco Giuliari · Pietro Morerio · Alessio Del Bue | N/A | Code |
| MS-DETR: Efficient DETR Training with Mixed Supervision | Chuyang Zhao · Yifan Sun · Wenhao Wang · Qiang Chen · Errui Ding · Yi Yang · Jingdong Wang | N/A | Code |
| Material Palette: Extraction of Materials from a Single Image | Ivan Lopes · Fabio Pizzati · Raoul de Charette | N/A | Code |
| Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld | Yijun Yang · Tianyi Zhou · kanxue Li · Dapeng Tao · Lusong Li · Li Shen · Xiaodong He · Jing Jiang · Yuhui Shi | N/A | Code |
| Differentiable Point-based Inverse Rendering | Hoon-Gyu Chung · Seokjun Choi · Seung-Hwan Baek | N/A | Code |
| VMINer: Versatile Multi-view Inverse Rendering with Near- and Far-field Light Sources | Fan Fei · Jiajun Tang · Ping Tan · Boxin Shi | N/A | Code |
| FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication | Eric Slyman · Stefan Lee · Scott Cohen · Kushal Kafle | N/A | Code |
| On the Test-Time Zero-Shot Generalization of Vision-Language Models: Do We Really Need Prompt Learning? | Maxime Zanella · Ismail Ben Ayed | N/A | Code |
| C^2RV: Cross-Regional and Cross-View Learning for Sparse-View CBCT Reconstruction | Yiqun Lin · Jiewen Yang · hualiang wang · Xinpeng Ding · Wei Zhao · Xiaomeng Li | N/A | Code |
| OmniSDF: Scene Reconstruction using Omnidirectional Signed Distance Functions and Adaptive Binoctrees | Hakyeong Kim · Andreas Meuleman · Hyeonjoong Jang · James Tompkin · Min H. Kim | N/A | Code |
| Consistent Prompting for Rehearsal-Free Continual Learning | Zhanxin Gao · Jun Cen · Xiaobin Chang | N/A | Code |
| MedBN: Robust Test-Time Adaptation against Malicious Test Samples | Hyejin Park · Jeongyeon Hwang · Sunung Mun · Sangdon Park · Jungseul Ok | N/A | Code |
| Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion | Litu Rout · Yujia Chen · Abhishek Kumar · Constantine Caramanis · Sanjay Shakkottai · Wen-Sheng Chu | N/A | Code |
| KVQ: Kwai Video Quality Assessment for Short-form Videos | Yiting Lu · Xin Li · Yajing Pei · Kun Yuan · Qizhi Xie · Yunpeng Qu · Ming Sun · Chao Zhou · Zhibo Chen | N/A | Code |
| Purified and Unified Steganographic Network | GuoBiao Li · Sheng Li · Zicong Luo · Zhenxing Qian · Xinpeng Zhang | N/A | Code |
| Deformable One-shot Face Stylization via DINO Semantic Guidance | Yang Zhou · Zichong Chen · Hui Huang | N/A | Code |
| PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation | Ardian Umam · Cheng-Kun Yang · Min-Hung Chen · Jen-Hui Chuang · Yen-Yu Lin | N/A | Code |
| InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning | Yan-Shuo Liang · Wu-Jun Li | N/A | Code |
| Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer | Jiwoo Chung · Sangeek Hyun · Jae-Pil Heo | N/A | Code |
| Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning | Xinshun Wang · Zhongbin Fang · Xia Li · Xiangtai Li · Chen Chen · Mengyuan Liu | N/A | Code |
| Previously on ... From Recaps to Story Summarization | Aditya Kumar Singh · Dhruv Srivastava · Makarand Tapaswi | N/A | Code |
| VecFusion: Vector Font Generation with Diffusion | Vikas Thamizharasan · Difan Liu · Shantanu Agarwal · Matthew Fisher · Michaël Gharbi · Oliver Wang · Alec Jacobson · Evangelos Kalogerakis | N/A | Code |
| Generating Non-Stationary Textures using Self-Rectification | Yang Zhou · Rongjun Xiao · Dani Lischinski · Daniel Cohen-Or · Hui Huang | N/A | Code |
| OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising | Haichao Zhang · Yi Xu · Hongsheng Lu · Takayuki Shimizu · Yun Fu | N/A | Code |
| Frozen Feature Augmentation for Few-Shot Image Classification | Andreas Bär · Neil Houlsby · Mostafa Dehghani · Manoj Kumar | N/A | Code |
| 1-Lipschitz Layers Compared: Memory Speed and Certifiable Robustness | Bernd Prach · Fabio Brau · Giorgio Buttazzo · Christoph Lampert | N/A | Code |
| VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models | Hyeonho Jeong · Geon Yeong Park · Jong Chul Ye | N/A | Code |
| Building Optimal Neural Architectures using Interpretable Knowledge | Keith Mills · Fred Han · Mohammad Salameh · Shengyao Lu · CHUNHUA ZHOU · Jiao He · Fengyu Sun · Di Niu | N/A | Code |
| MAPSeg: Unified Unsupervised Domain Adaptation for Heterogeneous Medical Image Segmentation Based on 3D Masked Autoencoding and Pseudo-Labeling | Xuzhe Zhang · Yuhao Wu · Elsa Angelini · Ang Li · Jia Guo · Jerod Rasmussen · Thomas O'Connor · Pathik Wadhwa · Andrea Jackowski · Hai Li · Jonathan Posner · Andrew Laine · YUN WANG | N/A | Code |
| Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping | Alex Costanzino · Pierluigi Zama Ramirez · Giuseppe Lisanti · Luigi Di Stefano | N/A | Code |
| PoNQ: a Neural QEM-based Mesh Representation | Nissim Maruani · Maks Ovsjanikov · Pierre Alliez · Mathieu Desbrun | N/A | Code |
| Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians | Yuelang Xu · Benwang Chen · Zhe Li · Hongwen Zhang · Lizhen Wang · Zerong Zheng · Yebin Liu | N/A | Code |
| Video Harmonization with Triplet Spatio-Temporal Variation Patterns | Zonghui Guo · XinYu Han · Jie Zhang · Shiguang Shan · Haiyong Zheng | N/A | Code |
| Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens | Zhiwen Chen · Zhiyu Zhu · Yifan Zhang · Junhui Hou · Guangming Shi · Jinjian Wu | N/A | Code |
| Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning | Da-Wei Zhou · Hai-Long Sun · Han-Jia Ye · De-Chuan Zhan | N/A | Code |
| Learning without Exact Guidance: Updating Large-scale High-resolution Land Cover Maps from Low-resolution Historical Labels | Zhuohong Li · Wei He · Jiepan Li · Fangxiao Lu · Hongyan Zhang | N/A | Code |
| Towards Realistic Scene Generation with LiDAR Diffusion Models | Haoxi Ran · Vitor Guizilini · Yue Wang | N/A | Code |
| Exploring Orthogonality in Open World Object Detection | Zhicheng Sun · Jinghan Li · Yadong Mu | N/A | Code |
| Compositional Chain-of-Thought Prompting for Large Multimodal Models | Chancharik Mitra · Brandon Huang · Trevor Darrell · Roei Herzig | N/A | Code |
| As-Plausible-As-Possible: Plausibility-Aware Mesh Deformation Using 2D Diffusion Priors | Seungwoo Yoo · Kunho Kim · Vladimir G. Kim · Minhyuk Sung | N/A | Code |
| 3D Face Tracking from 2D Video through Iterative Dense UV to Image Flow | Felix Taubner · Prashant Raina · Mathieu Tuli · Eu Wern Teh · Chul Lee · Jinmiao Huang | N/A | Code |
| HIT: Estimating Internal Human Implicit Tissues from the Body Surface | Marilyn Keller · Vaibhav ARORA · Abdelmouttaleb Dakri · Shivam Chandhok · Jürgen Machann · Andreas Fritsche · Michael J. Black · Sergi Pujades | N/A | Code |
| HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations | Peng Dai · Yang Zhang · Tao Liu · ZhenFan · Tianyuan Du · Zhuo Su · Xiaozheng Zheng · Zeming Li | N/A | Code |
| Investigating and Mitigating the Side Effects of Noisy Views for Self-Supervised Clustering Algorithms in Practical Multi-View Scenarios | Jie Xu · Yazhou Ren · Xiaolong Wang · Lei Feng · Zheng Zhang · Gang Niu · Xiaofeng Zhu | N/A | Code |
| Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts | Qin Liu · Jaemin Cho · Mohit Bansal · Marc Niethammer | N/A | Code |
| Multiscale Vision Transformers Meet Bipartite Matching for Efficient Single-stage Action Localization | Ioanna Ntinou · Enrique Sanchez · Georgios Tzimiropoulos | N/A | Code |
| SeMoLi: What Moves Together Belongs Together | Jenny Seidenschwarz · Aljoša Ošep · Francesco Ferroni · Simon Lucey · Laura Leal-Taixe | N/A | Code |
| An N-Point Linear Solver for Line and Motion Estimation with Event Cameras | Ling Gao · Daniel Gehrig · Hang Su · Davide Scaramuzza · Laurent Kneip | N/A | Code |
| Instance-Aware Group Quantization for Vision Transformers | Jaehyeon Moon · Dohyung Kim · Jun Yong Cheon · Bumsub Ham | N/A | Code |
| Text-Driven Image Editing via Learnable Regions | Yuanze Lin · Yi-Wen Chen · Yi-Hsuan Tsai · Lu Jiang · Ming-Hsuan Yang | N/A | Code |
| ZERO-IG: Zero-Shot Illumination-Guided Joint Denoising and Adaptive Enhancement for Low-Light Images | Yiqi Shi · Duo Liu · Liguo Zhang · Ye Tian · Xuezhi Xia · fuxiaojing | N/A | Code |
| HINTED: Hard Instance Enhanced Detector with Mixed-Density Feature Fusion for Sparsely-Supervised 3D Object Detection | Qiming Xia · Wei Ye · Hai Wu · Shijia Zhao · Leyuan Xing · Xun Huang · Jinhao Deng · Xin Li · Chenglu Wen · Cheng Wang | N/A | Code |
| NAYER: Noisy Layer Data Generation for Efficient and Effective Data-free Knowledge Distillation | Minh-Tuan Tran · Trung Le · Xuan-May Le · Mehrtash Harandi · Quan Tran · Dinh Phung | N/A | Code |
| Domain-Specific Block Selection and Paired-View Pseudo-Labeling for Online Test-Time Adaptation | Yeonguk Yu · Sungho Shin · Seunghyeok Back · Minhwan Ko · Sangjun Noh · Kyoobin Lee | N/A | Code |
| Collaborating Foundation Models for Domain Generalized Semantic Segmentation | Yasser Benigmim · Subhankar Roy · Slim Essid · Vicky Kalogeiton · Stéphane Lathuilière | N/A | Code |
| Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework | Ziyao Huang · Fan Tang · Yong Zhang · Xiaodong Cun · Juan Cao · Jintao Li · Tong-yee Lee | N/A | Code |
| NEAT: Distilling 3D Wireframes from Neural Attraction Fields | Nan Xue · Bin Tan · Yuxi Xiao · Liang Dong · Gui-Song Xia · Tianfu Wu · Yujun Shen | N/A | Code |
| LEDITS++: Limitless Image Editing using Text-to-Image Models | Manuel Brack · Felix Friedrich · Katharina Kornmeier · Linoy Tsaban · Patrick Schramowski · Kristian Kersting · Apolinário Passos | N/A | Code |
| Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes | Hmrishav Bandyopadhyay · Subhadeep Koley · Ayan Das · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Emu Edit: Precise Image Editing via Recognition and Generation Tasks | Shelly Sheynin · Adam Polyak · Uriel Singer · Yuval Kirstain · Amit Zohar · Oron Ashual · Devi Parikh · Yaniv Taigman | N/A | Code |
| Revamping Federated Learning Security from a Defender's Perspective: A Unified Defense with Homomorphic Encrypted Data Space | Naveen Kumar Kummari · Reshmi Mitra · Krishna Mohan Chalavadi | N/A | Code |
| PeerAiD: Improving Adversarial Distillation from a Specialized Peer Tutor | Jaewon Jung · Hongsun Jang · Jaeyong Song · Jinho Lee | N/A | Code |
| HIPTrack: Visual Tracking with Historical Prompts | Wenrui Cai · Qingjie Liu · Yunhong Wang | N/A | Code |
| FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appearance Head-pose and Facial Expression Features | Andre Rochow · Max Schwarz · Sven Behnke | N/A | Code |
| Situational Awareness Matters in 3D Vision Language Reasoning | Yunze Man · Liang-Yan Gui · Yu-Xiong Wang | N/A | Code |
| Generative Proxemics: A Prior for 3D Social Interaction from Images | Vickie Ye · Vickie Ye · Georgios Pavlakos · Michael J. Black · Angjoo Kanazawa | N/A | Code |
| The Neglected Tails in Vision-Language Models | Shubham Parashar · Tian Liu · Zhiqiu Lin · Xiangjue Dong · Yanan Li · James Caverlee · Deva Ramanan · Shu Kong | N/A | Code |
| Multi-View Attentive Contextualization for Multi-View 3D Object Detection | Xianpeng Liu · Ce Zheng · Ming Qian · Nan Xue · Chen Chen · Zhebin Zhang · Chen Li · Tianfu Wu | N/A | Code |
| Task-Driven Wavelets using Constrained Empirical Risk Minimization | Eric Marcus · Ray Sheombarsing · Jan-Jakob Sonke · Jonas Teuwen | N/A | Code |
| SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Reconstruction | Zechuan Zhang · Zongxin Yang · Yi Yang | N/A | Code |
| Text-to-3D using Gaussian Splatting | Zilong Chen · Feng Wang · Yikai Wang · Huaping Liu | N/A | Code |
| InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion | Jihyun Lee · Shunsuke Saito · Giljoo Nam · Minhyuk Sung · Tae-Kyun Kim | N/A | Code |
| Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs | Hao Fei · Shengqiong Wu · Wei Ji · Hanwang Zhang · Tat-seng Chua | N/A | Code |
| SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes | Alexandros Delitzas · Ayça Takmaz · Federico Tombari · Robert Sumner · Marc Pollefeys · Francis Engelmann | N/A | Code |
| Source-Free Domain Adaptation with Frozen Multimodal Foundation Model | Song Tang · Wenxin Su · Mao Ye · Xiatian Zhu | N/A | Code |
| Utility-Fairness Trade-Offs and How to Find Them | Sepehr Dehdashtian · Bashir Sadeghi · Vishnu Naresh Boddeti | N/A | Code |
| MMA-Diffusion: MultiModal Attack on Diffusion Models | Yijun Yang · Ruiyuan Gao · Xiaosen Wang · Tsung-Yi Ho · Xu Nan · Qiang Xu | N/A | Code |
| Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment | Zheren Fu · Lei Zhang · Hou Xia · Zhendong Mao | N/A | Code |
| Grounding and Enhancing Grid-based Models for Neural Fields | Zelin Zhao · FENGLEI FAN · Wenlong Liao · Junchi Yan | N/A | Code |
| Fitting Flats to Flats | Gabriel Dogadov · Ugo Finnendahl · Marc Alexa | N/A | Code |
| Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering | Tao Lu · Mulin Yu · Linning Xu · Yuanbo Xiangli · Limin Wang · Dahua Lin · Bo Dai | N/A | Code |
| Can Biases in ImageNet Models Explain Generalization? | Paul Gavrikov · Janis Keuper | N/A | Code |
| UnScene3D: Unsupervised 3D Instance Segmentation for Indoor Scenes | David Rozenberszki · Or Litany · Angela Dai | N/A | Code |
| Boosting Flow-based Generative Super-Resolution Models via Learned Prior | Li-Yuan Tsao · Yi-Chen Lo · Chia-Che Chang · Hao-Wei Chen · Roy Tseng · Chien Feng · Chun-Yi Lee | N/A | Code |
| REACTO: Reconstructing Articulated Objects from a Single Video | Chaoyue Song · Jiacheng Wei · Chuan-Sheng Foo · Guosheng Lin · Fayao Liu | N/A | Code |
| Towards Robust Learning to Optimize with Theoretical Guarantees | Qingyu Song · Wei Lin · Juncheng Wang · Hong Xu | N/A | Code |
| Efficient Test-Time Adaptation of Vision-Language Models | Adilbek Karmanov · Dayan Guan · Shijian Lu · Abdulmotaleb El Saddik · Eric P. Xing | N/A | Code |
| Fourier-basis Functions to Bridge Augmentation Gap: Rethinking Frequency Augmentation in Image Classification | Mei Vaish · Shunxin Wang · Nicola Strisciuglio | N/A | Code |
| Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations | Sangmin Lee · Bolin Lai · Fiona Ryan · Bikram Boote · James Rehg | N/A | Code |
| FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition | Sicheng Mo · Fangzhou Mu · Kuan Heng Lin · Yanli Liu · Bochen Guan · Yin Li · Bolei Zhou | N/A | Code |
| Cloud-Device Collaborative Learning for Multimodal Large Language Models | Guanqun Wang · Jiaming Liu · Chenxuan Li · Yuan Zhang · Ma Junpeng · Xinyu Wei · Kevin Zhang · Maurice Chong · Renrui Zhang · Yijiang Liu · Shanghang Zhang | N/A | Code |
| Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning | Rashindrie Perera · Saman Halgamuge | N/A | Code |
| EventEgo3D: 3D Human Motion Capture from Egocentric Event Streams | Christen Millerdurai · Hiroyasu Akada · Jian Wang · Diogo Luvizon · Christian Theobalt · Vladislav Golyanik | N/A | Code |
| LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion | Pancheng Zhao · Peng Xu · Pengda Qin · Deng-Ping Fan · Zhicheng Zhang · Guoli Jia · Bowen Zhou · Jufeng Yang | N/A | Code |
| What Do You See in Vehicle? Comprehensive Vision Solution for In-Vehicle Gaze Estimation | Yihua Cheng · Yaning Zhu · Zongji Wang · hongquan hao · Liu wei · Shiqing Cheng · Xi Wang · Hyung Jin Chang | N/A | Code |
| HUGS: Human Gaussian Splats | Muhammed Kocabas · Jen-Hao Rick Chang · James Gabriel · Oncel Tuzel · Anurag Ranjan | N/A | Code |
| GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis | Shunyuan Zheng · Boyao ZHOU · Ruizhi Shao · Boning Liu · Shengping Zhang · Liqiang Nie · Yebin Liu | N/A | Code |
| WaveFace: Authentic Face Restoration with Efficient Frequency Recovery | Yunqi Miao · Jiankang Deng · Jungong Han | N/A | Code |
| MAPLM: A Real-World Large-Scale Vision-Language Benchmark for Map and Traffic Scene Understanding | Xu Cao · Tong Zhou · Yunsheng Ma · Wenqian Ye · Can Cui · Kun Tang · Zhipeng Cao · Kaizhao Liang · Ziran Wang · James Rehg · chao zheng | N/A | Code |
| Hierarchical Histogram Threshold Segmentation – Auto-terminating High-detail Oversegmentation | Thomas Chang · Simon Seibt · Bartosz von Rymon Lipinski | N/A | Code |
| G-FARS: Gradient-Field-based Auto-Regressive Sampling for 3D Part Grouping | Junfeng Cheng · Tania Stathaki | N/A | Code |
| Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers | Zi-Xin Zou · Zhipeng Yu · Yuan-Chen Guo · Yangguang Li · Yan-Pei Cao · Ding Liang · Song-Hai Zhang | N/A | Code |
| LiDAR-based Person Re-identification | Wenxuan Guo · Zhiyu Pan · Yingping Liang · Ziheng Xi · Zhi Chen Zhong · Jianjiang Feng · Jie Zhou | N/A | Code |
| Do Vision and Language Encoders Represent the World Similarly? | Mayug Maniparambil · Raiymbek Akshulakov · YASSER ABDELAZIZ DAHOU DJILALI · Mohamed El Amine Seddik · Sanath Narayan · Karttikeya Mangalam · Noel O'Connor | N/A | Code |
| MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | Kunchang Li · Yali Wang · Yinan He · Yizhuo Li · Yi Wang · Yi Liu · Zun Wang · Jilan Xu · Guo Chen · Ping Luo · Limin Wang · Yu Qiao | N/A | Code |
| Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera | Jiye Lee · Hanbyul Joo | N/A | Code |
| Residual Denoising Diffusion Models | Jiawei Liu · Qiang Wang · Huijie Fan · Yinong Wang · Yandong Tang · Liangqiong Qu | N/A | Code |
| Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach | Guoqiang Liang · Kanghao Chen · Hangyu Li · Yunfan Lu · Addison, Lin Wang | N/A | Code |
| A Semi-supervised Nighttime Dehazing Baseline with Spatial-Frequency Aware and Realistic Brightness Constraint | Xiaofeng Cong · Jie Gui · Jing Zhang · Junming Hou · Hao Shen | N/A | Code |
| AVID: Any-Length Video Inpainting with Diffusion Model | Zhixing Zhang · Bichen Wu · Xiaoyan Wang · Yaqiao Luo · Luxin Zhang · Yinan Zhao · Peter Vajda · Dimitris N. Metaxas · Licheng Yu | N/A | Code |
| Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers | Sanghyeok Lee · Joonmyung Choi · Hyunwoo J. Kim | N/A | Code |
| Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Kiran Chhatre · Radek Danecek · Nikos Athanasiou · Giorgio Becherini · Christopher Peters · Michael J. Black · Timo Bolkart | N/A | Code |
| Deciphering ‘What’ and ‘Where’ Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations | Xiao Zhang · David Yunis · Michael Maire | N/A | Code |
| WaveMo: Learning Wavefront Modulations to See Through Scattering | Mingyang Xie · Haiyun Guo · Brandon Y. Feng · Lingbo Jin · Ashok Veeraraghavan · Christopher Metzler | N/A | Code |
| Optimal Transport Aggregation for Visual Place Recognition | Sergio Izquierdo · Javier Civera | N/A | Code |
| BANF: Band-Limited Neural Fields for Levels of Detail Reconstruction | Ahan Shabanov · Shrisudhan Govindarajan · Cody Reading · Leili Goli · Daniel Rebain · Kwang Moo Yi · Andrea Tagliasacchi | N/A | Code |
| StraightPCF: Straight Point Cloud Filtering | Dasith de Silva Edirimuni · Xuequan Lu · Gang Li · Lei Wei · Antonio Robles-Kelly · Hongdong Li | N/A | Code |
| NeRFiller: Completing Scenes via Generative 3D Inpainting | Ethan Weber · Aleksander Holynski · Varun Jampani · Saurabh Saxena · Noah Snavely · Abhishek Kar · Angjoo Kanazawa | N/A | Code |
| Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching | Xianqi Wang · Gangwei Xu · Hao Jia · Xin Yang | N/A | Code |
| Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training | Runze He · Shaofei Huang · Xuecheng Nie · Tianrui Hui · Luoqi Liu · Jiao Dai · Jizhong Han · Guanbin Li · Si Liu | N/A | Code |
| MPOD123: One Image to 3D Content Generation Using Mask-enhanced Progressive Outline-to-Detail Optimization | Jimin Xu · Tianbao Wang · Tao Jin · Shengyu Zhang · Dongjie Fu · Zhe Wang · Jiangjing Lyu · Chengfei Lv · Chaoyue Niu · Zhou Yu · Zhou Zhao · Fei Wu | N/A | Code |
| Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction | Yizhi Wang · Wallace Lira · Wenqi Wang · Ali Mahdavi Amiri · Hao Zhang | N/A | Code |
| EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Language Models | Sijie Cheng · Zhicheng Guo · Jingwen Wu · Kechen Fang · Peng Li · Huaping Liu · Yang Liu | N/A | Code |
| Learning to Produce Semi-dense Correspondences for Visual Localization | Khang Truong Giang · Soohwan Song · Sungho Jo | N/A | Code |
| When StyleGAN Meets Stable Diffusion: a W+ Adapter for Personalized Image Generation | Xiaoming Li · Xinyu Hou · Chen Change Loy | N/A | Code |
| Differentiable Neural Surface Refinement for Modeling Transparent Objects | Weijian Deng · Dylan Campbell · Chunyi Sun · Shubham Kanitkar · Matthew Shaffer · Stephen Gould | N/A | Code |
| Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation | Wenhao Li · Mengyuan Liu · Hong Liu · Pichao Wang · Jialun Cai · Nicu Sebe | N/A | Code |
| Low-power Continuous Remote Behavioral Localization with Event Cameras | Friedhelm Hamann · Suman Ghosh · Ignacio Juarez Martinez · Tom Hart · Alex Kacelnik · Guillermo Gallego | N/A | Code |
| 4D-DRESS: A 4D Dataset of Real-World Human Clothing With Semantic Annotations | Wenbo Wang · Hsuan-I Ho · Chen Guo · Boxiang Rong · Artur Grigorev · Jie Song · Juan Jose Zarate · Otmar Hilliges | N/A | Code |
| Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting | Taeho Kang · Youngki Lee | N/A | Code |
| AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One | Mike Ranzinger · Greg Heinrich · Jan Kautz · Pavlo Molchanov | N/A | Code |
| Towards Co-Evaluation of Cameras HDR and Algorithms for Industrial-Grade 6DoF Pose Estimation | Agastya Kalra · Guy Stoppi · Dmitrii Marin · Vage Taamazyan · Aarrushi Shandilya · Rishav Agarwal · Anton Boykov · Aaron Chong · Michael Stark | N/A | Code |
| An Upload-Efficient Scheme for Transferring Knowledge From a Server-Side Pre-trained Generator to Clients in Heterogeneous Federated Learning | Jianqing Zhang · Yang Liu · Yang Hua · Jian Cao | N/A | Code |
| Partial-to-Partial Shape Matching with Geometric Consistency | Viktoria Ehm · Maolin Gao · Paul Roetzer · Marvin Eisenberger · Daniel Cremers · Florian Bernard | N/A | Code |
| DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing | Kaiwen Zhang · Yifan Zhou · Xudong XU · Bo Dai · Xingang Pan | N/A | Code |
| PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs | Michael Dorkenwald · Nimrod Barazani · Cees G. M. Snoek · Yuki Asano | N/A | Code |
| RMT: Retentive Networks Meet Vision Transformers | Qihang Fan · Huaibo Huang · Mingrui Chen · Hongmin Liu · Ran He | N/A | Code |
| LAENeRF: Local Appearance Editing for Neural Radiance Fields | Lukas Radl · Michael Steiner · Andreas Kurz · Markus Steinberger | N/A | Code |
| Learning Multi-Dimensional Human Preference for Text-to-Image Generation | Sixian Zhang · Bohan Wang · Junqiang Wu · Yan Li · Tingting Gao · Di ZHANG · Zhongyuan Wang | N/A | Code |
| Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation | Philipp Schröppel · Christopher Wewer · Jan Lenssen · Eddy Ilg · Thomas Brox | N/A | Code |
| EarthLoc: Astronaut Photography Localization by Indexing Earth from Space | Gabriele Berton · Alex Stoken · Barbara Caputo · Carlo Masone | N/A | Code |
| Language Models as Black-Box Optimizers for Vision-Language Models | Shihong Liu · Samuel Yu · Zhiqiu Lin · Deepak Pathak · Deva Ramanan | N/A | Code |
| Improved Visual Grounding through Self-Consistent Explanations | Ruozhen He · Paola Cascante-Bonilla · Ziyan Yang · Alex Berg · Vicente Ordonez | N/A | Code |
| Relation Rectification in Diffusion Model | Yinwei Wu · Xingyi Yang · Xinchao Wang | N/A | Code |
| Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft | Hao Li · Xue Yang · Zhaokai Wang · Xizhou Zhu · Jie Zhou · Yu Qiao · Xiaogang Wang · Hongsheng Li · Lewei Lu · Jifeng Dai | N/A | Code |
| Close Imitation of Expert Retouching for Black-and-White Photography | Seunghyun Shin · Jisu Shin · Jihwan Bae · Inwook Shim · Hae-Gon Jeon | N/A | Code |
| OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos | Dongyoung Choi · Hyeonjoong Jang · Min H. Kim | N/A | Code |
| Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model | Kai Yang · Jian Tao · Jiafei Lyu · Chunjiang Ge · Jiaxin Chen · Weihan Shen · Xiaolong Zhu · Xiu Li | N/A | Code |
| Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation | Xinyao Li · Yuke Li · Zhekai Du · Fengling Li · Ke Lu · Jingjing Li | N/A | Code |
| Wavelet-based Fourier Information Interaction with Frequency Diffusion Adjustment for Underwater Image Restoration | Chen Zhao · Weiling Cai · Chenyu Dong · Chengwei Hu | N/A | Code |
| Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training | Di Ming · Peng Ren · Yunlong Wang · Xin Feng | N/A | Code |
| XFeat: Accelerated Features for Lightweight Image Matching | Guilherme Potje · Felipe Cadar · André Araujo · Renato Martins · Erickson R. Nascimento | N/A | Code |
| OneFormer3D: One Transformer for Unified Point Cloud Segmentation | Maksim Kolodiazhnyi · Anna Vorontsova · Anton Konushin · Danila Rukhovich | N/A | Code |
| DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis | Jiapeng Tang · Yinyu Nie · Lev Markhasin · Angela Dai · Justus Thies · Matthias Nießner | N/A | Code |
| IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection | Junbo Yin · Wenguan Wang · Runnan Chen · Wei Li · Ruigang Yang · Pascal Frossard · Jianbing Shen | N/A | Code |
| One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion | Minghua Liu · Ruoxi Shi · Linghao Chen · Zhuoyang Zhang · Chao Xu · Xinyue Wei · Hansheng Chen · Chong Zeng · Jiayuan Gu · Hao Su | N/A | Code |
| One-Shot Structure-Aware Stylized Image Synthesis | Hansam Cho · Jonghyun Lee · Seunggyu Chang · Yonghyun Jeong | N/A | Code |
| ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering | Haokai Pang · Heming Zhu · Adam Kortylewski · Christian Theobalt · Marc Habermann | N/A | Code |
| StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On | Jeongho Kim · Gyojung Gu · Minho Park · Sunghyun Park · Jaegul Choo | N/A | Code |
| Looking Similar Sounding Different: Leveraging Counterfactual Cross-Modal Pairs for Audiovisual Representation Learning | Nikhil Singh · Chih-Wei Wu · Iroro Orife · Kalayeh | N/A | Code |
| MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training | Pavan Kumar Anasosalu Vasu · Hadi Pouransari · Fartash Faghri · Raviteja Vemulapalli · Oncel Tuzel | N/A | Code |
| TetraSphere: A Neural Descriptor for O(3)-Invariant Point Cloud Analysis | Pavlo Melnyk · Andreas Robinson · Michael Felsberg · Mårten Wadenbäck | N/A | Code |
| WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion | Soyong Shin · Juyong Kim · Eni Halilaj · Michael J. Black | N/A | Code |
| TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Shuhuai Ren · Linli Yao · Shicheng Li · Xu Sun · Lu Hou | N/A | Code |
| 3D Neural Edge Reconstruction | Lei Li · Songyou Peng · Zehao Yu · Shaohui Liu · Rémi Pautrat · Xiaochuan Yin · Marc Pollefeys | N/A | Code |
| YOLO-World: Real-Time Open-Vocabulary Object Detection | Tianheng Cheng · Lin Song · Yixiao Ge · Wenyu Liu · Xinggang Wang · Ying Shan | N/A | Code |
| Direct2.5: Diverse Text-to-3D Generation via Multi-view 2.5D Diffusion | Yuanxun Lu · Jingyang Zhang · Shiwei Li · Tian Fang · David McKinnon · Yanghai Tsin · Long Quan · Xun Cao · Yao Yao | N/A | Code |
| Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains | Eunsu Baek · Keondo Park · Ji-yoon Kim · Hyung-Sin Kim | N/A | Code |
| CLOAF: CoLlisiOn-Aware Human Flow | Andrey Davydov · Martin Engilberge · Mathieu Salzmann · Pascal Fua | N/A | Code |
| FedUV: Uniformity and Variance for Heterogeneous Federated Learning | Ha Min Son · Moon-Hyun Kim · Tai-Myoung Chung · Chao Huang · Xin Liu | N/A | Code |
| Efficient 3D Implicit Head Avatar with Mesh-anchored Hash Table Blendshapes | Ziqian Bai · Feitong Tan · Sean Fanello · Rohit Pandey · Mingsong Dou · Shichen Liu · Ping Tan · Yinda Zhang | N/A | Code |
| Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions | Namitha Padmanabhan · Matthew A Gwilliam · Pulkit Kumar · Shishira R Maiya · Max Ehrlich · Abhinav Shrivastava | N/A | Code |
| Learning Correlation Structures for Vision Transformers | Manjin Kim · Paul Hongsuck Seo · Cordelia Schmid · Minsu Cho | N/A | Code |
| Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes | Diandian Guo · Deng-Ping Fan · Tongyu Lu · Christos Sakaridis · Luc Van Gool | N/A | Code |
| DiffusionAvatars: Deferred Diffusion for High-fidelity 3D Head Avatars | Tobias Kirschstein · Simon Giebenhain · Matthias Nießner | N/A | Code |
| Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation | Shanshan Zhong · Zhongzhan Huang · Shanghua Gao · Wushao Wen · Liang Lin · Marinka Zitnik · Pan Zhou | N/A | Code |
| LightIt: Illumination Modeling and Control for Diffusion Models | Peter Kocsis · Kalyan Sunkavalli · Julien Philip · Matthias Nießner · Yannick Hold-Geoffroy | N/A | Code |
| Bring Event into RGB and LiDAR: Hierarchical Visual-Motion Fusion for Scene Flow | Hanyu Zhou · Yi Chang · Zhiwei Shi | N/A | Code |
| Language-driven Grasp Detection | An Dinh Vuong · Minh Nhat VU · Baoru Huang · Nghia Nguyen · Hieu Le · Thieu Vo · Anh Nguyen | N/A | Code |
| Each Test Image Deserves A Specific Prompt: Continual Test-Time Adaptation for 2D Medical Image Segmentation | Ziyang Chen · Yongsheng Pan · Yiwen Ye · Mengkang Lu · Yong Xia | N/A | Code |
| SplattingAvatar: Realistic Real-Time Human Avatars with Mesh-Embedded Gaussian Splatting | Zhijing Shao · Wang Zhaolong · Zhuang Li · Duotun Wang · Xiangru Lin · Yu Zhang · Mingming Fan · Zeyu Wang | N/A | Code |
| AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings | Jamie Watson · Filippo Aleotti · Mohamed Sayed · Zawar Qureshi · Oisin Mac Aodha · Gabriel J. Brostow · Michael Firman · Sara Vicente | N/A | Code |
| Instance-based Max-margin for Practical Few-shot Recognition | Minghao Fu · Ke Zhu | N/A | Code |
| GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians | Liangxiao Hu · Hongwen Zhang · Yuxiang Zhang · Boyao ZHOU · Boning Liu · Shengping Zhang · Liqiang Nie | N/A | Code |
| SVDTree: Semantic Voxel Diffusion for Single Image Tree Reconstruction | Yuan Li · Zhihao Liu · Bedrich Benes · Xiaopeng Zhang · Jianwei Guo | N/A | Code |
| Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Jianwu Fang · Lei-lei Li · Junfei Zhou · Junbin Xiao · Hongkai Yu · Chen Lv · Jianru Xue · Tat-seng Chua | N/A | Code |
| Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-based Hyperspectral Image Synthesis | Yang Yu · Erting Pan · Xinya Wang · Yuheng Wu · Xiaoguang Mei · Jiayi Ma | N/A | Code |
| KITRO: Refining Human Mesh by 2D Clues and Kinematic-tree Rotation | Fengyuan Yang · Kerui Gu · Angela Yao | N/A | Code |
| Optimizing Diffusion Noise Can Serve As Universal Motion Priors | Korrawe Karunratanakul · Konpat Preechakul · Emre Aksan · Thabo Beeler · Supasorn Suwajanakorn · Siyu Tang | N/A | Code |
| Robust Self-calibration of Focal Lengths from the Fundamental Matrix | Viktor Kocur · Daniel Kyselica · Zuzana Kukelova | N/A | Code |
| FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition | Ganggui Ding · Canyu Zhao · Wen Wang · Zhen Yang · Zide Liu · Hao Chen · Chunhua Shen | N/A | Code |
| Generative Unlearning for Any Identity | Juwon Seo · Sung-Hoon Lee · Tae-Young Lee · SeungJun Moon · Gyeong-Moon Park | N/A | Code |
| Learning to Control Camera Exposure via Reinforcement Learning | Kyunghyun Lee · Ukcheol Shin · Byeong-Uk Lee | N/A | Code |
| Efficient Privacy-Preserving Visual Localization Using 3D Ray Clouds | Heejoon Moon · Chunghwan Lee · Je Hyeong Hong | N/A | Code |
| Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition | Zihan Wang · Siyang Song · Cheng Luo · Songhe Deng · Weicheng Xie · Linlin Shen | N/A | Code |
| Tailored Visions: Enhancing Text-to-Image Generation with Personalized Prompt Rewriting | Zijie Chen · Lichao Zhang · Fangsheng Weng · Lili Pan · ZHENZHONG Lan | N/A | Code |
| Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles | Vanessa Sklyarova · Egor Zakharov · Otmar Hilliges · Michael J. Black · Justus Thies | N/A | Code |
| SVDinsTN: A Tensor Network Paradigm for Efficient Structure Search from Regularized Modeling Perspective | Yu-Bang Zheng · Xile Zhao · Junhua Zeng · Chao Li · Qibin Zhao · Heng-Chao Li · Ting-Zhu Huang | N/A | Code |
| Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection | Ting Lei · Shaofeng Yin · Yang Liu | N/A | Code |
| CustomListener: Text-guided Responsive Interaction for User-friendly Listening Head Generation | Xi Liu · Ying Guo · Cheng Zhen · Tong Li · Yingying Ao · Pengfei Yan | N/A | Code |
| Brain Decodes Deep Nets | Huzheng Yang · James Gee · Jianbo Shi | N/A | Code |
| Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension | Quan Liu · Hongzi Zhu · Zhenxi Wang · Yunsong Zhou · Shan Chang · Minyi Guo | N/A | Code |
| Fully Convolutional Slice-to-Volume Reconstruction for Single-Stack MRI | Sean I. Young · Yaël Balbastre · Bruce Fischl · Polina Golland · Juan Iglesias | N/A | Code |
| DiffAvatar: Simulation-Ready Garment Optimization with Differentiable Simulation | Yifei Li · Hsiaoyu Chen · Egor Larionov · Nikolaos Sarafianos · Wojciech Matusik · Tuur Stuyck | N/A | Code |
| LidaRF: Delving into Lidar for Neural Radiance Field on Street Scenes | Shanlin Sun · Bingbing Zhuang · Ziyu Jiang · Buyu Liu · Xiaohui Xie · Manmohan Chandraker | N/A | Code |
| 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces | Linyi Jin · Nilesh Kulkarni · David Fouhey | N/A | Code |
| Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting | Haipeng Liu · Yang Wang · Biao Qian · Meng Wang · Yong Rui | N/A | Code |
| Task2Box: Box Embeddings for Modeling Asymmetric Task Relationships | Rangel Daroya · Aaron Sun · Subhransu Maji | N/A | Code |
| DUDF: Differentiable Unsigned Distance Fields with Hyperbolic Scaling | Miguel Fainstein · Viviana Siless · Emmanuel Iarussi | N/A | Code |
| RadSimReal: Bridging the Gap Between Synthetic and Real Data in Radar Object Detection With Simulation | Oded Bialer · Yuval Haitman | N/A | Code |
| Mip-Splatting: Alias-free 3D Gaussian Splatting | Zehao Yu · Anpei Chen · Binbin Huang · Torsten Sattler · Andreas Geiger | N/A | Code |
| Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers | Sheng Yang · Jiawang Bai · Kuofeng Gao · Yong Yang · Yiming Li · Shu-Tao Xia | N/A | Code |
| Text2QR: Harmonizing Aesthetic Customization and Scanning Robustness for Text-Guided QR Code Generation | Guangyang Wu · Xiaohong Liu · Jun Jia · Xuehao Cui · Guangtao Zhai | N/A | Code |
| DiffSCI: Zero-Shot Snapshot Compressive Imaging via Iterative Spectral Diffusion Model | Zhenghao Pan · Haijin Zeng · Jiezhang Cao · Kai Zhang · Yongyong Chen | N/A | Code |
| SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos | Changan Chen · Kumar Ashutosh · Rohit Girdhar · David Harwath · Kristen Grauman | N/A | Code |
| SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation | Keqi Chen · vinkle srivastav · Nicolas Padoy | N/A | Code |
| Context-Aware Integration of Language and Visual References for Natural Language Tracking | Yanyan Shao · Shuting He · Qi Ye · Yuchao Feng · Wenhan Luo · Jiming Chen | N/A | Code |
| Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling | Liwen Wu · Sai Bi · Zexiang Xu · Fujun Luan · Kai Zhang · Iliyan Georgiev · Kalyan Sunkavalli · Ravi Ramamoorthi | N/A | Code |
| FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models | Jinglin Xu · Yijie Guo · Yuxin Peng | N/A | Code |
| Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts | Cansu Korkmaz · Ahmet Murat Tekalp · Zafer Dogan | N/A | Code |
| MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation | Mi Yan · Jiazhao Zhang · Yan Zhu · He Wang | N/A | Code |
| SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection | Gang Zhang · Chen Junnan · Guohuan Gao · Jianmin Li · Si Liu · Xiaolin Hu | N/A | Code |
| MemoNav: Working Memory Model for Visual Navigation | Hongxin Li · Zeyu Wang · Xu Yang · yuran Yang · Shuqi Mei · Zhaoxiang Zhang | N/A | Code |
| RCooper: A Real-world Large-scale Dataset for Roadside Cooperative Perception | Ruiyang Hao · Siqi Fan · Yingru Dai · Zhenlin Zhang · Chenxi Li · YuntianWang · Haibao Yu · Wenxian Yang · Jirui Yuan · Zaiqing Nie | N/A | Code |
| ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding | Le Xue · Ning Yu · Shu Zhang · Artemis Panagopoulou · Junnan Li · Roberto Martín-Martín · Jiajun Wu · Caiming Xiong · Ran Xu · Juan Carlos Niebles · Silvio Savarese | N/A | Code |
| SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking | Xiaojun Hou · Jiazheng Xing · Yijie Qian · Yaowei Guo · Shuo Xin · Junhao Chen · Kai Tang · Mengmeng Wang · Zhengkai Jiang · Liang Liu · Yong Liu | N/A | Code |
| Towards Language-Driven Video Inpainting via Multimodal Large Language Models | Jianzong Wu · Xiangtai Li · Chenyang Si · Shangchen Zhou · Jingkang Yang · Jiangning Zhang · Yining Li · Kai Chen · Yunhai Tong · Ziwei Liu · Chen Change Loy | N/A | Code |
| Convolutional Prompting meets Language Models for Continual Learning | Anurag Roy · Riddhiman Moulick · Vinay Verma · Saptarshi Ghosh · Abir Das | N/A | Code |
| Multiview Aerial Visual RECognition (MAVREC): Can Multi-view Improve Aerial Visual Perception? | Aritra Dutta · Srijan Das · Jacob Nielsen · RAJATSUBHRA CHAKRABORTY · Mubarak Shah | N/A | Code |
| SchurVINS: Schur Complement-Based Lightweight Visual Inertial Navigation System | Yunfei Fan · Tianyu Zhao · Guidong Wang | N/A | Code |
| MACE: Mass Concept Erasure in Diffusion Models | Shilin Lu · Zilan Wang · Leyang Li · Yanzhu Liu · Adams Wai-Kin Kong | N/A | Code |
| JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients | Woo Kyoung Han · Sunghoon Im · Jaedeok Kim · Kyong Hwan Jin | N/A | Code |
| Revisiting Adversarial Training Under Long-Tailed Distributions | Xinli Yue · Ningping Mou · Qian Wang · Lingchen Zhao | N/A | Code |
| Plug-and-Play Diffusion Distillation | Yi-Ting Hsiao · Siavash Khodadadeh · Kevin Duarte · Wei-An Lin · Hui Qu · Mingi Kwon · Ratheesh Kalarot | N/A | Code |
| Polos: Multimodal Metric Learning from Human Feedback for Image Captioning | Yuiga Wada · Kanta Kaneda · Daichi Saito · Komei Sugiura | N/A | Code |
| LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry | Weirong Chen · Le Chen · Rui Wang · Marc Pollefeys | N/A | Code |
| DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations | Tianhao Qi · Shancheng Fang · Yanze Wu · Hongtao Xie · Jiawei Liu · Lang chen · Qian HE · Yongdong Zhang | N/A | Code |
| AvatarGPT: All-in-One Framework for Motion Understanding Planning Generation and Beyond | Zixiang Zhou · Yu Wan · Baoyuan Wang | N/A | Code |
| Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery | Yuqi Zhang · Guanying Chen · Jiaxing Chen · Shuguang Cui | N/A | Code |
| Beyond Textual Constraints: Learning Novel Diffusion Conditions with Fewer Examples | Yuyang Yu · Bangzhen Liu · Chenxi Zheng · Xuemiao Xu · Huaidong Zhang · Shengfeng He | N/A | Code |
| ProxyCap: Real-time Monocular Full-body Capture in World Space via Human-Centric Proxy-to-Motion Learning | Yuxiang Zhang · Hongwen Zhang · Liangxiao Hu · Jiajun Zhang · Hongwei Yi · Shengping Zhang · Yebin Liu | N/A | Code |
| Learning from Synthetic Human Group Activities | Che-Jui Chang · Danrui Li · Deep Patel · Parth Goel · Seonghyeon Moon · Samuel Sohn · Honglu Zhou · Sejong Yoon · Vladimir Pavlovic · Mubbasir Kapadia | N/A | Code |
| Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations | Daan de Geus · Gijs Dubbelman | N/A | Code |
| Unsupervised 3D Structure Inference from Category-Specific Image Collections | Weikang Wang · Dongliang Cao · Florian Bernard | N/A | Code |
| Alpha Invariance: On Inverse Scaling Between Distance and Volume Density in Neural Radiance Fields | Joshua Ahn · Haochen Wang · Raymond A. Yeh · Greg Shakhnarovich | N/A | Code |
| ContextSeg: Sketch Semantic Segmentation by Querying the Context with Attention | Jiawei Wang · Changjian Li | N/A | Code |
| Multi-Modal Proxy Learning Towards Personalized Visual Multiple Clustering | Jiawei Yao · Qi Qian · Juhua Hu | N/A | Code |
| Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding | Le Zhang · Rabiul Awal · Aishwarya Agrawal | N/A | Code |
| Edit One for All: Interactive Batch Image Editing | Thao Nguyen · Utkarsh Ojha · Yuheng Li · Haotian Liu · Yong Jae Lee | N/A | Code |
| DreamAvatar: Text-and-Shape Guided 3D Human Avatar Generation via Diffusion Models | Yukang Cao · Yan-Pei Cao · Kai Han · Ying Shan · Kwan-Yee K. Wong | N/A | Code |
| Driving-Video Dehazing with Non-Aligned Regularization for Safety Assistance | Junkai Fan · Jiangwei Weng · Kun Wang · Yijun Yang · Jianjun Qian · Jun Li · Jian Yang | N/A | Code |
| Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers | Zhibo Yang · Sounak Mondal · Seoyoung Ahn · Ruoyu Xue · Gregory Zelinsky · Minh Hoai · Dimitris Samaras | N/A | Code |
| SAI3D: Segment Any Instance in 3D Scenes | Yingda Yin · Yuzheng Liu · Yang Xiao · Daniel Cohen-Or · Jingwei Huang · Baoquan Chen | N/A | Code |
| Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features | Niladri Shekhar Dutt · Sanjeev Muralikrishnan · Niloy J. Mitra | N/A | Code |
| Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud Matching | Matteo Bastico · Etienne Decencière · Laurent Corté · Yannick TILLIER · David Ryckelynck | N/A | Code |
| Pre-trained Model Guided Fine-Tuning for Zero-Shot Adversarial Robustness | Sibo Wang · Jie Zhang · Zheng Yuan · Shiguang Shan | N/A | Code |
| MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Distillation | Zhicheng Zhang · Pancheng Zhao · Eunil Park · Jufeng Yang | N/A | Code |
| Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis | Yanzuo Lu · Manlin Zhang · Jinhua Ma · Xiaohua Xie · Jianhuang Lai | N/A | Code |
| ICP-Flow: LiDAR Scene Flow Estimation with ICP | Yancong Lin · Holger Caesar | N/A | Code |
| Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAM | Tongyan Hua · Addison, Lin Wang | N/A | Code |
| DeepCache: Accelerating Diffusion Models for Free | Xinyin Ma · Gongfan Fang · Xinchao Wang | N/A | Code |
| The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding | Lorenzo Bianchi · Fabio Carrara · Nicola Messina · Claudio Gennaro · Fabrizio Falchi | N/A | Code |
| CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection | Mikhail Kennerley · Jian-Gang Wang · Bharadwaj Veeravalli · Robby T. Tan | N/A | Code |
| Semantic Human Mesh Reconstruction with Textures | xiaoyu zhan · Jianxin Yang · Yuanqi Li · Jie Guo · Yanwen Guo · Wenping Wang | N/A | Code |
| CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images | Guanlin Shen · Jingwei Huang · Zhihua Hu · Bin Wang | N/A | Code |
| Neural Clustering based Visual Representation Learning | Guikun Chen · Xia Li · Yi Yang · Wenguan Wang | N/A | Code |
| ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models | Jeong-gi Kwak · Erqun Dong · Yuhe Jin · Hanseok Ko · Shweta Mahajan · Kwang Moo Yi | N/A | Code |
| BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation | Yunhao Ge · Yihe Tang · Jiashu Xu · Cem Gokmen · Chengshu Li · Wensi Ai · Benjamin Martinez · Arman Aydin · Mona Anvari · Ayush Chakravarthy · Hong-Xing Yu · Josiah Wong · Sanjana Srivastava · Sharon Lee · Shengxin Zha · Laurent Itti · Yunzhu Li · Roberto Martín-Martín · Miao Liu · Pengchuan Zhang · Ruohan Zhang · Li Fei-Fei · Jiajun Wu | N/A | Code |
| CapHuman: Capture Your Moments in Parallel Universes | Chao Liang · Fan Ma · Linchao Zhu · Yingying Deng · Yi Yang | N/A | Code |
| NeRF Director: Revisiting View Selection in Neural Volume Rendering | Wenhui Xiao · Rodrigo Santa Cruz · David Ahmedt-Aristizabal · Olivier Salvado · Clinton Fookes · Leo Lebrat | N/A | Code |
| MANUS: Markerless Grasp Capture using Articulated 3D Gaussians | Chandradeep Pokhariya · Ishaan Shah · Angela Xing · Zekun Li · Kefan Chen · Avinash Sharma · Srinath Sridhar | N/A | Code |
| 4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling | Sherwin Bahmani · Ivan Skorokhodov · Victor Rong · Gordon Wetzstein · Leonidas Guibas · Peter Wonka · Sergey Tulyakov · Jeong Joon Park · Andrea Tagliasacchi · David B. Lindell | N/A | Code |
| VS: Reconstructing Clothed 3D Human from Single Image via Vertex Shift | Leyuan Liu · Yuhan Li · Yunqi Gao · Changxin Gao · Yuanyuan Liu · Jingying Chen | N/A | Code |
| En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data | Yifang Men · Biwen Lei · Yuan Yao · Miaomiao Cui · Zhouhui Lian · Xuansong Xie | N/A | Code |
| Diversified and Personalized Multi-rater Medical Image Segmentation | Yicheng Wu · Xiangde Luo · Zhe Xu · Xiaoqing Guo · Lie Ju · Zongyuan Ge · Wenjun Liao · Jianfei Cai | N/A | Code |
| vid-TLDR: Training Free Token Merging for Light-weight Video Transformer | Joonmyung Choi · Sanghyeok Lee · Jaewon Chu · Minhyuk Choi · Hyunwoo J. Kim | N/A | Code |
| Point Transformer V3: Simpler Faster Stronger | Xiaoyang Wu · Li Jiang · Peng-Shuai Wang · Zhijian Liu · Xihui Liu · Yu Qiao · Wanli Ouyang · Tong He · Hengshuang Zhao | N/A | Code |
| Depth-Aware Concealed Crop Detection in Dense Agricultural Scenes | Liqiong Wang · Jinyu Yang · Yanfu Zhang · Fangyi Wang · Feng Zheng | N/A | Code |
| DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models | Khawar Islam · Muhammad Zaigham Zaheer · Arif Mahmood · Karthik Nandakumar | N/A | Code |
| MoMask: Generative Masked Modeling of 3D Human Motions | chuan guo · Yuxuan Mu · Muhammad Gohar Javed · Sen Wang · Li Cheng | N/A | Code |
| MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception | Yiran Qin · Enshen Zhou · Qichang Liu · Zhenfei Yin · Lu Sheng · Ruimao Zhang · Yu Qiao · Jing Shao | N/A | Code |
| Initialization Matters for Adversarial Transfer Learning | Andong Hua · Jindong Gu · Zhiyu Xue · Nicholas Carlini · Eric Wong · Yao Qin | N/A | Code |
| Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacements | Niccolò Biondi · Federico Pernici · Simone Ricci · Alberto Del Bimbo | N/A | Code |
| Gaussian Shadow Casting for Neural Characters | Luis Bolanos · Shih-Yang Su · Helge Rhodin | N/A | Code |
| Enhancing 3D Fidelity of Text-to-3D using Cross-View Correspondences | Seungwook Kim · Kejie Li · Xueqing Deng · Yichun Shi · Minsu Cho · Peng Wang | N/A | Code |
| BigGait: Learning Gait Representation You Want by Large Vision Models | Dingqiang Ye · Chao Fan · Jingzhe Ma · Xiaoming Liu · Shiqi Yu | N/A | Code |
| Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval | Haochen Han · Qinghua Zheng · Guang Dai · Minnan Luo · Jingdong Wang | N/A | Code |
| Language-Driven Anchors for Zero-Shot Adversarial Robustness | Xiao Li · Wei Zhang · Yining Liu · Zhanhao Hu · Bo Zhang · Xiaolin Hu | N/A | Code |
| Gaussian Shell Maps for Efficient 3D Human Generation | Rameen Abdal · Wang Yifan · Zifan Shi · Yinghao Xu · Ryan Po · Zhengfei Kuang · Qifeng Chen · Dit-Yan Yeung · Gordon Wetzstein | N/A | Code |
| Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments | Liyuan Zhu · Shengyu Huang · Konrad Schindler · Iro Armeni | N/A | Code |
| Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition | Kyle Buettner · Sina Malakouti · Xiang Li · Adriana Kovashka | N/A | Code |
| MindBridge: A Cross-Subject Brain Decoding Framework | Shizun Wang · Songhua Liu · Zhenxiong Tan · Xinchao Wang | N/A | Code |
| State Space Models for Event Cameras | Nikola Zubic · Mathias Gehrig · Davide Scaramuzza | N/A | Code |
| TIM: A Time Interval Machine for Audio-Visual Action Recognition | Jacob Chalk · Jaesung Huh · Evangelos Kazakos · Andrew Zisserman · Dima Damen | N/A | Code |
| READ: Retrieval-Enhanced Asymmetric Diffusion for Motion Planning | Takeru Oba · Matthew Walter · Norimichi Ukita | N/A | Code |
| MeshPose: Unifying DensePose and 3D Body Mesh Reconstruction | Eric-Tuan Le · Antonios Kakolyris · Petros Koutras · Himmy Tam · Efstratios Skordos · George Papandreou · Riza Alp Guler · Iasonas Kokkinos | N/A | Code |
| AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation | Jeongsoo Choi · Se Jin Park · Minsu Kim · Yong Man Ro | N/A | Code |
| Matching Anything by Segmenting Anything | Siyuan Li · Lei Ke · Martin Danelljan · Luigi Piccinelli · Mattia Segu · Luc Van Gool · Fisher Yu | N/A | Code |
| MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models | Sanjoy Chowdhury · Sayan Nag · Joseph K J · Balaji Vasan Srinivasan · Dinesh Manocha | N/A | Code |
| Narrative Action Evaluation with Prompt-Guided Multimodal Interaction | Shiyi Zhang · Sule Bai · Guangyi Chen · Lei Chen · Jiwen Lu · Junle Wang · Yansong Tang | N/A | Code |
| MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation | Xiaolong Deng · Huisi Wu · Runhao Zeng · Jing Qin | N/A | Code |
| SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing | Zeyinzi Jiang · Chaojie Mao · Yulin Pan · Zhen Han · Jingfeng Zhang | N/A | Code |
| ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts | Mu Cai · Haotian Liu · Siva Mustikovela · Gregory P. Meyer · Yuning Chai · Dennis Park · Yong Jae Lee | N/A | Code |
| LeGO: Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example | Soyeon Yoon · Kwan Yun · Kwanggyoon Seo · Sihun Cha · Jung Eun Yoo · Junyong Noh | N/A | Code |
| Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers | Tsai-Shien Chen · Aliaksandr Siarohin · Willi Menapace · Ekaterina Deyneka · Hsiang-wei Chao · Byung Jeon · Yuwei Fang · Hsin-Ying Lee · Jian Ren · Ming-Hsuan Yang · Sergey Tulyakov | N/A | Code |
| Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action | Jiasen Lu · Christopher Clark · Sangho Lee · Zichen Zhang · Savya Khosla · Ryan Marten · Derek Hoiem · Aniruddha Kembhavi | N/A | Code |
| Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling | Leon Sick · Dominik Engel · Pedro Hermosilla · Timo Ropinski | N/A | Code |
| HHMR: Holistic Hand Mesh Recovery by Enhancing the Multimodal Controllability of Graph Diffusion Models | Mengcheng Li · Hongwen Zhang · Yuxiang Zhang · Ruizhi Shao · Tao Yu · Yebin Liu | N/A | Code |
| Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling | Baoquan Zhang · Huaibin Wang · Luo Chuyao · Xutao Li · Guotao liang · Yunming Ye · joeq · Yao He | N/A | Code |
| Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships | Sebastian Koch · Narunas Vaskevicius · Mirco Colosi · Pedro Hermosilla · Timo Ropinski | N/A | Code |
| ChatPose: Chatting about 3D Human Pose | Yao Feng · Jing Lin · Sai Kumar Dwivedi · Yu Sun · Priyanka Patel · Michael J. Black | N/A | Code |
| Modality-agnostic Domain Generalizable Medical Image Segmentation by Multi-Frequency in Multi-Scale Attention | Ju-Hyeon Nam · Nur Suriza Syazwany · Su Jung Kim · Sang-Chul Lee | N/A | Code |
| Improved Baselines with Visual Instruction Tuning | Haotian Liu · Chunyuan Li · Yuheng Li · Yong Jae Lee | N/A | Code |
| DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks | Jiaxin Zhang · Dezhi Peng · Chongyu Liu · Peirong Zhang · Lianwen Jin | N/A | Code |
| DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis | Yuming Gu · Hongyi Xu · You Xie · Guoxian Song · Yichun Shi · Di Chang · Jing Yang · Linjie Luo | N/A | Code |
| Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes | Gaurav Shrivastava · Abhinav Shrivastava | N/A | Code |
| MoST: Motion Style Transformer Between Diverse Action Contents | Boeun Kim · Jungho Kim · Hyung Jin Chang · Jin Young Choi | N/A | Code |
| Rotation-Agnostic Image Representation Learning for Digital Pathology | Saghir Alfasly · Abubakr Shafique · Peyman Nejat · Jibran Khan · Areej Alsaafin · Ghazal Alabtah · Hamid Tizhoosh | N/A | Code |
| Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation | Lior Talker · Aviad Cohen · Erez Yosef · Alexandra Dana · Michael Dinerstein | N/A | Code |
| Generating Human Motion in 3D Scenes from Text Descriptions | Zhi Cen · Huaijin Pi · Sida Peng · Zehong Shen · Minghui Yang · Shuai Zhu · Hujun Bao · Xiaowei Zhou | N/A | Code |
| Visual Point Cloud Forecasting enables Scalable Autonomous Driving | Zetong Yang · Li Chen · Yanan Sun · Hongyang Li | N/A | Code |
| MaGGIe: Masked Guided Gradual Human Instance Matting | Chuong Huynh · Seoung Wug Oh · Abhinav Shrivastava · Joon-Young Lee | N/A | Code |
| Grid Diffusion Models for Text-to-Video Generation | Taegyeong Lee · Soyeong Kwon · Taehwan Kim | N/A | Code |
| Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction | Devikalyan Das · Christopher Wewer · Raza Yunus · Eddy Ilg · Jan Lenssen | N/A | Code |
| Making Vision Transformers Truly Shift-Equivariant | Renan A. Rojas-Gomez · Teck-Yian Lim · Minh Do · Raymond A. Yeh | N/A | Code |
| DreamVideo: Composing Your Dream Videos with Customized Subject and Motion | Yujie Wei · Shiwei Zhang · Zhiwu Qing · Hangjie Yuan · Zhiheng Liu · Yu Liu · Yingya Zhang · Jingren Zhou · Hongming Shan | N/A | Code |
| Posterior Distillation Sampling | Juil Koo · Chanho Park · Minhyuk Sung | N/A | Code |
| RankED: Addressing Imbalance and Uncertainty in Edge Detection Using Ranking-based Losses | bedrettin cetinkaya · Sinan Kalkan · Emre Akbas | N/A | Code |
| Free3D: Consistent Novel View Synthesis without 3D Representation | Chuanxia Zheng · Andrea Vedaldi | N/A | Code |
| Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior | Fangfu Liu · Diankun Wu · Yi Wei · Yongming Rao · Yueqi Duan | N/A | Code |
| HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation | Xin Huang · Ruizhi Shao · Qi Zhang · Hongwen Zhang · Ying Feng · Yebin Liu · Qing Wang | N/A | Code |
| Generalized Event Cameras | Varun Sundar · Matthew Dutson · Andrei Ardelean · Claudio Bruschini · Edoardo Charbon · Mohit Gupta | N/A | Code |
| BEVSpread: Spread Voxel Pooling for Bird’s-Eye-View Representation in Vision-based Roadside 3D Object Detection | Wenjie Wang · Yehao Lu · Guangcong Zheng · Shuigenzhan · Xiaoqing Ye · Zichang Tan · Jingdong Wang · Gaoang Wang · Xi Li | N/A | Code |
| GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement | Linfang Zheng · Tze Ho Elden Tse · Chen Wang · Yinghan Sun · Hua Chen · Aleš Leonardis · Wei Zhang · Hyung Jin Chang | N/A | Code |
| Unbiased Estimator for Distorted Conics in Camera Calibration | Chaehyeon Song · Jaeho Shin · Myung-Hwan Jeon · Jongwoo Lim · Ayoung Kim | N/A | Code |
| Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer | Zhen Zhao · Jingqun Tang · Chunhui Lin · Binghong Wu · Can Huang · Hao Liu · Xin Tan · Zhizhong Zhang · Yuan Xie | N/A | Code |
| NIVeL: Neural Implicit Vector Layers for Text-to-Vector Generation | Vikas Thamizharasan · Difan Liu · Matthew Fisher · Nanxuan Zhao · Evangelos Kalogerakis · Michal Lukáč | N/A | Code |
| FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Shuai Yang · Yifan Zhou · Ziwei Liu · Chen Change Loy | N/A | Code |
| HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Video | Zicong Fan · Maria Parelli · Maria Kadoglou · Xu Chen · Muhammed Kocabas · Michael J. Black · Otmar Hilliges | N/A | Code |
| SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution | Zhixuan Liang · Yao Mu · Hengbo Ma · Masayoshi Tomizuka · Mingyu Ding · Ping Luo | N/A | Code |
| Learning to Count without Annotations | Lukas Knobel · Tengda Han · Yuki Asano | N/A | Code |
| RoDLA: Benchmarking the Robustness of Document Layout Analysis Models | Yufan Chen · Jiaming Zhang · Kunyu Peng · Junwei Zheng · Ruiping Liu · Philip H.S. Torr · Rainer Stiefelhagen | N/A | Code |
| Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos | Mehmet Saygin Seyfioglu · Wisdom Ikezogwo · Fatemeh Ghezloo · Ranjay Krishna · Linda Shapiro | N/A | Code |
| Diffusion-FOF: Single-View Clothed Human Reconstruction via Diffusion-Based Fourier Occupancy Field | Yuanzhen Li · Fei LUO · Chunxia Xiao | N/A | Code |
| SURE: SUrvey REcipes for building reliable and robust deep networks | Yuting Li · Yingyi Chen · Xuanlong Yu · Dexiong Chen · Xi Shen | N/A | Code |
| Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation | Ba Hung Ngo · Nhat-Tuong Do-Tran · Tuan-Ngoc Nguyen · Hae-Gon Jeon · Tae Jong Choi | N/A | Code |
| From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation | Javier Tirado-Garín · Javier Civera | N/A | Code |
| Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Pablo Marcos-Manchón · Roberto Alcover-Couso · Juan SanMiguel · Jose M. Martinez | N/A | Code |
| Attentive Illumination Decomposition Model for Multi-Illuminant White Balancing | Dongyoung Kim · Jinwoo Kim · Junsang Yu · Seon Joo Kim | N/A | Code |
| Label Propagation for Zero-shot Classification with Vision-Language Models | Vladan Stojnić · Yannis Kalantidis · Giorgos Tolias | N/A | Code |
| Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation | Zhipeng Du · Miaojing Shi · Jiankang Deng | N/A | Code |
| A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network | Ruichen Ma · Guanchao Qiao · Yian Liu · Liwei Meng · Ning Ning · Yang Liu · Shaogang Hu | N/A | Code |
| LEOD: Label-Efficient Object Detection for Event Cameras | Ziyi Wu · Mathias Gehrig · Qing Lyu · Xudong Liu · Igor Gilitschenski | N/A | Code |
| OpenESS: Event-based Semantic Scene Understanding with Open Vocabularies | Lingdong Kong · Youquan Liu · Lai Xing Ng · Benoit Cottereau · Wei Tsang Ooi | N/A | Code |
| VAREN: Very Accurate and Realistic Equine Network | Silvia Zuffi · Ylva Mellbin · Ci Li · Markus Höschle · Hedvig Kjellström · Senya Polikovsky · Elin Hernlund · Michael J. Black | N/A | Code |
| Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion | Fan Zhang · Shaodi You · Yu Li · Ying Fu | N/A | Code |
| Cross-spectral Gated-RGB Stereo Depth Estimation | Samuel Brucker · Stefanie Walz · Mario Bijelic · Felix Heide | N/A | Code |
| 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation | Zidu Wang · Xiangyu Zhu · Tianshuo Zhang · baiqin wang · Zhen Lei | N/A | Code |
| HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models | Tianrui Guan · Fuxiao Liu · Xiyang Wu · Ruiqi Xian · Zongxia Li · Xiaoyu Liu · Xijun Wang · Lichang Chen · Furong Huang · Yaser Yacoob · Dinesh Manocha · Tianyi Zhou | N/A | Code |
| Object Recognition as Next Token Prediction | Kaiyu Yue · Bor-Chun Chen · Jonas Geiping · Hengduo Li · Tom Goldstein · Ser-Nam Lim | N/A | Code |
| Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention Alignment and Prompt Tuning | Leslie Ching Ow Tiong · Dick Sigmund · Chen-Hui Chan · Andrew Beng Jin Teoh | N/A | Code |
| EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation | Chanyoung Kim · Woojung Han · Dayun Ju · Seong Jae Hwang | N/A | Code |
| Disentangled Pre-training for Human-Object Interaction Detection | Zhuolong Li · Xingao Li · Changxing Ding · Xiangmin Xu | N/A | Code |
| GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding | Zi-Ting Chou · Sheng-Yu Huang · I-Jieh Liu · Yu-Chiang Frank Wang | N/A | Code |
| CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model | Jianhao Zeng · Dan Song · Weizhi Nie · Hongshuo Tian · Tongtong Wang · An-An Liu | N/A | Code |
| CoGS: Controllable Gaussian Splatting | Heng Yu · Joel Julin · Zoltán Á. Milacski · Koichiro Niinuma · László A. Jeni | N/A | Code |
| A Bayesian Approach to OOD Robustness in Image Classification | Prakhar Kaushik · Adam Kortylewski · Alan L. Yuille | N/A | Code |
| ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language Tasks | Andrea Rosasco · Stefano Berti · Giulia Pasquale · Damiano Malafronte · Shogo Sato · Hiroyuki Segawa · Tetsugo Inada · Lorenzo Natale | N/A | Code |
| Readout Guidance: Learning Control from Diffusion Features | Grace Luo · Trevor Darrell · Oliver Wang · Dan B Goldman · Aleksander Holynski | N/A | Code |
| Transcriptomics-guided Slide Representation Learning in Computational Pathology | Guillaume Jaume · Lukas Oldenburg · Anurag Vaidya · Richard J. Chen · Drew F. K. Williamson · Thomas Peeters · Andrew Song · Faisal Mahmood | N/A | Code |
| Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving | Brian Yang · Huangyuan Su · Nikolaos Gkanatsios · Tsung-Wei Ke · Ayush Jain · Jeff Schneider · Katerina Fragkiadaki | N/A | Code |
| CRKD: Enhanced Camera-Radar Object Detection with Cross-modality Knowledge Distillation | Lingjun Zhao · Jingyu Song · Katherine Skinner | N/A | Code |
| Instance-aware Contrastive Learning for Occluded Human Mesh Reconstruction | Mi-Gyeong Gwon · Gi-Mun Um · Won-Sik Cheong · Wonjun Kim | N/A | Code |
| BrainWash: A Poisoning Attack to Forget in Continual Learning | Ali Abbasi · Parsa Nooralinejad · Hamed Pirsiavash · Soheil Kolouri | N/A | Code |
| WALT3D: Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion | Khiem Vuong · N. Dinesh Reddy · Robert Tamburo · Srinivasa G. Narasimhan | N/A | Code |
| Data Valuation and Detections in Federated Learning | Wenqian Li · Shuran Fu · Fengrui Zhang · Yan Pang | N/A | Code |
| ProTeCt: Prompt Tuning for Taxonomic Open Set Classification | Tz-Ying Wu · Chih-Hui Ho · Nuno Vasconcelos | N/A | Code |
| Mosaic-SDF for 3D Generative Models | Lior Yariv · Omri Puny · Oran Gafni · Yaron Lipman | N/A | Code |
| FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions | Jiong WANG · Fengyu Yang · Bingliang Li · Wenbo Gou · Danqi Yan · Ailing Zeng · Yijun Gao · Junle Wang · Yanqing Jing · Ruimao Zhang | N/A | Code |
| TUMTraf V2X Cooperative Perception Dataset | Walter Zimmer · Gerhard Arya Wardana · Suren Sritharan · Xingcheng Zhou · Rui Song · Alois Knoll | N/A | Code |
| Visual Concept Connectome (VCC): Open World Concept Discovery and their Interlayer Connections in Deep Models | Matthew Kowal · Richard P. Wildes · Kosta Derpanis | N/A | Code |
| A Recipe for Scaling up Text-to-Video Generation with Text-free Videos | Xiang Wang · Shiwei Zhang · Hangjie Yuan · Zhiwu Qing · Biao Gong · Yingya Zhang · Yujun Shen · Changxin Gao · Nong Sang | N/A | Code |
| GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos | Tomas Soucek · Dima Damen · Michael Wray · Ivan Laptev · Josef Sivic | N/A | Code |
| TULIP: Transformer for Upsampling of LiDAR Point Clouds | Bin Yang · Patrick Pfreundschuh · Roland Siegwart · Marco Hutter · Peyman Moghadam · Vaishakh Patil | N/A | Code |
| Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects | Yijia Weng · Bowen Wen · Jonathan Tremblay · Valts Blukis · Dieter Fox · Leonidas Guibas · Stan Birchfield | N/A | Code |
| Human Gaussian Splatting: Real-time Rendering of Animatable Avatars | Arthur Moreau · Jifei Song · Helisa Dhamo · Richard Shaw · Yiren Zhou · Eduardo Pérez-Pellitero | N/A | Code |
| Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training | Yipeng Gao · Zeyu Wang · Wei-Shi Zheng · Cihang Xie · Yuyin Zhou | N/A | Code |
| NECA: Neural Customizable Human Avatar | Junjin Xiao · Qing Zhang · Zhan Xu · Wei-Shi Zheng | N/A | Code |
| EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World | Yifei Huang · Guo Chen · Jilan Xu · Mingfang Zhang · Lijin Yang · Baoqi Pei · Hongjie Zhang · Lu Dong · Yali Wang · Limin Wang · Yu Qiao | N/A | Code |
| From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration | Zekun Qian · Ruize Han · Wei Feng · Song Wang | N/A | Code |
| Self-correcting LLM-controlled Diffusion Models | Tsung-Han Wu · Long Lian · Joseph Gonzalez · Boyi Li · Trevor Darrell | N/A | Code |
| Efficient Dataset Distillation via Minimax Diffusion | Jianyang Gu · Saeed Vahidian · Vyacheslav Kungurtsev · Haonan Wang · Wei Jiang · Yang You · Yiran Chen | N/A | Code |
| DUSt3R: Geometric 3D Vision Made Easy | Shuzhe Wang · Vincent Leroy · Yohann Cabon · Boris Chidlovskii · Jerome Revaud | N/A | Code |
| Enhancing Video Super-Resolution via Implicit Resampling-based Alignment | Kai Xu · Ziwei Yu · Xin Wang · Michael Bi Mi · Angela Yao | N/A | Code |
| InceptionNeXt: When Inception Meets ConvNeXt | Weihao Yu · Pan Zhou · Shuicheng Yan · Xinchao Wang | N/A | Code |
| Inversion-Free Image Editing with Language-Guided Diffusion Models | Sihan Xu · Yidong Huang · Jiayi Pan · Ziqiao Ma · Joyce Chai | N/A | Code |
| Generalizable Face Landmarking Guided by Conditional Face Warping | Jiayi Liang · Haotian Liu · Hongteng Xu · Dixin Luo | N/A | Code |
| eTraM: Event-based Traffic Monitoring Dataset | Aayush Atul Verma · Bharatesh Chakravarthi · Arpitsinh Vaghela · Hua Wei · 'YZ' Yezhou Yang | N/A | Code |
| SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes | Yihua Huang · Yangtian Sun · Ziyi Yang · Xiaoyang Lyu · Yan-Pei Cao · Xiaojuan Qi | N/A | Code |
| SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion | Hsuan-I Ho · Jie Song · Otmar Hilliges | N/A | Code |
| CAD: Photorealistic 3D Generation via Adversarial Distillation | Ziyu Wan · Despoina Paschalidou · Ian Huang · Hongyu Liu · Bokui Shen · Xiaoyu Xiang · Jing Liao · Leonidas Guibas | N/A | Code |
| Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D | Mukund Varma T · Peihao Wang · Zhiwen Fan · Zhangyang Wang · Hao Su · Ravi Ramamoorthi | N/A | Code |
| SpiderMatch: 3D Shape Matching with Global Optimality and Geometric Consistency | Paul Roetzer · Florian Bernard | N/A | Code |
| Realigning Confidence with Temporal Saliency Information for Point-Level Weakly-Supervised Temporal Action Localization | Ziying Xia · Jian Cheng · Siyu Liu · Yongxiang Hu · Shiguang Wang · Zhang Yijie · Wanli Dang | N/A | Code |
| 3D Facial Expressions through Analysis-by-Neural-Synthesis | George Retsinas · Panagiotis Filntisis · Radek Danecek · Victoria Abrevaya · Anastasios Roussos · Timo Bolkart · Petros Maragos | N/A | Code |
| Selective Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition | Filip Ilic · He Zhao · Thomas Pock · Richard P. Wildes | N/A | Code |
| Segment and Caption Anything | Xiaoke Huang · Jianfeng Wang · Yansong Tang · Zheng Zhang · Han Hu · Jiwen Lu · Lijuan Wang · Zicheng Liu | N/A | Code |
| ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image | Kyle Sargent · Zizhang Li · Tanmay Shah · Charles Herrmann · Hong-Xing Yu · Yunzhi Zhang · Eric Ryan Chan · Dmitry Lagun · Li Fei-Fei · Deqing Sun · Jiajun Wu | N/A | Code |
| A2XP: Towards Private Domain Generalization | Geunhyeok Yu · Hyoseok Hwang | N/A | Code |
| CityDreamer: Compositional Generative Model of Unbounded 3D Cities | Haozhe Xie · Zhaoxi Chen · Fangzhou Hong · Ziwei Liu | N/A | Code |
| Noisy-Correspondence Learning for Text-to-Image Person Re-identification | Yang Qin · Yingke Chen · Dezhong Peng · Xi Peng · Joey Tianyi Zhou · Peng Hu | N/A | Code |
| ParamISP: Learned Forward and Inverse ISPs using Camera Parameters | Woohyeok Kim · Geonu Kim · Junyong Lee · Seungyong Lee · Seung-Hwan Baek · Sunghyun Cho | N/A | Code |
| Diversity-aware Channel Pruning for StyleGAN Compression | Jiwoo Chung · Sangeek Hyun · Sang-Heon Shim · Jae-Pil Heo | N/A | Code |
| LEMON: Learning 3D Human-Object Interaction Relation from 2D Images | Yuhang Yang · Wei Zhai · Hongchen Luo · Yang Cao · Zheng-Jun Zha | N/A | Code |
| NeRF On-the-go: Exploiting Uncertainty for Distractor-free NeRFs in the Wild | Weining Ren · Zihan Zhu · Boyang Sun · Jiaqi Chen · Marc Pollefeys · Songyou Peng | N/A | Code |
| OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition | Tongjia Chen · Hongshan Yu · Zhengeng Yang · Zechuan Li · Wei Sun · Chen Chen | N/A | Code |
| DYSON: Dynamic Feature Space Self-Organization for Online Task-Free Class Incremental Learning | Yuhang He · YingJie Chen · Yuhan Jin · Songlin Dong · Xing Wei · Yihong Gong | N/A | Code |
| Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences | Minyoung Hwang · Luca Weihs · Chanwoo Park · Kimin Lee · Aniruddha Kembhavi · Kiana Ehsani | N/A | Code |
| PanoContext-Former: Panoramic Total Scene Understanding with a Transformer | Yuan Dong · Chuan Fang · Liefeng Bo · Zilong Dong · Ping Tan | N/A | Code |
| Continuous Pose for Monocular Cameras in Neural Implicit Representation | Qi Ma · Danda Paudel · Ajad Chhatkuli · Luc Van Gool | N/A | Code |
| Learned Trajectory Embedding for Subspace Clustering | Yaroslava Lochman · Christopher Zach · Carl Olsson | N/A | Code |
| ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction | Zhicheng Zhang · Junyao Hu · Wentao Cheng · Danda Paudel · Jufeng Yang | N/A | Code |
| HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting | Xian Liu · Xiaohang Zhan · Jiaxiang Tang · Ying Shan · Gang Zeng · Dahua Lin · Xihui Liu · Ziwei Liu | N/A | Code |
| Depth Prompting for Sensor-Agnostic Depth Estimation | Jin-Hwi Park · Chanhwi Jeong · Junoh Lee · Hae-Gon Jeon | N/A | Code |
| DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback | Yangyi Chen · Karan Sikka · Michael Cogswell · Heng Ji · Ajay Divakaran | N/A | Code |
| SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild | Andreas Engelhardt · Amit Raj · Mark Boss · Yunzhi Zhang · Abhishek Kar · Yuanzhen Li · Ricardo Martin-Brualla · Jonathan T. Barron · Deqing Sun · Hendrik Lensch · Varun Jampani | N/A | Code |
| Grounded Question-Answering in Long Egocentric Videos | Shangzhe Di · Weidi Xie | N/A | Code |
| MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections | mude hui · Zihao Wei · Hongru Zhu · Fei Xia · Yuyin Zhou | N/A | Code |
| Learning Inclusion Matching for Animation Paint Bucket Colorization | Yuekun Dai · Shangchen Zhou · Blake Li · Chongyi Li · Chen Change Loy | N/A | Code |
| Visual Layout Composer: Image-Vector Dual Diffusion Model for Design Layout Generation | Mohammad Amin Shabani · Zhaowen Wang · Difan Liu · Nanxuan Zhao · Jimei Yang · Yasutaka Furukawa | N/A | Code |
| DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction | Weiyi Lv · Yuhang Huang · NING Zhang · Ruei-Sung Lin · Mei Han · Dan Zeng | N/A | Code |
| RepViT: Revisiting Mobile CNN From ViT Perspective | Ao Wang · Hui Chen · Zijia Lin · Jungong Han · Guiguang Ding | N/A | Code |
| Simple Semantic-Aided Few-Shot Learning | Hai Zhang · Junzhe Xu · Shanlin Jiang · Zhenan He | N/A | Code |
| AETTA: Label-Free Accuracy Estimation for Test-Time Adaptation | Taeckyung Lee · Sorn Chottananurak · Taesik Gong · Sung-Ju Lee | N/A | Code |
| A Simple Recipe for Language-guided Domain Generalized Segmentation | Mohammad Fahes · TUAN-HUNG VU · Andrei Bursuc · Patrick Pérez · Raoul de Charette | N/A | Code |
| AdaShift: Learning Discriminative Self-Gated Neural Feature Activation With an Adaptive Shift Factor | Sudong Cai | N/A | Code |
| Improved Implicit Neural Representation with Fourier Reparameterized Training | Kexuan Shi · Xingyu Zhou · Shuhang Gu | N/A | Code |
| Gradient Alignment for Cross-Domain Face Anti-Spoofing | MINH BINH LE · Simon Woo | N/A | Code |
| RoMa: Robust Dense Feature Matching | Johan Edstedt · Qiyu Sun · Georg Bökman · Mårten Wadenbäck · Michael Felsberg | N/A | Code |
| Federated Online Adaptation for Deep Stereo | Matteo Poggi · Fabio Tosi | N/A | Code |
| MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer | Jianjian Cao · Peng Ye · Shengze Li · Chong Yu · Yansong Tang · Jiwen Lu · Tao Chen | N/A | Code |
| ReconFusion: 3D Reconstruction with Diffusion Priors | Rundi Wu · Ben Mildenhall · Philipp Henzler · Ruiqi Gao · Keunhong Park · Daniel Watson · Pratul P. Srinivasan · Dor Verbin · Jonathan T. Barron · Ben Poole · Aleksander Holynski | N/A | Code |
| ReCoRe: Regularized Contrastive Representation Learning of World Model | Rudra P,K. Poudel · Harit Pandya · Stephan Liwicki · Roberto Cipolla | N/A | Code |
| Learning Object State Changes in Videos: An Open-World Perspective | Zihui Xue · Kumar Ashutosh · Kristen Grauman | N/A | Code |
| Holistic Features are almost Sufficient for Text-to-Video Retrieval | Kaibin Tian · Ruixiang Zhao · Zijie Xin · Bangxiang Lan · Xirong Li | N/A | Code |
| Diffuse Attend and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Junjiao Tian · Lavisha Aggarwal · Andrea Colaco · Zsolt Kira · Mar Gonzalez-Franco | N/A | Code |
| How to Handle Sketch-Abstraction in Sketch-Based Image Retrieval? | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| You'll Never Walk Alone: A Sketch and Text Duet for Fine-Grained Image Retrieval | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| It's All About Your Sketch: Democratising Sketch Control in Diffusion Models | Subhadeep Koley · Ayan Kumar Bhunia · Deeptanshu Sekhri · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Text-to-Image Diffusion Models are Great Sketch-Photo Matchmakers | Subhadeep Koley · Ayan Kumar Bhunia · Aneeshan Sain · Pinaki Nath Chowdhury · Tao Xiang · Yi-Zhe Song | N/A | Code |
| Insights from the Use of Previously Unseen Neural Architecture Search Datasets | Rob Geada · David Towers · Matthew Forshaw · Amir Atapour-Abarghouei · Stephen McGough | N/A | Code |
| General Object Foundation Model for Images and Videos at Scale | Junfeng Wu · Yi Jiang · Qihao Liu · Zehuan Yuan · Xiang Bai · Song Bai | N/A | Code |
| WonderJourney: Going from Anywhere to Everywhere | Hong-Xing Yu · Haoyi Duan · Junhwa Hur · Kyle Sargent · Michael Rubinstein · William Freeman · Forrester Cole · Deqing Sun · Noah Snavely · Jiajun Wu · Charles Herrmann | N/A | Code |
| MoDE: CLIP Data Experts via Clustering | Jiawei Ma · Po-Yao Huang · Saining Xie · Shang-Wen Li · Luke Zettlemoyer · Shih-Fu Chang · Wen-tau Yih · Hu Xu | N/A | Code |
| Flow-Guided Online Stereo Rectification for Wide Baseline Stereo | Anush Kumar · Fahim Mannan · Omid Hosseini Jafari · Shile Li · Felix Heide | N/A | Code |
| 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering | Guanjun Wu · Taoran Yi · Jiemin Fang · Lingxi Xie · Xiaopeng Zhang · Wei Wei · Wenyu Liu · Qi Tian · Xinggang Wang | N/A | Code |
| Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training | Xiaoyang Wu · Zhuotao Tian · Xin Wen · Bohao Peng · Xihui Liu · Kaicheng Yu · Hengshuang Zhao | N/A | Code |
| Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning | Desai Xie · Jiahao Li · Hao Tan · Xin Sun · Zhixin Shu · Yi Zhou · Sai Bi · Soren Pirk · ARIE KAUFMAN | N/A | Code |
| Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles | Rui Song · Chenwei Liang · Hu Cao · Zhiran Yan · Walter Zimmer · Markus Gross · Andreas Festag · Alois Knoll | N/A | Code |
| CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation | Kangfu Mei · Mauricio Delbracio · Hossein Talebi · Zhengzhong Tu · Vishal M. Patel · Peyman Milanfar | N/A | Code |
| Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation | Yuan Xiao · Shiqing Ma · Juan Zhai · Chunrong Fang · Jinyuan Jia · Zhenyu Chen | N/A | Code |
| Density-guided Translator Boosts Synthetic-to-Real Unsupervised Domain Adaptive Segmentation of 3D Point Clouds | Zhimin Yuan · Wankang Zeng · Yanfei Su · Weiquan Liu · Ming Cheng · Yulan Guo · Cheng Wang | N/A | Code |
| EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling | Haiyang Liu · Zihao Zhu · Giorgio Becherini · YICHEN PENG · Mingyang Su · YOU ZHOU · Xuefei Zhe · Naoya Iwamoto · Bo Zheng · Michael J. Black | N/A | Code |
| Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Luca Barsellotti · Roberto Amoroso · Marcella Cornia · Lorenzo Baraldi · Rita Cucchiara | N/A | Code |
| Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark | Ziyang Chen · Israel D. Gebru · Christian Richardt · Anurag Kumar · William Laney · Andrew Owens · Alexander Richard | N/A | Code |
| Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding | Jin-Chuan Shi · Miao Wang · Haobin Duan · Shaohua Guan | N/A | Code |
| Hybrid Proposal Refiner: Revisiting DETR Series from the Faster R-CNN Perspective | Jinjing Zhao · Fangyun Wei · Chang Xu | N/A | Code |
| SinSR: Diffusion-Based Image Super-Resolution in a Single Step | Yufei Wang · Wenhan Yang · Xinyuan Chen · Yaohui Wang · Lanqing Guo · Lap-Pui Chau · Ziwei Liu · Yu Qiao · Alex C. Kot · Bihan Wen | N/A | Code |
| VBench: Comprehensive Benchmark Suite for Video Generative Models | Ziqi Huang · Yinan He · Jiashuo Yu · Fan Zhang · Chenyang Si · Yuming Jiang · Yuanhan Zhang · Tianxing Wu · Jin Qingyang · Nattapol Chanpaisit · Yaohui Wang · Xinyuan Chen · Limin Wang · Dahua Lin · Yu Qiao · Ziwei Liu | N/A | Code |
| A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition | Yusheng Dai · HangChen · Jun Du · Ruoyu Wang · shihao chen · Haotian Wang · Chin-Hui Lee | N/A | Code |
| Vlogger: Make Your Dream A Vlog | Shaobin Zhuang · Kunchang Li · Xinyuan Chen · Yaohui Wang · Ziwei Liu · Yu Qiao · Yali Wang | N/A | Code |
| EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion | Zehuan Huang · Hao Wen · Junting Dong · Yaohui Wang · Yangguang Li · Xinyuan Chen · Yan-Pei Cao · Ding Liang · Yu Qiao · Bo Dai · Lu Sheng | N/A | Code |
| Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos | Leonhard Sommer · Artur Jesslen · Eddy Ilg · Adam Kortylewski | N/A | Code |
| MCPNet: An Interpretable Classifier via Multi-Level Concept Prototypes | Bor Shiun Wang · Chien-Yi Wang · Wei-Chen Chiu | N/A | Code |
| Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration | Mingyuan Meng · Dagan Feng · Lei Bi · Jinman Kim | N/A | Code |
| Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives | Kristen Grauman · Andrew Westbury · Lorenzo Torresani · Kris Kitani · Jitendra Malik · Triantafyllos Afouras · Kumar Ashutosh · Vijay Baiyya · Siddhant Bansal · Bikram Boote · Eugene Byrne · Zachary Chavis · Joya Chen · Feng Cheng · Fu-Jen Chu · Sean Crane · Avijit Dasgupta · Jing Dong · Maria Escobar · Cristhian David Forigua Diaz · Abrham Gebreselasie · Sanjay Haresh · Jing Huang · Md Mohaiminul Islam · Suyog Jain · Rawal Khirodkar · Devansh Kukreja · Kevin Liang · Jia-Wei Liu · Sagnik Majumder · Yongsen Mao · Miguel Martin · Effrosyni Mavroudi · Tushar Nagarajan · Francesco Ragusa · Santhosh Kumar Ramakrishnan · Luigi Seminara · Arjun Somayazulu · Yale Song · Shan Su · Zihui Xue · Edward Zhang · Jinxu Zhang · Angela Castillo · Changan Chen · Fu Xinzhu · Ryosuke Furuta · Cristina González · Gupta · Jiabo Hu · Yifei Huang · Yiming Huang · Weslie Khoo · Anush Kumar · Robert Kuo · Sach Lakhavani · Miao Liu · Mi Luo · Zhengyi Luo · Brighid Meredith · Austin Miller · Oluwatumininu Oguntola · Xiaqing Pan · Penny Peng · Shraman Pramanick · Merey Ramazanova · Fiona Ryan · Wei Shan · Kiran Somasundaram · Chenan Song · Audrey Southerland · Masatoshi Tateno · Huiyu Wang · Yuchen Wang · Takuma Yagi · Mingfei Yan · Xitong Yang · Zecheng Yu · Shengxin Zha · Chen Zhao · Ziwei Zhao · Zhifan Zhu · Jeff Zhuo · Pablo ARBELAEZ · Gedas Bertasius · Dima Damen · Jakob Engel · Giovanni Maria Farinella · Antonino Furnari · Bernard Ghanem · Judy Hoffman · C.V. Jawahar · Richard Newcombe · Hyun Soo Park · James Rehg · Yoichi Sato · Manolis Savva · Jianbo Shi · Mike Zheng Shou · Michael Wray | N/A | Code |
| Generalized Predictive Model for Autonomous Driving | Jiazhi Yang · Shenyuan Gao · Yihang Qiu · Li Chen · Tianyu Li · Bo Dai · Kashyap Chitta · Penghao Wu · Jia Zeng · Ping Luo · Jun Zhang · Andreas Geiger · Yu Qiao · Hongyang Li | N/A | Code |
| RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D | Lingteng Qiu · Guanying Chen · Xiaodong Gu · Qi Zuo · Mutian Xu · Yushuang Wu · Weihao Yuan · Zilong Dong · Liefeng Bo · Xiaoguang Han | N/A | Code |
| GenesisTex: Adapting Image Denoising Diffusion to Texture Space | Chenjian Gao · Boyan Jiang · Xinghui Li · YingPeng Zhang · Qian Yu | N/A | Code |
| Adaptive Random Feature Regularization on Fine-tuning Deep Neural Networks | Shin'ya Yamaguchi · Sekitoshi Kanai · Kazuki Adachi · Daiki Chijiwa | N/A | Code |
| Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis | Yiyang Chen · Lunhao Duan · Shanshan Zhao · Changxing Ding · Dacheng Tao | N/A | Code |
| DiffusionLight: Light Probes for Free by Painting a Chrome Ball | Pakkapon Phongthawee · Worameth Chinchuthakun · Nontaphat Sinsunthithet · Varun Jampani · Amit Raj · Pramook Khungurn · Supasorn Suwajanakorn | N/A | Code |
| Describing Differences in Image Sets with Natural Language | Lisa Dunlap · Yuhui Zhang · Xiaohan Wang · Ruiqi Zhong · Trevor Darrell · Jacob Steinhardt · Joseph Gonzalez · Serena Yeung | N/A | Code |
| Object Pose Estimation via the Aggregation of Diffusion Features | Tianfu Wang · Guosheng Hu · Hongguang Wang | N/A | Code |
| Salience DETR: Enhancing Detection Transformer with Hierarchical Salience Filtering Refinement | Xiuquan Hou · Meiqin Liu · Senlin Zhang · Ping Wei · Badong Chen | N/A | Code |
| Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation | ZHIXIANG WEI · Lin Chen · Xiaoxiao Ma · Huaian Chen · Tianle Liu · Pengyang Ling · Jinjin Zheng · Ben Wang · Yi Jin | N/A | Code |
| OpenStreetView-5M: The Many Roads to Global Visual Geolocation | Guillaume Astruc · Nicolas Dufour · Ioannis Siglidis · Constantin Aronssohn · Nacim Bouia · Stephanie Fu · Romain Loiseau · Van Nguyen Nguyen · Charles Raude · Elliot Vincent · Lintao XU · Hongyu Zhou · Loic Landrieu | N/A | Code |
| Action Scene Graphs for Long-Form Understanding of Egocentric Videos | Ivan Rodin · Antonino Furnari · Kyle Min · Subarna Tripathi · Giovanni Maria Farinella | N/A | Code |
| Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization | Lahav Lipson · Jia Deng | N/A | Code |
| RNb-NeuS: Reflectance and Normal-based Multi-View 3D Reconstruction | Baptiste Brument · Robin Bruneau · Yvain Queau · Jean Mélou · Francois Lauze · Jean-Denis Durou · Lilian Calvet | N/A | Code |
| Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation | Xiao Ma · Sumit Patidar · Iain Haughton · Stephen James | N/A | Code |
| Tyche: Stochastic In-Context Learning for Medical Image Segmentation | Marianne Rakic · Hallee Wong · Jose Javier Gonzalez Ortiz · Beth Cimini · John Guttag · Adrian V. Dalca | N/A | Code |
| How Far Can We Compress Instant-NGP-Based NeRF? | Yihang Chen · Qianyi Wu · Mehrtash Harandi · Jianfei Cai | N/A | Code |
| CAD-SIGNet: CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention | Mohammad Sadil Khan · Elona Dupont · Sk Aziz Ali · Kseniya Cherenkova · Anis Kacem · Djamila Aouada | N/A | Code |
| LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection | Dat NGUYEN · Nesryne Mejri · Inder Pal Singh · Polina Kuleshova · Marcella Astrid · Anis Kacem · Enjie Ghorbel · Djamila Aouada | N/A | Code |
| SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection | Mingxuan Liu · Tyler Hayes · Elisa Ricci · Gabriela Csurka · Riccardo Volpi | N/A | Code |
| IPoD: Implicit Field Learning with Point Diffusion for Generalizable 3D Object Reconstruction from Single RGB-D Images | Yushuang Wu · Luyue Shi · Junhao Cai · Weihao Yuan · Lingteng Qiu · Zilong Dong · Liefeng Bo · Shuguang Cui · Xiaoguang Han | N/A | Code |
| Normalizing Flows on the Product Space of SO(3) Manifolds for Probabilistic Human Pose Modeling | Olaf Dünkel · Tim Salzmann · Florian Pfaff | N/A | Code |
| The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing | Denis Bobkov · Vadim Titov · Aibek Alanov · Dmitry Vetrov | N/A | Code |
| SVGDreamer: Text Guided SVG Generation with Diffusion Model | XiMing Xing · Chuang Wang · Haitao Zhou · Jing Zhang · Dong Xu · Qian Yu | N/A | Code |
| Learning to Remove Wrinkled Transparent Film with Polarized Prior | Jiaqi Tang · RUIZHENG WU · Xiaogang Xu · Sixing Hu · Ying-Cong Chen | N/A | Code |
| GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image | Chong Bao · Yinda Zhang · Yuan Li · Xiyu Zhang · Bangbang Yang · Hujun Bao · Marc Pollefeys · Guofeng Zhang · Zhaopeng Cui | N/A | Code |
| Joint-Task Regularization for Partially Labeled Multi-Task Learning | Kento Nishi · Junsik Kim · Wanhua Li · Hanspeter Pfister | N/A | Code |
| Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos | Kumaranage Ravindu Nagasinghe · Honglu Zhou · Malitha Gunawardhana · Martin Renqiang Min · Daniel Harari · Muhammad Haris Khan | N/A | Code |
| SI-MIL: Taming Deep MIL for Self-Interpretability in Gigapixel Histopathology | Saarthak Kapse · Pushpak Pati · Srijan Das · Jingwei Zhang · Chao Chen · Maria Vakalopoulou · Joel Saltz · Dimitris Samaras · Rajarsi Gupta · Prateek Prasanna | N/A | Code |
| Learned Representation-Guided Diffusion Models for Large-Image Generation | Alexandros Graikos · Srikar Yellapragada · Minh-Quan Le · Saarthak Kapse · Prateek Prasanna · Joel Saltz · Dimitris Samaras | N/A | Code |
| MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation | Sumanth Udupa · Prajwal Gurunath · Aniruddh Sikdar · Suresh Sundaram | N/A | Code |
| HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation | Ce Zhang · Simon Stepputtis · Joseph Campbell · Katia Sycara · Yaqi Xie | N/A | Code |
| IEEE Computer Society | Unknown | N/A | Code |
| The Computer Vision Foundation | Unknown | N/A | Code |
ECCV 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images | Jacopo Bonato, Marco Cotogni, Luigi Sabetta | N/A | |
| Octopus: Embodied Vision-Language Programmer from Environmental Feedback | Jingkang Yang, Yuhao Dong, Shuai Liu, Bo Li, Ziyue Wang, ChenCheng Jiang, Haoran Tan, Jiamu Kang, Yuanhan Zhang, Kaiyang Zhou, Ziwei Liu* | N/A | |
| FunQA: Towards Surprising Video Comprehension | Binzhu Xie, Sicheng Zhang, Zitang Zhou, Bo Li, Yuanhan Zhang, Jack Hessel, Jingkang Yang, Ziwei Liu* | N/A | |
| 4D Contrastive Superflows are Dense 3D Representation Learners | Xiang Xu, Lingdong Kong, Hui Shuai, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Qingshan Liu | N/A | |
| ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation | Yuyuan Liu*, Yuanhong Chen, Hu Wang, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro | N/A | |
| Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos | Keqiang Sun, Dor Litvak, Yunzhi Zhang, Hongsheng Li, Jiajun Wu, Shangzhe Wu | N/A | |
| Robust Fitting on a Gate Quantum Computer | Frances F Yang*, Michele Sasdelli, Tat-Jun Chin | N/A | |
| H-V2X: A Large Scale Highway Dataset for BEV Perception | Chang Liu*, MingXu zhu, Cong Ma | N/A | |
| Learning Camouflaged Object Detection from Noisy Pseudo Label | Jin Zhang, Ruiheng Zhang, Yanjiao Shi, Zhe Cao, Nian Liu, Fahad Shahbaz Khan | N/A | |
| Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance | Kuan-Chih Huang*, Yi-Hsuan Tsai, Ming-Hsuan Yang | N/A | |
| Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Weng Fei Low*, Gim Hee Lee | N/A | |
| CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction | Shengke Sun, Ziqian Luan, Zhanshan Zhao, Shijie Luo, Shuzhen Han | N/A | |
| Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence | Mengyao Lyu, Tianxiang Hao, Xinhao Xu, Hui Chen, Zijia Lin, Jungong Han, Guiguang Ding | N/A | |
| PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts | Zewen Chen, Haina Qin, Juan Wang, Chunfeng Yuan, Bing Li*, Weiming Hu, Leon Wang | N/A | |
| Motion Mamba: Efficient and Long Sequence Motion Generation | Zeyu Zhang, Akide Liu, Ian Reid, RICHARD HARTLEY, Bohan Zhuang, Hao Tang* | N/A | |
| Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | Yuanhao Cai*, Yixun Liang, Jiahao Wang, Angtian Wang, Yulun Zhang, Xiaokang Yang, Zongwei Zhou, Alan Yuille | N/A | |
| "Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance" | Liting Lin, Heng Fan, Zhipeng Zhang, Yaowei Wang, Yong Xu, Haibin Ling | N/A | |
| A Direct Approach to Viewing Graph Solvability | Federica Arrigoni*, Andrea Fusiello, Tomas Pajdla | N/A | |
| CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization | Jiawei Zhang, Jiahe Li, Xiaohan Yu, Lei Huang, Lin Gu, Jin Zheng, Xiao Bai | N/A | |
| SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Qingwen Zhang*, Yi Yang, Peizheng Li, Olov Andersson, Patric Jensfelt | N/A | |
| ZeST: Zero-Shot Material Transfer from a Single Image | Ta-Ying Cheng, Prafull Sharma, Andrew Markham, Niki Trigoni, Varun Jampani* | N/A | |
| 3D Congealing: 3D-Aware Image Alignment in the Wild | Yunzhi Zhang*, Zizhang Li, Amit Raj, Andreas Engelhardt, Yuanzhen Li, Tingbo Hou, Jiajun Wu, Varun Jampani | N/A | |
| SMooDi: Stylized Motion Diffusion Model | Lei Zhong, Yiming Xie, Varun Jampani, Deqing Sun, Huaizu Jiang* | N/A | |
| ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs | Viraj Shah, Nataniel Ruiz, Forrester Cole, Erika Lu, Svetlana Lazebnik, Yuanzhen Li, Varun Jampani* | N/A | |
| SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | Vikram Voleti, Chun-Han Yao, Mark Boss, Adam Letts, David Pankratz, Dmitrii Tochilkin, Christian Laforte, Robin Rombach, Varun Jampani | N/A | |
| WordRobe: Text-Guided Generation of Textured 3D Garments | Astitva Srivastava*, Pranav Manu, Amit Raj, Varun Jampani, Avinash Sharma | N/A | |
| Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation | Taekyung Ki, Dongchan Min, Gyeongsu Chae | N/A | |
| SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Yingqi Tang, Zhaotie Meng, Guoliang Chen, Erkang Cheng* | N/A | |
| "EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Human Motion Generation" | Wenyang Zhou, Zhiyang Dou*, Zeyu Cao, Zhouyingcheng Liao, Jingbo Wang, Wenjia Wang, Yuan Liu, Taku Komura, Wenping Wang, Lingjie Liu | N/A | |
| Editable Image Elements for Controllable Synthesis | Jiteng Mu, Michaël Gharbi, Richard Zhang, Eli Shechtman, Nuno Vasconcelos, Xiaolong Wang, Taesung Park | N/A | |
| Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Yuanwen Yue*, Anurag Das, Francis Engelmann, Siyu Tang, Jan Eric Lenssen | N/A | |
| Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection | Yuanpeng Tu, Boshen Zhang, Liang Liu, YUXI LI, Jiangning Zhang, Yabiao Wang, Chengjie Wang, cairong zhao | N/A | |
| PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Runsong Zhu, Shi Qiu, Qianyi Wu, Ka-Hei Hui, Pheng-Ann Heng, Chi-Wing Fu | N/A | |
| SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Kailin Li, Jingbo Wang, Lixin Yang, Cewu Lu, Bo Dai | N/A | |
| MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation | Jiaxi Jiang*, Paul Streli, Xuejing Luo, Christoph Gebhardt, Christian Holz | N/A | |
| Simple Unsupervised Knowledge Distillation With Space Similarity | Aditya Singh*, Haohan Wang | N/A | |
| DragAPart: Learning a Part-Level Motion Prior for Articulated Objects | Ruining Li*, Chuanxia Zheng, Christian Rupprecht, Andrea Vedaldi | N/A | |
| Diffusion Bridges for 3D Point Cloud Denoising | Mathias Vogel Hüni, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann* | N/A | |
| Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging | Mahmoud Afifi*, Zhenhua Hu, Liang Liang | N/A | |
| BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | Pilhyeon Lee*, Hyeran Byun | N/A | |
| MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description | Ziqiang Zheng*, Yiwei Chen, Huimin Zeng, Tuan-Anh Vu, Binh-Son Hua, Sai-Kit Yeung | N/A | |
| Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data | Jia-Yi Li, Xi-Le Zhao*, Jian-Li Wang, Chao Wang, Min Wang | N/A | |
| EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere | Jiaxi Jiang*, Paul Streli, Manuel Meier, Christian Holz | N/A | |
| Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition | Satoshi Ikehata*, Yuta Asano | N/A | |
| SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Marko Mihajlovic*, Sergey Prokudin, Siyu Tang, Robert Maier, Federica Bogo, Tony Tung, Edmond Boyer | N/A | |
| VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models | Junlin Han*, Filippos Kokkinos, Philip Torr | N/A | |
| Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences | Shishir Reddy Vutukur*, Junwen Huang, Rasmus Laurvig Haugaard, Benjamin Busam, Tolga Birdal | N/A | |
| Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | Muhammad Jehanzeb Mirza*, Leonid Karlinsky, Wei Lin, Sivan Doveh, Jakub Micorek, Mateusz Kozinski, Hilde Kuehne, Horst Possegger | N/A | |
| Physics-Based Interaction with 3D Objects via Video Generation | Tianyuan Zhang*, Hong-Xing Yu, Rundi Wu, Brandon Y Feng, Changxi Zheng, Noah Snavely, Jiajun Wu, William T. Freeman | N/A | |
| Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians | Licheng Zhong, Hong-Xing Yu, Jiajun Wu, Yunzhu Li* | N/A | |
| Deep Patch Visual SLAM | Lahav Lipson*, Zachary Teed, Jia Deng | N/A | |
| Surface Reconstruction for 3D Gaussian Splatting via Local Structural Hints | Qianyi Wu*, Jianmin Zheng, Jianfei Cai | N/A | |
| HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting | Helisa Dhamo, Yinyu Nie, Arthur Moreau, Jifei Song, Richard Shaw, Yiren Zhou, Eduardo Pérez-Pellitero | N/A | |
| LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Hongyu Wen*, Erich Liang, Jia Deng | N/A | |
| Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal | Yuxin Wang, Qianyi Wu, Guofeng Zhang, Dan Xu* | N/A | |
| Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation | Friedhelm Hamann*, Ziyun Wang, Ioannis Asmanis, Kenneth Chaney, Guillermo Gallego, Kostas Daniilidis | N/A | |
| Efficient Few-Shot Action Recognition via Multi-Level Post-Reasoning | Cong Wu, Xiao-Jun Wu*, Linze Li, Tianyang Xu, Zhenhua Feng, Josef Kittler | N/A | |
| Text2Place: Affordance-aware Text Guided Human Placement | Rishubh Parihar*, Harsh Gupta, Sachidanand VS, Venkatesh Babu RADHAKRISHNAN | N/A | |
| OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations | Yiming Zuo*, Jia Deng | N/A | |
| Zero-Shot Multi-Object Scene Completion | Shun Iwase*, Katherine Liu, Vitor Guizilini, Adrien Gaidon, Kris Kitani, Rareș A Ambruș, Sergey Zakharov | N/A | |
| Beta-Tuned Timestep Diffusion Model | Tianyi Zheng, Peng-Tao Jiang, Ben Wan, Hao Zhang, Jinwei Chen, Jia Wang, Bo Li* | N/A | |
| POA: Pre-training Once for Models of All Sizes | Yingying Zhang, Xin Guo, Jiangwei Lao, Lei Yu, Lixiang Ru, Jian Wang, Guo Ye, HUIMEI HE, Jingdong Chen, Ming Yang | N/A | |
| Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Chieh Hubert Lin*, Changil Kim, Jia-Bin Huang, Qinbo Li, Chih-Yao Ma, Johannes Kopf, Ming-Hsuan Yang, Hung-Yu Tseng | N/A | |
| MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | Xiaoshuai Hao*, Ruikai Li, Hui Zhang, Rong Yin, Dingzhe Li, Sangil Jung, Seung-In Park, ByungIn Yoo, Haimei Zhao, Jing Zhang | N/A | |
| "ByteEdit: Boost, Comply and Accelerate Generative Image Editing" | Yuxi Ren, Jie Wu*, Yanzuo Lu, Huafeng Kuang, Xin Xia, Xionghui Wang, Qianqian Wang, Yixing Zhu, Pan Xie, Shiyin Wang, Xuefeng Xiao, Yitong Wang, Min Zheng, Lean FU | N/A | |
| ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion | Sungmin Woo, Wonjoon Lee, Woo Jin Kim, Dogyoon Lee, Sangyoun Lee | N/A | |
| High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs | Ruikang Xu, Mingde Yao, Yue Li, Yueyi Zhang, Zhiwei Xiong* | N/A | |
| Accelerating Image Super-Resolution Networks with Pixel-Level Classification | Jinho Jeong, Jinwoo Kim, Younghyun Jo, Seon Joo Kim* | N/A | |
| LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation | Jianan Li, Qiulei Dong | N/A | |
| Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution | Xingyuan Li, Jinyuan Liu*, ZHIXIN CHEN, Yang Zou, Long Ma, Xin Fan, Risheng Liu | N/A | |
| Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | Seokhun Choi, Hyeonseop Song, Jaechul Kim, Taehyeong Kim, Hoseok Do | N/A | |
| Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes | Zelong Zeng*, Kaname Tomite | N/A | |
| DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction | Mozghan Pourkeshavarz*, Arielle Zhang, Amir Rasouli | N/A | |
| Track Everything Everywhere Fast and Robustly | Yunzhou Song, Jiahui Lei*, Ziyun Wang, Lingjie Liu, Kostas Daniilidis | N/A | |
| Towards Open-ended Visual Quality Comparison | Haoning Wu, Hanwei Zhu, Zicheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Annan Wang, Wenxiu Sun, Qiong Yan, Xiaohong Liu, Guangtao Zhai, Shiqi Wang, Weisi Lin* | N/A | |
| FreeInit: Bridging Initialization Gap in Video Diffusion Models | Tianxing Wu*, Chenyang Si, Yuming Jiang, Ziqi Huang, Ziwei Liu | N/A | |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | DongHyun Kim, Byeongho Heo, Dongyoon Han* | N/A | |
| Eliminating Feature Ambiguity for Few-Shot Segmentation | Qianxiong Xu*, Guosheng Lin, Chen Change Loy, Cheng Long, Ziyue Li, Rui Zhao | N/A | |
| Soft Prompt Generation for Domain Generalization | Shuanghao Bai, Yuedi Zhang, Wanqi Zhou, Zhirong Luan, Badong Chen | N/A | |
| Shedding More Light on Robust Classifiers under the lens of Energy-based Models | Mujtaba Hussain Mirza, Maria Rosaria Briglia, Senad Beadini, Iacopo Masi | N/A | |
| LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Jiaxiang Tang*, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu | N/A | |
| Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization | Qi Zhang, Kaiyi Zhang, Antoni B. Chan, Hui Huang* | N/A | |
| RAW-Adapter: Adapting Pretrained Visual Model to Camera RAW Images | Ziteng Cui*, Tatsuya Harada | N/A | |
| SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic | Kashyap Chitta*, Daniel Dauner, Andreas Geiger | N/A | |
| AFreeCA: Annotation-Free Counting for All | Adriano D'Alessandro*, Ali Mahdavi-Amiri, Ghassan Hamarneh | N/A | |
| Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap | Junhao Dong, Piotr Koniusz, Junxi Chen, Yew-Soon Ong | N/A | |
| LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation | Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy* | N/A | |
| Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion | Bohan Li*, Jiajun Deng, Wenyao Zhang, Zhujin Liang, Dalong Du, Xin Jin, Wenjun Zeng | N/A | |
| Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Xueyang Kang, Zhaoliang Luan, Kourosh Khoshelham, Bing WANG | N/A | |
| GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation | Chenxin Li*, Xinyu Liu, Cheng Wang, Yifan Liu, Weihao Yu, Jing Shao, Yixuan Yuan | N/A | |
| PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery | Fernando Julio Cendra, Bingchen Zhao, Kai Han* | N/A | |
| Sapiens: Foundation for Human Vision Models | Rawal Khirodkar*, Timur Bagautdinov, Julieta Martinez, Zhaoen Su, Austin T James, Peter Selednik, Stuart Anderson, Shunsuke Saito | N/A | |
| Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation | sehyung lee*, Mijung Kim, Yeongnam Chae, Bjorn Stenger | N/A | |
| Generating Human Interaction Motions in Scenes with Text Control | Hongwei Yi, Justus Thies, Michael J. Black, Xue Bin Peng, Davis Rempe | N/A | |
| NOVUM: Neural Object Volumes for Robust Object Classification | Artur Jesslen*, Guofeng Zhang, Angtian Wang, Wufei Ma, Alan Yuille, Adam Kortylewski | N/A | |
| Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception | Dingkang Yang, Dingkang Yang, Ke Li, Dongling Xiao, Zedian Shao, Peng Sun, Liang Song* | N/A | |
| HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Xintao Lv, Liang Xu, Yichao Yan*, Xin Jin, Congsheng Xu, Wu Shuwen, Yifan Liu, Lincheng Li, Mengxiao Bi, Wenjun Zeng, Xiaokang Yang | N/A | |
| SAIR: Learning Semantic-aware Implicit Representation | Canyu Zhang, Xiaoguang Li, Qing Guo, Song Wang | N/A | |
| ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video Colorization | Yixin Yang, Jiangxin Dong, Jinhui Tang, Jinshan Pan* | N/A | |
| UNIC: Universal Classification Models via Multi-teacher Distillation | Yannis Kalantidis, Diane Larlus, Mert Bulent Sariyildiz*, Philippe Weinzaepfel, Thomas LUCAS | N/A | |
| Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation | Arpit Garg*, Cuong Cao Nguyen, RAFAEL FELIX, Thanh-Toan Do, Gustavo Carneiro | N/A | |
| Eliminating Warping Shakes for Unsupervised Online Video Stitching | Lang Nie, Chunyu Lin*, Kang Liao, Yun Zhang, Shuaicheng Liu, Rui Ai, Yao Zhao | N/A | |
| Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models | Haoran Wei*, Lingyu Kong, Jinyue Chen, Liang Zhao, Zheng Ge, Jinrong Yang, Jianjian Sun, Chunrui Han, Xiangyu Zhang | N/A | |
| Merlin: Empowering Multimodal LLMs with Foresight Minds | En Yu, Liang Zhao, YANA WEI, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, Wenbing Tao* | N/A | |
| ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders | Jefferson Hernandez*, Ruben Villegas, Vicente Ordonez | N/A | |
| E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness | Robin Courant*, Nicolas Dufour, Xi WANG, Marc Christie, Vicky Kalogeiton | N/A | |
| OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding | Ming Hu, Peng Xia, Lin Wang, Siyuan Yan, Feilong Tang, zhongxing xu, Yimin Luo, Kaimin Song, Jurgen Leitner, Xuelian Cheng, Jun Cheng, Chi Liu, Kaijing Zhou, Zongyuan Ge* | N/A | |
| SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark | Zhengdi Yu, Shaoli Huang*, yongkang cheng, Tolga Birdal | N/A | |
| AttnZero: Efficient Attention Discovery for Vision Transformers | Lujun Li, Zimian Wei, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo* | N/A | |
| Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search | Lujun Li, Haosen Sun, Shiwen Li, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo | N/A | |
| Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search | Haosen Sun, Lujun Li*, Peijie Dong, Zimian Wei, Shitong Shao | N/A | |
| UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation | Zexiang Liu, Yangguang Li, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang, Ding Liang*, Wanli Ouyang | N/A | |
| TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning | Huabin Liu, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin* | N/A | |
| Spectral Subsurface Scattering for Material Classification | Haejoon Lee*, Aswin Sankaranarayanan | N/A | |
| nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding | Benjin Zhu, zhe wang, Hongsheng Li | N/A | |
| Dynamic Neural Radiance Field From Defocused Monocular Video | Xianrui Luo, Huiqiang Sun, Juewen Peng, Zhiguo Cao* | N/A | |
| PiTe: Pixel-Temporal Alignment for Large Video-Language Model | Yang Liu*, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang | N/A | |
| CarFormer: Self-Driving with Learned Object-Centric Representations | Shadi Hamdan*, Fatma Guney | N/A | |
| FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models | Wei WU, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni Chan | N/A | |
| Plain-Det: A Plain Multi-Dataset Object Detector | Cheng Shi, Yuchen Zhu, Sibei Yang* | N/A | |
| Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation | Zhen Zhao, Zicheng Wang, Dian Yu, Longyue Wang, Yixuan Yuan, Luping Zhou | N/A | |
| Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation | Wei Cong*, Yang Cong, Yuyang Liu, Gan Sun | N/A | |
| Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching | Dongliang Cao*, Zorah Laehner, Florian Bernard | N/A | |
| Text-Guided Video Masked Autoencoder | David Fan*, Jue Wang, Shuai Liao, Zhikang Zhang, Vimal Bhat, Xinyu Li | N/A | |
| Diffusion Models for Open-Vocabulary Segmentation | Laurynas Karazija*, Iro Laina, Andrea Vedaldi, Christian Rupprecht | N/A | |
| Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation | Peixi Xiong*, Michael A Kozuch, Nilesh Jain | N/A | |
| EvSign: Sign Language Recognition and Translation with Streaming Events | Pengyu Zhang*, Hao Yin, Zeren Wang, Wenyue Chen, Sheng Ming Li, Dong Wang, Huchuan Lu, Xu Jia | N/A | |
| QUAR-VLA: Vision-Language-Action Model for Quadruped Robots | Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang* | N/A | |
| Zero-shot Object Counting with Good Exemplars | Huilin Zhu, Jingling Yuan, Zhengwei Yang, Yu Guo, Xian Zhong, Zheng Wang, Shengfeng He | N/A | |
| TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering | Jingye Chen*, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei | N/A | |
| SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds | Yanbo Wang, Wentao Zhao, Cao Chuan, Tianchen Deng, Jingchuan Wang, Weidong Chen | N/A | |
| PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Hyunjin Kim, Minhyuk Sung* | N/A | |
| FutureDepth: Learning to Predict the Future Improves Video Depth Estimation | Rajeev Yasarla*, Manish Kumar Singh, Hong Cai, Yunxiao Shi, Jisoo Jeong, Yinhao Zhu, Shizhong Han, Risheek Garrepalli, Fatih Porikli | N/A | |
| LLM as Copilot for Coarse-grained Vision-and-Language Navigation | Yanyuan Qiao*, Qianyi Liu, Jiajun Liu, Jing Liu, Qi Wu | N/A | |
| Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal | Yeying Jin, Xin Li, Jiadong Wang, Yan Zhan, Malu Zhang | N/A | |
| Unsupervised Moving Object Segmentation with Atmospheric Turbulence | Dehao Qin*, Ripon k Saha, Woojeh Chung, Suren Jayasuriya, Jinwei Ye, Nianyi Li | N/A | |
| AccDiffusion: An Accurate Method for Higher-Resolution Image Generation | Zhihang Lin, Mingbao Lin, Meng Zhao, Rongrong Ji* | N/A | |
| Uncertainty-Driven Spectral Compressive Imaging with Spatial-Frequency Transformer | Lintao Peng, Siyu Xie, Liheng Bian* | N/A | |
| CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering | Haidong Zhu, Tianyu Ding*, Tianyi Chen, Ilya Zharkov, Ram Nevatia, Luming Liang | N/A | |
| MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping | Jiacheng Chen, Yuefan Wu, Jiaqi Tan, Hang Ma, Yasutaka Furukawa | N/A | |
| Image Demoireing in RAW and sRGB Domains | Shuning Xu, Binbin Song, Xiangyu Chen, Xina Liu, Jiantao Zhou* | N/A | |
| LiDAR-Event Stereo Fusion with Hallucinations | Luca Bartolomei, Matteo Poggi, Andrea Conti, Stefano Mattoccia | N/A | |
| X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs | Sirnam Swetha*, Jinyu Yang, Tal Neiman, Mamshad Nayeem Rizve, Son Tran, Benjamin Yao, Trishul A Chilimbi, Mubarak Shah | N/A | |
| Learning Anomalies with Normality Prior for Unsupervised Video Anomaly Detection | Haoyue Shi, Le Wang*, Sanping Zhou, Gang Hua, Wei Tang | N/A | |
| Revisiting Supervision for Continual Representation Learning | Daniel Marczak, Sebastian Cygert, Tomasz Trzcinski, Bartlomiej Twardowski | N/A | |
| FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds | Keke Tang, Lujie Huang, Weilong Peng*, Daizong Liu, Xiaofei Wang, Yang Ma, Ligang Liu, Zhihong Tian | N/A | |
| MMBENCH: Is Your Multi-Modal Model an All-around Player? | Yuan Liu, Haodong Duan, Yuanhan Zhang, Bo Li, Songyang Zhang, Wangbo Zhao, Yike Yuan, Jiaqi Wang, Conghui He, Ziwei Liu, Kai Chen, Dahua Lin | N/A | |
| Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds | Shengtao Li*, Ge Gao, Yudong Liu, Ming Gu, Yu-Shen Liu | N/A | |
| Unsupervised Exposure Correction | Ruodai Cui*, Li Niu, Guosheng Hu | N/A | |
| Anytime Continual Learning for Open Vocabulary Classification | Zhen Zhu, Yiming Gong, Derek Hoiem | N/A | |
| External Knowledge Enhanced 3D Scene Generation from Sketch | Zijie Wu, Mingtao Feng*, Yaonan Wang, He Xie, Weisheng Dong, Bo Miao, Ajmal Mian | N/A | |
| G3R: Gradient Guided Generalizable Reconstruction | Yun Chen, Jingkang Wang, Ze Yang, Sivabalan Manivasagam, Raquel Urtasun* | N/A | |
| DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting | Shijie Zhou*, Zhiwen Fan, Dejia Xu, Haoran Chang, Pradyumna Chari, Tejas K Bharadwaj, Suya You, Zhangyang Wang, Achuta Kadambi | N/A | |
| Frequency-Spatial Entanglement Learning for Camouflaged Object Detection | Yanguang Sun, Chunyan Xu, Jian Yang, Hanyu Xuan, Lei Luo | N/A | |
| VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions | Seokha Moon, Hyun Woo, Hongbeen Park, Haeji Jung, Reza Mahjourian, Hyung-gun Chi, Hyerin Lim, Sangpil Kim, Jinkyu Kim* | N/A | |
| Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective | Panjian Huang, Yunjie Peng, Saihui Hou, Chunshui Cao, Xu Liu, Zhiqiang He, Yongzhen Huang | N/A | |
| EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis | Shuai Tan, Bin Ji, Mengxiao Bi, ye pan | N/A | |
| Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi* | N/A | |
| On the Utility of 3D Hand Poses for Action Recognition | Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener, Shugao Ma, Angela Yao | N/A | |
| DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding | Jincen Jiang, Qianyu Zhou, Yuhang Li, Xuequan Lu, Meili Wang, Lizhuang Ma, Jian Chang, Jian Jun Zhang | N/A | |
| Operational Open-Set Recognition and PostMax Refinement | Steve Cruz*, Ryan Rabinowitz, Manuel Günther, Terrance E. Boult | N/A | |
| ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Zhiyuan Ma*, Yuxiang Wei, Yabin Zhang, Xiangyu Zhu, Zhen Lei, Lei Zhang | N/A | |
| SINDER: Repairing the Singular Defects of DINOv2 | Haoqi Wang, Tong Zhang, Mathieu Salzmann* | N/A | |
| "SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow" | Yihan Wang*, Lahav O Lipson, Jia Deng | N/A | |
| Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation | Bochao Liu, Pengju Wang, Shiming Ge* | N/A | |
| General and Task-Oriented Video Segmentation | Mu Chen, Liulei Li, Wenguan Wang, Ruijie Quan, Yi Yang* | N/A | |
| VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement | Hanjung Kim, Jaehyun Kang, Miran Heo, Sukjun Hwang, Seoung Wug Oh, Seon Joo Kim* | N/A | |
| LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors | Saksham Suri*, Matthew Walmer, Kamal Gupta, Abhinav Shrivastava | N/A | |
| ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback | Ming Li*, Taojiannan Yang, Huafeng Kuang, Jie Wu, Zhaoning Wang, Xuefeng Xiao, Chen Chen | N/A | |
| TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing | Xudong Wang, Ke-Yue Zhang, Taiping Yao, Qianyu Zhou, Shouhong Ding, Pingyang Dai, Rongrong Ji | N/A | |
| Prompting Future Driven Diffusion Model for Hand Motion Prediction | Bowen Tang, Kaihao Zhang, Wenhan Luo*, Wei Liu, HONGDONG LI | N/A | |
| Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics | Shuai Yang, ZhiFei Chen, Pengguang Chen, Xi Fang, Yixun Liang, Shu Liu, Yingcong Chen | N/A | |
| Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement | Kun Zhou*, Xinyu Lin, Wenbo Li, Xiaogang Xu, Yuanhao Cai, Zhonghang Liu, Xiaoguang Han, Jiangbo Lu | N/A | |
| RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Li Li*, Hubert P. H. Shum, Toby P Breckon | N/A | |
| UMBRAE: Unified Multimodal Brain Decoding | Weihao Xia*, Raoul de Charette, A. Cengiz Oztireli, Jing-Hao Xue | N/A | |
| NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models | Gengze Zhou*, Yicong Hong, Zun Wang, Xin Eric Wang, Qi Wu | N/A | |
| 3D Single-object Tracking in Point Clouds with High Temporal Variation | Qiao Wu, Kun Sun, Pei An, Mathieu Salzmann, Yanning Zhang, Jiaqi Yang* | N/A | |
| Adaptive Multi-task Learning for Few-shot Object Detection | Yan Ren*, Yanling Li, Adams Wai-Kin Kong | N/A | |
| Event Trojan: Asynchronous Event-based Backdoor Attacks | Ruofei Wang, Qing Guo, Haoliang Li, Renjie Wan | N/A | |
| Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization | Mengnan Liu, Le Wang*, Sanping Zhou, Kun Xia, Qi Wu, Qilin Zhang, Gang Hua | N/A | |
| Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems | Ziyuan Luo, Boxin Shi, Haoliang Li, Renjie Wan* | N/A | |
| Dropout Mixture Low-Rank Adaptation for Visual Parameters-Efficient Fine-Tuning | Zhengyi Fang, Yue Wang, Ran Yi*, Lizhuang Ma | N/A | |
| OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers | Qitai Wang, Jiawei He, Yuntao Chen, Zhaoxiang Zhang* | N/A | |
| LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers | Ziling Huang*, Shin'ichi Satoh | N/A | |
| HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression | Yihang Chen, Qianyi Wu, Weiyao Lin, Mehrtash Harandi, Jianfei Cai | N/A | |
| Energy-induced Explicit quantification for Multi-modality MRI fusion | Xiaoming Qi, Yuan Zhang, Tong Wang, Guanyu Yang, Yueming Jin*, Shuo Li | N/A | |
| ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement | Muhammad Atif Butt*, Kai Wang, Javier Vazquez-Corral, Joost van de Weijer | N/A | |
| Exemplar-free Continual Representation Learning via Learnable Drift Compensation | Alex Gomez-Villa*, Dipam Goswami, Kai Wang, Andy Bagdanov, Bartlomiej Twardowski, Joost van de Weijer | N/A | |
| Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs | Mattia Segù*, Luigi Piccinelli, Siyuan Li, Luc Van Gool, Fisher Yu, Bernt Schiele | N/A | |
| Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition | Sumin Lee*, Yooseung Wang, Sangmin Woo, Changick Kim | N/A | |
| DiffiT: Diffusion Vision Transformers for Image Generation | Ali Hatamizadeh*, Jiaming Song, Guilin Liu, Jan Kautz, Arash Vahdat | N/A | |
| WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation | Zirui Shao, Feiyu Gao, Hangdi Xing, Zepeng Zhu, Zhi Yu*, Jiajun Bu, Qi Zheng, Cong Yao | N/A | |
| GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding | Changshuo Wang*, Meiqing Wu, Siew-Kei Lam, Xin Ning, Shangshu Yu, Ruiping Wang, Weijun Li, Thambipillai Srikanthan | N/A | |
| FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis | Ke Fan, Junshu Tang, Weijian Cao, Ran Yi, Moran Li, Jingyu Gong, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Lizhuang Ma | N/A | |
| FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Zheng Jiang, Jinqing Zhang, Yanan Zhang, Qingjie Liu, Zhenghui HU, Baohui Wang, Yunhong Wang | N/A | |
| SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Yang Miao, Francis Engelmann, Olga Vysotska, Federico Tombari, Marc Pollefeys, Daniel Barath* | N/A | |
| ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities | Chenming Zhu, Tai Wang, Wenwei Zhang, Kai Chen, Xihui Liu* | N/A | |
| MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? | Renrui Zhang, Dongzhi Jiang, Yichi Zhang, Haokun Lin, Ziyu Guo, Pengshuo Qiu, Aojun Zhou, Pan Lu, Kai-Wei Chang, Peng Gao, Hongsheng Li* | N/A | |
| See and Think: Embodied Agent in Virtual Environment | Zhonghan Zhao, Xuan Wang, Wenhao Chai, Boyi Li, Shengyu Hao, Shidong Cao, Tian Ye, Gaoang Wang* | N/A | |
| PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects | Guangcheng Chen*, Yicheng He, Li He, Hong Zhang | N/A | |
| Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases | Xinpeng Liu, Yong-Lu Li, Ailing Zeng, Zizheng Zhou, Yang You, Cewu Lu | N/A | |
| VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding | Ofir Abramovich, Niv Nayman, Sharon Fogel, Inbal Lavi, Ron Litman, Shahar Tsiper, Royee Tichauer, Srikar Appalaraju, Shai Mazor, R. Manmatha | N/A | |
| Masked Angle-Aware Autoencoder for Remote Sensing Images | Zhihao Li*, Biao Hou, Siteng Ma, zitong wu, Xianpeng Guo, bo ren, Licheng Jiao | N/A | |
| Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm | Yi Wu, Ziqiang Li, Heliang Zheng, Chaoyue Wang, Bin Li | N/A | |
| MultiGen: Zero-shot Image Generation from Multi-modal Prompts | Zhi-Fan Wu*, Lianghua Huang, Wei Wang, Yanheng Wei, Yu Liu | N/A | |
| GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths | Xianyu Chen, Ming Jiang, Qi Zhao | N/A | |
| Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning | Yifeng Zhang, Ming Jiang, Qi Zhao* | N/A | |
| SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis | Hanrong Ye, Jason Kuen, Qing Liu, Zhe Lin, Brian Price, Dan Xu | N/A | |
| Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets | Ishan Rajendrakumar Dave, Fabian Caba, Mubarak Shah, Simon Jenni | N/A | |
| FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition | Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Mubarak Shah | N/A | |
| Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting | Yu Liu, Fatimah binti Khalid, Lei Wang, Youxi Zhang, Cunrui Wang* | N/A | |
| UniCode : Learning a Unified Codebook for Multimodal Large Language Models | Sipeng Zheng, Bohan Zhou, Yicheng Feng, Ye Wang, Zongqing Lu | N/A | |
| When Do We Not Need Larger Vision Models? | Baifeng Shi*, Ziyang Wu, Maolin Mao, Xin Wang, Trevor Darrell | N/A | |
| GVGEN: Text-to-3D Generation with Volumetric Representation | Xianglong He, Junyi Chen, Sida Peng, Di Huang, Yangguang Li, Xiaoshui Huang, Chun Yuan, Wanli Ouyang, Tong He | N/A | |
| Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Zhening Liu, Xinjie Zhang, Jiawei Shao, Zehong Lin*, Jun Zhang | N/A | |
| "UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation" | Yunfan Lu, Guoqiang Liang, Yusheng Wang, Lin Wang, Hui Xiong | N/A | |
| ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild | Chen Guo, Tianjian Jiang, Manuel Kaufmann, Chengwei Zheng, Julien Valentin, Jie Song, Otmar Hilliges | N/A | |
| Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi*, HONGDONG LI, Akhil Perincherry, Ankit Vora | N/A | |
| Dataset Growth | Ziheng Qin, zhaopan xu, YuKun Zhou, Kai Wang, Zangwei Zheng, Zebang Cheng, Hao Tang, Lei Shang, Baigui Sun, Radu Timofte, Xiaojiang Peng, Hongxun Yao, Yang You | N/A | |
| MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Lukas Bösiger*, Mihai Dusmanu, Marc Pollefeys, Zuria Bauer | N/A | |
| Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint | Sixiang Chen, Tian Ye, Kai Zhang, Zhaohu Xing, Yunlong Lin, Lei Zhu* | N/A | |
| MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration | Yulin Ren, Xin Li, Bingchen Li, Xingrui Wang, Mengxi China Guo, Shijie Zhao, Li Zhang, Zhibo Chen | N/A | |
| LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning | Bolin Lai*, Xiaoliang Dai, Lawrence Chen, Guan Pang, James M Rehg, Miao Liu | N/A | |
| SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant | Guohao Sun*, Can Qin, JIAMINAN WANG, Zeyuan Chen, Ran Xu, Zhiqiang Tao | N/A | |
| Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Yujin Chen*, Yinyu Nie, Benjamin Ummenhofer, Reiner Birkl, Michael Paulitsch, Matthias Müller, Matthias Niessner | N/A | |
| Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation | Bolin Lai*, Fiona Ryan, Wenqi Jia, Miao Liu, James M Rehg | N/A | |
| R^2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations | Xiang Li*, Kai Qiu, Jinglu Wang, Xiaohao Xu, Kashu Yamazaki, Hao Chen, Rita Singh, Xiaonan Huang, Bhiksha Raj | N/A | |
| Self-supervised co-salient object detection via feature correspondences at multiple scales | Souradeep Chakraborty*, Dimitris Samaras | N/A | |
| Differentiable Convex Polyhedra Optimization from Multi-view Images | Daxuan Ren*, Haiyi Mei, Hezi Shi, Jianmin Zheng, Jianfei Cai, Lei Yang | N/A | |
| SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields | Yu Liu, Baoxiong Jia*, Yixin Chen, Siyuan Huang | N/A | |
| SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding | Baoxiong Jia*, Yixin Chen, Huangyue Yu, Yan Wang, Xuesong Niu, Tengyu Liu, Qing Li, Siyuan Huang | N/A | |
| ADMap: Anti-disturbance Framework for Vectorized HD Map Construction | Haotian Hu, Fanyi Wang, Yaonong Wang, Laifeng Hu, Jingwei Xu, Zhiwang Zhang | N/A | |
| GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhang | N/A | |
| PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Shilin Yan, Xiaohao Xu, Renrui Zhang, Lingyi Hong, wenchao chen, Wenqiang Zhang, Wei Zhang | N/A | |
| Evaluating Text-to-Visual Generation with Image-to-Text Generation | Zhiqiu Lin*, Deepak Pathak, Baiqi Li, Jiayao Li, Xide Xia, Graham Neubig, Pengchuan Zhang, Deva Ramanan | N/A | |
| SENC: Handling Self-collision in Neural Cloth Simulation | Zhouyingcheng Liao*, Sinan Wang, Taku Komura | N/A | |
| HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation | Shanyan Guan, Yanhao Ge, Ying Tai, Jian Yang, Wei Li, Mingyu You | N/A | |
| PartCraft: Crafting Creative Objects by Parts | Kam Woh Ng*, Xiatian Zhu, Yi-Zhe Song, Tao Xiang | N/A | |
| GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Xiufeng HUANG, Ka Chun Cheung, Simon See, Renjie Wan | N/A | |
| PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation | Yizhe Xiong, Hui Chen*, Tianxiang Hao, Zijia Lin, Jungong Han, Yuesong Zhang, Guoxin Wang, Yongjun Bao, Guiguang Ding | N/A | |
| FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction | Hang Hua*, Jing Shi, Kushal Kafle, Simon Jenni, Daoan Zhang, John Collomosse, Scott Cohen, Jiebo Luo | N/A | |
| CrossScore: A Multi-View Approach to Image Evaluation and Scoring | Zirui Wang*, Wenjing Bian, Victor Adrian Prisacariu | N/A | |
| Modeling and Driving Human Body Soundfields through Acoustic Primitives | Chao Huang, Dejan Markovic, Chenliang Xu, Alexander Richard | N/A | |
| m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Zixian Ma*, Weikai Huang, Jieyu Zhang, Tanmay Gupta, Ranjay Krishna | N/A | |
| Label-anticipated Event Disentanglement for Audio-Visual Video Parsing | Jinxing Zhou, Dan Guo, Yuxin Mao, Yiran Zhong, Xiaojun Chang, Meng Wang* | N/A | |
| High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding | Qi Zuo*, Xiaodong Gu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Qiu Lingteng, Liefeng Bo, Zilong Dong | N/A | |
| Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization | Hongtao Wu, Angelica I Aviles-Rivero, Yijun Yang, Jingjing Ren, Sixiang Chen, Haoyu Chen, Lei Zhu* | N/A | |
| I-MedSAM: Implicit Medical Image Segmentation with Segment Anything | Xiaobao Wei, Jiajun Cao, Yizhu Jin, Ming Lu, Guangyu Wang, Shanghang Zhang* | N/A | |
| ReMamber: Referring Image Segmentation with Mamba Twister | Yuhuan Yang, Chaofan Ma, Jiangchao Yao, Zhun Zhong, Ya Zhang, Yanfeng Wang | N/A | |
| TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting | Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin Gu | N/A | |
| CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Qilang Ye, Zitong Yu*, Rui Shao, Xinyu Xie, Philip Torr, Xiaochun Cao | N/A | |
| Segmentation-guided Layer-wise Image Vectorization with Gradient Fills | Hengyu Zhou, Hui Zhang, Bin Wang | N/A | |
| Implicit Style-Content Separation using B-LoRA | Yarden Frenkel*, Yael Vinker, Ariel Shamir, Danny Cohen-Or | N/A | |
| OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models | Zijian Zhou, Zheng Zhu, Holger Caesar, Miaojing Shi | N/A | |
| ActionVOS: Actions as Prompts for Video Object Segmentation | Liangyang Ouyang, Ruicong Liu, Yifei Huang, Ryosuke Furuta, Yoichi Sato* | N/A | |
| FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance | Jiedong Zhuang, Jiaqi Hu, Lianrui Mu, Rui Hu, Xiaoyu Liang, Jiangnan Ye, Haoji Hu* | N/A | |
| U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation | li zhang*, Weiqing Meng, Yan Zhong, Bin Kong, Mingliang Xu, Jianming Du, Xue Wang, Rujing Wang, Liu Liu | N/A | |
| Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization | Naiyu Yin*, Hanjing Wang, Yue Yu, Tian Gao, Amit Dhurandhar, Qiang Ji | N/A | |
| Rotary Position Embedding for Vision Transformer | Byeongho Heo*, Song Park, Dongyoon Han, Sangdoo Yun | N/A | |
| Local All-Pair Correspondence for Point Tracking | Seokju Cho, Jiahui Huang, Jisu Nam, Honggyu An, Seungryong Kim, Joon-Young Lee | N/A | |
| MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Youngmin Oh, Hyung-Il Kim, Seong Tae Kim, Jung Uk Kim | N/A | |
| ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments | Taewoong Kim, Cheolhong Min, Byeonghwi Kim, Jinyeon Kim, Wonje Jeung, Jonghyun Choi* | N/A | |
| S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Dongze Li, Kang Zhao, Wei Wang*, Yifeng Ma, Bo Peng, Yingya Zhang, Jing Dong | N/A | |
| ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos | Hyolim Kang, Jeongseok Hyun, Joungbin An, Youngjae Yu, Seon Joo Kim* | N/A | |
| Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos | Subin Jeon, In Cho, Minsu Kim, Woong Oh Cho, Seon Joo Kim* | N/A | |
| PQ-SAM: Post-training Quantization for Segment Anything Model | Xiaoyu Liu, Xin Ding, Lei Yu, Yuanyuan Xi, Wei Li, Zhijun Tu, jie hu, Hanting Chen, Baoqun YIN, Zhiwei Xiong | N/A | |
| CPM: Class-conditional Prompting Machine for Audio-visual Segmentation | Yuanhong Chen*, Chong Wang, Yuyuan Liu, Hu Wang, Gustavo Carneiro | N/A | |
| Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition | Shreyank N Gowda*, Anurag Arnab, Jonathan Huang | N/A | |
| DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment | Jiuming Liu, Dong Zhuo, Zhiheng Feng, Siting Zhu, Chensheng Peng, Zhe Liu, Hesheng Wang* | N/A | |
| CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing | Faegheh Sardari*, Armin Mustafa, Philip JB Jackson, Adrian Hilton | N/A | |
| Noise-assisted Prompt Learning for Image Forgery Detection and Localization | Dong Li, Jiaying Zhu, Xueyang Fu*, Xun Guo, Yidi Liu, Gang Yang, Jiawei Liu, Zheng-Jun Zha | N/A | |
| Data Collection-free Masked Video Modeling | Yuchi Ishikawa*, Masayoshi Kondo, Yoshimitsu Aoki | N/A | |
| Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model | Qi Song*, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan | N/A | |
| Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization | Tao Yang*, Rongyuan Wu, Peiran Ren, Xuansong Xie, Lei Zhang | N/A | |
| AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation | Yanan Sun*, Yanchen Liu, Yinhao Tang, Wenjie Pei, Kai Chen | N/A | |
| SEED: A Simple and Effective 3D DETR in Point Clouds | Zhe Liu, Jinghua Hou, Xiaoqing Ye, Tong Wang, Jingdong Wang, Xiang Bai* | N/A | |
| AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion | Zhiheng Fu, Longguang Wang, Lian Xu, Zhiyong Wang, Hamid Laga, Yulan Guo*, Farid Boussaid, Mohammed Bennamoun | N/A | |
| Synergy of Sight and Semantics: Visual Intention Understanding with CLIP | Qu Yang, Mang Ye*, Dacheng Tao | N/A | |
| Intrinsic Single-Image HDR Reconstruction | Sebastian Dille, Chris Careaga, Yagiz Aksoy | N/A | |
| T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning | Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers, Martin R. Oswald | N/A | |
| Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification | Linhao Qu, Dingkang Yang, Dan Huang, Qinhao Guo, rongkui luo, Shaoting Zhang, Xiaosong Wang | N/A | |
| Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching | Meng Chu, Zhedong Zheng*, Wei Ji, Tingyu Wang, Tat-Seng Chua | N/A | |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Moon Ye-Bin, Nam Hyeon-Woo, Wonseok Choi, Tae-Hyun Oh* | N/A | |
| Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene | Ruiyang Zhang, Hu Zhang, Hang Yu, Zhedong Zheng | N/A | |
| DATENeRF: Depth-Aware Text-based Editing of NeRFs | Sara Rojas Martinez*, Julien Philip, Kai Zhang, Sai Bi, Fujun Luan, Bernard Ghanem, Kalyan Sunkavalli | N/A | |
| XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution | Qu Yunpeng*, Kun Yuan, Kai Zhao, Qizhi Xie, Jinhua Hao, Ming Sun, Chao Zhou | N/A | |
| ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting | Michael A Hobley*, Victor Adrian Prisacariu | N/A | |
| Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery | Grzegorz Rypeść*, Daniel Marczak, Sebastian Cygert, Tomasz Trzcinski, Bartlomiej Twardowski | N/A | |
| LaRa: Efficient Large-Baseline Radiance Fields | Anpei Chen*, Haofei Xu, Stefano Esposito, Siyu Tang, Andreas Geiger | N/A | |
| Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement | Haodong LI, Hao LU, Yingcong Chen | N/A | |
| MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment | Kanglei Zhou, Liyuan Wang, Xingxing Zhang, Hubert P. H. Shum, Frederick W. B. Li, Jianguo Li, Xiaohui Liang* | N/A | |
| Grounding Language Models for Visual Entity Recognition | Zilin Xiao, Ming Gong, Paola Cascante-Bonilla, Xingyao Zhang, Jie Wu, Vicente Ordonez | N/A | |
| ELSE: Efficient Deep Neural Network Inference through Line-based Sparsity Exploration | Zeqi Zhu*, Alberto Garcia-Ortiz, Luc Waeijen, Egor Bondarev, Arash Pourtaherian, Orlando Moreira | N/A | |
| DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation | Yiqun Duan, Xianda Guo, Zheng Zhu | N/A | |
| DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation | Wenliang Zhao, Haolin Wang, Jie Zhou, Jiwen Lu* | N/A | |
| TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos | Yufu Wang*, Ziyun Wang, Lingjie Liu, Kostas Daniilidis | N/A | |
| MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection | Ziyue Huang, Yongchao Feng, Qingjie Liu*, Yunhong Wang | N/A | |
| Self-Supervised Video Copy Localization with Regional Token Representation | Minlong Lu*, Yichen Lu, Siwei Nie, Xudong Yang, Xiaobo Zhang | N/A | |
| Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models | Claudio Rota*, Marco Buzzelli, Joost van de Weijer | N/A | |
| RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF | Sibi Catley-Chandar*, Richard Shaw, Gregory Slabaugh, Eduardo Pérez Pellitero | N/A | |
| Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture | ShahRukh Athar*, Shunsuke Saito, Stanislav Pidhorskyi, Zhengyu Yang, Chen Cao | N/A | |
| ControlLLM: Augment Language Models with Tools by Searching on Graphs | Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, erfei cui, Ziheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang | N/A | |
| UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction | Lan Feng, Mohammadhossein Bahari*, Kaouther Messaoud, Eloi Zablocki, Matthieu Cord, Alexandre Alahi | N/A | |
| DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Zizheng Yan, Jiapeng Zhou, Fanpeng Meng, Yushuang Wu, Lingteng Qiu, Zisheng Ye, Shuguang Cui, Guanying CHEN, Xiaoguang Han | N/A | |
| Vamos: Versatile Action Models for Video Understanding | Shijie Wang*, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, Chen Sun | N/A | |
| Prioritized Semantic Learning for Zero-shot Instance Navigation | xinyu sun, Lizhao Liu, Hongyan Zhi, Ronghe Qiu, Junwei Liang | N/A | |
| RoadPainter: Points Are Ideal Navigators for Topology transformER | Zhongxing Ma, Liang Shuang, Yongkun Wen, Weixin Lu, Guowei Wan* | N/A | |
| FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li | N/A | |
| Can OOD Object Detectors Learn from Foundation Models? | Jiahui Liu, Xin Wen, Shizhen Zhao, Yingxian Chen, Xiaojuan Qi | N/A | |
| Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Xiang Fan*, Anand Bhattad, Ranjay Krishna | N/A | |
| MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo | Ashish Tiwari*, Satoshi Ikehata, Shanmuganathan Raman | N/A | |
| Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training | Qiangqiang Wu, Yan Xia*, Jia Wan, Antoni Chan | N/A | |
| Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation | Junsung Lee, Minsoo Kang, Bohyung Han* | N/A | |
| Real-data-driven 2000 FPS Color Video from Mosaicked Chromatic Spikes | Siqi Yang*, Zhaojun Huang, Yakun Chang, Bin Fan, Zhaofei Yu, Boxin Shi | N/A | |
| Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging | Peirong Liu*, Oula Puonti, Xiaoling Hu, Daniel C. Alexander, Juan E. Iglesias | N/A | |
| TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts | Youssef Mansour*, Xuyang Zhong, Serdar Caglar, Reinhard Heckel | N/A | |
| RadEdit: stress-testing biomedical vision models via diffusion image editing | Fernando Pérez-García, Sam Bond-Taylor, Pedro Sanchez, Boris van Breugel, Daniel Coelho de Castro, Harshita Sharma, Valentina Salvatelli, Maria Teodora A Wetscherek, Hannah CM Richardson, Lungren Matthew, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse* | N/A | |
| SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow | Orcun Cetintas*, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé | N/A | |
| AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution | Yuanting Fan, Chengxu Liu, Nengzhong Yin, Changlong Gao, Xueming Qian* | N/A | |
| Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion | Xu Hang, Chen Long, Wenxiao Zhang*, Yuan Liu, Zhen Cao, Zhen Dong, Bisheng Yang | N/A | |
| Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Taewoo Kim, Jaeseok Jeong, Hoonhee Cho, Yuhwan Jeong, Kuk-Jin Yoon* | N/A | |
| Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation | Xuelu Feng, Dongdong Chen, Junsong Yuan, Chunming Qiao, Gang Hua, Zixin Zhu* | N/A | |
| TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Jinjie Mai*, Wenxuan Zhu, Sara Rojas, Jesus Zarzar, Abdullah Hamdi, Guocheng Qian, Bing Li, Silvio Giancola, Bernard Ghanem | N/A | |
| COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation | Liu He*, Daniel Aliaga | N/A | |
| Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography | Kailai Zhou, Lijing Cai, Yibo Wang, Mengya Zhang, Bihan Wen, Qiu Shen, Xun Cao | N/A | |
| SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding | Han Xiao, Wenzhao Zheng, Sicheng Zuo, Peng Gao, Jie Zhou, Jiwen Lu* | N/A | |
| OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving | Wenzhao Zheng, Weiliang Chen, Yuanhui Huang, Borui Zhang, Yueqi Duan, Jiwen Lu* | N/A | |
| MyVLM: Personalizing VLMs for User-Specific Queries | Yuval Alaluf*, Elad Richardson, Sergey Tulyakov, Kfir Aberman, Danny Cohen-Or | N/A | |
| AMEGO: Active Memory from long EGOcentric videos | Gabriele Goletto*, Tushar Nagarajan, Giuseppe Averta, Dima Damen | N/A | |
| Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment | Simon Weber*, Je Hyeong Hong, Daniel Cremers | N/A | |
| Collaborative Control for Geometry-Conditioned PBR Image Generation | Shimon Vainer, Mark Boss, Mathias Parger, Konstantin Kutsy, Dante De Nigris, Ciara Rowles, Nicolas Perony, Simon Donné* | N/A | |
| Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model | Seonghui Min, Hyun-Jic Oh, Won-Ki Jeong* | N/A | |
| One-stage Prompt-based Continual Learning | Youngeun Kim*, Yuhang Li, Priyadarshini Panda | N/A | |
| SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images | Nir Barel, Ron A Shapira Weber, Nir Mualem, Shahaf E Finder, Oren Freifeld* | N/A | |
| APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension | Yaxin Luo, Jiayi Ji, Xiaofu Chen, Yuxin Zhang, Tianhe Ren, Gen Luo* | N/A | |
| GenQ: Quantization in Low Data Regimes with Generative Synthetic Data | Yuhang Li*, Youngeun Kim, Donghyun Lee, Souvik Kundu, Priyadarshini Panda | N/A | |
| MVDD: Multi-View Depth Diffusion Models | Zhen Wang*, Qiangeng Xu, Feitong Tan, Menglei Chai, Shichen Liu, Rohit Pandey, Sean Fanello, Achuta Kadambi, Yinda Zhang | N/A | |
| Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data | Wufei Ma*, Kai Li, Zhongshi Jiang, Moustafa Meshry, Qihao Liu, Huiyu Wang, Christian Haene, Alan Yuille | N/A | |
| Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving | Yixuan Fan, Ya-Li Li, Shengjin Wang | N/A | |
| Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation | Ruijie Xu*, CHUYU ZHANG, Hui Ren, Xuming He | N/A | |
| EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models | Eungbean Lee, Somi Jeong, Kwanghoon Sohn* | N/A | |
| DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators | Hanyang Kong, Dongze Lian, Michael Bi Mi, Xinchao Wang | N/A | |
| Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation | Duo Peng, Zhengbo Zhang, Ping Hu, Qiuhong Ke, David Yau, Jun Liu* | N/A | |
| SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai | N/A | |
| Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks | Cheeun Hong, Kyoung Mu Lee* | N/A | |
| Large Motion Model for Unified Multi-Modal Motion Generation | Mingyuan Zhang, Daisheng Jin, Chenyang Gu, Fangzhou Hong, Zhongang Cai, Jingfang Huang, Chongzhi Zhang, Xinying Guo, Lei Yang, Ying He, Ziwei Liu | N/A | |
| FisherRF: Active View Selection and Mapping with Radiance Fields using Fisher Information | Wen Jiang, BOSHU LEI, Kostas Daniilidis | N/A | |
| Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding | Niloofar Azizi*, Mohsen Fayyaz, Horst Bischof | N/A | |
| Gradient-based Out-of-Distribution Detection | Taha Entesari, Sina Sharifi, Bardia Safaei*, Vishal Patel, Mahyar Fazlyab | N/A | |
| Event-based Mosaicing Bundle Adjustment | Shuang Guo*, Guillermo Gallego | N/A | |
| ProMerge: Prompt and Merge for Unsupervised Instance Segmentation | Dylan J Li, Gyungin Shin* | N/A | |
| M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Seunggeun Chi, Hyung-gun Chi, Hengbo Ma, Nakul Agarwal, Faizan Siddiqui, Karthik Ramani, Kwonjoon Lee* | N/A | |
| The Hard Positive Truth about Vision-Language Compositionality | Amita Kamath*, Cheng-Yu Hsieh, Kai-Wei Chang, Ranjay Krishna | N/A | |
| GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Jing Wu, Jia-Wang Bian, Xinghui Li, Guangrun Wang, Ian Reid, Philip Torr, Victor Adrian Prisacariu | N/A | |
| Shapefusion: 3D localized human diffusion models | Rolandos Alexandros Potamias*, Michael Tarasiou, Stylianos Ploumpis, Stefanos Zafeiriou | N/A | |
| Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Wonjun Kang, Kevin Galim, Hyung Il Koo* | N/A | |
| Prompting Language-Informed Distribution for Compositional Zero-Shot Learning | Wentao Bao*, Lichang Chen, Heng Huang, Yu Kong | N/A | |
| Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment | Mengting Chen*, Xi Chen, Zhonghua Zhai, Chen Ju, Xuewen Hong, Jinsong Lan, Shuai Xiao | N/A | |
| 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting | Zhe Jun Tang*, Tat-Jen Cham | N/A | |
| Distribution-Aware Robust Learning from Long-Tailed Data with Noisy Labels | Jae Soon Baik, In Young Yoon, Kun Hoon Kim, Jun Won Choi | N/A | |
| Free-Viewpoint Video of Outdoor Sports Using a Drone | Zhengdong Hong* | N/A | |
| Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing | Haijin Zeng*, Hiep Luong, Wilfried Philips | N/A | |
| ConGeo: Robust Cross-view Geo-localization across Ground View Variations | Li Mi, Chang Xu*, Javiera Castillo Navarro, SYRIELLE MONTARIOL, Wen Yang, Antoine Bosselut, Devis Tuia | N/A | |
| Generalizable Facial Expression Recognition | Yuhang Zhang, Xiuqi Zheng, Chenyi Liang, Jiani Hu*, Weihong Deng | N/A | |
| GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views | Vinayak Gupta*, Rongali Simhachala Venkata Girish, Mukund Varma T, Ayush Tewari, Kaushik Mitra | N/A | |
| Self-Supervised Any-Point Tracking by Contrastive Random Walks | Ayush Shrivastava*, Andrew Owens | N/A | |
| MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization | Tianchen Zhao*, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang | N/A | |
| Siamese Vision Transformers are Scalable Audio-visual Learners | Yan-Bo Lin*, Gedas Bertasius | N/A | |
| LCM-Lookahead for Encoder-based Text-to-Image Personalization | Rinon Gal*, Or Lichter, Elad Richardson, Or Patashnik, Amit Bermano, Gal Chechik, Danny Cohen-Or | N/A | |
| Towards Architecture-Agnostic Untrained Networks Priors for Image Reconstruction with Frequency Regularization | Yilin Liu, Yunkui Pang, Jiang Li, Yong Chen, Pew-Thian Yap* | N/A | |
| Towards Open-Ended Visual Recognition with Large Language Models | Qihang Yu*, Xiaohui Shen, Liang-Chieh Chen | N/A | |
| Ray-Distance Volume Rendering for Neural Scene Reconstruction | Ruihong Yin*, Yunlu Chen, Sezer Karaoglu, Theo Gevers | N/A | |
| ReNoise: Real Image Inversion Through Iterative Noising | Daniel Garibi*, Or Patashnik, Andrey Voynov, Hadar Averbuch-Elor, Danny Cohen-Or | N/A | |
| Attention Decomposition for Cross-Domain Semantic Segmentation | Liqiang He*, Sinisa Todorovic | N/A | |
| Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation | Omer Dahary*, Or Patashnik, Kfir Aberman, Danny Cohen-Or | N/A | |
| Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework | Jingjing Zheng, Wanglong Lu, Wenzhe Wang, Yankai Cao*, Xiaoqin Zhang, Xianta Jiang | N/A | |
| RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models | Bowen Zhang, Yiji Cheng, Chunyu Wang*, Ting Zhang, Jiaolong Yang, Yansong Tang, Feng Zhao, Dong Chen, Baining Guo | N/A | |
| GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Yinghao Xu*, Zifan Shi, Wang Yifan, Hansheng Chen, Ceyuan Yang, Sida Peng, Yujun Shen, Gordon Wetzstein | N/A | |
| IRGen: Generative Modeling for Image Retrieval | Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Jingdong Wang, Baining Guo | N/A | |
| Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality | Kyu Ri Park, Hong Joo Lee, Jung Uk Kim | N/A | |
| FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos | Florian Maximilian Langer*, Jihong Ju, Georgi Dikov, Gerhard Reitmayr, Mohsen Ghafoorian | N/A | |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Wouter Van Gansbeke*, Bert De Brabandere | N/A | |
| VISA: Reasoning Video Object Segmentation via Large Language Model | Cilin Yan, Haochen Wang, Shilin Yan, Xiaolong Jiang, Yao Hu, Guoliang Kang*, Weidi Xie, Efstratios Gavves | N/A | |
| Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models | Saman Motamed*, Danda Pani Paudel, Luc Van Gool | N/A | |
| IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation | Yuanhao Zhai*, Kevin Lin, Linjie Li, Chung-Ching Lin, Jianfeng Wang, Zhengyuan Yang, David Doermann, Junsong Yuan, Zicheng Liu, Lijuan Wang | N/A | |
| Scaling Backwards: Minimal Synthetic Pre-training? | Ryo Nakamura, Ryu Tadokoro, Ryosuke Yamada, Yuki M Asano, Iro Laina, Christian Rupprecht, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka* | N/A | |
| BAMM: Bidirectional Autoregressive Motion Model | Ekkasit Pinyoanuntapong*, Muhammad Usama Saleem, Pu Wang, Minwoo Lee, Srijan Das, Chen Chen | N/A | |
| Event-based Head Pose Estimation: Benchmark and Method | Jiahui Yuan, Hebei Li, Yansong Peng, Jin Wang, Yuheng Jiang, Yueyi Zhang, Xiaoyan Sun | N/A | |
| Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos | Ekta Prashnani*, Koki Nagano, Shalini De Mello, David P Luebke, Orazio Gallo | N/A | |
| Towards Multi-modal Transformers in Federated Learning | Guangyu Sun*, Matias Mendieta, Aritra Dutta, Xin Li, Chen Chen | N/A | |
| Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning | Wenke Huang, Mang Ye, zekun shi, Bo Du, Dacheng Tao | N/A | |
| QueryCDR: Query-based Controllable Distortion Rectification Network for Fisheye Images | Pengbo Guo, Chengxu Liu, Xingsong Hou*, Xueming Qian | N/A | |
| Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics | Shishira R Maiya*, Anubhav Gupta, Matthew A Gwilliam, Max Ehrlich, Abhinav Shrivastava | N/A | |
| DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution | Shrey Singh, Prateek Keserwani, Masakazu Iwamura, Partha Pratim Roy | N/A | |
| Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting | Jeongmin Bae, Seoha Kim, Youngsik Yun, Hahyun Lee, Gun Bang, Youngjung Uh* | N/A | |
| DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion | Liao Shen, Tianqi Liu, Huiqiang Sun, Xinyi Ye, Baopu Li, Jianming Zhang, Zhiguo Cao* | N/A | |
| CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Shuang Hao, Chunlin Zhong, He Tang* | N/A | |
| Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-Supervised Learning | Zhiyu Wu, Jinshi Cui | N/A | |
| RPBG: Towards Robust Neural Point-based Graphics in the Wild | Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng* | N/A | |
| GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang*, Yinglin Xu, Yihao Li, Yuantao Chen, Wensen Feng, Xiaoguang Han | N/A | |
| Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Yifan Pu, Zhuofan Xia, Jiayi Guo, Dongchen Han, Qixiu Li, Duo Li, Yuhui Yuan, Ji Li, Yizeng Han, Shiji Song, Gao Huang, Xiu Li* | N/A | |
| Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation | Pengfei Wang*, Yuxi Wang, Shuai Li, Zhaoxiang Zhang, Zhen Lei, Lei Zhang | N/A | |
| IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with motion complexity map | Kihwan Yoon, Yong Han Kim, Sungjei Kim, Jinwoo Jeong* | N/A | |
| TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data | Siyi Du, Shaoming Zheng, Yinsong Wang, Wenjia Bai, Declan P. O'Regan, Chen Qin | N/A | |
| Diffusion Model is a Good Pose Estimator from 3D RF-Vision | Junqiao Fan, Jianfei Yang*, Yuecong Xu, Lihua Xie | N/A | |
| UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues | Vandad Davoodnia*, Saeed Ghorbani, Marc-André Carbonneau, Alexandre Messier, Ali Etemad | N/A | |
| Learning 3D-aware GANs from Unposed Images with Template Feature Field | Xinya Chen, Hanlei Guo, Yanrui Bin, Shangzhan Zhang, Yuanbo Yang, Yujun Shen, Yue Wang, Yiyi Liao* | N/A | |
| TAPTR: Tracking Any Point with Transformers as Detection | Hongyang Li, Hao Zhang, Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Lei Zhang | N/A | |
| Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning | Shibo Jie, Yehui Tang, Jianyuan Guo, Zhi-Hong Deng, Kai Han, Yunhe Wang* | N/A | |
| Point-supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance | Jing Li, Junsong Fan, Zhaoxiang Zhang | N/A | |
| BRAVE: Broadening the visual encoding of vision-language models | Oğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, Achin Kulshrestha, Amir Zamir, Federico Tombari | N/A | |
| HUMOS: Human Motion Model Conditioned on Body Shape | Shashank Tripathi, Omid Taheri, Christoph Lassner, Michael J. Black, Daniel Holden, Carsten Stoll* | N/A | |
| Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields | Yonggan Fu, Huaizhi Qu, Zhifan Ye, Chaojian Li, Kevin Zhao, Yingyan (Celine) Lin* | N/A | |
| MVDiffHD: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction | Shitao Tang*, Jiacheng Chen, Dilin Wang, Chengzhou Tang, Fuyang Zhang, Yuchen Fan, Vikas Chandra, Yasutaka Furukawa, Rakesh Ranjan | N/A | |
| FlowCon: Out-of-Distribution Detection using Flow-based Contrastive Learning | Saandeep Aathreya, Shaun Canavan | N/A | |
| LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Archana Swaminathan*, Anubhav Gupta, Kamal Gupta, Shishira R Maiya, Vatsal Agarwal, Abhinav Shrivastava | N/A | |
| Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation | Ziyun Wang*, Jinyuan Guo, Kostas Daniilidis | N/A | |
| Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration | Shihao Zhou, Jinshan Pan, Jinglei Shi*, Duosheng Chen, Lishen Qu, Jufeng Yang | N/A | |
| CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians | Yang Liu, Chuanchen Luo, Lue Fan, Naiyan Wang, Junran Peng, Zhaoxiang Zhang | N/A | |
| Bayesian Evidential Deep Learning for Online Action Detection | Hongji Guo, Hanjing Wang, Qiang Ji* | N/A | |
| AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation | Zanlin Ni, Yulin Wang, Renping Zhou, Rui Lu, Jiayi Guo, Jinyi Hu, Zhiyuan Liu, Yuan Yao, Gao Huang | N/A | |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Junsung Park, Kyungmin Kim, Hyunjung Shim* | N/A | |
| Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction | Xinhang Liu*, Jiaben Chen, Shiu-Hong Kao, Yu-Wing Tai, Chi-Keung Tang | N/A | |
| Memory-Efficient Fine-Tuning for Quantized Diffusion Model | Hyogon Ryu, Seohyun Lim, Hyunjung Shim* | N/A | |
| VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing | Shang Liu, Chaohui Yu, Chenjie Cao, Wen Qian, Fan Wang | N/A | |
| MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Wenxun Dai, Ling-Hao Chen, Jingbo Wang, Jinpeng Liu, Bo Dai, Yansong Tang | N/A | |
| Human Hair Reconstruction with Strand-Aligned 3D Gaussians | Egor Zakharov*, Vanessa Sklyarova, Michael J. Black, Giljoo Nam, Justus Thies, Otmar Hilliges | N/A | |
| COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation | Jiefeng Li, Ye Yuan, Davis Rempe, Haotian Zhang, Pavlo Molchanov, Cewu Lu, Jan Kautz, Umar Iqbal | N/A | |
| SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders | Sheng-Wei Li, Zi-Xiang Wei, Wei-Jie Chen, Yi-Hsin Yu, Chih-Yuan Yang, Jane Yung-jen Hsu | N/A | |
| Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection | Qijie Mo, Yipeng Gao, Shenghao Fu, Junkai Yan, Ancong Wu, Wei-Shi Zheng | N/A | |
| Global-to-Pixel Regression for Human Mesh Recovery | Yabo Xiao, Mingshu HE*, Dongdong Yu | N/A | |
| Visible and Clear: Finding Tiny Objects in Difference Map | Bing Cao, Haiyu Yao, Pengfei Zhu*, Qinghua Hu | N/A | |
| Rethinking Image Super Resolution from Training Data Perspectives | Go Ohtani*, Ryu Tadokoro, Ryosuke Yamada, Yuki M Asano, Iro Laina, Christian Rupprecht, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka, Yoshimitsu Aoki | N/A | |
| BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering | Xinmin Qiu, Congying Han, Zicheng Zhang, Bonan Li*, Tiande Guo, Pingyu Wang, Xuecheng Nie | N/A | |
| Efficient Inference of Vision Instruction-Following Models with Elastic Cache | Zuyan Liu, Benlin Liu, Jiahui Wang, Yuhao Dong, Guangyi Chen, Yongming Rao, Ranjay Krishna, Jiwen Lu* | N/A | |
| FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior | Zhekai Chen, Wen Wang, Zhen Yang, Zeqing Yuan, Hao Chen, Chunhua Shen | N/A | |
| Learning to Robustly Reconstruct Dynamic Scenes from Low-light Spike Streams | Liwen Hu, Ziluo Ding, Mianzhi Liu, Lei Ma, Tiejun Huang | N/A | |
| MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection | Kuo Wang, Lechao Cheng, Weikai Chen, Pingping Zhang, Liang Lin, Fan Zhou, Guanbin Li | N/A | |
| WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models | Zijian He, Peixin Chen, Guangrun Wang, Guanbin Li*, Philip Torr, Liang Lin | N/A | |
| Interactive 3D Object Detection with Prompts | Ruifei Zhang, Xiangru Lin, Wei Zhang, Jincheng Lu, Xuekuan Wang, Xiao Tan, Yingying Li, Errui Ding, Jingdong Wang, Guanbin Li* | N/A | |
| How Video Meetings Change Your Expression | Sumit Sarin*, Utkarsh Mall, Purva Tendulkar, Carl Vondrick | N/A | |
| GRACE: Graph-Based Contextual Debiasing for Fair Visual Question Answering | Yifeng Zhang, Ming Jiang, Qi Zhao* | N/A | |
| Neural Volumetric World Models for Autonomous Driving | Zanming Huang, Jimuyang Zhang, Eshed Ohn-Bar* | N/A | |
| IVTP: Instruction-guided Visual Token Pruning for Large Vision-Language Models | Kai Huang*, Hao Zou, Ye Xi, Bochen Wang, Zhen Xie, Liang Yu | N/A | |
| RegionDrag: Fast Region-Based Image Editing with Diffusion Models | Jingyi Lu, Xinghui Li, Kai Han* | N/A | |
| On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy | Letian Huang, Jiayang Bai, Jie Guo*, Yuanqi Li, Yanwen Guo | N/A | |
| Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding | Talfan Evans, Shreya Pathak, Hamza Merzic, Jonathan Richard Schwarz, Ryutaro Tanno, Olivier Henaff | N/A | |
| Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration | Zhihao Liang, Qi Zhang, Wenbo Hu, Ying Feng, Lei ZHU, Kui Jia* | N/A | |
| GRA: Detecting Oriented Objects through Group-wise Rotating and Attention | Jiangshan Wang, Yifan Pu, Yizeng Han, Jiayi Guo, Yiru Wang, Xiu Li, Gao Huang* | N/A | |
| Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer | Yu Deng*, Duomin Wang, Baoyuan Wang | N/A | |
| CSOT: Cross-Scan Object Transfer for Semi-Supervised LiDAR Object Detection | Jinglin Zhan, Tiejun Liu, Rengang Li, Zhaoxiang Zhang, Yuntao Chen* | N/A | |
| Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation | Chang Liu, Giulia Rizzoli, Pietro Zanuttigh, Fu Li, Yi Niu* | N/A | |
| ShareGPT4V: Improving Large Multi-Modal Models with Better Captions | Lin Chen, Jinsong Li, Xiaoyi Dong, Pan Zhang, Conghui He, Jiaqi Wang, Feng Zhao, Dahua Lin* | N/A | |
| "Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation" | Yunhao Gou, Kai Chen, Zhili LIU, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok, Yu Zhang | N/A | |
| Invertible Neural Warp for NeRF | Shin-Fang Chng*, Ravi Garg, Hemanth Saratchandran, Simon Lucey | N/A | |
| Enhancing Vectorized Map Perception with Historical Rasterized Maps | Xiaoyu Zhang, Guangwei Liu, Zihao Liu, Ningyi Xu, Yunhui Liu*, Ji Zhao | N/A | |
| Efficient and Versatile Robust Fine-Tuning of Zero-shot Models | Sungyeon Kim, Boseung Jeong, Donghyun Kim, Suha Kwak | N/A | |
| Part2Object: Hierarchical Unsupervised 3D Instance Segmentation | Cheng Shi, Yulin Zhang, Bin Yang, Jiajin Tang, Yuexin Ma, Sibei Yang* | N/A | |
| PetFace: A Large-Scale Dataset and Benchmark for Animal Identification | Risa Shinoda*, Kaede Shiohara | N/A | |
| MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Tianqi Liu, Guangcong Wang, Shoukang Hu, Liao Shen, Xinyi Ye, Yuhang Zang, Zhiguo Cao*, Wei Li, Ziwei Liu | N/A | |
| Zero-Shot Detection of AI-Generated Images | Davide Cozzolino, GIovanni Poggi, Matthias Niessner, Luisa Verdoliva* | N/A | |
| Language-Image Pre-training with Long Captions | Kecheng Zheng*, Yifei Zhang, Wei Wu, Fan Lu, Shuailei Ma, Xin Jin, Wei Chen, Yujun Shen | N/A | |
| GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition | Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu | N/A | |
| DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control | Xinyu Xu, Shengcheng Luo, Yanchao Yang, Yong-Lu Li, Cewu Lu* | N/A | |
| You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception | Sheng Jin, Shuhuai Li, Tong Li, Wentao Liu, Chen Qian, Ping Luo | N/A | |
| Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models | Jiaqi Xu, Mengyang Wu, Xiaowei Hu, Chi-Wing Fu, Qi Dou, Pheng-Ann Heng | N/A | |
| Facial Affective Behavior Analysis with Instruction Tuning | Yifan Li*, Anh Dao, Wentao Bao, Zhen Tan, Tianlong Chen, Huan Liu, Yu Kong | N/A | |
| CoReS: Orchestrating the Dance of Reasoning and Segmentation | Xiaoyi Bao, Siyang Sun, Shuailei Ma, Kecheng Zheng, Yuxin Guo, Guosheng Zhao, Yun Zheng, Xingang Wang* | N/A | |
| MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing | Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu*, Hang Xu, Yu-Gang Jiang | N/A | |
| MambaIR: A Simple Baseline for Image Restoration with State-Space Model | Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia | N/A | |
| I Can't Believe It's Not Scene Flow! | Ishan Khatri, Kyle Vedder, Neehar Peri, Deva Ramanan, James Hays | N/A | |
| Rethinking Unsupervised Outlier Detection via Multiple Thresholding | Zhonghang Liu*, Panzhong Lu, Guoyang Xie, Zhichao Lu, Wen-Yan Lin | N/A | |
| Compress3D: a Compressed Latent Space for 3D Generation from a Single Image | Bowen Zhang, Tianyu Yang, Yu Li, Lei Zhang, Xi Zhao* | N/A | |
| Scalable Group Choreography via Variational Phase Manifold Learning | Nhat Le, Khoa Do, Xuan Bui, Tuong Do, Erman Tjiputra, Quang D.Tran, Anh Nguyen* | N/A | |
| Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Mingfang Zhang, Yifei Huang*, Ruicong Liu, Yoichi Sato | N/A | |
| Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion | Jian Ma, Wenguan Wang*, Yi Yang, Feng Zheng | N/A | |
| PoseSOR: Human Pose Can Guide Our Attention | Huankang Guan, Rynson W.H. Lau* | N/A | |
| TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes | Bu Jin, Yupeng Zheng*, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao | N/A | |
| Bi-directional Contextual Attention for 3D Dense Captioning | Minjung Kim, Hyung Suk Lim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim* | N/A | |
| Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning | Peng Xiao, Yi Xie, Xuemiao Xu, Weihong Chen, Huaidong Zhang | N/A | |
| InfMAE: A Foundation Model in The Infrared Modality | Fangcen Liu, Chenqiang Gao*, Yaming Zhang, Junjie Guo, Jinghao Wang, Deyu Meng | N/A | |
| TPA3D: Triplane Attention for Fast Text-to-3D Generation | Bin-Shih Wu, Hong-En Chen, Sheng-Yu Huang, Yu-Chiang Frank Wang | N/A | |
| Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification | Jiangming Shi, Xiangbo Yin, Yeyun Chen, Yachao Zhang, Zhizhong Zhang, Yuan Xie, Yanyun Qu | N/A | |
| LivePhoto: Real Image Animation with Text-guided Motion Control | Xi Chen, Zhiheng Liu, Mengting Chen, Yutong Feng, Yu Liu, Yujun Shen, Hengshuang Zhao* | N/A | |
| "NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation" | Ruikai Cui, Weizhe Liu*, Weixuan Sun, Senbo Wang, Taizhang Shang, Yang Li, Xibin Song, Han Yan, ZHENNAN WU, Shenzhou Chen, HONGDONG LI, Pan Ji | N/A | |
| AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling | Sherry X. Chen*, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Misha Sra, Pradeep Sen | N/A | |
| SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models | Dongseok Shim, Hyoun Jin Kim | N/A | |
| Quantized Prompt for Efficient Generalization of Vision-Language Models | Tianxiang Hao, Xiaohan Ding, Juexiao Feng, Yuhong Yang, Hui Chen, Guiguang Ding | N/A | |
| Online Temporal Action Localization with Memory-Augmented Transformer | Youngkil Song, Dongkeun Kim, Minsu Cho, Suha Kwak* | N/A | |
| Efficient Cascaded Multiscale Adaptive Network for Image Restoration | Yichen Zhou, Pan Zhou, Teck Khim Ng | N/A | |
| MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model | Muyao Niu, Xiaodong Cun, Xintao Wang, Yong Zhang, Ying Shan, Yinqiang Zheng | N/A | |
| Occlusion-Aware Seamless Segmentation | Yihong Cao, Jiaming Zhang, Hao Shi, Kunyu Peng, Yuhongxuan Zhang, Hui Zhang, Rainer Stiefelhagen, Kailun Yang | N/A | |
| OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection | Changsheng Lu, Zheyuan Liu, Piotr Koniusz | N/A | |
| Referring Atomic Video Action Recognition | Kunyu Peng*, Jia Fu, Kailun Yang, Di Wen, Yufan Chen, Ruiping Liu, Junwei Zheng, Jiaming Zhang, Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg | N/A | |
| Agent3D-Zero: An Agent for Zero-shot 3D Understanding | sha zhang, Di Huang, Jiajun Deng, Shixiang Tang, Wanli Ouyang, Tong He, Yanyong Zhang* | N/A | |
| Stream Query Denoising for Vectorized HD-Map Construction | Shuo Wang, Fan Jia, Weixin Mao, Yingfei Liu, Yucheng Zhao, Zehui Chen, Tiancai Wang, Chi Zhang, Xiangyu Zhang, Feng Zhao | N/A | |
| SAGS: Structure-Aware 3D Gaussian Splatting | Evangelos Ververas, Rolandos Alexandros Potamias*, Jifei Song, Jiankang Deng, Stefanos Zafeiriou | N/A | |
| Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval | Young Kyun Jang, Dat B Huynh, Ashish Shah, Wen-Kai Chen, Ser-Nam Lim | N/A | |
| OneRestore: A Universal Restoration Framework for Composite Degradation | Yu Guo*, Yuan Gao, Yuxu Lu, Huilin Zhu, Wen Liu, Shengfeng He | N/A | |
| Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation | Zikai Huang, Xuemiao Xu, Cheng Xu, Huaidong Zhang, Chenxi Zheng, Jing Qin, Shengfeng He | N/A | |
| SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks | Peishen Yan, Hao Wang, Tao Song*, Yang Hua, Ruhui Ma, Ningxin Hu, Mohammad Reza Haghighat, Haibing Guan | N/A | |
| RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency | Ziming Sun, Yuan Liang, Zejun Ma, Tianle Zhang, Linchao Bao, Guiqing Li, Shengfeng He* | N/A | |
| Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting | Zheng Zhang, Wenbo Hu, Yixing Lao, Tong He, Hengshuang Zhao | N/A | |
| WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation | Tianjian Jiang*, Johsan Billingham, Sebastian Müksch, Juan J Zarate, Nicolas Evans, Martin R. Oswald, Marc Pollefeys, Otmar Hilliges, Manuel Kaufmann, Jie Song | N/A | |
| Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance | Toan Nguyen, Minh Nhat Nhat Vu, Baoru Huang, An Dinh Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen* | N/A | |
| COIN-Matting: Confounder Intervention for Image Matting | Zhaohe Liao, Jiangtong Li, Jun Lan, Huijia Zhu, Weiqiang Wang, Li Niu, Liqing Zhang | N/A | |
| SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding | Zixu Cheng, Yujiang Pu, Shaogang Gong, Parisa Kordjamshidi, Yu Kong | N/A | |
| Audio-driven Talking Face Generation with Stabilized Synchronization Loss | Dogucan Yaman*, Fevziye Irem Eyiokur, Leonard Bärmann, HAZIM KEMAL EKENEL, Alexander Waibel | N/A | |
| "Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos" | Md Mohaiminul Islam*, Tushar Nagarajan, Huiyu Wang, FU-JEN CHU, Kris Kitani, Gedas Bertasius, Xitong Yang | N/A | |
| Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation | Björn Michele*, Alexandre Boulch, Tuan-Hung VU, Gilles Puy, Renaud Marlet, Nicolas Courty | N/A | |
| Learning to Obstruct Few-Shot Image Classification over Restricted Classes | Amber Yijia Zheng, Chiao-An Yang, Raymond A. Yeh | N/A | |
| RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion | Kyle Shih-Huang Lo*, Jorg Peters, Eric Spellman | N/A | |
| L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model | Yuchen Hong, Haofeng Zhong, Shuchen Weng, Jinxiu S Liang, Boxin Shi | N/A | |
| AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting | Yu Wang, Xiaogeng Liu, Yu Li, Muhao Chen, Chaowei Xiao | N/A | |
| OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Guoqing Wang, Zhongdao Wang, Pin Tang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma* | N/A | |
| CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner | Tingbing Yan, Wenzheng Zeng, Yang Xiao, Xingyu Tong, Bo Tan, Zhiwen Fang, Zhiguo Cao, Joey Tianyi Zhou | N/A | |
| HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning | Fucai Ke*, Zhixi Cai, Simindokht Jahangard, Weiqing Wang, Pari Delir Haghighi, Hamid Rezatofighi | N/A | |
| BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Xuan Ju, Xian Liu, Xintao Wang, Yuxuan Bian, Ying Shan, Qiang Xu* | N/A | |
| LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer | Ning Yu*, Chia-chih Chen, Zeyuan Chen, Rui Meng, Gang Wu, Paul W Josel, Juan Carlos Niebles, Caiming Xiong, Ran Xu | N/A | |
| Blind image deblurring with noise-robust kernel estimation | Chanseok Lee, Jeongsol Kim, Seungmin Lee, Jaehwang Jung, Yunje Cho, Taejoong Kim, Taeyong Jo, Myungjun Lee, Mooseok Jang | N/A | |
| Binomial Self-compensation for Motion Error in Dynamic 3D Scanning | Geyou Zhang, Ce Zhu*, Kai Liu | N/A | |
| AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes | Dongxu Yue, Maomao Li, Yunfei Liu, Ailing Zeng, Tianyu Yang, Qin Guo, Yu Li* | N/A | |
| Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation | Yue Xu, Yong-Lu Li*, Kaitong Cui, Ziyu Wang, Cewu Lu, Yu-Wing Tai, Chi-Keung Tang | N/A | |
| VersatileGaussian: Real-time Neural Rendering for Versatile Tasks using Gaussian Splatting | Renjie Li, Zhiwen Fan*, Bohua Wang, Peihao Wang, Zhangyang Wang, Xi Wu | N/A | |
| Momentum Auxiliary Network for Supervised Local Learning | Junhao Su, Changpeng Cai, Feiyu Zhu, Chenghao He, Xiaojie Xu, Dongzhi Guan, Chenyang Si | N/A | |
| HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion | Junhao Su, Chenghao He, Feiyu Zhu, Xiaojie Xu, Dongzhi Guan, Chenyang Si* | N/A | |
| Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains | Jaeyeul Kim, Jungwan Woo, Jeonghoon Kim, Sunghoon Im* | N/A | |
| Improving Zero-Shot Generalization for CLIP with Variational Adapter | Ziqian Lu, Fengli Shen, Mushui Liu, Yunlong Yu*, Xi Li | N/A | |
| Realistic Human Motion Generation with Cross-Diffusion Models | Zeping Ren, Shaoli Huang, Xiu Li | N/A | |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Yuan-Ming Li, Wei-Jin Huang, An-Lan Wang, Ling-An Zeng, Jing-Ke Meng, Wei-Shi Zheng | N/A | |
| Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection | Youheng Sun, Shengming Yuan, Xuanhan Wang*, Lianli Gao, Jingkuan Song | N/A | |
| Towards Reliable Advertising Image Generation Using Human Feedback | Zhenbang Du*, Wei Feng, Haohan Wang, Yaoyu Li, Jingsen Wang, Jian Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junsheng Jin, Junjie Shen, Zhangang Lin, Jingping Shao | N/A | |
| Topology-Preserving Downsampling of Binary Images | Chia-Chia Chen, Chi-Han Peng | N/A | |
| ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders | Carlos Hinojosa*, Shuming Liu, Bernard Ghanem | N/A | |
| Classification Matters: Improving Video Action Detection with Class-Specific Attention | Jinsung Lee, Taeoh Kim, Inwoong Lee, Minho Shim, Dongyoon Wee, Minsu Cho, Suha Kwak* | N/A | |
| Improving Medical Multi-modal Contrastive Learning with Expert Annotations | Yogesh Kumar*, Pekka Marttinen | N/A | |
| Rethinking Data Bias: Dataset Copyright Protection via Embedding Class-wise Hidden Bias | Jinhyeok Jang*, ByungOk Han, Jaehong Kim, Chan-Hyun Youn | N/A | |
| Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization | Jiayun Wang*, Yubei Chen, Stella X. Yu | N/A | |
| SILC: Improving Vision Language Pretraining with Self-Distillation | Muhammad Ferjad Naeem*, Yongqin Xian, Xiaohua Zhai, Lukas Hoyer, Luc Van Gool, Federico Tombari | N/A | |
| Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction | Guowei Xu, Jiale Tao, Wen Li*, Lixin Duan | N/A | |
| Leveraging temporal contextualization for video action recognition | Minji Kim, Dongyoon Han, Taekyung Kim, Bohyung Han | N/A | |
| ChEX: Interactive Localization and Region Description in Chest X-rays | Philip Müller*, Georgios Kaissis, Daniel Rueckert | N/A | |
| AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale | Adam Pardyl, Michał Wronka, Maciej Wołczyk, Kamil Adamczewski, Tomasz Trzcinski, Bartosz Zieliński | N/A | |
| CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts | Yichao Cai*, Yuhang Liu, Zhen Zhang, Javen Qinfeng Shi | N/A | |
| ZigMa: A DiT-style Zigzag Mamba Diffusion Model | Vincent Tao Hu*, Stefan A Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes S Fischer, Bjorn Ommer | N/A | |
| EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion | Guangyao Zhai*, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam | N/A | |
| "On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines" | Selim Kuzucu, Kemal Oksuz, Jonathan Sadeghi, Puneet Dokania | N/A | |
| HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization | Sakib Reza, Yuexi Zhang, Mohsen Moghaddam, Octavia Camps* | N/A | |
| Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Chiao-An Yang*, Ziwei Liu, Raymond Yeh | N/A | |
| Safe-Sim: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries | Wei-Jer Chang*, Francesco Pittaluga, Masayoshi Tomizuka, Wei Zhan, Manmohan Chandraker | N/A | |
| Analysis-by-Synthesis Transformer for Single-View 3D Reconstruction | Dian Jia, Xiaoqian Ruan, Kun Xia, Zhiming Zou, Le Wang, Wei Tang* | N/A | |
| Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning | Chongyu Fan, Jiancheng Liu*, Alfred Hero, Sijia Liu | N/A | |
| WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians | Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos, Avinash Paliwal, Pingchuan Ma, Omid Poursaeed, Sreyas Mohan, Yuchen Fan, Yilei Li, Rakesh Ranjan, Bjorn Ommer | N/A | |
| SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference | Feng Wang*, Jieru Mei, Alan Yuille | N/A | |
| Flying with Photons: Rendering Novel Views of Propagating Light | Anagh Malik*, Noah Juravsky, Ryan Po, Gordon Wetzstein, Kiriakos N. Kutulakos, David B. Lindell | N/A | |
| RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos | Tanveer Hannan*, Md Mohaiminul Islam, Thomas Seidl, Gedas Bertasius | N/A | |
| MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images | Yuedong Chen*, Haofei Xu, Chuanxia Zheng, Bohan Zhuang, Marc Pollefeys, Andreas Geiger, Tat-Jen Cham, Jianfei Cai | N/A | |
| 3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views | Evangelos Ververas*, Polydefkis Gkagkos, Jiankang Deng, Michail C Doukas, Jia Guo, Stefanos Zafeiriou | N/A | |
| Removing Distributional Discrepancies in Captions Improves Image-Text Alignment | Mu Cai, Haotian Liu, Yuheng Li*, Yijun Li, Eli Shechtman, Zhe Lin, Yong Jae Lee, Krishna Kumar Singh | N/A | |
| Resilience of Entropy Model in Distributed Neural Networks | Milin Zhang*, Mohammad Abdi, Shahriar Rifat, Francesco Restuccia | N/A | |
| Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis | Chirag Vashist*, Shichong Peng, Ke Li | N/A | |
| Implicit Concept Removal of Diffusion Models | Zhili Liu*, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok | N/A | |
| PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery | Jicheol Park, Dongwon Kim, Boseung Jeong, Suha Kwak* | N/A | |
| GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting | Kai Zhang*, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu | N/A | |
| Robust-Wide: Robust Watermarking against Instruction-driven Image Editing | Runyi Hu, Jie Zhang*, Ting Xu, Jiwei Li, Tianwei Zhang | N/A | |
| OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal | Qiao Mo, Yukang Ding, Jinhua Hao, Qiang Zhu, Ming Sun, Chao Zhou, Feiyu Chen, Shuyuan Zhu | N/A | |
| Formula-Supervised Visual-Geometric Pre-training | Ryosuke Yamada, Kensho Hara, Hirokatsu Kataoka, Koshi Makihara, Nakamasa Inoue, Rio Yokota, Yutaka Satoh | N/A | |
| VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding | Yue Fan, Xiaojian Ma, Rujie Wu, yuntao du, Jiaqi Li, Zhi Gao, Qing Li | N/A | |
| Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing | Guanghao Zheng, Yuchen Liu, Wenrui Dai*, Chenglin Li, Junni Zou, Hongkai Xiong | N/A | |
| Restoring Images in Adverse Weather Conditions via Histogram Transformer | Shangquan Sun, Wenqi Ren*, Xinwei Gao, Rui Wang, Xiaochun Cao | N/A | |
| PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer | Tongkun Guan, Chengyu Lin, Wei Shen*, Xiaokang Yang | N/A | |
| NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis | Yubin Hu, Xiaoyang Guo, Yang Xiao, Jingwei Huang, Yong-Jin Liu* | N/A | |
| Elysium: Exploring Object-level Perception in Videos through Semantic Integration Using MLLMs | Han Wang*, Yanjie Wang, Ye Yongjie, Yuxiang Nie, Can Huang | N/A | |
| G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields | Shuxiang Xie*, Shuyi Zhou, Ken Sakurada, Ryoichi Ishikawa, Masaki Onishi, Takeshi Oishi | N/A | |
| Getting it Right: Improving Spatial Consistency in Text-to-Image Models | Agneet Chatterjee*, Gabriela Ben Melech Stan, Estelle Guez Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hanna Hajishirzi, Vasudev Lal, Chitta R Baral, Yezhou Yang | N/A | |
| Generating 3D House Wireframes with Semantics | Xueqi Ma, Yilin Liu, Wenjun Zhou, Ruowei Wang, Hui Huang* | N/A | |
| GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image | Xiao Fu*, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long | N/A | |
| Shape-guided Configuration-aware Learning for Endoscopic-image-based Pose Estimation of Flexible Robotic Instruments | Yiyao Ma, Kai Chen, Hon-Sing Tong, Ruofeng Wei, Yui-Lun Ng, Ka-Wai Kwok, Qi Dou | N/A | |
| Nonverbal Interaction Detection | Jianan Wei, Tianfei Zhou, Yi Yang, Wenguan Wang* | N/A | |
| UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving | Jian Zou, Tianyu Huang, Guanglei Yang, Zhenhua Guo, Tao Luo, Chun-Mei Feng, Wangmeng Zuo | N/A | |
| Responsible Visual Editing | Minheng Ni, Yeli Shen, Lei Zhang, Wangmeng Zuo | N/A | |
| Drag Anything: Motion Control for Anything using Entity Representation | Weijia Wu , Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou*, Yan Li, Tingting Gao, Zhang Di | N/A | |
| SegPoint: Segment Any Point Cloud via Large Language Model | Shuting He, Henghui Ding, Xudong Jiang, Bihan Wen* | N/A | |
| Navigation Instruction Generation with BEV Perception and Large Language Models | Sheng Fan, Rui Liu, Wenguan Wang*, Yi Yang | N/A | |
| Rebalancing Using Estimated Class Distribution for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch | Taemin Park, Hyuck Lee, Heeyoung Kim* | N/A | |
| Vista3D: unravel the 3d darkside of a single image | Qiuhong Shen, Xingyi Yang, Michael Bi Mi, Xinchao Wang* | N/A | |
| The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation | Yi Yao, Chan-Feng Hsu, Jhe-Hao Lin, Hongxia Xie, Terence Lin, Yi-Ning Huang, Hong-Han Shuai, Wen-Huang Cheng* | N/A | |
| Detecting As Labeling: Rethinking LiDAR-camera Fusion in 3D Object Detection | Junjie Huang*, Yun Ye, Zhujin Liang, Yi Shan, Dalong Du | N/A | |
| FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally | Qiuhong Shen, Xingyi Yang, Xinchao Wang* | N/A | |
| Exploiting Dual-Correlation for Multi-frame Time-of-Flight Denoising | Guanting Dong, Yueyi Zhang, Xiaoyan Sun, Zhiwei Xiong | N/A | |
| Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection | Kwanyong Park, Kuniaki Saito, Donghyun Kim* | N/A | |
| Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-frequency Fusion and Uncertainty Correction | Wanting Zhang, Huisi Wu*, Jing Qin | N/A | |
| CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images | Jisu Shin, Junmyeong Lee, Seongmin Lee, Min-Gyu Park, Jumi Kang, Ju Hong Yoon, Hae-Gon Jeon* | N/A | |
| Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation | Genki Kinoshita*, Ko Nishino | N/A | |
| Uni3DL: A Unified Model for 3D Vision-Language Understanding | Xiang Li*, Jian Ding, Zhaoyang Chen, Mohamed Elhoseiny | N/A | |
| Object-Aware NIR-to-Visible Translation | Yunyi Gao, Lin Gu, Qiankun Liu, Ying Fu* | N/A | |
| PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference | Tanvir Mahmud*, Burhaneddin Yaman, Chun-Hao Liu, Diana Marculescu | N/A | |
| GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator | Henry Hengyuan Zhao, Pan Zhou, Mike Zheng Shou* | N/A | |
| BLINK: Multimodal Large Language Models Can See but Not Perceive | Xingyu Fu, Yushi Hu, Bangzheng Li, Yu Feng, Haoyu Wang, Xudong Lin, Dan Roth, Noah A Smith, Wei-Chiu Ma, Ranjay Krishna | N/A | |
| AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation | Lorenzo Mur-Labadia*, Ruben Martinez-Cantin, Jose J Guerrero, Giovanni Maria Farinella, Antonino Furnari | N/A | |
| PreLAR: World Model Pre-training with Learnable Action Representation | Lixuan Zhang, Meina Kan, Shiguang Shan, Xilin Chen* | N/A | |
| Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot | Fabien Baradel*, Thomas LUCAS, Matthieu Armando, Salma Galaaoui, Romain Brégier, Philippe Weinzaepfel, Gregory Rogez | N/A | |
| De-confounded Gaze Estimation | Ziyang Liang, Yiwei Bao, Feng Lu* | N/A | |
| Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Fabio Tosi, Pierluigi Zama Ramirez, Matteo Poggi* | N/A | |
| FreestyleRet: Retrieving Images from Style-Diversified Queries | Hao Li, Yanhao Jia, Peng Jin, Zesen Cheng, Kehan Li, Jialu Sui, Chang Liu, Li Yuan | N/A | |
| ReGround: Improving Textual and Spatial Grounding at No Cost | Phillip Y. Lee, Minhyuk Sung* | N/A | |
| CardiacNet: Learning to Reconstruct Abnormalities for Cardiac Disease Assessment from Echocardiogram Videos | Jiewen Yang, Yiqun Lin, Bin Pu, Jiarong GUO, Xiaowei Xu, Xiaomeng Li* | N/A | |
| LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Penghui Du, Yu Wang, Yifan Sun, Luting Wang, Yue Liao, gang zhang, Errui Ding, Yan Wang, Jingdong Wang, Si Liu | N/A | |
| Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement | Lingyu Zhu, Wenhan Yang, Baoliang Chen, Hanwei Zhu, Zhangkai Ni, Qi Mao, Shiqi Wang* | N/A | |
| Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders | Alexandre Eymaël, Renaud Vandeghen*, Anthony Cioppa, Silvio Giancola, Bernard Ghanem, Marc Van Droogenbroeck | N/A | |
| VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network | Zhixue Fang, Yuzhi Liu, Huisi Wu*, Jing Qin | N/A | |
| Dataset Enhancement with Instance-Level Augmentations | Orest Kupyn*, Christian Rupprecht | N/A | |
| FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models | Zhikai Zhang, Yitang Li, Haofeng Huang, Mingxian Lin, Li Yi* | N/A | |
| Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild | Donggyun Kim, Seongwoong Cho, Semin Kim, Chong Luo, Seunghoon Hong* | N/A | |
| Reliability in Semantic Segmentation: Can We Use Synthetic Data? | Thibaut Loiseau, Tuan-Hung Vu*, Mickael Chen, Patrick Pérez, Matthieu Cord | N/A | |
| SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning | Runmin Zhang, Jun Ma, Lun Luo, Beinan Yu, Shu-Jie Chen, Junwei Li, Hui-Liang Shen, Si-Yuan Cao | N/A | |
| SCAPE: A Simple and Strong Category-Agnostic Pose Estimator | Yujia Liang, Zixuan Ye, Wenze Liu, Hao Lu* | N/A | |
| Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning | Mainak Singha*, Ankit Jha, Divyam Gupta, Pranav Singla, Biplab Banerjee | N/A | |
| Improving Knowledge Distillation via Regularizing Feature Direction and Norm | Yuzhu Wang, Lechao Cheng*, Manni Duan, Yongheng Wang, Zunlei Feng, Shu Kong | N/A | |
| 3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views | Kennard Yanting Chan*, Fayao Liu, Guosheng Lin, Chuan Sheng Foo, Weisi Lin | N/A | |
| Lazy Diffusion Transformer for Interactive Image Editing | Yotam Nitzan*, Zongze Wu, Richard Zhang, Eli Shechtman, Danny Cohen-Or, Taesung Park, Michaël Gharbi | N/A | |
| Non-parametric Sensor Noise Modeling and Synthesis | Ali Mosleh*, Luxi Zhao, Atin Vikram Singh, Jaeduk Han, Abhijith Punnappurath, Marcus A Brubaker, Jihwan Choe, Michael S Brown | N/A | |
| Stripe Observation Guided Inference Cost-free Attention Mechanism | Zhongzhan Huang, Shanshan Zhong, Wushao Wen, Jinghui Qin, Liang Lin | N/A | |
| The Nerfect Match: Exploring NeRF Features for Visual Localization | Qunjie Zhou*, Maxim Maximov, Or Litany, Laura Leal-Taixé | N/A | |
| ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance | Yongwei Chen, Tengfei Wang, Tong Wu, Xingang Pan, Kui Jia*, Ziwei Liu | N/A | |
| Robust Calibration of Large Vision-Language Adapters | Balamurali Murugesan*, Julio Silva-Rodríguez, Ismail Ben Ayed, Jose Dolz | N/A | |
| Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation | Haizhong Zheng, Jiachen Sun, Shutong Wu, Bhavya Kailkhura, Zhuoqing Morley Mao, Chaowei Xiao, Atul Prakash* | N/A | |
| Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training | Yuanqi Yao, Gang Wu, Kui Jiang, Siao Liu, Jian Kuai, Xianming Liu, Junjun Jiang | N/A | |
| milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing | Fangqiang Ding*, Zhen Luo, Peijun Zhao, Chris Xiaoxuan Lu | N/A | |
| denoiSplit: a method for joint microscopy image splitting and unsupervised denoising | Ashesh Ashesh, Florian Jug | N/A | |
| AugDETR: Improving Multi-scale Learning for Detection Transformer | Jinpeng Dong, Yutong Lin, Chen Li, Sanping Zhou, Nanning Zheng* | N/A | |
| Spherical World-Locking for Audio-Visual Localization in Egocentric Videos | Heeseung Yun, Ruohan Gao, Ishwarya Ananthabhotla, Anurag Kumar, Jacob Donley, Chao Li, Gunhee Kim, Vamsi Krishna Ithapu, Calvin Murdock | N/A | |
| SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images | Josh David Myers-Dean*, Jarek T Reynolds, Brian Price, Yifei Fan, Danna Gurari | N/A | |
| SIGMA: Sinkhorn-Guided Masked Video Modeling | Mohammadreza Salehi, Michael Dorkenwald, Fida Mohammad Thoker, Efstratios Gavves, Cees Snoek, Yuki M Asano | N/A | |
| Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis | Basile Van Hoorick*, Rundi Wu, Ege Ozguroglu, Kyle Sargent, Ruoshi Liu, Pavel Tokmakov, Achal Dave, Changxi Zheng, Carl Vondrick | N/A | |
| Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams | Ziqiang Wang, Zhixiang Chi, Yanan Wu, Li Gu, Zhi Liu, Konstantinos N Plataniotis, Yang Wang | N/A | |
| Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images | Tianyu Luan, Zhongpai Gao, Luyuan Xie, Abhishek Sharma, Hao Ding, Benjamin Planche, Meng Zheng, Ange Lou, Terrence Chen, Junsong Yuan, Ziyan Wu* | N/A | |
| Understanding Physical Dynamics with Counterfactual World Modeling | Rahul Venkatesh, Honglin Chen, Kevin Feigelis, Daniel M Bear, Khaled Jedoui, Klemen Kotar, Felix J Binder, Wanhee Lee, Sherry Liu, Kevin Smith, Judith E. Fan, Daniel Yamins | N/A | |
| MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition | Aggelina Chatziagapi*, Grigorios Chrysos, Dimitris Samaras | N/A | |
| 4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation | Feng Cheng, Mi Luo, Huiyu Wang, Alex Dimakis, Lorenzo Torresani, Gedas Bertasius, Kristen Grauman | N/A | |
| Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance | I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu, Ming-Hsuan Yang, Sy-Yen Kuo* | N/A | |
| Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild | Lingni Ma*, Yuting Ye, Rowan Postyeni, Alexander J Gamino, Vijay Baiyya, Luis Pesqueira, Kevin M Bailey, David Soriano Fosas, Fangzhou Hong, Vladimir Guzov, Yifeng Jiang, Hyo Jin Kim, Jakob Engel, Karen Liu, Ziwei Liu, Renzo De Nardi, Richard Newcombe | N/A | |
| DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation | Yi-Hao Peng*, Faria Huq, Yue Jiang, Jason Wu, Xin Yue Li, Jeffrey Bigham, Amy Pavel | N/A | |
| SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild | Pengfei Wang, Xiaofei Hui, Jing Wu, Zile Yang, Kian Eng Ong, Xinge Zhao, Beijia Lu, Dezhao Huang, Evan Ling, Weiling Chen, Keng Teck Ma, Minhoe Hur, Jun Liu* | N/A | |
| VideoMamba: Spatio-Temporal Selective State Space Model | Jinyoung Park*, Hee-Seon Kim, Kangwook Ko, Minbeom Kim, Changick Kim | N/A | |
| Text to Layer-wise 3D Clothed Human Generation | Junting Dong*, Qi Fang, Zehuan Huang, Xudong XU, Jingbo Wang, Sida Peng, Bo Dai | N/A | |
| Texture-GS: Disentangle the Geometry and Texture for 3D Gaussian Splatting Editing | Tianxing Xu*, Wenbo Hu, Yu-Kun Lai, Ying Shan, Song-Hai Zhang | N/A | |
| Fully Sparse 3D Occupancy Prediction | Haisong Liu, Yang Chen, Haiguang Wang, Zetong Yang, Tianyu Li, Jia Zeng, Li Chen, Hongyang Li, Limin Wang* | N/A | |
| Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data | Junha Song, Tae Soo Kim, Junha Kim, Gunhee Nam, Thijs Kooi, Jaegul Choo | N/A | |
| CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Jiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liangjing Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui* | N/A | |
| Shifted Autoencoders for Point Annotation Restoration in Object Counting | Yuda Zou, Xin Xiao, Peilin Zhou, Zhichao Sun, Bo Du, Yongchao Xu* | N/A | |
| PointLLM: Empowering Large Language Models to Understand Point Clouds | Runsen Xu, Xiaolong Wang, Tai Wang, Yilun Chen, Jiangmiao Pang*, Dahua Lin | N/A | |
| GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections | Shiyue Zhang, Zheng Chong, Xujie Zhang, Hanhui Li, Yuhao Cheng, yiqiang yan, Xiaodan Liang* | N/A | |
| Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving | Zhenghao Peng, Wenjie Luo, Yiren Lu*, Tianyi Shen, Cole Gulino, Ari Seff, Justin Fu | N/A | |
| Enhancing Diffusion Models with Text-Encoder Reinforcement Learning | Chaofeng Chen, Annan Wang, Haoning Wu, Liang Liao, Wenxiu Sun, Qiong Yan, Weisi Lin | N/A | |
| Asymmetric Mask Scheme for Self-Supervised Real Image Denoising | Xiangyu Liao, Tianheng Zheng, Jiayu Zhong, Pingping Zhang, Chao Ren | N/A | |
| Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation | Mengchen Zhang, Tong Wu, Tai Wang, Tengfei Wang, Ziwei Liu, Dahua Lin | N/A | |
| BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Lingzhe Zhao, Peng Wang, Peidong Liu* | N/A | |
| Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis | Qi Sun*, Hang Zhou, Wengang Zhou, Li Li, Houqiang Li | N/A | |
| BaSIC: BayesNet Structure Learning for Computational Scalable Neural Image Compression | Yufeng Zhang, Hang Yu, Shizhan Liu, Wenrui Dai, Weiyao Lin* | N/A | |
| FlexAttention for Efficient High-Resolution Vision-Language Models | Junyan Li*, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan | N/A | |
| Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting | Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, xing zhou, munan ning, Li Yuan | N/A | |
| AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation | Xinzhou Wang, Yikai Wang, Junliang Ye, Fuchun Sun, Zhengyi Wang, Ling Wang, Pengkun Liu, Kai Sun, Xintong Wang, Xie wende, Fangfu Liu, Bin He | N/A | |
| Spatially-Variant Degradation Model for Dataset-free Super-resolution | SHAOJIE GUO, Haofei Song, Qingli Li, Yan Wang* | N/A | |
| DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation | Junkai Yan, Yipeng Gao, Qize Yang, Xihan Wei, Xuansong Xie, Ancong Wu, WEI-SHI ZHENG | N/A | |
| Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence | Hongyuan Wang, Lizhi Wang*, Jiang Xu, Chang Chen, Xue Hu, Fenglong Song, Youliang Yan | N/A | |
| Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation | Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan*, Jie Chen | N/A | |
| EAFormer: Scene Text Segmentation with Edge-Aware Transformers | Haiyang Yu, Teng Fu, Bin Li*, Xiangyang Xue | N/A | |
| Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects | Zicong Fan, Takehiko Ohkawa*, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Liu Zheng, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao | N/A | |
| DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration | Meng-Cheng Shih, Tsai-Ling Huang, Yu-Heng Shih, Hong-Han Shuai, Hsuan-Tung Liu, Yi-Ren Yeh, Ching-Chun Huang | N/A | |
| LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation | Ruida Zhang, Ziqin Huang, Gu Wang, Chenyangguang Zhang, Yan Di, Xingxing Zuo, Jiwen Tang, Xiangyang Ji* | N/A | |
| Upper-body Hierarchical Graph for Skeleton Based Emotion Recognition in Assistive Driving | Jiehui Wu, Jiansheng Chen*, Qifeng Luo, Siqi Liu, Youze Xue, Huimin Ma | N/A | |
| Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction | Yansheng Li, Tingzhu Wang*, Kang Wu, Linlin Wang, Xin Guo, Wenbin Wang | N/A | |
| Exploring Guided Sampling of Conditional GANs | Yifei Zhang, Mengfei Xia, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, Kecheng Zheng, Lianghua Huang, Yu Liu, Fan Cheng | N/A | |
| MotionChain: Conversational Motion Controllers via Multimodal Prompts | Biao Jiang, Xin Chen, Chi Zhang, Fukun Yin, Zhuoyuan Li, Gang Yu, Jiayuan Fan* | N/A | |
| Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition | Lilang Lin, Lehong Wu, Jiahang Zhang, Jiaying Liu* | N/A | |
| Latent Guard: a Safety Framework for Text-to-image Generation | Runtao Liu, Ashkan Khakzar, Jindong Gu, Qifeng Chen, Philip Torr, Fabio Pizzati* | N/A | |
| MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion | Lehong Wu, Lilang Lin, Jiahang Zhang, Yiyang Ma, Jiaying Liu | N/A | |
| TCC-Det: Temporarily consistent cues for weakly-supervised 3D detection | Jan Skvrna*, Lukáš Neumann | N/A | |
| OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jinghua Hou, Tong Wang, Xiaoqing Ye, Zhe Liu, Shi Gong, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai* | N/A | |
| FoundPose: Unseen Object Pose Estimation with Foundation Features | Evin Pınar Örnek*, Yann Labbé, Bugra Tekin, Lingni Ma, Cem Keskin, Christian Forster, Tomas Hodan | N/A | |
| Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation | Zhengyuan Xie, Haiquan Lu, Jia-wen Xiao, Enguang Wang, Le Zhang, Xialei Liu* | N/A | |
| Kalman-Inspired Feature Propagation for Video Face Super-Resolution | Ruicheng Feng, Chongyi Li, Chen Change Loy* | N/A | |
| Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models | Yu-Chu Yu*, Chi-Pin Huang, Jr-Jen Chen, Kai-Po Chang, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang | N/A | |
| VideoMamba: State Space Model for Efficient Video Understanding | Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao* | N/A | |
| SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging | Lingtong Kong*, Bo Li, Yike Xiong, Hao Zhang, Hong Gu, Jinwei Chen | N/A | |
| Heterogeneous Graph Learning for Scene Graph Prediction in 3D Point Clouds | Yanni Ma, Hao Liu, Yun Pei, Yulan Guo* | N/A | |
| Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving | Ming Nie, Renyuan Peng, Chunwei Wang, Xinyue Cai, Jianhua Han, Hang Xu, Li Zhang | N/A | |
| Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models | Shouwei Ruan, Yinpeng Dong, Liu Hanqing, Yao Huang, Hang Su, Xingxing Wei | N/A | |
| Deep Cost Ray Fusion for Sparse Depth Video Completion | Jungeon Kim, Soongjin Kim, Jaesik Park, Seungyong Lee* | N/A | |
| GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Ziying Song, Lei Yang, Shaoqing Xu, Lin Liu, Dongyang Xu, Caiyan Jia*, Feiyang Jia, Li Wang | N/A | |
| DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video | Narek Tumanyan*, Assaf Singer, Shai Bagon, Tali Dekel | N/A | |
| GraspXL: Generating Grasping Motions for Diverse Objects at Scale | Hui Zhang*, Sammy Christen, Zicong Fan, Otmar Hilliges, Jie Song | N/A | |
| Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models | Ruibin Li*, Ruihuang Li, Song Guo, Lei Zhang | N/A | |
| Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models | Nishad Singhi*, Jae Myung Kim, Karsten Roth, Zeynep Akata | N/A | |
| JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation | ChenHan Jiang*, Yihan Zeng, Tianyang Hu, Songcen Xu, Wei Zhang, Hang Xu, Dit-Yan Yeung | N/A | |
| Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals | Camilo L Fosco*, Benjamin Lahner, Bowen Pan, Alex Andonian, Emilie L Josephs, Alex Lascelles, Aude Oliva | N/A | |
| Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection | Deepti Hegde, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Vishal M. Patel | N/A | |
| "SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking" | Siyuan Li*, Lei Ke, Yung-Hsu Yang, Luigi Piccinelli, Mattia Segù, Martin Danelljan, Luc Van Gool | N/A | |
| Tensorial template matching for fast cross-correlation with rotations and its application for tomography | Antonio Martinez-Sanchez*, Ulrike Homberg, J. M. Almira, Harold Phelippeau | N/A | |
| FreeAugment: Data Augmentation Search Across All Degrees of Freedom | Tom Bekor*, Niv Nayman, Lihi Zelnik-Manor | N/A | |
| Learning Representations of Satellite Images From Metadata Supervision | Jules Bourcier*, Gohar Dashyan, Karteek Alahari, Jocelyn Chanussot | N/A | |
| I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Gwangtak Bae, Changwoon Choi, Hyeongjun Heo, Sang Min Kim, Young Min Kim* | N/A | |
| FlashTex: Fast Relightable Mesh Texturing with LightControlNet | Kangle Deng*, Timothy Omernick, Alexander B Weiss, Deva Ramanan, Jun-Yan Zhu, Tinghui Zhou, Maneesh Agrawala | N/A | |
| GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence | Pengyuan Wang*, Takuya Ikeda, Robert Lee, Koichi Nishiwaki | N/A | |
| ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling | William Yicheng Zhu, Keren Ye, Junjie Ke, Jiahui Yu, Leonidas Guibas, Peyman Milanfar, Feng Yang* | N/A | |
| PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Aoming Liu, Zhong Li, Zhang Chen*, Nannan Li, Yi Xu, Bryan Plummer | N/A | |
| SOS: Segment Object System for Open-World Instance Segmentation With Object Priors | Christian Wilms*, Tim Rolff, Maris N Hillemann, Robert Johanson, Simone Frintrop | N/A | |
| Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan*, Zeno Sambugaro, Akhmedkhan Shabanov, Towaki Takikawa, Weiwei Sun, Daniel Rebain, Nicola Conci, Kwang Moo Yi, Andrea Tagliasacchi | N/A | |
| EDformer: Transformer-Based Event Denoising Across Varied Noise Levels | Bin Jiang, Bo Xiong, Bohan Qu, M. Salman Asif, You Zhou, Zhan Ma | N/A | |
| Foster Adaptivity and Balance in Learning with Noisy Labels | Mengmeng Sheng, Zeren Sun, Tao Chen, Shuchao Pang, yucheng wang, Yazhou Yao | N/A | |
| MetaAug: Meta-Data Augmentation for Post-Training Quantization | Cuong Van Pham*, Hoang Anh Dung, Cuong Cao Nguyen, Trung Le, Dinh Phung, Gustavo Carneiro, Thanh-Toan Do | N/A | |
| Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Qian Chen, Shihao Shu, Xiangzhi Bai* | N/A | |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Shizhou Zhang, Wenlong Luo, De Cheng*, Qingchun Yang, Lingyan Ran, Yinghui Xing, Yanning Zhang | N/A | |
| Unleashing the Power of Prompt-driven Nucleus Instance Segmentation | Zhongyi Shui, Yunlong Zhang, Kai Yao, Chenglu Zhu, Sunyi Zheng, Jingxiong Li, Honglin Li, YUXUAN SUN, Ruizhe Guo, Lin Yang | N/A | |
| Gaze Target Detection Based on Head-Local-Global Coordination | Yaokun Yang, Feng Lu* | N/A | |
| 3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms | Po Han Chen, Chia-Chi Tsai* | N/A | |
| Toward Tiny and High-quality Facial Makeup with Data Amplify Learning | Qiaoqiao Jin, Xuanhong Chen, Meiguang Jin, Ying Chen, Rui Shi, Yucheng Zheng, Yupeng Zhu, Bingbing Ni* | N/A | |
| An Economic Framework for 6-DoF Grasp Detection | Xiao-Ming Wu, Jia-Feng Cai, Jian-Jian Jiang, Dian Zheng, Yi-Lin Wei, Wei-Shi Zheng | N/A | |
| GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Yuanhui Huang, Wenzhao Zheng, Yunpeng Zhang, Jie Zhou, Jiwen Lu* | N/A | |
| Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning | Fanyue Wei, Wei Zeng, Zhenyang Li, Dawei Yin, Lixin Duan, Wen Li* | N/A | |
| AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer | Zhuguanyu Wu, Jiaxin Chen*, Hanwen Zhong, Di Huang, Yunhong Wang | N/A | |
| Multi-Label Cluster Discrimination for Visual Representation Learning | Xiang An, Kaicheng Yang, Xiangzi Dai, Ziyong Feng, Jiankang Deng* | N/A | |
| "Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation" | Jinpeng Liu, Wenxun Dai, Chunyu Wang, Yiji Cheng, Yansong Tang*, Xin Tong | N/A | |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Junjie Guo, Chenqiang Gao, Fangcen Liu, Deyu Meng, Xinbo Gao | N/A | |
| CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks | Hao Fang, Jiawei Kong, Bin Chen*, Tao Dai, Hao Wu, Shu-Tao Xia | N/A | |
| Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering | Benjamin Attal*, Dor Verbin, Ben Mildenhall, Peter Hedman, Jonathan T Barron, Matthew O'Toole, Pratul Srinivasan | N/A | |
| Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds | Zicheng Wang, Zhen Zhao, Yiming Wu, Luping Zhou, Dong Xu | N/A | |
| A New Dataset and Framework for Real-World Blurred Images Super-Resolution | Rui Qin, Ming Sun, Chao Zhou, Bin Wang* | N/A | |
| AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization | Shixiong Xu, Chenghao Zhang, Lubin Fan, Gaofeng Meng, SHIMING XIANG, Jieping Ye | N/A | |
| RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation | Zhiyuan Zhang*, Licheng Yang, Zhiyu Xiang | N/A | |
| StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models | Wen Li*, Muyuan Fang, Cheng Zou, Biao Gong, Ruobing Zheng, Meng Wang, Jingdong Chen, Ming Yang | N/A | |
| Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation | Chen-Chen Zong, Ye-Wen Wang, Kun-Peng Ning, Hai-Bo Ye, Sheng-Jun Huang* | N/A | |
| Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective | Zhaoxin Wang, Handing Wang, Cong Tian, Yaochu Jin | N/A | |
| Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation | Zeyang Zhao, Qilong Xue, Yifan Bai, Yuhang He, Xing Wei*, Yihong Gong | N/A | |
| SeiT++: Masked Token Modeling Improves Storage-efficient Training | Minhyun Lee, Song Park, Byeongho Heo, Dongyoon Han, Hyunjung Shim* | N/A | |
| Rectify the Regression Bias in Long-Tailed Object Detection | Ke Zhu, Minghao Fu, Jie Shao, Tianyu Liu, Jianxin Wu* | N/A | |
| MagicEraser: Erasing Any Objects via Semantics-Aware Control | Fan Li*, Zixiao Zhang, Yi Huang, Jianzhuang Liu, Renjing Pei, Bin Shao, Songcen Xu | N/A | |
| Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation | Haozhi Cao, Yuecong Xu, Jianfei Yang*, Pengyu Yin, Xingyu Ji, Shenghai Yuan, Lihua Xie | N/A | |
| Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis | Hanting Li, Hongjing Niu, Feng Zhao* | N/A | |
| SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images | Jintu Zheng, Yi Ding, Qizhe Liu, Yuehui Chen, Yi Cao, Ying Hu, Zenan Wang* | N/A | |
| NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Zhongqun Zhang, Hengfei Wang, Ziwei Yu, Yihua Cheng, Angela Yao, Hyung Jin Chang | N/A | |
| Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities | Kaiwen Cai, ZheKai Duan, Gaowen Liu, Charles Fleming, Chris Xiaoxuan Lu* | N/A | |
| Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers | Zhengbo Zhang, Li Xu, Duo Peng, Hossein Rahmani, Jun Liu | N/A | |
| Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification | Hai Ci, Pei Yang, Yiren Song, Mike Zheng Shou | N/A | |
| 3D Small Object Detection with Dynamic Spatial Pruning | Zhihao Sun, Ziwei Wang, Hongmin Liu, Jie Zhou, Jiwen Lu, Xiuwei Xu | N/A | |
| STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning | Hao Cheng, SIYUAN YANG, Chong Wang, Joey Tianyi Zhou, Alex Kot, Bihan Wen* | N/A | |
| Transferable 3D Adversarial Shape Completion using Diffusion Models | Xuelong Dai*, Bin Xiao | N/A | |
| OmniSat: Self-Supervised Modality Fusion for Earth Observation | Guillaume Astruc*, Nicolas Gonthier, Clement Mallet, Loic Landrieu | N/A | |
| Distilling Diffusion Models into Conditional GANs | MinGuk Kang, Richard Zhang, Connelly Barnes, Sylvain Paris, Suha Kwak, Jaesik Park, Eli Shechtman, Jun-Yan Zhu, Taesung Park | N/A | |
| Semantically Guided Representation Learning For Action Anticipation | Anxhelo Diko*, Danilo Avola, Bardh Prenkaj, Federico Fontana, Luigi Cinque | N/A | |
| MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory | Juwon Kang, Nayeong Kim, Jungseul Ok, Suha Kwak | N/A | |
| FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions | Sohyun Lee, Namyup Kim, Sungyeon Kim, Suha Kwak* | N/A | |
| ScanTalk: 3D Talking Heads from Unregistered Scans | Federico Nocentini*, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, Mohamed Daoudi | N/A | |
| Controllable Navigation Instruction Generation with Chain of Thought Prompting | Xianghao Kong, Jinyu Chen, Wenguan Wang, Hang Su, Xiaolin Hu, Yi Yang, Si Liu | N/A | |
| GiT: Towards Generalist Vision Transformer through Universal Language Interface | Haiyang Wang*, Hao Tang, Li Jiang, Shaoshuai Shi, Muhammad Ferjad Naeem, Hongsheng Li, Bernt Schiele, Liwei Wang | N/A | |
| ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention | Chenhang He*, Ruihuang Li, Guowen Zhang, Lei Zhang | N/A | |
| A Cephalometric Landmark Regression Method based on Dual-encoder for High-resolution X-ray Image | Chao Dai, yang wang*, Chaolin Huang, zhou jiakai, Qilin Xu, Minpeng Xu | N/A | |
| Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking | Jikai Zheng, Mingjiang Liang, Shaoli Huang, Jifeng Ning* | N/A | |
| LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment | Yiming Ren, Xiao Han, Yichen Yao, Xiaoxiao Long, Yujing Sun, Yuexin Ma | N/A | |
| You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Mehdi Noroozi, Isma Hadji, Brais Martinez, Adrian Bulat, Georgios Tzimiropoulos* | N/A | |
| Gaussian Grouping: Segment and Edit Anything in 3D Scenes | Mingqiao Ye, Martin Danelljan, Fisher Yu, Lei Ke* | N/A | |
| CoMo: Controllable Motion Generation through Language Guided Pose Code Editing | Yiming Huang*, Weilin Wan, Yue Yang, Chris Callison-Burch, Mark Yatskar, Lingjie Liu | N/A | |
| MegaScenes: Scene-Level View Synthesis at Scale | Joseph Tung, Gene Chou*, Ruojin Cai, Guandao Yang, Kai Zhang, Gordon Wetzstein, Bharath Hariharan, Noah Snavely | N/A | |
| SuperGaussian: Repurposing Video Models for 3D Super Resolution | Yuan Shen, Duygu Ceylan, Paul Guerrero, Zexiang Xu, Niloy J. Mitra, Shenlong Wang, Anna Fruehstueck* | N/A | |
| Towards Model-Agnostic Dataset Condensation by Heterogeneous Models | Jun-Yeong Moon, Jung Uk Kim, Gyeong-Moon Park | N/A | |
| Goldfish: Vision-Language Understanding of Arbitrarily Long Videos | Kirolos Ataallah, Xiaoqian shen, Eslam mohamed abdelrahman, Essam Sleiman, Mingchen Zhuge, Jian Ding, Deyao Zhu, Jürgen Schmidhuber, Mohamed Elhoseiny | N/A | |
| MeshFeat: Multi-Resolution Features for Neural Fields on Meshes | Mihir Mahajan, Florian Hofherr, Daniel Cremers | N/A | |
| Decoupling Common and Unique Representations for Multimodal Self-supervised Learning | Yi Wang*, Conrad M Albrecht, Nassim Ait Ali Braham, Chenying Liu, Zhitong Xiong, Xiao Xiang Zhu | N/A | |
| "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training" | Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Samuel Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Futang Peng, Anton Belyi, Max A Schwarzer, Hongyu Hè, Xianzhi Du, Haotian Zhang, Karanjeet Singh, Doug Kang, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev*, Yinfei Yang | N/A | |
| Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Yixiao Wang*, Chen Tang, Lingfeng Sun, Simone Rossi, Yichen Xie, Chensheng Peng, Thomas Hannagan, Stefano Sabatini, Nicola Poerio, Masayoshi TOMIZUKA, Wei Zhan | N/A | |
| 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Atsuya Nakata, Takao Yamanaka | N/A | |
| Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Xiaoyu Zhu*, Hao Zhou, Pengfei Xing, Long Zhao, Hao Xu, Junwei Liang, Alexander G. Hauptmann, Ting Liu, Andrew Gallagher | N/A | |
| D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction | Bowen Fu, Gu Wang, Chenyangguang Zhang, Yan Di, Ziqin Huang, Zhiying Leng, Fabian Manhardt, Xiangyang Ji, Federico Tombari | N/A | |
| Combining Generative and Geometry Priors for Wide-Angle Portrait Correction | Lan Yao, Chaofeng Chen, Xiaoming Li*, Zifei Yan, Wangmeng Zuo | N/A | |
| RealViformer: Investigating Attention for Real-World Video Super-Resolution | Yuehan Zhang*, Angela Yao | N/A | |
| Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Yuehan Zhang*, Seungjun Lee, Angela Yao | N/A | |
| Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation | zhao zhe*, Mengshi Qi, Huadong Ma | N/A | |
| UniFS: Universal Few-shot Instance Perception with Point Representations | Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo* | N/A | |
| SemanticHuman-HD: High Resolution Semantic disentangled 3D Human Generation | Peng Zheng, Tao Liu, Zili Yi, Rui Ma* | N/A | |
| CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Avinash Paliwal*, Wei Ye, Jinhui Xiong, Dmytro Kotovenko, Rakesh Ranjan, Vikas Chandra, Nima Khademi Kalantari | N/A | |
| Monocular Occupancy Prediction for Scalable Indoor Scenes | Hongxiao Yu, Yuqi Wang, Yuntao Chen, Zhaoxiang Zhang* | N/A | |
| Visual Grounding for Object-Level Generalization in Reinforcement Learning | Haobin Jiang, Zongqing Lu* | N/A | |
| 3DEgo: 3D Editing on the Go! | Umar Khalid, Hasan Iqbal, Azib Farooq, Jing Hua, Chen Chen* | N/A | |
| Efficient Depth-Guided Urban View Synthesis | sheng miao*, Jiaxin Huang, Dongfeng Bai, Weichao Qiu, Liu Bingbing, Andreas Geiger, Yiyi Liao | N/A | |
| Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model | Donggeun Yoon, Minseok Seo, Doyi Kim, Yeji Choi, Donghyeon Cho* | N/A | |
| Domain-adaptive Video Deblurring via Test-time Blurring | Jin-Ting He*, Fu-Jen Tsai, Jia-Hao Wu, Yan-Tsung Peng, Chung-Chi Tsai, Chia-Wen Lin, Yen-Yu Lin | N/A | |
| Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures | Jiaxing Huang, Yanfeng Zhou, Yaoru Luo, Guole Liu, Heng Guo, Ge Yang* | N/A | |
| NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | William Ljungbergh*, Adam Tonderski, Joakim Johnander, Holger Caesar, Kalle Åström, Michael Felsberg, Christoffer Petersson | N/A | |
| OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing | Pranav Gupta, Rishubh Singh, Pradeep Shenoy, Ravi Kiran Sarvadevabhatla | N/A | |
| Progressive Pretext Task Learning for Human Trajectory Prediction | Xiaotong Lin, Tianming Liang, Jianhuang Lai, Jian-Fang Hu* | N/A | |
| "Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM" | David Hug*, Ignacio Alzugaray, Margarita Chli | N/A | |
| Isomorphic Pruning for Vision Models | Gongfan Fang, Xinyin Ma, Michael Bi Mi, Xinchao Wang | N/A | |
| Attention Prompting on Image for Large Vision-Language Models | Runpeng Yu, Weihao Yu, Xinchao Wang* | N/A | |
| Learning Cross-hand Policies of High-DOF Reaching and Grasping | Qijin She, Shishun Zhang, Yunfan Ye, Ruizhen Hu, Kai Xu* | N/A | |
| Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Ting-Ru Liu*, Hsuan-Kung Yang, Jou-Min Liu, Chun-Wei Huang, Tsung-Chih Chiang, Quan Kong, Norimasa Kobori, Chun-Yi Lee | N/A | |
| Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning | Jinglin Liang, Jin Zhong, Hanlin Gu, Zhongqi Lu, Xingxing Tang, Gang Dai, Shuangping Huang*, Lixin Fan, Qiang Yang | N/A | |
| Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment | Zhanzhong Pang*, Fadime Sener, Shrinivas Ramasubramanian, Angela Yao | N/A | |
| REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models | Agneet Chatterjee*, Yiran Luo, Tejas Gokhale, Yezhou Yang, Chitta R Baral | N/A | |
| DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing | Hyeonho Jeong, Jinho Chang, Geon Yeong Park, Jong Chul Ye* | N/A | |
| VideoClusterNet: Self-Supervised and Adaptive Face Clustering for Videos | Devesh Walawalkar*, Pablo Garrido | N/A | |
| Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients | Yiming Chen*, Xiangyu Yang, Nikos Deligiannis | N/A | |
| Controlling the World by Sleight of Hand | Sruthi Sudhakar*, Ruoshi Liu, Basile Van Hoorick, Carl Vondrick, Richard Zemel | N/A | |
| Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack | Mingyu Yang*, Daizong Liu, Keke Tang, Pan Zhou, Lixing Chen, Junyang Chen | N/A | |
| Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection | Yongwei Nie, Hao Huang, Chengjiang Long, Qing Zhang, Pradipta Maji, Hongmin Cai* | N/A | |
| Cross-Domain Learning for Video Anomaly Detection with Limited Supervision | Yashika Jain, Ali Dabouei, Min Xu | N/A | |
| YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information | Chien-Yao Wang*, I-Hau Yeh, Hong-Yuan Mark Liao | N/A | |
| Unsupervised Multi-modal Medical Image Registration via Invertible Translation | Mengjie Guo* | N/A | |
| Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery | Jian-Li Wang, Xi-Le Zhao* | N/A | |
| CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model | Zhengyi Wang*, Yikai Wang, Yifei Chen, Chendong Xiang, Shuo Chen, Dajiang Yu, Chongxuan Li, Hang Su, Jun Zhu | N/A | |
| Domain Reduction Strategy for Non-Line-of-Sight Imaging | Hyunbo Shim, In Cho, Daekyu Kwon, Seon Joo Kim* | N/A | |
| HPE-Li: WiFi-enabled Lightweight Dual Selective Kernel Convolution for Human Pose Estimation | Toan D. Gian, Tien Dac Lai, Thien Van Luong, Kok-Seng Wong, Van-Dinh Nguyen* | N/A | |
| Cut out the Middleman: Revisiting Pose-based Gait Recognition | Yang Fu, Saihui Hou, Shibei Meng, Xuecai Hu, Chunshui Cao, Xu Liu, Yongzhen Huang | N/A | |
| HiEI: A Universal Framework for Generating High-quality Emerging Images from Natural Images | Jingmeng Li, Lukang Fu, Surun Yang, Hui Wei* | N/A | |
| High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior | Jianbing Shen*, Wencheng Han | N/A | |
| SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM | Mingrui Li, Shuhong Liu, Heng Zhou, Guohao Zhu, Na Cheng, Tianchen Deng, Hongyu Wang* | N/A | |
| View Selection for 3D Captioning via Diffusion Ranking | Tiange Luo*, Justin Johnson, Honglak Lee | N/A | |
| OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model | Runyi Li, Xuhan Sheng, Weiqi Li, Jian Zhang | N/A | |
| UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models | Yiming Zhao, Zhouhui Lian | N/A | |
| Confidence Self-Calibration for Multi-Label Class-Incremental Learning | Kaile Du, Yifan Zhou, Fan Lyu, Yuyang Li, Chen Lu, Guangcan Liu | N/A | |
| OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models | Zhe Kong, Yong Zhang, Tianyu Yang, Tao Wang, Kaihao Zhang, Bizhu Wu, Guanying Chen, Wei Liu, Wenhan Luo* | N/A | |
| Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning | Min-Yeong Park, Jae-Ho Lee, Gyeong-Moon Park* | N/A | |
| WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting | Jingjing Wu, Zhengyao Fang, Pengyuan Lyu, Chengquan Zhang, Fanglin Chen, Guangming Lu, Wenjie Pei* | N/A | |
| An Incremental Unified Framework for Small Defect Inspection | Jiaqi Tang, Hao Lu, Xiaogang Xu, Ruizheng Wu, Sixing Hu, Tong Zhang, Tsz Wa Cheng, Ming Ge, Ying-Cong Chen*, Fugee Tsung | N/A | |
| Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent | NianHui Guo*, Hong Guo, Christoph Meinel, Haojin Yang | N/A | |
| Temporally Consistent Stereo Matching | Jiaxi Zeng, Chengtang Yao, Yuwei Wu, Yunde Jia | N/A | |
| A Rotation-invariant Texture ViT for Fine-Grained Recognition of Esophageal Cancer Endoscopic Ultrasound Images | Tianyi Liu, Shuaishuai S Zhuang, Jiacheng Nie, Geng Chen , Yusheng Guo, Guangquan Zhou, Jean-Louis Coatrieux, Yang Chen | N/A | |
| BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation | Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee, Kang Zhang, Yu-Jung Heo, Du-Seong Chang, Chang D. Yoo* | N/A | |
| Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth | Zimin Xia*, Yujiao Shi, Hongdong Li, Julian F. P. Kooij | N/A | |
| BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream | Wenpu Li, Pian Wan, Peng Wang, Jinghang Li, Yi Zhou, Peidong Liu* | N/A | |
| Human Motion Forecasting in Dynamic Domain Shifts: A Homeostatic Continual Test-time Adaptation Framework | Qiongjie Cui*, Huaijiang Sun, Bin Li, Jianfeng Lu, Weiqing Li | N/A | |
| CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation | Hajin Shim, Changhun Kim, Eunho Yang* | N/A | |
| DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment | Yunpeng Bai*, Xintao Wang, Yan-Pei Cao, Yixiao Ge, Chun Yuan, Ying Shan | N/A | |
| FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation | Honghao Xu, Juzhan Xu, Zeyu Huang, Pengfei Xu, Hui Huang, Ruizhen Hu* | N/A | |
| BugNIST - a Large Volumetric Dataset for Detection under Domain Shift | Patrick M Jensen, Vedrana A Dahl, Rebecca Engberg, Carsten Gundlach, Hans Martin Kjer, Anders B Dahl* | N/A | |
| SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis | Huan-ang Gao, Mingju Gao, Jiaju Li, Wenyi Li, Rong Zhi, Hao Tang, Hao Zhao* | N/A | |
| PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture | Zhuojun Li, Chun Yu, Chen Liang, Yuanchun Shi | N/A | |
| PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Junsong Chen, Chongjian GE, Enze Xie*, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li | N/A | |
| Hierarchical Gaussian Mixture Normalizing Flow Modeling for Unified Anomaly Detection | Xincheng Yao, Ruoqi Li, Zefeng Qian, lu wang, Chongyang Zhang | N/A | |
| A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks | Yixiang Qiu, Hao Fang, Hongyao Yu, Bin Chen, Meikang Qiu, Shu-Tao Xia | N/A | |
| Improving Unsupervised Domain Adaptation: A Pseudo-Candidate Set Approach | Aveen Dayal*, Rishabh Lalla, Linga Reddy Cenkeramaddi, C. Krishna Mohan, Abhinav Kumar, Vineeth N Balasubramanian | N/A | |
| HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting | Zhenglin Zhou*, Fan Ma, Hehe Fan, Zongxin Yang, Yi Yang | N/A | |
| DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM | Yixuan Wu*, Yizhou Wang, Shixiang Tang, Wenhao Wu, Tong He, Wanli Ouyang, Philip Torr, Jian Wu | N/A | |
| Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction | Rui Peng, Shihe Shen, Kaiqiang Xiong, Huachen Gao, Jianbo Jiao, Xiaodong Gu, Ronggang Wang* | N/A | |
| HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Guian Fang*, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang | N/A | |
| Multiscale Graph Texture Network | Ravishankar Evani*, Deepu Rajan, Shangbo Mao | N/A | |
| HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis | Fangqin Zhou*, Mert Kilickaya, Joaquin Vanschoren, Ran Piao | N/A | |
| Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection | Xinhao Luo, Man Yao, Yuhong Chou, Bo Xu, Guoqi Li* | N/A | |
| RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Jianbing Shen, Chunliang Li, Wencheng Han, Junbo Yin, Sanyuan Zhao* | N/A | |
| Phase Concentration and Shortcut Suppression for Weakly Supervised Semantic Segmentation | Hoyong Kwon, Jaeseok Jeong, Sung-Hoon Yoon, Kuk-Jin Yoon* | N/A | |
| Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection | Harsh Shah, Kashish Mittal, Ajit Rajwade | N/A | |
| CompGS: Smaller and Faster Gaussian Splatting with Vector Quantization | K L Navaneet*, Kossar Pourahmadi Meibodi, Soroush Abbasi Koohpayegani, Hamed Pirsiavash | N/A | |
| SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection | Anay Majee, Ryan X Sharp, Rishabh Iyer | N/A | |
| Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren*, Yang Zhou, Jimei Yang, Jing Shi, Difan Liu, Feng Liu, Mingi Kwon, Abhinav Shrivastava | N/A | |
| S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition | Mohamed Abdelfattah*, Alexandre Alahi | N/A | |
| ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions | Minh-Quan Le*, Alexandros Graikos, Srikar Yellapragada, Rajarsi Gupta, Joel Saltz, Dimitris Samaras | N/A | |
| SwapAnything: Enabling Arbitrary Object Swapping in Personalized Image Editing | Jing Gu, Nanxuan Zhao, Wei Xiong, Qing Liu, Zhifei Zhang, He Zhang, Jianming Zhang, HyunJoon Jung, Yilin Wang, Xin Eric Wang* | N/A | |
| Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition | Yisong Wang, Nan Xi*, Jingjing Meng, Junsong Yuan | N/A | |
| Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing | Ioannis Maniadis Metaxas*, Georgios Tzimiropoulos, Ioannis Patras | N/A | |
| ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang, Yun Tang, Wenjie Ruan, Xiaowei Huang, Siddartha Khastgir, Paul A Jennings, Xingyu Zhao* | N/A | |
| Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos | Akshay Paruchuri*, Samuel Ehrenstein, Shuxian Wang, Inbar Fried, Stephen Pizer, Marc Niethammer, Roni Sengupta | N/A | |
| OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks | jingyang xiang*, Zuohui Chen, Siqi Li, Qing Wu, Yong Liu | N/A | |
| Multistain Pretraining for Slide Representation Learning in Pathology | Guillaume Jaume, Anurag J Vaidya, Andrew Zhang, Andrew Song, Richard J Chen, Sharifa Sahai, Dandan Mo, Emilio Madrigal, Long P Le, Faisal Mahmood* | N/A | |
| T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy | Qing Jiang, Feng Li, Zhaoyang Zeng, Shilong Liu, Tianhe Ren, Lei Zhang | N/A | |
| Harmonizing knowledge Transfer in Neural Network with Unified Distillation | yaomin huang, Faming Fang, Zaoming Yan, Chaomin Shen, Guixu Zhang* | N/A | |
| Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data | Shufan Li*, Aditya Grover, Harkanwar Singh | N/A | |
| Click Prompt Learning with Optimal Transport for Interactive Segmentation | Jie Liu*, Haochen wang, Wenzhe Yin, Jan-Jakob Sonke, Efstratios Gavves | N/A | |
| 3D Human Pose Estimation via Non-Causal Retentive Networks | Kaili Zheng, Feixiang Lu, Yihao Lv, Liangjun Zhang, Chenyi Guo, Ji Wu | N/A | |
| OMR: Occlusion-Aware Memory-Based Refinement for Video Lane Detection | Dongkwon Jin, Chang-Su Kim* | N/A | |
| 6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry | Sungho Chun, Ju Yong Chang* | N/A | |
| Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging | Zongliang Wu*, Ruiying Lu, Ying Fu, Xin Yuan | N/A | |
| Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition | Masashi Hatano*, Ryo Hachiuma, Ryo Fujii, Hideo Saito | N/A | |
| Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition | Zhongxi Chen, Shen Chen, Taiping Yao, Ke Sun, Shouhong Ding, Xianming Lin, Liujuan Cao, Rongrong Ji | N/A | |
| Modeling Label Correlations with Latent Context for Multi-Label Recognition | Zhaomin Chen, Quan Cui, Ruoxi Deng, Jie Hu, Guodao Zhang | N/A | |
| LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model | Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang* | N/A | |
| Finding a needle in a haystack: A Black-Box Approach to Invisible Watermark Detection | Minzhou Pan*, Zhenting Wang, Xin Dong, Vikash Sehwag, Lingjuan Lyu, Xue Lin | N/A | |
| DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction | Yuxin Yao, Siyu Ren, Junhui Hou*, Zhi Deng, Juyong Zhang, Wenping Wang | N/A | |
| MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos | Yihong Sun*, Bharath Hariharan | N/A | |
| ARoFace: Alignment Robustness to Improve Low-quality Face Recognition | Mohammad Saeed Ebrahimi Saadabadi*, Sahar Rahimi Malakshan, Ali Dabouei, Nasser Nasrabadi | N/A | |
| Learning Diffusion Models for Multi-View Anomaly Detection | Chieh Liu, Yu-Min Chu, Ting-I Hsieh, Hwann-Tzong Chen, Tyng-Luh Liu* | N/A | |
| "Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation" | Zhihang Zhong, Gurunandan Krishnan, Xiao Sun, Yu Qiao, Sizhuo Ma, Jian Wang | N/A | |
| Multi-modal Relation Distillation for Unified 3D Representation Learning | Huiqun Wang, Yiping Bao, Panwang Pan, Zeming Li, Xiao Liu, Ruijie Yang, Di Huang* | N/A | |
| Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization | Renjie Pi*, Tianyang Han, Wei Xiong, Jipeng ZHANG, Runtao Liu, Rui Pan, Tong Zhang | N/A | |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Siyu Jiao, hongguang Zhu, Yunchao Wei, Yao Zhao, Jiannan Huang, Humphrey Shi | N/A | |
| Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification | Dekun Lin*, Zhe Cui, Rui Chen, Tailai Peng, xinran xie, Xiaolin Qin | N/A | |
| MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation | Shuzhao Xie*, Weixiang Zhang, Chen Tang, Yunpeng Bai, Rongwei Lu, Shjia Ge, Zhi Wang | N/A | |
| LongVLM: Efficient Long Video Understanding via Large Language Models | Yuetian Weng, Mingfei Han, Haoyu He, Xiaojun Chang, Bohan Zhuang* | N/A | |
| The All-Seeing Project V2: Towards General Relation Comprehension of the Open World | Weiyun Wang, yiming ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai* | N/A | |
| Neural Metamorphosis | Xingyi Yang, Xinchao Wang | N/A | |
| WHAC: World-grounded Humans and Cameras | Wanqi Yin, Zhongang Cai, Chen Wei, Fanzhou Wang, Ruisi Wang, Haiyi Mei, Weiye Xiao, Zhitao Yang, Qingping Sun, Atsushi Yamashita, Ziwei Liu, Lei Yang* | N/A | |
| Federated Learning with Local Openset Noisy Labels | Zonglin Di, Zhaowei Zhu, Xiaoxiao Li, Yang Liu | N/A | |
| Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Jiacheng Deng*, Jiahao Lu, Tianzhu Zhang | N/A | |
| PSALM: Pixelwise Segmentation with Large Multi-modal Model | Zheng Zhang, yeyao ma, Enming Zhang, Xiang Bai* | N/A | |
| Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model | Shoma Iwai*, Atsuki Osanai, Shunsuke Kitada, Shinichiro Omachi | N/A | |
| Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images | Ruiqi Wang*, Akshay Gadi Patil, Fenggen Yu, Hao Zhang | N/A | |
| Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture | Xuanchen Li, Yuhao Cheng, Xingyu Ren, Haozhe Jia, Di Xu, Wenhan Zhu, Yichao Yan* | N/A | |
| Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities | Xu Zheng, Yuanhuiyi Lyu, Lin Wang | N/A | |
| Kinetic Typography Diffusion Model | Seonmi Park, Inhwan Bae, Seunghyun Shin, Hae-Gon Jeon* | N/A | |
| "Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction" | Shuchi Wu, Chuan Ma, Kang Wei*, Xiaogang XU, Ming Ding, Yuwen Qian, Di Xiao, Tao Xiang | N/A | |
| Light-in-Flight for a World-in-Motion | Jongho Lee*, Ryan J Suess, Mohit Gupta | N/A | |
| GroupDiff: Diffusion-based Group Portrait Editing | Yuming Jiang, Nanxuan Zhao*, Qing Liu, Krishna Kumar Singh, Shuai Yang, Chen Change Loy, Ziwei Liu | N/A | |
| Faceptor: A Generalist Model for Face Perception | Lixiong Qin, Mei Wang, Xuannan Liu, Yuhang Zhang, Wei Deng, Xiaoshuai Song, Weiran Xu, Weihong Deng | N/A | |
| Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks | Lingzhuang Meng, Mingwen Shao*, Yuanjian Qiao, Wenjie Liu | N/A | |
| Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels | Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc Pollefeys, Shiji Song, Gao Huang*, Francis Engelmann | N/A | |
| InsMapper: Exploring Inner-instance Information for Vectorized HD Mapping | zhenhua xu*, Kwan-Yee K. Wong, Hengshuang Zhao | N/A | |
| KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval | Xianwei Zhuang*, Hongxiang Li, Xuxin Cheng, Zhihong Zhu, Yuxin Xie, Yuexian Zou | N/A | |
| "Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images" | Chuanrui Zhang, Yonggen Ling, Minglei Lu, Minghan Qin, Haoqian Wang* | N/A | |
| Learning with Unmasked Tokens Drives Stronger Vision Learners | Taekyung Kim, Sanghyuk Chun, Byeongho Heo, Dongyoon Han | N/A | |
| Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Peifu Liu, Tingfa Xu, Jie Wang, Huan Chen, Huiyan Bai, Jianan Li | N/A | |
| Multi-Task Domain Adaptation for Language Grounding with 3D Objects | Penglei Sun, Yaoxian Song, Xinglin Pan, Peijie Dong, Xiaofei Yang, Qiang Wang, Zhixu Li, Tiefeng Li, Xiaowen Chu | N/A | |
| Efficient Active Domain Adaptation for Semantic Segmentation by Selecting Information-rich Superpixels | Yuan Gao, Zilei Wang*, Yixin Zhang, Bohai Tu | N/A | |
| Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture | Zhigao Cao, Meng Li, Xiashuang Wang, Haoyu Wang, Fan Wang, Youjun Li, Zigang Huang* | N/A | |
| Camera-LiDAR Cross-modality Gait Recognition | Wenxuan Guo*, Yingping Liang, Zhiyu Pan, Ziheng Xi, Jianjiang Feng, Jie Zhou | N/A | |
| LiteSAM is Actually what you Need for segment Everything | Jianhai Fu, Yuanjie Yu, Ningchuan Li, Yi Zhang, Qichao Chen, Jianping Xiong, Jun Yin, Zhiyu Xiang | N/A | |
| IGNORE: Information Gap-based False Negative Loss Rejection for Single Positive Multi-Label Learning | Gyeong Ryeol Song, Noo-ri Kim, Jin-Seop Lee, Jee-Hyong Lee* | N/A | |
| Visual Prompting via Partial Optimal Transport | Mengyu Zheng, Zhiwei Hao, Yehui Tang, Chang Xu | N/A | |
| Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model | Guanren Qiao, Guiliang Liu*, Guorui Quan, Rongxiao Qu | N/A | |
| Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation | Chongjie Si, Xuehui Wang, Xiaokang Yang, Wei Shen* | N/A | |
| AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection | Yunkang Cao, Jiangning Zhang, Luca Frittoli, Yuqi Cheng, Weiming Shen, Giacomo Boracchi | N/A | |
| Pathformer3D: A 3D Scanpath Transformer for 360° Images | Rong Quan, yantao Lai, Mengyu Qiu, Dong Liang* | N/A | |
| TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection | Matic Fučka*, Vitjan Zavrtanik, Danijel Skočaj | N/A | |
| SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection | Hongcheng Zhang, Liu Liang, Pengxin Zeng*, Xiao Song, Zhe Wang | N/A | |
| 3D Gaussian Parametric Head Model | Yuelang Xu, Lizhen Wang, Zerong Zheng, Zhaoqi Su, Yebin Liu* | N/A | |
| RING-NeRF : Rethinking Inductive Biases for Versatile and Efficient Neural Fields | Doriand Petit*, Steve Bourgeois, Dumitru Pavel, Vincent Gay-Bellile, Florian Chabot, Loïc Barthe | N/A | |
| Platypus: A Generalized Specialist Model for Reading Text in Various Forms | Peng Wang, Zhaohai Li, Jun Tang, Humen Zhong, Fei Huang, Zhibo Yang, Cong Yao | N/A | |
| Structured-NeRF: Hierarchical Scene Graph with Neural Representation | Zhide Zhong, Jiakai Cao, songen gu, Sirui Xie, Liyi Luo, Hao Zhao, Guyue Zhou, Haoang Li, Zike Yan* | N/A | |
| EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation | Nikolai Körber*, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn Schuller | N/A | |
| Plug-and-Play Learned Proximal Trajectory for 3D Sparse-View X-Ray Computed Tomography | Romain Vo*, Julie Escoda, Caroline Vienne, Etienne Decenciere | N/A | |
| PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving | Zhili Chen, Maosheng Ye, Shuangjie Xu, Tongyi Cao, Qifeng Chen* | N/A | |
| Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification | Cheng-Chang Tsai, Yuan-Chih Chen, Chun-Shien Lu | N/A | |
| Beyond MOT: Semantic Multi-Object Tracking | Yunhao Li, Qin Li, Hao Wang, Xue Ma, Jiali Yao, Shaohua Dong, Heng Fan, Libo Zhang* | N/A | |
| Temporal Event Stereo via Joint Learning with Stereoscopic Flow | Hoonhee Cho, Jae-Young Kang, Kuk-Jin Yoon* | N/A | |
| SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection | Huafeng Chen, Pengxu Wei, Guangqian Guo, Shan Gao* | N/A | |
| Just a Hint: Point-Supervised Camouflaged Object Detection | Huafeng Chen, Dian SHAO, Guangqian Guo, shan gao | N/A | |
| ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation | Guanxing Lu, Shiyi Zhang, Ziwei Wang*, Changliu Liu, Jiwen Lu, Yansong Tang | N/A | |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Xingyu Peng, Yan Bai, Chen Gao, Lirong Yang, Fei Xia, Beipeng Mu, Xiaofei Wang, Si Liu* | N/A | |
| Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection | Zhili Chen, Shuangjie Xu, Maosheng Ye, Zian Qian, Xiaoyi Zou, Dit-Yan Yeung, Qifeng Chen* | N/A | |
| View-Consistent 3D Editing with Gaussian Splatting | Yuxuan Wang*, Xuanyu Yi, Zike Wu, Na Zhao, Long Chen, Hanwang Zhang | N/A | |
| E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation | Shengxuming Zhang, Lei Jin, Yifan Wang, Xinyu Wang, Xu Wen, Zunlei Feng*, Mingli Song | N/A | |
| GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering | Yanyan Li*, Chenyu Lyu, Yan Di, Guangyao Zhai, Gim Hee Lee, Federico Tombari | N/A | |
| URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Bo Xu*, Liu Ziao, Mengqi Guo, jiancheng Li, Gim Hee Lee | N/A | |
| InstructIR: High-Quality Image Restoration Following Human Instructions | Marcos V. Conde*, Gregor Geigle, Radu Timofte | N/A | |
| Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Yuan Chen, Zi-han Ding, Ziqin Wang, Yan Wang, Lijun Zhang, Si Liu | N/A | |
| Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation | Lanqing Guo, Yingqing HE, Haoxin Chen, Menghan Xia, Xiaodong Cun, Yufei Wang, Siyu Huang, Yong Zhang, Xintao Wang, Qifeng Chen, Ying Shan, Bihan Wen* | N/A | |
| LayoutFlow: Flow Matching for Layout Generation | Julian Jorge Andrade Guerreiro, Naoto Inoue, Kento Masui, Mayu Otani, Hideki Nakayama | N/A | |
| Making Large Language Models Better Planners with Reasoning-Decision Alignment | Zhijian Huang, Tao Tang, Shaoxiang Chen, Sihao Lin, Zequn Jie, Lin Ma, Guangrun Wang, Xiaodan Liang* | N/A | |
| R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Zheyuan Zhou, Le Wang, Naiyu Fang, Zili Wang, Lemiao Qiu*, Shuyou Zhang | N/A | |
| Representation Enhancement-Stabilization: Reducing Bias-Variance of Domain Generalization | Wei Huang*, Yilei Shi, Zhitong Xiong, Xiao Xiang Zhu | N/A | |
| Continual Learning for Remote Physiological Measurement: Minimize Forgetting and Simplify Inference | Qian Liang, Yan Chen, Yang Hu* | N/A | |
| An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes | Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, Qixing Huang | N/A | |
| STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians | Yifei Zeng, Yanqin Jiang, Siyu Zhu, Yuanxun Lu, Youtian Lin, Hao Zhu, Weiming Hu, Xun Cao, Yao Yao* | N/A | |
| RGBD GS-ICP SLAM | Seongbo Ha, Jiung Yeon, Hyeonwoo Yu* | N/A | |
| Efficient NeRF Optimization - Not All Samples Remain Equally Hard | Juuso Korhonen*, Goutham Rangu, Hamed Rezazadegan Tavakoli, Juho Kannala | N/A | |
| Revisiting Calibration of Wide-Angle Radially Symmetric Cameras | Andrea Porfiri Dal Cin, Francesco Azzoni, Giacomo Boracchi, Luca Magri | N/A | |
| Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs | Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi, Radu Timofte | N/A | |
| Robust Incremental Structure-from-Motion with Hybrid Features | Shaohui Liu*, Yidan Gao, Tianyi Zhang, Rémi Pautrat, Johannes L Schönberger, Viktor Larsson, Marc Pollefeys | N/A | |
| Revisiting Domain-Adaptive Object Detection in Adverse Weather by the Generation and Composition of High-Quality Pseudo-Labels | Rui Zhao, Huibin Yan, Shuoyao Wang* | N/A | |
| Prediction Exposes Your Face: Black-box Model Inversion via Prediction Alignment | Yufan Liu*, Wanqian Zhang, Dayan Wu, Zheng Lin, jingzi Gu, Weiping Wang | N/A | |
| Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Qinyu Yang, Haoxin Chen, Yong Zhang, Menghan Xia, Xiaodong Cun, Zhixun Su, Ying Shan | N/A | |
| UniCal: Unified Neural Sensor Calibration | Ze Yang, George G Chen, Haowei Zhang, Kevin Ta, Ioan Andrei Bârsan, Daniel Murphy, Sivabalan Manivasagam, Raquel Urtasun* | N/A | |
| Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models | Longxiang Tang*, Zhuotao Tian, Kai Li, Chunming He, Hantao Zhou, Hengshuang Zhao, Xiu Li, Jiaya Jia | N/A | |
| Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter | Suqi Song, Chenxu Zhang, Peng Zhang, Pengkun Li, Fenglong Song, Lei Zhang* | N/A | |
| Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation | Chih-Jung Tsai, Hwann-Tzong Chen*, Tyng-Luh Liu | N/A | |
| WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering | Pingyi Chen, Chenglu Zhu, Sunyi Zheng, Honglin Li, Lin Yang | N/A | |
| ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions | Anindita Ghosh*, Rishabh Dabral, Vladislav Golyanik, Christian Theobalt, Philipp Slusallek | N/A | |
| Statewide Visual Geolocalization in the Wild | Florian Fervers*, Sebastian Bullinger, Christoph Bodensteiner, Michael Arens, Rainer Stiefelhagen | N/A | |
| Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding | Yiwen Tang, Ray Zhang, Jiaming Liu, Zoey Guo, Bin Zhao, Zhigang Wang, Dong Wang, Peng Gao, Hongsheng Li, Xuelong Li | N/A | |
| Trajectory-aligned Space-time Tokens for Few-shot Action Recognition | Pulkit Kumar*, Namitha Padmanabhan, Luke Luo, Sai Saketh Rambhatla, Abhinav Shrivastava | N/A | |
| EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval | Thomas Hummel*, Shyamgopal Karthik, Mariana-Iuliana Georgescu, Zeynep Akata | N/A | |
| Synchronization of Projective Transformations | Rakshith Madhavan*, Andrea Fusiello, Federica Arrigoni | N/A | |
| TLControl: Trajectory and Language Control for Human Motion Synthesis | Weilin Wan*, Zhiyang Dou, Taku Komura, Wenping Wang, Dinesh Jayaraman, Lingjie Liu | N/A | |
| Insect Identification in the Wild: The AMI Dataset | Aditya Jain*, Fagner Cunha, Michael J Bunsen, Juan Sebastián Cañas, Léonard Pasi, Nathan Pinoy, Flemming Helsing, JoAnne Russo, Marc S Botham, Michael Sabourin, Jonathan Fréchette, Alexandre Anctil, Yacksecari Lopez, Eduardo Navarro, Filonila Pérez, Ana C Zamora, Jose Alejandro Ramirez-Silva, Jonathan Gagnon, Tom A August, Kim Bjerge, Alba Gomez Segura, Marc Belisle, Yves Basset, Kent P McFarland, David B Roy, Toke T Høye, Maxim Larrivee, David Rolnick | N/A | |
| Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Junyan Ye, Zhutao Lv, Weijia Li, Jinhua Yu, Haote Yang, Huaping Zhong, Conghui He | N/A | |
| F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions | Jie Yang, Xuesong Niu, Nan Jiang, Ruimao Zhang, Siyuan Huang | N/A | |
| Test-time Model Adaptation for Image Reconstruction Using Self-supervised Adaptive Layers | Yutian Zhao, Tianjing Zhang, Hui Ji* | N/A | |
| SHIC: Shape-Image Correspondences with no Keypoint Supervision | Aleksandar Shtedritski*, Christian Rupprecht, Andrea Vedaldi | N/A | |
| GenRC: Generative 3D Room Completion from Sparse Image Collections | Ming-Feng Li*, Yueh-Feng Ku, Hong-Xuan Yen, Chi Liu, Yu-Lun Liu, Albert Y Chen, Cheng-Hao Kuo, Min Sun | N/A | |
| A Probability-guided Sampler for Neural Implicit Surface Rendering | Gonçalo José Dias Pais, Valter André Piedade, Moitreya Chatterjee, Marcus Greiff, Pedro Miraldo* | N/A | |
| ReMatching: Low-Resolution Representations for Scalable Shape Correspondence | Filippo Maggioli*, Daniele Baieri, Emanuele Rodola, Simone Melzi | N/A | |
| Where am I? Scene Retrieval with Language | Jiaqi Chen*, Daniel Barath, Iro Armeni, Marc Pollefeys, Hermann Blum | N/A | |
| This Probably Looks Exactly Like That: An Invertible Prototypical Network | Zachariah Carmichael*, Timothy P Redgrave, Daniel Gonzalez Cedre, Walter Scheirer | N/A | |
| Arc2Face: A Foundation Model for ID-Consistent Human Faces | Foivos Paraperas Papantoniou*, Alexandros Lattas, Stylianos Moschoglou, Jiankang Deng, Bernhard Kainz, Stefanos Zafeiriou | N/A | |
| PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations | Yang Zheng*, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein | N/A | |
| Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling | Wonwoong Cho, Hareesh Ravi, Midhun Harikumar, Vinh Khuc, Krishna Kumar Singh, Jingwan Lu, David Iseri Inouye, Ajinkya Kale | N/A | |
| SweepNet: Unsupervised Learning Shape Abstraction via Neural Sweepers | Mingrui Zhao*, Yizhi Wang, Fenggen Yu, Changqing Zou, Ali Mahdavi-Amiri | N/A | |
| Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Jiacong Xu*, Mingqian Liao, Ram Prabhakar Kathirvel, Vishal Patel | N/A | |
| On the Viability of Monocular Depth Pre-training for Semantic Segmentation | Dong Lao*, Fengyu Yang, Daniel Wang, Hyoungseob Park, Samuel Lu, Alex Wong, Stefano Soatto | N/A | |
| Fairness-aware Vision Transformer via Debiased Self-Attention | Yao Qiang, Chengyin Li, Prashant Khanduri, Dongxiao Zhu* | N/A | |
| EgoPet: Egomotion and Interaction Data from an Animal's Perspective | Amir Bar*, Arya Bakhtiar, Danny L Tran, Antonio Loquercio, Jathushan Rajasegaran, yann lecun, Amir Globerson, Trevor Darrell | N/A | |
| Deep Companion Learning: Enhancing Generalization Through Historical Consistency | Ruizhao Zhu, Venkatesh Saligrama | N/A | |
| Neural graphics texture compression supporting random access | Farzad Farhadzadeh*, Qiqi Hou, Hoang Le, Amir Said, Randall R Rauwendaal, Alex Bourd, Fatih Porikli | N/A | |
| Contrastive Learning with Synthetic Positives | Dewen Zeng*, Xinrong Hu, Yawen Wu, Xiaowei Xu, Yiyu Shi | N/A | |
| GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features | Luc P.J. Sträter*, Mohammadreza Salehi, Efstratios Gavves, Cees G.M. Snoek, Yuki M. Asano | N/A | |
| Interpretability-Guided Test-Time Adversarial Defense | Akshay Kulkarni*, Tsui-Wei Weng | N/A | |
| DIM: Dyadic Interaction Modeling for Social Behavior Generation | Minh Tran*, Di Chang, Maksim Siniukov, Mohammad Soleymani | N/A | |
| Tri^{2}-plane: Thinking Head Avatar via Feature Pyramid | Luchuan Song*, Pinxin Liu, Lele Chen, Guojun Yin, Chenliang Xu | N/A | |
| ControlCap: Controllable Region-level Captioning | Yuzhong Zhao, Liu Yue, Zonghao Guo, weijia wu, Chen Gong, Qixiang Ye, Fang Wan* | N/A | |
| Free Lunch for Gait Recognition: A Novel Relation Descriptor | Jilong Wang, Saihui Hou, Yan Huang, Chunshui Cao, Xu Liu, Yongzhen Huang, Tianzhu Zhang, Liang Wang | N/A | |
| SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding | Weitai Kang*, Gaowen Liu, Mubarak Shah, Yan Yan | N/A | |
| Adaptive Correspondence Scoring for Unsupervised Medical Image Registration | Xiaoran Zhang*, John C. Stendahl, Lawrence H. Staib, Albert J. Sinusas, Alex Wong, James S. Duncan | N/A | |
| MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models | Nithin Gopalakrishnan Nair*, Jeya Maria Jose Valanarasu, Vishal Patel | N/A | |
| Watch Your Steps: Local Image and Scene Editing by Text Instructions | Ashkan Mirzaei*, Tristan T Aumentado-Armstrong, Marcus A Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G Derpanis, Igor Gilitschenski | N/A | |
| Forget More to Learn More: Domain-specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation | Hritam Basak*, Zhaozheng Yin | N/A | |
| 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences | Anh Thai*, Weiyao Wang, Hao Tang, Stefan Stojanov, James M Rehg, Matt Feiszli | N/A | |
| Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation | Zhengyuan Yang*, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang | N/A | |
| Human-in-the-Loop Visual Re-ID for Population Size Estimation | Gustavo Perez*, Daniel Sheldon, Grant Van Horn, Subhransu Maji | N/A | |
| SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation | Lingchen Meng, Shiyi Lan, Hengduo Li, Jose M Alvarez, Zuxuan Wu*, Yu-Gang Jiang | N/A | |
| "PointNeRF++: A multi-scale, point-based Neural Radiance Field" | Weiwei Sun, Eduard Trulls, Yang-Che Tseng, Sneha Sambandam, Gopal Sharma, Andrea Tagliasacchi, Kwang Moo Yi* | N/A | |
| A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties | Junfei Xiao, Ziqi Zhou, Wenxuan Li, Shiyi Lan, Jieru Mei, Zhiding Yu, Bingchen Zhao, Alan Yuille, Yuyin Zhou, Cihang Xie* | N/A | |
| UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding | Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang* | N/A | |
| Fast View Synthesis of Casual Videos with Soup-of-Planes | Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen, Simon Niklaus, Jianming Zhang, Jia-Bin Huang, Feng Liu | N/A | |
| Adaptive Human Trajectory Prediction via Latent Corridors | Neerja Thakkar*, Karttikeya Mangalam, Andrea Bajcsy, Jitendra Malik | N/A | |
| Video Question Answering with Procedural Programs | Rohan Choudhury*, Koichiro Niinuma, Kris Kitani, Laszlo A Jeni | N/A | |
| DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification | Wenhui Zhu*, Xiwen Chen, Peijie Qiu, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang | N/A | |
| TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling | Dong Huo*, Zixin Guo, Xinxin Zuo, Zhihao Shi, Juwei Lu, Peng Dai, Songcen Xu, Li Cheng, Yee-Hong Yang | N/A | |
| C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition | Rongchang Li, Zhenhua Feng, Tianyang Xu, Linze Li, Xiao-Jun Wu*, Muhammad Awais, Sara Atito, Josef Kittler | N/A | |
| LLMGA: Multimodal Large Language Model based Generation Assistant | bin xia*, Shiyin Wang, Yingfan Tao, Yitong Wang, Jiaya Jia | N/A | |
| Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos | Mi Luo*, Zihui Xue, Alex Dimakis, Kristen Grauman | N/A | |
| Shape from Heat Conduction | Sriram Narayanan*, Mani Ramanagopal, Mark Sheinin, Aswin C. Sankaranarayanan, Srinivasa G. Narasimhan | N/A | |
| An Adaptive Screen-Space Meshing Approach for Normal Integration | Moritz Heep*, Eduard Zell | N/A | |
| Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation | Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang | N/A | |
| HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning | Eugene Valassakis, Guillermo Garcia-Hernando* | N/A | |
| Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning | Yibing Wei, Abhinav Gupta, Pedro Morgado | N/A | |
| Nuvo: Neural UV Mapping for Unruly 3D Representations | Pratul Srinivasan*, Stephan J Garbin, Dor Verbin, Jonathan T Barron, Ben Mildenhall | N/A | |
| Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation | Rong Wang*, Wei Mao, Changsheng Lu, HONGDONG LI | N/A | |
| AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration | Rao Fu*, Zehao Wen, Zichen Liu , Srinath Sridhar | N/A | |
| Better Call SAL: Towards Learning to Segment Anything in Lidar | Aljosa Osep*, Tim Meinhardt, Francesco Ferroni, Neehar Peri, Deva Ramanan, Laura Leal-Taixé | N/A | |
| DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control | Yuru Jia, Lukas Hoyer, Shengyu Huang, Tianfu Wang, Luc Van Gool, Konrad Schindler, Anton Obukhov* | N/A | |
| "DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement" | Qimin Chen*, Zhiqin Chen, Vladimir G. Kim, Noam Aigerman, Hao Zhang, Siddhartha Chaudhuri | N/A | |
| Scene-aware Human Motion Forecasting via Mutual Distance Prediction | Chaoyue Xing*, Wei Mao, Miaomiao Liu | N/A | |
| FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting | Zehao Zhu, Zhiwen Fan, Yifan Jiang, Zhangyang Wang | N/A | |
| Open Panoramic Segmentation | Junwei Zheng, Ruiping Liu, Yufan Chen, Kunyu Peng, Chengzhi Wu, Kailun Yang, Jiaming Zhang*, Rainer Stiefelhagen | N/A | |
| iMatching: Imperative Correspondence Learning | Zitong Zhan, Dasong Gao, Yun-Jou Lin, Youjie Xia, Chen Wang | N/A | |
| COSMU: Complete 3D human shape from monocular unconstrained images | Marco Pesavento*, Marco Volino, Adrian Hilton | N/A | |
| MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Jianhao Zheng, Daniel Barath, Marc Pollefeys, Iro Armeni | N/A | |
| Appearance-based Refinement for Object-Centric Motion Segmentation | Junyu Xie*, Weidi Xie, Andrew Zisserman | N/A | |
| SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance | Lukas Hoyer*, David Joseph Tan, Muhammad Ferjad Naeem, Luc Van Gool, Federico Tombari | N/A | |
| Open Vocabulary Multi-Label Video Classification | Rohit Gupta*, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Yao, Trishul A Chilimbi | N/A | |
| Optimal Transport of Diverse Unsupervised Tasks for Robust Learning from Noisy Few-Shot Data | Xiaofan Que, Qi Yu* | N/A | |
| Regularizing Dynamic Radiance Fields with Kinematic Fields | Woobin Im, Geonho Cha, Sebin Lee, Jumin Lee, Juhyeong Seon, Dongyoon Wee, Sungeui Yoon* | N/A | |
| MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation | Linyan Yang, Lukas Hoyer, Mark Weber, Tobias Fischer, Dengxin Dai, Laura Leal-Taixé, Daniel Cremers, Marc Pollefeys, Luc Van Gool | N/A | |
| Efficient Pre-training for Localized Instruction Generation of Procedural Videos | Anil Batra*, Davide Moltisanti, Laura Sevilla-Lara, Marcus Rohrbach, Frank Keller | N/A | |
| MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution | Yuxuan Jiang*, Chen Feng, Fan Zhang, David Bull | N/A | |
| DEAL: Disentangle and Localize Concept-level Explanations for VLMs | Tang Li*, Mengmeng Ma, Xi Peng | N/A | |
| Fast Encoding and Decoding for Implicit Video Representation | Hao Chen*, Saining Xie, Ser-Nam Lim, Abhinav Shrivastava | N/A | |
| Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models | Zhengming Yu, Zhiyang Dou, Xiaoxiao Long, Cheng Lin, Zekun Li, Yuan Liu, Norman Müller, Taku Komura, Marc Habermann, Christian Theobalt, Xin Li, Wenping Wang | N/A | |
| Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following | Qiaomu Miao*, Alexandros Graikos, Jingwei Zhang, Sounak Mondal, Minh Hoai, Dimitris Samaras | N/A | |
| IMMA: Immunizing text-to-image Models against Malicious Adaptation | Amber Yijia Zheng*, Raymond A. Yeh | N/A | |
| Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Jaehyeok Kim, Dongyoon Wee, Dan Xu* | N/A | |
| GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht, Paul-Edouard Sarlin, Philipp Lindenberger, Marc Pollefeys | N/A | |
| 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation | Zihao Xiao*, Longlong Jing, Shangxuan Wu, Alex Zihao Zhu, Jingwei Ji, Chiyu Max Jiang, Wei-Chih Hung, Thomas Funkhouser, Weicheng Kuo, Anelia Angelova, Yin Zhou, Shiwei Sheng | N/A | |
| Semicalibrated Relative Pose from an Affine Correspondence and Monodepth | Petr Hruby*, Marc Pollefeys, Daniel Barath | N/A | |
| Global Structure-from-Motion Revisited | Linfei Pan*, Daniel Barath, Marc Pollefeys, Johannes L Schönberger | N/A | |
| MobileNetV4: Universal Models for the Mobile Ecosystem | Danfeng Qin*, Chas H Leichner, Manolis Delakis, Marco Fornoni, Shixin Luo, Fan Yang, Weijun Wang, Colby Banbury, Chengxi Ye, Berkin Akin, Vaibhav Aggarwal, Tenghui Zhu, Daniele Moro, Andrew Howard | N/A | |
| Gravity-aligned Rotation Averaging with Circular Regression | Linfei Pan*, Marc Pollefeys, Daniel Barath | N/A | |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Kunpeng Song, Yizhe Zhu, Bingchen Liu, Qing Yan, Ahmed Elgammal, Xiao Yang | N/A | |
| Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments | Djamahl Etchegaray*, Zi Helen Huang, Tatsuya Harada, Yadan Luo | N/A | |
| Quanta Video Restoration | Prateek Chennuri, Yiheng Chi, Enze Jiang, GM Dilshan Godaliyadda, Abhiram Gnanasambandam, Hamid R Sheikh, Istvan Gyongy, Stanley H Chan | N/A | |
| Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models | Rohit Gandikota*, Joanna Materzynska, Tingrui Zhou, Antonio Torralba, David Bau | N/A | |
| CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model | Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao, Shijian Lu* | N/A | |
| ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image | Hallee E. Wong*, Marianne Rakic, John Guttag, Adrian V. Dalca | N/A | |
| POCA: Post-training Quantization with Temporal Alignment for Codec Avatars | Jian Meng, Yuecheng Li, Leo (Chenghui) Li, Syed Shakib Sarwar, Dilin Wang, Jae-sun Seo* | N/A | |
| HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts | Wonjae Kim*, Sanghyuk Chun, Taekyung Kim, Dongyoon Han, Sangdoo Yun | N/A | |
| Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras | Hoonhee Cho, Sung-Hoon Yoon, Hyeokjun Kweon, Kuk-Jin Yoon* | N/A | |
| Unsupervised Dense Prediction using Differentiable Normalized Cuts | Yanbin Liu*, Stephen Gould | N/A | |
| Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training | Cheng Tan, Jingxuan Wei, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Ruifeng Guo, BiHui Yu, Stan Z. Li* | N/A | |
| Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization | Jooyeol Yun*, Jaegul Choo | N/A | |
| AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion | Yitong Jiang, Zhaoyang Zhang, Tianfan Xue, Jinwei Gu | N/A | |
| Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers | Chi-Pin Huang*, Kai-Po Chang, Chung-Ting Tsai, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang | N/A | |
| EINet: Point Cloud Completion via Extrapolation and Interpolation | Pingping Cai*, Canyu Zhang, LINGJIA SHI, Lili Wang, Nasrin Imanpour, Song Wang | N/A | |
| Personalized Video Relighting With an At-Home Light Stage | Jun Myeong Choi*, Max Christman, Roni Sengupta | N/A | |
| Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction | Lin Zhu*, Yunlong Zheng, Yijun Zhang, Xiao Wang, Lizhi Wang, Hua Huang | N/A | |
| A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks | Feiyu CHEN*, Wei Lin, Ziquan Liu, Antoni Chan | N/A | |
| SPIRE: Semantic Prompt-Driven Image Restoration | Chenyang QI*, Zhengzhong Tu, Keren Ye, Mauricio Delbracio, Peyman Milanfar, Qifeng Chen, Hossein Talebi | N/A | |
| Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images | David Junhao Zhang, Mutian Xu, Jay Zhangjie Wu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou | N/A | |
| HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution | XIANG ZHANG*, Yulun Zhang, Fisher Yu | N/A | |
| Audio-Synchronized Visual Animation | Lin Zhang, Shentong Mo, Yijing Zhang, Pedro Morgado* | N/A | |
| Expressive Whole-Body 3D Gaussian Avatar | Gyeongsik Moon*, Takaaki Shiratori, Shunsuke Saito | N/A | |
| Canonical Shape Projection is All You Need for 3D Few-shot Class Incremental Learning | Ali Cheraghian*, Zeeshan Hayder, Sameeea Ramasinghe, Shafin Rahman, Javad Jafaryahya, Lars Petersson, Mehrtash Harandi | N/A | |
| Controllable Human-Object Interaction Synthesis | Jiaman Li*, Alexander Clegg, Roozbeh Mottaghi, Jiajun Wu, Xavier Puig, C. Karen Liu | N/A | |
| High-Fidelity and Transferable NeRF Editing by Frequency Decomposition | Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, Qixing Huang | N/A | |
| DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects | Dominik Bauer*, Zhenjia Xu, Shuran Song | N/A | |
| PAV: Personalized Head Avatar from Unstructured Video Collection | Akin Caliskan*, Berkay Kicanaoglu, Hyeongwoo Kim | N/A | |
| Strike a Balance in Continual Panoptic Segmentation | Jinpeng Chen, Runmin Cong, Yuxuan Luo, Horace Ho Shing Ip, Sam Kwong | N/A | |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Dahyun Kang, Minsu Cho* | N/A | |
| MultiDelete for Multimodal Machine Unlearning | Jiali Cheng*, Hadi Amiri | N/A | |
| Unified Local-Cloud Decision-Making via Reinforcement Learning | Kathakoli Sengupta, Zhongkai Shangguan, Sandesh Bharadwaj, Sanjay Arora, Eshed Ohn-Bar*, Renato Mancuso | N/A | |
| UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Xiangyu Fan, Jiaqi Li, Zhiqian Lin, Weiye Xiao, Lei Yang | N/A | |
| Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation | Yuanchen Ju, Kaizhe Hu, Guowei Zhang, Gu Zhang, Mingrun Jiang, Huazhe Xu* | N/A | |
| Efficient Frequency-Domain Image Deraining with Contrastive Regularization | Ning Gao, Xingyu Jiang, Xiuhui Zhang, Yue Deng* | N/A | |
| Stitched ViTs are Flexible Vision Backbones | Zizheng Pan, Jing Liu, Haoyu He, Jianfei Cai, Bohan Zhuang | N/A | |
| TrajPrompt: Aligning Color Trajectory with Vision-Language Representations | Li-Wu Tsao, Hao-Tang Tsui, Yu-Rou Tuan, Pei-Chi Chen, Kuan-Lin Wang, Jhih-Ciang Wu, Hong-Han Shuai, Wen-Huang Cheng | N/A | |
| SemReg: Semantics Constrained Point Cloud Registration | Sheldon Fung, Xuequan Lu*, Dasith de Silva Edirimuni, Wei Pan, Xiao Liu, HONGDONG LI | N/A | |
| Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views | Yabo Chen, Jiemin Fang, Yuyang Huang, Taoran Yi, Xiaopeng Zhang, Lingxi Xie, Xinggang Wang, Wenrui Dai, Hongkai Xiong, Qi Tian | N/A | |
| RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception | Xiaosu Zhu, Hualian Sheng, Sijia Cai, Bing Deng, Shaopeng Yang, Qiao Liang, Ken Chen, Lianli Gao, Jingkuan Song, Jieping Ye | N/A | |
| ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer | Jiazhi Guan*, Zhiliang Xu, Hang Zhou, Kaisiyuan Wang, Shengyi He, Zhanwang Zhang, Borong Liang, Haocheng Feng, Errui Ding, Jingtuo Liu, Jingdong Wang, Youjian Zhao, Ziwei Liu | N/A | |
| Language-Driven Physics-Based Scene Synthesis and Editing via Feature Splatting | Ri-Zhao Qiu*, Ge Yang, Weijia Zeng, Xiaolong Wang | N/A | |
| AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation | Ri-Zhao Qiu*, Yu-Xiong Wang, Kris Hauser | N/A | |
| SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition | Jeonghyeok Do, Munchurl Kim* | N/A | |
| R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding | Ye Liu, Jixuan He, Wanhua Li, Junsik Kim, Donglai Wei, Hanspeter Pfister, Chang Wen Chen | N/A | |
| Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors | Jae Joong Lee, Bosheng Li, Sara M Beery, Jonathan Huang, Songlin Fei, Raymond A. Yeh, Bedrich Benes* | N/A | |
| Parameterization-driven Neural Surface Reconstruction for Object-oriented Editing in Neural Rendering | Baixin Xu, Jiangbei Hu, Fei Hou, Kwan-Yee Lin, Wayne Wu, Chen Qian, Ying He* | N/A | |
| DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models | Yuyang Huang, Yabo Chen, Yuchen Liu, xiaopeng zhang, Wenrui Dai, Hongkai Xiong, Qi Tian | N/A | |
| Open-Set Recognition in the Age of Vision-Language Models | Dimity Miller*, Niko Suenderhauf, Alex Kenna, Keita Mason | N/A | |
| Unsqueeze [CLS] Bottleneck to Learn Rich Representations | Qing Su*, Shihao Ji | N/A | |
| Robust Multimodal Learning via Representation Decoupling | Shicai Wei, Yang Luo, Yuji Wang, Chunbo Luo* | N/A | |
| Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models | Yasi Zhang*, Peiyu Yu, Ying Nian Wu | N/A | |
| WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing | Shuokang Huang, Kaihan Li, Di You, Yichong Chen, Arvin Lin, Siying Liu, Xiaohui Li, Julie A. McCann | N/A | |
| Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation | Hyunwoo Yu, Yubin Cho, Beoungwoo Kang, Seunghun Moon, Kyeongbo Kong, Suk-Ju Kang* | N/A | |
| VeCLIP: Improving CLIP Training via Visual-enriched Captions | Zhengfeng Lai*, Haotian Zhang, Bowen Zhang, Wentao Wu, Haoping Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao | N/A | |
| Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks | Manyuan Zhang*, Guanglu Song, Xiaoyu Shi, Yu Liu, Hongsheng Li | N/A | |
| Learning Representations from Foundation Models for Domain Generalized Stereo Matching | Yongjian Zhang, Longguang Wang, Kunhong Li, WANG Yun, Yulan Guo* | N/A | |
| Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction | Jianxiong Tang, Jian-Huang Lai, Lingxiao Yang, Xiaohua Xie | N/A | |
| Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer | Qinji Yu, Yirui Wang, Ke Yan, Haoshen Li, Dazhou Guo, Li Zhang, Na Shen, Qifeng Wang, Xiaowei Ding, Le Lu, Xianghua Ye, Dakai Jin | N/A | |
| Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts | shuangkang fang, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang | N/A | |
| Event-Adapted Video Super-Resolution | Zeyu Xiao, Dachun Kai, Yueyi Zhang, Zheng-Jun Zha, Xiaoyan Sun, Zhiwei Xiong* | N/A | |
| Look Hear: Gaze Prediction for Speech-directed Human Attention | Sounak Mondal*, Seoyoung Ahn, Zhibo Yang, Niranjan Balasubramanian, Dimitris Samaras, Gregory Zelinsky, Minh Hoai | N/A | |
| Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching | Xiaoyong Lu, Songlin Du | N/A | |
| Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge | Haibo Wang, Weifeng Ge | N/A | |
| Catastrophic Overfitting: A Potential Blessing in Disguise | MN Zhao, Lihe Zhang*, Yuqiu Kong, Baocai Yin | N/A | |
| Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework | Shengqi Xu, Run Sun, Yi Chang*, Shuning Cao, Xueyao Xiao, Luxin Yan | N/A | |
| SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models | Yuwei Guo, Ceyuan Yang, Anyi Rao, Maneesh Agrawala, Dahua Lin, Bo Dai* | N/A | |
| Visual Alignment Pre-training for Sign Language Translation | Peiqi Jiao, Yuecong Min, Xilin Chen* | N/A | |
| Parrot Captions Teach CLIP to Spot Text | Yiqi Lin, Conghui He*, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou | N/A | |
| Solving Motion Planning Tasks with a Scalable Generative Model | Yihan Hu, Siqi Chai, Zhening Yang, Jingyu Qian, Kun Li, Wenxin Shao, Haichao Zhang, Wei Xu, Qiang Liu | N/A | |
| Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models | Yufei Zhan, Yousong Zhu*, Zhiyang Chen, Fan Yang, Ming Tang, Jinqiao Wang | N/A | |
| Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment | Huangbiao Xu, Xiao Ke*, Yuezhou Li, Rui Xu, Huanqi Wu, Xiaofeng Lin, Wenzhong Guo | N/A | |
| Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation | Tao Chen*, Xiruo Jiang, Gensheng Pei, Zeren Sun, Yucheng Wang, Yazhou Yao | N/A | |
| BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | EungGu Kang*, Byeonghun Lee, Sunghoon Im, Kyong Hwan Jin | N/A | |
| Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Tao Huang, Guangqi Jiang, Yanjie Ze, Huazhe Xu | N/A | |
| Recursive Visual Programming | Jiaxin Ge*, Sanjay Subramanian, Baifeng Shi, Roei Herzig, Trevor Darrell | N/A | |
| LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models | Hao Zhang*, Hongyang Li, Feng Li, Tianhe Ren, Xueyan Zou, Shilong Liu, Shijia Huang, Jianfeng Gao, Lei Zhang, Chunyuan Li, Jianwei Yang | N/A | |
| Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks | Hunmin Yang, Jongoh Jeong, Kuk-Jin Yoon* | N/A | |
| Learning to Adapt SAM for Segmenting Cross-domain Point Clouds | Xidong Peng, Runnan Chen, Feng Qiao, Lingdong Kong, Youquan Liu, Yujing Sun, Tai Wang, Xinge Zhu, Yuexin Ma | N/A | |
| Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging | In Cho, Hyunbo Shim, Seon Joo Kim* | N/A | |
| ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers | Jinke Li, Xiao He, Chonghua Zhou, Xiaoqiang Cheng, Yang Wen, Dan Zhang* | N/A | |
| Fine-grained Dynamic Network for Generic Event Boundary Detection | Ziwei Zheng, Lijun He, Le Yang, Fan Li* | N/A | |
| Take A Step Back: Rethinking the Two Stages in Visual Reasoning | Mingyu Zhang, Jiting Cai, Mingyu Liu, Yue Xu, Cewu Lu, Yong-Lu Li* | N/A | |
| AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation | Jiannan Ge*, Lingxi Xie, Hongtao Xie, Pandeng Li, Xiaopeng Zhang, Yongdong Zhang, Qi Tian | N/A | |
| Learning with Counterfactual Explanations for Radiology Report Generation | Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang | N/A | |
| SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models | Weilong Chai*, Dandan Zheng, Jiajiong Cao, Zhiquan Chen, Changbao Wang, Chenguang Ma | N/A | |
| Better Regression Makes Better Test-time Adaptive 3D Object Detection | Jiakang Yuan, Bo Zhang, Kaixiong Gong, Xiangyu Yue, Botian Shi, Yu Qiao, Tao Chen* | N/A | |
| ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Zekun Qi, Runpei Dong, Shaochen Zhang, Haoran Geng, Chunrui Han, Zheng Ge, Li Yi, Kaisheng Ma | N/A | |
| Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Weihang Liu, Xue Xian Zheng, Jingyi Yu, Xin Lou* | N/A | |
| Finding Visual Task Vectors | Alberto Hojel, Yutong Bai, Trevor Darrell, Amir Globerson, Amir Bar | N/A | |
| Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation | Zongrui Li, Minghui Hu, Qian Zheng, Xudong Jiang | N/A | |
| Event Camera Data Dense Pre-training | Yan Yang, Liyuan Pan*, Liu liu | N/A | |
| Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning | Yunbin Tu*, Liang Li, Li Su, Chenggang Yan, Qingming Huang | N/A | |
| Rethinking Image-to-Video Adaptation: An Object-centric Perspective | Rui Qian*, Shuangrui Ding, Dahua Lin | N/A | |
| Layer-Wise Relevance Propagation with Conservation Property for ResNet | Seitaro Otsuki, Tsumugi Iida, Félix Doublet, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi, Komei Sugiura* | N/A | |
| DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism | Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen* | N/A | |
| EgoLifter: Open-world 3D Segmentation for Egocentric Perception | Qiao Gu, Zhaoyang Lv, Duncan Frost, Simon Green, Julian Straub, Chris Sweeney* | N/A | |
| MEVG : Multi-event Video Generation with Text-to-Video Models | Gyeongrok Oh, Jaehwan Jeong, Sieun Kim, Wonmin Byeon, Jinkyu Kim, Sungwoong Kim, Sangpil Kim | N/A | |
| Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively | Haobo Yuan, Xiangtai Li*, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy | N/A | |
| Data-to-Model Distillation: Data-Efficient Learning Framework | Ahmad Sajedi*, Samir Khaki, Lucy Z. Liu, Ehsan Amjadian, Yuri A. Lawryshyn, Konstantinos N. Plataniotis | N/A | |
| DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays | Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Xiantong Zhen, Zhen Qian, Juan Zhang, Baochang Zhang | N/A | |
| AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network | Yuxi Li, Fuyuan Cheng, Wangbo Yu, Guangshuo Wang, Guibo Luo, Yuesheng Zhu* | N/A | |
| ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion | Yan Hong, Yuxuan Duan, Bo Zhang, Haoxing Chen, Jun Lan, Huijia Zhu, Weiqiang Wang, Jianfu Zhang | N/A | |
| ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency | Shaocheng Yan, Pengcheng Shi, Jiayuan Li* | N/A | |
| Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation | Yuchen Yang, Yu Qiao, Xiao Sun* | N/A | |
| MoVideo: Motion-Aware Video Generation with Diffusion Models | Jingyun Liang, Yuchen Fan, Kai Zhang, Radu Timofte, Luc Van Gool, Rakesh Ranjan | N/A | |
| SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning | Haiwen Diao, Bo Wan, Xu Jia, Yunzhi Zhuge, Ying Zhang, Huchuan Lu, Long Chen | N/A | |
| MonoTTA: Fully Test-Time Adaptation for Monocular 3D Object Detection | Hongbin Lin, Yifan Zhang, Shuaicheng Niu, Shuguang Cui, Zhen Li* | N/A | |
| RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Qianjiang Hu, Zhimin Zhang, Wei Hu* | N/A | |
| Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation | Xiaofeng Yang*, Yiwen Chen, Cheng Chen, Chi Zhang, Yi Xu, Xulei Yang, Fayao Liu, Guosheng Lin | N/A | |
| Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Fu-Yun Wang, Xiaoshi Wu, Zhaoyang Huang, Xiaoyu Shi, Dazhong Shen, Guanglu Song, Yu Liu, Hongsheng Li | N/A | |
| Physically Plausible Color Correction for Neural Radiance Fields | Qi Zhang, Ying Feng, HONGDONG LI | N/A | |
| Unifying 3D Vision-Language Understanding via Promptable Queries | ziyu zhu, Zhuofan Zhang, Xiaojian Ma, Xuesong Niu, Yixin Chen, Baoxiong Jia, Zhidong Deng, Siyuan Huang, Qing Li | N/A | |
| Model Stock: All we need is just a few fine-tuned models | Dong-Hwan Jang, Sangdoo Yun, Dongyoon Han* | N/A | |
| Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution | Xi Yang*, Chenhang He, Jianqi Ma, Lei Zhang | N/A | |
| PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control | Yong Zhong, Min Zhao, Zebin You, Xiaofeng Yu, Changwang Zhang, Chongxuan Li* | N/A | |
| MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction | Qiang Wang* | N/A | |
| Benchmarking Object Detectors with COCO: A New Path Forward | Shweta Singh, Aayan Yadav, Jitesh Jain, Humphrey Shi, Justin Johnson, Karan Desai* | N/A | |
| Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification | Chenyue Li, Shuoyi Chen, Mang Ye* | N/A | |
| WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models | Xin-Jian Wu, Ruisong Zhang, Jie Qin, Shijie Ma, Cheng-Lin Liu | N/A | |
| Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction | Bencheng Liao, Shaoyu Chen, Bo Jiang, Tianheng Cheng, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang* | N/A | |
| DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Xiaojing Zhong, Xinyi Huang, Xiaofeng Yang, Guosheng Lin, Qingyao Wu | N/A | |
| Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing | Zizheng Yang, Hu Yu, Bing Li, Jinghao Zhang, Jie Huang, Feng Zhao* | N/A | |
| Uncertainty-aware sign language video retrieval with probability distribution modeling | Xuan Wu, Hongxiang Li, yuanjiang luo, Xuxin Cheng, Xianwei Zhuang, Meng Cao, Keren Fu | N/A | |
| NeRMo: Learning Implicit Neural Representations for 3D Human Motion Prediction | Dong Wei, Huaijiang Sun, Xiaoning Sun*, Shengxiang Hu | N/A | |
| Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors | Tongkun Guan, Wei Shen*, Xue Yang, Xuehui Wang, Xiaokang Yang | N/A | |
| VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Ahmad Khaliq, Ming Xu, Stephen Hausler, Michael J Milford, Sourav Garg* | N/A | |
| DSA: Discriminative Scatter Analysis for Early Smoke Segmentation | Lujian Yao, Haitao Zhao, Jingchao Peng, Zhongze Wang, Kaijie Zhao | N/A | |
| SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation | Sayan Nag*, Koustava Goswami, Srikrishna Karanam | N/A | |
| KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Yifan Zhan, Zhuoxiao Li, Muyao Niu, Zhihang Zhong, Shohei Nobuhara, Ko Nishino, Yinqiang Zheng* | N/A | |
| Physical-Based Event Camera Simulator | Haiqian Han, Jiacheng Lyu, Jianing Li, Henglu Wei, Cheng Li, Yajing Wei, SHU CHEN, Xiangyang Ji | N/A | |
| V-IRL: Grounding Virtual Intelligence in Real Life | Jihan Yang*, Runyu Ding, Ellis L Brown, Xiaojuan Qi, Saining Xie | N/A | |
| Adversarial Prompt Tuning for Vision-Language Models | Jiaming Zhang, Xingjun Ma, Xin Wang, Lingyu Qiu, Jiaqi Wang, Yu-Gang Jiang, Jitao Sang | N/A | |
| Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing | Jian Gao, chun gu, Youtian Lin, Zhihao Li, Hao Zhu, Xun Cao, Li Zhang, Yao Yao | N/A | |
| Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation | Jinfeng Liu*, Lingtong Kong, Bo Li, Zerong Wang, Hong Gu, Jinwei Chen | N/A | |
| CC-SAM: Enhancing SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation | Shreyank N Gowda*, David A Clifton | N/A | |
| An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding | Wei Chen, Long Chen, Yu Wu* | N/A | |
| Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-v2) | Qifeng Li*, Xiaosong Jia, Shaobo Wang, Junchi Yan | N/A | |
| PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion | Guansong Lu*, Yuanfan Guo, Jianhua Han, Minzhe Niu, Yihan Zeng, Songcen Xu, Zeyi Huang, Zhao Zhong, Wei Zhang, Hang Xu | N/A | |
| "X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning" | Artemis Panagopoulou*, Le Xue, Ning Yu, LI JUNNAN, DONGXU LI, Shafiq Joty, Ran Xu, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles | N/A | |
| Learning Neural Volumetric Pose Features for Camera Localization | Jingyu Lin, Jiaqi Gu, Bojian Wu, Lubin Fan, Renjie Chen, Ligang Liu, Jieping Ye | N/A | |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Shuangrui Ding*, Rui Qian, Haohang Xu, Dahua Lin, Hongkai Xiong | N/A | |
| REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices | Chaojie Ji*, Yufeng Li, Yiyi Liao | N/A | |
| Self-Training Room Layout via Geometry-aware Ray-casting | Bolivar Solarte, Chin-Hsuan Wu, Jin-Cheng Jhang, Jonathan Lee, Yi-Hsuan Tsai, Min Sun | N/A | |
| Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback | Xin Jin, Bohan Li, Baao Xie, Wenyao Zhang, Jinming Liu, Ziqiang Li, Tao Yang, Wenjun Zeng | N/A | |
| Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective | Xiang Fang, Zeyu Xiong, Wanlong Fang, Xiaoye Qu, Chen Chen, Jianfeng Dong, Keke Tang, Pan Zhou, Yu Cheng, Daizong Liu | N/A | |
| Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization | Ming-Yang Ho, Che-Ming Wu, Min-Sheng Wu, Yufeng Jane Tseng* | N/A | |
| ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model | Fu-Yun Wang, Zhaoyang Huang, Qiang Ma, Guanglu Song, Xudong LU, Weikang Bian, Yijin Li, Yu Liu, Hongsheng Li* | N/A | |
| Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach | Taolin Zhang, Jiawang Bai, Zhihe Lu, Dongze Lian, genping wang, Xinchao Wang, Shu-Tao Xia | N/A | |
| Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration | Chujie Qin, Ruiqi Wu, Zikun Liu, Xin Lin, Chun-Le Guo, Hyun Hee Park, Chongyi Li* | N/A | |
| When Fast Fourier Transform Meets Transformer for Image Restoration | Xingyu Jiang, Xiuhui Zhang, Ning Gao, Yue Deng* | N/A | |
| Dolphins: Multimodal Language Model for Driving | Yingzi Ma, Yulong Cao, Jiachen Sun, Marco Pavone, Chaowei Xiao* | N/A | |
| Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model | Chen Rao, Guangyuan Li, Zehua Lan, Jiakai Sun, Junsheng Luan, Wei Xing, Lei Zhao, Huaizhong Lin*, Jianfeng Dong, Dalong Zhang | N/A | |
| CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection | xunfa lai, Zhiyu Yang, Jie Hu, ShengChuan Zhang*, Liujuan Cao, Guannan Jiang, Songan Zhang, zhiyu wang, Rongrong Ji | N/A | |
| Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Pau de Jorge Aranda*, Riccardo Volpi, Puneet Dokania, Philip Torr, Gregory Rogez | N/A | |
| Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents | Mengjun Cheng, Chengquan Zhang, Chang Liu*, Yuke Li, Bohan Li, Kun Yao, Xiawu Zheng, Rongrong Ji, Jie Chen | N/A | |
| Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching | Ruonan Yu, Songhua Liu, Jingwen Ye, Xinchao Wang* | N/A | |
| Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework | Wei Suo, Lanqing Lai, Mengyang Sun, Hanwang Zhang, Peng Wang*, Yanning Zhang | N/A | |
| D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On | Zhaotong Yang, Zicheng Jiang, Xinzhe Li, Huiyu Zhou, Junyu Dong, Huaidong Zhang, Yong Du* | N/A | |
| TC4D: Trajectory-Conditioned Text-to-4D Generation | Sherwin Bahmani*, Xian Liu, Wang Yifan, Ivan Skorokhodov, Victor Rong, Ziwei Liu, Xihui Liu, Jeong Joon Park, Sergey Tulyakov, Gordon Wetzstein, Andrea Tagliasacchi, David B Lindell | N/A | |
| Blind Image Deconvolution by Generative-based Kernel Prior and Initializer via Latent Encoding | Jiangtao Zhang, Zongsheng Yue, Hui Wang, Qian Zhao, Deyu Meng | N/A | |
| AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models | Xuelong Dai*, Kaisheng Liang, Bin Xiao | N/A | |
| Improving Text-guided Object Inpainting with Semantic Pre-inpainting | Yifu Chen, Jingwen Chen, Yingwei Pan*, Yehao Li, Ting Yao, Zhineng Chen, Tao Mei | N/A | |
| Personalized Federated Domain-Incremental Learning based on Adaptive Knowledge Matching | Yichen Li, Wenchao Xu, Haozhao Wang, Yining Qi, Jingcai Guo, Ruixuan Li* | N/A | |
| ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images | Xiangtian Xue, Jiasong Wu*, Youyong Kong, Lotfi Senhadji, Huazhong Shu | N/A | |
| RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Muyao Niu, Tong Chen, Yifan Zhan, Zhuoxiao Li, Xiang Ji, Yinqiang Zheng* | N/A | |
| Region-Adaptive Transform with Segmentation Prior for Image Compression | Yuxi Liu*, Wenhan Yang, Huihui Bai, Yunchao Wei, Yao Zhao | N/A | |
| Enhancing Tracking Robustness with Auxiliary Adversarial Defense Networks | Zhewei Wu, Ruilong Yu, Qihe Liu*, Shuying Cheng, Shilin Qiu, Shijie Zhou | N/A | |
| SLIM: Spuriousness Mitigation with Minimal Human Annotations | Xiwei Xuan*, Ziquan Deng, Hsuan-Tien Lin, Kwan-Liu Ma | N/A | |
| Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset | Mijoo Kim, Junseok Kwon* | N/A | |
| X-Pose: Detecting Any Keypoints | Jie Yang, Ailing Zeng, Ruimao Zhang, Lei Zhang | N/A | |
| M^2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation | Yingshuang Zou, Yikang Ding, Xi Qiu, Haoqian Wang, Haotian Zhang* | N/A | |
| UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection | Yingsen Zeng, Yujie Zhong*, Chengjian Feng, Lin Ma | N/A | |
| DyFADet: Dynamic Feature Aggregation for Temporal Action Detection | Le Yang*, Ziwei Zheng, Yizeng Han, Hao Cheng, Shiji Song, Gao Huang, Fan Li | N/A | |
| LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models | Yanwei Li*, Chengyao Wang, Jiaya Jia | N/A | |
| MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering | Guoxing Sun*, Rishabh Dabral, Pascal Fua, Christian Theobalt, Marc Habermann | N/A | |
| DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction | Yanlong LI*, Chamara Madarasingha, Kanchana Thilakarathna | N/A | |
| Multi-branch Collaborative Learning Network for 3D Visual Grounding | Zhipeng Qian, Yiwei Ma, Zhekai Lin, Jiayi Ji, Xiawu Zheng, Xiaoshuai Sun*, Rongrong Ji | N/A | |
| DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors | Jinbo Xing*, Menghan Xia, Yong Zhang, Haoxin Chen, Wangbo Yu, Hanyuan Liu, Gongye Liu, Xintao Wang, Ying Shan, Tien-Tsin Wong | N/A | |
| Motion Aware Event Representation-driven Image Deblurring | Zhijing Sun, Xueyang Fu, Longzhuo Huang, Aiping Liu, Zheng-Jun Zha* | N/A | |
| Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models | Chen Ju, Haicheng Wang, Haozhe Cheng, Xu Chen, Zhonghua Zhai, Weilin Huang, Jinsong Lan, Shuai Xiao, Bo Zheng | N/A | |
| WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language | Zhenxiang Lin, Xidong Peng, Peishan Cong, Ge Zheng, Yujing Sun, Yuenan HOU, Xinge Zhu, Sibei Yang, Yuexin Ma* | N/A | |
| RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning | Longrong Yang, Hanbin Zhao, Yunlong Yu, Xiaodong Zeng, Xi Li | N/A | |
| Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models | Luozhou Wang, Guibao Shen, Wenhang Ge, Guangyong Chen, Yijun Li, Yingcong Chen | N/A | |
| Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Hao Zhang, Jie Yang, Qing Jiang, Chunyuan Li, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang | N/A | |
| Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression | Dingyuan Zhang, Dingkang Liang, Zichang Tan, Xiaoqing Ye, Cheng Zhang, Jingdong Wang, Xiang Bai | N/A | |
| OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation | Zhenyu Wang*, Ya-Li Li, TAICHI LIU, Hengshuang Zhao, Shengjin Wang | N/A | |
| CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing | Haibo Jin, Ruoxi Chen, Jinyin Chen, Haibin Zheng, Yang Zhang, Haohan Wang* | N/A | |
| UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt | Xin Li*, Bingchen Li, Yeying Jin, Cuiling Lan, Hanxin Zhu, Yulin Ren, Zhibo Chen | N/A | |
| LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents | Shilong Liu, Hao Cheng, Haotian Liu, Hao Zhang, Feng Li, Tianhe Ren, Xueyan Zou, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang, Jianfeng Gao, Chunyuan Li | N/A | |
| ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference | Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng*, Wayne Zhang | N/A | |
| Two-Stage Active Learning for Efficient Temporal Action Segmentation | Yuhao Su, Ehsan Elhamifar* | N/A | |
| TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation | Yufei Liu, Junwei Zhu, Junshu Tang, Shijie Zhang, Jiangning Zhang, Weijian Cao, Chengjie Wang, Yunsheng Wu, Dongjin Huang* | N/A | |
| MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Wangze Xu, Huachen Gao, Shihe Shen, Rui Peng, Jianbo Jiao, Ronggang Wang* | N/A | |
| Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions | Yihao Ai*, Yifei Qi, Bo Wang, Yu Cheng, Xinchao Wang, Robby T. Tan | N/A | |
| Towards More Practical Group Activity Detection: A New Benchmark and Model | Dongkeun Kim, Youngkil Song, Minsu Cho, Suha Kwak* | N/A | |
| Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models | Zhiyuan You, Zheyuan Li, Jinjin Gu, Zhenfei Yin, Tianfan Xue, Chao Dong | N/A | |
| Zero-Shot Image Feature Consensus with Deep Functional Maps | Xinle Cheng, Congyue Deng, Adam Harley, Yixin Zhu, Leonidas Guibas* | N/A | |
| WindPoly: Polygonal Mesh Reconstruction via Winding Numbers | Xin He, Chenlei Lv, Pengdi Huang, Hui Huang* | N/A | |
| MinD-3D: Reconstruct High-quality 3D objects in Human Brain | Jianxiong Gao, Yuqian Fu, Yun Wang, Xuelin Qian, Jianfeng Feng, Yanwei Fu* | N/A | |
| Tokenize Anything via Prompting | Ting Pan, Lulu Tang, Xinlong Wang, Shiguang Shan | N/A | |
| Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views | Ningli Xu, Rongjun Qin* | N/A | |
| Scissorhands: Scrub Data Influence via Connection Sensitivity in Networks | Jing Wu*, Mehrtash Harandi | N/A | |
| City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Kaiwen Song, Xiaoyi Zeng, Chenqu Ren, Juyong Zhang* | N/A | |
| GRAPE: Generalizable and Robust Multi-view Facial Capture | Jing Li, Di Kang, Zhenyu He* | N/A | |
| Training-Free Model Merging for Multi-target Domain Adaptation | Wenyi Li, Huan-ang Gao, Mingju Gao, Beiwen Tian, Rong Zhi, Hao Zhao* | N/A | |
| Multi-RoI Human Mesh Recovery with Camera Consistency and Contrastive Losses | Yongwei Nie, Changzhen Liu, Chengjiang Long, Qing Zhang, Guiqing Li, Hongmin Cai* | N/A | |
| Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection | Lianjun Wu, Jiangxiao Han, Zengqiang Zheng, Xinggang Wang* | N/A | |
| Open-Vocabulary Camouflaged Object Segmentation | Youwei Pang, Xiaoqi Zhao, JiaMing Zuo, Lihe Zhang*, Huchuan Lu | N/A | |
| SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions | Xiaoyu Liu, Yuxiang Wei, Ming Liu*, Xianhui Lin, Peiran Ren, xuansong xie, Wangmeng Zuo | N/A | |
| InterFusion: Text-Driven Generation of 3D Human-Object Interaction | Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu | N/A | |
| GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval | Han Zhou, Wei Dong, Xiaohong Liu, Shuaicheng Liu, Xiongkuo Min, Guangtao Zhai, Jun Chen | N/A | |
| DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving | Xiaofeng Wang*, Zheng Zhu, Guan Huang, Chen Xinze, Jiagang Zhu, Jiwen Lu | N/A | |
| Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition | Muhammad Adi Nugroho*, Sangmin Woo, Sumin Lee, Jinyoung Park, Yooseung Wang, Donguk Kim, Changick Kim | N/A | |
| NeRF-XL: NeRF at Any Scale with Multi-GPU | Ruilong Li*, Sanja Fidler, Angjoo Kanazawa, Francis Williams | N/A | |
| CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Jiankun Zhao, Bowen Song, Liyue Shen* | N/A | |
| The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? | Qinyu Zhao*, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould | N/A | |
| Compositional Substitutivity of Visual Reasoning for Visual Question Answering | Chuanhao Li, Zhen Li, Chenchen Jing, Yuwei Wu, Mingliang Zhai, Yunde Jia | N/A | |
| LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Hai Jiang, Ao Luo, Xiaohong Liu, Songchen Han, Shuaicheng Liu* | N/A | |
| DNI: Dilutional Noise Initialization for Diffusion Video Editing | Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo* | N/A | |
| Two-Stage Video Shadow Detection via Temporal-Spatial Adaption | Xin Duan, Yu Cao, Lei Zhu, Gang Fu, Xin Wang, Renjie ZHANG, Ping Li* | N/A | |
| Towards Physical World Backdoor Attacks against Skeleton Action Recognition | Qichen Zheng, Yi Yu, SIYUAN YANG*, Jun Liu, Kwok-Yan Lam, Alex Kot | N/A | |
| SAM-guided Graph Cut for 3D Instance Segmentation | Haoyu Guo, He Zhu, Sida Peng, Yuang Wang, Yujun Shen, Ruizhen Hu, Xiaowei Zhou* | N/A | |
| Fully Authentic Visual Question Answering Dataset from Online Communities | Chongyan Chen*, Mengchen Liu, Noel C Codella, Yunsheng Li, Lu Yuan, Danna Gurari | N/A | |
| Active Generation for Image Classification | Tao Huang, Jiaqi Liu, Shan You*, Chang Xu | N/A | |
| FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors | Chen-Wei Xie*, Siyang Sun, Liming Zhao, Pandeng Li, Shuailei Ma, Yun Zheng | N/A | |
| Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes | Chao Chen, Yu-Shen Liu*, Zhizhong Han | N/A | |
| Understanding Multi-compositional learning in Vision and Language models via Category Theory | Sotirios Panagiotis Chytas*, Hyunwoo J Kim, Vikas Singh | N/A | |
| FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients | Shangchao Su, Bin Li*, Xiangyang Xue | N/A | |
| Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration | Youngjin Oh*, Keuntek Lee, Jooyoung Lee, Dae-Hyun Lee, Nam Ik Cho | N/A | |
| Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image | Pengkun Jiao, Na Zhao, Jingjing Chen, Yu-Gang Jiang | N/A | |
| Diffusion-Guided Weakly Supervised Semantic Segmentation | Sung-Hoon Yoon, Hoyong Kwon, Jaeseok Jeong, Daehee Park, Kuk-Jin Yoon* | N/A | |
| Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment | Yang Jin, Yadong Mu | N/A | |
| When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset | Yi Zhang, Wang Zeng, Sheng Jin, Chen Qian*, Ping Luo, Wentao Liu | N/A | |
| NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image | Yoonwoo Jeong, Jinwoo Lee, Chiheon Kim, Minsu Cho, Doyup Lee | N/A | |
| Segment and Recognize Anything at Any Granularity | Feng Li, Hao Zhang, Peize Sun, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianwei Yang, Lei Zhang, Jianfeng Gao* | N/A | |
| Real-time Holistic Robot Pose Estimation with Unknown States | Shikun Ban, Juling Fan, Xiaoxuan Ma, Wentao Zhu, Yu QIAO, Yizhou Wang | N/A | |
| CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning | Junghun Oh, Sungyong Baik, Kyoung Mu Lee* | N/A | |
| A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars | Ronglai Zuo, Fangyun Wei*, Zenggui Chen, Brian Mak, Jiaolong Yang, Xin Tong | N/A | |
| An accurate detection is not all you need to combat label noise in web-noisy datasets | Paul Albert*, Kevin McGuinness, Eric Arazo, Tarun Krishna, Noel O Connor, Jack Valmadre | N/A | |
| Online Vectorized HD Map Construction using Geometry | Zhixin Zhang, Yiyuan Zhang, Xiaohan Ding, Fusheng Jin*, Xiangyu Yue | N/A | |
| Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids | Wontae Kim, Nam Ik Cho | N/A | |
| Learned HDR Image Compression for Perceptually Optimal Storage and Display | Peibei Cao, HAOYU CHEN, Jingzhe Ma, Yu-Chieh Yuan, Zhiyong Xie, Xin Xie, Haiqing Bai, Kede Ma* | N/A | |
| Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion | Huadong Li, Minhao Jing, Jin Wang, Shichao Dong, Jiajun Liang, Haoqiang Fan, Renhe Ji* | N/A | |
| Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration | Qiang Wang*, Yuhang He, Songlin Dong, Xinyuan Gao, Shaokun Wang, Yihong Gong | N/A | |
| Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression | Yuan Tian, Guo Lu, Guangtao Zhai* | N/A | |
| Improving Virtual Try-On with Garment-focused Diffusion Models | Siqi Wan, Yehao Li, Jingwen Chen, Yingwei Pan*, Ting Yao, Yang Cao, Tao Mei | N/A | |
| Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection | Feng Liu, Tengteng Huang, Qianjing Zhang, Haotian Yao, Chi Zhang, Fang Wan, Qixiang Ye, Yanzhao Zhou | N/A | |
| Disentangled Generation and Aggregation for Robust Radiance Fields | Shihe Shen, Huachen Gao, Wangze Xu, Rui Peng, Luyang Tang, Kaiqiang Xiong, Jianbo Jiao, Ronggang Wang* | N/A | |
| UNIKD: UNcertainty-Filtered Incremental Knowledge Distillation for Neural Implicit Representation | Mengqi Guo*, Chen Li, Hanlin Chen, Gim Hee Lee | N/A | |
| Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Jiawei Han, Kaiqi Liu*, Wei Li, Guangzhi Chen | N/A | |
| MoAI: Mixture of All Intelligence for Large Language and Vision Models | Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro* | N/A | |
| Semantic-guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift | kangyu xiao*, Zilei Wang, junjie li | N/A | |
| Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations | Zipeng Wang, yunfan lu, Lin Wang | N/A | |
| SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models | Yang Zhou, Yongjian Wu, Jiya Saiyin, Bingzheng Wei, Maode Lai, Eric I Chang, Yan Xu | N/A | |
| Open-World Dynamic Prompt and Continual Visual Representation Learning | Youngeun Kim, Jun Fang*, Qin Zhang, Zhaowei Cai, Yantao Shen, Rahul Duggal, Dripta S. Raychaudhuri, Zhuowen Tu, Yifan Xing, Onkar Dabeer | N/A | |
| Learning Video Context as Interleaved Multimodal Sequences | Kevin Qinghong Lin, Pengchuan Zhang, Difei Gao, Xide Xia, Joya Chen, Ziteng Gao, Jinheng Xie, Xuhong Xiao, Mike Zheng Shou* | N/A | |
| Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors | Wenyuan Zhang, Kanle Shi, Yu-Shen Liu*, Zhizhong Han | N/A | |
| Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding | Ruihuang Li*, Zhengqiang ZHANG, Chenhang He, Zhiyuan Ma, Vishal Patel, Lei Zhang | N/A | |
| Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks | Cheng Gong, Yao Chen, Qiuyang Luo, Ye Lu, Tao Li, Yuzhi Zhang, Yufei Sun, Le Zhang | N/A | |
| Multi-scale Cross Distillation for Object Detection in Aerial Images | Kun Wang, Zi Wang, Zhang Li*, Xichao Teng, Yang Li | N/A | |
| Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation | Hyun Seok Seong, WonJun Moon, SuBeen Lee, Jae-Pil Heo* | N/A | |
| Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence | Yutong Chen, Yifan Zhan, Zhihang Zhong, Wei Wang, Xiao Sun, Yu Qiao, Yinqiang Zheng | N/A | |
| Revisit Human-Scene Interaction via Space Occupancy | Xinpeng Liu, Haowen Hou, Yanchao Yang, Yong-Lu Li*, Cewu Lu | N/A | |
| Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control | Yue Han*, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu | N/A | |
| WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model | Haisheng Fu*, Jie Liang, Zhenman Fang, Jingning Han, Feng Liang, Guohe Zhang | N/A | |
| Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning | Pengyu Li*, biao wang, Tianchu Guo, Xian-Sheng Hua | N/A | |
| Mitigating Background Shift in Class-Incremental Semantic Segmentation | Gilhan Park, WonJun Moon, SuBeen Lee, Tae-Young Kim, Jae-Pil Heo* | N/A | |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Xiuquan Hou, Meiqin Liu*, Senlin Zhang, Ping Wei, Badong Chen, Xuguang Lan | N/A | |
| BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation | Zekai Xu, Kang You, Qinghai Guo, Xiang Wang, Zhezhi He* | N/A | |
| Agent Attention: On the Integration of Softmax and Linear Attention | Dongchen Han, Tianzhu Ye, Yizeng Han, Zhuofan Xia, Siyuan Pan, Pengfei Wan, Shiji Song, Gao Huang* | N/A | |
| Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion | Quoc-Huy Tran*, Muhammad Ahmed, Murad Popattia, Muhammad Hassan Ahmed, Andrey Konin, Zeeshan Zia | N/A | |
| Resolving Scale Ambiguity in Multi-view 3D Reconstruction using Dual-Pixel Sensors | Kohei Ashida*, Hiroaki Santo, Fumio Okura, Yasuyuki Matsushita | N/A | |
| Object-Oriented Anchoring and Modal Alignment in Multimodal Learning | Shibin Mei, Bingbing Ni*, Hang Wang, Chenglong Zhao, fengfa hu, Zhiming Pi, BiLian Ke | N/A | |
| Towards Stable 3D Object Detection | Jiabao Wang, Qiang Meng, Guochao Liu, Liujiang Yan, Ke Wang, Ming-Ming Cheng, Qibin Hou* | N/A | |
| FYI: Flip Your Images for Dataset Distillation | Byunggwan Son, Youngmin Oh, Donghyeon Baek, Bumsub Ham | N/A | |
| On-the-fly Category Discovery for LiDAR Semantic Segmentation | Hyeonseong Kim, Sung-Hoon Yoon, Minseok Kim, Kuk-Jin Yoon* | N/A | |
| Dual-Camera Smooth Zoom on Mobile Phones | Renlong Wu, Zhilu Zhang*, Yu Yang, Wangmeng Zuo | N/A | |
| ProtoComp: Diverse Point Cloud Completion with Controllable Prototype | Xumin Yu, Yanbo Wang, Jie Zhou, Jiwen Lu* | N/A | |
| CONDA: Condensed Deep Association Learning for Co-Salient Object Detection. | Long Li, Nian Liu, Dingwen Zhang, Zhongyu Li, Salman Khan, Rao Anwer, Hisham Cholakkal, Junwei Han, Fahad Shahbaz Khan | N/A | |
| Cascade Prompt Learning for Visual-Language Model Adaptation | Ge Wu, Xin Zhang, Zheng Li, Zhaowei Chen, Jiajun Liang, Jian Yang, Xiang Li* | N/A | |
| PolyRoom: Room-aware Transformer for Floorplan Reconstruction | Yuzhou Liu, Lingjie Zhu, Xiaodong Ma, Hanqiao Ye, Xiang Gao, Xianwei Zheng, Shuhan Shen* | N/A | |
| BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models | Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Yaohang Li, Xing Luo, Chenyu Yi, Alex Kot | N/A | |
| SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution | mingjun zheng, Long Sun, Jiangxin Dong, Jinshan Pan* | N/A | |
| HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Zhongyu Xia, ZhiWei Lin, Xinhao Wang, Yongtao Wang*, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang | N/A | |
| Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation | Bowei Xing*, Xianghua Ying, Ruibin Wang, Ruohao Guo, Ji Shi, Wenzhen Yue | N/A | |
| Customized Generation Reimagined: Fidelity and Editability Harmonized | Jian Jin, Yang Shen, Zhenyong Fu, Jian Yang | N/A | |
| AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors | Kaishen Yuan, Zitong Yu, Xin Liu, Weicheng Xie, Huanjing Yue, Jingyu Yang | N/A | |
| Improving Video Segmentation via Dynamic Anchor Queries | Yikang Zhou, Tao Zhang, Xiangtai Li, Shunping Ji*, Shuicheng Yan | N/A | |
| Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights | Shunqi Mao*, Chaoyi Zhang, Hang Su, Hwanjun Song, Igor Shalyminov, Weidong Cai | N/A | |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Renming Huang, Yunqiang Pei, Guoqing Wang*, Yangming Zhang, Yang Yang, Peng Wang, Heng Tao Shen | N/A | |
| Enhanced Sparsification via Stimulative Training | Shengji Tang, Weihao Lin, Hancheng Ye, Peng Ye, Chong Yu, Baopu Li, Tao Chen* | N/A | |
| How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs | Haoqin Tu, Chenhang Cui, Zijun Wang, Yiyang Zhou, Bingchen Zhao, Junlin Han, Wangchunshu Zhou, Huaxiu Yao, Cihang Xie | N/A | |
| NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation | Jingyang Huo, Yikai Wang, Yanwei Fu*, Xuelin Qian, Chong Li, Yun Wang, Jianfeng Feng | N/A | |
| Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image | Xingyu Liu, Pengfei Ren, Jingyu Wang, Qi Qi, Haifeng Sun, Zirui Zhuang, Jianxin Liao | N/A | |
| Efficient Snapshot Spectral Imaging: Calibration-Free Parallel Structure with Aperture Diffraction Fusion | Tao Lv*, Lihao Hu, Shiqiao Li, Chenglong Huang, Xun Cao | N/A | |
| Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective | Fangzhou Song, Bin Zhu, Yanbin Hao*, Shuo Wang | N/A | |
| PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking | Jiahuan Long, Tingsong Jiang, Wen Yao, Shuai Jia, Weijia Zhang, Weien Zhou, Chao Ma, Xiaoqian Chen | N/A | |
| HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models | Shen Zhang, Zhaowei CHEN, Zhenyu Zhao, Yuhao Chen, Yao Tang, Jiajun Liang* | N/A | |
| On the Approximation Risk of Few-Shot Class-Incremental Learning | Xuan Wang, Zhong Ji*, Xiyao Liu, Yanwei Pang, Jungong Han | N/A | |
| Syn-to-Real Domain Adaptation for Point Cloud Completion via Part-based Approach | Yunseo Yang, Jihun Kim, Kuk-Jin Yoon* | N/A | |
| Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization | Jiajun Hu, Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao | N/A | |
| SCOMatch: Alleviating Overtrusting in Open-set Semi-supervised Learning | Zerun Wang*, Liuyu Xiang, Lang Huang, Jiafeng Mao, Ling Xiao, Toshihiko Yamasaki | N/A | |
| Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning | Meixuan Li, Tianyu Li, Guoqing Wang*, Peng Wang, Yang Yang, Jie Zou | N/A | |
| MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation | Yuxiang Wei, Zhilong Ji, Jinfeng Bai, Hongzhi Zhang, Lei Zhang, Wangmeng Zuo | N/A | |
| PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training | Suyi Chen, Hao Xu, Haipeng Li, Kunming Luo, Guanghui Liu, Chi-Wing Fu, Ping Tan, Shuaicheng Liu* | N/A | |
| General Geometry-aware Weakly Supervised 3D Object Detection | Guowen Zhang*, Junsong Fan, Liyi Chen, Zhaoxiang Zhang, Zhen Lei, Lei Zhang | N/A | |
| Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang, Pan Zhang, Xiaoyi Dong, Yuhang Zang, Jiaqi Wang* | N/A | |
| Dolfin: Diffusion Layout Transformers without Autoencoder | Yilin Wang, Zeyuan Chen, Liangjun Zhong, Zheng Ding, Zhuowen Tu* | N/A | |
| Real-time 3D-aware Portrait Editing from a Single Image | Qingyan Bai, Zifan Shi, Yinghao Xu, Hao Ouyang, Qiuyu Wang, Ceyuan Yang, Xuan Wang, Gordon Wetzstein, Yujun Shen, Qifeng Chen* | N/A | |
| StructLDM: Structured Latent Diffusion for 3D Human Generation | Tao Hu, Fangzhou Hong, Ziwei Liu* | N/A | |
| Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation | Han Li, Shaohui Li, Shuangrui Ding, Wenrui Dai*, Maida Cao, Chenglin Li, Junni Zou, Hongkai Xiong | N/A | |
| Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models | Hyeonwoo Kim, Sookwan Han, Patrick Kwon, Hanbyul Joo* | N/A | |
| Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification | Yu Bai, Bo Zhang*, Zheng Zhang, Shuo Yan, Zibo Ma, Wu Liu, Xiuzhuang Zhou, Xiangyang Gong, Wendong Wang | N/A | |
| Continuous Memory Representation for Anomaly Detection | Joo Chan Lee, Taejune Kim, Eunbyung Park, Simon S Woo, Jong Hwan Ko | N/A | |
| InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser | Xing Cui, Zekun Li, Peipei Li*, Huaibo Huang, Xuannan Liu, Zhaofeng He | N/A | |
| PACE: Pose Annotations in Cluttered Environments | Yang You, kai xiong, Zhening Yang, Zhengxiang Huang, Junwei Zhou, Ruoxi Shi, Zhou FANG, Adam Harley, Leonidas Guibas, Cewu Lu | N/A | |
| CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring | Taewoo Kim, Hoonhee Cho, Kuk-Jin Yoon* | N/A | |
| CountFormer: Multi-View Crowd Counting Transformer | Hong Mo, Xiong Zhang, Jianchao Tan, Cheng Yang, Qiong Gu, Bo Hang, Wenqi Ren | N/A | |
| Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery | Haiyang Zheng, Nan Pu, Wenjing Li, Nicu Sebe, Zhun Zhong | N/A | |
| Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis | Jaein Kim, HEE BIN YOO, Dong-Sig Han, Yeon-Ji Song, Byoung-Tak Zhang* | N/A | |
| EA-VTR: Event-Aware Video-Text Retrieval | Zongyang Ma*, Ziqi Zhang, Yuxin Chen, Zhongang Qi, Chunfeng Yuan, Bing Li, Yingmin Luo, Xu LI, Xiaojuan Qi, Ying Shan, Weiming Hu | N/A | |
| Privacy-Preserving Adaptive Re-Identification without Image Transfer | Hamza Rami*, Jhony H. Giraldo, Nicolas Winckler, Stéphane Lathuilière | N/A | |
| A Simple Low-bit Quantization Framework for Video Snapshot Compressive Imaging | Miao Cao*, Lishun Wang, Huan Wang, Xin Yuan | N/A | |
| DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks | Caixin Kang, Yinpeng Dong, Zhengyi Wang, Shouwei Ruan, Yubo Chen, Hang Su, Xingxing Wei* | N/A | |
| Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Kihong Kim, Haneol Lee, Jihye Park, Seyeon Kim, Kwang Hee Lee, Seungryong Kim, Jaejun Yoo | N/A | |
| Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation | Anqi Zhang, Guangyu Gao* | N/A | |
| Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation | Yeongtak Oh, Jonghyun Lee, Jooyoung Choi, Dahuin Jung, Uiwon Hwang, Sungroh Yoon | N/A | |
| Learning to Unlearn for Robust Machine Unlearning | Mark He Huang, Lin Geng Foo, Jun Liu | N/A | |
| Emergent Visual-Semantic Hierarchies in Image-Text Representations | Morris Alper*, Hadar Averbuch-Elor | N/A | |
| Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation | Zhenliang Ni, Xinghao Chen, Yingjie Zhai, Yehui Tang, Yunhe Wang | N/A | |
| DriveLM: Driving with Graph Visual Question Answering | Chonghao Sima*, Katrin Renz, Kashyap Chitta, Li Chen, Zhang Hanxue, Chengen Xie, Jens Beißwenger, Ping Luo, Andreas Geiger, Hongyang Li | N/A | |
| Neural Spectral Decomposition for Dataset Distillation | Shaolei Yang, Shen Cheng, Mingbo Hong, Haoqiang Fan, Xing Wei, Shuaicheng Liu* | N/A | |
| Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation | Linlong Fan, Ye Huang*, Yanqi Ge, Wen Li, Lixin Duan | N/A | |
| Learning Non-Linear Invariants for Unsupervised Out-of-Distribution Detection | Lars Doorenbos*, Raphael Sznitman, Pablo Márquez Neila | N/A | |
| Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection | Trinh Le Ba Khanh, Huy-Hung Nguyen, Long Hoang Pham, Duong Nguyen-Ngoc Tran, Jae Wook Jeon | N/A | |
| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Xiao Zhou, Xiaoman Zhang, Chaoyi Wu, Ya Zhang, Weidi Xie, Yan-Feng Wang* | N/A | |
| Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution | Junxiong Lin*, Yan Wang, Zeng Tao, Boyang Wang, Qing Zhao, Haoran Wang, Xuan Tong, Xinji Mai, Yuxuan Lin, Wei Song, Jiawen Yu, Shaoqi Yan, Wenqiang Zhang | N/A | |
| Disentangled Clothed Avatar Generation from Text Descriptions | Jionghao Wang, Yuan Liu, Zhiyang Dou, Zhengming Yu, Yongqing Liang, Cheng Lin, Rong Xie, Li Song, Xin Li, Wenping Wang* | N/A | |
| Real Appearance Modeling for More General Deepfake Detection | Jiahe Tian, Cai Yu, Xi Wang, Peng Chen, Zihao Xiao, Jiao Dai, Yesheng Chai*, Jizhong Han | N/A | |
| 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model | Matteo Bortolon*, Theodore Tsesmelis, Stuart James, Fabio Poiesi, Alessio Del Bue | N/A | |
| Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning | Jia-Hao Xiao, Ming-Kun Xie, Heng-Bo Fan, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang* | N/A | |
| V2X-Real: a Largs-Scale Dataset for Vehicle-to-Everything Cooperative Perception | Hao Xiang, Xin Xia, Zhaoliang Zheng, Runsheng Xu, Letian Gao, Zewei Zhou, xu han, Xinkai Ji, Mingxi Li, Zonglin Meng, Li Jin, Mingyue Lei, Zhaoyang Ma, Zihang He, Haoxuan Ma, Yunshuang Yuan, Yingqian Zhao, Jiaqi Ma* | N/A | |
| VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space | Guénolé Fiche*, Simon Leglaive, Xavier Alameda-Pineda, Antonio Agudo, Francesc Moreno | N/A | |
| Attention Beats Linear for Fast Implicit Neural Representation Generation | Shuyi Zhang, Ke Liu, Jingjun Gu, Xiaoxu Cai, Zhihua Wang, Jiajun Bu, Haishuai Wang* | N/A | |
| HARIVO: Harnessing Text-to-Image Models for Video Generation | Mingi Kwon, Seoung Wug Oh, Yang Zhou, Joon-Young Lee, Difan Liu, Haoran Cai, Baqiao Liu, Feng Liu, Youngjung Uh* | N/A | |
| Deep Online Probability Aggregation Clustering | Yuxuan Yan, Na Lu*, Ruofan Yan | N/A | |
| WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification | Yonggan Wu, Ling-Chao Meng, Yuan Zichao, Sixian Chan, Hong-Qiang Wang | N/A | |
| Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models | Chao Gong, Kai Chen, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang | N/A | |
| Visual Text Generation in the Wild | Yuanzhi Zhu, Jiawei Liu, Feiyu Gao, Wenyu Liu, Xinggang Wang, Peng Wang, Fei Huang, Cong Yao, Zhibo Yang | N/A | |
| Length-Aware Motion Synthesis via Latent Diffusion | Alessio Sampieri*, Alessio Palma, Indro Spinelli, Fabio Galasso | N/A | |
| Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification | Yunlong Zhang, Honglin Li, YUXUAN SUN, Chenglu Zhu, Sunyi Zheng, Lin Yang | N/A | |
| An Optimal Control View of LoRA and Binary Controller Design for Vision Transformers | Chi Zhang*, Jingpu Cheng, Qianxiao Li | N/A | |
| Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model | Danni Yang, Ruohan Dong, Jiayi Ji, Yiwei Ma, Haowei Wang, Xiaoshuai Sun*, Rongrong Ji | N/A | |
| FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jianwei Zhao, Xin Li, Fan Yang, Qiang Zhai, Ao Luo, Zhicheng Jiao, Hong Cheng | N/A | |
| Improving image synthesis with diffusion-negative sampling | Alakh Desai*, Nuno Vasconcelos | N/A | |
| AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos | Feichi Lu, Zijian Dong, Jie Song, Otmar Hilliges | N/A | |
| FedVAD: Enhancing Federated Video Anomaly Detection with GPT-Driven Semantic Distillation | Fan Qi, Ruijie Pan, Huaiwen Zhang, Changsheng Xu | N/A | |
| SignGen: End-to-End Sign Language Video Generation with Latent Diffusion | Fan Qi, Yu Duan, Changsheng Xu, Huaiwen Zhang | N/A | |
| "Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization" | Hongjing Niu, Hanting Li, Bin Li, Feng Zhao | N/A | |
| Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Sojin Lee, Dogyun Park, Inho Kong, Hyunwoo J. Kim* | N/A | |
| The Gaussian Discriminant Variational Autoencoder (GdVAE): A Self-Explainable Model with Counterfactual Explanations | Anselm Haselhoff*, Kevin Trelenberg, Fabian Küppers, Jonas Schneider | N/A | |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Chen Xu, Tianhui Song, Weixin Feng, Xubin Li, Tiezheng Ge, Bo Zheng, Limin Wang* | N/A | |
| Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models | Samuele Poppi, Tobia Poppi, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara | N/A | |
| TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation | Nikolai Kalischek*, Torben Peters, Jan Dirk Wegner, Konrad Schindler | N/A | |
| Camera Calibration using a Collimator System | Shunkun Liang, Banglei Guan*, Zhenbao Yu, Pengju Sun, Yang Shang | N/A | |
| Label-free Neural Semantic Image Synthesis | Jiayi Wang*, Kevin A Laube, Yumeng Li, Jan Hendrik Metzen, Shin-I Cheng, Julio Borges, Anna Khoreva | N/A | |
| Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Yuwen Pan*, Rui Sun, Naisong Luo, Tianzhu Zhang, Yongdong Zhang | N/A | |
| Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures | Jiaqi He, Zhihua Wang, Leon Wang, Tsein-I Liu, Yuming Fang, Qilin Sun*, Kede Ma | N/A | |
| DiscoMatch: Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching | Paul Roetzer, Ahmed Abbas, Dongliang Cao, Florian Bernard, Paul Swoboda | N/A | |
| Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Byeongjun Park, Hyojun Go, Jin-Young Kim, Sangmin Woo, Seokil Ham, Changick Kim* | N/A | |
| "FARSE-CNN: Fully Asynchronous, Recurrent and Sparse Event-Based CNN" | Riccardo Santambrogio*, Marco Cannici, Matteo Matteucci | N/A | |
| ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images | Xiaoshuai Zhang*, Zhicheng Wang, Howard Zhou, Soham Ghosh, Danushen L Gnanapragasam, Varun Jampani, Hao Su, Leonidas Guibas | N/A | |
| MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment | Anurag Das*, Xinting Hu, Li Jiang, Bernt Schiele | N/A | |
| Event-Aided Time-To-Collision Estimation for Autonomous Driving | Jinghang Li, Bangyan Liao, Xiuyuan Lu, Peidong Liu, Shaojie Shen, Yi Zhou* | N/A | |
| The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation | Muyang Qiu, Jian Zhang, Lei Qi, Qian Yu, Yinghuan Shi*, Yang Gao | N/A | |
| VEON: Vocabulary-Enhanced Occupancy Prediction | Jilai Zheng, Pin Tang, Zhongdao Wang, Guoqing Wang, Xiangxuan Ren, Bailan Feng, Chao Ma* | N/A | |
| Adapt without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models | Mengyu Zheng, Yehui Tang, Zhiwei Hao, Kai Han, Yunhe Wang, Chang Xu | N/A | |
| The Sky's the Limit: Relightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility | James A D Gardner*, Evgenii Kashin, Bernhard Egger, William Smith | N/A | |
| DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Xinxu Ge, Xin Liu, Zitong Yu, Jingang Shi, Chun Qi, Jie Li, Heikki Kälviäinen | N/A | |
| Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception | Congzhang Shao, Guiyang Luo, Quan Yuan, Yifu Chen, Yilin Liu, Gong Kexin, Jinglin Li | N/A | |
| Learning-based Axial Video Motion Magnification | Kwon Byung-Ki, Oh Hyun-Bin, Kim Jun-Seong, Hyunwoo Ha, Tae-Hyun Oh* | N/A | |
| Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights | Yan Hao, Florent Forest*, Olga Fink | N/A | |
| Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion | Linlan Huang, Xusheng Cao, Haori Lu, Xialei Liu* | N/A | |
| cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process | Yihang Chen, Tsai Hor Chan, Guosheng Yin, Yuming Jiang, Lequan Yu* | N/A | |
| Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition | Haijun Xiong, Bin Feng*, Xinggang Wang, Wenyu Liu | N/A | |
| Retargeting Visual Data with Deformation Fields | Tim Elsner*, Julia Berger, Tong Wu, Victor Czech, Lin Gao, Leif Kobbelt | N/A | |
| Delving Deep into Engagement Prediction of Short Videos | dasong Li, Wenjie Li, Baili Lu, Hongsheng Li, Sizhuo Ma, Gurunandan Krishnan, Jian Wang* | N/A | |
| Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration | Emanuel Sanchez Aimar*, Nathaniel D Helgesen, Yonghao Xu, Marco Kuhlmann, Michael Felsberg | N/A | |
| CLEO: Continual Learning of Evolving Ontologies | Shishir Muralidhara*, Saqib Bukhari, Georg Dr. Schneider, Didier Stricker, René Schuster | N/A | |
| SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization | Xixu Hu, Runkai Zheng, Jindong Wang, Cheuk Hang Leung, Qi Wu, Xing Xie | N/A | |
| Wavelet Convolutions for Large Receptive Fields | Shahaf E Finder, Roy Amoyal, Eran Treister, Oren Freifeld | N/A | |
| "BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion" | Bo-Kyeong Kim*, Hyoung-Kyu Song, Thibault Castells, Shinkook Choi | N/A | |
| Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation | Haoyu Ji, Bowen Chen, Xinglong Xu, Weihong Ren, Zhiyong Wang*, Honghai Liu | N/A | |
| Leveraging scale- and orientation-covariant features for planar motion estimation | Marcus Valtonen Örnhag*, Alberto Jaenal | N/A | |
| Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning | Zijun Long*, Lipeng Zhuang, George W Killick, Richard Mccreadie, Gerardo Aragon-Camarasa, Paul Henderson | N/A | |
| Adaptive Parametric Activation | Konstantinos P Alexandridis*, Jiankang Deng, Anh Nguyen, Shan Luo | N/A | |
| Distractor-Free Novel View Synthesis via Exploiting Memorization Effect in Optimization | Yukun Wang, Kunhong Li, Minglin Chen, Longguang Wang, Shunbo Zhou, Kaiwen Xue, Yulan Guo | N/A | |
| VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Sungwon Hwang, Min-Jung Kim, Taewoong Kang, Jayeon Kang, Jaegul Choo* | N/A | |
| HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation | Tianpei Zou, Sanqing Qu, Zhijun Li, Alois C. Knoll, 何 良华, Guang Chen*, Changjun Jiang | N/A | |
| SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting | Richard Shaw*, Michal Nazarczuk, Jifei Song, Arthur Moreau, Sibi Catley-Chandar, Helisa Dhamo, Eduardo Pérez Pellitero | N/A | |
| Temporal-Mapping Photography for Event Cameras | Yuhan Bao, Lei Sun, Yuqin Ma, Kaiwei Wang | N/A | |
| Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Tuo Feng, Wenguan Wang, Ruijie Quan, Yi Yang* | N/A | |
| LineFit: A Geometric Approach for Fitting Line Segments in Images | Marion Boyer, David Youssefi, Florent Lafarge* | N/A | |
| Six-Point Method for Multi-Camera Systems with Reduced Solution Space | Banglei Guan, Ji Zhao*, Laurent Kneip | N/A | |
| Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network | Sukwon Yun, Jie Peng, Alexandro E Trevino, Chanyoung Park, Tianlong Chen* | N/A | |
| Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance | Shenhao Zhu, Junming Leo Chen, Zuozhuo Dai, Zilong Dong, Yinghui Xu, Xun Cao, Yao Yao, Hao Zhu, Siyu Zhu | N/A | |
| AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition | Fadi Boutros*, Vitomir Struc, Naser Damer | N/A | |
| HERGen: Elevating Radiology Report Generation with Longitudinal Data | Fuying Wang, Shenghui Du, Lequan Yu* | N/A | |
| Labeled Data Selection for Category Discovery | Bingchen Zhao, Nico Lang, Serge Belongie, Oisin Mac Aodha | N/A | |
| Dependency-aware Differentiable Neural Architecture Search | Buang Zhang*, Xinle Wu, Hao Miao, Bin Yang, Chenjuan Guo | N/A | |
| WAS: Dataset and Methods for Artistic Text Segmentation | Xudong Xie, Yuzhe Li, Yang Liu, Zhifei Zhang, Zhaowen Wang, Wei Xiong, Xiang Bai* | N/A | |
| CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection | Wuyang Li, Xinyu Liu, Jiayi Ma, Yixuan Yuan* | N/A | |
| GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Youngho Yoon, Hyun-Kurl Jang, Kuk-Jin Yoon* | N/A | |
| Norface: Improving Facial Expression Analysis by Identity Normalization | Hanwei Liu, Rudong An, Zhimeng Zhang, Bowen Ma, Wei Zhang, Yan Song, Yujing Hu, Chen Wei, Yu Ding | N/A | |
| Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy | Hong Zhang, Yixuan Lyu, Qian Yu, Hanyang Liu, Huimin Ma, Yuan Ding, Yifan Yang* | N/A | |
| SNeRV: Spectra-preserving Neural Representation for Video | Jina Kim, Jihoo Lee, Jewon Kang* | N/A | |
| COMO: Compact Mapping and Odometry | Eric Dexheimer*, Andrew Davison | N/A | |
| OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction | Yini Fang*, Jingling Yu, Haozheng Zhang, Ralf van der Lans, Bertram E Shi | N/A | |
| SelfSwapper: Self-Supervised Face Swapping via Shape Agnostic Masked AutoEncoder | Jaeseong Lee, Junha Hyung, Sohyun Jeong, Jaegul Choo* | N/A | |
| EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation | Chenhongyi Yang*, Anastasia Tkach, Shreyas Hampali, Linguang Zhang, Elliot J Crowley, Cem Keskin | N/A | |
| An Information Theoretical View for Out-Of-Distribution Detection | Hu Jinjing, Wenrui Liu, Hong Chang*, Bingpeng MA, Shiguang Shan, Xilin Chen | N/A | |
| DMiT: Deformable Mipmapped Tri-Plane Representation for Dynamic Scenes | Jing-Wen Yang, Jia-Mu Sun, Yong-Liang Yang, Jie Yang, Ying Shan, Yan-Pei Cao, Lin Gao* | N/A | |
| Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation | Olga Zatsarynna, Emad Bahrami, Yazan Abu Farha, Gianpiero Francesca, Jürgen Gall* | N/A | |
| Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation | Wenbo Qi, Jiafei Wu, S. C. Chan | N/A | |
| HowToCaption: Prompting LLMs to Transform Video Annotations at Scale | Nina Shvetsova*, Anna Kukleva, Xudong Hong, Christian Rupprecht, Bernt Schiele, Hilde Kuehne | N/A | |
| LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection | Sanmin Kim, Youngseok Kim, Sihwan Hwang, Hyeonjun Jeong, Dongsuk Kum* | N/A | |
| Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction | Hyeongseok Jeon, Sanmin Kim, Abi Rahman Syamil, Junsoo Kim, Dongsuk Kum* | N/A | |
| On Pretraining Data Diversity for Self-Supervised Learning | Hasan Abed Al Kader Hammoud, Tuhin Das, Fabio Pizzati, Philip Torr, Adel Bibi, Bernard Ghanem | N/A | |
| Look Around and Learn: Self-Training Object Detection by Exploration | Gianluca Scarpellini, Stefano Rosa, Pietro Morerio, Lorenzo Natale, Alessio Del Bue | N/A | |
| Bayesian Self-Training for Semi-Supervised 3D Segmentation | Ozan Unal*, Christos Sakaridis, Luc Van Gool | N/A | |
| Motion and Structure from Event-based Normal Flow | Zhongyang Ren, Bangyan Liao, Delei Kong, Jinghang Li, Peidong Liu, Laurent Kneip, Guillermo Gallego, Yi Zhou* | N/A | |
| ParCo: Part-Coordinating Text-to-Motion Synthesis | Qiran Zou, Shangyuan Yuan, Shian Du, Yu Wang, Chang Liu, Yi Xu, Jie Chen, Xiangyang Ji* | N/A | |
| Learning to Complement and to Defer to Multiple Users | Zheng Zhang, Wenjie Ai, Kevin Wells, David M Rosewarne, Thanh-Toan Do, Gustavo Carneiro* | N/A | |
| Tiny Models are the Computational Saver for Large Models | Qingyuan Wang, Barry Cardiff, Antoine Frappé, Benoit Larras, Deepu John | N/A | |
| DragVideo: Interactive Drag-style Video Editing | Yufan Deng, Ruida WANG, Yuhao ZHANG, Yu-Wing Tai, Chi-Keung Tang | N/A | |
| Multi-Sentence Grounding for Long-term Instructional Video | Zeqian Li, Qirui Chen, Tengda Han, Ya Zhang, Yan-Feng Wang, Weidi Xie* | N/A | |
| Do Generalised Classifiers really work on Human Drawn Sketches? | Hmrishav Bandyopadhyay*, Pinaki Nath Chowdhury, Aneeshan Sain, Subhadeep Koley, Tao Xiang, Ayan Kumar Bhunia, Yi-Zhe Song | N/A | |
| KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding | Zhihao Xu, Shengjie Gong, Jiapeng Tang, Lingyu Liang, Yining Huang, Haojie Li, Shuangping Huang* | N/A | |
| Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° | Yuxiao He, Yiyu Zhuang, Yanwen Wang, Yao Yao, Siyu Zhu, Xiaoyu Li, Qi Zhang, Xun Cao, Hao Zhu* | N/A | |
| MotionDirector: Motion Customization of Text-to-Video Diffusion Models | Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jia-Wei Liu, weijia wu, Jussi Keppo, Mike Zheng Shou* | N/A | |
| Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer | Yang Wu, Kaihua Zhang, Jianjun Qian, Jin Xie, Jian Yang | N/A | |
| Enhanced Motion Forecasting with Visual Relation Reasoning | Sungjune Kim, Hadam Baek, Seunggwan Lee, Hyung-gun Chi, Hyerin Lim, Jinkyu Kim, Sangpil Kim | N/A | |
| Rate-Distortion-Cognition Controllable Versatile Neural Image Compression | Jinming Liu*, Ruoyu Feng, Yunpeng Qi, Qiuyu Chen, Zhibo Chen, Wenjun Zeng, Xin Jin | N/A | |
| Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers | Zixuan Fu*, Lanqing Guo, Chong Wang, Yufei Wang, Zhihao Li, Bihan Wen | N/A | |
| LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar | Yujeong Chae, Hyeonseong Kim, Changgyoon Oh, Minseok Kim, Kuk-Jin Yoon* | N/A | |
| MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models | Xin Liu*, Yichen Zhu, Jindong Gu, Yunshi Lan, Chao Yang, Yu Qiao | N/A | |
| Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models | Siao Tang, Xin Wang, Hong Chen, Chaoyu Guan, Zewen Wu, Yansong Tang, Wenwu Zhu | N/A | |
| Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Eric Brachmann*, Jamie Wynn, Shuai Chen, Tommaso Cavallari, Aron Monszpart, Daniyar Turmukhambetov, Victor Adrian Prisacariu | N/A | |
| Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors | Ruicheng Wang*, Jianfeng Xiang, Jiaolong Yang, Xin Tong | N/A | |
| Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation | Xinyu Yang*, Hossein Rahmani, Dame S Black, Bryan M Williams | N/A | |
| StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion | Ming Tao, Bingkun Bao, Hao Tang, Yaowei Wang, Changsheng Xu | N/A | |
| ST-LLM: Large Language Models Are Effective Temporal Learners | Ruyang Liu, Chen Li, Haoran Tang, Yixiao Ge, Ying Shan, Ge Li* | N/A | |
| Exact Diffusion Inversion via Bidirectional Integration Approximation | Guoqiang Zhang*, j.p. lewis, W. Bastiaan Kleijn | N/A | |
| Textual Query-Driven Mask Transformer for Domain Generalized Segmentation | Byeonghyun Pak, Byeongju Woo, Sunghwan Kim, Dae-hwan Kim, Hoseong Kim* | N/A | |
| EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head | Qianyun He, Xinya Ji, Yicheng Gong, Yuanxun Lu, Zhengyu Diao, Linjia Huang, Yao Yao, Siyu Zhu, Zhan Ma, Songcen Xu, Xiaofei Wu, Zixiao Zhang, Xun Cao, Hao Zhu* | N/A | |
| Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors | Wei Shang, Dongwei Ren, Wanying Zhang, Yuming Fang, Wangmeng Zuo, Kede Ma | N/A | |
| Object-Centric Diffusion for Efficient Video Editing | Kumara Kahatapitiya, Adil Karjauv, Davide Abati, Fatih Porikli, Yuki M Asano, Amirhossein Habibian | N/A | |
| Single-Mask Inpainting for Voxel-based Neural Radiance Fields | Jiafu Chen*, Tianyi Chu, Jiakai Sun, Wei Xing, Lei Zhao | N/A | |
| McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction | Daxuan Ren*, Hezi Shi, Jianmin Zheng, Jianfei Cai | N/A | |
| Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Aneeshan Sain*, Pinaki Nath Chowdhury, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song | N/A | |
| Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts | Yanting Yang, Minghao Chen*, Qibo Qiu, Jiahao WU, Wenxiao Wang, Binbin Lin, Ziyu Guan, Xiaofei He | N/A | |
| Diffusion for Natural Image Matting | Yihan Hu, Yiheng Lin, Wei Wang, Yao Zhao, Yunchao Wei, Humphrey Shi | N/A | |
| Agglomerative Token Clustering | Joakim Bruslund Haurum, Sergio Escalera, Graham W. Taylor, Thomas B. Moeslund | N/A | |
| CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection | Jinhao Deng, Wei Ye, Hai Wu, Qiming Xia, Xun Huang, Xin Li, Jin Fang, Wei Li, Chenglu Wen, Cheng Wang | N/A | |
| Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning | Jianjie Luo, Jingwen Chen, Yehao Li, Yingwei Pan*, Jianlin Feng, Hongyang Chao, Ting Yao | N/A | |
| ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Tianhao Wu*, Chuanxia Zheng, Qianyi Wu, Tat-Jen Cham | N/A | |
| NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition | Chenyu Liu, Jia Pan, Jinshui Hu, Baocai Yin, Bing Yin, Mingjun Chen, Cong Liu, Jun Du*, Qingfeng Liu | N/A | |
| GIVT: Generative Infinite-Vocabulary Transformers | Michael Tschannen*, Cian Eastwood, Fabian Mentzer | N/A | |
| Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment | Brian Gordon, Yonatan Bitton, Yonatan Shafir, Roopal Garg, Xi Chen, Dani Lischinski, Daniel Cohen-Or, Idan Szpektor | N/A | |
| Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density | Peiyu Yang*, Naveed Akhtar, Mubarak Shah, Ajmal Mian | N/A | |
| Multi-Modal Video Dialog State Tracking in the Wild | Adnen Abdessaied*, Lei Shi, Andreas Bulling | N/A | |
| Factorized Diffusion: Perceptual Illusions by Noise Decomposition | Daniel Geng*, Inbum Park, Andrew Owens | N/A | |
| To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now | Yimeng Zhang*, jinghan jia, Xin Chen, Aochuan Chen, Yihua Zhang, Jiancheng Liu, Ke Ding, Sijia Liu | N/A | |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Jin Gao, Lei Gan, Yuankai Li, Yixin Ye, Dequan Wang* | N/A | |
| StereoGlue: Joint Feature Matching and Robust Estimation | Daniel Barath*, Dmytro Mishkin, Luca Cavalli, Paul-Edouard Sarlin, Petr Hruby, Marc Pollefeys | N/A | |
| Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory | Sensen Gao, Xiaojun Jia, Xuhong Ren, Ivor Tsang, Qing Guo | N/A | |
| Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction | Zihao Liu, Xiaoyu Zhang, Guangwei Liu, Ji Zhao, Ningyi Xu | N/A | |
| Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM | Jia Wan*, Qiangqiang Wu, Wei Lin, Antoni Chan | N/A | |
| AWOL: Analysis WithOut synthesis using Language | Silvia Zuffi*, Michael J. Black | N/A | |
| OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework | Wanyun Li, Pinxue Guo, Xinyu Zhou, Lingyi Hong, Yangji He, Xiangyu Zheng, Wei Zhang, Wenqiang Zhang | N/A | |
| M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions | Mingsheng Li, Xin Chen, Chi Zhang, Sijin Chen, Hongyuan Zhu, Fukun Yin, Zhuoyuan Li, Gang Yu, Tao Chen* | N/A | |
| MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes | Casper van Engelenburg*, Fatemeh Mostafavi, Emanuel Kuhn, Yuntae Jeon, Michael Franzen, Matthias Standfest, Jan van Gemert, Seyran Khademi | N/A | |
| End-to-End Rate-Distortion Optimized 3D Gaussian Representation | Henan Wang*, Hanxin Zhu, Tianyu He, Runsen Feng, Jiajun Deng, Jiang Bian, Zhibo Chen | N/A | |
| Temporal Residual Jacobians for Rig-free Motion Transfer | Sanjeev Muralikrishnan*, Niladri Shekhar Dutt, Siddhartha Chaudhuri, Noam Aigerman, Vladimir Kim, Matthew Fisher, Niloy Mitra | N/A | |
| LetsMap: Unsupervised Representation Learning for Label-Efficient Semantic BEV Mapping | Nikhil Gosala*, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Paulo L. J. Drews-Jr, Wolfram Burgard, Abhinav Valada | N/A | |
| Deblurring 3D Gaussian Splatting | Byeonghyeon Lee, Howoong Lee, Xiangyu Sun, Usman Ali, Eunbyung Park | N/A | |
| Taming Lookup Tables for Efficient Image Retouching | Sidi Yang, Binxiao Huang, Mingdeng Cao, Yatai Ji, Hanzhong Guo, Ngai Wong, Yujiu Yang* | N/A | |
| DualDn: Dual-domain Denoising via Differentiable ISP | Ruikang Li, Yujin Wang*, Shiqi Chen, Fan Zhang, Jinwei Gu, Tianfan Xue | N/A | |
| Quantization-Friendly Winograd Transformations for Convolutional Neural Networks | Vladimir Protsenko*, Vladimir Kryzhanovskiy, Alexander Filippov | N/A | |
| A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting | Junhao Zhuang, Yanhong Zeng, WENRAN LIU, Chun Yuan, Kai Chen | N/A | |
| Self-supervised Shape Completion via Involution and Implicit Correspondences | Mengya Liu*, Ajad Chhatkuli, Janis Postels, Luc Van Gool, Federico Tombari | N/A | |
| From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition | Maan Qraitem*, Kate Saenko, Bryan A. Plummer | N/A | |
| Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector | Yuqian Fu*, Yu Wang, Yixuan Pan, Xingyu Qiu, Lian Huai, Zeyu Shangguan, Tong Liu, Yanwei Fu, Luc Van Gool, Xingqun Jiang | N/A | |
| NICP: Neural ICP for 3D Human Registration at Scale | Riccardo Marin*, Enric Corona, Gerard Pons-Moll | N/A | |
| PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines | ZiDong Wang, Zeyu Lu, Di Huang, Tong He, Xihui Liu, Wanli Ouyang, Lei Bai | N/A | |
| FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation | Xinzhi Mu*, Li Chen, Bohan CHEN, Shuyang Gu, Jianmin Bao, Dong Chen, Ji Li, Yuhui Yuan | N/A | |
| Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models | Kent Fujiwara*, Mikihiro Tanaka, Qing Yu | N/A | |
| StableDrag: Stable Dragging for Point-based Image Editing | Yutao Cui, Xiaotong Zhao, Guozhen Zhang, Shengming Cao, Kai Ma, Limin Wang* | N/A | |
| Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context | Shashank Agnihotri*, Julia Grabinski, Margret Keuper | N/A | |
| Dynamic Data Selection for Efficient SSL via Coarse-to-Fine Refinement | Aditay Tripathi*, Pradeep Shenoy, Anirban Chakraborty | N/A | |
| Neural Surface Detection for Unsigned Distance Fields | Federico Stella*, Nicolas Talabot, Hieu Le, Pascal Fua | N/A | |
| One-Shot Diffusion Mimicker for Handwritten Text Generation | Gang Dai, Yifan Zhang, Quhui Ke, Qiangya Guo, Shuangping Huang* | N/A | |
| Event-Based Motion Magnification | Yutian Chen, Shi Guo*, Yu Fangzheng, Feng Zhang, Jinwei Gu, Tianfan Xue | N/A | |
| Improving Neural Surface Reconstruction with Feature Priors from Multi-View Images | Xinlin Ren, Chenjie Cao, Yanwei Fu, Xiangyang Xue | N/A | |
| Towards Multimodal Sentiment Analysis Debiasing via Bias Purification | Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhang | N/A | |
| Kernel Diffusion: An Alternate Approach to Blind Deconvolution | Yash Sanghvi*, Yiheng Chi, Stanley Chan | N/A | |
| MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty | Tim Broedermann*, David Brüggemann, Christos Sakaridis, Kevin Ta, Odysseas Liagouris, Jason Corkill, Luc Van Gool | N/A | |
| Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning | Sanjoy Kundu, Shubham Trehan, Sathyanarayanan N Aakur* | N/A | |
| Bidirectional Progressive Transformer for Interaction Intention Anticipation | Zichen Zhang, Hongchen Luo, Wei Zhai, Yu Kang, Yang Cao | N/A | |
| Reinforcement Learning Meets Visual Odometry | Nico Messikommer*, Giovanni Cioffi, Mathias Gehrig, Davide Scaramuzza | N/A | |
| Bucketed Ranking-based Losses for Efficient Training of Object Detectors | Feyza Yavuz*, Baris Can Cam, Adnan Harun Dogan, Kemal Oksuz, Emre Akbas, Sinan Kalkan | N/A | |
| Robustness Tokens: Towards Adversarial Robustness of Transformers | Brian Pulfer*, Yury Belousov, Slava Voloshynovskiy | N/A | |
| RSL-BA: Rolling Shutter Line Bundle Adjustment | Yongcong Zhang, Bangyan Liao, Yifei Xue, Lu Chen, Peidong Liu, Yizhen Lao* | N/A | |
| DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Zaid Tasneem*, Akshat Dave, Abhishek Singh, Kushagra Tiwary, Praneeth Vepakomma, Ashok Veeraraghavan, Ramesh Raskar | N/A | |
| DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Haibo Yang, Yang Chen, Yingwei Pan*, Ting Yao, Zhineng Chen, Zuxuan Wu, Yu-Gang Jiang, Tao Mei | N/A | |
| Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models | Hao Cheng, Erjia Xiao, Jindong Gu, Le Yang, Jinhao Duan, Jize Zhang, Jiahang Cao, Kaidi Xu, Renjing Xu* | N/A | |
| N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields | Yash Bhalgat*, Iro Laina, Joao F Henriques, Andrew Zisserman, Andrea Vedaldi | N/A | |
| ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Shaozhe Hao, Kai Han, Zhengyao Lv, Shihao Zhao, Kwan-Yee K. Wong* | N/A | |
| PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments | Rixin Zhou*, Ding Xia, YI ZHANG, honglin pang, Xi Yang, chuntao li | N/A | |
| Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph | Zhengcen Li, Xinle Chang, Yueran Li, Jingyong Su* | N/A | |
| Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision | Hao Dong, Eleni Chatzi, Olga Fink* | N/A | |
| ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories | Chen-Yi Lu*, Shubham Agarwal, Md Mehrab Tanjim, Kanak Mahadik, Anup Rao, Subrata Mitra, Shiv K Saini, Saurabh Bagchi, Somali Chaterji | N/A | |
| AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval | Pavel Suma*, Giorgos Kordopatis-Zilos, Ahmet Iscen, Giorgos Tolias | N/A | |
| TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models | Jeongho Kim, Min-Jung Kim, Junsoo Lee, Jaegul Choo* | N/A | |
| 3D Hand Sequence Recovery from Real Blurry Images and Event Stream | JoonKyu Park, Gyeongsik Moon, Weipeng Xu, Evan Kaseman, Takaaki Shiratori, Kyoung Mu Lee* | N/A | |
| GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation | Bangyan Liao, Zhenjun Zhao, Lu Chen, Haoang Li, Daniel Cremers, Peidong Liu* | N/A | |
| Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection | Jian Shi*, Pengyi Zhang, Ni Zhang, Hakim Ghazzai, Peter Wonka | N/A | |
| StyleCity: Large-Scale 3D Urban Scenes Stylization | Yingshu Chen, Huajian Huang*, Tuan-Anh Vu, Ka Chun Shum, Sai-Kit Yeung | N/A | |
| ViG-Bias: Visually Grounded Bias Discovery and Mitigation | Badr-Eddine Marani*, Mohamed Hanini, Nihitha Malayarukil, Stergios Christodoulidis, Maria Vakalopoulou, Enzo Ferrante | N/A | |
| DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior | Xinqi Lin, Jingwen He, Ziyan Chen, Zhaoyang Lyu, Bo Dai, Fanghua Yu, Yu Qiao, Wanli Ouyang, Chao Dong | N/A | |
| Assessing Sample Quality via the Latent Space of Generative Models | Jingyi Xu*, Hieu Le, Dimitris Samaras | N/A | |
| Relightable Neural Actor with Intrinsic Decomposition and Pose Control | Diogo Carbonera Luvizon*, Vladislav Golyanik, Adam Kortylewski, Marc Habermann, Christian Theobalt | N/A | |
| Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images | Zhangjin Huang, Zhihao Liang, Kui Jia | N/A | |
| HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Zhuopeng Li, Yilin Zhang, Chenming Wu, Jianke Zhu, Liangjun Zhang | N/A | |
| Pseudo-keypoint RKHS Learning for Self-supervised 6DoF Pose Estimation | Yangzheng Wu*, Michael Alan Greenspan | N/A | |
| Consistent 3D Line Mapping | Xulong Bai, Hainan Cui, Shuhan Shen | N/A | |
| Distributed Active Client Selection With Noisy Clients Using Model Association Scores | Kwang In Kim* | N/A | |
| PixOOD: Pixel-Level Out-of-Distribution Detection | Tomas Vojir*, Jan Sochman, Jiri Matas | N/A | |
| GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns | Maria Korosteleva*, Timur Levent Kesdogan, Fabian Kemper, Stephan Wenninger, Jasmin Koller, Yuhan Zhang, Mario Botsch, Olga Sorkine-Hornung | N/A | |
| Towards a Density Preserving Objective Function for Learning on Point Sets | Haritha Jayasinghe*, Ioannis Brilakis | N/A | |
| AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking | Yuheng Li, Tianyu Luan, Yizhou Wu, Shaoyan Pan, Yenho Chen, Xiaofeng Yang* | N/A | |
| VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Leo Segre*, Shai Avidan | N/A | |
| Task-Driven Uncertainty Quantification in Inverse Problems via Conformal Prediction | Jeffrey Wen*, Rizwan Ahmad, Phillip Schniter | N/A | |
| Trainable Highly-expressive Activation Functions | Irit Chelly*, Shahaf E. Finder, Shira Ifergane, Oren Freifeld | N/A | |
| Region-Aware Sequence-to-Sequence Learning for Hyperspectral Denoising | JiaHua Xiao, Yang Liu, Xing Wei* | N/A | |
| Self-Supervised Representation Learning for Adversarial Attack Detection | Yi Li*, Plamen Angelov, Neeraj Suri | N/A | |
| Do text-free diffusion models learn discriminative visual representations? | Soumik Mukhopadhyay, Matthew A Gwilliam, Yosuke Yamaguchi, Vatsal Agarwal, Namitha Padmanabhan, Archana Swaminathan, Tianyi Zhou, Jun Ohya, Abhinav Shrivastava | N/A | |
| Clean & Compact: Efficient Data-Free Backdoor Defense with Model Compactness | Huy Phan*, Jinqi Xiao, Yang Sui, Tianfang Zhang, Zijie Tang, Cong Shi, Yan Wang, Yingying Chen, Bo Yuan | N/A | |
| DOCCI: Descriptions of Connected and Contrasting Images | Yasumasa Onoe*, Sunayana Rane, Zachary E Berger, Yonatan Bitton, Jaemin Cho, Roopal Garg, Alexander Ku, Zarana Parekh, Jordi Pont-Tuset, Garrett Tanzer, Su Wang, Jason M Baldridge | N/A | |
| EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks | Ziming Wang, Ziling Wang, Huaning Li, Lang Qin, Runhao Jiang, De Ma, Huajin Tang | N/A | |
| AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild | Junho Park, Kyeongbo Kong, Suk-Ju Kang* | N/A | |
| Dataset Quantization with Active Learning based Adaptive Sampling | Zhenghao Zhao*, Yuzhang Shang, Junyi Wu, Yan Yan | N/A | |
| LogoSticker: Inserting Logos into Diffusion Models for Customized Generation | Mingkang Zhu, Xi CHEN, Zhongdao Wang, Hengshuang Zhao, Jiaya Jia | N/A | |
| LEROjD: Lidar Extended Radar-Only Object Detection | Patrick Palmer*, Martin Krüger, Stefan Schütte, Richard Altendorfer, Ganesh Adam, Torsten Bertram | N/A | |
| "ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation" | Jack Lu, Ryan Teehan, Mengye Ren* | N/A | |
| Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching | Junpeng Jing, Ye Mao, Krystian Mikolajczyk | N/A | |
| Probabilistic Image-Driven Traffic Modeling via Remote Sensing | Scott Workman*, Armin Hadzic | N/A | |
| IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Xi Chen, Sida Peng, Dongchen Yang, Yuan Liu, Bowen Pan, Chengfei Lyu, Xiaowei Zhou | N/A | |
| VideoStudio: Generating Consistent-Content and Multi-Scene Videos | Fuchen Long, Zhaofan Qiu*, Ting Yao, Tao Mei | N/A | |
| Semantic Residual Prompts for Continual Learning | Martin Menabue, Emanuele Frascaroli, Matteo Boschini, Enver Sangineto, Lorenzo Bonicelli, Angelo Porrello, SIMONE CALDERARA | N/A | |
| TransCAD: A Hierarchical Transformer for CAD Sequence Inference from Point Clouds | Elona Dupont*, Kseniya Cherenkova, Dimitrios Mallis, Gleb A Gusev, Anis Kacem, Djamila Aouada | N/A | |
| ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling | Siming Yan*, Min Bai, Weifeng Chen, Xiong Zhou, Qixing Huang, Li Erran Li | N/A | |
| Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection | Alireza Ganjdanesh*, Yan Kang, Yuchen Liu, Richard Zhang, Zhe Lin, Heng Huang | N/A | |
| Occupancy as Set of Points | Yiang Shi, Tianheng Cheng, Qian Zhang, Wenyu Liu, Xinggang Wang* | N/A | |
| UAV First-Person Viewers Are Radiance Field Learners | Liqi Yan, Qifan Wang, Junhan Zhao, Qiang Guan, Zheng Tang, Jianhui Zhang, Dongfang Liu | N/A | |
| Rethinking Few-shot Class-incremental Learning: Learning from Yourself | Yu-Ming Tang, Yi-Xing Peng, Jingke Meng*, Wei-Shi Zheng | N/A | |
| ProSub: Probabilistic Open-Set Semi-Supervised Learning with Subspace-Based Out-of-Distribution Detection | Erik Wallin*, Lennart Svensson, Fredrik Kahl, Lars Hammarstrand | N/A | |
| A Fair Ranking and New Model for Panoptic Scene Graph Generation | Julian Lorenz*, Alexander Pest, Daniel Kienzle, Katja Ludwig, Rainer Lienhart | N/A | |
| Pick-a-back: Selective Device-to-Device Knowledge Transfer in Federated Continual Learning | HyungJune Lee*, JinYi Yoon | N/A | |
| Compensation Sampling for Improved Convergence in Diffusion Models | Hui Lu*, Albert Ali Salah, Ronald Poppe | N/A | |
| Situated Instruction Following | So Yeon Min*, Xavier Puig, Devendra Singh Chaplot, Tsung-Yen Yang, Priyam Parashar, Akshara Rai, Ruslan Salakhutdinov, Yonatan Bisk, Roozbeh Mottaghi | N/A | |
| Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography | Dorian Chan, Matthew O'Toole, Sizhuo Ma, Jian Wang | N/A | |
| SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model | Armen Avetisyan*, Christopher Xie, Henry Howard-Jenkins, Tsun-Yi Yang, Samir Aroudj, Suvam Patra, Fuyang Zhang, Luke Holland, Duncan Frost, Campbell Orme, Jakob Engel, Edward Miller, Richard Newcombe, Vasileios Balntas | N/A | |
| GalLop: Learning global and local prompts for vision-language models | Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Audebert, Nicolas Thome | N/A | |
| Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor | Andrea Conti*, Matteo Poggi, Valerio Cambareri, Stefano Mattoccia | N/A | |
| Lossy Image Compression with Foundation Diffusion Models | Lucas Relic, Roberto Azevedo, Markus Gross, Christopher Schroers | N/A | |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Monika Wysoczańska*, Oriane Siméoni, Michaël Ramamonjisoa, Andrei Bursuc, Tomasz Trzciński, Patrick Pérez | N/A | |
| FMBoost: Boosting Latent Diffusion with Flow Matching | Johannes S Fischer*, Ming Gui, Pingchuan Ma, Nick Stracke, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer | N/A | |
| COMPOSE: Comprehensive Portrait Shadow Editing | Andrew Z Hou*, Zhixin Shu, Xuaner Zhang, He Zhang, Yannick Hold-Geoffroy, Jae Shin Yoon, Xiaoming Liu | N/A | |
| LNL+K: Enhancing Learning with Noisy Labels Through Noise Source Knowledge Integration | Siqi Wang*, Bryan Plummer | N/A | |
| Diffusion Models as Data Mining Tools | Ioannis Siglidis*, Aleksander Holynski, Alexei A. Efros, Mathieu Aubry, Shiry Ginosar | N/A | |
| Graph Neural Network Causal Explanation via Neural Causal Models | Arman Behnam*, Binghui Wang | N/A | |
| "Unsupervised, Online and On-The-Fly Anomaly Detection For Non-Stationary Image Distributions" | Declan GD McIntosh*, Alexandra Branzan Albu | N/A | |
| Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Ruofan Liang, Zan Gojcic, Merlin Nimier-David, David Acuna, Nandita Vijaykumar, Sanja Fidler, Zian Wang* | N/A | |
| GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers | Manu S Pillai*, Mamshad Nayeem Rizve, Mubarak Shah | N/A | |
| SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather | Edoardo Palladin, Roland Dietze, Praveen Narayanan, Mario Bijelic, Felix Heide | N/A | |
| Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs | Aayam Shrestha, Pan Liu, German Ros, Kai Yuan, Alan Fern | N/A | |
| CoTracker: It is Better to Track Together | Nikita Karaev*, Ignacio Rocco, Ben Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht | N/A | |
| "SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models" | Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li* | N/A | |
| PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology | Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Dan Wan, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin, Lin Yang* | N/A | |
| Improving Adversarial Transferability via Model Alignment | Avery Ma*, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu | N/A | |
| RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios | Wenhao Ding*, Yulong Cao, DING ZHAO, Chaowei Xiao, Marco Pavone | N/A | |
| ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation | Hao Tang, Weiyao Wang, Pierre Gleize, Matt Feiszli* | N/A | |
| Embodied Understanding of Driving Scenarios | Yunsong Zhou*, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li | N/A | |
| Learning to Drive via Asymmetric Self-Play | Chris Zhang*, Sourav Biswas, Kelvin Wong, Kion Fallah, Lunjun Zhang, Dian Chen, Sergio Casas, Raquel Urtasun | N/A | |
| OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation | Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby | N/A | |
| ViLA: Efficient Video-Language Alignment for Video Question Answering | Xijun Wang*, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming C Lin, Shan Yang | N/A | |
| Factorizing Text-to-Video Generation by Explicit Image Conditioning | Rohit Girdhar*, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Mian Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra | N/A | |
| MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices | Yang Zhao, Zhisheng Xiao, Yanwu Xu, Haolin Jia, Tingbo Hou | N/A | |
| Open-Set Biometrics: Beyond Good Closed-Set Models | Yiyang Su, Minchul Kim, Feng Liu, Anil Jain, Xiaoming Liu* | N/A | |
| UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening | Siyuan Cheng*, Guangyu Shen, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo, Shiqing Ma, Xiangyu Zhang | N/A | |
| Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution | Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu* | N/A | |
| Osmosis: RGBD Diffusion Prior for Underwater Image Restoration | Opher Bar Nathan*, Deborah Levy, Tali Treibitz, Dan Rosenbaum | N/A | |
| Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Feixiang Zhou, Bryan Williams, Hossein Rahmani* | N/A | |
| Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements | Niels Chr Overgaard*, Anders Holst | N/A | |
| DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields | Yu Chi*, Fangneng Zhan, Sibo Wu, Christian Theobalt, Adam Kortylewski | N/A | |
| Flowed Time of Flight Radiance Fields | Mikhail Okunev*, Marc Mapeke, Benjamin Attal, Christian Richardt, Matthew O'Toole, James Tompkin | N/A | |
| 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing | Haoran Li, Long Ma, Haolin Shi, Yanbin Hao, Yong Liao, Lechao Cheng, Peng Yuan Zhou | N/A | |
| Fast Registration of Photorealistic Avatars for VR Facial Animation | Chaitanya Patel*, Shaojie Bai, Te-Li Wang, Jason Saragih, Shih-En Wei | N/A | |
| CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings | Cristina Mata*, Kanchana N Ranasinghe, Michael S Ryoo | N/A | |
| HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs | Ziwei Yao, Ruiping Wang*, Xilin Chen | N/A | |
| Image-to-Lidar Relational Distillation for Autonomous Driving Data | Anas Mahmoud*, Ali Harakeh, Steven Waslander | N/A | |
| Thinking Outside the BBox: Unconstrained Generative Object Compositing | Gemma Canet Tarrés*, Zhe Lin, Zhifei Zhang, Jianming Zhang, Yizhi Song, Dan Ruta, Andrew Gilbert, John Collomosse, Soo Ye Kim | N/A | |
| Large-scale Reinforcement Learning for Diffusion Models | Yinan Zhang, Eric Tzeng, Yilun Du, Dmitry Kislyuk | N/A | |
| CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion | Jiarui Sun, Girish Chowdhary | N/A | |
| FedHARM: Harmonizing Model Architectural Diversity in Federated Learning | Anestis Kastellos*, Athanasios Psaltis, Charalampos Z Patrikakis, Petros Daras | N/A | |
| EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Sharath Girish*, Kamal Gupta, Abhinav Shrivastava | N/A | |
| Global Counterfactual Directions | Bartłomiej Sobieski, Przemyslaw Biecek | N/A | |
| TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving | Cheng Zhao*, su sun, Ruoyu Wang, Yuliang Guo, Jun-Jun Wan, Zhou Huang, Xinyu Huang, Yingjie Victor Chen, Liu Ren | N/A | |
| RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark | Yuan-Hao Ho, Jen-Hao Cheng, Sheng Yao Kuan, Zhongyu Jiang, Wenhao Chai, Hsiang-Wei Huang, Chih-Lung Lin, Jenq-Neng Hwang* | N/A | |
| EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models | Ruoxi Chen, Haibo Jin, Yixin Liu, Jinyin Chen*, Haohan Wang, Lichao Sun | N/A | |
| "RICA^2: Rubric-Informed, Calibrated Assessment of Actions" | Abrar Majeedi, Viswanatha Reddy Gajjala, Satya Sai Srinath Namburi GNVV, Yin Li* | N/A | |
| Region-centric Image-Language Pretraining for Open-Vocabulary Detection | Dahun Kim*, Anelia Angelova, Weicheng Kuo | N/A | |
| Commonly Interesting Images | Fitim Abdullahu, Helmut Grabner | N/A | |
| Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities | Lorenzo Baraldi*, Federico Cocchi, Marcella Cornia, Lorenzo Baraldi, Alessandro Nicolosi, Rita Cucchiara | N/A | |
| CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching | Samia Shafique*, Shu Kong, Charless Fowlkes | N/A | |
| Caltech Aerial RGB-Thermal Dataset in the Wild | Connor Lee*, Matthew Anderson, Nikhil Ranganathan, Xingxing Zuo, Kevin T Do, Georgia Gkioxari, Soon-Jo Chung | N/A | |
| Diffusion Soup: Model Merging for Text-to-Image Diffusion Models | Benjamin J Biggs*, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto | N/A | |
| Volumetric Rendering with Baked Quadrature Fields | Gopal Sharma*, Daniel Rebain, Kwang Moo Yi, Andrea Tagliasacchi | N/A | |
| CityGuessr: City-Level Video Geo-Localization on a Global Scale | Parth Parag Kulkarni*, Gaurav Kumar Nayak, Mubarak Shah | N/A | |
| Pseudo-Labelling Should Be Aware of Disguising Channel Activations | Changrui Chen, Kurt Debattista, Jungong Han* | N/A | |
| Bayesian Detector Combination for Object Detection with Crowdsourced Annotations | Zhi Qin Tan*, Olga Isupova, Gustavo Carneiro, Xiatian Zhu, Yunpeng Li | N/A | |
| Revising Densification in Gaussian Splatting | Samuel Rota Bulò*, Lorenzo Porzi, Peter Kontschieder | N/A | |
| FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing | Gwanhyeong Koo, Sunjae Yoon, Ji Woo Hong, Chang D. Yoo* | N/A | |
| "Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-View Stereo with DIV Loss" | Alex Rich*, Noah Stier, Pradeep Sen, Tobias Hollerer | N/A | |
| Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions | Yijun Qian*, Jack Urbanek, Alexander Hauptmann, Jungdam Won | N/A | |
| UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation | Jinho Park*, Se Young Chun, Mingoo Seok | N/A | |
| PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis | Jason J. Yu*, Tristan Aumentado-Armstrong, Fereshteh Forghani, Konstantinos G. Derpanis, Marcus A. Brubaker | N/A | |
| R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding | Qirui Wu*, Sonia Raychaudhuri, Daniel Ritchie, Manolis Savva, Angel X Chang | N/A | |
| A Graph-Based Approach for Category-Agnostic Pose Estimation | Or Hirschorn*, Shai Avidan | N/A | |
| Depth-guided NeRF Training via Earth Mover’s Distance | Anita Rau*, Josiah Aklilu, Floyd C Holsinger, Serena Yeung-Levy | N/A | |
| INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding | Ji Ha Jang, Hoigi Seo, Se Young Chun* | N/A | |
| DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Sarah Jabbour*, Gregory Kondas, Ella Kazerooni, Michael Sjoding, David Fouhey, Jenna Wiens | N/A | |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Sanjoy Chowdhury*, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha | N/A | |
| Diagnosing and Re-learning for Balanced Multimodal Learning | Yake Wei, Siwei Li, Ruoxuan Feng, Di Hu* | N/A | |
| Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration | Dongwon Park, Hayeon Kim, Se Young Chun* | N/A | |
| Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders | Lucas Stoffl, Andy Bonnetto, Stéphane D'Ascoli, Alexander Mathis* | N/A | |
| BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion | Gwanghyun Kim, Hayeon Kim, Hoigi Seo, Dong Un Kang, Se Young Chun* | N/A | |
| SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Chao Xu, Ang Li, Linghao Chen, Yulin Liu, Ruoxi Shi, Hao Su, Minghua Liu | N/A | |
| MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke, Serge Belongie, Christian Igel, Nico Lang* | N/A | |
| Discovering Unwritten Visual Classifiers with Large Language Models | Mia Chiquier*, Utkarsh Mall, Carl Vondrick | N/A | |
| LITA: Language Instructed Temporal-Localization Assistant | De-An Huang*, Shijia Liao, Subhashree Radhakrishnan, Hongxu Yin, Pavlo Molchanov, Zhiding Yu, Jan Kautz | N/A | |
| MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain | Timothy Chase Jr*, Karthik Dantu | N/A | |
| Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs | Keen You*, Haotian Zhang, Eldon Schoop, Floris Weers, Amanda Swearngin, Jeff Nichols, Yinfei Yang, Zhe Gan | N/A | |
| Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data | Zhengfeng Lai*, Joohi Chauhan, Brittany N. Dugger, Chen-Nee Chuah | N/A | |
| AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation | Yangchao Wu*, Tian Yu Liu, Hyoungseob Park, Stefano Soatto, Dong Lao, Alex Wong | N/A | |
| CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection | Wei-Yu Lee*, Martin Dimitrievski, David Van Hamme, Jan Aelterman, Ljubomir Jovanov, Wilfried Philips | N/A | |
| SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging | Haijin Zeng, Yuxi Liu, Yongyong Chen*, Youfa Liu, Chong Peng, Jingyong Su | N/A | |
| Minimalist Vision with Freeform Pixels | Jeremy Klotz*, Shree Nayar | N/A | |
| All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation | Seongho Kim, Byung Cheol Song* | N/A | |
| LatentEditor: Text Driven Local Editing of 3D Scenes | Umar Khalid*, Hasan Iqbal, Muhammad Tayyab, Md Nazmul Karim, Jing Hua, Chen Chen | N/A | |
| Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Kaustubh Sadekar*, David Maier, Atul Ingle | N/A | |
| Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision | Hussain Sajwani, Dimitrios Makris, Yahya Prof. Zweiri, Fariborz Baghaei Naeini, Sanket Mr Kachole* | N/A | |
| Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models | James Burgess*, Kuan-Chieh Wang, Serena Yeung-Levy | N/A | |
| POET: Prompt Offset Tuning for Continual Human Action Adaptation | Prachi Garg*, Joseph K J, Vineeth N Balasubramanian, Necati Cihan Camgoz, Chengde Wan, Kenrick Kin, Weiguang Si, Shugao Ma, Fernando de la Torre | N/A | |
| Domain Generalization of 3D Object Detection by Density-Resampling | Shuangzhi Li, Lei Ma, Xingyu Li* | N/A | |
| IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers | Chenglin Yang*, Siyuan Qiao, Yuan Cao, Yu Zhang, Tao Zhu, Alan Yuille, Jiahui Yu | N/A | |
| MRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning | Dongyao Jiang, Hui Chen, Haodong Jing, Yongqiang Ma, Nanning Zheng* | N/A | |
| Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs | Jeongkee Lim, Yusung Kim* | N/A | |
| TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance | Guoxing Zhang, Yiming Liu, xiaoyu yang, Chao Huang*, HUANG Hailong | N/A | |
| Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing | Yushi Lan, Feitong Tan, Qiangeng Xu, Di Qiu, Kyle Genova, Zeng Huang, Rohit Pandey, Sean Fanello, Thomas Funkhouser, Chen Change Loy, Yinda Zhang | N/A | |
| Towards Open Domain Text-Driven Synthesis of Multi-Person Motions | Mengyi Shan, Lu Dong, Yutao Han, Yuan Yao, Tao Liu, Ifeoma Nwogu, Guo-Jun Qi, Mitchell K Hill* | N/A | |
| Generative End-to-End Autonomous Driving | Wenzhao Zheng, Ruiqi Song, Xianda Guo*, Chenming Zhang, Long Chen | N/A | |
| Learning to Distinguish Samples for Generalized Category Discovery | Fengxiang Yang, Nan Pu, Wenjing Li, Zhiming Luo, Shaozi Li, Nicu Sebe, Zhun Zhong | N/A | |
| COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark | Atsushi Hashimoto*, Koki Maeda, Tosho Hirasawa, Jun Harashima, Leszek Rybicki, Yusuke Fukasawa, Yoshitaka Ushiku | N/A | |
| PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning | Haiyang Guo, Fei Zhu, Wenzhuo Liu, Xu-Yao Zhang, Cheng-Lin Liu | N/A | |
| Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem | Qianliang Wu, Haobo Jiang, Lei Luo, Jun Li, Yaqing Ding, Jin Xie, Jian Yang* | N/A | |
| WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning | Kunbei Cai, Zhenkai Zhang, Qian Lou, Fan Yao | N/A | |
| "Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice" | Xiayu Wang, Ke Ma, Ruiyun Zhong, Xinggang Wang, Yi Fang, Yang Xiao, Tian Xia* | N/A | |
| Encapsulating Knowledge in One Prompt | Qi Li, Runpeng Yu, Xinchao Wang* | N/A | |
| Cross-Input Certified Training for Universal Perturbations | Changming Xu*, Gagandeep Singh | N/A | |
| Visual Relationship Transformation | Xiaoyu Xu*, Jiayan Qiu, Baosheng Yu, Zhou Wang | N/A | |
| "Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data" | Yuxuan Li, Sarthak Kumar Maharana, Yunhui Guo* | N/A | |
| Delving into Adversarial Robustness on Document Tampering Localization | Huiru Shao, Zhuang Qian, Kaizhu Huang, Wei Wang, Xiaowei Huang, Qiufeng Wang* | N/A | |
| Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing | Seongmin Hong, Jaehyeok Bae, Jongho Lee, Se Young Chun | N/A | |
| Confidence-Based Iterative Generation for Real-World Image Super-Resolution | Jialun Peng, Xin Luo, Jingjing Fu, Dong Liu | N/A | |
| Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy | Tao Li*, Weisen Jiang, Fanghui Liu, Xiaolin Huang, James Kwok | N/A | |
| Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection | Kohei Yamashita*, Vincent Lepetit, Ko Nishino | N/A | |
| Seeing Faces in Things: A Model and Dataset for Pareidolia | Mark T Hamilton*, Simon Stent, Vasha G DuTell, Anne Harrington, Jennifer E Corbett, Ruth Rosenholtz, William T. Freeman | N/A | |
| Cocktail Universal Adversarial Attack on Deep Neural Networks | Shaoxin Li, Xiaofeng Liao, Xin Che, Xintong Li, Yong Zhang, Lingyang Chu | N/A | |
| Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering | Antoine Guédon*, Vincent Lepetit | N/A | |
| AMD: Automatic Multi-step Distillation of Large-scale Vision Models | Cheng Han, Qifan Wang, Sohail A Dianat, Majid Rabbani, Raghuveer Rao, Yi Fang, Qiang Guan, Lifu Huang, Dongfang Liu* | N/A | |
| FairViT: Fair Vision Transformer via Adaptive Masking | Bowei Tian, Ruijie Du, Yanning Shen* | N/A | |
| TrojVLM: Backdoor Attack Against Vision Language Models | Weimin Lyu*, Lu Pang, Tengfei Ma, Haibin Ling, Chao Chen | N/A | |
| VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks | Xiangxiang Chu, Jianlin Su, Bo Zhang, Chunhua Shen | N/A | |
| Frugal 3D Point Cloud Model Training via Progressive Near Point Filtering and Fused Aggregation | Donghyun Lee, Yejin Lee, Jae W. Lee, Hongil Yoon | N/A | |
| HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation | Noranart Vesdapunt*, Kah Kuen Fu, Yue Wu, Xu Zhang, Pradeep Natarajan | N/A | |
| Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data | Sneha Paul*, Zachary Patterson, Nizar Bouguila | N/A | |
| PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation | Renjie Lu, Jingke Meng*, WEI-SHI ZHENG | N/A | |
| MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Seongju Lee, Junseok Lee, Yeonguk Yu, Taeri Kim, Kyoobin Lee* | N/A | |
| Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention | Zuyao Chen, Jinlin Wu, Zhen Lei, Zhaoxiang Zhang, Chang Wen Chen* | N/A | |
| Few-shot NeRF by Adaptive Rendering Loss Regularization | Qingshan Xu*, Xuanyu Yi, Jianyao Xu, Wenbing Tao, Yew Soon Ong, Hanwang Zhang | N/A | |
| Investigating Style Similarity in Diffusion Models | Gowthami Somepalli*, Anubhav Gupta, Kamal Gupta, Shramay Palta, Micah Goldblum, Jonas A. Geiping, Abhinav Shrivastava, Tom Goldstein | N/A | |
| JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention | Brian Cheong, Jiachen Zhou, Steven L Waslander* | N/A | |
| MagicMirror: Fast and High-Quality Avatar Generation with Constrained Search Space | Armand Comas, Di Qiu*, Menglei Chai, Marcel C. Bühler, Amit Raj, Ruiqi Gao, Qiangeng Xu, Mark J Matthews, Paulo Gotardo, Sergio Orts-Escolano, Thabo Beeler | N/A | |
| EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification | Suorong Yang, Furao Shen, Jian Zhao | N/A | |
| Timestep-Aware Correction for Quantized Diffusion Models | Yuzhe Yao, Feng Tian, Jun Chen*, Haonan Lin, Guang Dai, Yong Liu, Jingdong Wang | N/A | |
| SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision | Ankit Vani*, Bac Nguyen, Samuel Lavoie, Ranjay Krishna, Aaron Courville | N/A | |
| Towards compact reversible image representations for neural style transfer | Xiyao Liu, Siyu Yang, Jian Zhang, Gerald Schaefer, Jiya Li, Xunli FAN, Songtao Wu, Hui Fang | N/A | |
| Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Tao Lin, lijia Yu, Gaojie Jin, Renjue Li, Peng Wu, Lijun Zhang | N/A | |
| GTMS: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method | Haoxin Lv, Tianxiong Zhong, Sanyuan Zhao* | N/A | |
| Long-term Temporal Context Gathering for Neural Video Compression | Linfeng Qi, Zhaoyang Jia, Jiahao Li, Bin Li, Houqiang Li, Yan Lu* | N/A | |
| VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | YIBO LIU*, Zheyuan Yang, Guile Wu, Yuan Ren, Kejian Lin, Liu Bingbing, Yang Liu, JINJUN SHAN | N/A | |
| From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation | Yunfei Xie*, Cihang Xie, Alan Yuille, Jieru Mei | N/A | |
| Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling | Zixiao Wang*, Hongtao Xie, YuXin Wang, Yadong Qu, Fengjun Guo, Pengwei Liu | N/A | |
| Unmasking Bias in Diffusion Model Training | Hu Yu, Li Shen, Jie Huang, Hongsheng Li, Feng Zhao* | N/A | |
| Multimodal Label Relevance Ranking via Reinforcement Learning | Taian Guo, Taolin Zhang, Haoqian Wu, Hanjun Li, Ruizhi Qiao*, Xing Sun | N/A | |
| Animate Your Motion: Turning Still Images into Dynamic Videos | Mingxiao Li, Bo Wan, Sien Moens, Tinne Tuytelaars | N/A | |
| Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis | Zipeng Qi, Guoxi Huang*, Chenyang Liu, Fei Ye | N/A | |
| CIC-BART-SSA: : Controllable Image Captioning with Structured Semantic Augmentation | Kalliopi Basioti, Mohamed A Abdelsalam, Federico Fancellu, Vladimir Pavlovic, Afsaneh Fazly* | N/A | |
| A Simple Background Augmentation Method for Object Detection with Diffusion Model | Yuhang Li, Xin Dong, Chen Chen, Weiming Zhuang, Lingjuan Lyu* | N/A | |
| Echoes of the Past: Boosting Long-tail Recognition via Reflective Learning | Qihao Zhao, Yalun Dai, Shen Lin, Wei Hu, Fan Zhang*, Jun Liu | N/A | |
| "BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events" | Yijin Li, Yichen Shen, Zhaoyang Huang, Shuo Chen, Weikang Bian, Xiaoyu Shi, Fu-Yun Wang, Keqiang Sun, Hujun Bao, Zhaopeng Cui, Guofeng Zhang, Hongsheng Li | N/A | |
| A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and Localization | Qiyu Chen, Huiyuan Luo, Chengkan Lv*, Zhengtao Zhang | N/A | |
| Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation | Chenhao Li*, Trung Thanh Ngo, Hajime Nagahara | N/A | |
| Rethinking Features-Fused-Pyramid-Neck for Object Detection | Hulin Li* | N/A | |
| Spatial-Temporal Multi-level Association for Video Object Segmentation | Deshui Miao, Xin Li, Zhenyu He*, Huchuan Lu, Ming-Hsuan Yang | N/A | |
| Sparse Refinement for Efficient High-Resolution Semantic Segmentation | Zhijian Liu, Zhuoyang Zhang, Samir Khaki, Shang Yang, Haotian Tang, Chenfeng Xu, Kurt Keutzer, Song Han* | N/A | |
| Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion | Sanghyun Kim, Seohyeon Jung, Balhae Kim, Moonseok Choi, Jinwoo Shin, Juho Lee | N/A | |
| An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought | Chunhao LU, Qiang Lu*, Jake Luo | N/A | |
| RaFE: Generative Radiance Fields Restoration | Zhongkai Wu, Ziyu Wan, Jing Zhang*, Jing Liao, Dong Xu | N/A | |
| UniProcessor: A Text-induced Unified Low-level Image Processor | Huiyu Duan*, Xiongkuo Min, Sijing Wu, Wei Shen, Guangtao Zhai | N/A | |
| Fast Sprite Decomposition from Animated Graphics | Tomoyuki Suzuki*, Kotaro Kikuchi, Kota Yamaguchi | N/A | |
| Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection | Liren He, Zhengkai Jiang, Jinlong Peng, Wenbing Zhu, Liang Liu, Qiangang Du, Xiaobin Hu, Mingmin Chi, Yabiao Wang, Chengjie Wang* | N/A | |
| IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection | Mingjin Zhang, Yuchun Wang, Jie Guo, Yunsong Li, Xinbo Gao, Jing Zhang | N/A | |
| PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Zhenyu Li*, Shariq Farooq Bhat, Peter Wonka | N/A | |
| A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability | Linfeng Ma, Han Fang, Tianyi Wei, Zijin Yang, Zehua Ma, Weiming Zhang, Nenghai Yu | N/A | |
| Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation | Yuhwan Jeong, Hoonhee Cho, Kuk-Jin Yoon* | N/A | |
| CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs | Akshat Ramachandran, Souvik Kundu, Tushar Krishna* | N/A | |
| A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures | Tahmina Khanam, Mohammed Bennamoun, Guan Wang, Guanjin Wang, Ferdous Sohel, Farid Boussaid, Anuj Srivastava, Hamid Laga* | N/A | |
| Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation | Yushun Tang, Shuoshuo Chen, Zhihe Lu, Xinchao Wang, Zhihai He* | N/A | |
| Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Gen Li, zhihao shu, Jie Ji, Minghai Qin, Fatemeh Afghah, Wei Niu, Xiaolong Ma | N/A | |
| The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers | Seungwoo Son, Jegwang Ryu, Namhoon Lee, Jaeho Lee | N/A | |
| Training A Small Emotional Vision Language Model for Visual Art Comprehension | Jing Zhang, Liang Zheng, Meng Wang, Dan Guo | N/A | |
| UGG: Unified Generative Grasping | Jiaxin Lu, Hao Kang, Haoxiang Li, Bo Liu, Yiding Yang, Qixing Huang, Gang Hua* | N/A | |
| FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation | Chenliang Zhou*, Fangcheng Zhong, Param Hanji, Zhilin Guo, Kyle Thomas Fogarty, Alejandro Sztrajman, Hongyun Gao, A. Cengiz Oztireli | N/A | |
| Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt | Bin-Bin Gao* | N/A | |
| GAMMA-FACE: GAussian Mixture Models Amend Diffusion Models for Bias Mitigation in Face Images | Basudha Pal, Arunkumar Kannan, Ram Prabhakar Kathirvel, Alice O'Toole, Rama Chellappa | N/A | |
| Reinforcement Learning Friendly Vision-Language Model for Minecraft | Haobin Jiang, Junpeng Yue, Hao Luo, Ziluo Ding, Zongqing Lu* | N/A | |
| Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation | Seonghoon Yu, Paul Hongsuck Seo, Jeany Son | N/A | |
| Training-free Composite Scene Generation for Layout-to-Image Synthesis | Jiaqi Liu*, Tao Huang, Chang Xu | N/A | |
| Robustness Preserving Fine-tuning using Neuron Importance | Guangrui Li, Rahul Duggal*, Aaditya Singh, Kaustav Kundu, Bing Shuai, Jonathan Wu | N/A | |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Mengcheng Lan, Chaofeng Chen, Yiping Ke, Xinjiang Wang, Litong Feng, Wayne Zhang* | N/A | |
| PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation | jian ma, Chen Chen, Qingsong Xie, Haonan Lu | N/A | |
| Similarity of Neural Architectures using Adversarial Attack Transferability | Jaehui Hwang, Dongyoon Han, Byeongho Heo, Song Park, Sanghyuk Chun*, Jong-Seok Lee | N/A | |
| Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers | Tingting Chen*, Beibei Lin, Yeying Jin, Wending Yan, WEI YE, Yuan Yuan, Robby T. Tan | N/A | |
| PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation | Ning Gao, Sanping Zhou*, Le Wang, Nanning Zheng | N/A | |
| OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web | Raghav Kapoor, Yash Parag Butala, Melisa A Russak, Jing Yu Koh, Kiran Kamble, Waseem AlShikh, Ruslan Salakhutdinov | N/A | |
| AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering | Xiuyuan Chen, Yuan Lin, Yuchen Zhang, Weiran Huang* | N/A | |
| Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models | Jinrui Zhang, Teng Wang, Haigang Zhang, Ping Lu, Feng Zheng* | N/A | |
| Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks | Jiawei Wu, Zhi Jin* | N/A | |
| Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation | Duy Tho Le, Hengcan Shi, Jianfei Cai, Hamid Rezatofighi | N/A | |
| MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Yushuo Chen*, Zerong Zheng, Zhe Li, Chao Xu, Yebin Liu | N/A | |
| Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Hao Xu, Xi Zhang, Xiaolin Wu | N/A | |
| Scene-Conditional 3D Object Stylization and Composition | Jinghao Zhou*, Tomas Jakab, Philip Torr, Christian Rupprecht | N/A | |
| GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning | Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang | N/A | |
| Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Kartik Garg, Sai Shubodh, Shishir N Y Kolathaya, Madhava Krishna, Sourav Garg* | N/A | |
| EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching | Peiqi Chen, Lei Yu, Yi Wan, Yongjun Zhang*, Jian Wang, Liheng Zhong, Jingdong Chen, Ming Yang | N/A | |
| DGD: Dynamic 3D Gaussians Distillation | Isaac Labe, Noam Issachar, Itai Lang, Sagie Benaim* | N/A | |
| Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation | Jaehyeong Jeon*, Kibum Kim, Kanghoon Yoon, Chanyoung Park | N/A | |
| DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation | Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, ZhengKai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Ji | N/A | |
| Self-Guided Generation of Minority Samples Using Diffusion Models | Soobin Um, Jong Chul Ye* | N/A | |
| DEVIAS: Learning Disentangled Video Representations of Action and Scene | Kyungho Bae, Youngrae Kim, Geo Ahn, Jinwoo Choi* | N/A | |
| AD3: Introducing a score for Anomaly Detection Dataset Difficulty assessment using VIADUCT dataset | Jan D Lehr*, Jan H Philipps, Alik Sargsyan, Martin Pape, Jörg Krüger | N/A | |
| RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting | Qi WANG*, Ruijie Lu, Xudong XU, Jingbo Wang, Michael Yu Wang, Bo Dai, Gang Zeng, Dan Xu | N/A | |
| Class-Agnostic Object Counting with Text-to-Image Diffusion Model | Xiaofei Hui, Qian Wu, Hossein Rahmani, Jun Liu* | N/A | |
| Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks | Sehwan Choi*, Jun Won Choi, Jungho Kim, Hongjae Shin | N/A | |
| SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object Reconstruction | Yuliang Guo*, Abhinav Kumar, Cheng Zhao, Ruoyu Wang, Xinyu Huang, Liu Ren | N/A | |
| Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme | Jintae Kim, Seungwon Yang, Seong-Gyun Jeong, Chang-Su Kim* | N/A | |
| Pyramid Diffusion for Fine 3D Large Scene Generation | Yuheng Liu, Xinke Li, Xueting Li, Lu Qi, Chongshou Li, Ming-Hsuan Yang | N/A | |
| ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model | Wenyu Li*, Binghui Chen, Yifeng Geng, Xuansong Xie, Wangmeng Zuo | N/A | |
| A Watermark-Conditioned Diffusion Model for IP Protection | Rui Min, Sen Li, Hongyang Chen, Minhao Cheng | N/A | |
| Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation | Seongsu Ha, Chaeyun Kim, Donghwa Kim, Junho Lee, Sangho Lee, Joonseok Lee* | N/A | |
| SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning | Bac Nguyen*, Stefan Uhlich, Fabien Cardinaux, Lukas Mauch, Marzieh Edraki, Aaron Courville | N/A | |
| FTBC: Forward Temporal Bias Correction for Optimizing ANN-SNN Conversion | Xiaofeng Wu, Velibor Bojkovic, Bin Gu, Kun Suo, Kai Zou | N/A | |
| Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples | Chengen Lai, Shengli Song*, Sitong Yan, Guangneng Hu | N/A | |
| Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation | Xu Zheng, Yuanhuiyi Lyu, jiazhou zhou, Lin Wang | N/A | |
| GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation | Haonan Wang, Jie Liu*, Jie Tang, Gangshan Wu, Bo Xu, Yanbing Chou, Yong Wang | N/A | |
| Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations | Ofir Shifman*, Yair Weiss | N/A | |
| DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation | Soojin Jang, JungMin Yun, JuneHyoung Kwon, Eunju Lee, YoungBin Kim* | N/A | |
| Rethinking Normalization Layers for Domain Generalizable Person Re-identification | Ren Nie, Jin Ding, Xue Zhou*, Xi Li | N/A | |
| Generalizing to Unseen Domains via Text-guided Augmentation | Daiqing Qi*, Handong Zhao, Aidong Zhang, Sheng Li | N/A | |
| VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation | Zhen Qu, Xian Tao*, Mukesh Prasad, Fei Shen, Zhengtao Zhang, Xinyi Gong, Guiguang Ding | N/A | |
| Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models | Juntu Zhao, Junyu Deng, Yixin Ye, Chongxuan Li, Zhijie Deng, Dequan Wang | N/A | |
| Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes | Zhi Cai, Yingjie Gao, Yaoyan Zheng, Nan Zhou, Di Huang* | N/A | |
| Zero-shot Text-guided Infinite Image Synthesis with LLM guidance | Soyeong Kwon, Taegyeong Lee, Taehwan Kim* | N/A | |
| Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution | Zhiheng Li, Muheng Li, Jixuan Fan, Lei Chen*, Yansong Tang, Jiwen Lu, Jie Zhou | N/A | |
| Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model | Yang Jin, Lei Zhang, Shi Yan, Bin Fan, Binglu Wang* | N/A | |
| Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization | Xi Yang, Songsong Duan*, Nannan Wang, Xinbo Gao | N/A | |
| Adaptive Multi-head Contrastive Learning | Lei Wang*, Piotr Koniusz, Tom Gedeon, Liang Zheng | N/A | |
| Rotated Orthographic Projection for Self-Supervised 3D Human Pose Estimation | YAO YAO, Yixuan Pan, Wenjun Shi, Dongchen Zhu, Lei Wang, Jiamao Li* | N/A | |
| Easing 3D Pattern Reasoning with Side-view Features for Semantic Scene Completion | Linxi Huan, Mingyue Dong, Linwei Yue, Shuhan Shen, Xianwei Zheng* | N/A | |
| DSMix: Distortion-Induced Saliency Map Based Pre-training for No-Reference Image Quality Assessment | Jinsong Shi, Pan Gao*, Xiaojiang Peng, Jie Qin | N/A | |
| MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets | PENG LIAO, Xilu Wang, Yaochu Jin, Wenli Du | N/A | |
| Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression | Animesh Sinha*, Bo Sun, Anmol Kalia, Arantxa Casanova, Elliot Blanchard, David Yan, Winnie Zhang, Tony Nelli, Jiahui Chen, Hardik Shah, Licheng Yu, Mitesh Kumar Singh, Ankit Ramchandani, Maziar Sanjabi, Sonal Gupta, Amy L Bearman, Dhruv Mahajan | N/A | |
| Adaptive Annealing for Robust Averaging | Sidhartha Chitturi*, Venu Madhav Govindu | N/A | |
| GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity | Shuo Cao, Yihao Liu, Wenlong Zhang, Yu Qiao, Chao Dong* | N/A | |
| MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery | Pei Zhou, Yanchao Yang* | N/A | |
| High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering | Xin Ming, Jiawei Li, Jingwang Ling, Libo Zhang, Feng Xu* | N/A | |
| Disentangling Masked Autoencoders for Unsupervised Domain Generalization | An Zhang*, Han Wang, Xiang Wang, Tat-Seng Chua | N/A | |
| Early Anticipation of Driving Maneuvers | Abdul Wasi Lone, Shankar Gangisetty*, Shyam Nandan Rai, C. V. Jawahar | N/A | |
| Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing | Siqi Liu*, Qirui Wang, Pong C. Yuen | N/A | |
| SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Yiyang Chen, Siyan Dong, Xulong Wang, Lulu Cai, Youyi Zheng, Yanchao Yang | N/A | |
| On the Evaluation Consistency of Attribution-based Explanations | Jiarui Duan, Haoling Li, Haofei Zhang, Hao Jiang, Mengqi Xue, Li Sun, Mingli Song, Jie Song* | N/A | |
| Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation | Hao Fang, Peng Wu, Yawei Li, Xinxin Zhang, Xiankai Lu* | N/A | |
| InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Xulong Wang, Siyan Dong, Youyi Zheng, Yanchao Yang | N/A | |
| DreamReward: Aligning Human Preference in Text-to-3D Generation | Junliang Ye, Fangfu Liu, Qixiu Li, Zhengyi Wang, Yikai Wang, Xinzhou Wang, Yueqi Duan, Jun Zhu | N/A | |
| Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos | Changan Chen*, Puyuan Peng, Ami Baid, Zihui Xue, Wei-Ning Hsu, David Harwath, Kristen Grauman | N/A | |
| Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation | Xinru Cui, Qiming Liu, Zhe Liu, Hesheng Wang* | N/A | |
| MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders | Baijiong Lin*, Weisen Jiang, Pengguang Chen, Yu Zhang, Shu Liu, Yingcong Chen | N/A | |
| VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models | Shicheng Li, Lei Li, Yi Liu, Shuhuai Ren, Yuanxin Liu, Rundong Gao, Xu Sun*, Lu Hou | N/A | |
| Learning a Dynamic Privacy-preserving Camera Robust to Inversion Attacks | Jiacheng Cheng*, Xiang Dai, Jia Wan, Nick Antipa, Nuno Vasconcelos | N/A | |
| CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches | Sifan Wu, Amir Hosein Khasahmadi, Mor Katz, Pradeep Kumar Jayaraman, Yewen Pu, Karl D.D. Willis, Bang Liu | N/A | |
| Towards Image Ambient Lighting Normalization | Florin-Alexandru Vasluianu, Tim Seizinger, Zongwei WU, Rakesh Ranjan, Radu Timofte | N/A | |
| FedHide: Federated Learning by Hiding in the Neighbors | Hyunsin Park*, Sungrack Yun | N/A | |
| Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Dohyung Kim, Junghyup Lee, Jeimin Jeon, JAEHYEON MOON, Bumsub Ham* | N/A | |
| SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery | Sarah Rastegar*, Mohammadreza Salehi, Yuki M Asano, Hazel Doughty, Cees Snoek | N/A | |
| Self-Cooperation Knowledge Distillation for Novel Class Discovery | Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Yunquan Sun, Lizhe Qi | N/A | |
| EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding | jiazhou zhou*, Xu Zheng, Yuanhuiyi Lyu, Lin Wang | N/A | |
| GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Hang Yao, Ming Liu*, Zhicun Yin, Zifei Yan, Xiaopeng Hong, Wangmeng Zuo | N/A | |
| MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks | Elad Hirsch*, Gefen Dawidowicz, Ayellet Tal | N/A | |
| Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection? | Rosario Leonardi*, Antonino Furnari, Francesco Ragusa, Giovanni Maria Farinella | N/A | |
| "PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation" | Ginger Delmas*, Philippe Weinzaepfel, Francesc Moreno-Noguer, Gregory Rogez | N/A | |
| A Comparative Study of Image Restoration Networks for General Backbone Network Design | Xiangyu Chen, Zheyuan Li, Yuandong Pu, Yihao Liu, Jiantao Zhou, Yu Qiao, Chao Dong* | N/A | |
| Learned Image Enhancement via Color Naming | David Serrano-Lozano*, Luis Herranz, Michael S Brown, Javier Vazquez-Corral | N/A | |
| Synthesizing Time-varying BRDFs via Latent Space | Takuto Narumoto*, Hiroaki Santo, Fumio Okura | N/A | |
| HoloADMM: High-Quality Holographic Complex Field Recovery | Mazen Mel*, Paul Springer, Pietro Zanuttigh, Haitao Zhou, Alexander Gatto | N/A | |
| Fundamental Matrix Estimation Using Relative Depths | Yaqing Ding*, Václav Vávra, Snehal Bhayani, Qianliang Wu, Jian Yang, Zuzana Kukelova | N/A | |
| Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion | Otto Seiskari*, Jerry Ylilammi, Valtteri Kaatrasalo, Pekka Rantalankila, Matias Turkulainen, Juho Kannala, Esa Rahtu, Arno Solin | N/A | |
| MTaDCS: Moving Trace and Feature Density-based Confidence Sample Selection under Label Noise | Qingzheng Huang, Xilin He, Xiaole Xian, Qinliang Lin, Weicheng Xie*, Siyang Song, Linlin Shen, Zitong Yu | N/A | |
| Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis | Brian Kostadinov Shalon Isaac-Medina, Yona Falinie Abdul Gaus, Neelanjan Bhowmik, Toby P Breckon | N/A | |
| GroundUp: Rapid Sketch-Based 3D City Massing | Gizem Esra Unlu*, Mohamed Sayed, Yulia Gryaditskaya, Gabriel Brostow | N/A | |
| Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing | Vadim Titov, Madina Khalmatova, Alexandra Ivanova, Dmitry P Vetrov, Aibek Alanov | N/A | |
| DataDream: Few-shot Guided Dataset Generation | Jae Myung Kim*, Jessica Bader, Stephan Alaniz, Cordelia Schmid, Zeynep Akata | N/A | |
| LPViT: Low-Power Semi-structured Pruning for Vision Transformers | Kaixin Xu, Zhe Wang, Chunyun Chen, Xue Geng, Jie Lin, Xulei Yang, Min Wu, Xiaoli Li, Weisi Lin | N/A | |
| CipherDM: Secure Three-Party Inference for Diffusion Model Sampling | Xin Zhao, Xiaojun Chen*, Xudong Chen, He Li, Tingyu Fan, Zhendong Zhao | N/A | |
| Weighted Ensemble Models Are Strong Continual Learners | Imad Eddine MAROUF*, Subhankar Roy, Enzo Tartaglione, Stéphane Lathuilière | N/A | |
| GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time | Hao Li, Yuanyuan Gao, Dingwen Zhang*, Chenming Wu, YALUN DAI, Chen Zhao, Haocheng Feng, Errui Ding, Jingdong Wang, Junwei Han | N/A | |
| A Unified Image Compression Method for Human Perception and Multiple Vision Tasks | Sha Guo, Lin Sui, Chen-Lin Zhang, Zhuo Chen, Wenhan Yang, Lingyu Duan* | N/A | |
| UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation | Shuang Wu, Songlin Tang, Guangming Lu, Jianzhuang Liu, Wenjie Pei* | N/A | |
| Audio-visual Generalized Zero-shot Learning the Easy Way | Shentong Mo*, Pedro Morgado | N/A | |
| PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition | Xiao Li*, Yining Liu, Na Dong, Sitian Qin, Xiaolin Hu | N/A | |
| Learning Equilibrium Transformation for Gamut Expansion and Color Restoration | Jun Xiao*, Changjian Shui, Zhi-Song Liu, Qian Ye, Kin-Man Lam | N/A | |
| Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition | Yurong Zhang*, Honghao Chen, Zhang Xinyu, Xiangxiang Chu, Li Song | N/A | |
| Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation | Jinghe Yang*, Mingming Gong, Ye Pu | N/A | |
| Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift | Antonio Tejero-de-Pablos*, Riku Togashi, Mayu Otani, Shin'ichi Satoh | N/A | |
| Chains of Diffusion Models | Yanheng Wei, Lianghua Huang, Zhi-Fan Wu, Wei Wang, Yu Liu, Mingda Jia, Shuailei Ma | N/A | |
| Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models | Phuong Hoang Dam, Jihoon Jeong, Anh T Tran, Daeyoung Kim | N/A | |
| Feature Diversification and Adaptation for Federated Domain Generalization | Seunghan Yang*, Seokeon Choi, Hyunsin Park, Sungha Choi, Simyung Chang, Sungrack Yun | N/A | |
| Grounding Image Matching in 3D with MASt3R | Vincent Leroy*, Yohann Cabon, Jerome Revaud | N/A | |
| TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling | Jun Li*, Zedong Zhang, Jian Yang | N/A | |
| RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes | Thang-Anh-Quan Nguyen, Luis G Roldao Jimenez, Nathan Piasco, Moussab Bennehar, Dzmitry Tsishkou* | N/A | |
| RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection | Ming Chang, Xishan Zhang*, Rui Zhang, Zhipeng Zhao, Guanhua He, Shaoli Liu | N/A | |
| Efficient Bias Mitigation Without Privileged Information | Mateo Espinosa Zarlenga*, Swami Sankaranarayanan, Jerone T. A. Andrews, Zohreh Shams, Mateja Jamnik, Alice Xiang | N/A | |
| MC-PanDA: Mask Confidence for Panoptic Domain Adaptation | Ivan Martinović*, Josip Šarić, Siniša Šegvić | N/A | |
| Learning Neural Deformation Representation for 4D Dynamic Shape Generation | Gyojin Han, Jiwan Hur, Jaehyun Choi, Junmo Kim | N/A | |
| Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge | Hyejin Park, Dongbo Min* | N/A | |
| Decomposition Betters Tracking Everything Everywhere | Rui Li, Dong Liu* | N/A | |
| Straightforward Layer-wise Pruning for More Efficient Visual Adaptation | Ruizi Han, Jinglei Tang | N/A | |
| Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs | Camillo Quattrocchi*, Antonino Furnari, Daniele Di Mauro, Mario Valerio Giuffrida, Giovanni Maria Farinella | N/A | |
| LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models | Yabin Zhang, Wenjie Zhu, Chenhang He, Lei Zhang | N/A | |
| Domain Shifting: A Generalized Solution for Heterogeneous Cross-Modality Person Re-Identification | Yan Jiang, Xu Cheng*, Hao Yu, Xingyu Liu, Haoyu Chen, Guoying Zhao | N/A | |
| Self-Supervised Video Desmoking for Laparoscopic Surgery | Renlong Wu, Zhilu Zhang, Shuohao Zhang, Longfei Gou, Haobin Chen, Lei Zhang, Hao Chen, Wangmeng Zuo | N/A | |
| Removing Rows and Columns of Tokens in Vision Transformer enables Faster Dense Prediction without Retraining | Diwei Su, cheng fei, Jianxu Luo* | N/A | |
| Continuity Preserving Online CenterLine Graph Learning | Yunhui Han, Kun Yu, Zhiwei Li* | N/A | |
| Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping | Minseong Park, Suhan Woo, Euntai Kim* | N/A | |
| MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections | Jiayue Liu, Xiao Tang, Freeman Cheng, Zihao Yang, Zhihao Li, Jianzhuang Liu, Yi Huang, Jiaqi Lin, Shiyong Liu, Xiaofei Wu, Songcen Xu, Chun Yuan | N/A | |
| Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection | Christos Koutlis*, Symeon Papadopoulos | N/A | |
| Exploring Vulnerabilities in Spiking Neural Networks: Direct Adversarial Attacks on Raw Event Data | Yanmeng Yao, Xiaohan Zhao, Bin Gu* | N/A | |
| HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos | Lixin Xue*, Chen Guo, Chengwei Zheng, Fangjinhua Wang, Tianjian Jiang, Hsuan-I Ho, Manuel Kaufmann, Jie Song, Otmar Hilliges | N/A | |
| Online Video Quality Enhancement with Spatial-Temporal Look-up Tables | Zefan Qu, Xinyang Jiang, Yifan Yang, Dongsheng Li, Cairong Zhao | N/A | |
| PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model | Amrin Kareem, Jean Lahoud, Hisham Cholakkal | N/A | |
| Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance | Donghoon Ahn, Hyoungwon Cho, Jaewon Min, Jungwoo Kim, Wooseok Jang, SeonHwa Kim, Hyun Hee Park, Kyong Hwan Jin, Seungryong Kim | N/A | |
| Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation | Zhaoyang Li*, Yuan Wang, Wangkai Li, Rui Sun, Tianzhu Zhang | N/A | |
| Think before Placement: Common Sense Enhanced Transformer for Object Placement | Yaxuan Qin, Jiayu Xu, Ruiping Wang*, Xilin Chen | N/A | |
| Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD) | Marko Savic, Guoying Zhao* | N/A | |
| Leveraging Imperfect Restoration for Data Availability Attack | YI HUANG, Jeremy Styborski, Mingzhi Lyu, Fan Wang, Wai-Kin Adams Kong* | N/A | |
| 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance | Xiaoxu Xu, Yitian Yuan, Jinlong Li, Qiudan Zhang, Zequn Jie, Lin Ma, Hao Tang, Nicu Sebe, Xu Wang* | N/A | |
| Open-set Domain Adaptation via Joint Error based Multi-class Positive and Unlabeled Learning | Dexuan Zhang*, Thomas Westfechtel, Tatsuya Harada | N/A | |
| DoubleTake: Geometry Guided Depth Estimation | Mohamed Sayed*, Filippo Aleotti, Jamie Watson, Zawar Qureshi, Guillermo Garcia-Hernando, Gabriel Brostow, Sara Vicente, Michael Firman | N/A | |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Fangwei Zhong*, Kui Wu, Hai Ci, Chu-ran Wang, Hao Chen | N/A | |
| Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting | Yunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Peng | N/A | |
| Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models | Yifan Li*, hangyu guo, Kun Zhou, Wayne Xin Zhao, Ji-Rong Wen | N/A | |
| Edge-Guided Fusion and Motion Augmentation for Event-Image Stereo | Fengan Zhao, Qianang Zhou, Junlin Xiong | N/A | |
| MetaWeather: Few-Shot Weather-Degraded Image Restoration | Youngrae Kim, Younggeol Cho, Thanh-Tung Nguyen, Seunghoon Hong, Dongman Lee | N/A | |
| CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance | Zhipeng Hu, Yongqiang Zhang, Chen Liu, Lincheng Li, Sida Peng, Xiaowei Zhou, Changjie Fan, Xin Yu | N/A | |
| "Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition" | Sergio Izquierdo, Javier Civera | N/A | |
| HiFi-123: Towards High-fidelity One Image to 3D Content Generation | Wangbo Yu*, Li Yuan, Yan-Pei Cao, Xiangjun Gao, Xiaoyu Li, Wenbo Hu, Long Quan, Ying Shan, Yonghong Tian | N/A | |
| Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View | Jianan Fan, Dongnan Liu, Canran Li, Hang Chang, Heng Huang, Filip Braet, Mei Chen, Weidong Cai | N/A | |
| Good Teachers Explain: Explanation-Enhanced Knowledge Distillation | Amin Parchami-Araghi*, Moritz Böhle, Sukrut Rao, Bernt Schiele | N/A | |
| Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation | Juncheng Ma, Peiwen Sun, Yaoting Wang, Di Hu* | N/A | |
| FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models | Junhyuk So, Jungwon Lee, Eunhyeok Park* | N/A | |
| Möbius Transform for Mitigating Perspective Distortions in Representation Learning | Prakash Chandra Chhipa*, Meenakshi Subhash Chippa, Kanjar De, Rajkumar Saini, Marcus Liwicki, Mubarak Shah | N/A | |
| TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection | Xixi Liu*, Christopher Zach | N/A | |
| CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction | Zhangchen Ye, Tao Jiang, Chenfeng Xu, Yiming Li, Hang Zhao* | N/A | |
| SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Niklas Gard*, Anna Hilsmann, Peter Eisert | N/A | |
| Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation | Mohamed El Amine Boudjoghra*, Jean Lahoud, Salman Khan, Hisham Cholakkal, Rao M Anwer, Fahad Shahbaz Khan | N/A | |
| DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting | Linus Härenstam-Nielsen, Lu Sang, Abhishek Saroha, Nikita Araslanov, Daniel Cremers* | N/A | |
| Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking | Lorenzo Vaquero*, Yihong Xu, Xavier Alameda-Pineda, Victor M. Brea, Manuel Mucientes | N/A | |
| Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection | Kangqi Ma*, Hao Dong, Yadong Mu | N/A | |
| Region-Native Visual Tokenization | Mengyu Wang*, Yuyao Huang, Henghui Ding, Xinlong Wang, Tiejun Huang, Yao Zhao, Yunchao Wei, Shuicheng Yan | N/A | |
| SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization | Mae Younes*, Amine Ouasfi, Adnane Boukhayma | N/A | |
| Sketch2Vox: Learning 3D Reconstruction from a Single Monocular Sketch Image | Fei Wang* | N/A | |
| DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing | Minghao Chen*, Iro Laina, Andrea Vedaldi | N/A | |
| The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization | Jiafeng Mao*, Xueting Wang, Kiyoharu Aizawa | N/A | |
| Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond | Silvio Galesso, Philipp Schröppel, Hssan Driss, Thomas Brox | N/A | |
| Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction | Zijie Jiang, Tianhan Xu, Hiroharu Kato | N/A | |
| A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | Tianhe Wu, Kede Ma, Jie Liang, Yujiu Yang, Lei Zhang | N/A | |
| Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment | Wulian Yun, Mengshi Qi, Fei Peng, Huadong Ma* | N/A | |
| Efficient Neural Video Representation with Temporally Coherent Modulation | Seungjun Shin, Suji Kim, Dokwan Oh | N/A | |
| Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes | Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu* | N/A | |
| DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling | Haoran Li, Haolin Shi, Wenli Zhang, Wenjun Wu, Yong Liao, Lin Wang, Lik-Hang Lee, Peng Yuan Zhou | N/A | |
| Multi-modal Crowd Counting via a Broker Modality | Haoliang Meng, Xiaopeng Hong*, Chenhao Wang, Miao Shang, Wangmeng Zuo | N/A | |
| FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation | tianyu zhang, Guocheng Qian, Jin Xie*, Jian Yang | N/A | |
| Made to Order: Discovering monotonic temporal changes via self-supervised video ordering | Charig Yang*, Weidi Xie, Andrew Zisserman | N/A | |
| PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration | Runzhao Yao, Shaoyi Du*, Wenting Cui, Canhui Tang, Chengwu Yang | N/A | |
| Open-Vocabulary RGB-Thermal Semantic Segmentation | GuoQiang Zhao, JunJie Huang, Xiaoyun Yan*, Zhaojing Wang, Junwei Tang, Yangjun Ou, Xinrong Hu, Tao Peng | N/A | |
| MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Gabriele Berton*, Lorenz Junglas, Riccardo Zaccone, Thomas Pollok, Barbara Caputo, Carlo Masone | N/A | |
| Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Yaoting Wang, Peiwen Sun, Yuanchao Li, Honggang Zhang, Di Hu* | N/A | |
| Concise Plane Arrangements for Low-Poly Surface and Volume Modelling | Raphael Sulzer, Florent Lafarge* | N/A | |
| KeypointDETR: An End-to-End 3D Keypoint Detector | Hairong Jin, Yuefan Shen, Jianwen Lou, Kun Zhou, Youyi Zheng* | N/A | |
| ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Sogand Salehi*, Mahdi Shafiei, Roman Bachmann, Teresa Yeo, Amir Zamir | N/A | |
| MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling | Jian Yang, Jiakun Li, Guoming Li, Huaiyu Wu, Zhen Shen, Zhaoxin Fan* | N/A | |
| uCAP: An Unsupervised Prompting Method for Vision-Language Models | A. Tuan Nguyen*, Kai Sheng Tai, Bor-Chun Chen, Satya Narayan Shukla, Hanchao Yu, Philip Torr, Tai-Peng Tian, Ser-Nam Lim | N/A | |
| LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model | Dilxat Muhtar, Zhenshi Li, Feng Gu, Xueliang Zhang*, Pengfeng Xiao | N/A | |
| How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology | Andrei Atanov*, Rishubh Singh, Jiawei Fu, Isabella Yu, Andrew Spielberg, Amir Zamir | N/A | |
| MONTAGE: Monitoring Training for Attribution of Generative Diffusion Models | Jonathan Brokman*, Omer Hofman, Roman Vainshtein, Amit Giloni, Toshiya Shimizu, Inderjeet Singh, Oren Rachmil, Alon Zolfi, Asaf Shabtai, Yuki Unno, Hisashi Kojima | N/A | |
| Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations | Kilichbek Haydarov*, Xiaoqian Shen, Avinash Madasu, Mahmoud Salem, Li-Jia Li, Gamaleldin F Elsayed, Mohamed Elhoseiny | N/A | |
| Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination | Yunan Li, Yihao Zhang, Shoude Li, Long Tian, DOU QUAN, Chaoneng Li, Qiguang Miao | N/A | |
| Self-supervised visual learning from interactions with objects | Arthur Aubret*, Céline Teulière, Jochen Triesch | N/A | |
| OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation | Yuchen Che*, Ryo Furukawa, Asako Kanezaki | N/A | |
| BAFFLE: A Baseline of Backpropagation-Free Federated Learning | Haozhe Feng, Tianyu Pang, Chao Du, Wei Chen*, Shuicheng Yan, Min Lin | N/A | |
| Sequential Representation Learning via Static-Dynamic Conditional Disentanglement | Mathieu Cyrille Simon*, Pascal Frossard, Christophe De Vleeschouwer | N/A | |
| OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects | Akshay Krishnan, Abhijit Kundu, Kevis-Kokitsi Maninis, James Hays, Matthew Brown | N/A | |
| 3R-INN: How to be climate friendly while consuming/delivering videos? | ZOUBIDA AMEUR*, Claire-Helene Demarty, Olivier LE MEUR, Daniel Menard | N/A | |
| Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction | Bingyu Xin*, Meng Ye, Leon Axel, Dimitris N. Metaxas | N/A | |
| Towards Robust Full Low-bit Quantization of Super Resolution Networks | Denis S. Makhov*, Irina Zhelavskaya, Ruslan Ostapets, Dehua Song, Kirill Solodskikh | N/A | |
| Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking | Jiyao Zhang, Weiyao Huang, Bo Peng, Mingdong Wu, Fei Hu, Zijian Chen, Bo Zhao, Hao Dong* | N/A | |
| Diverse Text-to-3D Synthesis with Augmented Text Embedding | Uy Dieu Tran, Minh N. Hoang Luu, Phong Ha Nguyen, Khoi Nguyen, Binh-Son Hua* | N/A | |
| Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Mathias Öttl*, Frauke Wilm, Jana Steenpass, Jingna Qiu, Matthias Rübner, Prof Arndt Hartmann, Matthias W. Beckmann, Peter Fasching, Andreas K Maier, Ramona Erber, Bernhard Kainz, Katharina Breininger | N/A | |
| LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang | Yuqing Zhang, Hangqi Li, Shengyu Zhang, Runzhong Wang, Baoyi He, Huaiyong Dou, Junchi Yan, Yongquan Zhang, Fei Wu | N/A | |
| Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks | MohammadReza Davari*, Eugene Belilovsky | N/A | |
| AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems | Roye Katzav, Amit Giloni, Edita Grolman, Hiroo Saito, Tomoyuki Shibata, Tsukasa Omino, Misaki Komatsu, Yoshikazu Hanatani, Yuval Elovici, Asaf Shabtai | N/A | |
| iHuman: Instant Animatable Digital Humans From Monocular Videos | Pramish Paudel*, Anubhav Khanal, Danda Pani Paudel, Jyoti Tandukar, Ajad Chhatkuli | N/A | |
| SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation | Heyuan Li, Ce Chen, Tianhao Shi, Yuda Qiu, Sizhe An, Guanying CHEN, Xiaoguang Han | N/A | |
| Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier | Prantik Howlader*, Srijan Das, Hieu Le, Dimitris Samaras | N/A | |
| Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering | Zeyu Liu, Weicong Liang, Zhanhao Liang, Chong Luo, Ji Li, Gao Huang, Yuhui Yuan* | N/A | |
| Solving the inverse problem of microscopy deconvolution with a residual Beylkin-Coifman-Rokhlin neural network | Rui Li, Mikhail Kudryashev, Artur Yakimovich* | N/A | |
| Face Reconstruction Transfer Attack as Out-of-Distribution Generalization | Yoon Gyo Jung, Jaewoo Park, Xingbo Dong, Hojin Park, Andrew Beng Jin Teoh, Octavia Camps | N/A | |
| FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models | Andrea Caraffa*, Davide Boscaini, Amir Hamza, Fabio Poiesi | N/A | |
| Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems | Hyungjin Chung, Jong Chul Ye* | N/A | |
| Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation | Prantik Howlader*, Hieu Le, Dimitris Samaras | N/A | |
| PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects | Junyi Li, Junfeng Wu, Weizhi Zhao, Song Bai, Xiang Bai* | N/A | |
| WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Quan Kong*, Yuki Kawana, Rajat Saini, Ashutosh Kumar, Jingjing Pan, Ta Gu, Yohei Ozao, Balazs Opra, Yoichi Sato, Norimasa Kobori | N/A | |
| Spiking Wavelet Transformer | Yuetong Fang, Ziqing Wang, Lingfeng Zhang, Jiahang Cao, Honglei Chen, Renjing Xu* | N/A | |
| WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing | Yutang Feng, Sicheng Gao, Yuxiang Bao, Xiaodi Wang, Shumin Han, Juan Zhang*, Baochang Zhang, Angela Yao | N/A | |
| PDT Uav Target Detection Dataset for Pests and Diseases Tree | Mingle Zhou, Rui Xing, Delong Han, Zhiyong Qi, Gang Li* | N/A | |
| Hypernetworks for Generalizable BRDF Representation | Fazilet Gokbudak*, Alejandro Sztrajman, Chenliang Zhou, Fangcheng Zhong, Rafal Mantiuk, A. Cengiz Oztireli | N/A | |
| Photon Inhibition for Energy-Efficient Single-Photon Imaging | Lucas J Koerner*, Shantanu Gupta, Atul N Ingle, Mohit Gupta | N/A | |
| COD: Learning Conditional Invariant Representation for Domain Adaptation Regression | Hao-Ran Yang, Chuan-Xian Ren*, You-Wei Luo | N/A | |
| RANRAC: Robust Neural Scene Representations via Random Ray Consensus | Benno Buschmann*, Andreea Dogaru, Elmar Eisemann, Michael Weinmann, Bernhard Egger | N/A | |
| LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model | Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang*, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu | N/A | |
| Characterizing Model Robustness via Natural Input Gradients | Adrian Rodriguez-Munoz*, Tongzhou Wang, Antonio Torralba | N/A | |
| UpFusion: Novel View Diffusion from Unposed Sparse View Observations | Bharath Raj Nagoor Kani*, Hsin-Ying Lee, Sergey Tulyakov, Shubham Tulsiani | N/A | |
| Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding | Ozan Unal*, Christos Sakaridis, Suman Saha, Luc Van Gool | N/A | |
| "SIMBA: Split Inference - Mechanisms, Benchmarks and Attacks" | Abhishek Singh*, Vivek Sharma, Rohan Sukumaran, John J Mose, Jeffrey K Chiu, Justin Yu, Ramesh Raskar | N/A | |
| Tuning-Free Image Customization with Image and Text Guidance | Pengzhi Li, Qiang Nie, Ying Chen, Xi Jiang, Kai Wu, Yuhuan Lin, Yong Liu, Jinlong Peng, Chengjie Wang, Feng Zheng* | N/A | |
| FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification | Yu Tian*, Congcong Wen, Min Shi, Muhammad Muneeb Afzal, Hao Huang, Muhammad Osama Khan, Yan Luo, Yi Fang, Mengyu Wang | N/A | |
| Emerging Property of Masked Token for Effective Pre-training | Hyesong Choi, Hunsang Lee, Seyoung Joung, Hyejin Park, Jiyeong Kim, Dongbo Min* | N/A | |
| DQ-DETR: DETR with Dynamic Query for Tiny Object Detection | Yi-Xin Huang*, Hou-I Liu, Hong-Han Shuai, Wen-Huang Cheng | N/A | |
| Track2Act: Predicting Point Tracks from Internet Videos enables Generalizable Robot Manipulation | Homanga Bharadhwaj*, Roozbeh Mottaghi, Abhinav Gupta, Shubham Tulsiani | N/A | |
| SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians | Hiba Dahmani*, Moussab Bennehar, Nathan Piasco, Luis G Roldao Jimenez, Dzmitry Tsishkou | N/A | |
| Gaussian in the wild: 3D Gaussian Splatting for Unconstrained Image Collections | Dongbin Zhang, Chuming Wang, Weitao Wang, Peihao Li, Minghan Qin, Haoqian Wang | N/A | |
| Few-shot Defect Image Generation based on Consistency Modeling | Qingfeng Shi, Jing Wei, Fei Shen*, Zhengtao Zhang | N/A | |
| Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits | Ada-Astrid Balauca*, Danda Pani Paudel, Kristina Toutanova, Luc Van Gool | N/A | |
| CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs | Yassine Ouali, Adrian Bulat, Brais Martinez, Georgios Tzimiropoulos | N/A | |
| Masked Motion Prediction with Semantic Contrast for Point Cloud Sequence Learning | yuehui han*, Can Xu, Rui Xu, Jianjun Qian, Jin Xie | N/A | |
| Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline | Zixuan Chen, Zewei He*, Ziqian Lu, Xuecheng Sun, Zheming Lu | N/A | |
| Video Editing via Factorized Diffusion Distillation | Uriel Singer, Amit Zohar, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman | N/A | |
| Trackastra: Transformer-based cell tracking for live-cell microscopy | Benjamin Gallusser, Martin Weigert* | N/A | |
| CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion | Wendi Zheng, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong, Ming Ding, Jie Tang | N/A | |
| SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers | Nanye Ma, Mark Goldstein, Michael Albergo, Nicholas M Boffi, Eric Vanden-Eijnden, Saining Xie* | N/A | |
| Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM | Baicheng Li, Zike Yan, Dong Wu, Hanqing Jiang, Hongbin Zha* | N/A | |
| Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation | Sudhir Yarram*, Junsong Yuan | N/A | |
| GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring | Emanuele Santellani*, Martin Zach, Christian Sormann, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer | N/A | |
| Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring | Sizhuo Li, Dimitri Gominski*, Martin Brandt, Xiaoye Tong, Philippe Ciais | N/A | |
| ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Daniel Winter, Matan Cohen, Shlomi Fruchter, Yael Pritch, Alex Rav-Acha, Yedid Hoshen | N/A | |
| CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning | ZiYang Gong, FuHao Li, Yupeng Deng, Deblina Bhattacharjee, Xianzheng Ma, Xiangwei Zhu, Zhenming Ji* | N/A | |
| Curved Diffusion: A Generative Model With Optical Geometry Control | Andrey Voynov*, Amir Hertz, Moab Arar, Shlomi Fruchter, Daniel Cohen-Or | N/A | |
| Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians | Guangchi Fang, Bing Wang* | N/A | |
| MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis | Ziming Zhong*, Yanyu Xu, Jing Li, Jiale Xu, Zhengxin Li, Chaohui Yu, Shenghua Gao | N/A | |
| OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation | Kwanyoung Kim, Yujin Oh, Jong Chul Ye* | N/A | |
| Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Yannick Kirchhoff, Maximilian R Rokuss, Saikat Roy*, Balint Kovacs, Constantin Ulrich, Tassilo Wald, Maximilian Zenk, Philipp Vollmuth, Jens Kleesiek, Fabian Isensee, Klaus H. Maier-Hein | N/A | |
| Conceptual Codebook Learning for Vision-Language Models | Yi Zhang, Ke Yu, Siqi Wu, Zhihai He | N/A | |
| LingoQA: Video Question Answering for Autonomous Driving | Ana-Maria Marcu*, Long Chen, Jan Hünermann, Alice Karnsund, Benoit Hanotte, Prajwal Chidananda, Saurabh Nair, Vijay Badrinarayanan, Alex Kendall, Jamie Shotton, Elahe Arani, Oleg Sinavski | N/A | |
| AnimateMe: 4D Facial Expressions via Diffusion Models | Dimitrios Gerogiannis*, Foivos Paraperas Papantoniou, Rolandos Alexandros Potamias, Alexandros Lattas, Stylianos Moschoglou, Stylianos Ploumpis, Stefanos Zafeiriou | N/A | |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Zhecan Wang, Garrett Bingham*, Adams Wei Yu, Quoc V. Le, Thang Luong, Golnaz Ghiasi | N/A | |
| LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Kevin Xie*, Tianshi Cao, Jonathan P Lorraine, Jun Gao, James R Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng | N/A | |
| PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Tianyuan Yuan, Yucheng Mao, Jiawei Yang, Yicheng LIU, Yue Wang, Hang Zhao | N/A | |
| Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention | Jie Ren*, Yaxin Li, Shenglai Zeng, Han Xu, Lingjuan Lyu, Yue Xing, Jiliang Tang | N/A | |
| iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning | Tom Fischer*, Yaoyao Liu, Artur Jesslen, Noor Ahmed, Prakhar Kaushik, Angtian Wang, Alan Yuille, Adam Kortylewski, Eddy Ilg | N/A | |
| Context Diffusion: In-Context Aware Image Generation | Ivona Najdenkoska*, Animesh Sinha, Abhimanyu Dubey, Dhruv Mahajan, Vignesh Ramanathan, Filip Radenovic | N/A | |
| Pose Guided Fine-Grained Sign Language Video Generation | Tongkai Shi, Lianyu Hu, Fanhua Shang, Jichao Feng, liu peidong, Wei Feng* | N/A | |
| RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos | Ali Zare*, Yulei Niu, Hammad Ayyubi, Shih-Fu Chang | N/A | |
| Certifiably Robust Image Watermark | Zhengyuan Jiang*, Moyang Guo, Yuepeng Hu, Jinyuan Jia, Neil Zhenqiang Gong | N/A | |
| Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery | Sukrut Rao, Sweta Mahajan, Moritz Böhle, Bernt Schiele | N/A | |
| Online Zero-Shot Classification with CLIP | Qi Qian*, Juhua Hu | N/A | |
| SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning | Qi Qian*, Yuanhong Xu, Juhua Hu | N/A | |
| Unlocking the Potential of Federated Learning: The Symphony of Dataset Distillation via Deep Generative Latents | Yuqi Jia, Saeed Vahidian*, Jingwei Sun, Jianyi Zhang, Vyacheslav Kungurtsev, Neil Zhenqiang Gong, Yiran Chen | N/A | |
| Rethinking Fast Adversarial Training: A Splitting Technique To Overcome Catastrophic Overfitting | Masoumeh Zareapoor, Pourya Shamsolmoali* | N/A | |
| Quality Assured: Rethinking Annotation Strategies in Imaging AI | Tim Rädsch, Annika Reinke, Vivienn Weru, Minu D. Tizabi, Nicholas Heller, Fabian Isensee, Annette Kopp-Schneider, Lena Maier-Hein | N/A | |
| BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues | Sara Sarto*, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara | N/A | |
| Enhancing Plausibility Evaluation for Generated Designs with Denoising Autoencoder | Jiajie Fan, Amal Trigui, Thomas Bäck, Hao Wang | N/A | |
| Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty Guidance | Yufei Zhang, Jeffrey Kephart, Qiang Ji | N/A | |
| 3D Reconstruction of Objects in Hands without Real World 3D Supervision | Aditya Prakash*, Matthew Chang, Matthew Jin, Ruisen Tu, Saurabh Gupta | N/A | |
| To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning | Souhail Hadgi*, Lei Li, Maks Ovsjanikov | N/A | |
| Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer | Xueyi Liu, Kangbo Lyu, jieqiong zhang, Tao Du, Li Yi | N/A | |
| 3D Hand Pose Estimation in Everyday Egocentric Images | Aditya Prakash*, Ruisen Tu, Matthew Chang, Saurabh Gupta | N/A | |
| Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops | Aditya Prakash*, Arjun Gupta, Saurabh Gupta | N/A | |
| Towards Neuro-Symbolic Video Understanding | Minkyu Choi*, Harsh Goel, Mohammad Omama, Yunhao Yang, Sahil Shah, Sandeep Chinchali | N/A | |
| Optimization-based Uncertainty Attribution Via Learning Informative Perturbations | Hanjing Wang*, Bashirul Azam Biswas, Qiang Ji | N/A | |
| Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast | Tatsuya Sasaki*, Yoshiki Ito, Satoshi Kondo | N/A | |
| Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency | Meilong Xu*, Xiaoling Hu, Saumya Gupta, Shahira Abousamra, Chao Chen | N/A | |
| Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling | Noam Elata*, Tomer Michaeli, Michael Elad | N/A | |
| Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator | Niki Amini-Naieni*, Tomas Jakab, Andrea Vedaldi, Ronald Clark | N/A | |
| MetaAT: Active Testing for Label-Efficient Evaluation of Dense Recognition Tasks | Sanbao Su, Xin Li*, Thang Doan, Sima Behpour, Wenbin He, Liang Gou, Fei Miao, Liu Ren | N/A | |
| Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training | Hyesong Choi, Hyejin Park, Kwang Moo Yi, Sungmin Cha, Dongbo Min* | N/A | |
| Data Augmentation via Latent Diffusion for Saliency Prediction | Bahar Aydemir*, Deblina Bhattacharjee, Tong Zhang, Mathieu Salzmann, Sabine Süsstrunk | N/A | |
| Explorative Inbetweening of Time and Space | Haiwen Feng*, Zheng Ding, Zhihao Xia, Simon Niklaus, Victoria Fernandez Abrevaya, Michael J. Black, Xuaner Zhang | N/A | |
| A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control | Karim Kadry*, Shreya Gupta, Jonas Sogbadji, Michiel Schaap, Kersten Petersen, Takuya Mizukami, Carlos Collet, Farhad R. Nezami, Elazer R Edelman | N/A | |
| Learning to Make Keypoints Sub-Pixel Accurate | Shinjeong Kim*, Marc Pollefeys, Daniel Barath | N/A | |
| Imaging with Confidence: Uncertainty Quantification for High-dimensional Undersampled MR Images | Frederik Hoppe*, Claudio Mayrink Verdun, Hannah Sophie Laus, Sebastian Endt, Marion Irene Menzel, Felix Krahmer, Holger Rauhut | N/A | |
| Generalizable Human Gaussians for Sparse View Synthesis | YoungJoong Kwon*, Baole Fang, Yixing Lu, Haoye Dong, Cheng Zhang, Francisco Vicente Carrasco, Albert Mosella-Montoro, Jianjin Xu, Shingo J Takagi, Daeil Kim, Aayush Prakash, Fernando de la Torre | N/A | |
| DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model | Li Xiaofan, Zhang Yifu, Ye Xiaoqing* | N/A | |
| Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off | Levente Halmosi, Bálint Mohos, Márk Jelasity* | N/A | |
| SkyScenes: A Synthetic Dataset for Aerial Scene Understanding | Sahil S Khose*, Anisha Pal, Aayushi Agarwal, . Deepanshi, Judy Hoffman, Prithvijit Chattopadhyay | N/A | |
| Large-Scale Multi-Hypotheses Cell Tracking Using Ultrametric Contours Maps | Jordão Bragantini*, Merlin Lange, Loïc A Royer | N/A | |
| GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction | Yuxuan Mu*, Xinxin Zuo, Chuan Guo, Yilin Wang, Juwei Lu, Xiaofei Wu, Songcen Xu, Peng Dai, Youliang Yan, Li Cheng | N/A | |
| AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation | Shengkun Tang*, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu | N/A | |
| PFedEdit: Personalized Federated Learning via Automated Model Editing | Haolin Yuan, William Paul, John Aucott, Philippe Burlina, Yinzhi Cao | N/A | |
| De-Confusing Pseudo-Labels in Source-Free Domain Adaptation | Idit Diamant*, Amir Rosenfeld, Idan Achituve, Jacob Goldberger, Arnon Netzer | N/A | |
| GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes | Ibrahim Ethem Hamamci*, Sezgin Er, Anjany Sekuboyina, Enis Simsar, Alperen Tezcan, Ayse Gulnihan Simsek, Sevval Nil Esirgun, Furkan Almas, Irem Dogan, Muhammed Furkan Dasdelen, Chinmay Prabhakar, Hadrien Reynaud, Sarthak Pati, Christian Bluethgen, Mehmet Kemal Ozdemir, Bjoern Menze | N/A | |
| EraseDraw : Learning to Insert Objects by Erasing Them from Images | Alper Canberk*, Maksym Bondarenko, Ege Ozguroglu, Ruoshi Liu, Carl Vondrick | N/A | |
| SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference | Alind Khare*, Animesh Agrawal, Aditya Annavajjala, Payman Behnam, Myungjin Lee, Hugo M Latapie, Alexey Tumanov | N/A | |
| Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models | Francesco Croce, Naman D. Singh, Matthias Hein | N/A | |
| Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training | David Wan*, Jaemin Cho, Elias Stengel-Eskin, Mohit Bansal | N/A | |
| Keypoint Promptable Re-Identification | Vladimir Somers*, Alexandre Alahi, Christophe De Vleeschouwer | N/A | |
| Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Fabio Quattrini, Vittorio Pippi, Silvia Cascianelli, Rita Cucchiara | N/A | |
| DynMF: Neural Motion Factorization for Real-time Dynamic View Synthesis with 3D Gaussian Splatting | Angelos Kratimenos*, Jiahui Lei, Kostas Daniilidis | N/A | |
| Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos | Remy Sabathier*, David Novotny, Niloy Mitra | N/A | |
| Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores | Lucas Goncalves, Prashant Mathur*, Chandrashekhar Lavania, Metehan Cekic, Marcello Federico, Kyu Han | N/A | |
| MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception | Mohammad Mahbubur Rahman, Ryoma Yataka, Sorachi Kato, Pu Wang*, Peizhao Li, Adriano Cardace, Petros Boufounos | N/A | |
| Training A Secure Model against Data-Free Model Extraction | Zhenyi Wang, Li Shen, junfeng guo, Tiehang Duan, Siyu Luan, Tongliang Liu, Mingchen Gao | N/A | |
| EpipolarGAN: Omnidirectional Image Synthesis with Explicit Camera Control | Christopher May*, Daniel Aliaga | N/A | |
| TriNeRFLet: A Wavelet Based Triplane NeRF Representation | Rajaei Khatib, Raja Giryes | N/A | |
| EgoBody3M: Egocentric Body Tracking on a VR Headset using a Diverse Dataset | Amy Zhao, Chengcheng Tang, Lezi Wang, Yijing Li, Mihika Dave, Lingling Tao*, Christopher D. Twigg, Robert Y. Wang | N/A | |
| Photorealistic Video Generation with Diffusion Models | Agrim Gupta*, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, Jose Lezama | N/A | |
| RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement | Tatiana Gaintseva, Martin Benning, Gregory Slabaugh | N/A | |
| TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models | Aditya Chinchure, Pushkar Shukla, Gaurav Bhatt, Kiri Salij, Kartik Hosanagar, Leonid Sigal, Matthew Turk | N/A | |
| Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval | Naoya Sogi, Takashi Shibata, Makoto Terao* | N/A | |
| DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation | Rakshith Subramanyam, Kowshik Thopalli, Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan | N/A | |
| Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding | Minh Tran*, Yelin Kim, Che-Chun Su, Min Sun, Cheng-Hao Kuo, Mohammad Soleymani | N/A | |
| Self-Supervised Audio-Visual Soundscape Stylization | Tingle Li*, Renhao Wang, Po-Yao Huang, Andrew Owens, Gopala Krishna Anumanchipalli | N/A | |
| SAVE: Protagonist Diversification with Structure Agnostic Video Editing | Yeji Song, Wonsik Shin, Junsoo Lee, Jeesoo Kim, Nojun Kwak | N/A | |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Xiaohan Wang*, Yuhui Zhang, Orr Zohar, Serena Yeung-Levy | N/A | |
| Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning | Thong Thanh Nguyen*, Yi Bin, Xiaobao Wu, Xinshuai Dong, Zhiyuan Hu, Khoi M Le, Cong-Duy Nguyen, See Kiong Ng, Anh Tuan Luu | N/A | |
| Source-Free Domain-Invariant Performance Prediction | Ekaterina Khramtsova*, Mahsa Baktashmotlagh, Guido Zuccon, Xi Wang, Mathieu Salzmann | N/A | |
| Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures | Sayanton V. Dibbo*, Adam Breuer, Juston Moore, Michael Teti | N/A | |
| Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort | Jeeyung Kim*, Ze Wang, Qiang Qiu | N/A | |
| Direct Distillation between Different Domains | Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama | N/A | |
| Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery | Andy V Huynh*, Lauren Gillespie, Jael Lopez-Saucedo, Claire Tang, Rohan Sikand, Moisés Expósito-Alonso | N/A | |
| V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation | Pooja Guhan*, Tsung-Wei Huang, Guan-Ming Su, Subhadra Gopalakrishnan, Dinesh Manocha | N/A | |
| GRiT: A Generative Region-to-text Transformer for Object Understanding | Jialian Wu*, Jianfeng Wang, Zhengyuan Yang, Zhe Gan, Zicheng Liu, Junsong Yuan, Lijuan Wang | N/A | |
| LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System | Hongbeen Park, Minjeong Park, Giljoo Nam, Jinkyu Kim* | N/A | |
| Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning | Seokwon Shin, Hyungrok Do, Youngdoo Son* | N/A | |
| Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending | Delong Wu, Hao Zhu, Qi Zhang, You Li, Xun Cao, Zhan Ma | N/A | |
| Geometry Fidelity for Spherical Images | Anders Christensen, Nooshin Mojab, Khushman Patel, Karan Ahuja, Zeynep Akata, Ole Winther, Mar Gonzalez Franco, Andrea Colaco | N/A | |
| BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Cheng Peng*, Yutao Tang, Yifan Zhou, Nengyu Wang, Xijun Liu, Deming Li, Rama Chellappa | N/A | |
| CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning | Erum Mushtaq*, Duygu Nur Yaldiz, Yavuz Faruk Bakman, Jie Ding, Chenyang Tao, Dimitrios Dimitriadis, Salman Avestimehr | N/A | |
| WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Jiachen Lu, Ze Huang, Zeyu Yang, Zhang Jiahui, Li Zhang* | N/A | |
| Benchmarking Spurious Bias in Few-Shot Image Classifiers | Guangtao Zheng*, Wenqian Ye, Aidong Zhang | N/A | |
| TurboEdit: Real-time text-based disentangled real image editing | Zongze Wu*, Nicholas I Kolkin, Jonathan Brandt, Richard Zhang, Eli Shechtman | N/A | |
| Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy | Fadlullah A Raji, John Murray-Bruce | N/A | |
| Augmented Neural Fine-tuning for Efficient Backdoor Purification | Nazmul Karim*, Abdullah Al Arafat, Umar Khalid, Zhishan Guo, Nazanin Rahnavard | N/A | |
| REDIR: Refocus-free Event-based De-occlusion Image Reconstruction | Qi Guo, Hailong Shi, Huan Li, Jinsheng Xiao, Xingyu Gao | N/A | |
| Free-Editor: Zero-shot Text-driven 3D Scene Editing | Nazmul Karim*, Hasan Iqbal, Umar Khalid, Chen Chen, Jing Hua | N/A | |
| DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly | Fenggen Yu*, Yiming Qian, Xu Zhang, Francisca Gil-Ureta, Brian Jackson, Eric Bennett, Hao Zhang | N/A | |
| An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation | Zhiyu Tan, Mengping Yang, Luozheng Qin , Hao Yang, Ye Qian , Qiang Zhou, Cheng Zhang, Hao Li* | N/A | |
| Few-shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt | Chenxi Liu, Zhenyi Wang, Tianyi Xiong, Ruibo Chen, Yihan Wu, junfeng guo, Heng Huang | N/A | |
| An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models | Liang Chen, Haozhe Zhao, Tianyu Liu, Shuai Bai, Junyang Lin, Chang Zhou, Baobao Chang* | N/A | |
| Generalizable Symbolic Optimizer Learning | Xiaotian Song, Peng Zeng, Yanan Sun*, Andy Song | N/A | |
| Online Continuous Generalized Category Discovery | Keon-Hee Park, Hakyung Lee, Kyungwoo Song, Gyeong-Moon Park | N/A | |
| Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Shihao Zhao, Shaozhe Hao, Bojia Zi, Huaizhe Xu, Kwan-Yee K. Wong | N/A | |
| Tackling Structural Hallucination in Image Translation with Local Diffusion | Seunghoi Kim*, Chen Jin, Tom Diethe, Matteo Figini, Henry FJ Tregidgo, Asher Mullokandov, Philip A Teare, Daniel Alexander | N/A | |
| Hierarchical Separable Video Transformer for Snapshot Compressive Imaging | Ping Wang, Yulun Zhang, Lishun Wang, Xin Yuan | N/A | |
| Unified Medical Image Pre-training in Language-Guided Common Semantic Space | Xiaoxuan He, Yifan Yang, Xinyang Jiang, Xufang Luo*, Haoji Hu, Siyun Zhao, Dongsheng Li, Yuqing Yang, Lili Qiu | N/A | |
| On the Vulnerability of Skip Connections to Model Inversion Attacks | Jun Hao Koh*, Sy-Tuyen Ho, Ngoc-Bao Nguyen, Ngai-Man Cheung | N/A | |
| Adversarial Robustification via Text-to-Image Diffusion Models | Daewon Choi, Jongheon Jeong, Huiwon Jang, Jinwoo Shin* | N/A | |
| Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection | Yunfeng FAN, Wenchao Xu, Haozhao Wang, Fushuo Huo, Jinyu Chen, Song Guo | N/A | |
| Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector | Xianren Zhang, Dongwon Lee, Suhang Wang* | N/A | |
| Reinforcement Learning via Auxillary Task Distillation | Abhinav N Harish*, Larry Heck, Josiah P Hanna, Zsolt Kira, Andrew Szot | N/A | |
| DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation | Sanghyun Jo, Fei Pan, In-Jae Yu, Kyungsu Kim* | N/A | |
| Pre-trained Visual Dynamics Representations for Efficient Policy Learning | Hao Luo, Bohan Zhou, Zongqing Lu | N/A | |
| View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields | Haodi He, Colton Stearns, Adam Harley, Leonidas Guibas* | N/A | |
| Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception | Tianyou Luo, Quan Yuan, Yuchen Xia, Guiyang Luo, Yujia Yang, Jinglin Li | N/A | |
| Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models | Yuchen Yang, Kwonjoon Lee, Behzad Dariush, Yinzhi Cao, Shao-Yuan Lo* | N/A | |
| SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation | Yi-Chia Chen, Wei-Hua Li, Cheng Sun, Yu-Chiang Frank Wang, Chu-Song Chen* | N/A | |
| TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias | Sanghyun Jo, Soohyun Ryu, Sungyub Kim, Eunho Yang, Kyungsu Kim* | N/A | |
| Learning Quantized Adaptive Conditions for Diffusion Models | Yuchen Liang, Yuchuan Tian, Lei Yu, Huaao Tang, Jie Hu, Xiangzhong Fang, Hanting Chen | N/A | |
| STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay | Yu Yongcan, Lijun Sheng, Ran He, Jian Liang* | N/A | |
| Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry | Shengjie Zhu*, Girish Chandar Ganesan, Abhinav Kumar, Xiaoming Liu | N/A | |
| Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention | Xunjiang Gu, Guanyu Song, Igor Gilitschenski, Marco Pavone, Boris Ivanovic* | N/A | |
| High-Fidelity Modeling of Generalizable Wrinkle Deformation | Jingfan Guo, Jae Shin Yoon, Shunsuke Saito, Takaaki Shiratori, Hyun Soo Park* | N/A | |
| Instruction Tuning-free Visual Token Complement for Multimodal LLMs | Dongsheng Wang*, Jiequan Cui, Miaoge Li, Wang Lin, Bo Chen, Hanwang Zhang | N/A | |
| Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection | Ting Lei, Shaofeng Yin, Yuxin Peng, Yang Liu* | N/A | |
| Training-free Video Temporal Grounding using Large-scale Pre-trained Models | Minghang Zheng, Xinhao Cai, Qingchao Chen, Yuxin Peng, Yang Liu* | N/A | |
| Revisit Self-supervision with Local Structure-from-Motion | Shengjie Zhu*, Xiaoming Liu | N/A | |
| FAMOUS: High-Fidelity Monocular 3D Human Digitization Using View Synthesis | Vishnu Mani Hema*, Shubhra Aich, Christian Haene, Jean-Charles Bazin, Fernando de la Torre | N/A | |
| Efficient Learning of Event-based Dense Representation using Hierarchical Memories with Adaptive Update | Uday Kamal*, Saibal Mukhopadhyay | N/A | |
| SNP: Structured Neuron-level Pruning to Preserve Attention Scores | KyungHwan Shim, Jaewoong Yun, Shinkook Choi* | N/A | |
| Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation | lei wang, Zejian Yuan, Badong Chen* | N/A | |
| Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats | Mingyang Xie*, Haoming Cai, Sachin Shah, Yiran Xu, Brandon Y. Feng, Jia-Bin Huang, Christopher A. Metzler | N/A | |
| PALM: Predicting Actions through Language Models | Sanghwan Kim*, Daoji Huang, Yongqin Xian, Otmar Hilliges, Luc Van Gool, Xi Wang | N/A | |
| Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation | Clinton A Mo, Kun Hu*, Chengjiang Long, Dong Yuan, Zhiyong Wang | N/A | |
| SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Trung Tuan Dao, Thuan Hoang Nguyen, Thanh Van Le, Duc H Vu, Khoi Nguyen, Cuong Pham, Anh T Tran | N/A | |
| Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment | Yuxiao Chen, Kai Li, Wentao Bao, Deep Patel, Yu Kong, Martin Renqiang Min, Dimitris N. Metaxas | N/A | |
| Improving Hyperbolic Representations via Gromov-Wasserstein Regularization | Yifei Yang, Wonjun Lee, Dongmian Zou*, Gilad Lerman | N/A | |
| VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG | Yankun Xu, Junzhe Wang, Yun-Hsuan Chen, Jie Yang, Wenjie Ming, Shuang Wang, Mohamad Sawan | N/A | |
| DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose | Yusuke Yoshiyasu*, Leyuan Sun | N/A | |
| Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense | Jeremy Styborski, Mingzhi Lyu, Yi Huang, Adams Kong | N/A | |
| Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics | Woojin Cho, Jihyun Lee, Minjae Yi, Minje Kim, Taeyun Woo, Donghwan Kim, Taewook Ha, Hyokeun Lee, Je-Hwan Ryu, Woontack Woo, Tae-Kyun (T-K) Kim* | N/A | |
| Human Pose Recognition via Occlusion-Preserving Abstract Images | Saad Manzur, Wayne B Hayes | N/A | |
| DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception | Kai Jiang*, Jiaxing Huang, Weiying Xie, Jie Lei, Yunsong Li, Ling Shao, Shijian Lu | N/A | |
| SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow | Yuanzhi Zhu, Xingchao Liu, Qiang Liu | N/A | |
| PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Shaowei Liu, Zhongzheng Ren, Saurabh Gupta, Shenlong Wang* | N/A | |
| Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery | Chao Wang*, Zhedong Zheng, Ruijie Quan, Yi Yang | N/A | |
| DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation | Jeongsol Kim, Geon Yeong Park, Jong Chul Ye* | N/A | |
| Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation | Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma, Weijun Zhuang, YaoHui Ma, Yong Dai, Yaowei Wang | N/A | |
| Personalized Privacy Protection Mask Against Unauthorized Facial Recognition | Ka-Ho Chow*, Sihao Hu, Tiansheng Huang, Ling Liu | N/A | |
| PosterLlama: Bridging Design Ability of Langauge Model to Content-Aware Layout Generation | Jaejung Seol, SeoJun Kim, Jaejun Yoo* | N/A | |
| PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control | Rishubh Parihar*, Sachidanand VS, Sabariswaran Mani, Tejan Karmali, Venkatesh Babu RADHAKRISHNAN | N/A | |
| LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation | Pengwei Yin*, Jingjing Wang, Guanzhong Zeng, Di Xie, Jiang Zhu | N/A | |
| Efficient Training with Denoised Neural Weights | Yifan Gong*, Zheng Zhan, Yanyu Li, Yerlan Idelbayev, Andrey Zharkov, Kfir Aberman, Sergey Tulyakov, Yanzhi Wang, Jian Ren | N/A | |
| Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning | Jihai Zhang, Xiang Lan, Xiaoye Qu, Yu Cheng, Mengling Feng, Bryan Hooi | N/A | |
| Integration of Global and Local Representations for Fine-grained Cross-modal Alignment | Seungwan Jin, Hoyoung Choi, Taehyung Noh, Kyungsik Han* | N/A | |
| Local and Global Flatness for Federated Domain Generalization | Hao Yan, Yuhong Guo* | N/A | |
| SRPose: Two-view Relative Pose Estimation with Sparse Keypoints | Rui Yin, Yulun Zhang, Zherong Pan, Jianjun Zhu, Cheng Wang, Biao Jia* | N/A | |
| Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Xiaoshi Wu, Yiming Hao, Manyuan Zhang, Keqiang Sun, Zhaoyang Huang, Guanglu Song, Yu Liu, Hongsheng Li | N/A | |
| Paying More Attention to Images: A Training-Free Method for Alleviating Hallucination in LVLMs | Shi Liu, Kecheng Zheng, Wei Chen* | N/A | |
| Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer. | Zhuoyi Yang*, Heyang Jiang, Wenyi Hong, Jiayan Teng, Wendi Zheng, Yuxiao Dong, Ming Ding, Jie Tang | N/A | |
| Implicit Neural Models to Extract Heart Rate from Video | Pradyumna Chari*, Anirudh Bindiganavale Harish, Adnan Armouti, Alexander Vilesov, Sanjit Sarda, Laleh Jalilian, Achuta Kadambi | N/A | |
| Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering | Francesco Di Sario*, Riccardo Renzulli, Marco Grangetto, Enzo Tartaglione | N/A | |
| PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang, Zhang Ziyi, Junhao He, Renjing Xu* | N/A | |
| Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation | Guan Gui, Bin-Bin Gao*, Jun Liu, Chengjie Wang, Yunsheng Wu | N/A | |
| E3M: Zero-Shot Spatio-Temporal Video Grounding with Expectation-Maximization Multimodal Modulation | Peijun Bao*, Zihao Shao, Wenhan Yang, Boon Poh Ng, Alex Kot | N/A | |
| EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Linrui Tian, Qi Wang, Bang Zhang, Liefeng Bo | N/A | |
| LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement | Ye Yu, Fengxin Chen, Jun Yu*, Zhen Kan | N/A | |
| "Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs" | Shuchao Pang, Ruhao Ma, Bing Li, Yongbin Zhou, Yazhou Yao | N/A | |
| Efficient Vision Transformers with Partial Attention | Xuan-Thuy Vo, Duy-Linh Nguyen, Adri Priadana, Kang-Hyun Jo | N/A | |
| Generalized Coverage for More Robust Low-Budget Active Learning | Wonho Bae, Junhyug Noh, Danica J. Sutherland* | N/A | |
| Rasterized Edge Gradients: Handling Discontinuities Differentially | Stanislav Pidhorskyi*, Tomas Simon, Gabriel Schwartz, He Wen, Yaser Sheikh, Jason Saragih | N/A | |
| Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment | Chong Li, Xuelin Qian, Yun Wang, Jingyang Huo, Xiangyang Xue, Yanwei Fu*, Jianfeng Feng | N/A | |
| FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning | Boyu Fan*, Chenrui Wu, Xiang Su, Pan HUI | N/A | |
| LLaVA-UHD: an LMM Perceiving any Aspect Ratio and High-Resolution Images | Zonghao Guo, Ruyi Xu, Yuan Yao, Junbo Cui, Zanlin Ni, Chunjiang Ge, Tat-Seng Chua, Zhiyuan Liu, Gao Huang | N/A | |
| Learning Natural Consistency Representation for Face Forgery Video Detection | Daichi Zhang, Zihao Xiao, Shikun Li, Fanzhao Lin, Jianmin Li, Shiming Ge | N/A | |
| ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video | Xinhao Li, Yuhan Zhu, Limin Wang* | N/A | |
| Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems | Yasar U Alcalar*, Mehmet Akcakaya | N/A | |
| R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model | Changhoon Kim, Kyle Min, Yezhou Yang | N/A | |
| OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection | Hu Zhang, xu jianhua, Tao Tang, Haiyang Sun, Xin Yu, Zi Helen Huang, Kaicheng Yu | N/A | |
| Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion | Yu Cao*, Shaogang Gong | N/A | |
| Data Poisoning Quantization Backdoor Attack | Tran Huynh*, Anh Tran, Khoa Doan, Tung Pham | N/A | |
| DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition | Qi Wang, Zhou Xu, Yuming Lin, Jingtao Ye, Hongsheng Li, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Liang Zhang* | N/A | |
| On the Topology Awareness and Generalization Performance of Graph Neural Networks | Junwei Su*, Chuan Wu | N/A | |
| T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy | Fan Duan, Jiahao Yu, Li Chen* | N/A | |
| A high-quality robust diffusion framework for corrupted dataset | Quan Dao*, Binh Ta, Tung Pham, Anh Tran | N/A | |
| Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning | Amandeep Kumar*, Muhammad Awais, Sanath Narayan, Hisham Cholakkal, Salman Khan, Rao Muhammad Anwer | N/A | |
| Distilling Knowledge from Large-Scale Image Models for Object Detection | Gang Li, Wenhai Wang, Xiang Li, Ziheng Li, Jian Yang, Jifeng Dai, Yu Qiao, Shanshan Zhang | N/A | |
| Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection | Hu Cao, Zehua Zhang, Yan Xia, Xinyi Li, Jiahao Xia, Guang Chen*, Alois C. Knoll | N/A | |
| TimeLens-XL: Real-time Event-based Video Frame Interpolation with Large Motion | Shi Guo, Yutian Chen, Tianfan Xue, Jinwei Gu, Yongrui Ma* | N/A | |
| Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection | Tim Salzmann, Markus Ryll, Alex Bewley, Matthias Minderer* | N/A | |
| Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM | Jonathan Sauder*, Devis Tuia | N/A | |
| Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets | Qin Lei*, Jiang Zhong, Qizhu Dai | N/A | |
| Retrieval Robust to Object Motion Blur | Rong Zou, Marc Pollefeys, Denys Rozumnyi* | N/A | |
| Unsupervised Representation Learning by Balanced Self Attention Matching | Daniel Shalam, Simon Korman | N/A | |
| DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences | Peidong Li, Wancheng Shen, Qihao Huang, Dixiao Cui | N/A | |
| Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging | Wenhua Wu, Kun Hu*, Wenxi Yue, Wei Li, Milena Simic, Changyang Li, Wei Xiang, Zhiyong Wang | N/A | |
| Learned Neural Physics Simulation for Articulated 3D Human Pose Reconstruction | Misha Andriluka*, Baruch Tabanpour, Daniel Freeman, Cristian Sminchisescu | N/A | |
| Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation | Ilhoon Yoon, Hyeongjun Kwon, Jin Kim, Junyoung Park, Hyunsung Jang, Kwanghoon Sohn* | N/A | |
| Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation | Shentong Mo, Enze Xie*, Yue Wu, Junsong Chen, Matthias Niessner, Zhenguo Li | N/A | |
| Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation | Shoumeng Qiu, Jie Chen, Xinrun Li, Ru Wan, Xiangyang Xue, Jian Pu* | N/A | |
| Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Fangfu Liu, Hanyang Wang, Weiliang Chen, Haowen Sun, Yueqi Duan* | N/A | |
| "Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts" | Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo | N/A | |
| SCOD: From Heuristics to Theory | Vojtech Franc, Jakub Paplham, Daniel Prusa* | N/A | |
| Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection | Gaurav Bhatt*, Leonid Sigal, James Ross | N/A | |
| Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | Marco Mistretta*, Alberto Baldrati, Marco Bertini, Andrew D. Bagdanov | N/A | |
| Teach CLIP to Develop a Number Sense for Ordinal Regression | Yao DU, Qiang Zhai, Weihang Dai, Xiaomeng Li | N/A | |
| Compact 3D Scene Representation via Self-Organizing Gaussian Grids | Wieland Morgenstern*, Florian Barthel, Anna Hilsmann, Peter Eisert | N/A | |
| Pix2Gif: Motion-Guided Diffusion for GIF Generation | Hitesh Kandala*, Jianfeng Gao, Jianwei Yang | N/A | |
| VETRA: A Dataset for Vehicle Tracking in Aerial Imagery - New Challenges for Multi-Object Tracking | Jens Hellekes*, Manuel Mühlhaus, Reza Bahmanyar, Seyed Majid Azimi, Franz Kurz | N/A | |
| SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes | Mohammad Zohaib*, Luca Cosmo, Alessio Del Bue | N/A | |
| Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning | Xinyuan Gao, Songlin Dong, Yuhang He*, Qiang Wang, Yihong Gong | N/A | |
| T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models | Zhongqi Wang, Jie Zhang*, Shiguang Shan, Xilin Chen | N/A | |
| ExMatch: Self-guided Exploitation for Semi-Supervised Learning with Scarce Labeled Samples | Noo-ri Kim, Jin-Seop Lee, Jee-Hyong Lee* | N/A | |
| Towards Certifiably Robust Face Recognition | Seunghun Paik, Dongsoo Kim, Chanwoo Hwang, Sunpill Kim, Jae Hong Seo* | N/A | |
| Linking in Style: Understanding learned features in deep learning models | Maren Wehrheim*, Pamela Osuna Vargas, Matthias Kaschube | N/A | |
| Stable Video Portraits | Mirela Ostrek*, Justus Thies | N/A | |
| UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework | Tarun Kalluri*, Sreyas Ravichandran, Manmohan Chandraker | N/A | |
| CliffPhys: Camera-based Respiratory Measurement using Clifford Neural Networks | Omar Ghezzi*, Giuseppe Boccignone, Giuliano Grossi, Raffaella Lanzarotti, Alessandro D'Amelio | N/A | |
| Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network | Chenhao Zhang, Wei Gao* | N/A | |
| PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers | Ananthu Aniraj*, Cassio F. Dantas, Dino Ienco, Diego Marcos | N/A | |
| Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection | Zihan Zhang, Zhuo Xu, Xiang Xiang* | N/A | |
| Synthesizing Environment-Specific People in Photographs | Mirela Ostrek*, Carol O'Sullivan, Michael J. Black, Justus Thies | N/A | |
| Weight Conditioning for Smooth Optimization of Neural Networks | Hemanth Saratchandran*, Thomas X Wang, Simon Lucey | N/A | |
| Energy-Clibrated VAE with Test Time Free Lunch | Yihong Luo, Siya Qiu, Xingjian Tao, Yujun Cai, Jing Tang* | N/A | |
| MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection | Shiyuan Meng, Wenchao Meng*, Qihang Zhou, Shizhong Li, Weiye Hou, Shibo He | N/A | |
| SceneTeller: Language-to-3D Scene Generation | Basak Melis Ocal*, Maxim Tatarchenko, Sezer Karaoglu, Theo Gevers | N/A | |
| MagMax: Leveraging Model Merging for Seamless Continual Learning | Daniel Marczak, Bartlomiej Twardowski, Tomasz Trzcinski, Sebastian Cygert | N/A | |
| InternVideo2: Scaling Foundation Models for Multimodal Video Understanding | Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, SongZe Li, hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang | N/A | |
| DiffusionPen: Towards Controlling the Style of Handwritten Text Generation | Konstantina Nikolaidou*, George Retsinas, Giorgos Sfikas, Marcus Liwicki | N/A | |
| Debiasing surgeon: fantastic weights and how to find them | Remi Nahon, Ivan Luiz De Moura Matos, Van-Tam Nguyen, Enzo Tartaglione* | N/A | |
| Denoising Vision Transformers | Jiawei Yang*, Katie Z Luo, Jiefeng Li, Congyue Deng, Leonidas Guibas, Dilip Krishnan, Kilian Weinberger, Yonglong Tian, Yue Wang | N/A | |
| Differentiable Product Quantization for Memory Efficient Camera Relocalization | Zakaria Laskar*, Iaroslav Melekhov, Assia Benbihi, Shuzhe Wang, Juho Kannala | N/A | |
| Spline-based Transformers | Prashanth Chandran, Agon Serifi, Markus Gross, Moritz Bächer | N/A | |
| Learning Pseudo 3D Guidance for View-consistent Texturing with 2D Diffusion | Kehan Li, Yanbo Fan, Yang Wu, Zhongqian Sun, Wei Yang, Xiangyang Ji, Li Yuan, Jie Chen | N/A | |
| TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly | Mengqi Guo*, Chen Li, Yuyang Zhao, Gim Hee Lee | N/A | |
| SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data | Jialong Wu*, Mirko Meuter, Markus Schoeler, Matthias Rottmann | N/A | |
| Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models | Yang Zhang*, Tze Tzun Teoh, Wei Hern Lim, Kenji Kawaguchi | N/A | |
| Adversarial Diffusion Distillation | Axel Sauer*, Dominik Lorenz, Andreas Blattmann, Robin Rombach | N/A | |
| Fake It till You Make It: Curricular Dynamic Forgery Augmentations towards General Deepfake Detection | Yuzhen Lin, Wentang Song, Bin Li, Yuezun Li, Jiangqun Ni, Han Chen, Qiushi Li | N/A | |
| Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts | Andong Tan, Fengtao Zhou, Hao Chen* | N/A | |
| Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation | Tong Shao, Zhuotao Tian, Hang Zhao, Jingyong Su | N/A | |
| A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis | Xiang Liu, Zhaoxiang Liu, Huan Hu, Zezhou Chen, Kohou Wang, Kai Wang, Shiguo Lian | N/A | |
| Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models | Taesup Kim*, Donggeun Kim | N/A | |
| Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information | Luca Di Giammarino*, Boyang Sun, Giorgio Grisetti, Marc Pollefeys, Hermann Blum, Daniel Barath | N/A | |
| Improving Diffusion Models for Authentic Virtual Try-on in the Wild | Yisol Choi, Sangkyung Kwak, Kyungmin Lee, Hyungwon Choi, Jinwoo Shin | N/A | |
| Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models | Minchan Kim, Minyeong Kim, Junik Bae, Suhwan Choi, Sungkyung Kim, Buru Chang* | N/A | |
| LISO: Lidar-only Self-Supervised 3D Object Detection | Stefan Andreas Baur*, Frank Moosmann, Andreas Geiger | N/A | |
| Text-Conditioned Resampler For Long Form Video Understanding | Bruno Korbar*, Yongqin Xian, Alessio Tonioni, Andrew Zisserman, Federico Tombari | N/A | |
| Implicit Steganography Beyond the Constraints of Modality | Sojeong Song, Seoyun Yang, Chang D. Yoo, Junmo Kim | N/A | |
| Using My Artistic Style? You Must Obtain My Authorization | Xiuli Bi, Haowei Liu, Weisheng Li, Bo Liu*, Bin Xiao | N/A | |
| LookupViT: Compressing visual information to a limited number of tokens | Rajat Koner, Gagan Jain, Sujoy Paul*, Volker Tresp, Prateek Jain | N/A | |
| Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation | Nina Weng*, Paraskevas Pegios, Eike Petersen, Aasa Feragen, Siavash Arjomand Bigdeli | N/A | |
| UMERegRobust – Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration | Yuval Haitman*, Amit Efraim, Joseph M Francos | N/A | |
| Non-transferable Pruning | Ruyi Ding*, Lili Su, A. Adam Ding, Yunsi Fei | N/A | |
| A Compact Dynamic 3D Gaussian Representation for Real-Time Dynamic View Synthesis | Kai Katsumata*, Duc Minh Vo, Hideki Nakayama | N/A | |
| Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Tomáš Chobola, Yu Liu, Hanyi Zhang, Julia A Schnabel, Tingying Peng | N/A | |
| Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning | Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu | N/A | |
| Affine steerers for structured keypoint description | Georg Bökman*, Johan Edstedt, Michael Felsberg, Fredrik Kahl | N/A | |
| Score Distillation Sampling with Learned Manifold Corrective | Thiemo Alldieck*, Nikos Kolotouros, Cristian Sminchisescu | N/A | |
| FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Xingtai Gui*, Tengteng Huang, Haonan Shao, Haotian Yao, Chi Zhang | N/A | |
| Benchmarking the Robustness of Cross-view Geo-localization Models | Qingwang Zhang, Yingying Zhu* | N/A | |
| GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth | Aurélien Cecille*, Stefan Duffner, Franck Davoine, Thibault Neveu, Rémi Agier | N/A | |
| SUMix: Mixup with Semantic and Uncertain Information | Huafeng Qin, Xin Jin*, Hongyu Zhu, Hongchao Liao, Mounim A. El Yacoubi, Xinbo Gao | N/A | |
| Flatness-aware Sequential Learning Generates Resilient Backdoors | Hoang Pham*, The-Anh Ta, Anh T Tran, Khoa D Doan | N/A | |
| Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models | Xiao Liu, Xiaoliu Guan, Yu Wu, Jiaxu Miao | N/A | |
| IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception | Shaohong Wang, Lu Bin, Xinyu Xiao, Zhiyu Xiang, Hangguan Shan, Eryun Liu* | N/A | |
| DiffClass: Diffusion-Based Class Incremental Learning | Zichong Meng, Jie Zhang, Changdi Yang, Zheng Zhan, Pu Zhao, Yanzhi Wang | N/A | |
| Convex Relaxations for Manifold-Valued Markov Random Fields with Approximation Guarantees | Robin Kenis*, Emanuel Laude, Panagiotis Patrinos | N/A | |
| Instant 3D Human Avatar Generation using Image Diffusion Models | Nikos Kolotouros*, Thiemo Alldieck, Enric Corona, Eduard Gabriel Bazavan, Cristian Sminchisescu | N/A | |
| PromptFusion: Decoupling Stability and Plasticity for Continual Learning | Haoran Chen, Zuxuan Wu*, Xintong Han, Menglin Jia, Yu-Gang Jiang | N/A | |
| Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance | Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun, Candace Ross, Michal Drozdzal, Adriana Romero-Soriano | N/A | |
| Adapting to Shifting Correlations with Unlabeled Data Calibration | Minh Nguyen*, Alan Q Wang, Heejong Kim, Mert Sabuncu | N/A | |
| Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity | Santiago Pascual, Chunghsin YEH*, Ioannis Tsiamas, Joan Serrà | N/A | |
| Information Bottleneck Based Data Correction in Continual Learning | Shuai Chen, mingyi zhang, Junge Zhang, Kaiqi Huang | N/A | |
| On Spectral Properties of Gradient-based Explanation Methods | Amir Mehrpanah*, Erik Englesson, Hossein Azizpour | N/A | |
| Contextual Correspondence Matters: Bidirectional Graph Matching for Video Summarization | Yunzuo Zhang*, Yameng Liu | N/A | |
| O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation | Muer Tie, Julong Wei, Zhengjun Wang, Ke Wu, Shanshuai Yuan, Kaizhao Zhang, Jie Jia, Jieru Zhao, Zhongxue Gan, Wenchao Ding | N/A | |
| Dataset Distillation by Automatic Training Trajectories | Dai Liu, Jindong Gu, Hu Cao, Carsten Trinitis, Martin Schulz* | N/A | |
| FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation | Jingyi Tang, Gu Wang, Zeyu Chen, Shengquan Li, Xiu Li, Xiangyang Ji | N/A | |
| EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding | Wenhua Wu, Qi Wang, Guangming Wang, Junping Wang, Tiankun Zhao, Yang Liu, Dongchao Gao, Zhe Liu, Hesheng Wang | N/A | |
| UniIR: Training and Benchmarking Universal Multimodal Information Retrievers | Cong Wei*, Yang Chen, Haonan Chen, Hexiang Hu, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen | N/A | |
| SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning | Mengxin Zheng*, Jiaqi Xue, Zihao Wang, Xun Chen, Qian Lou, Lei Jiang, Xiaofeng Wang | N/A | |
| Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation | Yingshan Chang*, Yasi Zhang, Zhiyuan Fang, Ying Nian Wu, Yonatan Bisk, Feng Gao | N/A | |
| Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision | Jinhee Kim, Taesung Kim, Jaegul Choo* | N/A | |
| latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction | Christopher Wewer, Kevin Raj, Eddy Ilg, Bernt Schiele, Jan E. Lenssen | N/A | |
| HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions | Chiranjeev Chiranjeev, Muskan Dosi, Kartik Thakral, Mayank Vatsa*, Richa Singh | N/A | |
| InstructGIE: Towards Generalizable Image Editing | Zichong Meng, Changdi Yang, Jun Liu, Hao Tang, Pu Zhao, Yanzhi Wang* | N/A | |
| HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation | WENCAN CHENG, Eunji Kim, Jong Hwan Ko* | N/A | |
| Navigating Text-to-Image Generative Bias across Indic Languages | Surbhi Mittal, Arnav Sudan, Mayank Vatsa, Richa Singh, Tamar Glaser, Tal Hassner | N/A | |
| Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning | Ray Zhang*, Zheming Zhou, Min Sun, Omid Ghasemalizadeh, Cheng-Hao Kuo, Ryan M. Eustice, Maani Ghaffari Jadidi, Arnie Sen | N/A | |
| CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models | Nick Stracke*, Stefan Andreas Baumann, Joshua Susskind, Miguel Angel Bautista, Bjorn Ommer | N/A | |
| Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation | Sangyeop Yeo, Yoojin Jang, Jaejun Yoo* | N/A | |
| VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation | Wenjie Zhuo*, Fan Ma, Hehe Fan, Yi Yang | N/A | |
| "A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation" | Riccardo Fogliato*, Pratik Patil, Mathew Monfort, Pietro Perona | N/A | |
| Towards Scene Graph Anticipation | Rohith Peddi*, Saksham Singh, Saurabh ., Parag Singla, Vibhav Gogate | N/A | |
| Non-Line-of-Sight Estimation of Fast Human Motion with Slow Scanning Imagers | Javier Grau Chopite, Patrick Hähn, Matthias B Hullin | N/A | |
| Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding | Danish Nazir*, Timo Bartels, Jan Piewek, Thorsten Bagdonat, Tim Fingscheidt | N/A | |
| NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration | Lin Tian*, Thomas H Greer, Raul San Jose Estepar, Roni Sengupta, Marc Niethammer | N/A | |
| Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models | Rining Wu, Feixiang Zhou, Ziwei Yin, Jian Liu | N/A | |
| Image Manipulation Detection With Implicit Neural Representation and Limited Supervision | Zhenfei Zhang*, Mingyang Li, Xin Li, Ming-Ching Chang, Jun-Wei Hsieh | N/A | |
| Scalar Function Topology Divergence: Comparing Topology of 3D Objects | Ilya Trofimov*, Daria Voronkova, Eduard Tulchinskii, Evgeny Burnaev, Serguei Barannikov | N/A | |
| Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks | Tingyu Qu*, Tinne Tuytelaars, Marie-Francine Moens | N/A | |
| Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models | Vitali Petsiuk*, Kate Saenko | N/A | |
| DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Sergio Casas*, Ben T Agro, Jiageng Mao, Thomas Gilles, ALEXANDER Y CUI, Enxu Li, Raquel Urtasun | N/A | |
| ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems | Denis Zavadski*, Johann-Friedrich Feiden, Carsten Rother | N/A | |
| Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Alexander Timans*, Christoph-Nikolas Straehle, Kaspar Sakmann, Eric Nalisnick | N/A | |
| Common Sense Reasoning for Deep Fake Detection | Yue Zhang, Ben Colman, Xiao Guo, Ali Shahriyari, Gaurav Bharaj | N/A | |
| Let the Avatar Talk using Texts without Paired Training Data | Xiuzhe Wu, Yang-Tian Sun, Handi Chen, Hang Zhou, Jingdong Wang, Zhengzhe Liu, Xiaojuan Qi* | N/A | |
| NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields | Muhammad Zubair Irshad*, Sergey Zakharov, Vitor Guizilini, Adrien Gaidon, Zsolt Kira, Rares Ambrus | N/A | |
| GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning | Animesh Karnewar, Roman Shapovalov, Tom Monnier, Andrea Vedaldi, Niloy J. Mitra, David Novotny* | N/A | |
| Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks | Weizhi An, Wenliang Zhong, Feng Jiang, Hehuan Ma, Junzhou Huang* | N/A | |
| AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale | Keenon Werling*, Janelle M Kaneda, Tian Tan, Rishi Agarwal, Six Skov, Tom Van Wouwe, Scott Uhlrich, Scott Delp, Karen Liu, Nicholas A Bianco, Carmichael Ong, Antoine Falisse, Shardul Sapkota, Aidan Jai Chandra, Joshua A Carter, Ezio Preatoni, Benjamin J Fregly, Jennifer Hicks | N/A | |
| How to Train the Teacher Model for Effective Knowledge Distillation | Shayan Mohajer Hamidi*, Xizhen Deng, Renhao Tan, Linfeng Ye, Ahmed Hussein Salamah | N/A | |
| Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers | Ekaterina Grishina*, Mikhail Gorbunov, Maxim Rakhuba | N/A | |
| Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models | Reza Abbasi, Mohammad Rohban, Mahdieh Soleymani Baghshah* | N/A | |
| Modality Translation for Object Detection Adaptation without forgetting prior knowledge | Heitor Rapela Medeiros*, Masih Aminbeidokhti, Fidel A Guerrero Pena, David Latortue, Eric Granger, Marco Pedersoli | N/A | |
| FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning | Oscar Skean*, Aayush Dhakal, Nathan Jacobs, Luis G Sanchez Giraldo | N/A | |
| Learning Multimodal Latent Generative Models with Energy-Based Prior | Shiyu Yuan*, Jiali Cui, Hanao Li, Tian Han | N/A | |
| On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition | Zihu Wang*, Lingqiao Liu, Scott Ricardo Figueroa Weston, Samuel Tian, Peng Li | N/A | |
| LaWa: Using Latent Space for In-Generation Image Watermarking | Ahmad Rezaei, Mohammad Akbari, Saeed Ranjbar Alvar, Arezou Fatemi, Yong Zhang* | N/A | |
| Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution | Mridul Khurana, Arka Daw, M. Maruf, Josef C. Uyeda, Wasila Dahdul, Caleb Charpentier, Yasin Bakış, Henry L. Bart Jr., Paula M. Mabee, Hilmar Lapp, James P. Balhoff, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Anuj Karpatne | N/A | |
| Markov Knowledge Distillation: Make Nasty Teachers trained by Self-undermining Knowledge Distillation Fully Distillable | En-hui Yang, Linfeng Ye* | N/A | |
| Co-speech Gesture Video Generation with 3D Human Meshes | Aniruddha Mahapatra, Richa Mishra, Ziyi Chen, Boyang Ding, Renda Li, Shoulei Wang, Jun-Yan Zhu, Peng Chang, Mei Han, Jing Xiao | N/A | |
| When and How do negative prompts take effect? | Yuanhao Ban, Ruochen Wang, Tianyi Zhou, Minhao Cheng, Boqing Gong, Cho-Jui Hsieh* | N/A | |
| GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views | Yaniv Wolf*, Amit Bracha, Ron Kimmel | N/A | |
| CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Jiezhi Yang, Khushi P Desai, Charles Packer, Harshil bhatia, Nicholas Rhinehart, Rowan McAllister, Joseph E Gonzalez | N/A | |
| Snuffy: Efficient Whole Slide Image Classifier | Hossein Jafarinia, Alireza Alipanah, Saeed Razavi, Nahal Mirzaie, Mohammad Hossein Rohban | N/A | |
| Learning to Build by Building Your Own Instructions | Aaron T Walsman*, Muru Zhang, Adam Fishman, Ali Farhadi, Dieter Fox | N/A | |
| Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling | Wonho Bae, Jing Wang, Danica J. Sutherland* | N/A | |
| BlenderAlchemy: Editing 3D Graphics with Vision-Language Models | Ian Huang*, Guandao Yang, Leonidas Guibas | N/A | |
| DεpS: Delayed ε-Shrinking for Faster Once-For-All Training | Aditya Annavajjala, Alind Khare, Animesh Agrawal, Igor Fedorov, Hugo M Latapie, Myungjin Lee, Alexey Tumanov | N/A | |
| Learning Depth from Focus in the Wild | Changyeon Won, Hae-Gon Jeon | N/A | |
| Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World | Zheng Dang, Lizhou Wang, Yu Guo, Mathieu Salzmann | N/A | |
| An End-to-End Transformer Model for Crowd Localization | Dingkang Liang, Wei Xu, Xiang Bai | N/A | |
| Few-Shot Single-View 3D Reconstruction with Memory Prior Contrastive Network | Zhen Xing, Yijiang Chen, Zhixin Ling, Xiangdong Zhou, Yu Xiang | N/A | |
| DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection | Liang Peng, Xiaopei Wu, Zheng Yang, Haifeng Liu, Deng Cai | N/A | |
| Adaptive Co-Teaching for Unsupervised Monocular Depth Estimation | Weisong Ren, Lijun Wang, Yongri Piao, Miao Zhang, Huchuan Lu, Ting Liu | N/A | |
| Fusing Local Similarities for Retrieval-Based 3D Orientation Estimation of Unseen Objects | Chen Zhao, Yinlin Hu, Mathieu Salzmann | N/A | |
| Lidar Point Cloud Guided Monocular 3D Object Detection | Liang Peng, Fei Liu, Zhengxu Yu, Senbo Yan, Dan Deng, Zheng Yang, Haifeng Liu, Deng Cai | N/A | |
| Structural Causal 3D Reconstruction | Weiyang Liu, Zhen Liu, Liam Paull, Adrian Weller, Bernhard Schölkopf | N/A | |
| 3D Human Pose Estimation Using Möbius Graph Convolutional Networks | Niloofar Azizi, Horst Possegger, Emanuele Rodolà, Horst Bischof | N/A | |
| Learning to Train a Point Cloud Reconstruction Network without Matching | Tianxin Huang, Xuemeng Yang, Jiangning Zhang, Jinhao Cui, Hao Zou, Jun Chen, Xiangrui Zhao, Yong Liu | N/A | |
| PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation | Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao | N/A | |
| Self-supervised Human Mesh Recovery with Cross-Representation Alignment | Xuan Gong, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, David Doermann, Ziyan Wu | N/A | |
| AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction | Zerui Chen, Yana Hasson, Cordelia Schmid, Ivan Laptev | N/A | |
| A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation | Yiming Qian, James H. Elder | N/A | |
| PS-NeRF: Neural Inverse Rendering for Multi-View Photometric Stereo | Wenqi Yang, Guanying Chen, Chaofeng Chen, Zhenfang Chen, Kwan-Yee K. Wong | N/A | |
| Share with Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency | Tom Monnier, Matthew Fisher, Alexei A. Efros, Mathieu Aubry | N/A | |
| Towards Comprehensive Representation Enhancement in Semantics-Guided Self-Supervised Monocular Depth Estimation | Jingyuan Ma, Xiangyu Lei, Nan Liu, Xian Zhao, Shiliang Pu | N/A | |
| AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture | Zhe Li, Zerong Zheng, Hongwen Zhang, Chaonan Ji, Yebin Liu | N/A | |
| Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers | Junhyeong Cho, Kim Youwang, Tae-Hyun Oh | N/A | |
| GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping | Pan Ji, Qingan Yan, Yuxin Ma, Yi Xu | N/A | |
| Multi-modal Masked Pre-training for Monocular Panoramic Depth Completion | Zhiqiang Yan, Xiang Li, Kun Wang, Zhenyu Zhang, Jun Li, Jian Yang | N/A | |
| GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation | Shi Gong, Xiaoqing Ye, Xiao Tan, Jingdong Wang, Errui Ding, Yu Zhou, Xiang Bai | N/A | |
| Learning Visibility for Robust Dense Human Body Estimation | Chun-Han Yao, Jimei Yang, Duygu Ceylan, Yi Zhou, Yang Zhou, Ming-Hsuan Yang | N/A | |
| Towards High-Fidelity Single-View Holistic Reconstruction of Indoor Scenes | Haolin Liu, Yujian Zheng, Guanying Chen, Shuguang Cui, Xiaoguang Han | N/A | |
| CompNVS: Novel View Synthesis with Scene Completion | Zuoyue Li, Tianxing Fan, Zhenqiang Li, Zhaopeng Cui, Yoichi Sato, Marc Pollefeys, Martin R. Oswald | N/A | |
| SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling | Chenjian Gao, Qian Yu, Lu Sheng, Yi-Zhe Song, Dong Xu | N/A | |
| LocalBins: Improving Depth Estimation by Learning Local Distributions | Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka | N/A | |
| 2D GANs Meet Unsupervised Single-View 3D Reconstruction | Feng Liu, Xiaoming Liu | N/A | |
| InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images | Zhengqi Li, Qianqian Wang, Noah Snavely, Angjoo Kanazawa | N/A | |
| Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors | Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang | N/A | |
| Bilateral Normal Integration | Xu Cao, Hiroaki Santo, Boxin Shi, Fumio Okura, Yasuyuki Matsushita | N/A | |
| S2Contact: Graph-Based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning | Tze Ho Elden Tse, Zhongqun Zhang, Kwang In Kim, Aleš Leonardis, Feng Zheng, Hyung Jin Chang | N/A | |
| SC-wLS: Towards Interpretable Feed-Forward Camera Re-localization | Xin Wu, Hao Zhao, Shunkai Li, Yingdian Cao, Hongbin Zha | N/A | |
| FloatingFusion: Depth from ToF and Image-Stabilized Stereo Cameras | Andreas Meuleman, Hakyeong Kim, James Tompkin, Min H. Kim | N/A | |
| DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image | Yijin Li, Xinyang Liu, Wenqi Dong, Han Zhou, Hujun Bao, Guofeng Zhang, Yinda Zhang, Zhaopeng Cui | N/A | |
| 3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform | Yining Zhao, Chao Wen, Zhou Xue, Yue Gao | N/A | |
| RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation | Ruida Zhang, Yan Di, Zhiqiang Lou, Fabian Manhardt, Federico Tombari, Xiangyang Ji | N/A | |
| Monocular 3D Object Reconstruction with GAN Inversion | Junzhe Zhang, Daxuan Ren, Zhongang Cai, Chai Kiat Yeo, Bo Dai, Chen Change Loy | N/A | |
| Map-Free Visual Relocalization: Metric Pose Relative to a Single Image | Eduardo Arnold, Jamie Wynn, Sara Vicente, Guillermo Garcia-Hernando, Aron Monszpart, Victor Prisacariu, Daniyar Turmukhambetov, Eric Brachmann | N/A | |
| Self-Distilled Feature Aggregation for Self-Supervised Monocular Depth Estimation | Zhengming Zhou, Qiulei Dong | N/A | |
| Planes vs. Chairs: Category-Guided 3D Shape Learning without Any 3D Cues | Zixuan Huang, Stefan Stojanov, Anh Thai, Varun Jampani, James M. Rehg | N/A | |
| MHR-Net: Multiple-Hypothesis Reconstruction of Non-rigid Shapes from 2D Views | Haitian Zeng, Xin Yu, Jiaxu Miao, Yi Yang | N/A | |
| Depth Map Decomposition for Monocular Depth Estimation | Jinyoung Jun, Jae-Han Lee, Chul Lee, Chang-Su Kim | N/A | |
| Monitored Distillation for Positive Congruent Depth Completion | Tian Yu Liu, Parth Agrawal, Allison Chen, Byung-Woo Hong, Alex Wong | N/A | |
| Resolution-Free Point Cloud Sampling Network with Data Distillation | Tianxin Huang, Jiangning Zhang, Jun Chen, Yuang Liu, Yong Liu | N/A | |
| Organic Priors in Non-rigid Structure from Motion | Suryansh Kumar, Luc Van Gool | N/A | |
| Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation | Yinlin Hu, Pascal Fua, Mathieu Salzmann | N/A | |
| DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks | Shih-Yang Su, Timur Bagautdinov, Helge Rhodin | N/A | |
| "CHORE: Contact, Human and Object REconstruction from a Single RGB Image" | Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll | N/A | |
| Learned Vertex Descent: A New Direction for 3D Human Model Fitting | Enric Corona, Gerard Pons-Moll, Guillem Alenyà, Francesc Moreno-Noguer | N/A | |
| Self-Calibrating Photometric Stereo by Neural Inverse Rendering | Junxuan Li, Hongdong Li | N/A | |
| 3D Clothed Human Reconstruction in the Wild | Gyeongsik Moon, Hyeongjin Nam, Takaaki Shiratori, Kyoung Mu Lee | N/A | |
| Directed Ray Distance Functions for 3D Scene Reconstruction | Nilesh Kulkarni, Justin Johnson, David F. Fouhey | N/A | |
| Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation from Monocular RGB Image | Zhaoxin Fan, Zhenbo Song, Jian Xu, Zhicheng Wang, Kejian Wu, Hongyan Liu, Jun He | N/A | |
| Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression | Dongting Hu, Liuhua Peng, Tingjin Chu, Xiaoxing Zhang, Yinian Mao, Howard Bondell, Mingming Gong | N/A | |
| CostDCNet: Cost Volume Based Depth Completion for a Single RGB-D Image | Jaewon Kam, Jungeon Kim, Soongjin Kim, Jaesik Park, Seungyong Lee | N/A | |
| "ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization" | Muhammad Zubair Irshad, Sergey Zakharov, Rareș Ambruș, Thomas Kollar, Zsolt Kira, Adrien Gaidon | N/A | |
| 3D Siamese Transformer Network for Single Object Tracking on Point Clouds | Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang | N/A | |
| Object Wake-Up: 3D Object Rigging from a Single Image | Ji Yang, Xinxin Zuo, Sen Wang, Zhenbo Yu, Xingyu Li, Bingbing Ni, Minglun Gong, Li Cheng | N/A | |
| IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-View Human Reconstruction | Kennard Yanting Chan, Guosheng Lin, Haiyu Zhao, Weisi Lin | N/A | |
| Realistic One-Shot Mesh-Based Head Avatars | Taras Khakhulin, Vanessa Sklyarova, Victor Lempitsky, Egor Zakharov | N/A | |
| A Kendall Shape Space Approach to 3D Shape Estimation from 2D Landmarks | Martha Paskin, Daniel Baum, Mason N. Dean, Christoph von Tycowicz | N/A | |
| Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion | Zian Wang, Wenzheng Chen, David Acuna, Jan Kautz, Sanja Fidler | N/A | |
| Perspective Phase Angle Model for Polarimetric 3D Reconstruction | Guangcheng Chen, Li He, Yisheng Guan, Hong Zhang | N/A | |
| DeepShadow: Neural Shape from Shadow | Asaf Karnieli, Ohad Fried, Yacov Hel-Or | N/A | |
| Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix | Yu Liu, Hui Zhang | N/A | |
| Super-Resolution 3D Human Shape from a Single Low-Resolution Image | Marco Pesavento, Marco Volino, Adrian Hilton | N/A | |
| Minimal Neural Atlas: Parameterizing Complex Surfaces with Minimal Charts and Distortion | Weng Fei Low, Gim Hee Lee | N/A | |
| ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing | Daxuan Ren, Jianmin Zheng, Jianfei Cai, Jiatong Li, Junzhe Zhang | N/A | |
| CATRE: Iterative Point Clouds Alignment for Category-Level Object Pose Refinement | Xingyu Liu, Gu Wang, Yi Li, Xiangyang Ji | N/A | |
| Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation | Jingyu Gong, Fengqi Liu, Jiachen Xu, Min Wang, Xin Tan, Zhizhong Zhang, Ran Yi, Haichuan Song, Yuan Xie, Lizhuang Ma | N/A | |
| Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction | Haocheng Yuan, Chen Zhao, Shichao Fan, Jiaxi Jiang, Jiaqi Yang | N/A | |
| MvDeCor: Multi-View Dense Correspondence Learning for Fine-Grained 3D Segmentation | Gopal Sharma, Kangxue Yin, Subhransu Maji, Evangelos Kalogerakis, Or Litany, Sanja Fidler | N/A | |
| SUPR: A Sparse Unified Part-Based Human Representation | Ahmed A. A. Osman, Timo Bolkart, Dimitrios Tzionas, Michael J. Black | N/A | |
| Revisiting Point Cloud Simplification: A Learnable Feature Preserving Approach | Rolandos Alexandros Potamias, Giorgos Bouritsas, Stefanos Zafeiriou | N/A | |
| Masked Autoencoders for Point Cloud Self-Supervised Learning | Yatian Pang, Wenxiao Wang, Francis E.H. Tay, Wei Liu, Yonghong Tian, Li Yuan | N/A | |
| Intrinsic Neural Fields: Learning Functions on Manifolds | Lukas Koestler, Daniel Grittner, Michael Moeller, Daniel Cremers, Zorah Lähner | N/A | |
| Skeleton-Free Pose Transfer for Stylized 3D Characters | Zhouyingcheng Liao, Jimei Yang, Jun Saito, Gerard Pons-Moll, Yang Zhou | N/A | |
| Masked Discrimination for Self-Supervised Learning on Point Clouds | Haotian Liu, Mu Cai, Yong Jae Lee | N/A | |
| FBNet: Feedback Network for Point Cloud Completion | Xuejun Yan, Hongyu Yan, Jingjing Wang, Hang Du, Zhihong Wu, Di Xie, Shiliang Pu, Li Lu | N/A | |
| Meta-Sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds | Ta-Ying Cheng, Qingyong Hu, Qian Xie, Niki Trigoni, Andrew Markham | N/A | |
| A Level Set Theory for Neural Implicit Evolution under Explicit Flows | Ishit Mehta, Manmohan Chandraker, Ravi Ramamoorthi | N/A | |
| Efficient Point Cloud Analysis Using Hilbert Curve | Wanli Chen, Xinge Zhu, Guojin Chen, Bei Yu | N/A | |
| TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement | Keyang Zhou, Bharat Lal Bhatnagar, Jan Eric Lenssen, Gerard Pons-Moll | N/A | |
| LaTeRF: Label and Text Driven Object Radiance Fields | Ashkan Mirzaei, Yash Kant, Jonathan Kelly, Igor Gilitschenski | N/A | |
| MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis | Yaqian Liang, Shanshan Zhao, Baosheng Yu, Jing Zhang, Fazhi He | N/A | |
| Unsupervised Deep Multi-Shape Matching | Dongliang Cao, Florian Bernard | N/A | |
| Texturify: Generating Textures on 3D Shape Surfaces | Yawar Siddiqui, Justus Thies, Fangchang Ma, Qi Shan, Matthias Nießner, Angela Dai | N/A | |
| Autoregressive 3D Shape Generation via Canonical Mapping | An-Chieh Cheng, Xueting Li, Sifei Liu, Min Sun, Ming-Hsuan Yang | N/A | |
| PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees | Jun-Kun Chen, Yu-Xiong Wang | N/A | |
| UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation | Shenhan Qian, Jiale Xu, Ziwei Liu, Liqian Ma, Shenghua Gao | N/A | |
| PRIF: Primary Ray-Based Implicit Function | Brandon Y. Feng, Yinda Zhang, Danhang Tang, Ruofei Du, Amitabh Varshney | N/A | |
| Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction | Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tianlong Chen, Yu Cheng, Zhangyang Wang | N/A | |
| CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes | Kim Youwang, Kim Ji-Yeon, Tae-Hyun Oh | N/A | |
| PlaneFormers: From Sparse View Planes to 3D Reconstruction | Samir Agarwala, Linyi Jin, Chris Rockwell, David F. Fouhey | N/A | |
| Learning Implicit Templates for Point-Based Clothed Human Modeling | Siyou Lin, Hongwen Zhang, Zerong Zheng, Ruizhi Shao, Yebin Liu | N/A | |
| Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks | Qianjiang Hu, Daizong Liu, Wei Hu | N/A | |
| Structure-Aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation | Jingwang Ling, Zhibo Wang, Ming Lu, Quan Wang, Chen Qian, Feng Xu | N/A | |
| MoFaNeRF: Morphable Facial Neural Radiance Field | Yiyu Zhuang, Hao Zhu, Xusen Sun, Xun Cao | N/A | |
| PointInst3D: Segmenting 3D Instances by Points | Tong He, Wei Yin, Chunhua Shen, Anton van den Hengel | N/A | |
| Cross-Modal 3D Shape Generation and Manipulation | Zezhou Cheng, Menglei Chai, Jian Ren, Hsin-Ying Lee, Kyle Olszewski, Zeng Huang, Subhransu Maji, Sergey Tulyakov | N/A | |
| Latent Partition Implicit with Surface Codes for 3D Representation | Chao Chen, Yu-Shen Liu, Zhizhong Han | N/A | |
| Implicit Field Supervision for Robust Non-rigid Shape Matching | Ramana Sundararaman, Gautam Pai, Maks Ovsjanikov | N/A | |
| Learning Self-Prior for Mesh Denoising Using Dual Graph Convolutional Networks | Shota Hattori, Tatsuya Yatagawa, Yutaka Ohtake, Hiromasa Suzuki | N/A | |
| diffConv: Analyzing Irregular Point Clouds with an Irregular View | Manxi Lin, Aasa Feragen | N/A | |
| PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows | Aihua Mao, Zihui Du, Yu-Hui Wen, Jun Xuan, Yong-Jin Liu | N/A | |
| SeedFormer: Patch Seeds Based Point Cloud Completion with Upsample Transformer | Haoran Zhou, Yun Cao, Wenqing Chu, Junwei Zhu, Tong Lu, Ying Tai, Chengjie Wang | N/A | |
| DeepMend: Learning Occupancy Functions to Represent Shape for Repair | Nikolas Lamb, Sean Banerjee, Natasha Kholgade Banerjee | N/A | |
| A Repulsive Force Unit for Garment Collision Handling in Neural Networks | Qingyang Tan, Yi Zhou, Tuanfeng Wang, Duygu Ceylan, Xin Sun, Dinesh Manocha | N/A | |
| Shape-Pose Disentanglement Using SE(3)-Equivariant Vector Neurons | Oren Katzir, Dani Lischinski, Daniel Cohen-Or | N/A | |
| 3D Equivariant Graph Implicit Functions | Yunlu Chen, Basura Fernando, Hakan Bilen, Matthias Nießner, Efstratios Gavves | N/A | |
| PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation | Bo Sun, Vladimir G. Kim, Noam Aigerman, Qixing Huang, Siddhartha Chaudhuri | N/A | |
| 3D Shape Sequence of Human Comparison and Classification Using Current and Varifolds | Emery Pierson, Mohamed Daoudi, Sylvain Arguillere | N/A | |
| Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification | Jianxiong Shen, Antonio Agudo, Francesc Moreno-Noguer, Adria Ruiz | N/A | |
| Unsupervised Pose-Aware Part Decomposition for Man-Made Articulated Objects | Yuki Kawana, Yusuke Mukuta, Tatsuya Harada | N/A | |
| MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field Networks | Benoît Guillard, Federico Stella, Pascal Fua | N/A | |
| SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement | Zhaofan Qiu, Yehao Li, Yu Wang, Yingwei Pan, Ting Yao, Tao Mei | N/A | |
| The Shape Part Slot Machine: Contact-Based Reasoning for Generating 3D Shapes from Parts | Kai Wang, Paul Guerrero, Vladimir G. Kim, Siddhartha Chaudhuri, Minhyuk Sung, Daniel Ritchie | N/A | |
| Spatiotemporal Self-Attention Modeling with Temporal Patch Shift for Action Recognition | Wangmeng Xiang, Chao Li, Biao Wang, Xihan Wei, Xian-Sheng Hua, Lei Zhang | N/A | |
| Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning | Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang | N/A | |
| Semi-Supervised Temporal Action Detection with Proposal-Free Masking | Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang | N/A | |
| Zero-Shot Temporal Action Detection via Vision-Language Prompting | Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang | N/A | |
| CycDA: Unsupervised Cycle Domain Adaptation to Learn from Image to Video | Wei Lin, Anna Kukleva, Kunyang Sun, Horst Possegger, Hilde Kuehne, Horst Bischof | N/A | |
| S2N: Suppression-Strengthen Network for Event-Based Recognition under Variant Illuminations | Zengyu Wan, Yang Wang, Ganchao Tan, Yang Cao, Zheng-Jun Zha | N/A | |
| CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation | Yunyao Mao, Wengang Zhou, Zhenbo Lu, Jiajun Deng, Houqiang Li | N/A | |
| Expanding Language-Image Pretrained Models for General Video Recognition | Bolin Ni, Houwen Peng, Minghao Chen, Songyang Zhang, Gaofeng Meng, Jianlong Fu, Shiming Xiang, Haibin Ling | N/A | |
| Hunting Group Clues with Transformers for Social Group Activity Recognition | Masato Tamura, Rahul Vishwakarma, Ravigopal Vennelakanti | N/A | |
| Contrastive Positive Mining for Unsupervised 3D Action Representation Learning | Haoyuan Zhang, Yonghong Hou, Wenjing Zhang, Wanqing Li | N/A | |
| Target-Absent Human Attention | Zhibo Yang, Sounak Mondal, Seoyoung Ahn, Gregory Zelinsky, Minh Hoai, Dimitris Samaras | N/A | |
| Uncertainty-Based Spatial-Temporal Attention for Online Action Detection | Hongji Guo, Zhou Ren, Yi Wu, Gang Hua, Qiang Ji | N/A | |
| Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows | Danyang Tu, Xiongkuo Min, Huiyu Duan, Guodong Guo, Guangtao Zhai, Wei Shen | N/A | |
| Rethinking Zero-Shot Action Recognition: Learning from Latent Atomic Actions | Yijun Qian, Lijun Yu, Wenhe Liu, Alexander G. Hauptmann | N/A | |
| Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection | Xiaoqian Wu, Yong-Lu Li, Xinpeng Liu, Junyi Zhang, Yuzhe Wu, Cewu Lu | N/A | |
| Collaborating Domain-Shared and Target-Specific Feature Clustering for Cross-Domain 3D Action Recognition | Qinying Liu, Zilei Wang | N/A | |
| Is Appearance Free Action Recognition Possible? | Filip Ilic, Thomas Pock, Richard P. Wildes | N/A | |
| Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition | Ning Ma, Hongyi Zhang, Xuhui Li, Sheng Zhou, Zhen Zhang, Jun Wen, Haifeng Li, Jingjun Gu, Jiajun Bu | N/A | |
| Dual-Evidential Learning for Weakly-Supervised Temporal Action Localization | Mengyuan Chen, Junyu Gao, Shicai Yang, Changsheng Xu | N/A | |
| Global-Local Motion Transformer for Unsupervised Skeleton-Based Action Learning | Boeun Kim, Hyung Jin Chang, Jungho Kim, Jin Young Choi | N/A | |
| AdaFocusV3: On Unified Spatial-Temporal Dynamic Video Recognition | Yulin Wang, Yang Yue, Xinhong Xu, Ali Hassani, Victor Kulikov, Nikita Orlov, Shiji Song, Humphrey Shi, Gao Huang | N/A | |
| Panoramic Human Activity Recognition | Ruize Han, Haomin Yan, Jiacheng Li, Songmiao Wang, Wei Feng, Song Wang | N/A | |
| Delving into Details: Synopsis-to-Detail Networks for Video Recognition | Shuxian Liang, Xu Shen, Jianqiang Huang, Xian-Sheng Hua | N/A | |
| A Generalized & Robust Framework for Timestamp Supervision in Temporal Action Segmentation | Rahul Rahaman, Dipika Singhania, Alexandre Thiery, Angela Yao | N/A | |
| Few-Shot Action Recognition with Hierarchical Matching and Contrastive Learning | Sipeng Zheng, Shizhe Chen, Qin Jin | N/A | |
| PrivHAR: Recognizing Human Actions from Privacy-Preserving Lens | Carlos Hinojosa, Miguel Marquez, Henry Arguello, Ehsan Adeli, Li Fei-Fei, Juan Carlos Niebles | N/A | |
| Scale-Aware Spatio-Temporal Relation Learning for Video Anomaly Detection | Guoqiu Li, Guanxiong Cai, Xingyu Zeng, Rui Zhao | N/A | |
| Compound Prototype Matching for Few-Shot Action Recognition | Yifei Huang, Lijin Yang, Yoichi Sato | N/A | |
| Continual 3D Convolutional Neural Networks for Real-Time Processing of Videos | Lukas Hedegaard, Alexandros Iosifidis | N/A | |
| Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition | Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu | N/A | |
| Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection | Zhiwei Yang, Peng Wu, Jing Liu, Xiaotao Liu | N/A | |
| Action Quality Assessment with Temporal Parsing Transformer | Yang Bai, Desen Zhou, Songyang Zhang, Jian Wang, Errui Ding, Yu Guan, Yang Long, Jingdong Wang | N/A | |
| Entry-Flipped Transformer for Inference and Prediction of Participant Behavior | Bo Hu, Tat-Jen Cham | N/A | |
| Pairwise Contrastive Learning Network for Action Quality Assessment | Mingzhe Li, Hong-Bo Zhang, Qing Lei, Zongwen Fan, Jinghua Liu, Ji-Xiang Du | N/A | |
| Geometric Features Informed Multi-Person Human-Object Interaction Recognition in Videos | Tanqiu Qiao, Qianhui Men, Frederick W. B. Li, Yoshiki Kubotani, Shigeo Morishima, Hubert P. H. Shum | N/A | |
| ActionFormer: Localizing Moments of Actions with Transformers | Chen-Lin Zhang, Jianxin Wu, Yin Li | N/A | |
| SocialVAE: Human Trajectory Prediction Using Timewise Latents | Pei Xu, Jean-Bernard Hayet, Ioannis Karamouzas | N/A | |
| Shape Matters: Deformable Patch Attack | Zhaoyu Chen, Bo Li, Shuang Wu, Jianghe Xu, Shouhong Ding, Wenqiang Zhang | N/A | |
| Frequency Domain Model Augmentation for Adversarial Attack | Yuyang Long, Qilong Zhang, Boheng Zeng, Lianli Gao, Xianglong Liu, Jian Zhang, Jingkuan Song | N/A | |
| Prior-Guided Adversarial Initialization for Fast Adversarial Training | Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao | N/A | |
| Enhanced Accuracy and Robustness via Multi-Teacher Adversarial Distillation | Shiji Zhao, Jie Yu, Zhenlong Sun, Bo Zhang, Xingxing Wei | N/A | |
| LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity | Martin Gubri, Maxime Cordy, Mike Papadakis, Yves Le Traon, Koushik Sen | N/A | |
| A Large-Scale Multiple-Objective Method for Black-Box Attack against Object Detection | Siyuan Liang, Longkang Li, Yanbo Fan, Xiaojun Jia, Jingzhi Li, Baoyuan Wu, Xiaochun Cao | N/A | |
| GradAuto: Energy-Oriented Attack on Dynamic Neural Networks | Jianhong Pan, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu | N/A | |
| A Spectral View of Randomized Smoothing under Common Corruptions: Benchmarking and Improving Certified Robustness | Jiachen Sun, Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Dan Hendrycks, Jihun Hamm, Z. Morley Mao | N/A | |
| Improving Adversarial Robustness of 3D Point Cloud Classification Models | Guanlin Li, Guowen Xu, Han Qiu, Ruan He, Jiwei Li, Tianwei Zhang | N/A | |
| Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number | Xian Wei, Yangyu Xu, Yanhui Huang, Hairong Lv, Hai Lan, Mingsong Chen, Xuan Tang | N/A | |
| RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN | Huy Phan, Cong Shi, Yi Xie, Tianfang Zhang, Zhuohang Li, Tianming Zhao, Jian Liu, Yan Wang, Yingying Chen, Bo Yuan | N/A | |
| Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks | Xiao Yang, Yinpeng Dong, Tianyu Pang, Hang Su, Jun Zhu | N/A | |
| Adaptive Image Transformations for Transfer-Based Adversarial Attack | Zheng Yuan, Jie Zhang, Shiguang Shan | N/A | |
| Generative Multiplane Images: Making a 2D GAN 3D-Aware | Xiaoming Zhao, Fangchang Ma, David Güera, Zhile Ren, Alexander G. Schwing, Alex Colburn | N/A | |
| AdvDO: Realistic Adversarial Attacks for Trajectory Prediction | Yulong Cao, Chaowei Xiao, Anima Anandkumar, Danfei Xu, Marco Pavone | N/A | |
| Adversarial Contrastive Learning via Asymmetric InfoNCE | Qiying Yu, Jieming Lou, Xianyuan Zhan, Qizhang Li, Wangmeng Zuo, Yang Liu, Jingjing Liu | N/A | |
| One Size Does NOT Fit All: Data-Adaptive Adversarial Training | Shuo Yang, Chang Xu | N/A | |
| UniCR: Universally Approximated Certified Robustness via Randomized Smoothing | Hanbin Hong, Binghui Wang, Yuan Hong | N/A | |
| Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips | Jiawang Bai, Kuofeng Gao, Dihong Gong, Shu-Tao Xia, Zhifeng Li, Wei Liu | N/A | |
| Robust Network Architecture Search via Feature Distortion Restraining | Yaguan Qian, Shenghui Huang, Bin Wang, Xiang Ling, Xiaohui Guan, Zhaoquan Gu, Shaoning Zeng, Wujie Zhou, Haijiang Wang | N/A | |
| SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination | Zhuowen Yuan, Fan Wu, Yunhui Long, Chaowei Xiao, Bo Li | N/A | |
| Triangle Attack: A Query-Efficient Decision-Based Adversarial Attack | Xiaosen Wang, Zeliang Zhang, Kangheng Tong, Dihong Gong, Kun He, Zhifeng Li, Wei Liu | N/A | |
| Data-Free Backdoor Removal Based on Channel Lipschitzness | Runkai Zheng, Rongjun Tang, Jianze Li, Li Liu | N/A | |
| Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack | Yixu Wang, Jie Li, Hong Liu, Yan Wang, Yongjian Wu, Feiyue Huang, Rongrong Ji | N/A | |
| Learning Energy-Based Models with Adversarial Training | Xuwang Yin, Shiying Li, Gustavo K. Rohde | N/A | |
| Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation | Ganlin Liu, Xiaowei Huang, Xinping Yi | N/A | |
| Revisiting Outer Optimization in Adversarial Training | Ali Dabouei, Fariborz Taherkhani, Sobhan Soleymani, Nasser M. Nasrabadi | N/A | |
| Zero-Shot Attribute Attacks on Fine-Grained Recognition Models | Nasim Shafiee, Ehsan Elhamifar | N/A | |
| Towards Effective and Robust Neural Trojan Defenses via Input Filtering | Kien Do, Haripriya Harikumar, Hung Le, Dung Nguyen, Truyen Tran, Santu Rana, Dang Nguyen, Willy Susilo, Svetha Venkatesh | N/A | |
| Scaling Adversarial Training to Large Perturbation Bounds | Sravanti Addepalli, Samyak Jain, Gaurang Sriramanan, R. Venkatesh Babu | N/A | |
| Exploiting the Local Parabolic Landscapes of Adversarial Losses to Accelerate Black-Box Adversarial Attack | Hoang Tran, Dan Lu, Guannan Zhang | N/A | |
| Generative Domain Adaptation for Face Anti-Spoofing | Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Kekai Sheng, Shouhong Ding, Lizhuang Ma | N/A | |
| MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition | Huanzhang Dou, Pengyi Zhang, Wei Su, Yunlong Yu, Xi Li | N/A | |
| GaitEdge: Beyond Plain End-to-End Gait Recognition for Better Practicality | Junhao Liang, Chao Fan, Saihui Hou, Chuanfu Shen, Yongzhen Huang, Shiqi Yu | N/A | |
| UIA-ViT: Unsupervised Inconsistency-Aware Method Based on Vision Transformer for Face Forgery Detection | Wanyi Zhuang, Qi Chu, Zhentao Tan, Qiankun Liu, Haojie Yuan, Changtao Miao, Zixiang Luo, Nenghai Yu | N/A | |
| Effective Presentation Attack Detection Driven by Face Related Task | Wentian Zhang, Haozhe Liu, Feng Liu, Raghavendra Ramachandra, Christoph Busch | N/A | |
| PPT: Token-Pruned Pose Transformer for Monocular and Multi-View Human Pose Estimation | Haoyu Ma, Zhe Wang, Yifei Chen, Deying Kong, Liangjian Chen, Xingwei Liu, Xiangyi Yan, Hao Tang, Xiaohui Xie | N/A | |
| AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing | Jiaxi Jiang, Paul Streli, Huajian Qiu, Andreas Fender, Larissa Laich, Patrick Snape, Christian Holz | N/A | |
| P-STMO: Pre-trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation | Wenkang Shan, Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao | N/A | |
| D&D: Learning Human Dynamics from Dynamic Camera | Jiefeng Li, Siyuan Bian, Chao Xu, Gang Liu, Gang Yu, Cewu Lu | N/A | |
| Explicit Occlusion Reasoning for Multi-Person 3D Human Pose Estimation | Qihao Liu, Yi Zhang, Song Bai, Alan Yuille | N/A | |
| COUCH: Towards Controllable Human-Chair Interactions | Xiaohan Zhang, Bharat Lal Bhatnagar, Sebastian Starke, Vladimir Guzov, Gerard Pons-Moll | N/A | |
| Identity-Aware Hand Mesh Estimation and Personalization from RGB Images | Deying Kong, Linguang Zhang, Liangjian Chen, Haoyu Ma, Xiangyi Yan, Shanlin Sun, Xingwei Liu, Kun Han, Xiaohui Xie | N/A | |
| C3P: Cross-Domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation | Cunlin Wu, Yang Xiao, Boshen Zhang, Mingyang Zhang, Zhiguo Cao, Joey Tianyi Zhou | N/A | |
| Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields | Garvita Tiwari, Dimitrije Antić, Jan Eric Lenssen, Nikolaos Sarafianos, Tony Tung, Gerard Pons-Moll | N/A | |
| CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation | Zhihao Li, Jianzhuang Liu, Zhensong Zhang, Songcen Xu, Youliang Yan | N/A | |
| DeciWatch: A Simple Baseline for 10× Efficient 2D and 3D Pose Estimation | Ailing Zeng, Xuan Ju, Lei Yang, Ruiyuan Gao, Xizhou Zhu, Bo Dai, Qiang Xu | N/A | |
| SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos | Ailing Zeng, Lei Yang, Xuan Ju, Jiefeng Li, Jianyi Wang, Qiang Xu | N/A | |
| PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation | Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu | N/A | |
| Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement | Junuk Cha, Muhammad Saqlain, GeonU Kim, Mingyu Shin, Seungryul Baek | N/A | |
| Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction | Xiaoning Sun, Qiongjie Cui, Huaijiang Sun, Bin Li, Weiqing Li, Jianfeng Lu | N/A | |
| Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation | Zhuo Chen, Xu Zhao, Xiaoyue Wan | N/A | |
| Audio-Driven Stylized Gesture Generation with Flow-Based Model | Sheng Ye, Yu-Hui Wen, Yanan Sun, Ying He, Ziyang Zhang, Yaoyuan Wang, Weihua He, Yong-Jin Liu | N/A | |
| Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation | Zhehan Kan, Shuoshuo Chen, Zeng Li, Zhihai He | N/A | |
| UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture | Hiroyasu Akada, Jian Wang, Soshi Shimada, Masaki Takahashi, Christian Theobalt, Vladislav Golyanik | N/A | |
| Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction | Maosen Li, Siheng Chen, Zijing Zhang, Lingxi Xie, Qi Tian, Ya Zhang | N/A | |
| Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation | William McNally, Kanav Vats, Alexander Wong, John McPhee | N/A | |
| VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data | Jiajun Su, Chunyu Wang, Xiaoxuan Ma, Wenjun Zeng, Yizhou Wang | N/A | |
| Poseur: Direct Human Pose Regression with Transformers | Weian Mao, Yongtao Ge, Chunhua Shen, Zhi Tian, Xinlong Wang, Zhibin Wang, Anton van den Hengel | N/A | |
| SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation | Yanjie Li, Sen Yang, Peidong Liu, Shoukui Zhang, Yunxiao Wang, Zhicheng Wang, Wankou Yang, Shu-Tao Xia | N/A | |
| Regularizing Vector Embedding in Bottom-Up Human Pose Estimation | Haixin Wang, Lu Zhou, Yingying Chen, Ming Tang, Jinqiao Wang | N/A | |
| A Visual Navigation Perspective for Category-Level Object Pose Estimation | Jiaxin Guo, Fangxun Zhong, Rong Xiong, Yun-Hui Liu, Yue Wang, Yiyi Liao | N/A | |
| Faster VoxelPose: Real-Time 3D Human Pose Estimation by Orthographic Projection | Hang Ye, Wentao Zhu, Chunyu Wang, Rujie Wu, Yizhou Wang | N/A | |
| Learning to Fit Morphable Models | Vasileios Choutas, Federica Bogo, Jingjing Shen, Julien Valentin | N/A | |
| EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices | Siwei Zhang, Qianli Ma, Yan Zhang, Zhiyin Qian, Taein Kwon, Marc Pollefeys, Federica Bogo, Siyu Tang | N/A | |
| Grasp’D: Differentiable Contact-Rich Grasp Synthesis for Multi-Fingered Hands | Dylan Turpin, Liquan Wang, Eric Heiden, Yun-Chun Chen, Miles Macklin, Stavros Tsogkas, Sven Dickinson, Animesh Garg | N/A | |
| AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling | Ziqian Bai, Timur Bagautdinov, Javier Romero, Michael Zollhöfer, Ping Tan, Shunsuke Saito | N/A | |
| Deep Radial Embedding for Visual Sequence Learning | Yuecong Min, Peiqi Jiao, Yanan Li, Xiaotao Wang, Lei Lei, Xiujuan Chai, Xilin Chen | N/A | |
| SAGA: Stochastic Whole-Body Grasping with Contact | Yan Wu, Jiahao Wang, Yan Zhang, Siwei Zhang, Otmar Hilliges, Fisher Yu, Siyu Tang | N/A | |
| Neural Capture of Animatable 3D Human from Monocular Video | Gusi Te, Xiu Li, Xiao Li, Jinglu Wang, Wei Hu, Yan Lu | N/A | |
| General Object Pose Transformation Network from Unpaired Data | Yukun Su, Guosheng Lin, Ruizhou Sun, Qingyao Wu | N/A | |
| Compositional Human-Scene Interaction Synthesis with Semantic Control | Kaifeng Zhao, Shaofei Wang, Yan Zhang, Thabo Beeler, Siyu Tang | N/A | |
| PressureVision: Estimating Hand Pressure from a Single RGB Image | Patrick Grady, Chengcheng Tang, Samarth Brahmbhatt, Christopher D. Twigg, Chengde Wan, James Hays, Charles C. Kemp | N/A | |
| PoseScript: 3D Human Poses from Natural Language | Ginger Delmas, Philippe Weinzaepfel, Thomas Lucas, Francesc Moreno-Noguer, Grégory Rogez | N/A | |
| DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation | Jaewoo Park, Nam Ik Cho | N/A | |
| 3D Interacting Hand Pose Estimation by Hand De-Occlusion and Removal | Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo | N/A | |
| Pose for Everything: Towards Category-Agnostic Pose Estimation | Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang | N/A | |
| PoseGPT: Quantization-Based 3D Human Motion Generation and Forecasting | Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Grégory Rogez | N/A | |
| DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation | Linzhi Huang, Jiahao Liang, Weihong Deng | N/A | |
| Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation | Jiajun Tang, Yongjie Zhu, Haoyu Wang, Jun Hoong Chan, Si Li, Boxin Shi | N/A | |
| Boosting Event Stream Super-Resolution with a Recurrent Neural Network | Wenming Weng, Yueyi Zhang, Zhiwei Xiong | N/A | |
| Projective Parallel Single-Pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning | Yuxi Li, Huijie Zhao, Hongzhi Jiang, Xudong Li | N/A | |
| Semantic-Sparse Colorization Network for Deep Exemplar-Based Colorization | Yunpeng Bai, Chao Dong, Zenghao Chai, Andong Wang, Zhengzhuo Xu, Chun Yuan | N/A | |
| Practical and Scalable Desktop-Based High-Quality Facial Capture | Alexandros Lattas, Yiming Lin, Jayanth Kannan, Ekin Ozturk, Luca Filipi, Giuseppe Claudio Guarnera, Gaurav Chawla, Abhijeet Ghosh | N/A | |
| FAST-VQA: Efficient End-to-End Video Quality Assessment with Fragment Sampling | Haoning Wu, Chaofeng Chen, Jingwen Hou, Liang Liao, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin | N/A | |
| Physically-Based Editing of Indoor Scene Lighting from a Single Image | Zhengqin Li, Jia Shi, Sai Bi, Rui Zhu, Kalyan Sunkavalli, Miloš Hašan, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker | N/A | |
| LEDNet: Joint Low-Light Enhancement and Deblurring in the Dark | Shangchen Zhou, Chongyi Li, Chen Change Loy | N/A | |
| MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects | Juewen Peng, Jianming Zhang, Xianrui Luo, Hao Lu, Ke Xian, Zhiguo Cao | N/A | |
| Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset | Huanjing Yue, Zhiming Zhang, Jingyu Yang | N/A | |
| Transform Your Smartphone into a DSLR Camera: Learning the ISP in the Wild | Ardhendu Shekhar Tripathi, Martin Danelljan, Samarth Shukla, Radu Timofte, Luc Van Gool | N/A | |
| Learning Deep Non-Blind Image Deconvolution without Ground Truths | Yuhui Quan, Zhuojie Chen, Huan Zheng, Hui Ji | N/A | |
| NEST: Neural Event Stack for Event-Based Image Enhancement | Minggui Teng, Chu Zhou, Hanyue Lou, Boxin Shi | N/A | |
| Editable Indoor Lighting Estimation | Henrique Weber, Mathieu Garon, Jean-François Lalonde | N/A | |
| Fast Two-Step Blind Optical Aberration Correction | Thomas Eboli, Jean-Michel Morel, Gabriele Facciolo | N/A | |
| Seeing Far in the Dark with Patterned Flash | Zhanghao Sun, Jian Wang, Yicheng Wu, Shree Nayar | N/A | |
| PseudoClick: Interactive Image Segmentation with Click Imitation | Qin Liu, Meng Zheng, Benjamin Planche, Srikrishna Karanam, Terrence Chen, Marc Niethammer, Ziyan Wu | N/A | |
| CT2: Colorization Transformer via Color Tokens | Shuchen Weng, Jimeng Sun, Yu Li, Si Li, Boxin Shi | N/A | |
| Simple Baselines for Image Restoration | Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun | N/A | |
| Spike Transformer: Monocular Depth Estimation for Spiking Camera | Jiyuan Zhang, Lulu Tang, Zhaofei Yu, Jiwen Lu, Tiejun Huang | N/A | |
| Improving Image Restoration by Revisiting Global Information Aggregation | Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu | N/A | |
| Data Association between Event Streams and Intensity Frames under Diverse Baselines | Dehao Zhang, Qiankun Ding, Peiqi Duan, Chu Zhou, Boxin Shi | N/A | |
| D2HNet: Joint Denoising and Deblurring with Hierarchical Network for Robust Night Image Restoration | Yuzhi Zhao, Yongzhe Xu, Qiong Yan, Dingdong Yang, Xuehui Wang, Lai-Man Po | N/A | |
| Learning Graph Neural Networks for Image Style Transfer | Yongcheng Jing, Yining Mao, Yiding Yang, Yibing Zhan, Mingli Song, Xinchao Wang, Dacheng Tao | N/A | |
| DeepPS2: Revisiting Photometric Stereo Using Two Differently Illuminated Images | Ashish Tiwari, Shanmuganathan Raman | N/A | |
| Instance Contour Adjustment via Structure-Driven CNN | Shuchen Weng, Yi Wei, Ming-Ching Chang, Boxin Shi | N/A | |
| Synthesizing Light Field Video from Monocular Video | Shrisudhan Govindarajan, Prasan Shedligeri, Sarah, Kaushik Mitra | N/A | |
| Human-Centric Image Cropping with Partition-Aware and Content-Preserving Features | Bo Zhang, Li Niu, Xing Zhao, Liqing Zhang | N/A | |
| DeMFI: Deep Joint Deblurring and Multi-Frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting | Jihyong Oh, Munchurl Kim | N/A | |
| Neural Image Representations for Multi-Image Fusion and Layer Separation | Seonghyeon Nam, Marcus A. Brubaker, Michael S. Brown | N/A | |
| Bringing Rolling Shutter Images Alive with Dual Reversed Distortion | Zhihang Zhong, Mingdeng Cao, Xiao Sun, Zhirong Wu, Zhongyi Zhou, Yinqiang Zheng, Stephen Lin, Imari Sato | N/A | |
| FILM: Frame Interpolation for Large Motion | Fitsum Reda, Janne Kontkanen, Eric Tabellion, Deqing Sun, Caroline Pantofaru, Brian Curless | N/A | |
| Video Interpolation by Event-Driven Anisotropic Adjustment of Optical Flow | Song Wu, Kaichao You, Weihua He, Chen Yang, Yang Tian, Yaoyuan Wang, Ziyang Zhang, Jianxing Liao | N/A | |
| EvAC3D: From Event-Based Apparent Contours to 3D Models via Continuous Visual Hulls | Ziyun Wang, Kenneth Chaney, Kostas Daniilidis | N/A | |
| DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image Harmonization | Ben Xue, Shenghui Ran, Quan Chen, Rongfei Jia, Binqiang Zhao, Xing Tang | N/A | |
| SelectionConv: Convolutional Neural Networks for Non-Rectilinear Image Data | David Hart, Michael Whitney, Bryan Morse | N/A | |
| Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization | Jingtang Liang, Xiaodong Cun, Chi-Man Pun, Jue Wang | N/A | |
| BigColor: Colorization Using a Generative Color Prior for Natural Images | Geonung Kim, Kyoungkook Kang, Seongtae Kim, Hwayoon Lee, Sehoon Kim, Jonghyun Kim, Seung-Hwan Baek, Sunghyun Cho | N/A | |
| CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution | Cheeun Hong, Sungyong Baik, Heewon Kim, Seungjun Nah, Kyoung Mu Lee | N/A | |
| Deep Semantic Statistics Matching (D2SM) Denoising Network | Kangfu Mei, Vishal M. Patel, Rui Huang | N/A | |
| 3D Scene Inference from Transient Histograms | Sacha Jungerman, Atul Ingle, Yin Li, Mohit Gupta | N/A | |
| Neural Space-Filling Curves | Hanyu Wang, Kamal Gupta, Larry Davis, Abhinav Shrivastava | N/A | |
| Exposure-Aware Dynamic Weighted Learning for Single-Shot HDR Imaging | An Gia Vien, Chul Lee | N/A | |
| Seeing through a Black Box: Toward High-Quality Terahertz Imaging via Subspace-and-Attention Guided Restoration | Weng-Tai Su, Yi-Chun Hung, Po-Jen Yu, Shang-Hua Yang, Chia-Wen Lin | N/A | |
| Tomography of Turbulence Strength Based on Scintillation Imaging | Nir Shaul, Yoav Y. Schechner | N/A | |
| Realistic Blur Synthesis for Learning Image Deblurring | Jaesung Rim, Geonung Kim, Jungeon Kim, Junyong Lee, Seungyong Lee, Sunghyun Cho | N/A | |
| Learning Phase Mask for Privacy-Preserving Passive Depth Estimation | Zaid Tasneem, Giovanni Milione, Yi-Hsuan Tsai, Xiang Yu, Ashok Veeraraghavan, Manmohan Chandraker, Francesco Pittaluga | N/A | |
| LWGNet – Learned Wirtinger Gradients for Fourier Ptychographic Phase Retrieval | Atreyee Saha, Salman S. Khan, Sagar Sehrawat, Sanjana S. Prabhu, Shanti Bhattacharya, Kaushik Mitra | N/A | |
| PANDORA: Polarization-Aided Neural Decomposition of Radiance | Akshat Dave, Yongyi Zhao, Ashok Veeraraghavan | N/A | |
| HuMMan: Multi-modal 4D Human Dataset for Versatile Sensing and Modeling | Zhongang Cai, Daxuan Ren, Ailing Zeng, Zhengyu Lin, Tao Yu, Wenjia Wang, Xiangyu Fan, Yang Gao, Yifan Yu, Liang Pan, Fangzhou Hong, Mingyuan Zhang, Chen Change Loy, Lei Yang, Ziwei Liu | N/A | |
| DVS-Voltmeter: Stochastic Process-Based Event Simulator for Dynamic Vision Sensors | Songnan Lin, Ye Ma, Zhenhua Guo, Bihan Wen | N/A | |
| Benchmarking Omni-Vision Representation through the Lens of Visual Realms | Yuanhan Zhang, Zhenfei Yin, Jing Shao, Ziwei Liu | N/A | |
| BEAT: A Large-Scale Semantic and Emotional Multi-modal Dataset for Conversational Gestures Synthesis | Haiyang Liu, Zihao Zhu, Naoya Iwamoto, Yichen Peng, Zhengqing Li, You Zhou, Elif Bozkurt, Bo Zheng | N/A | |
| Neuromorphic Data Augmentation for Training Spiking Neural Networks | Yuhang Li, Youngeun Kim, Hyoungseob Park, Tamar Geller, Priyadarshini Panda | N/A | |
| CelebV-HQ: A Large-Scale Video Facial Attributes Dataset | Hao Zhu, Wayne Wu, Wentao Zhu, Liming Jiang, Siwei Tang, Li Zhang, Ziwei Liu, Chen Change Loy | N/A | |
| MovieCuts: A New Dataset and Benchmark for Cut Type Recognition | Alejandro Pardo, Fabian Caba, Juan León Alcázar, Ali Thabet, Bernard Ghanem | N/A | |
| LaMAR: Benchmarking Localization and Mapping for Augmented Reality | Paul-Edouard Sarlin, Mihai Dusmanu, Johannes L. Schönberger, Pablo Speciale, Lukas Gruber, Viktor Larsson, Ondrej Miksik, Marc Pollefeys | N/A | |
| "Unitail: Detecting, Reading, and Matching in Retail Scene" | Fangyi Chen, Han Zhang, Zaiwang Li, Jiachen Dou, Shentong Mo, Hao Chen, Yongxin Zhang, Uzair Ahmed, Chenchen Zhu, Marios Savvides | N/A | |
| Not Just Streaks: Towards Ground Truth for Single Image Deraining | Yunhao Ba, Howard Zhang, Ethan Yang, Akira Suzuki, Arnold Pfahnl, Chethan Chinder Chandrappa, Celso M. de Melo, Suya You, Stefano Soatto, Alex Wong, Achuta Kadambi | N/A | |
| ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-Verified Image-Caption Associations for MS-COCO | Sanghyuk Chun, Wonjae Kim, Song Park, Minsuk Chang, Seong Joon Oh | N/A | |
| MOTCOM: The Multi-Object Tracking Dataset Complexity Metric | Malte Pedersen, Joakim Bruslund Haurum, Patrick Dendorfer, Thomas B. Moeslund | N/A | |
| How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset? | Yuchi Liu, Zhongdao Wang, Tom Gedeon, Liang Zheng | N/A | |
| A Real World Dataset for Multi-View 3D Reconstruction | Rakesh Shrestha, Siqi Hu, Minghao Gou, Ziyuan Liu, Ping Tan | N/A | |
| REALY: Rethinking the Evaluation of 3D Face Reconstruction | Zenghao Chai, Haoxian Zhang, Jing Ren, Di Kang, Zhengzhuo Xu, Xuefei Zhe, Chun Yuan, Linchao Bao | N/A | |
| "Capturing, Reconstructing, and Simulating: The UrbanScene3D Dataset" | Liqiang Lin, Yilin Liu, Yue Hu, Xingguang Yan, Ke Xie, Hui Huang | N/A | |
| 3D CoMPaT: Composition of Materials on Parts of 3D Things | Yuchen Li, Ujjwal Upadhyay, Habib Slim, Tezuesh Varshney, Ahmed Abdelreheem, Arpit Prajapati, Suhail Pothigara, Peter Wonka, Mohamed Elhoseiny | N/A | |
| "PartImageNet: A Large, High-Quality Dataset of Parts" | Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan Yuille | N/A | |
| A-OKVQA: A Benchmark for Visual Question Answering Using World Knowledge | Dustin Schwenk, Apoorv Khandelwal, Christopher Clark, Kenneth Marino, Roozbeh Mottaghi | N/A | |
| OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images | Bingchen Zhao, Shaozuo Yu, Wufei Ma, Mingxin Yu, Shenxiao Mei, Angtian Wang, Ju He, Alan Yuille, Adam Kortylewski | N/A | |
| Facial Depth and Normal Estimation Using Single Dual-Pixel Camera | Minjun Kang, Jaesung Choe, Hyowon Ha, Hae-Gon Jeon, Sunghoon Im, In So Kweon, Kuk-Jin Yoon | N/A | |
| The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing | Dawit Mureja Argaw, Fabian Caba, Joon-Young Lee, Markus Woodson, In So Kweon | N/A | |
| StyleBabel: Artistic Style Tagging and Captioning | Dan Ruta, Andrew Gilbert, Pranav Aggarwal, Naveen Marri, Ajinkya Kale, Jo Briggs, Chris Speed, Hailin Jin, Baldo Faieta, Alex Filipkowski, Zhe Lin, John Collomosse | N/A | |
| PANDORA: A Panoramic Detection Dataset for Object with Orientation | Hang Xu, Qiang Zhao, Yike Ma, Xiaodong Li, Peng Yuan, Bailan Feng, Chenggang Yan, Feng Dai | N/A | |
| FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context | Pinaki Nath Chowdhury, Aneeshan Sain, Ayan Kumar Bhunia, Tao Xiang, Yulia Gryaditskaya, Yi-Zhe Song | N/A | |
| Exploring Fine-Grained Audiovisual Categorization with the SSW60 Dataset | Grant Van Horn, Rui Qian, Kimberly Wilber, Hartwig Adam, Oisin Mac Aodha, Serge Belongie | N/A | |
| The Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and Counting | Justin Kay, Peter Kulits, Suzanne Stathatos, Siqi Deng, Erik Young, Sara Beery, Grant Van Horn, Pietro Perona | N/A | |
| A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility | Andrea Burns, Deniz Arsan, Sanjna Agrawal, Ranjitha Kumar, Kate Saenko, Bryan A. Plummer | N/A | |
| BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis | Davide Moltisanti, Jinyi Wu, Bo Dai, Chen Change Loy | N/A | |
| Dress Code: High-Resolution Multi-Category Virtual Try-On | Davide Morelli, Matteo Fincato, Marcella Cornia, Federico Landi, Fabio Cesari, Rita Cucchiara | N/A | |
| A Data-Centric Approach for Improving Ambiguous Labels with Combined Semi-Supervised Classification and Clustering | Lars Schmarje, Monty Santarossa, Simon-Martin Schröder, Claudius Zelenka, Rainer Kiko, Jenny Stracke, Nina Volkmann, Reinhard Koch | N/A | |
| ClearPose: Large-Scale Transparent Object Dataset and Benchmark | Xiaotong Chen, Huijie Zhang, Zeren Yu, Anthony Opipari, Odest Chadwicke Jenkins | N/A | |
| When Deep Classifiers Agree: Analyzing Correlations between Learning Order and Image Statistics | Iuliia Pliushch, Martin Mundt, Nicolas Lupp, Visvanathan Ramesh | N/A | |
| AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment | Kangyeol Kim, Sunghyun Park, Jaeseong Lee, Sunghyo Chung, Junsoo Lee, Jaegul Choo | N/A | |
| MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration | Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh | N/A | |
| A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing | Paul Upchurch, Ransen Niu | N/A | |
| MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis | Athanasios Papaioannou, Baris Gecer, Shiyang Cheng, Grigorios G. Chrysos, Jiankang Deng, Eftychia Fotiadou, Christos Kampouris, Dimitrios Kollias, Stylianos Moschoglou, Kritaphat Songsri-In, Stylianos Ploumpis, George Trigeorgis, Panagiotis Tzirakis, Evangelos Ververas, Yuxiang Zhou, Allan Ponniah, Anastasios Roussos, Stefanos Zafeiriou | N/A | |
| "Delving into Universal Lesion Segmentation: Method, Dataset, and Benchmark" | Yu Qiu, Jing Xu | N/A | |
| Large Scale Real-World Multi-person Tracking | Bing Shuai, Alessandro Bergamo, Uta Büchler, Andrew Berneshawi, Alyssa Boden, Joseph Tighe | N/A | |
| D2-TPred: Discontinuous Dependency for Trajectory Prediction under Traffic Lights | Yuzhen Zhang, Wentong Wang, Weizhi Guo, Pei Lv, Mingliang Xu, Wei Chen, Dinesh Manocha | N/A | |
| The Missing Link: Finding Label Relations across Datasets | Jasper Uijlings, Thomas Mensink, Vittorio Ferrari | N/A | |
| Learning Omnidirectional Flow in 360° Video via Siamese Representation | Keshav Bhandari, Bin Duan, Gaowen Liu, Hugo Latapie, Ziliang Zong, Yan Yan | N/A | |
| VizWiz-FewShot: Locating Objects in Images Taken by People with Visual Impairments | Yu-Yun Tseng, Alexander Bell, Danna Gurari | N/A | |
| TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments | Shubham Dokania, Anbumani Subramanian, Manmohan Chandraker, C.V. Jawahar | N/A | |
| Trapped in Texture Bias? A Large Scale Comparison of Deep Instance Segmentation | Johannes Theodoridis, Jessica Hofmann, Johannes Maucher, Andreas Schilling | N/A | |
| Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection | Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao | N/A | |
| WeLSA: Learning to Predict 6D Pose from Weakly Labeled Data Using Shape Alignment | Shishir Reddy Vutukur, Ivan Shugurov, Benjamin Busam, Andreas Hutter, Slobodan Ilic | N/A | |
| Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph | Honghui Yang, Zili Liu, Xiaopei Wu, Wenxiao Wang, Wei Qian, Xiaofei He, Deng Cai | N/A | |
| MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection | Xuesong Chen, Shaoshuai Shi, Benjin Zhu, Ka Chun Cheung, Hang Xu, Hongsheng Li | N/A | |
| Long-Tail Detection with Effective Class-Margins | Jang Hyun Cho, Philipp Krähenbühl | N/A | |
| Semi-Supervised Monocular 3D Object Detection by Multi-View Consistency | Qing Lian, Yanbo Xu, Weilong Yao, Yingcong Chen, Tong Zhang | N/A | |
| PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer towards Video Object Detection | Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song | N/A | |
| BEVFormer: Learning Bird’s-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers | Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai | N/A | |
| Category-Level 6D Object Pose and Size Estimation Using Self-Supervised Deep Prior Deformation Networks | Jiehong Lin, Zewei Wei, Changxing Ding, Kui Jia | N/A | |
| Dense Teacher: Dense Pseudo-Labels for Semi-Supervised Object Detection | Hongyu Zhou, Zheng Ge, Songtao Liu, Weixin Mao, Zeming Li, Haiyan Yu, Jian Sun | N/A | |
| Point-to-Box Network for Accurate Object Detection via Single Point Supervision | Pengfei Chen, Xuehui Yu, Xumeng Han, Najmul Hassan, Kai Wang, Jiachen Li, Jian Zhao, Humphrey Shi, Zhenjun Han, Qixiang Ye | N/A | |
| Domain Adaptive Hand Keypoint and Pixel Localization in the Wild | Takehiko Ohkawa, Yu-Jhe Li, Qichen Fu, Ryosuke Furuta, Kris M. Kitani, Yoichi Sato | N/A | |
| Towards Data-Efficient Detection Transformers | Wen Wang, Jing Zhang, Yang Cao, Yongliang Shen, Dacheng Tao | N/A | |
| Open-Vocabulary DETR with Conditional Matching | Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy | N/A | |
| Prediction-Guided Distillation for Dense Object Detection | Chenhongyi Yang, Mateusz Ochal, Amos Storkey, Elliot J. Crowley | N/A | |
| Multimodal Object Detection via Probabilistic Ensembling | Yi-Ting Chen, Jinghao Shi, Zelin Ye, Christoph Mertz, Deva Ramanan, Shu Kong | N/A | |
| Exploiting Unlabeled Data with Vision and Language Models for Object Detection | Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, Vijay Kumar B G, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris N. Metaxas | N/A | |
| CPO: Change Robust Panorama to Point Cloud Localization | Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim | N/A | |
| INT: Towards Infinite-Frames 3D Detection with an Efficient Framework | Jianyun Xu, Zhenwei Miao, Da Zhang, Hongyu Pan, Kaixuan Liu, Peihan Hao, Jun Zhu, Zhengyang Sun, Hongmin Li, Xin Zhan | N/A | |
| End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution | Mingxiang Liao, Fang Wan, Yuan Yao, Zhenjun Han, Jialing Zou, Yuze Wang, Bailan Feng, Peng Yuan, Qixiang Ye | N/A | |
| Calibration-Free Multi-View Crowd Counting | Qi Zhang, Antoni B. Chan | N/A | |
| Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-Training | Zhenyu Li, Zehui Chen, Ang Li, Liangji Fang, Qinhong Jiang, Xianming Liu, Junjun Jiang | N/A | |
| SuperLine3D: Self-Supervised Line Segmentation and Description for LiDAR Point Cloud | Xiangrui Zhao, Sheng Yang, Tianxin Huang, Jun Chen, Teng Ma, Mingyang Li, Yong Liu | N/A | |
| Exploring Plain Vision Transformer Backbones for Object Detection | Yanghao Li, Hanzi Mao, Ross Girshick, Kaiming He | N/A | |
| Adversarially-Aware Robust Object Detector | Ziyi Dong, Pengxu Wei, Liang Lin | N/A | |
| HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors | Luting Wang, Xiaojie Li, Yue Liao, Zeren Jiang, Jianlong Wu, Fei Wang, Chen Qian, Si Liu | N/A | |
| You Should Look at All Objects | Zhenchao Jin, Dongdong Yu, Luchuan Song, Zehuan Yuan, Lequan Yu | N/A | |
| Detecting Twenty-Thousand Classes Using Image-Level Supervision | Xingyi Zhou, Rohit Girdhar, Armand Joulin, Philipp Krähenbühl, Ishan Misra | N/A | |
| DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation | Hongyang Li, Jiehong Lin, Kui Jia | N/A | |
| Monocular 3D Object Detection with Depth from Motion | Tai Wang, Jiangmiao Pang, Dahua Lin | N/A | |
| DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation | Yilin Wen, Xiangyu Li, Hao Pan, Lei Yang, Zheng Wang, Taku Komura, Wenping Wang | N/A | |
| Distilling Object Detectors with Global Knowledge | Sanli Tang, Zhongyu Zhang, Zhanzhan Cheng, Jing Lu, Yunlu Xu, Yi Niu, Fan He | N/A | |
| Unifying Visual Perception by Dispersible Points Learning | Jianming Liang, Guanglu Song, Biao Leng, Yu Liu | N/A | |
| PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection | Gang Li, Xiang Li, Yujie Wang, Yichao Wu, Ding Liang, Shanshan Zhang | N/A | |
| Exploring Resolution and Degradation Clues As Self-Supervised Signal for Low Quality Object Detection | Ziteng Cui, Yingying Zhu, Lin Gu, Guo-Jun Qi, Xiaoxiao Li, Renrui Zhang, Zenghui Zhang, Tatsuya Harada | N/A | |
| Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features | Wufei Ma, Angtian Wang, Alan Yuille, Adam Kortylewski | N/A | |
| "Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle Detection" | Maoxun Yuan, Yinyan Wang, Xingxing Wei | N/A | |
| RFLA: Gaussian Receptive Field Based Label Assignment for Tiny Object Detection | Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia | N/A | |
| Rethinking IoU-Based Optimization for Single-Stage 3D Object Detection | Hualian Sheng, Sijia Cai, Na Zhao, Bing Deng, Jianqiang Huang, Xian-Sheng Hua, Min-Jian Zhao, Gim Hee Lee | N/A | |
| TD-Road: Top-Down Road Network Extraction with Holistic Graph Construction | Yang He, Ravi Garg, Amber Roy Chowdhury | N/A | |
| Multi-faceted Distillation of Base-Novel Commonality for Few-Shot Object Detection | Shuang Wu, Wenjie Pei, Dianwen Mei, Fanglin Chen, Jiandong Tian, Guangming Lu | N/A | |
| PointCLM: A Contrastive Learning-Based Framework for Multi-Instance Point Cloud Registration | Mingzhi Yuan, Zhihao Li, Qiuye Jin, Xinrong Chen, Manning Wang | N/A | |
| Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration | Haotian Bai, Ruimao Zhang, Jiong Wang, Xiang Wan | N/A | |
| MTTrans: Cross-Domain Object Detection with Mean Teacher Transformer | Jinze Yu, Jiaming Liu, Xiaobao Wei, Haoyi Zhou, Yohei Nakata, Denis Gudovskiy, Tomoyuki Okuno, Jianxin Li, Kurt Keutzer, Shanghang Zhang | N/A | |
| Multi-Domain Multi-Definition Landmark Localization for Small Datasets | David Ferman, Gaurav Bharaj | N/A | |
| DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection | Abhinav Kumar, Garrick Brazil, Enrique Corona, Armin Parchami, Xiaoming Liu | N/A | |
| Label-Guided Auxiliary Training Improves 3D Object Detector | Yaomin Huang, Xinmei Liu, Yichen Zhu, Zhiyuan Xu, Chaomin Shen, Zhengping Che, Guixu Zhang, Yaxin Peng, Feifei Feng, Jian Tang | N/A | |
| PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images | Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma | N/A | |
| Densely Constrained Depth Estimator for Monocular 3D Object Detection | Yingyan Li, Yuntao Chen, Jiawei He, Zhaoxiang Zhang | N/A | |
| Polarimetric Pose Prediction | Daoyi Gao, Yitong Li, Patrick Ruhkamp, Iuliia Skobleva, Magdalena Wysocki, HyunJun Jung, Pengyuan Wang, Arturo Guridi, Benjamin Busam | N/A | |
| DFNet: Enhance Absolute Pose Regression with Direct Feature Matching | Shuai Chen, Xinghui Li, Zirui Wang, Victor Adrian Prisacariu | N/A | |
| Cornerformer: Purifying Instances for Corner-Based Detectors | Haoran Wei, Xin Chen, Lingxi Xie, Qi Tian | N/A | |
| PillarNet: Real-Time and High-Performance Pillar-Based 3D Object Detection | Guangsheng Shi, Ruifeng Li, Chao Ma | N/A | |
| Robust Object Detection with Inaccurate Bounding Boxes | Chengxin Liu, Kewei Wang, Hao Lu, Zhiguo Cao, Ziming Zhang | N/A | |
| Efficient Decoder-Free Object Detection with Transformers | Peixian Chen, Mengdan Zhang, Yunhang Shen, Kekai Sheng, Yuting Gao, Xing Sun, Ke Li, Chunhua Shen | N/A | |
| Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection | Yu Hong, Hang Dai, Yong Ding | N/A | |
| ReAct: Temporal Action Detection with Relational Queries | Dingfeng Shi, Yujie Zhong, Qiong Cao, Jing Zhang, Lin Ma, Jia Li, Dacheng Tao | N/A | |
| Towards Accurate Active Camera Localization | Qihang Fang, Yingda Yin, Qingnan Fan, Fei Xia, Siyan Dong, Sheng Wang, Jue Wang, Leonidas J. Guibas, Baoquan Chen | N/A | |
| Camera Pose Auto-Encoders for Improving Pose Regression | Yoli Shavit, Yosi Keller | N/A | |
| Improving the Intra-Class Long-Tail in 3D Detection via Rare Example Mining | Chiyu Max Jiang, Mahyar Najibi, Charles R. Qi, Yin Zhou, Dragomir Anguelov | N/A | |
| Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization | Lei Zhu, Qian Chen, Lujia Jin, Yunfei You, Yanye Lu | N/A | |
| UC-OWOD: Unknown-Classified Open World Object Detection | Zhiheng Wu, Yue Lu, Xingyu Chen, Zhengxing Wu, Liwen Kang, Junzhi Yu | N/A | |
| RayTran: 3D Pose Estimation and Shape Reconstruction of Multiple Objects from Videos with Ray-Traced Transformers | Michał J. Tyszkiewicz, Kevis-Kokitsi Maninis, Stefan Popov, Vittorio Ferrari | N/A | |
| GTCaR: Graph Transformer for Camera Re-Localization | Xinyi Li, Haibin Ling | N/A | |
| 3D Object Detection with a Self-Supervised Lidar Scene Flow Backbone | Emeç Erçelik, Ekim Yurtsever, Mingyu Liu, Zhijie Yang, Hanzhen Zhang, Pınar Topçam, Maximilian Listl, Yılmaz Kaan Çaylı, Alois Knoll | N/A | |
| Open Vocabulary Object Detection with Pseudo Bounding-Box Labels | Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong | N/A | |
| Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words Representations | Wenjie Pei, Shuang Wu, Dianwen Mei, Fanglin Chen, Jiandong Tian, Guangming Lu | N/A | |
| SALISA: Saliency-Based Input Sampling for Efficient Video Object Detection | Babak Ehteshami Bejnordi, Amirhossein Habibian, Fatih Porikli, Amir Ghodrati | N/A | |
| ECO-TR: Efficient Correspondences Finding via Coarse-to-Fine Refinement | Dongli Tan, Jiang-Jiang Liu, Xingyu Chen, Chao Chen, Ruixin Zhang, Yunhang Shen, Shouhong Ding, Rongrong Ji | N/A | |
| Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting | Yangzheng Wu, Mohsen Zand, Ali Etemad, Michael Greenspan | N/A | |
| Long-Tailed Instance Segmentation Using Gumbel Optimized Loss | Konstantinos Panagiotis Alexandridis, Jiankang Deng, Anh Nguyen, Shan Luo | N/A | |
| DetMatch: Two Teachers Are Better than One for Joint 2D and 3D Semi-Supervised Object Detection | Jinhyung Park, Chenfeng Xu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan | N/A | |
| ObjectBox: From Centers to Boxes for Anchor-Free Object Detection | Mohsen Zand, Ali Etemad, Michael Greenspan | N/A | |
| Is Geometry Enough for Matching in Visual Localization? | Qunjie Zhou, Sérgio Agostinho, Aljoša Ošep, Laura Leal-Taixé | N/A | |
| SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds | Pei Sun, Mingxing Tan, Weiyue Wang, Chenxi Liu, Fei Xia, Zhaoqi Leng, Dragomir Anguelov | N/A | |
| PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry | Yu Zhang, Junle Yu, Xiaolin Huang, Wenhui Zhou, Ji Hou | N/A | |
| GLAMD: Global and Local Attention Mask Distillation for Object Detectors | Younho Jang, Wheemyung Shin, Jinbeom Kim, Simon Woo, Sung-Ho Bae | N/A | |
| FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection | Danila Rukhovich, Anna Vorontsova, Anton Konushin | N/A | |
| Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles | Guodong Wang, Yunhong Wang, Jie Qin, Dongming Zhang, Xiuguo Bao, Di Huang | N/A | |
| Class-Agnostic Object Detection with Multi-modal Transformer | Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Ming-Hsuan Yang | N/A | |
| Enhancing Multi-modal Features Using Local Self-Attention for 3D Object Detection | Hao Li, Zehan Zhang, Xian Zhao, Yulong Wang, Yuxi Shen, Shiliang Pu, Hui Mao | N/A | |
| Object Detection As Probabilistic Set Prediction | Georg Hess, Christoffer Petersson, Lennart Svensson | N/A | |
| Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions | Zhi Li, Lu He, Huijuan Xu | N/A | |
| Neural Correspondence Field for Object Pose Estimation | Lin Huang, Tomas Hodan, Lingni Ma, Linguang Zhang, Luan Tran, Christopher D. Twigg, Po-Chen Wu, Junsong Yuan, Cem Keskin, Robert Wang | N/A | |
| On Label Granularity and Object Localization | Elijah Cole, Kimberly Wilber, Grant Van Horn, Xuan Yang, Marco Fornoni, Pietro Perona, Serge Belongie, Andrew Howard, Oisin Mac Aodha | N/A | |
| OIMNet++: Prototypical Normalization and Localization-Aware Learning for Person Search | Sanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham | N/A | |
| Out-of-Distribution Identification: Let Detector Tell Which I Am Not Sure | Ruoqi Li, Chongyang Zhang, Hao Zhou, Chao Shi, Yan Luo | N/A | |
| Learning with Free Object Segments for Long-Tailed Instance Segmentation | Cheng Zhang, Tai-Yu Pan, Tianle Chen, Jike Zhong, Wenjin Fu, Wei-Lun Chao | N/A | |
| Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction | YuXuan Liu, Nikhil Mishra, Maximilian Sieb, Yide Shentu, Pieter Abbeel, Xi Chen | N/A | |
| 3D Random Occlusion and Multi-layer Projection for Deep Multi-Camera Pedestrian Localization | Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang | N/A | |
| A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation | Wuyang Chen, Xianzhi Du, Fan Yang, Lucas Beyer, Xiaohua Zhai, Tsung-Yi Lin, Huizhong Chen, Jing Li, Xiaodan Song, Zhangyang Wang, Denny Zhou | N/A | |
| Simple Open-Vocabulary Object Detection with Vision Transformers | Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby | N/A | |
| "A Simple Approach and Benchmark for 21,000-Category Object Detection" | Yutong Lin, Chen Li, Yue Cao, Zheng Zhang, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Han Hu | N/A | |
| Knowledge Condensation Distillation | Chenxin Li, Mingbao Lin, Zhiyuan Ding, Nie Lin, Yihong Zhuang, Yue Huang, Xinghao Ding, Liujuan Cao | N/A | |
| Reducing Information Loss for Spiking Neural Networks | Yufei Guo, Yuanpei Chen, Liwen Zhang, YingLei Wang, Xiaode Liu, Xinyi Tong, Yuanyuan Ou, Xuhui Huang, Zhe Ma | N/A | |
| Masked Generative Distillation | Zhendong Yang, Zhe Li, Mingqi Shao, Dachuan Shi, Zehuan Yuan, Chun Yuan | N/A | |
| Fine-Grained Data Distribution Alignment for Post-Training Quantization | Yunshan Zhong, Mingbao Lin, Mengzhao Chen, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji | N/A | |
| Learning with Recoverable Forgetting | Jingwen Ye, Yifang Fu, Jie Song, Xingyi Yang, Songhua Liu, Xin Jin, Mingli Song, Xinchao Wang | N/A | |
| Efficient One Pass Self-Distillation with Zipf’s Label Smoothing | Jiajun Liang, Linze Li, Zhaodong Bing, Borui Zhao, Yao Tang, Bo Lin, Haoqiang Fan | N/A | |
| Prune Your Model before Distill It | Jinhyuk Park, Albert No | N/A | |
| Deep Partial Updating: Towards Communication Efficient Updating for On-Device Inference | Zhongnan Qu, Cong Liu, Lothar Thiele | N/A | |
| Patch Similarity Aware Data-Free Quantization for Vision Transformers | Zhikai Li, Liping Ma, Mengjuan Chen, Junrui Xiao, Qingyi Gu | N/A | |
| "L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training" | Jonghyun Bae, Woohyeon Baek, Tae Jun Ham, Jae W. Lee | N/A | |
| Streaming Multiscale Deep Equilibrium Models | Can Ufuk Ertenli, Emre Akbas, Ramazan Gokberk Cinbis | N/A | |
| Symmetry Regularization and Saturating Nonlinearity for Robust Quantization | Sein Park, Yeongsang Jang, Eunhyeok Park | N/A | |
| SP-Net: Slowly Progressing Dynamic Inference Networks | Huanyu Wang, Wenhu Zhang, Shihao Su, Hui Wang, Zhenwei Miao, Xin Zhan, Xi Li | N/A | |
| Equivariance and Invariance Inductive Bias for Learning from Insufficient Data | Tan Wang, Qianru Sun, Sugiri Pranata, Karlekar Jayashree, Hanwang Zhang | N/A | |
| Mixed-Precision Neural Network Quantization via Learned Layer-Wise Importance | Chen Tang, Kai Ouyang, Zhi Wang, Yifei Zhu, Wen Ji, Yaowei Wang, Wenwu Zhu | N/A | |
| Event Neural Networks | Matthew Dutson, Yin Li, Mohit Gupta | N/A | |
| EdgeViTs: Competing Light-Weight CNNs on Mobile Devices with Vision Transformers | Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martinez | N/A | |
| PalQuant: Accelerating High-Precision Networks on Low-Precision Accelerators | Qinghao Hu, Gang Li, Qiman Wu, Jian Cheng | N/A | |
| Disentangled Differentiable Network Pruning | Shangqian Gao, Feihu Huang, Yanfu Zhang, Heng Huang | N/A | |
| IDa-Det: An Information Discrepancy-Aware Distillation for 1-Bit Detectors | Sheng Xu, Yanjing Li, Bohan Zeng, Teli Ma, Baochang Zhang, Xianbin Cao, Peng Gao, Jinhu Lü | N/A | |
| Learning to Weight Samples for Dynamic Early-Exiting Networks | Yizeng Han, Yifan Pu, Zihang Lai, Chaofei Wang, Shiji Song, Junfeng Cao, Wenhui Huang, Chao Deng, Gao Huang | N/A | |
| AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets | Zhijun Tu, Xinghao Chen, Pengju Ren, Yunhe Wang | N/A | |
| Adaptive Token Sampling for Efficient Vision Transformers | Mohsen Fayyaz, Soroush Abbasi Koohpayegani, Farnoush Rezaei Jafari, Sunando Sengupta, Hamid Reza Vaezi Joze, Eric Sommerlade, Hamed Pirsiavash, Jürgen Gall | N/A | |
| Weight Fixing Networks | Christopher Subia-Waud, Srinandan Dasmahapatra | N/A | |
| Self-Slimmed Vision Transformer | Zhuofan Zong, Kunchang Li, Guanglu Song, Yali Wang, Yu Qiao, Biao Leng, Yu Liu | N/A | |
| Switchable Online Knowledge Distillation | Biao Qian, Yang Wang, Hongzhi Yin, Richang Hong, Meng Wang | N/A | |
| l∞-Robustness and Beyond: Unleashing Efficient Adversarial Training | Hadi M. Dolatabadi, Sarah Erfani, Christopher Leckie | N/A | |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices | Tianli Zhao, Xi Sheryl Zhang, Wentao Zhu, Jiaxing Wang, Sen Yang, Ji Liu, Jian Cheng | N/A | |
| Deep Ensemble Learning by Diverse Knowledge Distillation for Fine-Grained Object Classification | Naoki Okamoto, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi | N/A | |
| Helpful or Harmful: Inter-Task Association in Continual Learning | Hyundong Jin, Eunwoo Kim | N/A | |
| Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies | Xingrun Xing, Yangguang Li, Wei Li, Wenrui Ding, Yalong Jiang, Yufeng Wang, Jing Shao, Chunlei Liu, Xianglong Liu | N/A | |
| SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks | Chien-Yu Lin, Anish Prabhu, Thomas Merth, Sachin Mehta, Anurag Ranjan, Maxwell Horton, Mohammad Rastegari | N/A | |
| Ensemble Knowledge Guided Sub-network Search and Fine-Tuning for Filter Pruning | Seunghyun Lee, Byung Cheol Song | N/A | |
| Network Binarization via Contrastive Learning | Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, Yan Yan | N/A | |
| Lipschitz Continuity Retained Binary Neural Network | Yuzhang Shang, Dan Xu, Bin Duan, Ziliang Zong, Liqiang Nie, Yan Yan | N/A | |
| SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning | Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, Yanzhi Wang | N/A | |
| Soft Masking for Cost-Constrained Channel Pruning | Ryan Humble, Maying Shen, Jorge Albericio Latorre, Eric Darve, Jose Alvarez | N/A | |
| Non-uniform Step Size Quantization for Accurate Post-Training Quantization | Sangyun Oh, Hyeonuk Sim, Jounghyun Kim, Jongeun Lee | N/A | |
| SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning | Haoran You, Baopu Li, Zhanyi Sun, Xu Ouyang, Yingyan Lin | N/A | |
| Meta-GF: Training Dynamic-Depth Neural Networks Harmoniously | Yi Sun, Jian Li, Xin Xu | N/A | |
| Towards Ultra Low Latency Spiking Neural Networks for Vision and Sequential Tasks Using Temporal Pruning | Sayeed Shafayet Chowdhury, Nitin Rathi, Kaushik Roy | N/A | |
| Towards Accurate Network Quantization with Equivalent Smooth Regularizer | Kirill Solodskikh, Vladimir Chikin, Ruslan Aydarkhanov, Dehua Song, Irina Zhelavskaya, Jiansheng Wei | N/A | |
| Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization | Vladimir Chikin, Kirill Solodskikh, Irina Zhelavskaya | N/A | |
| BASQ: Branch-Wise Activation-Clipping Search Quantization for Sub-4-Bit Neural Networks | Han-Byul Kim, Eunhyeok Park, Sungjoo Yoo | N/A | |
| You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding | Geng Yuan, Sung-En Chang, Qing Jin, Alec Lu, Yanyu Li, Yushu Wu, Zhenglun Kong, Yanyue Xie, Peiyan Dong, Minghai Qin, Xiaolong Ma, Xulong Tang, Zhenman Fang, Yanzhi Wang | N/A | |
| Real Spike: Learning Real-Valued Spikes for Spiking Neural Networks | Yufei Guo, Liwen Zhang, Yuanpei Chen, Xinyi Tong, Xiaode Liu, YingLei Wang, Xuhui Huang, Zhe Ma | N/A | |
| FedLTN: Federated Learning for Sparse and Personalized Lottery Ticket Networks | Vaikkunth Mugunthan, Eric Lin, Vignesh Gokul, Christian Lau, Lalana Kagal, Steve Pieper | N/A | |
| Theoretical Understanding of the Information Flow on Continual Learning Performance | Joshua Andle, Salimeh Yasaei Sekeh | N/A | |
| Exploring Lottery Ticket Hypothesis in Spiking Neural Networks | Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Ruokai Yin, Priyadarshini Panda | N/A | |
| On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network | Juseung Yun, Janghyeon Lee, Hyounguk Shon, Eojindl Yi, Seung Hwan Kim, Junmo Kim | N/A | |
| LANA: Latency Aware Network Acceleration | Pavlo Molchanov, Jimmy Hall, Hongxu Yin, Jan Kautz, Nicolo Fusi, Arash Vahdat | N/A | |
| RDO-Q: Extremely Fine-Grained Channel-Wise Quantization via Rate-Distortion Optimization | Zhe Wang, Jie Lin, Xue Geng, Mohamed M. Sabry Aly, Vijay Chandrasekhar | N/A | |
| U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search | Ahmet Caner Yüzügüler, Nikolaos Dimitriadis, Pascal Frossard | N/A | |
| PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization | Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun | N/A | |
| Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach | Jiseok Youn, Jaehun Song, Hyung-Sin Kim, Saewoong Bahk | N/A | |
| Understanding the Dynamics of DNNs Using Graph Modularity | Yao Lu, Wen Yang, Yunzhe Zhang, Zuohui Chen, Jinyin Chen, Qi Xuan, Zhen Wang, Xiaoniu Yang | N/A | |
| Latent Discriminant Deterministic Uncertainty | Gianni Franchi, Xuanlong Yu, Andrei Bursuc, Emanuel Aldea, Severine Dubuisson, David Filliat | N/A | |
| Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals | Simon Vandenhende, Dhruv Mahajan, Filip Radenovic, Deepti Ghadiyaram | N/A | |
| HIVE: Evaluating the Human Interpretability of Visual Explanations | Sunnie S. Y. Kim, Nicole Meister, Vikram V. Ramaswamy, Ruth Fong, Olga Russakovsky | N/A | |
| BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks | Uddeshya Upadhyay, Shyamgopal Karthik, Yanbei Chen, Massimiliano Mancini, Zeynep Akata | N/A | |
| SESS: Saliency Enhancing with Scaling and Sliding | Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes | N/A | |
| No Token Left Behind: Explainability-Aided Image Classification and Generation | Roni Paiss, Hila Chefer, Lior Wolf | N/A | |
| Interpretable Image Classification with Differentiable Prototypes Assignment | Dawid Rymarczyk, Łukasz Struski, Michał Górszczak, Koryna Lewandowska, Jacek Tabor, Bartosz Zieliński | N/A | |
| "Contributions of Shape, Texture, and Color in Visual Recognition" | Yunhao Ge, Yao Xiao, Zhi Xu, Xingrui Wang, Laurent Itti | N/A | |
| STEEX: Steering Counterfactual Explanations with Semantics | Paul Jacob, Éloi Zablocki, Hédi Ben-Younes, Mickaël Chen, Patrick Pérez, Matthieu Cord | N/A | |
| Are Vision Transformers Robust to Patch Perturbations? | Jindong Gu, Volker Tresp, Yao Qin | N/A | |
| A Dataset Generation Framework for Evaluating Megapixel Image Classifiers & Their Explanations | Gautam Machiraju, Sylvia Plevritis, Parag Mallick | N/A | |
| Cartoon Explanations of Image Classifiers | Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok | N/A | |
| Shap-CAM: Visual Explanations for Convolutional Neural Networks Based on Shapley Value | Quan Zheng, Ziwei Wang, Jie Zhou, Jiwen Lu | N/A | |
| Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain | Jiazhen Ji, Huan Wang, Yuge Huang, Jiaxiang Wu, Xingkun Xu, Shouhong Ding, ShengChuan Zhang, Liujuan Cao, Rongrong Ji | N/A | |
| Contrast-Phys: Unsupervised Video-Based Remote Physiological Measurement via Spatiotemporal Contrast | Zhaodong Sun, Xiaobai Li | N/A | |
| Source-Free Domain Adaptation with Contrastive Domain Alignment and Self-Supervised Exploration for Face Anti-Spoofing | Yuchen Liu, Yabo Chen, Wenrui Dai, Mengran Gou, Chun-Ting Huang, Hongkai Xiong | N/A | |
| On Mitigating Hard Clusters for Face Clustering | Yingjie Chen, Huasong Zhong, Chong Chen, Chen Shen, Jianqiang Huang, Tao Wang, Yun Liang, Qianru Sun | N/A | |
| OneFace: One Threshold for All | Jiaheng Liu, Zhipeng Yu, Haoyu Qin, Yichao Wu, Ding Liang, Gangming Zhao, Ke Xu | N/A | |
| Label2Label: A Language Modeling Framework for Multi-Attribute Learning | Wanhua Li, Zhexuan Cao, Jianjiang Feng, Jie Zhou, Jiwen Lu | N/A | |
| AgeTransGAN for Facial Age Transformation with Rectified Performance Metrics | Gee-Sern Hsu, Rui-Cang Xie, Zhi-Ting Chen, Yu-Hong Lin | N/A | |
| Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection | Zhihao Gu, Taiping Yao, Yang Chen, Shouhong Ding, Lizhuang Ma | N/A | |
| Rethinking Robust Representation Learning under Fine-Grained Noisy Faces | Bingqi Ma, Guanglu Song, Boxiao Liu, Yu Liu | N/A | |
| Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition | Sungho Shin, Joosoon Lee, Junseok Lee, Yeonguk Yu, Kyoobin Lee | N/A | |
| Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions | Tohar Lukov, Na Zhao, Gim Hee Lee, Ser-Nam Lim | N/A | |
| Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis | Shuai Shen, Wanhua Li, Zheng Zhu, Yueqi Duan, Jie Zhou, Jiwen Lu | N/A | |
| CoupleFace: Relation Matters for Face Recognition Distillation | Jiaheng Liu, Haoyu Qin, Yichao Wu, Jinyang Guo, Ding Liang, Ke Xu | N/A | |
| Controllable and Guided Face Synthesis for Unconstrained Face Recognition | Feng Liu, Minchul Kim, Anil Jain, Xiaoming Liu | N/A | |
| Towards Robust Face Recognition with Comprehensive Search | Manyuan Zhang, Guanglu Song, Yu Liu, Hongsheng Li | N/A | |
| Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian | Zhiwen Cao, Dongfang Liu, Qifan Wang, Yingjie Chen | N/A | |
| AU-Aware 3D Face Reconstruction through Personalized AU-Specific Blendshape Learning | Chenyi Kuang, Zijun Cui, Jeffrey O. Kephart, Qiang Ji | N/A | |
| BézierPalm: A Free Lunch for Palmprint Recognition | Kai Zhao, Lei Shen, Yingyi Zhang, Chuhan Zhou, Tao Wang, Ruixin Zhang, Shouhong Ding, Wei Jia, Wei Shen | N/A | |
| Adaptive Transformers for Robust Few-Shot Cross-Domain Face Anti-Spoofing | Hsin-Ping Huang, Deqing Sun, Yaojie Liu, Wen-Sheng Chu, Taihong Xiao, Jinwei Yuan, Hartwig Adam, Ming-Hsuan Yang | N/A | |
| Face2Faceρ: Real-Time High-Resolution One-Shot Face Reenactment | Kewei Yang, Kang Chen, Daoliang Guo, Song-Hai Zhang, Yuan-Chen Guo, Weidong Zhang | N/A | |
| Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation | Haiwen Feng, Timo Bolkart, Joachim Tesch, Michael J. Black, Victoria Abrevaya | N/A | |
| BoundaryFace: A Mining Framework with Noise Label Self-Correction for Face Recognition | Shijie Wu, Xun Gong | N/A | |
| Pre-training Strategies and Datasets for Facial Representation Learning | Adrian Bulat, Shiyang Cheng, Jing Yang, Andrew Garbett, Enrique Sanchez, Georgios Tzimiropoulos | N/A | |
| Look Both Ways: Self-Supervising Driver Gaze Estimation and Road Scene Saliency | Isaac Kasahara, Simon Stent, Hyun Soo Park | N/A | |
| MFIM: Megapixel Facial Identity Manipulation | Sanghyeon Na | N/A | |
| 3D Face Reconstruction with Dense Landmarks | Erroll Wood, Tadas Baltrušaitis, Charlie Hewitt, Matthew Johnson, Jingjing Shen, Nikola Milosavljević, Daniel Wilde, Stephan Garbin, Toby Sharp, Ivan Stojiljković, Tom Cashman, Julien Valentin | N/A | |
| Emotion-Aware Multi-View Contrastive Learning for Facial Emotion Recognition | Daeha Kim, Byung Cheol Song | N/A | |
| Order Learning Using Partially Ordered Data via Chainization | Seon-Ho Lee, Chang-Su Kim | N/A | |
| Unsupervised High-Fidelity Facial Texture Generation and Reconstruction | Ron Slossberg, Ibrahim Jubran, Ron Kimmel | N/A | |
| Multi-Domain Learning for Updating Face Anti-Spoofing Models | Xiao Guo, Yaojie Liu, Anil Jain, Xiaoming Liu | N/A | |
| Towards Metrical Reconstruction of Human Faces | Wojciech Zielonka, Timo Bolkart, Justus Thies | N/A | |
| Discover and Mitigate Unknown Biases with Debiasing Alternate Networks | Zhiheng Li, Anthony Hoogs, Chenliang Xu | N/A | |
| Unsupervised and Semi-Supervised Bias Benchmarking in Face Recognition | Alexandra Chouldechova, Siqi Deng, Yongxin Wang, Wei Xia, Pietro Perona | N/A | |
| Towards Efficient Adversarial Training on Vision Transformers | Boxi Wu, Jindong Gu, Zhifeng Li, Deng Cai, Xiaofei He, Wei Liu | N/A | |
| MIME: Minority Inclusion for Majority Group Enhancement of AI Performance | Pradyumna Chari, Yunhao Ba, Shreeram Athreya, Achuta Kadambi | N/A | |
| Studying Bias in GANs through the Lens of Race | Vongani H. Maluleke, Neerja Thakkar, Tim Brooks, Ethan Weber, Trevor Darrell, Alexei A. Efros, Angjoo Kanazawa, Devin Guillory | N/A | |
| "Trust, but Verify: Using Self-Supervised Probing to Improve Trustworthiness" | Ailin Deng, Shen Li, Miao Xiong, Zhirui Chen, Bryan Hooi | N/A | |
| Learning to Censor by Noisy Sampling | Ayush Chopra, Abhinav Java, Abhishek Singh, Vivek Sharma, Ramesh Raskar | N/A | |
| An Invisible Black-Box Backdoor Attack through Frequency Domain | Tong Wang, Yuan Yao, Feng Xu, Shengwei An, Hanghang Tong, Ting Wang | N/A | |
| FairGRAPE: Fairness-Aware GRAdient Pruning mEthod for Face Attribute Classification | Xiaofeng Lin, Seungbae Kim, Jungseock Joo | N/A | |
| Attaining Class-Level Forgetting in Pretrained Model Using Few Samples | Pravendra Singh, Pratik Mazumder, Mohammed Asad Karim | N/A | |
| Anti-Neuron Watermarking: Protecting Personal Data against Unauthorized Neural Networks | Zihang Zou, Boqing Gong, Liqiang Wang | N/A | |
| An Impartial Take to the CNN vs Transformer Robustness Contest | Francesco Pinto, Philip H. S. Torr, Puneet K. Dokania | N/A | |
| Recover Fair Deep Classification Models via Altering Pre-trained Structure | Yanfu Zhang, Shangqian Gao, Heng Huang | N/A | |
| Decouple-and-Sample: Protecting Sensitive Information in Task Agnostic Data Release | Abhishek Singh, Ethan Garza, Ayush Chopra, Praneeth Vepakomma, Vivek Sharma, Ramesh Raskar | N/A | |
| Privacy-Preserving Action Recognition via Motion Difference Quantization | Sudhakar Kumawat, Hajime Nagahara | N/A | |
| Latent Space Smoothing for Individually Fair Representations | Momchil Peychev, Anian Ruoss, Mislav Balunović, Maximilian Baader, Martin Vechev | N/A | |
| Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration | Christian Tomani, Daniel Cremers, Florian Buettner | N/A | |
| FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations | Cemre Efe Karakas, Alara Dirik, Eylül Yalçınkaya, Pinar Yanardag | N/A | |
| Distilling the Undistillable: Learning from a Nasty Teacher | Surgan Jandial, Yash Khasbage, Arghya Pal, Vineeth N Balasubramanian, Balaji Krishnamurthy | N/A | |
| SOS! Self-Supervised Learning over Sets of Handled Objects in Egocentric Action Recognition | Victor Escorcia, Ricardo Guerrero, Xiatian Zhu, Brais Martinez | N/A | |
| Egocentric Activity Recognition and Localization on a 3D Map | Miao Liu, Lingni Ma, Kiran Somasundaram, Yin Li, Kristen Grauman, James M. Rehg, Chao Li | N/A | |
| Generative Adversarial Network for Future Hand Segmentation from Egocentric Video | Wenqi Jia, Miao Liu, James M. Rehg | N/A | |
| My View Is the Best View: Procedure Learning from Egocentric Videos | Siddhant Bansal, Chetan Arora, C.V. Jawahar | N/A | |
| GIMO: Gaze-Informed Human Motion Prediction in Context | Yang Zheng, Yanchao Yang, Kaichun Mo, Jiaman Li, Tao Yu, Yebin Liu, Karen Liu, Leonidas J. Guibas | N/A | |
| Image-Based CLIP-Guided Essence Transfer | Hila Chefer, Sagie Benaim, Roni Paiss, Lior Wolf | N/A | |
| Detecting and Recovering Sequential DeepFake Manipulation | Rui Shao, Tianxing Wu, Ziwei Liu | N/A | |
| Self-Supervised Sparse Representation for Video Anomaly Detection | Jhih-Ciang Wu, He-Yen Hsieh, Ding-Jie Chen, Chiou-Shann Fuh, Tyng-Luh Liu | N/A | |
| Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal | Xinwei Liu, Jian Liu, Yang Bai, Jindong Gu, Tao Chen, Xiaojun Jia, Xiaochun Cao | N/A | |
| Explaining Deepfake Detection by Analysing Image Matching | Shichao Dong, Jin Wang, Jiajun Liang, Haoqiang Fan, Renhe Ji | N/A | |
| FrequencyLowCut Pooling – Plug & Play against Catastrophic Overfitting | Julia Grabinski, Steffen Jung, Janis Keuper, Margret Keuper | N/A | |
| TAFIM: Targeted Adversarial Attacks against Facial Image Manipulations | Shivangi Aneja, Lev Markhasin, Matthias Nießner | N/A | |
| FingerprintNet: Synthesized Fingerprints for Generated Image Detection | Yonghyun Jeong, Doyeon Kim, Youngmin Ro, Pyounggeon Kim, Jongwon Choi | N/A | |
| Detecting Generated Images by Real Images | Bo Liu, Fan Yang, Xiuli Bi, Bin Xiao, Weisheng Li, Xinbo Gao | N/A | |
| An Information Theoretic Approach for Attention-Driven Face Forgery Detection | Ke Sun, Hong Liu, Taiping Yao, Xiaoshuai Sun, Shen Chen, Shouhong Ding, Rongrong Ji | N/A | |
| Exploring Disentangled Content Information for Face Forgery Detection | Jiahao Liang, Huafeng Shi, Weihong Deng | N/A | |
| RepMix: Representation Mixing for Robust Attribution of Synthesized Images | Tu Bui, Ning Yu, John Collomosse | N/A | |
| Totems: Physical Objects for Verifying Visual Integrity | Jingwei Ma, Lucy Chai, Minyoung Huh, Tongzhou Wang, Ser-Nam Lim, Phillip Isola, Antonio Torralba | N/A | |
| Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval | Pandeng Li, Hongtao Xie, Jiannan Ge, Lei Zhang, Shaobo Min, Yongdong Zhang | N/A | |
| PASS: Part-Aware Self-Supervised Pre-training for Person Re-identification | Kuan Zhu, Haiyun Guo, Tianyi Yan, Yousong Zhu, Jinqiao Wang, Ming Tang | N/A | |
| Adaptive Cross-Domain Learning for Generalizable Person Re-identification | Pengyi Zhang, Huanzhang Dou, Yunlong Yu, Xi Li | N/A | |
| Multi-Query Video Retrieval | Zeyu Wang, Yu Wu, Karthik Narasimhan, Olga Russakovsky | N/A | |
| Hierarchical Average Precision Training for Pertinent Image Retrieval | Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot | N/A | |
| Learning Semantic Correspondence with Sparse Annotations | Shuaiyi Huang, Luyu Yang, Bo He, Songyang Zhang, Xuming He, Abhinav Shrivastava | N/A | |
| Dynamically Transformed Instance Normalization Network for Generalizable Person Re-identification | Bingliang Jiao, Lingqiao Liu, Liying Gao, Guosheng Lin, Lu Yang, Shizhou Zhang, Peng Wang, Yanning Zhang | N/A | |
| Domain Adaptive Person Search | Junjie Li, Yichao Yan, Guanshuo Wang, Fufu Yu, Qiong Jia, Shouhong Ding | N/A | |
| TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval | Yuqi Liu, Pengfei Xiong, Luhui Xu, Shengming Cao, Qin Jin | N/A | |
| Unstructured Feature Decoupling for Vehicle Re-identification | Wen Qian, Hao Luo, Silong Peng, Fan Wang, Chen Chen, Hao Li | N/A | |
| Deep Hash Distillation for Image Retrieval | Young Kyun Jang, Geonmo Gu, Byungsoo Ko, Isaac Kang, Nam Ik Cho | N/A | |
| Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification | Boqiang Xu, Jian Liang, Lingxiao He, Zhenan Sun | N/A | |
| Granularity-Aware Adaptation for Image Retrieval over Multiple Tasks | Jon Almazán, Byungsoo Ko, Geonmo Gu, Diane Larlus, Yannis Kalantidis | N/A | |
| Learning Audio-Video Modalities from Image Captions | Arsha Nagrani, Paul Hongsuck Seo, Bryan Seybold, Anja Hauth, Santiago Manen, Chen Sun, Cordelia Schmid | N/A | |
| RVSL: Robust Vehicle Similarity Learning in Real Hazy Scenes Based on Semi-Supervised Learning | Wei-Ting Chen, I-Hsiang Chen, Chih-Yuan Yeh, Hao-Hsiang Yang, Hua-En Chang, Jian-Jiun Ding, Sy-Yen Kuo | N/A | |
| Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval | Fan Hu, Aozhu Chen, Ziyue Wang, Fangming Zhou, Jianfeng Dong, Xirong Li | N/A | |
| Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification | Yiyuan Zhang, Sanyuan Zhao, Yuhao Kang, Jianbing Shen | N/A | |
| Cross-Modality Transformer for Visible-Infrared Person Re-identification | Kongzhu Jiang, Tianzhu Zhang, Xiang Liu, Bingqiao Qian, Yongdong Zhang, Feng Wu | N/A | |
| Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment | Sangmin Lee, Sungjune Park, Yong Man Ro | N/A | |
| Connecting Compression Spaces with Transformer for Approximate Nearest Neighbor Search | Haokui Zhang, Buzhou Tang, Wenze Hu, Xiaoyu Wang | N/A | |
| SEMICON: A Learning-to-Hash Solution for Large-Scale Fine-Grained Image Retrieval | Yang Shen, Xuhao Sun, Xiu-Shen Wei, Qing-Yuan Jiang, Jian Yang | N/A | |
| CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification | Jinlin Wu, Lingxiao He, Wu Liu, Yang Yang, Zhen Lei, Tao Mei, Stan Z. Li | N/A | |
| Text-Based Temporal Localization of Novel Events | Sudipta Paul, Niluthpol Chowdhury Mithun, Amit K. Roy-Chowdhury | N/A | |
| Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval | Zhaopeng Dou, Zhongdao Wang, Weihua Chen, Yali Li, Shengjin Wang | N/A | |
| Relighting4D: Neural Relightable Human from Videos | Zhaoxi Chen, Ziwei Liu | N/A | |
| Real-Time Intermediate Flow Estimation for Video Frame Interpolation | Zhewei Huang, Tianyuan Zhang, Wen Heng, Boxin Shi, Shuchang Zhou | N/A | |
| PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation | Jing He, Yiyi Zhou, Qi Zhang, Jun Peng, Yunhang Shen, Xiaoshuai Sun, Chao Chen, Rongrong Ji | N/A | |
| StyleSwap: Style-Based Generator Empowers Robust Face Swapping | Zhiliang Xu, Hang Zhou, Zhibin Hong, Ziwei Liu, Jiaming Liu, Zhizhi Guo, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang | N/A | |
| Paint2Pix: Interactive Painting Based Progressive Image Synthesis and Editing | Jaskirat Singh, Liang Zheng, Cameron Smith, Jose Echevarria | N/A | |
| FurryGAN: High Quality Foreground-Aware Image Synthesis | Jeongmin Bae, Mingi Kwon, Youngjung Uh | N/A | |
| SCAM! Transferring Humans between Images with Semantic Cross Attention Modulation | Nicolas Dufour, David Picard, Vicky Kalogeiton | N/A | |
| Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields | Yuedong Chen, Qianyi Wu, Chuanxia Zheng, Tat-Jen Cham, Jianfei Cai | N/A | |
| WaveGAN: Frequency-Aware GAN for High-Fidelity Few-Shot Image Generation | Mengping Yang, Zhe Wang, Ziqiu Chi, Wenyi Feng | N/A | |
| End-to-End Visual Editing with a Generatively Pre-trained Artist | Andrew Brown, Cheng-Yang Fu, Omkar Parkhi, Tamara L. Berg, Andrea Vedaldi | N/A | |
| High-Fidelity GAN Inversion with Padding Space | Qingyan Bai, Yinghao Xu, Jiapeng Zhu, Weihao Xia, Yujiu Yang, Yujun Shen | N/A | |
| Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping | Chao Xu, Jiangning Zhang, Yue Han, Guanzhong Tian, Xianfang Zeng, Ying Tai, Yabiao Wang, Chengjie Wang, Yong Liu | N/A | |
| Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives | Wentao Yuan, Qingtian Zhu, Xiangyue Liu, Yikang Ding, Haotian Zhang, Chi Zhang | N/A | |
| Make-a-Scene: Scene-Based Text-to-Image Generation with Human Priors | Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman | N/A | |
| 3D-FM GAN: Towards 3D-Controllable Face Manipulation | Yuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Richard Zhang, S.Y. Kung | N/A | |
| Multi-Curve Translator for High-Resolution Photorealistic Image Translation | Yuda Song, Hui Qian, Xin Du | N/A | |
| Deep Bayesian Video Frame Interpolation | Zhiyang Yu, Yu Zhang, Xujie Xiang, Dongqing Zou, Xijun Chen, Jimmy S. Ren | N/A | |
| Cross Attention Based Style Distribution for Controllable Person Image Synthesis | Xinyue Zhou, Mingyu Yin, Xinyuan Chen, Li Sun, Changxin Gao, Qingli Li | N/A | |
| KeypointNeRF: Generalizing Image-Based Volumetric Avatars Using Relative Spatial Encoding of Keypoints | Marko Mihajlovic, Aayush Bansal, Michael Zollhöfer, Siyu Tang, Shunsuke Saito | N/A | |
| ViewFormer: NeRF-Free Neural Rendering from Few Images Using Transformers | Jonáš Kulhánek, Erik Derner, Torsten Sattler, Robert Babuška | N/A | |
| L-Tracing: Fast Light Visibility Estimation on Neural Surfaces by Sphere Tracing | Ziyu Chen, Chenjing Ding, Jianfei Guo, Dongliang Wang, Yikang Li, Xuan Xiao, Wei Wu, Li Song | N/A | |
| A Perceptual Quality Metric for Video Frame Interpolation | Qiqi Hou, Abhijay Ghildyal, Feng Liu | N/A | |
| Adaptive Feature Interpolation for Low-Shot Image Generation | Mengyu Dai, Haibin Hang, Xiaoyang Guo | N/A | |
| PalGAN: Image Colorization with Palette Generative Adversarial Networks | Yi Wang, Menghan Xia, Lu Qi, Jing Shao, Yu Qiao | N/A | |
| Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis | Long Zhuo, Guangcong Wang, Shikai Li, Wayne Wu, Ziwei Liu | N/A | |
| Learning Prior Feature and Attention Enhanced Image Inpainting | Chenjie Cao, Qiaole Dong, Yanwei Fu | N/A | |
| Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning | Wenpeng Xing, Jie Chen | N/A | |
| 3D-Aware Semantic-Guided Generative Model for Human Synthesis | Jichao Zhang, Enver Sangineto, Hao Tang, Aliaksandr Siarohin, Zhun Zhong, Nicu Sebe, Wei Wang | N/A | |
| Temporally Consistent Semantic Video Editing | Yiran Xu, Badour AlBahar, Jia-Bin Huang | N/A | |
| Error Compensation Framework for Flow-Guided Video Inpainting | Jaeyeon Kang, Seoung Wug Oh, Seon Joo Kim | N/A | |
| Scraping Textures from Natural Images for Synthesis and Editing | Xueting Li, Xiaolong Wang, Ming-Hsuan Yang, Alexei A. Efros, Sifei Liu | N/A | |
| Single Stage Virtual Try-On via Deformable Attention Flows | Shuai Bai, Huiling Zhou, Zhikang Li, Chang Zhou, Hongxia Yang | N/A | |
| Improving GANs for Long-Tailed Data through Group Spectral Regularization | Harsh Rangwani, Naman Jaswani, Tejan Karmali, Varun Jampani, R. Venkatesh Babu | N/A | |
| Hierarchical Semantic Regularization of Latent Spaces in StyleGANs | Tejan Karmali, Rishubh Parihar, Susmit Agrawal, Harsh Rangwani, Varun Jampani, Maneesh Singh, R. Venkatesh Babu | N/A | |
| IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion | Seung-Jun Moon, Gyeong-Moon Park | N/A | |
| StyleLight: HDR Panorama Generation for Lighting Estimation and Editing | Guangcong Wang, Yinuo Yang, Chen Change Loy, Ziwei Liu | N/A | |
| Contrastive Monotonic Pixel-Level Modulation | Kun Lu, Rongpeng Li, Honggang Zhang | N/A | |
| Learning Cross-Video Neural Representations for High-Quality Frame Interpolation | Wentao Shangguan, Yu Sun, Weijie Gan, Ulugbek S. Kamilov | N/A | |
| Learning Continuous Implicit Representation for Near-Periodic Patterns | Bowei Chen, Tiancheng Zhi, Martial Hebert, Srinivasa G. Narasimhan | N/A | |
| End-to-End Graph-Constrained Vectorized Floorplan Generation with Panoptic Refinement | Jiachen Liu, Yuan Xue, Jose Duarte, Krishnendra Shekhawat, Zihan Zhou, Xiaolei Huang | N/A | |
| Few-Shot Image Generation with Mixup-Based Distance Learning | Chaerin Kong, Jeesoo Kim, Donghoon Han, Nojun Kwak | N/A | |
| A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos | Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier | N/A | |
| FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs | Ziqiang Li, Chaoyue Wang, Heliang Zheng, Jing Zhang, Bin Li | N/A | |
| BlobGAN: Spatially Disentangled Scene Representations | Dave Epstein, Taesung Park, Richard Zhang, Eli Shechtman, Alexei A. Efros | N/A | |
| Unified Implicit Neural Stylization | Zhiwen Fan, Yifan Jiang, Peihao Wang, Xinyu Gong, Dejia Xu, Zhangyang Wang | N/A | |
| GAN with Multivariate Disentangling for Controllable Hair Editing | Xuyang Guo, Meina Kan, Tianle Chen, Shiguang Shan | N/A | |
| Discovering Transferable Forensic Features for CNN-Generated Images Detection | Keshigeyan Chandrasegaran, Ngoc-Trung Tran, Alexander Binder, Ngai-Man Cheung | N/A | |
| Harmonizer: Learning to Perform White-Box Image and Video Harmonization | Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W.H. Lau | N/A | |
| Text2LIVE: Text-Driven Layered Image and Video Editing | Omer Bar-Tal, Dolev Ofri-Amar, Rafail Fridman, Yoni Kasten, Tali Dekel | N/A | |
| Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation | Jian Zhang, Jinchi Huang, Bowen Cai, Huan Fu, Mingming Gong, Chaohui Wang, Jiaming Wang, Hongchen Luo, Rongfei Jia, Binqiang Zhao, Xing Tang | N/A | |
| StyleGAN-Human: A Data-Centric Odyssey of Human Generation | Jianglin Fu, Shikai Li, Yuming Jiang, Kwan-Yee Lin, Chen Qian, Chen Change Loy, Wayne Wu, Ziwei Liu | N/A | |
| ColorFormer: Image Colorization via Color Memory Assisted Hybrid-Attention Transformer | Xiaozhong Ji, Boyuan Jiang, Donghao Luo, Guangpin Tao, Wenqing Chu, Zhifeng Xie, Chengjie Wang, Ying Tai | N/A | |
| EAGAN: Efficient Two-Stage Evolutionary Architecture Search for GANs | Guohao Ying, Xin He, Bin Gao, Bo Han, Xiaowen Chu | N/A | |
| Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation | Dae-Young Song, Geonsoo Lee, HeeKyung Lee, Gi-Mun Um, Donghyeon Cho | N/A | |
| DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation | Songhua Liu, Jingwen Ye, Sucheng Ren, Xinchao Wang | N/A | |
| Multimodal Conditional Image Synthesis with Product-of-Experts GANs | Xun Huang, Arun Mallya, Ting-Chun Wang, Ming-Yu Liu | N/A | |
| Auto-Regressive Image Synthesis with Integrated Quantization | Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Changgong Zhang, Shijian Lu | N/A | |
| JoJoGAN: One Shot Face Stylization | Min Jin Chong, David Forsyth | N/A | |
| VecGAN: Image-to-Image Translation with Interpretable Latent Directions | Yusuf Dalva, Said Fahri Altındiş, Aysegul Dundar | N/A | |
| Any-Resolution Training for High-Resolution Image Synthesis | Lucy Chai, Michaël Gharbi, Eli Shechtman, Phillip Isola, Richard Zhang | N/A | |
| CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer | Zijie Wu, Zhen Zhu, Junping Du, Xiang Bai | N/A | |
| CANF-VC: Conditional Augmented Normalizing Flows for Video Compression | Yung-Han Ho, Chih-Peng Chang, Peng-Yu Chen, Alessandro Gnutti, Wen-Hsiao Peng | N/A | |
| Bi-Level Feature Alignment for Versatile Image Translation and Manipulation | Fangneng Zhan, Yingchen Yu, Rongliang Wu, Jiahui Zhang, Kaiwen Cui, Aoran Xiao, Shijian Lu, Chunyan Miao | N/A | |
| High-Fidelity Image Inpainting with GAN Inversion | Yongsheng Yu, Libo Zhang, Heng Fan, Tiejian Luo | N/A | |
| DeltaGAN: Towards Diverse Few-Shot Image Generation with Sample-Specific Delta | Yan Hong, Li Niu, Jianfu Zhang, Liqing Zhang | N/A | |
| Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Haitian Zheng, Zhe Lin, Jingwan Lu, Scott Cohen, Eli Shechtman, Connelly Barnes, Jianming Zhang, Ning Xu, Sohrab Amirghodsi, Jiebo Luo | N/A | |
| StyleFace: Towards Identity-Disentangled Face Generation on Megapixels | Yuchen Luo, Junwei Zhu, Keke He, Wenqing Chu, Ying Tai, Chengjie Wang, Junchi Yan | N/A | |
| Video Extrapolation in Space and Time | Yunzhi Zhang, Jiajun Wu | N/A | |
| Contrastive Learning for Diverse Disentangled Foreground Generation | Yuheng Li, Yijun Li, Jingwan Lu, Eli Shechtman, Yong Jae Lee, Krishna Kumar Singh | N/A | |
| BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-Aided Adversarial Learning | Changgyoon Oh, Wonjune Cho, Yujeong Chae, Daehee Park, Lin Wang, Kuk-Jin Yoon | N/A | |
| Augmentation of rPPG Benchmark Datasets: Learning to Remove and Embed rPPG Signals via Double Cycle Consistent Learning from Unpaired Facial Videos | Cheng-Ju Hsieh, Wei-Hao Chung, Chiou-Ting Hsu | N/A | |
| Geometry-Aware Single-Image Full-Body Human Relighting | Chaonan Ji, Tao Yu, Kaiwen Guo, Jingxin Liu, Yebin Liu | N/A | |
| 3D-Aware Indoor Scene Synthesis with Depth Priors | Zifan Shi, Yujun Shen, Jiapeng Zhu, Dit-Yan Yeung, Qifeng Chen | N/A | |
| Deep Portrait Delighting | Joshua Weir, Junhong Zhao, Andrew Chalmers, Taehyun Rhee | N/A | |
| Vector Quantized Image-to-Image Translation | Yu-Jie Chen, Shin-I Cheng, Wei-Chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee | N/A | |
| The Surprisingly Straightforward Scene Text Removal Method with Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis | Hyeonsu Lee, Chankyu Choi | N/A | |
| Free-Viewpoint RGB-D Human Performance Capture and Rendering | Phong Nguyen-Ha, Nikolaos Sarafianos, Christoph Lassner, Janne Heikkilä, Tony Tung | N/A | |
| Multiview Regenerative Morphing with Dual Flows | Chih-Jung Tsai, Cheng Sun, Hwann-Tzong Chen | N/A | |
| Hallucinating Pose-Compatible Scenes | Tim Brooks, Alexei A. Efros | N/A | |
| Motion and Appearance Adaptation for Cross-Domain Motion Transfer | Borun Xu, Biao Wang, Jinhong Deng, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan | N/A | |
| Layered Controllable Video Generation | Jiahui Huang, Yuhe Jin, Kwang Moo Yi, Leonid Sigal | N/A | |
| Custom Structure Preservation in Face Aging | Guillermo Gomez-Trenado, Stéphane Lathuilière, Pablo Mesejo, Óscar Cordón | N/A | |
| Spatio-Temporal Deformable Attention Network for Video Deblurring | Huicong Zhang, Haozhe Xie, Hongxun Yao | N/A | |
| NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing | Bangbang Yang, Chong Bao, Junyi Zeng, Hujun Bao, Yinda Zhang, Zhaopeng Cui, Guofeng Zhang | N/A | |
| NeRF for Outdoor Scene Relighting | Viktor Rudnev, Mohamed Elgharib, William Smith, Lingjie Liu, Vladislav Golyanik, Christian Theobalt | N/A | |
| CoGS: Controllable Generation and Search from Sketch and Style | Cusuh Ham, Gemma Canet Tarrés, Tu Bui, James Hays, Zhe Lin, John Collomosse | N/A | |
| HairNet: Hairstyle Transfer with Pose Changes | Peihao Zhu, Rameen Abdal, John Femiani, Peter Wonka | N/A | |
| Unbiased Multi-Modality Guidance for Image Inpainting | Yongsheng Yu, Dawei Du, Libo Zhang, Tiejian Luo | N/A | |
| Intelli-Paint: Towards Developing More Human-Intelligible Painting Agents | Jaskirat Singh, Cameron Smith, Jose Echevarria, Liang Zheng | N/A | |
| Motion Transformer for Unsupervised Image Animation | Jiale Tao, Biao Wang, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan | N/A | |
| NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion | Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan | N/A | |
| EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer | Chenyu Yang, Wanrong He, Yingqing Xu, Yang Gao | N/A | |
| Editing Out-of-Domain GAN Inversion via Differential Activations | Haorui Song, Yong Du, Tianyi Xiang, Junyu Dong, Jing Qin, Shengfeng He | N/A | |
| On the Robustness of Quality Measures for GANs | Motasem Alfarra, Juan C. Pérez, Anna Frühstück, Philip H. S. Torr, Peter Wonka, Bernard Ghanem | N/A | |
| Sound-Guided Semantic Video Generation | Seung Hyun Lee, Gyeongrok Oh, Wonmin Byeon, Chanyoung Kim, Won Jeong Ryoo, Sang Ho Yoon, Hyunjun Cho, Jihyun Bae, Jinkyu Kim, Sangpil Kim | N/A | |
| Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation | Lingzhi Zhang, Connelly Barnes, Kevin Wampler, Sohrab Amirghodsi, Eli Shechtman, Zhe Lin, Jianbo Shi | N/A | |
| Controllable Video Generation through Global and Local Motion Dynamics | Aram Davtyan, Paolo Favaro | N/A | |
| StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN | Fei Yin, Yong Zhang, Xiaodong Cun, Mingdeng Cao, Yanbo Fan, Xuan Wang, Qingyan Bai, Baoyuan Wu, Jue Wang, Yujiu Yang | N/A | |
| Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer | Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh | N/A | |
| Combining Internal and External Constraints for Unrolling Shutter in Videos | Eyal Naor, Itai Antebi, Shai Bagon, Michal Irani | N/A | |
| WISE: Whitebox Image Stylization by Example-Based Learning | Winfried Lötzsch, Max Reimann, Martin Büssemeyer, Amir Semmo, Jürgen Döllner, Matthias Trapp | N/A | |
| Neural Radiance Transfer Fields for Relightable Novel-View Synthesis with Global Illumination | Linjie Lyu, Ayush Tewari, Thomas Leimkühler, Marc Habermann, Christian Theobalt | N/A | |
| Transformers As Meta-Learners for Implicit Neural Representations | Yinbo Chen, Xiaolong Wang | N/A | |
| Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment | Taewoo Kim, Chaeyeon Chung, Yoonseo Kim, Sunghyun Park, Kangyeol Kim, Jaegul Choo | N/A | |
| High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions | Sangyun Lee, Gyojung Gu, Sunghyun Park, Seunghwan Choi, Jaegul Choo | N/A | |
| A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution | Hengsheng Zhang, Xueyi Zou, Jiaming Guo, Youliang Yan, Rong Xie, Li Song | N/A | |
| Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis | Jeong-gi Kwak, Yuanming Li, Dongsik Yoon, Donghyeon Kim, David Han, Hanseok Ko | N/A | |
| AdaNeRF: Adaptive Sampling for Real-Time Rendering of Neural Radiance Fields | Andreas Kurz, Thomas Neff, Zhaoyang Lv, Michael Zollhöfer, Markus Steinberger | N/A | |
| Improving the Perceptual Quality of 2D Animation Interpolation | Shuhong Chen, Matthias Zwicker | N/A | |
| Selective TransHDR: Transformer-Based Selective HDR Imaging Using Ghost Region Mask | Jou Won Song, Ye-In Park, Kyeongbo Kong, Jaeho Kwak, Suk-Ju Kang | N/A | |
| Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution | Cheng Ma, Jingyi Zhang, Jie Zhou, Jiwen Lu | N/A | |
| GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints | Di Chen, Yu Liu, Lianghua Huang, Bin Wang, Pan Pan | N/A | |
| DoodleFormer: Creative Sketch Drawing with Transformers | Ankan Kumar Bhunia, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Jorma Laaksonen, Michael Felsberg | N/A | |
| Implicit Neural Representations for Variable Length Human Motion Generation | Pablo Cervantes, Yusuke Sekikawa, Ikuro Sato, Koichi Shinoda | N/A | |
| Learning Object Placement via Dual-Path Graph Completion | Siyuan Zhou, Liu Liu, Li Niu, Liqing Zhang | N/A | |
| Expanded Adaptive Scaling Normalization for End to End Image Compression | Chajin Shin, Hyeongmin Lee, Hanbin Son, Sangjin Lee, Dogyoon Lee, Sangyoun Lee | N/A | |
| Generator Knows What Discriminator Should Learn in Unconditional GANs | Gayoung Lee, Hyunsu Kim, Junho Kim, Seonghyeon Kim, Jung-Woo Ha, Yunjey Choi | N/A | |
| Compositional Visual Generation with Composable Diffusion Models | Nan Liu, Shuang Li, Yilun Du, Antonio Torralba, Joshua B. Tenenbaum | N/A | |
| ManiFest: Manifold Deformation for Few-Shot Image Translation | Fabio Pizzati, Jean-François Lalonde, Raoul de Charette | N/A | |
| Supervised Attribute Information Removal and Reconstruction for Image Manipulation | Nannan Li, Bryan A. Plummer | N/A | |
| BLT: Bidirectional Layout Transformer for Controllable Layout Generation | Xiang Kong, Lu Jiang, Huiwen Chang, Han Zhang, Yuan Hao, Haifeng Gong, Irfan Essa | N/A | |
| Diverse Generation from a Single Video Made Possible | Niv Haim, Ben Feinstein, Niv Granot, Assaf Shocher, Shai Bagon, Tali Dekel, Michal Irani | N/A | |
| Rayleigh EigenDirections (REDs): Nonlinear GAN Latent Space Traversals for Multidimensional Features | Guha Balakrishnan, Raghudeep Gadde, Aleix Martinez, Pietro Perona | N/A | |
| Bridging the Domain Gap towards Generalization in Automatic Colorization | Hyejin Lee, Daehee Kim, Daeun Lee, Jinkyu Kim, Jaekoo Lee | N/A | |
| Generating Natural Images with Direct Patch Distributions Matching | Ariel Elnekave, Yair Weiss | N/A | |
| Context-Consistent Semantic Image Editing with Style-Preserved Modulation | Wuyang Luo, Su Yang, Hong Wang, Bo Long, Weishan Zhang | N/A | |
| Eliminating Gradient Conflict in Reference-Based Line-Art Colorization | Zekun Li, Zhengyang Geng, Zhao Kang, Wenyu Chen, Yibo Yang | N/A | |
| Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations | Atsuhiro Noguchi, Xiao Sun, Stephen Lin, Tatsuya Harada | N/A | |
| JPEG Artifacts Removal via Contrastive Representation Learning | Xi Wang, Xueyang Fu, Yurui Zhu, Zheng-Jun Zha | N/A | |
| Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning | Xiang Chen, Zhentao Fan, Pengpeng Li, Longgang Dai, Caihua Kong, Zhuoran Zheng, Yufeng Huang, Yufeng Li | N/A | |
| Efficient Long-Range Attention Network for Image Super-Resolution | Xindong Zhang, Hui Zeng, Shi Guo, Lei Zhang | N/A | |
| FlowFormer: A Transformer Architecture for Optical Flow | Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li | N/A | |
| Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction | Yuanhao Cai, Jing Lin, Xiaowan Hu, Haoqian Wang, Xin Yuan, Yulun Zhang, Radu Timofte, Luc Van Gool | N/A | |
| Learning Shadow Correspondence for Video Shadow Detection | Xinpeng Ding, Jingwen Yang, Xiaowei Hu, Xiaomeng Li | N/A | |
| Metric Learning Based Interactive Modulation for Real-World Super-Resolution | Chong Mou, Yanze Wu, Xintao Wang, Chao Dong, Jian Zhang, Ying Shan | N/A | |
| Dynamic Dual Trainable Bounds for Ultra-Low Precision Super-Resolution Networks | Yunshan Zhong, Mingbao Lin, Xunchao Li, Ke Li, Yunhang Shen, Fei Chao, Yongjian Wu, Rongrong Ji | N/A | |
| OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers | Jialun Pei, Tianyang Cheng, Deng-Ping Fan, He Tang, Chuanbo Chen, Luc Van Gool | N/A | |
| Highly Accurate Dichotomous Image Segmentation | Xuebin Qin, Hang Dai, Xiaobin Hu, Deng-Ping Fan, Ling Shao, Luc Van Gool | N/A | |
| Boosting Supervised Dehazing Methods via Bi-Level Patch Reweighting | Xingyu Jiang, Hongkun Dou, Chengwei Fu, Bingquan Dai, Tianrun Xu, Yue Deng | N/A | |
| Flow-Guided Transformer for Video Inpainting | Kaidong Zhang, Jingjing Fu, Dong Liu | N/A | |
| Shift-tolerant Perceptual Similarity Metric | Abhijay Ghildyal, Feng Liu | N/A | |
| Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution | Yuehan Zhang, Bo Ji, Jia Hao, Angela Yao | N/A | |
| VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder | Yuchao Gu, Xintao Wang, Liangbin Xie, Chao Dong, Gen Li, Ying Shan, Ming-Ming Cheng | N/A | |
| Uncertainty Learning in Kernel Estimation for Multi-stage Blind Image Super-Resolution | Zhenxuan Fang, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi | N/A | |
| Learning Spatio-Temporal Downsampling for Effective Video Upscaling | Xiaoyu Xiang, Yapeng Tian, Vijay Rengarajan, Lucas D. Young, Bo Zhu, Rakesh Ranjan | N/A | |
| Learning Local Implicit Fourier Representation for Image Warping | Jaewon Lee, Kwang Pyo Choi, Kyong Hwan Jin | N/A | |
| SepLUT: Separable Image-Adaptive Lookup Tables for Real-Time Image Enhancement | Canqian Yang, Meiguang Jin, Yi Xu, Rui Zhang, Ying Chen, Huaida Liu | N/A | |
| Blind Image Decomposition | Junlin Han, Weihao Li, Pengfei Fang, Chunyi Sun, Jie Hong, Mohammad Ali Armin, Lars Petersson, Hongdong Li | N/A | |
| MuLUT: Cooperating Multiple Look-Up Tables for Efficient Image Super-Resolution | Jiacheng Li, Chang Chen, Zhen Cheng, Zhiwei Xiong | N/A | |
| Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution | Zhongwei Qiu, Huan Yang, Jianlong Fu, Dongmei Fu | N/A | |
| Spatial-Frequency Domain Information Integration for Pan-Sharpening | Man Zhou, Jie Huang, Keyu Yan, Hu Yu, Xueyang Fu, Aiping Liu, Xian Wei, Feng Zhao | N/A | |
| Adaptive Patch Exiting for Scalable Single Image Super-Resolution | Shizun Wang, Jiaming Liu, Kaixin Chen, Xiaoqi Li, Ming Lu, Yandong Guo | N/A | |
| Efficient Meta-Tuning for Content-Aware Neural Video Delivery | Xiaoqi Li, Jiaming Liu, Shizun Wang, Cheng Lyu, Ming Lu, Yurong Chen, Anbang Yao, Yandong Guo, Shanghang Zhang | N/A | |
| Reference-Based Image Super-Resolution with Deformable Attention Transformer | Jiezhang Cao, Jingyun Liang, Kai Zhang, Yawei Li, Yulun Zhang, Wenguan Wang, Luc Van Gool | N/A | |
| Local Color Distributions Prior for Image Enhancement | Haoyuan Wang, Ke Xu, Rynson W.H. Lau | N/A | |
| L-CoDer: Language-Based Colorization with Color-Object Decoupling Transformer | Zheng Chang, Shuchen Weng, Yu Li, Si Li, Boxin Shi | N/A | |
| From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution | Xiaoming Li, Chaofeng Chen, Xianhui Lin, Wangmeng Zuo, Lei Zhang | N/A | |
| Towards Interpretable Video Super-Resolution via Alternating Optimization | Jiezhang Cao, Jingyun Liang, Kai Zhang, Wenguan Wang, Qin Wang, Yulun Zhang, Hao Tang, Luc Van Gool | N/A | |
| Event-Based Fusion for Motion Deblurring with Cross-Modal Attention | Lei Sun, Christos Sakaridis, Jingyun Liang, Qi Jiang, Kailun Yang, Peng Sun, Yaozu Ye, Kaiwei Wang, Luc Van Gool | N/A | |
| Fast and High Quality Image Denoising via Malleable Convolution | Yifan Jiang, Bartlomiej Wronski, Ben Mildenhall, Jonathan T. Barron, Zhangyang Wang, Tianfan Xue | N/A | |
| TAPE: Task-Agnostic Prior Embedding for Image Restoration | Lin Liu, Lingxi Xie, Xiaopeng Zhang, Shanxin Yuan, Xiangyu Chen, Wengang Zhou, Houqiang Li, Qi Tian | N/A | |
| Uncertainty Inspired Underwater Image Enhancement | Zhenqi Fu, Wu Wang, Yue Huang, Xinghao Ding, Kai-Kuang Ma | N/A | |
| Hourglass Attention Network for Image Inpainting | Ye Deng, Siqi Hui, Rongye Meng, Sanping Zhou, Jinjun Wang | N/A | |
| Unfolded Deep Kernel Estimation for Blind Image Super-Resolution | Hongyi Zheng, Hongwei Yong, Lei Zhang | N/A | |
| Event-Guided Deblurring of Unknown Exposure Time Videos | Taewoo Kim, Jeongmin Lee, Lin Wang, Kuk-Jin Yoon | N/A | |
| ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-Modality Image Fusion | Zhanbo Huang, Jinyuan Liu, Xin Fan, Risheng Liu, Wei Zhong, Zhongxuan Luo | N/A | |
| Content Adaptive Latents and Decoder for Neural Image Compression | Guanbo Pan, Guo Lu, Zhihao Hu, Dong Xu | N/A | |
| Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution | Jie Liang, Hui Zeng, Lei Zhang | N/A | |
| Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-Ahead Forward Ones | Junyi Li, Xiaohe Wu, Zhenxing Niu, Wangmeng Zuo | N/A | |
| Self-Supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations | Zhilu Zhang, Ruohao Wang, Hongzhi Zhang, Yunjin Chen, Wangmeng Zuo | N/A | |
| Secrets of Event-Based Optical Flow | Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego | N/A | |
| Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing | Xin Yu, Peng Dai, Wenbo Li, Lan Ma, Jiajun Shen, Jia Li, Xiaojuan Qi | N/A | |
| ERDN: Equivalent Receptive Field Deformable Network for Video Deblurring | Bangrui Jiang, Zhihuai Xie, Zhen Xia, Songnan Li, Shan Liu | N/A | |
| Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye Distortion | Nobuhiko Wakai, Satoshi Sato, Yasunori Ishii, Takayoshi Yamashita | N/A | |
| ART-SS: An Adaptive Rejection Technique for Semi-Supervised Restoration for Adverse Weather-Affected Images | Rajeev Yasarla, Carey E. Priebe, Vishal M. Patel | N/A | |
| Fusion from Decomposition: A Self-Supervised Decomposition Approach for Image Fusion | Pengwei Liang, Junjun Jiang, Xianming Liu, Jiayi Ma | N/A | |
| Learning Degradation Representations for Image Deblurring | Dasong Li, Yi Zhang, Ka Chun Cheung, Xiaogang Wang, Hongwei Qin, Hongsheng Li | N/A | |
| Learning Mutual Modulation for Self-Supervised Cross-Modal Super-Resolution | Xiaoyu Dong, Naoto Yokoya, Longguang Wang, Tatsumi Uezato | N/A | |
| Spectrum-Aware and Transferable Architecture Search for Hyperspectral Image Restoration | Wei He, Quanming Yao, Naoto Yokoya, Tatsumi Uezato, Hongyan Zhang, Liangpei Zhang | N/A | |
| Neural Color Operators for Sequential Image Retouching | Yili Wang, Xin Li, Kun Xu, Dongliang He, Qi Zhang, Fu Li, Errui Ding | N/A | |
| Optimizing Image Compression via Joint Learning with Denoising | Ka Leong Cheng, Yueqi Xie, Qifeng Chen | N/A | |
| "Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks" | Xiaotao Hu, Jun Xu, Shuhang Gu, Ming-Ming Cheng, Li Liu | N/A | |
| Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution | Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang | N/A | |
| Modeling Mask Uncertainty in Hyperspectral Image Reconstruction | Jiamian Wang, Yulun Zhang, Xin Yuan, Ziyi Meng, Zhiqiang Tao | N/A | |
| Perceiving and Modeling Density for Image Dehazing | Tian Ye, Yunchen Zhang, Mingchao Jiang, Liang Chen, Yun Liu, Sixiang Chen, Erkang Chen | N/A | |
| Stripformer: Strip Transformer for Fast Image Deblurring | Fu-Jen Tsai, Yan-Tsung Peng, Yen-Yu Lin, Chung-Chi Tsai, Chia-Wen Lin | N/A | |
| Deep Fourier-Based Exposure Correction Network with Spatial-Frequency Interaction | Jie Huang, Yajing Liu, Feng Zhao, Keyu Yan, Jinghao Zhang, Yukun Huang, Man Zhou, Zhiwei Xiong | N/A | |
| Frequency and Spatial Dual Guidance for Image Dehazing | Hu Yu, Naishan Zheng, Man Zhou, Jie Huang, Zeyu Xiao, Feng Zhao | N/A | |
| Towards Real-World HDRTV Reconstruction: A Data Synthesis-Based Approach | Zhen Cheng, Tao Wang, Yong Li, Fenglong Song, Chang Chen, Zhiwei Xiong | N/A | |
| Learning Discriminative Shrinkage Deep Networks for Image Deconvolution | Pin-Hung Kuo, Jinshan Pan, Shao-Yi Chien, Ming-Hsuan Yang | N/A | |
| KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution | Jiahong Fu, Hong Wang, Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu | N/A | |
| ARM: Any-Time Super-Resolution Method | Bohong Chen, Mingbao Lin, Kekai Sheng, Mengdan Zhang, Peixian Chen, Ke Li, Liujuan Cao, Rongrong Ji | N/A | |
| Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines | Haina Qin, Longfei Han, Juan Wang, Congxuan Zhang, Yanwei Li, Bing Li, Weiming Hu | N/A | |
| RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos | Yunhui Han, Kunming Luo, Ao Luo, Jiangyu Liu, Haoqiang Fan, Guiming Luo, Shuaicheng Liu | N/A | |
| Memory-Augmented Model-Driven Network for Pansharpening | Keyu Yan, Man Zhou, Li Zhang, Chengjun Xie | N/A | |
| All You Need Is RAW: Defending against Adversarial Attacks with Camera Image Pipelines | Yuxuan Zhang, Bo Dong, Felix Heide | N/A | |
| Ghost-Free High Dynamic Range Imaging with Context-Aware Transformer | Zhen Liu, Yinglong Wang, Bing Zeng, Shuaicheng Liu | N/A | |
| Style-Guided Shadow Removal | Jin Wan, Hui Yin, Zhenyao Wu, Xinyi Wu, Yanting Liu, Song Wang | N/A | |
| D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution | Youwei Li, Haibin Huang, Lanpeng Jia, Haoqiang Fan, Shuaicheng Liu | N/A | |
| GRIT-VLP: Grouped Mini-Batch Sampling for Efficient Vision and Language Pre-training | Jaeseok Byun, Taebaek Hwang, Jianlong Fu, Taesup Moon | N/A | |
| Efficient Video Deblurring Guided by Motion Magnitude | Yusheng Wang, Yunfan Lu, Ye Gao, Lin Wang, Zhihang Zhong, Yinqiang Zheng, Atsushi Yamashita | N/A | |
| Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and a New Physics-Inspired Transformer Model | Zhiyuan Mao, Ajay Jaiswal, Zhangyang Wang, Stanley H. Chan | N/A | |
| Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression | A. Burakhan Koyuncu, Han Gao, Atanas Boev, Georgii Gaikov, Elena Alshina, Eckehard Steinbach | N/A | |
| Image Super-Resolution with Deep Dictionary | Shunta Maeda | N/A | |
| TempFormer: Temporally Consistent Transformer for Video Denoising | Mingyang Song, Yang Zhang, Tunç O. Aydın | N/A | |
| RAWtoBit: A Fully End-to-End Camera ISP Network | Wooseok Jeong, Seung-Won Jung | N/A | |
| DRCNet: Dynamic Image Restoration Contrastive Network | Fei Li, Lingfeng Shen, Yang Mi, Zhenbo Li | N/A | |
| Zero-Shot Learning for Reflection Removal of Single 360-Degree Image | Byeong-Ju Han, Jae-Young Sim | N/A | |
| Transformer with Implicit Edges for Particle-Based Physics Simulation | Yidi Shao, Chen Change Loy, Bo Dai | N/A | |
| Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior | Shuai Wang, Lei Zhu, Huazhu Fu, Jing Qin, Carola-Bibiane Schönlieb, Wei Feng, Song Wang | N/A | |
| Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images | Jinjin Gu, Haoming Cai, Chenyu Dong, Ruofan Zhang, Yulun Zhang, Wenming Yang, Chun Yuan | N/A | |
| Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance | Zhihang Zhong, Xiao Sun, Zhirong Wu, Yinqiang Zheng, Stephen Lin, Imari Sato | N/A | |
| AlphaVC: High-Performance and Efficient Learned Video Compression | Yibo Shi, Yunying Ge, Jing Wang, Jue Mao | N/A | |
| Content-Oriented Learned Image Compression | Meng Li, Shangyin Gao, Yihui Feng, Yibo Shi, Jing Wang | N/A | |
| RRSR:Reciprocal Reference-Based Image Super-Resolution with Progressive Feature Alignment and Selection | Lin Zhang, Xin Li, Dongliang He, Fu Li, Yili Wang, Zhaoxiang Zhang | N/A | |
| Contrastive Prototypical Network with Wasserstein Confidence Penalty | Haoqing Wang, Zhi-Hong Deng | N/A | |
| Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition | Xinyi Zou, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang | N/A | |
| Self-Support Few-Shot Semantic Segmentation | Qi Fan, Wenjie Pei, Yu-Wing Tai, Chi-Keung Tang | N/A | |
| Few-Shot Object Detection with Model Calibration | Qi Fan, Chi-Keung Tang, Yu-Wing Tai | N/A | |
| Self-Supervision Can Be a Good Few-Shot Learner | Yuning Lu, Liangjian Wen, Jianzhuang Liu, Yajing Liu, Xinmei Tian | N/A | |
| tSF: Transformer-Based Semantic Filter for Few-Shot Learning | Jinxiang Lai, Siqian Yang, Wenlong Liu, Yi Zeng, Zhongyi Huang, Wenlong Wu, Jun Liu, Bin-Bin Gao, Chengjie Wang | N/A | |
| Adversarial Feature Augmentation for Cross-Domain Few-Shot Classification | Yanxu Hu, Andy J. Ma | N/A | |
| Constructing Balance from Imbalance for Long-Tailed Image Recognition | Yue Xu, Yong-Lu Li, Jiefeng Li, Cewu Lu | N/A | |
| "On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond" | Yuzhe Yang, Hao Wang, Dina Katabi | N/A | |
| Few-Shot Video Object Detection | Qi Fan, Chi-Keung Tang, Yu-Wing Tai | N/A | |
| Worst Case Matters for Few-Shot Recognition | Minghao Fu, Yun-Hao Cao, Jianxin Wu | N/A | |
| Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification | Kai Yi, Xiaoqian Shen, Yunhao Gou, Mohamed Elhoseiny | N/A | |
| Doubly Deformable Aggregation of Covariance Matrices for Few-Shot Segmentation | Zhitong Xiong, Haopeng Li, Xiao Xiang Zhu | N/A | |
| Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation | Xinyu Shi, Dong Wei, Yu Zhang, Donghuan Lu, Munan Ning, Jiashun Chen, Kai Ma, Yefeng Zheng | N/A | |
| Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning | Xingping Dong, Jianbing Shen, Ling Shao | N/A | |
| CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition | Shreyank N Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach | N/A | |
| Few-Shot Class-Incremental Learning for 3D Point Cloud Objects | Townim Chowdhury, Ali Cheraghian, Sameera Ramasinghe, Sahar Ahmadi, Morteza Saberi, Shafin Rahman | N/A | |
| Meta-Learning with Less Forgetting on Large-Scale Non-stationary Task Distributions | Zhenyi Wang, Li Shen, Le Fang, Qiuling Suo, Donglin Zhan, Tiehang Duan, Mingchen Gao | N/A | |
| DNA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment | Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luowei Zhou, Lu Yuan, Ahmed Awadallah, Zhangyang Wang | N/A | |
| Learning Instance and Task-Aware Dynamic Kernels for Few-Shot Learning | Rongkai Ma, Pengfei Fang, Gil Avraham, Yan Zuo, Tianyu Zhu, Tom Drummond, Mehrtash Harandi | N/A | |
| Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding | Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang | N/A | |
| Few-Shot Classification with Contrastive Learning | Zhanyuan Yang, Jinghua Wang, Yingying Zhu | N/A | |
| Time-rEversed diffusioN tEnsor Transformer: A New TENET of Few-Shot Object Detection | Shan Zhang, Naila Murray, Lei Wang, Piotr Koniusz | N/A | |
| Self-Promoted Supervision for Few-Shot Transformer | Bowen Dong, Pan Zhou, Shuicheng Yan, Wangmeng Zuo | N/A | |
| Few-Shot Object Counting and Detection | Thanh Nguyen, Chau Pham, Khoi Nguyen, Minh Hoai | N/A | |
| Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark | Kibok Lee, Hao Yang, Satyaki Chakraborty, Zhaowei Cai, Gurumurthy Swaminathan, Avinash Ravichandran, Onkar Dabeer | N/A | |
| Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations | Wentao Chen, Zhang Zhang, Wei Wang, Liang Wang, Zilei Wang, Tieniu Tan | N/A | |
| Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection | Tianxue Ma, Mingwei Bi, Jian Zhang, Wang Yuan, Zhizhong Zhang, Yuan Xie, Shouhong Ding, Lizhuang Ma | N/A | |
| Dual Contrastive Learning with Anatomical Auxiliary Supervision for Few-Shot Medical Image Segmentation | Huisi Wu, Fangyan Xiao, Chongxin Liang | N/A | |
| Improving Few-Shot Learning through Multi-task Representation Learning Theory | Quentin Bouniot, Ievgen Redko, Romaric Audigier, Angélique Loesch, Amaury Habrard | N/A | |
| Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation | Min Zhang, Siteng Huang, Wenbin Li, Donglin Wang | N/A | |
| Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments | Khoi D. Nguyen, Quoc-Huy Tran, Khoi Nguyen, Binh-Son Hua, Rang Nguyen | N/A | |
| Temporal and Cross-Modal Attention for Audio-Visual Zero-Shot Learning | Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata | N/A | |
| HM: Hybrid Masking for Few-Shot Segmentation | Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia | N/A | |
| TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning | Haoquan Li, Laoming Zhang, Daoan Zhang, Lang Fu, Peng Yang, Jianguo Zhang | N/A | |
| Kernel Relative-Prototype Spectral Filtering for Few-Shot Learning | Tao Zhang, Wu Huang | N/A | |
| "“This Is My Unicorn, Fluffy”: Personalizing Frozen Vision-Language Representations" | Niv Cohen, Rinon Gal, Eli A. Meirom, Gal Chechik, Yuval Atzmon | N/A | |
| CLOSE: Curriculum Learning on the Sharing Extent towards Better One-Shot NAS | Zixuan Zhou, Xuefei Ning, Yi Cai, Jiashu Han, Yiping Deng, Yuhan Dong, Huazhong Yang, Yu Wang | N/A | |
| Streamable Neural Fields | Junwoo Cho, Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park | N/A | |
| Gradient-Based Uncertainty for Monocular Depth Estimation | Julia Hornauer, Vasileios Belagiannis | N/A | |
| Online Continual Learning with Contrastive Vision Transformer | Zhen Wang, Liu Liu, Yajing Kong, Jiaxian Guo, Dacheng Tao | N/A | |
| CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution | Taeho Kim, Yongin Kwon, Jemin Lee, Taeho Kim, Sangtae Ha | N/A | |
| EAutoDet: Efficient Architecture Search for Object Detection | Xiaoxing Wang, Jiale Lin, Juanping Zhao, Xiaokang Yang, Junchi Yan | N/A | |
| A Max-Flow Based Approach for Neural Architecture Search | Chao Xue, Xiaoxing Wang, Junchi Yan, Chun-Guang Li | N/A | |
| OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses | Robik Shrestha, Kushal Kafle, Christopher Kanan | N/A | |
| ERA: Enhanced Rational Activations | Martin Trimmel, Mihai Zanfir, Richard Hartley, Cristian Sminchisescu | N/A | |
| Convolutional Embedding Makes Hierarchical Vision Transformer Stronger | Cong Wang, Hongmin Xu, Xiong Zhang, Li Wang, Zhitong Zheng, Haifeng Liu | N/A | |
| Active Label Correction Using Robust Parameter Update and Entropy Propagation | Kwang In Kim | N/A | |
| Unpaired Image Translation via Vector Symbolic Architectures | Justin Theiss, Jay Leverett, Daeil Kim, Aayush Prakash | N/A | |
| "UniNet: Unified Architecture Search with Convolution, Transformer, and MLP" | Jihao Liu, Xin Huang, Guanglu Song, Hongsheng Li, Yu Liu | N/A | |
| AMixer: Adaptive Weight Mixing for Self-Attention Free Vision Transformers | Yongming Rao, Wenliang Zhao, Jie Zhou, Jiwen Lu | N/A | |
| TinyViT: Fast Pretraining Distillation for Small Vision Transformers | Kan Wu, Jinnian Zhang, Houwen Peng, Mengchen Liu, Bin Xiao, Jianlong Fu, Lu Yuan | N/A | |
| Equivariant Hypergraph Neural Networks | Jinwoo Kim, Saeyoon Oh, Sungjun Cho, Seunghoon Hong | N/A | |
| ScaleNet: Searching for the Model to Scale | Jiyang Xie, Xiu Su, Shan You, Zhanyu Ma, Fei Wang, Chen Qian | N/A | |
| Complementing Brightness Constancy with Deep Networks for Optical Flow Prediction | Vincent Le Guen, Clément Rambour, Nicolas Thome | N/A | |
| ViTAS: Vision Transformer Architecture Search | Xiu Su, Shan You, Jiyang Xie, Mingkai Zheng, Fei Wang, Chen Qian, Changshui Zhang, Xiaogang Wang, Chang Xu | N/A | |
| LidarNAS: Unifying and Searching Neural Architectures for 3D Point Clouds | Chenxi Liu, Zhaoqi Leng, Pei Sun, Shuyang Cheng, Charles R. Qi, Yin Zhou, Mingxing Tan, Dragomir Anguelov | N/A | |
| Uncertainty-DTW for Time Series and Sequences | Lei Wang, Piotr Koniusz | N/A | |
| Black-Box Few-Shot Knowledge Distillation | Dang Nguyen, Sunil Gupta, Kien Do, Svetha Venkatesh | N/A | |
| Revisiting Batch Norm Initialization | Jim Davis, Logan Frank | N/A | |
| SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling | Ho Man Kwan, Shenghui Song | N/A | |
| Filter Pruning via Feature Discrimination in Deep Neural Networks | Zhiqiang He, Yaguan Qian, Yuqi Wang, Bin Wang, Xiaohui Guan, Zhaoquan Gu, Xiang Ling, Shaoning Zeng, Haijiang Wang, Wujie Zhou | N/A | |
| LA3: Efficient Label-Aware AutoAugment | Mingjun Zhao, Shan Lu, Zixuan Wang, Xiaoli Wang, Di Niu | N/A | |
| Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps | Alireza Ganjdanesh, Shangqian Gao, Heng Huang | N/A | |
| BA-Net: Bridge Attention for Deep Convolutional Neural Networks | Yue Zhao, Junzhou Chen, Zirui Zhang, Ronghui Zhang | N/A | |
| SAU: Smooth Activation Function Using Convolution with Approximate Identities | Koushik Biswas, Sandeep Kumar, Shilpak Banerjee, Ashish Kumar Pandey | N/A | |
| Multi-Exit Semantic Segmentation Networks | Alexandros Kouris, Stylianos I. Venieris, Stefanos Laskaridis, Nicholas Lane | N/A | |
| Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks | Bernd Prach, Christoph H. Lampert | N/A | |
| PointScatter: Point Set Representation for Tubular Structure Extraction | Dong Wang, Zhao Zhang, Ziwei Zhao, Yuhang Liu, Yihong Chen, Liwei Wang | N/A | |
| Check and Link: Pairwise Lesion Correspondence Guides Mammogram Mass Detection | Ziwei Zhao, Dong Wang, Yihong Chen, Ziteng Wang, Liwei Wang | N/A | |
| Graph-Constrained Contrastive Regularization for Semi-Weakly Volumetric Segmentation | Simon Reiß, Constantin Seibold, Alexander Freytag, Erik Rodner, Rainer Stiefelhagen | N/A | |
| Generalizable Medical Image Segmentation via Random Amplitude Mixup and Domain-Specific Image Restoration | Ziqi Zhou, Lei Qi, Yinghuan Shi | N/A | |
| Auto-FedRL: Federated Hyperparameter Optimization for Multi-Institutional Medical Image Segmentation | Pengfei Guo, Dong Yang, Ali Hatamizadeh, An Xu, Ziyue Xu, Wenqi Li, Can Zhao, Daguang Xu, Stephanie Harmon, Evrim Turkbey, Baris Turkbey, Bradford Wood, Francesca Patella, Elvira Stellato, Gianpaolo Carrafiello, Vishal M. Patel, Holger R. Roth | N/A | |
| Personalizing Federated Medical Image Segmentation via Local Calibration | Jiacheng Wang, Yueming Jin, Liansheng Wang | N/A | |
| One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement | Zihao Yin, Ping Gong, Chunyu Wang, Yizhou Yu, Yizhou Wang | N/A | |
| Ultra-High-Resolution Unpaired Stain Transformation via Kernelized Instance Normalization | Ming-Yang Ho, Min-Sheng Wu, Che-Ming Wu | N/A | |
| Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation | Wenxuan Wang, Chen Chen, Jing Wang, Sen Zha, Yan Zhang, Jiangyun Li | N/A | |
| ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images | Jiawei Yang, Hanbo Chen, Yuan Liang, Junzhou Huang, Lei He, Jianhua Yao | N/A | |
| CryoAI: Amortized Inference of Poses for Ab Initio Reconstruction of 3D Molecular Volumes from Real Cryo-EM Images | Axel Levy, Frédéric Poitevin, Julien Martel, Youssef Nashed, Ariana Peck, Nina Miolane, Daniel Ratner, Mike Dunne, Gordon Wetzstein | N/A | |
| UniMiSS: Universal Medical Self-Supervised Learning via Breaking Dimensionality Barrier | Yutong Xie, Jianpeng Zhang, Yong Xia, Qi Wu | N/A | |
| DLME: Deep Local-Flatness Manifold Embedding | Zelin Zang, Siyuan Li, Di Wu, Ge Wang, Kai Wang, Lei Shang, Baigui Sun, Hao Li, Stan Z. Li | N/A | |
| Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching | Jiazhen Liu, Xirong Li, Qijie Wei, Jie Xu, Dayong Ding | N/A | |
| Graph Neural Network for Cell Tracking in Microscopy Videos | Tal Ben-Haim, Tammy Riklin Raviv | N/A | |
| CXR Segmentation by AdaIN-Based Domain Adaptation and Knowledge Distillation | Yujin Oh, Jong Chul Ye | N/A | |
| Accurate Detection of Proteins in Cryo-Electron Tomograms from Sparse Labels | Qinwen Huang, Ye Zhou, Hsuan-Fu Liu, Alberto Bartesaghi | N/A | |
| K-SALSA: K-Anonymous Synthetic Averaging of Retinal Images via Local Style Alignment | Minkyu Jeon, Hyeonjin Park, Hyunwoo J. Kim, Michael Morley, Hyunghoon Cho | N/A | |
| RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-Guided Disease Classification | Moinak Bhattacharya, Shubham Jain, Prateek Prasanna | N/A | |
| Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images | Kevin Thandiackal, Boqi Chen, Pushpak Pati, Guillaume Jaume, Drew F. K. Williamson, Maria Gabrani, Orcun Goksel | N/A | |
| Learning Uncoupled-Modulation CVAE for 3D Action-Conditioned Human Motion Synthesis | Chongyang Zhong, Lei Hu, Zihao Zhang, Shihong Xia | N/A | |
| Towards Grand Unification of Object Tracking | Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu | N/A | |
| ByteTrack: Multi-Object Tracking by Associating Every Detection Box | Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Fucheng Weng, Zehuan Yuan, Ping Luo, Wenyu Liu, Xinggang Wang | N/A | |
| Robust Multi-Object Tracking by Marginal Inference | Yifu Zhang, Chunyu Wang, Xinggang Wang, Wenjun Zeng, Wenyu Liu | N/A | |
| PolarMOT: How Far Can Geometric Relations Take Us in 3D Multi-Object Tracking? | Aleksandr Kim, Guillem Brasó, Aljoša Ošep, Laura Leal-Taixé | N/A | |
| Particle Video Revisited: Tracking through Occlusions Using Point Trajectories | Adam W. Harley, Zhaoyuan Fang, Katerina Fragkiadaki | N/A | |
| Tracking Objects As Pixel-Wise Distributions | Zelin Zhao, Ze Wu, Yueqing Zhuang, Boxun Li, Jiaya Jia | N/A | |
| CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds | Zhiyang Guo, Yunyao Mao, Wengang Zhou, Min Wang, Houqiang Li | N/A | |
| Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline | Jinyu Yang, Zhongqun Zhang, Zhe Li, Hyung Jin Chang, Aleš Leonardis, Feng Zheng | N/A | |
| Hierarchical Latent Structure for Multi-modal Vehicle Trajectory Forecasting | Dooseop Choi, KyoungWook Min | N/A | |
| AiATrack: Attention in Attention for Transformer Visual Tracking | Shenyuan Gao, Chunluan Zhou, Chao Ma, Xinggang Wang, Junsong Yuan | N/A | |
| Disentangling Architecture and Training for Optical Flow | Deqing Sun, Charles Herrmann, Fitsum Reda, Michael Rubinstein, David J. Fleet, William T. Freeman | N/A | |
| A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow | Jenny Schmalfuss, Philipp Scholze, Andrés Bruhn | N/A | |
| Robust Landmark-Based Stent Tracking in X-Ray Fluoroscopy | Luojie Huang, Yikang Liu, Li Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun | N/A | |
| Social ODE: Multi-agent Trajectory Forecasting with Neural Ordinary Differential Equations | Song Wen, Hao Wang, Dimitris N. Metaxas | N/A | |
| Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-agent Trajectory Prediction | Li-Wu Tsao, Yan-Kai Wang, Hao-Siang Lin, Hong-Han Shuai, Lai-Kuan Wong, Wen-Huang Cheng | N/A | |
| Diverse Human Motion Prediction Guided by Multi-level Spatial-Temporal Anchors | Sirui Xu, Yu-Xiong Wang, Liang-Yan Gui | N/A | |
| Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction | Inhwan Bae, Jin-Hwi Park, Hae-Gon Jeon | N/A | |
| Sequential Multi-View Fusion Network for Fast LiDAR Point Motion Estimation | Gang Zhang, Xiaoyan Li, Zhenhua Wang | N/A | |
| E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs | Yanyan Li, Federico Tombari | N/A | |
| Point Cloud Compression with Range Image-Based Entropy Model for Autonomous Driving | Sukai Wang, Ming Liu | N/A | |
| Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework | Botao Ye, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen | N/A | |
| MotionCLIP: Exposing Human Motion Generation to CLIP Space | Guy Tevet, Brian Gordon, Amir Hertz, Amit H. Bermano, Daniel Cohen-Or | N/A | |
| Backbone Is All Your Need: A Simplified Architecture for Visual Object Tracking | Boyu Chen, Peixia Li, Lei Bai, Lei Qiao, Qiuhong Shen, Bo Li, Weihao Gan, Wei Wu, Wanli Ouyang | N/A | |
| Aware of the History: Trajectory Forecasting with the Local Behavior Data | Yiqi Zhong, Zhenyang Ni, Siheng Chen, Ulrich Neumann | N/A | |
| Optical Flow Training under Limited Label Budget via Active Learning | Shuai Yuan, Xian Sun, Hannah Kim, Shuzhi Yu, Carlo Tomasi | N/A | |
| Hierarchical Feature Embedding for Visual Tracking | Zhixiong Pi, Weitao Wan, Chong Sun, Changxin Gao, Nong Sang, Chen Li | N/A | |
| Tackling Background Distraction in Video Object Segmentation | Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee | N/A | |
| Social-Implicit: Rethinking Trajectory Prediction Evaluation and the Effectiveness of Implicit Maximum Likelihood Estimation | Abduallah Mohamed, Deyao Zhu, Warren Vu, Mohamed Elhoseiny, Christian Claudel | N/A | |
| TEMOS: Generating Diverse Human Motions from Textual Descriptions | Mathis Petrovich, Michael J. Black, Gül Varol | N/A | |
| Tracking Every Thing in the Wild | Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, Fisher Yu | N/A | |
| HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance | Soshi Shimada, Vladislav Golyanik, Zhi Li, Patrick Pérez, Weipeng Xu, Christian Theobalt | N/A | |
| Towards Sequence-Level Training for Visual Tracking | Minji Kim, Seungkwan Lee, Jungseul Ok, Bohyung Han, Minsu Cho | N/A | |
| Learned Monocular Depth Priors in Visual-Inertial Initialization | Yunwen Zhou, Abhishek Kar, Eric Turner, Adarsh Kowdle, Chao X. Guo, Ryan C. DuToit, Konstantine Tsotsos | N/A | |
| Robust Visual Tracking by Segmentation | Matthieu Paul, Martin Danelljan, Christoph Mayer, Luc Van Gool | N/A | |
| MeshLoc: Mesh-Based Visual Localization | Vojtech Panek, Zuzana Kukelova, Torsten Sattler | N/A | |
| S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction | Yu-Wen Chen, Hsuan-Kung Yang, Chu-Chi Chiu, Chun-Yi Lee | N/A | |
| Large-Displacement 3D Object Tracking with Hybrid Non-local Optimization | Xuhui Tian, Xinran Lin, Fan Zhong, Xueying Qin | N/A | |
| "FEAR: Fast, Efficient, Accurate and Robust Visual Tracker" | Vasyl Borsuk, Roman Vei, Orest Kupyn, Tetiana Martyniuk, Igor Krashenyi, Jiři Matas | N/A | |
| PREF: Predictability Regularized Neural Motion Fields | Liangchen Song, Xuan Gong, Benjamin Planche, Meng Zheng, David Doermann, Junsong Yuan, Terrence Chen, Ziyan Wu | N/A | |
| View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums | Conghao Wong, Beihao Xia, Ziming Hong, Qinmu Peng, Wei Yuan, Qiong Cao, Yibo Yang, Xinge You | N/A | |
| "HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking" | Haoxian Zhang, Yonggen Ling | N/A | |
| RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer | Jianfeng Xiang, Junliang Chen, Wenshuang Liu, Xianxu Hou, Linlin Shen | N/A | |
| SinNeRF: Training Neural Radiance Fields on Complex Scenes from a Single Image | Dejia Xu, Yifan Jiang, Peihao Wang, Zhiwen Fan, Humphrey Shi, Zhangyang Wang | N/A | |
| Entropy-Driven Sampling and Training Scheme for Conditional Diffusion Generation | Guangcong Zheng, Shengming Li, Hui Wang, Taiping Yao, Yang Chen, Shouhong Ding, Xi Li | N/A | |
| Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling | Hengyuan Ma, Li Zhang, Xiatian Zhu, Jianfeng Feng | N/A | |
| Learning to Generate Realistic LiDAR Point Clouds | Vlas Zyrianov, Xiyue Zhu, Shenlong Wang | N/A | |
| RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds | Tuan-Anh Vu, Thanh Nguyen, Binh-Son Hua, Quang-Hieu Pham, Sai-Kit Yeung | N/A | |
| Diverse Image Inpainting with Normalizing Flow | Cairong Wang, Yiming Zhu, Chun Yuan | N/A | |
| Improved Masked Image Generation with Token-Critic | José Lezama, Huiwen Chang, Lu Jiang, Irfan Essa | N/A | |
| TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation | Junghyuk Lee, Jong-Seok Lee | N/A | |
| Exploring Gradient-Based Multi-directional Controls in GANs | Zikun Chen, Ruowei Jiang, Brendan Duke, Han Zhao, Parham Aarabi | N/A | |
| Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition | Tianyu Wang, Miaomiao Liu, Kee Siong Ng | N/A | |
| Neural Scene Decoration from a Single Photograph | Hong-Wing Pang, Yingshu Chen, Phuoc-Hieu Le, Binh-Son Hua, Thanh Nguyen, Sai-Kit Yeung | N/A | |
| Outpainting by Queries | Kai Yao, Penglei Gao, Xi Yang, Jie Sun, Rui Zhang, Kaizhu Huang | N/A | |
| Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes | Sam Bond-Taylor, Peter Hessey, Hiroshi Sasaki, Toby P. Breckon, Chris G. Willcocks | N/A | |
| ChunkyGAN: Real Image Inversion via Segments | Adéla Šubrtová, David Futschik, Jan Čech, Michal Lukáč, Eli Shechtman, Daniel Sýkora | N/A | |
| GAN Cocktail: Mixing GANs without Dataset Access | Omri Avrahami, Dani Lischinski, Ohad Fried | N/A | |
| Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering | Mingfei Chen, Jianfeng Zhang, Xiangyu Xu, Lijuan Liu, Yujun Cai, Jiashi Feng, Shuicheng Yan | N/A | |
| Controllable Shadow Generation Using Pixel Height Maps | Yichen Sheng, Yifan Liu, Jianming Zhang, Wei Yin, A. Cengiz Oztireli, He Zhang, Zhe Lin, Eli Shechtman, Bedrich Benes | N/A | |
| Learning Where to Look – Generative NAS Is Surprisingly Efficient | Jovita Lukasik, Steffen Jung, Margret Keuper | N/A | |
| Subspace Diffusion Generative Models | Bowen Jing, Gabriele Corso, Renato Berlinghieri, Tommi Jaakkola | N/A | |
| DuelGAN: A Duel between Two Discriminators Stabilizes the GAN Training | Jiaheng Wei, Minghao Liu, Jiahao Luo, Andrew Zhu, James Davis, Yang Liu | N/A | |
| MINER: Multiscale Implicit Neural Representation | Vishwanath Saragadam, Jasper Tan, Guha Balakrishnan, Richard G. Baraniuk, Ashok Veeraraghavan | N/A | |
| An Embedded Feature Whitening Approach to Deep Neural Network Optimization | Hongwei Yong, Lei Zhang | N/A | |
| Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization | Alp Yurtsever, Tolga Birdal, Vladislav Golyanik | N/A | |
| Self-Supervised Learning of Visual Graph Matching | Chang Liu, Shaofeng Zhang, Xiaokang Yang, Junchi Yan | N/A | |
| Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models | Xuxi Chen, Tianlong Chen, Yu Cheng, Weizhu Chen, Ahmed Awadallah, Zhangyang Wang | N/A | |
| QISTA-ImageNet: A Deep Compressive Image Sensing Framework Solving lq-Norm Optimization Problem | Gang-Xuan Lin, Shih-Wei Hu, Chun-Shien Lu | N/A | |
| R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning | Qiankun Gao, Chen Zhao, Bernard Ghanem, Jian Zhang | N/A | |
| Domain Generalization by Mutual-Information Regularization with Pre-trained Models | Junbum Cha, Kyungjae Lee, Sungrae Park, Sanghyuk Chun | N/A | |
| Predicting Is Not Understanding: Recognizing and Addressing Underspecification in Machine Learning | Damien Teney, Maxime Peyrard, Ehsan Abbasnejad | N/A | |
| Neural-Sim: Learning to Generate Training Data with NeRF | Yunhao Ge, Harkirat Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet | N/A | |
| Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning | Hanwei Fan, Jiandong Mu, Wei Zhang | N/A | |
| Learned Variational Video Color Propagation | Markus Hofinger, Erich Kobler, Alexander Effland, Thomas Pock | N/A | |
| Continual Variational Autoencoder Learning via Online Cooperative Memorization | Fei Ye, Adrian G. Bors | N/A | |
| Learning to Learn with Smooth Regularization | Yuanhao Xiong, Cho-Jui Hsieh | N/A | |
| Incremental Task Learning with Incremental Rank Updates | Rakib Hyder, Ken Shao, Boyu Hou, Panos Markopoulos, Ashley Prater-Bennette, M. Salman Asif | N/A | |
| Batch-Efficient EigenDecomposition for Small and Medium Matrices | Yue Song, Nicu Sebe, Wei Wang | N/A | |
| Ensemble Learning Priors Driven Deep Unfolding for Scalable Video Snapshot Compressive Imaging | Chengshuai Yang, Shiyu Zhang, Xin Yuan | N/A | |
| Approximate Discrete Optimal Transport Plan with Auxiliary Measure Method | Dongsheng An, Na Lei, Xianfeng Gu | N/A | |
| A Comparative Study of Graph Matching Algorithms in Computer Vision | Stefan Haller, Lorenz Feineis, Lisa Hutschenreiter, Florian Bernard, Carsten Rother, Dagmar Kainmüller, Paul Swoboda, Bogdan Savchynskyy | N/A | |
| Improving Generalization in Federated Learning by Seeking Flat Minima | Debora Caldarola, Barbara Caputo, Marco Ciccone | N/A | |
| Semidefinite Relaxations of Truncated Least-Squares in Robust Rotation Search: Tight or Not | Liangzu Peng, Mahyar Fazlyab, René Vidal | N/A | |
| Transfer without Forgetting | Matteo Boschini, Lorenzo Bonicelli, Angelo Porrello, Giovanni Bellitto, Matteo Pennisi, Simone Palazzo, Concetto Spampinato, Simone Calderara | N/A | |
| AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation | Farshid Varno, Marzie Saghayi, Laya Rafiee Sevyeri, Sharut Gupta, Stan Matwin, Mohammad Havaei | N/A | |
| Tackling Long-Tailed Category Distribution under Domain Shifts | Xiao Gu, Yao Guo, Zeju Li, Jianing Qiu, Qi Dou, Yuxuan Liu, Benny Lo, Guang-Zhong Yang | N/A | |
| Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation | Li Gao, Dong Nie, Bo Li, Xiaofeng Ren | N/A | |
| Improving Vision Transformers by Revisiting High-Frequency Components | Jiawang Bai, Li Yuan, Shu-Tao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu | N/A | |
| Recurrent Bilinear Optimization for Binary Neural Networks | Sheng Xu, Yanjing Li, Tiancheng Wang, Teli Ma, Baochang Zhang, Peng Gao, Yu Qiao, Jinhu Lü, Guodong Guo | N/A | |
| Neural Architecture Search for Spiking Neural Networks | Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Priyadarshini Panda | N/A | |
| Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification | Yang Liu, Lei Zhou, Pengcheng Zhang, Xiao Bai, Lin Gu, Xiaohan Yu, Jun Zhou, Edwin R. Hancock | N/A | |
| DaViT: Dual Attention Vision Transformers | Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan | N/A | |
| Optimal Transport for Label-Efficient Visible-Infrared Person Re-identification | Jiangming Wang, Zhizhong Zhang, Mingang Chen, Yi Zhang, Cong Wang, Bin Sheng, Yanyun Qu, Yuan Xie | N/A | |
| Locality Guidance for Improving Vision Transformers on Tiny Datasets | Kehan Li, Runyi Yu, Zhennan Wang, Li Yuan, Guoli Song, Jie Chen | N/A | |
| Neighborhood Collective Estimation for Noisy Label Identification and Correction | Jichang Li, Guanbin Li, Feng Liu, Yizhou Yu | N/A | |
| Few-Shot Class-Incremental Learning via Entropy-Regularized Data-Free Replay | Huan Liu, Li Gu, Zhixiang Chi, Yang Wang, Yuanhao Yu, Jun Chen, Jin Tang | N/A | |
| Anti-Retroactive Interference for Lifelong Learning | Runqi Wang, Yuxiang Bao, Baochang Zhang, Jianzhuang Liu, Wentao Zhu, Guodong Guo | N/A | |
| Towards Calibrated Hyper-Sphere Representation via Distribution Overlap Coefficient for Long-Tailed Learning | Hualiang Wang, Siming Fu, Xiaoxuan He, Hangxiang Fang, Zuozhu Liu, Haoji Hu | N/A | |
| Dynamic Metric Learning with Cross-Level Concept Distillation | Wenzhao Zheng, Yuanhui Huang, Borui Zhang, Jie Zhou, Jiwen Lu | N/A | |
| MENet: A Memory-Based Network with Dual-Branch for Efficient Event Stream Processing | Linhui Sun, Yifan Zhang, Ke Cheng, Jian Cheng, Hanqing Lu | N/A | |
| Out-of-Distribution Detection with Boundary Aware Learning | Sen Pei, Xin Zhang, Bin Fan, Gaofeng Meng | N/A | |
| Learning Hierarchy Aware Features for Reducing Mistake Severity | Ashima Garg, Depanshu Sani, Saket Anand | N/A | |
| Learning to Detect Every Thing in an Open World | Kuniaki Saito, Ping Hu, Trevor Darrell, Kate Saenko | N/A | |
| KVT: k-NN Attention for Boosting Vision Transformers | Pichao Wang, Xue Wang, Fan Wang, Ming Lin, Shuning Chang, Hao Li, Rong Jin | N/A | |
| Registration Based Few-Shot Anomaly Detection | Chaoqin Huang, Haoyan Guan, Aofan Jiang, Ya Zhang, Michael Spratling, Yan-Feng Wang | N/A | |
| Improving Robustness by Enhancing Weak Subnets | Yong Guo, David Stutz, Bernt Schiele | N/A | |
| Learning Invariant Visual Representations for Compositional Zero-Shot Learning | Tian Zhang, Kongming Liang, Ruoyi Du, Xian Sun, Zhanyu Ma, Jun Guo | N/A | |
| Improving Covariance Conditioning of the SVD Meta-Layer by Orthogonality | Yue Song, Nicu Sebe, Wei Wang | N/A | |
| Out-of-Distribution Detection with Semantic Mismatch under Masking | Yijun Yang, Ruiyuan Gao, Qiang Xu | N/A | |
| Data-Free Neural Architecture Search via Recursive Label Calibration | Zechun Liu, Zhiqiang Shen, Yun Long, Eric Xing, Kwang-Ting Cheng, Chas Leichner | N/A | |
| Learning from Multiple Annotator Noisy Labels via Sample-Wise Label Fusion | Zhengqi Gao, Fan-Keng Sun, Mingran Yang, Sucheng Ren, Zikai Xiong, Marc Engeler, Antonio Burazer, Linda Wildling, Luca Daniel, Duane S. Boning | N/A | |
| Acknowledging the Unknown for Multi-Label Learning with Single Positive Labels | Donghao Zhou, Pengfei Chen, Qiong Wang, Guangyong Chen, Pheng-Ann Heng | N/A | |
| AutoMix: Unveiling the Power of Mixup for Stronger Classifiers | Zicheng Liu, Siyuan Li, Di Wu, Zihan Liu, Zhiyuan Chen, Lirong Wu, Stan Z. Li | N/A | |
| MaxViT: Multi-axis Vision Transformer | Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li | N/A | |
| ScalableViT: Rethinking the Context-Oriented Generalization of Vision Transformer | Rui Yang, Hailong Ma, Jie Wu, Yansong Tang, Xuefeng Xiao, Min Zheng, Xiu Li | N/A | |
| Three Things Everyone Should Know about Vision Transformers | Hugo Touvron, Matthieu Cord, Alaaeldin El-Nouby, Jakob Verbeek, Hervé Jégou | N/A | |
| DeiT III: Revenge of the ViT | Hugo Touvron, Matthieu Cord, Hervé Jégou | N/A | |
| MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition | Chuanguang Yang, Zhulin An, Helong Zhou, Linhang Cai, Xiang Zhi, Jiwen Wu, Yongjun Xu, Qian Zhang | N/A | |
| Self-Feature Distillation with Uncertainty Modeling for Degraded Image Recognition | Zhou Yang, Weisheng Dong, Xin Li, Jinjian Wu, Leida Li, Guangming Shi | N/A | |
| Novel Class Discovery without Forgetting | K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian | N/A | |
| SAFA: Sample-Adaptive Feature Augmentation for Long-Tailed Image Classification | Yan Hong, Jianfu Zhang, Zhongyi Sun, Ke Yan | N/A | |
| Negative Samples Are at Large: Leveraging Hard-Distance Elastic Loss for Re-identification | Hyungtae Lee, Sungmin Eum, Heesung Kwon | N/A | |
| Discrete-Constrained Regression for Local Counting Models | Haipeng Xiong, Angela Yao | N/A | |
| Breadcrumbs: Adversarial Class-Balanced Sampling for Long-Tailed Recognition | Bo Liu, Haoxiang Li, Hao Kang, Gang Hua, Nuno Vasconcelos | N/A | |
| Chairs Can Be Stood On: Overcoming Object Bias in Human-Object Interaction Detection | Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan Kankanhalli | N/A | |
| A Fast Knowledge Distillation Framework for Visual Recognition | Zhiqiang Shen, Eric Xing | N/A | |
| DICE: Leveraging Sparsification for Out-of-Distribution Detection | Yiyou Sun, Yixuan Li | N/A | |
| Invariant Feature Learning for Generalized Long-Tailed Classification | Kaihua Tang, Mingyuan Tao, Jiaxin Qi, Zhenguang Liu, Hanwang Zhang | N/A | |
| Sliced Recursive Transformer | Zhiqiang Shen, Zechun Liu, Eric Xing | N/A | |
| Cross-Domain Ensemble Distillation for Domain Generalization | Kyungmoon Lee, Sungyeon Kim, Suha Kwak | N/A | |
| Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels | Ganlong Zhao, Guanbin Li, Yipeng Qin, Feng Liu, Yizhou Yu | N/A | |
| Hyperspherical Learning in Multi-Label Classification | Bo Ke, Yunquan Zhu, Mengtian Li, Xiujun Shu, Ruizhi Qiao, Bo Ren | N/A | |
| When Active Learning Meets Implicit Semantic Data Augmentation | Zhuangzhuang Chen, Jin Zhang, Pan Wang, Jie Chen, Jianqiang Li | N/A | |
| VL-LTR: Learning Class-Wise Visual-Linguistic Representation for Long-Tailed Visual Recognition | Changyao Tian, Wenhai Wang, Xizhou Zhu, Jifeng Dai, Yu Qiao | N/A | |
| Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-of-Distribution Generalization | Jiaxin Qi, Kaihua Tang, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang | N/A | |
| Hierarchical Semi-Supervised Contrastive Learning for Contamination-Resistant Anomaly Detection | Gaoang Wang, Yibing Zhan, Xinchao Wang, Mingli Song, Klara Nahrstedt | N/A | |
| Tracking by Associating Clips | Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, Joon-Young Lee | N/A | |
| RealPatch: A Statistical Matching Framework for Model Patching with Real Samples | Sara Romiti, Christopher Inskip, Viktoriia Sharmanska, Novi Quadrianto | N/A | |
| Background-Insensitive Scene Text Recognition with Text Semantic Segmentation | Liang Zhao, Zhenyao Wu, Xinyi Wu, Greg Wilsbacher, Song Wang | N/A | |
| Semantic Novelty Detection via Relational Reasoning | Francesco Cappio Borlino, Silvia Bucci, Tatiana Tommasi | N/A | |
| Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers | Khoi Pham, Kushal Kafle, Zhe Lin, Zhihong Ding, Scott Cohen, Quan Tran, Abhinav Shrivastava | N/A | |
| Training Vision Transformers with Only 2040 Images | Yun-Hao Cao, Hao Yu, Jianxin Wu | N/A | |
| Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection | Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, Joon-Young Lee | N/A | |
| TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs | Shantanu Jaiswal, Basura Fernando, Cheston Tan | N/A | |
| Automatic Check-Out via Prototype-Based Classifier Learning from Single-Product Exemplars | Hao Chen, Xiu-Shen Wei, Faen Zhang, Yang Shen, Hui Xu, Liang Xiao | N/A | |
| Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain | Piyapat Saranrittichai, Chaithanya Kumar Mummadi, Claudia Blaiotta, Mauricio Munoz, Volker Fischer | N/A | |
| Photo-Realistic Neural Domain Randomization | Sergey Zakharov, Rareș Ambruș, Vitor Guizilini, Wadim Kehl, Adrien Gaidon | N/A | |
| Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning | Ting Yao, Yingwei Pan, Yehao Li, Chong-Wah Ngo, Tao Mei | N/A | |
| Tailoring Self-Supervision for Supervised Learning | WonJun Moon, Ji-Hwan Kim, Jae-Pil Heo | N/A | |
| Difficulty-Aware Simulator for Open Set Recognition | WonJun Moon, Junho Park, Hyun Seok Seong, Cheol-Ho Cho, Jae-Pil Heo | N/A | |
| Few-Shot Class-Incremental Learning from an Open-Set Perspective | Can Peng, Kun Zhao, Tianren Wang, Meng Li, Brian C. Lovell | N/A | |
| FOSTER: Feature Boosting and Compression for Class-Incremental Learning | Fu-Yun Wang, Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan | N/A | |
| Visual Knowledge Tracing | Neehar Kondapaneni, Pietro Perona, Oisin Mac Aodha | N/A | |
| S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning | Jayateja Kalla, Soma Biswas | N/A | |
| Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-Boosting Attention Mechanism | Yangyang Shu, Baosheng Yu, Haiming Xu, Lingqiao Liu | N/A | |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao | N/A | |
| Unbiased Manifold Augmentation for Coarse Class Subdivision | Baoming Yan, Ke Gao, Bo Gao, Lin Wang, Jiang Yang, Xiaobo Li | N/A | |
| DenseHybrid: Hybrid Anomaly Detection for Dense Open-Set Recognition | Matej Grcić, Petra Bevandić, Siniša Šegvić | N/A | |
| Rethinking Confidence Calibration for Failure Prediction | Fei Zhu, Zhen Cheng, Xu-Yao Zhang, Cheng-Lin Liu | N/A | |
| Uncertainty-Guided Source-Free Domain Adaptation | Subhankar Roy, Martin Trapp, Andrea Pilzer, Juho Kannala, Nicu Sebe, Elisa Ricci, Arno Solin | N/A | |
| Should All Proposals Be Treated Equally in Object Detection? | Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Ying Jin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos | N/A | |
| VIP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers | Junbo Li, Huan Zhang, Cihang Xie | N/A | |
| incDFM: Incremental Deep Feature Modeling for Continual Novelty Detection | Amanda Rios, Nilesh Ahuja, Ibrahima Ndiour, Utku Genc, Laurent Itti, Omesh Tickoo | N/A | |
| IGFormer: Interaction Graph Transformer for Skeleton-Based Human Interaction Recognition | Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu | N/A | |
| PRIME: A Few Primitives Can Boost Robustness to Common Corruptions | Apostolos Modas, Rahul Rade, Guillermo Ortiz-Jiménez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard | N/A | |
| Rotation Regularization without Rotation | Takumi Kobayashi | N/A | |
| Towards Accurate Open-Set Recognition via Background-Class Regularization | Wonwoo Cho, Jaegul Choo | N/A | |
| In Defense of Image Pre-training for Spatiotemporal Recognition | Xianhang Li, Huiyu Wang, Chen Wei, Jieru Mei, Alan Yuille, Yuyin Zhou, Cihang Xie | N/A | |
| Augmenting Deep Classifiers with Polynomial Neural Networks | Grigorios G. Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar | N/A | |
| Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection | Seong Min Kye, Kwanghee Choi, Joonyoung Yi, Buru Chang | N/A | |
| Online Task-Free Continual Learning with Dynamic Sparse Distributed Memory | Julien Pourcel, Ngoc-Son Vu, Robert M. French | N/A | |
| Contrastive Deep Supervision | Linfeng Zhang, Xin Chen, Junbo Zhang, Runpei Dong, Kaisheng Ma | N/A | |
| Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective | Quan Cui, Bingchen Zhao, Zhao-Min Chen, Borui Zhao, Renjie Song, Boyan Zhou, Jiajun Liang, Osamu Yoshie | N/A | |
| LocVTP: Video-Text Pre-training for Temporal Localization | Meng Cao, Tianyu Yang, Junwu Weng, Can Zhang, Jue Wang, Yuexian Zou | N/A | |
| Few-Shot End-to-End Object Detection via Constantly Concentrated Encoding across Heads | Jiawei Ma, Guangxing Han, Shiyuan Huang, Yuncong Yang, Shih-Fu Chang | N/A | |
| Implicit Neural Representations for Image Compression | Yannick Strümpler, Janis Postels, Ren Yang, Luc Van Gool, Federico Tombari | N/A | |
| LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space | Emre Aksan, Shugao Ma, Akin Caliskan, Stanislav Pidhorskyi, Alexander Richard, Shih-En Wei, Jason Saragih, Otmar Hilliges | N/A | |
| Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining | Qihang Zhang, Zhenghao Peng, Bolei Zhou | N/A | |
| Learning Ego 3D Representation As Ray Tracing | Jiachen Lu, Zheyuan Zhou, Xiatian Zhu, Hang Xu, Li Zhang | N/A | |
| Static and Dynamic Concepts for Self-Supervised Video Representation Learning | Rui Qian, Shuangrui Ding, Xian Liu, Dahua Lin | N/A | |
| SphereFed: Hyperspherical Federated Learning | Xin Dong, Sai Qian Zhang, Ang Li, H.T. Kung | N/A | |
| Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning | Yuxiao Chen, Long Zhao, Jianbo Yuan, Yu Tian, Zhaoyang Xia, Shijie Geng, Ligong Han, Dimitris N. Metaxas | N/A | |
| Posterior Refinement on Metric Matrix Improves Generalization Bound in Metric Learning | Mingda Wang, Canqian Yang, Yi Xu | N/A | |
| Balancing Stability and Plasticity through Advanced Null Space in Continual Learning | Yajing Kong, Liu Liu, Zhen Wang, Dacheng Tao | N/A | |
| DisCo: Remedying Self-Supervised Learning on Lightweight Models with Distilled Contrastive Learning | Yuting Gao, Jia-Xin Zhuang, Shaohui Lin, Hao Cheng, Xing Sun, Ke Li, Chunhua Shen | N/A | |
| CoSCL: Cooperation of Small Continual Learners Is Stronger than a Big One | Liyuan Wang, Xingxing Zhang, Qian Li, Jun Zhu, Yi Zhong | N/A | |
| Manifold Adversarial Learning for Cross-Domain 3D Shape Representation | Hao Huang, Cheng Chen, Yi Fang | N/A | |
| Fast-MoCo: Boost Momentum-Based Contrastive Learning with Combinatorial Patches | Yuanzheng Ci, Chen Lin, Lei Bai, Wanli Ouyang | N/A | |
| LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling | Boyan Jiang, Xinlin Ren, Mingsong Dou, Xiangyang Xue, Yanwei Fu, Yinda Zhang | N/A | |
| On the Versatile Uses of Partial Distance Correlation in Deep Learning | Xingjian Zhen, Zihang Meng, Rudrasis Chakraborty, Vikas Singh | N/A | |
| Self-Regulated Feature Learning via Teacher-Free Feature Distillation | Lujun Li | N/A | |
| Balancing between Forgetting and Acquisition in Incremental Subpopulation Learning | Mingfu Liang, Jiahuan Zhou, Wei Wei, Ying Wu | N/A | |
| Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification | Xulin Li, Yan Lu, Bin Liu, Yating Liu, Guojun Yin, Qi Chu, Jinyang Huang, Feng Zhu, Rui Zhao, Nenghai Yu | N/A | |
| DAS: Densely-Anchored Sampling for Deep Metric Learning | Lizhao Liu, Shangxin Huang, Zhuangwei Zhuang, Ran Yang, Mingkui Tan, Yaowei Wang | N/A | |
| Learn from All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition | Yuhang Zhang, Chengrui Wang, Xu Ling, Weihong Deng | N/A | |
| A Non-Isotropic Probabilistic Take On Proxy-Based Deep Metric Learning | Michael Kirchhof, Karsten Roth, Zeynep Akata, Enkelejda Kasneci | N/A | |
| TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers | Jihao Liu, Boxiao Liu, Hang Zhou, Hongsheng Li, Yu Liu | N/A | |
| UFO: Unified Feature Optimization | Teng Xi, Yifan Sun, Deli Yu, Bi Li, Nan Peng, Gang Zhang, Xinyu Zhang, Zhigang Wang, Jinwen Chen, Jian Wang, Lufei Liu, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Jingdong Wang | N/A | |
| Sound Localization by Self-Supervised Time Delay Estimation | Ziyang Chen, David F. Fouhey, Andrew Owens | N/A | |
| X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation | Yinan He, Gengshi Huang, Siyu Chen, Jianing Teng, Kun Wang, Zhenfei Yin, Lu Sheng, Ziwei Liu, Yu Qiao, Jing Shao | N/A | |
| SLIP: Self-Supervision Meets Language-Image Pre-training | Norman Mu, Alexander Kirillov, David Wagner, Saining Xie | N/A | |
| Discovering Deformable Keypoint Pyramids | Jianing Qian, Anastasios Panagopoulos, Dinesh Jayaraman | N/A | |
| Neural Video Compression Using GANs for Detail Synthesis and Propagation | Fabian Mentzer, Eirikur Agustsson, Johannes Ballé, David Minnen, Nick Johnston, George Toderici | N/A | |
| A Contrastive Objective for Learning Disentangled Representations | Jonathan Kahana, Yedid Hoshen | N/A | |
| PT4AL: Using Self-Supervised Pretext Tasks for Active Learning | John Seon Keun Yi, Minseok Seo, Jongchan Park, Dong-Geol Choi | N/A | |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Haokui Zhang, Wenze Hu, Xiaoyu Wang | N/A | |
| DualPrompt: Complementary Prompting for Rehearsal-Free Continual Learning | Zifeng Wang, Zizhao Zhang, Sayna Ebrahimi, Ruoxi Sun, Han Zhang, Chen-Yu Lee, Xiaoqi Ren, Guolong Su, Vincent Perot, Jennifer Dy, Tomas Pfister | N/A | |
| Unifying Visual Contrastive Learning for Object Recognition from a Graph Perspective | Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Chenyu Wang, Wanli Ouyang | N/A | |
| Decoupled Contrastive Learning | Chun-Hsiao Yeh, Cheng-Yao Hong, Yen-Chi Hsu, Tyng-Luh Liu, Yubei Chen, Yann LeCun | N/A | |
| Joint Learning of Localized Representations from Medical Images and Reports | Philip Müller, Georgios Kaissis, Congyu Zou, Daniel Rueckert | N/A | |
| The Challenges of Continuous Self-Supervised Learning | Senthil Purushwalkam, Pedro Morgado, Abhinav Gupta | N/A | |
| Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval | Zhixin Ling, Zhen Xing, Jian Zhou, Xiangdong Zhou | N/A | |
| Identifying Hard Noise in Long-Tailed Sample Distribution | Xuanyu Yi, Kaihua Tang, Xian-Sheng Hua, Joo-Hwee Lim, Hanwang Zhang | N/A | |
| Relative Contrastive Loss for Unsupervised Representation Learning | Shixiang Tang, Feng Zhu, Lei Bai, Rui Zhao, Wanli Ouyang | N/A | |
| Fine-Grained Fashion Representation Learning by Online Deep Clustering | Yang Jiao, Ning Xie, Yan Gao, Chien-chih Wang, Yi Sun | N/A | |
| NashAE: Disentangling Representations through Adversarial Covariance Minimization | Eric Yeats, Frank Liu, David Womble, Hai Li | N/A | |
| A Gyrovector Space Approach for Symmetric Positive Semi-Definite Matrix Learning | Xuan Son Nguyen | N/A | |
| Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training | Haoxuan You, Luowei Zhou, Bin Xiao, Noel Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan | N/A | |
| Contrasting Quadratic Assignments for Set-Based Representation Learning | Artem Moskalev, Ivan Sosnovik, Volker Fischer, Arnold Smeulders | N/A | |
| Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer | Arjun Ashok, K J Joseph, Vineeth N Balasubramanian | N/A | |
| Object Discovery and Representation Networks | Olivier J. Hénaff, Skanda Koppula, Evan Shelhamer, Daniel Zoran, Andrew Jaegle, Andrew Zisserman, João Carreira, Relja Arandjelović | N/A | |
| Trading Positional Complexity vs Deepness in Coordinate Networks | Jianqiao Zheng, Sameera Ramasinghe, Xueqian Li, Simon Lucey | N/A | |
| MVDG: A Unified Multi-View Framework for Domain Generalization | Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao | N/A | |
| Panoptic Scene Graph Generation | Jingkang Yang, Yi Zhe Ang, Zujin Guo, Kaiyang Zhou, Wayne Zhang, Ziwei Liu | N/A | |
| Object-Compositional Neural Implicit Surfaces | Qianyi Wu, Xian Liu, Yuedong Chen, Kejie Li, Chuanxia Zheng, Jianfei Cai, Jianmin Zheng | N/A | |
| RigNet: Repetitive Image Guided Network for Depth Completion | Zhiqiang Yan, Kun Wang, Xiang Li, Zhenyu Zhang, Jun Li, Jian Yang | N/A | |
| FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling | Hao Lu, Wenze Liu, Hongtao Fu, Zhiguo Cao | N/A | |
| LiDAL: Inter-Frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation | Zeyu Hu, Xuyang Bai, Runze Zhang, Xin Wang, Guangyuan Sun, Hongbo Fu, Chiew-Lan Tai | N/A | |
| Hierarchical Memory Learning for Fine-Grained Scene Graph Generation | Youming Deng, Yansheng Li, Yongjun Zhang, Xiang Xiang, Jian Wang, Jingdong Chen, Jiayi Ma | N/A | |
| DODA: Data-Oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation | Runyu Ding, Jihan Yang, Li Jiang, Xiaojuan Qi | N/A | |
| MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning | Xiaogang Xu, Hengshuang Zhao, Vibhav Vineet, Ser-Nam Lim, Antonio Torralba | N/A | |
| MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene Flow Estimation with Monocular Images | Runfa Li, Truong Nguyen | N/A | |
| TO-Scene: A Large-Scale Dataset for Understanding 3D Tabletop Scenes | Mutian Xu, Pei Chen, Haolin Liu, Xiaoguang Han | N/A | |
| Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation? | Xinyi Wu, Zhenyao Wu, Jin Wan, Lili Ju, Song Wang | N/A | |
| Meta Spatio-Temporal Debiasing for Video Scene Graph Generation | Li Xu, Haoxuan Qu, Jason Kuen, Jiuxiang Gu, Jun Liu | N/A | |
| Improving the Reliability for Confidence Estimation | Haoxuan Qu, Yanchao Li, Lin Geng Foo, Jason Kuen, Jiuxiang Gu, Jun Liu | N/A | |
| Fine-Grained Scene Graph Generation with Data Transfer | Ao Zhang, Yuan Yao, Qianyu Chen, Wei Ji, Zhiyuan Liu, Maosong Sun, Tat-Seng Chua | N/A | |
| Pose2Room: Understanding 3D Scenes from Human Activities | Yinyu Nie, Angela Dai, Xiaoguang Han, Matthias Nießner | N/A | |
| Towards Hard-Positive Query Mining for DETR-Based Human-Object Interaction Detection | Xubin Zhong, Changxing Ding, Zijian Li, Shaoli Huang | N/A | |
| Discovering Human-Object Interaction Concepts via Self-Compositional Learning | Zhi Hou, Baosheng Yu, Dacheng Tao | N/A | |
| Primitive-Based Shape Abstraction via Nonparametric Bayesian Inference | Yuwei Wu, Weixiao Liu, Sipu Ruan, Gregory S. Chirikjian | N/A | |
| Stereo Depth Estimation with Echoes | Chenghao Zhang, Kun Tian, Bolin Ni, Gaofeng Meng, Bin Fan, Zhaoxiang Zhang, Chunhong Pan | N/A | |
| Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Hanrong Ye, Dan Xu | N/A | |
| PETR: Position Embedding Transformation for Multi-View 3D Object Detection | Yingfei Liu, Tiancai Wang, Xiangyu Zhang, Jian Sun | N/A | |
| S2Net: Stochastic Sequential Pointcloud Forecasting | Xinshuo Weng, Junyu Nan, Kuan-Hui Lee, Rowan McAllister, Adrien Gaidon, Nicholas Rhinehart, Kris M. Kitani | N/A | |
| RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation | Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang | N/A | |
| PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation | Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao | N/A | |
| SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds | Qingyong Hu, Bo Yang, Guangchi Fang, Yulan Guo, Aleš Leonardis, Niki Trigoni, Andrew Markham | N/A | |
| PointMixer: MLP-Mixer for Point Cloud Understanding | Jaesung Choe, Chunghyun Park, Francois Rameau, Jaesik Park, In So Kweon | N/A | |
| Initialization and Alignment for Adversarial Texture Optimization | Xiaoming Zhao, Zhizhen Zhao, Alexander G. Schwing | N/A | |
| MOTR: End-to-End Multiple-Object Tracking with TRansformer | Fangao Zeng, Bin Dong, Yuang Zhang, Tiancai Wang, Xiangyu Zhang, Yichen Wei | N/A | |
| GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing | Sijie Zhu, Zhe Lin, Scott Cohen, Jason Kuen, Zhifei Zhang, Chen Chen | N/A | |
| LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments | Henry Howard-Jenkins, Victor Adrian Prisacariu | N/A | |
| 3D-PL: Domain Adaptive Depth Estimation with 3D-Aware Pseudo-Labeling | Yu-Ting Yen, Chia-Ni Lu, Wei-Chen Chiu, Yi-Hsuan Tsai | N/A | |
| Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation | Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao | N/A | |
| Salient Object Detection for Point Clouds | Songlin Fan, Wei Gao, Ge Li | N/A | |
| Learning Semantic Segmentation from Multiple Datasets with Label Shifts | Dongwan Kim, Yi-Hsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg, Manmohan Chandraker, Bohyung Han | N/A | |
| Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination | Kangcheng Liu, Yuzhi Zhao, Qiang Nie, Zhi Gao, Ben M. Chen | N/A | |
| Towards Open-Vocabulary Scene Graph Generation with Prompt-Based Finetuning | Tao He, Lianli Gao, Jingkuan Song, Yuan-Fang Li | N/A | |
| Variance-Aware Weight Initialization for Point Convolutional Neural Networks | Pedro Hermosilla, Michael Schelling, Tobias Ritschel, Timo Ropinski | N/A | |
| Break and Make: Interactive Structural Understanding Using LEGO Bricks | Aaron Walsman, Muru Zhang, Klemen Kotar, Karthik Desingh, Ali Farhadi, Dieter Fox | N/A | |
| Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation | Wencan Cheng, Jong Hwan Ko | N/A | |
| 3DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching | Runyu Mao, Chen Bai, Yatong An, Fengqing Zhu, Cheng Lu | N/A | |
| Video Restoration Framework and Its Meta-Adaptations to Data-Poor Conditions | Prashant W Patil, Sunil Gupta, Santu Rana, Svetha Venkatesh | N/A | |
| MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud | Michaël Ramamonjisoa, Sinisa Stekovic, Vincent Lepetit | N/A | |
| Scene Text Recognition with Permuted Autoregressive Sequence Models | Darwin Bautista, Rowel Atienza | N/A | |
| When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition | Bohan Li, Ye Yuan, Dingkang Liang, Xiao Liu, Zhilong Ji, Jinfeng Bai, Wenyu Liu, Xiang Bai | N/A | |
| Detecting Tampered Scene Text in the Wild | Yuxin Wang, Hongtao Xie, Mengting Xing, Jing Wang, Shenggao Zhu, Yongdong Zhang | N/A | |
| Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning | Jingqun Tang, Wenming Qian, Luchuan Song, Xiena Dong, Lan Li, Xiang Bai | N/A | |
| GLASS: Global to Local Attention for Scene-Text Spotting | Roi Ronen, Shahar Tsiper, Oron Anschel, Inbal Lavi, Amir Markovitz, R. Manmatha | N/A | |
| COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts | Jeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa | N/A | |
| Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting | Chuhui Xue, Wenqing Zhang, Yu Hao, Shijian Lu, Philip H. S. Torr, Song Bai | N/A | |
| Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition | Xudong Xie, Ling Fu, Zhifei Zhang, Zhaowen Wang, Xiang Bai | N/A | |
| Levenshtein OCR | Cheng Da, Peng Wang, Cong Yao | N/A | |
| Multi-Granularity Prediction for Scene Text Recognition | Peng Wang, Cheng Da, Cong Yao | N/A | |
| Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting | Ying Chen, Liang Qiao, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Xi Li | N/A | |
| Contextual Text Block Detection towards Scene Text Understanding | Chuhui Xue, Jiaxing Huang, Wenqing Zhang, Shijian Lu, Changhu Wang, Song Bai | N/A | |
| CoMER: Modeling Coverage for Transformer-Based Handwritten Mathematical Expression Recognition | Wenqi Zhao, Liangcai Gao | N/A | |
| Don’t Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context | Chongyu Liu, Lianwen Jin, Yuliang Liu, Canjie Luo, Bangdong Chen, Fengjun Guo, Kai Ding | N/A | |
| TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers | Oren Nuriel, Sharon Fogel, Ron Litman | N/A | |
| Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features | Byeonghu Na, Yoonsik Kim, Sungrae Park | N/A | |
| SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition | Dajian Zhong, Shujing Lyu, Palaiahnakote Shivakumara, Bing Yin, Jiajia Wu, Umapada Pal, Yue Lu | N/A | |
| Pure Transformer with Integrated Experts for Scene Text Recognition | Yew Lee Tan, Adams Wai-Kin Kong, Jung-Jae Kim | N/A | |
| OCR-Free Document Understanding Transformer | Geewook Kim, Teakgyu Hong, Moonbin Yim, JeongYeon Nam, Jinyoung Park, Jinyeong Yim, Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park | N/A | |
| CAR: Class-Aware Regularizations for Semantic Segmentation | Ye Huang, Di Kang, Liang Chen, Xuefei Zhe, Wenjing Jia, Linchao Bao, Xiangjian He | N/A | |
| Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation | Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee | N/A | |
| SeqFormer: Sequential Transformer for Video Instance Segmentation | Junfeng Wu, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai | N/A | |
| Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection | Wenhu Zhang, Liangli Zheng, Huanyu Wang, Xintian Wu, Xi Li | N/A | |
| In Defense of Online Models for Video Instance Segmentation | Junfeng Wu, Qihao Liu, Yi Jiang, Song Bai, Alan Yuille, Xiang Bai | N/A | |
| Active Pointly-Supervised Instance Segmentation | Chufeng Tang, Lingxi Xie, Gang Zhang, Xiaopeng Zhang, Qi Tian, Xiaolin Hu | N/A | |
| A Transformer-Based Decoder for Semantic Segmentation with Multi-level Context Mining | Bowen Shi, Dongsheng Jiang, Xiaopeng Zhang, Han Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian | N/A | |
| XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model | Ho Kei Cheng, Alexander G. Schwing | N/A | |
| Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving | Jiale Li, Hang Dai, Yong Ding | N/A | |
| 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds | Xu Yan, Jiantao Gao, Chaoda Zheng, Chao Zheng, Ruimao Zhang, Shuguang Cui, Zhen Li | N/A | |
| Extract Free Dense Labels from CLIP | Chong Zhou, Chen Change Loy, Bo Dai | N/A | |
| 3D Compositional Zero-Shot Learning with DeCompositional Consensus | Muhammad Ferjad Naeem, Evin Pınar Örnek, Yongqin Xian, Luc Van Gool, Federico Tombari | N/A | |
| Video Mask Transfiner for High-Quality Video Instance Segmentation | Lei Ke, Henghui Ding, Martin Danelljan, Yu-Wing Tai, Chi-Keung Tang, Fisher Yu | N/A | |
| Box-Supervised Instance Segmentation with Level Set Evolution | Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, Xian-Sheng Hua, Lei Zhang | N/A | |
| Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding | Hao Wen, Yunze Liu, Jingwei Huang, Bo Duan, Li Yi | N/A | |
| Adaptive Agent Transformer for Few-Shot Segmentation | Yuan Wang, Rui Sun, Zhe Zhang, Tianzhu Zhang | N/A | |
| Waymo Open Dataset: Panoramic Video Panoptic Segmentation | Jieru Mei, Alex Zihao Zhu, Xinchen Yan, Hang Yan, Siyuan Qiao, Yukun Zhu, Liang-Chieh Chen, Henrik Kretzschmar | N/A | |
| TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation | Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin | N/A | |
| AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-Shot Interactions | Yian Wang, Ruihai Wu, Kaichun Mo, Jiaqi Ke, Qingnan Fan, Leonidas J. Guibas, Hao Dong | N/A | |
| Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation | Sunghwan Hong, Seokju Cho, Jisu Nam, Stephen Lin, Seungryong Kim | N/A | |
| "Fine-Grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications" | Lingzhi Zhang, Shenghao Zhou, Simon Stent, Jianbo Shi | N/A | |
| Perceptual Artifacts Localization for Inpainting | Lingzhi Zhang, Yuqian Zhou, Connelly Barnes, Sohrab Amirghodsi, Zhe Lin, Eli Shechtman, Jianbo Shi | N/A | |
| 2D Amodal Instance Segmentation Guided by 3D Shape Prior | Zhixuan Li, Weining Ye, Tingting Jiang, Tiejun Huang | N/A | |
| Data Efficient 3D Learner via Knowledge Transferred from 2D Model | Ping-Chung Yu, Cheng Sun, Min Sun | N/A | |
| Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation | Tong Wu, Guangyu Gao, Junshi Huang, Xiaolin Wei, Xiaoming Wei, Chi Harold Liu | N/A | |
| Dense Gaussian Processes for Few-Shot Segmentation | Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan | N/A | |
| 3D Instances as 1D Kernels | Yizheng Wu, Min Shi, Shuaiyuan Du, Hao Lu, Zhiguo Cao, Weicai Zhong | N/A | |
| TransMatting: Enhancing Transparent Objects Matting with Transformers | Huanqia Cai, Fanglei Xue, Lele Xu, Lili Guo | N/A | |
| MVSalNet:Multi-View Augmentation for RGB-D Salient Object Detection | Jiayuan Zhou, Lijun Wang, Huchuan Lu, Kaining Huang, Xinchu Shi, Bocong Liu | N/A | |
| k-Means Mask Transformer | Qihang Yu, Huiyu Wang, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen | N/A | |
| SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness | Jindong Gu, Hengshuang Zhao, Volker Tresp, Philip H. S. Torr | N/A | |
| Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation | Sung-Hoon Yoon, Hyeokjun Kweon, Jegyeong Cho, Shinjeong Kim, Kuk-Jin Yoon | N/A | |
| Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment | Zihan Lin, Zilei Wang, Yixin Zhang | N/A | |
| Interclass Prototype Relation for Few-Shot Segmentation | Atsuro Okazawa | N/A | |
| Slim Scissors: Segmenting Thin Object from Synthetic Background | Kunyang Han, Jun Hao Liew, Jiashi Feng, Huawei Tian, Yao Zhao, Yunchao Wei | N/A | |
| Abstracting Sketches through Simple Primitives | Stephan Alaniz, Massimiliano Mancini, Anjan Dutta, Diego Marcos, Zeynep Akata | N/A | |
| Multi-Scale and Cross-Scale Contrastive Learning for Semantic Segmentation | Theodoros Pissas, Claudio S. Ravasio, Lyndon Da Cruz, Christos Bergeles | N/A | |
| One-Trimap Video Matting | Hongje Seong, Seoung Wug Oh, Brian Price, Euntai Kim, Joon-Young Lee | N/A | |
| D2ADA: Dynamic Density-Aware Active Domain Adaptation for Semantic Segmentation | Tsung-Han Wu, Yi-Syuan Liou, Shao-Ji Yuan, Hsin-Ying Lee, Tung-I Chen, Kuan-Chih Huang, Winston H. Hsu | N/A | |
| Learning Quality-Aware Dynamic Memory for Video Object Segmentation | Yong Liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang | N/A | |
| Learning Implicit Feature Alignment Function for Semantic Segmentation | Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang | N/A | |
| Quantum Motion Segmentation | Federica Arrigoni, Willi Menapace, Marcel Seelbach Benkner, Elisa Ricci, Vladislav Golyanik | N/A | |
| Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation | Feng Zhu, Zongxin Yang, Xin Yu, Yi Yang, Yunchao Wei | N/A | |
| Laplacian Mesh Transformer: Dual Attention and Topology Aware Network for 3D Mesh Classification and Segmentation | Xiao-Juan Li, Jie Yang, Fang-Lue Zhang | N/A | |
| Geodesic-Former: A Geodesic-Guided Few-Shot 3D Point Cloud Instance Segmenter | Tuan Ngo, Khoi Nguyen | N/A | |
| Union-Set Multi-source Model Adaptation for Semantic Segmentation | Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama | N/A | |
| Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Divisions | Ardian Umam, Cheng-Kun Yang, Yung-Yu Chuang, Jen-Hui Chuang, Yen-Yu Lin | N/A | |
| BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation | Ye Yu, Jialin Yuan, Gaurav Mittal, Li Fuxin, Mei Chen | N/A | |
| SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection | Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee | N/A | |
| Global Spectral Filter Memory Network for Video Object Segmentation | Yong Liu, Ran Yu, Jiahao Wang, Xinyuan Zhao, Yitong Wang, Yansong Tang, Yujiu Yang | N/A | |
| Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer | Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, Fahad Shahbaz Khan | N/A | |
| RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation | Haodi He, Yuhui Yuan, Xiangyu Yue, Han Hu | N/A | |
| Learning Topological Interactions for Multi-Class Medical Image Segmentation | Saumya Gupta, Xiaoling Hu, James Kaan, Michael Jin, Mutshipay Mpoy, Katherine Chung, Gagandeep Singh, Mary Saltz, Tahsin Kurc, Joel Saltz, Apostolos Tassiopoulos, Prateek Prasanna, Chao Chen | N/A | |
| Unsupervised Segmentation in Real-World Images via Spelke Object Inference | Honglin Chen, Rahul Venkatesh, Yoni Friedman, Jiajun Wu, Joshua B. Tenenbaum, Daniel L. K. Yamins, Daniel M. Bear | N/A | |
| A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model | Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai | N/A | |
| Fast Two-View Motion Segmentation Using Christoffel Polynomials | Bengisu Ozbay, Octavia Camps, Mario Sznaier | N/A | |
| UCTNet: Uncertainty-Aware Cross-Modal Transformer Network for Indoor RGB-D Semantic Segmentation | Xiaowen Ying, Mooi Choo Chuah | N/A | |
| Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation | Geon Lee, Chanho Eom, Wonkyung Lee, Hyekang Park, Bumsub Ham | N/A | |
| Learning Regional Purity for Instance Segmentation on 3D Point Clouds | Shichao Dong, Guosheng Lin, Tzu-Yi Hung | N/A | |
| Cross-Domain Few-Shot Semantic Segmentation | Shuo Lei, Xuchao Zhang, Jianfeng He, Fanglan Chen, Bowen Du, Chang-Tien Lu | N/A | |
| Generative Subgraph Contrast for Self-Supervised Graph Representation Learning | Yuehui Han, Le Hui, Haobo Jiang, Jianjun Qian, Jin Xie | N/A | |
| SdAE: Self-Distillated Masked Autoencoder | Yabo Chen, Yuchen Liu, Dongsheng Jiang, Xiaopeng Zhang, Wenrui Dai, Hongkai Xiong, Qi Tian | N/A | |
| Demystifying Unsupervised Semantic Correspondence Estimation | Mehmet Aygün, Oisin Mac Aodha | N/A | |
| Open-Set Semi-Supervised Object Detection | Yen-Cheng Liu, Chih-Yao Ma, Xiaoliang Dai, Junjiao Tian, Peter Vajda, Zijian He, Zsolt Kira | N/A | |
| Vibration-Based Uncertainty Estimation for Learning from Limited Supervision | Hengtong Hu, Lingxi Xie, Xinyue Huo, Richang Hong, Qi Tian | N/A | |
| Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation | Jogendra Nath Kundu, Suvaansh Bhambri, Akshay Kulkarni, Hiran Sarkar, Varun Jampani, R. Venkatesh Babu | N/A | |
| Weakly Supervised Object Localization through Inter-class Feature Similarity and Intra-Class Appearance Consistency | Jun Wei, Sheng Wang, S. Kevin Zhou, Shuguang Cui, Zhen Li | N/A | |
| Active Learning Strategies for Weakly-Supervised Object Detection | Huy V. Vo, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Jean Ponce | N/A | |
| Mc-BEiT: Multi-Choice Discretization for Image BERT Pre-training | Xiaotong Li, Yixiao Ge, Kun Yi, Zixuan Hu, Ying Shan, Ling-Yu Duan | N/A | |
| Bootstrapped Masked Autoencoders for Vision BERT Pretraining | Xiaoyi Dong, Jianmin Bao, Ting Zhang, Dongdong Chen, Weiming Zhang, Lu Yuan, Dong Chen, Fang Wen, Nenghai Yu | N/A | |
| Unsupervised Visual Representation Learning by Synchronous Momentum Grouping | Bo Pang, Yifan Zhang, Yaoyi Li, Jia Cai, Cewu Lu | N/A | |
| Improving Few-Shot Part Segmentation Using Coarse Supervision | Oindrila Saha, Zezhou Cheng, Subhransu Maji | N/A | |
| What to Hide from Your Students: Attention-Guided Masked Image Modeling | Ioannis Kakogeorgiou, Spyros Gidaris, Bill Psomas, Yannis Avrithis, Andrei Bursuc, Konstantinos Karantzalos, Nikos Komodakis | N/A | |
| Pointly-Supervised Panoptic Segmentation | Junsong Fan, Zhaoxiang Zhang, Tieniu Tan | N/A | |
| MVP: Multimodality-Guided Visual Pre-training | Longhui Wei, Lingxi Xie, Wengang Zhou, Houqiang Li, Qi Tian | N/A | |
| Locally Varying Distance Transform for Unsupervised Visual Anomaly Detection | Wen-Yan Lin, Zhonghang Liu, Siying Liu | N/A | |
| HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation | Lukas Hoyer, Dengxin Dai, Luc Van Gool | N/A | |
| SPot-the-Difference Self-Supervised Pre-training for Anomaly Detection and Segmentation | Yang Zou, Jongheon Jeong, Latha Pemula, Dongqing Zhang, Onkar Dabeer | N/A | |
| Dual-Domain Self-Supervised Learning and Model Adaption for Deep Compressive Imaging | Yuhui Quan, Xinran Qin, Tongyao Pang, Hui Ji | N/A | |
| Unsupervised Selective Labeling for More Effective Semi-Supervised Learning | Xudong Wang, Long Lian, Stella X. Yu | N/A | |
| Max Pooling with Vision Transformers Reconciles Class and Shape in Weakly Supervised Semantic Segmentation | Simone Rossetti, Damiano Zappia, Marta Sanzari, Marco Schaerf, Fiora Pirri | N/A | |
| Dense Siamese Network for Dense Unsupervised Learning | Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy | N/A | |
| Multi-Granularity Distillation Scheme towards Lightweight Semi-Supervised Semantic Segmentation | Jie Qin, Jie Wu, Ming Li, Xuefeng Xiao, Min Zheng, Xingang Wang | N/A | |
| CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation | Feng Wang, Huiyu Wang, Chen Wei, Alan Yuille, Wei Shen | N/A | |
| Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization | Qi Wei, Haoliang Sun, Xiankai Lu, Yilong Yin | N/A | |
| RDA: Reciprocal Distribution Alignment for Robust Semi-Supervised Learning | Yue Duan, Lei Qi, Lei Wang, Luping Zhou, Yinghuan Shi | N/A | |
| MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation | Tarun Kalluri, Astuti Sharma, Manmohan Chandraker | N/A | |
| United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning | Wenda Zhao, Fei Wei, You He, Huchuan Lu | N/A | |
| Synergistic Self-Supervised and Quantization Learning | Yun-Hao Cao, Peiqin Sun, Yechang Huang, Jianxin Wu, Shuchang Zhou | N/A | |
| Semi-Supervised Vision Transformers | Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang | N/A | |
| Domain Adaptive Video Segmentation via Temporal Pseudo Supervision | Yun Xing, Dayan Guan, Jiaxing Huang, Shijian Lu | N/A | |
| Diverse Learner: Exploring Diverse Supervision for Semi-Supervised Object Detection | Linfeng Li, Minyue Jiang, Yue Yu, Wei Zhang, Xiangru Lin, Yingying Li, Xiao Tan, Jingdong Wang, Errui Ding | N/A | |
| A Closer Look at Invariances in Self-Supervised Pre-training for 3D Vision | Lanxiao Li, Michael Heizmann | N/A | |
| ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization | Jiwon Kim, Youngjo Min, Daehwan Kim, Gyuseong Lee, Junyoung Seo, Kwangrok Ryoo, Seungryong Kim | N/A | |
| FedX: Unsupervised Federated Learning with Cross Knowledge Distillation | Sungwon Han, Sungwon Park, Fangzhao Wu, Sundong Kim, Chuhan Wu, Xing Xie, Meeyoung Cha | N/A | |
| W2N: Switching from Weak Supervision to Noisy Supervision for Object Detection | Zitong Huang, Yiping Bao, Bowen Dong, Erjin Zhou, Wangmeng Zuo | N/A | |
| Decoupled Adversarial Contrastive Learning for Self-Supervised Adversarial Robustness | Chaoning Zhang, Kang Zhang, Chenshuang Zhang, Axi Niu, Jiu Feng, Chang D. Yoo, In So Kweon | N/A | |
| GOCA: Guided Online Cluster Assignment for Self-Supervised Video Representation Learning | Huseyin Coskun, Alireza Zareian, Joshua L. Moore, Federico Tombari, Chen Wang | N/A | |
| Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning | K L Navaneet, Soroush Abbasi Koohpayegani, Ajinkya Tejankar, Kossar Pourahmadi, Akshayvarun Subramanya, Hamed Pirsiavash | N/A | |
| Revisiting the Critical Factors of Augmentation-Invariant Representation Learning | Junqiang Huang, Xiangwen Kong, Xiangyu Zhang | N/A | |
| CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation | Lu Qi, Jason Kuen, Zhe Lin, Jiuxiang Gu, Fengyun Rao, Dian Li, Weidong Guo, Zhen Wen, Ming-Hsuan Yang, Jiaya Jia | N/A | |
| Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation | Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai, Chen Qian | N/A | |
| Semantic-Aware Fine-Grained Correspondence | Yingdong Hu, Renhao Wang, Kaifeng Zhang, Yang Gao | N/A | |
| Self-Supervised Classification Network | Elad Amrani, Leonid Karlinsky, Alex Bronstein | N/A | |
| Data Invariants to Understand Unsupervised Out-of-Distribution Detection | Lars Doorenbos, Raphael Sznitman, Pablo Márquez-Neila | N/A | |
| Domain Invariant Masked Autoencoders for Self-Supervised Learning from Multi-Domains | Haiyang Yang, Shixiang Tang, Meilin Chen, Yizhou Wang, Feng Zhu, Lei Bai, Rui Zhao, Wanli Ouyang | N/A | |
| Semi-Supervised Object Detection via Virtual Category Learning | Changrui Chen, Kurt Debattista, Jungong Han | N/A | |
| Completely Self-Supervised Crowd Counting via Distribution Matching | Deepak Babu Sam, Abhinav Agarwalla, Jimmy Joseph, Vishwanath A. Sindagi, R. Venkatesh Babu, Vishal M. Patel | N/A | |
| Coarse-to-Fine Incremental Few-Shot Learning | Xiang Xiang, Yuwen Tan, Qian Wan, Jing Ma, Alan Yuille, Gregory D. Hager | N/A | |
| Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling | Jian Hu, Haowen Zhong, Fei Yang, Shaogang Gong, Guile Wu, Junchi Yan | N/A | |
| Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition | Shreyank N Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara | N/A | |
| CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation | Renhao Wang, Hang Zhao, Yang Gao | N/A | |
| PSS: Progressive Sample Selection for Open-World Visual Representation Learning | Tianyue Cao, Yongxin Wang, Yifan Xing, Tianjun Xiao, Tong He, Zheng Zhang, Hao Zhou, Joseph Tighe | N/A | |
| Improving Self-Supervised Lightweight Model Learning via Hard-Aware Metric Distillation | Hao Liu, Mang Ye | N/A | |
| Object Discovery via Contrastive Learning for Weakly Supervised Object Detection | Jinhwan Seo, Wonho Bae, Danica J. Sutherland, Junhyug Noh, Daijin Kim | N/A | |
| Stochastic Consensus: Enhancing Semi-Supervised Learning with Consistency of Stochastic Classifiers | Hui Tang, Lin Sun, Kui Jia | N/A | |
| DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model | Boah Kim, Inhwa Han, Jong Chul Ye | N/A | |
| Semi-Leak: Membership Inference Attacks against Semi-Supervised Learning | Xinlei He, Hongbin Liu, Neil Zhenqiang Gong, Yang Zhang | N/A | |
| OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning | Mamshad Nayeem Rizve, Navid Kardan, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah | N/A | |
| Embedding Contrastive Unsupervised Features to Cluster in- and Out-of-Distribution Noise in Corrupted Image Datasets | Paul Albert, Eric Arazo, Noel E. O’Connor, Kevin McGuinness | N/A | |
| Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space | Shuo Li, Fang Liu, Zehua Hao, Kaibo Zhao, Licheng Jiao | N/A | |
| Towards Realistic Semi-Supervised Learning | Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah | N/A | |
| Masked Siamese Networks for Label-Efficient Learning | Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Michael Rabbat, Nicolas Ballas | N/A | |
| Natural Synthetic Anomalies for Self-Supervised Anomaly Detection and Localization | Hannah M. Schlüter, Jeremy Tan, Benjamin Hou, Bernhard Kainz | N/A | |
| Understanding Collapse in Non-Contrastive Siamese Representation Learning | Alexander C. Li, Alexei A. Efros, Deepak Pathak | N/A | |
| Federated Self-Supervised Learning for Video Understanding | Yasar Abbas Ur Rehman, Yan Gao, Jiajun Shen, Pedro Porto Buarque de Gusmão, Nicholas Lane | N/A | |
| Towards Efficient and Effective Self-Supervised Learning of Visual Representations | Sravanti Addepalli, Kaushal Bhogale, Priyam Dey, R. Venkatesh Babu | N/A | |
| DSR – A Dual Subspace Re-Projection Network for Surface Anomaly Detection | Vitjan Zavrtanik, Matej Kristan, Danijel Skočaj | N/A | |
| PseudoAugment: Learning to Use Unlabeled Data for Data Augmentation in Point Clouds | Zhaoqi Leng, Shuyang Cheng, Benjamin Caine, Weiyue Wang, Xiao Zhang, Jonathon Shlens, Mingxing Tan, Dragomir Anguelov | N/A | |
| MVSTER: Epipolar Transformer for Efficient Multi-View Stereo | Xiaofeng Wang, Zheng Zhu, Guan Huang, Fangbo Qin, Yun Ye, Yijia He, Xu Chi, Xingang Wang | N/A | |
| RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y. Zhang, Deva Ramanan, Shubham Tulsiani | N/A | |
| R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis | Huan Wang, Jian Ren, Zeng Huang, Kyle Olszewski, Menglei Chai, Yun Fu, Sergey Tulyakov | N/A | |
| KD-MVS: Knowledge Distillation Based Self-Supervised Learning for Multi-View Stereo | Yikang Ding, Qingtian Zhu, Xiangyue Liu, Wentao Yuan, Haotian Zhang, Chi Zhang | N/A | |
| SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas | John Lambert, Yuguang Li, Ivaylo Boyadzhiev, Lambert Wixson, Manjunath Narayana, Will Hutchcroft, James Hays, Frank Dellaert, Sing Bing Kang | N/A | |
| RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering | Di Chang, Aljaž Božič, Tong Zhang, Qingsong Yan, Yingcong Chen, Sabine Süsstrunk, Matthias Nießner | N/A | |
| Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes | Julian Chibane, Francis Engelmann, Tuan Anh Tran, Gerard Pons-Moll | N/A | |
| NeILF: Neural Incident Light Field for Physically-Based Material Estimation | Yao Yao, Jingyang Zhang, Jingbo Liu, Yihang Qu, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan | N/A | |
| ARF: Artistic Radiance Fields | Kai Zhang, Nick Kolkin, Sai Bi, Fujun Luan, Zexiang Xu, Eli Shechtman, Noah Snavely | N/A | |
| Multiview Stereo with Cascaded Epipolar RAFT | Zeyu Ma, Zachary Teed, Jia Deng | N/A | |
| ARAH: Animatable Volume Rendering of Articulated Human SDFs | Shaofei Wang, Katja Schwarz, Andreas Geiger, Siyu Tang | N/A | |
| ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer | Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David McKinnon, Yanghai Tsin, Long Quan | N/A | |
| NDF: Neural Deformable Fields for Dynamic Human Modelling | Ruiqi Zhang, Jie Chen | N/A | |
| Neural Density-Distance Fields | Itsuki Ueda, Yoshihiro Fukuhara, Hirokatsu Kataoka, Hiroaki Aizawa, Hidehiko Shishido, Itaru Kitahara | N/A | |
| NeXT: Towards High Quality Neural Radiance Fields via Multi-Skip Transformer | Yunxiao Wang, Yanjie Li, Peidong Liu, Tao Dai, Shu-Tao Xia | N/A | |
| Learning Online Multi-sensor Depth Fusion | Erik Sandström, Martin R. Oswald, Suryansh Kumar, Silvan Weder, Fisher Yu, Cristian Sminchisescu, Luc Van Gool | N/A | |
| BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-Scale Scene Rendering | Yuanbo Xiangli, Linning Xu, Xingang Pan, Nanxuan Zhao, Anyi Rao, Christian Theobalt, Bo Dai, Dahua Lin | N/A | |
| Decomposing the Tangent of Occluding Boundaries according to Curvatures and Torsions | Huizong Yang, Anthony Yezzi | N/A | |
| NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors | Jiepeng Wang, Peng Wang, Xiaoxiao Long, Christian Theobalt, Taku Komura, Lingjie Liu, Wenping Wang | N/A | |
| Generalizable Patch-Based Neural Rendering | Mohammed Suhail, Carlos Esteves, Leonid Sigal, Ameesh Makadia | N/A | |
| Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation | Ziming Wang, Xiaoliang Huo, Zhenghao Chen, Jing Zhang, Lu Sheng, Dong Xu | N/A | |
| Real-Time Neural Character Rendering with Pose-Guided Multiplane Images | Hao Ouyang, Bo Zhang, Pan Zhang, Hao Yang, Jiaolong Yang, Dong Chen, Qifeng Chen, Fang Wen | N/A | |
| SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views | Xiaoxiao Long, Cheng Lin, Peng Wang, Taku Komura, Wenping Wang | N/A | |
| Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth | Ziyue Feng, Liang Yang, Longlong Jing, Haiyan Wang, YingLi Tian, Bing Li | N/A | |
| Depth Field Networks for Generalizable Multi-View Scene Representation | Vitor Guizilini, Igor Vasiljevic, Jiading Fang, Rareș Ambruș, Greg Shakhnarovich, Matthew R. Walter, Adrien Gaidon | N/A | |
| Context-Enhanced Stereo Transformer | Weiyu Guo, Zhaoshuo Li, Yongkui Yang, Zheng Wang, Russell H. Taylor, Mathias Unberath, Alan Yuille, Yingwei Li | N/A | |
| PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching | Zhelun Shen, Yuchao Dai, Xibin Song, Zhibo Rao, Dingfu Zhou, Liangjun Zhang | N/A | |
| Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images | Yuan Liu, Yilin Wen, Sida Peng, Cheng Lin, Xiaoxiao Long, Taku Komura, Wenping Wang | N/A | |
| Latency-Aware Collaborative Perception | Zixing Lei, Shunli Ren, Yue Hu, Wenjun Zhang, Siheng Chen | N/A | |
| TensoRF: Tensorial Radiance Fields | Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su | N/A | |
| NeFSAC: Neurally Filtered Minimal Samples | Luca Cavalli, Marc Pollefeys, Daniel Barath | N/A | |
| SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data | Eldar Insafutdinov, Dylan Campbell, João F. Henriques, Andrea Vedaldi | N/A | |
| HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields | Kim Jun-Seong, Kim Yu-Ji, Moon Ye-Bin, Tae-Hyun Oh | N/A | |
| NeuMan: Neural Human Radiance Field from a Single Video | Wei Jiang, Kwang Moo Yi, Golnoosh Samei, Oncel Tuzel, Anurag Ranjan | N/A | |
| TAVA: Template-Free Animatable Volumetric Actors | Ruilong Li, Julian Tanke, Minh Vo, Michael Zollhöfer, Jürgen Gall, Angjoo Kanazawa, Christoph Lassner | N/A | |
| EASNet: Searching Elastic and Accurate Network Architecture for Stereo Matching | Qiang Wang, Shaohuai Shi, Kaiyong Zhao, Xiaowen Chu | N/A | |
| Relative Pose from SIFT Features | Daniel Barath, Zuzana Kukelova | N/A | |
| Selection and Cross Similarity for Event-Image Deep Stereo | Hoonhee Cho, Kuk-Jin Yoon | N/A | |
| D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding | Zhenyu Chen, Qirui Wu, Matthias Nießner, Angel X. Chang | N/A | |
| CIRCLE: Convolutional Implicit Reconstruction and Completion for Large-Scale Indoor Scene | Hao-Xiang Chen, Jiahui Huang, Tai-Jiang Mu, Shi-Min Hu | N/A | |
| ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Wang Zhao, Shaohui Liu, Hengkai Guo, Wenping Wang, Yong-Jin Liu | N/A | |
| 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding | Yujin Chen, Matthias Nießner, Angela Dai | N/A | |
| Few ‘Zero Level Set’-Shot Learning of Shape Signed Distance Functions in Feature Space | Amine Ouasfi, Adnane Boukhayma | N/A | |
| Solution Space Analysis of Essential Matrix Based on Algebraic Error Minimization | Gaku Nakano | N/A | |
| Approximate Differentiable Rendering with Algebraic Surfaces | Leonid Keselman, Martial Hebert | N/A | |
| CoVisPose: Co-Visibility Pose Transformer for Wide-Baseline Relative Pose Estimation in 360° Indoor Panoramas | Will Hutchcroft, Yuguang Li, Ivaylo Boyadzhiev, Zhiqiang Wan, Haiyan Wang, Sing Bing Kang | N/A | |
| Affine Correspondences between Multi-Camera Systems for 6DOF Relative Pose Estimation | Banglei Guan, Ji Zhao | N/A | |
| GraphFit: Learning Multi-Scale Graph-Convolutional Representation for Point Cloud Normal Estimation | Keqiang Li, Mingyang Zhao, Huaiyu Wu, Dong-Ming Yan, Zhen Shen, Fei-Yue Wang, Gang Xiong | N/A | |
| IS-MVSNet: Importance Sampling-Based MVSNet | Likang Wang, Yue Gong, Xinjun Ma, Qirui Wang, Kaixuan Zhou, Lei Chen | N/A | |
| Point Scene Understanding via Disentangled Instance Mesh Reconstruction | Jiaxiang Tang, Xiaokang Chen, Jingbo Wang, Gang Zeng | N/A | |
| DiffuStereo: High Quality Human Reconstruction via Diffusion-Based Stereo Using Sparse Cameras | Ruizhi Shao, Zerong Zheng, Hongwen Zhang, Jingxiang Sun, Yebin Liu | N/A | |
| Space-Partitioning RANSAC | Daniel Barath, Gábor Valasek | N/A | |
| SimpleRecon: 3D Reconstruction without 3D Convolutions | Mohamed Sayed, John Gibson, Jamie Watson, Victor Prisacariu, Michael Firman, Clément Godard | N/A | |
| Structure and Motion from Casual Videos | Zhoutong Zhang, Forrester Cole, Zhengqi Li, Noah Snavely, Michael Rubinstein, William T. Freeman | N/A | |
| What Matters for 3D Scene Flow Network | Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka, Wei Zhan, Hesheng Wang | N/A | |
| Correspondence Reweighted Translation Averaging | Lalit Manam, Venu Madhav Govindu | N/A | |
| Neural Strands: Learning Hair Geometry and Appearance from Multi-View Images | Radu Alexandru Rosu, Shunsuke Saito, Ziyan Wang, Chenglei Wu, Sven Behnke, Giljoo Nam | N/A | |
| GraphCSPN: Geometry-Aware Depth Completion via Dynamic GCNs | Xin Liu, Xiaofei Shao, Bo Wang, Yali Li, Shengjin Wang | N/A | |
| Objects Can Move: 3D Change Detection by Geometric Transformation Consistency | Aikaterini Adam, Torsten Sattler, Konstantinos Karantzalos, Tomas Pajdla | N/A | |
| Language-Grounded Indoor 3D Semantic Segmentation in the Wild | Dávid Rozenberszki, Or Litany, Angela Dai | N/A | |
| Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs | Sameera Ramasinghe, Simon Lucey | N/A | |
| Deforming Radiance Fields with Cages | Tianhan Xu, Tatsuya Harada | N/A | |
| FLEX: Extrinsic Parameters-Free Multi-View 3D Human Motion Reconstruction | Brian Gordon, Sigal Raab, Guy Azov, Raja Giryes, Daniel Cohen-Or | N/A | |
| MODE: Multi-View Omnidirectional Depth Estimation with 360° Cameras | Ming Li, Xueqian Jin, Xuejiao Hu, Jingzhao Dai, Sidan Du, Yang Li | N/A | |
| GigaDepth: Learning Depth from Structured Light with Branching Neural Networks | Simon Schreiberhuber, Jean-Baptiste Weibel, Timothy Patten, Markus Vincze | N/A | |
| ActiveNeRF: Learning Where to See with Uncertainty Estimation | Xuran Pan, Zihang Lai, Shiji Song, Gao Huang | N/A | |
| PoserNet: Refining Relative Camera Poses Exploiting Object Detections | Matteo Taiana, Matteo Toso, Stuart James, Alessio Del Bue | N/A | |
| Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction & Pose Estimation | Shin-Fang Chng, Sameera Ramasinghe, Jamie Sherrah, Simon Lucey | N/A | |
| Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling | Jan U. Müller, Michael Weinmann, Reinhard Klein | N/A | |
| Towards Learning Neural Representations from Shadows | Kushagra Tiwary, Tzofi Klinghoffer, Ramesh Raskar | N/A | |
| Class-Incremental Novel Class Discovery | Subhankar Roy, Mingxuan Liu, Zhun Zhong, Nicu Sebe, Elisa Ricci | N/A | |
| Unknown-Oriented Learning for Open Set Domain Adaptation | Jie Liu, Xiaoqing Guo, Yixuan Yuan | N/A | |
| Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation | Hongbin Lin, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Chuang Gan, Yanxia Liu, Mingkui Tan | N/A | |
| DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation | Xin Lai, Zhuotao Tian, Xiaogang Xu, Yingcong Chen, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia | N/A | |
| Class-Agnostic Object Counting Robust to Intraclass Diversity | Shenjian Gong, Shanshan Zhang, Jian Yang, Dengxin Dai, Bernt Schiele | N/A | |
| Burn after Reading: Online Adaptation for Cross-Domain Streaming Data | Luyu Yang, Mingfei Gao, Zeyuan Chen, Ran Xu, Abhinav Shrivastava, Chetan Ramaiah | N/A | |
| Mind the Gap in Distilling StyleGANs | Guodong Xu, Yuenan Hou, Ziwei Liu, Chen Change Loy | N/A | |
| Improving Test-Time Adaptation via Shift-Agnostic Weight Regularization and Nearest Source Prototypes | Sungha Choi, Seunghan Yang, Seokeon Choi, Sungrack Yun | N/A | |
| Learning Instance-Specific Adaptation for Cross-Domain Segmentation | Yuliang Zou, Zizhao Zhang, Chun-Liang Li, Han Zhang, Tomas Pfister, Jia-Bin Huang | N/A | |
| RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning | Yufei Xu, Qiming Zhang, Jing Zhang, Dacheng Tao | N/A | |
| Long-Tailed Class Incremental Learning | Xialei Liu, Yu-Song Hu, Xu-Sheng Cao, Andrew D. Bagdanov, Ke Li, Ming-Ming Cheng | N/A | |
| DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning | Hyounguk Shon, Janghyeon Lee, Seung Hwan Kim, Junmo Kim | N/A | |
| Adversarial Partial Domain Adaptation by Cycle Inconsistency | Kun-Yu Lin, Jiaming Zhou, Yukun Qiu, Wei-Shi Zheng | N/A | |
| Combating Label Distribution Shift for Active Domain Adaptation | Sehyun Hwang, Sohyun Lee, Sungyeon Kim, Jungseul Ok, Suha Kwak | N/A | |
| GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori, Evgeny Krivosheev, Stéphane Lathuilière, Nicu Sebe, Fabio Galasso, Giuseppe Fiameni, Elisa Ricci, Fabio Poiesi | N/A | |
| CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation | Cristiano Saltori, Fabio Galasso, Giuseppe Fiameni, Nicu Sebe, Elisa Ricci, Fabio Poiesi | N/A | |
| A Unified Framework for Domain Adaptive Pose Estimation | Donghyun Kim, Kaihong Wang, Kate Saenko, Margrit Betke, Stan Sclaroff | N/A | |
| A Broad Study of Pre-training for Domain Generalization and Adaptation | Donghyun Kim, Kaihong Wang, Stan Sclaroff, Kate Saenko | N/A | |
| Prior Knowledge Guided Unsupervised Domain Adaptation | Tao Sun, Cheng Lu, Haibin Ling | N/A | |
| GCISG: Guided Causal Invariant Learning for Improved Syn-to-Real Generalization | Gilhyun Nam, Gyeongjae Choi, Kyungmin Lee | N/A | |
| AcroFOD: An Adaptive Method for Cross-Domain Few-Shot Object Detection | Yipeng Gao, Lingxiao Yang, Yunmu Huang, Song Xie, Shiyong Li, Wei-Shi Zheng | N/A | |
| Unsupervised Domain Adaptation for One-Stage Object Detector Using Offsets to Bounding Box | Jayeon Yoo, Inseop Chung, Nojun Kwak | N/A | |
| Visual Prompt Tuning | Menglin Jia, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, Ser-Nam Lim | N/A | |
| Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap | Yongwei Chen, Zihao Wang, Longkun Zou, Ke Chen, Kui Jia | N/A | |
| Interpretable Open-Set Domain Adaptation via Angular Margin Separation | Xinhao Li, Jingjing Li, Zhekai Du, Lei Zhu, Wen Li | N/A | |
| TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation | Rui Gong, Martin Danelljan, Dengxin Dai, Danda Pani Paudel, Ajad Chhatkuli, Fisher Yu, Luc Van Gool | N/A | |
| Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation | Zhengkai Jiang, Yuxi Li, Ceyuan Yang, Peng Gao, Yabiao Wang, Ying Tai, Chengjie Wang | N/A | |
| RBC: Rectifying the Biased Context in Continual Semantic Segmentation | Hanbin Zhao, Fengyu Yang, Xinghe Fu, Xi Li | N/A | |
| Factorizing Knowledge in Neural Networks | Xingyi Yang, Jingwen Ye, Xinchao Wang | N/A | |
| Contrastive Vicinal Space for Unsupervised Domain Adaptation | Jaemin Na, Dongyoon Han, Hyung Jin Chang, Wonjun Hwang | N/A | |
| Cross-Modal Knowledge Transfer without Task-Relevant Source Data | Sk Miraj Ahmed, Suhas Lohit, Kuan-Chuan Peng, Michael J. Jones, Amit K. Roy-Chowdhury | N/A | |
| Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions | Theodoros Panagiotakopoulos, Pier Luigi Dovesi, Linus Härenstam-Nielsen, Matteo Poggi | N/A | |
| Source-Free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition | Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu, Min Wu, Zhenghua Chen | N/A | |
| BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation | Sanqing Qu, Guang Chen, Jing Zhang, Zhijun Li, Wei He, Dacheng Tao | N/A | |
| Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks | Yawen Huang, Feng Zheng, Xu Sun, Yuexiang Li, Ling Shao, Yefeng Zheng | N/A | |
| Incomplete Multi-View Domain Adaptation via Channel Enhancement and Knowledge Transfer | Haifeng Xia, Pu Wang, Zhengming Ding | N/A | |
| DistPro: Searching a Fast Knowledge Distillation Process via Meta Optimization | Xueqing Deng, Dawei Sun, Shawn Newsam, Peng Wang | N/A | |
| ML-BPM: Multi-Teacher Learning with Bidirectional Photometric Mixing for Open Compound Domain Adaptation in Semantic Segmentation | Fei Pan, Sungsu Hur, Seokju Lee, Junsik Kim, In So Kweon | N/A | |
| PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks | Nan Ding, Xi Chen, Tomer Levinboim, Soravit Changpinyo, Radu Soricut | N/A | |
| Personalized Education: Blind Knowledge Distillation | Xiang Deng, Jian Zheng, Zhongfei Zhang | N/A | |
| Not All Models Are Equal: Predicting Model Transferability in a Self-Challenging Fisher Space | Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo | N/A | |
| How Stable Are Transferability Metrics Evaluations? | Andrea Agostinelli, Michal Pándy, Jasper Uijlings, Thomas Mensink, Vittorio Ferrari | N/A | |
| Attention Diversification for Domain Generalization | Rang Meng, Xianfeng Li, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, Shiliang Pu | N/A | |
| ESS: Learning Event-Based Semantic Segmentation from Still Images | Zhaoning Sun, Nico Messikommer, Daniel Gehrig, Davide Scaramuzza | N/A | |
| An Efficient Spatio-Temporal Pyramid Transformer for Action Detection | Yuetian Weng, Zizheng Pan, Mingfei Han, Xiaojun Chang, Bohan Zhuang | N/A | |
| Human Trajectory Prediction via Neural Social Physics | Jiangbei Yue, Dinesh Manocha, He Wang | N/A | |
| Towards Open Set Video Anomaly Detection | Yuansheng Zhu, Wentao Bao, Qi Yu | N/A | |
| ECLIPSE: Efficient Long-Range Video Retrieval Using Sight and Sound | Yan-Bo Lin, Jie Lei, Mohit Bansal, Gedas Bertasius | N/A | |
| Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing | Haoyue Cheng, Zhaoyang Liu, Hang Zhou, Chen Qian, Wayne Wu, Limin Wang | N/A | |
| Less than Few: Self-Shot Video Instance Segmentation | Pengwan Yang, Yuki M. Asano, Pascal Mettes, Cees G. M. Snoek | N/A | |
| Adaptive Face Forgery Detection in Cross Domain | Luchuan Song, Zheng Fang, Xiaodan Li, Xiaoyi Dong, Zhenchao Jin, Yuefeng Chen, Siwei Lyu | N/A | |
| Real-Time Online Video Detection with Temporal Smoothing Transformers | Yue Zhao, Philipp Krähenbühl | N/A | |
| TALLFormer: Temporal Action Localization with a Long-Memory Transformer | Feng Cheng, Gedas Bertasius | N/A | |
| Mining Relations among Cross-Frame Affinities for Video Semantic Segmentation | Guolei Sun, Yun Liu, Hao Tang, Ajad Chhatkuli, Le Zhang, Luc Van Gool | N/A | |
| TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency | Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid | N/A | |
| Rethinking Learning Approaches for Long-Term Action Anticipation | Megha Nawhal, Akash Abdu Jyothi, Greg Mori | N/A | |
| DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition | Yuxuan Liang, Pan Zhou, Roger Zimmermann, Shuicheng Yan | N/A | |
| Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation | Gensheng Pei, Fumin Shen, Yazhou Yao, Guo-Sen Xie, Zhenmin Tang, Jinhui Tang | N/A | |
| PAC-Net: Highlight Your Video via History Preference Modeling | Hang Wang, Penghao Zhou, Chong Zhou, Zhao Zhang, Xing Sun | N/A | |
| How Severe Is Benchmark-Sensitivity in Video Self-Supervised Learning? | Fida Mohammad Thoker, Hazel Doughty, Piyush Bagad, Cees G. M. Snoek | N/A | |
| A Sliding Window Scheme for Online Temporal Action Localization | Young Hwi Kim, Hyolim Kang, Seon Joo Kim | N/A | |
| ERA: Expert Retrieval and Assembly for Early Action Prediction | Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu | N/A | |
| Dual Perspective Network for Audio-Visual Event Localization | Varshanth Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu | N/A | |
| NSNet: Non-Saliency Suppression Sampler for Efficient Video Recognition | Boyang Xia, Wenhao Wu, Haoran Wang, Rui Su, Dongliang He, Haosen Yang, Xiaoran Fan, Wanli Ouyang | N/A | |
| Video Activity Localisation with Uncertainties in Temporal Boundary | Jiabo Huang, Hailin Jin, Shaogang Gong, Yang Liu | N/A | |
| Temporal Saliency Query Network for Efficient Video Recognition | Boyang Xia, Zhihao Wang, Wenhao Wu, Haoran Wang, Jungong Han | N/A | |
| Efficient One-Stage Video Object Detection by Exploiting Temporal Consistency | Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson | N/A | |
| Leveraging Action Affinity and Continuity for Semi-Supervised Temporal Action Segmentation | Guodong Ding, Angela Yao | N/A | |
| "Spotting Temporally Precise, Fine-Grained Events in Video" | James Hong, Haotian Zhang, Michaël Gharbi, Matthew Fisher, Kayvon Fatahalian | N/A | |
| Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation | Nadine Behrmann, S. Alireza Golestaneh, Zico Kolter, Jürgen Gall, Mehdi Noroozi | N/A | |
| Efficient Video Transformers with Spatial-Temporal Token Selection | Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang | N/A | |
| Long Movie Clip Classification with State-Space Video Models | Md Mohaiminul Islam, Gedas Bertasius | N/A | |
| Prompting Visual-Language Models for Efficient Video Understanding | Chen Ju, Tengda Han, Kunhao Zheng, Ya Zhang, Weidi Xie | N/A | |
| Asymmetric Relation Consistency Reasoning for Video Relation Grounding | Huan Li, Ping Wei, Jiapeng Li, Zeyu Ma, Jiahui Shang, Nanning Zheng | N/A | |
| Self-Supervised Social Relation Representation for Human Group Detection | Jiacheng Li, Ruize Han, Haomin Yan, Zekun Qian, Wei Feng, Song Wang | N/A | |
| K-Centered Patch Sampling for Efficient Video Recognition | Seong Hyeon Park, Jihoon Tack, Byeongho Heo, Jung-Woo Ha, Jinwoo Shin | N/A | |
| A Deep Moving-Camera Background Model | Guy Erez, Ron Shapira Weber, Oren Freifeld | N/A | |
| GraphVid: It Only Takes a Few Nodes to Understand a Video | Eitan Kosman, Dotan Di Castro | N/A | |
| Delta Distillation for Efficient Video Processing | Amirhossein Habibian, Haitam Ben Yahia, Davide Abati, Efstratios Gavves, Fatih Porikli | N/A | |
| MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning | David Junhao Zhang, Kunchang Li, Yali Wang, Yunpeng Chen, Shashwat Chandra, Yu Qiao, Luoqi Liu, Mike Zheng Shou | N/A | |
| COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality | Honglu Zhou, Asim Kadav, Aviv Shamsian, Shijie Geng, Farley Lai, Long Zhao, Ting Liu, Mubbasir Kapadia, Hans Peter Graf | N/A | |
| E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context | Zizhang Li, Mengmeng Wang, Huaijin Pi, Kechun Xu, Jianbiao Mei, Yong Liu | N/A | |
| TDViT: Temporal Dilated Video Transformer for Dense Video Tasks | Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson | N/A | |
| Semi-Supervised Learning of Optical Flow by Flow Supervisor | Woobin Im, Sebin Lee, Sung-Eui Yoon | N/A | |
| Flow Graph to Video Grounding for Weakly-Supervised Multi-step Localization | Nikita Dvornik, Isma Hadji, Hai Pham, Dhaivat Bhatt, Brais Martinez, Afsaneh Fazly, Allan D. Jepson | N/A | |
| Deep 360° Optical Flow Estimation Based on Multi-Projection Fusion | Yiheng Li, Connelly Barnes, Kun Huang, Fang-Lue Zhang | N/A | |
| MaCLR: Motion-Aware Contrastive Learning of Representations for Videos | Fanyi Xiao, Joseph Tighe, Davide Modolo | N/A | |
| Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection | Kyle Min, Sourya Roy, Subarna Tripathi, Tanaya Guha, Somdeb Majumdar | N/A | |
| Frozen CLIP Models Are Efficient Video Learners | Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li | N/A | |
| PIP: Physical Interaction Prediction via Mental Simulation with Span Selection | Jiafei Duan, Samson Yu, Soujanya Poria, Bihan Wen, Cheston Tan | N/A | |
| Panoramic Vision Transformer for Saliency Detection in 360° Videos | Heeseung Yun, Sehun Lee, Gunhee Kim | N/A | |
| Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration | Aditi Basu Bal, Ramy Mounir, Sathyanarayanan Aakur, Sudeep Sarkar, Anuj Srivastava | N/A | |
| Motion Sensitive Contrastive Learning for Self-Supervised Video Representation | Jingcheng Ni, Nan Zhou, Jie Qin, Qian Wu, Junqi Liu, Boxun Li, Di Huang | N/A | |
| Dynamic Temporal Filtering In Video Models | Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Chong-Wah Ngo, Tao Mei | N/A | |
| Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification | Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li | N/A | |
| Temporal Lift Pooling for Continuous Sign Language Recognition | Lianyu Hu, Liqing Gao, Zekang Liu, Wei Feng | N/A | |
| MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes | Yang Jiao, Shaoxiang Chen, Zequn Jie, Jingjing Chen, Lin Ma, Yu-Gang Jiang | N/A | |
| SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding | Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei | N/A | |
| Cross-Modal Prototype Driven Network for Radiology Report Generation | Jun Wang, Abhir Bhalerao, Yulan He | N/A | |
| TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts | Chuan Guo, Xinxin Zuo, Sen Wang, Li Cheng | N/A | |
| SeqTR: A Simple Yet Universal Network for Visual Grounding | Chaoyang Zhu, Yiyi Zhou, Yunhang Shen, Gen Luo, Xingjia Pan, Mingbao Lin, Chao Chen, Liujuan Cao, Xiaoshuai Sun, Rongrong Ji | N/A | |
| VTC: Improving Video-Text Retrieval with User Comments | Laura Hanu, James Thewlis, Yuki M. Asano, Christian Rupprecht | N/A | |
| FashionViL: Fashion-Focused Vision-and-Language Representation Learning | Xiao Han, Licheng Yu, Xiatian Zhu, Li Zhang, Yi-Zhe Song, Tao Xiang | N/A | |
| Weakly Supervised Grounding for VQA in Vision-Language Transformers | Aisha Urooj, Hilde Kuehne, Chuang Gan, Niels Da Vitoria Lobo, Mubarak Shah | N/A | |
| Automatic Dense Annotation of Large-Vocabulary Sign Language Videos | Liliane Momeni, Hannah Bull, K R Prajwal, Samuel Albanie, Gül Varol, Andrew Zisserman | N/A | |
| MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval | Yuying Ge, Yixiao Ge, Xihui Liu, Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo | N/A | |
| "GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval" | Yuxuan Wang, Difei Gao, Licheng Yu, Weixian Lei, Matt Feiszli, Mike Zheng Shou | N/A | |
| A Simple and Robust Correlation Filtering Method for Text-Based Person Search | Wei Suo, Mengyang Sun, Kai Niu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu | N/A | |
| Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing | Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie Hyland, Maria Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez-Valle, Hoifung Poon, Ozan Oktay | N/A | |
| Generative Negative Text Replay for Continual Vision-Language Pretraining | Shipeng Yan, Lanqing Hong, Hang Xu, Jianhua Han, Tinne Tuytelaars, Zhenguo Li, Xuming He | N/A | |
| Video Graph Transformer for Video Question Answering | Junbin Xiao, Pan Zhou, Tat-Seng Chua, Shuicheng Yan | N/A | |
| Trace Controlled Text to Image Generation | Kun Yan, Lei Ji, Chenfei Wu, Jianmin Bao, Ming Zhou, Nan Duan, Shuai Ma | N/A | |
| Video Question Answering with Iterative Video-Text Co-Tokenization | AJ Piergiovanni, Kairo Morton, Weicheng Kuo, Michael S. Ryoo, Anelia Angelova | N/A | |
| Rethinking Data Augmentation for Robust Visual Question Answering | Long Chen, Yuhang Zheng, Jun Xiao | N/A | |
| Explicit Image Caption Editing | Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao | N/A | |
| Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding | Jiachang Hao, Haifeng Sun, Pengfei Ren, Jingyu Wang, Qi Qi, Jianxin Liao | N/A | |
| Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly | Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach | N/A | |
| GRIT: Faster and Better Image Captioning Transformer Using Dual Visual Features | Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani | N/A | |
| Selective Query-Guided Debiasing for Video Corpus Moment Retrieval | Sunjae Yoon, Ji Woo Hong, Eunseop Yoon, Dahyun Kim, Junyeong Kim, Hee Suk Yoon, Chang D. Yoo | N/A | |
| Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding | Cheng Shi, Sibei Yang | N/A | |
| Object-Centric Unsupervised Image Captioning | Zihang Meng, David Yang, Xuefei Cao, Ashish Shah, Ser-Nam Lim | N/A | |
| Contrastive Vision-Language Pre-training with Limited Resources | Quan Cui, Boyan Zhou, Yu Guo, Weidong Yin, Hao Wu, Osamu Yoshie, Yubo Chen | N/A | |
| Learning Linguistic Association towards Efficient Text-Video Retrieval | Sheng Fang, Shuhui Wang, Junbao Zhuo, Xinzhe Han, Qingming Huang | N/A | |
| ASSISTER: Assistive Navigation via Conditional Instruction Generation | Zanming Huang, Zhongkai Shangguan, Jimuyang Zhang, Gilad Bar, Matthew Boyd, Eshed Ohn-Bar | N/A | |
| X-DETR: A Versatile Architecture for Instance-Wise Vision-Language Tasks | Zhaowei Cai, Gukyeong Kwon, Avinash Ravichandran, Erhan Bas, Zhuowen Tu, Rahul Bhotika, Stefano Soatto | N/A | |
| Learning Disentanglement with Decoupled Labels for Vision-Language Navigation | Wenhao Cheng, Xingping Dong, Salman Khan, Jianbing Shen | N/A | |
| Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input | Qingpei Guo, Kaisheng Yao, Wei Chu | N/A | |
| Word-Level Fine-Grained Story Visualization | Bowen Li | N/A | |
| Unifying Event Detection and Captioning as Sequence Generation via Pre-training | Qi Zhang, Yuqing Song, Qin Jin | N/A | |
| Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation | Chuang Lin, Yi Jiang, Jianfei Cai, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan | N/A | |
| Fine-Grained Visual Entailment | Christopher Thomas, Yipeng Zhang, Shih-Fu Chang | N/A | |
| Bottom Up Top down Detection Transformers for Language Grounding in Images and Point Clouds | Ayush Jain, Nikolaos Gkanatsios, Ishita Mediratta, Katerina Fragkiadaki | N/A | |
| New Datasets and Models for Contextual Reasoning in Visual Dialog | Yifeng Zhang, Ming Jiang, Qi Zhao | N/A | |
| VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection | Joanna Hong, Minsu Kim, Yong Man Ro | N/A | |
| Classification-Regression for Chart Comprehension | Matan Levy, Rami Ben-Ari, Dani Lischinski | N/A | |
| AssistQ: Affordance-Centric Question-Driven Task Completion for Egocentric Assistant | Benita Wong, Joya Chen, You Wu, Stan Weixian Lei, Dongxing Mao, Difei Gao, Mike Zheng Shou | N/A | |
| FindIt: Generalized Localization with Natural Language Queries | Weicheng Kuo, Fred Bertsch, Wei Li, AJ Piergiovanni, Mohammad Saffar, Anelia Angelova | N/A | |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Zhengyuan Yang, Zhe Gan, Jianfeng Wang, Xiaowei Hu, Faisal Ahmed, Zicheng Liu, Yumao Lu, Lijuan Wang | N/A | |
| Scaling Open-Vocabulary Image Segmentation with Image-Level Labels | Golnaz Ghiasi, Xiuye Gu, Yin Cui, Tsung-Yi Lin | N/A | |
| The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning | Jack Hessel, Jena D. Hwang, Jae Sung Park, Rowan Zellers, Chandra Bhagavatula, Anna Rohrbach, Kate Saenko, Yejin Choi | N/A | |
| Speaker-Adaptive Lip Reading with User-Dependent Padding | Minsu Kim, Hyunjun Kim, Yong Man Ro | N/A | |
| TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation | Tan M. Dinh, Rang Nguyen, Binh-Son Hua | N/A | |
| SemAug: Semantically Meaningful Image Augmentations for Object Detection through Language Grounding | Morgan Heisler, Amin Banitalebi-Dehkordi, Yong Zhang | N/A | |
| Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance | Myungsub Choi | N/A | |
| NewsStories: Illustrating Articles with Visual Summaries | Reuben Tan, Bryan A. Plummer, Kate Saenko, JP Lewis, Avneesh Sud, Thomas Leung | N/A | |
| Webly Supervised Concept Expansion for General Purpose Vision Models | Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi | N/A | |
| FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation | Kaiwen Zhou, Xin Eric Wang | N/A | |
| CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval | Haoran Wang, Dongliang He, Wenhao Wu, Boyang Xia, Min Yang, Fu Li, Yunlong Yu, Zhong Ji, Errui Ding, Jingdong Wang | N/A | |
| Language-Driven Artistic Style Transfer | Tsu-Jui Fu, Xin Eric Wang, William Yang Wang | N/A | |
| Single-Stream Multi-level Alignment for Vision-Language Pretraining | Zaid Khan, Vijay Kumar B G, Xiang Yu, Samuel Schulter, Manmohan Chandraker, Yun Fu | N/A | |
| Most and Least Retrievable Images in Visual-Language Query Systems | Liuwan Zhu, Rui Ning, Jiang Li, Chunsheng Xin, Hongyi Wu | N/A | |
| Sports Video Analysis on Large-Scale Data | Dekun Wu, He Zhao, Xingce Bao, Richard P. Wildes | N/A | |
| Grounding Visual Representations with Texts for Domain Generalization | Seonwoo Min, Nokyung Park, Siwon Kim, Seunghyun Park, Jinkyu Kim | N/A | |
| Bridging the Visual Semantic Gap in VLN via Semantically Richer Instructions | Joaquín Ossandón, Benjamín Earle, Alvaro Soto | N/A | |
| StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation | Adyasha Maharana, Darryl Hannan, Mohit Bansal | N/A | |
| VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance | Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato, Edward Raff | N/A | |
| Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation | Xian Liu, Yinghao Xu, Qianyi Wu, Hang Zhou, Wayne Wu, Bolei Zhou | N/A | |
| End-to-End Active Speaker Detection | Juan León Alcázar, Moritz Cordes, Chen Zhao, Bernard Ghanem | N/A | |
| Emotion Recognition for Multiple Context Awareness | Dingkang Yang, Shuai Huang, Shunli Wang, Yang Liu, Peng Zhai, Liuzhen Su, Mingcheng Li, Lihua Zhang | N/A | |
| Adaptive Fine-Grained Sketch-Based Image Retrieval | Ayan Kumar Bhunia, Aneeshan Sain, Parth Hiren Shah, Animesh Gupta, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song | N/A | |
| Quantized GAN for Complex Music Generation from Dance Videos | Ye Zhu, Kyle Olszewski, Yu Wu, Panos Achlioptas, Menglei Chai, Yan Yan, Sergey Tulyakov | N/A | |
| Uncertainty-Aware Multi-modal Learning via Cross-Modal Random Network Prediction | Hu Wang, Jianpeng Zhang, Yuanhong Chen, Congbo Ma, Jodie Avery, Louise Hull, Gustavo Carneiro | N/A | |
| Localizing Visual Sounds the Easy Way | Shentong Mo, Pedro Morgado | N/A | |
| Learning Visual Styles from Audio-Visual Associations | Tingle Li, Yichen Liu, Andrew Owens, Hang Zhao | N/A | |
| Remote Respiration Monitoring of Moving Person Using Radio Signals | Jae-Ho Choi, Ki-Bong Kang, Kyung-Tae Kim | N/A | |
| Camera Pose Estimation and Localization with Active Audio Sensing | Karren Yang, Michael Firman, Eric Brachmann, Clément Godard | N/A | |
| PACS: A Dataset for Physical Audiovisual Commonsense Reasoning | Samuel Yu, Peter Wu, Paul Pu Liang, Ruslan Salakhutdinov, Louis-Philippe Morency | N/A | |
| VoViT: Low Latency Graph-Based Audio-Visual Voice Separation Transformer | Juan F. Montesinos, Venkatesh S. Kadandale, Gloria Haro | N/A | |
| Telepresence Video Quality Assessment | Zhenqiang Ying, Deepti Ghadiyaram, Alan Bovik | N/A | |
| MultiMAE: Multi-modal Multi-task Masked Autoencoders | Roman Bachmann, David Mizrahi, Andrei Atanov, Amir Zamir | N/A | |
| AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation | Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey | N/A | |
| Audio—Visual Segmentation | Jinxing Zhou, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong | N/A | |
| Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression | Yeying Jin, Wenhan Yang, Robby T. Tan | N/A | |
| Relationformer: A Unified Framework for Image-to-Graph Generation | Suprosanna Shit, Rajat Koner, Bastian Wittmann, Johannes Paetzold, Ivan Ezhov, Hongwei Li, Jiazhen Pan, Sahand Sharifzadeh, Georgios Kaissis, Volker Tresp, Bjoern Menze | N/A | |
| GAMa: Cross-view Video Geo-localization | Shruti Vyas, Chen Chen, Mubarak Shah | N/A | |
| Revisiting a kNN-based Image Classification System with High-capacity Storage | Kengo Nakata, Youyang Ng, Daisuke Miyashita, Asuka Maki, Yu-Chieh Lin, Jun Deguchi | N/A | |
| Geometric Representation Learning for Document Image Rectification | Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, Houqiang Li | N/A | |
| S2-VER: Semi-Supervised Visual Emotion Recognition | Guoli Jia, Jufeng Yang | N/A | |
| Image Coding for Machines with Omnipotent Feature Learning | Ruoyu Feng, Xin Jin, Zongyu Guo, Runsen Feng, Yixin Gao, Tianyu He, Zhizheng Zhang, Simeng Sun, Zhibo Chen | N/A | |
| Feature Representation Learning for Unsupervised Cross-Domain Image Retrieval | Conghui Hu, Gim Hee Lee | N/A | |
| "Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition" | Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao | N/A | |
| Semantic-Guided Multi-Mask Image Harmonization | Xuqian Ren, Yifan Liu | N/A | |
| Learning an Isometric Surface Parameterization for Texture Unwrapping | Sagnik Das, Ke Ma, Zhixin Shu, Dimitris Samaras | N/A | |
| Towards Regression-Free Neural Networks for Diverse Compute Platforms | Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia | N/A | |
| Relationship Spatialization for Depth Estimation | Xiaoyu Xu, Jiayan Qiu, Xinchao Wang, Zhou Wang | N/A | |
| Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models | Chenfeng Xu, Shijia Yang, Tomer Galanti, Bichen Wu, Xiangyu Yue, Bohan Zhai, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka | N/A | |
| FAR: Fourier Aerial Video Recognition | Divya Kothandaraman, Tianrui Guan, Xijun Wang, Shuowen Hu, Ming Lin, Dinesh Manocha | N/A | |
| Translating a Visual LEGO Manual to a Machine-Executable Plan | Ruocheng Wang, Yunzhi Zhang, Jiayuan Mao, Chin-Yi Cheng, Jiajun Wu | N/A | |
| Fabric Material Recovery from Video Using Multi-Scale Geometric Auto-Encoder | Junbang Liang, Ming Lin | N/A | |
| MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment | Jie Ren, Wenteng Liang, Ran Yan, Luo Mai, Shiwen Liu, Xiao Liu | N/A | |
| The One Where They Reconstructed 3D Humans and Environments in TV Shows | Georgios Pavlakos, Ethan Weber, Matthew Tancik, Angjoo Kanazawa | N/A | |
| TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices Using Submodular Mutual Information | Suraj Kothawade, Saikat Ghosh, Sumit Shekhar, Yu Xiang, Rishabh Iyer | N/A | |
| An Efficient Person Clustering Algorithm for Open Checkout-Free Groceries | Junde Wu, Yu Zhang, Rao Fu, Yuanpei Liu, Jing Gao | N/A | |
| POP: Mining POtential Performance of New Fashion Products via Webly Cross-Modal Query Expansion | Christian Joppi, Geri Skenderi, Marco Cristani | N/A | |
| Pose Forecasting in Industrial Human-Robot Collaboration | Alessio Sampieri, Guido Maria D’Amely di Melendugno, Andrea Avogaro, Federico Cunico, Francesco Setti, Geri Skenderi, Marco Cristani, Fabio Galasso | N/A | |
| Actor-Centered Representations for Action Localization in Streaming Videos | Sathyanarayanan Aakur, Sudeep Sarkar | N/A | |
| Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT | Xiufeng Xie, Ning Zhou, Wentao Zhu, Ji Liu | N/A | |
| Domain Knowledge-Informed Self-Supervised Representations for Workout Form Assessment | Paritosh Parmar, Amol Gharat, Helge Rhodin | N/A | |
| Responsive Listening Head Generation: A Benchmark Dataset and Baseline | Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei | N/A | |
| "Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics" | Sen Zhang, Jing Zhang, Dacheng Tao | N/A | |
| TIPS: Text-Induced Pose Synthesis | Prasun Roy, Subhankar Ghosh, Saumik Bhattacharya, Umapada Pal, Michael Blumenstein | N/A | |
| Addressing Heterogeneity in Federated Learning via Distributional Transformation | Haolin Yuan, Bo Hui, Yuchen Yang, Philippe Burlina, Neil Zhenqiang Gong, Yinzhi Cao | N/A | |
| Where in the World Is This Image? Transformer-Based Geo-Localization in the Wild | Shraman Pramanick, Ewa M. Nowara, Joshua Gleason, Carlos D. Castillo, Rama Chellappa | N/A | |
| Colorization for In Situ Marine Plankton Images | Guannan Guo, Qi Lin, Tao Chen, Zhenghui Feng, Zheng Wang, Jianping Li | N/A | |
| Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection | Mingyu Yang, Yu Chen, Hun-Seok Kim | N/A | |
| A Sketch Is Worth a Thousand Words: Image Retrieval with Text and Sketch | Patsorn Sangkloy, Wittawat Jitkrittum, Diyi Yang, James Hays | N/A | |
| A Cloud 3D Dataset and Application-Specific Learned Image Compression in Cloud 3D | Tianyi Liu, Sen He, Vinodh Kumaran Jayakumar, Wei Wang | N/A | |
| AutoTransition: Learning to Recommend Video Transition Effects | Yaojie Shen, Libo Zhang, Kai Xu, Xiaojie Jin | N/A | |
| Online Segmentation of LiDAR Sequences: Dataset and Algorithm | Romain Loiseau, Mathieu Aubry, Loïc Landrieu | N/A | |
| Open-World Semantic Segmentation for LIDAR Point Clouds | Jun Cen, Peng Yun, Shiwei Zhang, Junhao Cai, Di Luan, Mingqian Tang, Ming Liu, Michael Yu Wang | N/A | |
| KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients | Niklas Hanselmann, Katrin Renz, Kashyap Chitta, Apratim Bhattacharyya, Andreas Geiger | N/A | |
| Differentiable Raycasting for Self-Supervised Occupancy Forecasting | Tarasha Khurana, Peiyun Hu, Achal Dave, Jason Ziglar, David Held, Deva Ramanan | N/A | |
| InAction: Interpretable Action Decision Making for Autonomous Driving | Taotao Jing, Haifeng Xia, Renran Tian, Haoran Ding, Xiao Luo, Joshua Domeyer, Rini Sherony, Zhengming Ding | N/A | |
| CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection | Jyh-Jing Hwang, Henrik Kretzschmar, Joshua Manela, Sean Rafferty, Nicholas Armstrong-Crews, Tiffany Chen, Dragomir Anguelov | N/A | |
| CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving | Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu | N/A | |
| Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving | Mahyar Najibi, Jingwei Ji, Yin Zhou, Charles R. Qi, Xinchen Yan, Scott Ettinger, Dragomir Anguelov | N/A | |
| StretchBEV: Stretching Future Instance Prediction Spatially and Temporally | Adil Kaan Akan, Fatma Güney | N/A | |
| RCLane: Relay Chain Prediction for Lane Detection | Shenghua Xu, Xinyue Cai, Bin Zhao, Li Zhang, Hang Xu, Yanwei Fu, Xiangyang Xue | N/A | |
| Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-Modal Distillation | Antonin Vobecky, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic | N/A | |
| CenterFormer: Center-based Transformer for 3D Object Detection | Zixiang Zhou, Xiangchen Zhao, Yu Wang, Panqu Wang, Hassan Foroosh | N/A | |
| Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches | Zhiyuan Cheng, James Liang, Hongjun Choi, Guanhong Tao, Zhiwen Cao, Dongfang Liu, Xiangyu Zhang | N/A | |
| ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning | Shengchao Hu, Li Chen, Penghao Wu, Hongyang Li, Junchi Yan, Dacheng Tao | N/A | |
| PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark | Li Chen, Chonghao Sima, Yang Li, Zehan Zheng, Jiajie Xu, Xiangwei Geng, Hongyang Li, Conghui He, Jianping Shi, Yu Qiao, Junchi Yan | N/A | |
| PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation | Kwonyoung Kim, Jungin Park, Jiyoung Lee, Dongbo Min, Kwanghoon Sohn | N/A | |
| BRNet: Exploring Comprehensive Features for Monocular Depth Estimation | Wencheng Han, Junbo Yin, Xiaogang Jin, Xiangdong Dai, Jianbing Shen | N/A | |
| SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network | Zhenyao Wu, Xinyi Wu, Xiaoping Zhang, Lili Ju, Song Wang | N/A | |
| Context-Aware Streaming Perception in Dynamic Environments | Gur-Eyal Sela, Ionel Gog, Justin Wong, Kumar Krishna Agrawal, Xiangxi Mo, Sukrit Kalra, Peter Schafhalter, Eric Leong, Xin Wang, Bharathan Balaji, Joseph Gonzalez, Ion Stoica | N/A | |
| SpOT: Spatiotemporal Modeling for 3D Object Tracking | Colton Stearns, Davis Rempe, Jie Li, Rareș Ambruș, Sergey Zakharov, Vitor Guizilini, Yanchao Yang, Leonidas J. Guibas | N/A | |
| Multimodal Transformer for Automatic 3D Annotation and Object Detection | Chang Liu, Xiaoyan Qian, Binxiao Huang, Xiaojuan Qi, Edmund Lam, Siew-Chong Tan, Ngai Wong | N/A | |
| Dynamic 3D Scene Analysis by Point Cloud Accumulation | Shengyu Huang, Zan Gojcic, Jiahui Huang, Andreas Wieser, Konrad Schindler | N/A | |
| Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection | Xin Li, Botian Shi, Yuenan Hou, Xingjiao Wu, Tianlong Ma, Yikang Li, Liang He | N/A | |
| "JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes" | Haimei Zhao, Jing Zhang, Sen Zhang, Dacheng Tao | N/A | |
| Semi-Supervised 3D Object Detection with Proficient Teachers | Junbo Yin, Jin Fang, Dingfu Zhou, Liangjun Zhang, Cheng-Zhong Xu, Jianbing Shen, Wenguan Wang | N/A | |
| Point Cloud Compression with Sibling Context and Surface Priors | Zhili Chen, Zian Qian, Sukai Wang, Qifeng Chen | N/A | |
| Lane Detection Transformer Based on Multi-Frame Horizontal and Vertical Attention and Visual Transformer Module | Han Zhang, Yunchao Gu, Xinliang Wang, Junjun Pan, Minghui Wang | N/A | |
| ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection | Junbo Yin, Dingfu Zhou, Liangjun Zhang, Jin Fang, Cheng-Zhong Xu, Jianbing Shen, Wenguan Wang | N/A | |
| PreTraM: Self-Supervised Pre-training via Connecting Trajectory and Map | Chenfeng Xu, Tian Li, Chen Tang, Lingfeng Sun, Kurt Keutzer, Masayoshi Tomizuka, Alireza Fathi, Wei Zhan | N/A | |
| Master of All: Simultaneous Generalization of Urban-Scene Segmentation to All Adverse Weather Conditions | Nikhil Reddy, Abhinav Singhal, Abhishek Kumar, Mahsa Baktashmotlagh, Chetan Arora | N/A | |
| LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds | Minghua Liu, Yin Zhou, Charles R. Qi, Boqing Gong, Hao Su, Dragomir Anguelov | N/A | |
| Visual Cross-View Metric Localization with Dense Uncertainty Estimates | Zimin Xia, Olaf Booij, Marco Manfredi, Julian F. P. Kooij | N/A | |
| V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer | Runsheng Xu, Hao Xiang, Zhengzhong Tu, Xin Xia, Ming-Hsuan Yang, Jiaqi Ma | N/A | |
| DevNet: Self-Supervised Monocular Depth Learning via Density Volume Construction | Kaichen Zhou, Lanqing Hong, Changhao Chen, Hang Xu, Chaoqiang Ye, Qingyong Hu, Zhenguo Li | N/A | |
| Action-Based Contrastive Learning for Trajectory Prediction | Marah Halawa, Olaf Hellwich, Pia Bideau | N/A | |
| Radatron: Accurate Detection Using Multi-Resolution Cascaded MIMO Radar | Sohrab Madani, Jayden Guan, Waleed Ahmed, Saurabh Gupta, Haitham Hassanieh | N/A | |
| LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection | Yi Wei, Zibu Wei, Yongming Rao, Jiaxin Li, Jie Zhou, Jiwen Lu | N/A | |
| Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks | Maosheng Ye, Rui Wan, Shuangjie Xu, Tongyi Cao, Qifeng Chen | N/A | |
| FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-World Point Clouds | Lihe Ding, Shaocong Dong, Tingfa Xu, Xinli Xu, Jie Wang, Jianan Li | N/A | |
| SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection from Multi-View Camera Images with Global Cross-Sensor Attention | Simon Doll, Richard Schulz, Lukas Schneider, Viviane Benzin, Markus Enzweiler, Hendrik P.A. Lensch | N/A | |
| Pixel-Wise Energy-Biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes | Yu Tian, Yuyuan Liu, Guansong Pang, Fengbei Liu, Yuanhong Chen, Gustavo Carneiro | N/A | |
| Rethinking Closed-Loop Training for Autonomous Driving | Chris Zhang, Runsheng Guo, Wenyuan Zeng, Yuwen Xiong, Binbin Dai, Rui Hu, Mengye Ren, Raquel Urtasun | N/A | |
| SLiDE: Self-Supervised LiDAR De-Snowing through Reconstruction Difficulty | Gwangtak Bae, Byungjun Kim, Seongyong Ahn, Jihong Min, Inwook Shim | N/A | |
| Generative Meta-Adversarial Network for Unseen Object Navigation | Sixian Zhang, Weijie Li, Xinhang Song, Yubing Bai, Shuqiang Jiang | N/A | |
| Object Manipulation via Visual Target Localization | Kiana Ehsani, Ali Farhadi, Aniruddha Kembhavi, Roozbeh Mottaghi | N/A | |
| MoDA: Map Style Transfer for Self-Supervised Domain Adaptation of Embodied Agents | Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim | N/A | |
| Housekeep: Tidying Virtual Households Using Commonsense Reasoning | Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal | N/A | |
| Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects | Qiyu Dai, Jiyao Zhang, Qiwei Li, Tianhao Wu, Hao Dong, Ziyuan Liu, Ping Tan, He Wang | N/A | |
| Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction | Chia-Chi Chuang, Donglin Yang, Chuan Wen, Yang Gao | N/A | |
| OPD: Single-View 3D Openable Part Detection | Hanxiao Jiang, Yongsen Mao, Manolis Savva, Angel X. Chang | N/A | |
| AirDet: Few-Shot Detection without Fine-Tuning for Autonomous Exploration | Bowen Li, Chen Wang, Pranay Reddy, Seungchan Kim, Sebastian Scherer | N/A | |
| TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from Only One Labeled Instance | Hongtao Wen, Jianhang Yan, Wanli Peng, Yi Sun | N/A | |
| StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning | Jinghuan Shang, Kumara Kahatapitiya, Xiang Li, Michael S. Ryoo | N/A | |
| TIDEE: Tidying Up Novel Rooms Using Visuo-Semantic Commonsense Priors | Gabriel Sarch, Zhaoyuan Fang, Adam W. Harley, Paul Schydlo, Michael J. Tarr, Saurabh Gupta, Katerina Fragkiadaki | N/A | |
| Learning Efficient Multi-agent Cooperative Visual Exploration | Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu | N/A | |
| Zero-Shot Category-Level Object Pose Estimation | Walter Goodwin, Sagar Vaze, Ioannis Havoutis, Ingmar Posner | N/A | |
| Sim-to-Real 6D Object Pose Estimation via Iterative Self-Training for Robotic Bin Picking | Kai Chen, Rui Cao, Stephen James, Yichuan Li, Yun-Hui Liu, Pieter Abbeel, Qi Dou | N/A | |
| Active Audio-Visual Separation of Dynamic Sound Sources | Sagnik Majumder, Kristen Grauman | N/A | |
| DexMV: Imitation Learning for Dexterous Manipulation from Human Videos | Yuzhe Qin, Yueh-Hua Wu, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang Fu, Xiaolong Wang | N/A | |
| Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments | Jacob Krantz, Stefan Lee | N/A | |
| Style-Agnostic Reinforcement Learning | Juyong Lee, Seokjun Ahn, Jaesik Park | N/A | |
| Self-Supervised Interactive Object Segmentation through a Singulation-and-Grasping Approach | Houjian Yu, Changhyun Choi | N/A | |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Shizhe Chen, Pierre-Louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev | N/A | |
| "BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking" | Dorian F. Henning, Tristan Laidlow, Stefan Leutenegger | N/A | |
| FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion | Fabian Duffhauss, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann | N/A | |
| Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning | Chi Zhang, Sirui Xie, Baoxiong Jia, Ying Nian Wu, Song-Chun Zhu, Yixin Zhu | N/A | |
| Video Dialog As Conversation about Objects Living in Space-Time | Hoang-Anh Pham, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran | N/A | |
| Quaternion Equivariant Capsule Networks for 3D Point Clouds | Yongheng Zhao, Tolga Birdal, Jan Eric Lenssen, Emanuele Menegatti, Leonidas Guibas, Federico Tombari | N/A | |
| DeepFit: 3D Surface Fitting via Neural Network Weighted Least Squares | Yizhak Ben-Shabat, Stephen Gould | N/A | |
| NSGANetV2: Evolutionary Multi-Objective Surrogate-Assisted Neural Architecture Search | Zhichao Lu, Kalyanmoy Deb, Erik Goodman, Wolfgang Banzhaf, Vishnu Naresh Boddeti | N/A | |
| Describing Textures using Natural Language | Chenyun Wu, Mikayla Timm, Subhransu Maji | N/A | |
| Empowering Relational Network by Self-Attention Augmented Conditional Random Fields for Group Activity Recognition | Rizard Renanda Adhi Pramono, Yie Tarng Chen, Wen Hsien Fang | N/A | |
| AiR: Attention with Reasoning Capability | Shi Chen, Ming Jiang, Jinhui Yang, Qi Zhao | N/A | |
| Self6D: Self-Supervised Monocular 6D Object Pose Estimation | Gu Wang, Fabian Manhardt, Jianzhun Shao, Xiangyang Ji, Nassir Navab , Federico Tombari | N/A | |
| Invertible Image Rescaling | Mingqing Xiao, Shuxin Zheng, Chang Liu, Yaolong Wang, Di He, Guolin Ke, Jiang Bian, Zhouchen Lin, Tie-Yan Liu | N/A | |
| Synthesize then Compare: Detecting Failures and Anomalies for Semantic Segmentation | Yingda Xia, Yi Zhang, Fengze Liu, Wei Shen, Alan L. Yuille | N/A | |
| House-GAN: Relational Generative Adversarial Networks for Graph-constrained House Layout Generation | Nelson Nauata, Kai-Hung Chang, Chin-Yi Cheng, Greg Mori, Yasutaka Furukawa | N/A | |
| Crowdsampling the Plenoptic Function | Zhengqi Li, Wenqi Xian, Abe Davis, Noah Snavely | N/A | |
| VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment | Hanyue Tu, Chunyu Wang, Wenjun Zeng | N/A | |
| End-to-End Object Detection with Transformers | Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko | N/A | |
| DeepSFM: Structure From Motion Via Deep Bundle Adjustment | Xingkui Wei, Yinda Zhang, Zhuwen Li, Yanwei Fu, Xiangyang Xue | N/A | |
| Ladybird: Quasi-Monte Carlo Sampling for Deep Implicit Field Based 3D Reconstruction with Symmetry | Yifan Xu, Tianqi Fan, Yi Yuan, Gurprit Singh | N/A | |
| Segment as Points for Efficient Online Multi-Object Tracking and Segmentation | Zhenbo Xu, Wei Zhang, Xiao Tan, Wei Yang, Huan Huang, Shilei Wen, Errui Ding, Liusheng Huang | N/A | |
| Conditional Convolutions for Instance Segmentation | Zhi Tian, Chunhua Shen, Hao Chen | N/A | |
| MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution | Taojiannan Yang, Sijie Zhu, Chen Chen, Shen Yan, Mi Zhang, Andrew Willis | N/A | |
| Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset | Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie , Bharath Hariharan, Hartwig Adam, Serge Belongie | N/A | |
| Privacy Preserving Structure-from-Motion | Marcel Geppert, Viktor Larsson, Pablo Speciale, Johannes L. Schönberger, Marc Pollefeys | N/A | |
| Rewriting a Deep Generative Model | David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba | N/A | |
| Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets | Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan | N/A | |
| Long-term Human Motion Prediction with Scene Context | Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik | N/A | |
| NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis | Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng | N/A | |
| ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes | Panos Achlioptas, Ahmed Abdelreheem, Fei Xia, Mohamed Elhoseiny, Leonidas Guibas | N/A | |
| MatryODShka: Real-time 6DoF Video View Synthesis using Multi-Sphere Images | Benjamin Attal, Selena Ling, Aaron Gokaslan, Christian Richardt, James Tompkin | N/A | |
| Learning and Aggregating Deep Local Descriptors for Instance-level Recognition | Giorgos Tolias, Tomas Jenicek, Ondřej Chum | N/A | |
| A Consistently Fast and Globally Optimal Solution to the Perspective-n-Point Problem | George Terzakis, Manolis Lourakis | N/A | |
| Learn to Recover Visible Color for Video Surveillance in a Day | Guangming Wu, Yinqiang Zheng, Zhiling Guo, Zekun Cai, Xiaodan Shi, Xin Ding, Yifei Huang, Yimin Guo, Ryosuke Shibasaki | N/A | |
| Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single Images | Heming Zhu, Yu Cao, Hang Jin, Weikai Chen, Dong Du, Zhangye Wang, Shuguang Cui, Xiaoguang Han | N/A | |
| Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation | Zhenda Xie, Zheng Zhang, Xizhou Zhu, Gao Huang, Stephen Lin | N/A | |
| BorderDet: Border Feature for Dense Object Detection | Han Qiu, Yuchen Ma, Zeming Li, Songtao Liu, Jian Sun | N/A | |
| Regularization with Latent Space Virtual Adversarial Training | Genki Osada, Budrul Ahsan, Revoti Prasad Bora, Takashi Nishide | N/A | |
| Du²Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels | Yinda Zhang, Neal Wadhwa, Sergio Orts-Escolano, Christian Häne, Sean Fanello, Rahul Garg | N/A | |
| Model-Agnostic Boundary-Adversarial Sampling for Test-Time Generalization in Few-Shot learning | Jaekyeom Kim, Hyoungseok Kim, Gunhee Kim | N/A | |
| Targeted Attack for Deep Hashing based Retrieval | Jiawang Bai, Bin Chen, Yiming Li, Dongxian Wu, Weiwei Guo, Shu-Tao Xia, En-Hui Yang | N/A | |
| Gradient Centralization: A New Optimization Technique for Deep Neural Networks | Hongwei Yong, Jianqiang Huang, Xiansheng Hua, Lei Zhang | N/A | |
| Content-Aware Unsupervised Deep Homography Estimation | Jirong Zhang, Chuan Wang, Shuaicheng Liu, Lanpeng Jia, Nianjin Ye, Jue Wang, Ji Zhou, Jian Sun | N/A | |
| Multi-View Optimization of Local Feature Geometry | Mihai Dusmanu, Johannes L. Schönberger, Marc Pollefeys | N/A | |
| The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization | Jingjing Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew Fitzgibbon, Jamie Shotton | N/A | |
| Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video | Miao Liu, Siyu Tang, Yin Li, James M. Rehg | N/A | |
| Learning Stereo from Single Images | Jamie Watson, Oisin Mac Aodha, Daniyar Turmukhambetov, Gabriel J. Brostow, Michael Firman | N/A | |
| Prototype Rectification for Few-Shot Learning | Jinlu Liu, Liang Song, Yongqiang Qin | N/A | |
| Learning Feature Descriptors using Camera Pose Supervision | Qianqian Wang, Xiaowei Zhou, Bharath Hariharan, Noah Snavely | N/A | |
| Semantic Flow for Fast and Accurate Scene Parsing | Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Shaohua Tan, Yunhai Tong | N/A | |
| Appearance Consensus Driven Self-Supervised Human Mesh Recovery | Jogendra Nath Kundu, Mugalodi Rakesh, Varun Jampani, Rahul Mysore Venkatesh, R. Venkatesh Babu | N/A | |
| Diffraction Line Imaging | Mark Sheinin, Dinesh N. Reddy, Matthew O’Toole, Srinivasa G. Narasimhan | N/A | |
| Aligning and Projecting Images to Class-conditional Generative Networks | Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, Aaron Hertzmann | N/A | |
| Suppress and Balance: A Simple Gated Network for Salient Object Detection | Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang | N/A | |
| Visual Memorability for Robotic Interestingness via Unsupervised Online Learning | Chen Wang, Wenshan Wang, Yuheng Qiu, Yafei Hu, Sebastian Scherer | N/A | |
| Post-Training Piecewise Linear Quantization for Deep Neural Networks | Jun Fang, Ali Shafiee, Hamzah Abdel-Aziz, David Thorsley, Georgios Georgiadis, Joseph H. Hassoun | N/A | |
| Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification | Yang Zou, Xiaodong Yang, Zhiding Yu, B.V.K. Vijaya Kumar, Jan Kautz | N/A | |
| In-Home Daily-Life Captioning Using Radio Signals | Lijie Fan, Tianhong Li, Yuan Yuan, Dina Katabi | N/A | |
| Self-Challenging Improves Cross-Domain Generalization | Zeyi Huang, Haohan Wang, Eric P. Xing, Dong Huang | N/A | |
| A Competence-aware Curriculum for Visual Concepts Learning via Question Answering | Qing Li, Siyuan Huang, Yining Hong, Song-Chun Zhu | N/A | |
| Multitask Learning Strengthens Adversarial Robustness | Chengzhi Mao, Amogh Gupta, Vikram Nitin, Baishakhi Ray, Shuran Song , Junfeng Yang, Carl Vondrick | N/A | |
| S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search | Zhihang Yuan, Bingzhe Wu, Guangyu Sun, Zheng Liang, Shiwan Zhao, Weichen Bi | N/A | |
| Improving Deep Video Compression by Resolution-adaptive Flow Coding | Zhihao Hu, Zhenghao Chen, Dong Xu, Guo Lu, Wanli Ouyang, Shuhang Gu | N/A | |
| Motion Capture from Internet Videos | Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao | N/A | |
| Appearance-Preserving 3D Convolution for Video-based Person Re-identification | Xinqian Gu, Hong Chang, Bingpeng Ma, Hongkai Zhang, Xilin Chen | N/A | |
| Solving the Blind Perspective-n-Point Problem End-To-End With Robust Differentiable Geometric Optimization | Dylan Campbell, Liu Liu, Stephen Gould | N/A | |
| Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation | Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, Ping Luo | N/A | |
| Deep Spatial-angular Regularization for Compressive Light Field Reconstruction over Coded Apertures | Mantang Guo, Junhui Hou, Jing Jin, Jie Chen, Lap-Pui Chau | N/A | |
| Video-based Remote Physiological Measurement via Cross-verified Feature Disentangling | Xuesong Niu, Zitong Yu, Hu Han, Xiaobai Li, Shiguang Shan, Guoying Zhao | N/A | |
| Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction | Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll | N/A | |
| Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network | Tsai-Shien Chen, Chih-Ting Liu, Chih-Wei Wu, Shao-Yi Chien | N/A | |
| Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation | Guolei Sun, Wenguan Wang, Jifeng Dai, Luc Van Gool | N/A | |
| CoReNet: Coherent 3D Scene Reconstruction from a Single RGB Image | Stefan Popov, Pablo Bauszat, Vittorio Ferrari | N/A | |
| Layer-wise Conditioning Analysis in Exploring the Learning Dynamics of DNNs | Lei Huang, Jie Qin, Li Liu, Fan Zhu, Ling Shao | N/A | |
| RAFT: Recurrent All-Pairs Field Transforms for Optical Flow | Zachary Teed, Jia Deng | N/A | |
| Domain-invariant Stereo Matching Networks | Feihu Zhang, Xiaojuan Qi, Ruigang Yang, Victor Prisacariu, Benjamin Wah, Philip Torr | N/A | |
| DeepHandMesh: A Weakly-supervised Deep Encoder-Decoder Framework for High-fidelity Hand Mesh Modeling | Gyeongsik Moon, Takaaki Shiratori, Kyoung Mu Lee | N/A | |
| Content Adaptive and Error Propagation Aware Deep Video Compression | Guo Lu, Chunlei Cai, Xiaoyun Zhang, Li Chen, Wanli Ouyang, Dong Xu , Zhiyong Gao | N/A | |
| Towards Streaming Perception | Mengtian Li, Yu-Xiong Wang, Deva Ramanan | N/A | |
| Towards Automated Testing and Robustification by Semantic Adversarial Data Generation | Rakshith Shetty, Mario Fritz, Bernt Schiele | N/A | |
| Adversarial Generative Grammars for Human Activity Prediction | AJ Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo | N/A | |
| GDumb: A Simple Approach that Questions Our Progress in Continual Learning | Ameya Prabhu, Philip H. S. Torr, Puneet K. Dokania | N/A | |
| Learning Lane Graph Representations for Motion Forecasting | Ming Liang, Bin Yang, Rui Hu, Yun Chen, Renjie Liao, Song Feng, Raquel Urtasun | N/A | |
| What Matters in Unsupervised Optical Flow | Rico Jonschkowski, Austin Stone, Jonathan T. Barron, Ariel Gordon, Kurt Konolige, Anelia Angelova | N/A | |
| Synthesis and Completion of Facades from Satellite Imagery | Xiaowei Zhang, Christopher May, Daniel Aliaga | N/A | |
| Mapillary Planet-Scale Depth Dataset | Manuel López Antequera, Pau Gargallo, Markus Hofinger, Samuel Rota Bulò, Yubin Kuang, Peter Kontschieder | N/A | |
| V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction | Tsun-Hsuan Wang, Sivabalan Manivasagam, Ming Liang, Bin Yang, Wenyuan Zeng, Raquel Urtasun | N/A | |
| Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters | Haoyu Liang, Zhihao Ouyang, Yuyuan Zeng, Hang Su, Zihao He, Shu-Tao Xia, Jun Zhu, Bo Zhang | N/A | |
| EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning | Bailin Li, Bowen Wu, Jiang Su, Guangrun Wang | N/A | |
| Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation | Marie-Julie Rakotosaona, Maks Ovsjanikov | N/A | |
| Cross-Domain Cascaded Deep Translation | Oren Katzir, Dani Lischinski, Daniel Cohen-Or | N/A | |
| “Look Ma, no landmarks!” – Unsupervised, Model-based Dense Face Alignment | Tatsuro Koizumi, William A. P. Smith | N/A | |
| Online Invariance Selection for Local Feature Descriptors | Rémi Pautrat, Viktor Larsson, Martin R. Oswald, Marc Pollefeys | N/A | |
| Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations | Hongyu Liu, Bin Jiang, Yibing Song, Wei Huang, Chao Yang | N/A | |
| TextCaps: a Dataset for Image Captioning with Reading Comprehension | Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh | N/A | |
| It is not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction | Karttikeya Mangalam, Harshayu Girase, Shreyas Agarwal, Kuan-Hui Lee, Ehsan Adeli, Jitendra Malik, Adrien Gaidon | N/A | |
| Learning What to Learn for Video Object Segmentation | Goutam Bhat, Felix Järemo Lawin, Martin Danelljan, Andreas Robinson, Michael Felsberg, Luc Van Gool, Radu Timofte | N/A | |
| SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing | Garvita Tiwari, Bharat Lal Bhatnagar, Tony Tung, Gerard Pons-Moll | N/A | |
| LIMP: Learning Latent Shape Representations with Metric Preservation Priors | Luca Cosmo, Antonio Norelli, Oshri Halimi, Ron Kimmel, Emanuele Rodolà | N/A | |
| Unsupervised Sketch to Photo Synthesis | Runtao Liu, Qian Yu, Stella X. Yu | N/A | |
| A Simple Way to Make Neural Networks Robust Against Diverse Image Corruptions | Evgenia Rusak, Lukas Schott, Roland S. Zimmermann, Julian Bitterwolf , Oliver Bringmann, Matthias Bethge, Wieland Brendel | N/A | |
| SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification | Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari | N/A | |
| Hierarchical Face Aging through Disentangled Latent Characteristics | Peipei Li, Huaibo Huang, Yibo Hu, Xiang Wu, Ran He, Zhenan Sun | N/A | |
| Hybrid Models for Open Set Recognition | Hongjie Zhang, Ang Li, Jie Guo, Yanwen Guo | N/A | |
| TopoGAN: A Topology-Aware Generative Adversarial Network | Fan Wang, Huidong Liu, Dimitris Samaras, Chao Chen | N/A | |
| Learning to Localize Actions from Moments | Fuchen Long, Ting Yao, Zhaofan Qiu, Xinmei Tian, Jiebo Luo, Tao Mei | N/A | |
| ForkGAN: Seeing into the Rainy Night | Ziqiang Zheng, Yang Wu, Xinran Han, Jianbo Shi | N/A | |
| TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning | Xinwei Sun, Yilun Xu, Peng Cao, Yuqing Kong, Lingjing Hu, Shanghang Zhang, Yizhou Wang | N/A | |
| ExchNet: A Unified Hashing Network for Large-Scale Fine-Grained Image Retrieval | Quan Cui, Qing-Yuan Jiang, Xiu-Shen Wei, Wu-Jun Li, Osamu Yoshie | N/A | |
| TSIT: A Simple and Versatile Framework for Image-to-Image Translation | Liming Jiang, Changxu Zhang, Mingyang Huang, Chunxiao Liu, Jianping Shi, Chen Change Loy | N/A | |
| ProxyBNN: Learning Binarized Neural Networks via Proxy Matrices | Xiangyu He, Zitao Mo, Ke Cheng, Weixiang Xu, Qinghao Hu, Peisong Wang, Qingshan Liu, Jian Cheng | N/A | |
| HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation | Can Wang, Jiefeng Li, Wentao Liu, Chen Qian, Cewu Lu | N/A | |
| Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve | Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin, Angela Dai | N/A | |
| A Unified Framework of Surrogate Loss by Refactoring and Interpolation | Lanlan Liu, Mingzhe Wang, Jia Deng | N/A | |
| Deep Reflectance Volumes: Relightable Reconstructions from Multi-View Photometric Images | Sai Bi, Zexiang Xu, Kalyan Sunkavalli, Miloš Hašan, Yannick Hold-Geoffroy, David Kriegman, Ravi Ramamoorthi | N/A | |
| Memory-augmented Dense Predictive Coding for Video Representation Learning | Tengda Han, Weidi Xie, Andrew Zisserman | N/A | |
| PointMixup: Augmentation for Point Clouds | Yunlu Chen, Vincent Tao Hu, Efstratios Gavves, Thomas Mensink, Pascal Mettes, Pengwan Yang, Cees G. M. Snoek | N/A | |
| Identity-Guided Human Semantic Parsing for Person Re-Identification | Kuan Zhu, Haiyun Guo, Zhiwei Liu, Ming Tang, Jinqiao Wang | N/A | |
| Learning Gradient Fields for Shape Generation | Ruojin Cai, Guandao Yang, Hadar Averbuch-Elor, Zekun Hao, Serge Belongie, Noah Snavely, Bharath Hariharan | N/A | |
| COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder | Kuniaki Saito, Kate Saenko, Ming-Yu Liu | N/A | |
| Corner Proposal Network for Anchor-free, Two-stage Object Detection | Kaiwen Duan, Lingxi Xie, Honggang Qi, Song Bai, Qingming Huang, Qi Tian | N/A | |
| PhraseClick: Toward Achieving Flexible Interactive Segmentation by Phrase and Click | Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang | N/A | |
| Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing | Yapeng Tian, Dingzeyu Li, Chenliang Xu | N/A | |
| Learning Delicate Local Representations for Multi-Person Pose Estimation | Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, Xiangyu Zhang, Xinyu Zhou, Erjin Zhou, Jian Sun | N/A | |
| Learning to Plan with Uncertain Topological Maps | Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf | N/A | |
| Neural Design Network: Graphic Layout Generation with Constraints | Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yang | N/A | |
| Learning Open Set Network with Discriminative Reciprocal Points | Guangyao Chen, Limeng Qiao, Yemin Shi, Peixi Peng, Jia Li, Tiejun Huang, Shiliang Pu, Yonghong Tian | N/A | |
| Convolutional Occupancy Networks | Songyou Peng, Michael Niemeyer, Lars Mescheder, Marc Pollefeys, Andreas Geiger | N/A | |
| Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry | He Chen, Pengfei Guo, Pengfei Li, Gim Hee Lee, Gregory Chirikjian | N/A | |
| TIDE: A General Toolbox for Identifying Object Detection Errors | Daniel Bolya, Sean Foley, James Hays, Judy Hoffman | N/A | |
| PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding | Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas Guibas, Or Litany | N/A | |
| DSA: More Efficient Budgeted Pruning via Differentiable Sparsity Allocation | Xuefei Ning, Tianchen Zhao, Wenshuo Li, Peng Lei, Yu Wang, Huazhong Yang | N/A | |
| Circumventing Outliers of AutoAugment with Knowledge Distillation | Longhui Wei, An Xiao, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Qi Tian | N/A | |
| S2DNet: Learning Image Features for Accurate Sparse-to-Dense Matching | Hugo Germain, Guillaume Bourmaud, Vincent Lepetit | N/A | |
| RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving | Peixuan Li, Huaici Zhao, Pengfei Liu, Feidao Cao | N/A | |
| Video Object Segmentation with Episodic Graph Memory Networks | Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc Van Gool | N/A | |
| Rethinking Bottleneck Structure for Efficient Mobile Network Design | Daquan Zhou, Qibin Hou, Yunpeng Chen, Jiashi Feng, Shuicheng Yan | N/A | |
| Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks | Jeffrey O. Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik | N/A | |
| Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach | Zerui Chen, Yan Huang, Hongyuan Yu, Bin Xue, Ke Han, Yiru Guo, Liang Wang | N/A | |
| REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets | Angelina Wang, Arvind Narayanan, Olga Russakovsky | N/A | |
| Contrastive Learning for Weakly Supervised Phrase Grounding | Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem | N/A | |
| Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis | Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot | N/A | |
| Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors | Zuxuan Wu, Ser-Nam Lim, Larry S. Davis, Tom Goldstein | N/A | |
| TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images | Jianxin Lin, Yingxue Pang, Yingce Xia, Zhibo Chen, Jiebo Luo | N/A | |
| Semi-Siamese Training for Shallow Face Learning | Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei | N/A | |
| GAN Slimming: All-in-One GAN Compression by A Unified Optimization Framework | Haotao Wang, Shupeng Gui, Haichuan Yang, Ji Liu, Zhangyang Wang | N/A | |
| Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition | Yukun Su, Guosheng Lin, Jinhui Zhu, Qingyao Wu | N/A | |
| Binarized Neural Network for Single Image Super Resolution | Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Heng Huang, Xinbo Gao | N/A | |
| Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation | Huiyu Wang, Yukun Zhu, Bradley Green, Hartwig Adam, Alan Yuille, Liang-Chieh Chen | N/A | |
| Adaptive Computationally Efficient Network for Monocular 3D Hand Pose Estimation | Zhipeng Fan, Jun Liu, Yao Wang | N/A | |
| Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking | Jinlong Peng, Changan Wang, Fangbin Wan, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu | N/A | |
| Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets | Tong Wu, Qingqiu Huang, Ziwei Liu, Yu Wang, Dahua Lin | N/A | |
| Hamiltonian Dynamics for Real-World Shape Interpolation | Marvin Eisenberger, Daniel Cremers | N/A | |
| Learning to Scale Multilingual Representations for Vision-Language Tasks | Andrea Burns, Donghyun Kim, Derry Wijaya, Kate Saenko, Bryan A. Plummer | N/A | |
| Multi-modal Transformer for Video Retrieval | Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid | N/A | |
| Feature Representation Matters: End-to-End Learning for Reference-based Image Super-resolution | Yanchun Xie, Jimin Xiao, Mingjie Sun, Chao Yao, Kaizhu Huang | N/A | |
| RobustFusion: Human Volumetric Capture with Data-driven Visual Cues using a RGBD Camera | Zhuo Su, Lan Xu, Zerong Zheng, Tao Yu, Yebin Liu, Lu Fang | N/A | |
| Surface Normal Estimation of Tilted Images via Spatial Rectifier | Tien Do, Khiem Vuong, Stergios I. Roumeliotis, Hyun Soo Park | N/A | |
| Multimodal Shape Completion via Conditional Generative Adversarial Networks | Rundi Wu, Xuelin Chen, Yixin Zhuang, Baoquan Chen | N/A | |
| Generative Sparse Detection Networks for 3D Single-shot Object Detection | JunYoung Gwak, Christopher Choy, Silvio Savarese | N/A | |
| Grounded Situation Recognition | Sarah Pratt, Mark Yatskar, Luca Weihs, Ali Farhadi, Aniruddha Kembhavi | N/A | |
| Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos | Shaoxiang Chen, Wenhao Jiang, Wei Liu, Yu-Gang Jiang | N/A | |
| Unpaired Learning of Deep Image Denoising | Xiaohe Wu, Ming Liu, Yue Cao, Dongwei Ren, Wangmeng Zuo | N/A | |
| Self-supervising Fine-grained Region Similarities for Large-scale Image Localization | Yixiao Ge, Haibo Wang, Feng Zhu, Rui Zhao, Hongsheng Li | N/A | |
| Rotationally-Temporally Consistent Novel View Synthesis of Human Performance Video | Youngjoong Kwon, Stefano Petrangeli, Dahun Kim, Haoliang Wang, Eunbyung Park, Viswanathan Swaminathan, Henry Fuchs | N/A | |
| Side-Aware Boundary Localization for More Precise Object Detection | Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin | N/A | |
| SF-Net: Single-Frame Supervision for Temporal Action Localization | Fan Ma, Linchao Zhu, Yi Yang, Shengxin Zha, Gourab Kundu, Matt Feiszli, Zheng Shou | N/A | |
| Negative Margin Matters: Understanding Margin in Few-shot Classification | Bin Liu, Yue Cao, Yutong Lin, Qi Li, Zheng Zhang, Mingsheng Long, Han Hu | N/A | |
| Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References | Ruizheng Wu, Xin Tao, Yingcong Chen, Xiaoyong Shen, Jiaya Jia | N/A | |
| Tracking Objects as Points | Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl | N/A | |
| CPGAN: Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis | Jiadong Liang, Wenjie Pei, Feng Lu | N/A | |
| Transporting Labels via Hierarchical Optimal Transport for Semi-Supervised Learning | Fariborz Taherkhani, Ali Dabouei, Sobhan Soleymani, Jeremy Dawson, Nasser M. Nasrabadi | N/A | |
| MTI-Net: Multi-Scale Task Interaction Networks for Multi-Task Learning | Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool | N/A | |
| Learning to Factorize and Relight a City | Andrew Liu, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros, Noah Snavely | N/A | |
| Region Graph Embedding Network for Zero-Shot Learning | Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao | N/A | |
| GRAB: A Dataset of Whole-Body Human Grasping of Objects | Omid Taheri, Nima Ghorbani, Michael J. Black, Dimitrios Tzionas | N/A | |
| DEMEA: Deep Mesh Autoencoders for Non-Rigidly Deforming Objects | Edgar Tretschk, Ayush Tewari, Michael Zollhöfer, Vladislav Golyanik, Christian Theobalt | N/A | |
| RANSAC-Flow: Generic Two-stage Image Alignment | Xi Shen, François Darmon, Alexei A. Efros, Mathieu Aubry | N/A | |
| Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds | Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool | N/A | |
| Neural Object Learning for 6D Pose Estimation Using a Few Cluttered Images | Kiru Park, Timothy Patten, Markus Vincze | N/A | |
| Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking | Jianfeng Yan, Zizhuang Wei, Hongwei Yi, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai | N/A | |
| Pixel-Pair Occlusion Relationship Map (P2ORM): Formulation, Inference & Application | Xuchong Qiu, Yang Xiao, Chaohui Wang, Renaud Marlet | N/A | |
| MovieNet: A Holistic Dataset for Movie Understanding | Qingqiu Huang, Yu Xiong, Anyi Rao, Jiaze Wang, Dahua Lin | N/A | |
| Short-Term and Long-Term Context Aggregation Network for Video Inpainting | Ang Li, Shanshan Zhao, Xingjun Ma, Mingming Gong, Jianzhong Qi, Rui Zhang, Dacheng Tao, Ramamohanarao Kotagiri | N/A | |
| DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization | Juan Du, Rui Wang, Daniel Cremers | N/A | |
| Face Super-Resolution Guided by 3D Facial Priors | Xiaobin Hu, Wenqi Ren, John LaMaster, Xiaochun Cao, Xiaoming Li, Zechao Li, Bjoern Menze, Wei Liu | N/A | |
| Label Propagation with Augmented Anchors: A Simple Semi-Supervised Learning baseline for Unsupervised Domain Adaptation | Yabin Zhang, Bin Deng, Kui Jia, Lei Zhang | N/A | |
| Are Labels Necessary for Neural Architecture Search? | Chenxi Liu, Piotr Dollár, Kaiming He, Ross Girshick, Alan Yuille, Saining Xie | N/A | |
| BLSM: A Bone-Level Skinned Model of the Human Mesh | Haoyang Wang, Riza Alp Güler, Iasonas Kokkinos, George Papandreou, Stefanos Zafeiriou | N/A | |
| Associative Alignment for Few-shot Image Classification | Arman Afrasiyabi, Jean-François Lalonde, Christian Gagné | N/A | |
| Cyclic Functional Mapping: Self-supervised Correspondence between Non-isometric Deformable Shapes | Dvir Ginzburg, Dan Raviv | N/A | |
| View-Invariant Probabilistic Embedding for Human Pose | Jennifer J. Sun, Jiaping Zhao, Liang-Chieh Chen, Florian Schroff, Hartwig Adam, Ting Liu | N/A | |
| Contact and Human Dynamics from Monocular Video | Davis Rempe, Leonidas J. Guibas, Aaron Hertzmann, Bryan Russell, Ruben Villegas, Jimei Yang | N/A | |
| PointPWC-Net: Cost Volume on Point Clouds for (Self-)Supervised Scene Flow Estimation | Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin | N/A | |
| Points2Surf Learning Implicit Surfaces from Point Clouds | Philipp Erler, Paul Guerrero, Stefan Ohrhallinger, Niloy J. Mitra, Michael Wimmer | N/A | |
| Few-Shot Scene-Adaptive Anomaly Detection | Yiwei Lu, Frank Yu, Mahesh Kumar Krishna Reddy, Yang Wang | N/A | |
| Personalized Face Modeling for Improved Face Reconstruction and Motion Retargeting | Bindita Chaudhuri, Noranart Vesdapunt, Linda Shapiro, Baoyuan Wang | N/A | |
| Entropy Minimisation Framework for Event-based Vision Model Estimation | Urbano Miguel Nunes, Yiannis Demiris | N/A | |
| Reconstructing NBA Players | Luyang Zhu, Konstantinos Rematas, Brian Curless, Steven M. Seitz, Ira Kemelmacher-Shlizerman | N/A | |
| PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments | Zhiming Chen, Kean Chen, Weiyao Lin, John See, Hui Yu, Yan Ke, Cong Yang | N/A | |
| TENet: Triple Excitation Network for Video Salient Object Detection | Sucheng Ren, Chu Han, Xin Yang, Guoqiang Han, Shengfeng He | N/A | |
| Deep Feedback Inverse Problem Solver | Wei-Chiu Ma, Shenlong Wang, Jiayuan Gu, Sivabalan Manivasagam, Antonio Torralba, Raquel Urtasun | N/A | |
| Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification | Liuyu Xiang, Guiguang Ding, Jungong Han | N/A | |
| Hallucinating Visual Instances in Total Absentia | Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao | N/A | |
| Weakly-supervised 3D Shape Completion in the Wild | Jiayuan Gu, Wei-Chiu Ma, Sivabalan Manivasagam, Wenyuan Zeng, Zihao Wang, Yuwen Xiong, Hao Su, Raquel Urtasun | N/A | |
| DTVNet: Dynamic Time-lapse Video Generation via Single Still Image | Jiangning Zhang, Chao Xu, Liang Liu, Mengmeng Wang, Xia Wu, Yong Liu, Yunliang Jiang | N/A | |
| CLIFFNet for Monocular Depth Estimation with Hierarchical Embedding Loss | Lijun Wang, Jianming Zhang, Yifan Wang, Huchuan Lu, Xiang Ruan | N/A | |
| Collaborative Video Object Segmentation by Foreground-Background Integration | Zongxin Yang, Yunchao Wei, Yi Yang | N/A | |
| Adaptive Margin Diversity Regularizer for handling Data Imbalance in Zero-Shot SBIR | Titir Dutta, Anurag Singh, Soma Biswas | N/A | |
| ETH-XGaze: A Large Scale Dataset for Gaze Estimation under Extreme Head Pose and Gaze Variation | Xucong Zhang, Seonwook Park, Thabo Beeler, Derek Bradley, Siyu Tang , Otmar Hilliges | N/A | |
| Calibration-free Structure-from-Motion with Calibrated Radial Trifocal Tensors | Viktor Larsson, Nicolas Zobernig, Kasim Taskin, Marc Pollefeys | N/A | |
| Occupancy Anticipation for Efficient Exploration and Navigation | Santhosh K. Ramakrishnan, Ziad Al-Halah, Kristen Grauman | N/A | |
| Unified Image and Video Saliency Modeling | Richard Droste, Jianbo Jiao, J. Alison Noble | N/A | |
| TAO: A Large-Scale Benchmark for Tracking Any Object | Achal Dave, Tarasha Khurana, Pavel Tokmakov, Cordelia Schmid, Deva Ramanan | N/A | |
| A Generalization of Otsu’s Method and Minimum Error Thresholding | Jonathan T. Barron | N/A | |
| A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks | Unnat Jain, Luca Weihs, Eric Kolve, Ali Farhadi, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander Schwing | N/A | |
| Big Transfer (BiT): General Visual Representation Learning | Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Joan Puigcerver, Jessica Yung, Sylvain Gelly, Neil Houlsby | N/A | |
| VisualCOMET: Reasoning about the Dynamic Context of a Still Image | Jae Sung Park, Chandra Bhagavatula, Roozbeh Mottaghi, Ali Farhadi, Yejin Choi | N/A | |
| Few-shot Action Recognition with Permutation-invariant Attention | Hongguang Zhang, Li Zhang, Xiaojuan Qi, Hongdong Li, Philip H. S. Torr, Piotr Koniusz | N/A | |
| Character Grounding and Re-Identification in Story of Videos and Text Descriptions | Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung, Gunhee Kim | N/A | |
| AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling | Wenshuo Ma, Tingzhong Tian, Hang Xu, Yimin Huang, Zhenguo Li | N/A | |
| Learning Visual Context by Comparison | Minchul Kim, Jongchan Park, Seil Na, Chang Min Park, Donggeun Yoo | N/A | |
| Large Scale Holistic Video Understanding | Ali Diba, Mohsen Fayyaz, Vivek Sharma, Manohar Paluri, Jürgen Gall, Rainer Stiefelhagen, Luc Van Gool | N/A | |
| Indirect Local Attacks for Context-aware Semantic Segmentation Networks | Krishna Kanth Nakka, Mathieu Salzmann | N/A | |
| Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings | Anita Rau, Guillermo Garcia-Hernando, Danail Stoyanov, Gabriel J. Brostow, Daniyar Turmukhambetov | N/A | |
| Connecting Vision and Language with Localized Narratives | Jordi Pont-Tuset, Jasper Uijlings, Soravit Changpinyo, Radu Soricut, Vittorio Ferrari | N/A | |
| Adversarial T-shirt! Evading Person Detectors in A Physical World | Kaidi Xu, Gaoyuan Zhang, Sijia Liu, Quanfu Fan, Mengshu Sun, Hongge Chen, Pin-Yu Chen, Yanzhi Wang, Xue Lin | N/A | |
| Bounding-box Channels for Visual Relationship Detection | Sho Inayoshi, Keita Otani, Antonio Tejero-de-Pablos, Tatsuya Harada | N/A | |
| Minimal Rolling Shutter Absolute Pose with Unknown Focal Length and Radial Distortion | Zuzana Kukelova, Cenek Albl, Akihiro Sugimoto, Konrad Schindler, Tomas Pajdla | N/A | |
| SRFlow: Learning the Super-Resolution Space with Normalizing Flow | Andreas Lugmayr, Martin Danelljan, Luc Van Gool, Radu Timofte | N/A | |
| DeepGMR: Learning Latent Gaussian Mixture Models for Registration | Wentao Yuan, Benjamin Eckart, Kihwan Kim, Varun Jampani, Dieter Fox , Jan Kautz | N/A | |
| Active Perception using Light Curtains for Autonomous Driving | Siddharth Ancha, Yaadhav Raaj, Peiyun Hu, Srinivasa G. Narasimhan, David Held | N/A | |
| Invertible Neural BRDF for Object Inverse Rendering | Zhe Chen, Shohei Nobuhara, Ko Nishino | N/A | |
| Semi-supervised Semantic Segmentation via Strong-weak Dual-branch Network | Wenfeng Luo, Meng Yang | N/A | |
| Practical Deep Raw Image Denoising on Mobile Devices | Yuzhi Wang, Haibin Huang, Qin Xu, Jiaming Liu, Yiqun Liu, Jue Wang | N/A | |
| SoundSpaces: Audio-Visual Navigation in 3D Environments | Changan Chen, Unnat Jain, Carl Schissler, Sebastia Vicenc Amengual Gari, Ziad Al-Halah, Vamsi Krishna Ithapu, Philip Robinson, and Kristen Grauman | N/A | |
| Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization | Yuanhao Zhai, Le Wang, Wei Tang, Qilin Zhang, Junsong Yuan, Gang Hua | N/A | |
| Erasing Appearance Preservation in Optimization-based Smoothing | Lvmin Zhang, Chengze Li, Yi JI, Chunping Liu, Tien-tsin Wong | N/A | |
| Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler | Tsu-Jui Fu, Xin Eric Wang, Matthew F. Peterson,Scott T. Grafton, Miguel P. Eckstein, William Yang Wang | N/A | |
| Guided Deep Decoder: Unsupervised Image Pair Fusion | Tatsumi Uezato, Danfeng Hong, Naoto Yokoya, Wei He | N/A | |
| Filter Style Transfer between Photos | Jonghwa Yim, Jisung Yoo, Won-joon Do, Beomsu Kim, Jihwan Choe | N/A | |
| JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image | Linpu Fang, Xingyan Liu, Li Liu, Hang Xu, Wenxiong Kang | N/A | |
| Dynamic Group Convolution for Accelerating Convolutional Neural Networks | Zhuo Su, Linpu Fang, Wenxiong Kang, Dewen Hu, Matti Pietikäinen, Li Liu | N/A | |
| RD-GAN: Few/Zero-Shot Chinese Character Style Transfer via Radical Decomposition and Rendering | Yaoxiong Huang, Mengchao He, Lianwen Jin, Yongpan Wang | N/A | |
| Object-Contextual Representations for Semantic Segmentation | Yuhui Yuan, Xilin Chen, Jingdong Wang | N/A | |
| Efficient Spatio-Temporal Recurrent Neural Network for Video Deblurring | Zhihang Zhong, Ye Gao, Yinqiang Zheng, Bo Zheng | N/A | |
| Joint Semantic Instance Segmentation on Graphs with the Semantic Mutex Watershed | Steffen Wolf, Yuyan Li, Constantin Pape, Alberto Bailoni, Anna Kreshuk, Fred A. Hamprecht | N/A | |
| Photon-Efficient 3D Imaging with A Non-Local Neural Network | Jiayong Peng, Zhiwei Xiong, Xin Huang, Zheng-Ping Li, Dong Liu, Feihu Xu | N/A | |
| GeLaTO: Generative Latent Textured Objects | Ricardo Martin-Brualla, Rohit Pandey, Sofien Bouaziz, Matthew Brown, Dan B Goldman | N/A | |
| Improving Vision-and-Language Navigation with Image-Text Pairs from the Web | Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra | N/A | |
| Directional Temporal Modeling for Action Recognition | Xinyu Li, Bing Shuai, Joseph Tighe | N/A | |
| Shonan Rotation Averaging: Global Optimality by Surfing SO(p)(n) | Frank Dellaert, David M. Rosen, Jing Wu, Robert Mahony, Luca Carlone | N/A | |
| Semantic Curiosity for Active Visual Learning | Devendra Singh Chaplot, Helen Jiang, Saurabh Gupta, Abhinav Gupta | N/A | |
| Multi-Temporal Recurrent Neural Networks For Progressive Non-Uniform Single Image Deblurring With Incremental Temporal Training | Dongwon Park, Dong Un Kang, Jisoo Kim, Se Young Chun | N/A | |
| ProgressFace: Scale-Aware Progressive Learning for Face Detection | Jiashu Zhu, Dong Li, Tiantian Han, Lu Tian, Yi Shan | N/A | |
| Learning Multi-layer Latent Variable Model via Variational Optimization of Short Run MCMC for Approximate Inference | Erik Nijkamp, Bo Pang, Tian Han, Linqi Zhou, Song-Chun Zhu, Ying Nian Wu | N/A | |
| CoTeRe-Net: Discovering Collaborative Ternary Relations in Videos | Zhensheng Shi, Cheng Guan, Liangjie Cao, Qianqian Li, Ju Liang, Zhaorui Gu, Haiyong Zheng, Bing Zheng | N/A | |
| Modeling the Effects of Windshield Refraction for Camera Calibration | Frank Verbiest, Marc Proesmans, Luc Van Gool | N/A | |
| Unsupervised Domain Adaptation for Semantic Segmentation of NIR Images through Generative Latent Search | Prashant Pandey, Aayush Kumar Tyagi, Sameer Ambekar, Prathosh AP | N/A | |
| PROFIT: A Novel Training Method for sub-4-bit MobileNet Models | Eunhyeok Park, Sungjoo Yoo | N/A | |
| Visual Relation Grounding in Videos | Junbin Xiao, Xindi Shang, Xun Yang, Sheng Tang, Tat-Seng Chua | N/A | |
| Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows | Andrei Zanfir, Eduard Gabriel Bazavan, Hongyi Xu, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu | N/A | |
| Controlling Style and Semantics in Weakly-Supervised Image Generation | Dario Pavllo, Aurelien Lucchi, Thomas Hofmann | N/A | |
| Jointly learning visual motion and confidence from local patches in event cameras | Daniel R. Kepple, Daewon Lee, Colin Prepsius, Volkan Isler, Il Memming Park, Daniel D. Lee | N/A | |
| SODA: Story Oriented Dense Video Captioning Evaluation Framework | Soichiro Fujita, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata | N/A | |
| Sketch-Guided Object Localization in Natural Images | Aditay Tripathi, Rajath R. Dani, Anand Mishra and Anirban Chakraborty | N/A | |
| A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses | Malik Boudiaf, Jérôme Rony, Imtiaz Masud Ziko, Eric Granger, Marco Pedersoli, Pablo Piantanida, Ismail Ben Ayed | N/A | |
| Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models | Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu | N/A | |
| The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement | William Peebles, John Peebles, Jun-Yan Zhu, Alexei Efros, Antonio Torralba | N/A | |
| STAR: Sparse Trained Articulated Human Body Regressor | Ahmed A. A. Osman, Timo Bolkart, Michael J. Black | N/A | |
| Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer | Xinghao Chen, Yiman Zhang, Yunhe Wang, Han Shu, Chunjing Xu, Chang Xu | N/A | |
| Collaboration by Competition: Self-coordinated Knowledge Amalgamation for Multi-talent Student Learning | Sihui Luo, Wenwen Pan, Xinchao Wang, Dazhou Wang, Haihong Tang, Mingli Song | N/A | |
| Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians | Shizhen Zhao, Changxin Gao, Jun Zhang, Hao Cheng, Chuchu Han, Xinyang Jiang, Xiaowei Guo, Wei-Shi Zheng, Nong Sang, Xing Sun | N/A | |
| Learning 3D Part Assembly from a Single Image | Yichen Li, Kaichun Mo, Lin Shao, Minhyuk Sung, Leonidas Guibas | N/A | |
| PT2PC: Learning to Generate 3D Point Cloud Shapes from Part Tree Conditions | Kaichun Mo, He Wang, Xinchen Yan, Leonidas Guibas | N/A | |
| Highly Efficient Salient Object Detection with 100K Parameters | Shang-Hua Gao, Yong-Qiang Tan, Ming-Ming Cheng, Chengze Lu, Yunpeng Chen, Shuicheng Yan | N/A | |
| HardGAN: A Haze-Aware Representation Distillation GAN for Single Image Dehazing | Qili Deng, Ziling Huang, Chung-Chi Tsai, Chia-Wen Lin | N/A | |
| Lifespan Age Transformation Synthesis | Roy Or-El, Soumyadip Sengupta, Ohad Fried, Eli Shechtman, Ira Kemelmacher-Shlizerman | N/A | |
| Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation | Xingchao Peng, Yichen Li, Kate Saenko | N/A | |
| Simulating Content Consistent Vehicle Datasets with Attribute Descent | Yue Yao, Liang Zheng, Xiaodong Yang, Milind Naphade, Tom Gedeon | N/A | |
| Multiview Detection with Feature Perspective Transformation | Yunzhong Hou, Liang Zheng, Stephen Gould | N/A | |
| Learning Object Relation Graph and Tentative Policy for Visual Navigation | Heming Du, Xin Yu, Liang Zheng | N/A | |
| Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition | Chenyang Si, Xuecheng Nie, Wei Wang, Liang Wang, Tieniu Tan, Jiashi Feng | N/A | |
| Across Scales & Across Dimensions: Temporal Super-Resolution using Deep Internal Learning | Liad Pollak Zuckerman, Eyal Naor, George Pisha, Shai Bagon, Michal Irani | N/A | |
| Inducing Optimal Attribute Representations for Conditional GANs | Binod Bhattarai, Tae-Kyun Kim | N/A | |
| AR-Net: Adaptive Frame Resolution for Efficient Action Recognition | Yue Meng, Chung-Ching Lin, Rameswar Panda, Prasanna Sattigeri, Leonid Karlinsky, Aude Oliva, Kate Saenko, Rogerio Feris | N/A | |
| Image-to-Voxel Model Translation for 3D Scene Reconstruction and Segmentation | Vladimir V. Kniaz, Vladimir A. Knyaz, Fabio Remondino, Artem Bordodymov, Petr Moshkantsev | N/A | |
| Consistency Guided Scene Flow Estimation | Yuhua Chen, Luc Van Gool, Cordelia Schmid, Cristian Sminchisescu | N/A | |
| Autoregressive Unsupervised Image Segmentation | Yassine Ouali, Céline Hudelot, Myriam Tami | N/A | |
| Controllable Image Synthesis via SegVAE | Yen-Chi Cheng, Hsin-Ying Lee, Min Sun, Ming-Hsuan Yang | N/A | |
| Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search | Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang , Jun Wang, Olga Fink | N/A | |
| Efficient Non-Line-of-Sight Imaging from Transient Sinograms | Mariko Isogawa, Dorian Chan, Ye Yuan, Kris Kitani, Matthew O’Toole | N/A | |
| Texture Hallucination for Large-Factor Painting Super-Resolution | Yulun Zhang, Zhifei Zhang, Stephen DiVerdi, Zhaowen Wang, Jose Echevarria, Yun Fu | N/A | |
| Learning Progressive Joint Propagation for Human Motion Prediction | Yujun Cai, Lin Huang, Yiwei Wang, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Xu Yang, Yiheng Zhu, Xiaohui Shen, Ding Liu, Jing Liu, Nadia Magnenat Thalmann | N/A | |
| Image Stitching and Rectification for Hand-Held Cameras | Bingbing Zhuang, Quoc-Huy Tran | N/A | |
| ParSeNet: A Parametric Surface Fitting Network for 3D Point Clouds | Gopal Sharma, Difan Liu, Subhransu Maji, Evangelos Kalogerakis, Siddhartha Chaudhuri, Radomír Měch | N/A | |
| The Group Loss for Deep Metric Learning | Ismail Elezi, Sebastiano Vascon, Alessandro Torcinovich, Marcello Pelillo, Laura Leal-Taixé | N/A | |
| Learning Object Depth from Camera Motion and Video Object Segmentation | Brent A. Griffin, Jason J. Corso | N/A | |
| OnlineAugment: Online Data Augmentation with Less Domain Knowledge | Zhiqiang Tang, Yunhe Gao, Leonid Karlinsky, Prasanna Sattigeri, Rogerio Feris, Dimitris Metaxas | N/A | |
| Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction | Yiming Qian, Yasutaka Furukawa | N/A | |
| Intra-class Feature Variation Distillation for Semantic Segmentation | Yukang Wang, Wei Zhou, Tao Jiang, Xiang Bai, Yongchao Xu | N/A | |
| Temporal Distinct Representation Learning for Action Recognition | Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan | N/A | |
| Representative Graph Neural Network | Changqian Yu, Yifan Liu, Changxin Gao, Chunhua Shen, Nong Sang | N/A | |
| Deformation-Aware 3D Model Embedding and Retrieval | Mikaela Angelina Uy, Jingwei Huang, Minhyuk Sung, Tolga Birdal, Leonidas Guibas | N/A | |
| Atlas: End-to-End 3D Scene Reconstruction from Posed Images | Zak Murez, Tarrence van As, James Bartolozzi, Ayan Sinha, Vijay Badrinarayanan, Andrew Rabinovich | N/A | |
| Multiple Class Novelty Detection Under Data Distribution Shift | Poojan Oza, Hien V. Nguyen, Vishal M. Patel | N/A | |
| Colorization of Depth Map via Disentanglement | Chung-Sheng Lai, Zunzhi You, Ching-Chun Huang, Yi-Hsuan Tsai, Wei-Chen Chiu | N/A | |
| Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes | Johanna Wald, Torsten Sattler, Stuart Golodetz, Tommaso Cavallari, Federico Tombari | N/A | |
| GeoGraph: Graph-based multi-view object detection with geometric cues end-to-end | Ahmed Samy Nassar, Stefano D’Aronco, Sébastien Lefèvre, Jan D. Wegner | N/A | |
| Localizing the Common Action Among a Few Videos | Pengwan Yang, Vincent Tao Hu, Pascal Mettes, Cees G. M. Snoek | N/A | |
| TAFSSL: Task-Adaptive Feature Sub-Space Learning for few-shot classification | Moshe Lichtenstein, Prasanna Sattigeri, Rogerio Feris, Raja Giryes, Leonid Karlinsky | N/A | |
| Traffic Accident Benchmark for Causality Recognition | Tackgeun You, Bohyung Han | N/A | |
| Face Anti-Spoofing with Human Material Perception | Zitong Yu, Xiaobai Li, Xuesong Niu, Jingang Shi, Guoying Zhao | N/A | |
| How Can I See My Future? FvTraj: Using First-person View for Pedestrian Trajectory Prediction | Huikun Bi, Ruisi Zhang, Tianlu Mao, Zhigang Deng, Zhaoqi Wang | N/A | |
| Multiple Expert Brainstorming for Domain Adaptive Person Re-identification | Yunpeng Zhai, Qixiang Ye, Shijian Lu, Mengxi Jia, Rongrong Ji, Yonghong Tian | N/A | |
| NASA Neural Articulated Shape Approximation | Boyang Deng, JP Lewis, Timothy Jeruzalski, Gerard Pons-Moll, Geoffrey Hinton, Mohammad Norouzi, Andrea Tagliasacchi | N/A | |
| Towards Unique and Informative Captioning of Images | Zeyu Wang, Berthy Feng, Karthik Narasimhan, Olga Russakovsky | N/A | |
| When Does Self-supervision Improve Few-shot Learning? | Jong-Chyi Su, Subhransu Maji, Bharath Hariharan | N/A | |
| Two-branch Recurrent Network for Isolating Deepfakes in Videos | Iacopo Masi, Aditya Killekar, Royston Marian Mascarenhas, Shenoy Pratik Gurudatt, Wael AbdAlmageed | N/A | |
| Incremental Few-Shot Meta-Learning via Indirect Discriminant Alignment | Qing Liu, Orchid Majumder, Alessandro Achille, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto | N/A | |
| BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models | Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Ruoming Pang, Quoc Le | N/A | |
| Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation | Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo | N/A | |
| Global Distance-distributions Separation for Unsupervised Person Re-identification | Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen | N/A | |
| I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image | Gyeongsik Moon, Kyoung Mu Lee | N/A | |
| Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose | Hongsuk Choi, Gyeongsik Moon, Kyoung Mu Lee | N/A | |
| ALRe: Outlier Detection for Guided Refinement | Mingzhu Zhu, Zhang Gao, Junzhi Yu, Bingwei He, Jiantao Liu | N/A | |
| Weakly-Supervised Crowd Counting Learns from Sorting rather than Locations | Yifan Yang, Guorong Li, Zhe Wu, Li Su, Qingming Huang, Nicu Sebe | N/A | |
| Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition | Wen Ji, Kelei He, Jing Huo, Zheng Gu, Yang Gao | N/A | |
| Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection | Carlo Biffi, Steven McDonagh, Philip Torr, Aleš Leonardis, Sarah Parisot | N/A | |
| Curriculum DeepSDF | Yueqi Duan, Haidong Zhu, He Wang, Li Yi Ram Nevatia, Leonidas J. Guibas | N/A | |
| Meshing Point Clouds with Predicted Intrinsic-Extrinsic Ratio Guidance | Minghua Liu, Xiaoshuai Zhang, Hao Su | N/A | |
| Improved Adversarial Training via Learned Optimizer | Yuanhao Xiong, Cho-Jui Hsieh | N/A | |
| Component Divide-and-Conquer for Real-World Image Super-Resolution | Pengxu Wei, Ziwei Xie, Hannan Lu, Zongyuan Zhan, Qixiang Ye, Wangmeng Zuo, Liang Lin | N/A | |
| Enabling Deep Residual Networks for Weakly Supervised Object Detection | Yunhang Shen, Rongrong Ji, Yan Wang, Zhiwei Chen, Feng Zheng, Feiyue Huang, Yunsheng Wu | N/A | |
| Deep near-light photometric stereo for spatially varying reflectances | Hiroaki Santo, Michael Waechter, Yasuyuki Matsushita | N/A | |
| Learning Visual Representations with Caption Annotations | Mert Bulent Sariyildiz, Julien Perez, Diane Larlus | N/A | |
| Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier | Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, Nuno Vasconcelos | N/A | |
| Regression of Instance Boundary by Aggregated CNN and GCN | Yanda Meng, Wei Meng, Dongxu Gao, Yitian Zhao, Xiaoyun Yang, Xiaowei Huang, Yalin Zheng | N/A | |
| Social Adaptive Module for Weakly-supervised Group Activity Recognition | Rui Yan, Lingxi Xie, Jinhui Tang, Xiangbo Shu, Qi Tian | N/A | |
| RGB-D Salient Object Detection with Cross-Modality Modulation and Selection | Chongyi Li, Runmin Cong, Yongri Piao, Qianqian Xu, Chen Change Loy | N/A | |
| RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval | Hung-Yu Tseng, Hsin-Ying Lee, Lu Jiang, Ming-Hsuan Yang, Weilong Yang | N/A | |
| Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection | Dongzhan Zhou, Xinchi Zhou, Hongwen Zhang, Shuai Yi, Wanli Ouyang | N/A | |
| Faster Person Re-Identification | Guan’an Wang, Shaogang Gong, Jian Cheng, Zengguang Hou | N/A | |
| Quantization Guided JPEG Artifact Correction | Max Ehrlich, Ser-Nam Lim, Larry Davis, Abhinav Shrivastava | N/A | |
| 3PointTM: Faster Measurement of High-Dimensional Transmission Matrices | Yujun Chen, Manoj Kumar Sharma, Ashutosh Sabharwal, Ashok Veeraraghavan, Aswin C. Sankaranarayanan | N/A | |
| Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer | Xide Xia, Meng Zhang, Tianfan Xue, Zheng Sun, Hui Fang, Brian Kulis , Jiawen Chen | N/A | |
| Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction | Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li | N/A | |
| World-Consistent Video-to-Video Synthesis | Arun Mallya, Ting-Chun Wang, Karan Sapra, Ming-Yu Liu | N/A | |
| Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance Segmentation | Qi Fan, Lei Ke, Wenjie Pei, Chi-Keung Tang, Yu-Wing Tai | N/A | |
| GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild | Umberto Michieli, Edoardo Borsato, Luca Rossi, Pietro Zanuttigh | N/A | |
| Event-based Asynchronous Sparse Convolutional Networks | Nico Messikommer, Daniel Gehrig, Antonio Loquercio, Davide Scaramuzza | N/A | |
| AtlantaNet: Inferring the 3D Indoor Layout from a Single 360(∘) Image beyond the Manhattan World Assumption | Giovanni Pintore, Marco Agus, Enrico Gobbetti | N/A | |
| AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification | Xiaofang Wang, Xuehan Xiong, Maxim Neumann, AJ Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua | N/A | |
| REMIND Your Neural Network to Prevent Catastrophic Forgetting | Tyler L. Hayes, Kushal Kafle, Robik Shrestha, Manoj Acharya, Christopher Kanan | N/A | |
| Image Classification in the Dark using Quanta Image Sensors | Abhiram Gnanasambandam, Stanley H. Chan | N/A | |
| n-Reference Transfer Learning for Saliency Prediction | Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao | N/A | |
| Progressively Guided Alternate Refinement Network for RGB-D Salient Object Detection | Shuhan Chen, Yun Fu | N/A | |
| Bottom-Up Temporal Action Localization with Mutual Regularization | Peisen Zhao, Lingxi Xie, Chen Ju, Ya Zhang, Yanfeng Wang, Qi Tian | N/A | |
| On Modulating the Gradient for Meta-Learning | Christian Simon, Piotr Koniusz, Richard Nock, Mehrtash Harandi | N/A | |
| Domain-Specific Mappings for Generative Adversarial Style Transfer | Hsin-Yu Chang, Zhixiang Wang, Yung-Yu Chuang | N/A | |
| DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning | Timo Milbich, Karsten Roth, Homanga Bharadhwaj, Samarth Sinha, Yoshua Bengio, Björn Ommer, Joseph Paul Cohen | N/A | |
| DHP: Differentiable Meta Pruning via HyperNetworks | Yawei Li, Shuhang Gu, Kai Zhang, Luc Van Gool, Radu Timofte | N/A | |
| Deep Transferring Quantization | Zheng Xie, Zhiquan Wen, Jing Liu, Zhiqiang Liu, Xixian Wu, Mingkui Tan | N/A | |
| Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification | Guangyi Chen, Yuhao Lu, Jiwen Lu, Jie Zhou | N/A | |
| Temporal Coherence or Temporal Motion: Which is More Critical for Video-based Person Re-identification? | Guangyi Chen, Yongming Rao, Jiwen Lu, Jie Zhou | N/A | |
| Arbitrary-Oriented Object Detection with Circular Smooth Label | Xue Yang, Junchi Yan | N/A | |
| Learning Event-Driven Video Deblurring and Interpolation | Songnan Lin, Jiawei Zhang, Jinshan Pan, Zhe Jiang, Dongqing Zou, Yongtian Wang, Jing Chen, Jimmy Ren | N/A | |
| Vectorizing World Buildings: Planar Graph Reconstruction by Primitive Detection and Relationship Inference | Nelson Nauata, Yasutaka Furukawa | N/A | |
| Learning to Combine: Knowledge Aggregation for Multi-Source Domain Adaptation | Hang Wang, Minghao Xu, Bingbing Ni, Wenjun Zhang | N/A | |
| CSCL: Critical Semantic-Consistent Learning for Unsupervised Domain Adaptation | Jiahua Dong, Yang Cong, Gan Sun, Yuyang Liu, Xiaowei Xu | N/A | |
| Prototype Mixture Models for Few-shot Semantic Segmentation | Boyu Yang, Chang Liu, Bohao Li, Jianbin Jiao, Qixiang Ye | N/A | |
| Webly Supervised Image Classification with Self-Contained Confidence | Jingkang Yang, Litong Feng, Weirong Chen, Xiaopeng Yan, Huabin Zheng , Ping Luo, Wayne Zhang | N/A | |
| Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization | Haibao Yu, Qi Han, Jianbo Li, Jianping Shi, Guangliang Cheng, Bin Fan | N/A | |
| Monocular 3D Object Detection via Feature Domain Adaptation | Xiaoqing Ye, Liang Du, Yifeng Shi, Yingying Li, Xiao Tan, Jianfeng Feng, Errui Ding, Shilei Wen | N/A | |
| Talking-head Generation with Rhythmic Head Motion | Lele Chen, Guofeng Cui, Celong Liu, Zhong Li, Ziyi Kou, Yi Xu, Chenliang Xu | N/A | |
| AUTO3D: Novel view synthesis through unsupervisely learned variational viewpoint and global 3D representation | Xiaofeng Liu, Tong Che, Yiqun Lu, Chao Yang, Site Li, Jane You | N/A | |
| VPN: Learning Video-Pose Embedding for Activities of Daily Living | Srijan Das, Saurav Sharma, Rui Dai, François Brémond, Monique Thonnat | N/A | |
| Soft Anchor-Point Object Detection | Chenchen Zhu, Fangyi Chen, Zhiqiang Shen, Marios Savvides | N/A | |
| Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid | Jun Gao, Zian Wang, Jinchen Xuan, Sanja Fidler | N/A | |
| Soft Expert Reward Learning for Vision-and-Language Navigation | Hu Wang, Qi Wu, Chunhua Shen | N/A | |
| Part-aware Prototype Network for Few-shot Semantic Segmentation | Yongfei Liu, Xiangyi Zhang, Songyang Zhang, Xuming He | N/A | |
| Learning from Extrinsic and Intrinsic Supervisions for Domain Generalization | Shujun Wang, Lequan Yu, Caizi Li, Chi-Wing Fu, Pheng-Ann Heng | N/A | |
| Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos | Mahsa Ehsanpour, Alireza Abedin, Fatemeh Saleh, Javen Shi, Ian Reid , Hamid Rezatofighi | N/A | |
| Whole-Body Human Pose Estimation in the Wild | Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo | N/A | |
| Relative Pose Estimation of Calibrated Cameras with Known SE(3) Invariants | Bo Li, Evgeniy Martyushev, Gim Hee Lee | N/A | |
| Sequential Convolution and Runge-Kutta Residual Architecture for Image Compressed Sensing | Runkai Zheng, Yinqi Zhang, Daolang Huang, Qingliang Chen | N/A | |
| Deep Hough Transform for Semantic Line Detection | Qi Han, Kai Zhao, Jun Xu, Ming-Ming Cheng | N/A | |
| Structured Landmark Detection via Topology-Adapting Deep Graph Learning | Weijian Li, Yuhang Lu, Kang Zheng, Haofu Liao, Chihung Lin, Jiebo Luo, Chi-Tung Cheng, Jing Xiao, Le Lu, Chang-Fu Kuo, Shun Miao | N/A | |
| 3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning | Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, László A. Jeni, Fernando De la Torre | N/A | |
| Learning to Balance Specificity and Invariance for In and Out of Domain Generalization | Prithvijit Chattopadhyay, Yogesh Balaji, Judy Hoffman | N/A | |
| Contrastive Learning for Unpaired Image-to-Image Translation | Taesung Park Alexei A. Efros Richard Zhang Jun-Yan Zhu | N/A | |
| DLow: Diversifying Latent Flows for Diverse Human Motion Prediction | Ye Yuan, Kris Kitani | N/A | |
| GRNet: Gridding Residual Network for Dense Point Cloud Completion | Haozhe Xie, Hongxun Yao, Shangchen Zhou, Jiageng Mao, Shengping Zhang, Wenxiu Sun | N/A | |
| Gait Lateral Network: Learning Discriminative and Compact Representations for Gait Recognition | Saihui Hou, Chunshui Cao, Xu Liu, Yongzhen Huang | N/A | |
| Blind Face Restoration via Deep Multi-scale Component Dictionaries | Xiaoming Li, Chaofeng Chen, Shangchen Zhou, Xianhui Lin, Wangmeng Zuo, Lei Zhang | N/A | |
| Robust Neural Networks inspired by Strong Stability Preserving Runge-Kutta methods | Byungjoo Kim, Bryce Chudomelka, Jinyoung Park, Jaewoo Kang, Youngjoon Hong, Hyunwoo J. Kim | N/A | |
| Inequality-Constrained and Robust 3D Face Model Fitting | Evangelos Sariyanidi, Casey J. Zampella, Robert T. Schultz, Birkan Tunc | N/A | |
| Gabor Layers Enhance Network Robustness | Juan C. Pérez, Motasem Alfarra, Guillaume Jeanneret, Adel Bibi, Ali Thabet, Bernard Ghanem, Pablo Arbeláez | N/A | |
| Conditional Image Repainting via Semantic Bridge and Piecewise Value Function | Shuchen Weng, Wenbo Li, Dawei Li, Hongxia Jin, Boxin Shi | N/A | |
| Learnable Cost Volume Using the Cayley Representation | Taihong Xiao, Jinwei Yuan, Deqing Sun, Qifei Wang Xin-Yu Zhang, Kehan Xu, Ming-Hsuan Yang | N/A | |
| HALO: Hardware-Aware Learning to Optimize | Chaojian Li, Tianlong Chen, Haoran You, Zhangyang Wang, Yingyan Lin | N/A | |
| Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling | Jia Zheng, Junfei Zhang, Jing Li, Rui Tang, Shenghua Gao, Zihan Zhou | N/A | |
| BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition | Yonghyun Kim, Wonpyo Park, Jongju Shin | N/A | |
| Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision | Xinzhe Han, Shuhui Wang, Chi Su, Weigang Zhang, Qingming Huang, Qi Tian | N/A | |
| Domain Adaptive Semantic Segmentation Using Weak Labels | Sujoy Paul, Yi-Hsuan Tsai, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan Chandraker | N/A | |
| Knowledge Distillation Meets Self-Supervision | Guodong Xu, Ziwei Liu, Xiaoxiao Li, Chen Change Loy | N/A | |
| Efficient Neighbourhood Consensus Networks via Submanifold Sparse Convolutions | Ignacio Rocco, Relja Arandjelović, Josef Sivic | N/A | |
| Reconstructing the Noise Variance Manifold for Image Denoising | Ioannis Marras, Grigorios G. Chrysos, Ioannis Alexiou, Gregory Slabaugh, Stefanos Zafeiriou | N/A | |
| Occlusion-Aware Depth Estimation with Adaptive Normal Constraints | Xiaoxiao Long, Lingjie Liu, Christian Theobalt, Wenping Wang | N/A | |
| VisualEchoes: Spatial Image Representation Learning through Echolocation | Ruohan Gao, Changan Chen, Ziad Al-Halah, Carl Schissler, Kristen Grauman | N/A | |
| Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval | Andrew Brown, Weidi Xie, Vicky Kalogeiton, Andrew Zisserman | N/A | |
| Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation | Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng, Maxwell D. Collins, Ekin D. Cubuk, Barret Zoph, Hartwig Adam, Jonathon Shlens | N/A | |
| Spatially Aware Multimodal Transformers for TextVQA | Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal | N/A | |
| Every Pixel Matters: Center-aware Feature Alignment for Domain Adaptive Object Detector | Cheng-Chun Hsu, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang | N/A | |
| URIE: Universal Image Enhancement for Visual Recognition in the Wild | Taeyoung Son Juwon Kang Namyup Kim Sunghyun Cho Suha Kwak | N/A | |
| Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation | Hongwei Yi, Zizhuang Wei, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai | N/A | |
| SPL-MLL: Selecting Predictable Landmarks for Multi-Label Learning | Junbing Li, Changqing Zhang, Pengfei Zhu, Baoyuan Wu, Lei Chen, Qinghua Hu | N/A | |
| Unpaired Image-to-Image Translation using Adversarial Consistency Loss | Yihao Zhao, Ruihai Wu, Hao Dong | N/A | |
| Discriminability Distillation in Group Representation Learning | Manyuan Zhang, Guanglu Song, Hang Zhou, Yu Liu | N/A | |
| Monocular Expressive Body Regression through Body-Driven Attention | Vasileios Choutas, Georgios Pavlakos, Timo Bolkart, Dimitrios Tzionas , Michael J. Black | N/A | |
| Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation | Zongsheng Yue, Qian Zhao, Lei Zhang, Deyu Meng | N/A | |
| Linguistic Structure Guided Context Modeling for Referring Image Segmentation | Tianrui Hui, Si Liu, Shaofei Huang, Guanbin Li, Sansi Yu, Faxi Zhang, Jizhong Han | N/A | |
| Federated Visual Classification with Real-World Data Distribution | Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown | N/A | |
| Robust Re-Identification by Multiple Views Knowledge Distillation | Angelo Porrello, Luca Bergamini, Simone Calderara | N/A | |
| Defocus Deblurring Using Dual-Pixel Data | Abdullah Abuolaim, Michael S. Brown | N/A | |
| RhyRNN: Rhythmic RNN for Recognizing Events in Long and Complex Videos | Tianshu Yu, Yikang Li, Baoxin Li | N/A | |
| Take an Emotion Walk: Perceiving Emotions from Gaits Using Hierarchical Attention Pooling and Affective Mapping | Uttaran Bhattacharya, Christian Roncal, Trisha Mittal, Rohan Chandra , Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha | N/A | |
| Weighing Counts: Sequential Crowd Counting by Reinforcement Learning | Liang Liu, Hao Lu, Hongwei Zou, Haipeng Xiong, Zhiguo Cao, Chunhua Shen | N/A | |
| Reflection Backdoor: A Natural Backdoor Attack on Deep Neural Networks | Yunfei Liu, Xingjun Ma, James Bailey, Feng Lu | N/A | |
| Learning to Learn with Variational Information Bottleneck for Domain Generalization | Yingjun Du, Jun Xu, Huan Xiong, Qiang Qiu, Xiantong Zhen, Cees G. M. Snoek, Ling Shao | N/A | |
| Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis | Ruixuan Yu, Xin Wei, Federico Tombari, Jian Sun | N/A | |
| Thanks for Nothing: Predicting Zero-Valued Activations with Lightweight Convolutional Neural Networks | Gil Shomron, Ron Banner, Moran Shkolnik, Uri Weiser | N/A | |
| Layered Neighborhood Expansion for Incremental Multiple Graph Matching | Zixuan Chen, Zhihui Xie, Junchi Yan Yinqiang Zheng, Xiaokang Yang | N/A | |
| SCAN: Learning to Classify Images without Labels | Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Marc Proesmans, Luc Van Gool | N/A | |
| Graph convolutional networks for learning with few clean and many noisy labels | Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum, Cordelia Schmid | N/A | |
| Object-and-Action Aware Model for Visual Language Navigation | Yuankai Qi, Zizheng Pan, Shengping Zhang, Anton van den Hengel, Qi Wu | N/A | |
| A Comprehensive Study of Weight Sharing in Graph Networks for 3D Human Pose Estimation | Kenkun Liu, Rongqi Ding, Zhiming Zou, Le Wang, Wei Tang | N/A | |
| MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution | Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia | N/A | |
| Efficient Semantic Video Segmentation with Per-frame Inference | Yifan Liu, Chunhua Shen, Changqian Yu, Jingdong Wang | N/A | |
| Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers | Christoph Kamann, Carsten Rother | N/A | |
| Deep Spiking Neural Network: Energy Efficiency Through Time based Coding | Bing Han, Kaushik Roy | N/A | |
| InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling | Jun Wang, Shiyi Lan, Mingfei Gao, Larry S. Davis | N/A | |
| Utilizing Patch-level Category Activation Patterns for Multiple Class Novelty Detection | Poojan Oza, Vishal M. Patel | N/A | |
| People as Scene Probes | Yifan Wang, Brian L. Curless, Steven M. Seitz | N/A | |
| Mapping in a Cycle: Sinkhorn Regularized Unsupervised Learning for Point Cloud Shapes | Lei Yang, Wenxi Liu, Zhiming Cui, Nenglun Chen, Wenping Wang | N/A | |
| Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions | Matheus Gadelha, Aruni RoyChowdhury, Gopal Sharma, Evangelos Kalogerakis, Liangliang Cao, Erik Learned-Miller, Rui Wang, Subhransu Maji | N/A | |
| TexMesh: Reconstructing Detailed Human Texture and Geometry from RGB-D Video | Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa G. Narasimhan, Minh Vo | N/A | |
| Consistency-based Semi-supervised Active Learning: Towards Minimizing Labeling Cost | Mingfei Gao, Zizhao Zhang, Guo Yu, Sercan . Arık, Larry S. Davis, Tomas Pfister | N/A | |
| Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation | Fangyun Wei, Xiao Sun, Hongyang Li, Jingdong Wang, Stephen Lin | N/A | |
| Modeling 3D Shapes by Reinforcement Learning | Cheng Lin, Tingxiang Fan, Wenping Wang, Matthias Nießner | N/A | |
| LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform | Lida Li, Kun Wang, Shuai Li, Xiangchu Feng, Lei Zhang | N/A | |
| Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision | Damien Teney, Ehsan Abbasnedjad, Anton van den Hengel | N/A | |
| CN: Channel Normalization For Point Cloud Recognition | Zetong Yang, Yanan Sun, Shu Liu, Xiaojuan Qi, Jiaya Jia | N/A | |
| Rethinking the Defocus Blur Detection Problem and A Real-Time Deep DBD Model | Ning Zhang, Junchi Yan | N/A | |
| AutoMix: Mixup Networks for Sample Interpolation via Cooperative Barycenter Learning | Jianchao Zhu, Liangliang Shi, Junchi Yan, Hongyuan Zha | N/A | |
| Scene Text Image Super-resolution in the wild | Wenjia Wang, Enze Xie, Xuebo Liu, Wenhai Wang, Ding Liang, Chunhua Shen, Xiang Bai | N/A | |
| Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling | Omid Poursaeed, Matthew Fisher, Noam Aigerman, Vladimir G. Kim | N/A | |
| Learning Disentangled Representations with Latent Variation Predictability | Xinqi Zhu, Chang Xu, Dacheng Tao | N/A | |
| Deep Space-Time Video Upsampling Networks | Jaeyeon Kang, Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim | N/A | |
| Large-Scale Few-Shot Learning via Multi-Modal Knowledge Discovery | Shuo Wang, Jun Yue, Jianzhuang Liu, Qi Tian, Meng Wang | N/A | |
| Fast Video Object Segmentation using the Global Context Module | Yu Li, Zhuoran Shen, Ying Shan | N/A | |
| Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos | Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid | N/A | |
| Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification | Nikita Dvornik, Cordelia Schmid, Julien Mairal | N/A | |
| MessyTable: Instance Association in Multiple Camera Views | Zhongang Cai, Junzhe Zhang, Daxuan Ren, Cunjun Yu, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Chen Change Loy | N/A | |
| A Unified Framework for Shot Type Classification Based on Subject Centric Lens | Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin | N/A | |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Samuel Albanie, Gül Varol, Liliane Momeni, Triantafyllos Afouras, Joon Son Chung, Neil Fox, Andrew Zisserman | N/A | |
| HTML: A Parametric Hand Texture Model for 3D Hand Reconstruction and Personalization | Neng Qian, Jiayi Wang, Franziska Mueller, Florian Bernard, Vladislav Golyanik, Christian Theobalt | N/A | |
| CycAs: Self-supervised Cycle Association for Learning Re-identifiable Descriptions | Zhongdao Wang, Jingwei Zhang, Liang Zheng, Yixuan Liu, Yifan Sun, Yali Li, Shengjin Wang | N/A | |
| Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions | Xihui Liu, Zhe Lin, Jianming Zhang, Handong Zhao, Quan Tran, Xiaogang Wang, Hongsheng Li | N/A | |
| Towards Real-Time Multi-Object Tracking | Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, Shengjin Wang | N/A | |
| A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation | Jian Liang, Yunbo Wang, Dapeng Hu, Ran He, Jiashi Feng | N/A | |
| Unsupervised Deep Metric Learning with Transformed Attention Consistency and Contrastive Clustering Loss | Yang Li, Shichao Kan, Zhihai He | N/A | |
| STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos | Ali Athar, Sabarinath Mahadevan, Aljosa Osep, Laura Leal-Taixé, Bastian Leibe | N/A | |
| Hierarchical Style-based Networks for Motion Synthesis | Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell | N/A | |
| Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop | Benjamin Biggs, Oliver Boyne, James Charles, Andrew Fitzgibbon, Roberto Cipolla | N/A | |
| Learning to Count in the Crowd from Limited Labeled Data | Vishwanath A. Sindagi, Rajeev Yasarla, Deepak Sam Babu, R. Venkatesh Babu, Vishal M. Patel | N/A | |
| SPOT: Selective Point Cloud Voting for Better Proposal in Point Cloud Object Detection | Hongyuan Du, Linjun Li, Bo Liu, Nuno Vasconcelos | N/A | |
| Explainable Face Recognition | Jonathan R. Williford, Brandon B. May, Jeffrey Byrne | N/A | |
| From Shadow Segmentation to Shadow Removal | Hieu Le, Dimitris Samaras | N/A | |
| Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding | Seong Hyeon Park, Gyubok Lee, Jimin Seo, Manoj Bhat, Minseok Kang, Jonathan Francis, Ashwin Jadhav, Paul Pu Liang, Louis-Philippe Morency | N/A | |
| CONFIG: Controllable Neural Face Image Generation | Marek Kowalski, Stephan J. Garbin, Virginia Estellers, Tadas Baltrušaitis, Matthew Johnson, Jamie Shotton | N/A | |
| Single View Metrology in the Wild | Rui Zhu, Xingyi Yang, Yannick Hold-Geoffroy, Federico Perazzi, Jonathan Eisenmann, Kalyan Sunkavalli, Manmohan Chandraker | N/A | |
| Procedure Planning in Instructional Videos | Chien-Yi Chang, De-An Huang, Danfei Xu, Ehsan Adeli, Li Fei-Fei, Juan Carlos Niebles | N/A | |
| Funnel Activation for Visual Recognition | Ningning Ma, Xiangyu Zhang, Jian Sun | N/A | |
| GIQA: Generated Image Quality Assessment | Shuyang Gu, Jianmin Bao, Dong Chen, Fang Wen | N/A | |
| Adversarial Continual Learning | Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach | N/A | |
| Adapting Object Detectors with Conditional Domain Normalization | Peng Su, Kun Wang, Xingyu Zeng, Shixiang Tang, Dapeng Chen, Di Qiu , Xiaogang Wang | N/A | |
| HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction | Tianjiao Li, Jun Liu, Wei Zhang, Lingyu Duan | N/A | |
| Pseudo RGB-D for Self-Improving Monocular SLAM and Depth Prediction | Lokender Tiwari, Pan Ji, Quoc-Huy Tran, Bingbing Zhuang, Saket Anand , Manmohan Chandraker | N/A | |
| Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting | Shengcai Liao, Ling Shao | N/A | |
| Self-supervised Bayesian Deep Learning for Image Recovery with Applications to Compressive Sensing | Tongyao Pang, Yuhui Quan, Hui Ji | N/A | |
| Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement | Jian Wang, Xiang Long, Yuan Gao, Errui Ding, Shilei Wen | N/A | |
| Semi-supervised Learning with a Teacher-student Network for Generalized Attribute Prediction | Minchul Shin | N/A | |
| Unsupervised Domain Adaptation with Noise Resistible Mutual-Training for Person Re-identification | Fang Zhao, Shengcai Liao, Guo-Sen Xie, Jian Zhao, Kaihao Zhang, Ling Shao | N/A | |
| DPDist: Comparing Point Clouds Using Deep Point Cloud Distance | Dahlia Urbach, Yizhak Ben-Shabat, Michael Lindenbaum | N/A | |
| Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation | Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, Gang Zeng | N/A | |
| DataMix: Efficient Privacy-Preserving Edge-Cloud Inference | Zhijian Liu, Zhanghao Wu, Chuang Gan, Ligeng Zhu, Song Han | N/A | |
| Neural Re-Rendering of Humans from a Single Image | Kripasindhu Sarkar, Dushyant Mehta, Weipeng Xu, Vladislav Golyanik, Christian Theobalt | N/A | |
| Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation | Filippo Aleotti, Fabio Tosi, Li Zhang, Matteo Poggi, Stefano Mattoccia | N/A | |
| PIPAL: a Large-Scale Image Quality Assessment Dataset for Perceptual Image Restoration | Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy S. Ren, Chao Dong | N/A | |
| Why do These Match? Explaining the Behavior of Image Similarity Models | Bryan A. Plummer, Mariya I. Vasileva, Vitali Petsiuk, Kate Saenko, David Forsyth | N/A | |
| CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing | Xuanhong Chen, Bingbing Ni, Naiyuan Liu, Ziang Liu, Yiliu Jiang, Loc Truong, Qi Tian | N/A | |
| Progressive Transformers for End-to-End Sign Language Production | Ben Saunders, Necati Cihan Camgoz, Richard Bowden | N/A | |
| Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting | Minghui Liao, Guan Pang, Jing Huang, Tal Hassner, Xiang Bai | N/A | |
| Making Affine Correspondences Work in Camera Geometry Computation | Daniel Barath, Michal Polic, Wolfgang Förstner, Torsten Sattler, Tomas Pajdla, Zuzana Kukelova | N/A | |
| Sub-center ArcFace: Boosting Face Recognition by Large-scale Noisy Web Faces | Jiankang Deng, Jia Guo, Tongliang Liu, Mingming Gong, Stefanos Zafeiriou | N/A | |
| Foley Music: Learning to Generate Music from Videos | Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba | N/A | |
| Contrastive Multiview Coding | Yonglong Tian, Dilip Krishnan, Phillip Isola | N/A | |
| Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses | Yingwei Li, Song Bai, Cihang Xie, Zhenyu Liao, Xiaohui Shen, Alan Yuille | N/A | |
| Generative Low-bitwidth Data Free Quantization | Shoukai Xu, Haokun Li, Bohan Zhuang, Jing Liu, Jiezhang Cao, Chuangrun Liang, Mingkui Tan | N/A | |
| Local Correlation Consistency for Knowledge Distillation | Xiaojie Li, Jianlong Wu, Hongyu Fang, Yue Liao, Fei Wang, Chen Qian | N/A | |
| Perceiving 3D Human-Object Spatial Arrangements from a Single Image in the Wild | Jason Y. Zhang, Sam Pepose, Hanbyul Joo, Deva Ramanan, Jitendra Malik, Angjoo Kanazawa | N/A | |
| Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation | Hang Zhou, Xudong Xu, Dahua Lin, Xiaogang Wang, Ziwei Liu | N/A | |
| CelebA-Spoof: Large-Scale Face Anti-Spoofing Dataset with Rich Annotations | Yuanhan Zhang, ZhenFei Yin, Yidong Li, Guojun Yin, Junjie Yan, Jing Shao, Ziwei Liu | N/A | |
| Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues | Yuyang Qian, Guojun Yin, Lu Sheng, Zixuan Chen, Jing Shao | N/A | |
| Weakly-Supervised Cell Tracking via Backward-and-Forward Propagation | Kazuya Nishimura, Junya Hayashida, Chenyang Wang, Dai Fei Elmer Ker, Ryoma Bise | N/A | |
| SeqHAND: RGB-Sequence-Based 3D Hand Pose and Shape Estimation | John Yang, Hyung Jin Chang, Seungeui Lee, Nojun Kwak | N/A | |
| Rethinking the Distribution Gap of Person Re-identification with Camera-based Batch Normalization | Zijie Zhuang, Longhui Wei, Lingxi Xie, Tianyu Zhang, Hengheng Zhang , Haozhe Wu, Haizhou Ai, Qi Tian | N/A | |
| AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation | Xiaobing Zhang, Shijian Lu, Haigang Gong, Zhipeng Luo, Ming Liu | N/A | |
| Online Multi-modal Person Search in Videos | Jiangyue Xia, Anyi Rao, Qingqiu Huang, Linning Xu, Jiangtao Wen, Dahua Lin | N/A | |
| Single Image Super-Resolution via a Holistic Attention Network | Ben Niu, Weilei Wen, Wenqi Ren, Xiangde Zhang, Lianping Yang, Shuzhen Wang, Kaihao Zhang, Xiaochun Cao, Haifeng Shen | N/A | |
| Can You Read Me Now? Content Aware Rectification using Angle Supervision | Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor, Roee Litman | N/A | |
| Momentum Batch Normalization for Deep Learning with Small Batch Size | Hongwei Yong, Jianqiang Huang, Deyu Meng, Xiansheng Hua, Lei Zhang | N/A | |
| AdvPC: Transferable Adversarial Perturbations on 3D Point Clouds | Abdullah Hamdi, Sara Rojas, Ali Thabet, Bernard Ghanem | N/A | |
| Edge-aware Graph Representation Learning and Reasoning for Face Parsing | Gusi Te, Yinglu Liu, Wei Hu, Hailin Shi, Tao Mei | N/A | |
| BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network | Deng-Ping Fan, Yingjie Zhai, Ali Borji, Jufeng Yang, Ling Shao | N/A | |
| G-LBM:Generative Low-dimensional Background Model Estimation from Video Sequences | Behnaz Rezaei, Amirreza Farnoosh, Sarah Ostadabbas | N/A | |
| H3DNet: 3D Object Detection Using Hybrid Geometric Primitives | Zaiwei Zhang, Bo Sun, Haitao Yang, Qixing Huang | N/A | |
| Expressive Telepresence via Modular Codec Avatars | Hang Chu, Shugao Ma, Fernando De la Torre, Sanja Fidler, Yaser Sheikh | N/A | |
| Cascade Graph Neural Networks for RGB-D Salient Object Detection | Ao Luo, Xin Li, Fan Yang, Zhicheng Jiao, Hong Cheng, Siwei Lyu | N/A | |
| FairALM: Augmented Lagrangian Method for Training Fair Models with Little Regret | Vishnu Suresh Lokhande, Aditya Kumar Akash, Sathya N. Ravi, Vikas Singh | N/A | |
| Generating Videos of Zero-Shot Compositions of Actions and Objects | Megha Nawhal, Mengyao Zhai, Andreas Lehrmann, Leonid Sigal, Greg Mori | N/A | |
| ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language | Zhe Wang, Zhiyuan Fang, Jun Wang, Yezhou Yang | N/A | |
| Renovating Parsing R-CNN for Accurate Multiple Human Parsing | Lu Yang, Qing Song, Zhihui Wang, Mengjie Hu, Chun Liu, Xueshi Xin, Wenhe Jia, Songcen Xu | N/A | |
| Multi-Task Curriculum Framework for Open-Set Semi-Supervised Learning | Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa | N/A | |
| Gradient-Induced Co-Saliency Detection | Zhao Zhang, Wenda Jin, Jun Xu, Ming-Ming Cheng | N/A | |
| Nighttime Defogging Using High-Low Frequency Decomposition and Grayscale-Color Networks | Wending Yan, Robby T. Tan, Dengxin Dai | N/A | |
| SegFix: Model-Agnostic Boundary Refinement for Segmentation | Yuhui Yuan, Jingyi Xie, Xilin Chen, Jingdong Wang | N/A | |
| Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction | Cunjun Yu, Xiao Ma, Jiawei Ren, Haiyu Zhao, Shuai Yi | N/A | |
| Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars | Egor Zakharov, Aleksei Ivakhnenko, Aliaksandra Shysheya, Victor Lempitsky | N/A | |
| Neural Geometric Parser for Single Image Camera Calibration | Jinwoo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim | N/A | |
| Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent Supervision | Yuxiang Wei, Ming Liu, Haolin Wang, Ruifeng Zhu, Guosheng Hu, Wangmeng Zuo | N/A | |
| Learning Architectures for Binary Networks | Dahyun Kim, Kunal Pratap Singh, Jonghyun Choi | N/A | |
| Semantic View Synthesis | Hsin-Ping Huang, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang | N/A | |
| An Analysis of Sketched IRLS for Accelerated Sparse Residual Regression | Daichi Iwata, Michael Waechter, Wen-Yan Lin, Yasuyuki Matsushita | N/A | |
| Relative Pose from Deep Learned Depth and a Single Affine Correspondence | Ivan Eichhardt, Daniel Barath | N/A | |
| Video Super-Resolution with Recurrent Structure-Detail Network | Takashi Isobe, Xu Jia, Shuhang Gu, Songjiang Li, Shengjin Wang, Qi Tian | N/A | |
| Shape Adaptor: A Learnable Resizing Module | Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns | N/A | |
| Shuffle and Attend: Video Domain Adaptation | Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang | N/A | |
| DRG: Dual Relation Graph for Human-Object Interaction Detection | Chen Gao, Jiarui Xu, Yuliang Zou, Jia-Bin Huang | N/A | |
| Flow-edge Guided Video Completion | Chen Gao, Ayush Saraf, Jia-Bin Huang, Johannes Kopf | N/A | |
| End-to-End Trainable Deep Active Contour Models for Automated Image Segmentation: Delineating Buildings in Aerial Imagery | Ali Hatamizadeh, Debleena Sengupta, Demetri Terzopoulos | N/A | |
| Towards End-to-end Video-based Eye-Tracking | Seonwook Park, Emre Aksan, Xucong Zhang, Otmar Hilliges | N/A | |
| Generating Handwriting via Decoupled Style Descriptors | Atsunobu Kotani, Stefanie Tellex, James Tompkin | N/A | |
| LEED: Label-Free Expression Editing via Disentanglement | Rongliang Wu, Shijian Lu | N/A | |
| Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards | Xuewen Yang, Heming Zhang, Di Jin, Yingru Liu, Chi-Hao Wu, Jianchao Tan, Dongliang Xie, Jue Wang, Xin Wang | N/A | |
| Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder | Gouthaman KV, Anurag Mittal | N/A | |
| Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation | Jogendra Nath Kundu, Ambareesh Revanur, Govind Vitthal Waghmare, Rahul Mysore Venkatesh, R. Venkatesh Babu | N/A | |
| Class-Incremental Domain Adaptation | Jogendra Nath Kundu, Rahul Mysore Venkatesh, Naveen Venkat, Ambareesh Revanur, R. Venkatesh Babu | N/A | |
| Anti-Bandit Neural Architecture Search for Model Defense | Hanlin Chen, Baochang Zhang, Song Xue, Xuan Gong, Hong Liu, Rongrong Ji, David Doermann | N/A | |
| Wavelet-Based Dual-Branch Network for Image Demoiréing | Lin Liu, Jianzhuang Liu, Shanxin Yuan, Gregory Slabaugh, Aleš Leonardis, Wengang Zhou, Qi Tian | N/A | |
| Low Light Video Enhancement using Synthetic Data Produced with an Intermediate Domain Mapping | Danai Triantafyllidou, Sean Moran, Steven McDonagh, Sarah Parisot, Gregory Slabaugh | N/A | |
| Non-Local Spatial Propagation Network for Depth Completion | Jinsun Park, Kyungdon Joo, Zhe Hu, Chi-Kuei Liu, In So Kweon | N/A | |
| DanbooRegion: An Illustration Region Dataset | Lvmin Zhang, Yi JI, Chunping Liu | N/A | |
| Event Enhanced High-Quality Image Recovery | Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang | N/A | |
| PackDet: Packed Long-Head Object Detector | Kun Ding, Guojin He, Huxiang Gu, Zisha Zhong, Shiming Xiang, Chunhong Pan | N/A | |
| A Generic Graph-based Neural Architecture Encoding Scheme for Predictor-based NAS | Xuefei Ning, Yin Zheng, Tianchen Zhao, Yu Wang, Huazhong Yang | N/A | |
| Learning Semantic Neural Tree for Human Parsing | Ruyi Ji, Dawei Du, Libo Zhang, Longyin Wen, Yanjun Wu, Chen Zhao, Feiyue Huang, Siwei Lyu | N/A | |
| Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation | Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen | N/A | |
| Burst Denoising via Temporally Shifted Wavelet Transforms | Xuejian Rong, Denis Demandolx, Kevin Matzen, Priyam Chatterjee, Yingli Tian | N/A | |
| JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans | Fengze Liu, Jinzheng Cai, Yuankai Huo, Chi-Tung Cheng, Ashwin Raju, Dakai Jin, Jing Xiao, Alan Yuille, Le Lu, ChienHung Liao, Adam P. Harrison | N/A | |
| SimAug: Learning Robust Representations from Simulation for Trajectory Prediction | Junwei Liang, Lu Jiang, Alexander Hauptmann | N/A | |
| ScribbleBox: Interactive Annotation Framework for Video Object Segmentation | Bowen Chen, Huan Ling, Xiaohui Zeng, Jun Gao, Ziyue Xu, Sanja Fidler | N/A | |
| Rethinking Pseudo-LiDAR Representation | Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang | N/A | |
| Deep Multi Depth Panoramas for View Synthesis | Kai-En Lin, Zexiang Xu, Ben Mildenhall, Pratul P. Srinivasan, Yannick Hold-Geoffroy, Stephen DiVerdi, Qi Sun, Kalyan Sunkavalli, Ravi Ramamoorthi | N/A | |
| MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection | Fa-Ting Hong, Xuanteng Huang, Wei-Hong Li, Wei-Shi Zheng | N/A | |
| ContactPose: A Dataset of Grasps with Object Contact and Hand Pose | Samarth Brahmbhatt, Chengcheng Tang, Christopher D. Twigg, Charles C. Kemp, James Hays | N/A | |
| API-Net: Robust Generative Classifier via a Single Discriminator | Xinshuai Dong, Hong Liu, Rongrong Ji, Liujuan Cao, Qixiang Ye, Jianzhuang Liu, Qi Tian | N/A | |
| Bias-based Universal Adversarial Patch Attack for Automatic Check-out | Aishan Liu, Jiakai Wang, Xianglong Liu, Bowen Cao, Chongzhi Zhang, Hang Yu | N/A | |
| Imbalanced Continual Learning with Partitioning Reservoir Sampling | Chris Dongjoo Kim, Jinseo Jeong, Gunhee Kim | N/A | |
| Guided Collaborative Training for Pixel-wise Semi-Supervised Learning | Zhanghan Ke, Di Qiu, Kaican Li, Qiong Yan, Rynson W.H. Lau | N/A | |
| Stacking Networks Dynamically for Image Restoration Based on the Plug-and-Play Framework | Haixin Wang, Tianhao Zhang, Muzhi Yu, Jinan Sun, Wei Ye, Chen Wang , Shikun Zhang | N/A | |
| Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight | Ming Sun, Haoxuan Dou, Junjie Yan | N/A | |
| Spatial Attention Pyramid Network for Unsupervised Domain Adaptation | Congcong Li, Dawei Du, Libo Zhang, Longyin Wen, Tiejian Luo, Yanjun Wu, Pengfei Zhu | N/A | |
| GSIR: Generalizable 3D Shape Interpretation and Reconstruction | Jianren Wang, Zhaoyuan Fang | N/A | |
| Weakly Supervised 3D Object Detection from Lidar Point Cloud | Qinghao Meng, Wenguan Wang, Tianfei Zhou, Jianbing Shen, Luc Van Gool , Dengxin Dai | N/A | |
| Two-phase Pseudo Label Densification for Self-training based Domain Adaptation | Inkyu Shin, Sanghyun Woo, Fei Pan, In So Kweon | N/A | |
| Adaptive Offline Quintuplet Loss for Image-Text Matching | Tianlang Chen, Jiajun Deng, Jiebo Luo | N/A | |
| Learning Object Placement by Inpainting for Compositional Data Augmentation | Lingzhi Zhang, Tarmily Wen, Jie Min, Jiancong Wang, David Han, Jianbo Shi | N/A | |
| Deep Vectorization of Technical Drawings | Vage Egiazarian, Oleg Voynov, Alexey Artemov, Denis Volkhonskiy, Aleksandr Safin, Maria Taktasheva, Denis Zorin, Evgeny Burnaev | N/A | |
| CAD-Deform: Deformable Fitting of CAD Models to 3D Scans | Vladislav Ishimtsev, Alexey Bokhovkin, Alexey Artemov, Savva Ignatyev , Matthias Niessner, Denis Zorin, Evgeny Burnaev | N/A | |
| An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices | Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Wujie Wen, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang | N/A | |
| AutoTrajectory: Label-free Trajectory Extraction and Prediction from Videos using Dynamic Points | Yuexin Ma, Xinge Zhu, Xinjing Cheng, Ruigang Yang, Jiming Liu, Dinesh Manocha | N/A | |
| Multi-Agent Embodied Question Answering in Interactive Environments | Sinan Tan, Weilai Xiang, Huaping Liu, Di Guo, Fuchun Sun | N/A | |
| Conditional Sequential Modulation for Efficient Global Image Retouching | Jingwen He, Yihao Liu, Yu Qiao, Chao Dong | N/A | |
| Segmenting Transparent Objects in the Wild | Enze Xie, Wenjia Wang, Wenhai Wang, Mingyu Ding, Chunhua Shen, Ping Luo | N/A | |
| Length-Controllable Image Captioning | Chaorui Deng, Ning Ding, Mingkui Tan, Qi Wu | N/A | |
| Few-Shot Semantic Segmentation with Democratic Attention Networks | Haochen Wang, Xudong Zhang, Yutao Hu, Yandan Yang, Xianbin Cao, Xiantong Zhen | N/A | |
| Defocus Blur Detection via Depth Distillation | Xiaodong Cun, Chi-Man Pun | N/A | |
| Motion Guided 3D Pose Estimation from Videos | Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin | N/A | |
| Reflection Separation via Multi-bounce Polarization State Tracing | Rui Li, Simeng Qiu, Guangming Zang, Wolfgang Heidrich | N/A | |
| SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation | Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao | N/A | |
| SemanticAdv: Generating Adversarial Examples via Attribute-conditioned Image Editing | Haonan Qiu, Chaowei Xiao, Lei Yang, Xinchen Yan, Honglak Lee, Bo Li | N/A | |
| Learning with Noisy Class Labels for Instance Segmentation | Longrong Yang, Fanman Meng, Hongliang Li, Qingbo Wu, Qishang Cheng | N/A | |
| Deep Image Clustering with Category-Style Representation | Junjie Zhao, Donghuan Lu, Kai Ma, Yu Zhang, Yefeng Zheng | N/A | |
| Self-supervised Motion Representation via Scattering Local Motion Cues | Yuan Tian, Zhaohui Che, Wenbo Bao, Guangtao Zhai, Zhiyong Gao | N/A | |
| Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets | Tian Chen, Shijie An, Yuan Zhang, Chongyang Ma , Huayan Wang, Xiaoyan Guo, Wen Zheng | N/A | |
| BMBC: Bilateral Motion Estimation with Bilateral Cost Volume for Video Interpolation | Junheum Park, Keunsoo Ko, Chul Lee, Chang-Su Kim | N/A | |
| Hard negative examples are hard, but useful | Hong Xuan, Abby Stylianou, Xiaotong Liu, Robert Pless | N/A | |
| ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions | Zechun Liu, Zhiqiang Shen, Marios Savvides, Kwang-Ting Cheng | N/A | |
| Video Object Detection via Object-level Temporal Aggregation | Chun-Han Yao, Chen Fang, Xiaohui Shen, Yangyue Wan, Ming-Hsuan Yang | N/A | |
| Object Detection with a Unified Label Space from Multiple Datasets | Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu | N/A | |
| Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Jonah Philion, Sanja Fidler | N/A | |
| Comprehensive Image Captioning via Scene Graph Decomposition | Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li | N/A | |
| Symbiotic Adversarial Learning for Attribute-based Person Search | Yu-Tong Cao, Jingya Wang, Dacheng Tao | N/A | |
| Amplifying Key Cues for Human-Object-Interaction Detection | Yang Liu, Qingchao Chen, Andrew Zisserman | N/A | |
| Rethinking Few-shot Image Classification: A Good Embedding is All You Need? | Yonglong Tian, Yue Wang, Dilip Krishnan, Joshua B. Tenenbaum, Phillip Isola | N/A | |
| Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization | Kyle Min, Jason J. Corso | N/A | |
| Action Localization through Continual Predictive Learning | Sathyanarayanan Aakur, Sudeep Sarkar | N/A | |
| Generative View-Correlation Adaptation for Semi-Supervised Multi-View Learning | Yunyu Liu, Lichen Wang, Yue Bai, Can Qin, Zhengming Ding, Yun Fu | N/A | |
| READ: Reciprocal Attention Discriminator for Image-to-Video Re-Identification | Minho Shim, Hsuan-I Ho, Jinhyung Kim, Dongyoon Wee | N/A | |
| 3D Human Shape Reconstruction from a Polarization Image | Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chi Xu, Minglun Gong , Li Cheng | N/A | |
| The Devil is in the Details: Self-Supervised Attention for Vehicle Re-Identification | Pirazh Khorramshahi, Neehar Peri, Jun-cheng Chen, Rama Chellappa | N/A | |
| Improving One-stage Visual Grounding by Recursive Sub-query Construction | Zhengyuan Yang, Tianlang Chen, Liwei Wang, Jiebo Luo | N/A | |
| Multi-level Wavelet-based Generative Adversarial Network for Perceptual Quality Enhancement of Compressed Video | Jianyi Wang, Xin Deng, Mai Xu, Congyong Chen, Yuhang Song | N/A | |
| Example-Guided Image Synthesis using Masked Spatial-Channel Attention and Self-Supervision | Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo | N/A | |
| Content-Consistent Matching for Domain Adaptive Semantic Segmentation | Guangrui Li, Guoliang Kang, Wu Liu, Yunchao Wei, Yi Yang | N/A | |
| AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting | Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, ZhiBo Yang, Tong Lu, Chunhua Shen, Ping Luo | N/A | |
| History Repeats Itself: Human Motion Prediction via Motion Attention | Wei Mao, Miaomiao Liu, Mathieu Salzmann | N/A | |
| Unsupervised Video Object Segmentation with Joint Hotspot Tracking | Lu Zhang, Jianming Zhang, Zhe Lin, Radomír Měch, Huchuan Lu, You He | N/A | |
| SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach | Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin | N/A | |
| CAFE-GAN: Arbitrary Face Attribute Editing with Complementary Attention Feature | Jeong gi Kwak, David K. Han, Hanseok Ko | N/A | |
| MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection | Xin Lu, Quanquan Li, Buyu Li, Junjie Yan | N/A | |
| Latent Topic-aware Multi-Label Classification | Jianghong Ma, Yang Liu | N/A | |
| Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning | Xiangxi Shi, Xu Yang, Jiuxiang Gu, Shafiq Joty, Jianfei Cai | N/A | |
| Attract, Perturb, and Explore: Learning a Feature Alignment Network for Semi-supervised Domain Adaptation | Taekyung Kim, Changick Kim | N/A | |
| Curriculum Manager for Source Selection in Multi-Source Domain Adaptation | Luyu Yang, Yogesh Balaji, Ser-Nam Lim, Abhinav Shrivastava | N/A | |
| Powering One-shot Topological NAS with Stabilized Share-parameter Proxy | Ronghao Guo, Chen Lin, Chuming Li, Keyu Tian, Ming Sun, Lu Sheng, Junjie Yan | N/A | |
| Classes Matter: A Fine-grained Adversarial Approach to Cross-domain Semantic Segmentation | Haoran Wang, Tong Shen, Wei Zhang, Ling-Yu Duan, Tao Mei | N/A | |
| Boundary-preserving Mask R-CNN | Tianheng Cheng, Xinggang Wang, Lichao Huang, Wenyu Liu | N/A | |
| Self-supervised Single-view 3D Reconstruction via Semantic Consistency | Xueting Li, Sifei Liu, Kihwan Kim, Shalini De Mello, Varun Jampani, Ming-Hsuan Yang, Jan Kautz | N/A | |
| MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation | Benlin Liu, Yongming Rao, Jiwen Lu, Jie Zhou, Cho-Jui Hsieh | N/A | |
| Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling | Yuliang Zou, Pan Ji, Quoc-Huy Tran, Jia-Bin Huang, Manmohan Chandraker | N/A | |
| The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation | Tao Wang, Yu Li, Bingyi Kang, Junnan Li, Junhao Liew, Sheng Tang, Steven Hoi, Jiashi Feng | N/A | |
| What is Learned in Deep Uncalibrated Photometric Stereo? | Guanying Chen, Michael Waechter, Boxin Shi, Kwan-Yee K. Wong, Yasuyuki Matsushita | N/A | |
| Prior-based Domain Adaptive Object Detection for Hazy and Rainy Conditions | Vishwanath A. Sindagi, Poojan Oza, Rajeev Yasarla, Vishal M. Patel | N/A | |
| Adversarial Ranking Attack and Defense | Mo Zhou, Zhenxing Niu, Le Wang, Qilin Zhang, Gang Hua | N/A | |
| ReDro: Efficiently Learning Large-sized SPD Visual Representation | Saimunur Rahman, Lei Wang, Changming Sun, Luping Zhou | N/A | |
| Graph-Based Social Relation Reasoning | Wanhua Li, Yueqi Duan, Jiwen Lu, Jianjiang Feng, Jie Zhou | N/A | |
| EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection | Tengteng Huang, Zhe Liu, Xiwu Chen, Xiang Bai | N/A | |
| Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency | Jiaxiang Shang, Tianwei Shen, Shiwei li, Lei Zhou, Mingmin Zhen, Tian Fang, Long Quan | N/A | |
| Asynchronous Interaction Aggregation for Action Detection | Jiajun Tang, Jin Xia, Xinzhi Mu, Bo Pang, Cewu Lu | N/A | |
| Shape and Viewpoint without Keypoints | Shubham Goel, Angjoo Kanazawa, Jitendra Malik | N/A | |
| Learning Attentive and Hierarchical Representations for 3D Shape Recognition | Jiaxin Chen, Jie Qin, Yuming Shen, Li Liu, Fan Zhu, Ling Shao | N/A | |
| TF-NAS: Rethinking Three Search Freedoms of Latency-Constrained Differentiable Neural Architecture Search | Yibo Hu, Xiang Wu, Ran He | N/A | |
| Associative3D: Volumetric Reconstruction from Sparse Views | Shengyi Qian, Linyi Jin, David F. Fouhey | N/A | |
| PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit | Yongqiang Mou, Lei Tan, Hui Yang, Jingying Chen, Leyuan Liu, Rui Yan, Yaohong Huang | N/A | |
| Memory Selection Network for Video Propagation | Ruizheng Wu, Huaijia Lin, Xiaojuan Qi, Jiaya Jia | N/A | |
| Disentangled Non-local Neural Networks | Minghao Yin, Zhuliang Yao, Yue Cao, Xiu Li, Zheng Zhang, Stephen Lin, Han Hu | N/A | |
| URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark | Seonguk Seo, Joon-Young Lee, Bohyung Han | N/A | |
| Generalizing Person Re-Identification by Camera-Aware Invariance Learning and Cross-Domain Mixup | Chuanchen Luo, Chunfeng Song, Zhaoxiang Zhang | N/A | |
| Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks | Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, Yinjie Lei | N/A | |
| Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training | Hongkai Zhang, Hong Chang, Bingpeng Ma, Naiyan Wang, Xilin Chen | N/A | |
| Boosting Decision-based Black-box Adversarial Attacks with Random Sign Flip | Weilun Chen, Zhaoxiang Zhang, Xiaolin Hu, Baoyuan Wu | N/A | |
| Knowledge Transfer via Dense Cross-Layer Mutual-Distillation | Anbang Yao, Dawei Sun | N/A | |
| Matching Guided Distillation | Kaiyu Yue, Jiangfan Deng, Feng Zhou | N/A | |
| Clustering Driven Deep Autoencoder for Video Anomaly Detection | Yunpeng Chang, Zhigang Tu, Wei Xie, Junsong Yuan | N/A | |
| Learning to Compose Hypercolumns for Visual Correspondence | Juhong Min, Jongmin Lee, Jean Ponce, Minsu Cho | N/A | |
| Stochastic Bundle Adjustment for Efficient and Scalable 3D Reconstruction | Lei Zhou, Zixin Luo, Mingmin Zhen, Tianwei Shen, Shiwei Li, Zhuofei Huang, Tian Fang, Long Quan | N/A | |
| Object-based Illumination Estimation with Rendering-aware Neural Networks | Xin Wei, Guojun Chen, Yue Dong, Stephen Lin, Xin Tong | N/A | |
| Progressive Point Cloud Deconvolution Generation Network | Le Hui, Rui Xu, Jin Xie, Jianjun Qian, Jian Yang | N/A | |
| SSCGAN: Facial Attribute Editing via Style Skip Connections | Wenqing Chu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji | N/A | |
| Negative Pseudo Labeling using Class Proportion for Semantic Segmentation in Pathology | Hiroki Tokunaga, Brian Kenji Iwana, Yuki Teramoto, Akihiko Yoshizawa , Ryoma Bise | N/A | |
| Learn to Propagate Reliably on Noisy Affinity Graphs | Lei Yang, Qingqiu Huang, Huaiyi Huang, Linning Xu, Dahua Lin | N/A | |
| Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search | Xiangxiang Chu, Tianbao Zhou, Bo Zhang, Jixiang Li | N/A | |
| TANet: Towards Fully Automatic Tooth Arrangement | Guodong Wei, Zhiming Cui, Yumeng Liu, Nenglun Chen, Runnan Chen, Guiqing Li, Wenping Wang | N/A | |
| UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection | Bumsoo Kim, Taeho Choi, Jaewoo Kang, Hyunwoo J. Kim | N/A | |
| GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-aware Supervision | Lei Ke, Shichao Li, Yanan Sun, Yu-Wing Tai, Chi-Keung Tang | N/A | |
| Resolution Switchable Networks for Runtime Efficient Image Recognition | Yikai Wang, Fuchun Sun, Duo Li, Anbang Yao | N/A | |
| SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation | Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao , Xiaowei Zhou | N/A | |
| Learning to Detect Open Classes for Universal Domain Adaptation | Bo Fu, Zhangjie Cao, Mingsheng Long, Jianmin Wang | N/A | |
| Visual Compositional Learning for Human-Object Interaction Detection | Zhi Hou, Xiaojiang Peng, Yu Qiao, Dacheng Tao | N/A | |
| Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches | Shuai Yang, Zhangyang Wang, Jiaying Liu, Zongming Guo | N/A | |
| Rethinking Class Activation Mapping for Weakly Supervised Object Localization | Wonho Bae, Junhyug Noh, Gunhee Kim | N/A | |
| OS2D: One-Stage One-Shot Object Detection by Matching Anchor Features | Anton Osokin, Denis Sumin, Vasily Lomakin | N/A | |
| Interpretable Neural Network Decoupling | Yuchao Li, Rongrong Ji, Shaohui Lin, Baochang Zhang, Chenqian Yan, Yongjian Wu, Feiyue Huang, Ling Shao | N/A | |
| Omni-sourced Webly-supervised Learning for Video Recognition | Haodong Duan, Yue Zhao, Yuanjun Xiong, Wentao Liu, Dahua Lin | N/A | |
| CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending | Hang Xu, Shaoju Wang, Xinyue Cai, Wei Zhang, Xiaodan Liang, Zhenguo Li | N/A | |
| Contextual-Relation Consistent Domain Adaptation for Semantic Segmentation | Jiaxing Huang, Shijian Lu, Dayan Guan, Xiaobing Zhang | N/A | |
| Estimating People Flows to Better Count Them in Crowded Scenes | Weizhe Liu, Mathieu Salzmann, Pascal Fua | N/A | |
| Generate to Adapt: Resolution Adaption Network for Surveillance Face Recognition | Han Fang, Weihong Deng, Yaoyao Zhong, Jiani Hu | N/A | |
| Learning Feature Embeddings for Discriminant Model based Tracking | Linyu Zheng, Ming Tang, Yingying Chen, Jinqiao Wang, Hanqing Lu | N/A | |
| WeightNet: Revisiting the Design Space of Weight Networks | Ningning Ma, Xiangyu Zhang, Jiawei Huang, Jian Sun | N/A | |
| Partially-Shared Variational Auto-encoders for Unsupervised Domain Adaptation with Target Shift | Ryuhei Takahashi, Atsushi Hashimoto, Motoharu Sonogashira, Masaaki Iiyama | N/A | |
| Learning Where to Focus for Efficient Video Object Detection | Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan | N/A | |
| Learning Object Permanence from Video | Aviv Shamsian, Ofri Kleinfeld, Amir Globerson, Gal Chechik | N/A | |
| Adaptive Text Recognition through Visual Matching | Chuhan Zhang, Ankush Gupta, Andrew Zisserman | N/A | |
| Actions as Moving Points | Yixuan Li, Zixu Wang, Limin Wang, Gangshan Wu | N/A | |
| Learning to Exploit Multiple Vision Modalities by Using Grafted Networks | Yuhuang Hu, Tobi Delbruck, Shih-Chii Liu | N/A | |
| Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild | Alexander Grabner, Yaming Wang, Peizhao Zhang, Peihong Guo, Tong Xiao, Peter Vajda, Peter M. Roth, Vincent Lepetit | N/A | |
| 3D Fluid Flow Reconstruction Using Compact Light Field PIV | Zhong Li, Yu Ji, Jingyi Yu, Jinwei Ye | N/A | |
| Contextual Diversity for Active Learning | Sharat Agarwal, Himanshu Arora, Saket Anand, Chetan Arora | N/A | |
| Temporal Aggregate Representations for Long-Range Video Understanding | Fadime Sener, Dipika Singhania, Angela Yao | N/A | |
| Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition | Zhe Niu, Brian Mak | N/A | |
| General 3D Room Layout from a Single View by Render-and-Compare | Sinisa Stekovic, Shreyas Hampali, Mahdi Rad, Sayan Deb Sarkar, Friedrich Fraundorfer, Vincent Lepetit | N/A | |
| Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints | Vikramjit Sidhu, Edgar Tretschk, Vladislav Golyanik, Antonio Agudo, Christian Theobalt | N/A | |
| Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability | Anelise Newman, Camilo Fosco, Vincent Casser, Allen Lee, Barry McNamara, Aude Oliva | N/A | |
| Yet Another Intermediate-Level Attack | Qizhang Li, Yiwen Guo, Hao Chen | N/A | |
| Topology-Change-Aware Volumetric Fusion for Dynamic Scene Reconstruction | Chao Li, Xiaohu Guo | N/A | |
| Early Exit Or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images | Qunliang Xing, Mai Xu, Tianyi Li, Zhenyu Guan | N/A | |
| PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations | Edgar Tretschk, Ayush Tewari, Vladislav Golyanik, Michael Zollhöfer, Carsten Stoll, Christian Theobalt | N/A | |
| How does Lipschitz Regularization Influence GAN Training? | Yipeng Qin, Niloy Mitra, Peter Wonka | N/A | |
| Infrastructure-based Multi-Camera Calibration using Radial Projections | Yukai Lin, Viktor Larsson, Marcel Geppert, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler | N/A | |
| MotionSqueeze: Neural Motion Feature Learning for Video Understanding | Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho | N/A | |
| Polarized Optical-Flow Gyroscope | Masada Tzabari, Yoav Y. Schechner | N/A | |
| Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation | Da Li, Timothy Hospedales | N/A | |
| An Ensemble of Epoch-wise Empirical Bayes for Few-shot Learning | Yaoyao Liu, Bernt Schiele, Qianru Sun | N/A | |
| On the Effectiveness of Image Rotation for Open Set Domain Adaptation | Silvia Bucci, Mohammad Reza Loghmani, Tatiana Tommasi | N/A | |
| Combining Task Predictors via Enhancing Joint Predictability | Kwang In Kim, Christian Richardt, Hyung Jin Chang | N/A | |
| Multi-Scale Positive Sample Refinement for Few-Shot Object Detection | Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang | N/A | |
| Single-Image Depth Prediction Makes Feature Matching Easier | Carl Toft, Daniyar Turmukhambetov, Torsten Sattler, Fredrik Kahl, Gabriel J. Brostow | N/A | |
| Deep Reinforced Attention Learning for Quality-Aware Visual Recognition | Duo Li, Qifeng Chen | N/A | |
| CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization | Yuxi Li, Weiyao Lin, John See, Ning Xu Shugong Xu, Ke Yan, Cong Yang | N/A | |
| Learning Joint Spatial-Temporal Transformations for Video Inpainting | Yanhong Zeng, Jianlong Fu, Hongyang Chao | N/A | |
| Single Path One-Shot Neural Architecture Search with Uniform Sampling | Zichao Guo, Xiangyu Zhang, Haoyuan Mu, Wen Heng, Zechun Liu, Yichen Wei, Jian Sun | N/A | |
| Learning to Generate Novel Domains for Domain Generalization | Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, Tao Xiang | N/A | |
| Continuous Adaptation for Interactive Object Segmentation by Learning from Corrections | Theodora Kontogianni, Michael Gygli, Jasper Uijlings, Vittorio Ferrari | N/A | |
| Impact of base dataset design on few-shot image classification | Othman Sbai, Camille Couprie, Mathieu Aubry | N/A | |
| Invertible Zero-Shot Recognition Flows | Yuming Shen, Jie Qin, Lei Huang, Li Liu, Fan Zhu, Ling Shao | N/A | |
| GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes | Weidong Zhang, Wei Zhang, Yinda Zhang | N/A | |
| Location Sensitive Image Retrieval and Tagging | Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas | N/A | |
| Joint 3D Layout and Depth Prediction from a Single Indoor Panorama Image | Wei Zeng, Sezer Karaoglu, Theo Gevers | N/A | |
| Guessing State Tracking for Visual Dialogue | Wei Pang, Xiaojie Wang | N/A | |
| Memory-Efficient Incremental Learning Through Feature Adaptation | Ahmet Iscen, Jeffrey Zhang, Svetlana Lazebnik, Cordelia Schmid | N/A | |
| Neural Voice Puppetry: Audio-driven Facial Reenactment | Justus Thies, Mohamed Elgharib, Ayush Tewari, Christian Theobalt, Matthias Nießner | N/A | |
| One-Shot Unsupervised Cross-Domain Detection | Antonio D’Innocente, Francesco Cappio Borlino, Silvia Bucci, Barbara Caputo, Tatiana Tommasi | N/A | |
| Stochastic Frequency Masking to Improve Super-Resolution and Denoising Networks | Majed El Helou, Ruofan Zhou, Sabine Süsstrunk | N/A | |
| Probabilistic Future Prediction for Video Scene Understanding | Anthony Hu, Fergal Cotter, Nikhil Mohan, Corina Gurau, Alex Kendall | N/A | |
| Suppressing Mislabeled Data via Grouping and Self-Attention | Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu Qiao | N/A | |
| Class-wise Dynamic Graph Convolution for Semantic Segmentation | Hanzhe Hu, Deyi Ji, Weihao Gan, Shuai Bai, Wei Wu, Junjie Yan | N/A | |
| Character-Preserving Coherent Story Visualization | Yun-Zhu Song, Zhi Rui Tam, Hung-Jen Chen, Huiao-Han Lu, Hong-Han Shuai | N/A | |
| GINet: Graph Interaction Network for Scene Parsing | Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, MingWu, Zhanyu Ma, Guodong Guo | N/A | |
| Tensor Low-Rank Reconstruction for Semantic Segmentation | Wanli Chen, Xinge Zhu, Ruoqi Sun, Junjun He, Ruiyu Li, Xiaoyong Shen , Bei Yu | N/A | |
| Attentive Normalization | Xilai Li, Wei Sun, Tianfu Wu | N/A | |
| Count- and Similarity-aware R-CNN for Pedestrian Detection | Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Mubarak Shah | N/A | |
| TRADI: Tracking Deep Neural network Weight Distributions | Gianni Franchi, Andrei Bursuc, Emanuel Aldea, Séverine Dubuisson, Isabelle Bloch | N/A | |
| Spatiotemporal Attacks for Embodied Agents | Aishan Liu, Tairan Huang, Xianglong Liu, Yitao Xu, Yuqing Ma, Xinyun Chen, Stephen J. Maybank, Dacheng Tao | N/A | |
| Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation | Qingqiu Huang, Lei Yang, Huaiyi Huang, Tong Wu, Dahua Lin | N/A | |
| Unselfie: Translating Selfies to Neutral-pose Portraits in the Wild | Liqian Ma, Zhe Lin, Connelly Barnes, Alexei A Efros, Jingwan Lu | N/A | |
| Design and Interpretation of Universal Adversarial Patches in Face Detection | Xiao Yang, Fangyun Wei, Hongyang Zhang, Jun Zhu | N/A | |
| Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild | Yang Xiao, Renaud Marlet | N/A | |
| Weakly Supervised 3D Hand Pose Estimation via Biomechanical Constraints | Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Otmar Hilliges, Jan Kautz | N/A | |
| Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification | Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo | N/A | |
| Contextual Heterogeneous Graph Network for Human-Object Interaction Detection | Hai Wang, Wei-shi Zheng, Ling Yingbiao | N/A | |
| Zero-Shot Image Super-Resolution with Depth Guided Internal Degradation Learning | Xi Cheng, Zhenyong Fu, Jian Yang | N/A | |
| A Closest Point Proposal for MCMC-based Probabilistic Surface Registration | Dennis Madsen, Andreas Morel-Forster, Patrick Kahr, Dana Rahbani, Thomas Vetter, Marcel Lüthi | N/A | |
| Interactive Video Object Segmentation Using Global and Local Transfer Modules | Yuk Heo, Yeong Jun Koh, Chang-Su Kim | N/A | |
| End-to-end Interpretable Learning of Non-blind Image Deblurring | Thomas Eboli, Jian Sun, Jean Ponce | N/A | |
| Employing Multi-Estimations for Weakly-Supervised Semantic Segmentation | Junsong Fan, Zhaoxiang Zhang, Tieniu Tan | N/A | |
| Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection | Jing Zhang, Jianwen Xie, Nick Barnes | N/A | |
| Rethinking Image Deraining via Rain Streaks and Vapors | Yinglong Wang, Yibing Song, Chao Ma, Bing Zeng | N/A | |
| Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes | Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu | N/A | |
| Is Sharing of Egocentric Video Giving Away Your Biometric Signature? | Daksh Thapar, Chetan Arora, Aditya Nigam | N/A | |
| Captioning Images Taken by People Who Are Blind | Danna Gurari, Yinan Zhao, Meng Zhang, Nilavra Bhattacharya | N/A | |
| Improving Semantic Segmentation via Decoupled Body and Edge Supervision | Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong | N/A | |
| Conditional Entropy Coding for Efficient Video Compression | Jerry Liu, Shenlong Wang, Wei-Chiu Ma, Meet Shah, Rui Hu, Pranaab Dhawan, Raquel Urtasun | N/A | |
| Differentiable Feature Aggregation Search for Knowledge Distillation | Yushuo Guan, Pengyu Zhao, Bingxuan Wang, Yuanxing Zhang, Cong Yao, Kaigui Bian, Jian Tang | N/A | |
| Attention Guided Anomaly Localization in Images | Shashanka Venkataramanan, Kuan-Chuan Peng, Rajat Vikram Singh, Abhijit Mahalanobis | N/A | |
| Self-supervised Video Representation Learning by Pace Prediction | Jiangliu Wang, Jianbo Jiao, Yun-Hui Liu | N/A | |
| Full-Body Awareness from Partial Observations | Chris Rockwell, David F. Fouhey | N/A | |
| Reinforced Axial Refinement Network for Monocular 3D Object Detection | Lijie Liu, Chufan Wu, Jiwen Lu, Lingxi Xie, Jie Zhou, Qi Tian | N/A | |
| Self-Supervised Multi-Task Procedure Learning from Instructional Videos | Ehsan Elhamifar, Dat Huynh | N/A | |
| CosyPose: Consistent multi-view multi-object 6D pose estimation | Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic | N/A | |
| In-Domain GAN Inversion for Real Image Editing | Jiapeng Zhu, Yujun Shen, Deli Zhao, Bolei Zhou | N/A | |
| Key Frame Proposal Network for Efficient Pose Estimation in Videos | Yuexi Zhang, Yin Wang, Octavia Camps, Mario Sznaier | N/A | |
| Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning | Yuki Saito, Takuma Nakamura, Hirotaka Hachiya, Kenji Fukumizu | N/A | |
| Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs | Robin Rombach, Patrick Esser, Björn Ommer | N/A | |
| Cross-Modal Weighting Network for RGB-D Salient Object Detection | Gongyang Li, Zhi Liu, Linwei Ye, Yang Wang, Haibin Ling | N/A | |
| Open-set Adversarial Defense | Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel | N/A | |
| Deep Image Compression using Decoder Side Information | Sharon Ayzik, Shai Avidan | N/A | |
| Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation | Jeevan Devaranjan, Amlan Kar, Sanja Fidler | N/A | |
| A Generic Visualization Approach for Convolutional Neural Networks | Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis | N/A | |
| Interactive Annotation of 3D Object Geometry using 2D Scribbles | Tianchang Shen, Jun Gao, Amlan Kar, Sanja Fidler | N/A | |
| Hierarchical Kinematic Human Mesh Recovery | Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Košecká, Ziyan Wu | N/A | |
| Multi-Loss Rebalancing Algorithm for Monocular Depth Estimation | Jae-Han Lee, Chang-Su Kim | N/A | |
| 3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View | Marc Badger, Yufu Wang, Adarsh Modh, Ammon Perkes, Nikos Kolotouros , Bernd G. Pfrommer, Marc F. Schmidt, Kostas Daniilidis | N/A | |
| We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos | Alex Andonian, Camilo Fosco, Mathew Monfort, Allen Lee, Rogerio Feris, Carl Vondrick, Aude Oliva | N/A | |
| Joint Optimization for Multi-Person Shape Models from Markerless 3D-Scans | Samuel Zeitvogel, Johannes Dornheim, Astrid Laubenheimer | N/A | |
| Accurate RGB-D Salient Object Detection via Collaborative Learning | Wei Ji, Jingjing Li, Miao Zhang, Yongri Piao, Huchuan Lu | N/A | |
| Finding Your (3D) Center: 3D Object Detection Using a Learned Loss | David Griffiths, Jan Boehm, Tobias Ritschel | N/A | |
| Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection | Ganlong Zhao, Guanbin Li, Ruijia Xu, Liang Lin | N/A | |
| Two Stream Active Query Suggestion for Active Learning in Connectomics | Zudi Lin, Donglai Wei, Won-Dong Jang, Siyan Zhou, Xupeng Chen, Xueying Wang, Richard Schalek, Daniel Berger, Brian Matejek, Lee Kamentsky, Adi Peleg, Daniel Haehn, Thouis Jones, Toufiq Parag, Jeff Lichtman, Hanspeter Pfister | N/A | |
| Pix2Surf: Learning Parametric 3D Surface Models of Objects from Images | Jiahui Lei, Srinath Sridhar, Paul Guerrero, Minhyuk Sung, Niloy Mitra, Leonidas J. Guibas | N/A | |
| 6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference | Mai Bui, Tolga Birdal, Haowen Deng, Shadi Albarqouni, Leonidas Guibas, Slobodan Ilic, Nassir Navab | N/A | |
| Modeling Artistic Workflows for Image Generation and Editing | Hung-Yu Tseng, Matthew Fisher, Jingwan Lu, Yijun Li, Vladimir Kim, Ming-Hsuan Yang | N/A | |
| A Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval Tasks with Deep Neural Networks | Sangpil Kim, Hyung-gun Chi, Xiao Hu, Qixing Huang, Karthik Ramani | N/A | |
| Hidden Footprints: Learning Contextual Walkability from 3D Human Trails | Jin Sun, Hadar Averbuch-Elor, Qianqian Wang, Noah Snavely | N/A | |
| Self-Supervised Learning of Audio-Visual Objects from Video | Triantafyllos Afouras, Andrew Owens, Joon Son Chung, Andrew Zisserman | N/A | |
| GAN-based Garment Generation Using Sewing Pattern Images | Yu Shen, Junbang Liang, Ming C. Lin | N/A | |
| Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach | Chaitanya Ahuja, Dong Won Lee, Yukiko I. Nakano, Louis-Philippe Morency | N/A | |
| An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds | Rui Huang, Wanyue Zhang, Abhijit Kundu, Caroline Pantofaru, David A Ross, Thomas Funkhouser, Alireza Fathi | N/A | |
| Monotonicity Prior for Cloud Tomography | Tamar Loeub, Aviad Levis, Vadim Holodovsky, Yoav Y. Schechner | N/A | |
| Learning Trailer Moments in Full-Length Movies with Co-Contrastive Attention | Lezi Wang, Dong Liu, Rohit Puri, Dimitris N. Metaxas | N/A | |
| Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval | Christopher Thomas, Adriana Kovashka | N/A | |
| Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline | Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das | N/A | |
| Learning to Generate Grounded Visual Captions without Localization Supervision | Chih-Yao Ma, Yannis Kalantidis, Ghassan AlRegib, Peter Vajda, Marcus Rohrbach, Zsolt Kira | N/A | |
| Neural Hair Rendering | Menglei Chai, Jian Ren, Sergey Tulyakov | N/A | |
| JNR: Joint-based Neural Rig Representation for Compact 3D Face Modeling | Noranart Vesdapunt, Mitch Rundle, HsiangTao Wu, Baoyuan Wang | N/A | |
| On Disentangling Spoof Trace for Generic Face Anti-Spoofing | Yaojie Liu, Joel Stehouwer, Xiaoming Liu | N/A | |
| Streaming Object Detection for 3-D Point Clouds | Wei Han, Zhengdong Zhang, Benjamin Caine, Brandon Yang, Christoph Sprunk, Ouais Alsharif, Jiquan Ngiam, Vijay Vasudevan, Jonathon Shlens, Zhifeng Chen | N/A | |
| NAS-DIP: Learning Deep Image Prior with Neural Architecture Search | Yun-Chun Chen, Chen Gao, Esther Robb, Jia-Bin Huang | N/A | |
| Learning to Learn in a Semi-Supervised Fashion | Yun-Chun Chen, Chao-Te Chou, Yu-Chiang Frank Wang | N/A | |
| FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning | Chia-Wen Kuo, Chih-Yao Ma, Jia-Bin Huang, Zsolt Kira | N/A | |
| RadarNet: Exploiting Radar for Robust Perception of Dynamic Objects | Bin Yang, Runsheng Guo, Ming Liang, Sergio Casas, Raquel Urtasun | N/A | |
| Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation | Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh | N/A | |
| Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes | Chenhongyi Yang, Vitaly Ablavsky, Kaihong Wang, Qi Feng, Margrit Betke | N/A | |
| Towards causal benchmarking of bias in face analysis algorithms | Guha Balakrishnan, Yuanjun Xiong, Wei Xia, Pietro Perona | N/A | |
| Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation | Tong He, Dong Gong, Zhi Tian, Chunhua Shen | N/A | |
| Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions | Noa Garcia, Yuta Nakashima | N/A | |
| Transformation Consistency Regularization – A Semi-Supervised Paradigm for Image-to-Image Translation | Aamir Mustafa, Rafal K. Mantiuk | N/A | |
| LIRA: Lifelong Image Restoration from Unknown Blended Distortions | Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen | N/A | |
| HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization | Jiahao Lin, Gim Hee Lee | N/A | |
| SOLO: Segmenting Objects by Locations | Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang, Lei Li | N/A | |
| Learning to See in the Dark with Events | Song Zhang, Yu Zhang, Zhe Jiang, Dongqing Zou, Jimmy Ren, Bin Zhou | N/A | |
| Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data | Tim Salzmann, Boris Ivanovic, Punarjay Chakravarty, Marco Pavone | N/A | |
| Context-Gated Convolution | Xudong Lin, Lin Ma, Wei Liu, Shih-Fu Chang | N/A | |
| Polynomial Regression Network for Variable-Number Lane Detection | Bingke Wang, Zilei Wang, Yixin Zhang | N/A | |
| Structural Deep Metric Learning for Room Layout Estimation | Wenzhao Zheng, Jiwen Lu, Jie Zhou | N/A | |
| Adaptive Task Sampling for Meta-Learning | Chenghao Liu, Zhihao Wang, Doyen Sahoo, Yuan Fang Kun Zhang, Steven C.H. Hoi | N/A | |
| Deep Complementary Joint Model for Complex Scene Registration and Few-shot Segmentation on Medical Images | Yuting He, Tiantian Li, Guanyu Yang, Youyong Kong, Yang Chen, Huazhong Shu, Jean-Louis Coatrieux, Jean-Louis Dillenseger, Shuo Li | N/A | |
| Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems | Kailai Zhou, Linsen Chen, Xun Cao | N/A | |
| High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling | Yu Zeng, Zhe Lin, Jimei Yang, Jianming Zhang, Eli Shechtman, Huchuan Lu | N/A | |
| Online Ensemble Model Compression using Knowledge Distillation | Devesh Walawalkar, Zhiqiang Shen, Marios Savvides | N/A | |
| Deep Learning-based Pupil Center Detection for Fast and Accurate Eye Tracking System | Kang Il Lee, Jung Ho Jeon, Byung Cheol Song | N/A | |
| Efficient Residue Number System Based Winograd Convolution | Zhi-Gang Liu, Matthew Mattina | N/A | |
| Robust Tracking against Adversarial Attacks | Shuai Jia, Chao Ma, Yibing Song, Xiaokang Yang | N/A | |
| Single-Shot Neural Relighting and SVBRDF Estimation | Shen Sang, Manmohan Chandraker | N/A | |
| Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement | Qiang Nie , Ziwei Liu , Yunhui Liu | N/A | |
| Angle-based Search Space Shrinking for Neural Architecture Search | Yiming Hu, Yuding Liang, Zichao Guo, Ruosi Wan, Xiangyu Zhang, Yichen Wei, Qingyi Gu, Jian Sun | N/A | |
| RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition | Xiaoyu Yue, Zhanghui Kuang, Chenhao Lin, Hongbin Sun, Wayne Zhang | N/A | |
| Towards Fast, Accurate and Stable 3D Dense Face Alignment | Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, Stan Z. Li | N/A | |
| Iterative Feature Transformation for Fast and Versatile Universal Style Transfer | Tai-Yin Chiu, Danna Gurari | N/A | |
| CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search | Xin Chen, Yawen Duan, Zewei Chen, Hang Xu, Zihao Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li | N/A | |
| Toward Faster and Simpler Matrix Normalization via Rank-1 Update | Tan Yu, Yunfeng Cai, Ping Li | N/A | |
| Accurate Polarimetric BRDF for Real Polarization Scene Rendering | Yuhi Kondo, Taishi Ono, Legong Sun, Yasutaka Hirasawa, Jun Murayama | N/A | |
| Lensless Imaging with Focusing Sparse URA Masks in Long-Wave Infrared and its Application for Human Detection | Ilya Reshetouski, Hideki Oyaizu, Kenichiro Nakamura, Ryuta Satoh, Suguru Ushiki, Ryuichi Tadano, Atsushi Ito, Jun Murayama | N/A | |
| Topology-Preserving Class-Incremental Learning | Xiaoyu Tao, Xinyuan Chang, Xiaopeng Hong, Xing Wei, Yihong Gong | N/A | |
| Inter-Image Communication for Weakly Supervised Localization | Xiaolin Zhang, Yunchao Wei, Yi Yang | N/A | |
| UFO²: A Unified Framework towards Omni-supervised Object Detection | Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Alexander G. Schwing, Jan Kautz | N/A | |
| iCaps: An Interpretable Classifier via Disentangled Capsule Networks | Dahuin Jung, Jonghyun Lee, Jihun Yi, Sungroh Yoon | N/A | |
| Detecting Natural Disasters, Damage, and Incidents in the Wild | Ethan Weber, Nuria Marzo, Dim P. Papadopoulos, Aritro Biswas, Agata Lapedriza, Ferda Ofli, Muhammad Imran, Antonio Torralba | N/A | |
| Dynamic ReLU | Yinpeng Chen, Xiyang Dai, Mengchen Liu, Dongdong Chen, Lu Yuan, Zicheng Liu | N/A | |
| Acquiring Dynamic Light Fields through Coded Aperture Camera | Kohei Sakai, Keita Takahashi, Toshiaki Fujii, Hajime Nagahara | N/A | |
| Gait Recognition from a Single Image using a Phase-Aware Gait Cycle Reconstruction Network | Chi Xu, Yasushi Makihara, Xiang Li, Yasushi Yagi, Jianfeng Lu | N/A | |
| Informative Sample Mining Network for Multi-Domain Image-to-Image Translation | Jie Cao, Huaibo Huang, Yi Li, Ran He, Zhenan Sun | N/A | |
| Spherical Feature Transform for Deep Metric Learning | Yuke Zhu, Yan Bai, Yichen Wei | N/A | |
| Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering | Ruixue Tang, Chao Ma, Wei Emma Zhang, Qi Wu, Xiaokang Yang | N/A | |
| Unsupervised Multi-View CNN for Salient View Selection of 3D Objects and Scenes | Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu | N/A | |
| Representation Sharing for Fast Object Detector Search and Beyond | Yujie Zhong, Zelu Deng, Sheng Guo, Matthew R. Scott, Weilin Huang | N/A | |
| Peeking into occluded joints: A novel framework for crowd pose estimation | Lingteng Qiu, Xuanye Zhang, Yanran Li, Guanbin Li, Xiaojun Wu, Zixiang Xiong, Xiaoguang Han, Shuguang Cui | N/A | |
| RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition | Linxi Fan, Shyamal Buch, Guanzhi Wang, Ryan Cao, Yuke Zhu, Juan Carlos Niebles, Li Fei-Fei | N/A | |
| Deep Hashing with Active Pairwise Supervision | Ziwei Wang, Quan Zheng, Jiwen Lu, Jie Zhou | N/A | |
| Graph Edit Distance Reward: Learning to Edit Scene Graph | Lichang Chen, Guosheng Lin, Shijie Wang, Qingyao Wu | N/A | |
| Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing | Yajie Xing, Jingbo Wang, Gang Zeng | N/A | |
| Feature-metric Loss for Self-supervised Learning of Depth and Egomotion | Chang Shu, Kun Yu, Zhixiang Duan, Kuiyuan Yang | N/A | |
| Propagating Over Phrase Relations for One-Stage Visual Grounding | Sibei Yang, Guanbin Li, Yizhou Yu | N/A | |
| Adversarial Semantic Data Augmentation for Human Pose Estimation | Yanrui Bin, Xuan Cao, Xinya Chen, Yanhao Ge, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Changxin Gao, Nong Sang | N/A | |
| Free View Synthesis | Gernot Riegler, Vladlen Koltun | N/A | |
| Face Anti-Spoofing via Disentangled Representation Learning | Ke-Yue Zhang, Taiping Yao, Jian Zhang, Ying Tai, Shouhong Ding, Jilin Li, Feiyue Huang, Haichuan Song, Lizhuang Ma | N/A | |
| Prime-Aware Adaptive Distillation | Youcai Zhang, Zhonghao Lan, Yuchen Dai, Fangao Zeng, Yan Bai, Jie Chang, Yichen Wei | N/A | |
| Meta-Learning with Network Pruning | Hongduan Tian, Bo Liu, Xiao-Tong Yuan, Qingshan Liu | N/A | |
| Spiral Generative Network for Image Extrapolation | Dongsheng Guo, Hongzhi Liu, Haoru Zhao, Yunhao Cheng, Qingwei Song, Zhaorui Gu, Haiyong Zheng, Bing Zheng | N/A | |
| SceneSketcher: Fine-Grained Image Retrieval with Scene Sketches | Fang Liu, Changqing Zou, Xiaoming Deng, Ran Zuo, Yu-Kun Lai, Cuixia Ma, Yong-Jin Liu, Hongan Wang | N/A | |
| Few-shot Compositional Font Generation with Dual Memory | Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee | N/A | |
| PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Yue Qian, Junhui Hou, Sam Kwong, Ying He | N/A | |
| Handcrafted Outlier Detection Revisited | Luca Cavalli, Viktor Larsson, Martin Ralf Oswald, Torsten Sattler, Marc Pollefeys | N/A | |
| The Average Mixing Kernel Signature | Luca Cosmo, Giorgia Minello, Michael Bronstein, Luca Rossi, Andrea Torsello | N/A | |
| BCNet: Learning Body and Cloth Shape from A Single Image | Boyi Jiang, Juyong Zhang, Yang Hong, Jinhao Luo, Ligang Liu, Hujun Bao | N/A | |
| Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos | Umer Rafi, Andreas Doering, Bastian Leibe, Juergen Gall | N/A | |
| Interactive Multi-Dimension Modulation with Dynamic Controllable Residual Learning for Image Restoration | Jingwen He, Chao Dong, Yu Qiao | N/A | |
| Polysemy Deciphering Network for Human-Object Interaction Detection | Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao | N/A | |
| PODNet: Pooled Outputs Distillation for Small-Tasks Incremental Learning | Arthur Douillard, Matthieu Cord, Charles Ollion, Thomas Robert, Eduardo Valle | N/A | |
| Learning Graph-Convolutional Representations for Point Cloud Denoising | Francesca Pistilli, Giulia Fracastoro, Diego Valsesia, Enrico Magli | N/A | |
| Semantic Line Detection Using Mirror Attention and Comparative Ranking and Matching | Dongkwon Jin, Jun-Tae Lee, Chang-Su Kim | N/A | |
| A Differentiable Recurrent Surface for Asynchronous Event-Based Data | Marco Cannici, Marco Ciccone, Andrea Romanoni , Matteo Matteucci | N/A | |
| Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches | Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma , Yi-Zhe Song, Jun Guo | N/A | |
| LiteFlowNet3: Resolving Correspondence Ambiguity for More Accurate Optical Flow Estimation | Tak-Wai Hui, Chen Change Loy | N/A | |
| Microscopy Image Restoration with Deep Wiener-Kolmogorov Filters | Valeriya Pronina, Filippos Kokkinos, Dmitry V. Dylov, Stamatios Lefkimmiatis | N/A | |
| ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language | Dave Zhenyu Chen, Angel X. Chang, Matthias Nießner | N/A | |
| JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds | Zeyu Hu, Mingmin Zhen, Xuyang Bai, Hongbo Fu, Chiew-lan Tai | N/A | |
| Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior | Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang | N/A | |
| An Inference Algorithm for Multi-Label MRF-MAP Problems with Clique Size 100 | Ishant Shanu, Siddhant Bharti, Chetan Arora, S. N. Maheshwari | N/A | |
| Dual Refinement Underwater Object Detection Network | Baojie Fan, Wei Chen, Yang Cong, Jiandong Tian | N/A | |
| Multiple Sound Sources Localization from Coarse to Fine | Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin | N/A | |
| Task-Aware Quantization Network for JPEG Image Compression | Jinyoung Choi, Bohyung Han | N/A | |
| Energy-Based Models for Deep Probabilistic Regression | Fredrik K. Gustafsson, Martin Danelljan, Goutam Bhat, Thomas B. Schön | N/A | |
| CLOTH3D: Clothed 3D Humans | Hugo Bertiche, Meysam Madadi, Sergio Escalera | N/A | |
| Encoding Structure-Texture Relation with P-Net for Anomaly Detection in Retinal Images | Kang Zhou, Yuting Xiao, Jianlong Yang, Jun Cheng, Wen Liu, Weixin Luo, Zaiwang Gu, Jiang Liu, Shenghua Gao | N/A | |
| CLNet: A Compact Latent Network for Fast Adjusting Siamese Trackers | Xingping Dong, Jianbing Shen, Ling Shao, Fatih Porikli | N/A | |
| Occlusion-Aware Siamese Network for Human Pose Estimation | Lu Zhou, Yingying Chen, Yunze Gao, Jinqiao Wang, Hanqing Lu | N/A | |
| Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model | Yufan Liu, Minglang Qiao, Mai Xu, Bing Li, Weiming Hu, Ali Borji | N/A | |
| NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image | Lizhen Wang, Xiaochen Zhao, Tao Yu, Songtao Wang, Yebin Liu | N/A | |
| Model-based occlusion disentanglement for image-to-image translation | Fabio Pizzati, Pietro Cerri, Raoul de Charette | N/A | |
| Rotation-robust Intersection over Union for 3D Object Detection | Yu Zheng, Danyang Zhang, Sinan Xie, Jiwen Lu, Jie Zhou | N/A | |
| New Threats against Object Detector with Non-local Block | Yi Huang, Fan Wang, Adams Wai-Kin Kong, Kwok-Yan Lam | N/A | |
| Self-Supervised CycleGAN for Object-Preserving Image-to-Image Domain Adaptation | Xinpeng Xie, Jiawei Chen, Yuexiang Li, Linlin Shen, Kai Ma, Yefeng Zheng | N/A | |
| On the Usage of the Trifocal Tensor in Motion Segmentation | Federica Arrigoni, Luca Magri, Tomas Pajdla | N/A | |
| 3D-Rotation-Equivariant Quaternion Neural Networks | Wen Shen, Binbin Zhang, Shikun Huang, Zhihua Wei, Quanshi Zhang | N/A | |
| InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image | Gyeongsik Moon, Shoou-I Yu, He Wen, Takaaki Shiratori, Kyoung Mu Lee | N/A | |
| Active Crowd Counting with Limited Supervision | Zhen Zhao, Miaojing Shi, Xiaoxiao Zhao, Li Li | N/A | |
| Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance | Marvin Klingner, Jan-Aike Termhlen, Jonas Mikolajczyk, Tim Fingscheidt | N/A | |
| Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language | Shaoxiang Chen, Yu-Gang Jiang | N/A | |
| Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-On | Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes | N/A | |
| NODIS: Neural Ordinary Differential Scene Understanding | Yuren Cong, Hanno Ackermann, Wentong Liao, Michael Ying Yang, Bodo Rosenhahn | N/A | |
| AssembleNet++: Assembling Modality Representations via Attention Connections - Supplementary Material - | Michael S. Ryoo, AJ Piergiovanni, Juhana Kangaspunta, Anelia Angelova | N/A | |
| Learning Propagation Rules for Attribution Map Generation | Yiding Yang, Jiayan Qiu, Mingli Song, Dacheng Tao, Xinchao Wang | N/A | |
| Reparameterizing Convolutions for Incremental Multi-Task Learning without Task Interference | Menelaos Kanakis, David Bruggemann, Suman Saha, Stamatios Georgoulis , Anton Obukhov, Luc Van Gool | N/A | |
| Learning Predictive Models from Observation and Interaction | Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas Daniilidis, Sergey Levine, Chelsea Finn | N/A | |
| Unifying Deep Local and Global Features for Image Search | Bingyi Cao, André Araujo, Jack Sim | N/A | |
| Human Body Model Fitting by Learned Gradient Descent | Jie Song, Xu Chen, Otmar Hilliges | N/A | |
| DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition | Matthew Korban, Xin Li | N/A | |
| Learning latent representations across multiple data domains using Lifelong VAEGAN | Fei Ye, Adrian G. Bors | N/A | |
| DVI: Depth Guided Video Inpainting for Autonomous Driving | Miao Liao, Feixiang Lu, Dingfu Zhou, Sibo Zhang, Wei Li, Ruigang Yang | N/A | |
| Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation | Kenan E. Ak, Ning Xu, Zhe Lin, Yilin Wang | N/A | |
| APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection | A. Braunegg, Amartya Chakraborty, Michael Krumdick, Nicole Lape, Sara Leary, Keith Manville, Elizabeth Merkhofer, Laura Strickhart, Matthew Walmer | N/A | |
| Visual Question Answering on Image Sets | Ankan Bansal, Yuting Zhang, Rama Chellappa | N/A | |
| Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots | Qi Chen, Lin Sun, Zhixin Wang, Kui Jia, Alan Yuille | N/A | |
| Placepedia: Comprehensive Place Understanding with Multi-Faceted Annotations | Huaiyi Huang, Yuqi Zhang, Qingqiu Huang, Zhengkui Guo, Ziwei Liu, Dahua Lin | N/A | |
| DELTAS: Depth Estimation by Learning Triangulation And densification of Sparse points | Ayan Sinha, Zak Murez, James Bartolozzi, Vijay Badrinarayanan, Andrew Rabinovich | N/A | |
| Dynamic Low-light Imaging with Quanta Image Sensors | Yiheng Chi, Abhiram Gnanasambandam, Vladlen Koltun, Stanley H. Chan | N/A | |
| Disambiguating Monocular Depth Estimation with a Single Transient | Mark Nishimura, David B. Lindell, Christopher Metzler, Gordon Wetzstein | N/A | |
| DSDNet: Deep Structured self-Driving Network | Wenyuan Zeng, Shenlong Wang, Renjie Liao, Yun Chen, Bin Yang, Raquel Urtasun | N/A | |
| QuEST: Quantized Embedding Space for Transferring Knowledge | Himalaya Jain, Spyros Gidaris, Nikos Komodakis, Patrick Pérez, Matthieu Cord | N/A | |
| EGDCL: An Adaptive Curriculum Learning Framework for Unbiased Glaucoma Diagnosis | Rongchang Zhao, Xuanlin Chen, Zailiang Chen, Shuo Li | N/A | |
| Backpropagated Gradient Representations for Anomaly Detection | Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib | N/A | |
| Dense RepPoints: Representing Visual Objects with Dense Point Sets | Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang Raquel Urtasun, Liwei Wang , Stephen Lin, Han Hu | N/A | |
| On Dropping Clusters to Regularize Graph Convolutional Neural Networks | Xikun Zhang, Chang Xu, Dacheng Tao | N/A | |
| Adaptive Video Highlight Detection by Learning from User History | Mrigank Rochan, Mahesh Kumar Krishna Reddy, Linwei Ye, Yang Wang | N/A | |
| Improving 3D Object Detection through Progressive Population Based Augmentation | Shuyang Cheng, Zhaoqi Leng, Ekin Dogus Cubuk, Barret Zoph, Chunyan Bai, Jiquan Ngiam, Yang Song, Benjamin Caine, Vijay Vasudevan, Congcong Li, Quoc V. Le, Jonathon Shlens, Dragomir Anguelov | N/A | |
| DR-KFS: A Differentiable Visual Similarity Metric for 3D Shape Reconstruction | Jiongchao Jin, Akshay Gadi Patil, Zhang Xiong, Hao Zhang | N/A | |
| SPAN: Spatial Pyramid Attention Network for Image Manipulation Localization | Xuefeng Hu, Zhihan Zhang, Zhenye Jiang, Syomantak Chaudhuri, Zhenheng Yang, Ram Nevatia | N/A | |
| Adversarial Learning for Zero-shot Domain Adaptation | Jinghua Wang, Jianmin Jiang | N/A | |
| YOLO in the Dark - Domain Adaptation Method for Merging Multiple Models - | Yukihiro Sasagawa, Hajime Nagahara | N/A | |
| Identity-Aware Multi-Sentence Video Description | Jae Sung Park, Trevor Darrell, Anna Rohrbach | N/A | |
| VQA-LOL: Visual Question Answering under the Lens of Logic | Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang | N/A | |
| Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation | Mengyao Zhai, Lei Chen, Jiawei He, Megha Nawhal, Frederick Tung, Greg Mori | N/A | |
| TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering | Xiaofeng Yang, Guosheng Lin, Fengmao Lv, Fayao Liu | N/A | |
| Mining Inter-Video Proposal Relations for Video Object Detection | Mingfei Han, Yali Wang, Xiaojun Chang, Yu Qiao | N/A | |
| TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval | Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal | N/A | |
| Minimum Class Confusion for Versatile Domain Adaptation | Ying Jin, Ximei Wang, Mingsheng Long(), Jianmin Wang | N/A | |
| Large Batch Optimization for Object Detection: Training COCO in 12 Minutes | Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Yaowei Wang, Jinqiao Wang, Ming Tang | N/A | |
| Towards Practical and Efficient High-Resolution HDR Deghosting with CNN | K. Ram Prabhakar, Susmit Agrawal, Durgesh Kumar Singh, Balraj Ashwath , R. Venkatesh Babu | N/A | |
| Monocular Differentiable Rendering for Self-Supervised 3D Object Detection | Deniz Beker, Hiroharu Kato, Mihai Adrian Morariu, Takahiro Ando, Toru Matsuoka, Wadim Kehl, Adrien Gaidon | N/A | |
| Shape Prior Deformation for Categorical 6D Object Pose and Size Estimation | Meng Tian, Marcelo H Ang Jr, Gim Hee Lee | N/A | |
| Dynamic and Static Context-aware LSTM for Multi-agent Motion Prediction | Chaofan Tao, Qinhong Jiang, Lixin Duan, Ping Luo | N/A | |
| Image-based table recognition: data, model, and evaluation | Xu Zhong, Elaheh ShafieiBavani, Antonio Jimeno Yepes | N/A | |
| Group Activity Prediction with Sequential Relational Anticipation Model | Junwen Chen, Wentao Bao,, Yu Kong | N/A | |
| PiP: Planning-informed Trajectory Prediction for Autonomous Driving | Haoran Song, Wenchao Ding, Yuxuan Chen, Shaojie Shen, Michael Yu Wang, Qifeng Chen | N/A | |
| PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer | Duo Li, Anbang Yao, Qifeng Chen | N/A | |
| Hierarchical Context Embedding for Region-based Object Detection | Zhao-Min Chen, Xin Jin, Borui Zhao, Xiu-Shen Wei, Yanwen Guo | N/A | |
| Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition | Jin Ye, Junjun He, Xiaojiang Peng, Wenhao Wu, Yu Qiao | N/A | |
| Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection | Yuliang Guo, Guang Chen, Peitao Zhao, Weide Zhang, Jinghao Miao, Jingao Wang, Tae Eun Choe | N/A | |
| Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction | Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li | N/A | |
| MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation | Kaisiyuan Wang Qianyi Wu Linsen Song Zhuoqian Yang Wayne Wu Chen Qian Ran He Yu Qiao Chen Change Loy | N/A | |
| Detecting Human-Object Interactions with Action Co-occurrence Priors | Dong-Jin Kim Xiao Sun Jinsoo Choi Stephen Lin In So Kweon | N/A | |
| Learning Connectivity of Neural Networks from a Topological Perspective | Kun Yuan, Quanquan Li, Jing Shao, Junjie Yan | N/A | |
| JSTASR: Joint Size and Transparency-Aware Snow Removal Algorithm Based on Modified Partial Convolution and Veiling Effect Removal | Wei-Ting Chen, Hao-Yu Fang, Jian-Jiun Ding, Cheng-Che Tsai, Sy-Yen Kuo | N/A | |
| Ocean: Object-aware Anchor-free Tracking | Zhipeng Zhang, Houwen Peng, Jianlong Fu Bing Li, Weiming Hu | N/A | |
| Object Tracking using Spatio-Temporal Networks for Future Prediction Location | Yuan Liu, Ruoteng Li, Yu Cheng, Robby T. Tan, Xiubao Sui | N/A | |
| Pillar-based Object Detection for Autonomous Driving | Yue Wang, Alireza Fathi, Abhijit Kundu, David A. Ross, Caroline Pantofaru, Tom Funkhouser, Justin Solomon | N/A | |
| Sparse Adversarial Attack via Perturbation Factorization | Yanbo Fan, Baoyuan Wu, Tuanhui Li, Yong Zhang, Mingyang Li, Zhifeng Li, Yujiu Yang | N/A | |
| 3D Scene Reconstruction from a Single Viewport | Maximilian Denninger, Rudolph Triebel | N/A | |
| Learning to Optimize Domain Specific Normalization for Domain Generalization | Seonguk Seo, Yumin Suh, Dongwan Kim, Geeho Kim, Jongwoo Han, Bohyung Han | N/A | |
| Self-supervised Outdoor Scene Relighting | Ye Yu, Abhimitra Meka, Mohamed Elgharib, Hans-Peter Seidel, Christian Theobalt, William A. P. Smith | N/A | |
| Privacy Preserving Visual SLAM | Mikiya Shibuya, Shinya Sumikura, Ken Sakurada | N/A | |
| Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning | Valentina Sanguineti, Pietro Morerio, Niccolò Pozzetti, Danilo Greco, Marco Cristani, Vittorio Murino | N/A | |
| Learning Joint Visual Semantic Matching Embeddings for Language-guided Retrieval | Yanbei Chen, Loris Bazzani | N/A | |
| Globally Optimal and Efficient Vanishing Point Estimation in Atlanta World | Haoang Li, Pyojin Kim, Ji Zhao, Kyungdon Joo, Zhipeng Cai, Zhe Liu , Yun-Hui Liu | N/A | |
| StyleGAN2 Distillation for Feed-forward Image Manipulation | Yuri Viazovetskyi, Vladimir Ivashkin, Evgeny Kashin | N/A | |
| Self-Prediction for Joint Instance and Semantic Segmentation of Point Clouds | Jinxian Liu, Minghui Yu, Bingbing Ni⁴, Ye Chen | N/A | |
| Learning Disentangled Representations via Mutual Information Estimation | Eduardo Hugo Sanchez, Mathieu Serrurier, Mathias Ortner | N/A | |
| Challenge-Aware RGBT Tracking | Chenglong Li, Lei Liu, Andong Lu, Qing Ji, Jin Tang | N/A | |
| Fully Trainable and Interpretable Non-Local Sparse Models for Image Restoration | Bruno Lecouat, Jean Ponce, Julien Mairal | N/A | |
| AutoSimulate: (Quickly) Learning Synthetic Data Generation | Harkirat Singh Behl, Atilim Güneş Baydin, Ran Gal, Philip H.S. Torr, Vibhav Vineet | N/A | |
| LatticeNet: Towards Lightweight Image Super-resolution with Lattice Block | Xiaotong Luo, Yuan Xie, Yulun Zhang, Yanyun Qu, Cuihua Li, Yun Fu | N/A | |
| Learning from Scale-Invariant Examples for Domain Adaptation in Semantic Segmentation | M.Naseer Subhani, Mohsen Ali | N/A | |
| Active Visual Information Gathering for Vision-Language Navigation | Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen | N/A | |
| Deep Hough-Transform Line Priors | Yancong Lin, Silvia L. Pintea, Jan C. van Gemert | N/A | |
| Unsupervised Shape and Pose Disentanglement for 3D Meshes | Keyang Zhou, Bharat Lal Bhatnagar, Gerard Pons-Moll | N/A | |
| CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection | Muhammad Zaigham Zaheer, Arif Mahmood, Marcella Astrid, Seung-Ik Lee | N/A | |
| Inclusive GAN: Improving Data and Minority Coverage in Generative Models | Ning Yu, Ke Li, Peng Zhou Jitendra Malik, Larry Davis, Mario Fritz | N/A | |
| SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects | Evangelos Ntavelis, Andrés Romero, Iason Kastanis, Luc Van Gool, Radu Timofte | N/A | |
| Dive Deeper Into Box for Object Detection | Ran Chen, Yong Liu, Mengdan Zhang, Shu Liu, Bei Yu, Yu-Wing Tai | N/A | |
| PG-Net: Pixel to Global Matching Network for Visual Tracking | Bingyan Liao, Chenye Wang, Yayun Wang, Yaonong Wang, Jun Yin | N/A | |
| Why Are Deep Representations Good Perceptual Quality Features? | Taimoor Tariq, Okan Tarhan Tursun, Munchurl Kim, Piotr Didyk | N/A | |
| Geometric Estimation via Robust Subspace Recovery | Aoxiang Fan, Xingyu Jiang, Yang Wang, Junjun Jiang, Jiayi Ma | N/A | |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, Ling Shao | N/A | |
| Human Correspondence Consensus for 3D Object Semantic Understanding | Yujing Lou, Yang You, Chengkun Li, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu | N/A | |
| Learning Memory Augmented Cascading Network for Compressed Sensing of Images | Jiwei Chen, Yubao Sun, Qingshan Liu, Rui Huang | N/A | |
| Least squares surface reconstruction on arbitrary domains | Dizhong Zhu, William A. P. Smith | N/A | |
| Task-conditioned Domain Adaptation for Pedestrian Detection in Thermal Imagery | My Kieu, Andrew D. Bagdanov, Marco Bertini, Alberto del Bimbo | N/A | |
| Improving the Transferability of Adversarial Examples with Resized-Diverse-Inputs, Diversity-Ensemble and Region Fitting | Junhua Zou, Zhisong Pan, Junyang Qiu, Xin Liu, Ting Rui, Wei Li | N/A | |
| DADA: Differentiable Automatic Data Augmentation | Yonggang Li, Guosheng Hu, Yongtao Wang, Timothy Hospedales, Neil M. Robertson, Yongxin Yang | N/A | |
| SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans | Armen Avetisyan, Tatiana Khanova, Christopher Choy, Denver Dash, Angela Dai, Matthias Nießner | N/A | |
| Kinship Identification through Joint Learning using Kinship Verification Ensembles | Wei Wang, Shaodi You, Theo Gevers | N/A | |
| Kernelized Memory Network for Video Object Segmentation | Hongje Seong, Junhyuk Hyun, Euntai Kim | N/A | |
| A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection | Xiaoqi Zhao, Lihe Zhang¹, Youwei Pang, Huchuan Lu, Lei Zhang | N/A | |
| Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation | Tianyi Zhang, Guosheng Lin, Weide Liu, Jianfei Cai, Alex Kot | N/A | |
| Temporal Keypoint Matching and Refinement Network for Pose Estimation and Tracking | Chunluan Zhou Zhou Ren Gang Hua | N/A | |
| Neural Point-Based Graphics | Kara-Ali Aliev, Artem Sevastopolsky, Maria Kolos, Dmitry Ulyanov, Victor Lempitsky | N/A | |
| FHDe²Net: Full High Definition Demoireing Network | Bin He, Ce Wang, Boxin Shi, Ling-Yu Duan | N/A | |
| Learning Structural Similarity of User Interface Layouts using Graph Networks | Dipu Manandhar, Dan Ruta, John Collomosse | N/A | |
| NAS-Count: Counting-by-Density with Neural Architecture Search | Yutao Hu ¹, Xiaolong Jiang ², Xuhui Liu, Baochang Zhang, Jungong Han, Xianbin Cao ², David Doermann | N/A | |
| Towards Generalization Across Depth for Monocular 3D Object Detection | Andrea Simonelli, Samuel Rota Buló, Lorenzo Porzi, Elisa Ricci, Peter Kontschieder | N/A | |
| Margin-Mix: Semi–Supervised Learning for Face Expression Recognition | Corneliu Florea, Mihai Badea, Laura Florea, Andrei Racoviteanu, Constantin Vertan | N/A | |
| Principal Feature Visualisation in Convolutional Neural Networks | Marianne Bakken, Johannes Kvam, Alexey A. Stepanov, Asbjørn Berge | N/A | |
| Progressive Refinement Network for Occluded Pedestrian Detection | Xiaolin Song Kaili Zhao Wen-Sheng Chu Honggang Zhang Jun Guo | N/A | |
| Monocular Real-Time Volumetric Performance Capture | Ruilong Li, Yuliang Xiu, Shunsuke Saito, Zeng Huang, Kyle Olsewski, Hao Li | N/A | |
| The Mapillary Traffic Sign Dataset for Detection and Classification on a Global Scale | Christian Ertler, Jerneja Mislej, Tobias Ollmann, Lorenzo Porzi, Gerhard Neuhold, Yubin Kuang | N/A | |
| Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction | Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren⁸, Weiting Huang⁸, Haifeng Sun⁸, Marek Hrúz⁹, Jakub Kanis⁹, Zdeněk Krňoul⁹, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yunhui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim | N/A | |
| Disentangling Multiple Features in Video Sequences using Gaussian Processes in Variational Autoencoders | Sarthak Bhagat, Shagun Uppal, Zhuyun Yin, Nengli Lim | N/A | |
| SEN: A Novel Feature Normalization Dissimilarity Measure for Prototypical Few-Shot Learning Networks | Van Nhan Nguyen, Sigurd Løkse, Kristoffer Wickstrøm, Michael Kampffmeyer, Davide Roverso, Robert Jenssen | N/A | |
| Kinematic 3D Object Detection in Monocular Video | Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele | N/A | |
| Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents | Ye Zhu, Yu Wu, Yi Yang, Yan Yan | N/A | |
| SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-based Symptom Relation Embedding | Sangmin Lee, Jung Uk Kim, Hak Gu Kim, Seongyeop Kim, Yong Man Ro | N/A | |
| End-to-End Low Cost Compressive Spectral Imaging with Spatial-Spectral Self-Attention | Ziyi Meng, Jiawei Ma, Xin Yuan | N/A | |
| Know Your Surroundings: Exploiting Scene Information for Object Tracking | Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte | N/A | |
| Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases | Ren Wang, Gaoyuan Zhang, Sijia Liu, Pin-Yu Chen, Jinjun Xiong, Meng Wang | N/A | |
| Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images | Haomin Chen, Yirui Wang, Kang Zheng, Weijian Li, Chi-Tung Chang, Adam P. Harrison, Jing Xiao, Gregory D. Hager, Le Lu, Chien-Hung Liao, Shun Miao | N/A | |
| DeepLandscape: Adversarial Modeling of Landscape Videos | Elizaveta Logacheva, Roman Suvorov, Oleg Khomenko, Anton Mashikhin, Victor Lempitsky | N/A | |
| GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images | Lei Kang, Pau Riba, Yaxing Wang, Marçal Rusiñol, Alicia Fornés, Mauricio Villegas | N/A | |
| Spatial-Angular Interaction for Light Field Image Super-Resolution | Yingqian Wang, Longguang Wang, Jungang Yang, Wei An, Jingyi Yu, Yulan Guo | N/A | |
| BATS: Binary ArchitecTure Search | Adrian Bulat, Brais Martinez, Georgios Tzimiropoulos | N/A | |
| A Closer Look at Local Aggregation Operators in Point Cloud Analysis | Ze Liu(†), Han Hu, Yue Cao, Zheng Zhang, Xin Tong | N/A | |
| Look here! A parametric learning based approach to redirect visual attention | Youssef A. Mejjati, Celso F. Gomez, Kwang In Kim, Eli Shechtman, Zoya Bylinskii | N/A | |
| Variational Diffusion Autoencoders with Random Walk Sampling | Henry Li, Ofir Lindenbaum, Xiuyuan Cheng, Alexander Cloninger | N/A | |
| Adaptive Variance Based Label Distribution Learning For Facial Age Estimation | Xin Wen, Biying Li, Haiyun Guo, Zhiwei Liu, Guosheng Hu, Ming Tang, Jinqiao Wang | N/A | |
| Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency | Shasha Li, Shitong Zhu, Sudipta Paul, Amit Roy-Chowdhury, Chengyu Song, Srikanth Krishnamurthy, Ananthram Swami, Kevin S Chan | N/A | |
| Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations | Abbas Sadat, Sergio Casas, Mengye Ren, Xinyu Wu, Pranaab Dhawan, Raquel Urtasun | N/A | |
| VarSR: Variational Super-Resolution Network for Very Low Resolution Images | Sangeek Hyun, Jae-Pil Heo | N/A | |
| Co-Heterogeneous and Adaptive Segmentation from Multi-Source and Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation | Ashwin Raju, Chi-Tung Cheng, Yuankai Huo, Jinzheng Cai, Junzhou Huang, Jing Xiao, Le Lu, ChienHung Liao, Adam P. Harrison | N/A | |
| Towards Recognizing Unseen Categories in Unseen Domains | Massimiliano Mancini, Zeynep Akata, Elisa Ricci, Barbara Caputo | N/A | |
| Square Attack: a query-efficient black-box adversarial attack via random search | Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion, Matthias Hein | N/A | |
| You Are Here: Geolocation by Embedding Maps and Images | Noe Samano, Mengjie Zhou, Andrew Calway | N/A | |
| Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation | Yang He, Shadi Rahimian, Bernt Schiele, Mario Fritz | N/A | |
| From Image to Stability: Learning Dynamics from Human Pose | Jesse Scott, Bharadwaj Ravichandran, Christopher Funk, Robert T. Collins, Yanxi Liu | N/A | |
| LevelSet R-CNN: A Deep Variational Method for Instance Segmentation | Namdar Homayounfar Yuwen Xiong Justin Liang Wei-Chiu Ma Raquel Urtasun {namdar,yuwen,justin.liang,weichiu,urtasun}@uber.com | N/A | |
| Efficient Scale-Permuted Backbone with Learned Resource Distribution | Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Yin Cui Mingxing Tan, Quoc Le, Xiaodan Song | N/A | |
| Reducing Distributional Uncertainty by Mutual Information Maximisation and Transferable Feature Learning | Jian Gao, Yang Hua, Guosheng Hu, Chi Wang, Neil M. Robertson | N/A | |
| Bridging Knowledge Graphs to Generate Scene Graphs | Alireza Zareian, Svebor Karaman, Shih-Fu Chang | N/A | |
| Implicit Latent Variable Model for Scene-Consistent Motion Forecasting | Sergio Casas, Cole Gulino, Simon Suo, Katie Luo, Renjie Liao, Raquel Urtasun | N/A | |
| Learning Visual Commonsense for Robust Scene Graph Generation | Alireza Zareian, Zhecan Wang, Haoxuan You, Shih-Fu Chang | N/A | |
| MPCC: Matching Priors and Conditionals for Clustering | Nicolás Astorga, Pablo Huijse, Pavlos Protopapas, Pablo Estévez | N/A | |
| PointAR: Efficient Lighting Estimation for Mobile Augmented Reality | Yiqin Zhao, Tian Guo | N/A | |
| Discrete Point Flow Networks for Efficient Point Cloud Generation | Roman Klokov, Edmond Boyer, Jakob Verbeek | N/A | |
| Accelerating Deep Learning with Millions of Classes | Zhuoning Yuan, Zhishuai Guo, Xiaotian Yu, Xiaoyu Wang, Tianbao Yang | N/A | |
| Password-conditioned Anonymization and Deanonymization with Face Identity Transformers | Xiuye Gu, Weixin Luo, Michael S. Ryoo, Yong Jae Lee | N/A | |
| Inertial Safety from Structured Light | Sizhuo Ma, Mohit Gupta | N/A | |
| PointTriNet: Learned Triangulation of 3D Point Sets | Nicholas Sharp, Maks Ovsjanikov | N/A | |
| Toward Unsupervised, Multi-Object Discovery in Large-Scale Image Collections | Huy V. Vo, Patrick Pérez, Jean Ponce | N/A | |
| Deep Novel View Synthesis from Colored 3D Point Clouds | Zhenbo Song, Wayne Chen, Dylan Campbell, Hongdong Li | N/A | |
| Consensus-Aware Visual-Semantic Embedding for Image-Text Matching | Haoran Wang, Ying Zhang, Zhong Ji, Yanwei Pang, Lin Ma | N/A | |
| Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising | Guanting Dong, Yueyi Zhang, Zhiwei Xiong | N/A | |
| Sat2Graph: Road Graph Extraction through Graph-Tensor Encoding | Songtao He, Favyen Bastani, Satvat Jagwani, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Mohamed M. Elshrif, Samuel Madden, Mohammad Amin Sadeghi | N/A | |
| Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition | Di Hu, Xuhong Li, Lichao Mou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, Dejing Dou | N/A | |
| Polarimetric Multi-View Inverse Rendering | Jinyu Zhao, Yusuke Monno, Masatoshi Okutomi | N/A | |
| SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information | Jing Yu Koh, Duc Thanh Nguyen, Quang-Trung Truong, Sai-Kit Yeung, Alexander Binder | N/A | |
| Improving Face Recognition by Clustering Unlabeled Faces in the Wild | Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker | N/A | |
| NeuRoRA: Neural Robust Rotation Averaging | Pulak Purkait, Tat-Jun Chin, Ian Reid | N/A | |
| SG-VAE: Scene Grammar Variational Autoencoder to generate new indoor scenes | Pulak Purkait, Christopher Zach, Ian Reid | N/A | |
| Unsupervised Learning of Optical Flow with Deep Feature Similarity | Woobin Im, Tae-Kyun Kim, Sung-Eui Yoon | N/A | |
| Blended Grammar Network for Human Parsing | Xiaomei Zhang, Yingying Chen, Bingke Zhu, Jinqiao Wang, Ming Tang | N/A | |
| P²Net: Patch-match and Plane-regularization for Unsupervised Indoor Depth Estimation | Zehao Yu, Lei Jin, Shenghua Gao | N/A | |
| Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs | Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani | N/A | |
| Adaptive Mixture Regression Network with Local Counting Map for Crowd Counting | Xiyang Liu, Jie Yang, Wenrui Ding, Tieqiang Wang, Zhijin Wang, Junjun Xiong | N/A | |
| BIRNAT: Bidirectional Recurrent Neural Networks with Adversarial Training for Video Snapshot Compressive Imaging | Ziheng Cheng, Ruiying Lu, Zhengjue Wang, Hao Zhang, Bo Chen, Ziyi Meng, Xin Yuan | N/A | |
| Ultra Fast Structure-aware Deep Lane Detection | Zequn Qin, Huanyu Wang, Xi Li | N/A | |
| Cross-Identity Motion Transfer for Arbitrary Objects through Pose-Attentive Video Reassembling | Subin Jeon, Seonghyeon Nam, Seoung Wug Oh, Seon Joo Kim | N/A | |
| Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN | Zhenwei He, Lei Zhang | N/A | |
| Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition | Xiaobo Wang, Tianyu Fu, Shengcai Liao, Shuo Wang, Zhen Lei, Tao Mei | N/A | |
| Learning Camera-Aware Noise Models | Ke-Chi Chang, Ren Wang, Hung-Jin Lin, Yu-Lun Liu, Chia-Ping Chen, Yu-Lin Chang, Hwann-Tzong Chen | N/A | |
| Towards Precise Completion of Deformable Shapes | Oshri Halimi, Ido Imanuel, Or Litany, Giovanni Trappolini, Emanuele Rodolà, Leonidas Guibas, Ron Kimmel | N/A | |
| Iterative Distance-Aware Similarity Matrix Convolution with Mutual-Supervised Point Elimination for Efficient Point Cloud Registration | Jiahao Li, Changhao Zhang, Ziyao Xu, Hangning Zhou, Chi Zhang | N/A | |
| Pairwise Similarity Knowledge Transfer for Weakly Supervised Object Localization | Amir Rahimi, Amirreza Shaban, Thalaiyasingam Ajanthan, Richard Hartley, Byron Boots | N/A | |
| Environment-agnostic Multitask Learning for Natural Language Grounded Navigation | Xin Eric Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi[2] | N/A | |
| TPFN: Applying Outer Product along Time to Multimodal Sentiment Analysis Fusion on Incomplete Data | Binghua Li, Chao Li, Feng Duan, Ning Zheng, Qibin Zhao | N/A | |
| ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis | Eu Wern Teh, Terrance DeVries, Graham W. Taylor | N/A | |
| Learning with Privileged Information for Efficient Image Super-Resolution | Wonkyung Lee, Junghyup Lee, Dohyung Kim, Bumsub Ham | N/A | |
| Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive Person Re-Identification | Jianing Li,, Shiliang Zhang | N/A | |
| Autoencoder-based Graph Construction for Semi-supervised Learning | Mingeun Kang, Kiwon Lee, Yong H. Lee, Changho Suh | N/A | |
| Virtual Multi-view Fusion for 3D Semantic Segmentation | Abhijit Kundu, Xiaoqi Yin, Alireza Fathi, David Ross, Brian Brewington, Thomas Funkhouser, Caroline Pantofaru | N/A | |
| Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition | Ke Cheng, Yifan Zhang, Congqi Cao, Lei Shi, Jian Cheng, Hanqing Lu | N/A | |
| Deep Shape from Polarization | Yunhao Ba, Alex Gilbert, Franklin Wang, Jinfa Yang, Rui Chen, Yiqin Wang, Lei Yan, Boxin Shi, Achuta Kadambi | N/A | |
| A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning | Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng | N/A | |
| Mind the Discriminability: Asymmetric Adversarial Domain Adaptation | Jianfei Yang, Han Zou, Yuxun Zhou, Zhaoyang Zeng, Lihua Xie () | N/A | |
| SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates | Zhizhong Han, Guanhui Qiao, Yu-Shen Liu, Matthias Zwicker | N/A | |
| Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking | ShiJie Sun, Naveed Akhtar, XiangYu Song, HuanSheng Song, Ajmal Mian , Mubarak Shah | N/A | |
| Deep FusionNet for Point Cloud Semantic Segmentation | Feihu Zhang Jin Fang Benjamin Wah Philip Torr | N/A | |
| Deep Material Recognition in Light-Fields via Disentanglement of Spatial and Angular Information | Bichuan Guo, Jiangtao Wen, Yuxing Han | N/A | |
| Dual Adversarial Network for Deep Active Learning | Shuo Wang, Yuexiang Li, Kai Ma, Ruhui Ma, Haibing Guan, Yefeng Zheng | N/A | |
| Fully Convolutional Networks for Continuous Sign Language Recognition | Ka Leong Cheng, Zhaoyang Yang, Qifeng Chen, Yu-Wing Tai | N/A | |
| Self-adapting confidence estimation for stereo | Matteo Poggi, Filippo Aleotti, Fabio Tosi, Giulio Zaccaroni, Stefano Mattoccia | N/A | |
| Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention | Quewei Li, Jie Guo, Yang Fei, Qinyu Tang, Wenxiu Sun, Jin Zeng, Yanwen Guo | N/A | |
| AutoSTR: Efficient Backbone Search for Scene Text Recognition | Hui Zhang, Quanming Yao, Mingkun Yang, Yongchao Xu, Xiang Bai | N/A | |
| Mitigating Embedding and Class Assignment Mismatch in Unsupervised Image Classification | Sungwon Han, Sungwon Park, Sungkyu Park, Sundong Kim, Meeyoung Cha | N/A | |
| Adversarial Training with Bi-directional Likelihood Regularization for Visual Classification | Weitao Wan, Jiansheng Chen, Ming-Hsuan Yang | N/A | |
| Faster AutoAugment: Learning Augmentation Strategies Using Backpropagation | Ryuichiro Hataya, Zdenek Jan, Kazuki Yoshizoe, Hideki Nakayama | N/A | |
| Hand-Transformer: Non-Autoregressive Structured Modeling for 3D Hand Pose Estimation | Lin Huang, Jianchao Tan, Ji Liu, Junsong Yuan | N/A | |
| Boundary-Aware Cascade Networks for Temporal Action Segmentation | Zhenzhi Wang, Ziteng Gao, Limin Wang, Zhifeng Li, Gangshan Wu | N/A | |
| Towards Content-Independent Multi-Reference Super-Resolution: Adaptive Pattern Matching and Feature Aggregation | Xu Yan, Weibing Zhao, Kun Yuan, Ruimao Zhang, Zhen Li, Shuguang Cui | N/A | |
| Inference Graphs for CNN Interpretation | Yael Konforti, Alon Shpigler, Boaz Lerner, Aharon Bar-Hillel | N/A | |
| An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension | Liangcheng Li, Feiyu Gao, Jiajun Bu, Yongpan Wang, Zhi Yu, Qi Zheng | N/A | |
| Improving Query Efficiency of Black-box Adversarial Attack | Yang Bai, Yuyuan Zeng, Yong Jiang, Yisen Wang, Shu-Tao Xia, Weiwei Guo | N/A | |
| Self-similarity Student for Partial Label Histopathology Image Segmentation | Hsien-Tzu Cheng, Chun-Fu Yeh, Po-Chen Kuo, Andy Wei, Keng-Chi Liu, Mong-Chi Ko, Kuan-Hua Chao, Yu-Ching Peng, Tyng-Luh Liu | N/A | |
| BioMetricNet: deep unconstrained face verification through learning of metrics regularized onto Gaussian distributions | Arslan Ali, Matteo Testa, Tiziano Bianchi, Enrico Magli | N/A | |
| A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images | Zhetong Liang, Shi Guo, Hong Gu, Huaqi Zhang, Lei Zhang | N/A | |
| Global-and-Local Relative Position Embedding for Unsupervised Video Summarization | Yunjae Jung, Donghyeon Cho, Sanghyun Woo, In So Kweon | N/A | |
| Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms | Jaesung Rim, Haeyun Lee, Jucheol Won, Sunghyun Cho | N/A | |
| SPARK: Spatial-aware Online Incremental Attack Against Visual Tracking | Qing Guo, Xiaofei Xie, Felix Juefei-Xu, Lei Ma, Zhongguo Li, Wanli Xue, Wei Feng, Yang Liu | N/A | |
| CenterNet Heatmap Propagation for Real-time Video Object Detection | Zhujun Xu, Emir Hrustic, Damien Vivet | N/A | |
| Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection | Youwei Pang, Lihe Zhang, Xiaoqi Zhao, Huchuan Lu | N/A | |
| SOLAR: Second-Order Loss and Attention for Image Retrieval | Tony Ng, Vassileios Balntas, Yurun Tian, Krystian Mikolajczyk | N/A | |
| Fixing Localization Errors to Improve Image Classification | Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, Luc Van Gool | N/A | |
| PatchPerPix for Instance Segmentation | Lisa Mais, Peter Hirsch and Dagmar Kainmueller | N/A | |
| Attend and Segment: Attention Guided Active Semantic Segmentation | Soroush Seifi, Tinne Tuytelaars | N/A | |
| Accelerating CNN Training by Pruning Activation Gradients | Xucheng Ye, Pengcheng Dai, Junyu Luo, Xin Guo, Yingjie Qi, Jianlei Yang, Yiran Chen | N/A | |
| Global and Local Enhancement Networks for Paired and Unpaired Image Enhancement | Han-Ul Kim, Young Jun Koh, Chang-Su Kim | N/A | |
| Probabilistic Anchor Assignment with IoU Prediction for Object Detection | Kang Kim, Hee Seok Lee | N/A | |
| Eyeglasses 3D shape reconstruction from a single face image | Yating Wang, Quan Wang, Feng Xu | N/A | |
| Temporal Complementary Learning for Video Person Re-Identification | Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen | N/A | |
| HoughNet: Integrating near and long-range evidence for bottom-up object detection | Nermin Samet, Samet Hicsonmez, Emre Akbas | N/A | |
| Graph Wasserstein Correlation Analysis for Movie Retrieval | Xueya Zhang, Tong Zhang, Xiaobin Hong, Zhen Cui, Jian Yang | N/A | |
| Context-Aware RCNN: A Baseline for Action Detection in Videos | Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu | N/A | |
| Full-Time Monocular Road Detection Using Zero-Distribution Prior of Angle of Polarization | Ning Li, Yongqiang Zhao, Quan Pan, Seong G. Kong, Jonathan Cheung-Wai Chan | N/A | |
| A Flexible Recurrent Residual Pyramid Network for Video Frame Interpolation | Haoxian Zhang, Yang Zhao, Ronggang Wang | N/A | |
| Learning Enriched Features for Real Image Restoration and Enhancement | Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao | N/A | |
| Detail Preserved Point Cloud Completion via Separated Feature Aggregation | Wenxiao Zhang, Qingan Yan, Chunxia Xiao | N/A | |
| LabelEnc: A New Intermediate Supervision Method for Object Detection | Miao Hao, Yitao Liu, Xiangyu Zhang, Jian Sun | N/A | |
| Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets | Clara Fernandez-Labrador, Ajad Chhatkuli, Danda Pani Paudel, Jose J. Guerrero, Cédric Demonceaux, Luc Van Gool | N/A | |
| PAMS: Quantized Super-Resolution via Parameterized Max Scale | Huixia Li, Chenqian Yan, Shaohui Lin, Xiawu Zheng, Baochang Zhang, Fan Yang, Rongrong Ji | N/A | |
| SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds | Xinge Zhu Yuexin Ma Tai Wang Yan Xu Jianping Shi Dahua Lin | N/A | |
| OID: Outlier Identifying and Discarding in Blind Image Deblurring | Liang Chen, Faming Fang, Jiawei Zhang, Jun Liu, Guixu Zhang | N/A | |
| Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors | Mateusz Michalkiewicz, Sarah Parisot, Stavros Tsogkas, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky | N/A | |
| Enhanced Sparse Model for Blind Deblurring | Liang Chen, Faming Fang, Shen Lei, Fang Li, Guixu Zhang | N/A | |
| SumGraph: Video Summarization via Recursive Graph Modeling | Jungin Park, Jiyoung Lee, Ig-Jae Kim, Kwanghoon Sohn | N/A | |
| Feature Normalized Knowledge Distillation for Image Classification | Kunran Xu, Lai Rui, Yishi Li, Lin Gu | N/A | |
| A Metric Learning Reality Check | Kevin Musgrave, Serge Belongie, Ser-Nam Lim | N/A | |
| FTL: A universal framework for training low-bit DNNs via Feature Transfer | Kunyuan Du, Ya Zhang, Haibing Guan, Qi Tian, Shenggan Cheng, James Lin | N/A | |
| XingGAN for Person Image Generation | Hao Tang, Song Bai, Li Zhang, Philip H.S. Torr, Nicu Sebe | N/A | |
| GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering | Chuang Niu, Jun Zhang, Ge Wang, Jimin Liang | N/A | |
| VCNet: A Robust Approach to Blind Image Inpainting | Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia | N/A | |
| Learning to Predict Context-adaptive Convolution for Semantic Segmentation | Jianbo Liu, Junjun He, Yu Qiao, Jimmy S. Ren, Hongsheng Li | N/A | |
| EfficientFCN: Holistically-guided Decoding for Semantic Segmentation | Jianbo Liu, Junjun He, Jiawei Zhang, Jimmy S. Ren, Hongsheng Li | N/A | |
| GroSS: Group-Size Series Decomposition for Grouped Architecture Search | Henry Howard-Jenkins, Yiwen Li, Victor Adrian Prisacariu | N/A | |
| Efficient Adversarial Attacks for Visual Object Tracking | Siyuan Liang, Xingxing Wei, Siyuan Yao, Xiaochun Cao | N/A | |
| Globally-Optimal Event Camera Motion Estimation | Xin Peng, Yifu Wang, Ling Gao, Laurent Kneip | N/A | |
| Weakly-supervised Learning of Human Dynamics | Petrissa Zell, Bodo Rosenhahn, Bastian Wandt | N/A | |
| Journey Towards Tiny Perceptual Super-Resolution | Royson Lee, Łukasz Dudziak, Mohamed Abdelfattah, Stylianos I. Venieris, Hyeji Kim, Hongkai Wen, Nicholas D. Lane | N/A | |
| What makes fake images detectable? Understanding properties that generalize | Lucy Chai, David Bau, Ser-Nam Lim, Phillip Isola | N/A | |
| Embedding Propagation: Smoother Manifold for Few-Shot Classification | Pau Rodríguez, Issam Laradji, Alexandre Drouin, Alexandre Lacoste | N/A | |
| Category Level Object Pose Estimation via Neural Analysis-by-Synthesis | Xu Chen, Zijian Dong, Jie Song, Andreas Geiger, Otmar Hilliges | N/A | |
| High-Fidelity Synthesis with Disentangled Representation | Wonkwang Lee, Donggyun Kim, Seunghoon Hong, Honglak Lee | N/A | |
| PL₁P - Point-line Minimal Problems under Partial Visibility in Three Views | Timothy Duff, Kathlén Kohn, Anton Leykin, Tomas Pajdla | N/A | |
| Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification | Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan | N/A | |
| Learning Canonical Representations for Scene Graph to Image Generation | Roei Herzig, Amir Bar, Huijuan Xu, Gal Chechik, Trevor Darrell, Amir Globerson | N/A | |
| Adversarial Robustness on In- and Out-Distribution Improves Explainability | Maximilian Augustin, Alexander Meinke, Matthias Hein | N/A | |
| Deformable Style Transfer | Sunnie S. Y. Kim, Nicholas Kolkin, Jason Salavon, Gregory Shakhnarovich | N/A | |
| Aligning Videos in Space and Time | Senthil Purushwalkam, Tian Ye, Saurabh Gupta, Abhinav Gupta | N/A | |
| Neural Wireframe Renderer: Learning Wireframe to Image Translations | Yuan Xue, Zihan Zhou, Xiaolei Huang | N/A | |
| RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax | Xiao Zhang, Rui Zhao, Yu Qiao, Hongsheng Li | N/A | |
| Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction | Kelvin Wong, Qiang Zhang, Ming Liang, Bin Yang, Renjie Liao, Abbas Sadat, Raquel Urtasun | N/A | |
| Determining the Relevance of Features for Deep Neural Networks | Christian Reimers, Jakob Runge, Joachim Denzler | N/A | |
| Weakly Supervised Semantic Segmentation with Boundary Exploration | Liyi Chen, Weiwei Wu, Chenchen Fu, Xiao Han, Yuntao Zhang | N/A | |
| GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation | Wallace Lira, Johannes Merz, Daniel Ritchie, Daniel Cohen-Or, Hao Zhang | N/A | |
| DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild | Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez | N/A | |
| Multi-view adaptive graph convolutions for graph classification | Nikolas Adaloglou, Nicholas Vretos, Petros Daras | N/A | |
| Instance Adaptive Self-Training for Unsupervised Domain Adaptation | Ke Mei, Chuang Zhu, Jiaqi Zou, Shanghang Zhang | N/A | |
| Weight Decay Scheduling and Knowledge Distillation for Active Learning | Juseung Yun, Byungjoo Kim, Junmo Kim | N/A | |
| HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs | Hai Victor Habi, Roy H. Jennings, Arnon Netzer | N/A | |
| Truncated Inference for Latent Variable Optimization Problems: Application to Robust Estimation and Learning | Christopher Zach, Huu Le | N/A | |
| Geometry Constrained Weakly Supervised Object Localization | Weizeng Lu, Xi Jia, Weicheng Xie, Linlin Shen, Yicong Zhou, Jinming Duan | N/A | |
| Duality Diagram Similarity: a generic framework for initialization selection in task transfer learning | Kshitij Dwivedi, Jiahui Huang, Radoslaw Martin Cichy, Gemma Roig | N/A | |
| OneGAN: Simultaneous Unsupervised Learning of Conditional Image Generation, Foreground Segmentation, and Fine-Grained Clustering | Yaniv Benny, Lior Wolf | N/A | |
| Mining self-similarity: Label super-resolution with epitomic representations | Nikolay Malkin, Anthony Ortiz, Nebojsa Jojic | N/A | |
| AE-OT-GAN: Training GANs from data specific latent distribution | Dongsheng An, Yang Guo, Min Zhang, Xin Qi, Na Lei, Xianfang Gu | N/A | |
| Null-sampling for Interpretable and Fair Representations | Thomas Kehrenberg, Myles Bartlett, Oliver Thomas, Novi Quadrianto | N/A | |
| Guiding Monocular Depth Estimation Using Depth-Attention Volume | Lam Huynh, Phong Nguyen-Ha, Jiri Matas, Esa Rahtu, Janne Heikkilä | N/A | |
| Tracking Emerges by Looking Around Static Scenes, with Neural 3D Mapping | Adam W. Harley, Shrinidhi Kowshika Lakshmikanth, Paul Schydlo, Katerina Fragkiadaki | N/A | |
| Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer | Yuanyi Zhong, Jianfeng Wang, Jian Peng, Lei Zhang | N/A | |
| BézierSketch: A generative model for scalable vector sketches | Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song | N/A | |
| Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation | Zeqi Li, Ruowei Jiang,, Parham Aarabi | N/A | |
| Domain Adaptation Through Task Distillation | Brady Zhou, Nimit Kalra, Philipp Krähenbühl | N/A | |
| PatchAttack: A Black-box Texture-based Attack with Reinforcement Learning | Chenglin Yang, Adam Kortylewski, Cihang Xie, Yinzhi Cao, Alan Yuille | N/A | |
| More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning | Yu Liu, Sarah Parisot, Gregory Slabaugh, Xu Jia, Ales Leonardis, Tinne Tuytelaars | N/A | |
| Extending and Analyzing Self-Supervised Learning Across Domains | Bram Wallace, Bharath Hariharan | N/A | |
| Multi-Source Open-Set Deep Adversarial Domain Adaptation | Sayan Rakshit, Dipesh Tamboli, Pragati Shuddhodhan Meshram, Biplab Banerjee, Gemma Roig, Subhasis Chaudhuri | N/A | |
| Neural Batch Sampling with Reinforcement Learning for Semi-Supervised Anomaly Detection | Wen-Hsuan Chu, Kris M. Kitani | N/A | |
| LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities | Baoxiong Jia, Yixin Chen, Siyuan Huang, Yixin Zhu, Song-Chun Zhu | N/A | |
| Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images | Matthew Purri, Kristin Dana | N/A | |
| Accurate Optimization of Weighted Nuclear Norm for Non-Rigid Structure from Motion | José Pedro Iglesias, Carl Olsson, Marcus Valtonen Örnhag | N/A | |
| Proposal-based Video Completion | Yuan-Ting Hu, Heng Wang, Nicolas Ballas, Kristen Grauman, Alexander G. Schwing | N/A | |
| HGNet: Hybrid Generative Network for Zero-shot Domain Adaptation | Haifeng Xia, Zhengming Ding | N/A | |
| Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding | Kaihao Zhang, Wenhan Luo, Wenqi Ren, Jingwen Wang Fang Zhao, Lin Ma , Hongdong Li | N/A | |
| DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks | Hassan Dbouk, Hetul Sanghvi, Mahesh Mehendale, Naresh Shanbhag | N/A | |
| All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling | Zhixiang Chi, Rasoul Mohammadi Nasiri, Zheng Liu, Juwei Lu, Jin Tang , Konstantinos N Plataniotis | N/A | |
| A Broader Study of Cross-Domain Few-Shot Learning | Yunhui Guo, Noel C. Codella, Leonid Karlinsky, James V. Codella, John R. Smith, Kate Saenko, Tajana Rosing, Rogerio Feris | N/A | |
| Practical Poisoning Attacks on Neural Networks | Junfeng Guo, Cong Liu | N/A | |
| Unsupervised Domain Adaptation in the Dissimilarity Space for Person Re-identification | Djebril Mekhazni, Amran Bhuiyan, George Ekladious, Eric Granger | N/A | |
| Learn distributed GAN with Temporary Discriminators | Hui Qu, Yikai Zhang, Qi Chang, Zhennan Yan, Chao Chen, Dimitris Metaxas | N/A | |
| SemifreddoNets: Partially Frozen Neural Networks for Efficient Computer Vision Systems | Leo F Isikdogan, Bhavin V Nayak, Chyuan-Tyng Wu, Joao Peralta Moreira , Sushma Rao, Gilad Michael | N/A | |
| Improving Adversarial Robustness by Enforcing Local and Global Compactness | Anh Bui, Trung Le, He Zhao, Paul Montague, Olivier deVel, Tamas Abraham, Dinh Phung | N/A | |
| TopoAL: An Adversarial Learning Approach for Topology-Aware Road Segmentation | Subeesh Vasu, Mateusz Kozinski, Leonardo Citraro, and Pascal Fua | N/A | |
| Channel selection using Gumbel Softmax | Charles Herrmann, Richard Strong Bowen, Ramin Zabih | N/A | |
| Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification | Dripta S. Raychaudhuri, Amit K. Roy-Chowdhury | N/A | |
| An Efficient Training Framework for Reversible Neural Architectures | Zixuan Jiang, Keren Zhu, Mingjie Liu, Jiaqi Gu, David Z. Pan | N/A | |
| Box2Seg: Attention Weighted Loss and Discriminative Feature Learning for Weakly Supervised Segmentation | Viveka Kulharia, Siddhartha Chandra, Amit Agrawal, Philip Torr, Ambrish Tyagi | N/A | |
| FreeCam3D: Snapshot Structured Light 3D with Freely-Moving Cameras | Yicheng Wu, Vivek Boominathan, Xuan Zhao, Jacob T. Robinson, Hiroshi Kawasaki, Aswin Sankaranarayanan, Ashok Veeraraghavan | N/A | |
| One-Pixel Signature: Characterizing CNN Models for Backdoor Detection | Shanjiaoyang Huang, Weiqi Peng, Zhiwei Jia, Zhuowen Tu | N/A | |
| Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning | Linchao Zhu, Sercan . Arık, Yi Yang, Tomas Pfister | N/A | |
| Structure-Aware Generation Network for Recipe Generation from Images | Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao | N/A | |
| A Simple and Effective Framework for Pairwise Deep Metric Learning | Qi Qi, Yan Yan, Zixuan Wu, Xiaoyu Wang, Tianbao Yang | N/A | |
| Meta-rPPG: Remote Heart Rate Estimation Using a Transductive Meta-Learner | Eugene Lee, Evan Chen, Chen-Yi Lee | N/A | |
| A Recurrent Transformer Network for Novel View Action Synthesis | Kara Marie Schatz, Erik Quintanilla, Shruti Vyas, Yogesh S Rawat | N/A | |
| Multi-view Action Recognition using Cross-view Video Prediction | Shruti Vyas, Yogesh S Rawat, Mubarak Shah | N/A | |
| Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation | Mingmin Zhen, Shiwei Li, Lei Zhou, Jiaxiang Shang, Haoan Feng, Tian Fang, Long Quan | N/A | |
| SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction | Sriram N N, Buyu Liu, Francesco Pittaluga, Manmohan Chandraker | N/A | |
| Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation | Jinyu Yang, Weizhi An, Sheng Wang, Xinliang Zhu, Chaochao Yan, Junzhou Huang | N/A | |
| Efficient Outdoor 3D Point Cloud Semantic Segmentation for Critical Road Objects and Distributed Contexts | Chi-Chong Wong, Chi-Man Vong | N/A | |
| Attributional Robustness Training using Input-Gradient Spatial Alignment | Mayank Singh, Nupur Kumari, Puneet Mangla, Abhishek Sinha, Vineeth N Balasubramanian, Balaji Krishnamurthy | N/A | |
| Reducing the Sim-to-Real Gap for Event Cameras | Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony | N/A | |
| Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning | Liangliang Ren, Yangyang Song, Jiwen Lu, Jie Zhou | N/A | |
| Learning Data Augmentation Strategies for Object Detection | Barret Zoph, Ekin D. Cubuk, Golnaz Ghiasi, Tsung-Yi Lin, Jonathon Shlens, Quoc V. Le | N/A | |
| DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search | Xiyang Dai, Dongdong Chen, Mengchen Liu, Yinpeng Chen, Lu Yuan | N/A | |
| A Closer Look at Generalisation in RAVEN | Steven Spratley, Krista Ehinger, Tim Miller | N/A | |
| Supervised Edge Attention Network for Accurate Image Instance Segmentation | Xier Chen, Yanchao Lian, Licheng Jiao, Haoran Wang, YanJie Gao, Shi Lingling | N/A | |
| Discriminative Partial Domain Adversarial Network | Jian Hu, Hongya Tuo, Chao Wang, Lingfeng Qiao, Haowen Zhong, Junchi Yan, Zhongliang Jing, Henry Leung | N/A | |
| Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model | John Janiczek, Parth Thaker, Gautam Dasarathy, Christopher S. Edwards , Philip Christensen, Suren Jayasuriya | N/A | |
| Deep Cross-species Feature Learning for Animal Face Recognition via Residual Interspecies Equivariant Network | Xiao Shi, Chenxue Yang, Xue Xia, Xiujuan Chai | N/A | |
| Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes | Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin’ichi Satoh | N/A | |
| Sound2Sight: Generating Visual Dynamics from Sound and Context | Moitreya Chatterjee, Anoop Cherian | N/A | |
| 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection | Jin Hyeok Yoo, Yecheol Kim, Jisong Kim, Jun Won Choi | N/A | |
| NoiseRank: Unsupervised Label Noise Reduction with Dependence Models | Karishma Sharma, Pinar Donmez, Enming Luo, Yan Liu, I. Zeki Yalniz | N/A | |
| Fast Adaptation to Super-Resolution Networks via Meta-Learning | Seobin Park, Jinsu Yoo, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim | N/A | |
| TP-LSD: Tri-Points Based Line Segment Detector | Siyu Huang, Fangbo Qin, Pengfei Xiong, Ning Ding, Yijia He, Xiao Liu | N/A | |
| SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation | Chenfeng Xu, Bichen Wu, Zining Wang, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka | N/A | |
| An Attention-driven Two-stage Clustering Method for Unsupervised Person Re-Identification | Zilong Ji, Xiaolong Zou, Xiaohan Lin, Xiao Liu, Tiejun Huang, Si Wu | N/A | |
| Toward Fine-grained Facial Expression Manipulation | Jun Ling, Han Xue, Li Song, Shuhui Yang, Rong Xie, Xiao Gu | N/A | |
| Adaptive Object Detection with Dual Multi-Label Prediction | Zhen Zhao, Yuhong Guo, Haifeng Shen, Jieping Ye | N/A | |
| Table Structure Recognition using Top-Down and Bottom-Up Cues | Sachin Raja, Ajoy Mondal, C V Jawahar | N/A | |
| Novel View Synthesis on Unpaired Data by Conditional Deformable Variational Auto-Encoder | Mingyu Yin, Li Sun, Qingli Li | N/A | |
| Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments | Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee | N/A | |
| Boundary Content Graph Neural Network for Temporal Action Proposal Generation | Yueran Bai, Yingying Wang, Yunhai Tong, Yang Yang, Qiyue Liu, Junhui Liu | N/A | |
| Pose Augmentation: Class-agnostic Object Pose Transformation for Object Recognition | Yunhao Ge, Jiaping Zhao, Laurent Itti | N/A | |
| VLANet: Video-Language Alignment Network for Weakly-Supervised Video Moment Retrieval | Minuk Ma, Sunjae Yoon, Junyeong Kim, Youngjoon Lee, Sunghun Kang, Chang D. Yoo | N/A | |
| Attention-Based Query Expansion Learning | Albert Gordo, Filip Radenovic, Tamara Berg | N/A | |
| Interpretable Foreground Object Search As Knowledge Distillation | Boren Li, Po-Yu Zhuang, Jian Gu, Mingyang Li, Ping Tan | N/A | |
| Improving Knowledge Distillation via Category Structure | Zailiang Chen, Xianxian Zheng, Hailan Shen, Ziyang Zeng, Yukun Zhou, Rongchang Zhao | N/A | |
| High Resolution Zero-Shot Domain Adaptation of Synthetically Rendered Face Images | Stephan J. Garbin, Marek Kowalski, Matthew Johnson, Jamie Shotton | N/A | |
| Attentive Prototype Few-shot Learning with Capsule Network-based Embedding | Fangyu Wu, Jeremy S.Smith, Wenjin Lu, Chaoyi Pang, Bailing Zhang | N/A | |
| Weakly Supervised Instance Segmentation by Learning Annotation Consistent Instances | Aditya Arun, C.V. Jawahar, M. Pawan Kumar | N/A | |
| DA4AD: End-to-End Deep Attention-based Visual Localization for Autonomous Driving | Yao Zhou, Guowei Wan, Shenhua Hou, Li Yu, Gang Wang, Xiaofei Rui, Shiyu Song | N/A | |
| Visual-Relation Conscious Image Generation from Structured-Text | Duc Minh Vo, Akihiro Sugimoto | N/A | |
| Patch-wise Attack for Fooling Deep Neural Network | Lianli Gao, Qilong Zhang, Jingkuan Song, Xianglong Liu, Heng Tao Shen | N/A | |
| Feature Pyramid Transformer | Dong Zhang, Hanwang Zhang, Jinhui Tang, Meng Wang, Xiansheng Hua, Qianru Sun | N/A | |
| MABNet: A Lightweight Stereo Network Based on Multibranch Adjustable Bottleneck Module | Jiabin Xing, Zhi Qi, Jiying Dong, Jiaxuan Cai, Hao Liu | N/A | |
| Guided Saliency Feature Learning for Person Re-identification in Crowded Scenes | Lingxiao He, Wu Liu | N/A | |
| Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection | Miao Zhang, Sun Xiao Fei, Jie Liu, Shuang Xu, Yongri Piao, Huchuan Lu | N/A | |
| Explaining Image Classifiers using Statistical Fault Localization | Youcheng Sun, Hana Chockler, Xiaowei Huang, Daniel Kroening | N/A | |
| Deep Graph Matching via Blackbox Differentiation of Combinatorial Solvers | Michal Rolínek, Paul Swoboda, Dominik Zietlow, Anselm Paulus, Vít Musil, Georg Martius | N/A | |
| Learning Video Representations by Transforming Time | Simon Jenni, Givi Meishvili, Paolo Favaro | N/A | |
| Unsupervised Monocular Depth Estimation for Night-time Images using Adversarial Domain Feature Adaptation | Madhu Vankadari, Sourav Garg, Anima Majumder, Swagat Kumar, Ardhendu Behera | N/A | |
| Variational Connectionist Temporal Classification | Linlin Chao, Jingdong Chen, Wei Chu | N/A | |
| End-to-end Dynamic Matching Network for Multi-view Multi-person 3d Pose Estimation | Congzhentao Huang, Shuai Jiang, Yang Li, Ziyue Zhang, Jason Traish, Chen Deng, Sam Ferguson, Richard Yi Da Xu | N/A | |
| Orderly Disorder in Point Cloud Domain | Morteza Ghahremani, Bernard Tiddeman, Yonghuai Liu, and Ardhendu Behera | N/A | |
| Deep Decomposition Learning for Inverse Imaging Problems | Dongdong Chen, Mike E. Davies | N/A | |
| FLOT: Scene Flow on Point Clouds guided by Optimal Transport | Gilles Puy, Alexandre Boulch, Renaud Marlet | N/A | |
| Accurate Reconstruction of Oriented 3D Points using Affine Correspondences | Carolina Raposo, Joao P. Barreto | N/A | |
| Volumetric Transformer Networks | Seungryong Kim, Sabine Ssstrunk, Mathieu Salzmann | N/A | |
| 360(o) Camera Alignment via Segmentation | Benjamin Davidson, Mohsan S. Alvi, João F. Henriques | N/A | |
| A Novel Line Integral Transform for 2D Affine-Invariant Shape Retrieval | Bin Wang, Yongsheng Gao | N/A | |
| Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks | Federico Baldassarre, Kevin Smith, Josephine Sullivan, Hossein Azizpour | N/A | |
| Guided Semantic Flow | Sangryul Jeon, Dongbo Min, Seungryong Kim, Jihwan Choe, Kwanghoon Sohn | N/A | |
| Document Structure Extraction using Prior based High Resolution Hierarchical Semantic Segmentation | Mausoom Sarkar, Milan Aggarwal, Arneh Jain, Hiresh Gupta, Balaji Krishnamurthy | N/A | |
| Measuring the Importance of Temporal Features in Video Saliency | Matthias Tangemann, Matthias Kümmerer, Thomas S.A. Wallis, Matthias Bethge | N/A | |
| Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution | Haotian Tang, Zhijian Liu, Shengyu Zhao, Yujun Lin, Ji Lin, Hanrui Wang, Song Han | N/A | |
| Towards Reliable Evaluation of Algorithms for Road Network Reconstruction from Aerial Images | Leonardo Citraro, Mateusz Koziński, Pascal Fua | N/A | |
| Online Continual Learning under Extreme Memory Constraints | Enrico Fini, Stéphane Lathuilière, Enver Sangineto, Moin Nabi, Elisa Ricci | N/A | |
| Learning to Cluster under Domain Shift | Willi Menapace, Stéphane Lathuilière, Elisa Ricci | N/A | |
| Defense Against Adversarial Attacks via Controlling Gradient Leaking on Embedded Manifolds | Yueru Li, Shuyu Cheng, Hang Su, Jun Zhu | N/A | |
| Improving Optical Flow on a Pyramid Level | Markus Hofinger, Samuel Rota Bulò, Lorenzo Porzi, Arno Knapitsch, Thomas Pock, Peter Kontschieder | N/A | |
| Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations | Sungheon Park, Minsik Lee, Nojun Kwak | N/A | |
| Learning to Learn Parameterized Classification Networks for Scalable Input Images | Duo Li, Anbang Yao, Qifeng Chen | N/A | |
| Stereo Event-based Particle Tracking Velocimetry for 3D Fluid Flow Reconstruction | Yuanhao Wang, Ramzi Idoughi, Wolfgang Heidrich | N/A | |
| Simplicial Complex based Point Correspondence between Images warped onto Manifolds | Charu Sharma, Manohar Kaul | N/A | |
| Representation Learning on Visual-Symbolic Graphs for Video Understanding | Effrosyni Mavroudi, Benjamín Béjar Haro, René Vidal | N/A | |
| Distance-Normalized Unified Representation for Monocular 3D Object Detection | Xuepeng Shi, Zhixiang Chen, Tae-Kyun Kim | N/A | |
| Sequential Deformation for Accurate Scene Text Detection | Shanyu Xiao, Liangrui Peng, Ruijie Yan, Keyu An, Gang Yao, Jaesik Min | N/A | |
| Where to Explore Next? ExHistCNN for History-aware Autonomous 3D Exploration | Yiming Wang, Alessio Del Bue | N/A | |
| Semi-Supervised Segmentation based on Error-Correcting Supervision | Robert Mendel, Luis Antonio de Souza Jr, David Rauber, João Paulo Papa, Christoph Palm | N/A | |
| Quantum-soft QUBO Suppression for Accurate Object Detection | Junde Li, Swaroop Ghosh | N/A | |
| Label-similarity Curriculum Learning | Ürün Dogan, Aniket Anand Deshmukh, Marcin Bronislaw Machura, Christian Igel | N/A | |
| Recurrent Image Annotation With Explicit Inter-Label Dependencies | Ayushi Dutta, Yashaswi Verma, C.V. Jawahar | N/A | |
| Cross-Attention in Coupled Unmixing Nets for Unsupervised Hyperspectral Super-Resolution | Jing Yao, Danfeng Hong, Jocelyn Chanussot, Deyu Meng, Xiaoxiang Zhu , Zongben Xu | N/A | |
| SimPose: Effectively Learning DensePose and Surface Normals of People from Simulated Data | Tyler Zhu, Per Karlsson, Christoph Bregler | N/A | |
| ByeGlassesGAN: Identity Preserving Eyeglasses Removal for Face Images | Yu-Hui Lee, Shang-Hong Lai | N/A | |
| Differentiable Joint Pruning and Quantization for Hardware Efficiency | Ying Wang, Yadong Lu, Tijmen Blankevoort | N/A | |
| Learning to Generate Customized Dynamic 3D Facial Expressions | Rolandos Alexandros Potamias, Jiali Zheng, Stylianos Ploumpis, Giorgos Bouritsas, Evangelos Ververas, Stefanos Zafeiriou | N/A | |
| LandscapeAR: Large Scale Outdoor Augmented Reality by Matching Photographs with Terrain Models Using Learned Descriptors | Jan Brejcha, Michal Lukáč, Yannick Hold-Geoffroy, Oliver Wang, Martin Čadík | N/A | |
| Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration | Xin Li, Xin Jin, Jianxin Lin, Sen Liu, Yaojun Wu, Tao Yu, Wei Zhou , Zhibo Chen | N/A | |
| Jointly De-biasing Face Recognition and Demographic Attribute Estimation | Sixue Gong, Xiaoming Liu, Anil K. Jain | N/A | |
| Regularized Loss for Weakly Supervised Single Class Semantic Segmentation | Olga Veksler | N/A | |
| Spike-FlowNet: Event-based Optical Flow Estimation with Energy-Efficient Hybrid Neural Networks | Chankyu Lee, Adarsh Kumar Kosta, Alex Zihao Zhu, Kenneth Chaney, Kostas Daniilidis, Kaushik Roy | N/A | |
| Forgetting Outside the Box: Scrubbing Deep Networks of Information Accessible from Input-Output Observations | Aditya Golatkar, Alessandro Achille, Stefano Soatto | N/A | |
| Inherent Adversarial Robustness of Deep Spiking Neural Networks: Effects of Discrete Input Encoding and Non-Linear Activations | Saima Sharmin, Nitin Rathi, Priyadarshini Panda, Kaushik Roy | N/A | |
| Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks | Baris Gecer, Alexandros Lattas, Stylianos Ploumpis, Jiankang Deng, Athanasios Papaioannou, Stylianos Moschoglou, Stefanos Zafeiriou | N/A | |
| Learning to Learn Words from Visual Scenes | Dídac Surís, Dave Epstein, Heng Ji, Shih-Fu Chang, Carl Vondrick | N/A | |
| On Transferability of Histological Tissue Labels in Computational Pathology | Mahdi S. Hosseini, Lyndon Chan, Weimin Huang, Yichen Wang, Danial Hasan, Corwyn Rowsell, Savvas Damaskinos, Konstantinos N. Plataniotis | N/A | |
| Learning Actionness via Long-range Temporal Order Verification | Dimitri Zhukov, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic | N/A | |
| Fully Embedding Fast Convolutional Networks on Pixel Processor Arrays | Laurie Bose, Piotr Dudek, Jianing Chen, Stephen J. Carey, Walterio W. Mayol-Cuevas | N/A | |
| Character Region Attention For Text Spotting | Youngmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, Junyeop Lee , Daehyun Nam, Hwalsuk Lee | N/A | |
| Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network | Anh-Huy Phan, Konstantin Sobolev, Konstantin Sozykin, Dmitry Ermilov , Julia Gusak, Petr Tichavský, Valeriy Glukhov, Ivan Oseledets, Andrzej Cichocki | N/A | |
| Dual Mixup Regularized Learning for Adversarial Domain Adaptation | Yuan Wu, Diana Inkpen, Ahmed El-Roby | N/A | |
| Robust and On-the-fly Dataset Denoising for Image Classification | Jiaming Song, Yann Dauphin, Michael Auli, Tengyu Ma | N/A | |
| Imaging Behind Occluders Using Two-Bounce Light | Connor Henley, Tomohiro Maeda, Tristan Swedish, Ramesh Raskar | N/A | |
| Improving Object Detection with Selective Self-Supervised Self-Training | Yandong Li, Di Huang, Danfeng Qin, Liqiang Wang, Boqing Gong | N/A | |
| Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction | Rohan Chabra, Jan E. Lenssen, Eddy Ilg, Tanner Schmidt, Julian Straub, Steven Lovegrove, Richard Newcombe | N/A | |
| Info3D: Representation Learning on 3D Objects using Mutual Information Maximization and Contrastive Learning | Aditya Sanghi | N/A | |
| Adversarial Data Augmentation via Deformation Statistics | Sahin Olut, Zhengyang Shen, Zhenlin Xu, Samuel Gerber, Marc Niethammer | N/A | |
| Neural Predictor for Neural Architecture Search | Wei Wen, Hanxiao Liu, Yiran Chen, Hai Li, Gabriel Bender, Pieter-Jan Kindermans | N/A | |
| Learning Permutation Invariant Representations using Memory Networks | Shivam Kalra, Mohammed Adnan, Graham Taylor, H.R. Tizhoosh | N/A | |
| Feature Space Augmentation for Long-Tailed Data | Peng Chu, Xiao Bian, Shaopeng Liu, Haibin Ling | N/A | |
| Laying the Foundations of Deep Long-Term Crowd Flow Prediction | Samuel S. Sohn, Honglu Zhou, Seonghyeon Moon, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia | N/A | |
| Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning | Zhekun Luo, Devin Guillory, Baifeng Shi, Wei Ke, Fang Wan, Trevor Darrell, Huijuan Xu | N/A | |
| Fairness by Learning Orthogonal Disentangled Representations | Mhd Hasan Sarhan, Nassir Navab, Abouzar Eslami, Shadi Albarqouni | N/A | |
| Self-supervision with Superpixels: Training Few-shot Medical Image Segmentation without Annotation | Cheng Ouyang, Carlo Biffi, Chen Chen, Turkay Kart, Huaqi Qiu, Daniel Rueckert | N/A | |
| On Diverse Asynchronous Activity Anticipation | He Zhao, Richard P. Wildes | N/A | |
| Representative-Discriminative Learning for Open-set Land Cover Classification of Satellite Imagery | Razieh Kaviani Baghbaderani, Ying Qu, Hairong Qi, Craig Stutts | N/A | |
| Structure-Aware Human-Action Generation | Ping Yu, Yang Zhao, Chunyuan Li, Junsong Yuan, Changyou Chen | N/A | |
| Towards Efficient Coarse-to-Fine Networks for Action and Gesture Recognition | Niamul Quader, Juwei Lu, Peng Dai, Wei Li | N/A | |
| S³Net: Semantic-Aware Self-supervised Depth Estimation with Monocular Videos and Synthetic Data | Bin Cheng, Inderjot Singh Saggu, Raunak Shah, Gaurav Bansal, Dinesh Bharadia | N/A | |
| Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning | Maunil R Vyas, Hemanth Venkateswara, Sethuraman Panchanathan | N/A | |
| Weight Excitation: Built-in Attention Mechanisms in Convolutional Neural Networks | Niamul Quader, Md Mafijul Islam Bhuiyan, Juwei Lu, Peng Dai, Wei Li | N/A | |
| UNITER: UNiversal Image-TExt Representation Learning | Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu | N/A | |
| Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks | Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang, Lijuan Wang, Houdong Hu, Li Dong, Furu Wei, Yejin Choi, Jianfeng Gao | N/A | |
| Improving Face Recognition from Hard Samples via Distribution Distillation Loss | Yuge Huang, Pengcheng Shen, Ying Tai, Shaoxin Li, Xiaoming Liu, Jilin Li, Feiyue Huang, Rongrong Ji | N/A | |
| Extract and Merge: Superpixel Segmentation with Regional Attributes | Jianqiao An, Yucheng Shi, Yahong Han, Meijun Sun, Qi Tian | N/A | |
| Spatial-Adaptive Network for Single Image Denoising | Meng Chang, Qi Li, Huajun Feng, Zhihai Xu | N/A | |
| Physics-based Feature Dehazing Networks | Jiangxin Dong, Jinshan Pan | N/A | |
| Learning Surrogates via Deep Embedding | Yash Patel, Tomáš Hodaň, Jiří Matas | N/A | |
| An Asymmetric Modeling for Action Assessment | Jibin Gao, Wei-Shi Zheng, Jia-Hui Pan, Chengying Gao, Yaowei Wang, Wei Zeng, Jianhuang Lai | N/A | |
| High-quality Single-model Deep Video Compression with Frame-Conv3D and Multi-frame Differential Modulation | Wenyu Sun, Chen Tang, Weigui Li, Zhuqing Yuan, Huazhong Yang, Yongpan Liu | N/A | |
| Instance-Aware Embedding for Point Cloud Instance Segmentation | Tong He, Yifan Liu, Chunhua Shen, Xinlong Wang, Changming Sun | N/A | |
| Self-Paced Deep Regression Forests with Consideration on Underrepresented Examples | Lili Pan, Shijie Ai, Yazhou Ren, Zenglin Xu | N/A | |
| Manifold Projection for Adversarial Defense on Face Recognition | Jianli Zhou, Chao Liang, Jun Chen | N/A | |
| Weakly Supervised Learning with Side Information for Noisy Labeled Images | Lele Cheng, Xiangzeng Zhou, Liming Zhao, Dangwei Li, Hong Shang, Yun Zheng, Pan Pan, Yinghui Xu | N/A | |
| Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision | Peng Wu, Jing Liu, Yujia Shi, Yujia Sun, Fangtao Shao, Zhaoyang Wu , Zhiwei Yang | N/A | |
| SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection | Rui Fan, Hengli Wang, Peide Cai, Ming Liu | N/A | |
| Modeling the Space of Point Landmark Constrained Diffeomorphisms | Chengfeng Wen, Yang Guo, Xianfeng Gu | N/A | |
| PieNet: Personalized Image Enhancement Network | Han-Ul Kim, Young Jun Koh, Chang-Su Kim | N/A | |
| Rotational Outlier Identification in Pose Graphs Using Dual Decomposition | Arman Karimian, Ziqi Yang, Roberto Tron | N/A | |
| Speech-driven Facial Animation using Cascaded GANs for Learning of Motion and Texture | Dipanjan Das, Sandika Biswas, Sanjana Sinha, Brojeshwar Bhowmick | N/A | |
| Solving Phase Retrieval with a Learned Reference | Rakib Hyder, Zikui Cai, M. Salman Asif | N/A | |
| Dual Grid Net: Hand Mesh Vertex Regression from Single Depth Maps | Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao | N/A | |
| Modeling Varying Camera-IMU Time Offset in Optimization-Based Visual-Inertial Odometry | Ling, Yonggen and Bao, Linchao and Jie, Zequn and Zhu, Fengming and Li, Ziyang and Tang, Shanmin and Liu, Yongsheng and Liu, Wei and Zhang, Tong | N/A | |
| Pose Partition Networks for Multi-Person Pose Estimation | Nie, Xuecheng and Feng, Jiashi and Xing, Junliang and Yan, Shuicheng | N/A | |
| Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition | Zhan, Xiaohang and Liu, Ziwei and Yan, Junjie and Lin, Dahua and Change Loy, Chen | N/A | |
| Open-World Stereo Video Matching with Deep RNN | Zhong, Yiran and Li, Hongdong and Dai, Yuchao | N/A | |
| Deep Cross-Modal Projection Learning for Image-Text Matching | Zhang, Ying and Lu, Huchuan | N/A | |
| Gray-box Adversarial Training | Vivek, B. S. and Reddy Mopuri, Konda and Venkatesh Babu, R. | N/A | |
| Multi-Class Model Fitting by Energy Minimization and Mode-Seeking | Barath, Daniel and Matas, Jiri | N/A | |
| MRF Optimization with Separable Convex Prior on Partially Ordered Labels | Domokos, Csaba and Schmidt, Frank R. and Cremers, Daniel | N/A | |
| VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions | Li, Qing and Tao, Qingyi and Joty, Shafiq and Cai, Jianfei and Luo, Jiebo | N/A | |
| Context Refinement for Object Detection | Chen, Zhe and Huang, Shaoli and Tao, Dacheng | N/A | |
| Depth Estimation via Affinity Learned with Convolutional Spatial Propagation Network | Cheng, Xinjing and Wang, Peng and Yang, Ruigang | N/A | |
| Zero-Annotation Object Detection with Web Knowledge Transfer | Tao, Qingyi and Yang, Hao and Cai, Jianfei | N/A | |
| Fast Light Field Reconstruction With Deep Coarse-To-Fine Modeling of Spatial-Angular Clues | Wing Fung Yeung, Henry and Hou, Junhui and Chen, Jie and Ying Chung, Yuk and Chen, Xiaoming | N/A | |
| AGIL: Learning Attention from Human for Visuomotor Tasks | Zhang, Ruohan and Liu, Zhuode and Zhang, Luxin and Whritner, Jake A. and Muller, Karl S. and Hayhoe, Mary M. and Ballard, Dana H. | N/A | |
| Physical Primitive Decomposition | Liu, Zhijian and Freeman, William T. and Tenenbaum, Joshua B. and Wu, Jiajun | N/A | |
| Deep Expander Networks: Efficient Deep Networks from Graph Theory | Prabhu, Ameya and Varma, Girish and Namboodiri, Anoop | N/A | |
| Real-Time MDNet | Jung, Ilchae and Son, Jeany and Baek, Mooyeol and Han, Bohyung | N/A | |
| The Mutex Watershed: Efficient, Parameter-Free Image Partitioning | Wolf, Steffen and Pape, Constantin and Bailoni, Alberto and Rahaman, Nasim and Kreshuk, Anna and Kothe, Ullrich and Hamprecht, FredA. | N/A | |
| MVSNet: Depth Inference for Unstructured Multi-view Stereo | Yao, Yao and Luo, Zixin and Li, Shiwei and Fang, Tian and Quan, Long | N/A | |
| Audio-Visual Event Localization in Unconstrained Videos | Tian, Yapeng and Shi, Jing and Li, Bochen and Duan, Zhiyao and Xu, Chenliang | N/A | |
| Attend and Rectify: a gated attention mechanism for fine-grained recovery | Rodriguez, Pau and Gonfaus, Josep M. and Cucurull, Guillem and XavierRoca, F. and Gonzalez, Jordi | N/A | |
| PyramidBox: A Context-assisted Single Shot Face Detector | Tang, Xu and Du, Daniel K. and He, Zeqiang and Liu, Jingtuo | N/A | |
| RT-GENE: Real-Time Eye Gaze Estimation in Natural Environments | Fischer, Tobias and Jin Chang, Hyung and Demiris, Yiannis | N/A | |
| Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias | Panda, Rameswar and Zhang, Jianming and Li, Haoxiang and Lee, Joon-Young and Lu, Xin and Roy-Chowdhury, Amit K. | N/A | |
| Highly-Economized Multi-View Binary Compression for Scalable Image Clustering | Zhang, Zheng and Liu, Li and Qin, Jie and Zhu, Fan and Shen, Fumin and Xu, Yong and Shao, Ling and Tao Shen, Heng | N/A | |
| Deep Kalman Filtering Network for Video Compression Artifact Reduction | Lu, Guo and Ouyang, Wanli and Xu, Dong and Zhang, Xiaoyun and Gao, Zhiyong and Sun, Ming-Ting | N/A | |
| DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model | Lathuiliere, Stephane and Mesejo, Pablo and Alameda-Pineda, Xavier and Horaud, Radu | N/A | |
| ISNN: Impact Sound Neural Network for Audio-Visual Object Classification | Sterling, Auston and Wilson, Justin and Lowe, Sam and Lin, Ming C. | N/A | |
| Deep Cross-modality Adaptation via Semantics Preserving Adversarial Learning for Sketch-based 3D Shape Retrieval | Chen, Jiaxin and Fang, Yi | N/A | |
| Learning to Blend Photos | Hung, Wei-Chih and Zhang, Jianming and Shen, Xiaohui and Lin, Zhe and Lee, Joon-Young and Yang, Ming-Hsuan | N/A | |
| Second-order Democratic Aggregation | Lin, Tsung-Yu and Maji, Subhransu and Koniusz, Piotr | N/A | |
| Recurrent Fusion Network for Image captioning | Jiang, Wenhao and Ma, Lin and Jiang, Yu-Gang and Liu, Wei and Zhang, Tong | N/A | |
| Grounding Visual Explanations | Anne Hendricks, Lisa and Hu, Ronghang and Darrell, Trevor and Akata, Zeynep | N/A | |
| A Dataset of Flash and Ambient Illumination Pairs from the Crowd | Aksoy, Yagiz and Kim, Changil and Kellnhofer, Petr and Paris, Sylvain and Elgharib, Mohamed and Pollefeys, Marc and Matusik, Wojciech | N/A | |
| Deep Continuous Fusion for Multi-Sensor 3D Object Detection | Liang, Ming and Yang, Bin and Wang, Shenlong and Urtasun, Raquel | N/A | |
| BusterNet: Detecting Copy-Move Image Forgery with Source/Target Localization | Wu, Yue and Abd-Almageed, Wael and Natarajan, Prem | N/A | |
| Parallel Feature Pyramid Network for Object Detection | Kim, Seung-Wook and Kook, Hyong-Keun and Sun, Jee-Young and Kang, Mun-Cheon and Ko, Sung-Jea | N/A | |
| Learning Region Features for Object Detection | Gu, Jiayuan and Hu, Han and Wang, Liwei and Wei, Yichen and Dai, Jifeng | N/A | |
| AMC: AutoML for Model Compression and Acceleration on Mobile Devices | He, Yihui and Lin, Ji and Liu, Zhijian and Wang, Hanrui and Li, Li-Jia and Han, Song | N/A | |
| PSDF Fusion: Probabilistic Signed Distance Function for On-the-fly 3D Data Fusion and Scene Reconstruction | Dong, Wei and Wang, Qiuyuan and Wang, Xin and Zha, Hongbin | N/A | |
| Penalizing Top Performers: Conservative Loss for Semantic Segmentation Adaptation | Zhu, Xinge and Zhou, Hui and Yang, Ceyuan and Shi, Jianping and Lin, Dahua | N/A | |
| Switchable Temporal Propagation Network | Liu, Sifei and Zhong, Guangyu and De Mello, Shalini and Gu, Jinwei and Jampani, Varun and Yang, Ming-Hsuan and Kautz, Jan | N/A | |
| Sampling Algebraic Varieties for Robust Camera Autocalibration | Pani Paudel, Danda and Van Gool, Luc | N/A | |
| Image Reassembly Combining Deep Learning and Shortest Path Problem | Paumard, Marie-Morgane and Picard, David and Tabia, Hedi | N/A | |
| Diverse Conditional Image Generation by Stochastic Regression with Latent Drop-Out Codes | He, Yang and Schiele, Bernt and Fritz, Mario | N/A | |
| Incremental Non-Rigid Structure-from-Motion with Unknown Focal Length | Probst, Thomas and Pani Paudel, Danda and Chhatkuli, Ajad and Van Gool, Luc | N/A | |
| PS-FCN: A Flexible Learning Framework for Photometric Stereo | Chen, Guanying and Han, Kai and Wong, Kwan-Yee K. | N/A | |
| Instance-level Human Parsing via Part Grouping Network | Gong, Ke and Liang, Xiaodan and Li, Yicheng and Chen, Yimin and Yang, Ming and Lin, Liang | N/A | |
| Normalized Blind Deconvolution | Jin, Meiguang and Roth, Stefan and Favaro, Paolo | N/A | |
| Constrained Optimization Based Low-Rank Approximation of Deep Neural Networks | Li, Chong and Richard Shi, C. J. | N/A | |
| Dense Pose Transfer | Neverova, Natalia and Alp Guler, Riza and Kokkinos, Iasonas | N/A | |
| RCAA: Relational Context-Aware Agents for Person Search | Chang, Xiaojun and Huang, Po-Yao and Shen, Yi-Dong and Liang, Xiaodan and Yang, Yi and Hauptmann, Alexander G. | N/A | |
| Deep Discriminative Model for Video Classification | Tavakolian, Mohammad and Hadid, Abdenour | N/A | |
| DeepKSPD: Learning Kernel-matrix-based SPD Representation for Fine-grained Image Recognition | Engin, Melih and Wang, Lei and Zhou, Luping and Liu, Xinwang | N/A | |
| Deep Pictorial Gaze Estimation | Park, Seonwook and Spurr, Adrian and Hilliges, Otmar | N/A | |
| CTAP: Complementary Temporal Action Proposal Generation | Gao, Jiyang and Chen, Kan and Nevatia, Ram | N/A | |
| Neural Network Encapsulation | Li, Hongyang and Guo, Xiaoyang and DaiWanli Ouyang, Bo and Wang, Xiaogang | N/A | |
| Recovering 3D Planes from a Single Image via Convolutional Neural Networks | Yang, Fengting and Zhou, Zihan | N/A | |
| Dist-GAN: An Improved GAN using Distance Constraints | Tran, Ngoc-Trung and Bui, Tuan-Anh and Cheung, Ngai-Man | N/A | |
| Retrospective Encoders for Video Summarization | Zhang, Ke and Grauman, Kristen and Sha, Fei | N/A | |
| Tracking Emerges by Colorizing Videos | Vondrick, Carl and Shrivastava, Abhinav and Fathi, Alireza and Guadarrama, Sergio and Murphy, Kevin | N/A | |
| Task-Aware Image Downscaling | Kim, Heewon and Choi, Myungsub and Lim, Bee and Mu Lee, Kyoung | N/A | |
| Product Quantization Network for Fast Image Retrieval | Yu, Tan and Yuan, Junsong and Fang, Chen and Jin, Hailin | N/A | |
| Supervising the new with the old: learning SFM from SFM | Klodt, Maria and Vedaldi, Andrea | N/A | |
| Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline | Xu, Zhenbo and Yang, Wei and Meng, Ajin and Lu, Nanxue and Huang, Huan and Ying, Changchun and Huang, Liusheng | N/A | |
| Ask, Acquire, and Attack: Data-free UAP Generation using Class Impressions | Reddy Mopuri, Konda and Krishna Uppala, Phani and Venkatesh Babu, R. | N/A | |
| Separating Reflection and Transmission Images in the Wild | Wieschollek, Patrick and Gallo, Orazio and Gu, Jinwei and Kautz, Jan | N/A | |
| Hard-Aware Point-to-Set Deep Metric for Person Re-identification | Yu, Rui and Dou, Zhiyong and Bai, Song and Zhang, Zhaoxiang and Xu, Yongchao and Bai, Xiang | N/A | |
| Cross-Modal and Hierarchical Modeling of Video and Text | Zhang, Bowen and Hu, Hexiang and Sha, Fei | N/A | |
| StarMap for Category-Agnostic Keypoint and Viewpoint Estimation | Zhou, Xingyi and Karpur, Arjun and Luo, Linjie and Huang, Qixing | N/A | |
| Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization | Jakubovitz, Daniel and Giryes, Raja | N/A | |
| RelocNet: Continuous Metric Learning Relocalisation using Neural Nets | Balntas, Vassileios and Li, Shuda and Prisacariu, Victor | N/A | |
| Mancs: A Multi-task Attentional Network with Curriculum Sampling for Person Re-identification | Wang, Cheng and Zhang, Qian and Huang, Chang and Liu, Wenyu and Wang, Xinggang | N/A | |
| Recurrent Tubelet Proposal and Recognition Networks for Action Detection | Li, Dong and Qiu, Zhaofan and Dai, Qi and Yao, Ting and Mei, Tao | N/A | |
| Estimating Depth from RGB and Sparse Sensing | Chen, Zhao and Badrinarayanan, Vijay and Drozdov, Gilad and Rabinovich, Andrew | N/A | |
| Folded Recurrent Neural Networks for Future Video Prediction | Oliu, Marc and Selva, Javier and Escalera, Sergio | N/A | |
| Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image | Huang, Siyuan and Qi, Siyuan and Zhu, Yixin and Xiao, Yinxue and Xu, Yuanlu and Zhu, Song-Chun | N/A | |
| Joint Task-Recursive Learning for Semantic Segmentation and Depth Estimation | Zhang, Zhenyu and Cui, Zhen and Xu, Chunyan and Jie, Zequn and Li, Xiang and Yang, Jian | N/A | |
| A New Large Scale Dynamic Texture Dataset with Application to ConvNet Understanding | Hadji, Isma and Wildes, Richard P. | N/A | |
| Compositing-aware Image Search | Zhao, Hengshuang and Shen, Xiaohui and Lin, Zhe and Sunkavalli, Kalyan and Price, Brian and Jia, Jiaya | N/A | |
| Extreme Network Compression via Filter Group Approximation | Peng, Bo and Tan, Wenming and Li, Zheyang and Zhang, Shun and Xie, Di and Pu, Shiliang | N/A | |
| Audio-Visual Scene Analysis with Self-Supervised Multisensory Features | Owens, Andrew and Efros, Alexei A. | N/A | |
| Look Before You Leap: Bridging Model-Free and Model-Based Reinforcement Learning for Planned-Ahead Vision-and-Language Navigation | Wang, Xin and Xiong, Wenhan and Wang, Hongmin and Yang Wang, William | N/A | |
| Structure-from-Motion-Aware PatchMatch for Adaptive Optical Flow Estimation | Maurer, Daniel and Marniok, Nico and Goldluecke, Bastian and Bruhn, Andres | N/A | |
| ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design | Ma, Ningning and Zhang, Xiangyu and Zheng, Hai-Tao and Sun, Jian | N/A | |
| Attention-GAN for Object Transfiguration in Wild Images | Chen, Xinyuan and Xu, Chang and Yang, Xiaokang and Tao, Dacheng | N/A | |
| Joint Representation and Truncated Inference Learning for Correlation Filter based Tracking | Yao, Yingjie and Wu, Xiaohe and Zhang, Lei and Shan, Shiguang and Zuo, Wangmeng | N/A | |
| Lambda Twist: An Accurate Fast Robust Perspective Three Point (P3P) Solver | Persson, Mikael and Nordberg, Klas | N/A | |
| StereoNet: Guided Hierarchical Refinement for Real-Time Edge-Aware Depth Prediction | Khamis, Sameh and Fanello, Sean and Rhemann, Christoph and Kowdle, Adarsh and Valentin, Julien and Izadi, Shahram | N/A | |
| Robust Optical Flow in Rainy Scenes | Li, Ruoteng and Tan, Robby T. and Cheong, Loong-Fah | N/A | |
| Scale Aggregation Network for Accurate and Efficient Crowd Counting | Cao, Xinkun and Wang, Zhipeng and Zhao, Yanyun and Su, Fei | N/A | |
| Deep Feature Factorization For Concept Discovery | Collins, Edo and Achanta, Radhakrishna and Susstrunk, Sabine | N/A | |
| Object-centered image stitching | Herrmann, Charles and Wang, Chen and Strong Bowen, Richard and Keyder, Emil and Zabih, Ramin | N/A | |
| A Style-Aware Content Loss for Real-time HD Style Transfer | Sanakoyeu, Artsiom and Kotovenko, Dmytro and Lang, Sabine and Ommer, Bjorn | N/A | |
| Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining | Li, Xia and Wu, Jianlong and Lin, Zhouchen and Liu, Hong and Zha, Hongbin | N/A | |
| Acquisition of Localization Confidence for Accurate Object Detection | Jiang, Borui and Luo, Ruixuan and Mao, Jiayuan and Xiao, Tete and Jiang, Yuning | N/A | |
| Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network | Feng, Yao and Wu, Fan and Shao, Xiaohu and Wang, Yanfeng and Zhou, Xi | N/A | |
| Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground | Fan, Deng-Ping and Cheng, Ming-Ming and Liu, Jiang-Jiang and Gao, Shang-Hua and Hou, Qibin and Borji, Ali | N/A | |
| Multimodal Unsupervised Image-to-image Translation | Huang, Xun and Liu, Ming-Yu and Belongie, Serge and Kautz, Jan | N/A | |
| Diverse feature visualizations reveal invariances in early layers of deep neural networks | Cadena, Santiago A. and Weis, Marissa A. and Gatys, Leon A. and Bethge, Matthias and Ecker, Alexander S. | N/A | |
Factual'' orEmotional'': Stylized Image Captioning with Adaptive Learning and Attention |
Chen, Tianlang and Zhang, Zhongping and You, Quanzeng and Fang, Chen and Wang, Zhaowen and Jin, Hailin and Luo, Jiebo | N/A | |
| Deblurring Natural Image Using Super-Gaussian Fields | Liu, Yuhang and Dong, Wenyong and Gong, Dong and Zhang, Lei and Shi, Qinfeng | N/A | |
| Dense Semantic and Topological Correspondence of 3D Faces without Landmarks | Fan, Zhenfeng and Hu, Xiyuan and Chen, Chen and Peng, Silong | N/A | |
| OmniDepth: Dense Depth Estimation for Indoors Spherical Panoramas | Zioulis, Nikolaos and Karakottas, Antonis and Zarpalas, Dimitrios and Daras, Petros | N/A | |
| On Regularized Losses for Weakly-supervised CNN Segmentation | Tang, Meng and Perazzi, Federico and Djelouah, Abdelaziz and Ben Ayed, Ismail and Schroers, Christopher and Boykov, Yuri | N/A | |
| Learning Dynamic Memory Networks for Object Tracking | Yang, Tianyu and Chan, Antoni B. | N/A | |
| Zero-Shot Deep Domain Adaptation | Peng, Kuan-Chuan and Wu, Ziyan and Ernst, Jan | N/A | |
| SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional Images | Coors, Benjamin and Paul Condurache, Alexandru and Geiger, Andreas | N/A | |
| Graininess-Aware Deep Feature Learning for Pedestrian Detection | Lin, Chunze and Lu, Jiwen and Wang, Gang and Zhou, Jie | N/A | |
| Learning to Forecast and Refine Residual Motion for Image-to-Video Generation | Zhao, Long and Peng, Xi and Tian, Yu and Kapadia, Mubbasir and Metaxas, Dimitris | N/A | |
| ML-LocNet: Improving Object Localization with Multi-view Learning Network | Zhang, Xiaopeng and Yang, Yang and Feng, Jiashi | N/A | |
| Statistically-motivated Second-order Pooling | Yu, Kaicheng and Salzmann, Mathieu | N/A | |
| Improving Generalization via Scalable Neighborhood Component Analysis | Wu, Zhirong and Efros, Alexei A. and Yu, Stella X. | N/A | |
| Monocular Depth Estimation with Affinity, Vertical Pooling, and Label Enhancement | Gan, Yukang and Xu, Xiangyu and Sun, Wenxiu and Lin, Liang | N/A | |
| Learning to Anonymize Faces for Privacy Preserving Action Detection | Ren, Zhongzheng and Jae Lee, Yong and Ryoo, Michael S. | N/A | |
| Distractor-aware Siamese Networks for Visual Object Tracking | Zhu, Zheng and Wang, Qiang and Li, Bo and Wu, Wei and Yan, Junjie and Hu, Weiming | N/A | |
| Question Type Guided Attention in Visual Question Answering | Shi, Yang and Furlanello, Tommaso and Zha, Sheng and Anandkumar, Animashree | N/A | |
| Escaping from Collapsing Modes in a Constrained Space | Chang, Chia-Che and Hubert Lin, Chieh and Lee, Che-Rung and Juan, Da-Cheng and Wei, Wei and Chen, Hwann-Tzong | N/A | |
| Light Structure from Pin Motion: Simple and Accurate Point Light Calibration for Physics-based Modeling | Santo, Hiroaki and Waechter, Michael and Samejima, Masaki and Sugano, Yusuke and Matsushita, Yasuyuki | N/A | |
| Bayesian Semantic Instance Segmentation in Open Set World | Pham, Trung and Kumar, Vijay B. G. and Do, Thanh-Toan and Carneiro, Gustavo and Reid, Ian | N/A | |
| HybridNet: Classification and Reconstruction Cooperation for Semi-Supervised Learning | Robert, Thomas and Thome, Nicolas and Cord, Matthieu | N/A | |
| Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow | Ilg, Eddy and Cicek, Ozgun and Galesso, Silvio and Klein, Aaron and Makansi, Osama and Hutter, Frank and Brox, Thomas | N/A | |
| Discriminative Region Proposal Adversarial Networks for High-Quality Image-to-Image Translation | Wang, Chao and Zheng, Haiyong and Yu, Zhibin and Zheng, Ziqiang and Gu, Zhaorui and Zheng, Bing | N/A | |
| Transductive Semi-Supervised Deep Learning using Min-Max Features | Shi, Weiwei and Gong, Yihong and Ding, Chris and MaXiaoyu Tao, Zhiheng and Zheng, Nanning | N/A | |
| Interpolating Convolutional Neural Networks Using Batch Normalization | Wesley Putra Data, Gratianus and Ngu, Kirjon and William Murray, David and Adrian Prisacariu, Victor | N/A | |
| Learning Blind Video Temporal Consistency | Lai, Wei-Sheng and Huang, Jia-Bin and Wang, Oliver and Shechtman, Eli and Yumer, Ersin and Yang, Ming-Hsuan | N/A | |
| Learning Class Prototypes via Structure Alignment for Zero-Shot Recognition | Jiang, Huajie and Wang, Ruiping and Shan, Shiguang and Chen, Xilin | N/A | |
| Fine-grained Video Categorization with Redundancy Reduction Attention | Zhu, Chen and Tan, Xiao and Zhou, Feng and Liu, Xiao and Yue, Kaiyu and Ding, Errui and Ma, Yi | N/A | |
| Object Detection in Video with Spatiotemporal Sampling Networks | Bertasius, Gedas and Torresani, Lorenzo and Shi, Jianbo | N/A | |
| Graph Distillation for Action Detection with Privileged Modalities | Luo, Zelun and Hsieh, Jun-Ting and Jiang, Lu and Niebles, Juan Carlos and Fei-Fei, Li | N/A | |
| Efficient Uncertainty Estimation for Semantic Segmentation in Videos | Huang, Po-Yu and Hsu, Wan-Ting and Chiu, Chun-Yueh and Wu, Ting-Fan and Sun, Min | N/A | |
| Saliency Preservation in Low-Resolution Grayscale Images | Yohanandan, Shivanthan and Song, Andy and Dyer, Adrian G. and Tao, Dacheng | N/A | |
| Polarimetric Three-View Geometry | Chen, Lixiong and Zheng, Yinqiang and Subpa-asa, Art and Sato, Imari | N/A | |
| Deep Imbalanced Attribute Classification using Visual Attention Aggregation | Sarafianos, Nikolaos and Xu, Xiang and Kakadiaris, Ioannis A. | N/A | |
| Adding Attentiveness to the Neurons in Recurrent Neural Networks | Zhang, Pengfei and Xue, Jianru and Lan, Cuiling and Zeng, Wenjun and Gao, Zhanning and Zheng, Nanning | N/A | |
| Seeing Deeply and Bidirectionally: A Deep Learning Approach for Single Image Reflection Removal | Yang, Jie and Gong, Dong and Liu, Lingqiao and Shi, Qinfeng | N/A | |
| Fast and Accurate Camera Covariance Computation for Large 3D Reconstruction | Polic, Michal and Forstner, Wolfgang and Pajdla, Tomas | N/A | |
| Dynamic Multimodal Instance Segmentation Guided by Natural Language Queries | Margffoy-Tuay, Edgar and Perez, Juan C. and Botero, Emilio and Arbelaez, Pablo | N/A | |
| Learning SO(3) Equivariant Representations with Spherical CNNs | Esteves, Carlos and Allen-Blanchette, Christine and Makadia, Ameesh and Daniilidis, Kostas | N/A | |
| Out-of-Distribution Detection Using an Ensemble of Self Supervised Leave-out Classifiers | Vyas, Apoorv and Jammalamadaka, Nataraj and Zhu, Xia and Das, Dipankar and Kaul, Bharat and Willke, Theodore L. | N/A | |
| Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification | Du, Yang and Yuan, Chunfeng and Li, Bing and Zhao, Lili and Li, Yangxi and Hu, Weiming | N/A | |
| T2Net: Synthetic-to-Realistic Translation for Solving Single-Image Depth Estimation Tasks | Zheng, Chuanxia and Cham, Tat-Jen and Cai, Jianfei | N/A | |
| Video Object Detection with an Aligned Spatial-Temporal Memory | Xiao, Fanyi and Jae Lee, Yong | N/A | |
| CGIntrinsics: Better Intrinsic Image Decomposition through Physically-Based Rendering | Li, Zhengqi and Snavely, Noah | N/A | |
| Partial Adversarial Domain Adaptation | Cao, Zhangjie and Ma, Lijia and Long, Mingsheng and Wang, Jianmin | N/A | |
| Diverse and Coherent Paragraph Generation from Images | Chatterjee, Moitreya and Schwing, Alexander G. | N/A | |
| Diverse Image-to-Image Translation via Disentangled Representations | Lee, Hsin-Ying and Tseng, Hung-Yu and Huang, Jia-Bin and Singh, Maneesh and Yang, Ming-Hsuan | N/A | |
| BOP: Benchmark for 6D Object Pose Estimation | Hodan, Tomas and Michel, Frank and Brachmann, Eric and Kehl, Wadim and GlentBuch, Anders and Kraft, Dirk and Drost, Bertram and Vidal, Joel and Ihrke, Stephan and Zabulis, Xenophon and Sahin, Caner and Manhardt, Fabian and Tombari, Federico and Kim, Tae-Kyun and Matas, Jiri and Rother, Carsten | N/A | |
| Generative Domain-Migration Hashing for Sketch-to-Image Retrieval | Zhang, Jingyi and Shen, Fumin and Liu, Li and Zhu, Fan and Yu, Mengyang and Shao, Ling and Tao Shen, Heng and Van Gool, Luc | N/A | |
| Multimodal image alignment through a multiscale chain of neural networks with application to remote sensing | Zampieri, Armand and Charpiat, Guillaume and Girard, Nicolas and Tarabalka, Yuliya | N/A | |
| FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans | Liu, Chen and Wu, Jiaye and Furukawa, Yasutaka | N/A | |
| Unsupervised Hard Example Mining from Videos for Improved Object Detection | Jin, SouYoung and RoyChowdhury, Aruni and Jiang, Huaizu and Singh, Ashish and Prasad, Aditya and Chakraborty, Deep and Learned-Miller, Erik | N/A | |
| A Deeply-initialized Coarse-to-fine Ensemble of Regression Trees for Face Alignment | Valle, Roberto and Buenaposada, Jose M. and Valdes, Antonio and Baumela, Luis | N/A | |
| Transferring GANs: generating images from limited data | Wang, Yaxing and Wu, Chenshen and Herranz, Luis and van de Weijer, Joost and Gonzalez-Garcia, Abel and Raducanu, Bogdan | N/A | |
| Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking | Li, Chenglong and Zhu, Chengli and Huang, Yan and Tang, Jin and Wang, Liang | N/A | |
| Broadcasting Convolutional Network for Visual Relational Reasoning | Chang, Simyung and Yang, John and Park, SeongUk and Kwak, Nojun | N/A | |
| DF-Net: Unsupervised Joint Learning of Depth and Flow using Cross-Task Consistency | Zou, Yuliang and Luo, Zelun and Huang, Jia-Bin | N/A | |
| K-convexity shape priors for segmentation | Isack, Hossam and Gorelick, Lena and Ng, Karin and Veksler, Olga and Boykov, Yuri | N/A | |
| Fine-Grained Visual Categorization using Meta-Learning Optimization with Sample Selection of Auxiliary Data | Zhang, Yabin and Tang, Hui and Jia, Kui | N/A | |
| Hierarchical Bilinear Pooling for Fine-Grained Visual Recognition | Yu, Chaojian and Zhao, Xinyi and Zheng, Qi and Zhang, Peng and You, Xinge | N/A | |
| Unpaired Image Captioning by Language Pivoting | Gu, Jiuxiang and Joty, Shafiq and Cai, Jianfei and Wang, Gang | N/A | |
| Face De-Spoofing: Anti-Spoofing via Noise Modeling | Jourabloo, Amin and Liu, Yaojie and Liu, Xiaoming | N/A | |
| Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation | Rhodin, Helge and Salzmann, Mathieu and Fua, Pascal | N/A | |
| Comparator Networks | Xie, Weidi and Shen, Li and Zisserman, Andrew | N/A | |
| Quaternion Convolutional Neural Networks | Zhu, Xuanyu and Xu, Yi and Xu, Hongteng and Chen, Changjian | N/A | |
| Learning Priors for Semantic 3D Reconstruction | Cherabier, Ian and Schonberger, Johannes L. and Oswald, Martin R. and Pollefeys, Marc and Geiger, Andreas | N/A | |
| Joint Map and Symmetry Synchronization | Sun, Yifan and Liang, Zhenxiao and Huang, Xiangru and Huang, Qixing | N/A | |
| Start, Follow, Read: End-to-End Full-Page Handwriting Recognition | Wigington, Curtis and Tensmeyer, Chris and Davis, Brian and Barrett, William and Price, Brian and Cohen, Scott | N/A | |
| Reverse Attention for Salient Object Detection | Chen, Shuhan and Tan, Xiuli and Wang, Ben and Hu, Xuelong | N/A | |
| TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes | Long, Shangbang and Ruan, Jiaqiang and Zhang, Wenjie and He, Xin and Wu, Wenhao and Yao, Cong | N/A | |
| Linear Span Network for Object Skeleton Detection | Liu, Chang and Ke, Wei and Qin, Fei and Ye, Qixiang | N/A | |
| Efficient Relative Attribute Learning using Graph Neural Networks | Meng, Zihang and Adluru, Nagesh and Kim, Hyunwoo J. and Fung, Glenn and Singh, Vikas | N/A | |
| Model-free Consensus Maximization for Non-Rigid Shapes | Probst, Thomas and Chhatkuli, Ajad and Pani Paudel, Danda and Van Gool, Luc | N/A | |
| U-PC: Unsupervised Planogram Compliance | Ray, Archan and Kumar, Nishant and Shaw, Avishek and Prasad Mukherjee, Dipti | N/A | |
| Predicting Future Instance Segmentation by Forecasting Convolutional Features | Luc, Pauline and Couprie, Camille and LeCun, Yann and Verbeek, Jakob | N/A | |
| Person Search by Multi-Scale Matching | Lan, Xu and Zhu, Xiatian and Gong, Shaogang | N/A | |
| Flow-Grounded Spatial-Temporal Video Prediction from Still Images | Li, Yijun and Fang, Chen and Yang, Jimei and Wang, Zhaowen and Lu, Xin and Yang, Ming-Hsuan | N/A | |
| Liquid Pouring Monitoring via Rich Sensory Inputs | Wu, Tz-Ying and Lin, Juan-Ting and Wang, Tsun-Hsuang and Hu, Chan-Wei and Niebles, Juan Carlos and Sun, Min | N/A | |
| Exploiting temporal information for 3D human pose estimation | Rayat Imtiaz Hossain, Mir and Little, James J. | N/A | |
| Unsupervised CNN-based Co-Saliency Detection with Graphical Optimization | Hsu, Kuang-Jui and Tsai, Chung-Chi and Lin, Yen-Yu and Qian, Xiaoning and Chuang, Yung-Yu | N/A | |
| Localization Recall Precision (LRP): A New Performance Metric for Object Detection | Oksuz, Kemal and Can Cam, Baris and Akbas, Emre and Kalkan, Sinan | N/A | |
| Attentive Semantic Alignment with Offset-Aware Correlation Kernels | Hongsuck Seo, Paul and Lee, Jongmin and Jung, Deunsol and Han, Bohyung and Cho, Minsu | N/A | |
| Learning 3D Human Pose from Structure and Motion | Dabral, Rishabh and Mundhada, Anurag and Kusupati, Uday and Afaque, Safeer and Sharma, Abhishek and Jain, Arjun | N/A | |
| ForestHash: Semantic Hashing With Shallow Random Forests and Tiny Convolutional Networks | Qiu, Qiang and Lezama, Jose and Bronstein, Alex and Sapiro, Guillermo | N/A | |
| Online Detection of Action Start in Untrimmed, Streaming Videos | Shou, Zheng and Pan, Junting and Chan, Jonathan and Miyazawa, Kazuyuki and Mansour, Hassan and Vetro, Anthony and Giro-i-Nieto, Xavier and Chang, Shih-Fu | N/A | |
| Exploring the Limits of Weakly Supervised Pretraining | Mahajan, Dhruv and Girshick, Ross and Ramanathan, Vignesh and He, Kaiming and Paluri, Manohar and Li, Yixuan and Bharambe, Ashwin and van der Maaten, Laurens | N/A | |
| Revisiting RCNN: On Awakening the Classification Power of Faster RCNN | Cheng, Bowen and Wei, Yunchao and Shi, Honghui and Feris, Rogerio and Xiong, Jinjun and Huang, Thomas | N/A | |
| HandMap: Robust Hand Pose Estimation via Intermediate Dense Guidance Map Supervision | Wu, Xiaokun and Finnegan, Daniel and O'Neill, Eamonn and Yang, Yong-Liang | N/A | |
| Unsupervised Learning of Multi-Frame Optical Flow with Occlusions | Janai, Joel and Guney, Fatma and Ranjan, Anurag and Black, Michael and Geiger, Andreas | N/A | |
| Integrating Egocentric Videos in Top-view Surveillance Videos: Joint Identification and Temporal Alignment | Ardeshir, Shervin and Borji, Ali | N/A | |
| Attribute-Guided Face Generation Using Conditional CycleGAN | Lu, Yongyi and Tai, Yu-Wing and Tang, Chi-Keung | N/A | |
| Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic Images | Tateno, Keisuke and Navab, Nassir and Tombari, Federico | N/A | |
| Joint Camera Spectral Sensitivity Selection and Hyperspectral Image Recovery | Fu, Ying and Zhang, Tao and Zheng, Yinqiang and Zhang, Debing and Huang, Hua | N/A | |
| Monocular Depth Estimation Using Whole Strip Masking and Reliability-Based Refinement | Heo, Minhyeok and Lee, Jaehan and Kim, Kyung-Rae and Kim, Han-Ul and Kim, Chang-Su | N/A | |
| Analyzing Clothing Layer Deformation Statistics of 3D Human Motions | Yang, Jinlong and Franco, Jean-Sebastien and Hetroy-Wheeler, Franck and Wuhrer, Stefanie | N/A | |
| Image Super-Resolution Using Very Deep Residual Channel Attention Networks | Zhang, Yulun and Li, Kunpeng and Li, Kai and Wang, Lichen and Zhong, Bineng and Fu, Yun | N/A | |
| Semi-Supervised Generative Adversarial Hashing for Image Retrieval | Wang, Guan'an and Hu, Qinghao and Cheng, Jian and Hou, Zengguang | N/A | |
| Learning Single-View 3D Reconstruction with Limited Pose Supervision | Yang, Guandao and Cui, Yin and Belongie, Serge and Hariharan, Bharath | N/A | |
| Materials for Masses: SVBRDF Acquisition with a Single Mobile Phone Image | Li, Zhengqin and Sunkavalli, Kalyan and Chandraker, Manmohan | N/A | |
| Multi-Scale Spatially-Asymmetric Recalibration for Image Classification | Wang, Yan and Xie, Lingxi and Qiao, Siyuan and Zhang, Ya and Zhang, Wenjun and Yuille, Alan L. | N/A | |
| Graph Adaptive Knowledge Transfer for Unsupervised Domain Adaptation | Ding, Zhengming and Li, Sheng and Shao, Ming and Fu, Yun | N/A | |
| Improving Sequential Determinantal Point Processes for Supervised Video Summarization | Sharghi, Aidean and Borji, Ali and Li, Chengtao and Yang, Tianbao and Gong, Boqing | N/A | |
| Specular-to-Diffuse Translation for Multi-View Reconstruction | Wu, Shihao and Huang, Hui and Portenier, Tiziano and Sela, Matan and Cohen-Or, Daniel and Kimmel, Ron and Zwicker, Matthias | N/A | |
| RESOUND: Towards Action Recognition without Representation Bias | Li, Yingwei and Li, Yi and Vasconcelos, Nuno | N/A | |
| A Framework for Evaluating 6-DOF Object Trackers | Garon, Mathieu and Laurendeau, Denis and Lalonde, Jean-Francois | N/A | |
| Extending Layered Models to 3D Motion | Lao, Dong and Sundaramoorthi, Ganesh | N/A | |
| Long-term Tracking in the Wild: a Benchmark | Valmadre, Jack and Bertinetto, Luca and Henriques, Joao F. and Tao, Ran and Vedaldi, Andrea and Smeulders, Arnold W.M. and Torr, Philip H.S. and Gavves, Efstratios | N/A | |
| Human Motion Analysis with Deep Metric Learning | Coskun, Huseyin and Joseph Tan, David and Conjeti, Sailesh and Navab, Nassir and Tombari, Federico | N/A | |
| Adaptive Affinity Fields for Semantic Segmentation | Ke, Tsung-Wei and Hwang, Jyh-Jing and Liu, Ziwei and Yu, Stella X. | N/A | |
| Hierarchy of Alternating Specialists for Scene Recognition | Jin Kim, Hyo and Frahm, Jan-Michael | N/A | |
| Multi-Scale Structure-Aware Network for Human Pose Estimation | Ke, Lipeng and Chang, Ming-Ching and Qi, Honggang and Lyu, Siwei | N/A | |
| License Plate Detection and Recognition in Unconstrained Scenarios | Montazzolli Silva, Sergio and Rosito Jung, Claudio | N/A | |
| Where Will They Go? Predicting Fine-Grained Adversarial Multi-Agent Motion using Conditional Variational Autoencoders | Felsen, Panna and Lucey, Patrick and Ganguly, Sujoy | N/A | |
| Deep Multi-Task Learning to Recognise Subtle Facial Expressions of Mental States | Hu, Guosheng and Liu, Li and Yuan, Yang and Yu, Zehao and Hua, Yang and Zhang, Zhihong and Shen, Fumin and Shao, Ling and Hospedales, Timothy and Robertson, Neil and Yang, Yongxin | N/A | |
| PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction | Shi, Yifei and Xu, Kai and Niessner, Matthias and Rusinkiewicz, Szymon and Funkhouser, Thomas | N/A | |
| PPF-FoldNet: Unsupervised Learning of Rotation Invariant 3D Local Descriptors | Deng, Haowen and Birdal, Tolga and Ilic, Slobodan | N/A | |
| HBE: Hand Branch Ensemble Network for Real-time 3D Hand Pose Estimation | Zhou, Yidan and Lu, Jian and Du, Kuo and Lin, Xiangbo and Sun, Yi and Ma, Xiaohong | N/A | |
| ShapeStacks: Learning Vision-Based Physical Intuition for Generalised Object Stacking | Groth, Oliver and Fuchs, Fabian B. and Posner, Ingmar and Vedaldi, Andrea | N/A | |
| Grassmann Pooling as Compact Homogeneous Bilinear Pooling for Fine-Grained Visual Classification | Wei, Xing and Zhang, Yue and Gong, Yihong and Zhang, Jiawei and Zheng, Nanning | N/A | |
| Deep Generative Models for Weakly-Supervised Multi-Label Classification | Chu, Hong-Min and Yeh, Chih-Kuan and Frank Wang, Yu-Chiang | N/A | |
| SRDA: Generating Instance Segmentation Annotation via Scanning, Reasoning and Domain Adaptation | Xu, Wenqiang and Li, Yonglu and Lu, Cewu | N/A | |
| MPLP++: Fast, Parallel Dual Block-Coordinate Ascent for Dense Graphical Models | Tourani, Siddharth and Shekhovtsov, Alexander and Rother, Carsten and Savchynskyy, Bogdan | N/A | |
| Unsupervised Video Object Segmentation using Motion Saliency-Guided Spatio-Temporal Propagation | Hu, Yuan-Ting and Huang, Jia-Bin and Schwing, Alexander G. | N/A | |
| Semi-Supervised Deep Learning with Memory | Chen, Yanbei and Zhu, Xiatian and Gong, Shaogang | N/A | |
| Deep Reinforcement Learning with Iterative Shift for Visual Tracking | Ren, Liangliang and Yuan, Xin and Lu, Jiwen and Yang, Ming and Zhou, Jie | N/A | |
| X2Face: A network for controlling face generation using images, audio, and pose codes | Wiles, Olivia and Sophia Koepke, A. and Zisserman, Andrew | N/A | |
| Correcting the Triplet Selection Bias for Triplet Loss | Yu, Baosheng and Liu, Tongliang and Gong, Mingming and Ding, Changxing and Tao, Dacheng | N/A | |
| Women also Snowboard: Overcoming Bias in Captioning Models | Anne Hendricks, Lisa and Burns, Kaylee and Saenko, Kate and Darrell, Trevor and Rohrbach, Anna | N/A | |
| GAL: Geometric Adversarial Loss for Single-View 3D-Object Reconstruction | Jiang, Li and Shi, Shaoshuai and Qi, Xiaojuan and Jia, Jiaya | N/A | |
| Contextual-based Image Inpainting: Infer, Match, and Translate | Song, Yuhang and Yang, Chao and Lin, Zhe and Liu, Xiaofeng and Huang, Qin and Li, Hao and Jay Kuo, C.-C. | N/A | |
| Inner Space Preserving Generative Pose Machine | Liu, Shuangjun and Ostadabbas, Sarah | N/A | |
| SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network | Bai, Yancheng and Zhang, Yongqiang and Ding, Mingli and Ghanem, Bernard | N/A | |
| Dependency-aware Attention Control for Unconstrained Face Recognition with Image Sets | Liu, Xiaofeng and Vijaya Kumar, B.V.K and Yang, Chao and Tang, Qingming and You, Jane | N/A | |
| End-to-end View Synthesis for Light Field Imaging with Pseudo 4DCNN | Wang, Yunlong and Liu, Fei and Wang, Zilei and Hou, Guangqi and Sun, Zhenan and Tan, Tieniu | N/A | |
| Iterative Crowd Counting | Ranjan, Viresh and Le, Hieu and Hoai, Minh | N/A | |
| DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks | Chen, Weixuan and McDuff, Daniel | N/A | |
| On the Solvability of Viewing Graphs | Trager, Matthew and Osserman, Brian and Ponce, Jean | N/A | |
| A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers | Zhang, Tianyun and Ye, Shaokai and Zhang, Kaiqi and Tang, Jian and Wen, Wujie and Fardad, Makan and Wang, Yanzhi | N/A | |
| Multimodal Dual Attention Memory for Video Story Question Answering | Kim, Kyung-Min and Choi, Seong-Ho and Kim, Jin-Hwa and Zhang, Byoung-Tak | N/A | |
| SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection | Kim, Yonghyun and Kang, Bong-Nam and Kim, Daijin | N/A | |
| Single Shot Scene Text Retrieval | Gomez, Lluis and Mafla, Andres and Rusinol, Marcal and Karatzas, Dimosthenis | N/A | |
| Dynamic Task Prioritization for Multitask Learning | Guo, Michelle and Haque, Albert and Huang, De-An and Yeung, Serena and Fei-Fei, Li | N/A | |
| Self-supervised Knowledge Distillation Using Singular Value Decomposition | Hyun Lee, Seung and Ha Kim, Dae and Cheol Song, Byung | N/A | |
| Transductive Centroid Projection for Semi-supervised Large-scale Recognition | Liu, Yu and Song, Guanglu and Shao, Jing and Jin, Xiao and Wang, Xiaogang | N/A | |
| Deep Shape Matching | Radenovic, Filip and Tolias, Giorgos and Chum, Ondrej | N/A | |
| Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network | Ahn, Namhyuk and Kang, Byungkon and Sohn, Kyung-Ah | N/A | |
| CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving | Liang, Xiaodan and Wang, Tairui and Yang, Luona and Xing, Eric | N/A | |
| EC-Net: an Edge-aware Point set Consolidation Network | Yu, Lequan and Li, Xianzhi and Fu, Chi-Wing and Cohen-Or, Daniel and Heng, Pheng-Ann | N/A | |
| Part-Activated Deep Reinforcement Learning for Action Prediction | Chen, Lei and Lu, Jiwen and Song, Zhanjie and Zhou, Jie | N/A | |
| Learning to Navigate for Fine-grained Classification | Yang, Ze and Luo, Tiange and Wang, Dong and Hu, Zhiqiang and Gao, Jun and Wang, Liwei | N/A | |
| Single Image Highlight Removal with a Sparse and Low-Rank Reflection Model | Guo, Jie and Zhou, Zuojian and Wang, Limin | N/A | |
| Improving Shape Deformation in Unsupervised Image-to-Image Translation | Gokaslan, Aaron and Ramanujan, Vivek and Ritchie, Daniel and In Kim, Kwang and Tompkin, James | N/A | |
| Scalable Exemplar-based Subspace Clustering on Class-Imbalanced Data | You, Chong and Li, Chi and Robinson, Daniel P. and Vidal, Rene | N/A | |
| 3D Ego-Pose Estimation via Imitation Learning | Yuan, Ye and Kitani, Kris | N/A | |
| Visual Coreference Resolution in Visual Dialog using Neural Module Networks | Kottur, Satwik and Moura, Jose M. F. and Parikh, Devi and Batra, Dhruv and Rohrbach, Marcus | N/A | |
| LSQ++: Lower running time and higher recall in multi-codebook quantization | Martinez, Julieta and Zakhmi, Shobhit and Hoos, Holger H. and Little, James J. | N/A | |
| A Hybrid Model for Identity Obfuscation by Face Replacement | Sun, Qianru and Tewari, Ayush and Xu, Weipeng and Fritz, Mario and Theobalt, Christian and Schiele, Bernt | N/A | |
| Depth-aware CNN for RGB-D Segmentation | Wang, Weiyue and Neumann, Ulrich | N/A | |
| BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation | Yu, Changqian and Wang, Jingbo and Peng, Chao and Gao, Changxin and Yu, Gang and Sang, Nong | N/A | |
| ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems | Zhang, Yinda and Khamis, Sameh and Rhemann, Christoph and Valentin, Julien and Kowdle, Adarsh and Tankovich, Vladimir and Schoenberg, Michael and Izadi, Shahram and Funkhouser, Thomas and Fanello, Sean | N/A | |
| Weakly- and Semi-Supervised Panoptic Segmentation | Li, Qizhu and Arnab, Anurag and Torr, Philip H.S. | N/A | |
| Selfie Video Stabilization | Yu, Jiyang and Ramamoorthi, Ravi | N/A | |
| Double JPEG Detection in Mixed JPEG Quality Factors using Deep Convolutional Neural Network | Park, Jinseok and Cho, Donghyeon and Ahn, Wonhyuk and Lee, Heung-Kyu | N/A | |
| Incremental Multi-graph Matching via Diversity and Randomness based Graph Clustering | Yu, Tianshu and Yan, Junchi and Liu, Wei and Li, Baoxin | N/A | |
| DeepTAM: Deep Tracking and Mapping | Zhou, Huizhong and Ummenhofer, Benjamin and Brox, Thomas | N/A | |
| R2P2: A ReparameteRized Pushforward Policy for Diverse, Precise Generative Path Forecasting | Rhinehart, Nicholas and Kitani, Kris M. and Vernaza, Paul | N/A | |
| SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters | Xu, Yifan and Fan, Tianqi and Xu, Mingye and Zeng, Long and Qiao, Yu | N/A | |
| CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images | Guo, Sheng and Huang, Weilin and Zhang, Haozhi and Zhuang, Chenfan and Dong, Dengke and Scott, Matthew R. and Huang, Dinglong | N/A | |
| Occlusions, Motion and Depth Boundaries with a Generic Network for Disparity, Optical Flow or Scene Flow Estimation | Ilg, Eddy and Saikia, Tonmoy and Keuper, Margret and Brox, Thomas | N/A | |
| Quantization Mimic: Towards Very Tiny CNN for Object Detection | Wei, Yi and Pan, Xinyu and Qin, Hongwei and Ouyang, Wanli and Yan, Junjie | N/A | |
| Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field Estimation | Lv, Zhaoyang and Kim, Kihwan and Troccoli, Alejandro and Sun, Deqing and Rehg, James M. and Kautz, Jan | N/A | |
| Proximal Dehaze-Net: A Prior Learning-Based Deep Network for Single Image Dehazing | Yang, Dong and Sun, Jian | N/A | |
| Textual Explanations for Self-Driving Vehicles | Kim, Jinkyu and Rohrbach, Anna and Darrell, Trevor and Canny, John and Akata, Zeynep | N/A | |
| Focus, Segment and Erase: An Efficient Network for Multi-Label Brain Tumor Segmentation | Chen, Xuan and Hao Liew, Jun and Xiong, Wei and Chui, Chee-Kong and Ong, Sim-Heng | N/A | |
| Local Orthogonal-Group Testing | Iscen, Ahmet and Chum, Ondrej | N/A | |
| Connecting Gaze, Scene, and Attention: Generalized Attention Estimation via Joint Modeling of Gaze and Scene Saliency | Chong, Eunji and Ruiz, Nataniel and Wang, Yongxin and Zhang, Yun and Rozga, Agata and Rehg, James M. | N/A | |
| Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data | Liu, Xihui and Li, Hongsheng and Shao, Jing and Chen, Dapeng and Wang, Xiaogang | N/A | |
| VideoMatch: Matching based Video Object Segmentation | Hu, Yuan-Ting and Huang, Jia-Bin and Schwing, Alexander G. | N/A | |
| Unsupervised Video Object Segmentation with Motion-based Bilateral Networks | Li, Siyang and Seybold, Bryan and Vorobyov, Alexey and Lei, Xuejing and Jay Kuo, C.-C. | N/A | |
| 3D Vehicle Trajectory Reconstruction in Monocular Video Data Using Environment Structure Constraints | Bullinger, Sebastian and Bodensteiner, Christoph and Arens, Michael and Stiefelhagen, Rainer | N/A | |
| Shuffle-Then-Assemble: Learning Object-Agnostic Visual Relationship Features | Yang, Xu and Zhang, Hanwang and Cai, Jianfei | N/A | |
| Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors | Baranchuk, Dmitry and Babenko, Artem and Malkov, Yury | N/A | |
| Eliminating the Blind Spot: Adapting 3D Object Detection and Monocular Depth Estimation to 360° Panoramic Imagery | Payen de La Garanderie, Greire and Atapour Abarghouei, Amir and Breckon, Toby P. | N/A | |
| Towards Realistic Predictors | Wang, Pei and Vasconcelos, Nuno | N/A | |
| Learning Deep Representations with Probabilistic Knowledge Transfer | Passalis, Nikolaos and Tefas, Anastasios | N/A | |
| DFT-based Transformation Invariant Pooling Layer for Visual Classification | Ryu, Jongbin and Yang, Ming-Hsuan and Lim, Jongwoo | N/A | |
| Objects that Sound | Arandjelovic, Relja and Zisserman, Andrew | N/A | |
| End-to-End Incremental Learning | Castro, Francisco M. and Marin-Jimenez, Manuel J. and Guil, Nicolas and Schmid, Cordelia and Alahari, Karteek | N/A | |
| SaaS: Speed as a Supervisor for Semi-supervised Learning | Cicek, Safa and Fawzi, Alhussein and Soatto, Stefano | N/A | |
| Deep Video Quality Assessor: From Spatio-temporal Visual Sensitivity to A Convolutional Neural Aggregation Network | Kim, Woojae and Kim, Jongyoo and Ahn, Sewoong and Kim, Jinwoo and Lee, Sanghoon | N/A | |
| Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering | Narasimhan, Medhini and Schwing, Alexander G. | N/A | |
| Deep Volumetric Video From Very Sparse Multi-View Performance Capture | Huang, Zeng and Li, Tianye and Chen, Weikai and Zhao, Yajie and Xing, Jun and LeGendre, Chloe and Luo, Linjie and Ma, Chongyang and Li, Hao | N/A | |
| Neural Procedural Reconstruction for Residential Buildings | Zeng, Huayi and Wu, Jiaye and Furukawa, Yasutaka | N/A | |
| Deformable Pose Traversal Convolution for 3D Action and Gesture Recognition | Weng, Junwu and Liu, Mengyuan and Jiang, Xudong and Yuan, Junsong | N/A | |
| A Trilateral Weighted Sparse Coding Scheme for Real-World Image Denoising | Xu, Jun and Zhang, Lei and Zhang, David | N/A | |
| Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition | Wang, Yitong and Gong, Dihong and Zhou, Zheng and Ji, Xing and Wang, Hao and Li, Zhifeng and Liu, Wei and Zhang, Tong | N/A | |
| Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection | Song, Hongmei and Wang, Wenguan and Zhao, Sanyuan and Shen, Jianbing and Lam, Kin-Man | N/A | |
| Deep Burst Denoising | Godard, Clement and Matzen, Kevin and Uyttendaele, Matt | N/A | |
| Learning to Separate Object Sounds by Watching Unlabeled Video | Gao, Ruohan and Feris, Rogerio and Grauman, Kristen | N/A | |
| Learnable PINs: Cross-Modal Embeddings for Person Identity | Nagrani, Arsha and Albanie, Samuel and Zisserman, Andrew | N/A | |
| Multi-object Tracking with Neural Gating Using Bilinear LSTM | Kim, Chanho and Li, Fuxin and Rehg, James M. | N/A | |
| Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera | von Marcard, Timo and Henschel, Roberto and Black, Michael J. and Rosenhahn, Bodo and Pons-Moll, Gerard | N/A | |
| Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding | Sakaridis, Christos and Dai, Dengxin and Hecker, Simon and Van Gool, Luc | N/A | |
| NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications | Yang, Tien-Ju and Howard, Andrew and Chen, Bo and Zhang, Xiao and Go, Alec and Sandler, Mark and Sze, Vivienne and Adam, Hartwig | N/A | |
| MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics | Yan, Xinchen and Rastogi, Akash and Villegas, Ruben and Sunkavalli, Kalyan and Shechtman, Eli and Hadap, Sunil and Yumer, Ersin and Lee, Honglak | N/A | |
| Affine Correspondences between Central Cameras for Rapid Relative Pose Estimation | Eichhardt, Ivan and Chetverikov, Dmitry | N/A | |
| Lifting Layers: Analysis and Applications | Ochs, Peter and Meinhardt, Tim and Leal-Taixe, Laura and Moeller, Michael | N/A | |
| Remote Photoplethysmography Correspondence Feature for 3D Mask Face Presentation Attack Detection | Liu, Si-Qi and Lan, Xiangyuan and Yuen, Pong C. | N/A | |
| Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline) | Sun, Yifan and Zheng, Liang and Yang, Yi and Tian, Qi and Wang, Shengjin | N/A | |
| Generative Adversarial Network with Spatial Attention for Face Attribute Editing | Zhang, Gang and Kan, Meina and Shan, Shiguang and Chen, Xilin | N/A | |
| Pairwise Body-Part Attention for Recognizing Human-Object Interactions | Fang, Hao-Shu and Cao, Jinkun and Tai, Yu-Wing and Lu, Cewu | N/A | |
| Person Search via A Mask-guided Two-stream CNN Model | Chen, Di and Zhang, Shanshan and Ouyang, Wanli and Yang, Jian and Tai, Ying | N/A | |
| Saliency Benchmarking Made Easy: Separating Models, Maps and Metrics | Kummerer, Matthias and Wallis, Thomas S. A. and Bethge, Matthias | N/A | |
| Sub-GAN: An Unsupervised Generative Model via Subspaces | Liang, Jie and Yang, Jufeng and Lee, Hsin-Ying and Wang, Kai and Yang, Ming-Hsuan | N/A | |
| Improving Spatiotemporal Self-Supervision by Deep Reinforcement Learning | Buchler, Uta and Brattoli, Biagio and Ommer, Bjorn | N/A | |
| The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking | Du, Dawei and Qi, Yuankai and Yu, Hongyang and Yang, Yifan and Duan, Kaiwen and Li, Guorong and Zhang, Weigang and Huang, Qingming and Tian, Qi | N/A | |
| Beyond local reasoning for stereo confidence estimation with deep learning | Tosi, Fabio and Poggi, Matteo and Benincasa, Antonio and Mattoccia, Stefano | N/A | |
| ConvNets and ImageNet Beyond Accuracy: Understanding Mistakes and Uncovering Biases | Stock, Pierre and Cisse, Moustapha | N/A | |
| Making Deep Heatmaps Robust to Partial Occlusions for 3D Object Pose Estimation | Oberweger, Markus and Rad, Mahdi and Lepetit, Vincent | N/A | |
| CBAM: Convolutional Block Attention Module | Woo, Sanghyun and Park, Jongchan and Lee, Joon-Young and So Kweon, In | N/A | |
| Spatio-temporal Transformer Network for Video Restoration | Hyun Kim, Tae and Sajjadi, Mehdi S. M. and Hirsch, Michael and Scholkopf, Bernhard | N/A | |
| stagNet: An Attentive Semantic RNN for Group Activity Recognition | Qi, Mengshi and Qin, Jie and Li, Annan and Wang, Yunhong and Luo, Jiebo and Van Gool, Luc | N/A | |
| Learning Discriminative Video Representations Using Adversarial Perturbations | Wang, Jue and Cherian, Anoop | N/A | |
| On Offline Evaluation of Vision-based Driving Models | Codevilla, Felipe and Lopez, Antonio M. and Koltun, Vladlen and Dosovitskiy, Alexey | N/A | |
| Real-to-Virtual Domain Unification for End-to-End Autonomous Driving | Yang, Luona and Liang, Xiaodan and Wang, Tairui and Xing, Eric | N/A | |
| How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video Summarization | Li, Yandong and Wang, Liqiang and Yang, Tianbao and Gong, Boqing | N/A | |
| Piggyback: Adapting a Single Network to Multiple Tasks by Learning to Mask Weights | Mallya, Arun and Davis, Dillon and Lazebnik, Svetlana | N/A | |
| PSANet: Point-wise Spatial Attention Network for Scene Parsing | Zhao, Hengshuang and Zhang, Yi and Liu, Shu and Shi, Jianping and Change Loy, Chen and Lin, Dahua and Jia, Jiaya | N/A | |
| X-ray Computed Tomography Through Scatter | Geva, Adam and Schechner, Yoav Y. and Chernyak, Yonatan and Gupta, Rajiv | N/A | |
| Image Generation from Sketch Constraint Using Contextual GAN | Lu, Yongyi and Wu, Shangzhe and Tai, Yu-Wing and Tang, Chi-Keung | N/A | |
| Weakly-supervised 3D Hand Pose Estimation from Monocular RGB Images | Cai, Yujun and Ge, Liuhao and Cai, Jianfei and Yuan, Junsong | N/A | |
| SkipNet: Learning Dynamic Routing in Convolutional Networks | Wang, Xin and Yu, Fisher and Dou, Zi-Yi and Darrell, Trevor and Gonzalez, Joseph E. | N/A | |
| Point-to-Point Regression PointNet for 3D Hand Pose Estimation | Ge, Liuhao and Ren, Zhou and Yuan, Junsong | N/A | |
| Deeply Learned Compositional Models for Human Pose Estimation | Tang, Wei and Yu, Pei and Wu, Ying | N/A | |
| Compound Memory Networks for Few-shot Video Classification | Zhu, Linchao and Yang, Yi | N/A | |
| 3D Recurrent Neural Networks with Context Fusion for Point Cloud Semantic Segmentation | Ye, Xiaoqing and Li, Jiamao and Huang, Hexiao and Du, Liang and Zhang, Xiaolin | N/A | |
| Unsupervised Person Re-identification by Deep Learning Tracklet Association | Li, Minxian and Zhu, Xiatian and Gong, Shaogang | N/A | |
| Deep Boosting for Image Denoising | Chen, Chang and Xiong, Zhiwei and Tian, Xinmei and Wu, Feng | N/A | |
| The Contextual Loss for Image Transformation with Non-Aligned Data | Mechrez, Roey and Talmi, Itamar and Zelnik-Manor, Lihi | N/A | |
| Actor-centric Relation Network | Sun, Chen and Shrivastava, Abhinav and Vondrick, Carl and Murphy, Kevin and Sukthankar, Rahul and Schmid, Cordelia | N/A | |
| Fully-Convolutional Point Networks for Large-Scale Point Clouds | Rethage, Dario and Wald, Johanna and Sturm, Jurgen and Navab, Nassir and Tombari, Federico | N/A | |
| Joint optimization for compressive video sensing and reconstruction under hardware constraints | Yoshida, Michitaka and Torii, Akihiko and Okutomi, Masatoshi and Endo, Kenta and Sugiyama, Yukinobu and Taniguchi, Rin-ichiro and Nagahara, Hajime | N/A | |
| Improved Structure from Motion Using Fiducial Marker Matching | DeGol, Joseph and Bretl, Timothy and Hoiem, Derek | N/A | |
| Deep Autoencoder for Combined Human Pose Estimation and Body Model Upscaling | Trumble, Matthew and Gilbert, Andrew and Hilton, Adrian and Collomosse, John | N/A | |
| Integral Human Pose Regression | Sun, Xiao and Xiao, Bin and Wei, Fangyin and Liang, Shuang and Wei, Yichen | N/A | |
| Convolutional Networks with Adaptive Inference Graphs | Veit, Andreas and Belongie, Serge | N/A | |
| A Dataset and Architecture for Visual Reasoning with a Working Memory | Robert Yang, Guangyu and Ganichev, Igor and Wang, Xiao-Jing and Shlens, Jonathon and Sussillo, David | N/A | |
| Video Compression through Image Interpolation | Wu, Chao-Yuan and Singhal, Nayan and Krahenbuhl, Philipp | N/A | |
| Composition Loss for Counting, Density Map Estimation and Localization in Dense Crowds | Idrees, Haroon and Tayyab, Muhmmad and Athrey, Kishan and Zhang, Dong and Al-Maadeed, Somaya and Rajpoot, Nasir and Shah, Mubarak | N/A | |
| Affinity Derivation and Graph Merge for Instance Segmentation | Liu, Yiding and Yang, Siyu and Li, Bin and Zhou, Wengang and Xu, Jizheng and Li, Houqiang and Lu, Yan | N/A | |
| Progressive Structure from Motion | Locher, Alex and Havlena, Michal and Van Gool, Luc | N/A | |
| MultiPoseNet: Fast Multi-Person Pose Estimation using Pose Residual Network | Kocabas, Muhammed and Karagoz, Salih and Akbas, Emre | N/A | |
| Self-Calibrating Isometric Non-Rigid Structure-from-Motion | Parashar, Shaifali and Bartoli, Adrien and Pizarro, Daniel | N/A | |
| Using Object Information for Spotting Text | Prasad, Shitala and Wai Kin Kong, Adams | N/A | |
| Modality Distillation with Multiple Stream Networks for Action Recognition | Garcia, Nuno C. and Morerio, Pietro and Murino, Vittorio | N/A | |
| Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving | Li, Peiliang and Qin, Tong and Shen, andShaojie | N/A | |
| AutoLoc: Weakly-supervised Temporal Action Localization in Untrimmed Videos | Shou, Zheng and Gao, Hang and Zhang, Lei and Miyazawa, Kazuyuki and Chang, Shih-Fu | N/A | |
| Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency | Zhou, Xingyi and Karpur, Arjun and Gan, Chuang and Luo, Linjie and Huang, Qixing | N/A | |
| Visual-Inertial Object Detection and Mapping | Fei, Xiaohan and Soatto, Stefano | N/A | |
| FishEyeRecNet: A Multi-Context Collaborative Deep Network for Fisheye Image Rectification | Yin, Xiaoqing and Wang, Xinchao and Yu, Jun and Zhang, Maojun and Fua, Pascal and Tao, Dacheng | N/A | |
| Semi-supervised FusedGAN for Conditional Image Generation | Bodla, Navaneeth and Hua, Gang and Chellappa, Rama | N/A | |
| Group Normalization | Wu, Yuxin and He, Kaiming | N/A | |
| Conditional Image-Text Embedding Networks | Plummer, Bryan A. and Kordas, Paige and Hadi Kiapour, M. and Zheng, Shuai and Piramuthu, Robinson and Lazebnik, Svetlana | N/A | |
| Deep Co-Training for Semi-Supervised Image Recognition | Qiao, Siyuan and Shen, Wei and Zhang, Zhishuai and Wang, Bo and Yuille, Alan | N/A | |
| Object Level Visual Reasoning in Videos | Baradel, Fabien and Neverova, Natalia and Wolf, Christian and Mille, Julien and Mori, Greg | N/A | |
| In the Eye of Beholder: Joint Learning of Gaze and Actions in First Person Video | Li, Yin and Liu, Miao and Rehg, James M. | N/A | |
| Deep Factorised Inverse-Sketching | Pang, Kaiyue and Li, Da and Song, Jifei and Song, Yi-Zhe and Xiang, Tao and Hospedales, Timothy M. | N/A | |
| A Joint Sequence Fusion Model for Video Question Answering and Retrieval | Yu, Youngjae and Kim, Jongseok and Kim, Gunhee | N/A | |
| View-graph Selection Framework for SfM | Shah, Rajvi and Chari, Visesh and J Narayanan, P | N/A | |
| Synthetically Supervised Feature Learning for Scene Text Recognition | Liu, Yang and Wang, Zhaowen and Jin, Hailin and Wassell, Ian | N/A | |
| Deep Clustering for Unsupervised Learning of Visual Features | Caron, Mathilde and Bojanowski, Piotr and Joulin, Armand and Douze, Matthijs | N/A | |
| Is Robustness the Cost of Accuracy? -- A Comprehensive Study on the Robustness of 18 Deep Image Classification Models | Su, Dong and Zhang, Huan and Chen, Hongge and Yi, Jinfeng and Chen, Pin-Yu and Gao, Yupeng | N/A | |
| Lifelong Learning via Progressive Distillation and Retrospection | Hou, Saihui and Pan, Xinyu and Change Loy, Chen and Wang, Zilei and Lin, Dahua | N/A | |
| Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery Data | Feng, Tian and Truong, Quang-Trung and Thanh Nguyen, Duc and Yu Koh, Jing and Yu, Lap-Fai and Binder, Alexander and Yeung, Sai-Kit | N/A | |
| Progressive Neural Architecture Search | Liu, Chenxi and Zoph, Barret and Neumann, Maxim and Shlens, Jonathon and Hua, Wei and Li, Li-Jia and Fei-Fei, Li and Yuille, Alan and Huang, Jonathan and Murphy, Kevin | N/A | |
| Single Image Water Hazard Detection using FCN with Reflection Attention Units | Han, Xiaofeng and Nguyen, Chuong and You, Shaodi and Lu, Jianfeng | N/A | |
| Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition | Huang, Yifei and Cai, Minjie and Li, Zhenqiang and Sato, Yoichi | N/A | |
| Joint Learning of Intrinsic Images and Semantic Segmentation | Baslamisli, Anil S. and Groenestege, Thomas T. and Das, Partha and Le, Hoang-An and Karaoglu, Sezer and Gevers, Theo | N/A | |
| Towards Robust Neural Networks via Random Self-ensemble | Liu, Xuanqing and Cheng, Minhao and Zhang, Huan and Hsieh, Cho-Jui | N/A | |
| Programmable Triangulation Light Curtains | Wang, Jian and Bartels, Joseph and Whittaker, William and Sankaranarayanan, Aswin C. and Narasimhan, Srinivasa G. | N/A | |
| Find and Focus: Retrieve and Localize Video Events with Natural Language Queries | Shao, Dian and Xiong, Yu and Zhao, Yue and Huang, Qingqiu and Qiao, Yu and Lin, Dahua | N/A | |
| Rethinking the Form of Latent States in Image Captioning | Dai, Bo and Ye, Deming and Lin, Dahua | N/A | |
| CubeNet: Equivariance to 3D Rotation and Translation | Worrall, Daniel and Brostow, Gabriel | N/A | |
| DeepWrinkles: Accurate and Realistic Clothing Modeling | Lahner, Zorah and Cremers, Daniel and Tung, Tony | N/A | |
| Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection | Zhu, Lei and Deng, Zijun and Hu, Xiaowei and Fu, Chi-Wing and Xu, Xuemiao and Qin, Jing and Heng, Pheng-Ann | N/A | |
| Deep Regression Tracking with Shrinkage Loss | Lu, Xiankai and Ma, Chao and Ni, Bingbing and Yang, Xiaokang and Reid, Ian and Yang, Ming-Hsuan | N/A | |
| Super-Resolution and Sparse View CT Reconstruction | Zang, Guangming and Aly, Mohamed and Idoughi, Ramzi and Wonka, Peter and Heidrich, Wolfgang | N/A | |
| Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes | Lyu, Pengyuan and Liao, Minghui and Yao, Cong and Wu, Wenhao and Bai, Xiang | N/A | |
| Learning to Dodge A Bullet: Concyclic View Morphing via Deep Learning | Jin, Shi and Liu, Ruiynag and Ji, Yu and Ye, Jinwei and Yu, Jingyi | N/A | |
| Deterministic Consensus Maximization with Biconvex Programming | Cai, Zhipeng and Chin, Tat-Jun and Le, Huu and Suter, David | N/A | |
| Practical Black-box Attacks on Deep Neural Networks using Efficient Query Mechanisms | Nitin Bhagoji, Arjun and He, Warren and Li, Bo and Song, Dawn | N/A | |
| Propagating LSTM: 3D Pose Estimation based on Joint Interdependency | Lee, Kyoungoh and Lee, Inwoong and Lee, Sanghoon | N/A | |
| Associating Inter-Image Salient Instances for Weakly Supervised Semantic Segmentation | Fan, Ruochen and Hou, Qibin and Cheng, Ming-Ming and Yu, Gang and Martin, Ralph R. and Hu, Shi-Min | N/A | |
| Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input | Harwath, David and Recasens, Adria and Suris, Didac and Chuang, Galen and Torralba, Antonio and Glass, James | N/A | |
| Visual Tracking via Spatially Aligned Correlation Filters Network | Zhang, Mengdan and Wang, Qiang and Xing, Junliang and Gao, Jin and Peng, Peixi and Hu, Weiming and Maybank, Steve | N/A | |
| HairNet: Single-View Hair Reconstruction using Convolutional Neural Networks | Zhou, Yi and Hu, Liwen and Xing, Jun and Chen, Weikai and Kung, Han-Wei and Tong, Xin and Li, Hao | N/A | |
| The Sound of Pixels | Zhao, Hang and Gan, Chuang and Rouditchenko, Andrew and Vondrick, Carl and McDermott, Josh and Torralba, Antonio | N/A | |
| Shape Reconstruction Using Volume Sweeping and Learned Photoconsistency | Leroy, Vincent and Franco, Jean-Sebastien and Boyer, Edmond | N/A | |
| Quantized Densely Connected U-Nets for Efficient Landmark Localization | Tang, Zhiqiang and Peng, Xi and Geng, Shijie and Wu, Lingfei and Zhang, Shaoting and Metaxas, Dimitris | N/A | |
| Joint 3D tracking of a deformable object in interaction with a hand | Tsoli, Aggeliki and Argyros, Antonis A. | N/A | |
| Move Forward and Tell: A Progressive Generator of Video Descriptions | Xiong, Yilei and Dai, Bo and Lin, Dahua | N/A | |
| Face Recognition with Contrastive Convolution | Han, Chunrui and Shan, Shiguang and Kan, Meina and Wu, Shuzhe and Chen, Xilin | N/A | |
| Repeatability Is Not Enough: Learning Affine Regions via Discriminability | Mishkin, Dmytro and Radenovic, Filip and Matas, Jiri | N/A | |
| Tackling 3D ToF Artifacts Through Learning and the FLAT Dataset | Guo, Qi and Frosio, Iuri and Gallo, Orazio and Zickler, Todd and Kautz, Jan | N/A | |
| Using LIP to Gloss Over Faces in Single-Stage Face Detection Networks | Yang, Siqi and Wiliem, Arnold and Chen, Shaokang and Lovell, Brian C. | N/A | |
| Motion Feature Network: Fixed Motion Filter for Action Recognition | Lee, Myunggi and Lee, Seungeui and Son, Sungjoon and Park, Gyutae and Kwak, Nojun | N/A | |
| Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot Study | Wu, Zhenyu and Wang, Zhangyang and Wang, Zhaowen and Jin, Hailin | N/A | |
| Learning Compression from Limited Unlabeled Data | He, Xiangyu and Cheng, Jian | N/A | |
| DeepVS: A Deep Learning Based Video Saliency Prediction Approach | Jiang, Lai and Xu, Mai and Liu, Tie and Qiao, Minglang and Wang, Zulin | N/A | |
| ADVIO: An Authentic Dataset for Visual-Inertial Odometry | Cortes, Santiago and Solin, Arno and Rahtu, Esa and Kannala, Juho | N/A | |
| Adversarial Geometry-Aware Human Motion Prediction | Gui, Liang-Yan and Wang, Yu-Xiong and Liang, Xiaodan and Moura, Jose M. F. | N/A | |
| Online Dictionary Learning for Approximate Archetypal Analysis | Mei, Jieru and Wang, Chunyu and Zeng, Wenjun | N/A | |
| Rendering Portraitures from Monocular Camera and Beyond | Xu, Xiangyu and Sun, Deqing and Liu, Sifei and Ren, Wenqi and Zhang, Yu-Jin and Yang, Ming-Hsuan and Sun, Jian | N/A | |
| Attributes as Operators: Factorizing Unseen Attribute-Object Compositions | Nagarajan, Tushar and Grauman, Kristen | N/A | |
| Scaling Egocentric Vision: The EPIC-KITCHENS Dataset | Damen, Dima and Doughty, Hazel and Maria Farinella, Giovanni and Fidler, Sanja and Furnari, Antonino and Kazakos, Evangelos and Moltisanti, Davide and Munro, Jonathan and Perrett, Toby and Price, Will and Wray, Michael | N/A | |
| Realtime Time Synchronized Event-based Stereo | Zihao Zhu, Alex and Chen, Yibo and Daniilidis, Kostas | N/A | |
| Memory Aware Synapses: Learning what (not) to forget | Aljundi, Rahaf and Babiloni, Francesca and Elhoseiny, Mohamed and Rohrbach, Marcus and Tuytelaars, Tinne | N/A | |
| Learning and Matching Multi-View Descriptors for Registration of Point Clouds | Zhou, Lei and Zhu, Siyu and Luo, Zixin and Shen, Tianwei and Zhang, Runze and Zhen, Mingmin and Fang, Tian and Quan, Long | N/A | |
| Semi-Dense 3D Reconstruction with a Stereo Event Camera | Zhou, Yi and Gallego, Guillermo and Rebecq, Henri and Kneip, Laurent and Li, Hongdong and Scaramuzza, Davide | N/A | |
| Scale-Awareness of Light Field Camera based Visual Odometry | Zeller, Niclas and Quint, Franz and Stilla, Uwe | N/A | |
| Revisiting Autofocus for Smartphone Cameras | Abuolaim, Abdullah and Punnappurath, Abhijith and Brown, Michael S. | N/A | |
| Deep Adversarial Attention Alignment for Unsupervised Domain Adaptation: the Benefit of Target Expectation Maximization | Kang, Guoliang and Zheng, Liang and Yan, Yan and Yang, Yi | N/A | |
| Efficient 6-DoF Tracking of Handheld Objects from an Egocentric Viewpoint | Pandey, Rohit and Pidlypenskyi, Pavel and Yang, Shuoran and Kaeser-Chen, Christine | N/A | |
| How good is my GAN? | Shmelkov, Konstantin and Schmid, Cordelia and Alahari, Karteek | N/A | |
| Superpixel Sampling Networks | Jampani, Varun and Sun, Deqing and Liu, Ming-Yu and Yang, Ming-Hsuan and Kautz, Jan | N/A | |
| Effective Use of Synthetic Data for Urban Scene Semantic Segmentation | Sadat Saleh, Fatemeh and Sadegh Aliakbarian, Mohammad and Salzmann, Mathieu and Petersson, Lars and Alvarez, Jose M. | N/A | |
| Generating 3D Faces using Convolutional Mesh Autoencoders | Ranjan, Anurag and Bolkart, Timo and Sanyal, Soubhik and Black, Michael J. | N/A | |
| 3D Face Reconstruction from Light Field Images: A Model-free Approach | Feng, Mingtao and Zulqarnain Gilani, Syed and Wang, Yaonan and Mian, Ajmal | N/A | |
| Attention-aware Deep Adversarial Hashing for Cross-Modal Retrieval | Zhang, Xi and Lai, Hanjiang and Feng, Jiashi | N/A | |
| Museum Exhibit Identification Challenge for the Supervised Domain Adaptation and Beyond | Koniusz, Piotr and Tas, Yusuf and Zhang, Hongguang and Harandi, Mehrtash and Porikli, Fatih and Zhang, Rui | N/A | |
| End-to-End Deep Structured Models for Drawing Crosswalks | Liang, Justin and Urtasun, Raquel | N/A | |
| Learning Visual Question Answering by Bootstrapping Hard Attention | Malinowski, Mateusz and Doersch, Carl and Santoro, Adam and Battaglia, Peter | N/A | |
| Deep Adaptive Attention for Joint Facial Action Unit Detection and Face Alignment | Shao, Zhiwen and Liu, Zhilei and Cai, Jianfei and Ma, Lizhuang | N/A | |
| Data-Driven Sparse Structure Selection for Deep Neural Networks | Huang, Zehao and Wang, Naiyan | N/A | |
| To learn image super-resolution, use a GAN to learn how to do image degradation first | Bulat, Adrian and Yang, Jing and Tzimiropoulos, Georgios | N/A | |
| Self-Supervised Relative Depth Learning for Urban Scene Understanding | Jiang, Huaizu and Larsson, Gustav and Maire Greg Shakhnarovich, Michael and Learned-Miller, Erik | N/A | |
| End-to-End Joint Semantic Segmentation of Actors and Actions in Video | Ji, Jingwei and Buch, Shyamal and Soto, Alvaro and Niebles, Juan Carlos | N/A | |
| Deep Texture and Structure Aware Filtering Network for Image Smoothing | Lu, Kaiyue and You, Shaodi and Barnes, Nick | N/A | |
| Pairwise Relational Networks for Face Recognition | Kang, Bong-Nam and Kim, Yonghyun and Kim, Daijin | N/A | |
| LAPRAN: A Scalable Laplacian Pyramid Reconstructive Adversarial Network for Flexible Compressive Sensing Reconstruction | XU, Kai and Zhang, Zhikang and Ren, Fengbo | N/A | |
| Learning Warped Guidance for Blind Face Restoration | Li, Xiaoming and Liu, Ming and Ye, Yuting and Zuo, Wangmeng and Lin, Liang and Yang, Ruigang | N/A | |
| Shift-Net: Image Inpainting via Deep Feature Rearrangement | Yan, Zhaoyi and Li, Xiaoming and Li, Mu and Zuo, Wangmeng and Shan, Shiguang | N/A | |
| Question-Guided Hybrid Convolution for Visual Question Answering | Gao, Peng and Li, Hongsheng and Li, Shuang and Lu, Pan and Li, Yikang and Hoi, Steven C.H. and Wang, Xiaogang | N/A | |
| Disentangling Factors of Variation with Cycle-Consistent Variational Auto-Encoders | Jha, Ananya Harsh and Anand, Saket and Singh, Maneesh and Veeravasarapu, VSR | N/A | |
| Deep Fundamental Matrix Estimation | Ranftl, Rene and Koltun, Vladlen | N/A | |
| Where are the blobs: Counting by Localization with Point Supervision | Laradji, Issam H. and Rostamzadeh, Negar and Pinheiro, Pedro O. and Vazquez, David and Schmidt, Mark | N/A | |
| Pose Guided Human Video Generation | Yang, Ceyuan and Wang, Zhe and Zhu, Xinge and Huang, Chen and Shi, Jianping and Lin, Dahua | N/A | |
| Real-time 'Actor-Critic' Tracking | Chen, Boyu and Wang, Dong and Li, Peixia and Wang, Shuang and Lu, Huchuan | N/A | |
| Estimating the Success of Unsupervised Image to Image Translation | Benaim, Sagie and Galanti, Tomer and Wolf, Lior | N/A | |
| Deep Bilevel Learning | Jenni, Simon and Favaro, Paolo | N/A | |
| Sparsely Aggregated Convolutional Networks | Zhu, Ligeng and Deng, Ruizhi and Maire, Michael and Deng, Zhiwei and Mori, Greg and Tan, Ping | N/A | |
| Interpretable Intuitive Physics Model | Ye, Tian and Wang, Xiaolong and Davidson, James and Gupta, Abhinav | N/A | |
| Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric Regression | Cheng, Yihua and Lu, Feng and Zhang, Xucong | N/A | |
| ADVISE: Symbolism and External Knowledge for Decoding Advertisements | Ye, Keren and Kovashka, Adriana | N/A | |
| Toward Characteristic-Preserving Image-based Virtual Try-On Network | Wang, Bochao and Zheng, Huabin and Liang, Xiaodan and Chen, Yimin and Lin, Liang and Yang, Meng | N/A | |
| A Closed-form Solution to Photorealistic Image Stylization | Li, Yijun and Liu, Ming-Yu and Li, Xueting and Yang, Ming-Hsuan and Kautz, Jan | N/A | |
| Understanding Degeneracies and Ambiguities in Attribute Transfer | Szabo, Attila and Hu, Qiyang and Portenier, Tiziano and Zwicker, Matthias and Favaro, Paolo | N/A | |
| Bi-Real Net: Enhancing the Performance of 1-bit CNNs with Improved Representational Capability and Advanced Training Algorithm | Liu, Zechun and Wu, Baoyuan and Luo, Wenhan and Yang, Xin and Liu, Wei and Cheng, Kwang-Ting | N/A | |
| Temporal Modular Networks for Retrieving Complex Compositional Activities in Videos | Liu, Bingbin and Yeung, Serena and Chou, Edward and Huang, De-An and Fei-Fei, Li and Niebles, Juan Carlos | N/A | |
| Neural Stereoscopic Image Style Transfer | Gong, Xinyu and Huang, Haozhi and Ma, Lin and Shen, Fumin and Liu, Wei and Zhang, Tong | N/A | |
| HiDDeN: Hiding Data with Deep Networks | Zhu, Jiren and Kaplan, Russell and Johnson, Justin and Fei-Fei, Li | N/A | |
| Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density Network | Ye, Qi and Kim, Tae-Kyun | N/A | |
| Conditional Prior Networks for Optical Flow | Yang, Yanchao and Soatto, Stefano | N/A | |
| Learning 3D Keypoint Descriptors for Non-Rigid Shape Matching | Wang, Hanyu and Guo, Jianwei and Yan, Dong-Ming and Quan, Weize and Zhang, Xiaopeng | N/A | |
| Stacked Cross Attention for Image-Text Matching | Lee, Kuang-Huei and Chen, Xi and Hua, Gang and Hu, Houdong and He, Xiaodong | N/A | |
| Video Summarization Using Fully Convolutional Sequence Networks | Rochan, Mrigank and Ye, Linwei and Wang, Yang | N/A | |
| Unveiling the Power of Deep Tracking | Bhat, Goutam and Johnander, Joakim and Danelljan, Martin and Shahbaz Khan, Fahad and Felsberg, Michael | N/A | |
| Weakly Supervised Region Proposal Network and Object Detection | Tang, Peng and Wang, Xinggang and Wang, Angtian and Yan, Yongluan and Liu, Wenyu and Huang, Junzhou and Yuille, Alan | N/A | |
| The Devil of Face Recognition is in the Noise | Wang, Fei and Chen, Liren and Li, Cheng and Huang, Shiyao and Chen, Yanjie and Qian, Chen and Change Loy, Chen | N/A | |
| SwapNet: Garment Transfer in Single View Images | Raj, Amit and Sangkloy, Patsorn and Chang, Huiwen and Lu, Jingwan and Ceylan, Duygu and Hays, James | N/A | |
| Egocentric Activity Prediction via Event Modulated Attention | Shen, Yang and Ni, Bingbing and Li, Zefan and Zhuang, Ning | N/A | |
| Person Search in Videos with One Portrait Through Visual and Temporal Links | Huang, Qingqiu and Liu, Wentao and Lin, Dahua | N/A | |
| Stereo Computation for a Single Mixture Image | Zhong, Yiran and Dai, Yuchao and Li, Hongdong | N/A | |
| Value-aware Quantization for Training and Inference of Neural Networks | Park, Eunhyeok and Yoo, Sungjoo and Vajda, Peter | N/A | |
| Explainable Neural Computation via Stack Neural Module Networks | Hu, Ronghang and Andreas, Jacob and Darrell, Trevor and Saenko, Kate | N/A | |
| Semi-supervised Adversarial Learning to Generate Photorealistic Face Images of New Identities from 3D Morphable Model | Gecer, Baris and Bhattarai, Binod and Kittler, Josef and Kim, Tae-Kyun | N/A | |
| TBN: Convolutional Neural Network with Ternary Inputs and Binary Weights | Wan, Diwen and Shen, Fumin and Liu, Li and Zhu, Fan and Qin, Jie and Shao, Ling and Tao Shen, Heng | N/A | |
| Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes | Zhan, Fangneng and Lu, Shijian and Xue, Chuhui | N/A | |
| What do I Annotate Next? An Empirical Study of Active Learning for Action Localization | Caba Heilbron, Fabian and Lee, Joon-Young and Jin, Hailin and Ghanem, Bernard | N/A | |
| An Adversarial Approach to Hard Triplet Generation | Zhao, Yiru and Jin, Zhongming and Qi, Guo-jun and Lu, Hongtao and Hua, Xian-sheng | N/A | |
| Interactive Boundary Prediction for Object Selection | Le, Hoang and Mai, Long and Price, Brian and Cohen, Scott and Jin, Hailin and Liu, Feng | N/A | |
| TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild | Muller, Matthias and Bibi, Adel and Giancola, Silvio and Alsubaihi, Salman and Ghanem, Bernard | N/A | |
| Concept Mask: Large-Scale Segmentation from Semantic Concepts | Wang, Yufei and Lin, Zhe and Shen, Xiaohui and Zhang, Jianming and Cohen, Scott | N/A | |
| Simultaneous 3D Reconstruction for Water Surface and Underwater Scene | Qian, Yiming and Zheng, Yinqiang and Gong, Minglun and Yang, Yee-Hong | N/A | |
| SegStereo: Exploiting Semantic Information for Disparity Estimation | Yang, Guorun and Zhao, Hengshuang and Shi, Jianping and Deng, Zhidong and Jia, Jiaya | N/A | |
| 3D-CODED: 3D Correspondences by Deep Deformation | Groueix, Thibault and Fisher, Matthew and Kim, Vladimir G. and Russell, Bryan C. and Aubry, Mathieu | N/A | |
| Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse Odometry | Yang, Nan and Wang, Rui and Stuckler, Jorg and Cremers, Daniel | N/A | |
| Single Image Intrinsic Decomposition without a Single Intrinsic Image | Ma, Wei-Chiu and Chu, Hang and Zhou, Bolei and Urtasun, Raquel and Torralba, Antonio | N/A | |
| Deep Model-Based 6D Pose Refinement in RGB | Manhardt, Fabian and Kehl, Wadim and Navab, Nassir and Tombari, Federico | N/A | |
| Learning-based Video Motion Magnification | Oh, Tae-Hyun and Jaroensri, Ronnachai and Kim, Changil and Elgharib, Mohamed and Durand, Fr'edo and Freeman, William T. and Matusik, Wojciech | N/A | |
| DeepJDOT: Deep Joint Distribution Optimal Transport for Unsupervised Domain Adaptation | Bhushan Damodaran, Bharath and Kellenberger, Benjamin and Flamary, Remi and Tuia, Devis and Courty, Nicolas | N/A | |
| Pose Proposal Networks | Sekii, Taiki | N/A | |
| Deep Regionlets for Object Detection | Xu, Hongyu and Lv, Xutao and Wang, Xiaoyu and Ren, Zhou and Bodla, Navaneeth and Chellappa, Rama | N/A | |
| Learning with Biased Complementary Labels | Yu, Xiyu and Liu, Tongliang and Gong, Mingming and Tao, Dacheng | N/A | |
| BSN: Boundary Sensitive Network for Temporal Action Proposal Generation | Lin, Tianwei and Zhao, Xu and Su, Haisheng and Wang, Chongjing and Yang, Ming | N/A | |
| Visual Reasoning with Multi-hop Feature Modulation | Strub, Florian and Seurin, Mathieu and Perez, Ethan and de Vries, Harm and Mary, Jeremie and Preux, Philippe and CourvilleOlivier Pietquin, Aaron | N/A | |
| Multiresolution Tree Networks for 3D Point Cloud Processing | Gadelha, Matheus and Wang, Rui and Maji, Subhransu | N/A | |
| Seeing Tree Structure from Vibration | Xue, Tianfan and Wu, Jiajun and Zhang, Zhoutong and Zhang, Chengkai and Tenenbaum, Joshua B. and Freeman, William T. | N/A | |
| DDRNet: Depth Map Denoising and Refinement for Consumer Depth Cameras Using Cascaded CNNs | Yan, Shi and Wu, Chenglei and Wang, Lizhen and Xu, Feng and An, Liang and Guo, Kaiwen and Liu, Yebin | N/A | |
| Probabilistic Video Generation using Holistic Attribute Control | He, Jiawei and Lehrmann, Andreas and Marino, Joseph and Mori, Greg and Sigal, Leonid | N/A | |
| Video Re-localization | Feng, Yang and Ma, Lin and Liu, Wei and Zhang, Tong and Luo, Jiebo | N/A | |
| Adversarial Open-World Person Re-Identification | Li, Xiang and Wu, Ancong and Zheng, Wei-Shi | N/A | |
| Geometric Constrained Joint Lane Segmentation and Lane Boundary Detection | Zhang, Jie and Xu, Yi and Ni, Bingbing and Duan, Zhenyu | N/A | |
| A Geometric Perspective on Structured Light Coding | Gupta, Mohit and Nakhate, Nikhil | N/A | |
| Modular Generative Adversarial Networks | Zhao, Bo and Chang, Bo and Jie, Zequn and Sigal, Leonid | N/A | |
| SRFeat: Single Image Super-Resolution with Feature Discrimination | Park, Seong-Jin and Son, Hyeongseok and Cho, Sunghyun and Hong, Ki-Sang and Lee, Seungyong | N/A | |
| Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning | Si, Chenyang and Jing, Ya and Wang, Wei and Wang, Liang and Tan, Tieniu | N/A | |
| Self-produced Guidance for Weakly-supervised Object Localization | Zhang, Xiaolin and Wei, Yunchao and Kang, Guoliang and Yang, Yi and Huang, Thomas | N/A | |
| Self-Calibration of Cameras with Euclidean Image Plane in Case of Two Views and Known Relative Rotation Angle | Martyushev, Evgeniy | N/A | |
| RIDI: Robust IMU Double Integration | Yan, Hang and Shan, Qi and Furukawa, Yasutaka | N/A | |
| Learning Monocular Depth by Distilling Cross-domain Stereo Networks | Guo, Xiaoyang and Li, Hongsheng and Yi, Shuai and Ren, Jimmy and Wang, Xiaogang | N/A | |
| Fully Motion-Aware Network for Video Object Detection | Wang, Shiyao and Zhou, Yucong and Yan, Junjie and Deng, Zhidong | N/A | |
| GridFace: Face Rectification via Learning Local Homography Transformations | Zhou, Erjin and Cao, Zhimin and Sun, Jian | N/A | |
| Deep Feature Pyramid Reconfiguration for Object Detection | Kong, Tao and Sun, Fuchun and Tan, Chuanqi and Liu, Huaping and Huang, Wenbing | N/A | |
| Does Haze Removal Help CNN-based Image Classification? | Pei, Yanting and Huang, Yaping and Zou, Qi and Lu, Yuhang and Wang, Song | N/A | |
| Multi-modal Cycle-consistent Generalized Zero-Shot Learning | Felix, Rafael and Kumar, Vijay B. G. and Reid, Ian and Carneiro, Gustavo | N/A | |
| YouTube-VOS: Sequence-to-Sequence Video Object Segmentation | Xu, Ning and Yang, Linjie and Fan, Yuchen and Yang, Jianchao and Yue, Dingcheng and Liang, Yuchen and Price, Brian and Cohen, Scott and Huang, Thomas | N/A | |
| Generalizing A Person Retrieval Model Hetero- and Homogeneously | Zhong, Zhun and Zheng, Liang and Li, Shaozi and Yang, Yi | N/A | |
| DYAN: A Dynamical Atoms-Based Network For Video Prediction | Liu, Wenqian and Sharma, Abhishek and Camps, Octavia and Sznaier, Mario | N/A | |
| 3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation | Dai, Angela and Niessner, Matthias | N/A | |
| WildDash - Creating Hazard-Aware Benchmarks | Zendel, Oliver and Honauer, Katrin and Murschitz, Markus and Steininger, Daniel and Fernandez Dominguez, Gustavo | N/A | |
| Adaptively Transforming Graph Matching | Wang, Fudong and Xue, Nan and Zhang, Yipeng and Bai, Xiang and Xia, Gui-Song | N/A | |
| Learning to Look around Objects for Top-View Representations of Outdoor Scenes | Schulter, Samuel and Zhai, Menghua and Jacobs, Nathan and Chandraker, Manmohan | N/A | |
| Visual Psychophysics for Making Face Recognition Algorithms More Explainable | RichardWebster, Brandon and Yon Kwon, So and Clarizio, Christopher and Anthony, Samuel E. and Scheirer, Walter J. | N/A | |
| Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based Losses | Dang, Zheng and Moo Yi, Kwang and Hu, Yinlin and Wang, Fei and Fua, Pascal and Salzmann, Mathieu | N/A | |
| Deep Domain Generalization via Conditional Invariant Adversarial Networks | Li, Ya and Tian, Xinmei and Gong, Mingming and Liu, Yajing and Liu, Tongliang and Zhang, Kun and Tao, Dacheng | N/A | |
| Local Spectral Graph Convolution for Point Set Feature Learning | Wang, Chu and Samari, Babak and Siddiqi, Kaleem | N/A | |
| Fighting Fake News: Image Splice Detection via Learned Self-Consistency | Huh, Minyoung and Liu, Andrew and Owens, Andrew and Efros, Alexei A. | N/A | |
| Receptive Field Block Net for Accurate and Fast Object Detection | Liu, Songtao and Huang, Di and Wang, andYunhong | N/A | |
| Dual-Agent Deep Reinforcement Learning for Deformable Face Tracking | Guo, Minghao and Lu, Jiwen and Zhou, Jie | N/A | |
| Online Multi-Object Tracking with Dual Matching Attention Networks | Zhu, Ji and Yang, Hua and Liu, Nian and Kim, Minyoung and Zhang, Wenjun and Yang, Ming-Hsuan | N/A | |
| Simultaneous Edge Alignment and Learning | Yu, Zhiding and Liu, Weiyang and Zou, Yang and Feng, Chen and Ramalingam, Srikumar and Vijaya Kumar, B. V. K. and Kautz, Jan | N/A | |
| Weakly-supervised Video Summarization using Variational Encoder-Decoder and Web Prior | Cai, Sijia and Zuo, Wangmeng and Davis, Larry S. and Zhang, Lei | N/A | |
| Toward Scale-Invariance and Position-Sensitive Region Proposal Networks | Lu, Hsueh-Fu and Du, Xiaofei and Chang, Ping-Lin | N/A | |
| Visual Question Answering as a Meta Learning Task | Teney, Damien and van den Hengel, Anton | N/A | |
| Generative Semantic Manipulation with Mask-Contrasting GAN | Liang, Xiaodan and Zhang, Hao and Lin, Liang and Xing, Eric | N/A | |
| End-to-End Learning of Driving Models with Surround-View Cameras and Route Planners | Hecker, Simon and Dai, Dengxin and Van Gool, Luc | N/A | |
| Deep High Dynamic Range Imaging with Large Foreground Motions | Wu, Shangzhe and Xu, Jiarui and Tai, Yu-Wing and Tang, Chi-Keung | N/A | |
| Hierarchical Relational Networks for Group Activity Recognition and Retrieval | Ibrahim, Mostafa S. and Mori, Greg | N/A | |
| GeoDesc: Learning Local Descriptors by Integrating Geometry Constraints | Luo, Zixin and Shen, Tianwei and Zhou, Lei and Zhu, Siyu and Zhang, Runze and Yao, Yao and Fang, Tian and Quan, Long | N/A | |
| SDC-Net: Video prediction using spatially-displaced convolution | Reda, Fitsum A. and Liu, Guilin and Shih, Kevin J. and Kirby, Robert and Barker, Jon and Tarjan, David and Tao, Andrew and Catanzaro, Bryan | N/A | |
| Efficient Sliding Window Computation for NN-Based Template Matching | Talker, Lior and Moses, Yael and Shimshoni, Ilan | N/A | |
| RefocusGAN: Scene Refocusing using a Single Image | Sakurikar, Parikshit and Mehta, Ishit and Balasubramanian, Vineeth N. and Narayanan, P. J. | N/A | |
| Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization | Alwassel, Humam and Caba Heilbron, Fabian and Ghanem, Bernard | N/A | |
| Joint Blind Motion Deblurring and Depth Estimation of Light Field | Lee, Dongwoo and Park, Haesol and Kyu Park, In and Mu Lee, Kyoung | N/A | |
| Mutual Learning to Adapt for Joint Human Parsing and Pose Estimation | Nie, Xuecheng and Feng, Jiashi and Yan, Shuicheng | N/A | |
| DOCK: Detecting Objects by transferring Common-sense Knowledge | Kumar Singh, Krishna and Divvala, Santosh and Farhadi, Ali and Jae Lee, Yong | N/A | |
| Simple Baselines for Human Pose Estimation and Tracking | Xiao, Bin and Wu, Haiping and Wei, Yichen | N/A | |
| PM-GANs: Discriminative Representation Learning for Action Recognition Using Partial-modalities | Wang, Lan and Gao, Chenqiang and Yang, Luyu and Zhao, Yue and Zuo, Wangmeng and Meng, Deyu | N/A | |
| CAR-Net: Clairvoyant Attentive Recurrent Network | Sadeghian, Amir and Legros, Ferdinand and Voisin, Maxime and Vesel, Ricky and Alahi, Alexandre and Savarese, Silvio | N/A | |
| Dynamic Filtering with Large Sampling Field for ConvNets | Wu, Jialin and Li, Dai and Yang, Yu and Bajaj, Chandrajit and Ji, Xiangyang | N/A | |
| Learning Category-Specific Mesh Reconstruction from Image Collections | Kanazawa, Angjoo and Tulsiani, Shubham and Efros, Alexei A. and Malik, Jitendra | N/A | |
| Clustering Convolutional Kernels to Compress Deep Neural Networks | Son, Sanghyun and Nah, Seungjun and Mu Lee, Kyoung | N/A | |
| CornerNet: Detecting Objects as Paired Keypoints | Law, Hei and Deng, Jia | N/A | |
| Efficient Dense Point Cloud Object Reconstruction using Deformation Vector Fields | Li, Kejie and Pham, Trung and Zhan, Huangying and Reid, Ian | N/A | |
| Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance | Selvaraju, Ramprasaath R. and Chattopadhyay, Prithvijit and Elhoseiny, Mohamed and Sharma, Tilak and Batra, Dhruv and Parikh, Devi and Lee, Stefan | N/A | |
| Hashing with Binary Matrix Pursuit | Cakir, Fatih and He, Kun and Sclaroff, Stan | N/A | |
| Recognition in Terra Incognita | Beery, Sara and Van Horn, Grant and Perona, Pietro | N/A | |
| Fast and Accurate Intrinsic Symmetry Detection | Nagar, Rajendra and Raman, Shanmuganathan | N/A | |
| Massively Parallel Video Networks | Carreira, Joao and Patraucean, Viorica and Mazare, Laurent and Zisserman, Andrew and Osindero, Simon | N/A | |
| ExFuse: Enhancing Feature Fusion for Semantic Segmentation | Zhang, Zhenli and Zhang, Xiangyu and Peng, Chao and Xue, Xiangyang and Sun, Jian | N/A | |
| Collaborative Deep Reinforcement Learning for Multi-Object Tracking | Ren, Liangliang and Lu, Jiwen and Wang, Zifeng and Tian, Qi and Zhou, Jie | N/A | |
| Deep Variational Metric Learning | Lin, Xudong and Duan, Yueqi and Dong, Qiyuan and Lu, Jiwen and Zhou, Jie | N/A | |
| MVTec D2S: Densely Segmented Supermarket Dataset | Follmann, Patrick and Bottger, Tobias and Hartinger, Philipp and Konig, Rebecca and Ulrich, Markus | N/A | |
| Robust fitting in computer vision: easy or hard? | Chin, Tat-Jun and Cai, Zhipeng and Neumann, Frank | N/A | |
| Visual Question Generation for Class Acquisition of Unknown Objects | Uehara, Kohei and Tejero-De-Pablos, Antonio and Ushiku, Yoshitaka and Harada, Tatsuya | N/A | |
| Image Manipulation with Perceptual Discriminators | Sungatullina, Diana and Zakharov, Egor and Ulyanov, Dmitry and Lempitsky, Victor | N/A | |
| Pairwise Confusion for Fine-Grained Visual Classification | Dubey, Abhimanyu and Gupta, Otkrist and Guo, Pei and Raskar, Ramesh and Farrell, Ryan and Naik, Nikhil | N/A | |
| Combining 3D Model Contour Energy and Keypoints for Object Tracking | Bugaev, Bogdan and Kryshchenko, Anton and Belov, Roman | N/A | |
| Quadtree Convolutional Neural Networks | Kumar Jayaraman, Pradeep and Mei, Jianhan and Cai, Jianfei and Zheng, Jianmin | N/A | |
| Deep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial Networks | Lee, Siyeong and Hwan An, Gwon and Kang, Suk-Ju | N/A | |
| Open Set Learning with Counterfactual Images | Neal, Lawrence and Olson, Matthew and Fern, Xiaoli and Wong, Weng-Keen and Li, Fuxin | N/A | |
| Implicit 3D Orientation Learning for 6D Object Detection from RGB Images | Sundermeyer, Martin and Marton, Zoltan-Csaba and Durner, Maximilian and Brucker, Manuel and Triebel, Rudolph | N/A | |
| Compressing the Input for CNNs with the First-Order Scattering Transform | Oyallon, Edouard and Belilovsky, Eugene and Zagoruyko, Sergey and Valko, Michal | N/A | |
| Part-Aligned Bilinear Representations for Person Re-Identification | Suh, Yumin and Wang, Jingdong and Tang, Siyu and Mei, Tao and Mu Lee, Kyoung | N/A | |
| Sidekick Policy Learning for Active Visual Exploration | Ramakrishnan, Santhosh K. and Grauman, Kristen | N/A | |
| HGMR: Hierarchical Gaussian Mixtures for Adaptive 3D Registration | Eckart, B. and Kim, K. and Kautz, J. | N/A | |
| Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition | Sun, Ming and Yuan, Yuchen and Zhou, Feng and Ding, Errui | N/A | |
| From Face Recognition to Models of Identity: A Bayesian Approach to Learning about Unknown Identities from Unsupervised Data | Coelho de Castro, Daniel and Nowozin, Sebastian | N/A | |
| Semi-convolutional Operators for Instance Segmentation | Novotny, David and Albanie, Samuel and Larlus, Diane and Vedaldi, Andrea | N/A | |
| Bi-box Regression for Pedestrian Detection and Occlusion Estimation | Zhou, Chunluan and Yuan, Junsong | N/A | |
| Learning Data Terms for Non-blind Deblurring | Dong, Jiangxin and Pan, Jinshan and Sun, Deqing and Su, Zhixun and Yang, Ming-Hsuan | N/A | |
| Unified Perceptual Parsing for Scene Understanding | Xiao, Tete and Liu, Yingcheng and Zhou, Bolei and Jiang, Yuning and Sun, Jian | N/A | |
| Face Super-resolution Guided by Facial Component Heatmaps | Yu, Xin and Fernando, Basura and Ghanem, Bernard and Porikli, Fatih and Hartley, Richard | N/A | |
| Descending, lifting or smoothing: Secrets of robust cost optimization | Zach, Christopher and Bourmaud, Guillaume | N/A | |
| ExplainGAN: Model Explanation via Decision Boundary Crossing Transformations | Samangouei, Pouya and Saeedi, Ardavan and Nakagawa, Liam and Silberman, Nathan | N/A | |
| A Unified Framework for Multi-View Multi-Class Object Pose Estimation | Li, Chi and Bai, Jin and Hager, Gregory D. | N/A | |
| Spatio-Temporal Channel Correlation Networks for Action Classification | Diba, Ali and Fayyaz, Mohsen and Sharma, Vivek and Mahdi Arzani, M. and Yousefzadeh, Rahman and Gall, Juergen and Van Gool, Luc | N/A | |
| Learning to Reconstruct High-quality 3D Shapes with Cascaded Fully Convolutional Networks | Cao, Yan-Pei and Liu, Zheng-Ning and Kuang, Zheng-Fei and Kobbelt, Leif and Hu, Shi-Min | N/A | |
| Characterizing Adversarial Examples Based on Spatial Consistency Information for Semantic Segmentation | Xiao, Chaowei and Deng, Ruizhi and Li, Bo and Yu, Fisher and Liu, Mingyan and Song, Dawn | N/A | |
| Deep Bilinear Learning for RGB-D Action Recognition | Hu, Jian-Fang and Zheng, Wei-Shi and Pan, Jiahui and Lai, Jianhuang and Zhang, Jianguo | N/A | |
| Coded Two-Bucket Cameras for Computer Vision | Wei, Mian and Sarhangnejad, Navid and Xia, Zhengfan and Gusev, Nikita and Katic, Nikola and Genov, Roman and Kutulakos, Kiriakos N. | N/A | |
| Few-Shot Human Motion Prediction via Meta-Learning | Gui, Liang-Yan and Wang, Yu-Xiong and Ramanan, Deva and Moura, Jose M. F. | N/A | |
| Recycle-GAN: Unsupervised Video Retargeting | Bansal, Aayush and Ma, Shugao and Ramanan, Deva and Sheikh, Yaser | N/A | |
| Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net | Pan, Xingang and Luo, Ping and Shi, Jianping and Tang, Xiaoou | N/A | |
| Learning Shape Priors for Single-View 3D Completion and Reconstruction | Wu, Jiajun and Zhang, Chengkai and Zhang, Xiuming and Zhang, Zhoutong and Freeman, William T. and Tenenbaum, Joshua B. | N/A | |
| Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks | Li, Minjun and Huang, Haozhi and Ma, Lin and Liu, Wei and Zhang, Tong and Jiang, Yugang | N/A | |
| Hierarchical Metric Learning and Matching for 2D and 3D Geometric Correspondences | Fathy, Mohammed E. and Tran, Quoc-Huy and Zeeshan Zia, M. and Vernaza, Paul and Chandraker, Manmohan | N/A | |
| A Minimal Closed-Form Solution for Multi-Perspective Pose Estimation using Points and Lines | Miraldo, Pedro and Dias, Tiago and Ramalingam, Srikumar | N/A | |
| Key-Word-Aware Network for Referring Expression Image Segmentation | Shi, Hengcan and Li, Hongliang and Meng, Fanman and Wu, Qingbo | N/A | |
| Dynamic Conditional Networks for Few-Shot Learning | Zhao, Fang and Zhao, Jian and Yan, Shuicheng and Feng, Jiashi | N/A | |
| Burst Image Deblurring Using Permutation Invariant Convolutional Neural Networks | Aittala, Miika and Durand, Fredo | N/A | |
| Learning Type-Aware Embeddings for Fashion Compatibility | Vasileva, Mariya I. and Plummer, Bryan A. and Dusad, Krishna and Rajpal, Shreya and Kumar, Ranjitha and Forsyth, David | N/A | |
| Learning to Fuse Proposals from Multiple Scanline Optimizations in Semi-Global Matching | Schonberger, Johannes L. and Sinha, Sudipta N. and Pollefeys, Marc | N/A | |
| Dividing and Aggregating Network for Multi-view Action Recognition | Wang, Dongang and Ouyang, Wanli and Li, Wen and Xu, Dong | N/A | |
| Joint & Progressive Learning from High-Dimensional Data for Multi-Label Classification | Hong, Danfeng and Yokoya, Naoto and Xu, Jian and Zhu, Xiaoxiang | N/A | |
| Image Inpainting for Irregular Holes Using Partial Convolutions | Liu, Guilin and Reda, Fitsum A. and Shih, Kevin J. and Wang, Ting-Chun and Tao, Andrew and Catanzaro, Bryan | N/A | |
| CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps | Hongsuck Seo, Paul and Weyand, Tobias and Sim, Jack and Han, Bohyung | N/A | |
| Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation | Chen, Liang-Chieh and Zhu, Yukun and Papandreou, George and Schroff, Florian and Adam, Hartwig | N/A | |
| Large Scale Urban Scene Modeling from MVS Meshes | Zhu, Lingjie and Shen, Shuhan and Gao, Xiang and Hu, Zhanyi | N/A | |
| Generalized Loss-Sensitive Adversarial Learning with Manifold Margins | Edraki, Marzieh and Qi, Guo-Jun | N/A | |
| Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World | Fabbri, Matteo and Lanzi, Fabio and Calderara, Simone and Palazzi, Andrea and Vezzani, Roberto and Cucchiara, Rita | N/A | |
| W-TALC: Weakly-supervised Temporal Activity Localization and Classification | Paul, Sujoy and Roy, Sourya and Roy-Chowdhury, Amit K. | N/A | |
| Viewpoint Estimation---Insights & Model | Divon, Gilad and Tal, Ayellet | N/A | |
| Relaxation-Free Deep Hashing via Policy Gradient | Yuan, Xin and Ren, Liangliang and Lu, Jiwen and Zhou, Jie | N/A | |
| Rolling Shutter Pose and Ego-motion Estimation using Shape-from-Template | Lao, Yizhen and Ait-Aider, Omar and Bartoli, Adrien | N/A | |
| Learning to Capture Light Fields through a Coded Aperture Camera | Inagaki, Yasutaka and Kobayashi, Yuto and Takahashi, Keita and Fujii, Toshiaki and Nagahara, Hajime | N/A | |
| Variable Ring Light Imaging: Capturing Transient Subsurface Scattering with An Ordinary Camera | Nishino, Ko and Subpa-asa, Art and Asano, Yuta and Shimano, Mihoko and Sato, Imari | N/A | |
| Deep Video Generation, Prediction and Completion of Human Action Sequences | Cai, Haoye and Bai, Chunyan and Tai, Yu-Wing and Tang, Chi-Keung | N/A | |
| PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model | Papandreou, George and Zhu, Tyler and Chen, Liang-Chieh and Gidaris, Spyros and Tompson, Jonathan and Murphy, Kevin | N/A | |
| Robust image stitching with multiple registrations | Herrmann, Charles and Wang, Chen and Strong Bowen, Richard and Keyder, Emil and Krainin, Michael and Liu, Ce and Zabih, Ramin | N/A | |
| Learning to Solve Nonlinear Least Squares for Monocular Stereo | Clark, Ronald and Bloesch, Michael and Czarnowski, Jan and Leutenegger, Stefan and Davison, Andrew J. | N/A | |
| Direct Sparse Odometry With Rolling Shutter | Schubert, David and Demmel, Nikolaus and Usenko, Vladyslav and Stuckler, Jorg and Cremers, Daniel | N/A | |
| A Zero-Shot Framework for Sketch based Image Retrieval | Kiran Yelamarthi, Sasi and Krishna Reddy, Shiva and Mishra, Ashish and Mittal, Anurag | N/A | |
| Structured Siamese Network for Real-Time Visual Tracking | Zhang, Yunhua and Wang, Lijun and Qi, Jinqing and Wang, Dong and Feng, Mengyang and Lu, Huchuan | N/A | |
| Selective Zero-Shot Classification with Augmented Attributes | Song, Jie and Shen, Chengchao and Lei, Jie and Zeng, An-Xiang and Ou, Kairi and Tao, Dacheng and Song, Mingli | N/A | |
| Deep Attention Neural Tensor Network for Visual Question Answering | Bai, Yalong and Fu, Jianlong and Zhao, Tiejun and Mei, Tao | N/A | |
| Zero-Shot Object Detection | Bansal, Ankan and Sikka, Karan and Sharma, Gaurav and Chellappa, Rama and Divakaran, Ajay | N/A | |
| Asynchronous, Photometric Feature Tracking using Events and Frames | Gehrig, Daniel and Rebecq, Henri and Gallego, Guillermo and Scaramuzza, Davide | N/A | |
| Unsupervised Class-Specific Deblurring | Madam Nimisha, Thekke and Sunil, Kumar and Rajagopalan, A. N. | N/A | |
| Imagine This! Scripts to Compositions to Videos | Gupta, Tanmay and Schwenk, Dustin and Farhadi, Ali and Hoiem, Derek and Kembhavi, Aniruddha | N/A | |
| Deep Structure Inference Network for Facial Action Unit Recognition | Corneanu, Ciprian and Madadi, Meysam and Escalera, Sergio | N/A | |
| Action Anticipation with RBF Kernelized Feature Mapping RNN | Shi, Yuge and Fernando, Basura and Hartley, Richard | N/A | |
| CNN-PS: CNN-based Photometric Stereo for General Non-Convex Surfaces | Ikehata, Satoshi | N/A | |
| Small-scale Pedestrian Detection Based on Topological Line Localization and Temporal Feature Aggregation | Song, Tao and Sun, Leiyu and Xie, Di and Sun, Haiming and Pu, Shiliang | N/A | |
| Summarizing First-Person Videos from Third Persons' Points of View | HO, HSUAN-I and Chiu, Wei-Chen and Frank Wang, Yu-Chiang | N/A | |
| Snap Angle Prediction for 360° Panoramas | Xiong, Bo and Grauman, Kristen | N/A | |
| Zoom-Net: Mining Deep Feature Interactions for Visual Relationship Recognition | Yin, Guojun and Sheng, Lu and Liu, Bin and Yu, Nenghai and Wang, Xiaogang and Shao, Jing and Change Loy, Chen | N/A | |
| Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images | Wang, Nanyang and Zhang, Yinda and Li, Zhuwen and Fu, Yanwei and Liu, Wei and Jiang, Yu-Gang | N/A | |
| Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation | Li, Yikang and Ouyang, Wanli and Zhou, Bolei and Shi, Jianping and Zhang, Chao and Wang, Xiaogang | N/A | |
| Reconstruction-based Pairwise Depth Dataset for Depth Image Enhancement Using CNN | Jeon, Junho and Lee, Seungyong | N/A | |
| Coded Illumination and Imaging for Fluorescence Based Classification | Asano, Yuta and Meguro, Misaki and Wang, Chao and Lam, Antony and Zheng, Yinqiang and Okabe, Takahiro and Sato, Imari | N/A | |
| Multi-view to Novel view: Synthesizing novel views with Self-Learned Confidence | Sun, Shao-Hua and Huh, Minyoung and Liao, Yuan-Hong and Zhang, Ning and Lim, Joseph J. | N/A | |
| Robust Anchor Embedding for Unsupervised Video Person Re-Identification in the Wild | Ye, Mang and Lan, Xiangyuan and Yuen, Pong C. | N/A | |
| Training Binary Weight Networks via Semi-Binary Decomposition | Hu, Qinghao and Li, Gang and Wang, Peisong and Zhang, Yifan and Cheng, Jian | N/A | |
| Hand Pose Estimation via Latent 2.5D Heatmap Regression | Iqbal, Umar and Molchanov, Pavlo and Breuel Juergen Gall, Thomas and Kautz, Jan | N/A | |
| LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks | Zhang, Dongqing and Yang, Jiaolong and Ye, Dongqiangzi and Hua, Gang | N/A | |
| Deep Randomized Ensembles for Metric Learning | Xuan, Hong and Souvenir, Richard and Pless, Robert | N/A | |
| ECO: Efficient Convolutional Network for Online Video Understanding | Zolfaghari, Mohammadreza and Singh, Kamaljeet and Brox, Thomas | N/A | |
| Proxy Clouds for Live RGB-D Stream Processing and Consolidation | Kaiser, Adrien and Alonso Ybanez Zepeda, Jose and Boubekeur, Tamy | N/A | |
| Neural Graph Matching Networks for Fewshot 3D Action Recognition | Guo, Michelle and Chou, Edward and Huang, De-An and Song, Shuran and Yeung, Serena and Fei-Fei, Li | N/A | |
| Stereo relative pose from line and point feature triplets | Vakhitov, Alexander and Lempitsky, Victor and Zheng, Yinqiang | N/A | |
| A-Contrario Horizon-First Vanishing Point Detection Using Second-Order Grouping Laws | Simon, Gilles and Fond, Antoine and Berger, Marie-Odile | N/A | |
| Learning to Zoom: a Saliency-Based Sampling Layer for Neural Networks | Recasens, Adria and Kellnhofer, Petr and Stent, Simon and Matusik, Wojciech and Torralba, Antonio | N/A | |
| Improving Deep Visual Representation for Person Re-identification by Global and Local Image-language Association | Chen, Dapeng and Li, Hongsheng and Liu, Xihui and Shen, Yantao and Shao, Jing and Yuan, Zejian and Wang, Xiaogang | N/A | |
| Less is More: Picking Informative Frames for Video Captioning | Chen, Yangyu and Wang, Shuhui and Zhang, Weigang and Huang, Qingming | N/A | |
| BodyNet: Volumetric Inference of 3D Human Body Shapes | Varol, Gul and Ceylan, Duygu and Russell, Bryan and Yang, Jimei and Yumer, Ersin and Laptev, Ivan and Schmid, Cordelia | N/A | |
| Towards Human-Level License Plate Recognition | Zhuang, Jiafan and Hou, Saihui and Wang, Zilei and Zha, Zheng-Jun | N/A | |
| A Dataset for Lane Instance Segmentation in Urban Environments | Roberts, Brook and Kaltwang, Sebastian and Samangooei, Sina and Pender-Bare, Mark and Tertikas, Konstantinos and Redford, John | N/A | |
| DeepIM: Deep Iterative Matching for 6D Pose Estimation | Li, Yi and Wang, Gu and Ji, Xiangyang and Xiang, Yu and Fox, Dieter | N/A | |
| Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence | Chaudhry, Arslan and Dokania, Puneet K. and Ajanthan, Thalaiyasingam and Torr, Philip H. S. | N/A | |
| Meta-Tracker: Fast and Robust Online Adaptation for Visual Object Trackers | Park, Eunbyung and Berg, Alexander C. | N/A | |
| ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation | Mehta, Sachin and Rastegari, Mohammad and Caspi, Anat and Shapiro, Linda and Hajishirzi, Hannaneh | N/A | |
| Wasserstein Divergence for GANs | Wu, Jiqing and Huang, Zhiwu and Thoma, Janine and Acharya, Dinesh and Van Gool, Luc | N/A | |
| Evaluating Capability of Deep Neural Networks for Image Classification via Information Plane | Cheng, Hao and Lian, Dongze and Gao, Shenghua and Geng, Yanlin | N/A | |
| C-WSL: Count-guided Weakly Supervised Localization | Gao, Mingfei and Li, Ang and Yu, Ruichi and Morariu, Vlad I. and Davis, Larry S. | N/A | |
| Goal-Oriented Visual Question Generation via Intermediate Rewards | Zhang, Junjie and Wu, Qi and Shen, Chunhua and Zhang, Jian and Lu, Jianfeng and van den Hengel, Anton | N/A | |
| ICNet for Real-Time Semantic Segmentation on High-Resolution Images | Zhao, Hengshuang and Qi, Xiaojuan and Shen, Xiaoyong and Shi, Jianping and Jia, Jiaya | N/A | |
| Multi-Fiber Networks for Video Recognition | Chen, Yunpeng and Kalantidis, Yannis and Li, Jianshu and Yan, Shuicheng and Feng, Jiashi | N/A | |
| TS2C: Tight Box Mining with Surrounding Segmentation Context for Weakly Supervised Object Detection | Wei, Yunchao and Shen, Zhiqiang and Cheng, Bowen and Shi, Honghui and Xiong, Jinjun and Feng, Jiashi and Huang, Thomas | N/A | |
| PARN: Pyramidal Affine Regression Networks for Dense Semantic Correspondence | Jeon, Sangryul and Kim, Seungryong and Min, Dongbo and Sohn, Kwanghoon | N/A | |
| Super-Identity Convolutional Neural Network for Face Hallucination | Zhang, Kaipeng and Zhang, Zhanpeng and Cheng, Chia-Wen and Hsu, Winston H. and Qiao, Yu and Liu, Wei and Zhang, Tong | N/A | |
| Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven Loss | Jiao, Jianbo and Cao, Ying and Song, Yibing and Lau, Rynson | N/A | |
| Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification | Xie, Saining and Sun, Chen and Huang, Jonathan and Tu, Zhuowen and Murphy, Kevin | N/A | |
| Domain Adaptation through Synthesis for Unsupervised Person Re-identification | Bak, Slawomir and Carr, Peter and Lalonde, Jean-Francois | N/A | |
| Learning to Predict Crisp Boundaries | Deng, Ruoxi and Shen, Chunhua and Liu, Shengjun and Wang, Huibing and Liu, Xinru | N/A | |
| Learning Efficient Single-stage Pedestrian Detectors by Asymptotic Localization Fitting | Liu, Wei and Liao, Shengcai and Hu, Weidong and Liang, Xuezhi and Chen, Xiao | N/A | |
| Attention-based Ensemble for Deep Metric Learning | Kim, Wonsik and Goyal, Bhavya and Chawla, Kunal and Lee, Jungmin and Kwon, Keunjoo | N/A | |
| 3DFeat-Net: Weakly Supervised Local 3D Features for Point Cloud Registration | Jian Yew, Zi and Hee Lee, Gim | N/A | |
| DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation | Wu, Zuxuan and Han, Xintong and Lin, Yen-Liang and Gokhan Uzunbas, Mustafa and Goldstein, Tom and Nam Lim, Ser and Davis, Larry S. | N/A | |
| NNEval: Neural Network based Evaluation Metric for Image Captioning | Sharif, Naeha and White, Lyndon and Bennamoun, Mohammed and Afaq Ali Shah, Syed | N/A | |
| Learning to Segment via Cut-and-Paste | Remez, Tal and Huang, Jonathan and Brown, Matthew | N/A | |
| Real-Time Hair Rendering using Sequential Adversarial Networks | Wei, Lingyu and Hu, Liwen and Kim, Vladimir and Yumer, Ersin and Li, Hao | N/A | |
| Learning Human-Object Interactions by Graph Parsing Neural Networks | Qi, Siyuan and Wang, Wenguan and Jia, Baoxiong and Shen, Jianbing and Zhu, Song-Chun | N/A | |
| Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd | Zhang, Shifeng and Wen, Longyin and Bian, Xiao and Lei, Zhen and Li, Stan Z. | N/A | |
| Linear RGB-D SLAM for Planar Environments | Kim, Pyojin and Coltin, Brian and Jin Kim, H. | N/A | |
| NAM: Non-Adversarial Unsupervised Domain Mapping | Hoshen, Yedid and Wolf, Lior | N/A | |
| CrossNet: An End-to-end Reference-based Super Resolution Network using Cross-scale Warping | Zheng, Haitian and Ji, Mengqi and Wang, Haoqian and Liu, Yebin and Fang, Lu | N/A | |
| Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation | Li, Xiaoxiao and Change Loy, Chen | N/A | |
| Layer-structured 3D Scene Inference via View Synthesis | Tulsiani, Shubham and Tucker, Richard and Snavely, Noah | N/A | |
| Facial Expression Recognition with Inconsistently Annotated Datasets | Zeng, Jiabei and Shan, Shiguang and Chen, Xilin | N/A | |
| Exploiting Vector Fields for Geometric Rectification of Distorted Document Images | MENG, Gaofeng and SU, Yuanqi and WU, Ying and XIANG, Shiming and PAN, Chunhong | N/A | |
| A+D Net: Training a Shadow Detector with Adversarial Shadow Attenuation | Le, Hieu and Yago Vicente, Tomas F. and Nguyen, Vu and Hoai, Minh and Samaras, Dimitris | N/A | |
| Lip Movements Generation at a Glance | Chen, Lele and Li, Zhiheng and K Maddox, Ross and Duan, Zhiyao and Xu, Chenliang | N/A | |
| Domain transfer through deep activation matching | Huang, Haoshuo and Huang, Qixing and Krahenbuhl, Philipp | N/A | |
| Geolocation Estimation of Photos using a Hierarchical Model and Scene Classification | Muller-Budack, Eric and Pustu-Iren, Kader and Ewerth, Ralph | N/A | |
| Temporal Relational Reasoning in Videos | Zhou, Bolei and Andonian, Alex and Oliva, Aude and Torralba, Antonio | N/A | |
| Leveraging Motion Priors in Videos for Improving Human Segmentation | Chen, Yu-Ting and Chang, Wen-Yen and Lu, Hai-Lun and Wu, Tingfan and Sun, Min | N/A | |
| Sequential Clique Optimization for Video Object Segmentation | Jun Koh, Yeong and Lee, Young-Yoon and Kim, Chang-Su | N/A | |
| 3D Scene Flow from 4D Light Field Gradients | Ma, Sizhuo and Smith, Brandon M. and Gupta, Mohit | N/A | |
| Accelerating Dynamic Programs via Nested Benders Decomposition with Application to Multi-Person Pose Estimation | Wang, Shaofei and Ihler, Alexander and Kording, Konrad and Yarkony, Julian | N/A | |
| Multi-scale Residual Network for Image Super-Resolution | Li, Juncheng and Fang, Faming and Mei, Kangfu and Zhang, Guixu | N/A | |
| Efficient Global Point Cloud Registration by Matching Rotation Invariant Features Through Translation Search | Liu, Yinlong and Wang, Chen and Song, Zhijian and Wang, Manning | N/A | |
| Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields | Jing, Yongcheng and Liu, Yang and Yang, Yezhou and Feng, Zunlei and Yu, Yizhou and Tao, Dacheng and Song, Mingli | N/A | |
| A Modulation Module for Multi-task Learning with Applications in Image Retrieval | Zhao, Xiangyun and Li, Haoxiang and Shen, Xiaohui and Liang, Xiaodan and Wu, Ying | N/A | |
| Deforming Autoencoders: Unsupervised Disentangling of Shape and Appearance | Shu, Zhixin and Sahasrabudhe, Mihir and Alp Guler, Riza and Samaras, Dimitris and Paragios, Nikos and Kokkinos, Iasonas | N/A | |
| ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids | Jayaraman, Dinesh and Gao, Ruohan and Grauman, Kristen | N/A | |
| Triplet Loss in Siamese Network for Object Tracking | Dong, Xingping and Shen, Jianbing | N/A | |
| Person Re-identification with Deep Similarity-Guided Graph Neural Network | Shen, Yantao and Li, Hongsheng and Yi, Shuai and Chen, Dapeng and Wang, Xiaogang | N/A | |
| VSO: Visual Semantic Odometry | Lianos, Konstantinos-Nektarios and Schonberger, Johannes L. and Pollefeys, Marc and Sattler, Torsten | N/A | |
| Volumetric performance capture from minimal camera viewpoints | Gilbert, Andrew and Volino, Marco and Collomosse, John and Hilton, Adrian | N/A | |
| Videos as Space-Time Region Graphs | Wang, Xiaolong and Gupta, Abhinav | N/A | |
| Faces as Lighting Probes via Unsupervised Deep Highlight Extraction | Yi, Renjiao and Zhu, Chenyang and Tan, Ping and Lin, Stephen | N/A | |
| Unsupervised holistic image generation from key local patches | Lee, Donghoon and Yun, Sangdoo and Choi, Sungjoon and Yoo, Hwiyeon and Yang, Ming-Hsuan and Oh, Songhwai | N/A | |
| Visual Text Correction | Mazaheri, Amir and Shah, Mubarak | N/A | |
| ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes | Xiao, Taihong and Hong, Jiapeng and Ma, Jinwen | N/A | |
| Coloring with Words: Guiding Image Colorization Through Text-based Palette Generation | Bahng, Hyojin and Yoo, Seungjoo and Cho, Wonwoong and Keetae Park, David and Wu, Ziming and Ma, Xiaojuan and Choo, Jaegul | N/A | |
| Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks | Shim, Minho and Hwi Kim, Young and Kim, Kyungmin and Joo Kim, Seon | N/A | |
| Into the Twilight Zone: Depth Estimation using Joint Structure-Stereo Optimization | Sharma, Aashish and Cheong, Loong-Fah | N/A | |
| Learning 3D Shapes as Multi-Layered Height-maps using 2D Convolutional Networks | Sarkar, Kripasindhu and Hampiholi, Basavaraj and Varanasi, Kiran and Stricker, Didier | N/A | |
| Coreset-Based Neural Network Compression | Dubey, Abhimanyu and Chatterjee, Moitreya and Ahuja, Narendra | N/A | |
| Variational Wasserstein Clustering | Mi, Liang and Zhang, Wen and Gu, Xianfeng and Wang, Yalin | N/A | |
| Joint Person Segmentation and Identification in Synchronized First- and Third-person Videos | Xu, Mingze and Fan, Chenyou and Wang, Yuchen and Ryoo, Michael S. and Crandall, David J. | N/A | |
| Zero-shot keyword spotting for visual speech recognition in-the-wild | Stafylakis, Themos and Tzimiropoulos, Georgios | N/A | |
| ContextVP: Fully Context-Aware Video Prediction | Byeon, Wonmin and Wang, Qin and Kumar Srivastava, Rupesh and Koumoutsakos, Petros | N/A | |
| Open Set Domain Adaptation by Backpropagation | Saito, Kuniaki and Yamamoto, Shohei and Ushiku, Yoshitaka and Harada, Tatsuya | N/A | |
| Learn-to-Score: Efficient 3D Scene Exploration by Predicting View Utility | Hepp, Benjamin and Dey, Debadeepta and Sinha, Sudipta N. and Kapoor, Ashish and Joshi, Neel and Hilliges, Otmar | N/A | |
| Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping | Xue, Chuhui and Lu, Shijian and Zhan, Fangneng | N/A | |
| Deep Image Demosaicking using a Cascade of Convolutional Residual Denoising Networks | Kokkinos, Filippos and Lefkimmiatis, Stamatios | N/A | |
| Good Line Cutting: towards Accurate Pose Tracking of Line-assisted VO/VSLAM | Zhao, Yipu and Vela, Patricio A. | N/A | |
| Constraint-Aware Deep Neural Network Compression | Chen, Changan and Tung, Frederick and Vedula, Naveen and Mori, Greg | N/A | |
| Boosted Attention: Leveraging Human Attention for Image Captioning | Chen, Shi and Zhao, Qi | N/A | |
| Understanding Perceptual and Conceptual Fluency at a Large Scale | Hu, Shengli and Borji, Ali | N/A | |
| MaskConnect: Connectivity Learning by Gradient Descent | Ahmed, Karim and Torresani, Lorenzo | N/A | |
| Exploring Visual Relationship for Image Captioning | Yao, Ting and Pan, Yingwei and Li, Yehao and Mei, Tao | N/A | |
| Diagnosing Error in Temporal Action Detectors | Alwassel, Humam and Caba Heilbron, Fabian and Escorcia, Victor and Ghanem, Bernard | N/A | |
| Efficient Semantic Scene Completion Network with Spatial Group Convolution | Zhang, Jiahui and Zhao, Hao and Yao, Anbang and Chen, Yurong and Zhang, Li and Liao, Hongen | N/A | |
| Task-driven Webpage Saliency | Zheng, Quanlong and Jiao, Jianbo and Cao, Ying and Lau, Rynson W.H. | N/A | |
| Multi-Scale Context Intertwining for Semantic Segmentation | Lin, Di and Ji, Yuanfeng and Lischinski, Dani and Cohen-Or, Daniel and Huang, Hui | N/A | |
| Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular video | Brau, Ernesto and Guan, Jinyan and Jeffries, Tanya and Barnard, Kobus | N/A | |
| HybridFusion: Real-Time Performance Capture Using a Single Depth Sensor and Sparse IMUs | Zheng, Zerong and Yu, Tao and Li, Hao and Guo, Kaiwen and Dai, Qionghai and Fang, Lu and Liu, Yebin | N/A | |
| Macro-Micro Adversarial Network for Human Parsing | Luo, Yawei and Zheng, Zhedong and Zheng, Liang and Guan, Tao and Yu, Junqing and Yang, Yi | N/A | |
| Pivot Correlational Neural Network for Multimodal Video Categorization | Kang, Sunghun and Kim, Junyeong and Choi, Hyunsoo and Kim, Sungjin and Yoo, Chang D. | N/A | |
| Semantically Aware Urban 3D Reconstruction with Plane-Based Regularization | Holzmann, Thomas and Maurer, Michael and Fraundorfer, Friedrich and Bischof, Horst | N/A | |
| AugGAN: Cross Domain Adaptation with GAN-based Data Augmentation | Huang, Sheng-Wei and Lin, Che-Tsung and Chen, Shu-Ping and Wu, Yen-Yi and Hsu, Po-Hao and Lai, Shang-Hong | N/A | |
| Unsupervised Domain Adaptation for Semantic Segmentation via Class-Balanced Self-Training | Zou, Yang and Yu, Zhiding and Vijaya Kumar, B.V.K. and Wang, Jinsong | N/A | |
| Fictitious GAN: Training GANs with Historical Models | Ge, Hao and Xia, Yin and Chen, Xu and Berry, Randall and Wu, Ying | N/A | |
| Perturbation Robust Representations of Topological Persistence Diagrams | Som, Anirudh and Thopalli, Kowshik and Natesan Ramamurthy, Karthikeyan and Venkataraman, Vinay and Shukla, Ankita and Turaga, Pavan | N/A | |
| DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures | Dong, Jin-Dong and Cheng, An-Chieh and Juan, Da-Cheng and Wei, Wei and Sun, Min | N/A | |
| SketchyScene: Richly-Annotated Scene Sketches | Zou, Changqing and Yu, Qian and Du, Ruofei and Mo, Haoran and Song, Yi-Zhe and Xiang, Tao and Gao, Chengying and Chen, Baoquan and Zhang, Hao | N/A | |
| Contour Knowledge Transfer for Salient Object Detection | Li, Xin and Yang, Fan and Cheng, Hong and Liu, Wei and Shen, Dinggang | N/A | |
| Scenes-Objects-Actions: A Multi-Task, Multi-Label Video Dataset | Ray, Jamie and Wang, Heng and Tran, Du and Wang, Yufei and Feiszli, Matt and Torresani, Lorenzo and Paluri, Manohar | N/A | |
| Saliency Detection in 360° Videos | Zhang, Ziheng and Xu, Yanyu and Yu, Jingyi and Gao, Shenghua | N/A | |
| DetNet: Design Backbone for Object Detection | Li, Zeming and Peng, Chao and Yu, Gang and Zhang, Xiangyu and Deng, Yangdong and Sun, Jian | N/A | |
| Facial Dynamics Interpreter Network: What are the Important Relations between Local Dynamics for Facial Trait Estimation? | Tae Kim, Seong and Man Ro, Yong | N/A | |
| Video Object Segmentation by Learning Location-Sensitive Embeddings | Ci, Hai and Wang, Chunyu and Wang, Yizhou | N/A | |
| Transferable Adversarial Perturbations | Zhou, Wen and Hou, Xin and Chen, Yongjun and Tang, Mengyun and Huang, Xiangqi and Gan, Xiang and Yang, Yong | N/A | |
| A Segmentation-aware Deep Fusion Network for Compressed Sensing MRI | Fan, Zhiwen and Sun, Liyan and Ding, Xinghao and Huang, Yue and Cai, Congbo and Paisley, John | N/A | |
| GANimation: Anatomically-aware Facial Animation from a Single Image | Pumarola, Albert and Agudo, Antonio and Martinez, Aleix M. and Sanfeliu, Alberto and Moreno-Noguer, Francesc | N/A | |
| Graph R-CNN for Scene Graph Generation | Yang, Jianwei and Lu, Jiasen and Lee, Stefan and Batra, Dhruv and Parikh, Devi | N/A | |
| Interpretable Basis Decomposition for Visual Explanation | Zhou, Bolei and Sun, Yiyou and Bau, David and Torralba, Antonio | N/A | |
| Reinforced Temporal Attention and Split-Rate Transfer for Depth-Based Person Re-Identification | Karianakis, Nikolaos and Liu, Zicheng and Chen, Yinpeng and Soatto, Stefano | N/A | |
| ArticulatedFusion: Real-time Reconstruction of Motion, Geometry and Segmentation Using a Single Depth Camera | Li, Chao and Zhao, Zheheng and Guo, Xiaohu | N/A | |
| Deep Metric Learning with Hierarchical Triplet Loss | Ge, Weifeng | N/A | |
| Deep Directional Statistics: Pose Estimation with Uncertainty Quantification | Prokudin, Sergey and Gehler, Peter and Nowozin, Sebastian | N/A | |
| Semantic Match Consistency for Long-Term Visual Localization | Toft, Carl and Stenborg, Erik and Hammarstrand, Lars and Brynte, Lucas and Pollefeys, Marc and Sattler, Torsten and Kahl, Fredrik | N/A | |
| Decouple Learning for Parameterized Image Operators | Fan, Qingnan and Chen, Dongdong and Yuan, Lu and Hua, Gang and Yu, Nenghai and Chen, Baoquan | N/A | |
| Structural Consistency and Controllability for Diverse Colorization | Messaoud, Safa and Forsyth, David and Schwing, Alexander G. | N/A | |
| Deep Component Analysis via Alternating Direction Neural Networks | Murdock, Calvin and Chang, MingFang and Lucey, Simon | N/A | |
| Maximum Margin Metric Learning Over Discriminative Nullspace for Person Re-identification | M Feroz Ali, T and Chaudhuri, Subhasis | N/A | |
| Pose-Normalized Image Generation for Person Re-identification | Qian, Xuelin and Fu, Yanwei and Xiang, Tao and Wang, Wenxuan and Qiu, Jie and Wu, Yang and Jiang, Yu-Gang and Xue, Xiangyang | N/A | |
| Cross-Modal Hamming Hashing | Cao, Yue and Liu, Bin and Long, Mingsheng and Wang, Jianmin | N/A | |
| Modeling Visual Context is Key to Augmenting Object Detection Datasets | Dvornik, Nikita and Mairal, Julien and Schmid, Cordelia | N/A | |
| ReenactGAN: Learning to Reenact Faces via Boundary Transfer | Wu, Wayne and Zhang, Yunxuan and Li, Cheng and Qian, Chen and Change Loy, Chen | N/A | |
| Universal Sketch Perceptual Grouping | Li, Ke and Pang, Kaiyue and Song, Jifei and Song, Yi-Zhe and Xiang, Tao and Hospedales, Timothy M. and Zhang, Honggang | N/A | |
| Compositional Learning for Human Object Interaction | Kato, Keizo and Li, Yin and Gupta, Abhinav | N/A | |
| Quaternion Equivariant Capsule Networks for 3D Point Clouds | Yongheng Zhao, Tolga Birdal, Jan Eric Lenssen, Emanuele Menegatti, Leonidas Guibas, Federico Tombari | N/A | |
| DeepFit: 3D Surface Fitting via Neural Network Weighted Least Squares | Yizhak Ben-Shabat, Stephen Gould | N/A | |
| NSGANetV2: Evolutionary Multi-Objective Surrogate-Assisted Neural Architecture Search | Zhichao Lu, Kalyanmoy Deb, Erik Goodman, Wolfgang Banzhaf, Vishnu Naresh Boddeti | N/A | |
| Describing Textures using Natural Language | Chenyun Wu, Mikayla Timm, Subhransu Maji | N/A | |
| Empowering Relational Network by Self-Attention Augmented Conditional Random Fields for Group Activity Recognition | Rizard Renanda Adhi Pramono, Yie Tarng Chen, Wen Hsien Fang | N/A | |
| AiR: Attention with Reasoning Capability | Shi Chen, Ming Jiang, Jinhui Yang, Qi Zhao | N/A | |
| Self6D: Self-Supervised Monocular 6D Object Pose Estimation | Gu Wang, Fabian Manhardt, Jianzhun Shao, Xiangyang Ji, Nassir Navab , Federico Tombari | N/A | |
| Invertible Image Rescaling | Mingqing Xiao, Shuxin Zheng, Chang Liu, Yaolong Wang, Di He, Guolin Ke, Jiang Bian, Zhouchen Lin, Tie-Yan Liu | N/A | |
| Synthesize then Compare: Detecting Failures and Anomalies for Semantic Segmentation | Yingda Xia, Yi Zhang, Fengze Liu, Wei Shen, Alan L. Yuille | N/A | |
| House-GAN: Relational Generative Adversarial Networks for Graph-constrained House Layout Generation | Nelson Nauata, Kai-Hung Chang, Chin-Yi Cheng, Greg Mori, Yasutaka Furukawa | N/A | |
| Crowdsampling the Plenoptic Function | Zhengqi Li, Wenqi Xian, Abe Davis, Noah Snavely | N/A | |
| VoxelPose: Towards Multi-Camera 3D Human Pose Estimation in Wild Environment | Hanyue Tu, Chunyu Wang, Wenjun Zeng | N/A | |
| End-to-End Object Detection with Transformers | Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko | N/A | |
| DeepSFM: Structure From Motion Via Deep Bundle Adjustment | Xingkui Wei, Yinda Zhang, Zhuwen Li, Yanwei Fu, Xiangyang Xue | N/A | |
| Ladybird: Quasi-Monte Carlo Sampling for Deep Implicit Field Based 3D Reconstruction with Symmetry | Yifan Xu, Tianqi Fan, Yi Yuan, Gurprit Singh | N/A | |
| Segment as Points for Efficient Online Multi-Object Tracking and Segmentation | Zhenbo Xu, Wei Zhang, Xiao Tan, Wei Yang, Huan Huang, Shilei Wen, Errui Ding, Liusheng Huang | N/A | |
| Conditional Convolutions for Instance Segmentation | Zhi Tian, Chunhua Shen, Hao Chen | N/A | |
| MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution | Taojiannan Yang, Sijie Zhu, Chen Chen, Shen Yan, Mi Zhang, Andrew Willis | N/A | |
| Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset | Menglin Jia, Mengyun Shi, Mikhail Sirotenko, Yin Cui, Claire Cardie , Bharath Hariharan, Hartwig Adam, Serge Belongie | N/A | |
| Privacy Preserving Structure-from-Motion | Marcel Geppert, Viktor Larsson, Pablo Speciale, Johannes L. Schönberger, Marc Pollefeys | N/A | |
| Rewriting a Deep Generative Model | David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba | N/A | |
| Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets | Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan | N/A | |
| Long-term Human Motion Prediction with Scene Context | Zhe Cao, Hang Gao, Karttikeya Mangalam, Qi-Zhi Cai, Minh Vo, Jitendra Malik | N/A | |
| NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis | Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi, Ren Ng | N/A | |
| ReferIt3D: Neural Listeners for Fine-Grained 3D Object Identification in Real-World Scenes | Panos Achlioptas, Ahmed Abdelreheem, Fei Xia, Mohamed Elhoseiny, Leonidas Guibas | N/A | |
| MatryODShka: Real-time 6DoF Video View Synthesis using Multi-Sphere Images | Benjamin Attal, Selena Ling, Aaron Gokaslan, Christian Richardt, James Tompkin | N/A | |
| Learning and Aggregating Deep Local Descriptors for Instance-level Recognition | Giorgos Tolias, Tomas Jenicek, Ondřej Chum | N/A | |
| A Consistently Fast and Globally Optimal Solution to the Perspective-n-Point Problem | George Terzakis, Manolis Lourakis | N/A | |
| Learn to Recover Visible Color for Video Surveillance in a Day | Guangming Wu, Yinqiang Zheng, Zhiling Guo, Zekun Cai, Xiaodan Shi, Xin Ding, Yifei Huang, Yimin Guo, Ryosuke Shibasaki | N/A | |
| Deep Fashion3D: A Dataset and Benchmark for 3D Garment Reconstruction from Single Images | Heming Zhu, Yu Cao, Hang Jin, Weikai Chen, Dong Du, Zhangye Wang, Shuguang Cui, Xiaoguang Han | N/A | |
| Spatially Adaptive Inference with Stochastic Feature Sampling and Interpolation | Zhenda Xie, Zheng Zhang, Xizhou Zhu, Gao Huang, Stephen Lin | N/A | |
| BorderDet: Border Feature for Dense Object Detection | Han Qiu, Yuchen Ma, Zeming Li, Songtao Liu, Jian Sun | N/A | |
| Regularization with Latent Space Virtual Adversarial Training | Genki Osada, Budrul Ahsan, Revoti Prasad Bora, Takashi Nishide | N/A | |
| Du²Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels | Yinda Zhang, Neal Wadhwa, Sergio Orts-Escolano, Christian Häne, Sean Fanello, Rahul Garg | N/A | |
| Model-Agnostic Boundary-Adversarial Sampling for Test-Time Generalization in Few-Shot learning | Jaekyeom Kim, Hyoungseok Kim, Gunhee Kim | N/A | |
| Targeted Attack for Deep Hashing based Retrieval | Jiawang Bai, Bin Chen, Yiming Li, Dongxian Wu, Weiwei Guo, Shu-Tao Xia, En-Hui Yang | N/A | |
| Gradient Centralization: A New Optimization Technique for Deep Neural Networks | Hongwei Yong, Jianqiang Huang, Xiansheng Hua, Lei Zhang | N/A | |
| Content-Aware Unsupervised Deep Homography Estimation | Jirong Zhang, Chuan Wang, Shuaicheng Liu, Lanpeng Jia, Nianjin Ye, Jue Wang, Ji Zhou, Jian Sun | N/A | |
| Multi-View Optimization of Local Feature Geometry | Mihai Dusmanu, Johannes L. Schönberger, Marc Pollefeys | N/A | |
| The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization | Jingjing Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew Fitzgibbon, Jamie Shotton | N/A | |
| Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video | Miao Liu, Siyu Tang, Yin Li, James M. Rehg | N/A | |
| Learning Stereo from Single Images | Jamie Watson, Oisin Mac Aodha, Daniyar Turmukhambetov, Gabriel J. Brostow, Michael Firman | N/A | |
| Prototype Rectification for Few-Shot Learning | Jinlu Liu, Liang Song, Yongqiang Qin | N/A | |
| Learning Feature Descriptors using Camera Pose Supervision | Qianqian Wang, Xiaowei Zhou, Bharath Hariharan, Noah Snavely | N/A | |
| Semantic Flow for Fast and Accurate Scene Parsing | Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Shaohua Tan, Yunhai Tong | N/A | |
| Appearance Consensus Driven Self-Supervised Human Mesh Recovery | Jogendra Nath Kundu, Mugalodi Rakesh, Varun Jampani, Rahul Mysore Venkatesh, R. Venkatesh Babu | N/A | |
| Diffraction Line Imaging | Mark Sheinin, Dinesh N. Reddy, Matthew O’Toole, Srinivasa G. Narasimhan | N/A | |
| Aligning and Projecting Images to Class-conditional Generative Networks | Minyoung Huh, Richard Zhang, Jun-Yan Zhu, Sylvain Paris, Aaron Hertzmann | N/A | |
| Suppress and Balance: A Simple Gated Network for Salient Object Detection | Xiaoqi Zhao, Youwei Pang, Lihe Zhang, Huchuan Lu, Lei Zhang | N/A | |
| Visual Memorability for Robotic Interestingness via Unsupervised Online Learning | Chen Wang, Wenshan Wang, Yuheng Qiu, Yafei Hu, Sebastian Scherer | N/A | |
| Post-Training Piecewise Linear Quantization for Deep Neural Networks | Jun Fang, Ali Shafiee, Hamzah Abdel-Aziz, David Thorsley, Georgios Georgiadis, Joseph H. Hassoun | N/A | |
| Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification | Yang Zou, Xiaodong Yang, Zhiding Yu, B.V.K. Vijaya Kumar, Jan Kautz | N/A | |
| In-Home Daily-Life Captioning Using Radio Signals | Lijie Fan, Tianhong Li, Yuan Yuan, Dina Katabi | N/A | |
| Self-Challenging Improves Cross-Domain Generalization | Zeyi Huang, Haohan Wang, Eric P. Xing, Dong Huang | N/A | |
| A Competence-aware Curriculum for Visual Concepts Learning via Question Answering | Qing Li, Siyuan Huang, Yining Hong, Song-Chun Zhu | N/A | |
| Multitask Learning Strengthens Adversarial Robustness | Chengzhi Mao, Amogh Gupta, Vikram Nitin, Baishakhi Ray, Shuran Song , Junfeng Yang, Carl Vondrick | N/A | |
| S2DNAS: Transforming Static CNN Model for Dynamic Inference via Neural Architecture Search | Zhihang Yuan, Bingzhe Wu, Guangyu Sun, Zheng Liang, Shiwan Zhao, Weichen Bi | N/A | |
| Improving Deep Video Compression by Resolution-adaptive Flow Coding | Zhihao Hu, Zhenghao Chen, Dong Xu, Guo Lu, Wanli Ouyang, Shuhang Gu | N/A | |
| Motion Capture from Internet Videos | Junting Dong, Qing Shuai, Yuanqing Zhang, Xian Liu, Xiaowei Zhou, Hujun Bao | N/A | |
| Appearance-Preserving 3D Convolution for Video-based Person Re-identification | Xinqian Gu, Hong Chang, Bingpeng Ma, Hongkai Zhang, Xilin Chen | N/A | |
| Solving the Blind Perspective-n-Point Problem End-To-End With Robust Differentiable Geometric Optimization | Dylan Campbell, Liu Liu, Stephen Gould | N/A | |
| Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation | Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, Ping Luo | N/A | |
| Deep Spatial-angular Regularization for Compressive Light Field Reconstruction over Coded Apertures | Mantang Guo, Junhui Hou, Jing Jin, Jie Chen, Lap-Pui Chau | N/A | |
| Video-based Remote Physiological Measurement via Cross-verified Feature Disentangling | Xuesong Niu, Zitong Yu, Hu Han, Xiaobai Li, Shiguang Shan, Guoying Zhao | N/A | |
| Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction | Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll | N/A | |
| Orientation-aware Vehicle Re-identification with Semantics-guided Part Attention Network | Tsai-Shien Chen, Chih-Ting Liu, Chih-Wei Wu, Shao-Yi Chien | N/A | |
| Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation | Guolei Sun, Wenguan Wang, Jifeng Dai, Luc Van Gool | N/A | |
| CoReNet: Coherent 3D Scene Reconstruction from a Single RGB Image | Stefan Popov, Pablo Bauszat, Vittorio Ferrari | N/A | |
| Layer-wise Conditioning Analysis in Exploring the Learning Dynamics of DNNs | Lei Huang, Jie Qin, Li Liu, Fan Zhu, Ling Shao | N/A | |
| RAFT: Recurrent All-Pairs Field Transforms for Optical Flow | Zachary Teed, Jia Deng | N/A | |
| Domain-invariant Stereo Matching Networks | Feihu Zhang, Xiaojuan Qi, Ruigang Yang, Victor Prisacariu, Benjamin Wah, Philip Torr | N/A | |
| DeepHandMesh: A Weakly-supervised Deep Encoder-Decoder Framework for High-fidelity Hand Mesh Modeling | Gyeongsik Moon, Takaaki Shiratori, Kyoung Mu Lee | N/A | |
| Content Adaptive and Error Propagation Aware Deep Video Compression | Guo Lu, Chunlei Cai, Xiaoyun Zhang, Li Chen, Wanli Ouyang, Dong Xu , Zhiyong Gao | N/A | |
| Towards Streaming Perception | Mengtian Li, Yu-Xiong Wang, Deva Ramanan | N/A | |
| Towards Automated Testing and Robustification by Semantic Adversarial Data Generation | Rakshith Shetty, Mario Fritz, Bernt Schiele | N/A | |
| Adversarial Generative Grammars for Human Activity Prediction | AJ Piergiovanni, Anelia Angelova, Alexander Toshev, Michael S. Ryoo | N/A | |
| GDumb: A Simple Approach that Questions Our Progress in Continual Learning | Ameya Prabhu, Philip H. S. Torr, Puneet K. Dokania | N/A | |
| Learning Lane Graph Representations for Motion Forecasting | Ming Liang, Bin Yang, Rui Hu, Yun Chen, Renjie Liao, Song Feng, Raquel Urtasun | N/A | |
| What Matters in Unsupervised Optical Flow | Rico Jonschkowski, Austin Stone, Jonathan T. Barron, Ariel Gordon, Kurt Konolige, Anelia Angelova | N/A | |
| Synthesis and Completion of Facades from Satellite Imagery | Xiaowei Zhang, Christopher May, Daniel Aliaga | N/A | |
| Mapillary Planet-Scale Depth Dataset | Manuel López Antequera, Pau Gargallo, Markus Hofinger, Samuel Rota Bulò, Yubin Kuang, Peter Kontschieder | N/A | |
| V2VNet: Vehicle-to-Vehicle Communication for Joint Perception and Prediction | Tsun-Hsuan Wang, Sivabalan Manivasagam, Ming Liang, Bin Yang, Wenyuan Zeng, Raquel Urtasun | N/A | |
| Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters | Haoyu Liang, Zhihao Ouyang, Yuyuan Zeng, Hang Su, Zihao He, Shu-Tao Xia, Jun Zhu, Bo Zhang | N/A | |
| EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning | Bailin Li, Bowen Wu, Jiang Su, Guangrun Wang | N/A | |
| Intrinsic Point Cloud Interpolation via Dual Latent Space Navigation | Marie-Julie Rakotosaona, Maks Ovsjanikov | N/A | |
| Cross-Domain Cascaded Deep Translation | Oren Katzir, Dani Lischinski, Daniel Cohen-Or | N/A | |
| “Look Ma, no landmarks!” – Unsupervised, Model-based Dense Face Alignment | Tatsuro Koizumi, William A. P. Smith | N/A | |
| Online Invariance Selection for Local Feature Descriptors | Rémi Pautrat, Viktor Larsson, Martin R. Oswald, Marc Pollefeys | N/A | |
| Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations | Hongyu Liu, Bin Jiang, Yibing Song, Wei Huang, Chao Yang | N/A | |
| TextCaps: a Dataset for Image Captioning with Reading Comprehension | Oleksii Sidorov, Ronghang Hu, Marcus Rohrbach, Amanpreet Singh | N/A | |
| It is not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction | Karttikeya Mangalam, Harshayu Girase, Shreyas Agarwal, Kuan-Hui Lee, Ehsan Adeli, Jitendra Malik, Adrien Gaidon | N/A | |
| Learning What to Learn for Video Object Segmentation | Goutam Bhat, Felix Järemo Lawin, Martin Danelljan, Andreas Robinson, Michael Felsberg, Luc Van Gool, Radu Timofte | N/A | |
| SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing | Garvita Tiwari, Bharat Lal Bhatnagar, Tony Tung, Gerard Pons-Moll | N/A | |
| LIMP: Learning Latent Shape Representations with Metric Preservation Priors | Luca Cosmo, Antonio Norelli, Oshri Halimi, Ron Kimmel, Emanuele Rodolà | N/A | |
| Unsupervised Sketch to Photo Synthesis | Runtao Liu, Qian Yu, Stella X. Yu | N/A | |
| A Simple Way to Make Neural Networks Robust Against Diverse Image Corruptions | Evgenia Rusak, Lukas Schott, Roland S. Zimmermann, Julian Bitterwolf , Oliver Bringmann, Matthias Bethge, Wieland Brendel | N/A | |
| SoftPoolNet: Shape Descriptor for Point Cloud Completion and Classification | Yida Wang, David Joseph Tan, Nassir Navab, Federico Tombari | N/A | |
| Hierarchical Face Aging through Disentangled Latent Characteristics | Peipei Li, Huaibo Huang, Yibo Hu, Xiang Wu, Ran He, Zhenan Sun | N/A | |
| Hybrid Models for Open Set Recognition | Hongjie Zhang, Ang Li, Jie Guo, Yanwen Guo | N/A | |
| TopoGAN: A Topology-Aware Generative Adversarial Network | Fan Wang, Huidong Liu, Dimitris Samaras, Chao Chen | N/A | |
| Learning to Localize Actions from Moments | Fuchen Long, Ting Yao, Zhaofan Qiu, Xinmei Tian, Jiebo Luo, Tao Mei | N/A | |
| ForkGAN: Seeing into the Rainy Night | Ziqiang Zheng, Yang Wu, Xinran Han, Jianbo Shi | N/A | |
| TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning | Xinwei Sun, Yilun Xu, Peng Cao, Yuqing Kong, Lingjing Hu, Shanghang Zhang, Yizhou Wang | N/A | |
| ExchNet: A Unified Hashing Network for Large-Scale Fine-Grained Image Retrieval | Quan Cui, Qing-Yuan Jiang, Xiu-Shen Wei, Wu-Jun Li, Osamu Yoshie | N/A | |
| TSIT: A Simple and Versatile Framework for Image-to-Image Translation | Liming Jiang, Changxu Zhang, Mingyang Huang, Chunxiao Liu, Jianping Shi, Chen Change Loy | N/A | |
| ProxyBNN: Learning Binarized Neural Networks via Proxy Matrices | Xiangyu He, Zitao Mo, Ke Cheng, Weixiang Xu, Qinghao Hu, Peisong Wang, Qingshan Liu, Jian Cheng | N/A | |
| HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation | Can Wang, Jiefeng Li, Wentao Liu, Chen Qian, Cewu Lu | N/A | |
| Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve | Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin, Angela Dai | N/A | |
| A Unified Framework of Surrogate Loss by Refactoring and Interpolation | Lanlan Liu, Mingzhe Wang, Jia Deng | N/A | |
| Deep Reflectance Volumes: Relightable Reconstructions from Multi-View Photometric Images | Sai Bi, Zexiang Xu, Kalyan Sunkavalli, Miloš Hašan, Yannick Hold-Geoffroy, David Kriegman, Ravi Ramamoorthi | N/A | |
| Memory-augmented Dense Predictive Coding for Video Representation Learning | Tengda Han, Weidi Xie, Andrew Zisserman | N/A | |
| PointMixup: Augmentation for Point Clouds | Yunlu Chen, Vincent Tao Hu, Efstratios Gavves, Thomas Mensink, Pascal Mettes, Pengwan Yang, Cees G. M. Snoek | N/A | |
| Identity-Guided Human Semantic Parsing for Person Re-Identification | Kuan Zhu, Haiyun Guo, Zhiwei Liu, Ming Tang, Jinqiao Wang | N/A | |
| Learning Gradient Fields for Shape Generation | Ruojin Cai, Guandao Yang, Hadar Averbuch-Elor, Zekun Hao, Serge Belongie, Noah Snavely, Bharath Hariharan | N/A | |
| COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder | Kuniaki Saito, Kate Saenko, Ming-Yu Liu | N/A | |
| Corner Proposal Network for Anchor-free, Two-stage Object Detection | Kaiwen Duan, Lingxi Xie, Honggang Qi, Song Bai, Qingming Huang, Qi Tian | N/A | |
| PhraseClick: Toward Achieving Flexible Interactive Segmentation by Phrase and Click | Henghui Ding, Scott Cohen, Brian Price, Xudong Jiang | N/A | |
| Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing | Yapeng Tian, Dingzeyu Li, Chenliang Xu | N/A | |
| Learning Delicate Local Representations for Multi-Person Pose Estimation | Yuanhao Cai, Zhicheng Wang, Zhengxiong Luo, Binyi Yin, Angang Du, Haoqian Wang, Xiangyu Zhang, Xinyu Zhou, Erjin Zhou, Jian Sun | N/A | |
| Learning to Plan with Uncertain Topological Maps | Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf | N/A | |
| Neural Design Network: Graphic Layout Generation with Constraints | Hsin-Ying Lee, Lu Jiang, Irfan Essa, Phuong B Le, Haifeng Gong, Ming-Hsuan Yang, Weilong Yang | N/A | |
| Learning Open Set Network with Discriminative Reciprocal Points | Guangyao Chen, Limeng Qiao, Yemin Shi, Peixi Peng, Jia Li, Tiejun Huang, Shiliang Pu, Yonghong Tian | N/A | |
| Convolutional Occupancy Networks | Songyou Peng, Michael Niemeyer, Lars Mescheder, Marc Pollefeys, Andreas Geiger | N/A | |
| Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry | He Chen, Pengfei Guo, Pengfei Li, Gim Hee Lee, Gregory Chirikjian | N/A | |
| TIDE: A General Toolbox for Identifying Object Detection Errors | Daniel Bolya, Sean Foley, James Hays, Judy Hoffman | N/A | |
| PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding | Saining Xie, Jiatao Gu, Demi Guo, Charles R. Qi, Leonidas Guibas, Or Litany | N/A | |
| DSA: More Efficient Budgeted Pruning via Differentiable Sparsity Allocation | Xuefei Ning, Tianchen Zhao, Wenshuo Li, Peng Lei, Yu Wang, Huazhong Yang | N/A | |
| Circumventing Outliers of AutoAugment with Knowledge Distillation | Longhui Wei, An Xiao, Lingxi Xie, Xiaopeng Zhang, Xin Chen, Qi Tian | N/A | |
| S2DNet: Learning Image Features for Accurate Sparse-to-Dense Matching | Hugo Germain, Guillaume Bourmaud, Vincent Lepetit | N/A | |
| RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving | Peixuan Li, Huaici Zhao, Pengfei Liu, Feidao Cao | N/A | |
| Video Object Segmentation with Episodic Graph Memory Networks | Xiankai Lu, Wenguan Wang, Martin Danelljan, Tianfei Zhou, Jianbing Shen, Luc Van Gool | N/A | |
| Rethinking Bottleneck Structure for Efficient Mobile Network Design | Daquan Zhou, Qibin Hou, Yunpeng Chen, Jiashi Feng, Shuicheng Yan | N/A | |
| Side-Tuning: A Baseline for Network Adaptation via Additive Side Networks | Jeffrey O. Zhang, Alexander Sax, Amir Zamir, Leonidas Guibas, Jitendra Malik | N/A | |
| Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach | Zerui Chen, Yan Huang, Hongyuan Yu, Bin Xue, Ke Han, Yiru Guo, Liang Wang | N/A | |
| REVISE: A Tool for Measuring and Mitigating Bias in Visual Datasets | Angelina Wang, Arvind Narayanan, Olga Russakovsky | N/A | |
| Contrastive Learning for Weakly Supervised Phrase Grounding | Tanmay Gupta, Arash Vahdat, Gal Chechik, Xiaodong Yang, Jan Kautz, Derek Hoiem | N/A | |
| Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis | Siyuan Yang, Jun Liu, Shijian Lu, Meng Hwa Er, Alex C. Kot | N/A | |
| Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors | Zuxuan Wu, Ser-Nam Lim, Larry S. Davis, Tom Goldstein | N/A | |
| TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images | Jianxin Lin, Yingxue Pang, Yingce Xia, Zhibo Chen, Jiebo Luo | N/A | |
| Semi-Siamese Training for Shallow Face Learning | Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei | N/A | |
| GAN Slimming: All-in-One GAN Compression by A Unified Optimization Framework | Haotao Wang, Shupeng Gui, Haichuan Yang, Ji Liu, Zhangyang Wang | N/A | |
| Human Interaction Learning on 3D Skeleton Point Clouds for Video Violence Recognition | Yukun Su, Guosheng Lin, Jinhui Zhu, Qingyao Wu | N/A | |
| Binarized Neural Network for Single Image Super Resolution | Jingwei Xin, Nannan Wang, Xinrui Jiang, Jie Li, Heng Huang, Xinbo Gao | N/A | |
| Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation | Huiyu Wang, Yukun Zhu, Bradley Green, Hartwig Adam, Alan Yuille, Liang-Chieh Chen | N/A | |
| Adaptive Computationally Efficient Network for Monocular 3D Hand Pose Estimation | Zhipeng Fan, Jun Liu, Yao Wang | N/A | |
| Chained-Tracker: Chaining Paired Attentive Regression Results for End-to-End Joint Multiple-Object Detection and Tracking | Jinlong Peng, Changan Wang, Fangbin Wan, Yang Wu, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu | N/A | |
| Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets | Tong Wu, Qingqiu Huang, Ziwei Liu, Yu Wang, Dahua Lin | N/A | |
| Hamiltonian Dynamics for Real-World Shape Interpolation | Marvin Eisenberger, Daniel Cremers | N/A | |
| Learning to Scale Multilingual Representations for Vision-Language Tasks | Andrea Burns, Donghyun Kim, Derry Wijaya, Kate Saenko, Bryan A. Plummer | N/A | |
| Multi-modal Transformer for Video Retrieval | Valentin Gabeur, Chen Sun, Karteek Alahari, Cordelia Schmid | N/A | |
| Feature Representation Matters: End-to-End Learning for Reference-based Image Super-resolution | Yanchun Xie, Jimin Xiao, Mingjie Sun, Chao Yao, Kaizhu Huang | N/A | |
| RobustFusion: Human Volumetric Capture with Data-driven Visual Cues using a RGBD Camera | Zhuo Su, Lan Xu, Zerong Zheng, Tao Yu, Yebin Liu, Lu Fang | N/A | |
| Surface Normal Estimation of Tilted Images via Spatial Rectifier | Tien Do, Khiem Vuong, Stergios I. Roumeliotis, Hyun Soo Park | N/A | |
| Multimodal Shape Completion via Conditional Generative Adversarial Networks | Rundi Wu, Xuelin Chen, Yixin Zhuang, Baoquan Chen | N/A | |
| Generative Sparse Detection Networks for 3D Single-shot Object Detection | JunYoung Gwak, Christopher Choy, Silvio Savarese | N/A | |
| Grounded Situation Recognition | Sarah Pratt, Mark Yatskar, Luca Weihs, Ali Farhadi, Aniruddha Kembhavi | N/A | |
| Learning Modality Interaction for Temporal Sentence Localization and Event Captioning in Videos | Shaoxiang Chen, Wenhao Jiang, Wei Liu, Yu-Gang Jiang | N/A | |
| Unpaired Learning of Deep Image Denoising | Xiaohe Wu, Ming Liu, Yue Cao, Dongwei Ren, Wangmeng Zuo | N/A | |
| Self-supervising Fine-grained Region Similarities for Large-scale Image Localization | Yixiao Ge, Haibo Wang, Feng Zhu, Rui Zhao, Hongsheng Li | N/A | |
| Rotationally-Temporally Consistent Novel View Synthesis of Human Performance Video | Youngjoong Kwon, Stefano Petrangeli, Dahun Kim, Haoliang Wang, Eunbyung Park, Viswanathan Swaminathan, Henry Fuchs | N/A | |
| Side-Aware Boundary Localization for More Precise Object Detection | Jiaqi Wang, Wenwei Zhang, Yuhang Cao, Kai Chen, Jiangmiao Pang, Tao Gong, Jianping Shi, Chen Change Loy, Dahua Lin | N/A | |
| SF-Net: Single-Frame Supervision for Temporal Action Localization | Fan Ma, Linchao Zhu, Yi Yang, Shengxin Zha, Gourab Kundu, Matt Feiszli, Zheng Shou | N/A | |
| Negative Margin Matters: Understanding Margin in Few-shot Classification | Bin Liu, Yue Cao, Yutong Lin, Qi Li, Zheng Zhang, Mingsheng Long, Han Hu | N/A | |
| Particularity beyond Commonality: Unpaired Identity Transfer with Multiple References | Ruizheng Wu, Xin Tao, Yingcong Chen, Xiaoyong Shen, Jiaya Jia | N/A | |
| Tracking Objects as Points | Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl | N/A | |
| CPGAN: Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis | Jiadong Liang, Wenjie Pei, Feng Lu | N/A | |
| Transporting Labels via Hierarchical Optimal Transport for Semi-Supervised Learning | Fariborz Taherkhani, Ali Dabouei, Sobhan Soleymani, Jeremy Dawson, Nasser M. Nasrabadi | N/A | |
| MTI-Net: Multi-Scale Task Interaction Networks for Multi-Task Learning | Simon Vandenhende, Stamatios Georgoulis, Luc Van Gool | N/A | |
| Learning to Factorize and Relight a City | Andrew Liu, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros, Noah Snavely | N/A | |
| Region Graph Embedding Network for Zero-Shot Learning | Guo-Sen Xie, Li Liu, Fan Zhu, Fang Zhao, Zheng Zhang, Yazhou Yao, Jie Qin, Ling Shao | N/A | |
| GRAB: A Dataset of Whole-Body Human Grasping of Objects | Omid Taheri, Nima Ghorbani, Michael J. Black, Dimitrios Tzionas | N/A | |
| DEMEA: Deep Mesh Autoencoders for Non-Rigidly Deforming Objects | Edgar Tretschk, Ayush Tewari, Michael Zollhöfer, Vladislav Golyanik, Christian Theobalt | N/A | |
| RANSAC-Flow: Generic Two-stage Image Alignment | Xi Shen, François Darmon, Alexei A. Efros, Mathieu Aubry | N/A | |
| Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds | Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool | N/A | |
| Neural Object Learning for 6D Pose Estimation Using a Few Cluttered Images | Kiru Park, Timothy Patten, Markus Vincze | N/A | |
| Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking | Jianfeng Yan, Zizhuang Wei, Hongwei Yi, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai | N/A | |
| Pixel-Pair Occlusion Relationship Map (P2ORM): Formulation, Inference & Application | Xuchong Qiu, Yang Xiao, Chaohui Wang, Renaud Marlet | N/A | |
| MovieNet: A Holistic Dataset for Movie Understanding | Qingqiu Huang, Yu Xiong, Anyi Rao, Jiaze Wang, Dahua Lin | N/A | |
| Short-Term and Long-Term Context Aggregation Network for Video Inpainting | Ang Li, Shanshan Zhao, Xingjun Ma, Mingming Gong, Jianzhong Qi, Rui Zhang, Dacheng Tao, Ramamohanarao Kotagiri | N/A | |
| DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization | Juan Du, Rui Wang, Daniel Cremers | N/A | |
| Face Super-Resolution Guided by 3D Facial Priors | Xiaobin Hu, Wenqi Ren, John LaMaster, Xiaochun Cao, Xiaoming Li, Zechao Li, Bjoern Menze, Wei Liu | N/A | |
| Label Propagation with Augmented Anchors: A Simple Semi-Supervised Learning baseline for Unsupervised Domain Adaptation | Yabin Zhang, Bin Deng, Kui Jia, Lei Zhang | N/A | |
| Are Labels Necessary for Neural Architecture Search? | Chenxi Liu, Piotr Dollár, Kaiming He, Ross Girshick, Alan Yuille, Saining Xie | N/A | |
| BLSM: A Bone-Level Skinned Model of the Human Mesh | Haoyang Wang, Riza Alp Güler, Iasonas Kokkinos, George Papandreou, Stefanos Zafeiriou | N/A | |
| Associative Alignment for Few-shot Image Classification | Arman Afrasiyabi, Jean-François Lalonde, Christian Gagné | N/A | |
| Cyclic Functional Mapping: Self-supervised Correspondence between Non-isometric Deformable Shapes | Dvir Ginzburg, Dan Raviv | N/A | |
| View-Invariant Probabilistic Embedding for Human Pose | Jennifer J. Sun, Jiaping Zhao, Liang-Chieh Chen, Florian Schroff, Hartwig Adam, Ting Liu | N/A | |
| Contact and Human Dynamics from Monocular Video | Davis Rempe, Leonidas J. Guibas, Aaron Hertzmann, Bryan Russell, Ruben Villegas, Jimei Yang | N/A | |
| PointPWC-Net: Cost Volume on Point Clouds for (Self-)Supervised Scene Flow Estimation | Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, Li Fuxin | N/A | |
| Points2Surf Learning Implicit Surfaces from Point Clouds | Philipp Erler, Paul Guerrero, Stefan Ohrhallinger, Niloy J. Mitra, Michael Wimmer | N/A | |
| Few-Shot Scene-Adaptive Anomaly Detection | Yiwei Lu, Frank Yu, Mahesh Kumar Krishna Reddy, Yang Wang | N/A | |
| Personalized Face Modeling for Improved Face Reconstruction and Motion Retargeting | Bindita Chaudhuri, Noranart Vesdapunt, Linda Shapiro, Baoyuan Wang | N/A | |
| Entropy Minimisation Framework for Event-based Vision Model Estimation | Urbano Miguel Nunes, Yiannis Demiris | N/A | |
| Reconstructing NBA Players | Luyang Zhu, Konstantinos Rematas, Brian Curless, Steven M. Seitz, Ira Kemelmacher-Shlizerman | N/A | |
| PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments | Zhiming Chen, Kean Chen, Weiyao Lin, John See, Hui Yu, Yan Ke, Cong Yang | N/A | |
| TENet: Triple Excitation Network for Video Salient Object Detection | Sucheng Ren, Chu Han, Xin Yang, Guoqiang Han, Shengfeng He | N/A | |
| Deep Feedback Inverse Problem Solver | Wei-Chiu Ma, Shenlong Wang, Jiayuan Gu, Sivabalan Manivasagam, Antonio Torralba, Raquel Urtasun | N/A | |
| Learning From Multiple Experts: Self-paced Knowledge Distillation for Long-tailed Classification | Liuyu Xiang, Guiguang Ding, Jungong Han | N/A | |
| Hallucinating Visual Instances in Total Absentia | Jiayan Qiu, Yiding Yang, Xinchao Wang, Dacheng Tao | N/A | |
| Weakly-supervised 3D Shape Completion in the Wild | Jiayuan Gu, Wei-Chiu Ma, Sivabalan Manivasagam, Wenyuan Zeng, Zihao Wang, Yuwen Xiong, Hao Su, Raquel Urtasun | N/A | |
| DTVNet: Dynamic Time-lapse Video Generation via Single Still Image | Jiangning Zhang, Chao Xu, Liang Liu, Mengmeng Wang, Xia Wu, Yong Liu, Yunliang Jiang | N/A | |
| CLIFFNet for Monocular Depth Estimation with Hierarchical Embedding Loss | Lijun Wang, Jianming Zhang, Yifan Wang, Huchuan Lu, Xiang Ruan | N/A | |
| Collaborative Video Object Segmentation by Foreground-Background Integration | Zongxin Yang, Yunchao Wei, Yi Yang | N/A | |
| Adaptive Margin Diversity Regularizer for handling Data Imbalance in Zero-Shot SBIR | Titir Dutta, Anurag Singh, Soma Biswas | N/A | |
| ETH-XGaze: A Large Scale Dataset for Gaze Estimation under Extreme Head Pose and Gaze Variation | Xucong Zhang, Seonwook Park, Thabo Beeler, Derek Bradley, Siyu Tang , Otmar Hilliges | N/A | |
| Calibration-free Structure-from-Motion with Calibrated Radial Trifocal Tensors | Viktor Larsson, Nicolas Zobernig, Kasim Taskin, Marc Pollefeys | N/A | |
| Occupancy Anticipation for Efficient Exploration and Navigation | Santhosh K. Ramakrishnan, Ziad Al-Halah, Kristen Grauman | N/A | |
| Unified Image and Video Saliency Modeling | Richard Droste, Jianbo Jiao, J. Alison Noble | N/A | |
| TAO: A Large-Scale Benchmark for Tracking Any Object | Achal Dave, Tarasha Khurana, Pavel Tokmakov, Cordelia Schmid, Deva Ramanan | N/A | |
| A Generalization of Otsu’s Method and Minimum Error Thresholding | Jonathan T. Barron | N/A | |
| A Cordial Sync: Going Beyond Marginal Policies for Multi-Agent Embodied Tasks | Unnat Jain, Luca Weihs, Eric Kolve, Ali Farhadi, Svetlana Lazebnik, Aniruddha Kembhavi, Alexander Schwing | N/A | |
| Big Transfer (BiT): General Visual Representation Learning | Alexander Kolesnikov, Lucas Beyer, Xiaohua Zhai, Joan Puigcerver, Jessica Yung, Sylvain Gelly, Neil Houlsby | N/A | |
| VisualCOMET: Reasoning about the Dynamic Context of a Still Image | Jae Sung Park, Chandra Bhagavatula, Roozbeh Mottaghi, Ali Farhadi, Yejin Choi | N/A | |
| Few-shot Action Recognition with Permutation-invariant Attention | Hongguang Zhang, Li Zhang, Xiaojuan Qi, Hongdong Li, Philip H. S. Torr, Piotr Koniusz | N/A | |
| Character Grounding and Re-Identification in Story of Videos and Text Descriptions | Youngjae Yu, Jongseok Kim, Heeseung Yun, Jiwan Chung, Gunhee Kim | N/A | |
| AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling | Wenshuo Ma, Tingzhong Tian, Hang Xu, Yimin Huang, Zhenguo Li | N/A | |
| Learning Visual Context by Comparison | Minchul Kim, Jongchan Park, Seil Na, Chang Min Park, Donggeun Yoo | N/A | |
| Large Scale Holistic Video Understanding | Ali Diba, Mohsen Fayyaz, Vivek Sharma, Manohar Paluri, Jürgen Gall, Rainer Stiefelhagen, Luc Van Gool | N/A | |
| Indirect Local Attacks for Context-aware Semantic Segmentation Networks | Krishna Kanth Nakka, Mathieu Salzmann | N/A | |
| Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings | Anita Rau, Guillermo Garcia-Hernando, Danail Stoyanov, Gabriel J. Brostow, Daniyar Turmukhambetov | N/A | |
| Connecting Vision and Language with Localized Narratives | Jordi Pont-Tuset, Jasper Uijlings, Soravit Changpinyo, Radu Soricut, Vittorio Ferrari | N/A | |
| Adversarial T-shirt! Evading Person Detectors in A Physical World | Kaidi Xu, Gaoyuan Zhang, Sijia Liu, Quanfu Fan, Mengshu Sun, Hongge Chen, Pin-Yu Chen, Yanzhi Wang, Xue Lin | N/A | |
| Bounding-box Channels for Visual Relationship Detection | Sho Inayoshi, Keita Otani, Antonio Tejero-de-Pablos, Tatsuya Harada | N/A | |
| Minimal Rolling Shutter Absolute Pose with Unknown Focal Length and Radial Distortion | Zuzana Kukelova, Cenek Albl, Akihiro Sugimoto, Konrad Schindler, Tomas Pajdla | N/A | |
| SRFlow: Learning the Super-Resolution Space with Normalizing Flow | Andreas Lugmayr, Martin Danelljan, Luc Van Gool, Radu Timofte | N/A | |
| DeepGMR: Learning Latent Gaussian Mixture Models for Registration | Wentao Yuan, Benjamin Eckart, Kihwan Kim, Varun Jampani, Dieter Fox , Jan Kautz | N/A | |
| Active Perception using Light Curtains for Autonomous Driving | Siddharth Ancha, Yaadhav Raaj, Peiyun Hu, Srinivasa G. Narasimhan, David Held | N/A | |
| Invertible Neural BRDF for Object Inverse Rendering | Zhe Chen, Shohei Nobuhara, Ko Nishino | N/A | |
| Semi-supervised Semantic Segmentation via Strong-weak Dual-branch Network | Wenfeng Luo, Meng Yang | N/A | |
| Practical Deep Raw Image Denoising on Mobile Devices | Yuzhi Wang, Haibin Huang, Qin Xu, Jiaming Liu, Yiqun Liu, Jue Wang | N/A | |
| SoundSpaces: Audio-Visual Navigation in 3D Environments | Changan Chen, Unnat Jain, Carl Schissler, Sebastia Vicenc Amengual Gari, Ziad Al-Halah, Vamsi Krishna Ithapu, Philip Robinson, and Kristen Grauman | N/A | |
| Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization | Yuanhao Zhai, Le Wang, Wei Tang, Qilin Zhang, Junsong Yuan, Gang Hua | N/A | |
| Erasing Appearance Preservation in Optimization-based Smoothing | Lvmin Zhang, Chengze Li, Yi JI, Chunping Liu, Tien-tsin Wong | N/A | |
| Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler | Tsu-Jui Fu, Xin Eric Wang, Matthew F. Peterson,Scott T. Grafton, Miguel P. Eckstein, William Yang Wang | N/A | |
| Guided Deep Decoder: Unsupervised Image Pair Fusion | Tatsumi Uezato, Danfeng Hong, Naoto Yokoya, Wei He | N/A | |
| Filter Style Transfer between Photos | Jonghwa Yim, Jisung Yoo, Won-joon Do, Beomsu Kim, Jihwan Choe | N/A | |
| JGR-P2O: Joint Graph Reasoning based Pixel-to-Offset Prediction Network for 3D Hand Pose Estimation from a Single Depth Image | Linpu Fang, Xingyan Liu, Li Liu, Hang Xu, Wenxiong Kang | N/A | |
| Dynamic Group Convolution for Accelerating Convolutional Neural Networks | Zhuo Su, Linpu Fang, Wenxiong Kang, Dewen Hu, Matti Pietikäinen, Li Liu | N/A | |
| RD-GAN: Few/Zero-Shot Chinese Character Style Transfer via Radical Decomposition and Rendering | Yaoxiong Huang, Mengchao He, Lianwen Jin, Yongpan Wang | N/A | |
| Object-Contextual Representations for Semantic Segmentation | Yuhui Yuan, Xilin Chen, Jingdong Wang | N/A | |
| Efficient Spatio-Temporal Recurrent Neural Network for Video Deblurring | Zhihang Zhong, Ye Gao, Yinqiang Zheng, Bo Zheng | N/A | |
| Joint Semantic Instance Segmentation on Graphs with the Semantic Mutex Watershed | Steffen Wolf, Yuyan Li, Constantin Pape, Alberto Bailoni, Anna Kreshuk, Fred A. Hamprecht | N/A | |
| Photon-Efficient 3D Imaging with A Non-Local Neural Network | Jiayong Peng, Zhiwei Xiong, Xin Huang, Zheng-Ping Li, Dong Liu, Feihu Xu | N/A | |
| GeLaTO: Generative Latent Textured Objects | Ricardo Martin-Brualla, Rohit Pandey, Sofien Bouaziz, Matthew Brown, Dan B Goldman | N/A | |
| Improving Vision-and-Language Navigation with Image-Text Pairs from the Web | Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra | N/A | |
| Directional Temporal Modeling for Action Recognition | Xinyu Li, Bing Shuai, Joseph Tighe | N/A | |
| Shonan Rotation Averaging: Global Optimality by Surfing SO(p)(n) | Frank Dellaert, David M. Rosen, Jing Wu, Robert Mahony, Luca Carlone | N/A | |
| Semantic Curiosity for Active Visual Learning | Devendra Singh Chaplot, Helen Jiang, Saurabh Gupta, Abhinav Gupta | N/A | |
| Multi-Temporal Recurrent Neural Networks For Progressive Non-Uniform Single Image Deblurring With Incremental Temporal Training | Dongwon Park, Dong Un Kang, Jisoo Kim, Se Young Chun | N/A | |
| ProgressFace: Scale-Aware Progressive Learning for Face Detection | Jiashu Zhu, Dong Li, Tiantian Han, Lu Tian, Yi Shan | N/A | |
| Learning Multi-layer Latent Variable Model via Variational Optimization of Short Run MCMC for Approximate Inference | Erik Nijkamp, Bo Pang, Tian Han, Linqi Zhou, Song-Chun Zhu, Ying Nian Wu | N/A | |
| CoTeRe-Net: Discovering Collaborative Ternary Relations in Videos | Zhensheng Shi, Cheng Guan, Liangjie Cao, Qianqian Li, Ju Liang, Zhaorui Gu, Haiyong Zheng, Bing Zheng | N/A | |
| Modeling the Effects of Windshield Refraction for Camera Calibration | Frank Verbiest, Marc Proesmans, Luc Van Gool | N/A | |
| Unsupervised Domain Adaptation for Semantic Segmentation of NIR Images through Generative Latent Search | Prashant Pandey, Aayush Kumar Tyagi, Sameer Ambekar, Prathosh AP | N/A | |
| PROFIT: A Novel Training Method for sub-4-bit MobileNet Models | Eunhyeok Park, Sungjoo Yoo | N/A | |
| Visual Relation Grounding in Videos | Junbin Xiao, Xindi Shang, Xun Yang, Sheng Tang, Tat-Seng Chua | N/A | |
| Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows | Andrei Zanfir, Eduard Gabriel Bazavan, Hongyi Xu, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu | N/A | |
| Controlling Style and Semantics in Weakly-Supervised Image Generation | Dario Pavllo, Aurelien Lucchi, Thomas Hofmann | N/A | |
| Jointly learning visual motion and confidence from local patches in event cameras | Daniel R. Kepple, Daewon Lee, Colin Prepsius, Volkan Isler, Il Memming Park, Daniel D. Lee | N/A | |
| SODA: Story Oriented Dense Video Captioning Evaluation Framework | Soichiro Fujita, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata | N/A | |
| Sketch-Guided Object Localization in Natural Images | Aditay Tripathi, Rajath R. Dani, Anand Mishra and Anirban Chakraborty | N/A | |
| A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses | Malik Boudiaf, Jérôme Rony, Imtiaz Masud Ziko, Eric Granger, Marco Pedersoli, Pablo Piantanida, Ismail Ben Ayed | N/A | |
| Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models | Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu | N/A | |
| The Hessian Penalty: A Weak Prior for Unsupervised Disentanglement | William Peebles, John Peebles, Jun-Yan Zhu, Alexei Efros, Antonio Torralba | N/A | |
| STAR: Sparse Trained Articulated Human Body Regressor | Ahmed A. A. Osman, Timo Bolkart, Michael J. Black | N/A | |
| Optical Flow Distillation: Towards Efficient and Stable Video Style Transfer | Xinghao Chen, Yiman Zhang, Yunhe Wang, Han Shu, Chunjing Xu, Chang Xu | N/A | |
| Collaboration by Competition: Self-coordinated Knowledge Amalgamation for Multi-talent Student Learning | Sihui Luo, Wenwen Pan, Xinchao Wang, Dazhou Wang, Haihong Tang, Mingli Song | N/A | |
| Do Not Disturb Me: Person Re-identification Under the Interference of Other Pedestrians | Shizhen Zhao, Changxin Gao, Jun Zhang, Hao Cheng, Chuchu Han, Xinyang Jiang, Xiaowei Guo, Wei-Shi Zheng, Nong Sang, Xing Sun | N/A | |
| Learning 3D Part Assembly from a Single Image | Yichen Li, Kaichun Mo, Lin Shao, Minhyuk Sung, Leonidas Guibas | N/A | |
| PT2PC: Learning to Generate 3D Point Cloud Shapes from Part Tree Conditions | Kaichun Mo, He Wang, Xinchen Yan, Leonidas Guibas | N/A | |
| Highly Efficient Salient Object Detection with 100K Parameters | Shang-Hua Gao, Yong-Qiang Tan, Ming-Ming Cheng, Chengze Lu, Yunpeng Chen, Shuicheng Yan | N/A | |
| HardGAN: A Haze-Aware Representation Distillation GAN for Single Image Dehazing | Qili Deng, Ziling Huang, Chung-Chi Tsai, Chia-Wen Lin | N/A | |
| Lifespan Age Transformation Synthesis | Roy Or-El, Soumyadip Sengupta, Ohad Fried, Eli Shechtman, Ira Kemelmacher-Shlizerman | N/A | |
| Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation | Xingchao Peng, Yichen Li, Kate Saenko | N/A | |
| Simulating Content Consistent Vehicle Datasets with Attribute Descent | Yue Yao, Liang Zheng, Xiaodong Yang, Milind Naphade, Tom Gedeon | N/A | |
| Multiview Detection with Feature Perspective Transformation | Yunzhong Hou, Liang Zheng, Stephen Gould | N/A | |
| Learning Object Relation Graph and Tentative Policy for Visual Navigation | Heming Du, Xin Yu, Liang Zheng | N/A | |
| Adversarial Self-Supervised Learning for Semi-Supervised 3D Action Recognition | Chenyang Si, Xuecheng Nie, Wei Wang, Liang Wang, Tieniu Tan, Jiashi Feng | N/A | |
| Across Scales & Across Dimensions: Temporal Super-Resolution using Deep Internal Learning | Liad Pollak Zuckerman, Eyal Naor, George Pisha, Shai Bagon, Michal Irani | N/A | |
| Inducing Optimal Attribute Representations for Conditional GANs | Binod Bhattarai, Tae-Kyun Kim | N/A | |
| AR-Net: Adaptive Frame Resolution for Efficient Action Recognition | Yue Meng, Chung-Ching Lin, Rameswar Panda, Prasanna Sattigeri, Leonid Karlinsky, Aude Oliva, Kate Saenko, Rogerio Feris | N/A | |
| Image-to-Voxel Model Translation for 3D Scene Reconstruction and Segmentation | Vladimir V. Kniaz, Vladimir A. Knyaz, Fabio Remondino, Artem Bordodymov, Petr Moshkantsev | N/A | |
| Consistency Guided Scene Flow Estimation | Yuhua Chen, Luc Van Gool, Cordelia Schmid, Cristian Sminchisescu | N/A | |
| Autoregressive Unsupervised Image Segmentation | Yassine Ouali, Céline Hudelot, Myriam Tami | N/A | |
| Controllable Image Synthesis via SegVAE | Yen-Chi Cheng, Hsin-Ying Lee, Min Sun, Ming-Hsuan Yang | N/A | |
| Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search | Yuan Tian, Qin Wang, Zhiwu Huang, Wen Li, Dengxin Dai, Minghao Yang , Jun Wang, Olga Fink | N/A | |
| Efficient Non-Line-of-Sight Imaging from Transient Sinograms | Mariko Isogawa, Dorian Chan, Ye Yuan, Kris Kitani, Matthew O’Toole | N/A | |
| Texture Hallucination for Large-Factor Painting Super-Resolution | Yulun Zhang, Zhifei Zhang, Stephen DiVerdi, Zhaowen Wang, Jose Echevarria, Yun Fu | N/A | |
| Learning Progressive Joint Propagation for Human Motion Prediction | Yujun Cai, Lin Huang, Yiwei Wang, Tat-Jen Cham, Jianfei Cai, Junsong Yuan, Jun Liu, Xu Yang, Yiheng Zhu, Xiaohui Shen, Ding Liu, Jing Liu, Nadia Magnenat Thalmann | N/A | |
| Image Stitching and Rectification for Hand-Held Cameras | Bingbing Zhuang, Quoc-Huy Tran | N/A | |
| ParSeNet: A Parametric Surface Fitting Network for 3D Point Clouds | Gopal Sharma, Difan Liu, Subhransu Maji, Evangelos Kalogerakis, Siddhartha Chaudhuri, Radomír Měch | N/A | |
| The Group Loss for Deep Metric Learning | Ismail Elezi, Sebastiano Vascon, Alessandro Torcinovich, Marcello Pelillo, Laura Leal-Taixé | N/A | |
| Learning Object Depth from Camera Motion and Video Object Segmentation | Brent A. Griffin, Jason J. Corso | N/A | |
| OnlineAugment: Online Data Augmentation with Less Domain Knowledge | Zhiqiang Tang, Yunhe Gao, Leonid Karlinsky, Prasanna Sattigeri, Rogerio Feris, Dimitris Metaxas | N/A | |
| Learning Pairwise Inter-Plane Relations for Piecewise Planar Reconstruction | Yiming Qian, Yasutaka Furukawa | N/A | |
| Intra-class Feature Variation Distillation for Semantic Segmentation | Yukang Wang, Wei Zhou, Tao Jiang, Xiang Bai, Yongchao Xu | N/A | |
| Temporal Distinct Representation Learning for Action Recognition | Junwu Weng, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Xudong Jiang, Junsong Yuan | N/A | |
| Representative Graph Neural Network | Changqian Yu, Yifan Liu, Changxin Gao, Chunhua Shen, Nong Sang | N/A | |
| Deformation-Aware 3D Model Embedding and Retrieval | Mikaela Angelina Uy, Jingwei Huang, Minhyuk Sung, Tolga Birdal, Leonidas Guibas | N/A | |
| Atlas: End-to-End 3D Scene Reconstruction from Posed Images | Zak Murez, Tarrence van As, James Bartolozzi, Ayan Sinha, Vijay Badrinarayanan, Andrew Rabinovich | N/A | |
| Multiple Class Novelty Detection Under Data Distribution Shift | Poojan Oza, Hien V. Nguyen, Vishal M. Patel | N/A | |
| Colorization of Depth Map via Disentanglement | Chung-Sheng Lai, Zunzhi You, Ching-Chun Huang, Yi-Hsuan Tsai, Wei-Chen Chiu | N/A | |
| Beyond Controlled Environments: 3D Camera Re-Localization in Changing Indoor Scenes | Johanna Wald, Torsten Sattler, Stuart Golodetz, Tommaso Cavallari, Federico Tombari | N/A | |
| GeoGraph: Graph-based multi-view object detection with geometric cues end-to-end | Ahmed Samy Nassar, Stefano D’Aronco, Sébastien Lefèvre, Jan D. Wegner | N/A | |
| Localizing the Common Action Among a Few Videos | Pengwan Yang, Vincent Tao Hu, Pascal Mettes, Cees G. M. Snoek | N/A | |
| TAFSSL: Task-Adaptive Feature Sub-Space Learning for few-shot classification | Moshe Lichtenstein, Prasanna Sattigeri, Rogerio Feris, Raja Giryes, Leonid Karlinsky | N/A | |
| Traffic Accident Benchmark for Causality Recognition | Tackgeun You, Bohyung Han | N/A | |
| Face Anti-Spoofing with Human Material Perception | Zitong Yu, Xiaobai Li, Xuesong Niu, Jingang Shi, Guoying Zhao | N/A | |
| How Can I See My Future? FvTraj: Using First-person View for Pedestrian Trajectory Prediction | Huikun Bi, Ruisi Zhang, Tianlu Mao, Zhigang Deng, Zhaoqi Wang | N/A | |
| Multiple Expert Brainstorming for Domain Adaptive Person Re-identification | Yunpeng Zhai, Qixiang Ye, Shijian Lu, Mengxi Jia, Rongrong Ji, Yonghong Tian | N/A | |
| NASA Neural Articulated Shape Approximation | Boyang Deng, JP Lewis, Timothy Jeruzalski, Gerard Pons-Moll, Geoffrey Hinton, Mohammad Norouzi, Andrea Tagliasacchi | N/A | |
| Towards Unique and Informative Captioning of Images | Zeyu Wang, Berthy Feng, Karthik Narasimhan, Olga Russakovsky | N/A | |
| When Does Self-supervision Improve Few-shot Learning? | Jong-Chyi Su, Subhransu Maji, Bharath Hariharan | N/A | |
| Two-branch Recurrent Network for Isolating Deepfakes in Videos | Iacopo Masi, Aditya Killekar, Royston Marian Mascarenhas, Shenoy Pratik Gurudatt, Wael AbdAlmageed | N/A | |
| Incremental Few-Shot Meta-Learning via Indirect Discriminant Alignment | Qing Liu, Orchid Majumder, Alessandro Achille, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto | N/A | |
| BigNAS: Scaling Up Neural Architecture Search with Big Single-Stage Models | Jiahui Yu, Pengchong Jin, Hanxiao Liu, Gabriel Bender, Pieter-Jan Kindermans, Mingxing Tan, Thomas Huang, Xiaodan Song, Ruoming Pang, Quoc Le | N/A | |
| Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation | Sheng Jin, Wentao Liu, Enze Xie, Wenhai Wang, Chen Qian, Wanli Ouyang, Ping Luo | N/A | |
| Global Distance-distributions Separation for Unsupervised Person Re-identification | Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen | N/A | |
| I2L-MeshNet: Image-to-Lixel Prediction Network for Accurate 3D Human Pose and Mesh Estimation from a Single RGB Image | Gyeongsik Moon, Kyoung Mu Lee | N/A | |
| Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh Recovery from a 2D Human Pose | Hongsuk Choi, Gyeongsik Moon, Kyoung Mu Lee | N/A | |
| ALRe: Outlier Detection for Guided Refinement | Mingzhu Zhu, Zhang Gao, Junzhi Yu, Bingwei He, Jiantao Liu | N/A | |
| Weakly-Supervised Crowd Counting Learns from Sorting rather than Locations | Yifan Yang, Guorong Li, Zhe Wu, Li Su, Qingming Huang, Nicu Sebe | N/A | |
| Unsupervised Domain Attention Adaptation Network for Caricature Attribute Recognition | Wen Ji, Kelei He, Jing Huo, Zheng Gu, Yang Gao | N/A | |
| Many-shot from Low-shot: Learning to Annotate using Mixed Supervision for Object Detection | Carlo Biffi, Steven McDonagh, Philip Torr, Aleš Leonardis, Sarah Parisot | N/A | |
| Curriculum DeepSDF | Yueqi Duan, Haidong Zhu, He Wang, Li Yi Ram Nevatia, Leonidas J. Guibas | N/A | |
| Meshing Point Clouds with Predicted Intrinsic-Extrinsic Ratio Guidance | Minghua Liu, Xiaoshuai Zhang, Hao Su | N/A | |
| Improved Adversarial Training via Learned Optimizer | Yuanhao Xiong, Cho-Jui Hsieh | N/A | |
| Component Divide-and-Conquer for Real-World Image Super-Resolution | Pengxu Wei, Ziwei Xie, Hannan Lu, Zongyuan Zhan, Qixiang Ye, Wangmeng Zuo, Liang Lin | N/A | |
| Enabling Deep Residual Networks for Weakly Supervised Object Detection | Yunhang Shen, Rongrong Ji, Yan Wang, Zhiwei Chen, Feng Zheng, Feiyue Huang, Yunsheng Wu | N/A | |
| Deep near-light photometric stereo for spatially varying reflectances | Hiroaki Santo, Michael Waechter, Yasuyuki Matsushita | N/A | |
| Learning Visual Representations with Caption Annotations | Mert Bulent Sariyildiz, Julien Perez, Diane Larlus | N/A | |
| Solving Long-tailed Recognition with Deep Realistic Taxonomic Classifier | Tz-Ying Wu, Pedro Morgado, Pei Wang, Chih-Hui Ho, Nuno Vasconcelos | N/A | |
| Regression of Instance Boundary by Aggregated CNN and GCN | Yanda Meng, Wei Meng, Dongxu Gao, Yitian Zhao, Xiaoyun Yang, Xiaowei Huang, Yalin Zheng | N/A | |
| Social Adaptive Module for Weakly-supervised Group Activity Recognition | Rui Yan, Lingxi Xie, Jinhui Tang, Xiangbo Shu, Qi Tian | N/A | |
| RGB-D Salient Object Detection with Cross-Modality Modulation and Selection | Chongyi Li, Runmin Cong, Yongri Piao, Qianqian Xu, Chen Change Loy | N/A | |
| RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval | Hung-Yu Tseng, Hsin-Ying Lee, Lu Jiang, Ming-Hsuan Yang, Weilong Yang | N/A | |
| Cheaper Pre-training Lunch: An Efficient Paradigm for Object Detection | Dongzhan Zhou, Xinchi Zhou, Hongwen Zhang, Shuai Yi, Wanli Ouyang | N/A | |
| Faster Person Re-Identification | Guan’an Wang, Shaogang Gong, Jian Cheng, Zengguang Hou | N/A | |
| Quantization Guided JPEG Artifact Correction | Max Ehrlich, Ser-Nam Lim, Larry Davis, Abhinav Shrivastava | N/A | |
| 3PointTM: Faster Measurement of High-Dimensional Transmission Matrices | Yujun Chen, Manoj Kumar Sharma, Ashutosh Sabharwal, Ashok Veeraraghavan, Aswin C. Sankaranarayanan | N/A | |
| Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer | Xide Xia, Meng Zhang, Tianfan Xue, Zheng Sun, Hui Fang, Brian Kulis , Jiawen Chen | N/A | |
| Beyond 3DMM Space: Towards Fine-grained 3D Face Reconstruction | Xiangyu Zhu, Fan Yang, Di Huang, Chang Yu, Hao Wang, Jianzhu Guo, Zhen Lei, Stan Z. Li | N/A | |
| World-Consistent Video-to-Video Synthesis | Arun Mallya, Ting-Chun Wang, Karan Sapra, Ming-Yu Liu | N/A | |
| Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance Segmentation | Qi Fan, Lei Ke, Wenjie Pei, Chi-Keung Tang, Yu-Wing Tai | N/A | |
| GMNet: Graph Matching Network for Large Scale Part Semantic Segmentation in the Wild | Umberto Michieli, Edoardo Borsato, Luca Rossi, Pietro Zanuttigh | N/A | |
| Event-based Asynchronous Sparse Convolutional Networks | Nico Messikommer, Daniel Gehrig, Antonio Loquercio, Davide Scaramuzza | N/A | |
| AtlantaNet: Inferring the 3D Indoor Layout from a Single 360(∘) Image beyond the Manhattan World Assumption | Giovanni Pintore, Marco Agus, Enrico Gobbetti | N/A | |
| AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification | Xiaofang Wang, Xuehan Xiong, Maxim Neumann, AJ Piergiovanni, Michael S. Ryoo, Anelia Angelova, Kris M. Kitani, Wei Hua | N/A | |
| REMIND Your Neural Network to Prevent Catastrophic Forgetting | Tyler L. Hayes, Kushal Kafle, Robik Shrestha, Manoj Acharya, Christopher Kanan | N/A | |
| Image Classification in the Dark using Quanta Image Sensors | Abhiram Gnanasambandam, Stanley H. Chan | N/A | |
| n-Reference Transfer Learning for Saliency Prediction | Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao | N/A | |
| Progressively Guided Alternate Refinement Network for RGB-D Salient Object Detection | Shuhan Chen, Yun Fu | N/A | |
| Bottom-Up Temporal Action Localization with Mutual Regularization | Peisen Zhao, Lingxi Xie, Chen Ju, Ya Zhang, Yanfeng Wang, Qi Tian | N/A | |
| On Modulating the Gradient for Meta-Learning | Christian Simon, Piotr Koniusz, Richard Nock, Mehrtash Harandi | N/A | |
| Domain-Specific Mappings for Generative Adversarial Style Transfer | Hsin-Yu Chang, Zhixiang Wang, Yung-Yu Chuang | N/A | |
| DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning | Timo Milbich, Karsten Roth, Homanga Bharadhwaj, Samarth Sinha, Yoshua Bengio, Björn Ommer, Joseph Paul Cohen | N/A | |
| DHP: Differentiable Meta Pruning via HyperNetworks | Yawei Li, Shuhang Gu, Kai Zhang, Luc Van Gool, Radu Timofte | N/A | |
| Deep Transferring Quantization | Zheng Xie, Zhiquan Wen, Jing Liu, Zhiqiang Liu, Xixian Wu, Mingkui Tan | N/A | |
| Deep Credible Metric Learning for Unsupervised Domain Adaptation Person Re-identification | Guangyi Chen, Yuhao Lu, Jiwen Lu, Jie Zhou | N/A | |
| Temporal Coherence or Temporal Motion: Which is More Critical for Video-based Person Re-identification? | Guangyi Chen, Yongming Rao, Jiwen Lu, Jie Zhou | N/A | |
| Arbitrary-Oriented Object Detection with Circular Smooth Label | Xue Yang, Junchi Yan | N/A | |
| Learning Event-Driven Video Deblurring and Interpolation | Songnan Lin, Jiawei Zhang, Jinshan Pan, Zhe Jiang, Dongqing Zou, Yongtian Wang, Jing Chen, Jimmy Ren | N/A | |
| Vectorizing World Buildings: Planar Graph Reconstruction by Primitive Detection and Relationship Inference | Nelson Nauata, Yasutaka Furukawa | N/A | |
| Learning to Combine: Knowledge Aggregation for Multi-Source Domain Adaptation | Hang Wang, Minghao Xu, Bingbing Ni, Wenjun Zhang | N/A | |
| CSCL: Critical Semantic-Consistent Learning for Unsupervised Domain Adaptation | Jiahua Dong, Yang Cong, Gan Sun, Yuyang Liu, Xiaowei Xu | N/A | |
| Prototype Mixture Models for Few-shot Semantic Segmentation | Boyu Yang, Chang Liu, Bohao Li, Jianbin Jiao, Qixiang Ye | N/A | |
| Webly Supervised Image Classification with Self-Contained Confidence | Jingkang Yang, Litong Feng, Weirong Chen, Xiaopeng Yan, Huabin Zheng , Ping Luo, Wayne Zhang | N/A | |
| Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization | Haibao Yu, Qi Han, Jianbo Li, Jianping Shi, Guangliang Cheng, Bin Fan | N/A | |
| Monocular 3D Object Detection via Feature Domain Adaptation | Xiaoqing Ye, Liang Du, Yifeng Shi, Yingying Li, Xiao Tan, Jianfeng Feng, Errui Ding, Shilei Wen | N/A | |
| Talking-head Generation with Rhythmic Head Motion | Lele Chen, Guofeng Cui, Celong Liu, Zhong Li, Ziyi Kou, Yi Xu, Chenliang Xu | N/A | |
| AUTO3D: Novel view synthesis through unsupervisely learned variational viewpoint and global 3D representation | Xiaofeng Liu, Tong Che, Yiqun Lu, Chao Yang, Site Li, Jane You | N/A | |
| VPN: Learning Video-Pose Embedding for Activities of Daily Living | Srijan Das, Saurav Sharma, Rui Dai, François Brémond, Monique Thonnat | N/A | |
| Soft Anchor-Point Object Detection | Chenchen Zhu, Fangyi Chen, Zhiqiang Shen, Marios Savvides | N/A | |
| Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid | Jun Gao, Zian Wang, Jinchen Xuan, Sanja Fidler | N/A | |
| Soft Expert Reward Learning for Vision-and-Language Navigation | Hu Wang, Qi Wu, Chunhua Shen | N/A | |
| Part-aware Prototype Network for Few-shot Semantic Segmentation | Yongfei Liu, Xiangyi Zhang, Songyang Zhang, Xuming He | N/A | |
| Learning from Extrinsic and Intrinsic Supervisions for Domain Generalization | Shujun Wang, Lequan Yu, Caizi Li, Chi-Wing Fu, Pheng-Ann Heng | N/A | |
| Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos | Mahsa Ehsanpour, Alireza Abedin, Fatemeh Saleh, Javen Shi, Ian Reid , Hamid Rezatofighi | N/A | |
| Whole-Body Human Pose Estimation in the Wild | Sheng Jin, Lumin Xu, Jin Xu, Can Wang, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo | N/A | |
| Relative Pose Estimation of Calibrated Cameras with Known SE(3) Invariants | Bo Li, Evgeniy Martyushev, Gim Hee Lee | N/A | |
| Sequential Convolution and Runge-Kutta Residual Architecture for Image Compressed Sensing | Runkai Zheng, Yinqi Zhang, Daolang Huang, Qingliang Chen | N/A | |
| Deep Hough Transform for Semantic Line Detection | Qi Han, Kai Zhao, Jun Xu, Ming-Ming Cheng | N/A | |
| Structured Landmark Detection via Topology-Adapting Deep Graph Learning | Weijian Li, Yuhang Lu, Kang Zheng, Haofu Liao, Chihung Lin, Jiebo Luo, Chi-Tung Cheng, Jing Xiao, Le Lu, Chang-Fu Kuo, Shun Miao | N/A | |
| 3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning | Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, László A. Jeni, Fernando De la Torre | N/A | |
| Learning to Balance Specificity and Invariance for In and Out of Domain Generalization | Prithvijit Chattopadhyay, Yogesh Balaji, Judy Hoffman | N/A | |
| Contrastive Learning for Unpaired Image-to-Image Translation | Taesung Park Alexei A. Efros Richard Zhang Jun-Yan Zhu | N/A | |
| DLow: Diversifying Latent Flows for Diverse Human Motion Prediction | Ye Yuan, Kris Kitani | N/A | |
| GRNet: Gridding Residual Network for Dense Point Cloud Completion | Haozhe Xie, Hongxun Yao, Shangchen Zhou, Jiageng Mao, Shengping Zhang, Wenxiu Sun | N/A | |
| Gait Lateral Network: Learning Discriminative and Compact Representations for Gait Recognition | Saihui Hou, Chunshui Cao, Xu Liu, Yongzhen Huang | N/A | |
| Blind Face Restoration via Deep Multi-scale Component Dictionaries | Xiaoming Li, Chaofeng Chen, Shangchen Zhou, Xianhui Lin, Wangmeng Zuo, Lei Zhang | N/A | |
| Robust Neural Networks inspired by Strong Stability Preserving Runge-Kutta methods | Byungjoo Kim, Bryce Chudomelka, Jinyoung Park, Jaewoo Kang, Youngjoon Hong, Hyunwoo J. Kim | N/A | |
| Inequality-Constrained and Robust 3D Face Model Fitting | Evangelos Sariyanidi, Casey J. Zampella, Robert T. Schultz, Birkan Tunc | N/A | |
| Gabor Layers Enhance Network Robustness | Juan C. Pérez, Motasem Alfarra, Guillaume Jeanneret, Adel Bibi, Ali Thabet, Bernard Ghanem, Pablo Arbeláez | N/A | |
| Conditional Image Repainting via Semantic Bridge and Piecewise Value Function | Shuchen Weng, Wenbo Li, Dawei Li, Hongxia Jin, Boxin Shi | N/A | |
| Learnable Cost Volume Using the Cayley Representation | Taihong Xiao, Jinwei Yuan, Deqing Sun, Qifei Wang Xin-Yu Zhang, Kehan Xu, Ming-Hsuan Yang | N/A | |
| HALO: Hardware-Aware Learning to Optimize | Chaojian Li, Tianlong Chen, Haoran You, Zhangyang Wang, Yingyan Lin | N/A | |
| Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling | Jia Zheng, Junfei Zhang, Jing Li, Rui Tang, Shenghua Gao, Zihan Zhou | N/A | |
| BroadFace: Looking at Tens of Thousands of People at Once for Face Recognition | Yonghyun Kim, Wonpyo Park, Jongju Shin | N/A | |
| Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision | Xinzhe Han, Shuhui Wang, Chi Su, Weigang Zhang, Qingming Huang, Qi Tian | N/A | |
| Domain Adaptive Semantic Segmentation Using Weak Labels | Sujoy Paul, Yi-Hsuan Tsai, Samuel Schulter, Amit K. Roy-Chowdhury, Manmohan Chandraker | N/A | |
| Knowledge Distillation Meets Self-Supervision | Guodong Xu, Ziwei Liu, Xiaoxiao Li, Chen Change Loy | N/A | |
| Efficient Neighbourhood Consensus Networks via Submanifold Sparse Convolutions | Ignacio Rocco, Relja Arandjelović, Josef Sivic | N/A | |
| Reconstructing the Noise Variance Manifold for Image Denoising | Ioannis Marras, Grigorios G. Chrysos, Ioannis Alexiou, Gregory Slabaugh, Stefanos Zafeiriou | N/A | |
| Occlusion-Aware Depth Estimation with Adaptive Normal Constraints | Xiaoxiao Long, Lingjie Liu, Christian Theobalt, Wenping Wang | N/A | |
| VisualEchoes: Spatial Image Representation Learning through Echolocation | Ruohan Gao, Changan Chen, Ziad Al-Halah, Carl Schissler, Kristen Grauman | N/A | |
| Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval | Andrew Brown, Weidi Xie, Vicky Kalogeiton, Andrew Zisserman | N/A | |
| Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation | Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng, Maxwell D. Collins, Ekin D. Cubuk, Barret Zoph, Hartwig Adam, Jonathon Shlens | N/A | |
| Spatially Aware Multimodal Transformers for TextVQA | Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal | N/A | |
| Every Pixel Matters: Center-aware Feature Alignment for Domain Adaptive Object Detector | Cheng-Chun Hsu, Yi-Hsuan Tsai, Yen-Yu Lin, Ming-Hsuan Yang | N/A | |
| URIE: Universal Image Enhancement for Visual Recognition in the Wild | Taeyoung Son Juwon Kang Namyup Kim Sunghyun Cho Suha Kwak | N/A | |
| Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation | Hongwei Yi, Zizhuang Wei, Mingyu Ding, Runze Zhang, Yisong Chen, Guoping Wang, Yu-Wing Tai | N/A | |
| SPL-MLL: Selecting Predictable Landmarks for Multi-Label Learning | Junbing Li, Changqing Zhang, Pengfei Zhu, Baoyuan Wu, Lei Chen, Qinghua Hu | N/A | |
| Unpaired Image-to-Image Translation using Adversarial Consistency Loss | Yihao Zhao, Ruihai Wu, Hao Dong | N/A | |
| Discriminability Distillation in Group Representation Learning | Manyuan Zhang, Guanglu Song, Hang Zhou, Yu Liu | N/A | |
| Monocular Expressive Body Regression through Body-Driven Attention | Vasileios Choutas, Georgios Pavlakos, Timo Bolkart, Dimitrios Tzionas , Michael J. Black | N/A | |
| Dual Adversarial Network: Toward Real-world Noise Removal and Noise Generation | Zongsheng Yue, Qian Zhao, Lei Zhang, Deyu Meng | N/A | |
| Linguistic Structure Guided Context Modeling for Referring Image Segmentation | Tianrui Hui, Si Liu, Shaofei Huang, Guanbin Li, Sansi Yu, Faxi Zhang, Jizhong Han | N/A | |
| Federated Visual Classification with Real-World Data Distribution | Tzu-Ming Harry Hsu, Hang Qi, Matthew Brown | N/A | |
| Robust Re-Identification by Multiple Views Knowledge Distillation | Angelo Porrello, Luca Bergamini, Simone Calderara | N/A | |
| Defocus Deblurring Using Dual-Pixel Data | Abdullah Abuolaim, Michael S. Brown | N/A | |
| RhyRNN: Rhythmic RNN for Recognizing Events in Long and Complex Videos | Tianshu Yu, Yikang Li, Baoxin Li | N/A | |
| Take an Emotion Walk: Perceiving Emotions from Gaits Using Hierarchical Attention Pooling and Affective Mapping | Uttaran Bhattacharya, Christian Roncal, Trisha Mittal, Rohan Chandra , Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha | N/A | |
| Weighing Counts: Sequential Crowd Counting by Reinforcement Learning | Liang Liu, Hao Lu, Hongwei Zou, Haipeng Xiong, Zhiguo Cao, Chunhua Shen | N/A | |
| Reflection Backdoor: A Natural Backdoor Attack on Deep Neural Networks | Yunfei Liu, Xingjun Ma, James Bailey, Feng Lu | N/A | |
| Learning to Learn with Variational Information Bottleneck for Domain Generalization | Yingjun Du, Jun Xu, Huan Xiong, Qiang Qiu, Xiantong Zhen, Cees G. M. Snoek, Ling Shao | N/A | |
| Deep Positional and Relational Feature Learning for Rotation-Invariant Point Cloud Analysis | Ruixuan Yu, Xin Wei, Federico Tombari, Jian Sun | N/A | |
| Thanks for Nothing: Predicting Zero-Valued Activations with Lightweight Convolutional Neural Networks | Gil Shomron, Ron Banner, Moran Shkolnik, Uri Weiser | N/A | |
| Layered Neighborhood Expansion for Incremental Multiple Graph Matching | Zixuan Chen, Zhihui Xie, Junchi Yan Yinqiang Zheng, Xiaokang Yang | N/A | |
| SCAN: Learning to Classify Images without Labels | Wouter Van Gansbeke, Simon Vandenhende, Stamatios Georgoulis, Marc Proesmans, Luc Van Gool | N/A | |
| Graph convolutional networks for learning with few clean and many noisy labels | Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Ondřej Chum, Cordelia Schmid | N/A | |
| Object-and-Action Aware Model for Visual Language Navigation | Yuankai Qi, Zizheng Pan, Shengping Zhang, Anton van den Hengel, Qi Wu | N/A | |
| A Comprehensive Study of Weight Sharing in Graph Networks for 3D Human Pose Estimation | Kenkun Liu, Rongqi Ding, Zhiming Zou, Le Wang, Wei Tang | N/A | |
| MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution | Wenbo Li, Xin Tao, Taian Guo, Lu Qi, Jiangbo Lu, Jiaya Jia | N/A | |
| Efficient Semantic Video Segmentation with Per-frame Inference | Yifan Liu, Chunhua Shen, Changqian Yu, Jingdong Wang | N/A | |
| Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers | Christoph Kamann, Carsten Rother | N/A | |
| Deep Spiking Neural Network: Energy Efficiency Through Time based Coding | Bing Han, Kaushik Roy | N/A | |
| InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic Information Modeling | Jun Wang, Shiyi Lan, Mingfei Gao, Larry S. Davis | N/A | |
| Utilizing Patch-level Category Activation Patterns for Multiple Class Novelty Detection | Poojan Oza, Vishal M. Patel | N/A | |
| People as Scene Probes | Yifan Wang, Brian L. Curless, Steven M. Seitz | N/A | |
| Mapping in a Cycle: Sinkhorn Regularized Unsupervised Learning for Point Cloud Shapes | Lei Yang, Wenxi Liu, Zhiming Cui, Nenglun Chen, Wenping Wang | N/A | |
| Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions | Matheus Gadelha, Aruni RoyChowdhury, Gopal Sharma, Evangelos Kalogerakis, Liangliang Cao, Erik Learned-Miller, Rui Wang, Subhransu Maji | N/A | |
| TexMesh: Reconstructing Detailed Human Texture and Geometry from RGB-D Video | Tiancheng Zhi, Christoph Lassner, Tony Tung, Carsten Stoll, Srinivasa G. Narasimhan, Minh Vo | N/A | |
| Consistency-based Semi-supervised Active Learning: Towards Minimizing Labeling Cost | Mingfei Gao, Zizhao Zhang, Guo Yu, Sercan . Arık, Larry S. Davis, Tomas Pfister | N/A | |
| Point-Set Anchors for Object Detection, Instance Segmentation and Pose Estimation | Fangyun Wei, Xiao Sun, Hongyang Li, Jingdong Wang, Stephen Lin | N/A | |
| Modeling 3D Shapes by Reinforcement Learning | Cheng Lin, Tingxiang Fan, Wenping Wang, Matthias Nießner | N/A | |
| LST-Net: Learning a Convolutional Neural Network with a Learnable Sparse Transform | Lida Li, Kun Wang, Shuai Li, Xiangchu Feng, Lei Zhang | N/A | |
| Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision | Damien Teney, Ehsan Abbasnedjad, Anton van den Hengel | N/A | |
| CN: Channel Normalization For Point Cloud Recognition | Zetong Yang, Yanan Sun, Shu Liu, Xiaojuan Qi, Jiaya Jia | N/A | |
| Rethinking the Defocus Blur Detection Problem and A Real-Time Deep DBD Model | Ning Zhang, Junchi Yan | N/A | |
| AutoMix: Mixup Networks for Sample Interpolation via Cooperative Barycenter Learning | Jianchao Zhu, Liangliang Shi, Junchi Yan, Hongyuan Zha | N/A | |
| Scene Text Image Super-resolution in the wild | Wenjia Wang, Enze Xie, Xuebo Liu, Wenhai Wang, Ding Liang, Chunhua Shen, Xiang Bai | N/A | |
| Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling | Omid Poursaeed, Matthew Fisher, Noam Aigerman, Vladimir G. Kim | N/A | |
| Learning Disentangled Representations with Latent Variation Predictability | Xinqi Zhu, Chang Xu, Dacheng Tao | N/A | |
| Deep Space-Time Video Upsampling Networks | Jaeyeon Kang, Younghyun Jo, Seoung Wug Oh, Peter Vajda, Seon Joo Kim | N/A | |
| Large-Scale Few-Shot Learning via Multi-Modal Knowledge Discovery | Shuo Wang, Jun Yue, Jianzhuang Liu, Qi Tian, Meng Wang | N/A | |
| Fast Video Object Segmentation using the Global Context Module | Yu Li, Zhuoran Shen, Ying Shan | N/A | |
| Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos | Anurag Arnab, Chen Sun, Arsha Nagrani, Cordelia Schmid | N/A | |
| Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification | Nikita Dvornik, Cordelia Schmid, Julien Mairal | N/A | |
| MessyTable: Instance Association in Multiple Camera Views | Zhongang Cai, Junzhe Zhang, Daxuan Ren, Cunjun Yu, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Chen Change Loy | N/A | |
| A Unified Framework for Shot Type Classification Based on Subject Centric Lens | Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin | N/A | |
| BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues | Samuel Albanie, Gül Varol, Liliane Momeni, Triantafyllos Afouras, Joon Son Chung, Neil Fox, Andrew Zisserman | N/A | |
| HTML: A Parametric Hand Texture Model for 3D Hand Reconstruction and Personalization | Neng Qian, Jiayi Wang, Franziska Mueller, Florian Bernard, Vladislav Golyanik, Christian Theobalt | N/A | |
| CycAs: Self-supervised Cycle Association for Learning Re-identifiable Descriptions | Zhongdao Wang, Jingwei Zhang, Liang Zheng, Yixuan Liu, Yifan Sun, Yali Li, Shengjin Wang | N/A | |
| Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions | Xihui Liu, Zhe Lin, Jianming Zhang, Handong Zhao, Quan Tran, Xiaogang Wang, Hongsheng Li | N/A | |
| Towards Real-Time Multi-Object Tracking | Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, Shengjin Wang | N/A | |
| A Balanced and Uncertainty-aware Approach for Partial Domain Adaptation | Jian Liang, Yunbo Wang, Dapeng Hu, Ran He, Jiashi Feng | N/A | |
| Unsupervised Deep Metric Learning with Transformed Attention Consistency and Contrastive Clustering Loss | Yang Li, Shichao Kan, Zhihai He | N/A | |
| STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos | Ali Athar, Sabarinath Mahadevan, Aljosa Osep, Laura Leal-Taixé, Bastian Leibe | N/A | |
| Hierarchical Style-based Networks for Motion Synthesis | Jingwei Xu, Huazhe Xu, Bingbing Ni, Xiaokang Yang, Xiaolong Wang, Trevor Darrell | N/A | |
| Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop | Benjamin Biggs, Oliver Boyne, James Charles, Andrew Fitzgibbon, Roberto Cipolla | N/A | |
| Learning to Count in the Crowd from Limited Labeled Data | Vishwanath A. Sindagi, Rajeev Yasarla, Deepak Sam Babu, R. Venkatesh Babu, Vishal M. Patel | N/A | |
| SPOT: Selective Point Cloud Voting for Better Proposal in Point Cloud Object Detection | Hongyuan Du, Linjun Li, Bo Liu, Nuno Vasconcelos | N/A | |
| Explainable Face Recognition | Jonathan R. Williford, Brandon B. May, Jeffrey Byrne | N/A | |
| From Shadow Segmentation to Shadow Removal | Hieu Le, Dimitris Samaras | N/A | |
| Diverse and Admissible Trajectory Prediction through Multimodal Context Understanding | Seong Hyeon Park, Gyubok Lee, Jimin Seo, Manoj Bhat, Minseok Kang, Jonathan Francis, Ashwin Jadhav, Paul Pu Liang, Louis-Philippe Morency | N/A | |
| CONFIG: Controllable Neural Face Image Generation | Marek Kowalski, Stephan J. Garbin, Virginia Estellers, Tadas Baltrušaitis, Matthew Johnson, Jamie Shotton | N/A | |
| Single View Metrology in the Wild | Rui Zhu, Xingyi Yang, Yannick Hold-Geoffroy, Federico Perazzi, Jonathan Eisenmann, Kalyan Sunkavalli, Manmohan Chandraker | N/A | |
| Procedure Planning in Instructional Videos | Chien-Yi Chang, De-An Huang, Danfei Xu, Ehsan Adeli, Li Fei-Fei, Juan Carlos Niebles | N/A | |
| Funnel Activation for Visual Recognition | Ningning Ma, Xiangyu Zhang, Jian Sun | N/A | |
| GIQA: Generated Image Quality Assessment | Shuyang Gu, Jianmin Bao, Dong Chen, Fang Wen | N/A | |
| Adversarial Continual Learning | Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach | N/A | |
| Adapting Object Detectors with Conditional Domain Normalization | Peng Su, Kun Wang, Xingyu Zeng, Shixiang Tang, Dapeng Chen, Di Qiu , Xiaogang Wang | N/A | |
| HARD-Net: Hardness-AwaRe Discrimination Network for 3D Early Activity Prediction | Tianjiao Li, Jun Liu, Wei Zhang, Lingyu Duan | N/A | |
| Pseudo RGB-D for Self-Improving Monocular SLAM and Depth Prediction | Lokender Tiwari, Pan Ji, Quoc-Huy Tran, Bingbing Zhuang, Saket Anand , Manmohan Chandraker | N/A | |
| Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting | Shengcai Liao, Ling Shao | N/A | |
| Self-supervised Bayesian Deep Learning for Image Recovery with Applications to Compressive Sensing | Tongyao Pang, Yuhui Quan, Hui Ji | N/A | |
| Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement | Jian Wang, Xiang Long, Yuan Gao, Errui Ding, Shilei Wen | N/A | |
| Semi-supervised Learning with a Teacher-student Network for Generalized Attribute Prediction | Minchul Shin | N/A | |
| Unsupervised Domain Adaptation with Noise Resistible Mutual-Training for Person Re-identification | Fang Zhao, Shengcai Liao, Guo-Sen Xie, Jian Zhao, Kaihao Zhang, Ling Shao | N/A | |
| DPDist: Comparing Point Clouds Using Deep Point Cloud Distance | Dahlia Urbach, Yizhak Ben-Shabat, Michael Lindenbaum | N/A | |
| Bi-directional Cross-Modality Feature Propagation with Separation-and-Aggregation Gate for RGB-D Semantic Segmentation | Xiaokang Chen, Kwan-Yee Lin, Jingbo Wang, Wayne Wu, Chen Qian, Hongsheng Li, Gang Zeng | N/A | |
| DataMix: Efficient Privacy-Preserving Edge-Cloud Inference | Zhijian Liu, Zhanghao Wu, Chuang Gan, Ligeng Zhu, Song Han | N/A | |
| Neural Re-Rendering of Humans from a Single Image | Kripasindhu Sarkar, Dushyant Mehta, Weipeng Xu, Vladislav Golyanik, Christian Theobalt | N/A | |
| Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation | Filippo Aleotti, Fabio Tosi, Li Zhang, Matteo Poggi, Stefano Mattoccia | N/A | |
| PIPAL: a Large-Scale Image Quality Assessment Dataset for Perceptual Image Restoration | Jinjin Gu, Haoming Cai, Haoyu Chen, Xiaoxing Ye, Jimmy S. Ren, Chao Dong | N/A | |
| Why do These Match? Explaining the Behavior of Image Similarity Models | Bryan A. Plummer, Mariya I. Vasileva, Vitali Petsiuk, Kate Saenko, David Forsyth | N/A | |
| CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing | Xuanhong Chen, Bingbing Ni, Naiyuan Liu, Ziang Liu, Yiliu Jiang, Loc Truong, Qi Tian | N/A | |
| Progressive Transformers for End-to-End Sign Language Production | Ben Saunders, Necati Cihan Camgoz, Richard Bowden | N/A | |
| Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting | Minghui Liao, Guan Pang, Jing Huang, Tal Hassner, Xiang Bai | N/A | |
| Making Affine Correspondences Work in Camera Geometry Computation | Daniel Barath, Michal Polic, Wolfgang Förstner, Torsten Sattler, Tomas Pajdla, Zuzana Kukelova | N/A | |
| Sub-center ArcFace: Boosting Face Recognition by Large-scale Noisy Web Faces | Jiankang Deng, Jia Guo, Tongliang Liu, Mingming Gong, Stefanos Zafeiriou | N/A | |
| Foley Music: Learning to Generate Music from Videos | Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba | N/A | |
| Contrastive Multiview Coding | Yonglong Tian, Dilip Krishnan, Phillip Isola | N/A | |
| Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses | Yingwei Li, Song Bai, Cihang Xie, Zhenyu Liao, Xiaohui Shen, Alan Yuille | N/A | |
| Generative Low-bitwidth Data Free Quantization | Shoukai Xu, Haokun Li, Bohan Zhuang, Jing Liu, Jiezhang Cao, Chuangrun Liang, Mingkui Tan | N/A | |
| Local Correlation Consistency for Knowledge Distillation | Xiaojie Li, Jianlong Wu, Hongyu Fang, Yue Liao, Fei Wang, Chen Qian | N/A | |
| Perceiving 3D Human-Object Spatial Arrangements from a Single Image in the Wild | Jason Y. Zhang, Sam Pepose, Hanbyul Joo, Deva Ramanan, Jitendra Malik, Angjoo Kanazawa | N/A | |
| Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation | Hang Zhou, Xudong Xu, Dahua Lin, Xiaogang Wang, Ziwei Liu | N/A | |
| CelebA-Spoof: Large-Scale Face Anti-Spoofing Dataset with Rich Annotations | Yuanhan Zhang, ZhenFei Yin, Yidong Li, Guojun Yin, Junjie Yan, Jing Shao, Ziwei Liu | N/A | |
| Thinking in Frequency: Face Forgery Detection by Mining Frequency-aware Clues | Yuyang Qian, Guojun Yin, Lu Sheng, Zixuan Chen, Jing Shao | N/A | |
| Weakly-Supervised Cell Tracking via Backward-and-Forward Propagation | Kazuya Nishimura, Junya Hayashida, Chenyang Wang, Dai Fei Elmer Ker, Ryoma Bise | N/A | |
| SeqHAND: RGB-Sequence-Based 3D Hand Pose and Shape Estimation | John Yang, Hyung Jin Chang, Seungeui Lee, Nojun Kwak | N/A | |
| Rethinking the Distribution Gap of Person Re-identification with Camera-based Batch Normalization | Zijie Zhuang, Longhui Wei, Lingxi Xie, Tianyu Zhang, Hengheng Zhang , Haozhe Wu, Haizhou Ai, Qi Tian | N/A | |
| AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation | Xiaobing Zhang, Shijian Lu, Haigang Gong, Zhipeng Luo, Ming Liu | N/A | |
| Online Multi-modal Person Search in Videos | Jiangyue Xia, Anyi Rao, Qingqiu Huang, Linning Xu, Jiangtao Wen, Dahua Lin | N/A | |
| Single Image Super-Resolution via a Holistic Attention Network | Ben Niu, Weilei Wen, Wenqi Ren, Xiangde Zhang, Lianping Yang, Shuzhen Wang, Kaihao Zhang, Xiaochun Cao, Haifeng Shen | N/A | |
| Can You Read Me Now? Content Aware Rectification using Angle Supervision | Amir Markovitz, Inbal Lavi, Or Perel, Shai Mazor, Roee Litman | N/A | |
| Momentum Batch Normalization for Deep Learning with Small Batch Size | Hongwei Yong, Jianqiang Huang, Deyu Meng, Xiansheng Hua, Lei Zhang | N/A | |
| AdvPC: Transferable Adversarial Perturbations on 3D Point Clouds | Abdullah Hamdi, Sara Rojas, Ali Thabet, Bernard Ghanem | N/A | |
| Edge-aware Graph Representation Learning and Reasoning for Face Parsing | Gusi Te, Yinglu Liu, Wei Hu, Hailin Shi, Tao Mei | N/A | |
| BBS-Net: RGB-D Salient Object Detection with a Bifurcated Backbone Strategy Network | Deng-Ping Fan, Yingjie Zhai, Ali Borji, Jufeng Yang, Ling Shao | N/A | |
| G-LBM:Generative Low-dimensional Background Model Estimation from Video Sequences | Behnaz Rezaei, Amirreza Farnoosh, Sarah Ostadabbas | N/A | |
| H3DNet: 3D Object Detection Using Hybrid Geometric Primitives | Zaiwei Zhang, Bo Sun, Haitao Yang, Qixing Huang | N/A | |
| Expressive Telepresence via Modular Codec Avatars | Hang Chu, Shugao Ma, Fernando De la Torre, Sanja Fidler, Yaser Sheikh | N/A | |
| Cascade Graph Neural Networks for RGB-D Salient Object Detection | Ao Luo, Xin Li, Fan Yang, Zhicheng Jiao, Hong Cheng, Siwei Lyu | N/A | |
| FairALM: Augmented Lagrangian Method for Training Fair Models with Little Regret | Vishnu Suresh Lokhande, Aditya Kumar Akash, Sathya N. Ravi, Vikas Singh | N/A | |
| Generating Videos of Zero-Shot Compositions of Actions and Objects | Megha Nawhal, Mengyao Zhai, Andreas Lehrmann, Leonid Sigal, Greg Mori | N/A | |
| ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language | Zhe Wang, Zhiyuan Fang, Jun Wang, Yezhou Yang | N/A | |
| Renovating Parsing R-CNN for Accurate Multiple Human Parsing | Lu Yang, Qing Song, Zhihui Wang, Mengjie Hu, Chun Liu, Xueshi Xin, Wenhe Jia, Songcen Xu | N/A | |
| Multi-Task Curriculum Framework for Open-Set Semi-Supervised Learning | Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa | N/A | |
| Gradient-Induced Co-Saliency Detection | Zhao Zhang, Wenda Jin, Jun Xu, Ming-Ming Cheng | N/A | |
| Nighttime Defogging Using High-Low Frequency Decomposition and Grayscale-Color Networks | Wending Yan, Robby T. Tan, Dengxin Dai | N/A | |
| SegFix: Model-Agnostic Boundary Refinement for Segmentation | Yuhui Yuan, Jingyi Xie, Xilin Chen, Jingdong Wang | N/A | |
| Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction | Cunjun Yu, Xiao Ma, Jiawei Ren, Haiyu Zhao, Shuai Yi | N/A | |
| Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars | Egor Zakharov, Aleksei Ivakhnenko, Aliaksandra Shysheya, Victor Lempitsky | N/A | |
| Neural Geometric Parser for Single Image Camera Calibration | Jinwoo Lee, Minhyuk Sung, Hyunjoon Lee, Junho Kim | N/A | |
| Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent Supervision | Yuxiang Wei, Ming Liu, Haolin Wang, Ruifeng Zhu, Guosheng Hu, Wangmeng Zuo | N/A | |
| Learning Architectures for Binary Networks | Dahyun Kim, Kunal Pratap Singh, Jonghyun Choi | N/A | |
| Semantic View Synthesis | Hsin-Ping Huang, Hung-Yu Tseng, Hsin-Ying Lee, Jia-Bin Huang | N/A | |
| An Analysis of Sketched IRLS for Accelerated Sparse Residual Regression | Daichi Iwata, Michael Waechter, Wen-Yan Lin, Yasuyuki Matsushita | N/A | |
| Relative Pose from Deep Learned Depth and a Single Affine Correspondence | Ivan Eichhardt, Daniel Barath | N/A | |
| Video Super-Resolution with Recurrent Structure-Detail Network | Takashi Isobe, Xu Jia, Shuhang Gu, Songjiang Li, Shengjin Wang, Qi Tian | N/A | |
| Shape Adaptor: A Learnable Resizing Module | Shikun Liu, Zhe Lin, Yilin Wang, Jianming Zhang, Federico Perazzi, Edward Johns | N/A | |
| Shuffle and Attend: Video Domain Adaptation | Jinwoo Choi, Gaurav Sharma, Samuel Schulter, Jia-Bin Huang | N/A | |
| DRG: Dual Relation Graph for Human-Object Interaction Detection | Chen Gao, Jiarui Xu, Yuliang Zou, Jia-Bin Huang | N/A | |
| Flow-edge Guided Video Completion | Chen Gao, Ayush Saraf, Jia-Bin Huang, Johannes Kopf | N/A | |
| End-to-End Trainable Deep Active Contour Models for Automated Image Segmentation: Delineating Buildings in Aerial Imagery | Ali Hatamizadeh, Debleena Sengupta, Demetri Terzopoulos | N/A | |
| Towards End-to-end Video-based Eye-Tracking | Seonwook Park, Emre Aksan, Xucong Zhang, Otmar Hilliges | N/A | |
| Generating Handwriting via Decoupled Style Descriptors | Atsunobu Kotani, Stefanie Tellex, James Tompkin | N/A | |
| LEED: Label-Free Expression Editing via Disentanglement | Rongliang Wu, Shijian Lu | N/A | |
| Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards | Xuewen Yang, Heming Zhang, Di Jin, Yingru Liu, Chi-Hao Wu, Jianchao Tan, Dongliang Xie, Jue Wang, Xin Wang | N/A | |
| Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder | Gouthaman KV, Anurag Mittal | N/A | |
| Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation | Jogendra Nath Kundu, Ambareesh Revanur, Govind Vitthal Waghmare, Rahul Mysore Venkatesh, R. Venkatesh Babu | N/A | |
| Class-Incremental Domain Adaptation | Jogendra Nath Kundu, Rahul Mysore Venkatesh, Naveen Venkat, Ambareesh Revanur, R. Venkatesh Babu | N/A | |
| Anti-Bandit Neural Architecture Search for Model Defense | Hanlin Chen, Baochang Zhang, Song Xue, Xuan Gong, Hong Liu, Rongrong Ji, David Doermann | N/A | |
| Wavelet-Based Dual-Branch Network for Image Demoiréing | Lin Liu, Jianzhuang Liu, Shanxin Yuan, Gregory Slabaugh, Aleš Leonardis, Wengang Zhou, Qi Tian | N/A | |
| Low Light Video Enhancement using Synthetic Data Produced with an Intermediate Domain Mapping | Danai Triantafyllidou, Sean Moran, Steven McDonagh, Sarah Parisot, Gregory Slabaugh | N/A | |
| Non-Local Spatial Propagation Network for Depth Completion | Jinsun Park, Kyungdon Joo, Zhe Hu, Chi-Kuei Liu, In So Kweon | N/A | |
| DanbooRegion: An Illustration Region Dataset | Lvmin Zhang, Yi JI, Chunping Liu | N/A | |
| Event Enhanced High-Quality Image Recovery | Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang | N/A | |
| PackDet: Packed Long-Head Object Detector | Kun Ding, Guojin He, Huxiang Gu, Zisha Zhong, Shiming Xiang, Chunhong Pan | N/A | |
| A Generic Graph-based Neural Architecture Encoding Scheme for Predictor-based NAS | Xuefei Ning, Yin Zheng, Tianchen Zhao, Yu Wang, Huazhong Yang | N/A | |
| Learning Semantic Neural Tree for Human Parsing | Ruyi Ji, Dawei Du, Libo Zhang, Longyin Wen, Yanjun Wu, Chen Zhao, Feiyue Huang, Siwei Lyu | N/A | |
| Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation | Wenbin Wang, Ruiping Wang, Shiguang Shan, Xilin Chen | N/A | |
| Burst Denoising via Temporally Shifted Wavelet Transforms | Xuejian Rong, Denis Demandolx, Kevin Matzen, Priyam Chatterjee, Yingli Tian | N/A | |
| JSSR: A Joint Synthesis, Segmentation, and Registration System for 3D Multi-Modal Image Alignment of Large-scale Pathological CT Scans | Fengze Liu, Jinzheng Cai, Yuankai Huo, Chi-Tung Cheng, Ashwin Raju, Dakai Jin, Jing Xiao, Alan Yuille, Le Lu, ChienHung Liao, Adam P. Harrison | N/A | |
| SimAug: Learning Robust Representations from Simulation for Trajectory Prediction | Junwei Liang, Lu Jiang, Alexander Hauptmann | N/A | |
| ScribbleBox: Interactive Annotation Framework for Video Object Segmentation | Bowen Chen, Huan Ling, Xiaohui Zeng, Jun Gao, Ziyue Xu, Sanja Fidler | N/A | |
| Rethinking Pseudo-LiDAR Representation | Xinzhu Ma, Shinan Liu, Zhiyi Xia, Hongwen Zhang, Xingyu Zeng, Wanli Ouyang | N/A | |
| Deep Multi Depth Panoramas for View Synthesis | Kai-En Lin, Zexiang Xu, Ben Mildenhall, Pratul P. Srinivasan, Yannick Hold-Geoffroy, Stephen DiVerdi, Qi Sun, Kalyan Sunkavalli, Ravi Ramamoorthi | N/A | |
| MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection | Fa-Ting Hong, Xuanteng Huang, Wei-Hong Li, Wei-Shi Zheng | N/A | |
| ContactPose: A Dataset of Grasps with Object Contact and Hand Pose | Samarth Brahmbhatt, Chengcheng Tang, Christopher D. Twigg, Charles C. Kemp, James Hays | N/A | |
| API-Net: Robust Generative Classifier via a Single Discriminator | Xinshuai Dong, Hong Liu, Rongrong Ji, Liujuan Cao, Qixiang Ye, Jianzhuang Liu, Qi Tian | N/A | |
| Bias-based Universal Adversarial Patch Attack for Automatic Check-out | Aishan Liu, Jiakai Wang, Xianglong Liu, Bowen Cao, Chongzhi Zhang, Hang Yu | N/A | |
| Imbalanced Continual Learning with Partitioning Reservoir Sampling | Chris Dongjoo Kim, Jinseo Jeong, Gunhee Kim | N/A | |
| Guided Collaborative Training for Pixel-wise Semi-Supervised Learning | Zhanghan Ke, Di Qiu, Kaican Li, Qiong Yan, Rynson W.H. Lau | N/A | |
| Stacking Networks Dynamically for Image Restoration Based on the Plug-and-Play Framework | Haixin Wang, Tianhao Zhang, Muzhi Yu, Jinan Sun, Wei Ye, Chen Wang , Shikun Zhang | N/A | |
| Efficient Transfer Learning via Joint Adaptation of Network Architecture and Weight | Ming Sun, Haoxuan Dou, Junjie Yan | N/A | |
| Spatial Attention Pyramid Network for Unsupervised Domain Adaptation | Congcong Li, Dawei Du, Libo Zhang, Longyin Wen, Tiejian Luo, Yanjun Wu, Pengfei Zhu | N/A | |
| GSIR: Generalizable 3D Shape Interpretation and Reconstruction | Jianren Wang, Zhaoyuan Fang | N/A | |
| Weakly Supervised 3D Object Detection from Lidar Point Cloud | Qinghao Meng, Wenguan Wang, Tianfei Zhou, Jianbing Shen, Luc Van Gool , Dengxin Dai | N/A | |
| Two-phase Pseudo Label Densification for Self-training based Domain Adaptation | Inkyu Shin, Sanghyun Woo, Fei Pan, In So Kweon | N/A | |
| Adaptive Offline Quintuplet Loss for Image-Text Matching | Tianlang Chen, Jiajun Deng, Jiebo Luo | N/A | |
| Learning Object Placement by Inpainting for Compositional Data Augmentation | Lingzhi Zhang, Tarmily Wen, Jie Min, Jiancong Wang, David Han, Jianbo Shi | N/A | |
| Deep Vectorization of Technical Drawings | Vage Egiazarian, Oleg Voynov, Alexey Artemov, Denis Volkhonskiy, Aleksandr Safin, Maria Taktasheva, Denis Zorin, Evgeny Burnaev | N/A | |
| CAD-Deform: Deformable Fitting of CAD Models to 3D Scans | Vladislav Ishimtsev, Alexey Bokhovkin, Alexey Artemov, Savva Ignatyev , Matthias Niessner, Denis Zorin, Evgeny Burnaev | N/A | |
| An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices | Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Wujie Wen, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang | N/A | |
| AutoTrajectory: Label-free Trajectory Extraction and Prediction from Videos using Dynamic Points | Yuexin Ma, Xinge Zhu, Xinjing Cheng, Ruigang Yang, Jiming Liu, Dinesh Manocha | N/A | |
| Multi-Agent Embodied Question Answering in Interactive Environments | Sinan Tan, Weilai Xiang, Huaping Liu, Di Guo, Fuchun Sun | N/A | |
| Conditional Sequential Modulation for Efficient Global Image Retouching | Jingwen He, Yihao Liu, Yu Qiao, Chao Dong | N/A | |
| Segmenting Transparent Objects in the Wild | Enze Xie, Wenjia Wang, Wenhai Wang, Mingyu Ding, Chunhua Shen, Ping Luo | N/A | |
| Length-Controllable Image Captioning | Chaorui Deng, Ning Ding, Mingkui Tan, Qi Wu | N/A | |
| Few-Shot Semantic Segmentation with Democratic Attention Networks | Haochen Wang, Xudong Zhang, Yutao Hu, Yandan Yang, Xianbin Cao, Xiantong Zhen | N/A | |
| Defocus Blur Detection via Depth Distillation | Xiaodong Cun, Chi-Man Pun | N/A | |
| Motion Guided 3D Pose Estimation from Videos | Jingbo Wang, Sijie Yan, Yuanjun Xiong, Dahua Lin | N/A | |
| Reflection Separation via Multi-bounce Polarization State Tracing | Rui Li, Simeng Qiu, Guangming Zang, Wolfgang Heidrich | N/A | |
| SipMask: Spatial Information Preservation for Fast Image and Video Instance Segmentation | Jiale Cao, Rao Muhammad Anwer, Hisham Cholakkal, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao | N/A | |
| SemanticAdv: Generating Adversarial Examples via Attribute-conditioned Image Editing | Haonan Qiu, Chaowei Xiao, Lei Yang, Xinchen Yan, Honglak Lee, Bo Li | N/A | |
| Learning with Noisy Class Labels for Instance Segmentation | Longrong Yang, Fanman Meng, Hongliang Li, Qingbo Wu, Qishang Cheng | N/A | |
| Deep Image Clustering with Category-Style Representation | Junjie Zhao, Donghuan Lu, Kai Ma, Yu Zhang, Yefeng Zheng | N/A | |
| Self-supervised Motion Representation via Scattering Local Motion Cues | Yuan Tian, Zhaohui Che, Wenbo Bao, Guangtao Zhai, Zhiyong Gao | N/A | |
| Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets | Tian Chen, Shijie An, Yuan Zhang, Chongyang Ma , Huayan Wang, Xiaoyan Guo, Wen Zheng | N/A | |
| BMBC: Bilateral Motion Estimation with Bilateral Cost Volume for Video Interpolation | Junheum Park, Keunsoo Ko, Chul Lee, Chang-Su Kim | N/A | |
| Hard negative examples are hard, but useful | Hong Xuan, Abby Stylianou, Xiaotong Liu, Robert Pless | N/A | |
| ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions | Zechun Liu, Zhiqiang Shen, Marios Savvides, Kwang-Ting Cheng | N/A | |
| Video Object Detection via Object-level Temporal Aggregation | Chun-Han Yao, Chen Fang, Xiaohui Shen, Yangyue Wan, Ming-Hsuan Yang | N/A | |
| Object Detection with a Unified Label Space from Multiple Datasets | Xiangyun Zhao, Samuel Schulter, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu | N/A | |
| Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D | Jonah Philion, Sanja Fidler | N/A | |
| Comprehensive Image Captioning via Scene Graph Decomposition | Yiwu Zhong, Liwei Wang, Jianshu Chen, Dong Yu, Yin Li | N/A | |
| Symbiotic Adversarial Learning for Attribute-based Person Search | Yu-Tong Cao, Jingya Wang, Dacheng Tao | N/A | |
| Amplifying Key Cues for Human-Object-Interaction Detection | Yang Liu, Qingchao Chen, Andrew Zisserman | N/A | |
| Rethinking Few-shot Image Classification: A Good Embedding is All You Need? | Yonglong Tian, Yue Wang, Dilip Krishnan, Joshua B. Tenenbaum, Phillip Isola | N/A | |
| Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization | Kyle Min, Jason J. Corso | N/A | |
| Action Localization through Continual Predictive Learning | Sathyanarayanan Aakur, Sudeep Sarkar | N/A | |
| Generative View-Correlation Adaptation for Semi-Supervised Multi-View Learning | Yunyu Liu, Lichen Wang, Yue Bai, Can Qin, Zhengming Ding, Yun Fu | N/A | |
| READ: Reciprocal Attention Discriminator for Image-to-Video Re-Identification | Minho Shim, Hsuan-I Ho, Jinhyung Kim, Dongyoon Wee | N/A | |
| 3D Human Shape Reconstruction from a Polarization Image | Shihao Zou, Xinxin Zuo, Yiming Qian, Sen Wang, Chi Xu, Minglun Gong , Li Cheng | N/A | |
| The Devil is in the Details: Self-Supervised Attention for Vehicle Re-Identification | Pirazh Khorramshahi, Neehar Peri, Jun-cheng Chen, Rama Chellappa | N/A | |
| Improving One-stage Visual Grounding by Recursive Sub-query Construction | Zhengyuan Yang, Tianlang Chen, Liwei Wang, Jiebo Luo | N/A | |
| Multi-level Wavelet-based Generative Adversarial Network for Perceptual Quality Enhancement of Compressed Video | Jianyi Wang, Xin Deng, Mai Xu, Congyong Chen, Yuhang Song | N/A | |
| Example-Guided Image Synthesis using Masked Spatial-Channel Attention and Self-Supervision | Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo | N/A | |
| Content-Consistent Matching for Domain Adaptive Semantic Segmentation | Guangrui Li, Guoliang Kang, Wu Liu, Yunchao Wei, Yi Yang | N/A | |
| AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting | Wenhai Wang, Xuebo Liu, Xiaozhong Ji, Enze Xie, Ding Liang, ZhiBo Yang, Tong Lu, Chunhua Shen, Ping Luo | N/A | |
| History Repeats Itself: Human Motion Prediction via Motion Attention | Wei Mao, Miaomiao Liu, Mathieu Salzmann | N/A | |
| Unsupervised Video Object Segmentation with Joint Hotspot Tracking | Lu Zhang, Jianming Zhang, Zhe Lin, Radomír Měch, Huchuan Lu, You He | N/A | |
| SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach | Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin | N/A | |
| CAFE-GAN: Arbitrary Face Attribute Editing with Complementary Attention Feature | Jeong gi Kwak, David K. Han, Hanseok Ko | N/A | |
| MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection | Xin Lu, Quanquan Li, Buyu Li, Junjie Yan | N/A | |
| Latent Topic-aware Multi-Label Classification | Jianghong Ma, Yang Liu | N/A | |
| Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning | Xiangxi Shi, Xu Yang, Jiuxiang Gu, Shafiq Joty, Jianfei Cai | N/A | |
| Attract, Perturb, and Explore: Learning a Feature Alignment Network for Semi-supervised Domain Adaptation | Taekyung Kim, Changick Kim | N/A | |
| Curriculum Manager for Source Selection in Multi-Source Domain Adaptation | Luyu Yang, Yogesh Balaji, Ser-Nam Lim, Abhinav Shrivastava | N/A | |
| Powering One-shot Topological NAS with Stabilized Share-parameter Proxy | Ronghao Guo, Chen Lin, Chuming Li, Keyu Tian, Ming Sun, Lu Sheng, Junjie Yan | N/A | |
| Classes Matter: A Fine-grained Adversarial Approach to Cross-domain Semantic Segmentation | Haoran Wang, Tong Shen, Wei Zhang, Ling-Yu Duan, Tao Mei | N/A | |
| Boundary-preserving Mask R-CNN | Tianheng Cheng, Xinggang Wang, Lichao Huang, Wenyu Liu | N/A | |
| Self-supervised Single-view 3D Reconstruction via Semantic Consistency | Xueting Li, Sifei Liu, Kihwan Kim, Shalini De Mello, Varun Jampani, Ming-Hsuan Yang, Jan Kautz | N/A | |
| MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation | Benlin Liu, Yongming Rao, Jiwen Lu, Jie Zhou, Cho-Jui Hsieh | N/A | |
| Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling | Yuliang Zou, Pan Ji, Quoc-Huy Tran, Jia-Bin Huang, Manmohan Chandraker | N/A | |
| The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation | Tao Wang, Yu Li, Bingyi Kang, Junnan Li, Junhao Liew, Sheng Tang, Steven Hoi, Jiashi Feng | N/A | |
| What is Learned in Deep Uncalibrated Photometric Stereo? | Guanying Chen, Michael Waechter, Boxin Shi, Kwan-Yee K. Wong, Yasuyuki Matsushita | N/A | |
| Prior-based Domain Adaptive Object Detection for Hazy and Rainy Conditions | Vishwanath A. Sindagi, Poojan Oza, Rajeev Yasarla, Vishal M. Patel | N/A | |
| Adversarial Ranking Attack and Defense | Mo Zhou, Zhenxing Niu, Le Wang, Qilin Zhang, Gang Hua | N/A | |
| ReDro: Efficiently Learning Large-sized SPD Visual Representation | Saimunur Rahman, Lei Wang, Changming Sun, Luping Zhou | N/A | |
| Graph-Based Social Relation Reasoning | Wanhua Li, Yueqi Duan, Jiwen Lu, Jianjiang Feng, Jie Zhou | N/A | |
| EPNet: Enhancing Point Features with Image Semantics for 3D Object Detection | Tengteng Huang, Zhe Liu, Xiwu Chen, Xiang Bai | N/A | |
| Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency | Jiaxiang Shang, Tianwei Shen, Shiwei li, Lei Zhou, Mingmin Zhen, Tian Fang, Long Quan | N/A | |
| Asynchronous Interaction Aggregation for Action Detection | Jiajun Tang, Jin Xia, Xinzhi Mu, Bo Pang, Cewu Lu | N/A | |
| Shape and Viewpoint without Keypoints | Shubham Goel, Angjoo Kanazawa, Jitendra Malik | N/A | |
| Learning Attentive and Hierarchical Representations for 3D Shape Recognition | Jiaxin Chen, Jie Qin, Yuming Shen, Li Liu, Fan Zhu, Ling Shao | N/A | |
| TF-NAS: Rethinking Three Search Freedoms of Latency-Constrained Differentiable Neural Architecture Search | Yibo Hu, Xiang Wu, Ran He | N/A | |
| Associative3D: Volumetric Reconstruction from Sparse Views | Shengyi Qian, Linyi Jin, David F. Fouhey | N/A | |
| PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit | Yongqiang Mou, Lei Tan, Hui Yang, Jingying Chen, Leyuan Liu, Rui Yan, Yaohong Huang | N/A | |
| Memory Selection Network for Video Propagation | Ruizheng Wu, Huaijia Lin, Xiaojuan Qi, Jiaya Jia | N/A | |
| Disentangled Non-local Neural Networks | Minghao Yin, Zhuliang Yao, Yue Cao, Xiu Li, Zheng Zhang, Stephen Lin, Han Hu | N/A | |
| URVOS: Unified Referring Video Object Segmentation Network with a Large-Scale Benchmark | Seonguk Seo, Joon-Young Lee, Bohyung Han | N/A | |
| Generalizing Person Re-Identification by Camera-Aware Invariance Learning and Cross-Domain Mixup | Chuanchen Luo, Chunfeng Song, Zhaoxiang Zhang | N/A | |
| Semi-Supervised Crowd Counting via Self-Training on Surrogate Tasks | Yan Liu, Lingqiao Liu, Peng Wang, Pingping Zhang, Yinjie Lei | N/A | |
| Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training | Hongkai Zhang, Hong Chang, Bingpeng Ma, Naiyan Wang, Xilin Chen | N/A | |
| Boosting Decision-based Black-box Adversarial Attacks with Random Sign Flip | Weilun Chen, Zhaoxiang Zhang, Xiaolin Hu, Baoyuan Wu | N/A | |
| Knowledge Transfer via Dense Cross-Layer Mutual-Distillation | Anbang Yao, Dawei Sun | N/A | |
| Matching Guided Distillation | Kaiyu Yue, Jiangfan Deng, Feng Zhou | N/A | |
| Clustering Driven Deep Autoencoder for Video Anomaly Detection | Yunpeng Chang, Zhigang Tu, Wei Xie, Junsong Yuan | N/A | |
| Learning to Compose Hypercolumns for Visual Correspondence | Juhong Min, Jongmin Lee, Jean Ponce, Minsu Cho | N/A | |
| Stochastic Bundle Adjustment for Efficient and Scalable 3D Reconstruction | Lei Zhou, Zixin Luo, Mingmin Zhen, Tianwei Shen, Shiwei Li, Zhuofei Huang, Tian Fang, Long Quan | N/A | |
| Object-based Illumination Estimation with Rendering-aware Neural Networks | Xin Wei, Guojun Chen, Yue Dong, Stephen Lin, Xin Tong | N/A | |
| Progressive Point Cloud Deconvolution Generation Network | Le Hui, Rui Xu, Jin Xie, Jianjun Qian, Jian Yang | N/A | |
| SSCGAN: Facial Attribute Editing via Style Skip Connections | Wenqing Chu, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji | N/A | |
| Negative Pseudo Labeling using Class Proportion for Semantic Segmentation in Pathology | Hiroki Tokunaga, Brian Kenji Iwana, Yuki Teramoto, Akihiko Yoshizawa , Ryoma Bise | N/A | |
| Learn to Propagate Reliably on Noisy Affinity Graphs | Lei Yang, Qingqiu Huang, Huaiyi Huang, Linning Xu, Dahua Lin | N/A | |
| Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search | Xiangxiang Chu, Tianbao Zhou, Bo Zhang, Jixiang Li | N/A | |
| TANet: Towards Fully Automatic Tooth Arrangement | Guodong Wei, Zhiming Cui, Yumeng Liu, Nenglun Chen, Runnan Chen, Guiqing Li, Wenping Wang | N/A | |
| UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection | Bumsoo Kim, Taeho Choi, Jaewoo Kang, Hyunwoo J. Kim | N/A | |
| GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-aware Supervision | Lei Ke, Shichao Li, Yanan Sun, Yu-Wing Tai, Chi-Keung Tang | N/A | |
| Resolution Switchable Networks for Runtime Efficient Image Recognition | Yikai Wang, Fuchun Sun, Duo Li, Anbang Yao | N/A | |
| SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation | Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao , Xiaowei Zhou | N/A | |
| Learning to Detect Open Classes for Universal Domain Adaptation | Bo Fu, Zhangjie Cao, Mingsheng Long, Jianmin Wang | N/A | |
| Visual Compositional Learning for Human-Object Interaction Detection | Zhi Hou, Xiaojiang Peng, Yu Qiao, Dacheng Tao | N/A | |
| Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches | Shuai Yang, Zhangyang Wang, Jiaying Liu, Zongming Guo | N/A | |
| Rethinking Class Activation Mapping for Weakly Supervised Object Localization | Wonho Bae, Junhyug Noh, Gunhee Kim | N/A | |
| OS2D: One-Stage One-Shot Object Detection by Matching Anchor Features | Anton Osokin, Denis Sumin, Vasily Lomakin | N/A | |
| Interpretable Neural Network Decoupling | Yuchao Li, Rongrong Ji, Shaohui Lin, Baochang Zhang, Chenqian Yan, Yongjian Wu, Feiyue Huang, Ling Shao | N/A | |
| Omni-sourced Webly-supervised Learning for Video Recognition | Haodong Duan, Yue Zhao, Yuanjun Xiong, Wentao Liu, Dahua Lin | N/A | |
| CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending | Hang Xu, Shaoju Wang, Xinyue Cai, Wei Zhang, Xiaodan Liang, Zhenguo Li | N/A | |
| Contextual-Relation Consistent Domain Adaptation for Semantic Segmentation | Jiaxing Huang, Shijian Lu, Dayan Guan, Xiaobing Zhang | N/A | |
| Estimating People Flows to Better Count Them in Crowded Scenes | Weizhe Liu, Mathieu Salzmann, Pascal Fua | N/A | |
| Generate to Adapt: Resolution Adaption Network for Surveillance Face Recognition | Han Fang, Weihong Deng, Yaoyao Zhong, Jiani Hu | N/A | |
| Learning Feature Embeddings for Discriminant Model based Tracking | Linyu Zheng, Ming Tang, Yingying Chen, Jinqiao Wang, Hanqing Lu | N/A | |
| WeightNet: Revisiting the Design Space of Weight Networks | Ningning Ma, Xiangyu Zhang, Jiawei Huang, Jian Sun | N/A | |
| Partially-Shared Variational Auto-encoders for Unsupervised Domain Adaptation with Target Shift | Ryuhei Takahashi, Atsushi Hashimoto, Motoharu Sonogashira, Masaaki Iiyama | N/A | |
| Learning Where to Focus for Efficient Video Object Detection | Zhengkai Jiang, Yu Liu, Ceyuan Yang, Jihao Liu, Peng Gao, Qian Zhang, Shiming Xiang, Chunhong Pan | N/A | |
| Learning Object Permanence from Video | Aviv Shamsian, Ofri Kleinfeld, Amir Globerson, Gal Chechik | N/A | |
| Adaptive Text Recognition through Visual Matching | Chuhan Zhang, Ankush Gupta, Andrew Zisserman | N/A | |
| Actions as Moving Points | Yixuan Li, Zixu Wang, Limin Wang, Gangshan Wu | N/A | |
| Learning to Exploit Multiple Vision Modalities by Using Grafted Networks | Yuhuang Hu, Tobi Delbruck, Shih-Chii Liu | N/A | |
| Geometric Correspondence Fields: Learned Differentiable Rendering for 3D Pose Refinement in the Wild | Alexander Grabner, Yaming Wang, Peizhao Zhang, Peihong Guo, Tong Xiao, Peter Vajda, Peter M. Roth, Vincent Lepetit | N/A | |
| 3D Fluid Flow Reconstruction Using Compact Light Field PIV | Zhong Li, Yu Ji, Jingyi Yu, Jinwei Ye | N/A | |
| Contextual Diversity for Active Learning | Sharat Agarwal, Himanshu Arora, Saket Anand, Chetan Arora | N/A | |
| Temporal Aggregate Representations for Long-Range Video Understanding | Fadime Sener, Dipika Singhania, Angela Yao | N/A | |
| Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition | Zhe Niu, Brian Mak | N/A | |
| General 3D Room Layout from a Single View by Render-and-Compare | Sinisa Stekovic, Shreyas Hampali, Mahdi Rad, Sayan Deb Sarkar, Friedrich Fraundorfer, Vincent Lepetit | N/A | |
| Neural Dense Non-Rigid Structure from Motion with Latent Space Constraints | Vikramjit Sidhu, Edgar Tretschk, Vladislav Golyanik, Antonio Agudo, Christian Theobalt | N/A | |
| Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability | Anelise Newman, Camilo Fosco, Vincent Casser, Allen Lee, Barry McNamara, Aude Oliva | N/A | |
| Yet Another Intermediate-Level Attack | Qizhang Li, Yiwen Guo, Hao Chen | N/A | |
| Topology-Change-Aware Volumetric Fusion for Dynamic Scene Reconstruction | Chao Li, Xiaohu Guo | N/A | |
| Early Exit Or Not: Resource-Efficient Blind Quality Enhancement for Compressed Images | Qunliang Xing, Mai Xu, Tianyi Li, Zhenyu Guan | N/A | |
| PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations | Edgar Tretschk, Ayush Tewari, Vladislav Golyanik, Michael Zollhöfer, Carsten Stoll, Christian Theobalt | N/A | |
| How does Lipschitz Regularization Influence GAN Training? | Yipeng Qin, Niloy Mitra, Peter Wonka | N/A | |
| Infrastructure-based Multi-Camera Calibration using Radial Projections | Yukai Lin, Viktor Larsson, Marcel Geppert, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler | N/A | |
| MotionSqueeze: Neural Motion Feature Learning for Video Understanding | Heeseung Kwon, Manjin Kim, Suha Kwak, Minsu Cho | N/A | |
| Polarized Optical-Flow Gyroscope | Masada Tzabari, Yoav Y. Schechner | N/A | |
| Online Meta-Learning for Multi-Source and Semi-Supervised Domain Adaptation | Da Li, Timothy Hospedales | N/A | |
| An Ensemble of Epoch-wise Empirical Bayes for Few-shot Learning | Yaoyao Liu, Bernt Schiele, Qianru Sun | N/A | |
| On the Effectiveness of Image Rotation for Open Set Domain Adaptation | Silvia Bucci, Mohammad Reza Loghmani, Tatiana Tommasi | N/A | |
| Combining Task Predictors via Enhancing Joint Predictability | Kwang In Kim, Christian Richardt, Hyung Jin Chang | N/A | |
| Multi-Scale Positive Sample Refinement for Few-Shot Object Detection | Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang | N/A | |
| Single-Image Depth Prediction Makes Feature Matching Easier | Carl Toft, Daniyar Turmukhambetov, Torsten Sattler, Fredrik Kahl, Gabriel J. Brostow | N/A | |
| Deep Reinforced Attention Learning for Quality-Aware Visual Recognition | Duo Li, Qifeng Chen | N/A | |
| CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization | Yuxi Li, Weiyao Lin, John See, Ning Xu Shugong Xu, Ke Yan, Cong Yang | N/A | |
| Learning Joint Spatial-Temporal Transformations for Video Inpainting | Yanhong Zeng, Jianlong Fu, Hongyang Chao | N/A | |
| Single Path One-Shot Neural Architecture Search with Uniform Sampling | Zichao Guo, Xiangyu Zhang, Haoyuan Mu, Wen Heng, Zechun Liu, Yichen Wei, Jian Sun | N/A | |
| Learning to Generate Novel Domains for Domain Generalization | Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, Tao Xiang | N/A | |
| Continuous Adaptation for Interactive Object Segmentation by Learning from Corrections | Theodora Kontogianni, Michael Gygli, Jasper Uijlings, Vittorio Ferrari | N/A | |
| Impact of base dataset design on few-shot image classification | Othman Sbai, Camille Couprie, Mathieu Aubry | N/A | |
| Invertible Zero-Shot Recognition Flows | Yuming Shen, Jie Qin, Lei Huang, Li Liu, Fan Zhu, Ling Shao | N/A | |
| GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes | Weidong Zhang, Wei Zhang, Yinda Zhang | N/A | |
| Location Sensitive Image Retrieval and Tagging | Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas | N/A | |
| Joint 3D Layout and Depth Prediction from a Single Indoor Panorama Image | Wei Zeng, Sezer Karaoglu, Theo Gevers | N/A | |
| Guessing State Tracking for Visual Dialogue | Wei Pang, Xiaojie Wang | N/A | |
| Memory-Efficient Incremental Learning Through Feature Adaptation | Ahmet Iscen, Jeffrey Zhang, Svetlana Lazebnik, Cordelia Schmid | N/A | |
| Neural Voice Puppetry: Audio-driven Facial Reenactment | Justus Thies, Mohamed Elgharib, Ayush Tewari, Christian Theobalt, Matthias Nießner | N/A | |
| One-Shot Unsupervised Cross-Domain Detection | Antonio D’Innocente, Francesco Cappio Borlino, Silvia Bucci, Barbara Caputo, Tatiana Tommasi | N/A | |
| Stochastic Frequency Masking to Improve Super-Resolution and Denoising Networks | Majed El Helou, Ruofan Zhou, Sabine Süsstrunk | N/A | |
| Probabilistic Future Prediction for Video Scene Understanding | Anthony Hu, Fergal Cotter, Nikhil Mohan, Corina Gurau, Alex Kendall | N/A | |
| Suppressing Mislabeled Data via Grouping and Self-Attention | Xiaojiang Peng, Kai Wang, Zhaoyang Zeng, Qing Li, Jianfei Yang, Yu Qiao | N/A | |
| Class-wise Dynamic Graph Convolution for Semantic Segmentation | Hanzhe Hu, Deyi Ji, Weihao Gan, Shuai Bai, Wei Wu, Junjie Yan | N/A | |
| Character-Preserving Coherent Story Visualization | Yun-Zhu Song, Zhi Rui Tam, Hung-Jen Chen, Huiao-Han Lu, Hong-Han Shuai | N/A | |
| GINet: Graph Interaction Network for Scene Parsing | Tianyi Wu, Yu Lu, Yu Zhu, Chuang Zhang, MingWu, Zhanyu Ma, Guodong Guo | N/A | |
| Tensor Low-Rank Reconstruction for Semantic Segmentation | Wanli Chen, Xinge Zhu, Ruoqi Sun, Junjun He, Ruiyu Li, Xiaoyong Shen , Bei Yu | N/A | |
| Attentive Normalization | Xilai Li, Wei Sun, Tianfu Wu | N/A | |
| Count- and Similarity-aware R-CNN for Pedestrian Detection | Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Mubarak Shah | N/A | |
| TRADI: Tracking Deep Neural network Weight Distributions | Gianni Franchi, Andrei Bursuc, Emanuel Aldea, Séverine Dubuisson, Isabelle Bloch | N/A | |
| Spatiotemporal Attacks for Embodied Agents | Aishan Liu, Tairan Huang, Xianglong Liu, Yitao Xu, Yuqing Ma, Xinyun Chen, Stephen J. Maybank, Dacheng Tao | N/A | |
| Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation | Qingqiu Huang, Lei Yang, Huaiyi Huang, Tong Wu, Dahua Lin | N/A | |
| Unselfie: Translating Selfies to Neutral-pose Portraits in the Wild | Liqian Ma, Zhe Lin, Connelly Barnes, Alexei A Efros, Jingwan Lu | N/A | |
| Design and Interpretation of Universal Adversarial Patches in Face Detection | Xiao Yang, Fangyun Wei, Hongyang Zhang, Jun Zhu | N/A | |
| Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild | Yang Xiao, Renaud Marlet | N/A | |
| Weakly Supervised 3D Hand Pose Estimation via Biomechanical Constraints | Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Otmar Hilliges, Jan Kautz | N/A | |
| Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification | Mang Ye, Jianbing Shen, David J. Crandall, Ling Shao, Jiebo Luo | N/A | |
| Contextual Heterogeneous Graph Network for Human-Object Interaction Detection | Hai Wang, Wei-shi Zheng, Ling Yingbiao | N/A | |
| Zero-Shot Image Super-Resolution with Depth Guided Internal Degradation Learning | Xi Cheng, Zhenyong Fu, Jian Yang | N/A | |
| A Closest Point Proposal for MCMC-based Probabilistic Surface Registration | Dennis Madsen, Andreas Morel-Forster, Patrick Kahr, Dana Rahbani, Thomas Vetter, Marcel Lüthi | N/A | |
| Interactive Video Object Segmentation Using Global and Local Transfer Modules | Yuk Heo, Yeong Jun Koh, Chang-Su Kim | N/A | |
| End-to-end Interpretable Learning of Non-blind Image Deblurring | Thomas Eboli, Jian Sun, Jean Ponce | N/A | |
| Employing Multi-Estimations for Weakly-Supervised Semantic Segmentation | Junsong Fan, Zhaoxiang Zhang, Tieniu Tan | N/A | |
| Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection | Jing Zhang, Jianwen Xie, Nick Barnes | N/A | |
| Rethinking Image Deraining via Rain Streaks and Vapors | Yinglong Wang, Yibing Song, Chao Ma, Bing Zeng | N/A | |
| Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes | Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu | N/A | |
| Is Sharing of Egocentric Video Giving Away Your Biometric Signature? | Daksh Thapar, Chetan Arora, Aditya Nigam | N/A | |
| Captioning Images Taken by People Who Are Blind | Danna Gurari, Yinan Zhao, Meng Zhang, Nilavra Bhattacharya | N/A | |
| Improving Semantic Segmentation via Decoupled Body and Edge Supervision | Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong | N/A | |
| Conditional Entropy Coding for Efficient Video Compression | Jerry Liu, Shenlong Wang, Wei-Chiu Ma, Meet Shah, Rui Hu, Pranaab Dhawan, Raquel Urtasun | N/A | |
| Differentiable Feature Aggregation Search for Knowledge Distillation | Yushuo Guan, Pengyu Zhao, Bingxuan Wang, Yuanxing Zhang, Cong Yao, Kaigui Bian, Jian Tang | N/A | |
| Attention Guided Anomaly Localization in Images | Shashanka Venkataramanan, Kuan-Chuan Peng, Rajat Vikram Singh, Abhijit Mahalanobis | N/A | |
| Self-supervised Video Representation Learning by Pace Prediction | Jiangliu Wang, Jianbo Jiao, Yun-Hui Liu | N/A | |
| Full-Body Awareness from Partial Observations | Chris Rockwell, David F. Fouhey | N/A | |
| Reinforced Axial Refinement Network for Monocular 3D Object Detection | Lijie Liu, Chufan Wu, Jiwen Lu, Lingxi Xie, Jie Zhou, Qi Tian | N/A | |
| Self-Supervised Multi-Task Procedure Learning from Instructional Videos | Ehsan Elhamifar, Dat Huynh | N/A | |
| CosyPose: Consistent multi-view multi-object 6D pose estimation | Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic | N/A | |
| In-Domain GAN Inversion for Real Image Editing | Jiapeng Zhu, Yujun Shen, Deli Zhao, Bolei Zhou | N/A | |
| Key Frame Proposal Network for Efficient Pose Estimation in Videos | Yuexi Zhang, Yin Wang, Octavia Camps, Mario Sznaier | N/A | |
| Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning | Yuki Saito, Takuma Nakamura, Hirotaka Hachiya, Kenji Fukumizu | N/A | |
| Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs | Robin Rombach, Patrick Esser, Björn Ommer | N/A | |
| Cross-Modal Weighting Network for RGB-D Salient Object Detection | Gongyang Li, Zhi Liu, Linwei Ye, Yang Wang, Haibin Ling | N/A | |
| Open-set Adversarial Defense | Rui Shao, Pramuditha Perera, Pong C. Yuen, Vishal M. Patel | N/A | |
| Deep Image Compression using Decoder Side Information | Sharon Ayzik, Shai Avidan | N/A | |
| Meta-Sim2: Unsupervised Learning of Scene Structure for Synthetic Data Generation | Jeevan Devaranjan, Amlan Kar, Sanja Fidler | N/A | |
| A Generic Visualization Approach for Convolutional Neural Networks | Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis | N/A | |
| Interactive Annotation of 3D Object Geometry using 2D Scribbles | Tianchang Shen, Jun Gao, Amlan Kar, Sanja Fidler | N/A | |
| Hierarchical Kinematic Human Mesh Recovery | Georgios Georgakis, Ren Li, Srikrishna Karanam, Terrence Chen, Jana Košecká, Ziyan Wu | N/A | |
| Multi-Loss Rebalancing Algorithm for Monocular Depth Estimation | Jae-Han Lee, Chang-Su Kim | N/A | |
| 3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View | Marc Badger, Yufu Wang, Adarsh Modh, Ammon Perkes, Nikos Kolotouros , Bernd G. Pfrommer, Marc F. Schmidt, Kostas Daniilidis | N/A | |
| We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos | Alex Andonian, Camilo Fosco, Mathew Monfort, Allen Lee, Rogerio Feris, Carl Vondrick, Aude Oliva | N/A | |
| Joint Optimization for Multi-Person Shape Models from Markerless 3D-Scans | Samuel Zeitvogel, Johannes Dornheim, Astrid Laubenheimer | N/A | |
| Accurate RGB-D Salient Object Detection via Collaborative Learning | Wei Ji, Jingjing Li, Miao Zhang, Yongri Piao, Huchuan Lu | N/A | |
| Finding Your (3D) Center: 3D Object Detection Using a Learned Loss | David Griffiths, Jan Boehm, Tobias Ritschel | N/A | |
| Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection | Ganlong Zhao, Guanbin Li, Ruijia Xu, Liang Lin | N/A | |
| Two Stream Active Query Suggestion for Active Learning in Connectomics | Zudi Lin, Donglai Wei, Won-Dong Jang, Siyan Zhou, Xupeng Chen, Xueying Wang, Richard Schalek, Daniel Berger, Brian Matejek, Lee Kamentsky, Adi Peleg, Daniel Haehn, Thouis Jones, Toufiq Parag, Jeff Lichtman, Hanspeter Pfister | N/A | |
| Pix2Surf: Learning Parametric 3D Surface Models of Objects from Images | Jiahui Lei, Srinath Sridhar, Paul Guerrero, Minhyuk Sung, Niloy Mitra, Leonidas J. Guibas | N/A | |
| 6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal Inference | Mai Bui, Tolga Birdal, Haowen Deng, Shadi Albarqouni, Leonidas Guibas, Slobodan Ilic, Nassir Navab | N/A | |
| Modeling Artistic Workflows for Image Generation and Editing | Hung-Yu Tseng, Matthew Fisher, Jingwan Lu, Yijun Li, Vladimir Kim, Ming-Hsuan Yang | N/A | |
| A Large-scale Annotated Mechanical Components Benchmark for Classification and Retrieval Tasks with Deep Neural Networks | Sangpil Kim, Hyung-gun Chi, Xiao Hu, Qixing Huang, Karthik Ramani | N/A | |
| Hidden Footprints: Learning Contextual Walkability from 3D Human Trails | Jin Sun, Hadar Averbuch-Elor, Qianqian Wang, Noah Snavely | N/A | |
| Self-Supervised Learning of Audio-Visual Objects from Video | Triantafyllos Afouras, Andrew Owens, Joon Son Chung, Andrew Zisserman | N/A | |
| GAN-based Garment Generation Using Sewing Pattern Images | Yu Shen, Junbang Liang, Ming C. Lin | N/A | |
| Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach | Chaitanya Ahuja, Dong Won Lee, Yukiko I. Nakano, Louis-Philippe Morency | N/A | |
| An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds | Rui Huang, Wanyue Zhang, Abhijit Kundu, Caroline Pantofaru, David A Ross, Thomas Funkhouser, Alireza Fathi | N/A | |
| Monotonicity Prior for Cloud Tomography | Tamar Loeub, Aviad Levis, Vadim Holodovsky, Yoav Y. Schechner | N/A | |
| Learning Trailer Moments in Full-Length Movies with Co-Contrastive Attention | Lezi Wang, Dong Liu, Rohit Puri, Dimitris N. Metaxas | N/A | |
| Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval | Christopher Thomas, Adriana Kovashka | N/A | |
| Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline | Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das | N/A | |
| Learning to Generate Grounded Visual Captions without Localization Supervision | Chih-Yao Ma, Yannis Kalantidis, Ghassan AlRegib, Peter Vajda, Marcus Rohrbach, Zsolt Kira | N/A | |
| Neural Hair Rendering | Menglei Chai, Jian Ren, Sergey Tulyakov | N/A | |
| JNR: Joint-based Neural Rig Representation for Compact 3D Face Modeling | Noranart Vesdapunt, Mitch Rundle, HsiangTao Wu, Baoyuan Wang | N/A | |
| On Disentangling Spoof Trace for Generic Face Anti-Spoofing | Yaojie Liu, Joel Stehouwer, Xiaoming Liu | N/A | |
| Streaming Object Detection for 3-D Point Clouds | Wei Han, Zhengdong Zhang, Benjamin Caine, Brandon Yang, Christoph Sprunk, Ouais Alsharif, Jiquan Ngiam, Vijay Vasudevan, Jonathon Shlens, Zhifeng Chen | N/A | |
| NAS-DIP: Learning Deep Image Prior with Neural Architecture Search | Yun-Chun Chen, Chen Gao, Esther Robb, Jia-Bin Huang | N/A | |
| Learning to Learn in a Semi-Supervised Fashion | Yun-Chun Chen, Chao-Te Chou, Yu-Chiang Frank Wang | N/A | |
| FeatMatch: Feature-Based Augmentation for Semi-Supervised Learning | Chia-Wen Kuo, Chih-Yao Ma, Jia-Bin Huang, Zsolt Kira | N/A | |
| RadarNet: Exploiting Radar for Robust Perception of Dynamic Objects | Bin Yang, Runsheng Guo, Ming Liang, Sergio Casas, Raquel Urtasun | N/A | |
| Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation | Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh | N/A | |
| Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes | Chenhongyi Yang, Vitaly Ablavsky, Kaihong Wang, Qi Feng, Margrit Betke | N/A | |
| Towards causal benchmarking of bias in face analysis algorithms | Guha Balakrishnan, Yuanjun Xiong, Wei Xia, Pietro Perona | N/A | |
| Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation | Tong He, Dong Gong, Zhi Tian, Chunhua Shen | N/A | |
| Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions | Noa Garcia, Yuta Nakashima | N/A | |
| Transformation Consistency Regularization – A Semi-Supervised Paradigm for Image-to-Image Translation | Aamir Mustafa, Rafal K. Mantiuk | N/A | |
| LIRA: Lifelong Image Restoration from Unknown Blended Distortions | Jianzhao Liu, Jianxin Lin, Xin Li, Wei Zhou, Sen Liu, Zhibo Chen | N/A | |
| HDNet: Human Depth Estimation for Multi-Person Camera-Space Localization | Jiahao Lin, Gim Hee Lee | N/A | |
| SOLO: Segmenting Objects by Locations | Xinlong Wang, Tao Kong, Chunhua Shen, Yuning Jiang, Lei Li | N/A | |
| Learning to See in the Dark with Events | Song Zhang, Yu Zhang, Zhe Jiang, Dongqing Zou, Jimmy Ren, Bin Zhou | N/A | |
| Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data | Tim Salzmann, Boris Ivanovic, Punarjay Chakravarty, Marco Pavone | N/A | |
| Context-Gated Convolution | Xudong Lin, Lin Ma, Wei Liu, Shih-Fu Chang | N/A | |
| Polynomial Regression Network for Variable-Number Lane Detection | Bingke Wang, Zilei Wang, Yixin Zhang | N/A | |
| Structural Deep Metric Learning for Room Layout Estimation | Wenzhao Zheng, Jiwen Lu, Jie Zhou | N/A | |
| Adaptive Task Sampling for Meta-Learning | Chenghao Liu, Zhihao Wang, Doyen Sahoo, Yuan Fang Kun Zhang, Steven C.H. Hoi | N/A | |
| Deep Complementary Joint Model for Complex Scene Registration and Few-shot Segmentation on Medical Images | Yuting He, Tiantian Li, Guanyu Yang, Youyong Kong, Yang Chen, Huazhong Shu, Jean-Louis Coatrieux, Jean-Louis Dillenseger, Shuo Li | N/A | |
| Improving Multispectral Pedestrian Detection by Addressing Modality Imbalance Problems | Kailai Zhou, Linsen Chen, Xun Cao | N/A | |
| High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling | Yu Zeng, Zhe Lin, Jimei Yang, Jianming Zhang, Eli Shechtman, Huchuan Lu | N/A | |
| Online Ensemble Model Compression using Knowledge Distillation | Devesh Walawalkar, Zhiqiang Shen, Marios Savvides | N/A | |
| Deep Learning-based Pupil Center Detection for Fast and Accurate Eye Tracking System | Kang Il Lee, Jung Ho Jeon, Byung Cheol Song | N/A | |
| Efficient Residue Number System Based Winograd Convolution | Zhi-Gang Liu, Matthew Mattina | N/A | |
| Robust Tracking against Adversarial Attacks | Shuai Jia, Chao Ma, Yibing Song, Xiaokang Yang | N/A | |
| Single-Shot Neural Relighting and SVBRDF Estimation | Shen Sang, Manmohan Chandraker | N/A | |
| Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement | Qiang Nie , Ziwei Liu , Yunhui Liu | N/A | |
| Angle-based Search Space Shrinking for Neural Architecture Search | Yiming Hu, Yuding Liang, Zichao Guo, Ruosi Wan, Xiangyu Zhang, Yichen Wei, Qingyi Gu, Jian Sun | N/A | |
| RobustScanner: Dynamically Enhancing Positional Clues for Robust Text Recognition | Xiaoyu Yue, Zhanghui Kuang, Chenhao Lin, Hongbin Sun, Wayne Zhang | N/A | |
| Towards Fast, Accurate and Stable 3D Dense Face Alignment | Jianzhu Guo, Xiangyu Zhu, Yang Yang, Fan Yang, Zhen Lei, Stan Z. Li | N/A | |
| Iterative Feature Transformation for Fast and Versatile Universal Style Transfer | Tai-Yin Chiu, Danna Gurari | N/A | |
| CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search | Xin Chen, Yawen Duan, Zewei Chen, Hang Xu, Zihao Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li | N/A | |
| Toward Faster and Simpler Matrix Normalization via Rank-1 Update | Tan Yu, Yunfeng Cai, Ping Li | N/A | |
| Accurate Polarimetric BRDF for Real Polarization Scene Rendering | Yuhi Kondo, Taishi Ono, Legong Sun, Yasutaka Hirasawa, Jun Murayama | N/A | |
| Lensless Imaging with Focusing Sparse URA Masks in Long-Wave Infrared and its Application for Human Detection | Ilya Reshetouski, Hideki Oyaizu, Kenichiro Nakamura, Ryuta Satoh, Suguru Ushiki, Ryuichi Tadano, Atsushi Ito, Jun Murayama | N/A | |
| Topology-Preserving Class-Incremental Learning | Xiaoyu Tao, Xinyuan Chang, Xiaopeng Hong, Xing Wei, Yihong Gong | N/A | |
| Inter-Image Communication for Weakly Supervised Localization | Xiaolin Zhang, Yunchao Wei, Yi Yang | N/A | |
| UFO²: A Unified Framework towards Omni-supervised Object Detection | Zhongzheng Ren, Zhiding Yu, Xiaodong Yang, Ming-Yu Liu, Alexander G. Schwing, Jan Kautz | N/A | |
| iCaps: An Interpretable Classifier via Disentangled Capsule Networks | Dahuin Jung, Jonghyun Lee, Jihun Yi, Sungroh Yoon | N/A | |
| Detecting Natural Disasters, Damage, and Incidents in the Wild | Ethan Weber, Nuria Marzo, Dim P. Papadopoulos, Aritro Biswas, Agata Lapedriza, Ferda Ofli, Muhammad Imran, Antonio Torralba | N/A | |
| Dynamic ReLU | Yinpeng Chen, Xiyang Dai, Mengchen Liu, Dongdong Chen, Lu Yuan, Zicheng Liu | N/A | |
| Acquiring Dynamic Light Fields through Coded Aperture Camera | Kohei Sakai, Keita Takahashi, Toshiaki Fujii, Hajime Nagahara | N/A | |
| Gait Recognition from a Single Image using a Phase-Aware Gait Cycle Reconstruction Network | Chi Xu, Yasushi Makihara, Xiang Li, Yasushi Yagi, Jianfeng Lu | N/A | |
| Informative Sample Mining Network for Multi-Domain Image-to-Image Translation | Jie Cao, Huaibo Huang, Yi Li, Ran He, Zhenan Sun | N/A | |
| Spherical Feature Transform for Deep Metric Learning | Yuke Zhu, Yan Bai, Yichen Wei | N/A | |
| Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering | Ruixue Tang, Chao Ma, Wei Emma Zhang, Qi Wu, Xiaokang Yang | N/A | |
| Unsupervised Multi-View CNN for Salient View Selection of 3D Objects and Scenes | Ran Song, Wei Zhang, Yitian Zhao, Yonghuai Liu | N/A | |
| Representation Sharing for Fast Object Detector Search and Beyond | Yujie Zhong, Zelu Deng, Sheng Guo, Matthew R. Scott, Weilin Huang | N/A | |
| Peeking into occluded joints: A novel framework for crowd pose estimation | Lingteng Qiu, Xuanye Zhang, Yanran Li, Guanbin Li, Xiaojun Wu, Zixiang Xiong, Xiaoguang Han, Shuguang Cui | N/A | |
| RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition | Linxi Fan, Shyamal Buch, Guanzhi Wang, Ryan Cao, Yuke Zhu, Juan Carlos Niebles, Li Fei-Fei | N/A | |
| Deep Hashing with Active Pairwise Supervision | Ziwei Wang, Quan Zheng, Jiwen Lu, Jie Zhou | N/A | |
| Graph Edit Distance Reward: Learning to Edit Scene Graph | Lichang Chen, Guosheng Lin, Shijie Wang, Qingyao Wu | N/A | |
| Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing | Yajie Xing, Jingbo Wang, Gang Zeng | N/A | |
| Feature-metric Loss for Self-supervised Learning of Depth and Egomotion | Chang Shu, Kun Yu, Zhixiang Duan, Kuiyuan Yang | N/A | |
| Propagating Over Phrase Relations for One-Stage Visual Grounding | Sibei Yang, Guanbin Li, Yizhou Yu | N/A | |
| Adversarial Semantic Data Augmentation for Human Pose Estimation | Yanrui Bin, Xuan Cao, Xinya Chen, Yanhao Ge, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Changxin Gao, Nong Sang | N/A | |
| Free View Synthesis | Gernot Riegler, Vladlen Koltun | N/A | |
| Face Anti-Spoofing via Disentangled Representation Learning | Ke-Yue Zhang, Taiping Yao, Jian Zhang, Ying Tai, Shouhong Ding, Jilin Li, Feiyue Huang, Haichuan Song, Lizhuang Ma | N/A | |
| Prime-Aware Adaptive Distillation | Youcai Zhang, Zhonghao Lan, Yuchen Dai, Fangao Zeng, Yan Bai, Jie Chang, Yichen Wei | N/A | |
| Meta-Learning with Network Pruning | Hongduan Tian, Bo Liu, Xiao-Tong Yuan, Qingshan Liu | N/A | |
| Spiral Generative Network for Image Extrapolation | Dongsheng Guo, Hongzhi Liu, Haoru Zhao, Yunhao Cheng, Qingwei Song, Zhaorui Gu, Haiyong Zheng, Bing Zheng | N/A | |
| SceneSketcher: Fine-Grained Image Retrieval with Scene Sketches | Fang Liu, Changqing Zou, Xiaoming Deng, Ran Zuo, Yu-Kun Lai, Cuixia Ma, Yong-Jin Liu, Hongan Wang | N/A | |
| Few-shot Compositional Font Generation with Dual Memory | Junbum Cha, Sanghyuk Chun, Gayoung Lee, Bado Lee, Seonghyeon Kim, Hwalsuk Lee | N/A | |
| PUGeo-Net: A Geometry-centric Network for 3D Point Cloud Upsampling | Yue Qian, Junhui Hou, Sam Kwong, Ying He | N/A | |
| Handcrafted Outlier Detection Revisited | Luca Cavalli, Viktor Larsson, Martin Ralf Oswald, Torsten Sattler, Marc Pollefeys | N/A | |
| The Average Mixing Kernel Signature | Luca Cosmo, Giorgia Minello, Michael Bronstein, Luca Rossi, Andrea Torsello | N/A | |
| BCNet: Learning Body and Cloth Shape from A Single Image | Boyi Jiang, Juyong Zhang, Yang Hong, Jinhao Luo, Ligang Liu, Hujun Bao | N/A | |
| Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos | Umer Rafi, Andreas Doering, Bastian Leibe, Juergen Gall | N/A | |
| Interactive Multi-Dimension Modulation with Dynamic Controllable Residual Learning for Image Restoration | Jingwen He, Chao Dong, Yu Qiao | N/A | |
| Polysemy Deciphering Network for Human-Object Interaction Detection | Xubin Zhong, Changxing Ding, Xian Qu, Dacheng Tao | N/A | |
| PODNet: Pooled Outputs Distillation for Small-Tasks Incremental Learning | Arthur Douillard, Matthieu Cord, Charles Ollion, Thomas Robert, Eduardo Valle | N/A | |
| Learning Graph-Convolutional Representations for Point Cloud Denoising | Francesca Pistilli, Giulia Fracastoro, Diego Valsesia, Enrico Magli | N/A | |
| Semantic Line Detection Using Mirror Attention and Comparative Ranking and Matching | Dongkwon Jin, Jun-Tae Lee, Chang-Su Kim | N/A | |
| A Differentiable Recurrent Surface for Asynchronous Event-Based Data | Marco Cannici, Marco Ciccone, Andrea Romanoni , Matteo Matteucci | N/A | |
| Fine-Grained Visual Classification via Progressive Multi-Granularity Training of Jigsaw Patches | Ruoyi Du, Dongliang Chang, Ayan Kumar Bhunia, Jiyang Xie, Zhanyu Ma , Yi-Zhe Song, Jun Guo | N/A | |
| LiteFlowNet3: Resolving Correspondence Ambiguity for More Accurate Optical Flow Estimation | Tak-Wai Hui, Chen Change Loy | N/A | |
| Microscopy Image Restoration with Deep Wiener-Kolmogorov Filters | Valeriya Pronina, Filippos Kokkinos, Dmitry V. Dylov, Stamatios Lefkimmiatis | N/A | |
| ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language | Dave Zhenyu Chen, Angel X. Chang, Matthias Nießner | N/A | |
| JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds | Zeyu Hu, Mingmin Zhen, Xuyang Bai, Hongbo Fu, Chiew-lan Tai | N/A | |
| Motion-Excited Sampler: Video Adversarial Attack with Sparked Prior | Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang | N/A | |
| An Inference Algorithm for Multi-Label MRF-MAP Problems with Clique Size 100 | Ishant Shanu, Siddhant Bharti, Chetan Arora, S. N. Maheshwari | N/A | |
| Dual Refinement Underwater Object Detection Network | Baojie Fan, Wei Chen, Yang Cong, Jiandong Tian | N/A | |
| Multiple Sound Sources Localization from Coarse to Fine | Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin | N/A | |
| Task-Aware Quantization Network for JPEG Image Compression | Jinyoung Choi, Bohyung Han | N/A | |
| Energy-Based Models for Deep Probabilistic Regression | Fredrik K. Gustafsson, Martin Danelljan, Goutam Bhat, Thomas B. Schön | N/A | |
| CLOTH3D: Clothed 3D Humans | Hugo Bertiche, Meysam Madadi, Sergio Escalera | N/A | |
| Encoding Structure-Texture Relation with P-Net for Anomaly Detection in Retinal Images | Kang Zhou, Yuting Xiao, Jianlong Yang, Jun Cheng, Wen Liu, Weixin Luo, Zaiwang Gu, Jiang Liu, Shenghua Gao | N/A | |
| CLNet: A Compact Latent Network for Fast Adjusting Siamese Trackers | Xingping Dong, Jianbing Shen, Ling Shao, Fatih Porikli | N/A | |
| Occlusion-Aware Siamese Network for Human Pose Estimation | Lu Zhou, Yingying Chen, Yunze Gao, Jinqiao Wang, Hanqing Lu | N/A | |
| Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model | Yufan Liu, Minglang Qiao, Mai Xu, Bing Li, Weiming Hu, Ali Borji | N/A | |
| NormalGAN: Learning Detailed 3D Human from a Single RGB-D Image | Lizhen Wang, Xiaochen Zhao, Tao Yu, Songtao Wang, Yebin Liu | N/A | |
| Model-based occlusion disentanglement for image-to-image translation | Fabio Pizzati, Pietro Cerri, Raoul de Charette | N/A | |
| Rotation-robust Intersection over Union for 3D Object Detection | Yu Zheng, Danyang Zhang, Sinan Xie, Jiwen Lu, Jie Zhou | N/A | |
| New Threats against Object Detector with Non-local Block | Yi Huang, Fan Wang, Adams Wai-Kin Kong, Kwok-Yan Lam | N/A | |
| Self-Supervised CycleGAN for Object-Preserving Image-to-Image Domain Adaptation | Xinpeng Xie, Jiawei Chen, Yuexiang Li, Linlin Shen, Kai Ma, Yefeng Zheng | N/A | |
| On the Usage of the Trifocal Tensor in Motion Segmentation | Federica Arrigoni, Luca Magri, Tomas Pajdla | N/A | |
| 3D-Rotation-Equivariant Quaternion Neural Networks | Wen Shen, Binbin Zhang, Shikun Huang, Zhihua Wei, Quanshi Zhang | N/A | |
| InterHand2.6M: A Dataset and Baseline for 3D Interacting Hand Pose Estimation from a Single RGB Image | Gyeongsik Moon, Shoou-I Yu, He Wen, Takaaki Shiratori, Kyoung Mu Lee | N/A | |
| Active Crowd Counting with Limited Supervision | Zhen Zhao, Miaojing Shi, Xiaoxiao Zhao, Li Li | N/A | |
| Self-Supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance | Marvin Klingner, Jan-Aike Termhlen, Jonas Mikolajczyk, Tim Fingscheidt | N/A | |
| Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language | Shaoxiang Chen, Yu-Gang Jiang | N/A | |
| Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-On | Thibaut Issenhuth, Jérémie Mary, Clément Calauzènes | N/A | |
| NODIS: Neural Ordinary Differential Scene Understanding | Yuren Cong, Hanno Ackermann, Wentong Liao, Michael Ying Yang, Bodo Rosenhahn | N/A | |
| AssembleNet++: Assembling Modality Representations via Attention Connections - Supplementary Material - | Michael S. Ryoo, AJ Piergiovanni, Juhana Kangaspunta, Anelia Angelova | N/A | |
| Learning Propagation Rules for Attribution Map Generation | Yiding Yang, Jiayan Qiu, Mingli Song, Dacheng Tao, Xinchao Wang | N/A | |
| Reparameterizing Convolutions for Incremental Multi-Task Learning without Task Interference | Menelaos Kanakis, David Bruggemann, Suman Saha, Stamatios Georgoulis , Anton Obukhov, Luc Van Gool | N/A | |
| Learning Predictive Models from Observation and Interaction | Karl Schmeckpeper, Annie Xie, Oleh Rybkin, Stephen Tian, Kostas Daniilidis, Sergey Levine, Chelsea Finn | N/A | |
| Unifying Deep Local and Global Features for Image Search | Bingyi Cao, André Araujo, Jack Sim | N/A | |
| Human Body Model Fitting by Learned Gradient Descent | Jie Song, Xu Chen, Otmar Hilliges | N/A | |
| DDGCN: A Dynamic Directed Graph Convolutional Network for Action Recognition | Matthew Korban, Xin Li | N/A | |
| Learning latent representations across multiple data domains using Lifelong VAEGAN | Fei Ye, Adrian G. Bors | N/A | |
| DVI: Depth Guided Video Inpainting for Autonomous Driving | Miao Liao, Feixiang Lu, Dingfu Zhou, Sibo Zhang, Wei Li, Ruigang Yang | N/A | |
| Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation | Kenan E. Ak, Ning Xu, Zhe Lin, Yilin Wang | N/A | |
| APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection | A. Braunegg, Amartya Chakraborty, Michael Krumdick, Nicole Lape, Sara Leary, Keith Manville, Elizabeth Merkhofer, Laura Strickhart, Matthew Walmer | N/A | |
| Visual Question Answering on Image Sets | Ankan Bansal, Yuting Zhang, Rama Chellappa | N/A | |
| Object as Hotspots: An Anchor-Free 3D Object Detection Approach via Firing of Hotspots | Qi Chen, Lin Sun, Zhixin Wang, Kui Jia, Alan Yuille | N/A | |
| Placepedia: Comprehensive Place Understanding with Multi-Faceted Annotations | Huaiyi Huang, Yuqi Zhang, Qingqiu Huang, Zhengkui Guo, Ziwei Liu, Dahua Lin | N/A | |
| DELTAS: Depth Estimation by Learning Triangulation And densification of Sparse points | Ayan Sinha, Zak Murez, James Bartolozzi, Vijay Badrinarayanan, Andrew Rabinovich | N/A | |
| Dynamic Low-light Imaging with Quanta Image Sensors | Yiheng Chi, Abhiram Gnanasambandam, Vladlen Koltun, Stanley H. Chan | N/A | |
| Disambiguating Monocular Depth Estimation with a Single Transient | Mark Nishimura, David B. Lindell, Christopher Metzler, Gordon Wetzstein | N/A | |
| DSDNet: Deep Structured self-Driving Network | Wenyuan Zeng, Shenlong Wang, Renjie Liao, Yun Chen, Bin Yang, Raquel Urtasun | N/A | |
| QuEST: Quantized Embedding Space for Transferring Knowledge | Himalaya Jain, Spyros Gidaris, Nikos Komodakis, Patrick Pérez, Matthieu Cord | N/A | |
| EGDCL: An Adaptive Curriculum Learning Framework for Unbiased Glaucoma Diagnosis | Rongchang Zhao, Xuanlin Chen, Zailiang Chen, Shuo Li | N/A | |
| Backpropagated Gradient Representations for Anomaly Detection | Gukyeong Kwon, Mohit Prabhushankar, Dogancan Temel, Ghassan AlRegib | N/A | |
| Dense RepPoints: Representing Visual Objects with Dense Point Sets | Ze Yang, Yinghao Xu, Han Xue, Zheng Zhang Raquel Urtasun, Liwei Wang , Stephen Lin, Han Hu | N/A | |
| On Dropping Clusters to Regularize Graph Convolutional Neural Networks | Xikun Zhang, Chang Xu, Dacheng Tao | N/A | |
| Adaptive Video Highlight Detection by Learning from User History | Mrigank Rochan, Mahesh Kumar Krishna Reddy, Linwei Ye, Yang Wang | N/A | |
| Improving 3D Object Detection through Progressive Population Based Augmentation | Shuyang Cheng, Zhaoqi Leng, Ekin Dogus Cubuk, Barret Zoph, Chunyan Bai, Jiquan Ngiam, Yang Song, Benjamin Caine, Vijay Vasudevan, Congcong Li, Quoc V. Le, Jonathon Shlens, Dragomir Anguelov | N/A | |
| DR-KFS: A Differentiable Visual Similarity Metric for 3D Shape Reconstruction | Jiongchao Jin, Akshay Gadi Patil, Zhang Xiong, Hao Zhang | N/A | |
| SPAN: Spatial Pyramid Attention Network for Image Manipulation Localization | Xuefeng Hu, Zhihan Zhang, Zhenye Jiang, Syomantak Chaudhuri, Zhenheng Yang, Ram Nevatia | N/A | |
| Adversarial Learning for Zero-shot Domain Adaptation | Jinghua Wang, Jianmin Jiang | N/A | |
| YOLO in the Dark - Domain Adaptation Method for Merging Multiple Models - | Yukihiro Sasagawa, Hajime Nagahara | N/A | |
| Identity-Aware Multi-Sentence Video Description | Jae Sung Park, Trevor Darrell, Anna Rohrbach | N/A | |
| VQA-LOL: Visual Question Answering under the Lens of Logic | Tejas Gokhale, Pratyay Banerjee, Chitta Baral, Yezhou Yang | N/A | |
| Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation | Mengyao Zhai, Lei Chen, Jiawei He, Megha Nawhal, Frederick Tung, Greg Mori | N/A | |
| TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering | Xiaofeng Yang, Guosheng Lin, Fengmao Lv, Fayao Liu | N/A | |
| Mining Inter-Video Proposal Relations for Video Object Detection | Mingfei Han, Yali Wang, Xiaojun Chang, Yu Qiao | N/A | |
| TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval | Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal | N/A | |
| Minimum Class Confusion for Versatile Domain Adaptation | Ying Jin, Ximei Wang, Mingsheng Long(), Jianmin Wang | N/A | |
| Large Batch Optimization for Object Detection: Training COCO in 12 Minutes | Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Yaowei Wang, Jinqiao Wang, Ming Tang | N/A | |
| Towards Practical and Efficient High-Resolution HDR Deghosting with CNN | K. Ram Prabhakar, Susmit Agrawal, Durgesh Kumar Singh, Balraj Ashwath , R. Venkatesh Babu | N/A | |
| Monocular Differentiable Rendering for Self-Supervised 3D Object Detection | Deniz Beker, Hiroharu Kato, Mihai Adrian Morariu, Takahiro Ando, Toru Matsuoka, Wadim Kehl, Adrien Gaidon | N/A | |
| Shape Prior Deformation for Categorical 6D Object Pose and Size Estimation | Meng Tian, Marcelo H Ang Jr, Gim Hee Lee | N/A | |
| Dynamic and Static Context-aware LSTM for Multi-agent Motion Prediction | Chaofan Tao, Qinhong Jiang, Lixin Duan, Ping Luo | N/A | |
| Image-based table recognition: data, model, and evaluation | Xu Zhong, Elaheh ShafieiBavani, Antonio Jimeno Yepes | N/A | |
| Group Activity Prediction with Sequential Relational Anticipation Model | Junwen Chen, Wentao Bao,, Yu Kong | N/A | |
| PiP: Planning-informed Trajectory Prediction for Autonomous Driving | Haoran Song, Wenchao Ding, Yuxuan Chen, Shaojie Shen, Michael Yu Wang, Qifeng Chen | N/A | |
| PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer | Duo Li, Anbang Yao, Qifeng Chen | N/A | |
| Hierarchical Context Embedding for Region-based Object Detection | Zhao-Min Chen, Xin Jin, Borui Zhao, Xiu-Shen Wei, Yanwen Guo | N/A | |
| Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition | Jin Ye, Junjun He, Xiaojiang Peng, Wenhao Wu, Yu Qiao | N/A | |
| Gen-LaneNet: A Generalized and Scalable Approach for 3D Lane Detection | Yuliang Guo, Guang Chen, Peitao Zhao, Weide Zhang, Jinghao Miao, Jingao Wang, Tae Eun Choe | N/A | |
| Sparse-to-Dense Depth Completion Revisited: Sampling Strategy and Graph Construction | Xin Xiong, Haipeng Xiong, Ke Xian, Chen Zhao, Zhiguo Cao, Xin Li | N/A | |
| MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation | Kaisiyuan Wang Qianyi Wu Linsen Song Zhuoqian Yang Wayne Wu Chen Qian Ran He Yu Qiao Chen Change Loy | N/A | |
| Detecting Human-Object Interactions with Action Co-occurrence Priors | Dong-Jin Kim Xiao Sun Jinsoo Choi Stephen Lin In So Kweon | N/A | |
| Learning Connectivity of Neural Networks from a Topological Perspective | Kun Yuan, Quanquan Li, Jing Shao, Junjie Yan | N/A | |
| JSTASR: Joint Size and Transparency-Aware Snow Removal Algorithm Based on Modified Partial Convolution and Veiling Effect Removal | Wei-Ting Chen, Hao-Yu Fang, Jian-Jiun Ding, Cheng-Che Tsai, Sy-Yen Kuo | N/A | |
| Ocean: Object-aware Anchor-free Tracking | Zhipeng Zhang, Houwen Peng, Jianlong Fu Bing Li, Weiming Hu | N/A | |
| Object Tracking using Spatio-Temporal Networks for Future Prediction Location | Yuan Liu, Ruoteng Li, Yu Cheng, Robby T. Tan, Xiubao Sui | N/A | |
| Pillar-based Object Detection for Autonomous Driving | Yue Wang, Alireza Fathi, Abhijit Kundu, David A. Ross, Caroline Pantofaru, Tom Funkhouser, Justin Solomon | N/A | |
| Sparse Adversarial Attack via Perturbation Factorization | Yanbo Fan, Baoyuan Wu, Tuanhui Li, Yong Zhang, Mingyang Li, Zhifeng Li, Yujiu Yang | N/A | |
| 3D Scene Reconstruction from a Single Viewport | Maximilian Denninger, Rudolph Triebel | N/A | |
| Learning to Optimize Domain Specific Normalization for Domain Generalization | Seonguk Seo, Yumin Suh, Dongwan Kim, Geeho Kim, Jongwoo Han, Bohyung Han | N/A | |
| Self-supervised Outdoor Scene Relighting | Ye Yu, Abhimitra Meka, Mohamed Elgharib, Hans-Peter Seidel, Christian Theobalt, William A. P. Smith | N/A | |
| Privacy Preserving Visual SLAM | Mikiya Shibuya, Shinya Sumikura, Ken Sakurada | N/A | |
| Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning | Valentina Sanguineti, Pietro Morerio, Niccolò Pozzetti, Danilo Greco, Marco Cristani, Vittorio Murino | N/A | |
| Learning Joint Visual Semantic Matching Embeddings for Language-guided Retrieval | Yanbei Chen, Loris Bazzani | N/A | |
| Globally Optimal and Efficient Vanishing Point Estimation in Atlanta World | Haoang Li, Pyojin Kim, Ji Zhao, Kyungdon Joo, Zhipeng Cai, Zhe Liu , Yun-Hui Liu | N/A | |
| StyleGAN2 Distillation for Feed-forward Image Manipulation | Yuri Viazovetskyi, Vladimir Ivashkin, Evgeny Kashin | N/A | |
| Self-Prediction for Joint Instance and Semantic Segmentation of Point Clouds | Jinxian Liu, Minghui Yu, Bingbing Ni⁴, Ye Chen | N/A | |
| Learning Disentangled Representations via Mutual Information Estimation | Eduardo Hugo Sanchez, Mathieu Serrurier, Mathias Ortner | N/A | |
| Challenge-Aware RGBT Tracking | Chenglong Li, Lei Liu, Andong Lu, Qing Ji, Jin Tang | N/A | |
| Fully Trainable and Interpretable Non-Local Sparse Models for Image Restoration | Bruno Lecouat, Jean Ponce, Julien Mairal | N/A | |
| AutoSimulate: (Quickly) Learning Synthetic Data Generation | Harkirat Singh Behl, Atilim Güneş Baydin, Ran Gal, Philip H.S. Torr, Vibhav Vineet | N/A | |
| LatticeNet: Towards Lightweight Image Super-resolution with Lattice Block | Xiaotong Luo, Yuan Xie, Yulun Zhang, Yanyun Qu, Cuihua Li, Yun Fu | N/A | |
| Learning from Scale-Invariant Examples for Domain Adaptation in Semantic Segmentation | M.Naseer Subhani, Mohsen Ali | N/A | |
| Active Visual Information Gathering for Vision-Language Navigation | Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, Jianbing Shen | N/A | |
| Deep Hough-Transform Line Priors | Yancong Lin, Silvia L. Pintea, Jan C. van Gemert | N/A | |
| Unsupervised Shape and Pose Disentanglement for 3D Meshes | Keyang Zhou, Bharat Lal Bhatnagar, Gerard Pons-Moll | N/A | |
| CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection | Muhammad Zaigham Zaheer, Arif Mahmood, Marcella Astrid, Seung-Ik Lee | N/A | |
| Inclusive GAN: Improving Data and Minority Coverage in Generative Models | Ning Yu, Ke Li, Peng Zhou Jitendra Malik, Larry Davis, Mario Fritz | N/A | |
| SESAME: Semantic Editing of Scenes by Adding, Manipulating or Erasing Objects | Evangelos Ntavelis, Andrés Romero, Iason Kastanis, Luc Van Gool, Radu Timofte | N/A | |
| Dive Deeper Into Box for Object Detection | Ran Chen, Yong Liu, Mengdan Zhang, Shu Liu, Bei Yu, Yu-Wing Tai | N/A | |
| PG-Net: Pixel to Global Matching Network for Visual Tracking | Bingyan Liao, Chenye Wang, Yayun Wang, Yaonong Wang, Jun Yin | N/A | |
| Why Are Deep Representations Good Perceptual Quality Features? | Taimoor Tariq, Okan Tarhan Tursun, Munchurl Kim, Piotr Didyk | N/A | |
| Geometric Estimation via Robust Subspace Recovery | Aoxiang Fan, Xingyu Jiang, Yang Wang, Junjun Jiang, Jiayi Ma | N/A | |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Sanath Narayan, Akshita Gupta, Fahad Shahbaz Khan, Cees G. M. Snoek, Ling Shao | N/A | |
| Human Correspondence Consensus for 3D Object Semantic Understanding | Yujing Lou, Yang You, Chengkun Li, Zhoujun Cheng, Liangwei Li, Lizhuang Ma, Weiming Wang, Cewu Lu | N/A | |
| Learning Memory Augmented Cascading Network for Compressed Sensing of Images | Jiwei Chen, Yubao Sun, Qingshan Liu, Rui Huang | N/A | |
| Least squares surface reconstruction on arbitrary domains | Dizhong Zhu, William A. P. Smith | N/A | |
| Task-conditioned Domain Adaptation for Pedestrian Detection in Thermal Imagery | My Kieu, Andrew D. Bagdanov, Marco Bertini, Alberto del Bimbo | N/A | |
| Improving the Transferability of Adversarial Examples with Resized-Diverse-Inputs, Diversity-Ensemble and Region Fitting | Junhua Zou, Zhisong Pan, Junyang Qiu, Xin Liu, Ting Rui, Wei Li | N/A | |
| DADA: Differentiable Automatic Data Augmentation | Yonggang Li, Guosheng Hu, Yongtao Wang, Timothy Hospedales, Neil M. Robertson, Yongxin Yang | N/A | |
| SceneCAD: Predicting Object Alignments and Layouts in RGB-D Scans | Armen Avetisyan, Tatiana Khanova, Christopher Choy, Denver Dash, Angela Dai, Matthias Nießner | N/A | |
| Kinship Identification through Joint Learning using Kinship Verification Ensembles | Wei Wang, Shaodi You, Theo Gevers | N/A | |
| Kernelized Memory Network for Video Object Segmentation | Hongje Seong, Junhyuk Hyun, Euntai Kim | N/A | |
| A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection | Xiaoqi Zhao, Lihe Zhang¹, Youwei Pang, Huchuan Lu, Lei Zhang | N/A | |
| Splitting vs. Merging: Mining Object Regions with Discrepancy and Intersection Loss for Weakly Supervised Semantic Segmentation | Tianyi Zhang, Guosheng Lin, Weide Liu, Jianfei Cai, Alex Kot | N/A | |
| Temporal Keypoint Matching and Refinement Network for Pose Estimation and Tracking | Chunluan Zhou Zhou Ren Gang Hua | N/A | |
| Neural Point-Based Graphics | Kara-Ali Aliev, Artem Sevastopolsky, Maria Kolos, Dmitry Ulyanov, Victor Lempitsky | N/A | |
| FHDe²Net: Full High Definition Demoireing Network | Bin He, Ce Wang, Boxin Shi, Ling-Yu Duan | N/A | |
| Learning Structural Similarity of User Interface Layouts using Graph Networks | Dipu Manandhar, Dan Ruta, John Collomosse | N/A | |
| NAS-Count: Counting-by-Density with Neural Architecture Search | Yutao Hu ¹, Xiaolong Jiang ², Xuhui Liu, Baochang Zhang, Jungong Han, Xianbin Cao ², David Doermann | N/A | |
| Towards Generalization Across Depth for Monocular 3D Object Detection | Andrea Simonelli, Samuel Rota Buló, Lorenzo Porzi, Elisa Ricci, Peter Kontschieder | N/A | |
| Margin-Mix: Semi–Supervised Learning for Face Expression Recognition | Corneliu Florea, Mihai Badea, Laura Florea, Andrei Racoviteanu, Constantin Vertan | N/A | |
| Principal Feature Visualisation in Convolutional Neural Networks | Marianne Bakken, Johannes Kvam, Alexey A. Stepanov, Asbjørn Berge | N/A | |
| Progressive Refinement Network for Occluded Pedestrian Detection | Xiaolin Song Kaili Zhao Wen-Sheng Chu Honggang Zhang Jun Guo | N/A | |
| Monocular Real-Time Volumetric Performance Capture | Ruilong Li, Yuliang Xiu, Shunsuke Saito, Zeng Huang, Kyle Olsewski, Hao Li | N/A | |
| The Mapillary Traffic Sign Dataset for Detection and Classification on a Global Scale | Christian Ertler, Jerneja Mislej, Tobias Ollmann, Lorenzo Porzi, Gerhard Neuhold, Yubin Kuang | N/A | |
| Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction | Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren⁸, Weiting Huang⁸, Haifeng Sun⁸, Marek Hrúz⁹, Jakub Kanis⁹, Zdeněk Krňoul⁹, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou, Sijia Mei, Yunhui Liu, Adrian Spurr, Umar Iqbal, Pavlo Molchanov, Philippe Weinzaepfel, Romain Brégier, Grégory Rogez, Vincent Lepetit, Tae-Kyun Kim | N/A | |
| Disentangling Multiple Features in Video Sequences using Gaussian Processes in Variational Autoencoders | Sarthak Bhagat, Shagun Uppal, Zhuyun Yin, Nengli Lim | N/A | |
| SEN: A Novel Feature Normalization Dissimilarity Measure for Prototypical Few-Shot Learning Networks | Van Nhan Nguyen, Sigurd Løkse, Kristoffer Wickstrøm, Michael Kampffmeyer, Davide Roverso, Robert Jenssen | N/A | |
| Kinematic 3D Object Detection in Monocular Video | Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele | N/A | |
| Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents | Ye Zhu, Yu Wu, Yi Yang, Yan Yan | N/A | |
| SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-based Symptom Relation Embedding | Sangmin Lee, Jung Uk Kim, Hak Gu Kim, Seongyeop Kim, Yong Man Ro | N/A | |
| End-to-End Low Cost Compressive Spectral Imaging with Spatial-Spectral Self-Attention | Ziyi Meng, Jiawei Ma, Xin Yuan | N/A | |
| Know Your Surroundings: Exploiting Scene Information for Object Tracking | Goutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte | N/A | |
| Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases | Ren Wang, Gaoyuan Zhang, Sijia Liu, Pin-Yu Chen, Jinjun Xiong, Meng Wang | N/A | |
| Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detection in X-ray Images | Haomin Chen, Yirui Wang, Kang Zheng, Weijian Li, Chi-Tung Chang, Adam P. Harrison, Jing Xiao, Gregory D. Hager, Le Lu, Chien-Hung Liao, Shun Miao | N/A | |
| DeepLandscape: Adversarial Modeling of Landscape Videos | Elizaveta Logacheva, Roman Suvorov, Oleg Khomenko, Anton Mashikhin, Victor Lempitsky | N/A | |
| GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images | Lei Kang, Pau Riba, Yaxing Wang, Marçal Rusiñol, Alicia Fornés, Mauricio Villegas | N/A | |
| Spatial-Angular Interaction for Light Field Image Super-Resolution | Yingqian Wang, Longguang Wang, Jungang Yang, Wei An, Jingyi Yu, Yulan Guo | N/A | |
| BATS: Binary ArchitecTure Search | Adrian Bulat, Brais Martinez, Georgios Tzimiropoulos | N/A | |
| A Closer Look at Local Aggregation Operators in Point Cloud Analysis | Ze Liu(†), Han Hu, Yue Cao, Zheng Zhang, Xin Tong | N/A | |
| Look here! A parametric learning based approach to redirect visual attention | Youssef A. Mejjati, Celso F. Gomez, Kwang In Kim, Eli Shechtman, Zoya Bylinskii | N/A | |
| Variational Diffusion Autoencoders with Random Walk Sampling | Henry Li, Ofir Lindenbaum, Xiuyuan Cheng, Alexander Cloninger | N/A | |
| Adaptive Variance Based Label Distribution Learning For Facial Age Estimation | Xin Wen, Biying Li, Haiyun Guo, Zhiwei Liu, Guosheng Hu, Ming Tang, Jinqiao Wang | N/A | |
| Connecting the Dots: Detecting Adversarial Perturbations Using Context Inconsistency | Shasha Li, Shitong Zhu, Sudipta Paul, Amit Roy-Chowdhury, Chengyu Song, Srikanth Krishnamurthy, Ananthram Swami, Kevin S Chan | N/A | |
| Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations | Abbas Sadat, Sergio Casas, Mengye Ren, Xinyu Wu, Pranaab Dhawan, Raquel Urtasun | N/A | |
| VarSR: Variational Super-Resolution Network for Very Low Resolution Images | Sangeek Hyun, Jae-Pil Heo | N/A | |
| Co-Heterogeneous and Adaptive Segmentation from Multi-Source and Multi-Phase CT Imaging Data: A Study on Pathological Liver and Lesion Segmentation | Ashwin Raju, Chi-Tung Cheng, Yuankai Huo, Jinzheng Cai, Junzhou Huang, Jing Xiao, Le Lu, ChienHung Liao, Adam P. Harrison | N/A | |
| Towards Recognizing Unseen Categories in Unseen Domains | Massimiliano Mancini, Zeynep Akata, Elisa Ricci, Barbara Caputo | N/A | |
| Square Attack: a query-efficient black-box adversarial attack via random search | Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion, Matthias Hein | N/A | |
| You Are Here: Geolocation by Embedding Maps and Images | Noe Samano, Mengjie Zhou, Andrew Calway | N/A | |
| Segmentations-Leak: Membership Inference Attacks and Defenses in Semantic Image Segmentation | Yang He, Shadi Rahimian, Bernt Schiele, Mario Fritz | N/A | |
| From Image to Stability: Learning Dynamics from Human Pose | Jesse Scott, Bharadwaj Ravichandran, Christopher Funk, Robert T. Collins, Yanxi Liu | N/A | |
| LevelSet R-CNN: A Deep Variational Method for Instance Segmentation | Namdar Homayounfar Yuwen Xiong Justin Liang Wei-Chiu Ma Raquel Urtasun {namdar,yuwen,justin.liang,weichiu,urtasun}@uber.com | N/A | |
| Efficient Scale-Permuted Backbone with Learned Resource Distribution | Xianzhi Du, Tsung-Yi Lin, Pengchong Jin, Yin Cui Mingxing Tan, Quoc Le, Xiaodan Song | N/A | |
| Reducing Distributional Uncertainty by Mutual Information Maximisation and Transferable Feature Learning | Jian Gao, Yang Hua, Guosheng Hu, Chi Wang, Neil M. Robertson | N/A | |
| Bridging Knowledge Graphs to Generate Scene Graphs | Alireza Zareian, Svebor Karaman, Shih-Fu Chang | N/A | |
| Implicit Latent Variable Model for Scene-Consistent Motion Forecasting | Sergio Casas, Cole Gulino, Simon Suo, Katie Luo, Renjie Liao, Raquel Urtasun | N/A | |
| Learning Visual Commonsense for Robust Scene Graph Generation | Alireza Zareian, Zhecan Wang, Haoxuan You, Shih-Fu Chang | N/A | |
| MPCC: Matching Priors and Conditionals for Clustering | Nicolás Astorga, Pablo Huijse, Pavlos Protopapas, Pablo Estévez | N/A | |
| PointAR: Efficient Lighting Estimation for Mobile Augmented Reality | Yiqin Zhao, Tian Guo | N/A | |
| Discrete Point Flow Networks for Efficient Point Cloud Generation | Roman Klokov, Edmond Boyer, Jakob Verbeek | N/A | |
| Accelerating Deep Learning with Millions of Classes | Zhuoning Yuan, Zhishuai Guo, Xiaotian Yu, Xiaoyu Wang, Tianbao Yang | N/A | |
| Password-conditioned Anonymization and Deanonymization with Face Identity Transformers | Xiuye Gu, Weixin Luo, Michael S. Ryoo, Yong Jae Lee | N/A | |
| Inertial Safety from Structured Light | Sizhuo Ma, Mohit Gupta | N/A | |
| PointTriNet: Learned Triangulation of 3D Point Sets | Nicholas Sharp, Maks Ovsjanikov | N/A | |
| Toward Unsupervised, Multi-Object Discovery in Large-Scale Image Collections | Huy V. Vo, Patrick Pérez, Jean Ponce | N/A | |
| Deep Novel View Synthesis from Colored 3D Point Clouds | Zhenbo Song, Wayne Chen, Dylan Campbell, Hongdong Li | N/A | |
| Consensus-Aware Visual-Semantic Embedding for Image-Text Matching | Haoran Wang, Ying Zhang, Zhong Ji, Yanwei Pang, Lin Ma | N/A | |
| Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising | Guanting Dong, Yueyi Zhang, Zhiwei Xiong | N/A | |
| Sat2Graph: Road Graph Extraction through Graph-Tensor Encoding | Songtao He, Favyen Bastani, Satvat Jagwani, Mohammad Alizadeh, Hari Balakrishnan, Sanjay Chawla, Mohamed M. Elshrif, Samuel Madden, Mohammad Amin Sadeghi | N/A | |
| Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition | Di Hu, Xuhong Li, Lichao Mou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, Dejing Dou | N/A | |
| Polarimetric Multi-View Inverse Rendering | Jinyu Zhao, Yusuke Monno, Masatoshi Okutomi | N/A | |
| SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information | Jing Yu Koh, Duc Thanh Nguyen, Quang-Trung Truong, Sai-Kit Yeung, Alexander Binder | N/A | |
| Improving Face Recognition by Clustering Unlabeled Faces in the Wild | Aruni RoyChowdhury, Xiang Yu, Kihyuk Sohn, Erik Learned-Miller, Manmohan Chandraker | N/A | |
| NeuRoRA: Neural Robust Rotation Averaging | Pulak Purkait, Tat-Jun Chin, Ian Reid | N/A | |
| SG-VAE: Scene Grammar Variational Autoencoder to generate new indoor scenes | Pulak Purkait, Christopher Zach, Ian Reid | N/A | |
| Unsupervised Learning of Optical Flow with Deep Feature Similarity | Woobin Im, Tae-Kyun Kim, Sung-Eui Yoon | N/A | |
| Blended Grammar Network for Human Parsing | Xiaomei Zhang, Yingying Chen, Bingke Zhu, Jinqiao Wang, Ming Tang | N/A | |
| P²Net: Patch-match and Plane-regularization for Unsupervised Indoor Depth Estimation | Zehao Yu, Lei Jin, Shenghua Gao | N/A | |
| Efficient Attention Mechanism for Visual Dialog that can Handle All the Interactions between Multiple Inputs | Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani | N/A | |
| Adaptive Mixture Regression Network with Local Counting Map for Crowd Counting | Xiyang Liu, Jie Yang, Wenrui Ding, Tieqiang Wang, Zhijin Wang, Junjun Xiong | N/A | |
| BIRNAT: Bidirectional Recurrent Neural Networks with Adversarial Training for Video Snapshot Compressive Imaging | Ziheng Cheng, Ruiying Lu, Zhengjue Wang, Hao Zhang, Bo Chen, Ziyi Meng, Xin Yuan | N/A | |
| Ultra Fast Structure-aware Deep Lane Detection | Zequn Qin, Huanyu Wang, Xi Li | N/A | |
| Cross-Identity Motion Transfer for Arbitrary Objects through Pose-Attentive Video Reassembling | Subin Jeon, Seonghyeon Nam, Seoung Wug Oh, Seon Joo Kim | N/A | |
| Domain Adaptive Object Detection via Asymmetric Tri-way Faster-RCNN | Zhenwei He, Lei Zhang | N/A | |
| Exclusivity-Consistency Regularized Knowledge Distillation for Face Recognition | Xiaobo Wang, Tianyu Fu, Shengcai Liao, Shuo Wang, Zhen Lei, Tao Mei | N/A | |
| Learning Camera-Aware Noise Models | Ke-Chi Chang, Ren Wang, Hung-Jin Lin, Yu-Lun Liu, Chia-Ping Chen, Yu-Lin Chang, Hwann-Tzong Chen | N/A | |
| Towards Precise Completion of Deformable Shapes | Oshri Halimi, Ido Imanuel, Or Litany, Giovanni Trappolini, Emanuele Rodolà, Leonidas Guibas, Ron Kimmel | N/A | |
| Iterative Distance-Aware Similarity Matrix Convolution with Mutual-Supervised Point Elimination for Efficient Point Cloud Registration | Jiahao Li, Changhao Zhang, Ziyao Xu, Hangning Zhou, Chi Zhang | N/A | |
| Pairwise Similarity Knowledge Transfer for Weakly Supervised Object Localization | Amir Rahimi, Amirreza Shaban, Thalaiyasingam Ajanthan, Richard Hartley, Byron Boots | N/A | |
| Environment-agnostic Multitask Learning for Natural Language Grounded Navigation | Xin Eric Wang, Vihan Jain, Eugene Ie, William Yang Wang, Zornitsa Kozareva, Sujith Ravi[2] | N/A | |
| TPFN: Applying Outer Product along Time to Multimodal Sentiment Analysis Fusion on Incomplete Data | Binghua Li, Chao Li, Feng Duan, Ning Zheng, Qibin Zhao | N/A | |
| ProxyNCA++: Revisiting and Revitalizing Proxy Neighborhood Component Analysis | Eu Wern Teh, Terrance DeVries, Graham W. Taylor | N/A | |
| Learning with Privileged Information for Efficient Image Super-Resolution | Wonkyung Lee, Junghyup Lee, Dohyung Kim, Bumsub Ham | N/A | |
| Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive Person Re-Identification | Jianing Li,, Shiliang Zhang | N/A | |
| Autoencoder-based Graph Construction for Semi-supervised Learning | Mingeun Kang, Kiwon Lee, Yong H. Lee, Changho Suh | N/A | |
| Virtual Multi-view Fusion for 3D Semantic Segmentation | Abhijit Kundu, Xiaoqi Yin, Alireza Fathi, David Ross, Brian Brewington, Thomas Funkhouser, Caroline Pantofaru | N/A | |
| Decoupling GCN with DropGraph Module for Skeleton-Based Action Recognition | Ke Cheng, Yifan Zhang, Congqi Cao, Lei Shi, Jian Cheng, Hanqing Lu | N/A | |
| Deep Shape from Polarization | Yunhao Ba, Alex Gilbert, Franklin Wang, Jinfa Yang, Rui Chen, Yiqin Wang, Lei Yan, Boxin Shi, Achuta Kadambi | N/A | |
| A Boundary Based Out-of-Distribution Classifier for Generalized Zero-Shot Learning | Xingyu Chen, Xuguang Lan, Fuchun Sun, Nanning Zheng | N/A | |
| Mind the Discriminability: Asymmetric Adversarial Domain Adaptation | Jianfei Yang, Han Zou, Yuxun Zhou, Zhaoyang Zeng, Lihua Xie () | N/A | |
| SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates | Zhizhong Han, Guanhui Qiao, Yu-Shen Liu, Matthias Zwicker | N/A | |
| Simultaneous Detection and Tracking with Motion Modelling for Multiple Object Tracking | ShiJie Sun, Naveed Akhtar, XiangYu Song, HuanSheng Song, Ajmal Mian , Mubarak Shah | N/A | |
| Deep FusionNet for Point Cloud Semantic Segmentation | Feihu Zhang Jin Fang Benjamin Wah Philip Torr | N/A | |
| Deep Material Recognition in Light-Fields via Disentanglement of Spatial and Angular Information | Bichuan Guo, Jiangtao Wen, Yuxing Han | N/A | |
| Dual Adversarial Network for Deep Active Learning | Shuo Wang, Yuexiang Li, Kai Ma, Ruhui Ma, Haibing Guan, Yefeng Zheng | N/A | |
| Fully Convolutional Networks for Continuous Sign Language Recognition | Ka Leong Cheng, Zhaoyang Yang, Qifeng Chen, Yu-Wing Tai | N/A | |
| Self-adapting confidence estimation for stereo | Matteo Poggi, Filippo Aleotti, Fabio Tosi, Giulio Zaccaroni, Stefano Mattoccia | N/A | |
| Deep Surface Normal Estimation on the 2-Sphere with Confidence Guided Semantic Attention | Quewei Li, Jie Guo, Yang Fei, Qinyu Tang, Wenxiu Sun, Jin Zeng, Yanwen Guo | N/A | |
| AutoSTR: Efficient Backbone Search for Scene Text Recognition | Hui Zhang, Quanming Yao, Mingkun Yang, Yongchao Xu, Xiang Bai | N/A | |
| Mitigating Embedding and Class Assignment Mismatch in Unsupervised Image Classification | Sungwon Han, Sungwon Park, Sungkyu Park, Sundong Kim, Meeyoung Cha | N/A | |
| Adversarial Training with Bi-directional Likelihood Regularization for Visual Classification | Weitao Wan, Jiansheng Chen, Ming-Hsuan Yang | N/A | |
| Faster AutoAugment: Learning Augmentation Strategies Using Backpropagation | Ryuichiro Hataya, Zdenek Jan, Kazuki Yoshizoe, Hideki Nakayama | N/A | |
| Hand-Transformer: Non-Autoregressive Structured Modeling for 3D Hand Pose Estimation | Lin Huang, Jianchao Tan, Ji Liu, Junsong Yuan | N/A | |
| Boundary-Aware Cascade Networks for Temporal Action Segmentation | Zhenzhi Wang, Ziteng Gao, Limin Wang, Zhifeng Li, Gangshan Wu | N/A | |
| Towards Content-Independent Multi-Reference Super-Resolution: Adaptive Pattern Matching and Feature Aggregation | Xu Yan, Weibing Zhao, Kun Yuan, Ruimao Zhang, Zhen Li, Shuguang Cui | N/A | |
| Inference Graphs for CNN Interpretation | Yael Konforti, Alon Shpigler, Boaz Lerner, Aharon Bar-Hillel | N/A | |
| An End-to-End OCR Text Re-organization Sequence Learning for Rich-text Detail Image Comprehension | Liangcheng Li, Feiyu Gao, Jiajun Bu, Yongpan Wang, Zhi Yu, Qi Zheng | N/A | |
| Improving Query Efficiency of Black-box Adversarial Attack | Yang Bai, Yuyuan Zeng, Yong Jiang, Yisen Wang, Shu-Tao Xia, Weiwei Guo | N/A | |
| Self-similarity Student for Partial Label Histopathology Image Segmentation | Hsien-Tzu Cheng, Chun-Fu Yeh, Po-Chen Kuo, Andy Wei, Keng-Chi Liu, Mong-Chi Ko, Kuan-Hua Chao, Yu-Ching Peng, Tyng-Luh Liu | N/A | |
| BioMetricNet: deep unconstrained face verification through learning of metrics regularized onto Gaussian distributions | Arslan Ali, Matteo Testa, Tiziano Bianchi, Enrico Magli | N/A | |
| A Decoupled Learning Scheme for Real-world Burst Denoising from Raw Images | Zhetong Liang, Shi Guo, Hong Gu, Huaqi Zhang, Lei Zhang | N/A | |
| Global-and-Local Relative Position Embedding for Unsupervised Video Summarization | Yunjae Jung, Donghyeon Cho, Sanghyun Woo, In So Kweon | N/A | |
| Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms | Jaesung Rim, Haeyun Lee, Jucheol Won, Sunghyun Cho | N/A | |
| SPARK: Spatial-aware Online Incremental Attack Against Visual Tracking | Qing Guo, Xiaofei Xie, Felix Juefei-Xu, Lei Ma, Zhongguo Li, Wanli Xue, Wei Feng, Yang Liu | N/A | |
| CenterNet Heatmap Propagation for Real-time Video Object Detection | Zhujun Xu, Emir Hrustic, Damien Vivet | N/A | |
| Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection | Youwei Pang, Lihe Zhang, Xiaoqi Zhao, Huchuan Lu | N/A | |
| SOLAR: Second-Order Loss and Attention for Image Retrieval | Tony Ng, Vassileios Balntas, Yurun Tian, Krystian Mikolajczyk | N/A | |
| Fixing Localization Errors to Improve Image Classification | Guolei Sun, Salman Khan, Wen Li, Hisham Cholakkal, Fahad Shahbaz Khan, Luc Van Gool | N/A | |
| PatchPerPix for Instance Segmentation | Lisa Mais, Peter Hirsch and Dagmar Kainmueller | N/A | |
| Attend and Segment: Attention Guided Active Semantic Segmentation | Soroush Seifi, Tinne Tuytelaars | N/A | |
| Accelerating CNN Training by Pruning Activation Gradients | Xucheng Ye, Pengcheng Dai, Junyu Luo, Xin Guo, Yingjie Qi, Jianlei Yang, Yiran Chen | N/A | |
| Global and Local Enhancement Networks for Paired and Unpaired Image Enhancement | Han-Ul Kim, Young Jun Koh, Chang-Su Kim | N/A | |
| Probabilistic Anchor Assignment with IoU Prediction for Object Detection | Kang Kim, Hee Seok Lee | N/A | |
| Eyeglasses 3D shape reconstruction from a single face image | Yating Wang, Quan Wang, Feng Xu | N/A | |
| Temporal Complementary Learning for Video Person Re-Identification | Ruibing Hou, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen | N/A | |
| HoughNet: Integrating near and long-range evidence for bottom-up object detection | Nermin Samet, Samet Hicsonmez, Emre Akbas | N/A | |
| Graph Wasserstein Correlation Analysis for Movie Retrieval | Xueya Zhang, Tong Zhang, Xiaobin Hong, Zhen Cui, Jian Yang | N/A | |
| Context-Aware RCNN: A Baseline for Action Detection in Videos | Jianchao Wu, Zhanghui Kuang, Limin Wang, Wayne Zhang, Gangshan Wu | N/A | |
| Full-Time Monocular Road Detection Using Zero-Distribution Prior of Angle of Polarization | Ning Li, Yongqiang Zhao, Quan Pan, Seong G. Kong, Jonathan Cheung-Wai Chan | N/A | |
| A Flexible Recurrent Residual Pyramid Network for Video Frame Interpolation | Haoxian Zhang, Yang Zhao, Ronggang Wang | N/A | |
| Learning Enriched Features for Real Image Restoration and Enhancement | Syed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao | N/A | |
| Detail Preserved Point Cloud Completion via Separated Feature Aggregation | Wenxiao Zhang, Qingan Yan, Chunxia Xiao | N/A | |
| LabelEnc: A New Intermediate Supervision Method for Object Detection | Miao Hao, Yitao Liu, Xiangyu Zhang, Jian Sun | N/A | |
| Unsupervised Learning of Category-Specific Symmetric 3D Keypoints from Point Sets | Clara Fernandez-Labrador, Ajad Chhatkuli, Danda Pani Paudel, Jose J. Guerrero, Cédric Demonceaux, Luc Van Gool | N/A | |
| PAMS: Quantized Super-Resolution via Parameterized Max Scale | Huixia Li, Chenqian Yan, Shaohui Lin, Xiawu Zheng, Baochang Zhang, Fan Yang, Rongrong Ji | N/A | |
| SSN: Shape Signature Networks for Multi-class Object Detection from Point Clouds | Xinge Zhu Yuexin Ma Tai Wang Yan Xu Jianping Shi Dahua Lin | N/A | |
| OID: Outlier Identifying and Discarding in Blind Image Deblurring | Liang Chen, Faming Fang, Jiawei Zhang, Jun Liu, Guixu Zhang | N/A | |
| Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors | Mateusz Michalkiewicz, Sarah Parisot, Stavros Tsogkas, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky | N/A | |
| Enhanced Sparse Model for Blind Deblurring | Liang Chen, Faming Fang, Shen Lei, Fang Li, Guixu Zhang | N/A | |
| SumGraph: Video Summarization via Recursive Graph Modeling | Jungin Park, Jiyoung Lee, Ig-Jae Kim, Kwanghoon Sohn | N/A | |
| Feature Normalized Knowledge Distillation for Image Classification | Kunran Xu, Lai Rui, Yishi Li, Lin Gu | N/A | |
| A Metric Learning Reality Check | Kevin Musgrave, Serge Belongie, Ser-Nam Lim | N/A | |
| FTL: A universal framework for training low-bit DNNs via Feature Transfer | Kunyuan Du, Ya Zhang, Haibing Guan, Qi Tian, Shenggan Cheng, James Lin | N/A | |
| XingGAN for Person Image Generation | Hao Tang, Song Bai, Li Zhang, Philip H.S. Torr, Nicu Sebe | N/A | |
| GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering | Chuang Niu, Jun Zhang, Ge Wang, Jimin Liang | N/A | |
| VCNet: A Robust Approach to Blind Image Inpainting | Yi Wang, Ying-Cong Chen, Xin Tao, Jiaya Jia | N/A | |
| Learning to Predict Context-adaptive Convolution for Semantic Segmentation | Jianbo Liu, Junjun He, Yu Qiao, Jimmy S. Ren, Hongsheng Li | N/A | |
| EfficientFCN: Holistically-guided Decoding for Semantic Segmentation | Jianbo Liu, Junjun He, Jiawei Zhang, Jimmy S. Ren, Hongsheng Li | N/A | |
| GroSS: Group-Size Series Decomposition for Grouped Architecture Search | Henry Howard-Jenkins, Yiwen Li, Victor Adrian Prisacariu | N/A | |
| Efficient Adversarial Attacks for Visual Object Tracking | Siyuan Liang, Xingxing Wei, Siyuan Yao, Xiaochun Cao | N/A | |
| Globally-Optimal Event Camera Motion Estimation | Xin Peng, Yifu Wang, Ling Gao, Laurent Kneip | N/A | |
| Weakly-supervised Learning of Human Dynamics | Petrissa Zell, Bodo Rosenhahn, Bastian Wandt | N/A | |
| Journey Towards Tiny Perceptual Super-Resolution | Royson Lee, Łukasz Dudziak, Mohamed Abdelfattah, Stylianos I. Venieris, Hyeji Kim, Hongkai Wen, Nicholas D. Lane | N/A | |
| What makes fake images detectable? Understanding properties that generalize | Lucy Chai, David Bau, Ser-Nam Lim, Phillip Isola | N/A | |
| Embedding Propagation: Smoother Manifold for Few-Shot Classification | Pau Rodríguez, Issam Laradji, Alexandre Drouin, Alexandre Lacoste | N/A | |
| Category Level Object Pose Estimation via Neural Analysis-by-Synthesis | Xu Chen, Zijian Dong, Jie Song, Andreas Geiger, Otmar Hilliges | N/A | |
| High-Fidelity Synthesis with Disentangled Representation | Wonkwang Lee, Donggyun Kim, Seunghoon Hong, Honglak Lee | N/A | |
| PL₁P - Point-line Minimal Problems under Partial Visibility in Three Views | Timothy Duff, Kathlén Kohn, Anton Leykin, Tomas Pajdla | N/A | |
| Prediction and Recovery for Adaptive Low-Resolution Person Re-Identification | Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan | N/A | |
| Learning Canonical Representations for Scene Graph to Image Generation | Roei Herzig, Amir Bar, Huijuan Xu, Gal Chechik, Trevor Darrell, Amir Globerson | N/A | |
| Adversarial Robustness on In- and Out-Distribution Improves Explainability | Maximilian Augustin, Alexander Meinke, Matthias Hein | N/A | |
| Deformable Style Transfer | Sunnie S. Y. Kim, Nicholas Kolkin, Jason Salavon, Gregory Shakhnarovich | N/A | |
| Aligning Videos in Space and Time | Senthil Purushwalkam, Tian Ye, Saurabh Gupta, Abhinav Gupta | N/A | |
| Neural Wireframe Renderer: Learning Wireframe to Image Translations | Yuan Xue, Zihan Zhou, Xiaolei Huang | N/A | |
| RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax | Xiao Zhang, Rui Zhao, Yu Qiao, Hongsheng Li | N/A | |
| Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction | Kelvin Wong, Qiang Zhang, Ming Liang, Bin Yang, Renjie Liao, Abbas Sadat, Raquel Urtasun | N/A | |
| Determining the Relevance of Features for Deep Neural Networks | Christian Reimers, Jakob Runge, Joachim Denzler | N/A | |
| Weakly Supervised Semantic Segmentation with Boundary Exploration | Liyi Chen, Weiwei Wu, Chenchen Fu, Xiao Han, Yuntao Zhang | N/A | |
| GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation | Wallace Lira, Johannes Merz, Daniel Ritchie, Daniel Cohen-Or, Hao Zhang | N/A | |
| DOPE: Distillation Of Part Experts for whole-body 3D pose estimation in the wild | Philippe Weinzaepfel, Romain Brégier, Hadrien Combaluzier, Vincent Leroy, Grégory Rogez | N/A | |
| Multi-view adaptive graph convolutions for graph classification | Nikolas Adaloglou, Nicholas Vretos, Petros Daras | N/A | |
| Instance Adaptive Self-Training for Unsupervised Domain Adaptation | Ke Mei, Chuang Zhu, Jiaqi Zou, Shanghang Zhang | N/A | |
| Weight Decay Scheduling and Knowledge Distillation for Active Learning | Juseung Yun, Byungjoo Kim, Junmo Kim | N/A | |
| HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs | Hai Victor Habi, Roy H. Jennings, Arnon Netzer | N/A | |
| Truncated Inference for Latent Variable Optimization Problems: Application to Robust Estimation and Learning | Christopher Zach, Huu Le | N/A | |
| Geometry Constrained Weakly Supervised Object Localization | Weizeng Lu, Xi Jia, Weicheng Xie, Linlin Shen, Yicong Zhou, Jinming Duan | N/A | |
| Duality Diagram Similarity: a generic framework for initialization selection in task transfer learning | Kshitij Dwivedi, Jiahui Huang, Radoslaw Martin Cichy, Gemma Roig | N/A | |
| OneGAN: Simultaneous Unsupervised Learning of Conditional Image Generation, Foreground Segmentation, and Fine-Grained Clustering | Yaniv Benny, Lior Wolf | N/A | |
| Mining self-similarity: Label super-resolution with epitomic representations | Nikolay Malkin, Anthony Ortiz, Nebojsa Jojic | N/A | |
| AE-OT-GAN: Training GANs from data specific latent distribution | Dongsheng An, Yang Guo, Min Zhang, Xin Qi, Na Lei, Xianfang Gu | N/A | |
| Null-sampling for Interpretable and Fair Representations | Thomas Kehrenberg, Myles Bartlett, Oliver Thomas, Novi Quadrianto | N/A | |
| Guiding Monocular Depth Estimation Using Depth-Attention Volume | Lam Huynh, Phong Nguyen-Ha, Jiri Matas, Esa Rahtu, Janne Heikkilä | N/A | |
| Tracking Emerges by Looking Around Static Scenes, with Neural 3D Mapping | Adam W. Harley, Shrinidhi Kowshika Lakshmikanth, Paul Schydlo, Katerina Fragkiadaki | N/A | |
| Boosting Weakly Supervised Object Detection with Progressive Knowledge Transfer | Yuanyi Zhong, Jianfeng Wang, Jian Peng, Lei Zhang | N/A | |
| BézierSketch: A generative model for scalable vector sketches | Ayan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song | N/A | |
| Semantic Relation Preserving Knowledge Distillation for Image-to-Image Translation | Zeqi Li, Ruowei Jiang,, Parham Aarabi | N/A | |
| Domain Adaptation Through Task Distillation | Brady Zhou, Nimit Kalra, Philipp Krähenbühl | N/A | |
| PatchAttack: A Black-box Texture-based Attack with Reinforcement Learning | Chenglin Yang, Adam Kortylewski, Cihang Xie, Yinzhi Cao, Alan Yuille | N/A | |
| More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning | Yu Liu, Sarah Parisot, Gregory Slabaugh, Xu Jia, Ales Leonardis, Tinne Tuytelaars | N/A | |
| Extending and Analyzing Self-Supervised Learning Across Domains | Bram Wallace, Bharath Hariharan | N/A | |
| Multi-Source Open-Set Deep Adversarial Domain Adaptation | Sayan Rakshit, Dipesh Tamboli, Pragati Shuddhodhan Meshram, Biplab Banerjee, Gemma Roig, Subhasis Chaudhuri | N/A | |
| Neural Batch Sampling with Reinforcement Learning for Semi-Supervised Anomaly Detection | Wen-Hsuan Chu, Kris M. Kitani | N/A | |
| LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities | Baoxiong Jia, Yixin Chen, Siyuan Huang, Yixin Zhu, Song-Chun Zhu | N/A | |
| Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images | Matthew Purri, Kristin Dana | N/A | |
| Accurate Optimization of Weighted Nuclear Norm for Non-Rigid Structure from Motion | José Pedro Iglesias, Carl Olsson, Marcus Valtonen Örnhag | N/A | |
| Proposal-based Video Completion | Yuan-Ting Hu, Heng Wang, Nicolas Ballas, Kristen Grauman, Alexander G. Schwing | N/A | |
| HGNet: Hybrid Generative Network for Zero-shot Domain Adaptation | Haifeng Xia, Zhengming Ding | N/A | |
| Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding | Kaihao Zhang, Wenhan Luo, Wenqi Ren, Jingwen Wang Fang Zhao, Lin Ma , Hongdong Li | N/A | |
| DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks | Hassan Dbouk, Hetul Sanghvi, Mahesh Mehendale, Naresh Shanbhag | N/A | |
| All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling | Zhixiang Chi, Rasoul Mohammadi Nasiri, Zheng Liu, Juwei Lu, Jin Tang , Konstantinos N Plataniotis | N/A | |
| A Broader Study of Cross-Domain Few-Shot Learning | Yunhui Guo, Noel C. Codella, Leonid Karlinsky, James V. Codella, John R. Smith, Kate Saenko, Tajana Rosing, Rogerio Feris | N/A | |
| Practical Poisoning Attacks on Neural Networks | Junfeng Guo, Cong Liu | N/A | |
| Unsupervised Domain Adaptation in the Dissimilarity Space for Person Re-identification | Djebril Mekhazni, Amran Bhuiyan, George Ekladious, Eric Granger | N/A | |
| Learn distributed GAN with Temporary Discriminators | Hui Qu, Yikai Zhang, Qi Chang, Zhennan Yan, Chao Chen, Dimitris Metaxas | N/A | |
| SemifreddoNets: Partially Frozen Neural Networks for Efficient Computer Vision Systems | Leo F Isikdogan, Bhavin V Nayak, Chyuan-Tyng Wu, Joao Peralta Moreira , Sushma Rao, Gilad Michael | N/A | |
| Improving Adversarial Robustness by Enforcing Local and Global Compactness | Anh Bui, Trung Le, He Zhao, Paul Montague, Olivier deVel, Tamas Abraham, Dinh Phung | N/A | |
| TopoAL: An Adversarial Learning Approach for Topology-Aware Road Segmentation | Subeesh Vasu, Mateusz Kozinski, Leonardo Citraro, and Pascal Fua | N/A | |
| Channel selection using Gumbel Softmax | Charles Herrmann, Richard Strong Bowen, Ramin Zabih | N/A | |
| Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification | Dripta S. Raychaudhuri, Amit K. Roy-Chowdhury | N/A | |
| An Efficient Training Framework for Reversible Neural Architectures | Zixuan Jiang, Keren Zhu, Mingjie Liu, Jiaqi Gu, David Z. Pan | N/A | |
| Box2Seg: Attention Weighted Loss and Discriminative Feature Learning for Weakly Supervised Segmentation | Viveka Kulharia, Siddhartha Chandra, Amit Agrawal, Philip Torr, Ambrish Tyagi | N/A | |
| FreeCam3D: Snapshot Structured Light 3D with Freely-Moving Cameras | Yicheng Wu, Vivek Boominathan, Xuan Zhao, Jacob T. Robinson, Hiroshi Kawasaki, Aswin Sankaranarayanan, Ashok Veeraraghavan | N/A | |
| One-Pixel Signature: Characterizing CNN Models for Backdoor Detection | Shanjiaoyang Huang, Weiqi Peng, Zhiwei Jia, Zhuowen Tu | N/A | |
| Learning to Transfer Learn: Reinforcement Learning-Based Selection for Adaptive Transfer Learning | Linchao Zhu, Sercan . Arık, Yi Yang, Tomas Pfister | N/A | |
| Structure-Aware Generation Network for Recipe Generation from Images | Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao | N/A | |
| A Simple and Effective Framework for Pairwise Deep Metric Learning | Qi Qi, Yan Yan, Zixuan Wu, Xiaoyu Wang, Tianbao Yang | N/A | |
| Meta-rPPG: Remote Heart Rate Estimation Using a Transductive Meta-Learner | Eugene Lee, Evan Chen, Chen-Yi Lee | N/A | |
| A Recurrent Transformer Network for Novel View Action Synthesis | Kara Marie Schatz, Erik Quintanilla, Shruti Vyas, Yogesh S Rawat | N/A | |
| Multi-view Action Recognition using Cross-view Video Prediction | Shruti Vyas, Yogesh S Rawat, Mubarak Shah | N/A | |
| Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation | Mingmin Zhen, Shiwei Li, Lei Zhou, Jiaxiang Shang, Haoan Feng, Tian Fang, Long Quan | N/A | |
| SMART: Simultaneous Multi-Agent Recurrent Trajectory Prediction | Sriram N N, Buyu Liu, Francesco Pittaluga, Manmohan Chandraker | N/A | |
| Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation | Jinyu Yang, Weizhi An, Sheng Wang, Xinliang Zhu, Chaochao Yan, Junzhou Huang | N/A | |
| Efficient Outdoor 3D Point Cloud Semantic Segmentation for Critical Road Objects and Distributed Contexts | Chi-Chong Wong, Chi-Man Vong | N/A | |
| Attributional Robustness Training using Input-Gradient Spatial Alignment | Mayank Singh, Nupur Kumari, Puneet Mangla, Abhishek Sinha, Vineeth N Balasubramanian, Balaji Krishnamurthy | N/A | |
| Reducing the Sim-to-Real Gap for Event Cameras | Timo Stoffregen, Cedric Scheerlinck, Davide Scaramuzza, Tom Drummond, Nick Barnes, Lindsay Kleeman, Robert Mahony | N/A | |
| Spatial Geometric Reasoning for Room Layout Estimation via Deep Reinforcement Learning | Liangliang Ren, Yangyang Song, Jiwen Lu, Jie Zhou | N/A | |
| Learning Data Augmentation Strategies for Object Detection | Barret Zoph, Ekin D. Cubuk, Golnaz Ghiasi, Tsung-Yi Lin, Jonathon Shlens, Quoc V. Le | N/A | |
| DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search | Xiyang Dai, Dongdong Chen, Mengchen Liu, Yinpeng Chen, Lu Yuan | N/A | |
| A Closer Look at Generalisation in RAVEN | Steven Spratley, Krista Ehinger, Tim Miller | N/A | |
| Supervised Edge Attention Network for Accurate Image Instance Segmentation | Xier Chen, Yanchao Lian, Licheng Jiao, Haoran Wang, YanJie Gao, Shi Lingling | N/A | |
| Discriminative Partial Domain Adversarial Network | Jian Hu, Hongya Tuo, Chao Wang, Lingfeng Qiao, Haowen Zhong, Junchi Yan, Zhongliang Jing, Henry Leung | N/A | |
| Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model | John Janiczek, Parth Thaker, Gautam Dasarathy, Christopher S. Edwards , Philip Christensen, Suren Jayasuriya | N/A | |
| Deep Cross-species Feature Learning for Animal Face Recognition via Residual Interspecies Equivariant Network | Xiao Shi, Chenxue Yang, Xue Xia, Xiujuan Chai | N/A | |
| Guidance and Evaluation: Semantic-Aware Image Inpainting for Mixed Scenes | Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin’ichi Satoh | N/A | |
| Sound2Sight: Generating Visual Dynamics from Sound and Context | Moitreya Chatterjee, Anoop Cherian | N/A | |
| 3D-CVF: Generating Joint Camera and LiDAR Features Using Cross-View Spatial Feature Fusion for 3D Object Detection | Jin Hyeok Yoo, Yecheol Kim, Jisong Kim, Jun Won Choi | N/A | |
| NoiseRank: Unsupervised Label Noise Reduction with Dependence Models | Karishma Sharma, Pinar Donmez, Enming Luo, Yan Liu, I. Zeki Yalniz | N/A | |
| Fast Adaptation to Super-Resolution Networks via Meta-Learning | Seobin Park, Jinsu Yoo, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim | N/A | |
| TP-LSD: Tri-Points Based Line Segment Detector | Siyu Huang, Fangbo Qin, Pengfei Xiong, Ning Ding, Yijia He, Xiao Liu | N/A | |
| SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation | Chenfeng Xu, Bichen Wu, Zining Wang, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka | N/A | |
| An Attention-driven Two-stage Clustering Method for Unsupervised Person Re-Identification | Zilong Ji, Xiaolong Zou, Xiaohan Lin, Xiao Liu, Tiejun Huang, Si Wu | N/A | |
| Toward Fine-grained Facial Expression Manipulation | Jun Ling, Han Xue, Li Song, Shuhui Yang, Rong Xie, Xiao Gu | N/A | |
| Adaptive Object Detection with Dual Multi-Label Prediction | Zhen Zhao, Yuhong Guo, Haifeng Shen, Jieping Ye | N/A | |
| Table Structure Recognition using Top-Down and Bottom-Up Cues | Sachin Raja, Ajoy Mondal, C V Jawahar | N/A | |
| Novel View Synthesis on Unpaired Data by Conditional Deformable Variational Auto-Encoder | Mingyu Yin, Li Sun, Qingli Li | N/A | |
| Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments | Jacob Krantz, Erik Wijmans, Arjun Majumdar, Dhruv Batra, Stefan Lee | N/A | |
| Boundary Content Graph Neural Network for Temporal Action Proposal Generation | Yueran Bai, Yingying Wang, Yunhai Tong, Yang Yang, Qiyue Liu, Junhui Liu | N/A | |
| Pose Augmentation: Class-agnostic Object Pose Transformation for Object Recognition | Yunhao Ge, Jiaping Zhao, Laurent Itti | N/A | |
| VLANet: Video-Language Alignment Network for Weakly-Supervised Video Moment Retrieval | Minuk Ma, Sunjae Yoon, Junyeong Kim, Youngjoon Lee, Sunghun Kang, Chang D. Yoo | N/A | |
| Attention-Based Query Expansion Learning | Albert Gordo, Filip Radenovic, Tamara Berg | N/A | |
| Interpretable Foreground Object Search As Knowledge Distillation | Boren Li, Po-Yu Zhuang, Jian Gu, Mingyang Li, Ping Tan | N/A | |
| Improving Knowledge Distillation via Category Structure | Zailiang Chen, Xianxian Zheng, Hailan Shen, Ziyang Zeng, Yukun Zhou, Rongchang Zhao | N/A | |
| High Resolution Zero-Shot Domain Adaptation of Synthetically Rendered Face Images | Stephan J. Garbin, Marek Kowalski, Matthew Johnson, Jamie Shotton | N/A | |
| Attentive Prototype Few-shot Learning with Capsule Network-based Embedding | Fangyu Wu, Jeremy S.Smith, Wenjin Lu, Chaoyi Pang, Bailing Zhang | N/A | |
| Weakly Supervised Instance Segmentation by Learning Annotation Consistent Instances | Aditya Arun, C.V. Jawahar, M. Pawan Kumar | N/A | |
| DA4AD: End-to-End Deep Attention-based Visual Localization for Autonomous Driving | Yao Zhou, Guowei Wan, Shenhua Hou, Li Yu, Gang Wang, Xiaofei Rui, Shiyu Song | N/A | |
| Visual-Relation Conscious Image Generation from Structured-Text | Duc Minh Vo, Akihiro Sugimoto | N/A | |
| Patch-wise Attack for Fooling Deep Neural Network | Lianli Gao, Qilong Zhang, Jingkuan Song, Xianglong Liu, Heng Tao Shen | N/A | |
| Feature Pyramid Transformer | Dong Zhang, Hanwang Zhang, Jinhui Tang, Meng Wang, Xiansheng Hua, Qianru Sun | N/A | |
| MABNet: A Lightweight Stereo Network Based on Multibranch Adjustable Bottleneck Module | Jiabin Xing, Zhi Qi, Jiying Dong, Jiaxuan Cai, Hao Liu | N/A | |
| Guided Saliency Feature Learning for Person Re-identification in Crowded Scenes | Lingxiao He, Wu Liu | N/A | |
| Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection | Miao Zhang, Sun Xiao Fei, Jie Liu, Shuang Xu, Yongri Piao, Huchuan Lu | N/A | |
| Explaining Image Classifiers using Statistical Fault Localization | Youcheng Sun, Hana Chockler, Xiaowei Huang, Daniel Kroening | N/A | |
| Deep Graph Matching via Blackbox Differentiation of Combinatorial Solvers | Michal Rolínek, Paul Swoboda, Dominik Zietlow, Anselm Paulus, Vít Musil, Georg Martius | N/A | |
| Learning Video Representations by Transforming Time | Simon Jenni, Givi Meishvili, Paolo Favaro | N/A | |
| Unsupervised Monocular Depth Estimation for Night-time Images using Adversarial Domain Feature Adaptation | Madhu Vankadari, Sourav Garg, Anima Majumder, Swagat Kumar, Ardhendu Behera | N/A | |
| Variational Connectionist Temporal Classification | Linlin Chao, Jingdong Chen, Wei Chu | N/A | |
| End-to-end Dynamic Matching Network for Multi-view Multi-person 3d Pose Estimation | Congzhentao Huang, Shuai Jiang, Yang Li, Ziyue Zhang, Jason Traish, Chen Deng, Sam Ferguson, Richard Yi Da Xu | N/A | |
| Orderly Disorder in Point Cloud Domain | Morteza Ghahremani, Bernard Tiddeman, Yonghuai Liu, and Ardhendu Behera | N/A | |
| Deep Decomposition Learning for Inverse Imaging Problems | Dongdong Chen, Mike E. Davies | N/A | |
| FLOT: Scene Flow on Point Clouds guided by Optimal Transport | Gilles Puy, Alexandre Boulch, Renaud Marlet | N/A | |
| Accurate Reconstruction of Oriented 3D Points using Affine Correspondences | Carolina Raposo, Joao P. Barreto | N/A | |
| Volumetric Transformer Networks | Seungryong Kim, Sabine Ssstrunk, Mathieu Salzmann | N/A | |
| 360(o) Camera Alignment via Segmentation | Benjamin Davidson, Mohsan S. Alvi, João F. Henriques | N/A | |
| A Novel Line Integral Transform for 2D Affine-Invariant Shape Retrieval | Bin Wang, Yongsheng Gao | N/A | |
| Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks | Federico Baldassarre, Kevin Smith, Josephine Sullivan, Hossein Azizpour | N/A | |
| Guided Semantic Flow | Sangryul Jeon, Dongbo Min, Seungryong Kim, Jihwan Choe, Kwanghoon Sohn | N/A | |
| Document Structure Extraction using Prior based High Resolution Hierarchical Semantic Segmentation | Mausoom Sarkar, Milan Aggarwal, Arneh Jain, Hiresh Gupta, Balaji Krishnamurthy | N/A | |
| Measuring the Importance of Temporal Features in Video Saliency | Matthias Tangemann, Matthias Kümmerer, Thomas S.A. Wallis, Matthias Bethge | N/A | |
| Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution | Haotian Tang, Zhijian Liu, Shengyu Zhao, Yujun Lin, Ji Lin, Hanrui Wang, Song Han | N/A | |
| Towards Reliable Evaluation of Algorithms for Road Network Reconstruction from Aerial Images | Leonardo Citraro, Mateusz Koziński, Pascal Fua | N/A | |
| Online Continual Learning under Extreme Memory Constraints | Enrico Fini, Stéphane Lathuilière, Enver Sangineto, Moin Nabi, Elisa Ricci | N/A | |
| Learning to Cluster under Domain Shift | Willi Menapace, Stéphane Lathuilière, Elisa Ricci | N/A | |
| Defense Against Adversarial Attacks via Controlling Gradient Leaking on Embedded Manifolds | Yueru Li, Shuyu Cheng, Hang Su, Jun Zhu | N/A | |
| Improving Optical Flow on a Pyramid Level | Markus Hofinger, Samuel Rota Bulò, Lorenzo Porzi, Arno Knapitsch, Thomas Pock, Peter Kontschieder | N/A | |
| Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations | Sungheon Park, Minsik Lee, Nojun Kwak | N/A | |
| Learning to Learn Parameterized Classification Networks for Scalable Input Images | Duo Li, Anbang Yao, Qifeng Chen | N/A | |
| Stereo Event-based Particle Tracking Velocimetry for 3D Fluid Flow Reconstruction | Yuanhao Wang, Ramzi Idoughi, Wolfgang Heidrich | N/A | |
| Simplicial Complex based Point Correspondence between Images warped onto Manifolds | Charu Sharma, Manohar Kaul | N/A | |
| Representation Learning on Visual-Symbolic Graphs for Video Understanding | Effrosyni Mavroudi, Benjamín Béjar Haro, René Vidal | N/A | |
| Distance-Normalized Unified Representation for Monocular 3D Object Detection | Xuepeng Shi, Zhixiang Chen, Tae-Kyun Kim | N/A | |
| Sequential Deformation for Accurate Scene Text Detection | Shanyu Xiao, Liangrui Peng, Ruijie Yan, Keyu An, Gang Yao, Jaesik Min | N/A | |
| Where to Explore Next? ExHistCNN for History-aware Autonomous 3D Exploration | Yiming Wang, Alessio Del Bue | N/A | |
| Semi-Supervised Segmentation based on Error-Correcting Supervision | Robert Mendel, Luis Antonio de Souza Jr, David Rauber, João Paulo Papa, Christoph Palm | N/A | |
| Quantum-soft QUBO Suppression for Accurate Object Detection | Junde Li, Swaroop Ghosh | N/A | |
| Label-similarity Curriculum Learning | Ürün Dogan, Aniket Anand Deshmukh, Marcin Bronislaw Machura, Christian Igel | N/A | |
| Recurrent Image Annotation With Explicit Inter-Label Dependencies | Ayushi Dutta, Yashaswi Verma, C.V. Jawahar | N/A | |
| Cross-Attention in Coupled Unmixing Nets for Unsupervised Hyperspectral Super-Resolution | Jing Yao, Danfeng Hong, Jocelyn Chanussot, Deyu Meng, Xiaoxiang Zhu , Zongben Xu | N/A | |
| SimPose: Effectively Learning DensePose and Surface Normals of People from Simulated Data | Tyler Zhu, Per Karlsson, Christoph Bregler | N/A | |
| ByeGlassesGAN: Identity Preserving Eyeglasses Removal for Face Images | Yu-Hui Lee, Shang-Hong Lai | N/A | |
| Differentiable Joint Pruning and Quantization for Hardware Efficiency | Ying Wang, Yadong Lu, Tijmen Blankevoort | N/A | |
| Learning to Generate Customized Dynamic 3D Facial Expressions | Rolandos Alexandros Potamias, Jiali Zheng, Stylianos Ploumpis, Giorgos Bouritsas, Evangelos Ververas, Stefanos Zafeiriou | N/A | |
| LandscapeAR: Large Scale Outdoor Augmented Reality by Matching Photographs with Terrain Models Using Learned Descriptors | Jan Brejcha, Michal Lukáč, Yannick Hold-Geoffroy, Oliver Wang, Martin Čadík | N/A | |
| Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration | Xin Li, Xin Jin, Jianxin Lin, Sen Liu, Yaojun Wu, Tao Yu, Wei Zhou , Zhibo Chen | N/A | |
| Jointly De-biasing Face Recognition and Demographic Attribute Estimation | Sixue Gong, Xiaoming Liu, Anil K. Jain | N/A | |
| Regularized Loss for Weakly Supervised Single Class Semantic Segmentation | Olga Veksler | N/A | |
| Spike-FlowNet: Event-based Optical Flow Estimation with Energy-Efficient Hybrid Neural Networks | Chankyu Lee, Adarsh Kumar Kosta, Alex Zihao Zhu, Kenneth Chaney, Kostas Daniilidis, Kaushik Roy | N/A | |
| Forgetting Outside the Box: Scrubbing Deep Networks of Information Accessible from Input-Output Observations | Aditya Golatkar, Alessandro Achille, Stefano Soatto | N/A | |
| Inherent Adversarial Robustness of Deep Spiking Neural Networks: Effects of Discrete Input Encoding and Non-Linear Activations | Saima Sharmin, Nitin Rathi, Priyadarshini Panda, Kaushik Roy | N/A | |
| Synthesizing Coupled 3D Face Modalities by Trunk-Branch Generative Adversarial Networks | Baris Gecer, Alexandros Lattas, Stylianos Ploumpis, Jiankang Deng, Athanasios Papaioannou, Stylianos Moschoglou, Stefanos Zafeiriou | N/A | |
| Learning to Learn Words from Visual Scenes | Dídac Surís, Dave Epstein, Heng Ji, Shih-Fu Chang, Carl Vondrick | N/A | |
| On Transferability of Histological Tissue Labels in Computational Pathology | Mahdi S. Hosseini, Lyndon Chan, Weimin Huang, Yichen Wang, Danial Hasan, Corwyn Rowsell, Savvas Damaskinos, Konstantinos N. Plataniotis | N/A | |
| Learning Actionness via Long-range Temporal Order Verification | Dimitri Zhukov, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic | N/A | |
| Fully Embedding Fast Convolutional Networks on Pixel Processor Arrays | Laurie Bose, Piotr Dudek, Jianing Chen, Stephen J. Carey, Walterio W. Mayol-Cuevas | N/A | |
| Character Region Attention For Text Spotting | Youngmin Baek, Seung Shin, Jeonghun Baek, Sungrae Park, Junyeop Lee , Daehyun Nam, Hwalsuk Lee | N/A | |
| Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network | Anh-Huy Phan, Konstantin Sobolev, Konstantin Sozykin, Dmitry Ermilov , Julia Gusak, Petr Tichavský, Valeriy Glukhov, Ivan Oseledets, Andrzej Cichocki | N/A | |
| Dual Mixup Regularized Learning for Adversarial Domain Adaptation | Yuan Wu, Diana Inkpen, Ahmed El-Roby | N/A | |
| Robust and On-the-fly Dataset Denoising for Image Classification | Jiaming Song, Yann Dauphin, Michael Auli, Tengyu Ma | N/A | |
| Imaging Behind Occluders Using Two-Bounce Light | Connor Henley, Tomohiro Maeda, Tristan Swedish, Ramesh Raskar | N/A | |
| Improving Object Detection with Selective Self-Supervised Self-Training | Yandong Li, Di Huang, Danfeng Qin, Liqiang Wang, Boqing Gong | N/A | |
| Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction | Rohan Chabra, Jan E. Lenssen, Eddy Ilg, Tanner Schmidt, Julian Straub, Steven Lovegrove, Richard Newcombe | N/A | |
| Info3D: Representation Learning on 3D Objects using Mutual Information Maximization and Contrastive Learning | Aditya Sanghi | N/A | |
| Adversarial Data Augmentation via Deformation Statistics | Sahin Olut, Zhengyang Shen, Zhenlin Xu, Samuel Gerber, Marc Niethammer | N/A | |
| Neural Predictor for Neural Architecture Search | Wei Wen, Hanxiao Liu, Yiran Chen, Hai Li, Gabriel Bender, Pieter-Jan Kindermans | N/A | |
| Learning Permutation Invariant Representations using Memory Networks | Shivam Kalra, Mohammed Adnan, Graham Taylor, H.R. Tizhoosh | N/A | |
| Feature Space Augmentation for Long-Tailed Data | Peng Chu, Xiao Bian, Shaopeng Liu, Haibin Ling | N/A | |
| Laying the Foundations of Deep Long-Term Crowd Flow Prediction | Samuel S. Sohn, Honglu Zhou, Seonghyeon Moon, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia | N/A | |
| Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning | Zhekun Luo, Devin Guillory, Baifeng Shi, Wei Ke, Fang Wan, Trevor Darrell, Huijuan Xu | N/A | |
| Fairness by Learning Orthogonal Disentangled Representations | Mhd Hasan Sarhan, Nassir Navab, Abouzar Eslami, Shadi Albarqouni | N/A | |
| Self-supervision with Superpixels: Training Few-shot Medical Image Segmentation without Annotation | Cheng Ouyang, Carlo Biffi, Chen Chen, Turkay Kart, Huaqi Qiu, Daniel Rueckert | N/A | |
| On Diverse Asynchronous Activity Anticipation | He Zhao, Richard P. Wildes | N/A | |
| Representative-Discriminative Learning for Open-set Land Cover Classification of Satellite Imagery | Razieh Kaviani Baghbaderani, Ying Qu, Hairong Qi, Craig Stutts | N/A | |
| Structure-Aware Human-Action Generation | Ping Yu, Yang Zhao, Chunyuan Li, Junsong Yuan, Changyou Chen | N/A | |
| Towards Efficient Coarse-to-Fine Networks for Action and Gesture Recognition | Niamul Quader, Juwei Lu, Peng Dai, Wei Li | N/A | |
| S³Net: Semantic-Aware Self-supervised Depth Estimation with Monocular Videos and Synthetic Data | Bin Cheng, Inderjot Singh Saggu, Raunak Shah, Gaurav Bansal, Dinesh Bharadia | N/A | |
| Leveraging Seen and Unseen Semantic Relationships for Generative Zero-Shot Learning | Maunil R Vyas, Hemanth Venkateswara, Sethuraman Panchanathan | N/A | |
| Weight Excitation: Built-in Attention Mechanisms in Convolutional Neural Networks | Niamul Quader, Md Mafijul Islam Bhuiyan, Juwei Lu, Peng Dai, Wei Li | N/A | |
| UNITER: UNiversal Image-TExt Representation Learning | Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu | N/A | |
| Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks | Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang, Lijuan Wang, Houdong Hu, Li Dong, Furu Wei, Yejin Choi, Jianfeng Gao | N/A | |
| Improving Face Recognition from Hard Samples via Distribution Distillation Loss | Yuge Huang, Pengcheng Shen, Ying Tai, Shaoxin Li, Xiaoming Liu, Jilin Li, Feiyue Huang, Rongrong Ji | N/A | |
| Extract and Merge: Superpixel Segmentation with Regional Attributes | Jianqiao An, Yucheng Shi, Yahong Han, Meijun Sun, Qi Tian | N/A | |
| Spatial-Adaptive Network for Single Image Denoising | Meng Chang, Qi Li, Huajun Feng, Zhihai Xu | N/A | |
| Physics-based Feature Dehazing Networks | Jiangxin Dong, Jinshan Pan | N/A | |
| Learning Surrogates via Deep Embedding | Yash Patel, Tomáš Hodaň, Jiří Matas | N/A | |
| An Asymmetric Modeling for Action Assessment | Jibin Gao, Wei-Shi Zheng, Jia-Hui Pan, Chengying Gao, Yaowei Wang, Wei Zeng, Jianhuang Lai | N/A | |
| High-quality Single-model Deep Video Compression with Frame-Conv3D and Multi-frame Differential Modulation | Wenyu Sun, Chen Tang, Weigui Li, Zhuqing Yuan, Huazhong Yang, Yongpan Liu | N/A | |
| Instance-Aware Embedding for Point Cloud Instance Segmentation | Tong He, Yifan Liu, Chunhua Shen, Xinlong Wang, Changming Sun | N/A | |
| Self-Paced Deep Regression Forests with Consideration on Underrepresented Examples | Lili Pan, Shijie Ai, Yazhou Ren, Zenglin Xu | N/A | |
| Manifold Projection for Adversarial Defense on Face Recognition | Jianli Zhou, Chao Liang, Jun Chen | N/A | |
| Weakly Supervised Learning with Side Information for Noisy Labeled Images | Lele Cheng, Xiangzeng Zhou, Liming Zhao, Dangwei Li, Hong Shang, Yun Zheng, Pan Pan, Yinghui Xu | N/A | |
| Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision | Peng Wu, Jing Liu, Yujia Shi, Yujia Sun, Fangtao Shao, Zhaoyang Wu , Zhiwei Yang | N/A | |
| SNE-RoadSeg: Incorporating Surface Normal Information into Semantic Segmentation for Accurate Freespace Detection | Rui Fan, Hengli Wang, Peide Cai, Ming Liu | N/A | |
| Modeling the Space of Point Landmark Constrained Diffeomorphisms | Chengfeng Wen, Yang Guo, Xianfeng Gu | N/A | |
| PieNet: Personalized Image Enhancement Network | Han-Ul Kim, Young Jun Koh, Chang-Su Kim | N/A | |
| Rotational Outlier Identification in Pose Graphs Using Dual Decomposition | Arman Karimian, Ziqi Yang, Roberto Tron | N/A | |
| Speech-driven Facial Animation using Cascaded GANs for Learning of Motion and Texture | Dipanjan Das, Sandika Biswas, Sanjana Sinha, Brojeshwar Bhowmick | N/A | |
| Solving Phase Retrieval with a Learned Reference | Rakib Hyder, Zikui Cai, M. Salman Asif | N/A | |
| Dual Grid Net: Hand Mesh Vertex Regression from Single Depth Maps | Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao | N/A |
ECCV 2022
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Paper Title | Authors | N/A | N/A |
| Contrastive Deep Supervision | Linfeng Zhang (Tsinghua University )*; Xin Chen (Intel Corp.); Junbo Zhang (Tsinghua University); Runpei Dong (Xi’an Jiaotong University); Kaisheng Ma (Tsinghua University ) | N/A | N/A |
| Towards Grand Unification of Object Tracking | Bin Yan (Dalian University of Technology)*; Yi Jiang (Bytedance); Peize Sun (The University of Hong Kong); Dong Wang (Dalian University of Technology); Zehuan Yuan (Bytedance.Inc); Ping Luo (The University of Hong Kong); Huchuan Lu (Dalian University of Technology) | N/A | N/A |
| SeqFormer: Sequential Transformer for Video Instance Segmentation | Junfeng Wu (Huazhong University of Science and Technology); Yi Jiang (Bytedance); Song Bai (University of Oxford); Wenqing Zhang (Huazhong University of Science and Technology); Xiang Bai (Huazhong University of Science and Technology)* | N/A | N/A |
| Estimating Spatially-Varying Lighting in Urban Scenes with Disentangled Representation | Jiajun Tang (Peking University); Yongjie Zhu (Beijing University of Posts and Telecommunications); Haoyu Wang (Peking University); Jun Hoong Chan (Peking University); Si Li (Beijing University of Posts and Telecommunications); Boxin Shi (Peking University)* | N/A | N/A |
| In Defense of Online Models for Video Instance Segmentation | Junfeng Wu (Huazhong University of Science and Technology); Qihao Liu (Johns Hopkins University); Yi Jiang (Bytedance); Song Bai (University of Oxford); Alan Yuille (Johns Hopkins University); Xiang Bai (Huazhong University of Science and Technology)* | N/A | N/A |
| HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling | Zhongang Cai (SenseTime International Pte Ltd)*; Daxuan Ren (Nanyang Technological University); Ailing Zeng (The Chinese University of Hong Kong); Zhengyu Lin (SenseTime); Tao Yu (Tsinghua University); Wenjia Wang (SenseTime); Xiangyu Fan (Sensetime); Yang Gao (Sensetime); Yifan Yu (ETH Zurich); Liang Pan (Nanyang Technological University); Fangzhou Hong (Nanyang Technological University); Mingyuan Zhang (Nanyang Technological University); Chen Change Loy (Nanyang Technological University); Lei Yang (Sensetime Group Limited); Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph | Honghui Yang (Zhejiang University)*; Zili Liu (ZJU); Xiaopei Wu (ZhejiangUniversity); Wenxiao Wang (State Key Lab of CAD&CG, Zhejiang University); Wei Qian (Fabu Inc.); Xiaofei He (Zhejiang University); Deng Cai (ZJU) | N/A | N/A |
| PointScatter: Point Set Representation for Tubular Structure Extraction | Dong Wang (Peking University)*; Zhao Zhang (Peking Univesity); Ziwei Zhao (Peking University); Yuhang Liu (Yizhun Medical AI Co., Ltd); Yihong Chen (Peking University); Liwei Wang (Peking University) | N/A | N/A |
| D&D: Learning Human Dynamics from Dynamic Camera | Jiefeng Li (Shanghai Jiao Tong University)*; Siyuan Bian (Shanghai Jiao Tong University); Chao Xu (Tencent); Gang Liu (Tencent inc.); Gang Yu (Tencent ); Cewu Lu (Shanghai Jiao Tong University) | N/A | N/A |
| On Mitigating Hard Clusters for Face Clustering | Yingjie Chen (Peking University); Huasong Zhong (Damo Academy, Alibaba Group); Chong Chen (Alibaba Group)*; Chen Shen (Alibaba Group); Jianqiang Huang (Damo Academy, Alibaba Group); Tao Wang (Peking University); Yun Liang (Peking University); Qianru Sun (Singapore Management University) | N/A | N/A |
| Recurrent Bilinear Optimization for Binary Neural Networks | Sheng Xu (Beihang University)*; Yanjing Li (Beihang University); Tiancheng Wang (Beihang University); Teli Ma (Shanghai Artificial Intelligence Laboratory); Baochang Zhang (Beihang University); Peng Gao (Chinese university of hong kong); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Jinhu Lu (Beihang University, Beijing, China); Guodong Guo (IDL, Baidu Research) | N/A | N/A |
| Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories | Adam Harley (Carnegie Mellon University)*; Zhaoyuan Fang (Carnegie Mellon University); Katerina Fragkiadaki (Carnegie Mellon University) | N/A | N/A |
| Open-Set Semi-Supervised Object Detection | Yen-Cheng Liu (Georgia Institute of Technology)*; Chih-Yao Ma (Facebook); Xiaoliang Dai (Facebook); Junjiao Tian (Georgia Institute of Technology); Peter Vajda (Facebook); Zijian He (Facebook); Zsolt Kira (Georgia Institute of Technology) | N/A | N/A |
| Semantic-Aware Implicit Neural Audio-Driven Video Portrait Generation | Xian Liu (The Chinese University of Hong Kong)*; Yinghao Xu (Chinese University of Hong Kong); Qianyi Wu (Monash University); Hang Zhou (The Chinese University of Hong Kong); Wayne Wu (SenseTime Research); Bolei Zhou (UCLA) | N/A | N/A |
| Long-tail Detection with Effective Class-Margins | Jang Hyun Cho (The University of Texas at Austin)*; Philipp Kraehenbuehl (UT Austin) | N/A | N/A |
| SeqTR: A Simple yet Universal Network for Visual Grounding | Chaoyang Zhu (Xiamen University)*; Yiyi Zhou (Xiamen University); Yunhang Shen (Xiamen University); Gen Luo (Xiamen University); Xingjia Pan (Momenta.ai); Mingbao Lin (Xiamen University, China); Chao Chen (Youtu Laboratory); Liujuan Cao (Xiamen University); Xiaoshuai Sun (Xiamen University); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound | Yan-Bo Lin (UNC Chapel Hill)*; Jie Lei (UNC Chapel Hill); Mohit Bansal (University of North Carolina at Chapel Hill); Gedas Bertasius (UNC Chapel Hill) | N/A | N/A |
| KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients | Niklas Hanselmann (Mercedes-Benz AG)*; Katrin Renz (University of Tuebingen); Kashyap Chitta (MPI-IS and University of Tuebingen); Apratim Bhattacharyya (Max Planck Institute for Informatics); Andreas Geiger (University of Tuebingen) | N/A | N/A |
| Extract Free Dense Labels from CLIP | Chong Zhou (Nanyang Technological University)*; Chen Change Loy (Nanyang Technological University); Bo Dai (Shanghai AI Lab) | N/A | N/A |
| Frequency Domain Model Augmentation for Adversarial Attack | Yuyang Long (University of Electronic Science and Technology of China)*; Qilong Zhang ( University of Electronic Science and Technology of China); Boheng Zeng (University of Electronic Science and Technology of China); Lianli Gao (The University of Electronic Science and Technology of China); Xianglong Liu (BUAA); Jian Zhang (College of Computer Science and Electronic Engineering, HNU); Jingkuan Song (UESTC) | N/A | N/A |
| Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors | Oran Gafni (Meta AI Research)*; Adam Polyak (Facebook); Oron Ashual (Facebook AI Research); Shelly Sheynin (Meta); Devi Parikh (Georgia Tech & Facebook AI Research); Yaniv Taigman (Facebook) | N/A | N/A |
| Weakly Supervised Grounding for VQA in Vision-Language Transformers | Aisha Urooj (University of Central Florida)*; Hilde Kuehne (University of Frankfurt); Chuang Gan (MIT-IBM Watson AI Lab); Niels da Vitoria Lobo (University of Central Florida); Mubarak Shah (University of Central Florida) | N/A | N/A |
| Practical and Scalable Desktop-based High-Quality Facial Capture | Alexandros Lattas (Imperial College London)*; Yiming Lin (Imperial college); Jayanth Kannan (Lumirithmic); Ekin Ozturk (Imperial College London); Luca Filipi (Lumirithmic); Giuseppe Claudio Guarnera (University of York); Gaurav Chawla (Lumirithmic Limited); Abhijeet Ghosh (Imperial College London) | N/A | N/A |
| Tracking Objects as Pixel-wise Distributions | Zelin Zhao (The Chinese University of Hong Kong)*; Ze Wu (Megvii); Yueqing Zhuang (Megvii Inc Company); Boxun Li (Megvii Inc.); Jiaya Jia (Chinese University of Hong Kong) | N/A | N/A |
| CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation | Yunyao Mao (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Zhenbo Lu (Institute of Artificial Intelligence, Hefei Comprehensive National Science Center); Jiajun Deng (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China) | N/A | N/A |
| Open-Vocabulary DETR with Conditional Matching | Yuhang Zang (Nanyang Technological University)*; Wei Li (Nanyang Technological University); Kaiyang Zhou (Nanyang Technological University); Chen Huang (Apple); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| Towards Calibrated Hyper-sphere Representation via Distribution Overlap Coefficient for Long-tailed Learning | Hualiang Wang (Zhejiang University)*; Siming FU (Zhejiang University); Xiaoxuan He (Zhejiang University); Hangxiang Fang (Zhejiang University); Zuozhu Liu (Zhejiang-UIUC Institute); Haoji Hu (Zhejiang University, China) | N/A | N/A |
| FBNet: Feedback Network for Point Cloud Completion | Xuejun Yan (Hikvision Research Institue)*; Hongyu Yan (Sichuan Universite); Jingjing Wang (Hikvision Research Institute); Hang Du (Hikvision Research Institute); Zhihong Wu (Sichuan University); Di Xie (Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Li Lu (Sichuan University) | N/A | N/A |
| Physically-Based Editing of Indoor Scene Lighting from a Single Image | Zhengqin Li (Meta)*; Jia Shi (Carnegie Mellon University); Sai Bi (Adobe Research); Rui Zhu (University of California San Diego ); Kalyan Sunkavalli (Adobe Research); Milos Hasan (Adobe Research); Zexiang Xu (Adobe Research); Ravi Ramamoorthi (University of California San Diego); Manmohan Chandraker (UC San Diego) | N/A | N/A |
| GLASS: Global to Local Attention for Scene-Text Spotting | Roi Ronen (Technion)*; Shahar Tsiper (Amazon); Oron Anschel (AWS); Inbal Lavi (Amazon); Amir Markovitz (Amazon); R. Manmatha (Amazon) | N/A | N/A |
| Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation | Antonin Vobecky (Czech Technical University in Prague)*; David Hurych (Valeo.ai); Oriane Siméoni (valeo.ai); Spyros Gidaris (valeo.ai); Andrei Bursuc (valeo.ai); Patrick Pérez (Valeo.ai); Josef Sivic (Czech Technical University) | N/A | N/A |
| Expanding Language-Image Pretrained Models for General Video Recognition | Bolin Ni (Institute of Automation, Chinese Academy of Sciences); Houwen Peng (Microsoft Research)*; Minghao Chen (Stony Brook University); Songyang Zhang (University of Rochester); Gaofeng Meng (Chinese Academy of Sciences); Jianlong Fu (Microsoft Research); SHIMING XIANG (Chinese Academy of Sciences, China); Haibin Ling (Stony Brook University) | N/A | N/A |
| Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes | Julian Chibane (Max Planck Institute for Informatics, University of Wuerzburg)*; Francis Engelmann (ETH AI Center); Anh Tuan Tran (Max Planck Institute for Informatics, Saarland University); Gerard Pons-Moll (University of Tübingen) | N/A | N/A |
| Pose-NDF: Modelling Human Pose Manifolds with Neural Distance Fields | Garvita Tiwari (MPI-INF, University of Tübingen)*; Dimitrije Antic (University of Tuebingen); Jan E. Lenssen (TU Dortmund); Nikolaos Sarafianos (Facebook Reality Labs); Tony Tung (Facebook Reality Labs); Gerard Pons-Moll (University of Tübingen) | N/A | N/A |
| Multimodal Object Detection via Probabilistic Ensembling | Yi-Ting Chen (University of Maryland); Jinghao Shi (Carnegie Mellon University); Zelin Ye (CMU); Mertz Christoph (CMU); Deva Ramanan (Carnegie Mellon University); Shu Kong (Carnegie Mellon University)* | N/A | N/A |
| CenterFormer: Center-based Transformer for 3D Object Detection | Zixiang Zhou (University of Central Florida)*; xiangchen zhao (Tusimple); Yu Wang (Tusimple); Panqu Wang (TuSimple, Inc); Hassan Foroosh (University of Central Florida) | N/A | N/A |
| Revisiting a kNN-based Image Classification System with High-capacity Storage | Kengo Nakata (Kioxia Corporation)*; Youyang Ng (Kioxia Corporation); Daisuke Miyashita (Kioxia Corporation); Asuka Maki (Kioxia Corporation); Yu-Chieh Lin (Kioxia Corporation); Jun Deguchi (Kioxia Corporation) | N/A | N/A |
| TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation | Zhaoyuan Yin (Hunan University)*; Pichao Wang (Alibaba Group); Fan Wang (Alibaba Group); Xianzhe Xu (alibaba group); Hanling Zhang (Hunan University); Hao Li (Alibaba Group); rong jin (alibaba group) | N/A | N/A |
| VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder | Yu-Chao Gu (Nankai University)*; Xintao Wang (Tencent); Liangbin Xie (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China); Chao Dong (SIAT); Gen LI (Tencent); Ying Shan (Tencent); Ming-Ming Cheng (Nankai University) | N/A | N/A |
| CLIFF: Carrying Location Information in Full Frames into Human Pose and Shape Estimation | Zhihao Li (Huawei Noah’s Ark Lab)*; Jianzhuang Liu (Huawei Noah’s Ark Lab); Zhensong Zhang (Huawei Noah’s Ark Lab); Songcen Xu (Huawei Noah’s Ark Lab); Youliang Yan (Huawei Noah’s Ark Lab) | N/A | N/A |
| Pointly-Supervised Panoptic Segmentation | Junsong Fan (Chinese Academy of Sciences, China)*; Zhaoxiang Zhang (Chinese Academy of Sciences, China); Tieniu Tan (NLPR, China) | N/A | N/A |
| Registration based Few-Shot Anomaly Detection | Chaoqin Huang (Shanghai Jiao Tong University)*; Haoyan Guan (King’s College London); Aofan Jiang (Shanghai Jiao Tong University); Ya Zhang (Cooperative Medianet Innovation Center, Shang hai Jiao Tong University); Michael W Spratling (King’s College London); Yan-Feng Wang (Cooperative medianet innovation center of Shanghai Jiao Tong University) | N/A | N/A |
| A Level Set Theory for Neural Implicit Evolution under Explicit Flows | Ishit Mehta (University of California San Diego)*; Manmohan Chandraker (UC San Diego); Ravi Ramamoorthi (University of California San Diego) | N/A | N/A |
| Improving Robustness by Enhancing Weak Subnets | Yong Guo (Max Planck Institute for Informatics)*; David Stutz (Max Planck Institute for Informatics); Bernt Schiele (MPI Informatics) | N/A | N/A |
| TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes | Mutian Xu (The Chinese University of Hong Kong (Shenzhen))*; Pei Chen (the Chinese University of Hong Kong (Shenzhen)); Haolin Liu (The Chinese University of Hong Kong, Shenzhen); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)) | N/A | N/A |
| PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark | Li Chen (Shanghai AI Laboratory)*; Chonghao Sima (Purdue University); Yang Li (SenseTime); Zehan Zheng (Shanghai AI Laboratory); Jiajie Xu (Carnegie Mellon University); Xiangwei Geng (SenseTime); Hongyang Li (SenseTime); Conghui He (Shanghai AI Lab); Jianping Shi (Sensetime Group Limited); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Junchi Yan (Shanghai Jiao Tong University) | N/A | N/A |
| Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting | Chuhui Xue (Nanyang Technological University); Wenqing Zhang (ByteDance); Yu Hao (Bytedance Inc.); Shijian Lu (Nanyang Technological University); Philip Torr (University of Oxford); Song Bai (University of Oxford)* | N/A | N/A |
| Adaptive Patch Exiting for Scalable Single Image Super-Resolution | Shizun Wang (Beijing University of Posts and Telecommunications)*; Jiaming Liu (Peking University); Kaixin Chen (Beijing University of Posts and Telecommunications); Xiaoqi Li (Columbia university in the city of New york); Ming Lu (Intel Labs China); Yandong Guo (OPPO Research Institute) | N/A | N/A |
| Perceptual Artifacts Localization for Inpainting | Lingzhi Zhang (University of Pennsylvania)*; Yuqian Zhou (Adobe); Connelly Barnes (Adobe); Zhe Lin (Adobe Research); Eli Shechtman (Adobe Research, US); Sohrab Amirghodsi (Adobe Research); Jianbo Shi (University of Pennsylvania) | N/A | N/A |
| Adversarially-Aware Robust Object Detector | ZiYi Dong (Sun Yat-Sen University)*; Pengxu Wei (Sun Yat-sen University); Liang Lin (Sun Yat-sen University) | N/A | N/A |
| RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds | Tuan-Anh Vu (The Hong Kong University of Science and Technology)*; Thanh Nguyen (Deakin University, Australia); Binh-Son Hua (VinAI Research); Quang Hieu Pham (Woven Planet North America); Sai-Kit Yeung (Hong Kong University of Science and Technology) | N/A | N/A |
| Generalizable Patch-Based Neural Rendering | Mohammed Suhail (University of British Columbia)*; Carlos Esteves (Google Research); Leonid Sigal (University of British Columbia); Ameesh Makadia (Google Research) | N/A | N/A |
| A Perturbation-Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow | Jenny Schmalfuss (University of Stuttgart)*; Philipp Scholze (University of Stuttgart); Andrés Bruhn (University of Stuttgart) | N/A | N/A |
| Contrastive Monotonic Pixel-Level Modulation | Kun Lu (Zhejiang University)*; Rongpeng Li (Zhejiang University); Honggang Zhang (Zhejiang University) | N/A | N/A |
| Social-SSL: Self-Supervised Cross-Sequence Representation Learning Based on Transformers for Multi-Agent Trajectory Prediction | Li-Wu Tsao (National Chiao Tung University)*; Yan-Kai Wang (National Chiao Tung University); Hao-Siang Lin (National Chiao Tung University); Hong-Han Shuai (National Yang Ming Chiao Tung University); Lai-Kuan Wong (Multimedia University); Wen-Huang Cheng (National Chiao Tung University) | N/A | N/A |
| SpOT: Spatiotemporal Modeling for 3D Object Tracking | Colton Stearns (Stanford University)*; Davis Rempe (Stanford University); Jie Li (Toyota Research Institute); Rareș A Ambruș (Toyota Research Institute); Sergey Zakharov (Toyota Research Institute); Vitor Guizilini (Toyota Research Institute); Yanchao Yang (Stanford University); Leonidas Guibas (Stanford University) | N/A | N/A |
| Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition | Xudong Xie (Huazhong University of Science and Technology)*; LING FU (Huazhong University of Science and Technology); Zhifei Zhang (Adobe Research); Zhaowen Wang (Adobe Research); Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Monocular 3D Object Detection with Depth from Motion | Tai Wang (The Chinese University of Hong Kong)*; Jiangmiao Pang (CUHK); Dahua Lin (The Chinese University of Hong Kong) | N/A | N/A |
| Fine-Grained Scene Graph Generation with Data Transfer | Ao Zhang (National University of Singapore)*; Yuan Yao (Tsinghua University); qianyu chen (Tsinghua University); Wei Ji (National University of Singapore); Zhiyuan Liu (Tsinghua University); Maosong Sun (Tsinghua University); Tat-Seng Chua (National university of Singapore) | N/A | N/A |
| Balancing Stability and Plasticity through Advanced Null Space in Continual Learning | Yajing Kong (The University of Sydney)*; Liu Liu (The University of Sydney); Zhen Wang (The University of Sydney ); Dacheng Tao (JD.com) | N/A | N/A |
| OccamNets: Mitigating Dataset Bias by Favoring Simpler Hypotheses | Robik S Shrestha (Rochester Institute of Technology)*; Kushal Kafle (Adobe Research); Christopher Kanan (University of Rochester) | N/A | N/A |
| DisCo: Remedying Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning | Yuting Gao (tencent)*; Jia-Xin Zhuang (Sun Yat-sen University); Shaohui Lin (East China Normal University ); Hao Cheng (Tencent); Xing Sun (Shopee); Ke Li (Tencent); Chunhua Shen (“University of Adelaide, Australia”) | N/A | N/A |
| Diverse Human Motion Prediction Guided by Multi-Level Spatial-Temporal Anchors | Sirui Xu (University of Illinois Urbana-Champaign)*; Yu-Xiong Wang (University of Illinois at Urbana-Champaign); Liangyan Gui (University of Illinois Urbana-Champaign) | N/A | N/A |
| InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images | Zhengqi Li (Google Inc.)*; Qianqian Wang (Cornell); Noah Snavely (Google); Angjoo Kanazawa (University of California Berkeley) | N/A | N/A |
| CT^2: Colorization Transformer via Color Tokens | Shuchen Weng (Peking University)*; Jimeng Sun (Beijing University of Posts and Telecommunications); Yu Li (International Digital Economy Academy); Si Li (Beijing University of Posts and Telecommunications); Boxin Shi (Peking University) | N/A | N/A |
| PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching | Zhelun Shen (Baidu Research)*; Yuchao Dai (Northwestern Polytechnical University); Xibin Song (Baidu); ZhiBo Rao (Northwestern Polytechnical University); Dingfu Zhou (Baidu); Liangjun Zhang (Baidu Research Institute) | N/A | N/A |
| Discovering Transferable Forensic Features for CNN-generated Images Detection | Keshigeyan Chandrasegaran (Singapore University of Technology and Design)*; Ngoc-Trung Tran (Singapore University of Technology and Design); Alexander Binder (University of Oslo); Ngai-Man Cheung (Singapore University of Technology and Design) | N/A | N/A |
| Domain Adaptive Person Search | Junjie Li (Shanghai Jiao Tong University); Yichao Yan (Shanghai Jiao Tong University)*; Guanshuo Wang (Tencent Youtu Lab); Fufu Yu (Tencent Youtu); Qiong Jia (Tencent Youtu Lab); Shouhong Ding (Tencent) | N/A | N/A |
| Text2LIVE: Text-Driven Layered Image and Video Editing | Omer Bar Tal (Weizmann Institute of Science )*; Dolev Ofri-Amar (Weizmann Institute of Science); Rafail Fridman (Weizmann Institute of Science); Yoni Kasten (Weizmann Institute); Tali Dekel (Weizmann Institute of Science) | N/A | N/A |
| Event-Based Fusion for Motion Deblurring with Cross-modal Attention | Lei Sun (Zhejiang University); Christos Sakaridis (ETH Zurich); Jingyun Liang (ETH Zurich); Qi Jiang (Zhejiang University); Kailun Yang (Karlsruhe Institute of Technology); Peng Sun (Zhejiang University); Yaozu Ye (State Key Laboratory of Modern Optical Instrumentation, Zhejiang University); Kaiwei Wang (State Key Laboratory of Modern Optical Instrumentation, Zhejiang University)*; Luc Van Gool (ETH Zurich) | N/A | N/A |
| AutoMix: Unveiling the Power of Mixup | Zicheng Liu (Westlake University)*; Siyuan Li (Westlake University); di wu (Westlake University); Zihan Liu (Westlake University); Zhiyuan Chen (Shanghai AI Lab); Lirong Wu (Westlake University); Stan Z. Li (Westlake University) | N/A | N/A |
| Synergistic Self-Supervised and Quantization Learning | Yunhao Cao (Nanjing University)*; Peiqin Sun (MEGVII Technology); Yechang Huang (MEGVII Technology); Jianxin Wu (Nanjing University); Shuchang Zhou (MEGVII Technology) | N/A | N/A |
| Auto-regressive Image Synthesis with Integrated Quantization | Fangneng Zhan (Max Planck Institute for Informatics); Yingchen Yu (Nanyang Technological University); Rongliang WU (Nanyang Technological University); Jiahui Zhang (Nanyang Technological University); Kaiwen Cui (Nanyang Technological University); Changgong Zhang (Amazon); Shijian Lu (Nanyang Technological University)* | N/A | N/A |
| Event-guided Deblurring of Unknown Exposure Time Videos | Taewoo Kim (KAIST)*; Jeongmin Lee (KAIST); Lin Wang (HKUST); Kuk-Jin Yoon (KAIST) | N/A | N/A |
| Learning Disentanglement with Decoupled Labels for Vision-Language Navigation | Wenhao Cheng (Beijing Institute of Technology); Xingping Dong (Inception Institute of Artificial Intelligence); Salman Khan (MBZUAI/ANU); Jianbing Shen (Inception Institute of Artificial Intelligence)* | N/A | N/A |
| 3D CoMPaT: Composition of Materials on Parts of 3D Things | Yuchen Li (King Abdullah University of Science and Technology (KAUST)); Ujjwal Upadhyay (KAUST); Habib Slim (KAUST); Tezuesh Varshney (KAUST); Ahmed Abdelreheem (KAUST); Arpit Prajapati (Poly9); Suhail S Pothigara (Poly9 Inc); Peter Wonka (KAUST); Mohamed Elhoseiny (KAUST)* | N/A | N/A |
| Exploring Gradient-based Multi-directional Controls in GANs | Zikun Chen (ModiFace Inc. )*; Ruowei Jiang (ModiFace Inc.); Brendan Duke (ModiFace Inc); Han Zhao (University of Illinois at Urbana-Champaign); Parham Aarabi (ModiFace Inc.) | N/A | N/A |
| OPD: Single-view 3D Openable Part Detection | Hanxiao Jiang (Simon Fraser University)*; Yongsen Mao (Simon Fraser University); Manolis Savva (Simon Fraser University); Angel X Chang (SFU) | N/A | N/A |
| Unpaired Image Translation via Vector Symbolic Architectures | Justin Theiss (University of California, Berkeley)*; Jay Leverett (Meta); Daeil Kim (Meta); Aayush Prakash (Meta) | N/A | N/A |
| CCPL: Contrastive Coherence Preserving Loss for Versatile Style Transfer | Zijie Wu (Huazhong University of Science and Technology)*; Zhen Zhu (University of Illinois at Urbana-Champaign); Junping Du (Beijing University of Posts and Telecommunications); Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Decoupled Adversarial Contrastive Learning for Self-supervised Adversarial Robustness | Chaoning Zhang (KAIST)*; Kang Zhang (KAIST); Chenshuang Zhang (KAIST); Axi Niu (Northwestern Polytechnical University ); Jiu Feng (Sichuan University); Chang D. Yoo (KAIST); In So Kweon (KAIST) | N/A | N/A |
| Secrets of Event-Based Optical Flow | Shintaro Shiba (Keio University)*; Yoshimitsu Aoki (Keio University); Guillermo Gallego (TU Berlin) | N/A | N/A |
| Synthesizing Light Field Video from Monocular Video | Shrisudhan Govindarajan (Indian Institute of Technology Madras); Prasan A Shedligeri (Indian Institute of Technology Madras)*; Sarah Sarah (Indian Institute of Technology, Madras); Kaushik Mitra (IIT Madras) | N/A | N/A |
| LESS: Label-Efficient Semantic Segmentation for LiDAR Point Clouds | Minghua Liu (UCSD)*; Yin Zhou (Waymo); Charles R. Qi (Waymo); Boqing Gong (Google); Hao Su (UCSD); Dragomir Anguelov (Waymo) | N/A | N/A |
| 3D-Aware Indoor Scene Synthesis with Depth Priors | Zifan SHI (HKUST)*; Yujun Shen (Dept. of IE, CUHK); Jiapeng Zhu (HKUST); Dit-Yan Yeung (HKUST); Qifeng Chen (HKUST) | N/A | N/A |
| Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks | xiaotao hu (Nankai University); Jun Xu (Nankai University)*; Shuhang Gu (ETH Zurich, Switzerland); Ming-Ming Cheng (Nankai University); Li Liu (the inception institute of artificial intelligence) | N/A | N/A |
| Modeling Mask Uncertainty in Hyperspectral Image Reconstruction | jiamian wang (Santa Clara University)*; Yulun Zhang (ETH Zurich); Xin Yuan (Westlake University); Ziyi Meng (Kuaishou Technology); Zhiqiang Tao (Santa Clara University) | N/A | N/A |
| Perceiving and Modeling Density for Image Dehazing | Tian Ye (Jimei University)*; Yunchen Zhang (China Design Group Ltd.Co); Erkang Chen (Jimei University); MingChao Jiang (JOYY.INC); Yun Liu (Southwest University); Liang Chen (Fujian Normal University); Sixiang Chen (JiMei University) | N/A | N/A |
| ROBIN: A Benchmark for Robustness to Individual Nuisances in Real-World Out-of-Distribution Shifts | Bingchen Zhao (University of Edinburgh)*; Shaozuo Yu (Tongji University); Wufei Ma (Purdue University); Mingxin Yu (Peking University); Shenxiao Mei (Johns Hopkins University); Angtian Wang (Johns Hopkins University); Ju He (Johns Hopkins University); Alan Yuille (Johns Hopkins University); Adam Kortylewski (Max Planck Institute for Informatics) | N/A | N/A |
| Delving into Details: Synopsis-to-Detail Networks for Video Recognition | Shuxian Liang (Zhejiang University)*; Xu Shen (Alibaba Group); Jianqiang Huang (Alibaba Group); Xian-Sheng Hua (Alibaba Group) | N/A | N/A |
| Bringing Rolling Shutter Images Alive with Dual Reversed Distortion | Zhihang Zhong (The University of Tokyo); Mingdeng Cao (Tsinghua University); Xiao Sun (Microsoft Research Asia); Zhirong Wu (Microsoft Research); Zhongyi Zhou (The University of Tokyo); Yinqiang Zheng (The University of Tokyo)*; Stephen Lin (Microsoft Research); Imari Sato (National Institute of Informatics) | N/A | N/A |
| SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation | Yanjie Li (Tsinghua University)*; Sen Yang (Southeast University); Peidong Liu (Tsinghua University); 寿奎 张 (meituan); Yunxiao Wang (Tsinghua University); Zhicheng Wang (Nreal); Wankou Yang (Southeast University); Shu-Tao Xia (Tsinghua University) | N/A | N/A |
| Generative Multiplane Images: Making a 2D GAN 3D-Aware | Xiaoming Zhao (University of Illinois at Urbana-Champaign)*; Fangchang Ma (Apple Inc.); David Güera (Apple Inc.); Zhile Ren (Apple Inc.); Alexander Schwing (UIUC); Alex Colburn (Apple Inc.) | N/A | N/A |
| Self-supervised Social Relation Representation for Human Group Detection | Jiacheng Li (College of Intelligence and Computing, Tianjin University); Ruize Han (College of Intelligence and Computing, Tianjin University)*; Haomin Yan (Tianjin University); Zekun Qian (College of Intelligence and Computing, Tianjin University); Wei Feng (College of Intelligence and Computing, Tianjin University, China); Song Wang (University of South Carolina) | N/A | N/A |
| Stripformer: Strip Transformer for Fast Image Deblurring | Fu-Jen Tsai (National Tsing Hua University)*; Yan-Tsung Peng (National Chengchi University); Yen-Yu Lin (National Yang Ming Chiao Tung University); Chung-Chi Tsai (Qualcomm Technology); Chia-Wen Lin (National Tsing Hua University) | N/A | N/A |
| Deep Fourier-based Exposure Correction Network with Spatial-Frequency Interaction | Jie Huang (University of Science and Technology of China); Yajing Liu (USTC); Feng Zhao (University of Science and Technology of China)*; Keyu Yan (University of Science and Technology of China); Jinghao Zhang (University of Science and Technology of China); Yukun Huang (University of Science and Technology of China); man zhou (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China) | N/A | N/A |
| Organic Priors in Non-Rigid Structure from Motion | Suryansh Kumar (ETH Zurich)*; Luc Van Gool (ETH Zurich) | N/A | N/A |
| TEMOS: Generating diverse human motions from textual descriptions | Mathis Petrovich (Ecole des Ponts)*; Michael Black (Max Planck Institute for Intelligent Systems); Gul Varol (Ecole des Ponts ParisTech) | N/A | N/A |
| Semantic-Aware Fine-Grained Correspondence | Yingdong Hu (Tsinghua University); Renhao Wang (Tsinghua University); Kaifeng Zhang (Tsinghua University); Yang Gao (Tsinghua University)* | N/A | N/A |
| Layered Controllable Video Generation | Jiahui Huang (University of British Columbia)*; Yuhe Jin (University of British Columbia); Kwang Moo Yi (University of British Columbia); Leonid Sigal (University of British Columbia) | N/A | N/A |
| GraphVid: It Only Takes a Few Nodes to Understand a Video | Eitan Kosman (Bosch AI)*; Dotan Di Castro (Bosch) | N/A | N/A |
| Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection | Yu Hong (Zhejiang University); Hang Dai (Mohamed bin Zayed University of Artificial Intelligence)*; Yong Ding (Zhejiang University) | N/A | N/A |
| Adaptive Token Sampling For Efficient Vision Transformers | Mohsen Fayyaz (Microsoft)*; Soroush Abbasi Koohpayegani (University of Maryland Baltimore County); Farnoush Rezaei Jafari (Technische Universität Berlin); Sunando Sengupta (Microsoft); HAMID VAEZI JOZE (Microsoft); Eric Sommerlade (Microsoft); Hamed Pirsiavash (University of California Davis); Jürgen Gall (University of Bonn) | N/A | N/A |
| Implicit Field Supervision For Robust Non-Rigid Shape Matching | Ramana S Sundararaman (Ecole Polytechnique)*; Gautam Pai (École Polytechnique); Maks Ovsjanikov (Ecole polytechnique) | N/A | N/A |
| NeuMesh: Learning Disentangled Neural Mesh-based Implicit Field for Geometry and Texture Editing | Bangbang Yang (Zhejiang University); Chong Bao (Zhejiang University); Junyi Zeng (Zhejiang University); Hujun Bao (Zhejiang University); Yinda Zhang (Google); Zhaopeng Cui (Zhejiang University); Guofeng Zhang (Zhejiang University)* | N/A | N/A |
| KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution | Jiahong Fu (Xi’an Jiaotong University)*; Hong Wang (Jarvis Lab,Tencent ); Qi Xie (Xi’an Jiaotong University); Qian Zhao (Xi’an Jiaotong University); Deyu Meng (Xi’an Jiaotong University); Zongben Xu (Xi’an Jiaotong University) | N/A | N/A |
| RealFlow: EM-based Realistic Optical Flow Datasets Generation from Videos | Yunhui Han (THU;Megvii); Kunming Luo (Megvii); Ao Luo (Megvii); Jiangyu Liu (megvii inc); Haoqiang Fan (Megvii Inc(face++)); Guiming Luo (School of Software, Tsinghua University); Shuaicheng Liu (UESTC; Megvii)* | N/A | N/A |
| Semi-supervised Object Detection via Virtual Category Learning | Changrui Chen (University of Warwick); Kurt Debattista (University of Warwick, UK); Jungong Han (Aberystwyth University)* | N/A | N/A |
| PrivHAR: Recognizing Human Actions From Privacy-preserving Lens | Carlos Hinojosa (Universidad Industrial de Santander)*; Miguel A Marquez (UIS Colombia); Henry Arguello (Universidad Industrial Santander); Ehsan Adeli (Stanford University); Li Fei-Fei (Stanford University); Juan Carlos Niebles (Salesforce & Stanford University) | N/A | N/A |
| Solution Space Analysis of Essential Matrix based on Algebraic Error Minimization | Gaku Nakano (NEC Corporation)* | N/A | N/A |
| EvAC3D: From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls | Ziyun Wang (University of Pennsylvania)*; Kenneth Chaney (University of Pennsylvania); Kostas Daniilidis (University of Pennsylvania) | N/A | N/A |
| DCCF: Deep Comprehensible Color Filter Learning Framework for High-Resolution Image Harmonization | Ben Xue (Peking University); Shenghui Ran (Alibaba Group); Quan Chen (Alibaba Group)*; Rongfei Jia (Alibaba Group); Binqiang Zhao (Alibaba); Xing Tang (Alibaba Group) | N/A | N/A |
| UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling | Zhengyuan Yang (Microsoft)*; Zhe Gan (Microsoft); Jianfeng Wang (Microsoft); Xiaowei Hu (Microsoft); Faisal Ahmed (Microsoft); Zicheng Liu (Microsoft); Yumao Lu (Microsoft); Lijuan Wang (Microsoft) | N/A | N/A |
| Grasp’D: Differentiable Contact-rich Grasp Synthesis for Multi-fingered Hands | Dylan Turpin (University of Toronto)*; Liquan Wang (University of Toronto); Eric Heiden (University of Southern California); Yun-Chun Chen (University of Toronto ); Miles Macklin (NVIDIA); Stavros Tsogkas (University of Toronto); Sven Dickinson (University of Toronto); Animesh Garg (University of Toronto, Vector Institute, Nvidia) | N/A | N/A |
| The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning | Jack Hessel (Allen Institute for AI)*; Jena D Hwang (Allen Institute for AI); Jae Sung Park (University of Washington); Rowan Zellers (University of Washington); Chandra Bhagavatula (AllenAI); Anna Rohrbach (UC Berkeley); Kate Saenko (Boston University); Yejin Choi (University of Washington) | N/A | N/A |
| Cross-Modal Knowledge Transfer Without Task-Relevant Source Data | SK MIRAJ AHMED (University of California Riverside); Suhas Lohit (Mitsubishi Electric Research Laboratories)*; Kuan-Chuan Peng (Mitsubishi Electric Research Laboratories (MERL)); Michael J Jones (MERL); Amit K. Roy-Chowdhury (University of California, Riverside) | N/A | N/A |
| Approximate Differentiable Rendering with Algebraic Surfaces | Leonid Keselman (Carnegie Mellon University)*; Martial Hebert (Carnegie Mellon School of Computer Science) | N/A | N/A |
| Sim-2-Sim Transfer for Vision-and-Language Navigation in Continuous Environments | Jacob Krantz (Oregon State University)*; Stefan Lee (Oregon State University) | N/A | N/A |
| Uncertainty-DTW for Time Series and Sequences | Lei Wang (The Australian National University); Piotr Koniusz (ANU College of Engineering and Computer Science)* | N/A | N/A |
| Affine Correspondences between Multi-Camera Systems for 6DOF Relative Pose Estimation | Banglei Guan (National University of Defense Technology)*; Ji Zhao (Huazhong University of Science and Technology) | N/A | N/A |
| Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation | Hao Liu (Beijing Institute of Technology); Mang Ye (Wuhan University)* | N/A | N/A |
| NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion | Chenfei Wu (Microsoft)*; Jian Liang (Peking University); Lei Ji (Microsoft); Fan Yang (MSRA); Yuejian Fang (Peking University); Daxin Jiang (Microsoft, Beijing, China); Nan Duan (Microsoft Research) | N/A | N/A |
| BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation | Ye Yu (Microsoft)*; Jialin Yuan (Oregon State University); Gaurav Mittal (Microsoft); Li Fuxin (Oregon State University); Mei Chen (Microsoft) | N/A | N/A |
| DiffuStereo: High Quality Human Reconstruction via Diffusion-based Stereo Using Sparse Cameras | Ruizhi Shao (Tsinghua University); Zerong Zheng (Tsinghua University); Hongwen Zhang (Tsinghua University); Jingxiang Sun (University of Illinois Urbana-Champaign); Yebin Liu (Tsinghua University)* | N/A | N/A |
| The Challenges of Continuous Self-Supervised Learning | Senthil Purushwalkam (Carnegie Mellon University); Pedro Morgado (CMU)*; Abhinav Gupta (CMU/FAIR) | N/A | N/A |
| Deep Radial Embedding for Visual Sequence Learning | Yuecong Min (Institute of Computing Technology, Chinese Academy of Sciences); Peiqi Jiao (Institute of Computing Technology, Chinese Academy of Sciences); Yanan Li (Xiaomi); Wang Xiaotao (XIaomi); LEI LEI (Xiaomi); Xiujuan Chai (Agricultural Information Institute, Chinese); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences)* | N/A | N/A |
| Shape-Pose Disentanglement using SE(3)-equivariant Vector Neurons | Oren Katzir (Tel Aviv University)*; Dani Lischinski (The Hebrew University of Jerusalem); Danny Cohen-Or (Tel Aviv University) | N/A | N/A |
| 3D Object Detection with a Self-supervised Lidar Scene Flow Backbone | Emeç Erçelik (Technical University of Munich)*; Ekim Yurtsever (The Ohio State University); Mingyu Liu (TUM); Zhijie Yang (Technical University of Munich); Hanzhen Zhang (TUM); Pınar Topçam (Technical University of Munich ); Maximilian Listl (Technical University of Munich); Yılmaz Kaan Kaan Çaylı (Technical University of Munich); Alois C. Knoll (Robotics and Embedded Systems) | N/A | N/A |
| FH-Net: A Fast Hierarchical Network for Scene Flow Estimation on Real-world Point Clouds | lihe Ding (Beijing Institute of Technology)*; Shaocong Dong (Beijing Institute of Technology); Tingfa Xu (Beijing Institute of Technology); xinli Xu (Beijing Institute of Technology); Jie Wang (Beijing Institute of Technology); Jianan Li (Beijing Institute of Technology) | N/A | N/A |
| Vote from the Center: 6 DoF Pose Estimation in RGB-D Images by Radial Keypoint Voting | Yangzheng Wu (Queen’s University)*; Mohsen Zand (Queen’s University); Ali Etemad (Queen’s University); Michael Alan Greenspan (Queen’s University) | N/A | N/A |
| Flow graph to Video Grounding for Weakly-supervised Multi-Step Localization | NIKITA DVORNIK (Samsung)*; Isma Hadji (Samsung AI Center – Toronto); Hai X Pham (Samsung AI Center); Dhaivat Bhatt (Samsung); Brais Martinez (Samsung AI Center); Afsaneh Fazly (SAIC Toronto); Allan D Jepson (Samsung Toronto AIC) | N/A | N/A |
| Neural Radiance Transfer Fields for Relightable Novel-view Synthesis with Global Illumination | Linjie Lyu (MPII)*; Ayush Tewari (MIT); Thomas Leimkuehler (MPI Informatik); Marc Habermann (Max Planck Institute for Informatics); Christian Theobalt (MPI Informatik) | N/A | N/A |
| Learning Topological Interactions for Multi-Class Medical Image Segmentation | Saumya Gupta (Stony Brook University)*; Xiaoling Hu (Stony Brook University); James Kaan (Stony Brook University); Michael Jin (Stony Brook University Hospital); Mutshipay Christian Mpoy (SUNY Stony Brook Medicine); Katherine Chung (Stony Brook University Hospital); Gagandeep Singh (RWJBarnabas Health); Mary Saltz (Stony Brook); Tahsin Kurc (Stony Brook University); Joel Saltz (Stony Brook University); APOSTOLOS K TASSIOPOULOS (Stony Brook University); Prateek Prasanna (Stony Brook University); Chao Chen (Stony Brook University) | N/A | N/A |
| Look Both Ways: Self-Supervising Driver Gaze Estimation and Road Scene Saliency | Isaac H Kasahara (University of Minnesota); Simon Stent (Toyota Research Institute); Hyun Soo Park (The University of Minnesota)* | N/A | N/A |
| ObjectBox: From Centers to Boxes for Anchor-Free Object Detection | Mohsen Zand (Queen’s University)*; Ali Etemad (Queen’s University); Michael Alan Greenspan (Queen’s University) | N/A | N/A |
| Unsupervised Segmentation in Real-World Images via Spelke Object Inference | Honglin Chen (Stanford University); Rahul M V (Stanford University); Yoni I Friedman (MIT); Jiajun Wu (Stanford University); Joshua Tenenbaum (MIT); Daniel Yamins (Stanford University); Daniel Bear (Stanford University)* | N/A | N/A |
| A Dense Material Segmentation Dataset for Indoor and Outdoor Scene Parsing | Paul Upchurch (Apple)*; Ransen Niu (Apple) | N/A | N/A |
| Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentation on Complex Urban Driving Scenes | Yu Tian (Australian Institute for Machine Learning, University of Adelaide ); Yuyuan Liu (University of Adelaide); Guansong Pang (Singapore Management University)*; Fengbei Liu (University of Adelaide); Yuanhong Chen (University of Adelaide); Gustavo Carneiro (University of Adelaide) | N/A | N/A |
| Identifying Hard Noise in Long-Tailed Sample Distribution | Xuanyu Yi (Nanyang Technological University)*; Kaihua Tang (Nanyang Technological University); Xian-Sheng Hua (Damo Academy, Alibaba Group); Joo-Hwee Lim (Institute for Infocomm Research); Hanwang Zhang (Nanyang Technological University) | N/A | N/A |
| PressureVision: Estimating Hand Pressure from a Single RGB Image | Patrick L Grady (Georgia Institute of Technology)*; Chengcheng Tang (Facebook Reality Labs); Samarth Brahmbhatt (Intel); Christopher D Twigg (Meta); Chengde Wan (Facebook Reality Lab); James Hays (Georgia Institute of Technology, USA); Charlie Kemp (Georgia Institute of Technology) | N/A | N/A |
| PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks | Nan Ding (Google)*; Xi Chen (Google Research); Tomer Levinboim (Google); Soravit Changpinyo (Google Research); Radu Soricut (Google) | N/A | N/A |
| Beyond Periodicity: Towards a Unifying Framework for Activations in Coordinate-MLPs | Sameera Ramasinghe (University of Adelaide)*; Simon Lucey (University of Adelaide) | N/A | N/A |
| Pose for Everything: Towards Category-Agnostic Pose Estimation | Lumin XU (The Chinese University of Hong Kong)*; Sheng Jin (The University of Hong Kong); Wang ZENG (The Chinese University of Hong Kong); Wentao Liu (Sensetime); Chen Qian (SenseTime); Wanli Ouyang (The University of Sydney); Ping Luo (The University of Hong Kong); Xiaogang Wang (Chinese University of Hong Kong, Hong Kong) | N/A | N/A |
| UIA-ViT: Unsupervised Inconsistency-Aware Method based on Vision Transformer for Face Forgery Detection | Wanyi Zhuang (University of Science and Technology of China); Qi Chu (University of Science and Technology of China)*; Zhentao Tan (University of Science and Technology of China); Qiankun Liu (University of Science and Technology of China); Haojie Yuan (University of Science and Technology of China); Changtao Miao (University of Science and Technology of China); Zixiang Luo (University of Science and Technology of China); Nenghai Yu (University of Science and Technology of China) | N/A | N/A |
| PREF: Predictability Regularized Neural Motion Fields | Liangchen Song (University at Buffalo)*; Xuan Gong (University at Buffalo); Benjamin Planche (United Imaging Intelligence); Meng Zheng (United Imaging Intelligence); David Doermann (University at Buffalo); Junsong Yuan (“State University of New York at Buffalo, USA”); Terrence Chen (United Imaging Intelligence); Ziyan Wu (United Imaging Intelligence) | N/A | N/A |
| Bi-PointFlowNet: Bidirectional Learning for Point Cloud Based Scene Flow Estimation | WENCAN CHENG (Sungkyunkwan University); Jong Hwan Ko (Sungkyunkwan University)* | N/A | N/A |
| Bayesian Tracking of Video Graphs Using Joint Kalman Smoothing and Registration | Aditi Basu Bal (Florida State University)*; Ramy A Mounir (University of South Florida); Sathyanarayanan N Aakur (OK State); Sudeep Sarkar (University of South Florida, Tampa); Anuj Srivastava (Florida State University) | N/A | N/A |
| Semidefinite Relaxations of Truncated Least-Squares in Robust Rotation Search: Tight or Not | Liangzu Peng (Johns Hopkins University)*; Mahyar Fazlyab (Johns Hopkins University); Rene Vidal (Johns Hopkins University, USA) | N/A | N/A |
| Lottery Ticket Hypothesis for Spiking Neural Networks | Youngeun Kim (Yale University)*; Yuhang Li (Yale University); Hyoungseob Park (Yale University); Yeshwanth Venkatesha (Yale university); Ruokai Yin (Yale University); Priyadarshini Panda (Yale University) | N/A | N/A |
| Multi-domain Learning for Updating Face Anti-spoofing Models | Xiao Guo (Michigan State University)*; Yaojie Liu (Google Research); Anil Jain (Michigan State University); Xiaoming Liu (Michigan State University) | N/A | N/A |
| Towards Realistic Semi-Supervised Learning | Mamshad Nayeem Rizve (University of Central Florida)*; Navid Kardan (University of Central Florida); Mubarak Shah (University of Central Florida) | N/A | N/A |
| Unsupervised Pose-aware Part Decomposition for Man-made Articulated Objects | Yuki Kawana (The University of Tokyo)*; Yusuke Mukuta (The University of Tokyo); Tatsuya Harada (The University of Tokyo / RIKEN) | N/A | N/A |
| Cartoon Explanations of Image Classifiers | Stefan Kolek (LMU)*; Duc Anh Nguyen (LMU Munich); Ron Levie (Technion); Joan Bruna (Courant Institute of Mathematical Sciences, NYU, USA); Gitta Kutyniok (Ludwig Maximilian University of Munich) | N/A | N/A |
| RRSR:Reciprocal Reference-based Image Super-Resolution with Progressive Feature Alignment and Selection | Lin Zhang (CASIA); Xin Li (Baidu); Dongliang He (Baidu)*; Fu Li (Baidu); Yili Wang (Tsinghua University); Zhaoxiang Zhang (Chinese Academy of Sciences, China) | N/A | N/A |
| Gaussian Activated Neural Radiance Fields for High Fidelity Reconstruction & Pose Estimation | Shin-Fang Chng (The University of Adelaide)*; Sameera Ramasinghe (University of Adelaide); Jamie Sherrah (AIML); Simon Lucey (University of Adelaide) | N/A | N/A |
| Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling | Jan U. Müller (University of Bonn)*; Michael Weinmann (TU Delft); Reinhard Klein (University of Bonn) | N/A | N/A |
| “This is my unicorn, Fluffy”: Personalizing frozen vision-language representations | Niv Cohen (The Hebrew University of Jerusalem)*; Rinon Gal (Tel Aviv University); Eli Meirom (NVIDIA Research); Gal Chechik (NVIDIA); Yuval Atzmon (NVIDIA Research) | N/A | N/A |
| Paper Title | Authors | N/A | N/A |
| Learning Uncoupled-Modulation CVAE for 3D Action-Conditioned Human Motion Synthesis | Chongyang Zhong (Institute of Computing Technology, Chinese Academy of Sciences)*; Lei Hu (Institute of Computing Technology, Chinese Academy of Sciences ); Zihao Zhang (Institute of Computing Technology, Chinese Academy of Sciences); Shihong Xia (institute of computing technology of the Chinese academy of sciences) | N/A | N/A |
| Generative Domain Adaptation for Face Anti-Spoofing | Qianyu Zhou (Shanghai Jiao Tong University)*; Ke-Yue Zhang (YouTu Lab, Tencent); Taiping Yao (Tencent YouTu); Ran Yi (Shanghai Jiao Tong University); Kekai Sheng (Youtu Lab, Tencent Inc.); Shouhong Ding (Tencent); Lizhuang Ma (Shanghai Jiao Tong University) | N/A | N/A |
| Learning Depth from Focus in the Wild | Changyeon Won (GIST)*; Hae-Gon Jeon (GIST) | N/A | N/A |
| Relighting4D: Neural Relightable Human from Videos | Zhaoxi Chen (Nanyang Technological University )*; Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation | Haoyu Ma (University of California, Irvine)*; Zhe Wang (UC-Irvine); Yifei Chen (Tencent); Deying Kong (university of california, irvine); Liangjian Chen (Reality Labs); Xingwei Liu (University of California Irvine); Xiangyi Yan (University of California, Irvine); Hao Tang (University of California Irvine); Xiaohui Xie (University of California, Irvine) | N/A | N/A |
| Understanding the Dynamics of DNNs Using Graph Modularity | Yao Lu (Zhejiang University of Technology)*; Wen Yang (Zhejiang University of Technology); Yunzhe Zhang (Zhejiang University of Technology); Zuohui Chen (Zhejiang University of Technology); Jinyin Chen (Zhejiang University of Technology); Qi Xuan (Zhejiang University of Technology); Zhen Wang (Northwestern Polytechnical University); Xiaoniu Yang (Zhejiang University of Technology; Science and Technology on Communication Information Security Control Laboratory) | N/A | N/A |
| Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective | Quan Cui (Waseda University)*; Bingchen Zhao (University of Edinburgh); Zhao-Min Chen (NanJing University); Borui Zhao (Megvii Technology); Renjie Song (Megvii Inc.); Boyan Zhou (ByteDance); Jiajun Liang (Megvii); Osamu Yoshie (Waseda University) | N/A | N/A |
| Learning-based Point Cloud Registration for 6D Object Pose Estimation in the Real World | Zheng Dang (EPFL)*; Lizhou Wang (Xi’an Jiaotong University); Yu Guo (School of Software Engineering, Xi’an Jiaotong University); Mathieu Salzmann (EPFL) | N/A | N/A |
| AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion Sensing | Jiaxi Jiang (ETH Zurich)*; Paul Streli (ETH Zurich); Huajian Qiu (EPFL); Andreas R Fender (ETH Zurich); Larissa Laich (Facebook Reality Labs); Patrick Snape (Meta); Christian Holz (ETH Zürich) | N/A | N/A |
| Knowledge Condensation Distillation | chenxin li (Xiamen University)*; Mingbao Lin (Xiamen University, China); Zhiyuan Ding (Xiamen University); Nie Lin (Hunan University); Yihong Zhuang (Xiamen University); Yue Huang (Xiamen University); Xinghao Ding (Xiamen University); Liujuan Cao (Xiamen University) | N/A | N/A |
| CAR: Class-aware Regularizations for Semantic Segmentation | Ye Huang (University of Technology Sydney)*; Di Kang (Tencent); Liang Chen (Fujian Normal University); Xuefei Zhe (Tencent AI lab); Wenjing Jia (University of Technology Sydney); Linchao Bao (Tencent AI Lab); Xiangjian He (University of Nottingham Ningbo China) | N/A | N/A |
| Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation | Yuyang Zhao (National University of Singapore)*; Zhun Zhong (University of Trento); Na Zhao (NUS); Nicu Sebe (University of Trento); Gim Hee Lee (National University of Singapore) | N/A | N/A |
| Reducing Information Loss for Spiking Neural Networks | Yufei Guo (The Second Academy of China Aerospace Science and Industry Corporation)*; Yuanpei Chen (X LAB,The Second Academy of CASIC,Beijing); Liwen Zhang (X Lab, the Second Academy of CASIC, Beijing); YingLei Wang (CASIC); Xiaode Liu (X Lab, The Second Academy of China Aerospace Science and Industry Corporation); Xinyi Tong (The Second Academy of China Aerospace Science and Industry Corporation); Yuanyuan Ou (Chongqing University); Xuhui Huang (X Lab, The Second Academy of CASIC); Zhe Ma (Xlab, the Second Academy of CASIC, Beijing) | N/A | N/A |
| Real-Time Intermediate Flow Estimation for Video Frame Interpolation | Zhewei Huang (MEGVII)*; Tianyuan Zhang (Carnegie Mellon University); Wen Heng (Megvii inc.); Boxin Shi (Peking University); Shuchang Zhou (MEGVII Technology) | N/A | N/A |
| Class-incremental Novel Class Discovery | Subhankar Roy (University of Trento); Mingxuan Liu (University of Trento); Zhun Zhong (University of Trento)*; Nicu Sebe (University of Trento); Elisa Ricci (University of Trento) | N/A | N/A |
| PixelFolder: An Efficient Progressive Pixel Synthesis Network for Image Generation | Jing He (Xiamen university)*; Yiyi Zhou (Xiamen University); Qi Zhang (Tencent); Jun Peng (Xiamen University); Yunhang Shen (Xiamen University); Xiaoshuai Sun (Xiamen University); Chao Chen (Youtu Laboratory); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Minimal Neural Atlas: Parameterizing Complex Surfaces with Minimal Charts and Distortion | Weng Fei Low (National University of Singapore)*; Gim Hee Lee (National University of Singapore) | N/A | N/A |
| Contrastive Prototypical Network with Wasserstein Confidence Penalty | Haoqing Wang (Peking University)*; Zhi-Hong Deng (Peking University) | N/A | N/A |
| Privacy-Preserving Face Recognition with Learnable Privacy Budgets in Frequency Domain | Jiazhen Ji (Tencent)*; Huan Wang (Xiamen University); Yuge Huang (Tencent YouTu); Jiaxiang Wu (Tencent); Xingkun Xu (Tencent); Shouhong Ding (Tencent); ShengChuan Zhang (Xiamen University); Liujuan Cao (Xiamen University); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| An End-to-End Transformer Model for Crowd Localization | Dingkang Liang (Huazhong University of Science and Technology)*; Wei Xu (Beijing University of Posts and Telecommunications); Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection | Zehui Chen (University of Science and Technology of China); Zhenyu Li (Harbin Institute of Technology); Shiquan Zhang (SenseTime Research); Liangji Fang (Sensetime Research); Qinhong Jiang (SenseTime Research; Shanghai AI Laboratory); Feng Zhao (University of Science and Technology of China)* | N/A | N/A |
| Masked Generative Distillation | Zhendong Yang (Graduate school at ShenZhen,Tsinghua university)*; Zhe Li (Bytedance Inc.); Shao Mingqi (Graduate school at ShenZhen, Tsinghua university); Dachuan Shi (Graduate school at ShenZhen, Tsinghua University); Zehuan Yuan (Bytedance.Inc); Chun Yuan (Graduate school at ShenZhen,Tsinghua university) | N/A | N/A |
| Saliency Hierarchy Modeling via Generative Kernels for Salient Object Detection | Wenhu Zhang (Zhejiang University)*; Liangli Zheng (Zhejiang University); Huanyu Wang (Zhejiang University); Xintian Wu (Zhejiang University); Xi Li (Zhejiang University) | N/A | N/A |
| Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification | Renrui Zhang (Shanghai AI Lab)*; Zhang Wei (Shanghai AI-Lab); Rongyao Fang (Chinese University of Hong Kong); Peng Gao (Chinese university of hong kong); Kunchang Li (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Jifeng Dai (SenseTime); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Hongsheng Li (The Chinese University of Hong Kong) | N/A | N/A |
| Temporal Lift Pooling for Continuous Sign Language Recognition | Lianyu Hu (Tianjin University)*; Liqing Gao (College of Intelligence and Computing,Tianjin University); Zekang Liu (College of Intelligence and Computing, Tianjin University); Wei Feng (College of Intelligence and Computing, Tianjin University, China) | N/A | N/A |
| MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes | Yang Jiao (Fudan University)*; Shaoxiang Chen (Fudan University); Zequn Jie (Meituan inc.); Jingjing Chen (Fudan University); Lin Ma (Meituan); Yu-Gang Jiang (Fudan University) | N/A | N/A |
| JPEG Artifacts Removal via Contrastive Representation Learning | Xi Wang (University of Science and Technology of China); Xueyang Fu (University of Science and Technology of China)*; Yurui Zhu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China) | N/A | N/A |
| Tackling Long-Tailed Category Distribution Under Domain Shifts | Xiao Gu (Imperial College London)*; Yao Guo (Shanghai Jiao Tong Univerisity); Zeju Li (Imperial College London); Jianing Qiu (Imperial College London); DOU QI (The Chinese University of Hong Kong); Yuxuan Liu (Institude of Medical Robotics, Shanghai Jiao Tong University); Benny P L Lo (Imperial College London); Guang-Zhong Yang (SJTU) | N/A | N/A |
| WeLSA: Learning To Predict 6D Pose From Weakly Labeled Data Using Shape Alignment | Shishir Reddy Vutukur (TU Munich / Siemens Technology)*; Ivan Shugurov (TU Munich / Siemens Corporate Technology); Benjamin Busam (Technical University of Munich); ANDREAS HUTTER (Siemens Corporate Technology, Germany); Slobodan Ilic (TUM) | N/A | N/A |
| Fine-grained Data Distribution Alignment for Post-Training Quantization | Yunshan Zhong (xiamen university)*; Mingbao Lin (Xiamen University, China); Mengzhao Chen (Xiamen University); Ke Li (Tencent); Yunhang Shen (Xiamen University); Fei Chao (Xiamen University); Yongjian Wu (Tencent Technology (Shanghai) Co.,Ltd); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Few-shot Single-view 3D Reconstruction with Memory Prior Contrastive Network | Zhen Xing (Fudan University)*; Yijiang Chen (Fudan University); Zhixin Ling (Fudan University); Xiangdong Zhou (Fudan University); Yu Xiang (The University of Texas at Dallas) | N/A | N/A |
| ExtrudeNet: Unsupervised Inverse Sketch-and-Extrude for Shape Parsing | Daxuan Ren (Nanyang Technological University)*; Jianmin Zheng (Nanyang Technological University); Jianfei Cai (Monash University); jiatong j li (Sensetime); Junzhe Zhang (Nanyang Technological University) | N/A | N/A |
| P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation | Wenkang Shan (Peking University)*; Zhenhua Liu (Peking University); xinfeng zhang (University of Chinese Academy of Sciences); Shanshe Wang (Peking University); Siwei Ma (Peking University, China); Wen Gao (PKU) | N/A | N/A |
| Contrast-Phys: Unsupervised Video-based Remote Physiological Measurement via Spatiotemporal Contrast | Zhaodong Sun (University of Oulu)*; Xiaobai Li (University of Oulu) | N/A | N/A |
| Panoptic Scene Graph Generation | Jingkang Yang (Nanyang Technological University)*; Yi Zhe Ang (Nanyang Technological University); Zujin GUO (Nanyang Technological University); Kaiyang Zhou (Nanyang Technological University); Wayne Zhang (SenseTime Research); Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| StyleSwap: Style-Based Generator Empowers Robust Face Swapping | Zhiliang Xu (Baidu Inc.); Hang Zhou (The Chinese University of Hong Kong)*; Zhibin Hong (Baidu Inc.); Ziwei Liu (Nanyang Technological University); Jiaming Liu (Baidu Inc.); zhizhi guo (Department of Computer Vision Technology (VIS), Baidu Inc); Junyu Han (Baidu Inc.); jingtuo liu (baidu); Errui Ding (Baidu Inc.); Jingdong Wang (Baidu) | N/A | N/A |
| Boosting Event Stream Super-Resolution with A Recurrent Neural Network | Wenming Weng (University of Science and Technology of China)*; Yueyi Zhang (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China) | N/A | N/A |
| Unknown-Oriented Learning for Open Set Domain Adaptation | jie liu (City University of Hong Kong)*; Xiaoqing Guo (City University of Hong Kong); Yixuan YUAN (City University of Hong Kong) | N/A | N/A |
| Unpaired Deep Image Dehazing Using Contrastive Disentanglement Learning | Xiang Chen (Nanjing University of Science and Technology)*; Zhentao Fan (Shenyang Aerospace University); Pengpeng Li (Dalian Polytechnic University); Longgang Dai (Shenyang Aerospace University); Caihua Kong (Shenyang Aerospace University); Zhuoran Zheng (Nanjing University of Science and Technology ); Yufeng Huang (Shenyang Aerospace University); Yufeng Li (Shenyang Aerospace University) | N/A | N/A |
| Check and Link: Pairwise Lesion Correspondence Guides Mammogram Mass Detection | Ziwei Zhao (Peking University)*; Dong Wang (Peking University); Yihong Chen (Peking University); Ziteng Wang (Yizhun-ai); Liwei Wang (Peking University) | N/A | N/A |
| Generative Subgraph Contrast for Self-Supervised Graph Representation Learning | yuehui han (njust)*; Le Hui (Nanjing University of Science and Technology); Haobo Jiang (Nanjing University of Science and Technology); Jianjun Qian (Nanjing University of Science and Technology); Jin Xie (Nanjing University of Science and Technology) | N/A | N/A |
| DVS-Voltmeter: Stochastic Process-based Event Simulator for Dynamic Vision Sensors | SongNan Lin (Nanyang Technological University)*; Ye Ma (McGill University); Zhenhua Guo (Aliababa Group); Bihan Wen (Nanyang Technological University) | N/A | N/A |
| Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation | Hongbin Lin (South China University of Technology); Yifan Zhang (National University of Singapore); Zhen Qiu (South China University of Technology); Shuaicheng Niu (South China University of Technology); Chuang Gan (MIT-IBM Watson AI Lab); Yanxia Liu (South China University of Technology); Mingkui Tan (South China University of Technology)* | N/A | N/A |
| SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding | Mengxue Qu (Beijing Jiaotong University)*; Yu Wu (Princeton University); Wu Liu (AI Research of JD.com); Qiqi Gong (BeijingJiaotong University); Xiaodan Liang (Sun Yat-sen University); Olga Russakovsky (Princeton University); Yao Zhao (Beijing Jiaotong University); Yunchao Wei (UTS) | N/A | N/A |
| Benchmarking Omni-Vision Representation through the Lens of Visual Realms | Yuanhan Zhang (Nanyang Technological University); Zhenfei Yin (Sensetime); Jing Shao (Sensetime); Ziwei Liu (Nanyang Technological University)* | N/A | N/A |
| Paint2Pix: Interactive Painting based Progressive Image Synthesis and Editing | Jaskirat Singh (Australian National University)*; Liang Zheng (Australian National University); Cameron Y Smith (Adobe Research); Jose Echevarria (Adobe System Inc.) | N/A | N/A |
| BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis | Haiyang Liu (The University of Tokyo)*; Zihao Zhu (Keio University); Naoya Iwamoto (Huawei Technologies Japan K.K.); Yichen Peng (Japan Advanced Institute of Science and Technology); Zhengqing Li (Huawei Japan K.K.); YOU ZHOU (Tokyo Research Center, Huawei); Elif Bozkurt (Huawei Turkey R&D Center, Istanbul, Turkey); Bo Zheng (Huawei) | N/A | N/A |
| Active Pointly-Supervised Instance Segmentation | Chufeng Tang (Tsinghua University)*; Lingxi Xie (Huawei Inc.); Gang Zhang (Tsinghua University); xiaopeng zhang (Huawei Cloud EI ); Qi Tian (Huawei Cloud & AI); Xiaolin Hu (Tsinghua University) | N/A | N/A |
| DecoupleNet: Decoupled Network for Domain Adaptive Semantic Segmentation | Xin Lai (The Chinese University of Hong Kong)*; Zhuotao Tian (The Chinese University of Hong Kong); Xiaogang XU (The Chinese University of Hong Kong); Yingcong Chen (Hong Kong University of Science and Technology); Shu Liu (SmartMore); Hengshuang Zhao (University of Oxford); Liwei Wang (CUHK); Jiaya Jia (Chinese University of Hong Kong) | N/A | N/A |
| ByteTrack: Multi-Object Tracking by Associating Every Detection Box | Yifu Zhang (Huazhong University of Science and Technology); Peize Sun (The University of Hong Kong); Yi Jiang (Bytedance); Dongdong Yu (ByteDance Inc.); Fucheng Weng (Huazhong University of Science and Technology); Zehuan Yuan (Bytedance.Inc); Ping Luo (The University of Hong Kong); Wenyu Liu (Huazhong University of Science and Technology); Xinggang Wang (Huazhong University of Science and Technology)* | N/A | N/A |
| Robust Multi-Object Tracking by Marginal Inference | Yifu Zhang (Huazhong University of Science and Technology); Chunyu Wang (Microsoft Research asia); Xinggang Wang (Huazhong University of Science and Technology)*; Wenjun Zeng (EIT Institute for Advanced Study); Wenyu Liu (Huazhong University of Science and Technology) | N/A | N/A |
| Doubly-Fused ViT: Fuse Information from Vision Transformer Doubly with Local Representation | Li Gao (Wuhan University)*; Dong Nie (UNC); Bo Li (Alibaba Group); Xiaofeng Ren (alibaba group) | N/A | N/A |
| CATRE: Iterative Point Clouds Alignment for Category-level Object Pose Refinement | Xingyu Liu (Tsinghua University); Gu Wang (JD.COM); Yi Li (University of Washington); Xiangyang Ji (Tsinghua University)* | N/A | N/A |
| Spatiotemporal Self-attention Modeling with Temporal Patch Shift for Action Recognition | Wangmeng Xiang (The Hong Kong Polytechnic University)*; Chao Li (Alibaba); Biao Wang (Alibaba); Xihan Wei (Alibaba); Xian-Sheng Hua (Damo Academy, Alibaba Group); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) | N/A | N/A |
| Efficient Long-Range Attention Network for Image Super-resolution | Xindong Zhang (The Hong Kong Polytechnic University)*; Hui Zeng (OPPO); Shi Guo (The Hong Kong Polytechnic University); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) | N/A | N/A |
| DID-M3D: Decoupling Instance Depth for Monocular 3D Object Detection | Liang Peng (ZJU)*; Xiaopei Wu (ZhejiangUniversity); Zheng Yang (FABU); Haifeng Liu (ZJU); Deng Cai (ZJU) | N/A | N/A |
| FlowFormer: A Transformer Architecture for Optical Flow | Zhaoyang Huang (Chinese University of HongKong)*; Xiaoyu Shi (CUHK); Chao Zhang (Samsung Telecommunication Research Institute); Qiang Wang (Samsung Research China, Beijing); Ka Chun Cheung (Nvidia); Hongwei Qin (Sensetime); Jifeng Dai (SenseTime); Hongsheng Li (The Chinese University of Hong Kong) | N/A | N/A |
| Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction | Yuanhao Cai (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School); Jing Lin (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School)*; Xiaowan Hu (Tsinghua Univisity, Tsinghua Shenzhen International Graduate School); Haoqian Wang (Tsinghua Shenzhen International Graduate School, Tsinghua University); Xin Yuan (Westlake University); Yulun Zhang (ETH Zurich); Radu Timofte (University of Wurzburg & ETH Zurich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| An Embedded Feature Whitening Approach to Deep Neural Network Optimization | Hongwei Yong (The Hong Kong Polytechnic University)*; Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) | N/A | N/A |
| Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation | Jingyu Gong (Shanghai Jiao Tong University)*; Fengqi Liu (Shanghai Jiao Tong University); Jiachen Xu (Shanghai Jiao Tong University); Min Wang (Sensetime Group); Xin Tan (Shanghai Jiao Tong University); Zhizhong Zhang (East China Normal University); Ran Yi (Shanghai Jiao Tong University); Haichuan Song (East China Normal University); Yuan Xie (East China Normal University); Lizhuang Ma (Shanghai Jiao Tong University) | N/A | N/A |
| Source-Free Domain Adaptation with Contrastive Domain Alignment and Self-supervised Exploration for Face Anti-Spoofing | Yuchen Liu (Shanghai Jiao Tong university)*; Yabo Chen (Shanghai Jiao Tong University ); Wenrui Dai (Shanghai Jiao Tong University); Mengran Gou (Qualcomm); Chun-Ting Huang (Qualcomm); Hongkai Xiong (Shanghai Jiao Tong University) | N/A | N/A |
| MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection | Xuesong Chen (The Chinese University of Hong Kong)*; Shaoshuai Shi (MPI Informatics); Benjin Zhu (MEGVII); Ka Chun Cheung (Nvidia); Hang Xu (Huawei Noah’s Ark Lab); Hongsheng Li (The Chinese University of Hong Kong) | N/A | N/A |
| SdAE: Self-distillated Masked Autoencoder | Yabo Chen (Shanghai Jiao Tong University ); Yuchen Liu (Shanghai Jiao Tong university); Dongsheng Jiang (Huawei Cloud & AI); xiaopeng zhang (Huawei Cloud EI )*; Wenrui Dai (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University); Qi Tian (Huawei Cloud & AI) | N/A | N/A |
| A Transformer-based Decoder for Semantic Segmentation with Multi-level Context Mining | Bowen Shi (Shanghai Jiao Tong University)*; Dongsheng Jiang (Huawei Cloud & AI); xiaopeng zhang (Huawei Cloud EI ); Han Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University); Qi Tian (Huawei Cloud & AI) | N/A | N/A |
| Graph-constrained Contrastive Regularization for Semi-weakly Volumetric Segmentation | Simon Reiß (Karlsruhe Institute of Technology)*; Constantin Marc Seibold (Karlsruhe Institute of Technology); Alexander Freytag (Carl Zeiss AG, Jena, Germany); Rodner Erik (University of Applied Sciences Berlin); Rainer Stiefelhagen (Karlsruhe Institute of Technology) | N/A | N/A |
| Improving Vision Transformers by Revisiting High-frequency Components | Jiawang Bai (Tsinghua University)*; Li Yuan (Peking University); Shu-Tao Xia (Tsinghua University); Shuicheng Yan (Sea AI Labs); Zhifeng Li (Tencent AI Lab); Wei Liu (Tencent) | N/A | N/A |
| Adaptive Co-Teaching for Unsupervised Monocular Depth Estimation | Weisong Ren (Dalian University of Technology); Lijun Wang (Dalian University of Technology)*; Yongri Piao (Dalian University of Technology); Miao Zhang (Dalian University of Technology); Huchuan Lu (Dalian University of Technology); Ting Liu (Alibaba) | N/A | N/A |
| FurryGAN: High quality foreground-aware image synthesis | Jeongmin Bae (Yonsei University); Mingi Kwon (Yonsei University); Youngjung Uh (Yonsei University)* | N/A | N/A |
| An Efficient Spatio-Temporal Pyramid Transformer for Action Detection | Yuetian Weng (Monash University); Zizheng Pan (Monash University); Mingfei Han (Monash University; DATA61, CSIRO); Xiaojun Chang (University of Technology Sydney); Bohan Zhuang (Monash University)* | N/A | N/A |
| LocVTP: Video-Text Pre-training for Temporal Localization | Meng Cao (Peking University); Tianyu Yang (Tencent AI Lab); Junwu Weng (Tencent AI Lab); Can Zhang (Peking University); Jue Wang (Tencent AI Lab); Yuexian Zou (Peking University)* | N/A | N/A |
| Fusing Local Similarities for Retrieval-based 3D Orientation Estimation of Unseen Objects | Chen Zhao (EPFL)*; Yinlin Hu (EPFL); Mathieu Salzmann (EPFL) | N/A | N/A |
| Online Segmentation of LiDAR Sequences: Dataset and Algorithm | Romain Loiseau (École des ponts ParisTech)*; Mathieu Aubry (École des ponts ParisTech); loic landrieu (IGN) | N/A | N/A |
| MVSTER: Epipolar Transformer for Efficient Multi-View Stereo | Xiaofeng Wang (Institute of Automation, Chinese Academy of Sciences; School of Artificial Intelligence, University of Chinese Academy of Sciences)*; Zheng Zhu (Tsinghua University); Guan Huang (Institute of Automation, Chinese Academy of Sciences); Fangbo Qin (Institute of Automation, Chinese Academy of Sciences); Yun Ye (XForwardAI Technology Co., Ltd, Beijing, China); Yijia He (Beijing Kuaishou Technology Co., Ltd); Xu Chi (Phigent Robotics); Xingang Wang (Institute of Automation, CAS) | N/A | N/A |
| Unsupervised Learning of 3D Semantic Keypoints with Mutual Reconstruction | Haocheng Yuan (Northwestern Polytechnical University); Chen Zhao (EPFL); Shichao Fan (Northwestern Polytechnical University); Jiaxi Jiang (Northwestern Polytechnical University); Jiaqi Yang (Northwestern Polytechnical University)* | N/A | N/A |
| Generalizable Medical Image Segmentation via Random Amplitude Mixup and Domain-Specific Image Restoration | Ziqi Zhou (Nanjing University)*; Lei Qi (Southeast University); Yinghuan Shi (Nanjing University) | N/A | N/A |
| Demystifying Unsupervised Semantic Correspondence Estimation | Mehmet Aygün (The University of Edinburgh)*; Oisin Mac Aodha (University of Edinburgh) | N/A | N/A |
| Learning Shadow Correspondence for Video Shadow Detection | Xinpeng Ding (The Hong Kong University of Science and Technology); Jingwen Yang (The Hong Kong University of Science and Technology); Xiaowei Hu (Shanghai AI Laboratory); Xiaomeng Li (The Hong Kong University of Science and Technology)* | N/A | N/A |
| PolarMOT: How far can geometric relations take us in 3D multi-object tracking? | Aleksandr Kim (Technical University of Munich); Guillem Brasó (TUM); Aljosa Osep (TUM Munich)*; Laura Leal-Taixé (TUM) | N/A | N/A |
| Few-Shot End-to-End Object Detection via Constantly Concentrated Encoding across Heads | Jiawei Ma (Columbia University)*; Guangxing Han (Columbia University); Shiyuan Huang (Columbia University); Yuncong Yang (Columbia University); Shih-Fu Chang (Columbia University) | N/A | N/A |
| MVDECOR: Multi-view Dense Correspondence Learning for Fine-grained 3D Segmentation | Gopal Sharma (University of Massachusetts Amherst)*; Kangxue Yin (NVIDIA); Subhransu Maji (University of Massachusetts, Amherst); Evangelos Kalogerakis (UMass Amherst); Or Litany (NVIDIA); Sanja Fidler (University of Toronto, NVIDIA) | N/A | N/A |
| Implicit Neural Representations for Image Compression | Yannick Strümpler (ETH Zürich)*; Janis Postels (ETH Zurich); Ren Yang (ETH Zurich); Luc Van Gool (ETH Zurich); Federico Tombari (Google, TU Munich) | N/A | N/A |
| Cross-modal Prototype Driven Network for Radiology Report Generation | Jun Wang (University of Warwick)*; Abhir Bhalerao (University of Warwick); Yulan He (University of Warwick) | N/A | N/A |
| Scene Text Recognition with Permuted Autoregressive Sequence Models | Darwin Bautista (University of the Philippines)*; Rowel Atienza (University of the Philippines) | N/A | N/A |
| XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model | Ho Kei Cheng (University of Illinois Urbana-Champaign)*; Alexander Schwing (UIUC) | N/A | N/A |
| SUPR: A Sparse Unified Part-Based Human Body Model | Ahmed A A Osman (Max Planck Institute for Intelligent Systems)*; Michael J. Black (Max Planck Institute for Intelligent Systems); Timo Bolkart (Max Planck Institute for Intelligent Systems); Dimitrios Tzionas (University of Amsterdam) | N/A | N/A |
| SCAM! Transferring humans between images with Semantic Cross Attention Modulation | Nicolas Dufour (ENPC)*; David Picard (ENPC); Vicky Kalogeiton (Ecole Polytechnique) | N/A | N/A |
| Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization | Alp Yurtsever (Umeå University); Tolga Birdal (TU Munich)*; Vladislav Golyanik (MPI for Informatics) | N/A | N/A |
| Revisiting Point Cloud Simplification: A Learnable Feature Preserving Approach | Rolandos Alexandros Potamias (Imperial College London)*; Giorgos Bouritsas (Imperial College London); Stefanos Zafeiriou (Imperial College London) | N/A | N/A |
| Neural Architecture Search for Spiking Neural Networks | Youngeun Kim (Yale University)*; Yuhang Li (Yale University); Hyoungseob Park (Yale University); Yeshwanth Venkatesha (Yale university); Priyadarshini Panda (Yale University) | N/A | N/A |
| Neuromorphic Data Augmentation for Training Spiking Neural Networks | Yuhang Li (Yale University)*; Youngeun Kim (Yale University); Hyoungseob Park (Yale University); Tamar Geller (Yale University); Priyadarshini Panda (Yale University) | N/A | N/A |
| RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild | Jason Y Zhang (Carnegie Mellon University)*; Deva Ramanan (Carnegie Mellon University); Shubham Tulsiani (Carnegie Mellon University) | N/A | N/A |
| Human Trajectory Prediction via Neural Social Physics | Jiangbei Yue (Leeds University); Dinesh Manocha (University of Maryland at College Park)*; He Wang (Leeds University) | N/A | N/A |
| Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation | Qihao Liu (Johns Hopkins University); Yi Zhang (Johns Hopkins University); Song Bai (University of Oxford); Alan Yuille (Johns Hopkins University)* | N/A | N/A |
| R2L: Distilling Neural Radiance Field to Neural Light Field for Efficient Novel View Synthesis | Huan Wang (Northeastern University); Jian Ren (Snap Inc.); Zeng Huang (Snap Inc.)*; Kyle B Olszewski (Snap Inc.); Menglei Chai (Snap Inc.); YUN FU (Northeastern University); Sergey Tulyakov (Snap Inc) | N/A | N/A |
| Towards Open Set Video Anomaly Detection | Yuansheng Zhu (Rochester Institute of Technology)*; Wentao Bao (Rochester Institute of Technology); Qi Yu (Rochester Institute of Technology) | N/A | N/A |
| Object-Compositional Neural Implicit Surfaces | Qianyi Wu (Monash University)*; Xian Liu (The Chinese University of Hong Kong); Yuedong Chen (Monash University); Kejie Li (University of Oxford); Chuanxia Zheng (Monash University); Jianfei Cai (Monash University); Jianmin Zheng (Nanyang Technological University) | N/A | N/A |
| Sem2NeRF: Converting Single-View Semantic Masks to Neural Radiance Fields | Yuedong Chen (Monash University)*; Qianyi Wu (Monash University); Chuanxia Zheng (Monash University); Tat-Jen Cham (Nanyang Technological University); Jianfei Cai (Monash University) | N/A | N/A |
| WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation | Mengping Yang (East China University of Science and Technology)*; Zhe Wang ( East China University of Science and Technology ); Ziqiu Chi (East China University Of Science and Technology); Wenyi Feng (east China university of science and technology) | N/A | N/A |
| Class-Agnostic Object Counting Robust to Intraclass Diversity | Shenjian Gong (Nanjing University of Science and Technology)*; Shanshan Zhang (Nanjing University of Science and Technology); Jian Yang (Nanjing University of Science and Technology); Dengxin Dai (MPI for Informatics ); Bernt Schiele (MPI Informatics) | N/A | N/A |
| TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts | Chuan Guo (University of Alberta)*; Xinxin Zuo (University of Alberta); Sen Wang (University of Alberta); Li Cheng (ECE dept., University of Alberta) | N/A | N/A |
| Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving | Jiale Li (Zhejiang University); Hang Dai (Mohamed bin Zayed University of Artificial Intelligence)*; Yong Ding (Zhejiang University) | N/A | N/A |
| Semi-Supervised Monocular 3D Object Detection by Multi-View Consistency | Qing Lian (Hong Kong University of Science and Technology )*; Yanbo XU (The Hong Kong University of Science and Technology); Weilong Yao (Shanghai Xiantu Intelligent Technology Co., Ltd.); Yingcong Chen (Hong Kong University of Science and Technology); Tong Zhang (Hong Kong University of Science and Technology) | N/A | N/A |
| Lidar Point Cloud Guided Monocular 3D Object Detection | Liang Peng (ZJU)*; Fei Liu (Zhejiang University); Zhengxu Yu (Zhejiang University); Senbo Yan (Zhejiang University); Dan Deng (FABU); Zheng Yang (FABU); Haifeng Liu (ZJU); Deng Cai (ZJU) | N/A | N/A |
| Structural Causal 3D Reconstruction | Weiyang Liu (University of Cambridge)*; Zhen Liu (Mila, University of Montreal); Liam Paull (Université de Montréal); Adrian Weller (University of Cambridge); Bernhard Schölkopf (MPI for Intelligent Systems, Tübingen) | N/A | N/A |
| KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo | Yikang Ding (Tsinghua University)*; Qingtian Zhu (Peking University); Xiangyue Liu (Beihang University); Wentao Yuan (Peking Universtiy); Haotian Zhang (Megvii); Chi Zhang (Megvii Inc.) | N/A | N/A |
| When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition | Bohan Li (Huazhong University of Science and Technology)*; Ye Yuan (Tomorrow Advancing Life); Dingkang Liang (Huazhong University of Science and Technology); Xiao Liu (Tencent); zhilong ji (Tomorrow Advancing Life); Jinfeng Bai (TAL); Wenyu Liu (Huazhong University of Science and Technology); Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Shape Matters: Deformable Patch Attack | Zhaoyu Chen (Fudan University); Bo Li (Nanjing University)*; Shuang Wu (Tencent); Jianghe Xu (Tencent Youtu Lab); Shouhong Ding (Tencent); Wenqiang Zhang (Fudan University) | N/A | N/A |
| PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection | Han Wang (Shanghai Jiao Tong University)*; Jun Tang (hikvision); Xiaodong Liu (Hikvision); Shanyan Guan (Shanghai Jiao Tong University); Rong Xie (Shanghai Jiao Tong University); Li Song (Shanghai Jiao Tong University) | N/A | N/A |
| BEVFormer: Learning Bird-Eye-View Representations from Multi-View Images via Spatiotemporal Transformer | Zhiqi Li (Nanjing University); Wenhai Wang (Nanjing University); Hongyang Li (SenseTime); Enze Xie (The University of Hong Kong); Chonghao Sima (Purdue University); Tong Lu (Nanjing University); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Jifeng Dai (SenseTime)* | N/A | N/A |
| Detecting Tampered Scene Text in the Wild | YuXin Wang (University of Science and Technology of China)*; Hongtao Xie (University of Science and Technology of China); Mengting Xing (University of Science and Technology of China); Jing Wang (Huawei Cloud & AI); Shenggao Zhu (Huawei); Yongdong Zhang (University of Science and Technology of China) | N/A | N/A |
| Projective Parallel Single-pixel Imaging to Overcome Global Illumination in 3D Structure Light Scanning | Yuxi Li (Beihang University)*; Huijie Zhao (Beihang University); Hongzhi Jiang (Beihang University); Xudong Li (Beihang University) | N/A | N/A |
| CelebV-HQ: A Large-Scale Video Facial Attributes Dataset | Hao Zhu (SenseTime Research)*; Wayne Wu (SenseTime Research); Wentao Zhu (Peking University); Liming Jiang (Nanyang Technological University); Siwei Tang (Sensetime research); Li Zhang (Sensetime); Ziwei Liu (Nanyang Technological University); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| Open-world Semantic Segmentation for LIDAR Point Clouds | Jun CEN (The Hong Kong University of Science and Technology)*; Peng YUN (Hong Kong University of Science and Technology); Shiwei Zhang (DAMO Academy, Alibaba Group); Junhao CAI (HKUST); Di LUAN (Hong Kong University of Science and Technology); Mingqian Tang (Alibaba Group); Michael Yu Wang (HKUST); Ming Liu (HKUST) | N/A | N/A |
| Burn After Reading: Online Adaptation for Cross-domain Streaming Data | Luyu Yang (University of Maryland, College Park)*; Mingfei Gao (Apple); Zeyuan Chen (Salesforce Research); Ran Xu (Salesforce Research); Abhinav Shrivastava (University of Maryland); Chetan Ramaiah (Salesforce Research) | N/A | N/A |
| CLOSE: Curriculum Learning On the Sharing Extent Towards Better One-shot NAS | Zixuan Zhou (Tsinghua University)*; Xuefei Ning (Tsinghua University); Yi Cai (Tsinghua University); Jiashu Han (None); Yiping Deng (Huawei); Yuhan Dong (Tsinghua University); Huazhong Yang (Tsinghua University); Yu Wang (Tsinghua University) | N/A | N/A |
| RigNet: Repetitive Image Guided Network for Depth Completion | Zhiqiang Yan (Nanjing University of Science and Tenchnology)*; Kun Wang (Nanjing University of Science and Technology); Xiang Li (Nanjing University of Science and Technology); Zhenyu Zhang (Tencent); Jun Li (Nanjing University of Science and Technology); Jian Yang (Nanjing University of Science and Technology) | N/A | N/A |
| Streamable Neural Fields | Junwoo Cho (Sungkyunkwan University)*; Seungtae Nam (Sungkyunkwan University); Daniel Rho (Sungkyunkwan University); Jong Hwan Ko (Sungkyunkwan University); Eunbyung Park (Sungkyunkwan University) | N/A | N/A |
| 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds | Xu Yan (The Chinese University of Hong Kong, Shenzhen); Jiantao Gao (Shanghai University); Chaoda Zheng (The Chinese University of Hong Kong, Shen Zhen); chao zheng (Tencent); Ruimao Zhang (The Chinese University of Hong Kong, Shenzhen); Shuguang Cui (The Chinese University of Hong Kong, Shenzhen ); Zhen Li (The Chinese University of Hong Kong, Shenzhen)* | N/A | N/A |
| Where to Focus: Investigating Hierarchical Attention Relationship for Fine-Grained Visual Classification | Yang Liu (Beihang University); Lei Zhou (Beihang University)*; Pengcheng Zhang (Beihang University); Xiao Bai (Beihang University); Lin Gu (RIKEN,AIP / The University of Tokyo); Xiaohan Yu (Griffith University); Jun Zhou (Griffith University); Hancock Edwin (“University of York, UK”) | N/A | N/A |
| Mind the Gap in Distilling StyleGANs | Guodong Xu (The Chinese University of Hong Kong)*; Yuenan HOU (Shanghai AI Lab); Ziwei Liu (Nanyang Technological University); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| End-to-End Active Speaker Detection | Juan C Leon (KAUST)*; Moritz Cordes (Leuphana University of Lüneburg); Chen Zhao (KAUST); Bernard Ghanem (KAUST) | N/A | N/A |
| Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing | Haoyue Cheng (Nanjing University); Zhaoyang Liu (SenseTime Research); Hang Zhou (The Chinese University of Hong Kong); Chen Qian (SenseTime); Wayne Wu (SenseTime Research); Limin Wang (Nanjing University)* | N/A | N/A |
| Learn-to-Decompose: Cascaded Decomposition Network for Cross-Domain Few-Shot Facial Expression Recognition | Xinyi Zou (Xiamen University); Yan Yan (Xiamen University)*; Jing-Hao Xue (University College London); Si Chen (Xiamen University of Technology); Hanzi Wang (Xiamen University) | N/A | N/A |
| Learning with Recoverable Forgetting | Jingwen Ye (National University of Singapore)*; Fu Yifang (National University of Singapore); Jie Song (Zhejiang University); Xingyi Yang (National University of Singapore); Songhua Liu (National University of Singapore); Xin Jin (University of Science and Technology of China); Mingli Song (Zhejiang University); Xinchao Wang (National University of Singapore) | N/A | N/A |
| Masked Autoencoders for Point Cloud Self-supervised Learning | Yatian Pang (National University of Singapore); Wenxiao Wang (State Key Lab of CAD&CG, Zhejiang University); Francis EH Tay (National University of Singapore); Wei Liu (Tencent); Yonghong Tian (Peking University); Li Yuan (Peking University)* | N/A | N/A |
| RamGAN: Region Attentive Morphing GAN for Region-Level Makeup Transfer | Jianfeng Xiang (ShenZhen University)*; Junliang Chen (Shenzhen University); Wenshuang Liu (Shenzhen University); Xianxu Hou (Shenzhen University); Linlin Shen (Shenzhen University) | N/A | N/A |
| Efficient One Pass Self-distillation with Zipf’s Label Smoothing | Jiajun Liang (Megvii)*; Linze Li (MEGVII Technology); Zhaodong Bing (Megvii Technology); Borui Zhao (Megvii Technology); Yao Tang (Peking University); Bo Lin (MEGVII Technology); Haoqiang Fan (Megvii Inc(face++)) | N/A | N/A |
| DaViT: Dual Attention Vision Transformers | Mingyu Ding (The University of Hong Kong)*; Bin Xiao (Microsoft); Noel C Codella (Microsoft); Ping Luo (The University of Hong Kong); Jingdong Wang (Baidu); Lu Yuan (Microsoft) | N/A | N/A |
| OneFace: One Threshold for All | Jiaheng Liu (Beihang University); zhipeng yu (University of Chinese Academy of Sciences); Haoyu Qin (SenseTime); Yichao Wu (Sensetime Group Limited); Ding Liang (Sensetime Group Limited); Gangming Zhao (The University of Hong Kong); Ke Xu (Beihang University)* | N/A | N/A |
| Semantic-Sparse Colorization Network for Deep Exemplar-based Colorization | Yunpeng Bai (Tsinghua University )*; Chao Dong (SIAT); Zenghao Chai (Tsinghua University); Andong Wang (Tsinghua University); Zhengzhuo Xu (Tsinghua University); Chun Yuan (Graduate school at ShenZhen,Tsinghua university) | N/A | N/A |
| Vibration-based Uncertainty Estimation for Learning from Limited Supervision | Hengtong Hu (Hefei University of Technology)*; Lingxi Xie (Huawei Inc.); Xinyue Huo (University of Science and Technology of China); Richang Hong (HeFei University of Technology); Qi Tian (Huawei Cloud & AI) | N/A | N/A |
| SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition | Victor A Escorcia (Samsung AI Center)*; Ricardo Guerrero (Samsung AI Center Cambridge); Xiatian Zhu (Samsung AI Centre); Brais Martinez (Samsung AI Center) | N/A | N/A |
| FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling | Hao Lu (Huazhong University of Science and Technology); Wenze Liu (Huazhong university of science and technology); Hongtao Fu (Huazhong university of Science and Technology); Zhiguo Cao (Huazhong Univ. of Sci.&Tech.)* | N/A | N/A |
| VTC: Improving Video-Text Retrieval with User Comments | Laura Hanu (Unitary)*; James Thewlis (Unitary); Yuki M Asano (University of Amsterdam); Christian Rupprecht (University of Oxford) | N/A | N/A |
| Less than Few: Self-Shot Video Instance Segmentation | Pengwan Yang (University of Amsterdam)*; Yuki M Asano (University of Amsterdam); Pascal Mettes (University of Amsterdam); Cees Snoek (University of Amsterdam) | N/A | N/A |
| End-to-End Visual Editing with a Generatively Pre-Trained Artist | Andrew Brown (University of Oxford)*; Cheng-Yang Fu (Facebook.com); Omkar M Parkhi (Facebook); Tamara Berg (Facebook AI Research); Andrea Vedaldi (University of Oxford / Facebook AI Research) | N/A | N/A |
| COUCH: Towards Controllable Human-chair Interactions | Xiaohan Zhang (University of Tübingen, MPI Informatics); Bharat Lal Bhatnagar (University of Tübingen, MPI informatik); Sebastian Starke (University of Edinburgh); Vladimir Guzov (University of Tuebingen); Gerard Pons-Moll (University of Tübingen)* | N/A | N/A |
| MovieCuts: A New Dataset and Benchmark forCut Type Recognition | Alejandro Pardo (KAUST)*; Fabian Caba (Adobe Research); Juan C Leon (KAUST); Ali K Thabet (Facebook); Bernard Ghanem (KAUST) | N/A | N/A |
| High-fidelity GAN Inversion with Padding Space | Qingyan Bai (Tsinghua University)*; Yinghao Xu (Chinese University of Hong Kong); Jiapeng Zhu (HKUST); Weihao Xia (University College London); Yujiu Yang (Tsinghua University); Yujun Shen (Dept. of IE, CUHK) | N/A | N/A |
| LiDAL: Inter-frame Uncertainty Based Active Learning for 3D LiDAR Semantic Segmentation | ZEYU HU (Hong Kong University of Science and Technology)*; Xuyang Bai (HKUST); Runze Zhang (Tencent); Xin Wang (Tencent); Guangyuan Sun (TENCENT); Hongbo Fu (City University of Hong Kong); Chiew-Lan Tai (Hong Kong University of Science & Technology) | N/A | N/A |
| Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning | Jingqun Tang (Ant Group)*; wenming qian (Huazhong University of Science and Technology); Luchuan Song (University of Science and Technology of China); Xiena Dong (Hangzhou Dianzi Universiy); lan li (Whu Han University); Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation | Jogendra Nath Kundu (Indian Institute of Science)*; Suvaansh Bhambri (Indian Institute of Science); Akshay R Kulkarni (Indian Institute of Science); Hiran Sarkar (Indian Institute of Science); Varun Jampani (Google); Venkatesh Babu RADHAKRISHNAN (Indian Institute of Science) | N/A | N/A |
| Designing One Unified Framework for High-Fidelity Face Reenactment and Swapping | Chao Xu (Zhejiang University)*; Jiangning Zhang (Zhejiang University); Yue Han (Zhejiang University); Guanzhong Tian (Ningbo Research Institute, Zhejiang University); xianfang zeng (Zhejiang University); Ying Tai (Tencent YouTu); Yabiao Wang (Tencent); Chengjie Wang (Tencent; Shanghai Jiao Tong University); Yong Liu (Zhejiang University) | N/A | N/A |
| Category-Level 6D Object Pose and Size Estimation using Self-Supervised Deep Prior Deformation Networks | Jiehong Lin (South China University of Technology)*; Zewei Wei (South China University of Technology); Changxing Ding (South China University of Technology); Kui Jia (South China University of Technology) | N/A | N/A |
| Intrinsic Neural Fields: Learning Functions on Manifolds | Lukas Koestler (Technical University of Munich)*; Daniel Grittner (Technische Universität München); Michael Moeller (University of Siegen); Daniel Cremers (TU Munich); Zorah Laehner (University of Siegen) | N/A | N/A |
| LaMAR: Benchmarking Localization and Mapping for Augmented Reality | Paul-Edouard Sarlin (ETH Zurich); Mihai Dusmanu (ETH Zurich)*; Johannes L Schönberger (Microsoft); Pablo Speciale (Microsoft); Lukas Gruber (Microsoft); Viktor Larsson (Lund University); Ondrej Miksik (Microsoft); Marc Pollefeys (ETH Zurich / Microsoft) | N/A | N/A |
| 3D Compositional Zero-shot Learning with DeCompositional Consensus | Muhammad Ferjad Naeem (ETH Zürich)*; Evin Pınar Örnek (TU Munich); Yongqin Xian (ETH Zurich); Luc Van Gool (ETH Zurich); Federico Tombari (Google, TU Munich) | N/A | N/A |
| Video Mask Transfiner for High-Quality Video Instance Segmentation | Lei Ke (HKUST)*; Henghui Ding (ETH Zurich); Martin Danelljan (ETH Zurich); Yu-Wing Tai (Kuaishou Technology / HKUST); Chi-Keung Tang (Hong Kong University of Science and Technology); Fisher Yu (ETH Zurich) | N/A | N/A |
| FashionViL: Fashion-Focused Vision-and-Language Representation Learning | Xiao Han (University of Surrey)*; Licheng Yu (Facebook); Xiatian Zhu (University of Surrey); Li Zhang (Fudan University); Yi-Zhe Song (University of Surrey); Tao Xiang (University of Surrey) | N/A | N/A |
| Adaptive Face Forgery Detection in Cross Domain | Luchuan Song (University of Science and Technology of China)*; Zheng Fang (BeihangUniversity); Xiaodan Li (Alibaba Group); Xiaoyi Dong (University of Science and Technology of China); Zhenchao Jin (University of Science and Technology of China); Yuefeng Chen (Alibaba Group); Siwei Lyu (University at Buffalo) | N/A | N/A |
| LiP-Flow: Learning Inference-time Priors for Codec Avatars via Normalizing Flows in Latent Space | Emre Aksan (ETH Zurich)*; Shugao Ma (Facebook); Akin Caliskan (Center for Vision Speech and Signal Processing – University of Surrey); Stanislav Pidhorskyi (Facebook Inc.); Alexander Richard (Facebook Reality Labs); Shih-En Wei (Facebook); Jason Saragih (Facebook); Otmar Hilliges (ETH Zurich) | N/A | N/A |
| Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection | Hongyu Zhou (Megvii)*; Songtao Liu (MEGVII); Zeming Li (Megvii(Face++) Inc); Jian Sun (Megvii Technology); Weixin Mao (waseda university); Zheng Ge (MEGVII Technology); haiyan yu (Harbin Institute of Technology) | N/A | N/A |
| Metric Learning based Interactive Modulation for Real-World Super-Resolution | Chong Mou (Peking University Shenzhen Graduate School)*; Yanze Wu (Tencent); Xintao Wang (Tencent); Chao Dong (SIAT); Jian Zhang (Peking University Shenzhen Graduate School); Ying Shan (Tencent) | N/A | N/A |
| Optimal Transport for Label-Efficient Visible-Infrared Person Re-Identification | Jiangming Wang (East China Normal University); Zhizhong Zhang (East China Normal University); Mingang Chen (Shanghai Development Center of Computer Software Technology); yi zhang (zhejianglab); Cong Wang (Huawei Technologies); Bin Sheng (Shanghai Jiao Tong University); Yanyun Qu (XMU); Yuan Xie (East China Normal University)* | N/A | N/A |
| Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning | Sauradip Nag (University of Surrey)*; Xiatian Zhu (University of Surrey); Yi-Zhe Song (University of Surrey); Tao Xiang (University of Surrey) | N/A | N/A |
| Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives | Wentao Yuan (Peking Universtiy)*; Qingtian Zhu (Peking University); Xiangyue Liu (Beihang University); Yikang Ding (Tsinghua University); Haotian Zhang (Megvii); Chi Zhang (Megvii Inc.) | N/A | N/A |
| Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression | Yeying Jin (National University of Singapore)*; Wenhan Yang (NTU); Robby T. Tan (National University of Singapore) | N/A | N/A |
| Point-to-Box Network for Accurate Object Detection via Single Point Supervision | Pengfei Chen (University of Chinese Academy of Sciences); Xuehui Yu (University of Chinese Academy of Sciences); Xumeng Han (University of Chinese Academy of Sciences); Najmul Hassan (University of Oregon); Kai Wang (U of Oregon); Jiachen Li (UIUC); Jian Zhao (Institute of North Electronic Equipment); Humphrey Shi (U of Oregon | UIUC | PAIR); Zhenjun Han (University of Chinese Academy of Sciences)*; Qixiang Ye (University of Chinese Academy of Sciences, China) |
| Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks | Yunshan Zhong (xiamen university)*; Mingbao Lin (Xiamen University, China); xunchao li (Xiamen University); Ke Li (Tencent); Yunhang Shen (Xiamen University); Fei Chao (Xiamen University); Yongjian Wu (Tencent Technology (Shanghai) Co.,Ltd); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Locality Guidance for Improving Vision Transformers on Tiny Datasets | Kehan Li (Peking University); Runyi Yu (Peking University); Zhennan Wang (Peng Cheng Laboratory); Li Yuan (Peking University); Guoli Song (Peng Cheng Laboratory); Jie Chen (Peking University)* | N/A | N/A |
| Weakly Supervised Object Localization through Inter-class Feature Similarity and Intra-class Appearance Consistency | Jun Wei (The Chinese University of Hong Kong, Shenzhen); Sheng Wang (Shanghai Zelixir Biotech); S. Kevin Zhou (USTC); Shuguang Cui (The Chinese University of Hong Kong, Shenzhen ); Zhen Li (The Chinese University of Hong Kong, Shenzhen)* | N/A | N/A |
| Semi-Supervised Temporal Action Detection with Proposal-Free Masking | Sauradip Nag (University of Surrey)*; Xiatian Zhu (University of Surrey); Yi-Zhe Song (University of Surrey); Tao Xiang (University of Surrey) | N/A | N/A |
| Neighborhood Collective Estimation for Noisy Label Identification and Correction | Jichang Li (The University of Hong Kong)*; Guanbin Li (Sun Yat-sen University); Feng Liu (Deepwise AI Lab); Yizhou Yu (The University of Hong Kong) | N/A | N/A |
| Zero-Shot Temporal Action Detection via Vision-Language Prompting | Sauradip Nag (University of Surrey)*; Xiatian Zhu (University of Surrey); Yi-Zhe Song (University of Surrey); Tao Xiang (University of Surrey) | N/A | N/A |
| Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval | Pandeng Li (University of Science and Technology of China)*; Hongtao Xie (University of Science and Technology of China); Jiannan Ge (University of Science and Technology of China); Lei Zhang (Kuaishou); Shaobo Min (tencent); Yongdong Zhang (University of Science and Technology of China) | N/A | N/A |
| Discover and Mitigate Unknown Biases with Debiasing Alternate Networks | Zhiheng Li (University of Rochester)*; Anthony Hoogs (Kitware); Chenliang Xu (University of Rochester) | N/A | N/A |
| Hierarchical Memory Learning for Fine-Grained Scene Graph Generation | Youming Deng (Wuhan University); Yansheng Li (Wuhan University)*; Yongjun Zhang (Wuhan University); Xiang Xiang (Huazhong University of Science and Technology); Jian Wang (Ant Group); Jingdong Chen (Ant Group); Jiayi Ma (Wuhan University) | N/A | N/A |
| Improving Test-Time Adaptation via Shift-agnostic Weight Regularization and Nearest Source Prototypes | Sungha Choi (Qualcomm AI Research)*; Seunghan Yang (Qualcomm AI Research); Seokeon Choi (Qualcomm AI research); Sungrack Yun (Qualcomm AI Research) | N/A | N/A |
| Automatic dense annotation of large-vocabulary sign language videos | Liliane Momeni (University of Oxford)*; Hannah Bull (LIMSI (CNRS)); Prajwal K R (VGG, Oxford); Samuel Albanie (University of Cambridge); Gul Varol (Ecole des Ponts ParisTech); Andrew Zisserman (University of Oxford) | N/A | N/A |
| Few-shot Class-incremental Learning via Entropy-regularized Data-free Replay | Huan Liu (McMaster University)*; Li Gu (Huawei Canada); Zhixiang Chi (Huawei Noah’s Ark Laboratory); Yuanhao Yu (Huawei Noah’s Ark Laboratory); Yang Wang (Concordia University); Jun Chen (McMaster University); Jin Tang ( Huawei Noah’s Ark Laboratory) | N/A | N/A |
| Learning Instance-Specific Adaptation for Cross-Domain Segmentation | Yuliang Zou (Virginia Tech)*; Zizhao Zhang (Google); Chun-Liang Li (Google); Han Zhang (Google); Tomas Pfister (Google); Jia-Bin Huang (Facebook ) | N/A | N/A |
| SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas | John W Lambert (Georgia Institute of Technology)*; Yuguang Li (Zillow Group); Ivaylo Boyadzhiev (Zillow Group); Lambert Wixson (Zillow Group); Manjunath Narayana (Zillow group); Will A Hutchcroft (Zillow Group); James Hays (Georgia Institute of Technology, USA); Frank Dellaert (Georgia Tech); Sing Bing Kang (Zillow Group) | N/A | N/A |
| Active Learning Strategies for Weakly-Supervised Object Detection | Huy V. Vo (Ecole Normale Supérieure – INRIA – Valeo.ai)*; Oriane Siméoni (valeo.ai); Spyros Gidaris (valeo.ai); Andrei Bursuc (valeo.ai); Patrick Pérez (Valeo.ai); Jean Ponce (Inria) | N/A | N/A |
| 3D Human Pose Estimation Using Möbius Graph Convolutional Networks | Niloofar Azizi (ICG department of TU Graz)*; Horst Possegger (Graz University of Technology); Emanuele Rodola (Sapienza University of Rome); Horst Bischof (Graz University of Technology) | N/A | N/A |
| Real-time Online Video Detection with Temporal Smoothing Transformers | Yue Zhao (University of Texas at Austin)*; Philipp Kraehenbuehl (UT Austin) | N/A | N/A |
| 3D-FM GAN: Towards 3D-Controllable Face Manipulation | Yuchen Liu (Princeton University)*; Zhixin Shu (Adobe Research); Yijun Li (Adobe Research); Zhe Lin (Adobe Research); Richard Zhang (Adobe); Sun-Yuan Kung (Princeton University) | N/A | N/A |
| SinNeRF: Training Neural Radiance Field on Complex Scene from a Single Image | Dejia Xu (University of Texas at Austin)*; Yifan Jiang (University of Texas at Austin); Peihao Wang (University of Texas at Austin); Zhiwen Fan (University of Texas at Austin); Humphrey Shi (U of Oregon | UIUC | PAIR); Zhangyang Wang (University of Texas at Austin) |
| Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation | Guangcong Zheng (Zhejiang University); Shengming Li (Zhejiang University); Hui Wang (Zhejiang University); Taiping Yao (Tencent YouTu); Yang Chen (Tencent); Shouhong Ding (Tencent); Xi Li (Zhejiang University)* | N/A | N/A |
| Identity-aware Hand Mesh Estimation and Personalization from RGB Images | Deying Kong (university of california, irvine)*; Linguang Zhang (Facebook Reality Labs); Liangjian Chen (Reality Labs); Haoyu Ma (University of California, Irvine); Xiangyi Yan (University of California, Irvine); shanlin sun (University of California, Irvine); Xingwei Liu (University of California Irvine); Kun Han (University of California Irvine); Xiaohui Xie (University of California, Irvine) | N/A | N/A |
| TALLFormer: Temporal Action Localization with a Long-memory Transformer | Feng Cheng (University of North Carolina ch); Gedas Bertasius (UNC Chapel Hill)* | N/A | N/A |
| Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition | Siqi Deng (Amazon)*; Alexandra Chouldechova (CMU); Yongxin Wang (Amazon); Wei Xia (Amazon); Pietro Perona (California Institute of Technology) | N/A | N/A |
| Domain Adaptive Hand Keypoint and Pixel Localization in the Wild | Takehiko Ohkawa (The University of Tokyo)*; Yu-Jhe Li (Carnegie Mellon University); Qichen Fu (Carnegie Mellon University); Ryosuke Furuta (The University of Tokyo); Kris Kitani (Carnegie Mellon University); Yoichi Sato (University of Tokyo) | N/A | N/A |
| Skeleton-free Pose Transfer for Stylized 3D Characters | Zhouyingcheng Liao (Saarland University)*; Jimei Yang (Adobe); Jun Saito (Adobe); Gerard Pons-Moll (University of Tübingen); Yang Zhou (Adobe Research) | N/A | N/A |
| Differentiable Raycasting for Self-supervised Occupancy Forecasting | Tarasha Khurana (Carnegie Mellon University)*; Peiyun Hu (Carnegie Mellon University); Achal D Dave (Amazon); Jason P Ziglar (Argo AI); David Held (); Deva Ramanan (Carnegie Mellon University) | N/A | N/A |
| InAction: Interpretable Action Decision Making for Autonomous Driving | Taotao Jing (Tulane University)*; Haifeng Xia (Tulane University); Renran Tian (Indiana University-Purdue University Indianapolis); Haoran Ding (IUPUI); Xiao Luo (IUPUI); Joshua E Domeyer (Toyota Motor North America); Rini Sherony (Toyota CSRC); Zhengming Ding (Tulane University) | N/A | N/A |
| CramNet: Camera-Radar Fusion with Ray-Constrained Cross-Attention for Robust 3D Object Detection | Jyh-Jing Hwang (Waymo)*; Henrik Kretzschmar (Waymo); Joshua M Manela (Waymo); Sean Rafferty (Waymo); Nicholas Armstrong-Crews (Waymo); Tiffany Chen (Waymo); Dragomir Anguelov (Waymo) | N/A | N/A |
| CycDA: Unsupervised Cycle Domain Adaptation to Learn from Image to Video | Wei Lin (Graz University of Technology)*; Anna Kukleva (MPII); Kunyang Sun (Southeast University); Horst Possegger (Graz University of Technology); Hilde Kuehne (University of Frankfurt); Horst Bischof (Graz University of Technology) | N/A | N/A |
| Latent Discriminant deterministic Uncertainty | Gianni Franchi (ENSTA Paris)*; Xuanlong Yu (ENSTA Paris); Andrei Bursuc (valeo.ai); Emanuel Aldea (Paris-Saclay University); Severine Dubuisson (Aix-Marseille University); David Filliat (ENSTA Paris) | N/A | N/A |
| Auto-FedRL: Federated Hyperparameter Optimization for Multi-institutional Medical Image Segmentation | Pengfei Guo (Johns Hopkins University)*; Dong Yang (NVIDIA Corporation); Ali Hatamizadeh (NVIDIA Corporation); An Xu (University of Pittsburgh); Ziyue Xu (NVIDIA); Wenqi Li (NVIDIA); Can Zhao (Nvidia); Daguang Xu (NVIDIA Corporation); Stephanie Anne Harmon (National Cancer Institute); Evrim Turkbey (NIH); Baris Turkbey (National Cancer Institute); Bradford J Wood (National Institutes of Health); Francesca Patella (ASST Santi Paolo e Carlo); Elvira Stellato (University of Milan); Gianpaolo Carrafiello (University of Milan); Vishal Patel (Johns Hopkins University); Holger R Roth (NVIDIA) | N/A | N/A |
| Image-based CLIP-Guided Essence Transfer | Hila Chefer (Tel Aviv University)*; Sagie Benaim (University of Copenhagen); Roni Paiss (Tel Aviv University, Google); Lior Wolf (Tel Aviv University, Israel) | N/A | N/A |
| Prune Your Model Before Distill It | JinHyuk Park (Hongik University); Albert No (Hongik University)* | N/A | N/A |
| S2N: Suppression-Strengthen Network for Event-based Recognition under Variant Illuminations | zengyu wan (University of Science and Technology of China)*; Yang Wang (University of Science and Technology of China); Ganchao Tan (University of Science and Technology of China); Yang Cao (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China) | N/A | N/A |
| MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval | Yuying Ge (The University of Hong Kong)*; Yixiao Ge (Tencent); Xihui Liu (UC Berkeley); Jinpeng Wang (National University of Singapore); Jianping Wu (Tsinghua University); Ying Shan (Tencent); Xiaohu Qie (Tencent); Ping Luo (The University of Hong Kong) | N/A | N/A |
| PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification | Kuan Zhu (Institute of Automation, Chinese Academy of Sciences)*; Haiyun Guo (CASIA); Tianyi Yan (Institute of Automation,Chinese Academy of Sciences;School of Artificial Intelligence, University of Chinese Academy Sciences); Yousong Zhu (Institute of Automation, Chinese Academy of Sciences); Jinqiao Wang (Institute of Automation, Chinese Academy of Sciences); Ming Tang (Institute of Automation, Chinese Academy of Sciences) | N/A | N/A |
| RegionCL: Exploring Contrastive Region Pairs for Self-supervised Representation Learning | YUFEI XU (University of sydney)*; Qiming Zhang (The University of Sydney); Jing Zhang (The University of Sydney); Dacheng Tao (JD.com) | N/A | N/A |
| Towards Data-Efficient Detection Transformers | Wen Wang (University of Science and Technology of China)*; Jing Zhang (The University of Sydney); Yang Cao (University of Science and Technology of China); Yongliang Shen (Zhejiang University); Dacheng Tao (JD.com) | N/A | N/A |
| Label2Label: A Language Modeling Framework for Multi-Attribute Learning | Wanhua Li (Tsinghua University); Zhexuan Cao (Tsinghua University); Jianjiang Feng (Tsinghua University); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University)* | N/A | N/A |
| Anti-Retroactive Interference for Lifelong Learning | Runqi Wang (Beihang University); Yuxiang Bao (Beihang University); Baochang Zhang (Beihang University)*; Jianzhuang Liu (Huawei Noah’s Ark Lab); Wentao Zhu (Amazon); Guodong Guo (IDL, Baidu Research) | N/A | N/A |
| Emotion Recognition for Multiple Context Awareness | Dingkang Yang (Fudan University); shuai huang (Fudan university); Shunli Wang (Fudan University); Yang Liu (Fudan University); Peng Zhai (Fudan university); Liuzhen Su (Fudan University); Mingcheng Li (Fudan University); Lihua Zhang (Fudan University)* | N/A | N/A |
| Box-supervised Instance Segmentation with Level Set Evolution | Wentong Li (Zhejiang University ); Wenyu Liu (Zhejiang University); Jianke Zhu (Zhejiang University)*; Miaomiao Cui (Alibaba-inc); Xian-Sheng Hua (Damo Academy, Alibaba Group); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) | N/A | N/A |
| mc-BEiT: Multi-choice Discretization for Image BERT Pre-training | Xiaotong Li (Peking University)*; Yixiao Ge (Tencent); Kun Yi (Nanjing University); Zixuan Hu (Peking University); Ying Shan (Tencent); Lingyu Duan (Peking University) | N/A | N/A |
| Adaptive Cross-Domain Learning for Generalizable Person Re-Identification | Pengyi Zhang (Zhejiang University)*; Huanzhang Dou (Zhejiang University); Yunlong Yu (Zhejiang University); Xi Li (Zhejiang University) | N/A | N/A |
| MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition | Huanzhang Dou (Zhejiang University)*; Pengyi Zhang (Zhejiang University); Wei Su (Zhejiang University); Yunlong Yu (Zhejiang University); Xi Li (Zhejiang University) | N/A | N/A |
| Bootstrapped Masked Autoencoders for Vision BERT Pretraining | Xiaoyi Dong (University of Science and Technology of China)*; Jianmin Bao (Microsoft Research Asia); Ting Zhang (MSRA); Dongdong Chen (Microsoft Cloud AI); Weiming Zhang (University of Science and Technology of China); Lu Yuan (Microsoft); Dong Chen (Microsoft Research Asia); Fang Wen (Microsoft Research Asia ); Nenghai Yu (University of Science and Technology of China) | N/A | N/A |
| Masked Discrimination for Self-Supervised Learning on Point Clouds | Haotian Liu (University of Wisconsin-Madison)*; Mu Cai (University of Wisconsin-Madison); Yong Jae Lee (University of Wisconsin-Madison) | N/A | N/A |
| GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval | Yuxuan Wang (National University of Singapore); Difei Gao (NUS); Licheng Yu (Facebook); Stan Weixian Lei (National University of Singapore); Matt Feiszli (Facebook Research); Mike Zheng Shou (National University of Singapore)* | N/A | N/A |
| FAST-VQA: Efficient End-to-end Video Quality Assessment with Fragment Sampling | Haoning Wu (Nanyang Technological University)*; Chaofeng Chen (Nanyang Technological University); Jingwen Hou (Nanyang Technological University); Liang Liao (Nanyang Technological University); Annan Wang (Nanyang Technological University); Wenxiu Sun (SenseTime Research and Tetras.AI); Qiong Yan (SenseTime Group Limited); Weisi Lin (Nanyang Technological University, Singapore) | N/A | N/A |
| Learning to train a point cloud reconstruction network without matching | Tianxin Huang (Zhejiang University)*; Xuemeng Yang (Zhejiang University); Jiangning Zhang (Zhejiang University); Jinhao Cui (Zhejiang Unversity); Hao Zou (Zhejiang University); Jun Chen (Zhejiang University); Xiangrui Zhao (Zhejiang University); Yong Liu (Zhejiang University) | N/A | N/A |
| Long-Tailed Class Incremental Learning | Xialei Liu (Nankai University)*; Yusong Hu (Nankai University); Xu-Sheng Cao (Nankai University); Andy Bagdanov (University of Florence, Italy); Ke Li (Tencent); Ming-Ming Cheng (Nankai University) | N/A | N/A |
| CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving | Kaican Li (Huawei Noah’s Ark Lab)*; Kai Chen (HKUST); Haoyu Wang (Purdue University); Lanqing Hong (Huawei Noah’s Ark Lab); Chaoqiang Ye (Huawei); Jianhua Han (Huawei Noah’s Ark Lab); Yukuai Chen (Huawei Intelligent Automotive Solution BU); Wei Zhang ( Noah’s Ark Lab, Huawei Technologies); Chunjing Xu (Huawei Noah’s Ark Lab); Dit-Yan Yeung (HKUST); Xiaodan Liang (Sun Yat-sen University); Zhenguo Li (Huawei Noah’s Ark Lab); Hang Xu (Huawei Noah’s Ark Lab) | N/A | N/A |
| CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds | Zhiyang Guo (University of Science and Technology of China)*; Yunyao Mao (University of Science and Technology of China); Wengang Zhou (University of Science and Technology of China); Min Wang (Institute of Artificial Intelligence, Hefei Comprehensive National Science Center); Houqiang Li (University of Science and Technology of China) | N/A | N/A |
| Motion Inspired Unsupervised Perception and Prediction in Autonomous Driving | Mahyar Najibi (Waymo LLC); Jingwei Ji (Waymo); Yin Zhou (Waymo)*; Charles R. Qi (Waymo); Xinchen Yan (Waymo); Scott Ettinger (Waymo); Dragomir Anguelov (Waymo) | N/A | N/A |
| Unitail: Detecting, Reading, and Matching in Retail Scene | Fangyi Chen (Carnegie Mellon University)*; Han Zhang (CMU); zaiwang li (pitt); Jiachen Dou (Carnegie Mellon University); Shentong Mo (Carnegie Mellon University); Hao Chen (Carnegie Mellon University); Yong-Xin Zhang (Tsinghua University); Uzair Ahmed (Carnegie Mellon University); Chenchen Zhu (Meta AI); Marios Savvides (Carnegie Mellon University) | N/A | N/A |
| DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation | Runyu Ding (The University of Hong Kong)*; Jihan Yang (The University of Hong Kong); Li Jiang (Max Planck Institute for Informatics); Xiaojuan Qi (The University of Hong Kong) | N/A | N/A |
| Learning to Drive by Watching YouTube Videos: Action-Conditioned Contrastive Policy Pretraining | Qihang Zhang (Chinese University of Hong Kong); Zhenghao Peng (Chinese University of Hong Kong); Bolei Zhou (UCLA)* | N/A | N/A |
| Multi-Curve Translator for High-Resolution Photorealistic Image Translation | Yuda Song (Zhejiang University); Hui Qian (Zhejiang University); Xin Du (Zhejiang University)* | N/A | N/A |
| Dynamic Metric Learning with Cross-Level Concept Distillation | Wenzhao Zheng (Tsinghua University)*; Yuanhui Huang (Tsinghua University); Borui Zhang (Tsinghua University); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University) | N/A | N/A |
| Deep Bayesian Video Frame Interpolation | Zhiyang Yu (Harbin Institute of Technology)*; Yu Zhang (Beihang University); Xujie Xiang (Beihang University); Dongqing Zou (SenseTime Research;Qing Yuan Research Institute, Shanghai Jiao Tong University); Xijun Chen (Harbin Institute of Technology); Jimmy Ren (SenseTime Research;Qing Yuan Research Institute, Shanghai Jiao Tong University) | N/A | N/A |
| PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation | Zhijie Shen (Beijing Jiaotong University); Chunyu Lin (Beijing Jiaotong University)*; Kang Liao (Beijing Jiaotong University); Lang Nie (Beijing Jiaotong University); Zishuo Zheng (Beijing Jiaotong University); Yao Zhao (Beijing Jiaotong University) | N/A | N/A |
| Cross Attention Based Style Distribution for Controllable Person Image Synthesis | Xinyue Zhou (East China Normal University ); Mingyu Yin (East China Normal University); Xinyuan Chen (Shanghai AI Laboratory); Li Sun (East China Normal University)*; Changxin Gao (Huazhong University of Science and Technology); Qingli Li (East China Normal University) | N/A | N/A |
| Generative Meta-Adversarial Network for Unseen Object Navigation | Sixian Zhang (ICT, China Academy of Science)*; Weijie Li (ICT, China Academy of Sciences); Xinhang Song (ICT); Yubing Bai (ICT,China Academy of Science); Shuqiang Jiang (ICT, China Academy of Science) | N/A | N/A |
| Unsupervised Visual Representation Learning by Synchronous Momentum Grouping | Bo Pang (Shanghai Jiao Tong University)*; Yifan Zhang (Shanghai Jiao Tong University); Yaoyi Li (Huawei); Jia Cai (Huawei); Cewu Lu (Shanghai Jiao Tong University) | N/A | N/A |
| OSFormer: One-Stage Camouflaged Instance Segmentation with Transformers | Jialun Pei (Huazhong University of Science and Technology); Tianyang Cheng (Huazhong University of Science and Technology); Deng-Ping Fan (ETH Zurich)*; He Tang (Huazhong University of Science and Technology); Chuanbo Chen (Huazhong University of Science and Technology); Luc Van Gool (ETH Zürich) | N/A | N/A |
| Highly Accurate Dichotomous Image Segmentation | Xuebin Qin (University of Alberta); Hang Dai (Mohamed bin Zayed University of Artificial Intelligence); Xiaobin Hu (Technische Universität München); Deng-Ping Fan (ETH Zurich)*; Ling Shao (Terminus Group); Luc Van Gool (ETH Zurich) | N/A | N/A |
| KeypointNeRF: Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints | Marko Mihajlovic (ETH Zurich)*; Aayush Bansal (Carnegie Mellon University); Michael Zollhöfer (Facebook Reality Labs); Siyu Tang (ETH Zurich); Shunsuke Saito (Facebook) | N/A | N/A |
| MENet: a Memory-Based Network with Dual-Branch for Efficient Event Stream Processing | Linhui Sun (CASIA)*; Yifan Zhang (Institute of Automation, Chinese Academy of Sciences); Ke Cheng (Institute of Automation, Chinese Academy of Sciences); Jian Cheng (“Chinese Academy of Sciences, China”); Hanqing Lu (NLPR, Institute of Automation, CAS) | N/A | N/A |
| Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals | Simon Vandenhende (KU Leuven)*; Dhruv Mahajan (Facebook); Filip Radenovic (Facebook AI); Deepti Ghadiyaram (Facebook) | N/A | N/A |
| LEDNet: Joint Low-light Enhancement and Deblurring in the Dark | Shangchen Zhou (Nanyang Technological University)*; Chongyi Li ( Nanyang Technological University); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering | Di Chang (Technical University of Munich)*; Aljaz Bozic (Technical University Munich); Tong Zhang (EPFL); Qingsong Yan (hong kong university of science and technology); Yingcong Chen (Hong Kong University of Science and Technology); Sabine Süsstrunk (EPFL); Matthias Niessner (Technical University of Munich) | N/A | N/A |
| StretchBEV: Stretching Future Instance Prediction Spatially and Temporally | Kaan Adil Akan (Koc University); Fatma Guney (Koc University)* | N/A | N/A |
| AgeTransGAN for Facial Age Transformation with Rectified Performance Metrics | Gee-Sern Hsu (National Taiwan University of Science and Technology)*; Rui-Cang Xie ( National Taiwan University of Science and Technology); Zhi-Ting Chen (National Taiwan University of Science and Technology); Yu-Hong Lin (National Taiwan University of Science and Technology) | N/A | N/A |
| Boosting Supervised Dehazing Methods via Bi-level Patch Reweighting | Xingyu Jiang (beihang ); Hongkun Dou (Beihang University); Chengwei Fu (beihang); Bingquan Dai (Beihang); Tianrun Xu (North China University of Technology); Yue Deng (Samsung Research America)* | N/A | N/A |
| Detecting and Recovering Sequential DeepFake Manipulation | Rui Shao (Nanyang Technological University)*; Tianxing Wu (Nanyang Technological University); Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning | Xiaogang XU (The Chinese University of Hong Kong)*; Hengshuang Zhao (University of Oxford); Vibhav Vineet (Microsoft Research); Ser-Nam Lim (Meta AI); Antonio Torralba (MIT) | N/A | N/A |
| Prediction-Guided Distillation for Dense Object Detection | Chenhongyi Yang (University of Edinburgh)*; Mateusz Ochal (Heriot Watt University); Amos Storkey (U Edinburgh); Elliot J Crowley (University of Edinburgh) | N/A | N/A |
| Towards Generic 3D Tracking in RGBD Videos: Benchmark and Baseline | Jinyu Yang (Southern University of Science and Technology)*; Zhongqun Zhang (University of Birmingham); Zhe LI (SUSTech); Hyung Jin Chang (University of Birmingham); Ales Leonardis (University of Birmingham); Feng Zheng (SUSTech) | N/A | N/A |
| C3P: Cross-domain Pose Prior Propagation for Weakly Supervised 3D Human Pose Estimation | cunlin wu (Huazhong University of Science and Technology); Yang Xiao (Huazhong Univ. of Sci.&Tech.); Boshen Zhang (Tencent); Mingyang Zhang (Huazhong Univ. of Sci.&Tech); Zhiguo Cao (Huazhong Univ. of Sci.&Tech.); Joey Tianyi Zhou (ASTAR Centre for Frontier AI Research (CFAR) ) | N/A | N/A |
| Adaptive Fine-Grained Sketch-Based Image Retrieval | Ayan Kumar Bhunia (University of Surrey)*; Aneeshan Sain (University of Surrey); Parth Hiren Shah (Indian Institute of Technology Guwahati); Animesh Gupta (Thapar University); Pinaki Nath Chowdhury (University of Surrey); Tao Xiang (University of Surrey); Yi-Zhe Song (University of Surrey) | N/A | N/A |
| Learning Ego 3D Representation as Ray Tracing | Jiachen Lu (Fudan University); Zheyuan Zhou (Fudan University); Xiatian Zhu (University of Surrey); Hang Xu (Huawei Noah’s Ark Lab); Li Zhang (Fudan University)* | N/A | N/A |
| Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling | Hengyuan Ma (Fudan University); Li Zhang (Fudan University)*; Xiatian Zhu (University of Surrey); Jianfeng Feng (Fudan University) | N/A | N/A |
| RCLane: Relay Chain Prediction for Lane Detection | Shenghua Xu (Fudan University); Xinyue Cai (Huawei Noah’s Ark Lab); Bin Zhao (Fudan University); Li Zhang (Fudan University)*; Hang Xu (Huawei Noah’s Ark Lab); Yanwei Fu (Fudan University); Xiangyang Xue (Fudan University) | N/A | N/A |
| Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding | Hao Wen (Tsinghua University); Yunze Liu (Tsinghua University)*; Jingwei Huang (Huawei); Bo Duan (Huawei); Li Yi (Tsinghua University) | N/A | N/A |
| Towards Efficient Adversarial Training on Vision Transformers | Boxi Wu (Zhejiang University)*; Jindong Gu (University of Munich); Zhifeng Li (Tencent AI Lab); Deng Cai (ZJU); Xiaofei He (Zhejiang University); Wei Liu (Tencent) | N/A | N/A |
| Adaptive Agent Transformer for Few-shot Segmentation | Yuan Wang (University of Science and Technology of China)*; Rui Sun (University of Science and Technology of China); Zhe Zhang (Lunar Exploration and Space Engineering Center of CNSA); Tianzhu Zhang (University of Science and Technology of China) | N/A | N/A |
| Improving Few-Shot Part Segmentation using Coarse Supervision | Oindrila Saha (University of Massachusetts Amherst)*; Zezhou Cheng (University of Massachusetts, Amherst); Subhransu Maji (University of Massachusetts, Amherst) | N/A | N/A |
| Mining Relations among Cross-Frame Affinities for Video Semantic Segmentation | Guolei Sun (ETH Zurich); Yun Liu (ETH Zurich)*; Hao Tang (ETH Zurich); Ajad Chhatkuli (ETH Zurich); Le Zhang (University of Electronic Science and Technology of China); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Out-of-distribution Detection with Boundary Aware Learning | Sen Pei (Institute of Automation, Chinese Academy of Sciences)*; Xin Zhang (Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences); Bin Fan (University of Science and Technology Beijing); Gaofeng Meng (Chinese Academy of Sciences) | N/A | N/A |
| NeILF: Neural Incident Light Field for Physically-based Material Estimation | Yao Yao (Apple Inc.); Jingyang Zhang (The Hong Kong University of Science and Technology)*; Jingbo Liu (Apple Inc.); Yihang Qu (Apple Inc.); Tian Fang (Apple); David N McKinnon (Apple); Yanghai Tsin (Apple Inc); Long Quan (Apple) | N/A | N/A |
| ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers | Jonáš Kulhánek (Czech Technical University in Prague)*; Erik Derner (CTU CIIRC); Torsten Sattler (Czech Technical University in Prague); Robert Babuska (TU Delft) | N/A | N/A |
| L-Tracing: Fast Light Visibility Estimation on Neural Surfaces by Sphere Tracing | Ziyu Chen (Shanghai Jiao Tong University)*; Chenjing Ding (Sensetime Group Limited); Jianfei Guo (Shanghai AI Laboratory); Dongliang Wang (SenseTime Group Limited); Yikang Li (Shanghai AI Lab); Xuan Xiao (SenseTime Group Limited); Wei Wu (SenseTime Group Limited); Li Song (Shanghai Jiao Tong University) | N/A | N/A |
| ARF: Artistic Radiance fields | Kai Zhang (Cornell University)*; Nicholas I Kolkin (Adobe Research); Sai Bi (Adobe Research); Fujun Luan (Adobe Research); Zexiang Xu (Adobe Research); Eli Shechtman (Adobe Research, US); Noah Snavely (Cornell University and Google AI) | N/A | N/A |
| Multiview Stereo with Cascaded Epipolar RAFT | Zeyu Ma (Princeton University)*; Zachary Teed (Princeton University); Jia Deng (Princeton University) | N/A | N/A |
| What to Hide from Your Students: Attention-Guided Masked Image Modeling | Ioannis Kakogeorgiou (National Technical University of Athens)*; Spyros Gidaris (valeo.ai); Bill Psomas (National Technical University of Athens); Yannis Avrithis (IARAI, Athena RC); Andrei Bursuc (valeo.ai); Konstantinos Karantzalos (National Technical University of Athens); Nikos Komodakis (University of Crete) | N/A | N/A |
| Static and Dynamic Concepts for Self-supervised Video Representation Learning | Rui Qian (The Chinese University of Hong Kong)*; Shuangrui Ding (Shanghai Jiao Tong University); Xian Liu (The Chinese University of Hong Kong); Dahua Lin (The Chinese University of Hong Kong) | N/A | N/A |
| Deep Partial Updating: Towards Communication Efficient Updating for On-device Inference | Zhongnan Qu (ETH Zurich)*; Cong Liu (University of Texas at Dallas); Lothar Thiele (ETH Zürich) | N/A | N/A |
| Gradient-based Uncertainty for Monocular Depth Estimation | Julia Hornauer (Ulm University)*; Vasileios Belagiannis (Otto von Guericke University Magdeburg) | N/A | N/A |
| Flow-Guided Transformer for Video Inpainting | Kaidong Zhang (University of Science and Technology of China); Jingjing Fu (Microsoft)*; Dong Liu (University of Science and Technology of China) | N/A | N/A |
| Relationformer: A Unified Framework for Image-to-Graph Generation | Suprosanna Shit (TUM)*; Rajat Koner (Ludwig Maximilian University of Munich); Bastian Wittmann (Technical University of Munich); Johannes C. Paetzold (TUM); Ivan Ezhov (TUM); Hongwei Li (Technical University of Munich); Jiazhen Pan (Technical University of Munich); Sahand Sharifzadeh (Ludwig Maximilian University of Munich); Georgios Kaissis (Technische Universität München); Volker Tresp (LMU); Bjoern Menze (TUM) | N/A | N/A |
| ARAH: Animatable Volume Rendering of Articulated Human SDFs | Shaofei wang (ETH Zurich)*; Katja Schwarz (MPI Tuebingen); Andreas Geiger (University of Tuebingen); Siyu Tang (ETH Zurich) | N/A | N/A |
| Learning Hierarchy Aware Features for Reducing Mistake Severity | Ashima Garg (IIIT Delhi)*; Depanshu Sani (Indraprastha Institute of Information Technology); Saket Anand (Indraprastha Institute of Information Technology Delhi) | N/A | N/A |
| Exploiting Unlabeled Data with Vision and Language Models for Object Detection | Shiyu Zhao (Rutgers University)*; Zhixing Zhang (Rutgers University); Samuel Schulter (NEC Laboratories America); Long Zhao (Google Research); Vijay Kumar B G (NEC Laboratories America); Anastasis Stathopoulos (Rutgers University); Manmohan Chandraker (UC San Diego); Dimitris N. Metaxas (Rutgers) | N/A | N/A |
| A Simple and Robust Correlation Filtering method for text-based person search | Wei Suo (Northwestern Polytechnical University); MengYang Sun (Northwestern Polytechnical University); Kai Niu (Northwestern Polytechnical University); Yiqi Gao (Northwestern Polytechnical University); Peng Wang (Northwestern Polytechnical University); Yanning Zhang (Northwestern Polytechnical University)*; Qi Wu (University of Adelaide) | N/A | N/A |
| Hunting Group Clues with Transformers for Social Group Activity Recognition | Masato Tamura (Hitachi America, Ltd.)*; Rahul Vishwakarma (Hitachi America Ltd.); Ravigopal Vennelakanti (Hitachi America, Ltd.) | N/A | N/A |
| Quantized GAN for Complex Music Generation from Dance Videos | Ye Zhu (Illinois Institute of Technology)*; Kyle B Olszewski (Snap Inc.); Yu Wu (Princeton University); Panos Achlioptas (Stanford University); Menglei Chai (Snap Inc.); Yan Yan (Illinois Institute of Technology); Sergey Tulyakov (Snap Inc) | N/A | N/A |
| Not Just Streaks: Towards Ground Truth for Single Image Deraining | Yunhao Ba (UCLA)*; Howard Zhang (UCLA); Ethan Yang (UCLA); Akira Suzuki (UCLA); Arnold J Pfahnl (University of California, Los Angeles); Chethan Chinder Chandrappa (University of California – Los Angeles); Celso de Melo (Army Research Laboratory); Suya You (US Army Research Laboratory); Stefano Soatto (UCLA); Alex Wong (Yale University); Achuta Kadambi (UCLA) | N/A | N/A |
| HIVE: Evaluating the Human Interpretability of Visual Explanations | Sunnie S. Y. Kim (Princeton University)*; Nicole Meister (Princeton University); Vikram V. Ramaswamy (Princeton University); Ruth C Fong (Princeton University); Olga Russakovsky (Princeton University) | N/A | N/A |
| GAMa: Cross-view Video Geo-localization | Shruti Vyas (University of Central Florida)*; Chen Chen (University of Central Florida); Mubarak Shah (University of Central Florida) | N/A | N/A |
| Meta-Sampler: Almost-Universal yet Task-Oriented Sampling for Point Clouds | Ta-Ying Cheng (University of Oxford); Qingyong Hu (University of Oxford)*; Qian Xie (University of Oxford); Niki Trigoni (University of Oxford); Andrew Markham (University of Oxford) | N/A | N/A |
| Multi-Query Video Retrieval | Zeyu Wang (Princeton University)*; Yu Wu (Princeton University); Karthik Narasimhan (Princeton University); Olga Russakovsky (Princeton University) | N/A | N/A |
| Waymo Open Dataset: Panoramic Video Panoptic Segmentation | Jieru Mei (Johns Hopkins University); Alex Zhu (Waymo)*; Xinchen Yan (Waymo); Hang Yan (Waymo LLC); Siyuan Qiao (Google); Yukun Zhu (Google Inc.); Liang-Chieh Chen (Google Inc.); Henrik Kretzschmar (Waymo) | N/A | N/A |
| MIME: Minority Inclusion for Majority Group Enhancement of AI Performance | Pradyumna Chari (UCLA); Yunhao Ba (UCLA)*; Shreeram Athreya (UCLA); Achuta Kadambi (UCLA) | N/A | N/A |
| Self-supervised Human Mesh Recovery with Cross-Representation Alignment | Xuan Gong (University at Buffalo); Meng Zheng (United Imaging Intelligence); Benjamin Planche (United Imaging Intelligence); Srikrishna Karanam (Adobe Research); Terrence Chen (United Imaging Intelligence); David Doermann (University at Buffalo); Ziyan Wu (United Imaging Intelligence)* | N/A | N/A |
| TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency | Medhini Narasimhan (UC Berkeley)*; Arsha Nagrani (Google); Chen Sun (Brown University); Michael Rubinstein (Google); Trevor Darrell (UC Berkeley); Anna Rohrbach (UC Berkeley); Cordelia Schmid (Google) | N/A | N/A |
| A Perceptual Quality Metric for Video Frame Interpolation | Qiqi Hou (Portland State University)*; Abhijay Ghildyal (Portland State University); Feng Liu (Portland State University) | N/A | N/A |
| Adaptive Feature Interpolation for Low-Shot Image Generation | Mengyu Dai (Microsoft Corporation)*; Haibin Hang (Amazom.com); Xiaoyang Guo (Facebook) | N/A | N/A |
| Rethinking Learning Approaches for Long-Term Action Anticipation | Megha Nawhal (Simon Fraser University)*; Akash Abdu Jyothi (Simon Fraser University); Greg Mori (Simon Fraser University / Borealis AI) | N/A | N/A |
| Object Manipulation via Visual Target Localization | Kiana Ehsani (Allen Institute for Artificial Intelligence)*; Ali Farhadi (University of Washington, Apple); Aniruddha Kembhavi (Allen Institute for Artificial Intelligence); Roozbeh Mottaghi (Allen Institute for AI) | N/A | N/A |
| AlignSDF: Pose-Aligned Signed Distance Fields for Hand-Object Reconstruction | Zerui Chen (Inria Paris); Yana Hasson (Inria); Cordelia Schmid (Inria/Google)*; Ivan Laptev (INRIA Paris) | N/A | N/A |
| Shift-tolerant Perceptual Similarity Metric | Abhijay Ghildyal (Portland State University)*; Feng Liu (Portland State University) | N/A | N/A |
| Making the Most of Text Semantics to Improve Biomedical Vision-Language Processing | Benedikt Boecking (Carnegie Mellon University); Naoto Usuyama (Microsoft Research); Shruthi J Bannur (Microsoft Research); Daniel Coelho de Castro (Microsoft Research); Anton Schwaighofer (Microsoft Research); Stephanie Hyland (Microsoft Research); Maria Teodora A Wetscherek (Microsoft); Tristan Naumann (Microsoft Research Redmond, US); Aditya Nori (Microsoft Research); Javier Alvarez-Valle (Microsoft Research); Hoifung Poon (Microsoft Research); Ozan Oktay (Microsoft Research)* | N/A | N/A |
| Self-Supervised Sparse Representation for Video Anomaly Detection | Jhih-Ciang Wu (Academia Sinica )*; He-Yen Hsieh (Academia Sinica); Ding-Jie Chen (Academia Sinica); Chiou-Shann Fuh (National Taiwan University); Tyng-Luh Liu (Academia Sinica) | N/A | N/A |
| CPO: Change Robust Panorama to Point Cloud Localization | Junho Kim (Seoul National University)*; Hojun Jang (Seoul National University); Changwoon Choi (Seoul National University); Young Min Kim (Seoul National University) | N/A | N/A |
| MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene Flow Estimation with Monocular Images | Runfa Li (UC San Diego)*; Truong Nguyen (UC San Diego) | N/A | N/A |
| DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning | Hyounguk Shon (KAIST)*; Janghyeon Lee (LG AI Research); Seung Hwan Kim (LG AI Research); Junmo Kim (KAIST) | N/A | N/A |
| Contrastive Positive Mining for Unsupervised 3D Action Representation Learning | Haoyuan Zhang (Tianjin University)*; Yonghong Hou (Tianjin University); Wenjing Zhang (Tianjin University); Wanqing Li (University of Wollongong) | N/A | N/A |
| Patch Similarity Aware Data-Free Quantization for Vision Transformers | Zhikai Li (Institute of Automation, Chinese Academy of Sciences; School of Artificial Intelligence, University of Chinese Academy of Sciences); Liping Ma (Institute of Automation, Chinese Academy of Sciences); Mengjuan Chen (Center of Precision Sensing and Control, Institute of Automation, Chinese Academy of Sciences); Junrui Xiao (Institute of Automation, Chinese Academy of Sciences; School of Artificial Intelligence, University of Chinese Academy of Sciences); Qingyi Gu (Institute of Automation, Chinese Academy of Sciences)* | N/A | N/A |
| Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution | Yuehan Zhang (National University of Singapore)*; Bo Ji (National University of Singapore); Jia Hao (HiSilicon (Shanghai) Technologies Co., Ltd); Angela Yao (National University of Singapore) | N/A | N/A |
| DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition | Yuxuan Liang (National University of Singapore)*; Pan Zhou (Sea AI Lab); Roger Zimmermann (NUS); Shuicheng Yan (Sea AI Labs) | N/A | N/A |
| Hierarchical Contrastive Inconsistency Learning for Deepfake Video Detection | Zhihao Gu (Shanghai Jiao Tong University)*; Taiping Yao (Tencent YouTu); Yang Chen (Tencent); Shouhong Ding (Tencent); Lizhuang Ma (Shanghai Jiao Tong University) | N/A | N/A |
| Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal | Xinwei Liu (Institute of Information Engineering,Chinese Academy of Sciences)*; Jian Liu (Ant Group); Yang Bai (Tsinghua); Jindong Gu (University of Munich); Tao Chen (Ant Group); Xiaojun Jia (Institute of Information Engineering,Chinese Academy of Sciences); Xiaochun Cao (Sun Yat-sen University) | N/A | N/A |
| ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO | Sanghyuk Chun (NAVER AI Lab)*; Wonjae Kim (NAVER AI Lab); Song Park (NAVER AI Lab); Minsuk Chang (NAVER AI Lab); Seong Joon Oh (Naver AI Lab) | N/A | N/A |
| Personalizing Federated Medical Image Segmentation via Local Calibration | Jiacheng Wang (Xiamen University); Yueming Jin (The Chinese University of Hong Kong); Liansheng Wang (Xiamen University)* | N/A | N/A |
| Learning to Detect Every Thing in an Open World | Kuniaki Saito (Boston University)*; Ping Hu (Boston University); Trevor Darrell (UC Berkeley); Kate Saenko (Boston University) | N/A | N/A |
| MVP: Multimodality-guided Visual Pre-training | Longhui Wei (University of Science and Technology of China)*; Lingxi Xie (Huawei Inc.); Wengang Zhou (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China); Qi Tian (Huawei Cloud & AI) | N/A | N/A |
| Uncertainty Learning in Kernel Estimation for Multi-Stage Blind Image Super-Resolution | Zhenxuan Fang (Xidian University); Weisheng Dong (Xidian University)*; Xin Li (West Virginia University); Jinjian Wu (Xidian University); Leida Li (Xidian University); Guangming Shi (Xidian University) | N/A | N/A |
| Physical Attack on Monocular Depth Estimation in Autonomous Driving with Optimal Adversarial Patches | Zhiyuan Cheng (Purdue University)*; James C Liang (Rochester Institute of Technology); Hongjun Choi (Purdue University); Guanhong Tao (Purdue University); Zhiwen Cao (Purdue University); Dongfang Liu (Rochester Institute of Technology); Xiangyu Zhang (Purdue University) | N/A | N/A |
| KVT: $k$-NN Attention for Boosting Vision Transformers | Pichao Wang (Alibaba Group)*; Xue Wang (Alibaba DAMO Academy); Fan Wang (Alibaba Group); Ming Lin (Alibaba Group); Shuning Chang (Alibiba Group); Hao Li (Alibaba Group); rong jin (alibaba group) | N/A | N/A |
| Locally Varying Distance Transform for Unsupervised Visual Anomaly Detection | Wen-Yan Lin (SMU); Zhonghang Liu (SMU); Siying Liu (I2R Singapore)* | N/A | N/A |
| Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation | Gensheng Pei (Nanjing University of Science and Technology)*; Fumin Shen (UESTC); Yazhou Yao (Nanjing University of Science and Technology); Guo-Sen Xie (Nanjing University of Science and Technology); Zhenmin Tang ( Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology) | N/A | N/A |
| PalGAN: Image Colorization with Palette Generative Adversarial Networks | Yi Wang (Shanghai AI Laboratory)*; Menghan Xia (Tencent AI lab); Lu Qi (The Chinese University of Hong Kong); Jing Shao (Sensetime); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences) | N/A | N/A |
| Fast-Vid2Vid: Spatial-Temporal Compression for Video-to-Video Synthesis | Long Zhuo (Shanghai AI Lab)*; Guangcong Wang (Nanyang Technological University); Shikai Li (SenseTime Research); Wayne Wu (SenseTime Research); Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| Generative Negative Text Replay for Continual Vision-Language Pretraining | Shipeng Yan (ShanghaiTech University)*; Lanqing Hong (Huawei Noah’s Ark Lab); Hang Xu (Huawei Noah’s Ark Lab); Jianhua Han (Huawei Noah’s Ark Lab); Tinne Tuytelaars (KU Leuven); Zhenguo Li (Huawei Noah’s Ark Lab); Xuming He (ShanghaiTech University) | N/A | N/A |
| Learning Spatio-Temporal Downsampling for Effective Video Upscaling | Xiaoyu Xiang (Meta Platforms Inc.)*; Yapeng Tian (University of Texas at Dallas); Vijay Rengarajan (Meta Platforms Inc.); Lucas D Young (Facebook); Bo Zhu (Meta Platforms, Inc.); Rakesh Ranjan (Facebook) | N/A | N/A |
| Geometric Representation Learning for Document Image Rectification | Hao Feng (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Jiajun Deng (University of Science and Technology of China); Yuechen Wang (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China) | N/A | N/A |
| ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer | Hongkai Chen (HKUST)*; Zixin Luo (Apple Inc.); Lei Zhou (Apple); Yurun Tian (Apple); Zhen Mingmin (Apple Inc.); Tian Fang (Apple); David N McKinnon (Apple); Yanghai Tsin (Apple Inc); Long Quan (Apple) | N/A | N/A |
| Egocentric Activity Recognition and Localization on a 3D Map | Miao Liu (Georgia Institute of Technology)*; Lingni Ma (Facebook Reality Labs); Kiran Somasundaram (Facebook Reality Labs); Yin Li (University of Wisconsin-Madison); Kristen Grauman (Facebook AI Research & UT Austin); James Rehg (Georgia Institute of Technology); Chao Li (Facebook Reality Labs) | N/A | N/A |
| Generative Adversarial Network for Future Hand Segmentation from Egocentric Video | Wenqi Jia (Georgia Institute of Technology)*; Miao Liu (Georgia Institute of Technology); James Rehg (Georgia Institute of Technology) | N/A | N/A |
| One-Shot Medical Landmark Localization by Edge-Guided Transform and Noisy Landmark Refinement | Zihao Yin (Center for Data Science, Peking University); Ping Gong (Deepwise AI Lab); Chunyu Wang (Microsoft Research asia); Yizhou Yu (The University of Hong Kong); Yizhou Wang (PKU)* | N/A | N/A |
| Learning Prior Feature and Attention Enhanced Image Inpainting | chenjie cao (fudan.edu.cn)*; Qiaole Dong (Fudan University); Yanwei Fu (Fudan University) | N/A | N/A |
| AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated Objects via Few-shot Interactions | Yian Wang (Peking university); Ruihai Wu (Peking University); Kaichun Mo (Stanford); Jiaqi Ke (Peking University); Qingnan Fan (Tencent AI Lab); Leonidas Guibas (Stanford University); Hao Dong (Peking University)* | N/A | N/A |
| Video Graph Transformer for Video Question Answering | Junbin Xiao (National University of Singapore)*; Pan Zhou (Sea AI Lab); Tat-Seng Chua (National Univ. of Singapore); Shuicheng Yan (Sea AI Labs) | N/A | N/A |
| A Reliable Online Method for Joint Estimation of Focal Length and Camera Rotation | Yiming Qian (Osaka University)*; James Elder (York University) | N/A | N/A |
| Learning Local Implicit Fourier Representation for Image Warping | Jaewon Lee (DGIST)*; Kwang Pyo Choi (Samsung Electronics); Kyong Hwan Jin (DGIST) | N/A | N/A |
| SepLUT: Separable Image-adaptive Lookup Tables for Real-time Image Enhancement | Canqian Yang (Shanghai Jiao Tong University); Meiguang Jin (Alibaba Group); Yi Xu (Shanghai Jiao Tong University)*; Rui Zhang (Shanghai Jiao Tong University); Ying Chen (Alibaba Group); Huaida Liu (Alibaba) | N/A | N/A |
| Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning | Wenpeng Xing (Hong Kong Baptist University); Jie Chen (Hong Kong Baptist University)* | N/A | N/A |
| Blind Image Decomposition | Junlin Han (CSIRO)*; Weihao Li (Data61, CSIRO); Pengfei Fang (The Australian National University); Chunyi Sun (Australian National University ); Jie Hong (Australian National University); Mohammad Ali Armin (CSIRO(Data61)); Lars Petersson (Data61/CSIRO); HONGDONG LI (Australian National University, Australia) | N/A | N/A |
| INT: Towards Infinite-frames 3D Detection with An Efficient Framework | Jianyun Xu (DAMO Academy, Alibaba Group)*; Zhenwei Miao (DAMO Academy, Alibaba Group); Da Zhang (UC Santa Barbara); Hongyu Pan (DAMO Academy, Alibaba Group); Kaixuan Liu (DAMO Academy, Alibaba Group); Peihan Hao (DAMO Academy, Alibaba Group); Jun Zhu (DAMO Academy, Alibaba Group); Zhengyang Sun (Tsinghua University); Li Hongmin (Huawei TCS lab); Xin Zhan (DAMO Academy, Alibaba Group) | N/A | N/A |
| MuLUT: Cooperating Multiple Look-Up Tables for Efficient Image Super-Resolution | Jiacheng Li (University of Science and Technology of China); Chang Chen (Huawei Noah’s Ark Lab); Zhen Cheng (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)* | N/A | N/A |
| NDF: Neural Deformable Fields for Dynamic Human Modelling | Ruiqi Zhang (Hong Kong Baptist University); Jie Chen (Hong Kong Baptist University)* | N/A | N/A |
| MPIB: An MPI-Based Bokeh Rendering Framework for Realistic Partial Occlusion Effects | Juewen Peng (Huazhong University of Science and Technology); Jianming Zhang (Adobe Research); Xianrui Luo (Huazhong University of Science and Technology); Hao Lu (Huazhong University of Science and Technology); Ke Xian (Huazhong University of Science and Technology)*; Zhiguo Cao (Huazhong Univ. of Sci.&Tech.) | N/A | N/A |
| Neural Density-Distance Fields | Itsuki UEDA (University of Tsukuba)*; Yoshihiro Fukuhara (Waseda University); Hirokatsu Kataoka (National Institute of Advanced Industrial Science and Technology (AIST)); Hiroaki Aizawa (Hiroshima University); Hidehiko Shishido (University of Tsukuba); Itaru Kitahara (University of Tsukuba) | N/A | N/A |
| MoDA: Map style transfer for self-supervised Domain Adaptation of embodied agents | Eun Sun Lee (Seoul National University)*; Junho Kim (Seoul National University); Sangwon Park (Seoul Nat’l University); Young Min Kim (Seoul National University) | N/A | N/A |
| L3: Accelerator-Friendly Lossless Image Format for High-Resolution, High-Throughput DNN Training | Jonghyun Bae (Seoul National University)*; Woohyeon Baek (Seoul National University); Tae Jun Ham (Seoul National University); Jae W. Lee (Seoul National University) | N/A | N/A |
| Prior-Guided Adversarial Initialization for Fast Adversarial Training | Xiaojun Jia (Institute of Information Engineering,Chinese Academy of Sciences)*; Yong Zhang (Tencent AI Lab); Xingxing Wei (Beihang University); Baoyuan Wu (The Chinese University of Hong Kong, Shenzhen; Shenzhen Research Institute of Big Data); Ke Ma (UCAS); Jue Wang (Tencent AI Lab); Xiaochun Cao (Sun Yat-sen University) | N/A | N/A |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | Yash Mukund Kant (University of Toronto)*; Arun Ramachandran (Georgia Institute of Technology); Sriram Yenamandra (Georgia Institute of Technology); Igor Gilitschenski (University of Toronto); Dhruv Batra (Georgia Tech & Facebook AI Research); Andrew Szot (Georgia Institute of Technology); Harsh Agrawal (Georgia Institute of Technology) | N/A | N/A |
| Real-RawVSR: Real-World Raw Video Super-Resolution with a Benchmark Dataset | Huanjing Yue (Tianjin University)*; Zhiming Zhang (Tianjin University); Jingyu Yang (Tianjin University) | N/A | N/A |
| ST-P3: End-to-end Vision-based Autonomous Driving via Spatial-Temporal Feature Learning | Shengchao Hu (Shanghai Jiao Tong University)*; Li Chen (Shanghai AI Laboratory); Penghao Wu (Shanghai Jiao Tong University); Hongyang Li (SenseTime); Junchi Yan (Shanghai Jiao Tong University); Dacheng Tao (JD.com) | N/A | N/A |
| NeXT: Towards High Quality Neural Radiance Fields via Multi-Skip Transformer | Yunxiao Wang (Tsinghua University); Yanjie Li (Tsinghua University)*; Peidong Liu (Tsinghua University); Tao Dai (Shenzhen University); Shu-Tao Xia (Tsinghua University) | N/A | N/A |
| Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution | Zhongwei Qiu (University of Science and Technology Beijing); Huan Yang (Microsoft Research)*; Jianlong Fu (Microsoft Research); Dongmei Fu (University of Science and Technology Beijing) | N/A | N/A |
| Adversarial Partial Domain Adaptation by Cycle Inconsistency | Kun-Yu Lin (Sun Yat-sen University); Jiaming Zhou (Sun Yat-sen University); Yukun Qiu (Sun Yat-sen University); WEI-SHI ZHENG (Sun Yat-sen University, China)* | N/A | N/A |
| BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks | Uddeshya Upadhyay (University of Tübingen)*; Shyamgopal Karthik (University of Tübingen); Massimiliano Mancini (University of Tübingen); Yanbei Chen (University of Tübingen); Zeynep Akata (University of Tübingen) | N/A | N/A |
| Domain Randomization-Enhanced Depth Simulation and Restoration for Perceiving and Grasping Specular and Transparent Objects | Qiyu Dai (Peking University); Jiyao Zhang (Xi’an Jiaotong University); Qiwei Li (Peking University); tianhao wu (Peking University); Hao Dong (Peking University); Ziyuan Liu (Huawei group); Ping Tan (Simon Fraser University); He Wang (Peking University)* | N/A | N/A |
| PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo | Wenqi Yang (The University of Hong Kong)*; Guanying CHEN (The Chinese University of Hong Kong, Shenzhen); Chaofeng Chen (Nanyang Technological University); Zhenfang Chen (MIT-IBM Watson AI Lab); Kwan-Yee K. Wong (The University of Hong Kong) | N/A | N/A |
| DeciWatch: A Simple Baseline for 10× Efficient 2D and 3D Pose Estimation | Ailing Zeng (The Chinese University of Hong Kong)*; Xuan Ju (The Chinese University of Hong Kong); Lei Yang (Sensetime Group Limited); Ruiyuan Gao (The Chinese University of Hong Kong); Xizhou Zhu (SenseTime); Bo Dai (Shanghai AI Lab); Qiang Xu (The Chinese University of Hong Kong) | N/A | N/A |
| Hierarchical Latent Structure for Multi-Modal Vehicle Trajectory Forecasting | Dooseop Choi (ETRI)*; KyoungWook Min (ETRI) | N/A | N/A |
| SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos | Ailing Zeng (The Chinese University of Hong Kong)*; Lei Yang (Sensetime Group Limited); Xuan Ju (The Chinese University of Hong Kong); Jiefeng Li (Shanghai Jiao Tong University); Jianyi Wang (Nanyang Technological University); Qiang Xu (The Chinese University of Hong Kong) | N/A | N/A |
| Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency | Tom Monnier (École des ponts Paristech)*; Matthew Fisher (Adobe Research); Alexei A Efros (UC Berkeley); Mathieu Aubry (École des ponts ParisTech) | N/A | N/A |
| End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution | Mingxiang Liao (University of Chinese Academy of Sciences); Fang Wan (University of Chinese Academy of Sciences)*; Yuan Yao (University of Chinese Academy of Sciences); Zhenjun Han (University of Chinese Academy of Sciences); Zou Jialing (University of Chinese Academy of Science); Yuze Wang ( Huawei Noah’s Ark Lab); Bailan Feng (Huawei Noah’s Ark Lab); Peng Yuan (Huawei Noah’s Ark Lab); Qixiang Ye (University of Chinese Academy of Sciences, China) | N/A | N/A |
| PAC-Net: Highlight Your Video via History Preference Modeling | Hang Wang (Huawei HiSilicon)*; Penghao Zhou (ByteDance); Chong Zhou (Nanyang Technological University); Zhao Zhang (Nankai University); Xing Sun (Shopee) | N/A | N/A |
| Efficient Point Cloud Analysis Using Hilbert Curve | Wanli Chen (CUHK)*; Xinge Zhu (The Chinese University of Hong Kong); Guojin Chen (The Chinese University of Hong Kong); Bei Yu (CUHK) | N/A | N/A |
| Learning Online Multi-Sensor Depth Fusion | Erik Sandström (ETH Zürich)*; Martin R. Oswald (ETH Zurich); Suryansh Kumar (ETH Zurich); Silvan Weder (ETH Zürich); Fisher Yu (ETH Zurich); Cristian Sminchisescu (Lund University); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Self-Support Few-Shot Semantic Segmentation | Qi Fan (HKUST)*; Wenjie Pei (Harbin Institute of Technology, Shenzhen); Yu-Wing Tai (Kuaishou Technology / HKUST); Chi-Keung Tang (Hong Kong University of Science and Technology) | N/A | N/A |
| Few-Shot Object Detection with Model Calibration | Qi Fan (HKUST)*; Chi-Keung Tang (Hong Kong University of Science and Technology); Yu-Wing Tai (Kuaishou Technology / HKUST) | N/A | N/A |
| S2-VER: Semi-Supervised Visual Emotion Recognition | Guoli Jia (NanKai University); Jufeng Yang (Nankai University )* | N/A | N/A |
| Self-Supervision Can Be a Good Few-Shot Learner | Yuning Lu (USTC); liangjian Wen (the Noah’s Ark Lab, Huawei Technologies Company Limited); Jianzhuang Liu (Huawei Noah’s Ark Lab); Yajing Liu (USTC); Xinmei Tian (USTC)* | N/A | N/A |
| My View is the Best View: Procedure Learning from Egocentric Videos | Siddhant Bansal (IIIT, Hyderabad)*; Chetan Arora (Indian Institute of Technology Delhi); C.V. Jawahar (IIIT-Hyderabad) | N/A | N/A |
| Trace Controlled Text to Image Generation | Kun Yan (Beihang University)*; Lei Ji (Microsoft); Chenfei Wu (Microsoft); Jianmin Bao (microsoft.com); Ming Zhou (SINOVATION VENTURES); Nan Duan (Microsoft Research); Shuai Ma (Beihang University) | N/A | N/A |
| Towards Comprehensive Representation Enhancement in Semantics-guided Self-supervised Monocular Depth Estimation | Jingyuan Ma (HikVision Research Institute)*; Xiangyu Lei (Hikvision Research Institute); Nan Liu (hikvison); Zhao Xian (Hikvision); Shiliang Pu (Hikvision Research Institute) | N/A | N/A |
| Calibration-free Multi-view Crowd Counting | Qi Zhang (City University of Hong Kong, Hong Kong)*; Antoni Chan (City University of Hong Kong, Hong, Kong) | N/A | N/A |
| Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-Training | Zhenyu Li (Harbin Institute of Technology)*; Zehui Chen (University of Science and Technology of China); Ang Li (SenseTime Research); Liangji Fang (Sensetime Research); Qinhong Jiang (SenseTime Research; Shanghai AI Laboratory); Xianming Liu (Harbin Institute of Technology); Junjun Jiang (Harbin Institute of Technology) | N/A | N/A |
| Online Continual Learning with Contrastive Vision Transformer | Zhen Wang (The University of Sydney )*; Liu Liu (The University of Sydney); Yajing Kong (The University of Sydney); Jiaxian Guo (The University of Sydney); Dacheng Tao (JD.com) | N/A | N/A |
| COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts | Jeonghun Baek (The University of Tokyo)*; Yusuke Matsui (The University of Tokyo); Kiyoharu Aizawa (The University of Tokyo) | N/A | N/A |
| BungeeNeRF: Progressive Neural Radiance Field for Extreme Multiscale Scene Rendering | Yuanbo Xiangli (Chinese University of Hong Kong)*; Linning Xu (CUHK); Xingang Pan (Max Planck Institute for Informatics); Nanxuan Zhao (University of Bath); Anyi Rao (The Chinese University of Hong Kong); Christian Theobalt (MPI Informatik); Bo Dai (Shanghai AI Lab); Dahua Lin (The Chinese University of Hong Kong) | N/A | N/A |
| AiATrack: Attention in Attention for Transformer Visual Tracking | Shenyuan Gao (Huazhong University of Science and Technology)*; CHUNLUAN ZHOU (Wormpex AI Research); Chao Ma (Shanghai Jiao Tong University); Xinggang Wang (Huazhong University of Science and Technology); Junsong Yuan (“State University of New York at Buffalo, USA”) | N/A | N/A |
| Learning Invariant Visual Representations for Compositional Zero-Shot Learning | Tian Zhang (Beijing University of Posts and Telecommunications); Kongming Liang (Beijing University of Posts and Telecommunications)*; Ruoyi Du (Beijing University of Posts and Telecommunications); Xian Sun (Aerospace Information Research Institute, Chinese Academy of Sciences); Zhanyu Ma (Beijing University of Posts and Telecommunications); Jun Guo (Beijing University of Posts and Telecommunications) | N/A | N/A |
| Image Coding for Machines with Omnipotent Feature Learning | Ruoyu Feng (University of Science and Technology of China)*; Xin Jin (University of Science and Technology of China); Zongyu Guo (University of Science and Technology of China); Runsen Feng (University of Science and Technology of China); Yixin Gao (University of Science and Technology of China); Tianyu He (Microsoft Research Asia); Zhizheng Zhang (Microsoft Research); Simeng Sun (University of Science and Technology of China); Zhibo Chen (University of Science and Technology of China) | N/A | N/A |
| MOTCOM: The Multi-Object Tracking Dataset Complexity Metric | Malte Pedersen (Aalborg University)*; Joakim Bruslund Haurum (Aalborg University); Patrick Dendorfer (TUM); Thomas B. Moeslund (Aalborg University) | N/A | N/A |
| How Severe is Benchmark-Sensitivity in Video Self-Supervised Learning? | Fida Mohammad Thoker (University of Amsterdam)*; Hazel Doughty (University of Amsterdam); Piyush Nitin Bagad (University of Amsterdam); Cees Snoek (University of Amsterdam) | N/A | N/A |
| Rethinking Robust Representation Learning Under Fine-grained Noisy Faces | Bingqi Ma (Sensetime Group Limited)*; Guanglu Song (Sensetime); Boxiao Liu (Institute of Computing Technology, Chinese Academy of Sciences); Yu Liu (SenseTime Group LTD) | N/A | N/A |
| Feature Representation Learning for Unsupervised Cross-domain Image Retrieval | Conghui Hu (National University of Singapore)*; Gim Hee Lee (National University of Singapore) | N/A | N/A |
| Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation | sunghwan hong (Korea University); Seokju Cho (Korea University); Jisu Nam (korea university); Stephen Lin (Microsoft Research); Seungryong Kim (Korea University)* | N/A | N/A |
| Spatial-Frequency Domain Information Integration for Pan-sharpening | man zhou (University of Science and Technology of China); Jie Huang (University of Science and Technology of China); Keyu Yan (University of Science and Technology of China); Hu Yu (University of Science and Technology of China); Xueyang Fu (University of Science and Technology of China); Aiping Liu (University of Science and Technology of China); Xian Wei (East China Normal University); Feng Zhao (University of Science and Technology of China)* | N/A | N/A |
| TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement | Keyang Zhou (University of Tübingen)*; Bharat Lal Bhatnagar (University of Tübingen, MPI informatik); Jan E. Lenssen (TU Dortmund); Gerard Pons-Moll (University of Tübingen) | N/A | N/A |
| HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation | Lukas Hoyer (ETH Zurich)*; Dengxin Dai (ETH Zurich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Combating Label Distribution Shift for Active Domain Adaptation | Sehyun Hwang (POSTECH)*; Sohyun Lee (POSTECH); Sungyeon Kim (POSTECH); Jungseul Ok (POSTECH); Suha Kwak (POSTECH) | N/A | N/A |
| GIPSO: Geometrically Informed Propagation for Online Adaptation in 3D LiDAR Segmentation | Cristiano Saltori (University of Trento)*; Evgeny Krivosheev (University of Trento); Stéphane Lathuilière (Telecom-Paris); Nicu Sebe (University of Trento); Fabio Galasso (Sapienza University); Giuseppe Fiameni (NVIDIA); Elisa Ricci (University of Trento); Fabio Poiesi (Fondazione Bruno Kessler) | N/A | N/A |
| SuperLine3D: Self-supervised Line Segmentation and Description for LiDAR Point Cloud | Xiangrui Zhao (Zhejiang University)*; Sheng Yang (Alibaba Group); Tianxin Huang (Zhejiang University); Jun Chen (Zhejiang University); Teng Ma (Alibaba Group); Mingyang Li (Alibaba A.I. Labs); Yong Liu (Zhejiang University) | N/A | N/A |
| Efficient Meta-Tuning for Content-aware Neural Video Delivery | Xiaoqi Li (Columbia university in the city of New york)*; Jiaming Liu (Peking University); Shizun Wang (Beijing University of Posts and Telecommunications); Cheng Lyu (Beijing University of Posts and Telecommunications); Ming Lu (Intel Labs China); Yurong Chen (Intel Labs China); Anbang Yao (Intel Labs China); Yandong Guo (OPPO Research Institute); Shanghang Zhang (University of California, Berkeley) | N/A | N/A |
| PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation | Wentao Jiang (Beihang University)*; Sheng Jin (The University of Hong Kong); Wentao Liu (Sensetime); Chen Qian (SenseTime); Ping Luo (The University of Hong Kong); Si Liu (Beihang University) | N/A | N/A |
| 3D-Aware Semantic-Guided Generative Model for Human Synthesis | Jichao Zhang (University of Trento)*; Enver Sangineto (University of Modena and Reggio Emilia); Hao Tang (ETH Zurich); Aliaksandr Siarohin (Snapchat); Zhun Zhong (University of Trento); Nicu Sebe (University of Trento); Wei Wang (EPFL) | N/A | N/A |
| Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality | Yue Song (University of Trento)*; Nicu Sebe (University of Trento); Wei Wang (EPFL) | N/A | N/A |
| CoSMix: Compositional Semantic Mix for Domain Adaptation in 3D LiDAR Segmentation | Cristiano Saltori (University of Trento)*; Fabio Galasso (Sapienza University); Giuseppe Fiameni (NVIDIA); Nicu Sebe (University of Trento); Elisa Ricci (University of Trento); Fabio Poiesi (Fondazione Bruno Kessler) | N/A | N/A |
| Streaming Multiscale Deep Equilibrium Models | Can Ufuk Ertenli (Middle East Technical University)*; Emre Akbas (METU); Ramazan Gokberk Cinbis (METU) | N/A | N/A |
| AvatarCap: Animatable Avatar Conditioned Monocular Human Volumetric Capture | Zhe Li (Tsinghua University)*; Zerong Zheng (Tsinghua University); Hongwen Zhang (Tsinghua University); Chaonan Ji (Tsinghua University); Yebin Liu (Tsinghua University) | N/A | N/A |
| Hierarchical Average Precision Training for Pertinent Image Retrieval | Elias Ramzi (Conservatoire Nation des Arts et Metiers)*; Nicolas Audebert (Cnam); Nicolas Thome (CNAM, Paris); Clément Rambour (Cnam); Xavier B Bitot (Coexya) | N/A | N/A |
| Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition | Shilin Xu (Peking University); Xiangtai Li (Peking University)*; Jingbo Wang (The Chinese University of HongKong); Guangliang Cheng (Sensetime Group Limited); Yunhai Tong (Peking University); Dacheng Tao (JD.com) | N/A | N/A |
| Out-of-Distribution Detection with Semantic Mismatch under Masking | Yijun Yang (The Chinese University of Hong Kong)*; Ruiyuan Gao (The Chinese University of Hong Kong); Qiang Xu (The Chinese University of Hong Kong) | N/A | N/A |
| Target-absent Human Attention | Zhibo Yang (Stony Brook University)*; Sounak Mondal (Stony Brook University); Seoyoung Ahn (Stony Brook University); Gregory Zelinsky (Stony Brook University); Minh Hoai (Stony Brook University); Dimitris Samaras (Stony Brook University) | N/A | N/A |
| Reference-based Image Super-Resolution with Deformable Attention Transformer | Jiezhang Cao (ETH Zürich)*; Jingyun Liang (ETH Zurich); Kai Zhang (ETH Zurich); Yawei Li (ETH Zurich); Yulun Zhang (ETH Zurich); Wenguan Wang (Eidgenössische Technische Hochschule Zürich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers | Junhyeong Cho (POSTECH)*; Kim Youwang (POSTECH); Tae-Hyun Oh (POSTECH) | N/A | N/A |
| Learning to Generate Realistic LiDAR Point Cloud | Vlas Zyrianov (University of Illinois Urbana Champaign); Xiyue Zhu (university of illinois); Shenlong Wang (UIUC)* | N/A | N/A |
| GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping | Pan Ji (OPPO US Research Center)*; Qingan Yan (OPPO US Research Center); Yuxin Ma (Wing LLC); Yi Xu (OPPO US Research Center) | N/A | N/A |
| Transform your Smartphone into a DSLR Camera: Learning the ISP in the Wild | Ardhendu Shekhar Tripathi (ETH Zurich)*; Martin Danelljan (ETH Zurich); Samarth Shukla (ETH Zurich); Radu Timofte (University of Wurzburg & ETH Zurich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Uncertainty-Based Spatial-Temporal Attention for Online Action Detection | Hongji Guo (Rensselaer Polytechnic Institute)*; Zhou Ren (Wormpex AI Research); Yi Wu (Wormpex AI Research); Gang Hua (Wormpex AI Research); Qiang Ji (Rensselaer Polytechnic Institute) | N/A | N/A |
| Video Question Answering with Iterative Video-Text Co-Tokenization | AJ Piergiovanni (Google)*; Kairo Morton (Massachusetts Institute of Technology); Weicheng Kuo (Google); Michael S Ryoo (Google; Stony Brook University); Anelia Angelova (Google) | N/A | N/A |
| LaTeRF: Label and Text Driven Object Radiance Fields | Ashkan Mirzaei (University of Toronto)*; Yash Mukund Kant (University of Toronto); Jonathan Kelly (University of Toronto); Igor Gilitschenski (University of Toronto) | N/A | N/A |
| Temporally Consistent Semantic Video Editing | Yiran Xu (University of Maryland, College Park)*; Badour A Sh AlBahar (Virginia Tech); Jia-Bin Huang (Facebook ) | N/A | N/A |
| SPot-the-Difference Self-Supervised Pre-training for Anomaly Detection and Segmentation | Yang Zou (Amazon AI)*; Jongheon Jeong (KAIST); Latha Pemula (Amazon); Dongqing Zhang (Amazon); Onkar Dabeer (Amazon) | N/A | N/A |
| Exploring Plain Vision Transformer Backbones for Object Detection | Yanghao Li (Facebook AI Research)*; Hanzi Mao (Facebook AI Research); Ross Girshick (FAIR); Kaiming He (Facebook AI Research) | N/A | N/A |
| Fine-grained Egocentric Hand-Object Segmentation: Dataset, Model, and Applications | Lingzhi Zhang (University of Pennsylvania)*; Shenghao Zhou (University of Pennsylvania); Simon Stent (Toyota Research Institute); Jianbo Shi (University of Pennsylvania) | N/A | N/A |
| Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation? | Xinyi Wu (University of South Carolina); Zhenyao Wu (University of South Carolina)*; Jin Wan (Beijing Jiaotong University); Lili Ju (University of South Carolina); Song Wang (University of South Carolina) | N/A | N/A |
| GIMO: Gaze-Informed Human Motion Prediction in Context | Yang Zheng (Tsinghua University); Yanchao Yang (Stanford University)*; Kaichun Mo (Stanford); Jiaman Li (University of Southern California); Tao Yu (Tsinghua University); Yebin Liu (Tsinghua University); Karen Liu (Stanford); Leonidas Guibas (Stanford University) | N/A | N/A |
| Error Compensation Framework for Flow-Guided Video Inpainting | Jaeyeon Kang (Yonsei University); Seoung Wug Oh (Adobe Research); Seon Joo Kim (Yonsei University)* | N/A | N/A |
| Decomposing The Tangent of Occluding Boundaries According to Curvatures and Torsions | Huizong Yang (Georgia Institute of Technology)*; Anthony Yezzi (Georgia Institute of Technology) | N/A | N/A |
| CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution | Taeho Kim (University of Colorado at Boulder)*; Yongin Kwon (Electronics and Telecommunications Research Institute); Jemin Lee (Electronics and Telecommunications Research Institute); Taeho Kim (Electronics and Telecommunications Research Institute); Sangtae Ha (University of Colorado at Boulder) | N/A | N/A |
| Scraping Textures from Natural Images for Synthesis and Editing | Xueting Li (University of California, Merced)*; Xiaolong Wang (UCSD); Ming-Hsuan Yang (University of California at Merced); Alexei A Efros (UC Berkeley); Sifei Liu (NVIDIA) | N/A | N/A |
| Self-supervised Learning of Visual Graph Matching | Chang Liu (Shanghai Jiao Tong University); Shaofeng Zhang (Shanghai Jiao Tong University); Xiaokang Yang (Shanghai Jiao Tong University of China); Junchi Yan (Shanghai Jiao Tong University)* | N/A | N/A |
| Disentangling Architecture and Training for Optical Flow | Deqing Sun (Google)*; Charles Herrmann (Google); Fitsum Reda (Google); Michael Rubinstein (Google); David J Fleet (University of Toronto); William T Freeman (Google) | N/A | N/A |
| PointFix: Learning to Fix Domain Bias for Robust Online Stereo Adaptation | Kwonyoung Kim (Yonsei University); JungIn Park (Yonsei University); Jiyoung Lee (NAVER AI Lab); Dongbo Min (Ewha Womans University); Kwanghoon Sohn (Yonsei Univ.)* | N/A | N/A |
| Teaching Where to Look: Attention Similarity Knowledge Distillation for Low Resolution Face Recognition | Sungho Shin (Gwangju Institute of Science and Technology); Joosoon Lee (Gwangju Institute of Science and Technology); junseok lee (GIST(Gwangju Institute of Science and Technology)); Yeonguk Yu (Gwangju Institute of Science and Technology); Kyoobin Lee (Gwangju Institute of Science and Technology)* | N/A | N/A |
| Iwin: Human-Object Interaction Detection via Transformer with Irregular Windows | Danyang Tu (Shanghai Jiao Tong University)*; Xiongkuo Min (Shanghai Jiao Tong University); Huiyu Duan (Shanghai Jiao Tong University); Guodong Guo (Baidu); Guangtao Zhai (Shanghai Jiao Tong University); Wei Shen (Shanghai Jiao Tong University) | N/A | N/A |
| Single Stage Virtual Try-on via Deformable Attention Flows | Shuai Bai (Alibaba Group)*; Huiling Zhou (Alibaba); Zhikang Li (DAMO Academy, Alibaba Group); Chang Zhou (Alibaba Group); Hongxia Yang (Alibaba Group) | N/A | N/A |
| Learning Deep Non-Blind Image Deconvolution Without Ground Truths | Yuhui Quan (South China University of Technology)*; Zhuojie Chen (South China University of Technology); Huan Zheng (National University of Singapore); Hui Ji (National University of Singapore) | N/A | N/A |
| Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions | Yijun Qian (Carnegie Mellon University)*; Lijun Yu (Carnegie Mellon University); Wenhe Liu (Carnegie Mellon University); Alexander Hauptmann (Carnegie Mellon University) | N/A | N/A |
| NeuRIS: Neural Reconstruction of Indoor Scenes Using Normal Priors | Jiepeng Wang (The University of Hong Kong); Peng Wang (The University of Hong Kong); Xiaoxiao Long (The University of Hong Kong); Christian Theobalt (MPI Informatik); Taku Komura (The University of Hong Kong); Lingjie Liu (Max Planck Institute for Informatics ); Wenping Wang (The University of Hong Kong)* | N/A | N/A |
| Rethinking Data Augmentation for Robust Visual Question Answering | Long Chen (Columbia University)*; Yuhang Zheng (Zhejiang University); Jun Xiao (Zhejiang University) | N/A | N/A |
| Dual-Domain Self-Supervised Learning and Model Adaption for Deep Compressive Imaging | Yuhui Quan (South China University of Technology)*; Xinran Qin (South China University of Technology); Tongyao Pang (National University of Singapore); Hui Ji (National University of Singapore) | N/A | N/A |
| Explicit Image Caption Editing | Zhen Wang (Zhejiang University); Long Chen (Columbia University)*; Wenbo Ma (Zhejiang University); Guangxing Han (Columbia University); Yulei Niu (Columbia University); Jian Shao (Zhejiang University); Jun Xiao (Zhejiang University) | N/A | N/A |
| SphereFed: Hyperspherical Federated Learning | Xin Dong (Harvard Univeristy)*; Sai Qian Zhang (Harvard University); Ang Li (Google DeepMind); H.T. Kung (Harvard University) | N/A | N/A |
| Local Color Distributions Prior for Image Enhancement | Haoyuan Wang (City University of Hong Kong)*; Ke Xu (City University of Hong Kong); Rynson W.H. Lau (City University of Hong Kong) | N/A | N/A |
| Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions | Tohar Lukov (National University of Singapore)*; Na Zhao (NUS); Gim Hee Lee (National University of Singapore); Ser-Nam Lim (Facebook AI) | N/A | N/A |
| Multi-Modal Masked Pre-Training for Monocular Panoramic Depth Completion | Zhiqiang Yan (Nanjing University of Science and Tenchnology)*; Xiang Li (Nanjing University of Science and Technology); Kun Wang (Nanjing University of Science and Technology); Zhenyu Zhang (Tencent); Jun Li (Nanjing University of Science and Technology); Jian Yang (Nanjing University of Science and Technology) | N/A | N/A |
| 2D Amodal Instance Segmentation Guided by 3D Shape Prior | Zhixuan Li (Peking University); Weining Ye (Peking University); Tingting Jiang (Peking University)*; Tiejun Huang (Peking University) | N/A | N/A |
| How to Synthesize a Large-Scale and Trainable Micro-Expression Dataset? | Yuchi Liu (Australian National University)*; Zhongdao Wang (Tsinghua University); Tom Gedeon (The Australian National University); Liang Zheng (Australian National University) | N/A | N/A |
| HEAD: HEtero-Assists Distillation for Heterogeneous Object Detectors | Luting Wang (Beihang University)*; Xiaojie Li (sensetime); Yue Liao (Beihang University); Zeren Jiang (ETH Zurich); Jianlong Wu (Shandong University); Fei Wang (University of Science and Technology of China); Chen Qian (SenseTime); Si Liu (Beihang University) | N/A | N/A |
| Meta Spatio-Temporal Debiasing for Video Scene Graph Generation | LI XU (Singapore University of Technology and Design)*; Haoxuan Qu (Singapore University of Technology and Design); Jason Kuen (Adobe Research); Jiuxiang Gu (Adobe Research); Jun Liu (Singapore University of Technology and Design) | N/A | N/A |
| A Sliding Window Scheme for Online Temporal Action Localization | Young Hwi Kim (Yonsei University); Hyolim Kang (Yonsei University); Seon Joo Kim (Yonsei University)* | N/A | N/A |
| Ultra-high-resolution unpaired stain transformation via Kernelized Instance Normalization | Ming-Yang Ho (aetherAI)*; Min-Sheng Wu (aetherAI); Che-Ming Wu (aetherAI) | N/A | N/A |
| SESS: Saliency Enhancing with Scaling and Sliding | Osman Tursun (Queensland University of Technology)*; SIMON DENMAN (Queensland University of Technology, Australia); Sridha Sridharan (QUT); Clinton Fookes (Queensland University of Technology) | N/A | N/A |
| Data Efficient 3D Learner via Knowledge Transferred from 2D Model | Ping-Chung Yu (National Tsing Hua University)*; Cheng Sun (National Tsing Hua University); Min Sun (NTHU) | N/A | N/A |
| MeshMAE: Masked Autoencoders for 3D Mesh Data Analysis | Yaqian Liang (Wuhan University); Shanshan Zhao (JD.COM); Baosheng Yu (The University of Sydney); Jing Zhang (The University of Sydney); Fazhi He (Wuhan University)* | N/A | N/A |
| ERA: Expert Retrieval and Assembly for Early Action Prediction | Lin Geng Foo (Singapore University of Technology and Design)*; Tianjiao Li (Singapore University of Technology and Design); Hossein Rahmani (Lancaster University); Qiuhong Ke (Monash University); Jun Liu (Singapore University of Technology and Design) | N/A | N/A |
| Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection | Xiaoqian Wu (Shanghai Jiao Tong University); Yong-Lu Li (Shanghai Jiao Tong University); Xinpeng Liu (Shanghai Jiao Tong University); Junyi Zhang (Shanghai Jiao Tong University); Yuzhe Wu (DongHua University); Cewu Lu (Shanghai Jiao Tong University)* | N/A | N/A |
| Improving GANs for Long-Tailed Data through Group Spectral Regularization | Harsh Rangwani (Indian Institute of Science)*; Naman Jaswani (Indian Institute of Science); Tejan Karmali (Indian Institute of Science, Bengaluru); Varun Jampani (Google); Venkatesh Babu RADHAKRISHNAN (Indian Institute of Science) | N/A | N/A |
| Hierarchical Semantic Regularization of Latent Spaces in StyleGANs | Tejan Karmali (Indian Institute of Science, Bengaluru)*; Rishubh Parihar (Indian Institute of Science, Bangalore); Susmit Agrawal (Indian Institute of Science); Harsh Rangwani (Indian Institute of Science); Varun Jampani (Google); Maneesh K Singh (Motive Technologies ); Venkatesh Babu RADHAKRISHNAN (Indian Institute of Science) | N/A | N/A |
| Symmetry Regularization and Saturating Nonlinearity for Robust Quantization | SEIN PARK (POSTECH); Yeongsang Jang (POSTECH); Eunhyeok Park (POSTECH)* | N/A | N/A |
| IntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion | Seung Jun Moon (KAIST)*; Gyeong-Moon Park (Kyung Hee University) | N/A | N/A |
| Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation | Ziming Wang (Beihang University); Xiaoliang Huo (Beihang University); Zhenghao Chen (University of Sydney); Jing Zhang (Beihang University); Lu Sheng (Beihang University)*; Dong Xu (The University of Hong Kong) | N/A | N/A |
| Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis | Shuai Shen (Tsinghua University); Wanhua Li (Tsinghua University); Zheng Zhu (Tsinghua University); Yueqi Duan (Tsinghua University); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University)* | N/A | N/A |
| StyleLight: HDR Panorama Generation for Lighting Estimation and Editing | Guangcong Wang (Nanyang Technological University)*; Yinuo Yang (Nanyang Technological University); Chen Change Loy (Nanyang Technological University); Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| You Should Look at All Objects | Zhenchao Jin (University of Science and Technology of China)*; Dongdong Yu (ByteDance Inc.); Luchuan Song (University of Science and Technology of China); Zehuan Yuan (Bytedance.Inc); Lequan Yu (The University of Hong Kong) | N/A | N/A |
| BRNet: Exploring Comprehensive Features for Monocular Depth Estimation | Wencheng Han (Beijing Institute of Technology)*; Junbo Yin (Beijing Institute of Technology); Xiaogang Jin (Zhejiang University); dai xiangdong (oppo); Jianbing Shen (Inception Institute of Artificial Intelligence) | N/A | N/A |
| CoupleFace: Relation Matters for Face Recognition Distillation | Jiaheng Liu (Beihang University)*; Haoyu Qin (SenseTime); Yichao Wu (Sensetime Group Limited); Jinyang Guo (The University of Sydney); Ding Liang (Sensetime Group Limited); Ke Xu (Beihang University) | N/A | N/A |
| Collaborating Domain-shared and Target-specific Feature Clustering for Cross-domain 3D Action Recognition | Qinying Liu (University of Science and Technology of China); Zilei Wang (University of Science and Technology of China)* | N/A | N/A |
| Adaptive Spatial-BCE Loss for Weakly Supervised Semantic Segmentation | Tong Wu (Beijing Institute of Technology); Guangyu Ryan Gao (Beijing Institute of Technology)*; junshi huang (Meituan); Xiaolin Wei (Meituan); Xiaoming Wei (Meituan); Chi Harold Liu (Beijing Institute of Technology) | N/A | N/A |
| Multi-Person 3D Pose and Shape Estimation via Inverse Kinematics and Refinement | Junuk Cha (UNIST)*; Muhammad Saqlain (Ulsan National Institute of Science and Technology); GeonU Kim (UNIST); Mingyu Shin (ULSAN NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY); Seungryul Baek (UNIST) | N/A | N/A |
| Explaining Deepfake Detection by Analysing Image Matching | Shichao Dong (Megvii); Jin Wang (Megvii); Haoqiang Fan (Megvii Inc(face++)); Jiajun Liang (Megvii); Renhe Ji (Megvii)* | N/A | N/A |
| L-CoDer: Language-based Colorization with Color-object Decoupling Transformer | Zheng Chang (Beijing University of Posts and Telecommunications); Shuchen Weng (Peking University)*; Yu Li (International Digital Economy Academy); Si Li (Beijing University of Posts and Telecommunications); Boxin Shi (Peking University) | N/A | N/A |
| GitNet: Geometric Prior-based Transformation for Birds-Eye-View Segmentation | Shi Gong (Huazhong University of Science and Technology); Xiaoqing Ye (Baidu Inc.); Xiao Tan (Baidu Inc.); Jingdong Wang (Baidu); Errui Ding (Baidu Inc.); Yu Zhou (Huazhong University of Science and Technology)*; Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Unsupervised Deep Multi-Shape Matching | Dongliang Cao (Technical University of Munich); Florian Bernard (University of Bonn)* | N/A | N/A |
| GaitEdge: Beyond Plain End-to-end Gait Recognition for Better Practicality | Junhao Liang (Southern University of Science and Technology in China)*; Chao Fan (SUSTech); Saihui Hou (Beijing Normal University); Chuanfu Shen (Southern University of Science and Technology); Yongzhen Huang (School of Artificial Intelligence, Beijing Normal University); Shiqi Yu (Southern University of Science and Technology) | N/A | N/A |
| EAutoDet: Efficient Architecture Search for Object Detection | Xiaoxing Wang (Shanghai Jiao Tong University); Jiale Lin (Shanghai Jiao Tong University); Juanping Zhao (Guangdong OPPO Mobile Telecommunications Co., Ltd.); Xiaokang Yang (Shanghai Jiao Tong University of China); Junchi Yan (Shanghai Jiao Tong University)* | N/A | N/A |
| A Max-Flow based Approach for Neural Architecture Search | Chao Xue (beijing university of posts and telecommunications)*; Xiaoxing Wang (Shanghai Jiao Tong University); Junchi Yan (Shanghai Jiao Tong University); Chun-Guang Li (Beijing University of Posts & Telecommunications) | N/A | N/A |
| Can Shuffling Video Benefit Temporal Bias Problem: A Novel Training Framework for Temporal Grounding | Jiachang Hao (Beijing University of Posts and Telecommunications)*; Haifeng Sun (Beijing university of posts and telecommunications); Pengfei Ren (Beijing University of Posts and Telecommunications); Jingyu Wang (Beijing University of Posts and Telecommunications); Qi Qi (Beijing University of Posts and Telecommunications); Jianxin Liao (beijing university of posts and telecommunications) | N/A | N/A |
| tSF: Transformer-based Semantic Filter for Few-Shot Learning | Jinxiang Lai (Tencent)*; Siqian Yang (Tencent); Wenlong Liu (Tencent); Yi Zeng (Tencent); Zhongyi Huang (Tencent); Wenlong Wu (Tencent); Jun Liu (Tencent); Bin-Bin Gao (Tencent); Chengjie Wang (Tencent; Shanghai Jiao Tong University) | N/A | N/A |
| Dense Gaussian Processes for Few-Shot Segmentation | Joakim Johnander (Linköping University)*; Johan Edstedt (Linköping University); Fahad Shahbaz Khan (MBZUAI); Michael Felsberg (Linköping University); Martin Danelljan (ETH Zurich) | N/A | N/A |
| Adversarial Feature Augmentation for Cross-domain Few-shot Classification | Yanxu Hu (Sun Yat-sen University); Andy J Ma (Sun Yat-sen University)* | N/A | N/A |
| Real-Time Neural Character Rendering with Pose-Guided Multiplane Images | Hao Ouyang (HKUST)*; Bo Zhang (Microsoft Research Asia); Pan Zhang (Shanghai AI Laboratory); Hao Yang (Microsoft Research Asia); Dong Chen (Microsoft Research Asia); Jiaolong Yang (Microsoft Research); Qifeng Chen (HKUST); Fang Wen (Microsoft Research Asia ) | N/A | N/A |
| Constructing Balance from Imbalance for Long-tailed Image Recognition | Yue Xu (Shanghai Jiao Tong University); Yong-Lu Li (Shanghai Jiao Tong University); Jiefeng Li (Shanghai Jiao Tong University); Cewu Lu (Shanghai Jiao Tong University)* | N/A | N/A |
| SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse Views | Xiaoxiao Long (The University of Hong Kong)*; Cheng Lin (Tencent); Peng Wang (The University of Hong Kong); Taku Komura (The University of Hong Kong); Wenping Wang (The University of Hong Kong) | N/A | N/A |
| Dual Perspective Network for Audio Visual Event Localization | Varshanth Rao (Huawei Technologies)*; Md Ibrahim Khalil (Huawei Noah’s Ark Laboratory); Haoda Li (University of California, Berkeley); Peng Dai (Huawei Technologies Inc.Canada); Juwei Lu (Huawei Noah’s Ark Lab) | N/A | N/A |
| SiamDoGe: Domain Generalizable Semantic Segmentation using Siamese Network | Zhenyao Wu (University of South Carolina)*; Xinyi Wu (University of South Carolina); Xiaoping Zhang (Wuhan University); Song Wang (University of South Carolina); Lili Ju (University of South Carolina) | N/A | N/A |
| Is Appearance Free Action Recognition Possible? | Filip Ilic (Graz University of Technology)*; Rick Wildes (York University); Thomas Pock (Graz University of Technology) | N/A | N/A |
| Detecting Twenty-thousand Classes using Image-level Supervision | Xingyi Zhou (The University of Texas at Austin)*; Rohit Girdhar (Facebook AI Research); Armand Joulin (Facebook AI Research); Philipp Kraehenbuehl (UT Austin); Ishan Misra (Facebook AI Research) | N/A | N/A |
| DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation | Hongyang Li (South China University of Technology)*; Jiehong Lin (South China University of Technology); Kui Jia (South China University of Technology) | N/A | N/A |
| Learning Cross-Video Neural Representations for High-Quality Frame Interpolation | Wentao Shangguan (Washington University in St Louis); Yu Sun (Washington University in St. Louis); Weijie Gan (Washington University in St. Louis); Ulugbek S. Kamilov (Washington University in St. Louis)* | N/A | N/A |
| Learning Visibility for Robust Dense Human Body Estimation | Chun-Han Yao (University of California at Merced)*; Jimei Yang (Adobe); Duygu Ceylan (Adobe Research); Yi Zhou (Adobe Research); Yang Zhou (Adobe Research); Ming-Hsuan Yang (University of California at Merced) | N/A | N/A |
| Texturify: Generating Textures on 3D Shape Surfaces | Yawar Siddiqui (Technical University of Munich)*; Justus Thies (Max Planck Institute for Intelligent Systems); Fangchang Ma (Apple Inc.); Qi Shan (Apple Inc.); Matthias Niessner (Technical University of Munich); Angela Dai (Technical University of Munich) | N/A | N/A |
| Unsupervised Selective Labeling for More Effective Semi-Supervised Learning | Xudong Wang (UC Berkeley / ICSI); Long Lian (UC Berkeley / ICSI); Stella X Yu (UC Berkeley / ICSI)* | N/A | N/A |
| Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly | Spencer Whitehead (Meta AI)*; Suzanne Petryk (UC Berkeley); Vedaad Shakib (UC Berkeley); Joseph E Gonzalez (UC Berkeley); Trevor Darrell (UC Berkeley); Anna Rohrbach (UC Berkeley); Marcus Rohrbach (Facebook AI Research) | N/A | N/A |
| Studying Bias in GANs through the Lens of Race | Vongani H Maluleke (University of California, Berkeley); Neerja Thakkar (University of California, Berkeley)*; Tim Brooks (UC Berkeley); Ethan Weber (UC Berkeley); Trevor Darrell (UC Berkeley); Alexei A Efros (UC Berkeley); Angjoo Kanazawa (University of California Berkeley); Devin Guillory (UC Berkeley) | N/A | N/A |
| On Multi-Domain Long-Tailed Recognition, Imbalanced Domain Generalization and Beyond | Yuzhe Yang (MIT)*; Hao Wang (Rutgers University); Dina Katabi (Massachusetts Institute of Technology) | N/A | N/A |
| Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth | Ziyue Feng (Clemson University)*; Liang Yang (Apple Inc); Longlong Jing (Waymo LLC); Haiyan Wang (The City College of New York); YingLi Tian (City University of New York); Bing Li (Clemson University) | N/A | N/A |
| Autoregressive 3D Shape Generation via Canonical Mapping | An-Chieh Cheng (National Tsing Hua University); Xueting Li (University of California, Merced); Sifei Liu (NVIDIA)*; Min Sun (NTHU); Ming-Hsuan Yang (University of California at Merced) | N/A | N/A |
| Learning Continuous Implicit Representation for Near-Periodic Patterns | Bowei Chen (CMU)*; Tiancheng Zhi (ByteDance); Martial Hebert (cmu); Srinivasa Narasimhan (Carnegie Mellon University, USA) | N/A | N/A |
| Robust Landmark-based Stent Tracking in X-ray Fluoroscopy | Luojie Huang (Johns Hopkins Uniersity); Yikang Liu (United Imaging Intelligence America); Li Chen (University of Washington); Eric Z. Chen (United Imaging Intelligence America); Xiao Chen (United Imaging Intelligence America); Shanhui Sun (United Imaging Intelligence America)* | N/A | N/A |
| Depth Field Networks for Generalizable Multi-view Scene Representation | Vitor Guizilini (Toyota Research Institute)*; Igor Vasiljevic (Toyota Research Institute); Jiading Fang (Toyota Technological Institute at Chicago); Rareș A Ambruș (Toyota Research Institute); Greg Shakhnarovich (Toyota Technological Institute at Chicago); Matthew Walter (Toyota Technological Institute at Chicago); Adrien Gaidon (Toyota Research Institute) | N/A | N/A |
| Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation | Simone Rossetti (Sapienza University); Damiano Zappia (Deepplants S.r.l.); Marta Sanzari (Sapienza University of Rome); Marco Schaerf (Sapienza University of Rome); fiora pirri (University of Rome, Sapienza)* | N/A | N/A |
| GRIT: Faster and Better Image captioning Transformer Using Dual Visual Features | Van-Quang Nguyen (Tohoku University)*; Masanori Suganuma (Tohoku University / RIKEN AIP); Takayuki Okatani (Tohoku University/RIKEN AIP) | N/A | N/A |
| Learning Semantic Correspondence with Sparse Annotations | Shuaiyi Huang (University of Maryland, College Park)*; Luyu Yang (University of Maryland, College Park); Bo He (University of Maryland); Songyang Zhang (Shanghai AI Laboratory); Xuming He (ShanghaiTech University); Abhinav Shrivastava (University of Maryland) | N/A | N/A |
| A Real World Dataset for Multi-view 3D Reconstruction | Rakesh Shrestha (Simon Fraser University)*; Siqi Hu (Alibaba damo academy); Minghao Gou (Shanghai Jiao Tong University); Ziyuan Liu (Huawei group); Ping Tan (Simon Fraser University) | N/A | N/A |
| Social ODE: Multi-Agent Trajectory Forecasting with Neural Ordinary Differential Equations | Song Wen (Rutgers University)*; Hao Wang (Rutgers University); Dimitris N. Metaxas (Rutgers) | N/A | N/A |
| 3D Instances as 1D Kernels | Yizheng Wu (Huazhong Univ. of Sci.&Tech.); Min Shi (Huazhong University of Science and Technology); Shuaiyuan Du (Huazhong Univ. of Sci.&Tech. ); Hao Lu (Huazhong University of Science and Technology); Zhiguo Cao (Huazhong Univ. of Sci.&Tech.)*; Weicai Zhong (Huawei CBG Consumer Cloud Service Big Data Platform Dept.) | N/A | N/A |
| Context-Aware Streaming Perception in Dynamic Environments | Gur-Eyal Sela (UC Berkeley)*; Ionel Gog (UC Berkeley); Justin Wong (UC Berkeley); Kumar Krishna Agrawal (UC Berkeley); Xiangxi Mo (UC Berkeley); Sukrit Kalra (UC Berkeley); Peter Schafhalter (UC Berkeley); Eric Leong (UC Berkeley); Xin Wang (Microsoft Research); Bharathan Balaji (Amazon); Joseph E Gonzalez (UC Berkeley); Ion Stoica (UC Berkeley) | N/A | N/A |
| PointTree: Transformation-Robust Point Cloud Encoder with Relaxed K-D Trees | Jun-Kun Chen (University of Illinois at Urbana-Champaign)*; Yu-Xiong Wang (University of Illinois at Urbana-Champaign) | N/A | N/A |
| Dense Siamese Network for Dense Unsupervised Learning | Wenwei Zhang (NTU)*; Jiangmiao Pang (CUHK); Kai Chen (SenseTime Research); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| Uncertainty-aware Multi-modal Learning via Cross-modal Random Network Prediction | Hu Wang (the University of Adelaide)*; Jianpeng Zhang (Northwestern Polytechnical University); Yuanhong Chen (University of Adelaide); Congbo Ma (The University of Adelaide); Jodie C Avery (University of Adelaide); Mary L Hull (University of Adelaide); Gustavo Carneiro (University of Adelaide) | N/A | N/A |
| Enhanced Accuracy and Robustness via Multi-Teacher Adversarial Distillation | Shiji Zhao (Beihang University); Jie Yu (Beihang University); Zhenlong Sun (Tencent Technology Co.Ltd); Bo Zhang (WeChat Search Application Department, Tencent); Xingxing Wei (Beihang University)* | N/A | N/A |
| End-to-end graph-constrained vectorized floorplan generation with panoptic refinement | Jiachen Liu (Pennsylvania State University)*; Yuan Xue (Johns Hopkins University); Jose P. Duarte (Penn State University); Krishnendra Shekhawat (BITS Pilani); Zihan Zhou (Manycore Tech Inc.); Sharon Xiaolei Huang (The Pennsylvania State University) | N/A | N/A |
| Context Enhanced Stereo Transformer | weiyu Guo (University of Chinese Academy of Sciences)*; Zhaoshuo Li (Johns Hopkins University); Yongkui Yang (Shenzhen Institute of Advanced Technology,Chinese Academy of Sciences); Zheng Wang (Shenzhen Institutes of Advanced Technology); Russ Taylor (Johns Hopkins University); Mathias Unberath (Johns Hopkins University); Alan Yuille (Johns Hopkins University); Yingwei Li (Johns Hopkins University) | N/A | N/A |
| NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition | Boyang Xia (Institute of Computing Technology, Chinese Academy of Science); Wenhao Wu (Baidu)*; Haoran Wang (Baidu); RUI SU (the University of Sydney); Dongliang He (Baidu); Haosen Yang (Harbin Institute of Technology); Xiaoran Fan (Institute of Computing Technology, Chinese Academy of Sciences); Wanli Ouyang (The University of Sydney) | N/A | N/A |
| Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning | Yuxiao Chen (Rutgers University)*; Long Zhao (Google Research); Jianbo Yuan (Bytedance); Yu Tian (Rutgers); zhaoyang xia (Rutgers University); Shijie Geng (Rutgers University); Ligong Han (Rutgers University); Dimitris N. Metaxas (Rutgers) | N/A | N/A |
| Few-Shot Video Object Detection | Qi Fan (HKUST)*; Chi-Keung Tang (Hong Kong University of Science and Technology); Yu-Wing Tai (Kuaishou Technology / HKUST) | N/A | N/A |
| Improving the Reliability for Confidence Estimation | Haoxuan Qu (Singapore University of Technology and Design)*; Yanchao Li (Singapore University of Technology and Design); Lin Geng Foo (Singapore University of Technology and Design); Jason Kuen (Adobe Research); Jiuxiang Gu (Adobe Research); Jun Liu (Singapore University of Technology and Design) | N/A | N/A |
| Selective Query-guided Debiasing for Video Corpus Moment Retrieval | Sunjae Yoon (KAIST)*; Ji Woo Hong (KAIST); Eunseop Yoon (KAIST); DaHyun Kim (KAIST); Junyeong Kim (Chung-Ang University); Hee Suk Yoon (KAIST); Chang D. Yoo (KAIST) | N/A | N/A |
| Posterior Refinement on Metric Matrix Improves Generalization in Metric Learning | Mingda Wang (Shanghai Jiao Tong University); Canqian Yang (Shanghai Jiao Tong University); Yi Xu (Shanghai Jiao Tong University)* | N/A | N/A |
| DISP6D: Disentangled Implicit Shape and Pose Learning for Scalable 6D Pose Estimation | Yilin Wen (The University of Hong Kong)*; Xiangyu Li (Brown University); Hao Pan (Microsoft Research); Lei Yang (The University of Hong Kong); Zheng Wang (SUSTech); Taku Komura (The University of Hong Kong); Wenping Wang (The University of Hong Kong) | N/A | N/A |
| Few-shot Image Generation with Mixup-based Distance Learning | Chaerin Kong (Seoul National University); Jeesoo Kim (Naver Webtoon AI); Donghoon Han (Seoul National University); Nojun Kwak (Seoul National University)* | N/A | N/A |
| Data-Free Neural Architecture Search via Recursive Label Calibration | Zechun Liu (Carnegie Mellon University); Zhiqiang Shen (Carnegie Mellon University)*; Yun Long (Google); Eric Xing (MBZUAI, CMU, and Petuum Inc.); Kwang-Ting Cheng (Hong Kong University of Science and Technology); Chas H Leichner (Google) | N/A | N/A |
| Distilling Object Detectors With Global Knowledge | Sanli Tang (Hikvision Research Institute); Zhongyu Zhang (Hikvision Research Institute); Zhanzhan Cheng (Zhejiang University & Hikvision Research Institute)*; Jing Lu (Hikvision Research Institute); Yunlu Xu (Hikvision Research Institute); Yi Niu (Hikvision Research Institute); Fan He (Shanghai Jiao Tong University) | N/A | N/A |
| NEST: Neural Event Stack for Event-based Image Enhancement | Minggui Teng (Peking University)*; Chu Zhou (Peking University); Hanyue Lou (Peking University); Boxin Shi (Peking University) | N/A | N/A |
| Multi-Granularity Distillation Scheme Towards Lightweight Semi-Supervised Semantic Segmentation | Jie Qin (School of Artificial Intelligence, University of Chinese Academy of Sciences; Institute of Automation,Chinese Academy of Sciences)*; Jie Wu (ByteDance Inc); Ming Li (Xiamen University); Xuefeng Xiao (ByteDance Inc); Min Zheng (ByteDance); Xingang Wang (Institute of Automation, CAS) | N/A | N/A |
| A Style-Based GAN Encoder for High Fidelity Reconstruction of Images and Videos | Xu YAO (Telecom ParisTech)*; Alasdair Newson (Telecom Paris); Yann Gousseau (Telecom Paris); PIERRE HELLIER (Interdigital (Technicolor)) | N/A | N/A |
| Unifying Visual Perception by Dispersible Points Learning | Jianming Liang (Beihang University)*; Guanglu Song (Sensetime); Biao Leng (Beihang University); Yu Liu (SenseTime Group LTD) | N/A | N/A |
| Towards High-Fidelity Single-view Holistic Reconstruction of Indoor Scenes | Haolin Liu (The Chinese University of Hong Kong, Shenzhen)*; Yujian Zheng (The Chinese University of Hong Kong, Shenzhen); Guanying CHEN (The Chinese University of Hong Kong, Shenzhen); Shuguang Cui (The Chinese University of Hong Kong, Shenzhen ); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)) | N/A | N/A |
| Multimodal Transformer for Automatic 3D Annotation and Object Detection | Chang Liu (The University of Hong Kong)*; Xiaoyan QIAN (The University of Hong Kong); Binxiao Huang (The University of Hong Kong); Xiaojuan Qi (The University of Hong Kong); Edmund Lam (The University of Hong Kong); Siew-Chong Tan (Nil); Ngai Wong (The University of Hong Kong) | N/A | N/A |
| SP-Net: Slowly Progressing Dynamic Inference Networks | Huanyu Wang (Zhejiang University)*; Wenhu Zhang (Zhejiang University); Shihao Su (Zhejiang University); Hui Wang (Zhejiang University); Zhenwei Miao (DAMO Academy, Alibaba Group); Xin Zhan (DAMO Academy, Alibaba Group); Xi Li (Zhejiang University) | N/A | N/A |
| No Token Left Behind: Explainability-Aided Image Classification and Generation | Roni Paiss (Tel Aviv University, Google); Hila Chefer (Tel Aviv University)*; Lior Wolf (Tel Aviv University, Israel) | N/A | N/A |
| Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification | BingLiang Jiao (Northwestern Polytechnical University ); Lingqiao Liu (University of Adelaide); Liying Gao ( Northwestern Polytechnical University); Guosheng Lin (Nanyang Technological University); Lu Yang (Northwestern Polytechnical University); Shizhou Zhang (NorthWestern Polytechnical University); Peng Wang (Northwestern Polytechnical University)*; Yanning Zhang (Northwestern Polytechnical University) | N/A | N/A |
| Editable Indoor Lighting Estimation | Henrique Weber (Université Laval)*; Mathieu Garon (Depix); Jean-Francois Lalonde (Université Laval) | N/A | N/A |
| PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection | Gang Li (Nanjing University of Science and Technology)*; Xiang Li (Nanjing University of Science and Technology); Yujie Wang (Sensetime Research); Yichao Wu (Sensetime Group Limited); Ding Liang (Sensetime Group Limited); Shanshan Zhang (Max Planck Institute for Informatics) | N/A | N/A |
| CompNVS: Novel View Synthesis with Scene Completion | Zuoyue Li (ETH Zurich)*; Tianxing Fan (Zhejiang University); Zhenqiang Li (The University of Tokyo); Zhaopeng Cui (Zhejiang University); Yoichi Sato (University of Tokyo); Marc Pollefeys (ETH Zurich / Microsoft); Martin R. Oswald (ETH Zurich) | N/A | N/A |
| Dynamic 3D Scene Analysis by Point Cloud Accumulation | Shengyu Huang (ETH Zürich)*; Zan Gojcic (NVIDIA); Jiahui Huang (Tsinghua University); Andreas Wieser (ETH Zürich); Konrad Schindler (ETH Zurich) | N/A | N/A |
| FakeCLR: Exploring Contrastive Learning for Solving Latent Discontinuity in Data-Efficient GANs | Ziqiang Li (University of Science and Technology of China)*; Chaoyue Wang (JD.com); Heliang Zheng (JD Explore Academy, JD.com); Jing Zhang (The University of Sydney); Bin Li (University of Science and Technology of China) | N/A | N/A |
| Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction | Chia-Chi Chuang (Tsinghua University); Donglin Yang (Tsinghua University); Chuan Wen (Tsinghua University)*; Yang Gao (Tsinghua University) | N/A | N/A |
| REALY: Rethinking the Evaluation of 3D Face Reconstruction | Zenghao Chai (Tsinghua University); Haoxian Zhang (Tencent); Jing Ren (ETH Zurich); Di Kang (Tencent); Zhengzhuo Xu (Tsinghua University); Xuefei Zhe (Tencent AI lab); Chun Yuan (Graduate school at ShenZhen,Tsinghua university); Linchao Bao (Tencent AI Lab)* | N/A | N/A |
| TransMatting: Enhancing Transparent Objects Matting with Transformers | huanqia cai (University of Chinese Academy of Sciences)*; Fanglei Xue (University of Chinese Academy of Sciences); Lele Xu (Key Laboratory of Space Utilization, Technology and Engineering Center for space Utilization, Chinese Academy of Sciences.); lili guo (Key Laboratory of Space Utilization, Technology and Engineering Center for space Utilization, Chinese Academy of Sciences. ) | N/A | N/A |
| Diverse Image Inpainting with Normalizing Flow | Cairong Wang (Graduate school at Shenzhen, Tsinghua University)*; Yiming M Zhu (Graduate school at ShenZhen,Tsinghua university); Chun Yuan (Graduate school at ShenZhen,Tsinghua university) | N/A | N/A |
| Video Activity Localisation with Uncertainties in Temporal Boundary | Jiabo Huang (Queen Mary University of London)*; Hailin Jin (Adobe Research); Shaogang Gong (Queen Mary University of London); Yang Liu (Peking University) | N/A | N/A |
| SketchSampler: Sketch-based 3D Reconstruction via View-dependent Depth Sampling | Chenjian Gao (Beihang University); Qian Yu (Beihang University)*; Lu Sheng (Beihang University); Yi-Zhe Song (University of Surrey); Dong Xu (The University of Hong Kong) | N/A | N/A |
| Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection | Ziteng Cui (The University of Tokyo); Yingying Zhu (University of Texas Arlington); Lin Gu (RIKEN,AIP / The University of Tokyo)*; Guo-Jun Qi (Futurewei Technologies); Xiaoxiao Li (The University of British Columbia); Renrui Zhang (Shanghai AI Lab); Zenghui Zhang (Shanghai Jiao Tong university); Tatsuya Harada (The University of Tokyo / RIKEN) | N/A | N/A |
| CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation | Feng Wang (Tsinghua University)*; Huiyu Wang (JHU); Chen Wei (Johns Hopkins University); Alan Yuille (Johns Hopkins University); Wei Shen (Shanghai Jiao Tong University) | N/A | N/A |
| Learning from Multiple Annotator Noisy Labels via Sample-wise Label Fusion | Zhengqi Gao (MIT)*; Fan-Keng Sun (MIT); Mingran Yang (MIT); Sucheng Ren (South China University of Technology); Zikai Xiong (Massachusetts Institute of Technology); Marc Engeler (Takeda); Antonio Burazer (Takeda); Linda Wildling (Takeda Pharmaceuticals International AG); Luca Daniel (Massachusetts Institute of Technology); Duane Boning (MIT) | N/A | N/A |
| Robust Category-Level 6D Pose Estimation with Coarse-to-Fine Rendering of Neural Features | Wufei Ma (Purdue University)*; Angtian Wang (Johns Hopkins University); Alan Yuille (Johns Hopkins University); Adam Kortylewski (Max Planck Institute for Informatics) | N/A | N/A |
| A Unified Framework for Domain Adaptive Pose Estimation | Donghyun Kim (MIT-IBM Watson AI Lab)*; Kaihong Wang (Boston University); Stan Sclaroff (Boston University); Margrit Betke (Boston University); Kate Saenko (Boston University) | N/A | N/A |
| A Broad Study of Pre-training for Domain Generalization and Adaptation | Donghyun Kim (MIT-IBM Watson AI Lab)*; Kaihong Wang (Boston University); Stan Sclaroff (Boston University); Kate Saenko (Boston University) | N/A | N/A |
| BlobGAN: Spatially Disentangled Scene Representations | Dave Epstein (UC Berkeley)*; Taesung Park (Adobe Research); Richard Zhang (Adobe); Eli Shechtman (Adobe Research, US); Alexei A Efros (UC Berkeley) | N/A | N/A |
| LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity | Martin Gubri (University of Luxembourg)*; Maxime Cordy (University of Luxembourg); Mike Papadakis (University of Luxembourg); Yves Le Traon (University of Luxembourg); Koushik Sen (University of California, Berkeley) | N/A | N/A |
| LocalBins: Improving Depth Estimation by Learning Local Distributions | Shariq F Bhat (KAUST)*; Ibraheem Alhashim (National Center for Artificial Intelligence (NCAI), Saudi Data and Artificial Intelligence Authority (SDAIA), Riyadh, Kingdom of Saudi Arabia); Peter Wonka (KAUST) | N/A | N/A |
| Prior Knowledge Guided Unsupervised Domain Adaptation | Tao Sun (Stony Brook University)*; Cheng Lu (Xiaopeng); Haibin Ling (Stony Brook University) | N/A | N/A |
| Fast Two-step Blind Optical Aberration Correction | Thomas Eboli (ENS Paris-Saclay)*; Jean-Michel Morel (Centre Borelli ENS Paris-Saclay); Gabriele Facciolo (ENS Paris – Saclay) | N/A | N/A |
| Controllable and Guided Face Synthesis for Unconstrained Face Recognition | Feng Liu (Michigan State University)*; Minchul Kim (Michigan State University); Anil Jain (Michigan State University); Xiaoming Liu (Michigan State University) | N/A | N/A |
| 2D GANs Meet Unsupervised Single-view 3D Reconstruction | Feng Liu (Michigan State University)*; Xiaoming Liu (Michigan State University) | N/A | N/A |
| Seeing Far in the Dark with Patterned Flash | Zhanghao Sun (Stanford University)*; Jian Wang (Snap); Yicheng Wu (Snap Inc.); Shree Nayar (Snap) | N/A | N/A |
| Unified Implicit Neural Stylization | Zhiwen Fan (University of Texas at Austin)*; Yifan Jiang (University of Texas at Austin); Peihao Wang (University of Texas at Austin); Xinyu Gong (University of Texas at Austin); Dejia Xu (University of Texas at Austin); Zhangyang Wang (University of Texas at Austin) | N/A | N/A |
| Improved Masked Image Generation with Token-Critic | Jose Lezama (Google Research)*; Huiwen Chang (Google); Lu Jiang (Google Research); Irfan Essa (Google) | N/A | N/A |
| UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation | Shenhan Qian (ShanghaiTech University)*; Jiale Xu (ShanghaiTech University); Ziwei Liu (Nanyang Technological University); Liqian Ma (ZMO AI); Shenghua Gao (Shanghaitech University) | N/A | N/A |
| PseudoClick: Interactive Image Segmentation with Click Imitation | Qin Liu (UNC)*; Meng Zheng (United Imaging Intelligence); Benjamin Planche (United Imaging Intelligence); Srikrishna Karanam (Adobe Research); Terrence Chen (United Imaging Intelligence); Marc Niethammer (UNC); Ziyan Wu (United Imaging Intelligence) | N/A | N/A |
| CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One | Liyuan Wang (Tsinghua University)*; Xingxing Zhang (Tsinghua University); Qian Li (Tsinghua University); Jun Zhu (Tsinghua University); Yi Zhong (Tsinghua University) | N/A | N/A |
| Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models | Xuxi Chen (University of Texas at Austin)*; Tianlong Chen (Unversity of Texas at Austin); Yu Cheng (Microsoft Research); Weizhu Chen (Microsoft); Ahmed Awadallah (Microsoft); Zhangyang Wang (University of Texas at Austin) | N/A | N/A |
| PRIF: Primary Ray-based Implicit Function | Brandon Yushan Feng (University of Maryland, College Park)*; Yinda Zhang (Google); Danhang Tang (Google); Ruofei Du (Google); Amitabh Varshney (University of Maryland) | N/A | N/A |
| From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution | Xiaoming Li (Harbin Institute of Technology); Chaofeng Chen (Nanyang Technological University); Xianhui Lin (Alibaba Group); Wangmeng Zuo (Harbin Institute of Technology, China)*; Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) | N/A | N/A |
| QISTA-ImageNet: A Deep Compressive Image Sensing Framework Solving Lq-Norm Optimization Problem | Gang-Xuan Lin (Academia Sinica); Shih-Wei Hu (National Taiwan University); Chun-Shien Lu (Academia Sinica)* | N/A | N/A |
| Trust, but Verify: Using Self-Supervised Probing to Improve Trustworthiness | Ailin Deng (National University of Singapore)*; Shen Li (National University of Singapore); Miao Xiong (National University of Singapore); Zhirui Chen (National University of Singapore); Bryan Hooi (National University of Singapore) | N/A | N/A |
| Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding | Cheng Shi (ShanghaiTech University); Sibei Yang (ShanghaiTech University)* | N/A | N/A |
| Med-DANet: Dynamic Architecture Network for Efficient Medical Volumetric Segmentation | Wenxuan Wang (University of Science and Technology Beijing)*; Chen Chen (University of Central Florida); Jing Wang (University of Science and Technology Beijing); Sen Zha (University of Science and Technology Beijing); Yan Zhang (University of Science and Technology Beijing); Jiangyun Li (University of Science and Technology Beijing) | N/A | N/A |
| Worst Case Matters for Few-Shot Recognition | Minghao Fu (Nanjing University); Yunhao Cao (Nanjing University); Jianxin Wu (Nanjing University)* | N/A | N/A |
| Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization | Qi Wei (Shandong University)*; Haoliang Sun (Shandong University); Xiankai Lu (Shandong University); Yilong Yin (Shandong University) | N/A | N/A |
| Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction | hanxue liang (University of Texas at Austin)*; Hehe Fan (NUS); Zhiwen Fan (University of Texas at Austin); Yi Wang (University of Texas at Austin); Tianlong Chen (Unversity of Texas at Austin); Yu Cheng (Microsoft Research); Zhangyang Wang (University of Texas at Austin) | N/A | N/A |
| Translation, Scale and Rotation: Cross-Modal Alignment Meets RGB-Infrared Vehicle Detection | Maoxun Yuan (Beihang University); Yinyan Wang (BeiHaing University); Xingxing Wei (Beihang University)* | N/A | N/A |
| Simple Baselines for Image Restoration | Liangyu Chen (Megvii Technology)*; Xiaojie Chu (Megvii Technology); Xiangyu Zhang (Megvii Technology); Jian Sun (Megvii Technology) | N/A | N/A |
| RDA: Reciprocal Distribution Alignment for Robust Semi-supervised Learning | Yue Duan (Nanjing University)*; Lei Qi (Southeast University); Lei Wang (“University of Wollongong, Australia”); Luping Zhou (University of Sydney); Yinghuan Shi (Nanjing University) | N/A | N/A |
| Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification | Kai Yi (King Abdullah University of Science and Technology)*; xiaoqian shen (King Abdullah University of Science and Technology); Yunhao Gou (Hong Kong University of Science and Technology); Mohamed Elhoseiny (KAUST) | N/A | N/A |
| Doubly Deformable Aggregation of Covariance Matrices for Few-shot Segmentation | Zhitong Xiong (Techinical University of Munich)*; Haopeng Li (The University of Melbourne); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR)) | N/A | N/A |
| MemSAC: Memory Augmented Sample Consistency for Large Scale Domain Adaptation | Tarun Kalluri (UC San Diego)*; Astuti Sharma (UCSD); Manmohan Chandraker (UC San Diego) | N/A | N/A |
| GCISG: Guided Causal Invariant Learning for Improved Syn-to-real Generalization | Gilhyun Nam (Agency for Defense Development)*; Gyeongjae Choi (Agency for Defense Development); Kyungmin Lee (Agency for Defense Development) | N/A | N/A |
| Temporal Saliency Query Network for Efficient Video Recognition | Boyang Xia (Institute of Computing Technology, Chinese Academy of Science); Zhihao Wang (Institute of Computing Technology, Chinese Academy of Sciences); Wenhao Wu (Baidu)*; Haoran Wang (Baidu); Jungong Han (Aberystwyth University) | N/A | N/A |
| Towards Interpretable Video Super-Resolution via Alternating Optimization | Jiezhang Cao (ETH Zürich)*; Jingyun Liang (ETH Zurich); Kai Zhang (ETH Zurich); Wenguan Wang (Eidgenössische Technische Hochschule Zürich); Qin Wang (ETH Zurich); Yulun Zhang (ETH Zurich); Hao Tang (ETH Zurich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning | Qiankun Gao (Peking University Shenzhen Graduate School)*; Chen Zhao (KAUST); Bernard Ghanem (KAUST); Jian Zhang (Peking University Shenzhen Graduate School) | N/A | N/A |
| Spike Transformer: Monocular Depth Estimation for Spiking Camera | Jiyuan Zhang (Peking University)*; Lulu Tang (Tsingua University); Zhaofei Yu (Peking University); Jiwen Lu (Tsinghua University); Tiejun Huang (Peking University) | N/A | N/A |
| Towards Robust Face Recognition with Comprehensive Search | Manyuan Zhang (Sensetime)*; Guanglu Song (Sensetime); Yu Liu (SenseTime Group LTD); Hongsheng Li (The Chinese University of Hong Kong) | N/A | N/A |
| Improving Image Restoration by Revisiting Global Information Aggregation | Xiaojie Chu (Megvii Technology)*; Liangyu Chen (Megvii Technology); Chengpeng Chen (Megvii); Xin Lu (Megvii Technology) | N/A | N/A |
| Learning Pedestrian Group Representations for Multi-modal Trajectory Prediction | Inhwan Bae (Gwangju Institute of Science and Technology)*; Jin-Hwi Park (GIST); Hae-Gon Jeon (GIST) | N/A | N/A |
| RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection | Chang Xu (Wuhan University); Jinwang Wang (Huawei Technoloty); Wen Yang (Wuhan University)*; Huai Yu (Wuhan University); Lei Yu (Wuhan University); Gui-Song Xia (Wuhan University) | N/A | N/A |
| Semi-supervised Single-view 3D Reconstruction via Prototype Shape Priors | Zhen Xing (Fudan University)*; Hengduo Li (University of Maryland, College Park ); Zuxuan Wu (UMD); Yu-Gang Jiang (Fudan University) | N/A | N/A |
| Sequential Multi-View Fusion Network for Fast LiDAR Point Motion Estimation | Gang Zhang (Damo Academy, Alibaba Group)*; Xiaoyan Li (Beijing University of Technology); Zhenhua Wang (DAMO Academy, Alibaba Group) | N/A | N/A |
| A Large-scale Multiple-objective Method for Black-box Attack against Object Detection | Siyuan Liang (Chinese Academy of Sciences); Longkang Li (Mohamed bin Zayed University of Artificial Intelligence); Yanbo Fan (Tencent AI Lab); Xiaojun Jia (Institute of Information Engineering,Chinese Academy of Sciences); Jingzhi Li (Institute of information engineering, CAS); Baoyuan Wu (The Chinese University of Hong Kong, Shenzhen)*; Xiaochun Cao (Sun Yat-sen University) | N/A | N/A |
| GradAuto: Energy-oriented Attack on Dynamic Neural Networks | Jianhong Pan (Singapore University of Technology and Design)*; Qichen Zheng (Singapore University of Technology and Design); Zhipeng Fan (NYU TANDON SCHOOL OF ENGINEERING); Hossein Rahmani (Lancaster University); Qiuhong Ke (Monash University); Jun Liu (Singapore University of Technology and Design) | N/A | N/A |
| Semantic-guided Multi-Mask Image Harmonization | Xuqian Ren (Watrix Technology); Yifan Liu (University of Adelaide)* | N/A | N/A |
| Manifold Adversarial Learning for Cross-domain 3D Shape Representation | Hao Huang (New York University); Cheng Chen (New York University); Yi Fang (New York University)* | N/A | N/A |
| GAN with Multivariate Disentangling for Controllable Hair Editing | Xuyang Guo (Institute of Computing Technology, Chinese Academy of Sciences); Meina Kan (Institute of Computing Technology, Chinese Academy of Sciences); Tianle Chen (Institute of Computing Technology, Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences)* | N/A | N/A |
| Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial Patches | Yuanzheng Ci (The University of Sydney)*; Chen Lin (University of Oxford); Lei Bai (Shanghai AI Laboratory); Wanli Ouyang (The University of Sydney) | N/A | N/A |
| Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation | Xinyu Shi (School of Computer Science and Engineering, Southeast University); DONG WEI (Tencent Jarvis Lab)*; Yu Zhang (Southeast University); Donghuan Lu (Tencent); Munan Ning (Tencent); Jiashun Chen (School of Computer Science and Engineering, Southeast University); Kai Ma (Tencent); Yefeng Zheng (Tencent) | N/A | N/A |
| Acknowledging the Unknown for Multi-label Learning with Single Positive Labels | Donghao Zhou (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences)*; Pengfei Chen (The Chinese University of Hong Kong); Qiong Wang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Guangyong Chen (Shenzhen Institutes of Advanced Technology); Pheng-Ann Heng (The Chinese Univsersity of Hong Kong) | N/A | N/A |
| LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling | Boyan Jiang (Fudan University)*; Xinlin Ren (Fudan University); Mingsong Dou (Google Inc.); Xiangyang Xue (Fudan University); Yanwei Fu (Fudan University); Yinda Zhang (Google) | N/A | N/A |
| Bilateral Normal Integration | Xu Cao (Osaka University)*; Hiroaki Santo (Osaka University); Boxin Shi (Peking University); Fumio Okura (Osaka University); Yasuyuki Matsushita (Osaka University) | N/A | N/A |
| Harmonizer: Learning to Perform White-Box Image and Video Harmonization | Zhanghan Ke (City University of Hong Kong)*; Chunyi Sun (Australian National University ); Lei ZHU (City University of Hong Kong); Ke Xu (City University of Hong Kong); Rynson W.H. Lau (City University of Hong Kong) | N/A | N/A |
| On the Versatile Uses of Partial Distance Correlation in Deep Learning | Xingjian Zhen (University of Wisconsin-Madison)*; Zihang Meng (University of Wisconsin Madison); Rudrasis Chakraborty (Butlr); Vikas Singh (University of Wisconsin Madison) | N/A | N/A |
| Object-Centric Unsupervised Image Captioning | Zihang Meng (University of Wisconsin Madison)*; David Yang (Facebook); Xuefei Cao (Facebook); Ashish Shah (Facebook AI); Ser-Nam Lim (Meta AI) | N/A | N/A |
| Pose2Room: Understanding 3D Scenes from Human Activities | Yinyu Nie (Technical University of Munich)*; Angela Dai (Technical University of Munich); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)); Matthias Niessner (Technical University of Munich) | N/A | N/A |
| Capturing, Reconstructing, and Simulating: the UrbanScene3D Dataset | Liqiang Lin (Shenzhen University); Yilin Liu (Shenzhen University); Yue Hu (Shenzhen University); Xingguang Yan (Shenzhen University); Ke Xie (Shenzhen University); Hui Huang (Shenzhen University)* | N/A | N/A |
| A Spectral View of Randomized Smoothing under Common Corruptions: Benchmarking and Improving Certified Robustness | Jiachen Sun (University of Michigan)*; Akshay Mehra (Tulane University); Bhavya Kailkhura (Lawrence Livermore National Laboratory); Pin-Yu Chen (IBM Research); Dan Hendrycks (UC Berkeley); Jihun Hamm (Tulane University); Zhuoqing Morley Mao (University of Michigan) | N/A | N/A |
| CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes | Kim Youwang (POSTECH)*; Ji-Yeon Kim (POSTECH); Tae-Hyun Oh (POSTECH) | N/A | N/A |
| Interpretable Image Classification with Differentiable Prototypes Assignment | Dawid Damian Rymarczyk (Jagiellonian University)*; Łukasz Struski (Jagiellonian University); Michał Górszczak (Jagiellonian University); Koryna Lewandowska (Jagiellonian University); Jacek Tabor (Jagiellonian University); Bartosz Zieliński (Jagiellonian University) | N/A | N/A |
| Efficient One-stage Video Object Detection by Exploiting Temporal Consistency | Guanxiong Sun (Queen’s University Belfast); Yang Hua (Queen’s University Belfast)*; Guosheng Hu (Oosto); Neil Robertson (Queen’s University Belfast) | N/A | N/A |
| ConCL: Concept Contrastive Learning for Dense Prediction Pre-training in Pathology Images | Jiawei Yang (UCLA)*; Hanbo Chen (Tencent AI Lab); Yuan Liang (UCLA); Junzhou Huang (University of Texas at Arlington); Lei He (UCLA); Jianhua Yao (National Institutes of Health) | N/A | N/A |
| Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation | Guodong Ding (National University of Singapore)*; Angela Yao (National University of Singapore) | N/A | N/A |
| Fast and High Quality Image Denoising via Malleable Convolution | Yifan Jiang (University of Texas at Austin)*; Bartlomiej Wronski (Google Research); Ben Mildenhall (Google Research); Jonathan T Barron (Google Research); Zhangyang Wang (University of Texas at Austin); Tianfan Xue (Google) | N/A | N/A |
| Data Association between Event Streams andIntensity Frames under Diverse Baselines | Dehao Zhang (Peking University)*; Qiankun Ding (Peking University); Peiqi Duan (Peking University); Chu Zhou (Peking University); Boxin Shi (Peking University) | N/A | N/A |
| Self-Regulated Feature Learning via Teacher-free Feature Distillation | Lujun Li (Chinese Academy of Science)* | N/A | N/A |
| TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval | Yuqi Liu (Renmin University of China)*; Pengfei Xiong (Shopee); luhui xu (tencent); Cao Shengming (Tencent); Qin Jin (Renmin University of China) | N/A | N/A |
| TAPE: Task-Agnostic Prior Embedding for Image Restoration | Lin Liu (University of Science and Technology of China)*; Lingxi Xie (Huawei Inc.); Xiaopeng Zhang (Noah’s Ark Lab, Huawei Inc.); Shanxin Yuan (Huawei Noah’s Ark Lab); Xiangyu Chen (University of Macau; SIAT); Wengang Zhou (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China); Qi Tian (Huawei Cloud & AI) | N/A | N/A |
| MVSalNet:Multi-View Augmentation for RGB-D Salient Object Detection | JiaYuan Zhou (Dalian University of Technology)*; Lijun Wang (Dalian University of Technology); Huchuan Lu (Dalian University of Technology); Kaining Huang (huang kaining); Xinchu Shi (Meituan Group); Bocong Liu (Meituan) | N/A | N/A |
| Rethinking IoU-based Optimization for Single-stage 3D Object Detection | Hualian Sheng (College of Information Science and Electronic Engineering, Zhejiang University; DAMO Academy, Alibaba Group)*; Sijia Cai (DAMO Academy, Alibaba Group); Na Zhao (NUS); Bing Deng (Damo Academy, Alibaba Group); Jianqiang Huang (Damo Academy, Alibaba Group); Xian-Sheng Hua (Damo Academy, Alibaba Group); Min-Jian Zhao (Zhejiang University); Gim Hee Lee (National University of Singapore) | N/A | N/A |
| Uncertainty Inspired Underwater Image Enhancement | Zhenqi Fu (Xiamen University)*; Wu Wang (Xiamen University); Yue Huang (Xiamen University); Xinghao Ding (Xiamen University); Kai-Kuang Ma (Nanyang Technological University, Singapore) | N/A | N/A |
| k-means Mask Transformer | Qihang Yu (Johns Hopkins University)*; Huiyu Wang (JHU); Siyuan Qiao (Google); Maxwell D Collins (Google Inc.); Yukun Zhu (Google Inc.); Hartwig Adam (Google); Alan Yuille (Johns Hopkins University); Liang-Chieh Chen (Google Inc.) | N/A | N/A |
| Contrastive Vision-Language Pre-training with Limited Resources | Quan Cui (Waseda University)*; Boyan Zhou (ByteDance); Yu Guo (Fudan University); Weidong Yin (UBC); Hao Wu (Bytedance Inc.); Osamu Yoshie (Waseda University); Yubo Chen (Bytedance) | N/A | N/A |
| Learning Linguistic Association Towards Efficient Text-Video Retrieval | Sheng Fang (ICT); Shuhui Wang (VIPL,ICT,Chinese academic of science)*; Junbao Zhuo (ICT CAS); Xinzhe Han (University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences) | N/A | N/A |
| United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning | Wenda Zhao (Dalian University of Technology)*; Fei Wei (Dalian University of Techology); You He (Naval Aviation University); Huchuan Lu (Dalian University of Technology) | N/A | N/A |
| Unstructured Feature Decoupling for Vehicle Re-Identification | Wen Qian (Institute of Automation, Chinese Academy of Sciences)*; Hao Luo (Alibaba group); Silong Peng (The Chinese academy of science); Fan Wang (Alibaba Group); Chen Chen (The Chinese academy of science); Hao Li (Alibaba Group) | N/A | N/A |
| Improving Adversarial Robustness of 3D Point Cloud Classification Models | Guanlin Li (Nanyang Technological University)*; Guowen Xu (Nanyang Technological University); Han Qiu (Tsinghua University); Ruan HE (Tencent); Jiwei Li (Shannon.AI); Tianwei Zhang (Nanyang Technological University) | N/A | N/A |
| ASSISTER: Assistive Navigation via Conditional Instruction Generation | Zanming Huang (Boston University); Zhongkai Shangguan (Boston University); Jimuyang Zhang (Boston University); Gilad Bar (Rutgers University – Camden); Matthew Boyd (Boston University); Eshed Ohn-Bar (Boston University)* | N/A | N/A |
| Deep Hash Distillation for Image Retrieval | Young Kyun Jang (Seoul National University)*; Geonmo Gu (NAVER corp); Byungsoo Ko (NAVER/LINE Corp.); Isaac Kang (Seoul National University); Nam Ik Cho (Seoul National University) | N/A | N/A |
| Learning Spatial-Preserved Skeleton Representations for Few-Shot Action Recognition | Ning Ma (Zhejiang University)*; Hongyi Zhang (Zhejiang University); Xuhui Li (Zhejiang University); Sheng Zhou (Zhejiang University); Zhen Zhang (National University of Singapore); Jun Wen (Harvard University); Haifeng Li (Zhejiang University); Jingjun Gu (Zhejiang University); Jiajun Bu (Zhejiang University) | N/A | N/A |
| Digging into Radiance Grid for Real-Time View Synthesis with Detail Preservation | Jian Zhang (Alibaba Group); Jinchi Huang (Alibaba Group); Bowen Cai (Alibaba Group); Huan Fu (Alibaba Group)*; Mingming Gong (University of Melbourne); Chaohui Wang (Laboratoire d’Informatique Gaspard Monge, Université Paris-Est); Jiaming Wang (Alibaba Group); Hongchen Luo (Alibaba Group); Rongfei Jia (Alibaba Group); Binqiang Zhao (Alibaba); Xing Tang (Alibaba Group) | N/A | N/A |
| S^2Contact: Graph-based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning | Tze Ho Elden Tse (University of Birmingham)*; Zhongqun Zhang (University of Birmingham); Kwang In Kim (UNIST); Ales Leonardis (University of Birmingham); Feng Zheng (SUSTech); Hyung Jin Chang (University of Birmingham) | N/A | N/A |
| TD-Road: Top-Down Road Network Extraction with Holistic Graph Construction | Yang He (Amazon)*; Ravi Garg (Amazon com services inc); Amber Roy Chowdhury (Amazon) | N/A | N/A |
| StyleGAN-Human: A Data-Centric Odyssey of Human Generation | Jianglin Fu (SenseTime)*; Shikai Li (SenseTime Research); Yuming Jiang (Nanyang Technological University); Kwan-Yee Lin (SenseTime Research); Chen Qian (SenseTime); Chen Change Loy (Nanyang Technological University); Wayne Wu (SenseTime Research); Ziwei Liu (Nanyang Technological University) | N/A | N/A |
| Hourglass Attention Network for Image Inpainting | Ye Deng (Xi’an Jiaotong University)*; Siqi Hui (Xi’an Jiaotong University); Rongye Meng (IAIR, Xi’an Jiaotong University); Sanping Zhou (Xi’an Jiaotong University); Jinjun Wang (Xi’an Jiaotong University) | N/A | N/A |
| MaxViT: Multi-Axis Vision Transformer | Zhengzhong Tu (University of Texas at Austin)*; Hossein Talebi (Google); Han Zhang (Google); Feng Yang (Google Research); Peyman Milanfar (Google); Alan Bovik (University of Texas at Austin); Yinxiao Li (Google) | N/A | N/A |
| Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images | Yuan Liu (The University of Hong Kong)*; Yilin Wen (The University of Hong Kong); Sida Peng (Zhejiang University); Cheng Lin (Tencent); Xiaoxiao Long (The University of Hong Kong); Taku Komura (The University of Hong Kong); Wenping Wang (The University of Hong Kong) | N/A | N/A |
| ColorFormer: Image Colorization via Color Memory assisted Hybrid-attention Transformer | Xiaozhong Ji (Tencent)*; Boyuan Jiang (Tencent Youtu Lab); Donghao Luo (Tencent); Guangpin Tao (Nanjing University); Wenqing Chu (Tencent); Zhifeng Xie (Shanghai University); Chengjie Wang (Tencent; Shanghai Jiao Tong University); Ying Tai (Tencent YouTu) | N/A | N/A |
| Spotting Temporally Precise, Fine-Grained Events in Video | James Hong (Stanford University)*; Haotian Zhang (Stanford University); Michaël Gharbi (Adobe Research); Matthew Fisher (Adobe Research); Kayvon Fatahalian (Stanford) | N/A | N/A |
| SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness | Jindong Gu (University of Munich)*; Hengshuang Zhao (University of Oxford); Volker Tresp (Siemens AG and Ludwig Maximilian University of Munich ); Philip Torr (University of Oxford) | N/A | N/A |
| Adversarial Erasing Framework via Triplet with Gated Pyramid Pooling Layer for Weakly Supervised Semantic Segmentation | Sung-Hoon Yoon (KAIST)*; Hyeokjun Kweon (KAIST); Jegyeong Cho (KAIST); Shinjeong Kim (KAIST); Kuk-Jin Yoon (KAIST) | N/A | N/A |
| Semi-Supervised Vision Transformers | Zejia Weng (Fudan University)*; Xitong Yang (University of Maryland); Ang Li (Google DeepMind); Zuxuan Wu (UMD); Yu-Gang Jiang (Fudan University) | N/A | N/A |
| Learning an Isometric Surface Parameterization for Texture Unwrapping | Sagnik Das (Stony Brook University)*; Ke Ma (Stony Brook University); Zhixin Shu (Adobe Research); Dimitris Samaras (Stony Brook University) | N/A | N/A |
| Mimic Embedding via Adaptive Aggregation: Learning Generalizable Person Re-identification | BOQIANG XU (University of Chinese Academy of Sciences;Institute of Automation,Chinese Academy of Sciences)*; Jian Liang (CASIA); He Lingxiao (nlpr,cripac); Zhenan Sun (Chinese of Academy of Sciences) | N/A | N/A |
| CryoAI: Amortized Inference of Poses for Ab Initio Reconstruction of 3D Molecular Volumes from Real Cryo-EM Images | Axel Levy (Stanford University); Frederic Poitevin (SLAC National Accelerator Laboratory); Julien N. P. Martel (Stanford University); Youssef Nashed (SLAC National Accelerator Laboratory); Ariana Peck (SLAC National Accelerator Laboratory); Nina Miolane (UCSB); Daniel Ratner (Stanford University ); Mike Dunne (SLAC National Accelerator Laboratory); Gordon Wetzstein (Stanford University)* | N/A | N/A |
| EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs | Guohao Ying (University of Southern California); Xin He (Hong Kong Baptist University); Bin Gao (National University of Singapore); Bo Han (HKBU / RIKEN); Xiaowen Chu (Hong Kong University of Science and Technology)* | N/A | N/A |
| ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer | Rui Yang (Tsinghua University)*; Hailong Ma (ByteDance Inc); Jie Wu (ByteDance Inc); Yansong Tang (Tsinghua University); Xuefeng Xiao (ByteDance Inc); Min Zheng (ByteDance); Xiu Li (Tsinghua University) | N/A | N/A |
| PlaneFormers: From Sparse View Planes to 3D Reconstruction | Samir Agarwala (University of Michigan)*; Linyi Jin (University of Michigan); Chris Rockwell (University of Michigan); David Fouhey (University of Michigan) | N/A | N/A |
| Domain Adaptive Video Segmentation via Temporal Pseudo Supervision | Yun Xing (Nanyang Technological University); Dayan Guan (Mohamed bin Zayed University of Artificial Intelligence); Jiaxing Huang (Nanyang Technological University); Shijian Lu (Nanyang Technological University)* | N/A | N/A |
| Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection | Linfeng Li (Baidu)*; Minyue Jiang (Baidu Inc.); Yue Yu (Baidu.Inc.); Wei Zhang (Baidu Inc); Xiangru Lin (Baidu Inc.); Yingying Li (Baidu); Xiao Tan (Baidu Inc.); Jingdong Wang (Baidu); Errui Ding (Baidu Inc.) | N/A | N/A |
| Overlooked Poses Actually Make Sense: Distilling Privileged Knowledge for Human Motion Prediction | Xiaoning Sun (Nanjing University of Science and Technology)*; Qiongjie Cui (Nanjing University of Science and Technology); Huaijiang Sun (Nanjing University of Science and Technology); Bin Li (Tianjin AiForward Science and Technology); Weiqing Li (Nanjing University of Science and Technology); Jianfeng Lu (Nanjing University of Science and Technology) | N/A | N/A |
| Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection | Xubin Zhong (South China University of Technology); Changxing Ding (South China University of Technology)*; Zijian Li (South China University of Technology); Shaoli Huang (Tencent AI-Lab) | N/A | N/A |
| Learning Extremely Lightweight and Robust Model with Differentiable Constraints on Sparsity and Condition Number | Xian Wei (East China Normal University); Yangyu Xu (Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences;University of Chinese Academy of Sciences); yanhui huang (Fuzhou University); Hairong Lv (Tsinghua University); Hai Lan (Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences); Mingsong Chen (East China Normal University); XUAN TANG (East China Normal University)* | N/A | N/A |
| Structural Triangulation: A Closed-Form Solution to Constrained 3D Human Pose Estimation | Zhuo Chen (Shanghai Jiao Tong University)*; Xu Zhao (Shanghai Jiao Tong University); Xiaoyue Wan (Shanghai Jiao Tong University) | N/A | N/A |
| Latency-Aware Collaborative Perception | Zixing Lei (Shanghai Jiao Tong University)*; Shunli Ren (Shanghai Jiao Tong University); Yue Hu (Shanghai Jiao Tong University); Wenjun Zhang (Shanghai Jiao Tong University); Siheng Chen (Shanghai Jiao Tong University) | N/A | N/A |
| Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection | Xin Li (East China Normal University)*; Botian Shi (Shanghai AI Lab); Yuenan HOU (Shanghai AI Lab); Xingjiao Wu ( East China Normal University); Tianlong Ma (East China Normal University); Yikang Li (Shanghai AI Lab); Liang He (ECNU) | N/A | N/A |
| Unfolded Deep Kernel Estimation for Blind Image Super-resolution | Hongyi Zheng (The Hong Kong Polytechnic University); Hongwei Yong (The Hong Kong Polytechnic University); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”)* | N/A | N/A |
| Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning | Xingping Dong (Inception Institute of Artificial Intelligence)*; Jianbing Shen (Inception Institute of Artificial Intelligence); Ling Shao (Terminus Group) | N/A | N/A |
| Continual Semantic Segmentation via Structure Preserving and Projected Feature Alignment | Zihan Lin (University of Science and Technology of China); Zilei Wang (University of Science and Technology of China)*; Yixin Zhang (University of Science and Technology of China) | N/A | N/A |
| SC-wLS: Towards Interpretable Feed-forward Camera Re-localization | Xin Wu (Peking University)*; Hao Zhao (Intel Labs China); Shunkai Li (Peking University); Yingdian Cao (Peking University); Hongbin Zha (Peking University, China) | N/A | N/A |
| Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation | Dae-Young Song (Chungnam National University); Geonsoo Lee (Chungnam National University); HeeKyung Lee (ETRI(Electronics and Telecommunications Reseach Institute)); Gi-Mun Um (ETRI(Electronics and Telecommunications Research Institute)); Donghyeon Cho (Chungnam National University)* | N/A | N/A |
| FloatingFusion: Depth from ToF and Image-stabilized Stereo Cameras | Andreas Meuleman (KAIST); Hakyeong Kim (KAIST); James Tompkin (Brown University); Min H. Kim (KAIST)* | N/A | N/A |
| Dual-Evidential Learning for Weakly-supervised Temporal Action Localization | Mengyuan Chen (Institute of Automation, Chinese Academy of Sciences)*; Junyu Gao (CASIA); Shicai Yang (Hikvision Research Institute); Changsheng Xu (CASIA) | N/A | N/A |
| DynaST: Dynamic Sparse Transformer for Exemplar-Guided Image Generation | Songhua Liu (National University of Singapore)*; Jingwen Ye (National University of Singapore); Sucheng Ren (South China University of Technology); Xinchao Wang (National University of Singapore) | N/A | N/A |
| D2HNet: Joint Denoising and Deblurring with Hierarchical Network for Robust Night Image Restoration | Yuzhi Zhao (City University of Hong Kong)*; Yongzhe Xu (SenseTime Group Limited); Qiong Yan (SenseTime Group Limited); DINGDONG YANG (University of Michigan); Xuehui Wang (Shanghai Jiao Tong University); Lai-Man Po (CITY UNIVERSITY OF HONG KONG) | N/A | N/A |
| DELTAR: Depth Estimation from a Light-weight ToF Sensor and RGB Image | Yijin Li (Zhejiang University); Yinda Zhang (Google); Xinyang Liu (Zhejiang University); Wenqi Dong (Zhejiang University); Han Zhou (Zhejiang University); Hujun Bao (Zhejiang University); Guofeng Zhang (Zhejiang University); Zhaopeng Cui (Zhejiang University)* | N/A | N/A |
| ERA: Enhanced Rational Activations | Martin Trimmel (Lund University)*; Mihai Zanfir (Google); Richard I Hartley (google); Cristian Sminchisescu (Google) | N/A | N/A |
| FrequencyLowCut pooling – Plug & Play against Catastrophic Overfitting | Julia Grabinski (University of Siegen)*; Janis Keuper (Fraunhofer); Margret Keuper (University of Mannheim); Steffen Jung (MPII) | N/A | N/A |
| Interclass Prototype Relation for Few-Shot Segmentation | Atsuro Okazawa (SoftBank Corp.)* | N/A | N/A |
| Multi-Faceted Distillation of Base-Novel Commonality for Few-shot Object Detection | Shuang Wu (Harbin Institute of Technology, Shenzhen); Wenjie Pei (Harbin Institute of Technology, Shenzhen); Dianwen Mei (Harbin Institute of Technology, Shenzhen); Fanglin Chen (Harbin Institute of Technology, Shenzhen); Jiandong Tian (CAS); Guangming Lu ( Harbin Institute of Technology, Shenzhen)* | N/A | N/A |
| X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks | Zhaowei Cai (Amazon)*; Gukyeong Kwon (Amazon); Avinash Ravichandran (Amazon); Erhan Bas (Amazon); Zhuowen Tu (UC San Diego); Rahul Bhotika (Amazon); Stefano Soatto (UCLA) | N/A | N/A |
| Equivariance and Invariance Inductive Bias for Learning from Insufficient Data | Tan Wang (Nanyang Technological University)*; Qianru Sun (Singapore Management University); Sugiri Pranata (Panasonic R&D Center Singapore); Karlekar Jayashree (Panasonic); Hanwang Zhang (Nanyang Technological University) | N/A | N/A |
| Multimodal Conditional Image Synthesis with Product-of-Experts GANs | Xun Huang (NVIDIA)*; Arun Mallya (NVIDIA); Ting-Chun Wang (NVIDIA); Ming-Yu Liu (NVIDIA) | N/A | N/A |
| Balancing between Forgetting and Acquisition in Incremental Subpopulation Learning | Mingfu Liang (Northwestern University)*; JIAHUAN ZHOU (Peking University); Wei Wei (Northwestern University); Ying Wu (Northwestern University) | N/A | N/A |
| TensoRF: Tensorial Radiance Fields | Anpei Chen (ShanghaiTech University)*; Zexiang Xu (Adobe Research); Andreas Geiger (University of Tuebingen); Jingyi Yu (Shanghai Tech University); Hao Su (UCSD) | N/A | N/A |
| PointCLM: A Contrastive Learning-based Framework for Multi-instance Point Cloud Registration | Mingzhi Yuan (Fudan University)*; Zhihao Li (Fudan); Qiuye Jin (Fudan University); Xinrong Chen (Fudan University); Manning Wang (Fudan University) | N/A | N/A |
| Slim Scissors: Segmenting Thin Object from Synthetic Background | Kunyang Han (Beijing Jiaotong University)*; Jun Hao Liew (ByteDance); Jiashi Feng (ByteDance); Huawei Tian (People’s Public Security University of China); Yao Zhao (Beijing Jiaotong University); Yunchao Wei (UTS) | N/A | N/A |
| CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition | Shreyank N Gowda (University of Edinburgh)*; Laura Sevilla-Lara (Facebook); Frank Keller (University of Edinburgh); Marcus Rohrbach (Facebook AI Research) | N/A | N/A |
| Discovering Human-Object Interaction Concepts via Self-Compositional Learning | Zhi Hou (The University of Sydney)*; Baosheng Yu (The University of Sydney); Dacheng Tao (The University of Sydney) | N/A | N/A |
| Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance | Chen Tang (Tsinghua University)*; Kai Ouyang (Tsinghua University); Zhi Wang (Tsinghua University); Yifei Zhu (Shanghai Jiao Tong University); Wen Ji (Institute of Computing Technology, Chinese Academy of Sciences); Yaowei Wang (PengCheng Laboratory); Wenwu Zhu (Tsinghua University) | N/A | N/A |
| TREND: Truncated Generalized Normal Density Estimation of Inception Embeddings for GAN Evaluation | Junghyuk Lee (School of Integrated Technology, Yonsei University); Jong-Seok Lee (“Yonsei University, Korea”)* | N/A | N/A |
| 3D Room Layout Estimation from a Cubemap of Panorama Image via Deep Manhattan Hough Transform | Yining Zhao (Tsinghua University); Chao Wen (Bytedance); Zhou Xue (Bytedance); Yue Gao (Tsinghua University)* | N/A | N/A |
| JoJoGAN: One Shot Face Stylization | Min Jin Chong (Univeristy of Illinois at Urbana-Champaign)*; David Forsyth (Univeristy of Illinois at Urbana-Champaign) | N/A | N/A |
| Convolutional Embedding Makes Hierarchical Vision Transformer Stronger | Cong Wang (OPPO); Hongmin Xu (OPPO)*; Xiong Zhang (Neolix Autonomous Vehicle); Li Wang (North China University of Technology ); Zhitong Zheng (OPPO); Haifeng Liu (OPPO) | N/A | N/A |
| Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration | Haotian Bai (The Chinese University of Hongkong, shenzhen); Ruimao Zhang (The Chinese University of Hong Kong, Shenzhen)*; Jiong WANG (The Chinese University of Hong Kong, Shenzhen); Xiang Wan (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)) | N/A | N/A |
| Few-shot Class-incremental Learning for 3D Point Cloud Objects | Townim Faisal Chowdhury (North South University); Ali Cheraghian (Australian National University (ANU)); Sameera Chandimal Ramasinghe (Australian National University); Sahar Ahmadi (University of Technology Sydney); Morteza Saberi (University of Technology, Sydney); Shafin Rahman (North South University)* | N/A | N/A |
| Learning Graph Neural Networks for Image Style Transfer | Yongcheng Jing (The University of Sydney); Yining Mao (Zhejiang University); Yiding Yang (Wormpex AI Research); Yibing Zhan (JD Explore Academy); Mingli Song (Zhejiang University); Xinchao Wang (National University of Singapore)*; Dacheng Tao (JD.com) | N/A | N/A |
| JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes | Haimei Zhao (The University of Sydney)*; Jing Zhang (The University of Sydney); Sen Zhang (The University of Sydney); Dacheng Tao (JD.com) | N/A | N/A |
| Meta-Learning with Less Forgetting on Large-Scale Non-Stationary Task Distributions | Zhenyi Wang (University at Buffalo)*; Li Shen (JD Explore Academy); Le Fang (University at Buffalo); Qiuling Suo (State University of New York at Buffalo); Donglin Zhan (Columbia University); Tiehang Duan (Facebook); Mingchen Gao (University at Buffalo, SUNY) | N/A | N/A |
| Semi-supervised 3D Object Detection with Proficient Teachers | Junbo Yin (Beijing Institute of Technology); Jin Fang (Baidu ); Dingfu Zhou (Baidu); Wenguan Wang (Eidgenössische Technische Hochschule Zürich); Liangjun Zhang (baidu); Cheng-Zhong Xu (University of Macau); Jianbing Shen (Inception Institute of Artificial Intelligence)* | N/A | N/A |
| NeFSAC: Neurally Filtered Minimal Samples | Luca Cavalli (ETH Zurich)*; Marc Pollefeys (ETH Zurich / Microsoft); Daniel Barath (ETH Zürich) | N/A | N/A |
| Domain Generalization by Mutual-Information Regularization with Pre-trained Models | Junbum Cha (Kakaobrain)*; Kyungjae Lee (Chung-Ang University); Sungrae Park (Upstage AI Research, Upstage AI); Sanghyuk Chun (NAVER AI Lab) | N/A | N/A |
| AcroFOD: An Adaptive Method for Cross-domain Few-shot Object Detection | Yipeng Gao (Sun Yat-sen University, China); Lingxiao YANG (Sun-Yat Sen University); Yunmu Huang (Huawei Technologies Co., Ltd.); Song Xie (Huawei Technologies Co., Ltd.); Shiyong Li ( AI Application Research Center, Huawei Technologies Co., Ltd); WEI-SHI ZHENG (Sun Yat-sen University, China)* | N/A | N/A |
| Primitive-based Shape Abstraction via Nonparametric Bayesian Inference | Yuwei Wu (National University of Singapore)*; Weixiao Liu (National University of Singapore); Sipu Ruan (National University of Singapore); Gregory S Chirikjian (National University of Singapore) | N/A | N/A |
| Active label correction using robust parameter update and entropy propagation | Kwang In Kim (UNIST)* | N/A | N/A |
| E-Graph: Minimal Solution for Rigid Rotation with Extensibility Graphs | Yanyan Li (tum)*; Federico Tombari (Google, TU Munich) | N/A | N/A |
| Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation | Nadine Behrmann (Bosch Center for Artificial Intelligence)*; S. Alireza Golestaneh (Google); Zico Kolter (Carnegie Mellon University); Jürgen Gall (University of Bonn); Mehdi Noroozi (Bosch Gmb) | N/A | N/A |
| Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification | Xulin Li (University of Science and Technology of China); Yan Lu (University of Sydney); Bin Liu (University of Science and Technology of China)*; Yating Liu (USTC); Guojun Yin (University of Science and Technology of China); Qi Chu (University of Science and Technology of China); Jinyang Huang (University Of Science And Technology Of China); Feng Zhu (University of Science and Technology of China); Rui Zhao (SenseTime Group Limited); Nenghai Yu (University of Science and Technology of China) | N/A | N/A |
| A Closer Look at Invariances in Self-supervised Pre-training for 3D Vision | Lanxiao Li (Karlsruher Institut fuer Technologie)*; Michael Heizmann (Karlsruher Institut fuer Technologie) | N/A | N/A |
| VecGAN: Image-to-Image Translation with Interpretable Latent Directions | Yusuf Dalva (Bilkent University); Said F Altındiş (Bilkent University); Aysegul Dundar (Bilkent University)* | N/A | N/A |
| SNeS: Learning Probably Symmetric Neural Surfaces from Incomplete Data | Eldar Insafutdinov (University of Oxford); Dylan Campbell (University of Oxford)*; Joao F Henriques (University of Oxford); Andrea Vedaldi (Oxford University) | N/A | N/A |
| Three things everyone should know about Vision Transformers | Hugo Touvron (Facebook AI Research)*; Matthieu Cord (Sorbonne University); Alaaeldin M El-Nouby (Facebook AI Research); Jakob Verbeek (Facebook); Herve Jegou (Facebook AI Research) | N/A | N/A |
| DeiT III: Revenge of the ViT | Hugo Touvron (Facebook AI Research)*; Matthieu Cord (Sorbonne University); Herve Jegou (Facebook AI Research) | N/A | N/A |
| Any-resolution Training for High-resolution Image Synthesis | Lucy Chai (MIT)*; Michaël Gharbi (Adobe Research); Eli Shechtman (Adobe Research, US); Phillip Isola (MIT); Richard Zhang (Adobe) | N/A | N/A |
| HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields | Kim Jun-Seong (POSTECH)*; Kim Yu-Ji (POSTECH); Moon Ye-Bin (POSTECH); Tae-Hyun Oh (POSTECH) | N/A | N/A |
| PartImageNet: A Large, High-Quality Dataset of Parts | Ju He (Johns Hopkins University)*; Shuo Yang (University of Technology Sydney); Shaokang Yang (ByteDance); Adam Kortylewski (Max Planck Institute for Informatics); Xiaoding Yuan (Johns Hopkins University); Jie-Neng Chen (Johns Hopkins University); shuai liu (ByteDance Inc.); Cheng Yang (ByteDance Inc.); Qihang Yu (Johns Hopkins University); Alan Yuille (Johns Hopkins University) | N/A | N/A |
| Abstracting Sketches through Simple Primitives | Stephan Alaniz (University of Tübingen)*; Massimiliano Mancini (University of Tübingen); Anjan Dutta (University of Surrey); Diego Marcos (Wageningen University); Zeynep Akata (University of Tübingen) | N/A | N/A |
| MTTrans: Cross-Domain Object Detection with Mean Teacher Transformer | Jinze Yu (Beihang University); Jiaming Liu (Peking University); Xiaobao Wei (Beihang University); Haoyi Zhou (Beihang University); Yohei Nakata (Panasonic Corporation); Denis A Gudovskiy (Panasonic); Tomoyuki Okuno (Panasonic); Jianxin Li (Beihang University); Kurt Keutzer (UC Berkeley); Shanghang Zhang (University of California, Berkeley)* | N/A | N/A |
| TAFIM: Targeted Adversarial Attacks against Facial Image Manipulations | Shivangi Aneja (Technical University Of Munich )*; Lev Markhasin (Sony Europe); Matthias Niessner (Technical University of Munich) | N/A | N/A |
| NeuMan: Neural Human Radiance Field from a Single Video | Wei Jiang (University of British Columbia)*; Kwang Moo Yi (University of British Columbia); Golnoosh Samei (UBC); Oncel Tuzel (Apple); Anurag Ranjan (Apple) | N/A | N/A |
| Learning Implicit Templates for Point-Based Clothed Human Modeling | Siyou Lin (Tsinghua University)*; Hongwen Zhang (Tsinghua University); Zerong Zheng (Tsinghua University); Ruizhi Shao (Tsinghua University); Yebin Liu (Tsinghua University) | N/A | N/A |
| Event Neural Networks | Matthew Dutson (University of Wisconsin-Madison)*; Yin Li (University of Wisconsin-Madison); Mohit Gupta (“University of Wisconsin-Madison, USA “) | N/A | N/A |
| Learning to Censor by Noisy Sampling | Ayush Chopra (MIT)*; Abhinav Java (Adobe, MDSR Labs); Abhishek Singh (MIT); Vivek Sharma (MIT); Ramesh Raskar (Massachusetts Institute of Technology) | N/A | N/A |
| ConMatch: Semi-Supervised Learning with Confidence-Guided Consistency Regularization | Jiwon Kim (Korea University)*; Youngjo Min (Korea University); Daehwan Kim (Samsung electro mechanics); Gyuseong Lee (Korea University); Junyoung Seo (Korea University); Kwangrok Ryoo (Korea University); Seungryong Kim (Korea University) | N/A | N/A |
| Granularity-aware Adaptation for Image Retrieval over Multiple Tasks | Jon Almazan (Naver Labs); Byungsoo Ko (NAVER/LINE Corp.); Geonmo Gu (NAVER corp); Diane Larlus (Naver Labs Europe); Yannis Kalantidis (NAVER LABS Europe)* | N/A | N/A |
| EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers | Junting Pan (The Chinese University of Hong Kong); Adrian Bulat (Samsung AI Center, Cambridge); Fuwen Tan (Samsung AI Center, Cambridge); Xiatian Zhu (University of Surrey); Lukasz Dudziak (Samsung AI Center Cambridge); Hongsheng Li (The Chinese University of Hong Kong); Georgios Tzimiropoulos (Queen Mary University of London); Brais Martinez (Samsung AI Center)* | N/A | N/A |
| Multi-Domain Multi-Definition Landmark Localization for Small Datasets | David Ferman (AI Foundation); Gaurav Bharaj (AI Foundation)* | N/A | N/A |
| TAVA: Template-free Animatable Volumetric Actors | Ruilong Li (UC Berkeley)*; Julian Tanke (University of Bonn); Minh P Vo (Facebook Reality Labs); Michael Zollhöfer (Facebook Reality Labs); Jürgen Gall (University of Bonn); Angjoo Kanazawa (University of California Berkeley); Christoph Lassner (Meta Reality Labs Research) | N/A | N/A |
| Stereo Depth Estimation with Echoes | Chenghao Zhang (National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, China)*; Kun Tian (Institute of Automation, Chinese Academy of Sciences); Bolin Ni (Institute of Automation, Chinese Academy of Sciences); Gaofeng Meng (Chinese Academy of Sciences); Bin Fan (University of Science and Technology Beijing); Zhaoxiang Zhang (Chinese Academy of Sciences, China); Chunhong Pan (Institute of Automation, Chinese Academy of Sciences) | N/A | N/A |
| EASNet:Searching Elastic and Accurate Network Architecture for Stereo Matching | Qiang Wang (Harbin Institute of Technology (Shenzhen))*; Shaohuai Shi (The Hong Kong University of Science and Technology); Kaiyong Zhao (Hong Kong Baptist University); Xiaowen Chu (Hong Kong University of Science and Technology) | N/A | N/A |
| DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object Detection | Abhinav Kumar (Michigan State University)*; Garrick Brazil (Facebook); Enrique Corona (Ford Motor Company); Armin Parchami (Ford Motor Company); Xiaoming Liu (Michigan State University) | N/A | N/A |
| RBP-Pose: Residual Bounding Box Projection for Category-Level Pose Estimation | Ruida Zhang (Tsinghua University)*; Yan Di (Technical University of Munich); Zhiqiang Lou (Tsinghua University); Fabian Manhardt (Google); Federico Tombari (Google, TU Munich); Xiangyang Ji (Tsinghua University) | N/A | N/A |
| Levenshtein OCR | Cheng Da (Alibaba DAMO Academy)*; Wang Peng (Alibaba DAMO Academy); Cong Yao (Alibaba DAMO Academy) | N/A | N/A |
| Multi-Granularity Prediction for Scene Text Recognition | Wang Peng (Alibaba DAMO Academy); Cheng Da (Alibaba DAMO Academy)*; Cong Yao (Alibaba DAMO Academy) | N/A | N/A |
| MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition | Chuanguang Yang (Institute of Computing Technology, Chinese Academy of Sciences )*; Zhulin An (Institute of Computing Technology, Chinese Academy of Sciences); Helong Zhou (Beijing Horizon Information Technology Co.,Ltd); linhang cai (Institute of Computing Technology, Chinese Academy of Sciences); Xiang Zhi (Institute of Computing Technology, Chinese Academy of Sciences); Jiwen Wu (Institute of Computing Technology, Chinese Academy of Sciences); yongjun xu (Institute of Computing Technology, Chinese Academy of Sciences); Qian Zhang (Horizon Robotics) | N/A | N/A |
| Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input | Qingpei Guo (Ant Financial Services Group)*; Kaisheng Yao (Amazon); Wei Chu (Ant Group) | N/A | N/A |
| Efficient Video Transformers with Spatial-temporal Token Selection | Junke Wang (Fudan University)*; Xitong Yang (University of Maryland); Hengduo Li (University of Maryland, College Park ); Li Liu (BirenTech Research); Zuxuan Wu (UMD); Yu-Gang Jiang (Fudan University) | N/A | N/A |
| DAS: Densely-Anchored Sampling for Deep Metric Learning | Lizhao Liu (South China University of Technology); Shangxin Huang (South China University of Technology); Zhuangwei Zhuang (South China University of Technology); Ran Yang (South China University of Technology); Mingkui Tan (South China University of Technology)*; Yaowei Wang (PengCheng Laboratory) | N/A | N/A |
| ReCoNet: Recurrent Correction Network for Fast and Efficient Multi-modality Image Fusion | Zhanbo Huang (Dalian University of Technology); Jinyuan Liu (Dalian University of Technology); Xin Fan (Dalian University of Technology)*; Risheng Liu (Dalian University of Technology); Wei Zhong (Dalian University of Technology); Zhongxuan Luo (DALIAN UNIVERSITY OF TECHNOLOGY) | N/A | N/A |
| RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN | Huy Phan (Rutgers University)*; Cong Shi (Rutgers University); Yi Xie (Rutgers University); Tianfang Zhang (Rutgers University, New Brunswick); Zhuohang Li (University of Tennessee, Knoxville); Tianming Zhao (Temple University); Jian Liu (The University of Tennessee, Knoxville); Yan Wang (Temple University); Yingying Chen (Rutgers University); bo yuan (rutgers university) | N/A | N/A |
| Point Cloud Compression with Sibling Context and Surface Priors | Zhili CHEN (HKUST); Zian Qian (HKUST); Sukai Wang (HKUST); Qifeng Chen (HKUST)* | N/A | N/A |
| Self-Feature Distillation with Uncertainty Modeling for Degraded Image Recognition | zhou yang (Xidian University); Weisheng Dong (Xidian University)*; Xin Li (West Virginia University); Jinjian Wu (Xidian University); Leida Li (Xidian University); Guangming Shi (Xidian University) | N/A | N/A |
| Point Cloud Compression using Range Image-based Entropy Model for Autonomous Driving | Sukai Wang (HKUST)*; Ming Liu (HKUST) | N/A | N/A |
| CANF-VC: Conditional Augmented Normalizing Flows for Video Compression | Yung-Han Ho (NCTU); Chih-Peng Chang (National Chiao Tung Univeristy); Peng-Yu Chen (NYCU); Alessandro Gnutti (University of Brescia); Wen-Hsiao Peng (National Yang Ming Chiao Tung University)* | N/A | N/A |
| Bi-level Feature Alignment for Versatile Image Translation and Manipulation | Fangneng Zhan (Max Planck Institute for Informatics); Yingchen Yu (Nanyang Technological University); Rongliang WU (Nanyang Technological University); Jiahui Zhang (Nanyang Technological University); Kaiwen Cui (Nanyang Technological University); Aoran Xiao (Nanyang Technological University); Shijian Lu (Nanyang Technological University)*; Chunyan Miao (NTU) | N/A | N/A |
| Lane Detection Transformer based on Multi-frame Horizontal and Vertical Attention and Visual Transformer Module | Han Zhang (Beihang University)*; Yunchao Gu (BUAA); Xinliang Wang (BUAA); Junjun Pan (Beihang University); Minghui Wang (Beihang University) | N/A | N/A |
| Label-Guided Auxiliary Training Improves 3D Object Detector | yaomin huang (East China Normal University); Xinmei Liu (East China Normal University)*; Yichen Zhu (Midea Group); Zhiyuan Xu (Midea Group); Chaomin Shen (East China Normal University); Zhengping Che (Midea Group); Guixu Zhang (East China Normal University); Yaxin Peng (Department of Mathematics, School of Science, Shanghai University); Feifei Feng (Midea Grooup); Jian Tang (Midea Group) | N/A | N/A |
| FedX: Unsupervised Federated Learning with Cross Knowledge Distillation | Sungwon Han (KAIST)*; Sungwon Park (KAIST); Fangzhao Wu (MSRA); Sundong Kim (Institute for Basic Science); Chuhan Wu (Tsinghua University); Xing Xie (Microsoft Research Asia); Meeyoung Cha (Institute for Basic Science) | N/A | N/A |
| ProposalContrast: Unsupervised Pre-training for LiDAR-based 3D Object Detection | Junbo Yin (Beijing Institute of Technology); Wenguan Wang (Eidgenössische Technische Hochschule Zürich); Dingfu Zhou (Baidu); Jin Fang (Baidu ); Liangjun Zhang (baidu); Cheng-Zhong Xu (University of Macau); Jianbing Shen (Inception Institute of Artificial Intelligence)* | N/A | N/A |
| Audio-Driven Stylized Gesture Generation with Flow-Based Model | Sheng Ye (Tsinghua University)*; Yu-Hui Wen (Tsinghua University); Yanan Sun (Tsinghua University); Ying He (Nanyang Technological University); Ziyang Zhang (HUAWEI TECHNOLOGIES CO.LTD); Yaoyuan Wang (Huawei Technologies Co., Ltd.); Weihua He (Tsinghua University); Yong-Jin Liu (Tsinghua University) | N/A | N/A |
| Unsupervised Domain Adaptation for One-Stage Object Detector using Offsets to Bounding Box | Jayeon Yoo (Seoul National University); Inseop Chung (Seoul National University); Nojun Kwak (Seoul National University)* | N/A | N/A |
| Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework | Botao Ye (Institute of Computing Technology, Chinese Academy of Sciences)*; Hong Chang (Chinese Academy of Sciences); Bingpeng MA (University of Chinese Academy of Sciences); Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences) | N/A | N/A |
| PreTraM: Self-Supervised Pre-training via Connecting Trajectory and Map | Chenfeng Xu (UC Berkeley)*; Tian Li (University of California, San Diego); Chen Tang (UC Berkeley); Lingfeng Sun (UC Berkeley); Kurt Keutzer (EECS, UC Berkeley); Masayoshi TOMIZUKA (MSC Lab); Alireza Fathi (Google); Wei Zhan (University of California, Berkeley) | N/A | N/A |
| DeepPS2: Revisiting Photometric Stereo using Two Differently Illuminated Images | Ashish Tiwari (Indian Institute of Technology Gandhinagar)*; Shanmuganathan Raman (Indian Institute of Technology (IIT) Gandhinagar) | N/A | N/A |
| Learn From All: Erasing Attention Consistency for Noisy Label Facial Expression Recognition | Yuhang Zhang (Beijing University of Posts and Telecommunicates); Chengrui Wang (Beijing University of Posts and Telecommunications); Xu Ling (Beijing University of Posts and Telecommunications); Weihong Deng (Beijing University of Posts and Telecommunications)* | N/A | N/A |
| Novel Class Discovery without Forgetting | Joseph K J (Indian Institute of Technology, Hyderabad)*; Sujoy Paul (Google Research); Gaurav Aggarwal (Google); Soma Biswas (Indian Institute of Science, Bangalore); Piyush Rai (IIT Kanpur); Kai Han (The University of Hong Kong); Vineeth N Balasubramanian (Indian Institute of Technology, Hyderabad) | N/A | N/A |
| Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation | ZheHan Kan (Southern University of Science and Technology); Shuoshuo Chen (Southern University of Science and Technology); Zeng Li (Southern University of Science and Technology); Zhihai He (Southern University of Science and Technology)* | N/A | N/A |
| Predicting is not Understanding: Recognizing and Addressing Underspecification in Machine Learning | Damien Teney (University of Adelaide)*; Maxime Peyrard (EPFL); Ehsan M Abbasnejad (The University of Adelaide) | N/A | N/A |
| A Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning | Michael Kirchhof (University of Tübingen)*; Karsten Roth (University of Tuebingen); Zeynep Akata (University of Tübingen); Enkelejda Kasneci (University of Tuebingen) | N/A | N/A |
| Relative Pose from SIFT Features | Daniel Barath (ETH Zürich)*; Zuzana Kukelova (Czech Technical University in Prague) | N/A | N/A |
| Monocular 3D Object Reconstruction with GAN Inversion | Junzhe Zhang (Nanyang Technological University)*; Daxuan Ren (Nanyang Technological University); Zhongang Cai (SenseTime International Pte Ltd); Chai Kiat Yeo (Nanyang Technological University); Bo Dai (Shanghai AI Lab); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| PromptDet: Towards Open-vocabulary Detection using Uncurated Images | Chengjian Feng (Meituan inc.)*; Yujie Zhong (University of Oxford); Zequn Jie (Meituan inc.); Xiangxiang Chu (Meituan); Haibing Ren (Meituan Inc.); Xiaolin Wei (Meituan); Weidi Xie (Shanghai Jiao Tong University); Lin Ma (Meituan) | N/A | N/A |
| Densely Constrained Depth Estimator for Monocular 3D Object Detection | Yingyan Li (CASIA)*; Yuntao Chen (TuSimple); Jiawei He (Institute of Automation, Chinese Academy of Sciences); Zhaoxiang Zhang (Chinese Academy of Sciences, China) | N/A | N/A |
| Content Adaptive Latents and Decoder for Neural Image Compression | Guanbo Pan (Beihang University)*; Guo Lu (Beijing Institute of Technology); Zhihao Hu (Beihang University); Dong Xu (The University of Hong Kong) | N/A | N/A |
| High-Fidelity Image Inpainting with GAN Inversion | Yongsheng YU (University of Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences)*; Heng Fan (University of North Texas); Tiejian Luo (University of Chinese Academy of Sciences) | N/A | N/A |
| Spatially Invariant Unsupervised 3D Object-Centric Learning and Scene Decomposition | Tianyu Wang (The Australian National University); Miaomiao Liu (The Australian National University)*; Kee Siong Ng (The Australian National University) | N/A | N/A |
| W2N: Switching From Weak Supervision to Noisy Supervision for Object Detection | Zitong Huang (Harbin Institute of Technology); Yiping Bao (Megvii(Face++) Inc); Bowen Dong (Harbin Institute of Technology); erjin zhou (megvii); Wangmeng Zuo (Harbin Institute of Technology, China)* | N/A | N/A |
| UnrealEgo: A New Dataset for Robust Egocentric 3D Human Motion Capture | Hiroyasu Akada (Max Planck Institute for Informatics, Keio University); Jian Wang (Max Planck Institute for Informatics); Soshi Shimada (MPI for Informatics); Masaki Takahashi (Keio University); Christian Theobalt (MPI Informatik); Vladislav Golyanik (MPI for Informatics)* | N/A | N/A |
| MotionCLIP: Exposing Human Motion Generation to CLIP Space | Guy Tevet (Tel Aviv University)*; Brian Gordon (Tel Aviv University); Amir Hertz (Tel Aviv University); Amit H Bermano (Tel-Aviv University); Danny Cohen-Or (Tel Aviv University) | N/A | N/A |
| Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution | Jie Liang (The Hong Kong Polytechnic University)*; Hui Zeng (OPPO); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) | N/A | N/A |
| Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-ahead Forward Ones | Junyi Li (Harbin Institute of Technology); Xiaohe Wu (Harbin Institute of technology); zhenxing niu (Alibaba Group-Machine Intelligence Technology); Wangmeng Zuo (Harbin Institute of Technology, China)* | N/A | N/A |
| Map-free Visual Relocalization: Metric Pose Relative to a Single Image | Eduardo Arnold (University of Warwick); Jamie M Wynn (Niantic); Sara Vicente (Niantic); Guillermo Garcia-Hernando (Niantic); Aron Monszpart (Niantic); Victor A Prisacariu (Niantic Labs); Daniyar Turmukhambetov (Niantic); Eric Brachmann (Niantic)* | N/A | N/A |
| DeltaGAN: Towards Diverse Few-shot ImageGeneration with Sample-Specific Delta | Yan Hong (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University)*; Jianfu Zhang (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong University) | N/A | N/A |
| Sample-Adaptive Augmentation for Long-Tailed Image Classification | Yan Hong (Shanghai Jiao Tong University); Jianfu Zhang (Shanghai Jiao Tong University)*; Zhongyi Sun (Tencent); Ke Yan (Tencent) | N/A | N/A |
| TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers | Jihao Liu (Sensetime)*; Boxiao Liu (Institute of Computing Technology, Chinese Academy of Sciences); Hang Zhou (The Chinese University of Hong Kong); Hongsheng Li (The Chinese University of Hong Kong); Yu Liu (SenseTime Group LTD) | N/A | N/A |
| UFO: Unified Feature Optimization | Teng Xi (Baidu Inc.)*; Yifan Sun (Baidu Research); Deli Yu (Baidu Inc. ); Bi Li (Baidu Inc.); Nan Peng (Baidu Inc.); gang zhang (Baidu Inc.); Xinyu Zhang (Baidu Inc.); Zhigang Wang (shanghai AI lab); jinwen chen (Baidu Inc.); Jian Wang (Baidu Inc.); liu lufei (Baidu Inc); Haocheng Feng (Baidu Inc.); Junyu Han (Baidu Inc.); jingtuo liu (baidu); Errui Ding (Baidu Inc.); Jingdong Wang (Baidu) | N/A | N/A |
| Master of All: Simultaneous Generalization of Urban-Scene Segmentation to All Adverse Weather Conditions | Nikhil Reddy (IIT Delhi)*; Abhinav Singhal (Indian Institute of Technology, Delhi); Abhishek Kumar (IIT Delhi); Mahsa Baktashmotlagh (University of Queensland); Chetan Arora (Indian Institute of Technology Delhi) | N/A | N/A |
| PalQuant: Accelerating High-precision Networks on Low-precision Accelerators | Qinghao Hu (Institute of Automation, Chinese Academy of Sciences)*; gang li (shanghai jiao tong university); Qiman Wu (Baidu Inc.); Jian Cheng (“Chinese Academy of Sciences, China”) | N/A | N/A |
| Self-Supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations | Zhilu Zhang (Harbin Institute of Technology); Ruohao Wang (Harbin Institute of Technology); Hongzhi Zhang (Harbin Institute of Technology); Yunjin Chen (ULSee Inc.); Wangmeng Zuo (Harbin Institute of Technology, China)* | N/A | N/A |
| UniMiSS: Universal Medical Self-Supervised Learning via Breaking Dimensionality Barrier | Yutong Xie (University of Adelaide)*; Jianpeng Zhang (Northwestern Polytechnical University); Yong Xia (Northwestern Polytechnical University, Research & Development Institute of Northwestern Polytechnical University in Shenzhen); Qi Wu (University of Adelaide) | N/A | N/A |
| Self-distilled Feature Aggregation for Self-supervised Monocular Depth Estimation | Zhengming Zhou (NLPR-IA-CAS); Qiulei Dong (NLPR-IA-CAS)* | N/A | N/A |
| Negative Samples are at Large: Leveraging Hard-distance Elastic Loss for Re-identification | Hyungtae Lee (DEVCOM Army Research Laboratory)*; Sungmin Eum (Booz Allen Hamilton Inc.); Heesung Kwon (U.S. Army Research Laboratory) | N/A | N/A |
| Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning | Boeun Kim (Seoul National University)*; Hyung Jin Chang (University of Birmingham); Jungho Kim (KETI); Jin Young Choi (Seoul National University) | N/A | N/A |
| Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoiréing | Xin Yu (The University of Hong Kong)*; Peng Dai (The University of Hong Kong); Wenbo Li (The Chinese University of Hong Kong); Lan Ma (TCL Corporate Research); Jiajun Shen (TCL Research); Jia Li (Sun Yat-Sen University); Xiaojuan Qi (The University of Hong Kong) | N/A | N/A |
| Instance Contour Adjustment via Structure-driven CNN | Shuchen Weng (Peking University)*; Yi Wei (Samsung Research America Inc.); Ming-Ching Chang (University at Albany – SUNY); Boxin Shi (Peking University) | N/A | N/A |
| ERDN: Equivalent Receptive Field Deformable Network for Video Deblurring | Bangrui Jiang (Tsinghua University)*; zhihuai xie (Tencent); Zhen Xia (Tencent); Songnan Li (Tencent); Shan Liu (Tencent America) | N/A | N/A |
| Localizing Visual Sounds the Easy Way | Shentong Mo (Carnegie Mellon University); Pedro Morgado (CMU)* | N/A | N/A |
| Polarimetric Pose Prediction | Daoyi Gao (Technical University of Munich)*; Yitong Li (Technical University of Munich); Patrick Ruhkamp (Technical University of Munich); Iuliia Skobleva (Technical University of Munich); Magdalena Wysocki (Technical University of Munich); HyunJun Jung ( Technical University of Munich); Pengyuan Wang (TUM); Arturo Guridi (Technical University of Munich); Benjamin Busam (Technical University of Munich) | N/A | N/A |
| DFNet: Enhance Absolute Pose Regression with Direct Feature Matching | Shuai Chen (University of Oxford)*; Xinghui Li (University of Oxford); Zirui Wang (University of Oxford); Victor Adrian Prisacariu (University of Oxford) | N/A | N/A |
| A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge | Dustin Schwenk (Allen Institute for Artificial Intelligence); Apoorv Khandelwal (Allen Institute for AI); Christopher A Clark (Allen Institute for AI); Kenneth Marino (CMU); Roozbeh Mottaghi (Allen Institute for AI)* | N/A | N/A |
| Sound Localization by Self-Supervised Time Delay Estimation | Ziyang Chen (University of Michigan)*; David Fouhey (University of Michigan); Andrew Owens (U Michigan) | N/A | N/A |
| AdaFocus V3: On Unified Spatial-temporal Dynamic Video Recognition | Yulin Wang (Tsinghua University); Yang Yue (Tsinghua University); Xinhong Xu (Tsinghua University); Ali Hassani (University of Oregon); Victor Kulikov (Picsart); Nikita Orlov (PicsArt); Shiji Song (Department of Automation, Tsinghua University); Humphrey Shi (U of Oregon | UIUC | PAIR); Gao Huang (Tsinghua)* |
| Discrete-Constrained Regression for Local Counting Models | Haipeng Xiong (National University of Singapore)*; Angela Yao (National University of Singapore) | N/A | N/A |
| Towards Regression-Free Neural Networks for Diverse Compute Platforms | Rahul Duggal (Georgia Tech); Hao Zhou (Amazon); Shuo Yang (Amazon); Jun Fang (Amazon)*; Yuanjun Xiong (Amazon); Wei Xia (Amazon) | N/A | N/A |
| Selection and Cross Similarity for Event-Image Deep Stereo | Hoonhee Cho (KAIST)*; Kuk-Jin Yoon (KAIST) | N/A | N/A |
| Long Movie Clip Classification with State-Space Video Models | Md Mohaiminul Islam (UNC Chapel Hill)*; Gedas Bertasius (UNC Chapel Hill) | N/A | N/A |
| Relationship Spatialization for Depth Estimation | xiaoyu xu (University of Waterloo)*; Jiayan Qiu (University of Waterloo); Xinchao Wang (National University of Singapore); Zhou Wang (University of Waterloo) | N/A | N/A |
| Breadcrumbs: Adversarial Class-Balanced Sampling for Long-tailed Recognition | Bo Liu (Wormpex AI Research)*; Haoxiang Li (Wormpex AI Research); Hao Kang (Wormpex AI Research); Gang Hua (Wormpex AI Research); Nuno Vasconcelos (UCSD, USA) | N/A | N/A |
| Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models | Chenfeng Xu (UC Berkeley)*; Shijia Yang (UC Berkeley); Tomer Galanti (Massachusetts Institute of Technology); Bichen Wu (Facebook Research); Xiangyu Yue (University of California, Berkeley); Bohan Zhai (UC Berkeley); Wei Zhan (University of California, Berkeley); Kurt Keutzer (EECS, UC Berkeley); Peter Vajda (Facebook); Masayoshi Tomizuka (University of California, Berkeley) | N/A | N/A |
| Visual Prompt Tuning | Menglin Jia (Cornell University)*; Luming Tang (Cornell University); Bor-Chun Chen (Facebook AI); Claire T Cardie (Cornell University); Serge Belongie (University of Copenhagen); Bharath Hariharan (Cornell University); Ser-Nam Lim (Meta AI) | N/A | N/A |
| Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation | THEODOROS PISSAS (University College London)*; Claudio S Ravasio (King’s College London (KCL)); Lyndon DaCruz (Moorfields Eye Hospital / University College London); Christos Bergeles (Kings College London) | N/A | N/A |
| Rethinking Generic Camera Models for Deep Single Image Camera Calibration to Recover Rotation and Fisheye Distortion | Nobuhiko Wakai (Panasonic Corporation)*; Satoshi Sato (Panasonic Corporation); Yasunori Ishii (Panasonic Holdings); Takayoshi Yamashita (Chubu University) | N/A | N/A |
| Neural-Sim: Learning to Generate Training Data with NeRF | Yunhao Ge (University of Southern California)*; Harkirat Behl (University of Oxford); Jiashu Xu (USC); Suriya Gunasekar (Microsoft Research); Neel Joshi (MICROSOFT RESEARCH); Yale Song (FAIR); Xin Wang (Microsoft Research); Laurent Itti (University of Southern California); Vibhav Vineet (Microsoft Research) | N/A | N/A |
| Word-Level Fine-Grained Story Visualization | Bowen Li (University of Oxford)* | N/A | N/A |
| Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection | Guangzhi Wang (National University of Singapore)*; Yangyang Guo (National University of Singapore); Yongkang Wong (National University of Singapore); Mohan Kankanhalli (National University of Singapore,) | N/A | N/A |
| GOCA: Guided Online Cluster Assignment for Self Supervised Video Representation Learning | HUSEYIN COSKUN (Technical University of Munich)*; Alireza Zareian (Snap Inc.); Joshua L Moore (Snapchat); Federico Tombari (Google, TU Munich); Chen Wang (Snap Inc.) | N/A | N/A |
| Learning Audio-Video Modalities from Image Captions | Arsha Nagrani (Google )*; Paul Hongsuck Seo (Google); Bryan Seybold (Google); Anja Hauth (Google AI); Santiago Manen (Google); Chen Sun (Brown University); Cordelia Schmid (Google) | N/A | N/A |
| Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Hanrong Ye (The Hong Kong University of Science and Technology)*; Dan Xu (The Hong Kong University of Science and Technology) | N/A | N/A |
| Image Inpainting with Cascaded Modulation GAN and Object-Aware Training | Haitian Zheng (University of Rochester)*; Zhe Lin (Adobe Research); Jingwan Lu (Adobe Research ); Scott Cohen (Adobe Research); Eli Shechtman (Adobe Research, US); Connelly Barnes (Adobe); Jianming Zhang (Adobe Research); Ning Xu (Adobe Research); Sohrab Amirghodsi (Adobe Research); Jiebo Luo (U. Rochester) | N/A | N/A |
| Planes vs. Chairs: Category-guided 3D shape learning without any 3D cues | Zixuan Huang (Georgia Institute of Technology)*; Stefan Stojanov (Georgia Institute of Technology); Anh Thai (Georgia Institute of Technology); Varun Jampani (Google); James Rehg (Georgia Institute of Technology) | N/A | N/A |
| ART-SS: An Adaptive Rejection Technique for Semi-Supervised restoration for adverse weather-affected images | Rajeev Yasarla ( AIBEE )*; Carey E Priebe (Johns Hopkins University); Vishal Patel (Johns Hopkins University) | N/A | N/A |
| Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction | Maosen Li (Cooperative Medianet Innovation Center, Shanghai Jiao Tong University)*; Siheng Chen (Shanghai Jiao Tong University); Zijing Zhang (Zhejiang University); Lingxi Xie (Huawei Inc.); Qi Tian (Huawei Cloud & AI); Ya Zhang (Cooperative Medianet Innovation Center, Shang hai Jiao Tong University) | N/A | N/A |
| MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views | Haitian Zeng (University of Technology Sydney)*; Xin Yu (University of Technology Sydney); Jiaxu Miao (Zhejiang University); Yi Yang (Zhejiang University) | N/A | N/A |
| Unifying Event Detection and Captioning as Sequence Generation via Pre-Training | Qi Zhang (Renmin University of China)*; Yuqing Song (Renmin University of China); Qin Jin (Renmin University of China) | N/A | N/A |
| Depth Map Decomposition for Monocular Depth Estimation | Jinyoung Jun (Korea University)*; Jae-Han Lee (Gauss Labs Inc.); Chul Lee (Dongguk University); Chang-Su Kim (Korea university) | N/A | N/A |
| Human-centric Image Cropping with Partition-aware and Content-preserving Features | Bo Zhang (Shanghai Jiao Tong University)*; Li Niu (Shanghai Jiao Tong University); Xing Zhao (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong University) | N/A | N/A |
| Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking | Boyu Chen (The University of Sydney); Peixia Li (The University of Sydney)*; Lei Bai (Shanghai AI Laboratory); Lei Qiao (SenseTime Group Limited); Qiuhong Shen (Harbin Institute of Technology (Shenzhen)); Bo Li (SenseTime Group Limited); Weihao Gan (SenseTime Group Limited); Wei Wu (SenseTime Group Limited); Wanli Ouyang (The University of Sydney) | N/A | N/A |
| StyleFace: Towards Identity-Disentangled Face Generation on Megapixels | Yuchen Luo (Shanghai Jiao Tong University)*; Junwei Zhu (Tencent); Keke He (Tencent); Wenqing Chu (Tencent); Ying Tai (Tencent YouTu); Junchi Yan (Shanghai Jiao Tong University); Chengjie Wang (Tencent; Shanghai Jiao Tong University) | N/A | N/A |
| Fusion from Decomposition: A Self-Supervised Decomposition Approach for Image Fusion | Pengwei Liang (Harbin Institute of Technology)*; Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology); Jiayi Ma (Wuhan University) | N/A | N/A |
| Learning Degradation Representations for Image Deblurring | dasong Li (Chinese University of Hong Kong)*; Yi Zhang (CUHK); Ka Chun Cheung (Nvidia); Xiaogang Wang (Chinese University of Hong Kong, Hong Kong); Hongwei Qin (Sensetime); Hongsheng Li (The Chinese University of Hong Kong) | N/A | N/A |
| Aware of the History: Trajectory Forecasting with the Local Behavior Data | Yiqi Zhong (University of Southern California)*; Zhenyang Ni (Shanghai Jiao Tong University); Siheng Chen (Shanghai Jiao Tong University); Ulrich Neumann (USC) | N/A | N/A |
| FAR: Fourier Aerial Video Recognition | Divya Kothandaraman (University of Maryland College Park)*; Tianrui Guan (University of Maryland, College Park); Xijun Wang (University of Maryland, College Park); Shuowen Hu (US Army Research Laboratory); Ming C Lin (UMD-CP & UNC-CH ); Dinesh Manocha (University of Maryland at College Park) | N/A | N/A |
| X-Learner: Learning Cross Sources and Tasks for Universal Visual Representation | Yinan He (Beijing University of Posts and Telecommunications)*; Gengshi Huang (School of Electronics and Information Technology, Sun Yat-sen University); Siyu Chen (Carnegie Mellon University); Jianing Teng (sensetime); Kun Wang (SenseTime Group Limited); Zhenfei Yin (Sensetime); Lu Sheng (Beihang University); Ziwei Liu (Nanyang Technological University); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Jing Shao (Sensetime) | N/A | N/A |
| Disentangled Differentiable Network Pruning | Shangqian Gao (University of Pittsburgh)*; Feihu Huang (University of Pittsburgh); Yanfu Zhang (University of Pittsburgh); Heng Huang (University of Pittsburgh) | N/A | N/A |
| Video Extrapolation in Space and Time | Yunzhi Zhang (Stanford University)*; Jiajun Wu (Stanford University) | N/A | N/A |
| IDa-Det: An Information Discrepancy-aware Distillation for 1-bit Detectors | Sheng Xu (Beihang University)*; Yanjing Li (Beihang University); Bohan Zeng (Beihang University); Teli Ma (Shanghai Artificial Intelligence Laboratory); Baochang Zhang (Beihang University); Xianbin Cao (Beihang University, China); Peng Gao (Chinese university of hong kong); Jinhu Lu (Beihang University, Beijing, China) | N/A | N/A |
| Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation | chuang lin (Monash University)*; Yi Jiang (Bytedance); Jianfei Cai (Monash University); Lizhen Qu (Monash University); Reza Haffari (Monash University, Australia); Zehuan Yuan (Bytedance.Inc) | N/A | N/A |
| DnA: Improving Few-shot Transfer Learning with Low-Rank Decomposition and Alignment | Ziyu Jiang (Texas A&M University)*; Tianlong Chen (Unversity of Texas at Austin); Xuxi Chen (University of Texas at Austin); Yu Cheng (Microsoft Research); Luowei Zhou (Microsoft); Lu Yuan (Microsoft); Ahmed Awadallah (Microsoft); Zhangyang Wang (University of Texas at Austin) | N/A | N/A |
| Translating a Visual LEGO Manual to a Machine-Executable Plan | Ruocheng Wang (Stanford University)*; Yunzhi Zhang (Stanford University); Jiayuan Mao (MIT); Chin-Yi Cheng (Google Research); Jiajun Wu (Stanford University) | N/A | N/A |
| Cornerformer: Purifying Instances for Corner-based Detectors | Haoran Wei (University of Chinese Academy of Sciences)*; Xin Chen (Huawei Inc.); Lingxi Xie (Huawei Inc.); Qi Tian (Huawei Cloud & AI) | N/A | N/A |
| Contributions of Shape, Texture, and Color in Visual Recognition | Yunhao Ge (University of Southern California)*; Yao Xiao (University of Southern California); Zhi Xu (University of Southern California); Xingrui Wang (University of Southern California); Laurent Itti (University of Southern California) | N/A | N/A |
| Monitored Distillation for Positive Congruent Depth Completion | Tian Yu Liu (UCLA); Parth Agrawal (UCLA); Allison Y Chen (University of California, Los Angeles); Byung-Woo Hong (Chung-Ang University); Alex Wong (Yale University)* | N/A | N/A |
| Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian | Zhiwen Cao (Purdue University); Dongfang Liu (Rochester Institute of Technology)*; Qifan Wang (Meta AI); Yingjie Victor Chen (Purdue University) | N/A | N/A |
| AirDet: Few-Shot Detection without Fine-tuning for Autonomous Exploration | Bowen Li (Tongji University)*; Chen Wang (Carnegie Mellon University); Pranay Reddy Anthireddy (Indian Institute of Information Technology, Design and Manufacturing, Jabalpur); Seungchan Kim (Carnegie Mellon University); Sebastian Scherer (Carnegie Mellon University) | N/A | N/A |
| Learning to Weight Samples for Dynamic Early-exiting Networks | Yizeng Han (Tsinghua University); Yifan Pu (Tsinghua University); Zihang Lai (CMU); Chaofei Wang (Tsinghua University); Shiji Song (Department of Automation, Tsinghua University); cao junfeng (CMRI); Wenhui Huang (CMRI); Chao Deng (China Mobile Research Institute); Gao Huang (Tsinghua)* | N/A | N/A |
| Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning | K L Navaneet (University of California, Davis); Soroush Abbasi Koohpayegani (University of Maryland Baltimore County)*; Ajinkya B Tejankar (UMBC); Kossar Pourahmadi Meibodi (University of Maryland, Baltimore County); Akshayvarun Subramanya (UMBC); Hamed Pirsiavash (University of California Davis) | N/A | N/A |
| SLIP: Self-supervision meets Language-Image Pre-training | Norman Mu (University of California, Berkeley)*; Alexander Kirillov (Facebook AI Reserach); David Wagner (UC Berkeley); Saining Xie (Facebook AI Research) | N/A | N/A |
| Learning Visual Styles from Audio-Visual Associations | Tingle Li (Tsinghua University)*; Yichen Liu (Tsinghua University); Andrew Owens (U Michigan); Hang Zhao (Tsinghua University) | N/A | N/A |
| Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting | Ying Chen (Hikvision Research Institute); Liang Qiao (Zhejiang University & Hikvision Research Institute)*; Zhanzhan Cheng (Zhejiang University & Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Yi Niu (Hikvision Research Institute); Xi Li (Zhejiang University) | N/A | N/A |
| Prompting Visual-Language Models for Efficient Video Understanding | Chen Ju (Cooperative Medianet Innovation Center, Shang hai Jiao Tong University); Tengda Han (University of Oxford); Kunhao Zheng (Shanghai Jiaotong University); Ya Zhang (Cooperative Medianet Innovation Center, Shang hai Jiao Tong University); Weidi Xie (Shanghai Jiao Tong University)* | N/A | N/A |
| One-Trimap Video Matting | Hongje Seong (Yonsei University)*; Seoung Wug Oh (Adobe Research); Brian Price (Adobe); Euntai Kim (Yonsei University); Joon-Young Lee (Adobe Research) | N/A | N/A |
| Contrastive Learning for Diverse Disentangled Foreground Generation | Yuheng Li (UW Madison)*; Yijun Li (Adobe Research); Jingwan Lu (Adobe Research ); Eli Shechtman (Adobe Research, US); Yong Jae Lee (University of Wisconsin-Madison); Krishna Kumar Singh (Adobe Research) | N/A | N/A |
| Resolution-free Point Cloud Sampling Network with Data Distillation | Tianxin Huang (Zhejiang University)*; Jiangning Zhang (Zhejiang University); Jun Chen (Zhejiang University); Yuang Liu (Zhejiang University); Yong Liu (Zhejiang University) | N/A | N/A |
| BIPS: Bi-modal Indoor Panorama Synthesis via Residual Depth-aided Adversarial Learning | Changgyoon Oh (KAIST)*; Wonjune Cho (NAVER LABS); Yujeong Chae (KAIST); Daehee Park (KAIST); Lin Wang (HKUST); Kuk-Jin Yoon (KAIST) | N/A | N/A |
| Augmentation of rPPG Benchmark Datasets: Learning to Remove and Embed rPPG Signals via Double Cycle Consistent Learning from Unpaired Facial Videos | WEI-HAO Chung (National Tsing Hua University)*; CHENG-JU HSIEH (National Tsing Hua University); Chiou-Ting Hsu (National Tsing Hua University) | N/A | N/A |
| Fabric Material Recovery from Video Using Multi-Scale Geometric Auto-Encoder | Junbang Liang (University of Maryland, College Park)*; Ming C Lin (UMD-CP & UNC-CH ) | N/A | N/A |
| An Invisible Black-box Backdoor Attack through Frequency Domain | Tong Wang (Nanjing University); Yuan Yao (Nanjing University)*; Feng Xu (Nanjing University); Shengwei An (Purdue University); Hanghang Tong (University of Illinois at Urbana-Champaign); Ting Wang (Penn State) | N/A | N/A |
| Learning Mutual Modulation for Self-Supervised Cross-Modal Super-Resolution | Xiaoyu Dong (The University of Tokyo / RIKEN AIP); Naoto Yokoya (The University of Tokyo)*; Longguang Wang (National University of Defense Technology); Tatsumi Uezato (Hitachi, Ltd) | N/A | N/A |
| TransGrasp: Grasp Pose Estimation of a Category of Objects by Transferring Grasps from Only One Labeled Instance | Hongtao Wen (Dalian University of Technology); Jianhang Yan (Dalian University of Technology); Wanli Peng (Dalian University of Technology)*; Yi Sun (Dalian University of Technology) | N/A | N/A |
| Learning Instance and Task-Aware Dynamic Kernels for Few-shot Learning | Rongkai Ma (Monash University)*; Pengfei Fang (The Australian National University); Gil Avraham (Monash University); Yan Zuo (CSIRO); Tianyu Zhu (Monash University); Tom Drummond (University of Melbourne); Mehrtash Harandi (Monash University) | N/A | N/A |
| PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection | Guangsheng Shi (Harbin Institute of Technology)*; Ruifeng Li (Harbin Institute of Technology); Chao Ma (Shanghai Jiao Tong University) | N/A | N/A |
| Robust Object Detection With Inaccurate Bounding Boxes | Chengxin Liu (Huazhong University of Science and Technology); Kewei Wang (Huazhong Univ. of Sci.&Tech.); Hao Lu (Huazhong University of Science and Technology); Zhiguo Cao (Huazhong Univ. of Sci.&Tech.)*; Ziming Zhang (Worcester Polytechnic Institute) | N/A | N/A |
| Revisiting the Critical Factors of Augmentation-Invariant Representation Learning | Junqiang Huang (MEGVII Technology)*; Xiangwen Kong (MEGVII Technology); Xiangyu Zhang (Megvii Technology) | N/A | N/A |
| A Fast Knowledge Distillation Framework for Visual Recognition | Zhiqiang Shen (Carnegie Mellon University)*; Eric Xing (MBZUAI, CMU, and Petuum Inc.) | N/A | N/A |
| MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment | Jie Ren (Megvii Inc.); Wenteng Liang (Megvii); Ran Yan (Megvii)*; Luo Mai (University of Edinburgh); Shiwen Liu (Megvii); Xiao Liu (Megvii Inc) | N/A | N/A |
| Spectrum-aware and Transferable Architecture Search for Hyperspectral Image Restoration | Wei He (Wuhan University)*; Quanming Yao (Tsinghua University); Naoto Yokoya (The University of Tokyo); Tatsumi Uezato (Hitachi, Ltd); Hongyan Zhang (Wuhan University); Liangpei Zhang (Wuhan University) | N/A | N/A |
| Boosting Transferability of Targeted Adversarial Examples via Hierarchical Generative Networks | Xiao Yang (Tsinghua University)*; Yinpeng Dong (Tsinghua University); Tianyu Pang (Sea AI Lab); Hang Su (Tsinghua Univiersity); Jun Zhu (Tsinghua University) | N/A | N/A |
| Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks | Qianjiang Hu (Peking University); Daizong Liu (Peking University); Wei Hu (Peking University)* | N/A | N/A |
| Geometry-aware Single-image Full-body Human Relighting | Chaonan Ji (Tsinghua University); Tao Yu (Tsinghua University); Kaiwen Guo (Google); JINGXIN LIU (OPPO); Yebin Liu (Tsinghua University)* | N/A | N/A |
| Optical Flow Training under Limited Label Budget via Active Learning | Shuai Yuan (Duke University)*; Xian Sun (Duke University); Hannah H Kim (Duke University); Shuzhi Yu (Duke University); Carlo Tomasi (Duke University) | N/A | N/A |
| RVSL: Robust Vehicle Similarity Learning in Real Hazy Scenes Based on Semi-supervised Learning | Wei-Ting Chen (National Taiwan University)*; I-HSIANG CHEN (National Taiwan University); CHIH-YUAN YEH (National Taiwan University); Hao-Hsiang Yang (National Taiwan University); Hua-En Chang (National Taiwan University); Jian-Jiun Ding (National Taiwan University); Sy-Yen Kuo (National Taiwan University) | N/A | N/A |
| Hierarchical Feature Embedding for Visual Tracking | Zhixiong Pi (Huazhong University of Science and Technology)*; Weitao Wan (Tencent); Chong Sun (Tencent Wechat); Changxin Gao (Huazhong University of Science and Technology); Nong Sang (Huazhong University of Science and Technology); Chen Li (Tencent) | N/A | N/A |
| Neural Color Operators for Sequential Image Retouching | YILI WANG (Tsinghua University); Xin Li (Baidu); Kun Xu (Tsinghua University)*; Dongliang He (Baidu); Qi Zhang (baidu); Fu Li (Baidu); Errui Ding (Baidu Inc.) | N/A | N/A |
| Optimizing Image Compression via Joint Learning with Denoising | Ka Leong Cheng (The Hong Kong University of Science and Technology); Yueqi Xie (The Hong Kong University of Science and Technology); Qifeng Chen (HKUST)* | N/A | N/A |
| DICE: Leveraging Sparsification for Out-of-Distribution Detection | Yiyou Sun (University of Wisconsin Madison); Yixuan Li (University of Wisconsin-Madison)* | N/A | N/A |
| DeMFI: Deep Joint Deblurring and Multi-Frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting | Jihyong Oh (KAIST)*; Munchurl Kim (Korea Advanced Institute of Science and Technology) | N/A | N/A |
| Invariant Feature Learning for Generalized Long-Tailed Classification | Kaihua Tang (Nanyang Technological University)*; Mingyuan Tao (Damo Academy, Alibaba Group); Jiaxin Qi (Nanyang Technological University); Zhenguang Liu (Zhejiang University); Hanwang Zhang (Nanyang Technological University) | N/A | N/A |
| Fine-Grained Visual Entailment | Christopher L Thomas (Columbia University)*; Yipeng Zhang (Columbia University); Shih-Fu Chang (Columbia University) | N/A | N/A |
| Sliced Recursive Transformer | Zhiqiang Shen (Carnegie Mellon University)*; Zechun Liu (Carnegie Mellon University); Eric Xing (MBZUAI, CMU, and Petuum Inc.) | N/A | N/A |
| Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval | Fan Hu (Renmin University of China); Aozhu Chen (Renmin University of China); Ziyue Wang (Renmin University of China); Fangming Zhou (Renmin University of China); Jianfeng Dong (Zhejiang Gongshang University); Xirong Li (Renmin University of China)* | N/A | N/A |
| Asymmetric Relation Consistency Reasoning for Video Relation Grounding | Huan Li (Xi’an Jiaotong University); Ping Wei (Xi’an Jiaotong University)*; Jiapeng Li (Xi’an Jiaotong University); Zeyu Ma (Xi’an Jiaotong University); Jiahui Shang (Xi’an Jiaotong University); Nanning Zheng (Xi’an Jiaotong University) | N/A | N/A |
| PETR: Position Embedding Transformation for Multi-View 3D Object Detection | Yingfei Liu (Megvii Technology); Tiancai Wang ( Megvii Technology)*; Xiangyu Zhang (Megvii Technology); Jian Sun (Megvii Technology) | N/A | N/A |
| Contextual Text Block Detection towards Scene Text Understanding | Chuhui Xue (Nanyang Technological University); Jiaxing Huang (Nanyang Technological University); Wenqing Zhang (ByteDance); Shijian Lu (Nanyang Technological University)*; Changhu Wang (ByteDance.Inc); Song Bai (University of Oxford) | N/A | N/A |
| Structure-aware Editable Morphable Model for 3D Facial Detail Animation and Manipulation | Jingwang Ling (Tsinghua University); Zhibo Wang (Tsinghua University); Ming Lu (Intel Labs China); Quan Wang (Sensetime); Chen Qian (SenseTime); Feng Xu (Tsinghua University)* | N/A | N/A |
| UniNet: Unified Architecture Search with Convolution, Transformer, and MLP | Jihao Liu (Sensetime)*; Xin Huang (Waseda University); Guanglu Song (Sensetime); Hongsheng Li (The Chinese University of Hong Kong); Yu Liu (SenseTime Group LTD) | N/A | N/A |
| Efficient Decoder-free Object Detection with Transformers | Peixian Chen (Youtu Tencent); mengdan zhang (Youtu, Tencent); Yunhang Shen (Xiamen University); Kekai Sheng (Youtu Lab, Tencent Inc.); Yuting Gao (tencent); Xing Sun (Shopee); Ke Li (Tencent)*; Chunhua Shen (“University of Adelaide, Australia”) | N/A | N/A |
| Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation | William McNally (University of Waterloo)*; Kanav Vats (University of Waterloo); Alexander Wong (University of Waterloo); John McPhee (University of Waterloo) | N/A | N/A |
| CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation | Lu Qi (The Chinese University of Hong Kong)*; Jason Kuen (Adobe Research); Zhe Lin (Adobe Research); Jiuxiang Gu (Adobe Research); Fengyun Rao (Tencent); Dian Li (Tencent.com); Weidong Guo (Tencent); Zhen Wen (Tencent Technology (Shenzhen) Co., Ltd); Ming-Hsuan Yang (University of California at Merced); Jiaya Jia (Chinese University of Hong Kong) | N/A | N/A |
| StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning | Jinghuan Shang (Stony Brook University)*; Kumara Kahatapitiya (Stony Brook University); Xiang Li (Stony Brook University); Michael S Ryoo (Stony Brook/Google) | N/A | N/A |
| S2Net: Stochastic Sequential Pointcloud Forecasting | Xinshuo Weng (NVIDIA Research)*; Junyu Nan (Carnegie Mellon University); Kuan-Hui Lee (Toyota Research Institute); Rowan McAllister (Toyota Research Institute); Adrien Gaidon (Toyota Research Institute); Nicholas Rhinehart (UC Berkeley); Kris Kitani (Carnegie Mellon University) | N/A | N/A |
| D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding | Zhenyu Chen (Technical University of Munich)*; Qirui Wu (Simon Fraser University); Matthias Niessner (Technical University of Munich); Angel X Chang (Simon Fraser University) | N/A | N/A |
| AMixer: Adaptive Weight Mixing for Self-Attention Free Vision Transformers | Yongming Rao (Tsinghua University); Wenliang Zhao (Tsinghua University); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University)* | N/A | N/A |
| Neural Image Representations for Multi-Image Fusion and Layer Separation | Seonghyeon Nam (York University); Marcus A Brubaker (York University); Michael S Brown (York University)* | N/A | N/A |
| Panoramic Human Activity Recognition | Ruize Han (College of Intelligence and Computing, Tianjin University); Haomin Yan (Tianjin University); Jiacheng Li (College of Intelligence and Computing, Tianjin University); Songmiao Wang (Tianjin University); Wei Feng (College of Intelligence and Computing, Tianjin University, China)*; Song Wang (University of South Carolina) | N/A | N/A |
| Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution | Yushu Wu (Northeastern University)*; Yifan Gong (Northeastern University); Pu Zhao (Northeastern University); Yanyu Li (Northeastern University); Zheng Zhan (Northeastern University); Wei Niu (William & Mary); Hao Tang (ETH Zurich); Minghai Qin (Western Digital Research); Bin Ren (William & Mary); Yanzhi Wang (Northeastern University) | N/A | N/A |
| Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation | Zhonghua Wu (Nanyang Technological University)*; Yicheng Wu (Monash University); Guosheng Lin (Nanyang Technological University); Jianfei Cai (Monash University); Chen Qian (SenseTime) | N/A | N/A |
| Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-Identification | Yiyuan Zhang (Beijing Institute of Technology); Sanyuan Zhao (Beijing Institute of Technology )*; Yuhao Kang (Beijing Institute of Technology); Jianbing Shen (Inception Institute of Artificial Intelligence) | N/A | N/A |
| RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth Estimation | Mu He (Nanjing University of Science and Technology)*; Le Hui (Nanjing University of Science and Technology); Yikai Bian (Nanjing University of Science and Technology); Jian Ren (Nanjing University of Science and Technology); Jin Xie (Nanjing University of Science and Technology); Jian Yang (Nanjing University of Science and Technology) | N/A | N/A |
| MoFaNeRF: Morphable Facial Neural Radiance Field | Yiyu Zhuang (Nanjing University); Hao Zhu (Nanjing University)*; Xusen Sun (Nanjing University); Xun Cao (Nanjing University) | N/A | N/A |
| Visual Cross-View Metric Localization with Dense Uncertainty Estimates | Zimin Xia (Delft University of Technology)*; Olaf Booij (TomTom); Marco Manfredi (TomTom); Julian F P Kooij (Delft University of Technology) | N/A | N/A |
| The One Where They Reconstructed 3D Humans and Environments in TV Shows | Georgios Pavlakos (UC Berkeley)*; Ethan Weber (UC Berkeley); Matthew Tancik (UC Berkeley); Angjoo Kanazawa (University of California Berkeley) | N/A | N/A |
| PointInst3D: Segmenting 3D Instances by Points | Tong He (University of Adelaide)*; Wei Yin (University of Adelaide); Chunhua Shen (“University of Adelaide, Australia”); Anton van den Hengel (University of Adelaide) | N/A | N/A |
| PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation | Haobo Yuan (Wuhan University)*; Xiangtai Li (Peking University); Yibo Yang (Peking University); Guangliang Cheng (Sensetime Group Limited); Jing Zhang (The University of Sydney); Yunhai Tong (Peking University); Lefei Zhang (Wuhan University); Dacheng Tao (JD.com) | N/A | N/A |
| Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap | Yongwei Chen (South China University of Technology); ZiHao Wang (South China University of Technology); Longkun Zou (South China University of Technology); Ke Chen (South China University of Technology); Kui Jia (South China University of Technology)* | N/A | N/A |
| TinyViT: Fast Pretraining Distillation for Small Vision Transformers | Kan Wu (Sun Yat-sen University); Jinnian Zhang (University of Wisconsin Madison); Houwen Peng (Microsoft Research)*; Mengchen Liu (Microsoft); Bin Xiao (Microsoft); Jianlong Fu (Microsoft Research); Lu Yuan (Microsoft) | N/A | N/A |
| VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data | Jiajun Su (Peking University)*; Chunyu Wang (Microsoft Research asia); Xiaoxuan Ma (Peking University); Wenjun Zeng (EIT Institute for Advanced Study); Yizhou Wang (PKU) | N/A | N/A |
| Poseur: Direct Human Pose Regression with Transformers | Weian Mao (the university of adelaide)*; Yongtao Ge (The University of Adelaide); Chunhua Shen (“University of Adelaide, Australia”); Xinlong Wang (University of Adelaide); Zhi Tian (Meituan); Zhibin Wang (Alibaba Group); Anton van den Hengel (University of Adelaide) | N/A | N/A |
| Adaptive Image Transformations for Transfer-based Adversarial Attack | Zheng Yuan (Institute of Computing Technology, Chinese Academy of Sciences); Jie Zhang (ICT, CAS)*; Shiguang Shan (Institute of Computing Technology, Chinese Academy of Sciences) | N/A | N/A |
| D2ADA: Dynamic Density-aware Active Domain Adaptation for Semantic Segmentation | Tsung-Han Wu (National Taiwan University)*; Yi-Syuan Liou (National Taiwan University); Shao-Ji Yuan (National Taiwan University); Hsin-Ying Lee (National Taiwan University); Tung-I Chen (National Taiwan University); Kuan-Chih Huang (National Taiwan University); Winston H. Hsu (National Taiwan University) | N/A | N/A |
| SQN: Weakly-Supervised Semantic Segmentation of Large-Scale 3D Point Clouds | Qingyong Hu (University of Oxford); Bo Yang (The Hong Kong Polytechnic University)*; Guangchi Fang (Sun Yat-sen University); Yulan Guo (Sun Yat-sen University); Ales Leonardis (University of Birmingham); Niki Trigoni (University of Oxford); Andrew Markham (University of Oxford) | N/A | N/A |
| Deep Portrait Delighting | Joshua William Weir (Victoria University of Wellington)*; Junhong Zhao (CMIC); Andrew Chalmers (CMIC); Taehyun Rhee (Victoria University of Wellington) | N/A | N/A |
| Vector Quantized Image-to-Image Translation | Yu-Jie Chen (National Chiao Tung University); Shin-I Cheng (National Chiao Tung University); Wei-Chen Chiu (National Chiao Tung University)*; Hung-Yu Tseng (Facebook); Hsin-Ying Lee (Snap Inc) | N/A | N/A |
| PointMixer: MLP-Mixer for Point Cloud Understanding | Jaesung Choe (KAIST)*; Chunghyun Park (POSTECH); Francois Rameau (KAIST); Jaesik Park (POSTECH); In So Kweon (KAIST) | N/A | N/A |
| V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer | Runsheng Xu (University of California, Los Angeles); Hao Xiang (University of California, Los Angeles); Zhengzhong Tu (University of Texas at Austin); Xin Xia (University of California, Los Angeles); Ming-Hsuan Yang (University of California at Merced); Jiaqi Ma (University of California, Los Angeles)* | N/A | N/A |
| Cross-Domain Ensemble Distillation for Domain Generalization | Kyungmoon Lee (POSTECH)*; Sungyeon Kim (POSTECH); Suha Kwak (POSTECH) | N/A | N/A |
| Cross-Modal 3D Shape Generation and Manipulation | Zezhou Cheng (University of Massachusetts, Amherst)*; Menglei Chai (Snap Inc.); Jian Ren (Snap Inc.); Hsin-Ying Lee (Snap Inc); Kyle B Olszewski (Snap Inc.); Zeng Huang (Snap Inc.); Subhransu Maji (University of Massachusetts, Amherst); Sergey Tulyakov (Snap Inc) | N/A | N/A |
| Latent Partition Implicit with Surface Codes for 3D Representation | Chao Chen (Tsinghua University); Yu-Shen Liu (Tsinghua University)*; Zhizhong Han (Wayne State University) | N/A | N/A |
| FILM: Frame Interpolation for Large Motion | Fitsum Reda (Google)*; Janne Kontkanen (Google); Eric Tabellion (Google); Deqing Sun (Google); Caroline Pantofaru (Google Research); Brian Curless (University of Washington) | N/A | N/A |
| Facial Depth and Normal Estimation using Single Dual-Pixel Camera | Minjun Kang (KAIST)*; Jaesung Choe (KAIST); Hyowon Ha (Facebook); Hae-Gon Jeon (GIST); Sunghoon Im (DGIST); In So Kweon (KAIST); Kuk-Jin Yoon (KAIST) | N/A | N/A |
| Initialization and Alignment for Adversarial Texture Optimization | Xiaoming Zhao (University of Illinois at Urbana-Champaign)*; Zhizhen Zhao (University of Illinois at Urbana-Champaign); Alexander Schwing (UIUC) | N/A | N/A |
| Regularizing Vector Embedding in Bottom-Up Human Pose Estimation | Haixin Wang (School of Artificial Intelligence, University of Chinese Academy of Sciences)*; lu zhou (CASIA); Yingying Chen (CASIA); Ming Tang (Institute of Automation, Chinese Academy of Sciences); Jinqiao Wang (Institute of Automation, Chinese Academy of Sciences) | N/A | N/A |
| Equivariant Hypergraph Neural Networks | Jinwoo Kim (KAIST); Saeyoon Oh (KAIST); Sungjun Cho (LG AI Research); Seunghoon Hong (KAIST)* | N/A | N/A |
| Learning Quality-aware Dynamic Memory for Video Object Segmentation | Yong Liu (Tsinghua University)*; Ran Yu (Tsinghua university); Fei Yin (Tsinghua University); Xinyuan Zhao (Huawei); Wei Zhao (Huawei); Weihao Xia (University College London); Yujiu Yang (Tsinghua University) | N/A | N/A |
| Neural Scene Decoration from a Single Photograph | Hong Wing Pang (The Hong Kong University of Science and Technology)*; Yingshu Chen ( The Hong Kong University of Science and Technology); Phuoc-Hieu T. Le (VinAI Research); Binh-Son Hua (VinAI Research); Thanh Nguyen (Deakin University, Australia); Sai-Kit Yeung (Hong Kong University of Science and Technology) | N/A | N/A |
| Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds | Ayush Jain (Carnegie Mellon University)*; Nikolaos Gkanatsios (Carnegie Mellon University); Ishita Mediratta (Meta AI); Katerina Fragkiadaki (Carnegie Mellon University) | N/A | N/A |
| CIRCLE:Convolutional Implicit Reconstruction and Completion for Large-scale Indoor Scene | Hao-Xiang Chen (Tsinghua University)*; Jiahui Huang (Tsinghua University); Tai-Jiang Mu (Tsinghua University); Shi-Min Hu (Tsinghua University) | N/A | N/A |
| Discovering Deformable Keypoint Pyramids | Jianing Qian (University of Pennsylvania)*; Anastasios Panagopoulos (University of Pennsylvania); Dinesh Jayaraman (University of Pennsylvania) | N/A | N/A |
| TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors | Gabriel Sarch (Carnegie Mellon University)*; Zhaoyuan Fang (Carnegie Mellon University); Adam Harley (Carnegie Mellon University); Paul Schydlo (Carnegie Mellon University); Michael J Tarr (Carnegie Mellon University); Saurabh Gupta (UIUC); Katerina Fragkiadaki (Carnegie Mellon University) | N/A | N/A |
| MOTR: End-to-End Multiple-Object Tracking with TRansformer | Fangao Zeng (Megvii Technology); Bin Dong (Megvii Technology); Yuang Zhang (Shanghai Jiao Tong University); Tiancai Wang ( Megvii Technology)*; Xiangyu Zhang (Megvii Technology); Yichen Wei (Megvii Research Shanghai) | N/A | N/A |
| K-centered Patch Sampling for Efficient Video Recognition | Seong Hyeon Park (KAIST AI)*; Jihoon Tack (KAIST); Byeongho Heo (NAVER AI LAB); Jung-Woo Ha (NAVER CLOVA AI Lab); Jinwoo Shin (KAIST) | N/A | N/A |
| Learning Implicit Feature Alignment Function for Semantic Segmentation | Hanzhe Hu (Peking University)*; Yinbo Chen (UC San Diego); Jiarui Xu (University of California San Diego); Shubhankar Borse (Qualcomm AI Research ); Hong Cai (Qualcomm AI Research); Fatih Porikli (Qualcomm AI Research); Xiaolong Wang (UCSD) | N/A | N/A |
| A Visual Navigation Perspective for Category-Level Object Pose Estimation | Jiaxin Guo (Zhejiang University)*; Yiyi Liao (MPI-IS and University of Tübingen); Zhong Fangxun (CUHK); Rong Xiong (Zhejiang University); Yunhui Liu (CUHK); Yue Wang (Zhejiang University) | N/A | N/A |
| ScaleNet: Searching for the Model to Scale | Jiyang Xie (Huawei Noah’s Ark Lab); Xiu Su (University of Sydney); Shan You (SenseTime); Zhanyu Ma (Beijing University of Posts and Telecommunications)*; Fei Wang (University of Science and Technology of China); Chen Qian (SenseTime) | N/A | N/A |
| Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels | Ganlong Zhao (The University of Hong Kong); Guanbin Li (Sun Yat-sen University)*; Yipeng Qin (Cardiff University); Feng Liu (Deepwise AI Lab); Yizhou Yu (The University of Hong Kong) | N/A | N/A |
| GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing | Sijie Zhu (University of Central Florida)*; Zhe Lin (Adobe Research); Scott Cohen (Adobe Research); Jason Kuen (Adobe Research); Zhifei Zhang (Adobe Research); Chen Chen (University of Central Florida) | N/A | N/A |
| FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute Classification | Xiaofeng Lin (University of California – Los Angeles); Seungbae Kim (University of South Florida); Jungseock Joo (University of California Los Angeles)* | N/A | N/A |
| Tackling Background Distraction in Video Object Segmentation | Suhwan Cho (Yonsei University)*; Heansung Lee (Yonsei University); Minhyeok Lee ( Yonsei University); Chaewon Park (Yonsei University); Sungjun Jang (Yonsei University); Minjung Kim (Yonsei University); Sangyoun Lee (Yonsei University) | N/A | N/A |
| Hyperspherical Learning in Multi-Label Classification | Bo Ke (Tencent Youtu Lab)*; yunquan zhu (Tencent YouTu Lab); Mengtian Li (East China Normal University); Xiujun shu (Tencent Toutu Lab); Ruizhi Qiao (Tencent Youtu Lab); Bo Ren (Tencent) | N/A | N/A |
| The Surprisingly Straightforward Scene Text Removal Method With Gated Attention and Region of Interest Generation: A Comprehensive Prominent Model Analysis | Hyeonsu Lee (Naver Corporation)*; Chankyu Choi (Naver Corporation) | N/A | N/A |
| FingerprintNet: Synthesized Fingerprints for Generated Image Detection | Yonghyun Jeong (NAVER CLOVA)*; Doyeon Kim (Line+); Youngmin Ro (Samsung SDS); pyounggeon kim (SDS); Jongwon Choi (Chung-Ang University) | N/A | N/A |
| ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the Wild | Wang Zhao (Tsinghua University)*; Shaohui Liu (ETH Zurich); Hengkai Guo (ByteDance AI Lab); Wenping Wang (The University of Hong Kong); Yong-Jin Liu (Tsinghua University) | N/A | N/A |
| Free-Viewpoint RGB-D Human Performance Capture and Rendering | Phong Ha Nguyen (University of Oulu)*; Nikolaos Sarafianos (Facebook Reality Labs); Christoph Lassner (Meta Reality Labs Research); Janne Heikkila (University of Oulu, Finland); Tony Tung (Facebook) | N/A | N/A |
| When Active Learning Meets Implicit Semantic Data Augmentation | zhuangzhuang chen (shenzhen university); Jin Zhang (Shenzhen University); Pan Wang (Shenzhen University); Jie Chen (Shenzhen University); Jianqiang Li (Shenzhen University)* | N/A | N/A |
| Multiview Regenerative Morphing with Dual Flows | Chih-Jung Tsai (National Tsing Hua University); Cheng Sun (National Tsing Hua University); Hwann-Tzong Chen (National Tsing Hua University)* | N/A | N/A |
| Frequency and Spatial Dual Guidance for Image Dehazing | Hu Yu (University of Science and Technology of China); Naishan Zheng (University of Science and Technology of China); man zhou (University of Science and Technology of China); Jie Huang (University of Science and Technology of China); Zeyu Xiao (University of Science and Technology of China); Feng Zhao (University of Science and Technology of China)* | N/A | N/A |
| The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assisted Video Editing | Dawit Mureja Argaw (KAIST)*; Fabian Caba (Adobe Research); Joon-Young Lee (Adobe Research); Markus Woodson (Adobe); In So Kweon (KAIST) | N/A | N/A |
| Hallucinating Pose-Compatible Scenes | Tim Brooks (UC Berkeley)*; Alexei A Efros (UC Berkeley) | N/A | N/A |
| Faster VoxelPose: Real-time 3D Human Pose Estimation by Orthographic Projection | Hang Ye (Peking University); Wentao Zhu (Peking University)*; Chunyu Wang (Microsoft Research asia); Rujie Wu (Peking University); Yizhou Wang (PKU) | N/A | N/A |
| Video Interpolation by Event-driven Anisotropic Adjustment of Optical Flow | Song Wu (Huawei Technologies Co., Ltd.); Kaichao You (Tsinghua Univ); Weihua He (Tsinghua University)*; Chen Yang (Peking University); Yang Tian (Tsinghua University); Yaoyuan Wang (Huawei Technologies Co., Ltd.); Jianxing Liao (HUAWEI TECHNOLOGIES CO.LTD); Ziyang Zhang (HUAWEI TECHNOLOGIES CO.LTD) | N/A | N/A |
| Motion and Appearance Adaptation for Cross-Domain Motion Transfer | Borun Xu (University of Electronic Science and Technology of China)*; Biao Wang (Alibaba Group); Jinhong Deng (University of Electronic Science and Technology of China); Jiale Tao (University of Electronic Science and Technology of China); Tiezheng Ge (Alibaba Group); Yuning Jiang (Alibaba Group); Wen Li (University of Electronic Science and Technology of China); Lixin Duan (University of Electronic Science and Technology of China) | N/A | N/A |
| AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets | Zhijun Tu (Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong university)*; Xinghao Chen (Huawei Noah’s Ark Lab); Pengju Ren (Institute of Artificial Intelligence at Xi’an Jiaotong University); Yunhe Wang (Huawei Technologies) | N/A | N/A |
| Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation | Abduallah A Mohamed (Meta)*; Deyao Zhu (King Abdullah University of Science and Technology); Warren Vu (The University of Texas at Austin); Mohamed Elhoseiny (KAUST); Christian Claudel (The university of Texas at Austin) | N/A | N/A |
| A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation | Rahul Rahaman (National University of Singapore)*; Dipika Singhania (National University of Singapore); Alex Thiery (National University of Singapore); Angela Yao (National University of Singapore) | N/A | N/A |
| A Deep Moving-camera Background Model | Guy Erez (Ben Gurion University)*; Ron A Shapira Weber (Ben-Gurion University); Oren Freifeld (Ben-Gurion University) | N/A | N/A |
| DLME: Deep Local-flatness Manifold Embedding | Zelin Zang (Zhejiang University & Westlake University)*; Siyuan Li (Westlake University); di wu (Westlake University); Ge Wang (Westlake University); Kai Wang (National University of Singapore); Lei Shang (Alibaba Group); Baigui Sun (Alibaba Group); Hao Li (Alibaba Group); Stan Z. Li (Westlake University) | N/A | N/A |
| Neural Video Compression using GANs for Detail Synthesis and Propagation | Fabian Mentzer (Google)*; Eirikur Agustsson (Google); Johannes Ballé (Google); David Minnen (Google Inc.); Nick Johnston (Google); George Toderici (Google Research) | N/A | N/A |
| Few-shot Action Recognition with Hierarchical Matching and Contrastive Learning | Sipeng Zheng (Renmin University of China)*; Shizhe Chen (INRIA); Qin Jin (Renmin University of China) | N/A | N/A |
| Perspective Flow Aggregation for Data-Limited 6D Object Pose Estimation | Yinlin Hu (EPFL)*; Pascal Fua (EPFL, Switzerland); Mathieu Salzmann (EPFL) | N/A | N/A |
| TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information | Suraj Kothawade (UT Dallas)*; Saikat Ghosh (University of Texas at Dallas); Sumit Shekhar (Adobe Research); Yu Xiang (The University of Texas at Dallas); Rishabh Iyer (University of Texas at Dallas) | N/A | N/A |
| New Datasets and Models for Contextual Reasoning in Visual Dialog | Yifeng Zhang (University of Minnesota, Twin Cities); Ming Jiang (University of Minnesota); Qi Zhao (University of Minnesota)* | N/A | N/A |
| Remote Respiration Monitoring of Moving Person Using Radio Signals | Jae-Ho Choi (Pohang University of Science and Technology)*; KIBONG KANG (POSTECH); Kyung-Tae Kim (Pohang University of Science and Technology) | N/A | N/A |
| AdvDO: Realistic Adversarial Attacks for Trajectory Prediction | Yulong Cao (University of Michigan, Ann Arbor )*; Chaowei Xiao (NVIDIA); Anima Anandkumar (NVIDIA/Caltech); Danfei Xu (Stanford University); Marco Pavone (Stanford University) | N/A | N/A |
| Cross-Modality Transformer for Visible-Infrared Person Re-Identification | Kongzhu Jiang (University of Science and Technology of China)*; Tianzhu Zhang (University of Science and Technology of China); Xiang Liu (Dongguan University of Technology); Bingqiao Qian (University of Science and Technology of China); Yongdong Zhang (University of Science and Technology of China); Feng Wu (University of Science and Technology of China) | N/A | N/A |
| VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition | Changyao Tian (Chinese University of Hong Kong); Wenhai Wang (Nanjing University); Xizhou Zhu (SenseTime); Jifeng Dai (SenseTime)*; Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences) | N/A | N/A |
| Self-Supervised Classification Network | Elad Amrani (IBM / Technion)*; Leonid Karlinsky (IBM-Research); Alex Bronstein (Technion) | N/A | N/A |
| DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction | Kaichen Zhou (University of Oxford)*; Lanqing Hong (Huawei Noah’s Ark Lab); Changhao Chen (National University of Defense Technology); Hang Xu (Huawei Noah’s Ark Lab); Chaoqiang Ye (Huawei); Qingyong Hu (University of Oxford); Zhenguo Li (Huawei Noah’s Ark Lab) | N/A | N/A |
| Bayesian Optimization with Clustering and Rollback for CNN Auto Pruning | Hanwei FAN (HKUST)*; Jiandong MU (HKUST); Wei Zhang (Hong Kong University of Science and Technology) | N/A | N/A |
| Towards Real-World HDRTV Reconstruction: A Data Synthesis-based Approach | Zhen Cheng (University of Science and Technology of China)*; Tao Wang (Huawei Noah’s Ark Lab); Yong Li (Huawei Noah’s Ark Lab); Fenglong Song (Huawei Noah’s Ark Lab); Chang Chen (Huawei Noah’s Ark Lab); Zhiwei Xiong (University of Science and Technology of China) | N/A | N/A |
| Quantum Motion Segmentation | Federica Arrigoni (University of Trento)*; Willi Menapace (University of Trento); Marcel Seelbach Benkner (University of Siegen); Elisa Ricci (University of Trento); Vladislav Golyanik (MPI for Informatics) | N/A | N/A |
| Open-world Semantic Segmentation via Contrasting and Clustering Vision-language Embedding | Quande Liu (The Chinese University of Hong Kong)*; Youpeng Wen (Dalian University of Technology); Jianhua Han (Huawei Noah’s Ark Lab); Chunjing Xu (Huawei Noah’s Ark Lab); Hang Xu (Huawei Noah’s Ark Lab); Xiaodan Liang (Sun Yat-sen University) | N/A | N/A |
| Custom Structure Preservation in Face Aging | Guillermo Gomez-Trenado (University of Granada)*; Stéphane Lathuilière (Telecom-Paris); Pablo Mesejo (University of Granada); Oscar Cordón García (University of Granada) | N/A | N/A |
| DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks | Shih-Yang Su (University of British Columbia)*; Timur Bagautdinov (Facebook); Helge Rhodin (UBC) | N/A | N/A |
| Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization | Jiaxin Qi (Nanyang Technological University)*; Kaihua Tang (Nanyang Technological University); Qianru Sun (Singapore Management University); Xian-Sheng Hua (Damo Academy, Alibaba Group); Hanwang Zhang (Nanyang Technological University) | N/A | N/A |
| Spatio-Temporal Deformable Attention Network for Video Deblurring | Huicong Zhang (Harbin Institute of Technology)*; Haozhe Xie (Tencent AI Lab); Hongxun Yao (Harbin Institute of Technology) | N/A | N/A |
| CHORE: Contact, Human and Object REconstruction from a single RGB image | Xianghui Xie (Saarland University )*; Bharat Lal Bhatnagar (University of Tübingen, MPI informatik); Gerard Pons-Moll (University of Tübingen) | N/A | N/A |
| Complementing Brightness Constancy with Deep Networks for Optical Flow Prediction | Vincent LE GUEN (EDF R&D, CNAM)*; Clément Rambour (Cnam); Nicolas Thome (CNAM, Paris) | N/A | N/A |
| Learning Discriminative Shrinkage Deep Networks for Image Deconvolution | Pin-Hung Kuo (National Taiwan University)*; Jinshan Pan (Nanjing University of Science and Technology); Shao-Yi Chien (National Taiwan University); Ming-Hsuan Yang (University of California at Merced) | N/A | N/A |
| Camera Pose Estimation and Localization with Active Audio Sensing | Karren D Yang (MIT); Michael Firman (Niantic); Eric Brachmann (Niantic)*; Clement LJC Godard (Niantic) | N/A | N/A |
| Learning Efficient Multi-Agent Cooperative Visual Exploration | Chao Yu (Tsinghua University); Xinyi Yang (Tinghua University)*; Jiaxuan Gao (Tsinghua University); Huazhong Yang (Tsinghua University); Yu Wang (Tsinghua University); Yi Wu (Tsinghua University) | N/A | N/A |
| 4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding | Yujin Chen (Technical University of Munich)*; Matthias Niessner (Technical University of Munich); Angela Dai (Technical University of Munich) | N/A | N/A |
| Learned Vertex Descent: A New Direction for 3D Human Model Fitting | Enric Corona (IRI)*; Gerard Pons-Moll (University of Tübingen); Guillem Alenyà (IRI); Francesc Moreno (IRI) | N/A | N/A |
| Hierarchical Semi-Supervised Contrastive Learning for Contamination-Resistant Anomaly Detection | Gaoang Wang (Zhejiang University); Yibing Zhan (JD Explore Academy); Xinchao Wang (National University of Singapore); Mingli Song (Zhejiang University)*; Klara Nahrstedt (University of Illinois at Urbana-Champaign) | N/A | N/A |
| Learning to Fit Morphable Models | Vasileios Choutas (ETH Zurich)*; Federica Bogo (Meta); Jingjing Shen (Microsoft); Julien Valentin (Microsoft) | N/A | N/A |
| Few-Shot Classification with Contrastive Learning | Zhanyuan Yang (Shenzhen University); Jinghua Wang (Harbin Institute of Technology); Yingying Zhu (Shenzhen University)* | N/A | N/A |
| ARM: Any-Time Super-Resolution Method | Bohong Chen (Xiamen University)*; Mingbao Lin (Xiamen University, China); Kekai Sheng (Youtu Lab, Tencent Inc.); mengdan zhang (Youtu, Tencent); Peixian Chen (Youtu Tencent); Ke Li (Tencent); Liujuan Cao (Xiamen University); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Tracking Every Thing in the Wild | Siyuan Li (ETH Zurich)*; Martin Danelljan (ETH Zurich); Henghui Ding (ETH Zurich); Thomas E Huang (ETH Zürich); Fisher Yu (ETH Zurich) | N/A | N/A |
| Learning Self-prior for Mesh Denoising using Dual Graph Convolutional Networks | Shota Hattori (The University of Tokyo)*; Tatsuya Yatagawa (The University of Tokyo); Yutaka Ohtake (The University of Tokyo); Suzuki Hiromasa (The University of Tokyo) | N/A | N/A |
| Few Zero Level Set-Shot Learning of Shape Signed Distance Functions in Feature Space | Amine Ouasfi (IMT Atlantique ); Adnane Boukhayma (Inria)* | N/A | N/A |
| Attention-aware Learning for Hyperparameters Prediction in Image Processing Pipelines | Haina Qin (University of Chinese Academy of Sciences); Longfei Han (Beijing Technology and Business University); Juan Wang (Institute of Automation, Chinese Academy of Sciences); Congxuan Zhang (Nanchang Hangkong University); Bing Li (National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences)*; Weiming Hu (Institute of Automation,Chinese Academy of Sciences); Yanwei Li (Zeku Technology(Shanghai) Corp.,Ltd.) | N/A | N/A |
| Attaining Class-level Forgetting in Pretrained Model using Few Samples | Pravendra Singh (IIT Roorkee); Pratik Mazumder (Indian Institute of Technology Jodhpur)*; Mohammed Asad Karim (Carnegie Mellon University) | N/A | N/A |
| Data Invariants to Understand Unsupervised Out-of-Distribution Detection | Lars Doorenbos (University of Bern)*; Raphael Sznitman (University of Bern); Pablo Márquez Neila (University of Bern) | N/A | N/A |
| STEEX: Steering Counterfactual Explanations with Semantics | Paul Jacob (École Polytechnique ); eloi zablocki (Valeo.ai)*; Hedi Ben-younes (Valeo AI); Mickael Chen (valeo.ai); Patrick Pérez (Valeo.ai); Matthieu Cord (Sorbonne University) | N/A | N/A |
| Outpainting by Queries | Kai Yao (Xi’an Jiaotong-liverpool University); Penglei Gao (Xi’an Jiaotong-Liverpool University); Xi Yang (Xi’an Jiaotong Liverpool University ); jie Sun (Xi’an Jiaotong-Liverpool University ); Rui Zhang (Xi’an Jiaotong-Liverpool University); Kaizhu Huang (Duke Kunshan University)* | N/A | N/A |
| HULC: 3D HUman Motion Capture with Pose Manifold SampLing and Dense Contact Guidance | Soshi Shimada (MPI for Informatics)*; Vladislav Golyanik (MPI for Informatics); Zhi Li (Max Planck Institute for Informatics); Patrick Pérez (Valeo.ai); Weipeng Xu (Reality Labs Research); Christian Theobalt (MPI Informatik) | N/A | N/A |
| Interpretable Open-Set Domain Adaptation via Angular Margin Separation | Xinhao Li (University of Electronic Science and Technology of China); Jingjing Li (University of Electronic Science and Technology of China)*; Zhekai Du (University of Electronic Science and Technology of China); Lei Zhu (Shandong Normal Unversity); Wen Li (University of Electronic Science and Technology of China) | N/A | N/A |
| EgoBody: Human Body Shape and Motion of Interacting People from Head-Mounted Devices | Siwei Zhang (ETH Zurich)*; Qianli Ma (Max Planck Institute for Intelligent Systems); Yan Zhang (ETH Zurich); Zhiyin Qian (ETH Zürich); Taein Kwon (ETH Zurich); Marc Pollefeys (ETH Zurich / Microsoft); Federica Bogo (Meta); Siyu Tang (ETH Zurich) | N/A | N/A |
| ViTAS: Vision Transformer Architecture Search | Xiu Su (University of Sydney); Shan You (SenseTime)*; Jiyang Xie (Huawei Noah’s Ark Lab); Mingkai Zheng (The University of Sydney); Fei Wang (University of Science and Technology of China); Chen Qian (SenseTime); Changshui Zhang (Tsinghua University); Xiaogang Wang (Chinese University of Hong Kong, Hong Kong); Chang Xu (University of Sydney) | N/A | N/A |
| LaLaLoc++: Global Floor Plan Comprehension for Layout Localisation in Unvisited Environments | Henry Howard-Jenkins (University of Oxford)*; Victor Adrian Prisacariu (University of Oxford) | N/A | N/A |
| diffConv: Analyzing Irregular Point Clouds with an Irregular View | Manxi Lin (Technical University of Denmark)*; Aasa Feragen (Technical University of Denmark) | N/A | N/A |
| ReAct: Temporal Action Detection with Relational Action Queries | Dingfeng Shi (Beihang University)*; Yujie Zhong (University of Oxford); Qiong Cao (JD.com); Jing Zhang (The University of Sydney); Lin Ma (Meituan); Jia Li (Beihang University); Dacheng Tao (JD.com) | N/A | N/A |
| StyleBabel: Artistic Style Tagging and Captioning | Dan Ruta (University of Surrey)*; Andrew Gilbert (University of Surrey); Pranav V Aggarwal (Adobe Inc.); Naveen Marri (Adobe Inc); Ajinkya Kale (Adobe); Jo Briggs (University of Northumbria); Chris Speed (University of Edinburgh); Hailin Jin (Adobe Research); Baldo Faieta (Adobe); Alex Filipkowski (Adobe); Zhe Lin (Adobe Research); John Collomosse (Adobe Research) | N/A | N/A |
| TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation | RUI GONG (ETH Zurich)*; Martin Danelljan (ETH Zurich); Dengxin Dai (ETH Zurich); Danda Pani Paudel (ETH Zürich); Ajad Chhatkuli (ETH Zurich); Fisher Yu (ETH Zurich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Domain Invariant Autoencoders for Self-supervised Learning from Multi-domains | Haiyang Yang (Nanjing University)*; Shixiang Tang (The University of Sydney); Meilin Chen (Zhejiang University); Yizhou Wang (Zhejiang University); Feng Zhu (University of Science and Technology of China); Lei Bai (Shanghai AI Laboratory); Rui Zhao (SenseTime Group Limited); Wanli Ouyang (The University of Sydney) | N/A | N/A |
| Learned Variational Video Color Propagation | Markus Hofinger (Graz University of Technology)*; Erich Kobler (University Hospital Bonn); Alexander Effland (University of Bonn); Thomas Pock (Graz University of Technology) | N/A | N/A |
| PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows | aihua mao (South China University of Technolgoy)*; Zihui Du (South China University of Technology); Yu-Hui Wen (Tsinghua University); Jun Xuan (South China University of Technology); Yong-Jin Liu (Tsinghua University) | N/A | N/A |
| Prototypical Contrast Adaptation for Domain Adaptive Semantic Segmentation | ZhengKai Jiang (Tencent Youtu Lab)*; Yuxi Li (Tencent); Ceyuan Yang (Chinese University of Hong Kong); Peng Gao (Chinese university of hong kong); Yabiao Wang (Tencent); Ying Tai (Tencent YouTu); Chengjie Wang (Tencent; Shanghai Jiao Tong University) | N/A | N/A |
| Adversarial Contrastive Learning via Asymmetric InfoNCE | Qiying Yu (Tsinghua University)*; Jieming Lou (Harbin Institute of Technology); Xianyuan Zhan (Tsinghua University); Qizhang Li (Harbin Institute of Technology); Wangmeng Zuo (Harbin Institute of Technology, China); Yang Liu (Tsinghua University); Jingjing Liu (Tsinghua University) | N/A | N/A |
| NeRF for Outdoor Scene Relighting | Viktor Rudnev (Max Planck Institute for Informatics)*; Mohamed Elgharib (Max Planck Institute for Informatics); William Smith (University of York); Lingjie Liu (Max Planck Institute for Informatics ); Vladislav Golyanik (MPI for Informatics); Christian Theobalt (MPI Informatik) | N/A | N/A |
| FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion | Fabian Duffhauss (Bosch Center for Artificial Intelligence)*; Vien Anh Ngo (Bosch Center for Artificial Intelligence); Hanna Ziesche (Bosch Center for AI); Gerhard Neumann (Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany) | N/A | N/A |
| Self-calibrating Photometric Stereo by Neural Inverse Rendering | Junxuan Li (Australian National University)*; HONGDONG LI (Australian National University, Australia) | N/A | N/A |
| Time-rEversed diffusioN tEnsor Transformer: A new TENET of Few-Shot Object Detection | Shan Zhang (Australian National University); Naila Murray (Naver Labs); Lei Wang (“University of Wollongong, Australia”); Piotr Koniusz (ANU College of Engineering and Computer Science)* | N/A | N/A |
| Detecting Generated Images by Real Images | Bo Liu (Chongqing University of Posts and Telecommunications); fan yang (Chongqing University of Posts and Telecommunications); Xiuli Bi (Chongqing University of Posts and Telecommunications); bin xiao (Chongqing University of Posts and Telecommunications)*; Weisheng Li (Chongqing University of Posts and Telecommunications); Xinbo Gao (Chongqing University of Posts and Telecommunications) | N/A | N/A |
| VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection | Joanna Hong (KAIST)*; Minsu Kim (KAIST); Yong Man Ro (KAIST) | N/A | N/A |
| Delta Distillation for Efficient Video Processing | Amirhossein Habibian (Qualcomm AI Research)*; Haitam Ben Yahia (Qualcomm AI Research); Davide Abati (Qualcomm AI Research); Efstratios Gavves (University of Amsterdam ); Fatih Porikli (Qualcomm AI Research) | N/A | N/A |
| PANDORA: A Panoramic Detection Dataset for Object with Orientation | Hang Xu (Hangzhou Dianzi University;The Institute of Computing Technology of the Chinese Academy of Sciences); Qiang Zhao (The Institute of Computing Technology of the Chinese Academy of Sciences); Yike Ma (Institute of Computing Technology, Chinese Academy of Sciences); Xiaodong Li (Huawei Noah’s Ark Lab); Peng Yuan (Huawei Noah’s Ark Lab); Bailan Feng (Huawei Noah’s Ark Lab); Chenggang Yan (Hangzhou Dianzi University); Feng Dai (Institute of Computing Technology, Chinese Academy of Sciences)* | N/A | N/A |
| Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation | Feng Zhu (University of Technology Sydney)*; Zongxin Yang (Zhejiang University); Xin Yu (University of Technology Sydney); Yi Yang (Zhejiang University); Yunchao Wei (UTS) | N/A | N/A |
| Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment | Sangmin Lee (KAIST)*; Sungjune Park (KAIST); Yong Man Ro (KAIST) | N/A | N/A |
| 3D Clothed Human Reconstruction in the Wild | Gyeongsik Moon (Seoul National University); Hyeongjin Nam (Seoul National University); Takaaki Shiratori (Meta Reality Labs Research); Kyoung Mu Lee (Seoul National University)* | N/A | N/A |
| Classification-Regression for Chart Comprehension | Matan Levy (The Hebrew University of Jerusalem)*; Rami Ben-Ari (OriginAI); Dani Lischinski (The Hebrew University of Jerusalem) | N/A | N/A |
| Zero-Shot Category-Level Object Pose Estimation | Walter Goodwin (University of Oxford)*; Sagar Vaze (Visual Geometry Group, University of Oxford); Ioannis Havoutis (“Oxford Robotics Institute, Universtity of Oxford”); Ingmar Posner (Oxford University) | N/A | N/A |
| AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant | Benita Wong (National University of Singapore)*; Joya Chen (National University of Singapore); You Wu (Harvard University); Stan Weixian Lei (National University of Singapore); Dongxing Mao (National University of Singapore); Difei Gao (NUS); Mike Zheng Shou (National University of Singapore) | N/A | N/A |
| Laplace Mesh Transformer: Dual Attention and Topology Aware Network for 3D mesh Classification and Segmentation | Xiao-Juan Li (Institute of Computing Technology, Chinese Academy of Sciences); Jie Yang (Institute of Computing Technology, Chinese Academy of Sciences)*; Fang-Lue Zhang (Victoria University of Wellington) | N/A | N/A |
| CoMER: Modeling Coverage for Transformer-based Handwritten Mathematical Expression Recognition | Wenqi Zhao (Peking University)*; Liangcai Gao (Peking University) | N/A | N/A |
| RBC: Rectifying the Biased Context in Continual Semantic Segmentation | Hanbin Zhao (Zhejiang University)*; Fengyu Yang (University of Michigan); Xinghe Fu (Zhejiang University); Xi Li (Zhejiang University) | N/A | N/A |
| Don’t Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context | Chongyu Liu (South China University of Technology); Lianwen Jin (South China University of Technology)*; Yuliang Liu (Huazhong University of Science and Technology); Canjie Luo (South China University of Technology); Bangdong Chen (South China University of Technology); Fengjun Guo (IntSig Information Co. Ltd); Kai Ding (IntSig Information Co., Ltd) | N/A | N/A |
| Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching | Jiazhen Liu (Renmin University of China); Xirong Li (Renmin University of China)*; Qijie Wei ( Vistel Inc.); Jie Xu (Beijing Tongren Hospital); Dayong Ding (Vistel Inc.) | N/A | N/A |
| Memory-Augmented Model-Driven Network for Pansharpening | Keyu Yan ( Hefei Institutes of Physical Science,Chinese Academy of Sciences)*; man zhou (Chinese Academy of Sciences); li zhang (Chinese Academy of Sciences); Chengjun Xie (Institute of Intelligent Machines, Chinese Academy of Sciences China) | N/A | N/A |
| Factorizing Knowledge in Neural Networks | Xingyi Yang (National University of Singapore)*; Jingwen Ye (National University of Singapore); Xinchao Wang (National University of Singapore) | N/A | N/A |
| Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes | Sam Bond-Taylor (Durham University)*; Peter Hessey (Durham University); Hiroshi Sasaki (Durham University); Toby P Breckon (Durham University); Chris G. Willcocks (Durham University) | N/A | N/A |
| Contrastive Vicinal Space for Unsupervised Domain Adaptation | Jaemin Na (Ajou University)*; Dongyoon Han (NAVER AI Lab); Hyung Jin Chang (University of Birmingham); Wonjun Hwang (Ajou University) | N/A | N/A |
| Weight Fixing Networks | Chris Subia-Waud (University of Southampton)*; Srinandan Dasmahapatra (University of Southampton) | N/A | N/A |
| Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin Picking | Kai Chen (The Chinese University of Hong Kong); Rui Cao (The Chinese University of Hong Kong); Stephen L James (UC Berkeley); YICHUAN LI (CUHK); Yunhui Liu (CUHK); Pieter Abbeel (UC Berkeley); Qi Dou (The Chinese University of Hong Kong)* | N/A | N/A |
| ChunkyGAN: Real Image Inversion via Segments | Adéla Šubrtová (Czech Technical University); David Futschik (Czech Technical University in Prague, FEE); Jan Čech (Czech Technical University in Prague); Michal Lukáč (Adobe Research); Eli Shechtman (Adobe Research, US); Daniel Sýkora (Czech Technical University in Prague)* | N/A | N/A |
| Towards Sequence-Level Training for Visual Tracking | Minji Kim (Seoul National University)*; Seungkwan Lee (POSTECH); Jungseul Ok (POSTECH); Bohyung Han (Seoul National University); Minsu Cho (POSTECH) | N/A | N/A |
| Scale-aware Spatio-temporal Relation Learning for Video Anomaly Detection | Guoqiu Li (Tsinghua Shenzhen International Graduate School, Tsinghua University)*; Guanxiong Cai (Shenzhen SenseTime Technology Co., Ltd); Xingyu ZENG (SenseTime Group Limited); Rui Zhao (SenseTime Group Limited) | N/A | N/A |
| Tracking by Associating Clips | Sanghyun Woo (KAIST)*; Kwanyong Park (KAIST); Seoung Wug Oh (Adobe Research); In So Kweon (KAIST); Joon-Young Lee (Adobe Research) | N/A | N/A |
| An Information Theoretic Approach forAttention-Driven Face Forgery Detection | Ke Sun (Xiamen University)*; Hong Liu (National Institute of Informatics ); Taiping Yao (Tencent YouTu); Xiaoshuai Sun (Xiamen University); Shen Chen (Tencent YouTu Lab); Shouhong Ding (Tencent); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Compound Prototype Matching for Few-shot Action Recognition | Yifei Huang (The University of Tokyo)*; Lijin Yang (The University of Tokyo); Yoichi Sato (University of Tokyo) | N/A | N/A |
| Self-Promoted Supervision for Few-Shot Transformer | Bowen Dong (Harbin Institute of Technology); Pan Zhou (NUS); Shuicheng Yan (National University of Singapore, Department of Electrical and Computer Engineering); Wangmeng Zuo (Harbin Institute of Technology, China)* | N/A | N/A |
| Completely Self-Supervised Crowd Counting via Distribution Matching | deepak babu sam (Indian Institute of Science)*; Abhinav Agarwalla (Carnegie Mellon University); Jimmy Joseph (Stony Brook University); Vishwanath Sindagi (Johns Hopkins University); Venkatesh Babu RADHAKRISHNAN (Indian Institute of Science); Vishal Patel (Johns Hopkins University) | N/A | N/A |
| Geodesic-Former: a Geodesic-Guided Few-shot 3D Point Cloud Instance Segmenter | Tuan Duc Ngo (VinAI Research)*; Khoi Nguyen (VinAI Research) | N/A | N/A |
| SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer | Haoran Zhou (Nanjing University)*; Yun Cao (Tencent); Wenqing Chu (Tencent); Junwei Zhu (Tencent); Tong Lu (Nanjing University); Ying Tai (Tencent YouTu); Chengjie Wang (Tencent; Shanghai Jiao Tong University) | N/A | N/A |
| 3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-Labeling | Yu-Ting Yen (National Chiao Tung University, Phiar Technologies)*; Chia-Ni Lu (National Chiao Tung University ); Wei-Chen Chiu (National Chiao Tung University); Yi-Hsuan Tsai (Phiar Technologies) | N/A | N/A |
| Towards Accurate Active Camera Localization | Qihang Fang (Shandong University); Yingda Yin (Peking University); Qingnan Fan (Tencent AI Lab)*; Fei Xia (Google Inc); Siyan Dong (Shandong University); Sheng Wang (3vjia); Jue Wang (Tencent AI Lab); Leonidas Guibas (Stanford University); Baoquan Chen (Peking University) | N/A | N/A |
| Few-shot Object Counting and Detection | Thanh Van Nguyen (VinAI Research)*; Chau Hai Pham (VinAI Research); Khoi Nguyen (VinAI Research); Minh Hoai (Stony Brook University) | N/A | N/A |
| RealPatch: A Statistical Matching Framework for Model Patching with Real Samples | Sara Romiti (University of Sussex)*; Christopher Inskip (University of Sussex); Viktoriia Sharmanska (University of Sussex and Imperial College London); Novi Quadrianto (University of Sussex and Basque Center for Applied Mathematics) | N/A | N/A |
| GAN Cocktail: mixing GANs without dataset access | Omri Avrahami (The Hebrew University of Jerusalem)*; Dani Lischinski (The Hebrew University of Jerusalem); Ohad Fried (IDC Herzliya) | N/A | N/A |
| Coarse-To-Fine Incremental Few-Shot Learning | Xiang Xiang (Huazhong University of Science and Technology)*; Yuwen Tan (Huazhong University of Science and Technology); Qian Wan (Wuhan Research Institute of Posts and Telecommunications); Jing Ma (Huazhong University of Science and Technology); Alan Yuille (Johns Hopkins University); Gregory D. Hager (The Johns Hopkins University) | N/A | N/A |
| Learning Unbiased Transferability for Domain Adaptation by Uncertainty Modeling | Jian Hu (Queen Mary University of London)*; Haowen Zhong (Zhejiang Lab); Fei Yang (Zhejiang Lab); Shaogang Gong (Queen Mary University of London); Guile Wu (Queen Mary University of London); Junchi Yan (Shanghai Jiao Tong University) | N/A | N/A |
| Camera Pose Auto-Encoders for Improving Pose Regression | Yoli Shavit (Faculty of Engineering, Bar Ilan University); Yosi Keller (Bar Ilan University)* | N/A | N/A |
| CoGS: Controllable Generation and Search from Sketch and Style | Cusuh Ham (Georgia Institute of Technology)*; Gemma Canet Tarrés (CVSSP, University of Surrey); Tu Bui (University of Surrey); James Hays (Georgia Institute of Technology, USA); Zhe Lin (Adobe Research); John Collomosse (Adobe Research) | N/A | N/A |
| Active Audio-Visual Separation of Dynamic Sound Sources | Sagnik Majumder (University of Texas at Austin)*; Kristen Grauman (Facebook AI Research & UT Austin) | N/A | N/A |
| AU-aware 3D Face Reconstruction through Personalized AU-specific Blendshape Learning | Chenyi Kuang (Rensselaer Polytechnic Institute)*; Zijun Cui (Rensselaer Polytechnic Institute); Jeffrey Kephart (IBM Research, USA); Qiang Ji (Renselaer Polytechnic Institute) | N/A | N/A |
| Directed Ray Distance Functions for 3D Scene Reconstruction | Nilesh Kulkarni (University of Michigan)*; Justin Johnson (University of Michigan); David Fouhey (University of Michigan) | N/A | N/A |
| Background-Insensitive Scene Text Recognition with Text Semantic Segmentation | Liang Zhao (University of South Carolina)*; Zhenyao Wu (University of South Carolina); Xinyi Wu (University of South Carolina); Greg Wilsbacher (University of South Carolina); Song Wang (University of South Carolina) | N/A | N/A |
| Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering | Mingfei Chen (University of Washington)*; Jianfeng Zhang (NUS); Xiangyu Xu (Sea AI Lab); Lijuan Liu (SEA AI LAB); Yujun Cai (Nanyang Technological University); Jiashi Feng (ByteDance); Shuicheng Yan (Sea AI Labs) | N/A | N/A |
| MorphMLP: An Efficient MLP-Like Backbone for Spatial-Temporal Representation Learning | David Junhao Zhang (National University of Singapore)*; Kunchang Li (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Yali Wang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Yunpeng Chen (National University of Singapore); Shashwat Chandra (National University of Singapore); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Luoqi Liu (meitu); Mike Zheng Shou (National University of Singapore) | N/A | N/A |
| Continual Variational Autoencoder Learning via Online Cooperative Memorization | Fei Ye (University of york)*; Adrian Bors (University of York) | N/A | N/A |
| Semantic Novelty Detection via Relational Reasoning | Francesco Cappio Borlino (Politecnico di Torino); Silvia Bucci (Italian Institute of Technology)*; Tatiana Tommasi (Politecnico di Torino) | N/A | N/A |
| FindIt: Generalized Localization with Natural Language Queries | Weicheng Kuo (Google)*; Fred Bertsch (Google); Wei Li (GOOGLE INC); AJ Piergiovanni (Google); Mohammad Saffar (Google); Anelia Angelova (Google) | N/A | N/A |
| SelectionConv: Convolutional Neural Networks for Non-rectilinear Image Data | David M Hart (Brigham Young University)*; Michael Whitney (Brigham Young University); Bryan S Morse (Brigham Young University) | N/A | N/A |
| HairNet: Hairstyle Transfer with Pose Changes | Peihao Zhu (KAUST)*; Rameen Abdal (KAUST); JOHN C FEMIANI (Miami University); Peter Wonka (KAUST) | N/A | N/A |
| Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition | Shreyank N Gowda (University of Edinburgh)*; Marcus Rohrbach (Facebook AI Research); Frank Keller (University of Edinburgh); Laura Sevilla-Lara (Facebook) | N/A | N/A |
| Action-based Contrastive Learning for Trajectory Prediction | Marah Halawa (Technische Universität Berlin)*; Olaf Hellwich (Technical University Berlin); Pia Bideau (TU Berlin) | N/A | N/A |
| Scaling Open-vocabulary Image Segmentation with Image-level Labels | Golnaz Ghiasi (Google Brain)*; Xiuye Gu (Google); Yin Cui (Google); Tsung-Yi Lin (Nvidia Research) | N/A | N/A |
| Improving Closed and Open-Vocabulary Attribute Prediction using Transformers | Khoi Pham (University of Maryland, College Park)*; Kushal Kafle (Adobe Research); Zhe Lin (Adobe Research); Zhihong Ding (Adobe Research); Scott Cohen (Adobe Research); Quan Hung Tran (Adobe Research); Abhinav Shrivastava (University of Maryland) | N/A | N/A |
| FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context | Pinaki Nath Chowdhury (University of Surrey)*; Aneeshan Sain (University of Surrey); Ayan Kumar Bhunia (University of Surrey); Tao Xiang (University of Surrey); Yulia Gryaditskaya (University of Surrey); Yi-Zhe Song (University of Surrey) | N/A | N/A |
| A Contrastive Objective for Learning Disentangled Representations | Jonathan Kahana (Hebrew University of Jerusalem)*; Yedid Hoshen (The Hebrew University of Jerusalem) | N/A | N/A |
| Unbiased Multi-Modality Guidance for Image Inpainting | Yongsheng YU (University of Chinese Academy of Sciences); Dawei Du (Kitware, Inc.)*; Libo Zhang (Institute of Software Chinese Academy of Sciences); Tiejian Luo (University of Chinese Academy of Sciences) | N/A | N/A |
| Learned Monocular Depth Priors in Visual-Inertial Initialization | Yunwen Zhou (Google)*; Abhishek Kar (Google); Eric L Turner (GOOGLE LLC); Adarsh Kowdle (Google); Chao Guo (Google Inc.); Ryan DuToit (Google); Konstantine Tsotsos (Google) | N/A | N/A |
| DexMV: Imitation Learning for Dexterous Manipulation from Human Videos | Yuzhe Qin (University of California San Diego)*; Yueh-Hua Wu (UCSD); Shaowei Liu (UIUC); Hanwen Jiang (UT Austin); Ruihan Yang (UC San Diego); Yang Fu (UCSD); Xiaolong Wang (UCSD) | N/A | N/A |
| Exploring Fine-grained Audiovisual Categorization with the SSW60 Dataset | Grant Van Horn (Cornell University)*; Rui Qian (Cornell University); Kimberly Wilber (Google); Hartwig Adam (Google); Oisin Mac Aodha (University of Edinburgh); Serge Belongie (University of Copenhagen) | N/A | N/A |
| Radatron: Accurate Detection Using Multi-Resolution Cascaded MIMO Radar | Sohrab Madani (UIUC)*; Junfeng Guan (UIUC); Waleed Ahmed (UIUC); Saurabh Gupta (UIUC); Haitham Hassanieh (UIUC) | N/A | N/A |
| COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality | Honglu Zhou (Rutgers University)*; Asim Kadav (NEC Labs); Aviv Shamsian (Bar Ilan University); Shijie Geng (Rutgers University); Farley Lai (NEC Laboratories America, Inc.); Long Zhao (Google Research); Ting Liu (Google Research); Mubbasir Kapadia (Rutgers University); Hans Peter Graf (NEC Labs) | N/A | N/A |
| The Fish Counting Dataset: A Benchmark for Multiple Object Tracking and Counting | Justin Kay (Caltech, Ai.Fish); Peter Kulits (Caltech); Suzanne C Stathatos (Caltech); Siqi Deng (Amazon); Erik Young (Trout Unlimited); Sara M Beery (Caltech); Grant Van Horn (Cornell University)*; Pietro Perona (California Institute of Technology) | N/A | N/A |
| Object Level Depth Reconstruction for Category Level 6D Object Pose Estimation From Monocular RGB Image | Zhaoxin Fan (Renmin University of China)*; Zhenbo Song (Nanjing University of Science and Technology); Jian Xu (Nreal); Zhicheng Wang (Nreal); Kejian Wu (Nreal); Hongyan Liu (Tsinghua University); Jun He (Renmin University of China) | N/A | N/A |
| DeepMend: Learning Occupancy Functions to Represent Shape for Repair | Nikolas Lamb (Clarkson University)*; Sean Banerjee (Clarkson University); Natasha Kholgade Banerjee (Clarkson University) | N/A | N/A |
| Graph Neural Network for Cell Tracking in Microscopy Videos | Tal Ben-Haim (School of Electrical and Computer Engineering, Ben-Gurion University)*; Tammy Riklin Raviv (BGU) | N/A | N/A |
| Anti-Neuron Watermarking: Protecting Personal Data Against Unauthorized Neural Networks | Zihang Zou (University of Central Florida)*; Boqing Gong (Google); Liqiang Wang (University of Central Florida) | N/A | N/A |
| PACS: A Dataset for Physical Audiovisual Commonsense Reasoning | Samuel Yu (Carnegie Mellon University)*; Peter Wu (UC Berkeley); Paul Pu Liang (Carnegie Mellon University); Ruslan Salakhutdinov (Carnegie Mellon University); Louis-Philippe Morency (Carnegie Mellon University) | N/A | N/A |
| Intelli-Paint: Towards Developing More Human-Intelligible Painting Agents | Jaskirat Singh (Australian National University)*; Cameron Y Smith (Adobe Research); Jose Echevarria (Adobe System Inc.); Liang Zheng (Australian National University) | N/A | N/A |
| Rethinking Few-Shot Object Detection on A Multi-Domain Benchmark | Kibok Lee (Yonsei University); Hao Yang (Amazon)*; Satyaki Chakraborty (Amazon ); Zhaowei Cai (Amazon); Gurumurthy Swaminathan (Amazon); Avinash Ravichandran (Amazon); Onkar Dabeer (Amazon) | N/A | N/A |
| LidarNAS: Unifying and Searching Neural Architectures for 3D Point Clouds | Chenxi Liu (Waymo)*; Zhaoqi Leng (Waymo); Pei Sun (Waymo); Shuyang Cheng (Waymo LLC); Charles R. Qi (Waymo); Yin Zhou (Waymo); Mingxing Tan (Waymo); Dragomir Anguelov (Waymo) | N/A | N/A |
| Improving the Intra-class Long-tail in 3D Detection via Rare Example Mining | Chiyu Jiang (Waymo)*; Mahyar Najibi (Waymo LLC); Charles R. Qi (Waymo); Yin Zhou (Waymo); Dragomir Anguelov (Waymo) | N/A | N/A |
| Learning to Learn with Smooth Regularization | Yuanhao Xiong (UCLA)*; Cho-Jui Hsieh (UCLA) | N/A | N/A |
| A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility | Andrea Burns (Boston University)*; Deniz Arsan (University of Illinois at Urbana Champaign); Sanjna Agrawal (Boston University); Ranjitha Kumar (UIUC: CS); Kate Saenko (Boston University); Bryan Plummer (Boston University) | N/A | N/A |
| CoVisPose: Co-Visibility Pose Transformer for Wide-Baseline Relative Pose Estimation in 360 Indoor Panoramas | Will A Hutchcroft (Zillow Group)*; Yuguang Li (Zillow Group); Ivaylo Boyadzhiev (Zillow Group); Zhiqiang Wan (Zillow); Haiyan Wang (The City College of New York); Sing Bing Kang (Zillow Group) | N/A | N/A |
| PT4AL: Using Self-Supervised Pretext Tasks for Active Learning | John Seon Keun Yi (Georgia Institute of Technology)*; Minseok Seo (si-analytics); Jongchan Park (Lunit); Dong-Geol Choi (Hanbat National University) | N/A | N/A |
| Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression | Dongting Hu (The University of Melbourne); Liuhua Peng (The University of Melbourne); Tingjin Chu (University of Melbourne); Xiaoxing Zhang (Meituan); Yinian Mao (Meituan-Dianping Group ); Howard Bondell (University of Melbourne); Mingming Gong (University of Melbourne)* | N/A | N/A |
| All You Need is RAW: Defending Against Adversarial Attacks with Camera Image Pipelines | Yuxuan Zhang (Princeton University)*; Bo Dong (Princeton University); Felix Heide (Princeton University) | N/A | N/A |
| ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer | Haokui Zhang (Lighthouse Co.Ltd)*; Wenze Hu (Lighthouse Co.Ltd); Xiaoyu Wang (The Chinese University of Hong Kong (Shenzhen)) | N/A | N/A |
| B ́ezierPalm: A Free lunch for Palmprint Recognition | KAI ZHAO (UCLA)*; Lei Shen (Tencent); Yingyi Zhang (Tencent); Chuhan Zhou (Tencent & VIA University College); Tao Wang (Tencent YouTu Lab); Ruixin Zhang (Tencent); Shouhong Ding (Tencent); Wei Jia (Heifei University of Technology); Wei Shen (Shanghai Jiao Tong University) | N/A | N/A |
| A Repulsive Force Unit for Garment Collision Handling in Neural Networks | Qingyang Tan (UMD)*; Yi Zhou (Adobe Research); Tuanfeng Wang (adobe research); Duygu Ceylan (Adobe Research); Xin Sun (Adobe Research); Dinesh Manocha (University of Maryland at College Park) | N/A | N/A |
| CYBORGS: Contrastively Bootstrapping Object Representations by Grounding in Segmentation | Renhao Wang (Tsinghua University)*; Hang Zhao (Tsinghua University); Yang Gao (Tsinghua University) | N/A | N/A |
| Connecting Compression Spaces withTransformer for Approximate Nearest Neighbor Search | Haokui Zhang (Lighthouse Co.Ltd)*; Buzhou Tang (Harbin Institute of Technology, China); Wenze Hu (Lighthouse Co.Ltd); Xiaoyu Wang (The Chinese University of Hong Kong (Shenzhen)) | N/A | N/A |
| Training Vision Transformers with Only 2040 Images | Yunhao Cao (Nanjing University); Hao Yu (Nanjing University); Jianxin Wu (Nanjing University)* | N/A | N/A |
| Black-box Few-shot Knowledge Distillation | Dang Nguyen (Deakin University)*; Sunil Gupta (Deakin University, Australia); Kien Duc Do (Deakin Unviersity); Svetha Venkatesh (Deakin University) | N/A | N/A |
| AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling | Ziqian Bai (Simon Fraser University)*; Timur Bagautdinov (Facebook); Javier Romero (Facebook); Michael Zollhöfer (Facebook Reality Labs); Ping Tan (Simon Fraser University); Shunsuke Saito (Facebook) | N/A | N/A |
| Ghost-free High Dynamic Range Imaging with Context-aware Transformer | Zhen Liu (Sichuan University; Megvii ); Yinglong Wang (Huawei Noah’s Ark Lab); Bing Zeng (University of Electronic Science and Technology of China); Shuaicheng Liu (UESTC; Megvii)* | N/A | N/A |
| Cross-Domain Cross-Set Few-Shot Learning via Learning Compact and Aligned Representations | Wentao Chen (University of Science and Technology of China)*; Zhang Zhang (Institute of Automation, Chinese Academy of Sciences); Wei Wang (Institute of Automation Chinese Academy of Sciences); Liang Wang (NLPR, China); Zilei Wang (University of Science and Technology of China); Tieniu Tan (NLPR, China) | N/A | N/A |
| Motion Transformer for Unsupervised Image Animation | Jiale Tao (University of Electronic Science and Technology of China)*; Biao Wang (Alibaba Group); Tiezheng Ge (Alibaba Group); Yuning Jiang (Alibaba Group); Wen Li (University of Electronic Science and Technology of China); Lixin Duan (University of Electronic Science and Technology of China) | N/A | N/A |
| LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection | Yi Wei (Tsinghua University)*; Zibu Wei (Tsinghua University); Yongming Rao (Tsinghua University); Jiaxin Li (Gaussian Robotics); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University) | N/A | N/A |
| PSS: Progressive Sample Selection for Open-World Visual Representation Learning | Tianyue Cao (Shanghai Jiao Tong University); Yongxin Wang (Amazon)*; Yifan Xing (AMAZON CORPORATE LLC); Tianjun Xiao (Amazon); Tong He (Amazon); Zheng Zhang (AWS); Hao Zhou (Amazon); Joseph Tighe (Amazon) | N/A | N/A |
| Self-slimmed Vision Transformer | Zhuofan Zong (Beihang University)*; Kunchang Li (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Guanglu Song (Sensetime); Yali Wang (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Biao Leng (Beihang University); Yu Liu (SenseTime Group LTD) | N/A | N/A |
| Switchable Online Knowledge Distillation | Biao Qian (Hefei University of Technology); Yang Wang (Hefei University of Technology)*; Hongzhi Yin (The University of Queensland); Richang Hong (Hefei University of Technology); Meng Wang (Hefei University of Technology) | N/A | N/A |
| Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing | Hsin-Ping Huang (University of California, Merced)*; Deqing Sun (Google); Yaojie Liu (Google); Wen-Sheng Chu (Google); Taihong Xiao (University of California at Merced); Jinwei Yuan (Google); Hartwig Adam (Google); Ming-Hsuan Yang (University of California at Merced) | N/A | N/A |
| GraphFit: Learning Multi-scale Graph-Convolutional Representation for Point Cloud Normal Estimation | Keqiang Li (Institute of Automation, Chinese Academy of Sciences; School of Artificial Intelligence, University of Chinese Academy of Sciences)*; Mingyang Zhao (University of Chinese Academy and Sciences&Beijing Academy of Artificial Intelligence); Huaiyu Wu (Institute of Automation, Chinese Academy of Sciences); Dong-Ming Yan (NLPR, CASIA); Zhen Shen (Institute of Automation, Chinese Academy of Sciences/Qingdao Academy of Intelligent Industries); Fei-Yue Wang (Institute of Automation, Chinese Academy of Sciences ); gang xiong (CASIA) | N/A | N/A |
| Are Vision Transformers Robust to Patch-wise Perturbations? | Jindong Gu (University of Munich)*; Volker Tresp (Siemens AG and Ludwig Maximilian University of Munich ); Yao Qin (Google) | N/A | N/A |
| DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning | Zifeng Wang (Northeastern University)*; Zizhao Zhang (Google); Sayna Ebrahimi (Google); Ruoxi Sun (Google); Han Zhang (Google); Chen-Yu Lee (Google); Xiaoqi Ren (Google); Guolong Su (Google); Vincent Perot (Google AI); Jennifer Dy (Northeastern); Tomas Pfister (Google) | N/A | N/A |
| EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer | Chenyu Yang (Tsinghua University)*; Wanrong He (Tsinghua University); Yingqing Xu (Tsinghua University); Yang Gao (Tsinghua University) | N/A | N/A |
| Union-set Multi-source Model Adaptation for Semantic Segmentation | Zongyao Li (Hokkaido University)*; Ren Togo (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University) | N/A | N/A |
| Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection | Sanghyun Woo (KAIST)*; Kwanyong Park (KAIST); Seoung Wug Oh (Adobe Research); In So Kweon (KAIST); Joon-Young Lee (Adobe Research) | N/A | N/A |
| TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNs | Shantanu Jaiswal (Agency for Science, Technology and Research ); Basura Fernando (Agency for Science, Technology and Research, ASTAR, Singapore); Cheston Tan (Institute for Infocomm Research, Singapore) | N/A | N/A |
| Exploring Disentangled Content Information for Face Forgery Detection | Jiahao Liang (Beijing University of Posts and Telecommunications)*; Huafeng Shi (SenseTime Group Limited); Weihong Deng (Beijing University of Posts and Telecommunications) | N/A | N/A |
| Object Discovery via Contrastive Learning for Weakly Supervised Object Detection | Jinhwan Seo (Pohang University of Science and Technology)*; Wonho Bae (University of British Columbia); Danica J. Sutherland (University of British Columbia); Junhyug Noh (Lawrence Livermore National Laboratory); Daijin Kim (Pohang University of Science and Technology) | N/A | N/A |
| Unifying Vision Unsupervised Contrastive Learning from a Graph Perspective | Shixiang Tang (The University of Sydney)*; Feng Zhu (University of Science and Technology of China); Lei Bai (Shanghai AI Laboratory); Rui Zhao (SenseTime Group Limited); Chenyu Wang (University of Sydney, Sydney Neuroimaging Analysis Centre); Wanli Ouyang (The University of Sydney) | N/A | N/A |
| E-NeRV: Expedite Neural Video Representation with Disentangled Spatial-Temporal Context | Zizhang Li (Zhejiang University)*; Mengmeng Wang (Zhejiang University); Huaijin Pi (Zhejiang University); Kechun Xu (Zhejiang University); Jianbiao Mei (Zhejiang University); Yong Liu (Zhejiang University) | N/A | N/A |
| $\ell_\infty$-Robustness and Beyond: Unleashing Efficient Adversarial Training | Hadi Mohaghegh Dolatabadi (University of Melbourne)*; Sarah Erfani (University of Melbourne); Christopher Leckie (University of Melbourne) | N/A | N/A |
| Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization | Jingtang Liang (University of Macau)*; Xiaodong Cun (Tencent AI Lab); Chi-Man Pun (University of Macau); Jue Wang (Tencent AI Lab) | N/A | N/A |
| Point MixSwap: Attentional Point Cloud Mixing via Swapping Matched Structural Divisions | Ardian Umam (NYCU)*; Cheng-Kun Yang (National Taiwan University); Yung-Yu Chuang (National Taiwan University); Jen-Hui Chuang (National Chiao Tung University ); Yen-Yu Lin (National Yang Ming Chiao Tung University) | N/A | N/A |
| One Size Does NOT Fit All: Data-Adaptive Adversarial Training | Shuo Yang (University of Sydney)*; Chang Xu (University of Sydney) | N/A | N/A |
| IS-MVSNet: Importance Sampling-based MVSNet | Likang Wang (HKUST)*; Yue Gong (Huawei Technologies Co., Ltd.); Xinjun Ma (Huawei); Qirui Wang (Huawei Technologies Co., Ltd.); Kaixuan Zhou (Huawei ); Lei Chen (Hong Kong University of Science and Technology) | N/A | N/A |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices | Tianli Zhao (Institute of Automation,Chinese Academy of Sciences;University of Chinese Academy of Sciences); Xi Sheryl Zhang (Institute of Automation, Chinese Academy of Sciences); Wentao Zhu (Amazon); Jiaxing Wang (Institute of Automation, Chinese Academy of Sciences); Sen Yang (Kuaishou); Ji Liu (Kwai Inc.); Jian Cheng (“Chinese Academy of Sciences, China”)* | N/A | N/A |
| Style-Agnostic Reinforcement Learning | Juyong Lee (POSTECH); Seokjun Ahn (POSTECH); Jaesik Park (POSTECH)* | N/A | N/A |
| Editing Out-of-domain GAN Inversion via Differential Activations | Haorui Song (South China University of Technology); Yong Du (Ocean University of China); Tianyi Xiang (South China University of Technology); Junyu Dong (Ocean University of China); Jing Qin (The Hong Kong Polytechnic University); Shengfeng He (South China University of Technology)* | N/A | N/A |
| Bagging Regional Classification Activation Maps for Weakly Supervised Object Localization | Lei Zhu (Beijing University of Posts and Telecommunications); Qian Chen (University of Science and Technology of China); Lujia Jin (Peking University); yunfei you (Peking University); Yanye Lu (Peking University)* | N/A | N/A |
| Mutually Reinforcing Structure with Proposal Contrastive Consistency for Few-Shot Object Detection | TianXue Ma (East China Normal University)*; Mingwei Bi (Tencent); Jian Zhang (Tencent Youtu); Wang Yuan (East China Normal University); Zhizhong Zhang (East China Normal University); Yuan Xie (East China Normal University); Shouhong Ding (Tencent); Lizhuang Ma (Shanghai Jiao Tong University) | N/A | N/A |
| Panoptic-PartFormer: Learning a Unified model for Panoptic Part Segmentation | Xiangtai Li (Peking University)*; Shilin Xu (Peking University); Yibo Yang (Peking University); Guangliang Cheng (Sensetime Group Limited); Yunhai Tong (Peking University); Dacheng Tao (JD.com) | N/A | N/A |
| TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers | Oren Nuriel (Amazon)*; Ron Litman (Amazon); Sharon Fogel (Amazon) | N/A | N/A |
| Speaker-adaptive Lip Reading with User-dependent Padding | Minsu Kim (KAIST)*; Hyunjun Kim (KAIST); Yong Man Ro (KAIST) | N/A | N/A |
| Online Domain Adaptation for Semantic Segmentation in Ever-Changing Conditions | Theodoros Panagiotakopoulos (KTH Royal Institute of Technology in Stockholm); Pier Luigi Dovesi (Univrses); Linus Härenstam-Nielsen (Artisense); Matteo Poggi (University of Bologna)* | N/A | N/A |
| Point Scene Understanding via Disentangled Instance Mesh Reconstruction | Jiaxiang Tang (Peking University)*; Xiaokang Chen (Peking University); Jingbo Wang (The Chinese University of HongKong); Gang Zeng (Peking University) | N/A | N/A |
| Dual Contrastive Learning with Anatomical Auxiliary Supervision for Few-shot Medical Image Segmentation | Huisi Wu (Shenzhen University)*; Fangyan Xiao (Shenzhen University); Chongxin Liang (Shenzhen University) | N/A | N/A |
| An Efficient Person Clustering Algorithm for Open Checkout-free Groceries | Junde Morsen Wu (Purdue University); Yu Zhang (Harbin Institute of Technology); RAO FU (None); Yuanpei Liu (Beijing Institute of Technology); Jing Gao (Purdue University)* | N/A | N/A |
| Face2Face^ρ: Real-Time High-Resolution One-Shot Face Reenactment | Kewei Yang (NetEase Games AI Lab)*; Kang Chen (NetEase Games AI Lab); Daoliang Guo (NetEase Games AI Lab); Song-Hai Zhang (Tsinghua University); Yuan-Chen Guo (Tsinghua University); Weidong Zhang (Netease Games AI Lab) | N/A | N/A |
| Decoupled Contrastive Learning | Chun-Hsiao Yeh (Academia Sinica / UC Berkeley)*; Cheng-Yao Hong (Academia Sinica); Yen-Chi Hsu (Academia Sinica); Tyng-Luh Liu (Academia Sinica); Yubei Chen (Berkeley AI Research, UC Berkeley); yann lecun (Facebook) | N/A | N/A |
| Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning | Chi Zhang (University of California, Los Angeles)*; Sirui Xie (UCLA); Baoxiong Jia (UCLA); Ying Nian Wu (University of California, Los Angeles); Song-Chun Zhu (UCLA); Yixin Zhu (Peking University) | N/A | N/A |
| On the Robustness of Quality Measures for GANs | Motasem Alfarra (KAUST)*; Juan C Perez (KAUST); Anna Fruehstueck (KAUST); Philip Torr (University of Oxford); Peter Wonka (KAUST); Bernard Ghanem (KAUST) | N/A | N/A |
| Automatic Check-Out via Prototype-based Classifier Learning from Single-Product Exemplars | Hao Chen (Nanjing University of Science and Technology)*; Xiu-Shen Wei (Nanjing University of Science and Technology); Faen Zhang (AInnovation Co. Ltd.); Yang Shen (Nanjing University of Science and Technology); Hui Xu (QINGDAO AINNOVATION TECHNOLOGY GROUP CO., LTD); liang xiao (nanjing university of science and technology) | N/A | N/A |
| TDViT: Temporal Dilated Transformer for Dense Video Tasks | Guanxiong Sun (Queen’s University Belfast); Yang Hua (Queen’s University Belfast)*; Guosheng Hu (Oosto); Neil Robertson (Queen’s University Belfast) | N/A | N/A |
| POP: Mining POtential Performance of new fashion products via webly cross-modal query expansion | Christian Joppi (Humatics srl)*; Geri Skenderi (University of Verona); Marco Cristani (University of Verona) | N/A | N/A |
| BRACE: The Breakdancing Competition Dataset for Dance Motion Synthesis | Davide Moltisanti (University of Edinburgh)*; Jinyi Wu (S-Lab Nanyang Technological University); Bo Dai (Shanghai AI Lab); Chen Change Loy (Nanyang Technological University) | N/A | N/A |
| Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation | Haiwen Feng (Max Planck Institute for Intelligent Systems); Timo Bolkart (Max Planck Institute for Intelligent Systems); Joachim Tesch (Max Planck Institute for Intelligent Systems); Michael J. Black (Max Planck Institute for Intelligent Systems); Victoria Fernandez Abrevaya (Max Planck Institute)* | N/A | N/A |
| Style-Guided Shadow Removal | Jin Wan (Beijing Jiaotong University); Hui Yin (Beijing Jiaotong University)*; Zhenyao Wu (University of South Carolina); Xinyi Wu (University of South Carolina); Yanting Liu (Yanting Liu); Song Wang (University of South Carolina) | N/A | N/A |
| Sound-guided Semantic Video Generation | Seung Hyun Lee (Korea University)*; Gyeongrok Oh (Korea University); Wonmin Byeon (NVIDIA Research); Jihyun Bae (Korea University); Chanyoung Kim (Korea University); Won Jeong Ryoo (Korea University); Sang Ho Yoon (KAIST); Hyunjun Cho (Korea University); Jinkyu Kim (Korea University); Sangpil Kim (Korea University) | N/A | N/A |
| Robust Visual Tracking by Segmentation | Matthieu Paul (ETH Zurich)*; Martin Danelljan (ETH Zurich); Christoph Mayer (ETH Zurich); Luc Van Gool (ETH Zurich) | N/A | N/A |
| Semi-Supervised Learning of Optical Flow by Flow Supervisor | Woobin Im (KAIST); Sebin Lee (KAIST); Sungeui Yoon (KAIST)* | N/A | N/A |
| Joint Learning of Localized Representations from Medical Images and Reports | Philip Müller (Technical University of Munich)*; Georgios Kaissis (Technische Universität München); congyu zou (Klinikum Rechts der Isar Technische Universität München ); Daniel Rueckert (Technische Universität München) | N/A | N/A |
| D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution | Youwei Li (Megvii); Haibin Huang (Kuaishou Technology); lanpeng jia (GWM); Haoqiang Fan (Megvii Inc(face++)); Shuaicheng Liu (UESTC; Megvii)* | N/A | N/A |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | Lukas Hedegaard (Aarhus University)*; Alexandros Iosifidis (Aarhus University) | N/A | N/A |
| Salient Object Detection for Point Clouds | Songlin Fan (Peking University ); Wei Gao (SECE, Shenzhen Graduate School, Peking University)*; Ge Li (Peking University) | N/A | N/A |
| Deep ensemble learning by diverse knowledge distillation for fine-grained object classification | Naoki Okamoto (Chubu university)*; Tsubasa Hirakawa (Chubu University); Takayoshi Yamashita (Chubu University); Hironobu Fujiyoshi (Chubu University) | N/A | N/A |
| Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition | Yuecong Xu (Institute for Infocomm Research, ASTAR, Singapore); Jianfei Yang (Nanyang Technological University); Haozhi Cao (Nanyang Technological University); Keyu Wu (Institute for Infocomm Research, ASTAR, Singapore); Min Wu (Institute for Infocomm Research, ASTAR, Singapore); Zhenghua Chen (Institute for Infocomm Research, A*STAR, Singapore) | N/A | N/A |
| GRIT-VLP: Grouped Mini-batch Sampling for Efficient Vision and Language Pre-training | Jaeseok Byun (Seoul National university); Taebaek Hwang (M.IN.D Lab); Jianlong Fu (Microsoft Research); Taesup Moon (Seoul National University)* | N/A | N/A |
| Pose Forecasting in Industrial Human-Robot Collaboration | Alessio Sampieri (Sapienza University)*; Guido Maria D’Amely di Melendugno (Sapienza University); ANDREA AVOGARO (University of Verona); Federico Cunico (University of Verona); Francesco Setti (University of Verona); Geri Skenderi (University of Verona); Marco Cristani (University of Verona); Fabio Galasso (Sapienza University) | N/A | N/A |
| MeshLoc: Mesh-Based Visual Localization | Vojtech Panek (CTU in Prague, FEE, CIIRC)*; Zuzana Kukelova (Czech Technical University in Prague); Torsten Sattler (Czech Technical University in Prague) | N/A | N/A |
| Dress Code: High-Resolution Multi-Category Virtual Try-On | Davide Morelli (UNIMORE); Matteo Fincato (Università degli Studi di Modena e Reggio Emilia); Marcella Cornia (University of Modena and Reggio Emilia)*; Federico Landi (University of Modena and Reggio Emilia); Fabio Cesari (YOOX Net-A-Porter Group S.p.A.); Rita Cucchiara (Università di Modena e Reggio Emilia) | N/A | N/A |
| UC-OWOD: Unknown-Classified Open World Object Detection | Zhiheng Wu (Institute of Automation, Chinese Academy of Sciences (CASIA))*; Yue Lu (Institute of Automation, Chinese Academy of Sciences(CASIA)); Xingyu Chen (Xiaobing.AI); Zhengxing Wu (CASIA); Liwen Kang (Institute of Automation, Chinese Academy of Sciences (CASIA)); Junzhi Yu (CASIA) | N/A | N/A |
| Helpful or Harmful: Inter-Task Association in Continual Learning | Hyundong Jin (Chung-Ang University ); Eunwoo Kim (Chung-Ang University)* | N/A | N/A |
| RayTran: 3D pose estimation and shape reconstruction of multiple objects from videos with ray-traced transformers | Michał J Tyszkiewicz (EPFL); Kevis-Kokitsi Maninis (Google Research)*; Stefan Popov (Google Research); Vittorio Ferrari (Google Research) | N/A | N/A |
| Efficient Point Cloud Segmentation with Geometry-aware Sparse Networks | Maosheng Ye (HKUST)*; Rui Wan (Deeproute.ai); Shuangjie Xu (HKUST); Tongyi Cao (Deeproute.ai); Qifeng Chen (HKUST) | N/A | N/A |
| Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition | Tianjiao Li (Singapore University of Technology and Design)*; Lin Geng Foo (Singapore University of Technology and Design); Qiuhong Ke (Monash University); Hossein Rahmani (Lancaster University); Anran Wang (Bytedance); Jinghua Wang (Harbin Institute of Technology); Jun Liu (Singapore University of Technology and Design) | N/A | N/A |
| TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation | Tan Minh Dinh (VinAI Research)*; Rang NGUYEN (VinAI Research); Binh-Son Hua (VinAI Research) | N/A | N/A |
| CostDCNet: Cost Volume based Depth Completion for a Single RGB-D Image | Jaewon Kam (POSTECH); Jungeon Kim (POSTECH); Soongjin Kim (POSTECH); Jaesik Park (POSTECH); Seungyong Lee (POSTECH)* | N/A | N/A |
| Efficient Video Deblurring Guided by Motion Magnitude | Yusheng Wang (The University of Tokyo)*; Yunfan Lu (Hong Kong University of Science and Technology); Ye Gao (Honor Technologies Japan); Lin Wang (HKUST); Zhihang Zhong (The University of Tokyo); Yinqiang Zheng (The University of Tokyo); Atsushi Yamashita (The University of Tokyo) | N/A | N/A |
| Space-Partitioning RANSAC | Daniel Barath (ETH Zürich)*; Gábor Valasek (ELTE) | N/A | N/A |
| Towards Accurate Binary Neural Networks via Modeling Contextual Dependencies | Xingrun Xing (Beihang University); Yangguang Li (SenseTime Group Limited); Wei Li (Nanyang Technological University); Wenrui Ding (Beihang University); Yalong Jiang (Beihang University)*; Yufeng Wang (Beihang University); Jing Shao (Sensetime); Chunlei Liu (Beihang University); Xianglong Liu (BUAA) | N/A | N/A |
| Overcoming Shortcut Learning in a Target Domain by Generalizing Basic Visual Factors from a Source Domain | Piyapat Saranrittichai (Bosch Center for Artificial Intelligence)*; Chaithanya Kumar Mummadi (Bosch Center for Artificial Intelligence); Claudia Blaiotta (Bosch Center for Artificial Intelligence); Mauricio Munoz (Bosch Center for Artificial Intelligence); Volker Fischer (Bosch Center for Artificial Intelligence) | N/A | N/A |
| SimpleRecon: 3D Reconstruction Without 3D Convolutions | Mohamed Sayed (University College London)*; John Gibson (Niantic, Inc.); Jamie Watson (Niantic); Victor A Prisacariu (Niantic Labs); Michael Firman (Niantic); Clement LJC Godard (Niantic) | N/A | N/A |
| SemAug: Semantically Meaningful Image Augmentations for Object Detection Through Language Grounding | Morgan L Heisler (Huawei Technologies Canada Co., Ltd.)*; Amin Banitalebi-Dehkordi (Huawei Technologies Canada Co., Ltd.); Yong Zhang (Huawei Technologies Canada Co., Ltd.) | N/A | N/A |
| A data-centric approach for improving ambiguous labels with combined semi-supervised classification and clustering | Lars Schmarje (Kiel University)*; Monty Santarossa (Kiel University); Simon-Martin Schröder (Kiel University); Claudius Zelenka (Kiel University); Rainer Kiko (Laboratoire d’Océanographie de Villefranche-sur-Mer); Jenny Stracke (University of Bonn); Nina Volkmann (University of Veterinary Medicine Hannover); Reinhard Koch (Kiel University) | N/A | N/A |
| SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks | Anish J Prabhu (Apple)*; Chien-Yu lin (University of Washington); Thomas Merth (Apple); Sachin Mehta (University of Washington); Anurag Ranjan (Apple); Maxwell C Horton (Apple, Xnor.Ai and University of Washington); Mohammad Rastegari (University of Washington) | N/A | N/A |
| SAGA: Stochastic Whole-Body Grasping With Contact | Yan Wu (ETH Zurich); Jiahao Wang (Max Planck Institute for Informatics); Yan Zhang (ETH Zurich); Siwei Zhang (ETH Zurich); Otmar Hilliges (ETH Zurich); Fisher Yu (ETH Zurich); Siyu Tang (ETH Zurich)* | N/A | N/A |
| GTCaR: Graph Transformer for Camera Re-localization | Xinyi Li (Magic Leap)*; Haibin Ling (Stony Brook University) | N/A | N/A |
| Actor-centered Representations for Action Localization in Streaming Videos | Sathyanarayanan N Aakur (OK State)*; Sudeep Sarkar (University of South Florida, Tampa) | N/A | N/A |
| Photo-realistic Neural Domain Randomization | Sergey Zakharov (Toyota Research Institute)*; Rareș A Ambruș (Toyota Research Institute); Vitor Guizilini (Toyota Research Institute); Wadim Kehl (Woven Planet); Adrien Gaidon (Toyota Research Institute) | N/A | N/A |
| ShAPO: Implicit Representations for Multi-Object Shape, Appearance, and Pose Optimization | Muhammad Zubair Irshad (Georgia Institute of Technology)*; Sergey Zakharov (Toyota Research Institute); Rareș A Ambruș (Toyota Research Institute); Thomas Kollar (Toyota Research Institute); Zsolt Kira (Georgia Institute of Technology); Adrien Gaidon (Toyota Research Institute) | N/A | N/A |
| Structure and Motion for Casual Videos | Zhoutong Zhang (MIT)*; Forrester Cole (Google Research); Zhengqi Li (Google Inc.); Noah Snavely (Google); Michael Rubinstein (Google); William T Freeman (Google) | N/A | N/A |
| Single Frame Atmospheric Turbulence Mitigation: A Benchmark Study and A New Physics-Inspired Transformer Model | Zhiyuan Mao (Purdue University)*; AJAY KUMAR JAISWAL (UT Austin); Zhangyang Wang (University of Texas at Austin); Stanley Chan (Purdue University, USA) | N/A | N/A |
| Incremental Task Learning with Incremental Rank Updates | Rakib Hyder (University of California, Riverside)*; Ken Shao (UCR); Boyu Hou (The University of California, Riverside ); Panagiotis Markopoulos (RIT); Ashley Prater-Bennette (Air Force Research Laboratory); M. Salman Asif (University of California, Riverside) | N/A | N/A |
| Bandwidth-Aware Adaptive Codec for DNN Inference Offloading in IoT | Xiufeng Xie (Kwai Inc.)*; Ning Zhou (Amazon); Wentao Zhu (Amazon); Ji Liu (Kwai Inc.) | N/A | N/A |
| Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation | Connelly Barnes (Adobe)*; Lingzhi Zhang (University of Pennsylvania); Jianbo Shi (University of Pennsylvania); Zhe Lin (Adobe Research); Eli Shechtman (Adobe Research, US); Sohrab Amirghodsi (Adobe Research); Kevin Wampler (Adobe Systems Inc.) | N/A | N/A |
| Controllable Video Generation through Global and Local Motion Dynamics | Aram Davtyan (University of Bern)*; Paolo Favaro (University of Bern) | N/A | N/A |
| UniCR: Universally Approximated Certified Robustness via Randomized Smoothing | Hanbin Hong (University of Connecticut)*; Binghui Wang (Illinois Institute of Technology); Yuan Hong (University of Connecticut) | N/A | N/A |
| 3D Siamese Transformer Network for Single Object Tracking on Point Clouds | Le Hui (Nanjing University of Science and Technology)*; Lingpeng Wang (Nanjing University of Science and Technology); Linghua Tang (Nanjing University of Science and Technology); Kaihao Lan (Nanjing University of Science and Technology); Jin Xie (Nanjing University of Science and Technology); Jian Yang (Nanjing University of Science and Technology) | N/A | N/A |
| Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips | Jiawang Bai (Tsinghua University)*; Kuofeng Gao (Tsinghua University); dihong gong (Tencent AI Lab); Shu-Tao Xia (Tsinghua University); Zhifeng Li (Tencent AI Lab); Wei Liu (Tencent) | N/A | N/A |
| StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN | Fei Yin (Tsinghua University)*; Yong Zhang (Tencent AI Lab); Xiaodong Cun (Tencent AI Lab); Mingdeng Cao (Tsinghua University); Yanbo Fan (Tencent AI Lab); Xuan Wang (Tencent AI Lab); Qingyan Bai (Tsinghua University); Baoyuan Wu (The Chinese University of Hong Kong, Shenzhen); Jue Wang (Tencent AI Lab); Yujiu Yang (Tsinghua University) | N/A | N/A |
| Referring Object Manipulation of Natural Images with Conditional Classifier-Free Guidance | Myungsub Choi (Google)* | N/A | N/A |
| Self-Supervised Interactive Object Segmentation Through a Singulation-and-Grasping Approach | Houjian Yu (University of Minnesota, Twin Cities)*; Changhyun Choi (University of Minnesota, Twin Cities) | N/A | N/A |
| BigColor: Colorization using a Generative Color Prior for Natural Images | geonung kim (POSTECH); Kyoungkook Kang (POSTECH); Seongtae Kim (POSTECH); Hwayoon Lee (POSTECH); Sehoon Kim (Samsung electronics co. ltd.); Jonghyun Kim (Samsung Electronics); Seung-Hwan Baek (POSTECH); Sunghyun Cho (POSTECH)* | N/A | N/A |
| Object Wake-up: 3D Object Rigging from a Single Image | Ji Yang (University of Alberta)*; Xinxin Zuo (University of Alberta); Sen Wang (University of Alberta); Zhenbo Yu (Shanghai Jiao Tong University); Xingyu Li (University of Alberta); Bingbing Ni (Shanghai Jiao Tong University); Minglun Gong (University of Guelph); Li Cheng (ECE dept., University of Alberta) | N/A | N/A |
| ClearPose: Large-scale Transparent Object Dataset and Benchmark | Xiaotong Chen (University of Michigan, Ann Arbor)*; Huijie Zhang (University of Michigan, Ann Arbor); Zeren Yu (University of Michigan–Ann Arbor); Anthony Opipari (University of Michigan); Odest Chadwicke Jenkins (University of Michigan) | N/A | N/A |
| Domain Knowledge-Informed Self-Supervised Representations for Workout Form Assessment | Paritosh Parmar (University of British Columbia)*; Amol Gharat (Flex A.I.); Helge Rhodin (UBC) | N/A | N/A |
| Neural Capture of Animatable 3D Human from Monocular Video | Gusi Te (Peking University); Xiu Li (Tencent); Xiao Li (Microsoft Research Asia)*; Jinglu Wang (Microsoft Research Asia); Wei Hu (Peking University); Yan Lu (Microsoft Research Asia) | N/A | N/A |
| Open Vocabulary Object Detection with Pseudo Bounding-Box Labels | Mingfei Gao (Apple)*; Chen Xing (Salesforce Research); Juan Carlos Niebles (Salesforce & Stanford University); Junnan Li (Salesforce); Ran Xu (Salesforce Research); Wenhao Liu (Salesforce Metamind); Caiming Xiong (Salesforce Research) | N/A | N/A |
| BoundaryFace: A mining framework with noise label self-correction for Face Recognition | Shijie Wu (Southwest Jiaotong University)*; Xun Gong (Southwest Jiaotong University) | N/A | N/A |
| IntegratedPIFu: Integrated Pixel Aligned Implicit Function for Single-view Human Reconstruction | Kennard Chan Yanting (Nanyang Technological University)*; Guosheng Lin (Nanyang Technological University); Haiyu Zhao (SenseTime International Pte Ltd); Weisi Lin (Nanyang Technological University, Singapore) | N/A | N/A |
| BMD: A General Class-balanced Multicentric Dynamic Prototype Strategy for Source-free Domain Adaptation | Sanqing Qu (Tongji University); Guang Chen (Tongji University)*; Jing Zhang (The University of Sydney); Zhijun Li (University of Science and Technology of China); Wei He (University of Science and Technology Beijing); Dacheng Tao (JD.com) | N/A | N/A |
| What Matters for 3D Scene Flow Network | Guangming Wang (Shanghai Jiao Tong University); Yunzhe Hu (Shanghai Jiao Tong University); Zhe Liu (University of Cambridge); Yiyang Zhou (UC Berkeley ); Masayoshi TOMIZUKA (MSC Lab); Wei Zhan (University of California, Berkeley); Hesheng Wang (SJTU)* | N/A | N/A |
| Controllable Shadow Generation Using Pixel Heigh Maps | Yichen Sheng (Purdue University)*; Yifan Liu (University of Adelaide); Jianming Zhang (Adobe Research); Wei Yin (University of Adelaide); A. Cengiz Oztireli (University of Cambridge, Google); He Zhang (Adobe); Zhe Lin (Adobe Research); Eli Shechtman (Adobe Research, US); Bedrich Benes (Purdue University) | N/A | N/A |
| CADyQ: Content-Aware Dynamic Quantization for Image Super-Resolution | Cheeun Hong (Seoul National University); Sungyong Baik (Hanyang University); Heewon Kim (Seoul National University); Seungjun Nah (NVIDIA); Kyoung Mu Lee (Seoul National University)* | N/A | N/A |
| SPSN: Superpixel Prototype Sampling Network for RGB-D Salient Object Detection | Minhyeok Lee ( Yonsei University)*; Chaewon Park (Yonsei University); Suhwan Cho (Yonsei University); Sangyoun Lee (Yonsei University) | N/A | N/A |
| Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer | Songwei Ge (University of Maryland)*; Thomas F Hayes (Meta); Harry Yang (Facebook); Xi Yin (Facebook); Guan Pang (Facebook); David Jacobs (University of Maryland, USA); Jia-Bin Huang (Facebook ); Devi Parikh (Georgia Tech & Facebook AI Research) | N/A | N/A |
| Combining Internal and External Constraints for Unrolling Shutter in Videos | Eyal Naor (Weizmann Institute)*; Itai Antebi (Weizmann); Shai Bagon (Weizmann Institute of Science); Michal Irani (Weizmann Institute, Israel) | N/A | N/A |
| Global Spectral Filter Memory Network for Video Object Segmentation | Yong Liu (Tsinghua University)*; Ran Yu (Tsinghua university); Jiahao Wang (Tsinghua University); Xinyuan Zhao (Huawei); Yitong Wang (Bytedance); Yansong Tang (Tsinghua University); Yujiu Yang (Tsinghua University) | N/A | N/A |
| SEMICON: A Learning-to-hash Solution for Large-scale Fine-grained Image Retrieval | Yang Shen (Nanjing University of Science and Technology); Xu Hao XH SUN (Nanjing University Of Science And Technology); Xiu-Shen Wei (Nanjing University of Science and Technology)*; Qing-Yuan Jiang (HuaWei); Jian Yang (Nanjing University of Science and Technology) | N/A | N/A |
| Batch-efficient EigenDecomposition for Small and Medium Matrices | Yue Song (University of Trento)*; Nicu Sebe (University of Trento); Wei Wang (EPFL) | N/A | N/A |
| General Object Pose Transformation Network from Unpaired Data | Yukun Su (South China University of Technology)*; Guosheng Lin (Nanyang Technological University); RuiZhou Sun (South China University of Technology); Qingyao Wu (South China University of Technology) | N/A | N/A |
| Robust Network Architecture Search via Feature Distortion Restraining | Yaguan QIAN (Zhejiang University of Science and Technology)*; Shenghui Huang (Zhejiang University of Science and Technology); Bin WANG (Network and Information Security Laboratory of Hangzhou Hikvision Digital Technology Co.); Xiang Ling (Institute of Software, Chinese Academy of Sciences); Xiaohui Guan (Zhejiang University of Water Resources and Electric Power); Zhaoquan Gu (Guangzhou University); Shaoning Zeng (Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China); Wujie Zhou (Zhejiang University of Science and Technology); Haijiang Wang (Zhejiang University of Science and Technology) | N/A | N/A |
| Correspondence Reweighted Translation Averaging | Lalit Manam (Indian Institute of Science Bengaluru)*; Venu Madhav Govindu (Indian Institute of Science) | N/A | N/A |
| RepMix: Representation Mixing for Robust Attribution of Synthesized Images | Tu Bui (University of Surrey)*; Ning Yu (Salesforce Research); John Collomosse (Adobe Research) | N/A | N/A |
| When Deep Classifiers Agree: Analyzing Correlations between Learning Order and Image Statistics | Iuliia Pliushch (Goethe University)*; Martin Mundt (TU Darmstadt); Nicolas Lupp (Goethe University Frankfurt); Visvanathan Ramesh (Goethe University) | N/A | N/A |
| S2F2: Single-Stage Flow Forecasting for Future Multiple Trajectories Prediction | YU-WEN CHEN (National Tsing Hua University); Hsuan-Kung Yang (National Tsing Hua University); Chu-Chi Chiu (National Tsin-Hua University); Chun-Yi Lee (National Tsing Hua University)* | N/A | N/A |
| Few-Shot Object Detection by Knowledge Distillation Using Bag-of-Visual-Words Representations | Wenjie Pei (Harbin Institute of Technology, Shenzhen); Shuang Wu (Harbin Institute of Technology, Shenzhen); Dianwen Mei (Harbin Institute of Technology, Shenzhen); Fanglin Chen (Harbin Institute of Technology, Shenzhen); Jiandong Tian (CAS); Guangming Lu ( Harbin Institute of Technology, Shenzhen)* | N/A | N/A |
| Stochastic Consensus: Enhancing Semi-Supervised Learning with Consistency of Stochastic Classifiers | Hui Tang (South China University of Technology)*; Kui Jia (South China University of Technology); Lin Sun (Magic Leap) | N/A | N/A |
| Learning Where To Look – Generative NAS is Surprisingly Efficient | Jovita Lukasik (University of Mannheim)*; Steffen Jung (MPII); Margret Keuper (University of Mannheim) | N/A | N/A |
| Realistic One-shot Mesh-based Head Avatars | Taras Khakhulin (Skolkovo Institute of Science and Technology)*; Vanessa Valerievna Skliarova (Skoltech); Victor Lempitsky (Yandex); Egor Zakharov (Skolkovo Institute of Science and Technology) | N/A | N/A |
| Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning | Seunghyun Lee (Inha University); Byung Cheol Song (Inha University)* | N/A | N/A |
| SALISA: Saliency-based Input Sampling for Efficient Video Object Detection | Babak Ehteshami Bejnordi (Qualcomm AI Reseach)*; Amir Ghodrati (Qualcomm AI Research); Fatih Porikli (Qualcomm AI Research); Amirhossein Habibian (Qualcomm AI Research) | N/A | N/A |
| Video Instance Segmentation via Multi-Scale Spatio-Temporal Split Attention Transformer | Omkar Thawakar (MBZUAI)*; Sanath Narayan (Inception Institute of Artificial Intelligence); Jiale Cao (Tianjin University); Hisham Cholakkal (MBZUAI); Rao Muhammad Anwer (MBZUAI/AALTO); Muhammad Haris Khan (Muhammad Bin Zayed University of Artificial Intelligence); Salman Khan (MBZUAI/ANU); Michael Felsberg (Linköping University); Fahad Shahbaz Khan (MBZUAI) | N/A | N/A |
| RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation | Haodi He (University of Science and Technology of China); Yuhui Yuan (Microsoft Research)*; Xiangyu Yue (University of California, Berkeley); Han Hu (Microsoft Research Asia) | N/A | N/A |
| Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression | Ahmet Burakhan Koyuncu (Technical University of Munich)*; Han Gao (Tencent America); Atanas Boev (Huawei Technologies Duesseldorf GmbH); Georgii Gaikov (Huawei Moscow Research Center); Elena Alshina (Huawei Technologies); Eckehard Steinbach (TUM) | N/A | N/A |
| Image Super-Resolution with Deep Dictionary | Shunta Maeda (Navier Inc.)* | N/A | N/A |
| ECO-TR: Efficient Correspondences Finding Via Coarse-to-Fine Refinement | Dongli Tan (Xiamen University)*; Jiang-Jiang Liu (Nankai University); Xingyu Chen (Youtu Lab); Chao Chen (Youtu Laboratory); Ruixin Zhang (Tencent); Yunhang Shen (Xiamen University); Shouhong Ding (Tencent); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Responsive Listening Head Generation: A Benchmark Dataset and Baseline | Mohan Zhou (Harbin Institute of Technology)*; Yalong Bai (JD AI Research); Wei Zhang (JD AI Research); Ting Yao (JD AI Research); Tiejun Zhao (Harbin Institute of Technology); Tao Mei (AI Research of JD.com) | N/A | N/A |
| WISE: Whitebox Image Stylization by Example-based Learning | Winfried Lötzsch (Merantix Momentum); Max Reimann (Hasso-Plattner-Institute)*; Martin Büßemeyer (Hasso-Plattner-Institut); Amir Semmo (Digital Masterpieces GmbH); Jürgen Döllner (Hasso-Plattner-Institut); Matthias Trapp (Hasso Plattner Institute, University of Potsdam) | N/A | N/A |
| 3D Equivariant Graph Implicit Functions | Yunlu Chen (University of Amsterdam); Basura Fernando (Agency for Science, Technology and Research, ASTAR, Singapore); Hakan Bilen (University of Edinburgh); Matthias Niessner (Technical University of Munich); Efstratios Gavves (University of Amsterdam ) | N/A | N/A |
| AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment | Kangyeol Kim (KAIST)*; Sunghyun Park (KAIST); Jaeseong Lee (KAIST); Sunghyo Chung (Korea University); Junsoo Lee (NAVER WEBTOON Ltd.); Jaegul Choo (Korea Advanced Institute of Science and Technology) | N/A | N/A |
| Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics | Sen Zhang (The University of Sydney); Jing Zhang (The University of Sydney)*; Dacheng Tao (The University of Sydney) | N/A | N/A |
| Dynamic Local Aggregation Network with Adaptive Clusterer for Anomaly Detection | Zhiwei Yang (Xidian University)*; Peng Wu (Xidian University); Jing Liu (Xidian University); Xiaotao Liu (Xidian University) | N/A | N/A |
| Learning Semantic Segmentation from Multiple Datasets with Label Shifts | Dongwan Kim (Seoul National University)*; Yi-Hsuan Tsai (Phiar Technologies); Yumin Suh (NEC Labs America); Masoud Faraki (NEC Labs); Sparsh Garg (NEC Labs America); Manmohan Chandraker (UC San Diego); Bohyung Han (Seoul National University) | N/A | N/A |
| SecretGen: Privacy Recovery on Pre-trained Models via Distribution Discrimination | Zhuowen Yuan (UIUC); Fan Wu (UIUC); Yunhui Long (University of Illinois); Chaowei Xiao (NVIDIA); Bo Li (UIUC)* | N/A | N/A |
| A Kendall Shape Space Approach to 3D Shape Estimation from 2D Landmarks | Martha Paskin (Zuse Institute Berlin); Daniel Baum (Zuse Institute Berlin); Mason N Dean (City University of Hong Kong); Christoph von Tycowicz (Zuse Institute Berlin)* | N/A | N/A |
| Temporally Consistent Transformer for Video Denoising | Mingyang Song (ETH Zurich)*; Yang Zhang (Disney Research Studios); Tunç Aydin (Disney Research) | N/A | N/A |
| Action Quality Assessment with Temporal Parsing Transformer | Yang Bai (Durham University); Desen Zhou (Baidu, Inc.)*; Songyang Zhang (Shanghai AI Laboratory); Jian Wang (Baidu); Errui Ding (Baidu Inc.); Yu Guan (University of Warwick); Yang Long (Durham University); Jingdong Wang (Baidu) | N/A | N/A |
| A study of Pre-training strategies and datasets for facial representation learning | Adrian Bulat (Samsung AI Center, Cambridge)*; Shiyang Cheng (Samsung); Jing Yang (University of Nottingham); Andrew Garbett (Samsung AI Center); Enrique Sanchez (Samsung AI Centre); Georgios Tzimiropoulos (Queen Mary University of London) | N/A | N/A |
| Neural Strands: Learning Hair Geometry and Appearance from Multi-View Images | Radu Alexandru Rosu (University of Bonn); Shunsuke Saito (Facebook); Ziyan Wang (Carnegie Mellon University); Chenglei Wu (Facebook Reality Labs); Sven Behnke (University of Bonn); Giljoo Nam (Facebook Inc.)* | N/A | N/A |
| Conditional Stroke Recovery for Fine-Grained Sketch-Based Image Retrieval | Zhixin Ling (Fudan University)*; Zhen Xing (Fudan University); Jian Zhou (Fudan University); Xiangdong Zhou (Fudan University) | N/A | N/A |
| Generalized Brain Image Synthesis with Transferable Convolutional Sparse Coding Networks | Yawen Huang (Tencent)*; Feng Zheng (SUSTech); Xu Sun (Tencent); Yuexiang Li (Jarvis Lab, Tencent); Ling Shao (Terminus Group); Yefeng Zheng (Tencent) | N/A | N/A |
| Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning | Ting Yao (JD AI Research); Yingwei Pan (JD AI Research)*; Yehao Li (JD AI Research); Chong-Wah Ngo (Singapore Management University); Tao Mei (AI Research of JD.com) | N/A | N/A |
| GraphCSPN: Geometry-Aware Depth Completion via Dynamic GCNs | Xin Liu (Tsinghua University)*; Xiaofei Shao (Deptrum); Bo Wang (Deptrum); Ya-Li Li (Tsinghua University); Shengjin Wang (Tsinghua University) | N/A | N/A |
| Revisiting Batch Norm Initialization | Jim Davis (Ohio State University); Logan Frank (Ohio State University)* | N/A | N/A |
| NewsStories: Illustrating articles with visual summaries | Reuben Tan (Boston University)*; Bryan Plummer (Boston University); Kate Saenko (Boston University); J.P. Lewis (Google Research); Avneesh Sud (Google); Thomas Leung (Google) | N/A | N/A |
| Improving Few-Shot Learning through Multi-task Representation Learning Theory | Quentin Bouniot (CEA, LIST)*; Ievgen Redko (Laboratoire Hubert Curien); Romaric Audigier (CEA LIST); Angélique Loesch (CEA LIST); Amaury Habrard (University of St-Etienne, Lab. H. Curien) | N/A | N/A |
| Deep Semantic Statistics Matching (D2SM) Denoising Network | Kangfu Mei (Johns Hopkins University)*; Vishal Patel (Johns Hopkins University); Rui Huang (The Chinese University of Hong Kong, Shenzhen) | N/A | N/A |
| Long-tailed Instance Segmentation using Gumbel Optimized Loss | Konstantinos P Alexandridis (University of Liverpool)*; Jiankang Deng (Imperial College London); Anh Nguyen (University of Liverpool); Shan Luo (University of Liverpool) | N/A | N/A |
| DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection | Jinhyung Park (Carnegie Mellon University)*; Chenfeng Xu (UC Berkeley); Yiyang Zhou (UC Berkeley ); Masayoshi TOMIZUKA (MSC Lab); Wei Zhan (University of California, Berkeley) | N/A | N/A |
| 3D Scene Inference from Transient Histograms | Sacha Jungerman (University of Wisconsin-Madison)*; Atul N Ingle (University of Wisconsin-Madison); Yin Li (University of Wisconsin-Madison); Mohit Gupta (“University of Wisconsin-Madison, USA “) | N/A | N/A |
| SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling | Ho Man Kwan (The Hong Kong University of Science and Technology)*; S.H. Song (HKUST) | N/A | N/A |
| Deep 360° Optical Flow Estimation by Multi-Projection Fusion | Yiheng Li (Victoria University of Wellington); Connelly Barnes (Adobe); Kun Huang (Victoria University of Wellington); Fang-Lue Zhang (Victoria University of Wellington)* | N/A | N/A |
| Neural Space-filling Curves | Hanyu Wang (University of Maryland – College Park)*; Kamal Gupta (University of Maryland); Larry Davis (University of Maryland); Abhinav Shrivastava (University of Maryland) | N/A | N/A |
| MFIM: Megapixel Facial Identity Manipulation | Sanghyeon Na (kakaobrain)* | N/A | N/A |
| Objects Can Move: 3D Change Detection by GeometricTransformation Consistency | Aikaterini Adam (National Techniclal University of Athens)*; Torsten Sattler (Czech Technical University in Prague); Konstantinos Karantzalos (National Technical University of Athens); Tomas Pajdla (Czech Technical University in Prague) | N/A | N/A |
| MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration | Thomas F Hayes (Meta); Songyang Zhang (University of Rochester)*; Xi Yin (Facebook); Guan Pang (Facebook); Sasha Sheng (Meta Platforms); Harry Yang (Facebook); Songwei Ge (University of Maryland, College Park); Qiyuan Hu (Facebook AI Research); Devi Parikh (Georgia Tech & Facebook AI Research) | N/A | N/A |
| PatchRD: Detail-Preserving Shape Completion by Learning Patch Retrieval and Deformation | Bo Sun (UT Austin)*; Vladimir Kim (Adobe); Qixing Huang (The University of Texas at Austin); Noam Aigerman (Adobe); Siddhartha Chaudhuri (Adobe Research) | N/A | N/A |
| Network Binarization via Contrastive Learning | Yuzhang Shang (Illinois Institute of Technology)*; Dan Xu (The Hong Kong University of Science and Technology); Ziliang Zong (Texas State University); Liqiang Nie (Harbin Institute of Technology (Shenzhen)); Yan Yan (Illinois Institute of Technology) | N/A | N/A |
| Lipschitz Continuity Retained Binary Neural Network | Yuzhang Shang (Illinois Institute of Technology)*; Dan Xu (The Hong Kong University of Science and Technology); Bin Duan (Illinois Institute of Technology); Ziliang Zong (Texas State University); Liqiang Nie (Harbin Institute of Technology (Shenzhen)); Yan Yan (Illinois Institute of Technology) | N/A | N/A |
| Is Geometry Enough for Matching in Visual Localization? | Qunjie Zhou (Technical University of Munich)*; Sérgio Agostinho (Institute for Systems and Robotics, Instituto Superior Técnico, Universidade de Lisboa); Aljosa Osep (TUM Munich); Laura Leal-Taixé (TUM) | N/A | N/A |
| Webly Supervised Concept Expansion for General Purpose Vision Models | Amita Kamath (Allen Institute for Artificial Intelligence); Christopher A Clark (Allen Institute for AI)*; Tanmay Gupta (Allen Institute for Artificial Intelligence); Eric Kolve (Allen AI); Derek Hoiem (University of Illinois at Urbana-Champaign); Aniruddha Kembhavi (Allen Institute for Artificial Intelligence) | N/A | N/A |
| Compositional Human-Scene Interaction Synthesis with Semantic Control | Kaifeng Zhao (ETH Zurich)*; Shaofei wang (ETH Zurich); Yan Zhang (ETH Zurich); Thabo Beeler (Disney Research | Studios); Siyu Tang (ETH Zurich) | N/A |
| MaCLR: Motion-aware Contrastive Learning of Representations for Videos | Fanyi Xiao (Meta); Joseph Tighe (Amazon); Davide Modolo (Amazon)* | N/A | N/A |
| Transformers as Meta-Learners for Implicit Neural Representations | Yinbo Chen (UC San Diego)*; Xiaolong Wang (UCSD) | N/A | N/A |
| RAWtoBit: A Fully End-to-end Camera ISP Network | Wooseok Jeong (Korea University); Seung-Won Jung (Korea University)* | N/A | N/A |
| SpatialDETR: Robust Scalable Transformer-Based 3D Object Detection from Multi-View Camera Images with Global Cross-Sensor Attention | Simon Doll (University of Tübingen)*; Richard Schulz (Mercedes Benz); Lukas Schneider (Daimer); Viviane Benzin (Mercedes-Benz AG); Markus Enzweiler (Esslingen University of Applied Sciences); Hendrik P. A. Lensch (University of Tübingen) | N/A | N/A |
| 3D Face Reconstruction with Dense Landmarks | Erroll Wood (Microsoft)*; Tadas Baltrusaitis (Microsoft); Charlie Hewitt (Microsoft); Matthew A Johnson (Microsoft); Jingjing Shen (Microsoft); Nikola Milosavljevic (Microsoft); Daniel S Wilde (Microsoft); Stephan J Garbin (University College London); Toby Sharp (Microsoft); Ivan Stojiljkovic (Microsoft); Tom Cashman (Microsoft); Julien Valentin (Microsoft) | N/A | N/A |
| SWFormer: Sparse Window Transformer for 3D Object Detection in Point Clouds | Pei Sun (Waymo)*; Mingxing Tan (Waymo); Weiyue Wang (Waymo); Chenxi Liu (Waymo); Fei Xia (Waymo); Zhaoqi Leng (Waymo); Dragomir Anguelov (Waymo) | N/A | N/A |
| Incomplete Multi-view Domain Adaptation via Channel Enhancement and Knowledge Transfer | Haifeng Xia (Tulane University)*; Pu Wang (MERL); Zhengming Ding (Tulane University) | N/A | N/A |
| Exposure-Aware Dynamic Weighted Learning for Single-Shot HDR Imaging | An Gia Vien (Dongguk University); Chul Lee (Dongguk University)* | N/A | N/A |
| Seeing through a Black Box: Toward High-Quality Terahertz Imaging via Subspace-and-Attention Guided Restoration | Weng-Tai Su (National Tsing Hua University); Yi-Chun Hung (University of California, Los Angeles); Po-Jen Yu (National Tsing Hua University); Shang-Hua Yang (National Tsing Hua University); Chia-Wen Lin (National Tsing Hua University)* | N/A | N/A |
| SPViT: Enabling Faster Vision Transformers via Soft Token Pruning | Zhenglun Kong (Northeastern University)*; Peiyan Dong (Northeastern University); Xiaolong Ma (Clemson University); Xin Meng (Peking university); Wei Niu (William & Mary); Mengshu Sun (Northeastern University); Xuan Shen (Northeastern University); Geng Yuan (Northeastern University); Bin Ren (William & Mary); Hao Tang (ETH Zurich); Minghai Qin (Western Digital Research); Yanzhi Wang (Northeastern University) | N/A | N/A |
| Soft Masking for Cost-Constrained Channel Pruning | Ryan Humble (Stanford University)*; Maying Shen (NVIDIA); Jorge Albericio Latorre (NVIDIA); Eric Darve (Stanford University); Jose M. Alvarez (NVIDIA) | N/A | N/A |
| Ensemble Learning Priors Driven Deep Unfolding forScalable Snapshot Compressive Imaging | Chengshuai Yang (Westlake University)*; Shiyu Zhang (Westlake University); Xin Yuan (Westlake University) | N/A | N/A |
| A Simple Baseline for Open Vocabulary Semantic Segmentation with Pre-trained Vision-language Model | Mengde Xu (Huazhong University of Science and Tech.); Zheng Zhang (MSRA)*; Fangyun Wei (Microsoft Research Asia); Yutong Lin (Xi’an Jiaotong University); Yue Cao (Microsoft Research); Han Hu (Microsoft Research Asia); Xiang Bai (Huazhong University of Science and Technology) | N/A | N/A |
| Triangle Attack: A Query-efficient Decision-based Adversarial Attack | Xiaosen Wang (Huazhong University of Science and Technology)*; Zeliang Zhang (Huazhong University of Sci. & Technology); Kangheng Tong (Huazhong University of Science and Technology); dihong gong (Tencent AI Lab); Kun He (Huazhong University of Science and Technology); Zhifeng Li (Tencent AI Lab); Wei Liu (Tencent) | N/A | N/A |
| Tailoring Self-Supervision for Supervised Learning | WonJun Moon (Sungkyunkwan University)*; Jihwan Kim (Sungkyunkwan University); Jae-Pil Heo (Sungkyunkwan University) | N/A | N/A |
| Difficulty-Aware Simulator for Open Set Recognition | WonJun Moon (Sungkyunkwan University)*; Jun ho Park (Sungkyunkwan university); Hyun Seok Seong (Sungkyunkwan University); Cheol-Ho Cho (Sungkyunkwan University); Jae-Pil Heo (Sungkyunkwan University) | N/A | N/A |
| Non-Uniform Step Size Quantization for Accurate Post-Training Quantization | Sangyun Oh (UNIST)*; Hyeonuk Sim (UNIST); Jounghyun Kim (UNIST); Jongeun Lee (UNIST) | N/A | N/A |
| FedVLN: Privacy-preserving Federated Vision-and-Language Navigation | Kaiwen Zhou (University of California, Santa Cruz)*; Xin Eric Wang (University of California, Santa Cruz) | N/A | N/A |
| Data-free Backdoor Removal Based on Channel Lipschitzness | Runkai Zheng (Chinese University of Hong Kong (Shenzhen)); Rongjun Tang (The Chinese University of Hong Kong, Shenzhen); Jianze Li (Shenzhen Research Institute of Big Data, The Chinese University of Hong Kong, Shenzhen); Li Liu (Shenzhen Research Institute of Big Data, the chinese university of hong kong shenzhen)* | N/A | N/A |
| SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning | Haoran You (Rice University)*; Baopu Li (Baidu ); Zhanyi Sun (Rice University); Xu Ouyang (Rice University); Yingyan Lin (Rice University) | N/A | N/A |
| PCR-CG: Point Cloud Registration via Deep Explicit Color and Geometry | Yu Zhang (Shanghai Jiaotong University )*; Yu Junle (HangZhou dianzi university); Xiaolin Huang (Shanghai Jiao Tong University); Wenhui Zhou (Hangzhou Dianzi University); Ji Hou (Meta Reality Labs) | N/A | N/A |
| DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization | Xueqing Deng (University of California, Merced); Dawei Sun (University of Illinois Urbana-Champaign); Shawn Newsam (UC Merced); Peng Wang (Bytedance USA LLC.)* | N/A | N/A |
| Tomography of Turbulence Strength Based on Scintillation Imaging | Nir Shaul (Technion)*; Schechner Yoav (Technion) | N/A | N/A |
| Realistic Blur Synthesis for Learning Image Deblurring | Jaesung Rim (POSTECH); Geonung Kim (POSTECH); Jungeon Kim (POSTECH); Junyong Lee (POSTECH); Seungyong Lee (POSTECH); Sunghyun Cho (POSTECH)* | N/A | N/A |
| GLAMD: Global and Local Attention MaskDistillation for Object Detectors | YounHo Jang (Kyung Hee University); Wheemyung Shin (Kyung Hee University); Jinbeom Kim (Sungkyunkwan University (SKKU)); Sung-Ho Bae (Kyung Hee University)*; Simon S Woo (Sungkyunkwan University (SKKU)) | N/A | N/A |
| Meta-GF: Training Dynamic-Depth Neural Networks Harmoniously | Yi Sun (National University of Defense Technology); Jian Li (NUDT); Xin Xu (National University of Defense Technology)* | N/A | N/A |
| CXR Segmentation by AdaIN-based Domain Adaptation and Knowledge Distillation | Yujin Oh (Kim Jaechul Graduate School of AI, KAIST, Korea); Jong Chul Ye (Kim Jaechul Graduate School of AI, KAIST, Korea)* | N/A | N/A |
| Emotion-aware Multi-view Contrastive Learning for Facial Emotion Recognition | Daeha Kim (Inha University); Byung Cheol Song (Inha University)* | N/A | N/A |
| FCAF3D: Fully Convolutional Anchor-Free 3D Object Detection | Danila Rukhovich (Samsung AI Center Moscow); Anna Vorontsova (Samsung AI Center)*; Anton S. Konushin (Samsung AI Center Moscow) | N/A | N/A |
| Video Dialog as Conversation about Objects Living in Space-Time | Hoang-Anh Pham (Deakin University)*; Thao Minh Le (Deakin University); Vuong Le (Deakin University); Tu Minh Phuong (Posts and Telecommunications Institute of Technology); Truyen Tran (Deakin University) | N/A | N/A |
| Few-Shot Class-Incremental Learning from an Open-Set Perspective | Can Peng (the University of Queensland)*; Kun Zhao (Sullivan Nicolaides Pathology); Tianren Wang (The University of Queensland); Meng Li (The University of Queensland); Brian C Lovell (University of Queensland) | N/A | N/A |
| ML-BPM: Multi-teacher Learning with Bidirectional Photometric Mixing for Open Compound Domain Adaptation in Semantic Segmentation | Fei Pan (KAIST)*; Sungsu Hur (KAIST); Seokju Lee (KENTECH); Junsik Kim (Harvard University); In So Kweon (KAIST) | N/A | N/A |
| DRCNet: Dynamic Image Restoration Contrastive Network | Fei Li (China Agricultural University)*; Lingfeng Shen (Tencent AI Lab); YANG MI (China Agricultural University); Zhenbo Li (China Agricultural University) | N/A | N/A |
| Order Learning Using Partially Ordered Data via Chainization | Seon-Ho Lee (MCL, Korea University); Chang-Su Kim (Korea university)* | N/A | N/A |
| Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment | Chaeyeon Chung ( Korea Advanced Institute of Science and Technology)*; Taewoo Kim (Korea Advanced Institute of Science and Technology ); Yoonseo Kim (KAIST); Sunghyun Park (KAIST); Kangyeol Kim (KAIST); Jaegul Choo (Korea Advanced Institute of Science and Technology) | N/A | N/A |
| High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions | SangYun Lee (Soongsil University); Gyojung Gu (Korea Advanced Institute of Science and Technology)*; Sunghyun Park (KAIST); Seunghwan Choi (Korea Advanced Institute of Science and Technology ); Jaegul Choo (Korea Advanced Institute of Science and Technology) | N/A | N/A |
| Zero-Shot Learning for Reflection Removal of Single 360-Degree Image | Byeong-Ju Han (Ulsan National Institute of Science and Technology ); Jae-Young Sim (Ulsan National Institute of Science and Technology)* | N/A | N/A |
| A Codec Information Assisted Framework for Efficient Compressed Video Super-Resolution | Hengsheng Zhang (Shanghai Jiao Tong University)*; Xueyi Zou (Huawei Noah’s Ark Lab); Jiaming Guo (Huawei Noah’s Ark Lab); Youliang Yan (Huawei Noah’s Ark Lab); Rong Xie (Shanghai Jiao Tong University); Li Song (Shanghai Jiao Tong University) | N/A | N/A |
| Towards Ultra Low Latency Spiking Neural Networks for Vision and Sequential Tasks Using Temporal Pruning | Sayeed Shafayet Chowdhury (Purdue University)*; Nitin Rathi (Purdue University); Kaushik Roy (Purdue Uniiversity) | N/A | N/A |
| MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis | Athanasios Papaioannou (Huawei)*; Baris Gecer (Huawei); Shiyang Cheng (Samsung); Grigorios Chrysos (EPFL); Jiankang Deng (Imperial College London); Eftychia Fotiadou (Imperial College London); Christos Kampouris (ApolloXR); Dimitrios Kollias (Queen Mary University London); Stylianos Moschoglou (Huawei Technologies Co. Ltd); Kritaphat Songsri-In (Imperial College London); Stylianos Ploumpis (Huawei Technologies Co. Ltd); George Trigeorgis (Imperial College London ); Panagiotis Tzirakis (Imperial College London); Evangelos Ververas (Imperial College London); Yuxiang Zhou (Deepmind, Google); Allan Ponniah (NHS); Anastasios Roussos (Institute of Computer Science, Foundation for Research and Technology Hellas); Stefanos Zafeiriou (Imperial College London) | N/A | N/A |
| Black-Box Dissector: Towards Erasing-based Hard-Label Model Stealing Attack | Yixu Wang (Xiamen University)*; Jie Li (Xiamen University); Hong Liu (National Institute of Informatics ); Yan Wang (Pinterest); Yongjian Wu (Tencent Technology (Shanghai) Co.,Ltd); Feiyue Huang (Tencent); Rongrong Ji (Xiamen University, China) | N/A | N/A |
| Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles | Guodong Wang (Beihang University)*; Yunhong Wang (State Key Laboratory of Virtual Reality Technology and System, Beihang University, Beijing 100191, China); Jie Qin (Nanjing University of Aeronautics and Astronautics); Dongming Zhang ( National Computer Network Emergency Response Technical Team/Coordination Center of China ); Xiuguo bao (National Computer Network Emergency Response Technical Team/Coordination Center of China); Di Huang (Beihang University, China) | N/A | N/A |
| Towards Accurate Network Quantization with Equivalent Smooth Regularizers | Kirill Solodskikh (Huawei Noah’s Ark Lab, MSU)*; Vladimir Chikin (Huawei Noah’s Ark Lab); Ruslan Aydarkhanov (Huawei Noah’s Ark Lab); Dehua Song (Huawei Noah’s Ark Lab); Irina Zhelavskaya (Skolkovo Institute of Science and Technology (Skoltech)); Jiansheng Wei (Huawei Technologies Co. Ltd.) | N/A | N/A |
| DiffuseMorph: Unsupervised Deformable Image Registration Using Diffusion Model | Boah Kim (KAIST)*; Inhwa Han (KAIST); Jong Chul Ye (Kim Jaechul Graduate School of AI, KAIST, Korea) | N/A | N/A |
| An Impartial Take to the CNN vs Transformer Robustness Contest | Francesco Pinto (University of Oxford)*; Philip Torr (University of Oxford); Puneet Dokania (University of Oxford) | N/A | N/A |
| CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval | Haoran Wang (Baidu)*; Dongliang He (Baidu); Wenhao Wu (Baidu); Boyang Xia (Institute of Computing Technology, Chinese Academy of Science); Min Yang (Baidu); Fu Li (Baidu); Yunlong Yu (Zhejiang University); Zhong Ji (Tianjin University); Errui Ding (Baidu Inc.); Jingdong Wang (Baidu) | N/A | N/A |
| Weakly Supervised 3D Scene Segmentation with Region-Level Boundary Awareness and Instance Discrimination | Kangcheng LIU (The Chinese University of Hong Kong)*; Yuzhi Zhao (City University of Hong Kong); Qiang Nie (Tencent Youtu Lab); Zhi Gao (NUS); Ben M. Chen (Chinese University of Hong Kong) | N/A | N/A |
| FOSTER: Feature Boosting and Compression for Class-Incremental Learning | Fu-Yun Wang (Nanjing University)*; Da-Wei Zhou (Nanjing University); Han-Jia Ye (Nanjing University); De-Chuan Zhan (Nanjing University) | N/A | N/A |
| Delving into Universal Lesion Segmentation: Method, Dataset, and Benchmark | Yu Qiu (Nankai University)*; Jing Xu (Nankai University) | N/A | N/A |
| Explicit Model Size Control and Relaxation via Smooth Regularization for Mixed-Precision Quantization | Vladimir Chikin (Huawei Noah’s Ark Lab)*; Kirill Solodskikh (Huawei Noah’s Ark Lab, MSU); Irina Zhelavskaya (Skolkovo Institute of Science and Technology (Skoltech)) | N/A | N/A |
| Large scale Real-world Multi Person Tracking | Bing Shuai (Amazon)*; Alessandro Bergamo (Amazon); Uta Büchler (Amazon); Andrew G Berneshawi (Amazon); Alyssa Boden (Amazon Web Services); Joseph Tighe (Amazon) | N/A | N/A |
| Class-agnostic Object Detection with Multi-modal Transformer | Muhammad Maaz (MBZUAI)*; Hanoona Abdul Rasheed (MBZUAI); Salman Khan (MBZUAI/ANU); Fahad Shahbaz Khan (MBZUAI); Rao Muhammad Anwer (MBZUAI/AALTO); Ming-Hsuan Yang (University of California at Merced) | N/A | N/A |
| Language-Grounded Indoor 3D Semantic Segmentation in the Wild | Dávid Rozenberszki (Technische Universitat Munchen)*; Or Litany (Stanford); Angela Dai (Technical University of Munich) | N/A | N/A |
| Injecting 3D Perception of Controllable NeRF-GAN into StyleGAN for Editable Portrait Image Synthesis | Jeong-gi Kwak (Korea University); Yuanming Li (Korea University); Dongsik Yoon (Korea University); Donghyeon Kim (Korea university); David K Han (Drexel University); Hanseok Ko (Korea University)* | N/A | N/A |
| BASQ: Branch-wise Activation-clipping Search Quantization for Sub-4-bit Neural Networks | Han-Byul Kim (Seoul National University)*; Eunhyeok Park (POSTECH); Sungjoo Yoo (Seoul National University) | N/A | N/A |
| AdaNeRF: Adaptive Sampling for Real-time Rendering of Neural Radiance Fields | Andreas Kurz (Graz University of Technology)*; Thomas Neff (Graz University of Technology); Zhaoyang Lv (Facebook); Michael Zollhöfer (Facebook Reality Labs); Markus Steinberger (Graz University of Technology) | N/A | N/A |
| Neural Light Field Estimation for Street Scenes with Differentiable Virtual Object Insertion | Zian Wang (University of Toronto)*; Wenzheng Chen (University of Toronto); David Acuna (University of Toronto, NVIDIA); Jan Kautz (NVIDIA); Sanja Fidler (University of Toronto, NVIDIA) | N/A | N/A |
| Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation | Min Zhang (Zhejiang University)*; Siteng Huang (Westlake University); Wenbin Li (Nanjing University); Donglin Wang (Westlake University) | N/A | N/A |
| PoseScript: 3D Human Poses from Natural Language | Ginger Delmas (NAVER LABS EUROPE)*; Philippe Weinzaepfel (NAVER LABS Europe); Thomas LUCAS (Naver); Francesc Moreno (IRI); Gregory Rogez (NAVER LABS Europe) | N/A | N/A |
| Learning Energy-Based Models With Adversarial Training | Xuwang Yin (University of Virginia)*; Shiying Li (University of North Carolina, Chapel Hill); Gustavo Rohde (University of Virginia) | N/A | N/A |
| You Already Have It: A Generator-Free Low-Precision DNN Training Framework using Stochastic Rounding | Geng Yuan (Northeastern University)*; Sung-En Chang (Northeastern University); Qing Jin (Northeastern University); Alec Lu (Simon Fraser University ); Yanyu Li (Northeastern University); Yushu Wu (Northeastern University); Zhenglun Kong (Northeastern University); Yanyue Xie (Northeastern University); Peiyan Dong (Northeastern University); Minghai Qin (Western Digital Research); Xiaolong Ma (Clemson University); Xulong Tang (University of Pittsburgh); Zhenman Fang (Simon Fraser University); Yanzhi Wang (Northeastern University) | N/A | N/A |
| TIPS: Text-Induced Pose Synthesis | Prasun Roy (University of Technology Sydney)*; Subhankar Ghosh (University of Technology Sydney ); Saumik Bhattacharya (Indian Institute of Technology Kharagpur ); Umapada Pal (Indian Statistical Institute, Kolkata); Michael Blumenstein (University of Technology Sydney) | N/A | N/A |
| Unsupervised High-Fidelity Facial Texture Generation and Reconstruction | Ron Slossberg (Technion)*; Ibrahim Jubran (The University of Haifa); Ron Kimmel (Technion) | N/A | N/A |
| Addressing Heterogeneity in Federated Learning via Distributional Transformation | Haolin Yuan (Johns Hopkins University); Bo Hui (Johns Hopkins University); Yuchen Yang (Johns Hopkins University); Philippe Burlina (JHU/APL/CS/SOM); Neil Zhenqiang Gong (Duke University); Yinzhi Cao (JHU)* | N/A | N/A |
| Adversarial Label Poisoning Attack on Graph Neural Networks via Label Propagation | Ganlin Liu (The University of Liverpool)*; Xiaowei Huang (Liverpool University); Xinping Yi (University of Liverpool) | N/A | N/A |
| Approximate Discrete Optimal Transport Plan with Auxiliary Measure Method | Dongsheng An (Stony Brook University)*; Na Lei (Dalian University of Technology); Xianfeng GU (Stony Brook University) | N/A | N/A |
| Visual Knowledge Tracing | Neehar Kondapaneni (Caltech)*; Pietro Perona (California Institute of Technology); Oisin Mac Aodha (University of Edinburgh) | N/A | N/A |
| Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning | Xinlei He (CISPA Helmholtz Center for Information Security)*; Hongbin Liu (Duke University); Neil Zhenqiang Gong (Duke University); Yang Zhang (CISPA Helmholtz Center for Information Security) | N/A | N/A |
| DProST: Dynamic Projective Spatial Transformer Network for 6D Pose Estimation | Jaewoo Park (Seoul National University); Nam Ik Cho (Seoul National University)* | N/A | N/A |
| Accurate Detection of Proteins in Cryo-Electron Tomograms from Sparse Labels | Qinwen Huang (Duke University)*; Alberto Bartesaghi (Duke University); Ye Zhou (Duke University); Hsuan-Fu Liu (Duke University) | N/A | N/A |
| Subspace Diffusion Generative Models | Bowen Jing (Massachusetts Institute of Technology)*; Gabriele Corso (MIT); Renato Berlinghieri (MIT); Tommi Jaakkola (MIT) | N/A | N/A |
| Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features | Byeonghu Na (KAIST); Yoonsik Kim (Clova AI Research, NAVER Corp.); Sungrae Park (Upstage AI Research, Upstage AI)* | N/A | N/A |
| Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments | Khoi D. Nguyen (VinAI Research)*; Quoc-Huy Tran (Retrocausal, Inc.); Khoi Nguyen (VinAI Research); Binh-Son Hua (VinAI Research); Rang NGUYEN (VinAI Research) | N/A | N/A |
| Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection | Kyle Min (Intel Labs); Sourya Roy (University of California, Riverside); Subarna Tripathi (Intel Labs)*; Tanaya Guha (University of Glasgow); Somdeb Majumdar (Intel Labs) | N/A | N/A |
| Relative Contrastive Loss for Unsupervised Representation Learning | Shixiang Tang (The University of Sydney)*; Feng Zhu (University of Science and Technology of China); Lei Bai (Shanghai AI Laboratory); Rui Zhao (SenseTime Group Limited); Wanli Ouyang (The University of Sydney) | N/A | N/A |
| Personalized Education: Blind Knowledge Distillation | Xiang Deng (State University of New York at Binghamton)*; Jian Zheng (Amazon); Zhongfei Zhang (Binghamton University) | N/A | N/A |
| Fast Two-View Motion Segmentation Using Christoffel Polynomials | Bengisu Ozbay (Northeastern University); Octavia Camps (Northeastern University); Mario Sznaier (Northeastern University)* | N/A | N/A |
| Real Spike: Learning Real-valued Spikes for Spiking Neural Networks | Yufei Guo (The Second Academy of China Aerospace Science and Industry Corporation)*; Liwen Zhang (X Lab, the Second Academy of CASIC, Beijing); Yuanpei Chen (X LAB,The Second Academy of CASIC,Beijing); Xinyi Tong (The Second Academy of China Aerospace Science and Industry Corporation); Xiaode Liu (X Lab, The Second Academy of China Aerospace Science and Industry Corporation); YingLei Wang (CASIC); Xuhui Huang (X Lab, The Second Academy of CASIC); Zhe Ma (Xlab, the Second Academy of CASIC, Beijing) | N/A | N/A |
| Language-Driven Artistic Style Transfer | Tsu-Jui Fu (UCSB)*; Xin Eric Wang (University of California, Santa Cruz); William Yang Wang (UC Santa Barbara) | N/A | N/A |
| FedLTN: Federated Learning for Sparse and Personalized Lottery Ticket Networks | Vaikkunth Mugunthan (DynamoFL)*; Eric Lin (DynamoFL); Vignesh Gokul (University of California San Diego); Christian Lau (DynamoFL); Lalana Kagal (MIT); Steve Pieper (Isomics, Inc.) | N/A | N/A |
| Transformer with Implicit Edges for Particle-based Physics Simulation | Yidi Shao (Nanyang Technological University)*; Chen Change Loy (Nanyang Technological University); Bo Dai (Shanghai AI Lab) | N/A | N/A |
| Improving the Perceptual Quality of 2D Animation Interpolation | Shuhong Chen (University of Maryland – College Park)*; Matthias Zwicker (University of Maryland) | N/A | N/A |
| Towards Open-vocabulary Scene Graph Generation with Prompt-based Finetuning | Tao He (Monash University); Lianli Gao (The University of Electronic Science and Technology of China); Jingkuan Song (UESTC); Yuan-Fang Li (Monash University)* | N/A | N/A |
| S3C: Self-Supervised Stochastic Classifiers for Few-Shot Class-Incremental Learning | Jayateja Kalla (Indian Institute of Science); Soma Biswas (Indian Institute of Science, Bangalore)* | N/A | N/A |
| Entry-Flipped Transformer for Inference and Prediction of Participant Behavior | BO HU (Nanyang Technological University)*; Tat-Jen Cham (Nanyang Technological University) | N/A | N/A |
| OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning | Mamshad Nayeem Rizve (University of Central Florida)*; Navid Kardan (University of Central Florida); Salman Khan (MBZUAI/ANU); Fahad Shahbaz Khan (MBZUAI); Mubarak Shah (University of Central Florida) | N/A | N/A |
| Fine-grained Fashion Representation Learning by Online Deep Clustering | Yang Jiao (Amazon)*; Ning Xie (Amazon); Yan Gao (Amazon); Chien-Chih Wang (Amazon); Yi Sun (Amazon) | N/A | N/A |
| Perspective Phase Angle Model for Polarimetric 3D Reconstruction | Guangcheng Chen (Guangdong University of Technology)*; Li He (Southern University of Science and Technology); Yisheng Guan (Guangdong University of Technology); Hong Zhang (University of Alberta) | N/A | N/A |
| Selective TransHDR: Transformer-based selective HDR Imaging using Ghost Region Mask | Jou Won Song (Sogang University); Ye-In Park (Sogang University); Kyeongbo Kong (Pukyong National University); Jaeho Kwak (Sogang University); Suk-Ju Kang (Sogang University)* | N/A | N/A |
| 3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal | Hao Meng (BeiHang University); Sheng Jin (The University of Hong Kong)*; Wentao Liu (Sensetime); Chen Qian (SenseTime); Mengxiang Lin (Beihang University); Wanli Ouyang (The University of Sydney); Ping Luo (The University of Hong Kong) | N/A | N/A |
| Recover Fair Deep Classification Models via Altering Pre-trained Structure | Yanfu Zhang (University of Pittsburgh)*; Shangqian Gao (University of Pittsburgh); Heng Huang (University of Pittsburgh) | N/A | N/A |
| Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-Boosting Attention Mechanism | Yangyang Shu (University of Adelaide); Lingqiao Liu (University of Adelaide)*; Baosheng Yu (The University of Sydney); Haiming Xu (The University of Adelaide) | N/A | N/A |
| VSA: Learning Varied-Size Window Attention in Vision Transformers | Qiming Zhang (The University of Sydney)*; YUFEI XU (University of sydney); Jing Zhang (The University of Sydney); Dacheng Tao (JD.com) | N/A | N/A |
| PoseGPT: Quantization-based 3D Human Motion Generation and Forecasting | Thomas LUCAS (Naver)*; Fabien Baradel (Naver Labs Europe); Philippe Weinzaepfel (NAVER LABS Europe); Gregory Rogez (NAVER LABS Europe) | N/A | N/A |
| CAViT: Contextual Alignment Vision Transformer for Video Object Re-identification | jinlin wu (Institute of Automation, Chinese Academy of Sciences, Beijing, China)*; He Lingxiao (nlpr,cripac); Wu Liu (AI Research of JD.com); Yang Yang (Institute of Automation, Chinese Academy of Sciences); Zhen Lei (NLPR, CASIA, China); Tao Mei (AI Research of JD.com); Stan Z. Li (Westlake University) | N/A | N/A |
| Learning Series-Parallel Lookup Tables for Efficient Image Super-Resolution | Cheng Ma (Tsinghua University); Jingyi Zhang (Tsinghua University); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University)* | N/A | N/A |
| Frozen CLIP Models are Efficient Video Learners | Ziyi Lin (The Chinese University of Hong Kong)*; Shijie Geng (Rutgers University); Renrui Zhang (Shanghai AI Lab); Peng Gao (Chinese university of hong kong); Gerard de Melo (Hasso Plattner Institute); Xiaogang Wang (Chinese University of Hong Kong, Hong Kong); Jifeng Dai (SenseTime); Yu Qiao (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Hongsheng Li (The Chinese University of Hong Kong) | N/A | N/A |
| Deforming Radiance Fields with Cages | Tianhan Xu (The University of Tokyo)*; Tatsuya Harada (The University of Tokyo / RIKEN) | N/A | N/A |
| GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constrains | Di Chen (Alibaba Group)*; Yu Liu (Alibaba Group); Lianghua Huang (Alibaba Group); bin wang (alibaba group); Pan Pan (Alibaba Group) | N/A | N/A |
| DoodleFormer: Creative Sketch Drawing with Transformers | Ankan Kumar Bhunia (MBZUAI)*; Salman Khan (MBZUAI/ANU); Hisham Cholakkal (MBZUAI); Rao Muhammad Anwer (MBZUAI/AALTO); Fahad Shahbaz Khan (MBZUAI); Jorma Laaksonen (Aalto University); Michael Felsberg (Linköping University) | N/A | N/A |
| Implicit Neural Representations for Variable Length Human Motion Generation | Pablo Alberto Cervantes Baque (Tokyo Institute of Technology)*; Yusuke Sekikawa (Denso IT Laboratory); Ikuro Sato (Tokyo Institute of Technology / Denso IT Laboratory); Koichi SHINODA (Tokyo Institute of Technology) | N/A | N/A |
| FLEX: Extrinsic Parameters-free Multi-view 3D Human Motion Reconstruction | Brian Gordon (Tel Aviv University); Sigal Raab (Tel Aviv University)*; Guy Azov (Tel Aviv University); Raja Giryes (Tel Aviv University); Danny Cohen-Or (Tel Aviv University) | N/A | N/A |
| Pairwise Contrastive Learning Network for Action Quality Assessment | Mingzhe Li (Huaqiao University); Hong-Bo Zhang (Huaqiao University)*; Qing Lei (Huaqiao University); Zongwen Fan (Huaqiao University); Jinghua Liu (Huaqiao University); Ji-Xiang Du (Huaqiao University) | N/A | N/A |
| Large-displacement 3D Object Tracking with Hybrid Non-local Optimization | Xuhui Tian (Shandong University)*; Xinran Lin (Shandong University); Fan Zhong (Shandong University); Xueying N/A Qin (Shandong University) | N/A | N/A |
| Learning Object Placement via Dual-path Graph Completion | Siyuan Zhou (Shanghai Jiao Tong University)*; Liu Liu (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong University) | N/A | N/A |
| Unbiased Manifold Augmentation for Coarse Class Subdivision | Baoming Yan (Alibaba Group)*; KE GAO (alibaba-inc); Bo Gao (Alibaba Group); Lin Wang (Alibaba-inc); Jiang Yang (Alibaba Group); Xiaobo Li (Alibaba) | N/A | N/A |
| Rethinking Video Rain Streak Removal: A New Synthesis Model and A Deraining Network with Video Rain Prior | Shuai Wang ( College of Intelligence and Computing, Tianjin University); Lei Zhu (The Hong Kong University of Science and Technology (Guangzhou))*; Huazhu Fu (IHPC, ASTAR); Jing Qin (The Hong Kong Polytechnic University); Carola-Bibiane B Schönlieb (Cambridge University); Wei Feng (School of Computer Science and Technology, Tianjin University); Song Wang (University of South Carolina) | N/A | N/A |
| Expanded Adaptive Scaling Normalization for End to End Image Compression | Chajin Shin (Yonsei University)*; Hyeongmin Lee (Yonsei University ); Hanbin Son (Yonsei Univ.); Sangjin Lee (Yonsei University); Dogyoon Lee (Yonsei University); Sangyoun Lee (Yonsei University) | N/A | N/A |
| Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets | Paul Albert (Insight Centre for Data Analytics (DCU))*; Eric Arazo (Insight Centre for Data Analytics (DCU)); Noel O Connor (Home); Kevin McGuinness (DCU) | N/A | N/A |
| Filter Pruning via Feature Discrimination in Deep Neural Networks | Zhiqiang He (Zhejiang University of Science and Technology)*; Yaguan QIAN (Zhejiang University of Science and Technology); Yuqi Wang (Zhejiang University of Science and Technology); Bin WANG (Network and Information Security Laboratory of Hangzhou Hikvision Digital Technology Co.); Xiaohui Guan (Zhejiang University of Water Resources and Electric Power); Zhaoquan Gu (Guangzhou University); Xiang Ling (Institute of Software, Chinese Academy of Sciences); Shaoning Zeng (Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China); Haijiang Wang (Zhejiang University of Science and Technology); Wujie Zhou (Zhejiang University of Science and Technology) | N/A | N/A |
| VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer | Juan Felipe Montesinos (Universitat Pompeu Fabra)*; Venkatesh Shenoy Kadandale (Universitat Pompeu Fabra); Gloria Haro (Universitat Pompeu Fabra) | N/A | N/A |
| SGBANet: Semantic GAN and Balanced Attention Network for Arbitrarily Oriented Scene Text Recognition | Dajian Zhong (East China Normal University)*; Shujing Lv (East China Normal University); Palaiahnakote Shivakumara (University of Malaya); Bing Yin (IFLYTEK Co.,Ltd); Jiajia Wu (IFLYTEK Co.,Ltd); Umapada Pal (Indian Statistical Institute, Kolkata); Yue Lu (East China Normal University) | N/A | N/A |
| DenseHybrid: Hybrid Anomaly Detection for Dense Open-set Recognition | Matej Grcić (University of Zagreb, Faculty of Electrical Engineering and Computing)*; Petra Bevandić (Faculty of Electrical Engineering and Computing); Sinisa Segvic (UniZg-FER) | N/A | N/A |
| D2-TPred: Discontinuous Dependency for Trajectory Prediction under Traffic Lights | Yuzhen Zhang (Zhengzhou University); Wentong Wang (Zhengzhou University); weizhi guo (zhengzhou university); Pei Lv (Zhengzhou University)*; Mingliang Xu (Zhengzhou University); Wei Chen (State Key Lab of CAD&CG, Zhejiang University); Dinesh Manocha (University of Maryland at College Park) | N/A | N/A |
| Where in the World is this Image? Transformer-based Geo-localization in the Wild | Shraman Pramanick (Johns Hopkins University)*; Ewa M Nowara (Meta Reality Labs); Joshua Gleason (Univ of Maryland); Carlos Castillo (Johns Hopkins University); Rama Chellappa (Johns Hopkins University) | N/A | N/A |
| MODE: Multi-view Omnidirectional Depth Estimation with 360-degree Cameras | Ming Li (NanJing University)*; Xueqian Jin (Nanjing University); Xuejiao Hu (Nanjing University); Jingzhao Dai (Nanjing University); Sidan Du (Nanjing University); Yang Li (NanJing University) | N/A | N/A |
| NashAE: Disentangling Representations through Adversarial Covariance Minimization | Eric C Yeats (Duke University)*; Frank Liu (Oak Ridge National Lab); David Womble (Oak Ridge National Laboratory); Hai Li (Duke University) | N/A | N/A |
| Rethinking Confidence Calibration for Failure Prediction | Fei Zhu (Institute of Automation of Chinese Academy of Sciences)*; Zhen Cheng (Institute of Automation of Chinese Academy of Sciences); Xu-Yao Zhang (Institute of Automation of Chinese Academy of Sciences); Cheng-Lin Liu (Institute of Automation of Chinese Academy of Sciences) | N/A | N/A |
| Colorization for in situ marine plankton images | Guannan Guo (Shenzhen Institute of Advanced Technology ,Chinese Academy of Sciences); Qi Lin (Xiamen University); Tao Chen (Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences); Zhenghui Feng (Harbin Institute of Technology, Shenzhen); Zheng Wang (Shenzhen Institutes of Advanced Technology); Jianping Li (Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences)* | N/A | N/A |
| PIP: Physical Interaction Prediction via Mental Simulation with Span Selection | Jiafei Duan (University of Washington, Seattle)*; Samson Yu (Agency for Science, Technology and Research); Soujanya Poria (Singapore University of Technology and Design); Bihan Wen (Nanyang Technological University); Cheston Tan (Institute for Infocomm Research, Singapore) | N/A | N/A |
| Generator Knows What Discriminator Should Learn in Unconditional GANs | Gayoung Lee (NAVER AI Lab)*; Hyunsu Kim (NAVER AI Lab); Junho Kim (NAVER AI Lab); Seonghyeon Kim (Clova AI Research, NAVER Corp.); Jung-Woo Ha (NAVER CLOVA AI Lab); Yunjey Choi (NAVER AI Lab) | N/A | N/A |
| A Gyrovector Space Approach for Symmetric Positive Semi-definite Matrix Learning | Xuan Son Nguyen (Ensea)* | N/A | N/A |
| Compositional Visual Generation with Composable Diffusion Models | Nan Liu (University of Illinois at Urbana-Champaign); Shuang Li (MIT); Yilun Du (MIT)*; Antonio Torralba (MIT); Joshua Tenenbaum (MIT) | N/A | N/A |
| Temporal and cross-modal attention for audio-visual zero-shot learning | Otniel-Bogdan Mercea (University of Tübingen)*; Thomas Hummel (University of Tübingen); A. Sophia Koepke (University of Tübingen); Zeynep Akata (University of Tübingen) | N/A | N/A |
| Telepresence Video Quality Assessment | Zhenqiang Ying (The University of Texas at Austin)*; Deepti Ghadiyaram (Facebook); Alan Bovik (University of Texas at Austin) | N/A | N/A |
| Enhancing Multi-modal Features Using Local Self-attention for 3D Object Detection | hao li (Hikvision Digital Technology Co. Ltd)*; Zehan Zhang (Shanghai Jiao Tong University & Hangzhou Hikvision Digital Technology Co. Ltd); Zhao Xian (Hikvision); yulong wang (Hikvision Digital Technology Co. Ltd); Yuxi Shen (Hikvision); Shiliang Pu (Hikvision Research Institute); Hui Mao (Hangzhou hikvision digital technology Co.,Ltd) | N/A | N/A |
| Totems: Physical Objects for Verifying Visual Integrity | Jingwei Ma (University of Washington)*; Lucy Chai (MIT); Minyoung Huh (MIT); Tongzhou Wang (MIT); Ser-Nam Lim (Meta AI); Phillip Isola (MIT); Antonio Torralba (MIT) | N/A | N/A |
| ManiFest: manifold deformation for few-shot image translation | Fabio Pizzati (Inria / Vislab)*; Jean-Francois Lalonde (Université Laval); Raoul de Charette (Inria) | N/A | N/A |
| 3D Shape Sequence of Human Comparison and Classification using Current and Varifolds | Emery Pierson (Université de Lille)*; Mohamed Daoudi (IMT Lille Douai); Sylvain Arguillere (Institute Camille Jordan) | N/A | N/A |
| Decouple-and-Sample: Protecting sensitive information in task agnostic data release | Abhishek Singh (MIT)*; Ethan Garza (MIT); Ayush Chopra (MIT); Praneeth Vepakomma (MIT); Vivek Sharma (MIT); Ramesh Raskar (Massachusetts Institute of Technology) | N/A | N/A |
| Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space | Wenqi Shao (The Chinese University of HongKong)*; Xun Zhao (Tencent Company); Yixiao Ge (Tencent); Zhaoyang Zhang (The Chinese University of Hong Kong); Lei Yang (Tencent); Xiaogang Wang (Chinese University of Hong Kong, Hong Kong); Ying Shan (Tencent); Ping Luo (The University of Hong Kong) | N/A | N/A |
| Object Detection as Probabilistic Set Prediction | Georg Hess (Chalmers University of Technology)*; Christoffer Petersson (Zenseact); Lennart Svensson (Chalmers University of Technology) | N/A | N/A |
| k-SALSA: k-anonymous synthetic averaging of retinal images via local style alignment | Minkyu Jeon (Korea University)*; Hyeonjin Park (Korea university); Hyunwoo J Kim (Korea University); Michael G Morley (Ophthalmic Consultants fo Boston); Hyunghoon Cho (Broad Institute of MIT and Harvard) | N/A | N/A |
| Uncertainty-guided Source-free Domain Adaptation | Subhankar Roy (University of Trento)*; Martin Trapp (Aalto University ); Andrea Pilzer (Aalto University); Juho Kannala (Aalto University, Finland); Nicu Sebe (University of Trento); Elisa Ricci (University of Trento); Arno Solin (Aalto University) | N/A | N/A |
| LA3: Efficient Label-Aware AutoAugment | Mingjun Zhao (University of Alberta)*; Shan Lu (University of Alberta); Zixuan Wang (Tencent Inc.); Xiaoli Wang (Tencent); Di Niu (University of Alberta) | N/A | N/A |
| Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions | Zhi Li (University of California, Berkeley)*; Lu He (Tencent America); Huijuan Xu (Pennsylvania State University) | N/A | N/A |
| Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos | Tanqiu Qiao (Durham University); Qianhui Men (University of Oxford); Frederick W. B. Li (University of Durham); Yoshiki Kubotani (Waseda University); Shigeo Morishima (Waseda Research Institute for Science and Engineering); Hubert P. H. Shum (Durham University)* | N/A | N/A |
| FEAR: Fast, Efficient, Accurate and Robust Visual Tracker | Vasyl Borsuk (Ukrainian Catholic University); Roman Vei (Ukrainian Catholic University); Orest Kupyn (Ukrainian Catholic University); Tetiana Martyniuk (Ukrainian Catholic University)*; Igor Krashenyi (Piñata Farms); Jiri Matas (CMP CTU FEE) | N/A | N/A |
| Variance-Aware Weight Initializationfor Point Convolutional Neural Networks | Pedro Hermosilla Casajus (Ulm University)*; Michael Schelling (Ulm University – Institute of Media Informatics); Tobias Ritschel (UCL); Timo Ropinski (Ulm University) | N/A | N/A |
| Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training | Haoxuan You (Columbia University)*; Luowei Zhou (Microsoft); Bin Xiao (Microsoft); Noel C Codella (Microsoft); Yu Cheng (Microsoft Research); Ruochen Xu (Microsoft); Shih-Fu Chang (Columbia University); Lu Yuan (Microsoft) | N/A | N/A |
| Single-Stream Multi-Level Alignment for Vision-Language Pretraining | Zaid Khan (Northeastern University)*; Vijay Kumar B G (NEC Laboratories America); Xiang Yu (NEC Labs); Samuel Schulter (NEC Laboratories America); Manmohan Chandraker (UC San Diego); YUN FU (Northeastern University) | N/A | N/A |
| Revisiting Outer Optimization in Adversarial Training | Ali Dabouei (West Virginia university)*; Fariborz Taherkhani (Carnegie Mellon University); Sobhan Soleymani (West Virginia University); Nasser Nasrabadi (West Virginia University) | N/A | N/A |
| Supervised Attribute Information Removal and Reconstruction for Image Manipulation | Nannan Li (Boston University)*; Bryan Plummer (Boston University) | N/A | N/A |
| Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty Quantification | Jianxiong Shen (IRI, CSIC-UPC)*; Antonio Agudo (Institut de Robotica i Informatica Industrial, CSIC-UPC); Francesc Moreno (IRI); Adria Ruiz (Seedtag) | N/A | N/A |
| BLT: Bidirectional Layout Transformer for Controllable Layout Generation | Xiang Kong (Carnegie Mellon University)*; Lu Jiang (Google Research); Huiwen Chang (Google); Han Zhang (Google); Yuan Hao (Google); Haifeng Gong (Google Inc.); Irfan Essa (Google) | N/A | N/A |
| Neural Correspondence Field for Object Pose Estimation | Lin Huang (University at Buffalo); Tomas Hodan (Facebook Reality Labs)*; Lingni Ma (Facebook Reality Labs); Linguang Zhang (Facebook Reality Labs); Luan Tran (Facebook); Christopher D Twigg (Meta); PO-CHEN WU (Meta Inc.); Junsong Yuan (“State University of New York at Buffalo, USA”); Cem Keskin (Facebook); Robert Wang (Facebook Reality Labs) | N/A | N/A |
| The Missing Link: Finding label relations across datasets | Jasper Uijlings (Google Research)*; Thomas Mensink (Google Research); Vittorio Ferrari (Google Research) | N/A | N/A |
| On Label Granularity and Object Localization | Elijah Cole (Caltech)*; Kimberly Wilber (Google); Grant Van Horn (Cornell University); Xuan Yang (Google); Marco Fornoni (Google); Pietro Perona (California Institute of Technology); Serge Belongie (University of Copenhagen); Andrew Howard (Google); Oisin Mac Aodha (University of Edinburgh) | N/A | N/A |
| RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease Classification | Moinak Bhattacharya (Stony Brook University)*; Shubham Jain (Stony Brook University); Prateek Prasanna (Stony Brook University) | N/A | N/A |
| OIMNet++: Prototypical Normalization and Localization-aware Learning for Person Search | Sanghoon Lee (Yonsei University); Youngmin Oh (Yonsei University); Donghyeon Baek (Yonsei University); Junghyup Lee (Yonsei University); Bumsub Ham (Yonsei University)* | N/A | N/A |
| Most and Least Retrievable Images in Visual-Language Query Systems | Liuwan Zhu (Old Dominion University)*; Rui Ning (Old Dominion University); Jiang Li (Old Dominion University); Chunsheng Xin (Old Dominion University); Hongyi Wu (Univesity of Arizona) | N/A | N/A |
| Contrasting quadratic assignments for set-based representation learning | Artem Moskalev (University of Amsterdam)*; Ivan Sosnovik (University of Amsterdam); Volker Fischer (Bosch Center for Artificial Intelligence); Arnold W.M. Smeulders (University of Amsterdam) | N/A | N/A |
| How stable are Transferability Metrics evaluations? | Andrea Agostinelli (Google)*; Michal Pandy (University of Cambridge); Jasper Uijlings (Google Research); Thomas Mensink (Google Research); Vittorio Ferrari (Google Research) | N/A | N/A |
| A Comparative Study of Graph Matching Algorithms in Computer Vision | Stefan Haller (Heidelberg University)*; Lorenz Feineis (Heidelberg University); Lisa Hutschenreiter (Heidelberg University); Florian Bernard (University of Bonn); Carsten Rother (University of Heidelberg); Dagmar Kainmueller (MDC); Paul Swoboda (MPI fuer Informatik, Saarbruecken); Bogdan Savchynskyy (Heidelberg University) | N/A | N/A |
| HM: Hybrid Masking for Few-Shot Segmentation | Seonghyeon Moon (Rutgers University)*; Samuel S Sohn (Rutgers University); Honglu Zhou (Rutgers University); Sejong Yoon (The College of New Jersey); Vladimir Pavlovic (Rutgers University); Muhammad Haris Khan (Muhammad Bin Zayed University of Artificial Intelligence); Mubbasir Kapadia (Rutgers) | N/A | N/A |
| UCTNet: Uncertainty-aware Cross-modal Transformer Network for Indoor RGB-D Semantic Segmentation | Xiaowen Ying (Lehigh University)*; Mooi Choo Chuah (Lehigh University) | N/A | N/A |
| Learning Omnidirectional Flow in 360° Video via Siamese Representation | Keshav Bhandari (Texas State University)*; Bin Duan (Illinois Institute of Technology); Gaowen Liu (Cisco Research); Hugo M Latapie (Cisco); Ziliang Zong (Texas State University); Yan Yan (Illinois Institute of Technology) | N/A | N/A |
| Improving Generalization in Federated Learning by Seeking Flat Minima | Debora Caldarola (Politecnico di Torino)*; Barbara Caputo (Politecnico di Torino); Marco Ciccone (Politecnico di Torino) | N/A | N/A |
| Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection | Mingyu Yang (University of Michigan)*; Yu Chen (University of Michigan); Hun Seok Kim (Nil) | N/A | N/A |
| MultiMAE: Multi-modal Multi-task Masked Autoencoders | Roman Bachmann (EPFL)*; David Mizrahi (EPFL); Andrei Atanov (EPFL); Amir Zamir (Swiss Federal Institute of Technology (EPFL)) | N/A | N/A |
| GigaDepth: Learning Depth from StructuredLight with Branching Neural Networks | Simon Schreiberhuber (TUWien)*; Jean-Baptiste Weibel (TU Wien); Timothy Patten (University of Technology Sydney); Markus Vincze (TU Wien) | N/A | N/A |
| Diverse Generation from a Single Video Made Possible | Niv Haim (Weizmann Institute of Science)*; Ben Feinstein (Weizmann Institute of Science); Niv Granot (Weizmann Institute of Science); Assaf Shocher (Weizmann Institute of Science); Shai Bagon (Weizmann Institute of Science); Tali Dekel (Weizmann Institute of Science); Michal Irani (Weizmann Institute, Israel) | N/A | N/A |
| Privacy-Preserving Action Recognition via Motion Difference Quantization | Sudhakar Kumawat (Osaka University)*; Hajime Nagahara (Osaka University) | N/A | N/A |
| Learning Phase Mask for Privacy-Preserving Passive Depth Estimation | Zaid Tasneem (Rice University); Giovanni Milione (4 independence Way, Princeton, NJ 08540); Yi-Hsuan Tsai (Phiar Technologies); Xiang Yu (NEC Labs); Ashok Veeraraghavan (Rice University); Manmohan Chandraker (UC San Diego); Francesco Pittaluga (NEC Laboratories America)* | N/A | N/A |
| DuelGAN: A Duel Between Two Discriminators Stabilizes the GAN Training | Jiaheng Wei (UCSC)*; Minghao Liu (UCSC); Jiahao Luo (UCSC); Andrew Zhu (UCSC); James E Davis (UC Santa Cruz); Yang Liu (UC Santa Cruz) | N/A | N/A |
| Should All Proposals be Treated Equally in Object Detection? | Yunsheng Li (UCSD)*; Yinpeng Chen (Microsoft); Xiyang Dai (Microsoft); DongDong Chen (Microsoft Cloud AI); Mengchen Liu (Microsoft); Pei Yu (); Ying Jin (Microsoft); Lu Yuan (Microsoft); Zicheng Liu (Microsoft); Nuno Vasconcelos (UC San Diego) | N/A | N/A |
| Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps | Alireza Ganjdanesh (University of Pittsburgh); Shangqian Gao (University of Pittsburgh); Heng Huang (University of Pittsburgh)* | N/A | N/A |
| Out-of-Distribution Identification: Let Detector Tell Which I Am Not Sure | Ruoqi Li (SJTU); Chongyang Zhang (Shanghai Jiao Tong University)*; Hao Zhou (Shanghai Jiao Tong University); Chao Shi (Shanghai Jiao Tong University); Yan Luo (Shanghai Jiao Tong University) | N/A | N/A |
| Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space | Shuo Li (Xidian University); Fang Liu (Xidian University)*; Zehua Hao (Xidian University); Kaibo Zhao (Xidian University); Licheng Jiao (Xidian University) | N/A | N/A |
| ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers | Junbo Li (UC Santa Cruz); Huan Zhang (UCLA); Cihang Xie (University of California, Santa Cruz)* | N/A | N/A |
| Panoramic Vision Transformer for Saliency Detection in 360 Videos | Heeseung Yun (Seoul National University)*; Sehun Lee (Seoul National University); Gunhee Kim (Seoul National University) | N/A | N/A |
| ActiveNeRF: Learning where to See with Uncertainty Estimation | Xuran Pan (Tsinghua University); Zihang Lai (CMU); Shiji Song (Department of Automation, Tsinghua University); Gao Huang (Tsinghua)* | N/A | N/A |
| incDFM: Incremental Deep Feature Modeling for Continual Novelty Detection | Amanda S Rios (University of Southern California; Intel )*; Nilesh A Ahuja (Intel); Ibrahima Ndiour (Intel); Ergin U Genc (Intel); Laurent Itti (University of Southern California); Omesh Tickoo (Intel) | N/A | N/A |
| BA-Net: Bridge Attention for Deep Convolutional Neural Networks | Yue Zhao (Sun Yat-sen University); Junzhou Chen (Sun Yat-sen University)*; Zhang Zirui (Sun Yat-sen University); Ronghui Zhang (Sun Yat-Sen University) | N/A | N/A |
| Super-Resolution by Predicting Offsets: An Ultra-Efficient Super-Resolution Network for Rasterized Images | Jinjin Gu (The University of Sydney)*; Haoming CAI (University of Maryland, College Park); Chenyu Dong (Graduate school at Shenzhen , Tsinghua University); Ruofan Zhang (Tsinghua University); Yulun Zhang (ETH Zurich); Wenming Yang (Tsinghua University); Chun Yuan (Graduate school at ShenZhen,Tsinghua university) | N/A | N/A |
| Animation from Blur: Multi-modal Blur Decomposition with Motion Guidance | Zhihang Zhong (The University of Tokyo); Xiao Sun (Microsoft Research Asia); Zhirong Wu (Microsoft Research); Yinqiang Zheng (The University of Tokyo); Stephen Lin (Microsoft Research)*; Imari Sato (National Institute of Informatics) | N/A | N/A |
| Zero-Shot Attribute Attacks on Fine-Grained Recognition Models | Nasim Shafiee (Northeastern University)*; Ehsan Elhamifar (Northeastern University) | N/A | N/A |
| Break and Make: Interactive Structural Understanding Using LEGO Bricks | Aaron T Walsman (University of Washington)*; Muru Zhang (University of Washington); Klemen Kotar (Allen Institute for AI); Karthik Desingh (University Washington); Dieter Fox (NVIDIA Research / University of Washington); Ali Farhadi (University of Washington, Allen Institue for AI, Apple) | N/A | N/A |
| PoserNet: Refining Relative Camera Poses Exploiting Object Detections | Matteo Taiana (Istituto Italiano di Tecnologia)*; Matteo Toso (Istituto Italiano di Tecnologia); Stuart James (Istituto Italiano di Tecnologia (IIT)); Alessio Del Bue (Istituto Italiano di Tecnologia (IIT)) | N/A | N/A |
| Towards Effective and Robust Neural Trojan Defenses via Input Filtering | Kien Duc Do (Deakin Unviersity)*; Haripriya Harikumar (Deakin University); Hung Le (Deakin University); Dung Nguyen (Deakin University); Truyen Tran (Deakin University); Santu Rana (Deakin University, Australia); Dang Nguyen (Deakin University); Willy Susilo (University of Wollongong); Svetha Venkatesh (Deakin University) | N/A | N/A |
| View Vertically: A Hierarchical Network for Trajectory Prediction via Fourier Spectrums | Conghao Wong (Huazhong University of Science and Technology); Beihao Xia (Huazhong University of Science and Technology); Ziming Hong (Huazhong University of Science and Technology); Qinmu Peng (Huazhong University of Science and Technology); Wei Yuan (Huazhong University of Science and Technology); Qiong Cao (JD.com); Yibo Yang (Peking University); Xinge YOU (Huazhong University of Science and Technology)* | N/A | N/A |
| Bi-directional Contrastive Learning for Domain Adaptive Semantic Segmentation | Geon Lee (Yonsei University); Chanho Eom (Yonsei University); Wonkyung Lee (PS Analytics); Hyekang Park (Yonsei University); Bumsub Ham (Yonsei University)* | N/A | N/A |
| Rayleigh EigenDirections (REDs): Nonlinear GAN latent space traversals for multidimensional features | Guha Balakrishnan (Rice University)*; Raghudeep Gadde (Amazon); Aleix M Martinez (Amazon); Pietro Perona (Amazon Web Services (AWS)) | N/A | N/A |
| ActionFormer: Localizing Moments of Actions with Transformers | Chen-Lin Zhang (4Paradigm, Inc); Jianxin Wu (Nanjing University); Yin Li (University of Wisconsin-Madison)* | N/A | N/A |
| Theoretical Understanding of the Information Flow on Continual Learning Performance | Joshua J Andle (University of Maine); Salimeh Yasaei Sekeh (University of Maine)* | N/A | N/A |
| 3DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching | Runyu Mao (Purdue University)*; Chen Bai (Xpeng Motors); yatong an (xm); Fengqing Maggie Zhu (Purdue University, USA); Cheng Lu (Xiaopeng) | N/A | N/A |
| Pure Transformer with Integrated Experts for Scene Text Recognition | Yew Lee Tan (Nanyang Technological University)*; Wai-Kin Adams Kong (Nanyang Technological University); Jung Jae Kim (I2R) | N/A | N/A |
| AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation | Efthymios Tzinis (University of Illinois at Urbana-Champaign); Scott Wisdom (Google)*; Tal Remez (Google); John Hershey (Google) | N/A | N/A |
| Bridging the Domain Gap towards Generalization in Automatic Colorization | Hyejin Lee (Kookmin University); Daehee Kim (Naver Corp.); Daeun Lee (Korea university); Jinkyu Kim (Korea University); Jaekoo Lee (Kookmin University)* | N/A | N/A |
| Learning with Free Object Segments for Long-Tailed Instance Segmentation | Cheng Zhang (Carnegie Mellon University)*; Tai-Yu Pan (The Ohio State University); tianle chen (The Ohio State University); Jike Zhong (The Ohio State University); Wenjin Fu (The Ohio State University); Wei-Lun Chao (The Ohio State University) | N/A | N/A |
| Rethinking Closed-loop Training for Autonomous Driving | Chris Zhang (Waabi / University of Toronto)*; Runsheng Guo (University of Waterloo); Wenyuan Zeng (Waabi, University of Toronto); Yuwen Xiong (University of Toronto); Binbin Dai (Waabi); Rui Hu (Waabi); Mengye Ren (NYU / Google); Raquel Urtasun (Uber ATG) | N/A | N/A |
| Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction | YuXuan Liu (Covariant.ai, UC Berkeley)*; Nikhil Mishra (Covariant.ai, UC Berkeley); Maximilian Sieb (Covariant.ai); Fred Shentu (UC Berkeley); Pieter Abbeel (UC Berkeley); Peter Chen (COVARIANT.AI) | N/A | N/A |
| Learning Regional Purity for Instance Segmentation on 3D Point Clouds | Shichao Dong (Nanyang Technological University)*; Guosheng Lin (Nanyang Technological University); Tzu-Yi HUNG (Delta Research Center) | N/A | N/A |
| Learning from Unlabeled 3D Environments for Vision-and-Language Navigation | Shizhe Chen (INRIA)*; Pierre-Louis Guhur (Inria); Makarand Tapaswi (Wadhwani AI, IIIT Hyderbad); Cordelia Schmid (Inria/Google); Ivan Laptev (INRIA Paris) | N/A | N/A |
| A Dataset Generation Framework for Evaluating Megapixel Image Classifiers & their Explanations | Gautam B Machiraju (Stanford University)*; Sylvia Plevritis (Stanford University); Parag Mallick (Stanford University) | N/A | N/A |
| Sports Video Analysis on Large-Scale Data | Dekun Wu (University of Pittsburgh)*; He Zhao (York University); Xingce Bao (EPFL); Rick Wildes (York University) | N/A | N/A |
| Audio-Visual Segmentation | Jinxing Zhou (Hefei University of Technology); Jianyuan Wang (Chinese University of Hong Kong); Jiayi Zhang (BeiHang University); Weixuan Sun (Australian National University); Jing Zhang (Australian National University); Stan Birchfield (NVIDIA); Dan Guo (Hefei University of Technology); Lingpeng Kong (The University of Hong Kong); Meng Wang (Hefei University of Technology); Yiran Zhong (Australian National University)* | N/A | N/A |
| SLiDE: Self-supervised LiDAR De-snowing through Reconstruction Difficulty | Gwangtak Bae (Seoul National University)*; Byungjun Kim (Seoul National University); Seongyong Ahn (Agency for Defense Development); jihong Min (Agency for Defense Development); Inwook Shim (Inha University) | N/A | N/A |
| On the Angular Update and Hyperparameter Tuning of a Scale-Invariant Network | Juseung Yun (KAIST)*; Janghyeon Lee (LG AI Research); Hyounguk Shon (KAIST); Eojindl Yi (KAIST); Seung Hwan Kim (LG AI Research); Junmo Kim (KAIST) | N/A | N/A |
| IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition | Yunsheng Pang (University of Melbourne)*; Qiuhong Ke (Monash University); Hossein Rahmani (Lancaster University); James Bailey (THE UNIVERSITY OF MELBOURNE); Jun Liu (Singapore University of Technology and Design) | N/A | N/A |
| LANA: Latency Aware Network Acceleration | Pavlo Molchanov (NVIDIA)*; James B Hall (Microsoft Research); Hongxu Yin (NVIDIA ); Nicolo Fusi (Microsoft Research); Jan Kautz (NVIDIA); Arash Vahdat (NVIDIA) | N/A | N/A |
| A Sketch Is Worth a Thousand Words:Image Retrieval with Text and Sketch | Patsorn Sangkloy (Georgia Institute of Technology)*; Wittawat Jitkrittum (Google Research); Diyi Yang (Georgia Institute of Technology); James Hays (Georgia Institute of Technology, USA) | N/A | N/A |
| HVC-Net: Unifying Homography, Visibility, and Confidence Learning for Planar Object Tracking | Haoxian Zhang (Tencent)*; Yonggen Ling (Tencent) | N/A | N/A |
| 3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization | Rui Qiu (Xi’an Jiaotong-Liverpool University, University of Liverpool); Ming Xu (Xi’an Jiaotong-Liverpool University)*; Yuyao Yan (Xi’an Jiaotong-Liverpool University); Jeremy S Smith (University of Liverpool); Xi Yang (Xi’an Jiaotong Liverpool University ) | N/A | N/A |
| Masked Siamese Networks for Label-Efficient Learning | Mahmoud Assran (Facebook AI)*; Mathilde Caron (Facebook Artificial Intelligence Research); Ishan Misra (Facebook AI Research); Piotr Bojanowski (Facebook); Florian Bordes (MILA); Pascal Vincent (Facebook FAIR & MILA Université de Montréal); Armand Joulin (Facebook AI Research); Mike Rabbat (Facebook FAIR); Nicolas Ballas (Facebook FAIR) | N/A | N/A |
| A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation | Wuyang Chen (University of Texas at Austin)*; Xianzhi Du (Google Brain); Fan Yang (Google); Lucas Beyer (Google Brain); Xiaohua Zhai (Google Brain); Tsung-Yi Lin (Google Brain); Huizhong Chen (Google); Jing Li (Google Brain); Xiaodan Song (Google Brain); Zhangyang Wang (University of Texas at Austin); Denny Zhou (Google Brain) | N/A | N/A |
| A Cloud 3D Dataset and Application-Specific Learned Image Compression in Cloud 3D | Tianyi Liu (The University of Texas at San Antonio)*; Sen He (The University of Texas at San Antonio); Vinodh Kumaran Jayakumar (UTSA); Wei Wang (The University of Texas at San Antonio) | N/A | N/A |
| Cross-Domain Few-Shot Semantic Segmentation | Shuo Lei (Virginia Tech)*; Xuchao Zhang (NEC Labs America); Jianfeng He (Virginia Tech); Fanglan Chen (Virginia Tech); Bowen Du (Beihang Univeristy); Chang-Tien Lu (Virginia Tech, USA) | N/A | N/A |
| VizWiz-FewShot: Locating Objects in Images Taken by People With Visual Impairments | Yu-Yun Tseng (University of Colorado Boulder)*; Alexander Bell (IVC Group); Danna Gurari (University of Colorado Boulder) | N/A | N/A |
| Towards Metrical Reconstruction of Human Faces | Wojciech Zielonka (Max Planck Institute for Intelligent Systems); Timo Bolkart (Max Planck Institute for Intelligent Systems); Justus Thies (Max Planck Institute for Intelligent Systems)* | N/A | N/A |
| DeepShadow: Neural Shape from Shadow | Asaf Karnieli (Reichman University)*; Yacov Hel-Or (The Interdisciplinary Center); Ohad Fried (IDC Herzliya) | N/A | N/A |
| Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer | Arjun Ashok (Indian Institute of Technology, Hyderabad)*; Joseph K J (Indian Institute of Technology, Hyderabad); Vineeth N Balasubramanian (Indian Institute of Technology, Hyderabad) | N/A | N/A |
| Object discovery and representation networks | Olivier Henaff (DeepMind)*; Skanda Koppula (DeepMind); Evan Shelhamer (DeepMind); Daniel Zoran (DeepMind); Andrew Jaegle (DeepMind); Andrew Zisserman (Oxford University); Joao Carreira (DeepMind); Relja Arandjelović (DeepMind) | N/A | N/A |
| MeshUDF: Fast and Differentiable Meshing of Unsigned Distance Field Networks | Benoit Guillard (EPFL)*; Federico Stella (EPFL); Pascal Fua (EPFL, Switzerland) | N/A | N/A |
| Natural Synthetic Anomalies for Self-Supervised Anomaly Detection and Localization | Hannah M Schlueter (Imperial College London)*; Jeremy Tan (Imperial College London); Benjamin Hou (Imperial College London); Bernhard Kainz (Imperial College London, FAU Erlangen-Nürnberg) | N/A | N/A |
| Shap-CAM: Visual Explanations for Convolutional Neural Networks based on Shapley Value | Quan Zheng (Tsinghua University); Ziwei Wang (Tsinghua University); Jie Zhou (Tsinghua University); Jiwen Lu (Tsinghua University)* | N/A | N/A |
| Simple Open-Vocabulary Object Detection with Vision Transformers | Matthias Minderer (Google Research)*; Alexey Gritsenko (Google Brain); Austin C Stone (Google); Maxim Neumann (Google); Dirk Weißenborn (German Research Center for Artificial Intelligence); Alexey Dosovitskiy (Inceptive); Aravindh Mahendran (Google); Anurag Arnab (Google); Mostafa Dehghani (Google Brain); Zhuoran Shen (Pony.ai); Xiao Wang (Google); Xiaohua Zhai (Google Brain); Thomas Kipf (Google Brain); Neil Houlsby (Google) | N/A | N/A |
| Video Restoration Framework and its Meta-adaptations to Data-poor Conditions | Prashant W Patil (Deakin University)*; Sunil Gupta (Deakin University, Australia); Santu Rana (Deakin University, Australia); Svetha Venkatesh (Deakin University) | N/A | N/A |
| PRIME: A Few Primitives Can Boost Robustness to Common Corruptions | Apostolos Modas (EPFL)*; Rahul Shekhar Rade (EthonAI); Guillermo Ortiz-Jimenez (EPFL); Seyed-Mohsen Moosavi-Dezfooli (Imperial College London); Pascal Frossard (EPFL) | N/A | N/A |
| AlphaVC: High-Performance and Efficient Learned Video Compression | Yibo Shi (Huawei); Yunying Ge (Huawei Technologies); Jing Wang (Huawei)*; Jue Mao (Huawei technologies) | N/A | N/A |
| Content-Oriented Learned Image Compression | Meng Li (Huawei); Shangyin Gao (Huawei); Yihui Feng (HUAWEI Technology Co., Ltd); Yibo Shi (Huawei); Jing Wang (Huawei)* | N/A | N/A |
| Generating Natural Images with Direct Patch Distributions Matching | Ariel Elnekave (Hebrew University of Jerusalem)*; Yair Weiss (Hebrew University) | N/A | N/A |
| Latent Space Smoothing for Individually Fair Representations | Momchil Peychev (ETH Zurich)*; Anian Ruoss (DeepMind); Mislav Balunovic (ETH Zurich); Maximilian Baader (ETH Zürich); Martin Vechev (ETH Zurich) | N/A | N/A |
| SAU: Smooth activation function using convolution with approximate identities | Koushik Biswas (Indraprastha Institute of Information Technology, New Delhi, India)*; Sandeep Kumar (Shaheed Bhagat Singh College, University of Delhi, Delhi); Shilpak Banerjee (Indian Institute of Technology Tirupati); Ashish Kumar Pandey (Indraprastha Institute of Information Technology, New Delhi, India) | N/A | N/A |
| TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments | Shubham Dokania (IIIT Hyderabad)*; Anbumani Subramanian (IIIT-Hyderabad); Manmohan Chandraker (UC San Diego); C.V. Jawahar (IIIT-Hyderabad) | N/A | N/A |
| Motion Sensitive Contrastive Learning for Self-supervised Video Representation | JingCheng Ni (Behang University)*; Nan Zhou (Beihang University); Jie Qin (Nanjing University of Aeronautics and Astronautics); Qian Wu (Megvii); Junqi Liu (Megvii); Boxun Li (Megvii Inc.); Di Huang (Beihang University, China) | N/A | N/A |
| Scaling Adversarial Training to Large Perturbation Bounds | Sravanti Addepalli (Indian Institute of Science)*; Samyak Jain (Indian Institute of Technology (BHU), Varanasi); Gaurang Sriramanan (University of Maryland, College Park); Venkatesh Babu RADHAKRISHNAN (Indian Institute of Science) | N/A | N/A |
| RDO-Q: Extremely Fine-Grained Channel-Wise Quantization via Rate-Distortion Optimization | Zhe Wang (Institute for Infocomm Research, Singapore); Jie Lin (Institute for Infocomm Research (I2R), Singapore); Xue Geng (I2R, ASTAR); Mohamed M. Sabry Aly (Nanyang Technological University); Vijay R. Chandrasekhar (Institute for Infocomm Research) | N/A | N/A |
| Camera Auto-calibration from the Steiner Conic of the Fundamental Matrix | Yu LIU (United International College, BNU-HKBU)*; Hui Zhang (UIC) | N/A | N/A |
| Understanding Collapse in Non-Contrastive Siamese Representation Learning | Alexander C Li (Carnegie Mellon University)*; Alexei A Efros (UC Berkeley); Deepak Pathak (Carnegie Mellon University) | N/A | N/A |
| AutoTransition: Learning to Recommend Video Transition Effects | Yaojie Shen (Institute of Software, Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences); Kai Xu (ByteDance Inc); Xiaojie Jin (Bytedance Inc. USA)* | N/A | N/A |
| SPE-Net: Boosting Point Cloud Analysis via Rotation Robustness Enhancement | Zhaofan Qiu (JD.com); Yehao Li (JD AI Research); Yu Wang (JD AI Research); Yingwei Pan (JD AI Research); Ting Yao (JD AI Research)*; Tao Mei (AI Research of JD.com) | N/A | N/A |
| Text-based Temporal Localization of Novel Events | Sudipta Paul (University of California, Riverside)*; Niluthpol C Mithun (SRI International); Amit K. Roy-Chowdhury (University of California, Riverside) | N/A | N/A |
| Effective Presentation Attack Detection Driven by Face Related Task | Wentian Zhang (Shenzhen University); Haozhe Liu ( King Abdullah University of Science and Technology); Feng Liu (Shenzhen University )*; Raghavendra Ramachandra (NTNU, Norway); Christoph Busch (Norwegian University of Science and Technology) | N/A | N/A |
| LWGNet – Learned Wirtinger Gradients for Fourier Ptychographic Phase Retrieval | Atreyee Saha (Indian Institute of Technology Madras)*; Salman Siddique Khan (IIT Madras); Sagar Sehrawat (IIT Madras); Sanjana S Prabhu (Indian Institute of Technology Madras); Shanti Bhattacharya (IIT Madras); Kaushik Mitra (IIT Madras) | N/A | N/A |
| Federated Self-supervised Learning for Video Understanding | Yasar Rehman (TCL Corporate Research(Hong Kong) Co. Ltd); Yan Gao (University of Cambridge)*; Jiajun Shen (TCL Research); Pedro Gusmao (University of Cambridge); Nicholas Lane (University of Cambridge and Samsung AI) | N/A | N/A |
| Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval | Zhaopeng Dou (Tsinghua University)*; Zhongdao Wang (Tsinghua University); Weihua Chen (alibaba group); Ya-Li Li (Tsinghua University); Shengjin Wang (Tsinghua University) | N/A | N/A |
| The Shape Part Slot Machine: Contact-based Reasoning for Generating 3D Shapes from Parts | Kai Wang (Brown University)*; Paul Guerrero (Adobe); Vladimir Kim (Adobe); Siddhartha Chaudhuri (Adobe Research); Minhyuk Sung (KAIST); Daniel Ritchie (Brown University) | N/A | N/A |
| Attention Diversification for Domain Generalization | Rang Meng (Hikvision Research Institute)*; Xianfeng Li (Hikvision Research Institute ); Weijie Chen (Zhejiang University); Shicai Yang (Hikvision Research Institute); Jie Song (Zhejiang University); Xinchao Wang (National University of Singapore); Lei Zhang (Chongqing University); Mingli Song (Zhengjiang University); Di Xie (Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute) | N/A | N/A |
| Exploiting the local parabolic landscapes of adversarial losses to accelerate black-box adversarial attack | Hoang Tran (Oak Ridge National Laboratory); Dan Lu (Oak Ridge National Laboratory); Guannan Zhang (Oak Ridge National Laboratory)* | N/A | N/A |
| Towards Efficient and Effective Self-Supervised Learning of Visual Representations | Sravanti Addepalli (Indian Institute of Science)*; Kaushal Bhogale (Indian Institute of Technology, Madras); Priyam Dey (Indian Institute of Science); Venkatesh Babu RADHAKRISHNAN (Indian Institute of Science) | N/A | N/A |
| TransVLAD: Focusing on Locally Aggregated Descriptors for Few-Shot Learning | Haoquan Li (Southern University of Science and Technology)*; Laoming Zhang (Southern University of Science and Technology); Daoan Zhang (Southern University of Science and Technology); Lang Fu (Southern University of Science and Technology); Peng Yang (Southern University of Science and Technology); Jianguo Zhang (Southern University of Science and Technology) | N/A | N/A |
| Rotation Regularization Without Rotation | Takumi Kobayashi (National Institute of Advanced Industrial Science and Technology)* | N/A | N/A |
| Parameterized Temperature Scaling for Boosting the Expressive Power in Post-Hoc Uncertainty Calibration | Christian Tomani (TUM)*; Daniel Cremers (TU Munich); Florian Buettner (German Cancer Research Center and Frankfurt University) | N/A | N/A |
| FairStyle: Debiasing StyleGAN2 with Style Channel Manipulations | Cemre Efe Karakas (Bogazici University); Alara Dirik (Bogazici University); Eylül Yalçınkaya (Bogazici University); Pinar Yanardag (Bogazici University)* | N/A | N/A |
| Dynamic Temporal Filtering in Video Models | Fuchen Long (JD.com); Zhaofan Qiu (JD.com); Yingwei Pan (JD AI Research)*; Ting Yao (JD AI Research); Chong-Wah Ngo (Singapore Management University); Tao Mei (AI Research of JD.com) | N/A | N/A |
| DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation | linzhi huang (Beijing University of Posts and Telecommunications)*; Jiahao Liang (Beijing University of Posts and Telecommunications); Weihong Deng (Beijing University of Posts and Telecommunications) | N/A | N/A |
| Super-resolution 3D Human Shape from a Single Low-Resolution Image | Marco Pesavento (University of Surrey)*; Marco Volino (University of Surrey); Adrian Hilton (University of Surrey) | N/A | N/A |
| Trading Positional Complexity vs Deepness in Coordinate Networks | Jianqiao Zheng (University of Adelaide)*; Sameera Ramasinghe (University of Adelaide); Xueqian Li (Carnegie Mellon University); Simon Lucey (University of Adelaide) | N/A | N/A |
| ESS: Learning Event-based Semantic Segmentation from Still Images | Zhaoning Sun (ETH Zürich); Nico Messikommer (University of Zurich & ETH Zurich)*; Daniel Gehrig (University of Zurich & ETH Zurich); Davide Scaramuzza (University of Zurich & ETH Zurich, Switzerland) | N/A | N/A |
| U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search | Ahmet Yüzügüler (EPFL)*; Nikolaos Dimitriadis (EPFL); Pascal Frossard (EPFL) | N/A | N/A |
| MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud | Michaël Ramamonjisoa (Ecole des Ponts)*; Sinisa Stekovic (Graz University of Technology); Vincent Lepetit (Ecole des Ponts ParisTech) | N/A | N/A |
| Trapped in texture bias? A large scale comparison of deep instance segmentation | Johannes Theodoridis (Hochschule der Medien Stuttgart)*; Jessica Hofmann (Hochschule der Medien); Johannes Maucher (Media University Stuttgart); Andreas G Schilling (University of Tübingen) | N/A | N/A |
| MVDG: A Unified Multi-view Framework for Domain Generalization | Jian Zhang (Nanjing University)*; Lei Qi (Southeast University); Yinghuan Shi (Nanjing University); Yang Gao (Nanjing University) | N/A | N/A |
| MINER: Multiscale Implicit Neural Representation | Vishwanath Saragadam (Rice University)*; Jasper T Tan (Rice University); Guha Balakrishnan (Rice University); Richard Baraniuk (Rice University); Ashok Veeraraghavan (Rice University) | N/A | N/A |
| PTQ4ViT: Post-Training Quantization for Vision Transformers with Twin Uniform Quantization | Zhihang Yuan (Peking University)*; Chenhao Xue (Peking University); Yiqi Chen (Peking University); Qiang Wu (HOUMO.AI); Guangyu Sun (Peking University) | N/A | N/A |
| Context-Consistent Semantic Image Editing with Style-Preserved Modulation | Wuyang Luo (School of Computer Science, Fudan University); Su Yang (School of Computer Science, Fudan University)*; Hong Wang (School of Computer Science, Fudan University); Bo Long (School of Computer Science, Fudan University ); Weishan Zhang (Department of Software Engineering, China University of Petroleum) | N/A | N/A |
| Distilling the Undistillable: Learning from a Nasty Teacher | Surgan Jandial (MDSR Labs, Adobe)*; Yash Khasbage (Indian Institute of Technology, Hyderabad); Arghya Pal (Harvard University); Vineeth N Balasubramanian (Indian Institute of Technology, Hyderabad); Balaji Krishnamurthy () | N/A | N/A |
| Grounding Visual Representations with Texts for Domain Generalization | Seonwoo Min (LG AI Research)*; Nokyung Park (Korea University); Siwon Kim (Seoul National University); Seunghyun Park (Clova AI Research, NAVER Corp.); Jinkyu Kim (Korea University) | N/A | N/A |
| Towards Accurate Open-Set Recognition via Background-Class Regularization | Wonwoo Cho (Korea Advanced Institute of Science and Technology)*; Jaegul Choo (Korea Advanced Institute of Science and Technology) | N/A | N/A |
| In Defense of Image Pre-Training for Spatiotemporal Recognition | Xianhang Li (University of California, Santa Cruz)*; Huiyu Wang (JHU); Chen Wei (Johns Hopkins University); Jieru Mei (Johns Hopkins University); Alan Yuille (Johns Hopkins University); Yuyin Zhou (UC Santa Cruz); Cihang Xie (University of California, Santa Cruz) | N/A | N/A |
| SocialVAE: Human Trajectory Prediction using Timewise Latents | Pei Xu (Clemson University)*; Jean-Bernard Hayet (CIMAT); Ioannis Karamouzas (Clemson University) | N/A | N/A |
| BodySLAM: Joint Camera Localisation, Mapping, and Human Motion Tracking | Dorian F Henning (Imperial College London)*; Tristan Laidlow (Imperial College London); Stefan Leutenegger (TU Munich) | N/A | N/A |
| Eliminating Gradient Conflict in Reference-based Line-Art Colorization | zekun li (University of Electronic Science and Technology of China)*; Zhengyang Geng (Peking University); Zhao Kang (University of Electronic Science and Technology of China); Wenyu Chen (University of Electronic Science and Technology of China); Yibo Yang (Peking University) | N/A | N/A |
| Transfer without Forgetting | Matteo Boschini (University of Modena and Reggio Emilia)*; Lorenzo Bonicelli (Università of Modena and Reggio Emilia); Angelo Porrello (University of Modena and Reggio Emilia); Giovanni Bellitto (University of Catania); Matteo Pennisi (University of Catania); Simone Palazzo (University of Catania); Concetto Spampinato (University of Catania); SIMONE CALDERARA (University of Modena and Reggio Emilia, Italy) | N/A | N/A |
| DSR — A dual subspace re-projection network for surface anomaly detection | Vitjan Zavrtanik (University of Ljubljana)*; Matej Kristan (University of Ljubljana); Danijel Skocaj (University of Ljubljana) | N/A | N/A |
| Multi-Exit Semantic Segmentation Networks | Alexandros Kouris (Imperial College London and Samsung AI)*; Stylianos Venieris (Samsung AI); Stefanos Laskaridis (Samsung AI); Nicholas Lane (University of Cambridge and Samsung AI) | N/A | N/A |
| Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks | Bernd Prach (IST Austria)*; Christoph H Lampert (IST Austria) | N/A | N/A |
| Bridging the visual semantic gap in VLN via semantically richer instructions | Joaquín Ignacio Ossandón (Universidad Catolica de Chile)*; Benjamín Earle (Universidad Católica de Chile); Alvaro Soto (Universidad Catolica de Chile) | N/A | N/A |
| Kernel Relative-prototype Spectral Filtering for Few-shot Learning | Tao Zhang (Chengdu Techman Software Co., Ltd.)*; Wu Huang (Sichuan University) | N/A | N/A |
| StoryDALL-E: Adapting Pretrained Text-to-image Transformers for Story Continuation | Adyasha Maharana (UNC Chapel Hill)*; Darryl Hannan (University of North Carolina at Chapel Hill); Mohit Bansal (University of North Carolina at Chapel Hill) | N/A | N/A |
| Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations | Atsuhiro Noguchi (The University of Tokyo)*; Xiao Sun (Microsoft Research Asia); Stephen Lin (Microsoft Research); Tatsuya Harada (The University of Tokyo / RIKEN) | N/A | N/A |
| PANDORA: Polarization-Aided Neural Decomposition Of Radiance | Akshat Dave (Rice University)*; Yongyi Zhao (Rice University); Ashok Veeraraghavan (Rice University) | N/A | N/A |
| OCR-free Document Understanding Transformer | Geewook Kim (NAVER Corporation)*; Teakgyu Hong (Upstage AI); Moonbin Yim (Clova AI Research, NAVER Corp.); Jeongyeon Nam (Naver); Jinyoung Park (TmaxAI); Jinyeong Yim (Google); Wonseok Hwang (LBox); Sangdoo Yun (NAVER AI LAB); Dongyoon Han (NAVER AI Lab); Seunghyun Park (Clova AI Research, NAVER Corp.) | N/A | N/A |
| VQGAN-CLIP: Open Domain Image Generation and Manipulation Using Natural Language | Katherine B Crowson (EleutherAI); Stella R Biderman (Booz Allen Hamilton)*; daniel kornis (Eleuther.ai); Dashiell Stander (Eleuther AI); Eric Hallahan (EleutherAI); Louis J Castricato (Georgia Tech); Edward Raff (Booz Allen Hamilton) | N/A | N/A |
| Learning to use unlabeled data in data augmentation for 3D detection | Zhaoqi Leng (Waymo)*; Shuyang Cheng (Waymo LLC); Ben Caine (Google); Weiyue Wang (Waymo); Xiao Zhang (Cruise); Jonathon Shlens (Google); Mingxing Tan (Waymo); Dragomir Anguelov (Waymo) | N/A | N/A |
| Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images | Kevin Thandiackal (ETH Zurich / IBM Research)*; Boqi Chen (ETH Zurich ); Pushpak Pati (IBM Research Zurich); Guillaume Jaume (Harvard); Drew Williamson (Pathology, Brigham and Women’s Hospital, Harvard Medical School); Maria Gabrani (IBM Research); Orcun Goksel (ETH Zurich) | N/A | N/A |
| Towards Learning Neural Representations from Shadows | Kushagra Tiwary (MIT)*; Tzofi M Klinghoffer (Massachusetts Institute of Technology); Ramesh Raskar (Massachusetts Institute of Technology) | N/A | N/A |
| Augmenting Deep Classifiers with Polynomial Neural Networks | Grigorios Chrysos (EPFL)*; Markos Georgopoulos (Imperial College London); Jiankang Deng (Imperial College London); Jean Kossaifi (NVIDIA); Yannis Panagakis (University of Athens); Animashree Anandkumar (Caltech) | N/A | N/A |
| AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation | Farshid Varno (Dalhousie/Imagia)*; Marzie Saghayi (Dalhousie University); Laya Rafiee Sevyeri (Concordia); Sharut Gupta (MILA, Imagia, Indian Institute of Technology Delhi (IIT Delhi)); Stan Matwin (Dalhouise University); Mohammad Havaei (Imagia) | N/A | N/A |
| A Simple Approach and Benchmark for 21,000-Category Object Detection | Yutong Lin (Xi’an Jiaotong University); Chen Li (Xi’an Jiaotong University); Yue Cao (Microsoft Research); Zheng Zhang (MSRA); Jianfeng Wang (Microsoft); Lijuan Wang (Microsoft); Zicheng Liu (Microsoft); Han Hu (Microsoft Research Asia)* | N/A | N/A |
| Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach | Jiseok Youn (Seoul National University)*; Jaehun Song (Seoul National University); Hyung-Sin Kim (Seoul National University); Saewoong Bahk (Seoul National University) | N/A | N/A |
| Learning with Noisy Labels by Efficient Transition Matrix Estimation to Combat Label Miscorrection | Seong Min Kye (KAIST); Kwanghee Choi (Sogang University); Joonyoung Yi (Hyperconnect); Buru Chang (Hyperconnect)* | N/A | N/A |
| Online Task-free Continual Learning with Dynamic Sparse Distributed Memory | Julien Pourcel (ENSEA)*; Ngoc-Son Vu (ETIS/Université Paris Seine, Université Cergy-Pontoise, ENSEA, CNRS/ 95000-Cergy); Robert M FRENCH (CNRS) | N/A | N/A |
ECCV 2024
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| 4D Contrastive Superflows are Dense 3D Representation Learners | Unknown | N/A | |
| Octopus: Embodied Vision-Language Programmer from Environmental Feedback | Unknown | N/A | |
| ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation | Unknown | N/A | |
| Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance | Unknown | N/A | |
| Modeling and Driving Human Body Soundfields through Acoustic Primitives | Unknown | N/A | |
| Motion Mamba: Efficient and Long Sequence Motion Generation | Unknown | N/A | |
| Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation | Unknown | N/A | |
| SAGS: Structure-Aware 3D Gaussian Splatting | Unknown | N/A | |
| MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes | Unknown | N/A | |
| Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing | Unknown | N/A | |
| 3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views | Unknown | N/A | |
| Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs | Unknown | N/A | |
| Enhancing Perceptual Quality in Video Super-Resolution through Temporally-Consistent Detail Synthesis using Diffusion Models | Unknown | N/A | |
| Disentangling Masked Autoencoders for Unsupervised Domain Generalization | Unknown | N/A | |
| SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Unknown | N/A | |
| BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos | Unknown | N/A | |
| Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition | Unknown | N/A | |
| MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description | Unknown | N/A | |
| BRAVE: Broadening the visual encoding of vision-language models | Unknown | N/A | |
| Motion-prior Contrast Maximization for Dense Continuous-Time Motion Estimation | Unknown | N/A | |
| SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Unknown | N/A | |
| CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance | Unknown | N/A | |
| OGNI-DC: Robust Depth Completion with Optimization-Guided Neural Iterations | Unknown | N/A | |
| MapDistill: Boosting Efficient Camera-based HD Map Construction via Camera-LiDAR Fusion Model Distillation | Unknown | N/A | |
| High-Resolution and Few-shot View Synthesis from Asymmetric Dual-lens Inputs | Unknown | N/A | |
| AFreeCA: Annotation-Free Counting for All | Unknown | N/A | |
| Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap | Unknown | N/A | |
| LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation | Unknown | N/A | |
| Motion and Structure from Event-based Normal Flow | Unknown | N/A | |
| Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion | Unknown | N/A | |
| DiscoMatch: Fast Discrete Optimisation for Geometrically Consistent 3D Shape Matching | Unknown | N/A | |
| When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset | Unknown | N/A | |
| HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Unknown | N/A | |
| You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception | Unknown | N/A | |
| Contrastive ground-level image and remote sensing pre-training improves representation learning for natural world imagery | Unknown | N/A | |
| Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation | Unknown | N/A | |
| GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition | Unknown | N/A | |
| LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer | Unknown | N/A | |
| Merlin: Empowering Multimodal LLMs with Foresight Minds | Unknown | N/A | |
| E.T. the Exceptional Trajectory: Text-to-camera-trajectory generation with character awareness | Unknown | N/A | |
| Nuvo: Neural UV Mapping for Unruly 3D Representations | Unknown | N/A | |
| Towards Neuro-Symbolic Video Understanding | Unknown | N/A | |
| SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs | Unknown | N/A | |
| Improving 2D Feature Representations by 3D-Aware Fine-Tuning | Unknown | N/A | |
| Diffusion Bridges for 3D Point Cloud Denoising | Unknown | N/A | |
| AttnZero: Efficient Attention Discovery for Vision Transformers | Unknown | N/A | |
| Auto-GAS: Automated Proxy Discovery for Training-free Generative Architecture Search | Unknown | N/A | |
| Auto-DAS: Automated Proxy Discovery for Training-free Distillation-aware Architecture Search | Unknown | N/A | |
| Spectral Subsurface Scattering for Material Classification | Unknown | N/A | |
| HeadGaS: Real-Time Animatable Head Avatars via 3D Gaussian Splatting | Unknown | N/A | |
| Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation | Unknown | N/A | |
| nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding | Unknown | N/A | |
| HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Unknown | N/A | |
| CarFormer: Self-Driving with Learned Object-Centric Representations | Unknown | N/A | |
| Text-Guided Video Masked Autoencoder | Unknown | N/A | |
| PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion | Unknown | N/A | |
| BAD-Gaussians: Bundle Adjusted Deblur Gaussian Splatting | Unknown | N/A | |
| Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation | Unknown | N/A | |
| ShareGPT4V: Improving Large Multi-Modal Models with Better Captions | Unknown | N/A | |
| EvSign: Sign Language Recognition and Translation with Streaming Events | Unknown | N/A | |
| MetaAug: Meta-Data Augmentation for Post-Training Quantization | Unknown | N/A | |
| QUAR-VLA: Vision-Language-Action Model for Quadruped Robots | Unknown | N/A | |
| Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning | Unknown | N/A | |
| UNIKD: UNcertainty-Filtered Incremental Knowledge Distillation for Neural Implicit Representation | Unknown | N/A | |
| PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Unknown | N/A | |
| FutureDepth: Learning to Predict the Future Improves Video Depth Estimation | Unknown | N/A | |
| Cross-Input Certified Training for Universal Perturbations | Unknown | N/A | |
| Rethinking and Improving Visual Prompt Selection for In-Context Learning Segmentation Framework | Unknown | N/A | |
| LiDAR-Event Stereo Fusion with Hallucinations | Unknown | N/A | |
| X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs | Unknown | N/A | |
| Multi-Granularity Sparse Relationship Matrix Prediction Network for End-to-End Scene Graph Generation | Unknown | N/A | |
| Revisiting Supervision for Continual Representation Learning | Unknown | N/A | |
| Dolphins: Multimodal Language Model for Driving | Unknown | N/A | |
| MMBENCH: Is Your Multi-Modal Model an All-around Player? | Unknown | N/A | |
| HUMOS: Human Motion Model Conditioned on Body Shape | Unknown | N/A | |
| ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs | Unknown | N/A | |
| Implicit Filtering for Learning Neural Signed Distance Functions from 3D Point Clouds | Unknown | N/A | |
| Unsupervised Exposure Correction | Unknown | N/A | |
| SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model | Unknown | N/A | |
| External Knowledge Enhanced 3D Scene Generation from Sketch | Unknown | N/A | |
| GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation | Unknown | N/A | |
| DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting | Unknown | N/A | |
| Frequency-Spatial Entanglement Learning for Camouflaged Object Detection | Unknown | N/A | |
| 3D Congealing: 3D-Aware Image Alignment in the Wild | Unknown | N/A | |
| Adversarial Robustification via Text-to-Image Diffusion Models | Unknown | N/A | |
| CoMo: Controllable Motion Generation through Language Guided Pose Code Editing | Unknown | N/A | |
| MVDiffHD: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction | Unknown | N/A | |
| Semi-Supervised Teacher-Reference-Student Architecture for Action Quality Assessment | Unknown | N/A | |
| VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions | Unknown | N/A | |
| Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs | Unknown | N/A | |
| Occluded Gait Recognition with Mixture of Experts: An Action Detection Perspective | Unknown | N/A | |
| Benchmarking the Robustness of Cross-view Geo-localization Models | Unknown | N/A | |
| Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Unknown | N/A | |
| Model Stock: All we need is just a few fine-tuned models | Unknown | N/A | |
| Rejection Sampling IMLE: Designing Priors for Better Few-Shot Image Synthesis | Unknown | N/A | |
| Asynchronous Bioplausible Neuron for Spiking Neural Networks for Event-Based Vision | Unknown | N/A | |
| Formula-Supervised Visual-Geometric Pre-training | Unknown | N/A | |
| MAGR: Manifold-Aligned Graph Regularization for Continual Action Quality Assessment | Unknown | N/A | |
| DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding | Unknown | N/A | |
| Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection | Unknown | N/A | |
| SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow | Unknown | N/A | |
| TF-FAS: Twofold-Element Fine-Grained Semantic Guidance for Generalizable Face Anti-Spoofing | Unknown | N/A | |
| Robust Fitting on a Gate Quantum Computer | Unknown | N/A | |
| Defect Spectrum: A Granular Look of Large-scale Defect Datasets with Rich Semantics | Unknown | N/A | |
| Unveiling Advanced Frequency Disentanglement Paradigm for Low-Light Image Enhancement | Unknown | N/A | |
| Large-scale Reinforcement Learning for Diffusion Models | Unknown | N/A | |
| RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation | Unknown | N/A | |
| 3D Single-object Tracking in Point Clouds with High Temporal Variation | Unknown | N/A | |
| Self-supervised Shape Completion via Involution and Implicit Correspondences | Unknown | N/A | |
| Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization | Unknown | N/A | |
| Imaging Interiors: An Implicit Solution to Electromagnetic Inverse Scattering Problems | Unknown | N/A | |
| Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion | Unknown | N/A | |
| iHuman: Instant Animatable Digital Humans From Monocular Videos | Unknown | N/A | |
| LoA-Trans: Enhancing Visual Grounding by Location-Aware Transformers | Unknown | N/A | |
| HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression | Unknown | N/A | |
| Energy-induced Explicit quantification for Multi-modality MRI fusion | Unknown | N/A | |
| Characterizing Model Robustness via Natural Input Gradients | Unknown | N/A | |
| ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement | Unknown | N/A | |
| GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation | Unknown | N/A | |
| FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis | Unknown | N/A | |
| SPVLoc: Semantic Panoramic Viewport Matching for 6D Camera Localization in Unseen Environments | Unknown | N/A | |
| Resolving Scale Ambiguity in Multi-view 3D Reconstruction using Dual-Pixel Sensors | Unknown | N/A | |
| FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Unknown | N/A | |
| BugNIST - a Large Volumetric Dataset for Detection under Domain Shift | Unknown | N/A | |
| Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training | Unknown | N/A | |
| ScanReason: Empowering 3D Visual Grounding with Reasoning Capabilities | Unknown | N/A | |
| See and Think: Embodied Agent in Virtual Environment | Unknown | N/A | |
| Scalar Function Topology Divergence: Comparing Topology of 3D Objects | Unknown | N/A | |
| VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding | Unknown | N/A | |
| GazeXplain: Learning to Predict Natural Language Explanations of Visual Scanpaths | Unknown | N/A | |
| Towards Robust Full Low-bit Quantization of Super Resolution Networks | Unknown | N/A | |
| When Do We Not Need Larger Vision Models? | Unknown | N/A | |
| Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets | Unknown | N/A | |
| GVGEN: Text-to-3D Generation with Volumetric Representation | Unknown | N/A | |
| Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields | Unknown | N/A | |
| UNIC: Universal Classification Models via Multi-teacher Distillation | Unknown | N/A | |
| MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Unknown | N/A | |
| ReLoo: Reconstructing Humans Dressed in Loose Garments from Monocular Video in the Wild | Unknown | N/A | |
| LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning | Unknown | N/A | |
| PointNeRF++: A multi-scale, point-based Neural Radiance Field | Unknown | N/A | |
| Convex Relaxations for Manifold-Valued Markov Random Fields with Approximation Guarantees | Unknown | N/A | |
| Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation | Unknown | N/A | |
| Differentiable Convex Polyhedra Optimization from Multi-view Images | Unknown | N/A | |
| WHAC: World-grounded Humans and Cameras | Unknown | N/A | |
| SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding | Unknown | N/A | |
| V-IRL: Grounding Virtual Intelligence in Real Life | Unknown | N/A | |
| SENC: Handling Self-collision in Neural Cloth Simulation | Unknown | N/A | |
| TrojVLM: Backdoor Attack Against Vision Language Models | Unknown | N/A | |
| Dataset Growth | Unknown | N/A | |
| m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Unknown | N/A | |
| Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos | Unknown | N/A | |
| ReMamber: Referring Image Segmentation with Mamba Twister | Unknown | N/A | |
| Plain-Det: A Plain Multi-Dataset Object Detector | Unknown | N/A | |
| Pix2Gif: Motion-Guided Diffusion for GIF Generation | Unknown | N/A | |
| OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models | Unknown | N/A | |
| Integrating Markov Blanket Discovery into Causal Representation Learning for Domain Generalization | Unknown | N/A | |
| Plug-and-Play Learned Proximal Trajectory for 3D Sparse-View X-Ray Computed Tomography | Unknown | N/A | |
| LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation | Unknown | N/A | |
| Beta-Tuned Timestep Diffusion Model | Unknown | N/A | |
| Bayesian Evidential Deep Learning for Online Action Detection | Unknown | N/A | |
| Local All-Pair Correspondence for Point Tracking | Unknown | N/A | |
| Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations | Unknown | N/A | |
| SEED: A Simple and Effective 3D DETR in Point Clouds | Unknown | N/A | |
| Intrinsic Single-Image HDR Reconstruction | Unknown | N/A | |
| DCDM: Diffusion-Conditioned-Diffusion Model for Scene Text Image Super-Resolution | Unknown | N/A | |
| Pathology-knowledge Enhanced Multi-instance Prompt Learning for Few-shot Whole Slide Image Classification | Unknown | N/A | |
| LaRa: Efficient Large-Baseline Radiance Fields | Unknown | N/A | |
| XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution | Unknown | N/A | |
| MobileNetV4: Universal Models for the Mobile Ecosystem | Unknown | N/A | |
| Efficient Snapshot Spectral Imaging: Calibration-Free Parallel Structure with Aperture Diffraction Fusion | Unknown | N/A | |
| AutoEval-Video: An Automatic Benchmark for Assessing Large Vision Language Models in Open-Ended Video Question Answering | Unknown | N/A | |
| DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation | Unknown | N/A | |
| MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection | Unknown | N/A | |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Unknown | N/A | |
| DiffiT: Diffusion Vision Transformers for Image Generation | Unknown | N/A | |
| Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation | Unknown | N/A | |
| DreamDissector: Learning Disentangled Text-to-3D Generation from 2D Diffusion Priors | Unknown | N/A | |
| Prioritized Semantic Learning for Zero-shot Instance Navigation | Unknown | N/A | |
| Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats | Unknown | N/A | |
| Can OOD Object Detectors Learn from Foundation Models? | Unknown | N/A | |
| 2S-ODIS: Two-Stage Omni-Directional Image Synthesis by Geometric Distortion Correction | Unknown | N/A | |
| RadEdit: stress-testing biomedical vision models via diffusion image editing | Unknown | N/A | |
| Towards Real-world Event-guided Low-light Video Enhancement and Deblurring | Unknown | N/A | |
| Referring Atomic Video Action Recognition | Unknown | N/A | |
| Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation | Unknown | N/A | |
| TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks | Unknown | N/A | |
| SpatialFormer: Towards Generalizable Vision Transformers with Explicit Spatial Understanding | Unknown | N/A | |
| Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | Unknown | N/A | |
| OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving | Unknown | N/A | |
| MyVLM: Personalizing VLMs for User-Specific Queries | Unknown | N/A | |
| SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark | Unknown | N/A | |
| AMEGO: Active Memory from long EGOcentric videos | Unknown | N/A | |
| Camera-LiDAR Cross-modality Gait Recognition | Unknown | N/A | |
| Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction | Unknown | N/A | |
| Adaptive Correspondence Scoring for Unsupervised Medical Image Registration | Unknown | N/A | |
| VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models | Unknown | N/A | |
| An Adaptive Screen-Space Meshing Approach for Normal Integration | Unknown | N/A | |
| Collaborative Control for Geometry-Conditioned PBR Image Generation | Unknown | N/A | |
| Open-set Domain Adaptation via Joint Error based Multi-class Positive and Unlabeled Learning | Unknown | N/A | |
| Quantization-Friendly Winograd Transformations for Convolutional Neural Networks | Unknown | N/A | |
| Look Around and Learn: Self-Training Object Detection by Exploration | Unknown | N/A | |
| Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model | Unknown | N/A | |
| Regularizing Dynamic Radiance Fields with Kinematic Fields | Unknown | N/A | |
| SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images | Unknown | N/A | |
| Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data | Unknown | N/A | |
| Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving | Unknown | N/A | |
| Smoothness, Synthesis, and Sampling: Re-thinking Unsupervised Multi-View Stereo with DIV Loss | Unknown | N/A | |
| DreamDrone: Text-to-Image Diffusion Models are Zero-shot Perpetual View Generators | Unknown | N/A | |
| Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks | Unknown | N/A | |
| Large Motion Model for Unified Multi-Modal Motion Generation | Unknown | N/A | |
| Memory-Efficient Fine-Tuning for Quantized Diffusion Model | Unknown | N/A | |
| WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians | Unknown | N/A | |
| Label-anticipated Event Disentanglement for Audio-Visual Video Parsing | Unknown | N/A | |
| Unified Local-Cloud Decision-Making via Reinforcement Learning | Unknown | N/A | |
| Think before Placement: Common Sense Enhanced Transformer for Object Placement | Unknown | N/A | |
| The Hard Positive Truth about Vision-Language Compositionality | Unknown | N/A | |
| Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing | Unknown | N/A | |
| GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing | Unknown | N/A | |
| Concise Plane Arrangements for Low-Poly Surface and Volume Modelling | Unknown | N/A | |
| Prompting Language-Informed Distribution for Compositional Zero-Shot Learning | Unknown | N/A | |
| 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting | Unknown | N/A | |
| Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth Estimation | Unknown | N/A | |
| AEDNet: Adaptive Embedding and Multiview-Aware Disentanglement for Point Cloud Completion | Unknown | N/A | |
| Wavelength-Embedding-guided Filter-Array Transformer for Spectral Demosaicing | Unknown | N/A | |
| GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views | Unknown | N/A | |
| Efficient Bias Mitigation Without Privileged Information | Unknown | N/A | |
| MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization | Unknown | N/A | |
| Towards Open-Ended Visual Recognition with Large Language Models | Unknown | N/A | |
| Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation | Unknown | N/A | |
| MotionLCM: Real-time Controllable Motion Generation via Latent Consistency Model | Unknown | N/A | |
| IFTR: An Instance-Level Fusion Transformer for Visual Collaborative Perception | Unknown | N/A | |
| On the Utility of 3D Hand Poses for Action Recognition | Unknown | N/A | |
| RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models | Unknown | N/A | |
| IRGen: Generative Modeling for Image Retrieval | Unknown | N/A | |
| A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting | Unknown | N/A | |
| LayeredFlow: A Real-World Benchmark for Non-Lambertian Multi-Layer Optical Flow | Unknown | N/A | |
| VISA: Reasoning Video Object Segmentation via Large Language Model | Unknown | N/A | |
| Learning Representations of Satellite Images From Metadata Supervision | Unknown | N/A | |
| Adaptive Parametric Activation | Unknown | N/A | |
| Scaling Backwards: Minimal Synthetic Pre-training? | Unknown | N/A | |
| Learned Neural Physics Simulation for Articulated 3D Human Pose Reconstruction | Unknown | N/A | |
| Towards Multi-modal Transformers in Federated Learning | Unknown | N/A | |
| Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics | Unknown | N/A | |
| InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser | Unknown | N/A | |
| ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders | Unknown | N/A | |
| DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion | Unknown | N/A | |
| Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-Supervised Learning | Unknown | N/A | |
| FisherRF: Active View Selection and Mapping with Radiance Fields using Fisher Information | Unknown | N/A | |
| General and Task-Oriented Video Segmentation | Unknown | N/A | |
| Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation | Unknown | N/A | |
| Benchmarking Object Detectors with COCO: A New Path Forward | Unknown | N/A | |
| Diffusion Model is a Good Pose Estimator from 3D RF-Vision | Unknown | N/A | |
| UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues | Unknown | N/A | |
| Grounding Language Models for Visual Entity Recognition | Unknown | N/A | |
| Soft Shadow Diffusion (SSD): Physics-inspired Learning for 3D Computational Periscopy | Unknown | N/A | |
| Learning 3D-aware GANs from Unposed Images with Template Feature Field | Unknown | N/A | |
| Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning | Unknown | N/A | |
| DεpS: Delayed ε-Shrinking for Faster Once-For-All Training | Unknown | N/A | |
| Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos | Unknown | N/A | |
| Human Hair Reconstruction with Strand-Aligned 3D Gaussians | Unknown | N/A | |
| SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoders | Unknown | N/A | |
| Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection | Unknown | N/A | |
| Global-to-Pixel Regression for Human Mesh Recovery | Unknown | N/A | |
| CIC-BART-SSA: : Controllable Image Captioning with Structured Semantic Augmentation | Unknown | N/A | |
| Rethinking Image Super Resolution from Training Data Perspectives | Unknown | N/A | |
| MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection | Unknown | N/A | |
| Interactive 3D Object Detection with Prompts | Unknown | N/A | |
| Learning to Robustly Reconstruct Dynamic Scenes from Low-light Spike Streams | Unknown | N/A | |
| Neural Volumetric World Models for Autonomous Driving | Unknown | N/A | |
| Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding | Unknown | N/A | |
| COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation | Unknown | N/A | |
| ControlLLM: Augment Language Models with Tools by Searching on Graphs | Unknown | N/A | |
| Analytic-Splatting: Anti-Aliased 3D Gaussian Splatting via Analytic Integration | Unknown | N/A | |
| Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer | Unknown | N/A | |
| Learning from the Web: Language Drives Weakly-Supervised Incremental Learning for Semantic Segmentation | Unknown | N/A | |
| Uni3DL: A Unified Model for 3D Vision-Language Understanding | Unknown | N/A | |
| G3R: Gradient Guided Generalizable Reconstruction | Unknown | N/A | |
| Goldfish: Vision-Language Understanding of Arbitrarily Long Videos | Unknown | N/A | |
| T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning | Unknown | N/A | |
| HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization | Unknown | N/A | |
| Invertible Neural Warp for NeRF | Unknown | N/A | |
| AddBiomechanics Dataset: Capturing the Physics of Human Motion at Scale | Unknown | N/A | |
| Efficient and Versatile Robust Fine-Tuning of Zero-shot Models | Unknown | N/A | |
| MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo | Unknown | N/A | |
| Language-Image Pre-training with Long Captions | Unknown | N/A | |
| SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference | Unknown | N/A | |
| CoReS: Orchestrating the Dance of Reasoning and Segmentation | Unknown | N/A | |
| MambaIR: A Simple Baseline for Image Restoration with State-Space Model | Unknown | N/A | |
| EBDM: Exemplar-guided Image Translation with Brownian-bridge Diffusion Models | Unknown | N/A | |
| I Can't Believe It's Not Scene Flow! | Unknown | N/A | |
| Compress3D: a Compressed Latent Space for 3D Generation from a Single Image | Unknown | N/A | |
| Bi-directional Contextual Attention for 3D Dense Captioning | Unknown | N/A | |
| Scalable Group Choreography via Variational Phase Manifold Learning | Unknown | N/A | |
| Quality Assured: Rethinking Annotation Strategies in Imaging AI | Unknown | N/A | |
| Distribution-Aware Robust Learning from Long-Tailed Data with Noisy Labels | Unknown | N/A | |
| TPA3D: Triplane Attention for Fast Text-to-3D Generation | Unknown | N/A | |
| Augmented Neural Fine-tuning for Efficient Backdoor Purification | Unknown | N/A | |
| Human Pose Recognition via Occlusion-Preserving Abstract Images | Unknown | N/A | |
| AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling | Unknown | N/A | |
| Retrieval Robust to Object Motion Blur | Unknown | N/A | |
| Rethinking Deep Unrolled Model for Accelerated MRI Reconstruction | Unknown | N/A | |
| Occlusion-Aware Seamless Segmentation | Unknown | N/A | |
| TTT-MIM: Test-Time Training with Masked Image Modeling for Denoising Distribution Shifts | Unknown | N/A | |
| Diffusion Models for Open-Vocabulary Segmentation | Unknown | N/A | |
| Rethinking Unsupervised Outlier Detection via Multiple Thresholding | Unknown | N/A | |
| OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection | Unknown | N/A | |
| Stream Query Denoising for Vectorized HD-Map Construction | Unknown | N/A | |
| Learn to Preserve and Diversify: Parameter-Efficient Group with Orthogonal Regularization for Domain Generalization | Unknown | N/A | |
| Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Unknown | N/A | |
| Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering | Unknown | N/A | |
| Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation | Unknown | N/A | |
| SkyMask: Attack-agnostic Robust Federated Learning with Fine-grained Learnable Masks | Unknown | N/A | |
| PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation | Unknown | N/A | |
| Pixel-GS Density Control with Pixel-aware Gradient for 3D Gaussian Splatting | Unknown | N/A | |
| WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation | Unknown | N/A | |
| Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization | Unknown | N/A | |
| Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance | Unknown | N/A | |
| SHINE: Saliency-aware HIerarchical NEgative Ranking for Compositional Temporal Grounding | Unknown | N/A | |
| Quantized Prompt for Efficient Generalization of Vision-Language Models | Unknown | N/A | |
| Modality Translation for Object Detection Adaptation without forgetting prior knowledge | Unknown | N/A | |
| How Video Meetings Change Your Expression | Unknown | N/A | |
| Audio-driven Talking Face Generation with Stabilized Synchronization Loss | Unknown | N/A | |
| Learning to Obstruct Few-Shot Image Classification over Restricted Classes | Unknown | N/A | |
| Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation | Unknown | N/A | |
| L-DiffER: Single Image Reflection Removal with Language-based Diffusion Model | Unknown | N/A | |
| DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation | Unknown | N/A | |
| Distilling Diffusion Models into Conditional GANs | Unknown | N/A | |
| UMBRAE: Unified Multimodal Brain Decoding | Unknown | N/A | |
| AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting | Unknown | N/A | |
| Model Breadcrumbs: Scaling Multi-Task Model Merging with Sparse Masks | Unknown | N/A | |
| HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning | Unknown | N/A | |
| BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion | Unknown | N/A | |
| Multiscale Graph Texture Network | Unknown | N/A | |
| LetsMap: Unsupervised Representation Learning for Label-Efficient Semantic BEV Mapping | Unknown | N/A | |
| Bottom-Up Domain Prompt Tuning for Generalized Face Anti-Spoofing | Unknown | N/A | |
| Blind image deblurring with noise-robust kernel estimation | Unknown | N/A | |
| Free-Viewpoint Video of Outdoor Sports Using a Drone | Unknown | N/A | |
| RePOSE: 3D Human Pose Estimation via Spatio-Temporal Depth Relational Consistency | Unknown | N/A | |
| Binomial Self-compensation for Motion Error in Dynamic 3D Scanning | Unknown | N/A | |
| Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation | Unknown | N/A | |
| Momentum Auxiliary Network for Supervised Local Learning | Unknown | N/A | |
| HPFF: Hierarchical Locally Supervised Learning with Patch Feature Fusion | Unknown | N/A | |
| Style-Extracting Diffusion Models for Semi-Supervised Histopathology Segmentation | Unknown | N/A | |
| Rethinking LiDAR Domain Generalization: Single Source as Multiple Density Domains | Unknown | N/A | |
| PQ-SAM: Post-training Quantization for Segment Anything Model | Unknown | N/A | |
| COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation | Unknown | N/A | |
| Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation | Unknown | N/A | |
| TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting | Unknown | N/A | |
| Improving Zero-Shot Generalization for CLIP with Variational Adapter | Unknown | N/A | |
| LaWa: Using Latent Space for In-Generation Image Watermarking | Unknown | N/A | |
| Topology-Preserving Downsampling of Binary Images | Unknown | N/A | |
| Cocktail Universal Adversarial Attack on Deep Neural Networks | Unknown | N/A | |
| Hypernetworks for Generalizable BRDF Representation | Unknown | N/A | |
| ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders | Unknown | N/A | |
| Classification Matters: Improving Video Action Detection with Class-Specific Attention | Unknown | N/A | |
| Improving Medical Multi-modal Contrastive Learning with Expert Annotations | Unknown | N/A | |
| Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization | Unknown | N/A | |
| AccDiffusion: An Accurate Method for Higher-Resolution Image Generation | Unknown | N/A | |
| Leveraging temporal contextualization for video action recognition | Unknown | N/A | |
| AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale | Unknown | N/A | |
| ZigMa: A DiT-style Zigzag Mamba Diffusion Model | Unknown | N/A | |
| Deep Nets with Subsampling Layers Unwittingly Discard Useful Activations at Test-Time | Unknown | N/A | |
| Safe-Sim: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries | Unknown | N/A | |
| MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images | Unknown | N/A | |
| Data Collection-free Masked Video Modeling | Unknown | N/A | |
| Resilience of Entropy Model in Distributed Neural Networks | Unknown | N/A | |
| Implicit Concept Removal of Diffusion Models | Unknown | N/A | |
| VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding | Unknown | N/A | |
| Restoring Images in Adverse Weather Conditions via Histogram Transformer | Unknown | N/A | |
| PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer | Unknown | N/A | |
| NGP-RT: Fusing Multi-Level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis | Unknown | N/A | |
| G2fR: Frequency Regularization in Grid-based Feature Encoding Neural Radiance Fields | Unknown | N/A | |
| Getting it Right: Improving Spatial Consistency in Text-to-Image Models | Unknown | N/A | |
| Generating 3D House Wireframes with Semantics | Unknown | N/A | |
| SegPoint: Segment Any Point Cloud via Large Language Model | Unknown | N/A | |
| Navigation Instruction Generation with BEV Perception and Large Language Models | Unknown | N/A | |
| The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation | Unknown | N/A | |
| FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally | Unknown | N/A | |
| Eliminating Feature Ambiguity for Few-Shot Segmentation | Unknown | N/A | |
| Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation | Unknown | N/A | |
| GENIXER: Empowering Multimodal Large Language Models as a Powerful Data Generator | Unknown | N/A | |
| BLINK: Multimodal Large Language Models Can See but Not Perceive | Unknown | N/A | |
| PreLAR: World Model Pre-training with Learnable Action Representation | Unknown | N/A | |
| Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single Shot | Unknown | N/A | |
| Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions | Unknown | N/A | |
| FreestyleRet: Retrieving Images from Style-Diversified Queries | Unknown | N/A | |
| Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal | Unknown | N/A | |
| ReGround: Improving Textual and Spatial Grounding at No Cost | Unknown | N/A | |
| CardiacNet: Learning to Reconstruct Abnormalities for Cardiac Disease Assessment from Echocardiogram Videos | Unknown | N/A | |
| Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting | Unknown | N/A | |
| Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders | Unknown | N/A | |
| Harnessing Text-to-Image Diffusion Models for Category-Agnostic Pose Estimation | Unknown | N/A | |
| Image Demoireing in RAW and sRGB Domains | Unknown | N/A | |
| Reliability in Semantic Segmentation: Can We Use Synthetic Data? | Unknown | N/A | |
| Prompting Future Driven Diffusion Model for Hand Motion Prediction | Unknown | N/A | |
| Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning | Unknown | N/A | |
| 3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views | Unknown | N/A | |
| Lazy Diffusion Transformer for Interactive Image Editing | Unknown | N/A | |
| Robust Calibration of Large Vision-Language Adapters | Unknown | N/A | |
| Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation | Unknown | N/A | |
| Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training | Unknown | N/A | |
| AugDETR: Improving Multi-scale Learning for Detection Transformer | Unknown | N/A | |
| Spherical World-Locking for Audio-Visual Localization in Egocentric Videos | Unknown | N/A | |
| SIGMA: Sinkhorn-Guided Masked Video Modeling | Unknown | N/A | |
| Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis | Unknown | N/A | |
| Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams | Unknown | N/A | |
| Understanding Physical Dynamics with Counterfactual World Modeling | Unknown | N/A | |
| SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild | Unknown | N/A | |
| VideoMamba: Spatio-Temporal Selective State Space Model | Unknown | N/A | |
| Text to Layer-wise 3D Clothed Human Generation | Unknown | N/A | |
| Fully Sparse 3D Occupancy Prediction | Unknown | N/A | |
| CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field | Unknown | N/A | |
| High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding | Unknown | N/A | |
| PointLLM: Empowering Large Language Models to Understand Point Clouds | Unknown | N/A | |
| Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation | Unknown | N/A | |
| Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis | Unknown | N/A | |
| AnimatableDreamer: Text-Guided Non-rigid 3D Model Generation and Reconstruction with Canonical Score Distillation | Unknown | N/A | |
| Spatially-Variant Degradation Model for Dataset-free Super-resolution | Unknown | N/A | |
| Learning Exhaustive Correlation for Spectral Super-Resolution: Where Spatial-Spectral Attention Meets Linear Dependence | Unknown | N/A | |
| SUMix: Mixup with Semantic and Uncertain Information | Unknown | N/A | |
| Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation | Unknown | N/A | |
| EAFormer: Scene Text Segmentation with Edge-Aware Transformers | Unknown | N/A | |
| DySeT: a Dynamic Masked Self-distillation Approach for Robust Trajectory Prediction | Unknown | N/A | |
| LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation | Unknown | N/A | |
| Upper-body Hierarchical Graph for Skeleton Based Emotion Recognition in Assistive Driving | Unknown | N/A | |
| Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction | Unknown | N/A | |
| Zero-Shot Detection of AI-Generated Images | Unknown | N/A | |
| Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training | Unknown | N/A | |
| Exploring Guided Sampling of Conditional GANs | Unknown | N/A | |
| TCC-Det: Temporarily consistent cues for weakly-supervised 3D detection | Unknown | N/A | |
| Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis | Unknown | N/A | |
| OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Unknown | N/A | |
| Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation | Unknown | N/A | |
| Kalman-Inspired Feature Propagation for Video Face Super-Resolution | Unknown | N/A | |
| Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models | Unknown | N/A | |
| VideoMamba: State Space Model for Efficient Video Understanding | Unknown | N/A | |
| Heterogeneous Graph Learning for Scene Graph Prediction in 3D Point Clouds | Unknown | N/A | |
| Omniview-Tuning: Boosting Viewpoint Invariance of Vision-Language Pre-training Models | Unknown | N/A | |
| DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video | Unknown | N/A | |
| Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models | Unknown | N/A | |
| Brain Netflix: Scaling Data to Reconstruct Videos from Brain Signals | Unknown | N/A | |
| Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models | Unknown | N/A | |
| Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection | Unknown | N/A | |
| FreeAugment: Data Augmentation Search Across All Degrees of Freedom | Unknown | N/A | |
| I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM | Unknown | N/A | |
| FlashTex: Fast Relightable Mesh Texturing with LightControlNet | Unknown | N/A | |
| GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence | Unknown | N/A | |
| ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling | Unknown | N/A | |
| PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance | Unknown | N/A | |
| SOS: Segment Object System for Open-World Instance Segmentation With Object Priors | Unknown | N/A | |
| Lagrangian Hashing for Compressed Neural Field Representations | Unknown | N/A | |
| Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Unknown | N/A | |
| Gaze Target Detection Based on Head-Local-Global Coordination | Unknown | N/A | |
| 3DSA:Multi-View 3D Human Pose Estimation With 3D Space Attention Mechanisms | Unknown | N/A | |
| An Economic Framework for 6-DoF Grasp Detection | Unknown | N/A | |
| GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | Unknown | N/A | |
| PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery | Unknown | N/A | |
| Multi-Label Cluster Discrimination for Visual Representation Learning | Unknown | N/A | |
| Plan, Posture and Go: Towards Open-vocabulary Text-to-Motion Generation | Unknown | N/A | |
| DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion | Unknown | N/A | |
| CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks | Unknown | N/A | |
| Flash Cache: Reducing Bias in Radiance Cache Based Inverse Rendering | Unknown | N/A | |
| RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos | Unknown | N/A | |
| Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds | Unknown | N/A | |
| RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation | Unknown | N/A | |
| StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models | Unknown | N/A | |
| Deblur e-NeRF: NeRF from Motion-Blurred Events under High-speed or Low-light Conditions | Unknown | N/A | |
| Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences | Unknown | N/A | |
| Preventing Catastrophic Overfitting in Fast Adversarial Training: A Bi-level Optimization Perspective | Unknown | N/A | |
| Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation | Unknown | N/A | |
| MagicEraser: Erasing Any Objects via Semantics-Aware Control | Unknown | N/A | |
| Reliable Spatial-Temporal Voxels For Multi-Modal Test-Time Adaptation | Unknown | N/A | |
| SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images | Unknown | N/A | |
| NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Unknown | N/A | |
| Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities | Unknown | N/A | |
| 3D Small Object Detection with Dynamic Spatial Pruning | Unknown | N/A | |
| Semantically Guided Representation Learning For Action Anticipation | Unknown | N/A | |
| MemBN: Robust Test-Time Adaptation via Batch Norm with Statistics Memory | Unknown | N/A | |
| ScanTalk: 3D Talking Heads from Unregistered Scans | Unknown | N/A | |
| FreeInit: Bridging Initialization Gap in Video Diffusion Models | Unknown | N/A | |
| Synchronous Diffusion for Unsupervised Smooth Non-Rigid 3D Shape Matching | Unknown | N/A | |
| Controllable Navigation Instruction Generation with Chain of Thought Prompting | Unknown | N/A | |
| TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning | Unknown | N/A | |
| LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment | Unknown | N/A | |
| EgoPoser: Robust Real-Time Egocentric Pose Estimation from Sparse and Intermittent Observations Everywhere | Unknown | N/A | |
| SuperGaussian: Repurposing Video Models for 3D Super Resolution | Unknown | N/A | |
| Towards Model-Agnostic Dataset Condensation by Heterogeneous Models | Unknown | N/A | |
| Decoupling Common and Unique Representations for Multimodal Self-supervised Learning | Unknown | N/A | |
| MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training | Unknown | N/A | |
| Optimizing Diffusion Models for Joint Trajectory Prediction and Controllable Generation | Unknown | N/A | |
| Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models | Unknown | N/A | |
| Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance | Unknown | N/A | |
| CLR-GAN: Improving GANs Stability and Quality via Consistent Latent Representation and Reconstruction | Unknown | N/A | |
| D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction | Unknown | N/A | |
| Pairwise Distance Distillation for Unsupervised Real-World Image Super-Resolution | Unknown | N/A | |
| Decomposed Vector-Quantized Variational Autoencoder for Human Grasp Generation | Unknown | N/A | |
| UniFS: Universal Few-shot Instance Perception with Point Representations | Unknown | N/A | |
| Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation | Unknown | N/A | |
| Physics-Based Interaction with 3D Objects via Video Generation | Unknown | N/A | |
| Taming Latent Diffusion Model for Neural Radiance Field Inpainting | Unknown | N/A | |
| Shedding More Light on Robust Classifiers under the lens of Energy-based Models | Unknown | N/A | |
| CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians | Unknown | N/A | |
| Unleashing the Power of Prompt-driven Nucleus Instance Segmentation | Unknown | N/A | |
| FREST: Feature RESToration for Semantic Segmentation under Multiple Adverse Conditions | Unknown | N/A | |
| 3DEgo: 3D Editing on the Go! | Unknown | N/A | |
| Domain-adaptive Video Deblurring via Test-time Blurring | Unknown | N/A | |
| NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving | Unknown | N/A | |
| Progressive Pretext Task Learning for Human Trajectory Prediction | Unknown | N/A | |
| Hyperion – A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM | Unknown | N/A | |
| Isomorphic Pruning for Vision Models | Unknown | N/A | |
| Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Unknown | N/A | |
| GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation | Unknown | N/A | |
| DreamMotion: Space-Time Self-Similar Score Distillation for Zero-Shot Video Editing | Unknown | N/A | |
| VideoClusterNet: Self-Supervised and Adaptive Face Clustering for Videos | Unknown | N/A | |
| Hiding Imperceptible Noise in Curvature-Aware Patches for 3D Point Cloud Attack | Unknown | N/A | |
| Interleaving One-Class and Weakly-Supervised Models with Adaptive Thresholding for Unsupervised Video Anomaly Detection | Unknown | N/A | |
| YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information | Unknown | N/A | |
| Cross-Domain Learning for Video Anomaly Detection with Limited Supervision | Unknown | N/A | |
| Unsupervised Multi-modal Medical Image Registration via Invertible Translation | Unknown | N/A | |
| CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model | Unknown | N/A | |
| SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM | Unknown | N/A | |
| View Selection for 3D Captioning via Diffusion Ranking | Unknown | N/A | |
| OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models | Unknown | N/A | |
| WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting | Unknown | N/A | |
| Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent | Unknown | N/A | |
| WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models | Unknown | N/A | |
| BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream | Unknown | N/A | |
| DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment | Unknown | N/A | |
| SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis | Unknown | N/A | |
| PoseAugment: Generative Human Pose Data Augmentation with Physical Plausibility for IMU-based Motion Capture | Unknown | N/A | |
| PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Unknown | N/A | |
| GiT: Towards Generalist Vision Transformer through Universal Language Interface | Unknown | N/A | |
| Hierarchical Gaussian Mixture Normalizing Flow Modeling for Unified Anomaly Detection | Unknown | N/A | |
| Improving Unsupervised Domain Adaptation: A Pseudo-Candidate Set Approach | Unknown | N/A | |
| Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction | Unknown | N/A | |
| BaSIC: BayesNet Structure Learning for Computational Scalable Neural Image Compression | Unknown | N/A | |
| Integer-Valued Training and Spike-driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection | Unknown | N/A | |
| Group Testing for Accurate and Efficient Range-Based Near Neighbor Search for Plagiarism Detection | Unknown | N/A | |
| CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization | Unknown | N/A | |
| SMILe: Leveraging Submodular Mutual Information For Robust Few-Shot Object Detection | Unknown | N/A | |
| Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Unknown | N/A | |
| S-JEPA: A Joint Embedding Predictive Architecture for Skeletal Action Recognition | Unknown | N/A | |
| SwapAnything: Enabling Arbitrary Object Swapping in Personalized Image Editing | Unknown | N/A | |
| ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Unknown | N/A | |
| OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks | Unknown | N/A | |
| Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos | Unknown | N/A | |
| Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data | Unknown | N/A | |
| Click Prompt Learning with Optimal Transport for Interactive Segmentation | Unknown | N/A | |
| T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy | Unknown | N/A | |
| 3D Human Pose Estimation via Non-Causal Retentive Networks | Unknown | N/A | |
| 6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry | Unknown | N/A | |
| Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition | Unknown | N/A | |
| DynoSurf: Neural Deformation-based Temporally Consistent Dynamic Surface Reconstruction | Unknown | N/A | |
| Learning Diffusion Models for Multi-View Anomaly Detection | Unknown | N/A | |
| Masked Angle-Aware Autoencoder for Remote Sensing Images | Unknown | N/A | |
| Multi-modal Relation Distillation for Unified 3D Representation Learning | Unknown | N/A | |
| LongVLM: Efficient Long Video Understanding via Large Language Models | Unknown | N/A | |
| The All-Seeing Project V2: Towards General Relation Comprehension of the Open World | Unknown | N/A | |
| Diff3DETR: Agent-based Diffusion Model for Semi-supervised 3D Object Detection | Unknown | N/A | |
| Layout-Corrector: Alleviating Layout Sticking Phenomenon in Discrete Diffusion Model | Unknown | N/A | |
| Light-in-Flight for a World-in-Motion | Unknown | N/A | |
| Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels | Unknown | N/A | |
| Learning with Unmasked Tokens Drives Stronger Vision Learners | Unknown | N/A | |
| Efficient Training of Spiking Neural Networks with Multi-Parallel Implicit Stream Architecture | Unknown | N/A | |
| Deep Patch Visual SLAM | Unknown | N/A | |
| LiteSAM is Actually what you Need for segment Everything | Unknown | N/A | |
| GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections | Unknown | N/A | |
| Visual Prompting via Partial Optimal Transport | Unknown | N/A | |
| AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection | Unknown | N/A | |
| Pathformer3D: A 3D Scanpath Transformer for 360° Images | Unknown | N/A | |
| Visual Grounding for Object-Level Generalization in Reinforcement Learning | Unknown | N/A | |
| TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection | Unknown | N/A | |
| SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection | Unknown | N/A | |
| Asymmetric Mask Scheme for Self-Supervised Real Image Denoising | Unknown | N/A | |
| FlexAttention for Efficient High-Resolution Vision-Language Models | Unknown | N/A | |
| EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation | Unknown | N/A | |
| EMDM: Efficient Motion Diffusion Model for Fast, High-Quality Human Motion Generation | Unknown | N/A | |
| Learning Differentially Private Diffusion Models via Stochastic Adversarial Distillation | Unknown | N/A | |
| PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving | Unknown | N/A | |
| Temporal Event Stereo via Joint Learning with Stereoscopic Flow | Unknown | N/A | |
| H-V2X: A Large Scale Highway Dataset for BEV Perception | Unknown | N/A | |
| ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation | Unknown | N/A | |
| QueryCDR: Query-based Controllable Distortion Rectification Network for Fisheye Images | Unknown | N/A | |
| Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection | Unknown | N/A | |
| E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation | Unknown | N/A | |
| InstructIR: High-Quality Image Restoration Following Human Instructions | Unknown | N/A | |
| Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation | Unknown | N/A | |
| LayoutFlow: Flow Matching for Layout Generation | Unknown | N/A | |
| Making Large Language Models Better Planners with Reasoning-Decision Alignment | Unknown | N/A | |
| Continual Learning for Remote Physiological Measurement: Minimize Forgetting and Simplify Inference | Unknown | N/A | |
| PACE: Pose Annotations in Cluttered Environments | Unknown | N/A | |
| InfMAE: A Foundation Model in The Infrared Modality | Unknown | N/A | |
| Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs | Unknown | N/A | |
| STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians | Unknown | N/A | |
| Robust Incremental Structure-from-Motion with Hybrid Features | Unknown | N/A | |
| FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis | Unknown | N/A | |
| UniCal: Unified Neural Sensor Calibration | Unknown | N/A | |
| Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models | Unknown | N/A | |
| Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter | Unknown | N/A | |
| ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions | Unknown | N/A | |
| Trajectory-aligned Space-time Tokens for Few-shot Action Recognition | Unknown | N/A | |
| Synchronization of Projective Transformations | Unknown | N/A | |
| U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation | Unknown | N/A | |
| Insect Identification in the Wild: The AMI Dataset | Unknown | N/A | |
| Test-time Model Adaptation for Image Reconstruction Using Self-supervised Adaptive Layers | Unknown | N/A | |
| CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring | Unknown | N/A | |
| This Probably Looks Exactly Like That: An Invertible Prototypical Network | Unknown | N/A | |
| GenRC: Generative 3D Room Completion from Sparse Image Collections | Unknown | N/A | |
| Towards Open-ended Visual Quality Comparison | Unknown | N/A | |
| EgoPet: Egomotion and Interaction Data from an Animal's Perspective | Unknown | N/A | |
| Neural graphics texture compression supporting random access | Unknown | N/A | |
| Contrastive Learning with Synthetic Positives | Unknown | N/A | |
| GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features | Unknown | N/A | |
| DIM: Dyadic Interaction Modeling for Social Behavior Generation | Unknown | N/A | |
| ControlCap: Controllable Region-level Captioning | Unknown | N/A | |
| MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models | Unknown | N/A | |
| Watch Your Steps: Local Image and Scene Editing by Text Instructions | Unknown | N/A | |
| Forget More to Learn More: Domain-specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation | Unknown | N/A | |
| LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model | Unknown | N/A | |
| 3x2: 3D Object Part Segmentation by 2D Semantic Correspondences | Unknown | N/A | |
| CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians | Unknown | N/A | |
| Fisher Calibration for Backdoor-Robust Heterogeneous Federated Learning | Unknown | N/A | |
| A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties | Unknown | N/A | |
| Fast View Synthesis of Casual Videos with Soup-of-Planes | Unknown | N/A | |
| Confidence Self-Calibration for Multi-Label Class-Incremental Learning | Unknown | N/A | |
| Video Question Answering with Procedural Programs | Unknown | N/A | |
| DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification | Unknown | N/A | |
| Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting | Unknown | N/A | |
| SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields | Unknown | N/A | |
| Representation Enhancement-Stabilization: Reducing Bias-Variance of Domain Generalization | Unknown | N/A | |
| LLMGA: Multimodal Large Language Model based Generation Assistant | Unknown | N/A | |
| Shape from Heat Conduction | Unknown | N/A | |
| Learn from the Learnt: Source-Free Active Domain Adaptation via Contrastive Sampling and Visual Persistence | Unknown | N/A | |
| HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning | Unknown | N/A | |
| Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion | Unknown | N/A | |
| Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation | Unknown | N/A | |
| AnyHome: Open-Vocabulary Large-Scale Indoor Scene Generation with First-Person View Exploration | Unknown | N/A | |
| Better Call SAL: Towards Learning to Segment Anything in Lidar | Unknown | N/A | |
| DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control | Unknown | N/A | |
| iMatching: Imperative Correspondence Learning | Unknown | N/A | |
| Appearance-based Refinement for Object-Centric Motion Segmentation | Unknown | N/A | |
| Open Panoramic Segmentation | Unknown | N/A | |
| Open Vocabulary Multi-Label Video Classification | Unknown | N/A | |
| Shape-guided Configuration-aware Learning for Endoscopic-image-based Pose Estimation of Flexible Robotic Instruments | Unknown | N/A | |
| MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation | Unknown | N/A | |
| GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image | Unknown | N/A | |
| Efficient Pre-training for Localized Instruction Generation of Procedural Videos | Unknown | N/A | |
| MTKD: Multi-Teacher Knowledge Distillation for Image Super-Resolution | Unknown | N/A | |
| DEAL: Disentangle and Localize Concept-level Explanations for VLMs | Unknown | N/A | |
| RoadPainter: Points Are Ideal Navigators for Topology transformER | Unknown | N/A | |
| Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models | Unknown | N/A | |
| Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following | Unknown | N/A | |
| IMMA: Immunizing text-to-image Models against Malicious Adaptation | Unknown | N/A | |
| ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments | Unknown | N/A | |
| SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow | Unknown | N/A | |
| GeoCalib: Learning Single-image Calibration with Geometric Optimization | Unknown | N/A | |
| 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation | Unknown | N/A | |
| ReMatching: Low-Resolution Representations for Scalable Shape Correspondence | Unknown | N/A | |
| Semicalibrated Relative Pose from an Affine Correspondence and Monodepth | Unknown | N/A | |
| Global Structure-from-Motion Revisited | Unknown | N/A | |
| Gravity-aligned Rotation Averaging with Circular Regression | Unknown | N/A | |
| MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation | Unknown | N/A | |
| Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments | Unknown | N/A | |
| Quanta Video Restoration | Unknown | N/A | |
| Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models | Unknown | N/A | |
| A Probability-guided Sampler for Neural Implicit Surface Rendering | Unknown | N/A | |
| CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model | Unknown | N/A | |
| ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image | Unknown | N/A | |
| FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition | Unknown | N/A | |
| POCA: Post-training Quantization with Temporal Alignment for Codec Avatars | Unknown | N/A | |
| HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts | Unknown | N/A | |
| Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers | Unknown | N/A | |
| Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone Capture | Unknown | N/A | |
| A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks | Unknown | N/A | |
| HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution | Unknown | N/A | |
| Audio-Synchronized Visual Animation | Unknown | N/A | |
| Expressive Whole-Body 3D Gaussian Avatar | Unknown | N/A | |
| Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving | Unknown | N/A | |
| DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects | Unknown | N/A | |
| PAV: Personalized Head Avatar from Unstructured Video Collection | Unknown | N/A | |
| Strike a Balance in Continual Panoptic Segmentation | Unknown | N/A | |
| MultiDelete for Multimodal Machine Unlearning | Unknown | N/A | |
| Stitched ViTs are Flexible Vision Backbones | Unknown | N/A | |
| Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation | Unknown | N/A | |
| TrajPrompt: Aligning Color Trajectory with Vision-Language Representations | Unknown | N/A | |
| Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis | Unknown | N/A | |
| CountFormer: Multi-View Crowd Counting Transformer | Unknown | N/A | |
| SemReg: Semantics Constrained Point Cloud Registration | Unknown | N/A | |
| You Only Need One Step: Fast Super-Resolution with Stable Diffusion via Scale Distillation | Unknown | N/A | |
| MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Unknown | N/A | |
| Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects | Unknown | N/A | |
| Cascade-Zero123: One Image to Highly Consistent 3D with Self-Prompted Nearby Views | Unknown | N/A | |
| RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception | Unknown | N/A | |
| R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding | Unknown | N/A | |
| SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition | Unknown | N/A | |
| Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors | Unknown | N/A | |
| ActionVOS: Actions as Prompts for Video Object Segmentation | Unknown | N/A | |
| DomainFusion: Generalizing To Unseen Domains with Latent Diffusion Models | Unknown | N/A | |
| One-stage Prompt-based Continual Learning | Unknown | N/A | |
| Unsqueeze [CLS] Bottleneck to Learn Rich Representations | Unknown | N/A | |
| Robust Multimodal Learning via Representation Decoupling | Unknown | N/A | |
| Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models | Unknown | N/A | |
| Long-Tail Temporal Action Segmentation with Group-wise Temporal Logit Adjustment | Unknown | N/A | |
| WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing | Unknown | N/A | |
| Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality | Unknown | N/A | |
| Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks | Unknown | N/A | |
| A Direct Approach to Viewing Graph Solvability | Unknown | N/A | |
| Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer | Unknown | N/A | |
| Look Hear: Gaze Prediction for Speech-directed Human Attention | Unknown | N/A | |
| Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching | Unknown | N/A | |
| Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework | Unknown | N/A | |
| SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models | Unknown | N/A | |
| Parrot Captions Teach CLIP to Spot Text | Unknown | N/A | |
| Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning | Unknown | N/A | |
| Solving Motion Planning Tasks with a Scalable Generative Model | Unknown | N/A | |
| Rotary Position Embedding for Vision Transformer | Unknown | N/A | |
| Rebalancing Using Estimated Class Distribution for Imbalanced Semi-Supervised Learning under Class Distribution Mismatch | Unknown | N/A | |
| Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models | Unknown | N/A | |
| ReNoise: Real Image Inversion Through Iterative Noising | Unknown | N/A | |
| Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment | Unknown | N/A | |
| Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions | Unknown | N/A | |
| PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery | Unknown | N/A | |
| Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation | Unknown | N/A | |
| Recursive Visual Programming | Unknown | N/A | |
| Prompt-Driven Contrastive Learning for Transferable Adversarial Attacks | Unknown | N/A | |
| Learning to Adapt SAM for Segmenting Cross-domain Point Clouds | Unknown | N/A | |
| Take A Step Back: Rethinking the Two Stages in Visual Reasoning | Unknown | N/A | |
| Human-in-the-Loop Visual Re-ID for Population Size Estimation | Unknown | N/A | |
| Finding Visual Task Vectors | Unknown | N/A | |
| ShapeLLM: Universal 3D Object Understanding for Embodied Interaction | Unknown | N/A | |
| Tensorial template matching for fast cross-correlation with rotations and its application for tomography | Unknown | N/A | |
| Event Camera Data Dense Pre-training | Unknown | N/A | |
| Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning | Unknown | N/A | |
| DECap: Towards Generalized Explicit Caption Editing via Diffusion Mechanism | Unknown | N/A | |
| EgoLifter: Open-world 3D Segmentation for Egocentric Perception | Unknown | N/A | |
| MoVideo: Motion-Aware Video Generation with Diffusion Models | Unknown | N/A | |
| ComFusion: Enhancing Personalized Generation by Instance-Scene Compositing and Fusion | Unknown | N/A | |
| Where am I? Scene Retrieval with Language | Unknown | N/A | |
| SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning | Unknown | N/A | |
| RangeLDM: Fast Realistic LiDAR Point Cloud Generation | Unknown | N/A | |
| Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Unknown | N/A | |
| Physically Plausible Color Correction for Neural Radiance Fields | Unknown | N/A | |
| Unifying 3D Vision-Language Understanding via Promptable Queries | Unknown | N/A | |
| LLM as Copilot for Coarse-grained Vision-and-Language Navigation | Unknown | N/A | |
| Revisiting Calibration of Wide-Angle Radially Symmetric Cameras | Unknown | N/A | |
| Motion-Guided Latent Diffusion for Temporally Consistent Real-world Video Super-resolution | Unknown | N/A | |
| PoseCrafter: One-Shot Personalized Video Synthesis Following Flexible Pose Control | Unknown | N/A | |
| MAD-DR: Map Compression for Visual Localization with Matchness Aware Descriptor Dimension Reduction | Unknown | N/A | |
| A New Dataset and Framework for Real-World Blurred Images Super-Resolution | Unknown | N/A | |
| Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction | Unknown | N/A | |
| Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing | Unknown | N/A | |
| Uncertainty-aware sign language video retrieval with probability distribution modeling | Unknown | N/A | |
| NeRMo: Learning Implicit Neural Representations for 3D Human Motion Prediction | Unknown | N/A | |
| SAFARI: Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation | Unknown | N/A | |
| Adversarial Prompt Tuning for Vision-Language Models | Unknown | N/A | |
| BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering | Unknown | N/A | |
| A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks | Unknown | N/A | |
| CC-SAM: Enhancing SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation | Unknown | N/A | |
| Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing | Unknown | N/A | |
| An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding | Unknown | N/A | |
| X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning | Unknown | N/A | |
| Operational Open-Set Recognition and PostMax Refinement | Unknown | N/A | |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Unknown | N/A | |
| Text2Place: Affordance-aware Text Guided Human Placement | Unknown | N/A | |
| REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices | Unknown | N/A | |
| Self-Training Room Layout via Geometry-aware Ray-casting | Unknown | N/A | |
| TAPTR: Tracking Any Point with Transformers as Detection | Unknown | N/A | |
| Adaptive Multi-task Learning for Few-shot Object Detection | Unknown | N/A | |
| Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback | Unknown | N/A | |
| ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model | Unknown | N/A | |
| Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration | Unknown | N/A | |
| CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection | Unknown | N/A | |
| Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents | Unknown | N/A | |
| Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery | Unknown | N/A | |
| Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition | Unknown | N/A | |
| D4-VTON: Dynamic Semantics Disentangling for Differential Diffusion based Virtual Try-On | Unknown | N/A | |
| TC4D: Trajectory-Conditioned Text-to-4D Generation | Unknown | N/A | |
| RAW-Adapter: Adapting Pretrained Visual Model to Camera RAW Images | Unknown | N/A | |
| Blind Image Deconvolution by Generative-based Kernel Prior and Initializer via Latent Encoding | Unknown | N/A | |
| Dataset Enhancement with Instance-Level Augmentations | Unknown | N/A | |
| AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models | Unknown | N/A | |
| Personalized Federated Domain-Incremental Learning based on Adaptive Knowledge Matching | Unknown | N/A | |
| ST-LDM: A Universal Framework for Text-Grounded Object Generation in Real Images | Unknown | N/A | |
| Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery | Unknown | N/A | |
| SLIM: Spuriousness Mitigation with Minimal Human Annotations | Unknown | N/A | |
| Uncertainty Calibration with Energy Based Instance-wise Scaling in the Wild Dataset | Unknown | N/A | |
| X-Pose: Detecting Any Keypoints | Unknown | N/A | |
| MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition | Unknown | N/A | |
| ∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions | Unknown | N/A | |
| OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing | Unknown | N/A | |
| UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection | Unknown | N/A | |
| MetaCap: Meta-learning Priors from Multi-View Imagery for Sparse-view Human Performance Capture and Rendering | Unknown | N/A | |
| DiffPMAE: Diffusion Masked Autoencoders for Point Cloud Reconstruction | Unknown | N/A | |
| Motion Aware Event Representation-driven Image Deblurring | Unknown | N/A | |
| Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs | Unknown | N/A | |
| WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language | Unknown | N/A | |
| Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models | Unknown | N/A | |
| GroupDiff: Diffusion-based Group Portrait Editing | Unknown | N/A | |
| Privacy-Preserving Adaptive Re-Identification without Image Transfer | Unknown | N/A | |
| Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression | Unknown | N/A | |
| UCIP: A Universal Framework for Compressed Image Super-Resolution using Dynamic Prompt | Unknown | N/A | |
| TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation | Unknown | N/A | |
| MVPGS: Excavating Multi-view Priors for Gaussian Splatting from Sparse Input Views | Unknown | N/A | |
| Towards More Practical Group Activity Detection: A New Benchmark and Model | Unknown | N/A | |
| Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models | Unknown | N/A | |
| Zero-Shot Image Feature Consensus with Deep Functional Maps | Unknown | N/A | |
| Geospecific View Generation - Geometry-Context Aware High-resolution Ground View Inference from Satellite Views | Unknown | N/A | |
| City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Unknown | N/A | |
| Co-Student: Collaborating Strong and Weak Students for Sparsely Annotated Object Detection | Unknown | N/A | |
| SeiT++: Masked Token Modeling Improves Storage-efficient Training | Unknown | N/A | |
| Revisiting Feature Disentanglement Strategy in Diffusion Training and Breaking Conditional Independence Assumption in Sampling | Unknown | N/A | |
| ProMerge: Prompt and Merge for Unsupervised Instance Segmentation | Unknown | N/A | |
| Open-Vocabulary Camouflaged Object Segmentation | Unknown | N/A | |
| CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images | Unknown | N/A | |
| PetFace: A Large-Scale Dataset and Benchmark for Animal Identification | Unknown | N/A | |
| A Simple Low-bit Quantization Framework for Video Snapshot Compressive Imaging | Unknown | N/A | |
| InterFusion: Text-Driven Generation of 3D Human-Object Interaction | Unknown | N/A | |
| GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval | Unknown | N/A | |
| Flow-Assisted Motion Learning Network for Weakly-Supervised Group Activity Recognition | Unknown | N/A | |
| Learning Anomalies with Normality Prior for Unsupervised Video Anomaly Detection | Unknown | N/A | |
| Multi-Memory Matching for Unsupervised Visible-Infrared Person Re-Identification | Unknown | N/A | |
| Compositional Substitutivity of Visual Reasoning for Visual Question Answering | Unknown | N/A | |
| DNI: Dilutional Noise Initialization for Diffusion Video Editing | Unknown | N/A | |
| Fully Authentic Visual Question Answering Dataset from Online Communities | Unknown | N/A | |
| Towards Physical World Backdoor Attacks against Skeleton Action Recognition | Unknown | N/A | |
| Active Generation for Image Classification | Unknown | N/A | |
| Panel-Specific Degradation Representation for Raw Under-Display Camera Image Restoration | Unknown | N/A | |
| Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image | Unknown | N/A | |
| Diffusion-Guided Weakly Supervised Semantic Segmentation | Unknown | N/A | |
| DetailSemNet: Elevating Signature Verification through Detail-Semantic Integration | Unknown | N/A | |
| Real-time Holistic Robot Pose Estimation with Unknown States | Unknown | N/A | |
| Online Vectorized HD Map Construction using Geometry | Unknown | N/A | |
| Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation | Unknown | N/A | |
| Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | Unknown | N/A | |
| Is user feedback always informative? Retrieval Latent Defending for Semi-Supervised Domain Adaptation without Source Data | Unknown | N/A | |
| Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion | Unknown | N/A | |
| Improving Virtual Try-On with Garment-focused Diffusion Models | Unknown | N/A | |
| MANIKIN: Biomechanically Accurate Neural Inverse Kinematics for Human Motion Estimation | Unknown | N/A | |
| Disentangled Generation and Aggregation for Robust Radiance Fields | Unknown | N/A | |
| MoAI: Mixture of All Intelligence for Large Language and Vision Models | Unknown | N/A | |
| SMooDi: Stylized Motion Diffusion Model | Unknown | N/A | |
| Online Temporal Action Localization with Memory-Augmented Transformer | Unknown | N/A | |
| JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation | Unknown | N/A | |
| TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling | Unknown | N/A | |
| SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models | Unknown | N/A | |
| Learning Video Context as Interleaved Multimodal Sequences | Unknown | N/A | |
| Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding | Unknown | N/A | |
| FLAT: Flux-aware Imperceptible Adversarial Attacks on 3D Point Clouds | Unknown | N/A | |
| Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks | Unknown | N/A | |
| Multi-branch Collaborative Learning Network for 3D Visual Grounding | Unknown | N/A | |
| Progressive Proxy Anchor Propagation for Unsupervised Semantic Segmentation | Unknown | N/A | |
| Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence | Unknown | N/A | |
| Revisit Human-Scene Interaction via Space Occupancy | Unknown | N/A | |
| Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control | Unknown | N/A | |
| WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model | Unknown | N/A | |
| Mitigating Background Shift in Class-Incremental Semantic Segmentation | Unknown | N/A | |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Unknown | N/A | |
| BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation | Unknown | N/A | |
| Object-Oriented Anchoring and Modal Alignment in Multimodal Learning | Unknown | N/A | |
| SPIRE: Semantic Prompt-Driven Image Restoration | Unknown | N/A | |
| Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models with Self-Consistency Training | Unknown | N/A | |
| SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance | Unknown | N/A | |
| Towards Stable 3D Object Detection | Unknown | N/A | |
| FYI: Flip Your Images for Dataset Distillation | Unknown | N/A | |
| On-the-fly Category Discovery for LiDAR Semantic Segmentation | Unknown | N/A | |
| Dual-Camera Smooth Zoom on Mobile Phones | Unknown | N/A | |
| Attention Decomposition for Cross-Domain Semantic Segmentation | Unknown | N/A | |
| CONDA: Condensed Deep Association Learning for Co-Salient Object Detection. | Unknown | N/A | |
| PolyRoom: Room-aware Transformer for Floorplan Reconstruction | Unknown | N/A | |
| BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models | Unknown | N/A | |
| SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution | Unknown | N/A | |
| AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors | Unknown | N/A | |
| Improving Video Segmentation via Dynamic Anchor Queries | Unknown | N/A | |
| Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights | Unknown | N/A | |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Unknown | N/A | |
| How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs | Unknown | N/A | |
| Coarse-to-Fine Implicit Representation Learning for 3D Hand-Object Reconstruction from a Single RGB-D Image | Unknown | N/A | |
| Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective | Unknown | N/A | |
| Flatness-aware Sequential Learning Generates Resilient Backdoors | Unknown | N/A | |
| PapMOT: Exploring Adversarial Patch Attack against Multiple Object Tracking | Unknown | N/A | |
| HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models | Unknown | N/A | |
| SCOMatch: Alleviating Overtrusting in Open-set Semi-supervised Learning | Unknown | N/A | |
| Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning | Unknown | N/A | |
| An Incremental Unified Framework for Small Defect Inspection | Unknown | N/A | |
| Self-supervised Feature Adaptation for 3D Industrial Anomaly Detection | Unknown | N/A | |
| MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation | Unknown | N/A | |
| PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training | Unknown | N/A | |
| Real-time 3D-aware Portrait Editing from a Single Image | Unknown | N/A | |
| Dolfin: Diffusion Layout Transformers without Autoencoder | Unknown | N/A | |
| Image Compression for Machine and Human Vision With Spatial-Frequency Adaptation | Unknown | N/A | |
| Platypus: A Generalized Specialist Model for Reading Text in Various Forms | Unknown | N/A | |
| DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks | Unknown | N/A | |
| Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Unknown | N/A | |
| Efficient Diffusion-Driven Corruption Editor for Test-Time Adaptation | Unknown | N/A | |
| Emergent Visual-Semantic Hierarchies in Image-Text Representations | Unknown | N/A | |
| DriveLM: Driving with Graph Visual Question Answering | Unknown | N/A | |
| Beyond Viewpoint: Robust 3D Object Recognition under Arbitrary Views through Joint Multi-Part Representation | Unknown | N/A | |
| LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors | Unknown | N/A | |
| Learning Non-Linear Invariants for Unsupervised Out-of-Distribution Detection | Unknown | N/A | |
| Real Appearance Modeling for More General Deepfake Detection | Unknown | N/A | |
| 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model | Unknown | N/A | |
| Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge | Unknown | N/A | |
| Event Trojan: Asynchronous Event-based Backdoor Attacks | Unknown | N/A | |
| V2X-Real: a Largs-Scale Dataset for Vehicle-to-Everything Cooperative Perception | Unknown | N/A | |
| VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space | Unknown | N/A | |
| CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing | Unknown | N/A | |
| GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding | Unknown | N/A | |
| Any2Point: Empowering Any-modality Transformers for Efficient 3D Understanding | Unknown | N/A | |
| HARIVO: Harnessing Text-to-Image Models for Video Generation | Unknown | N/A | |
| Deep Online Probability Aggregation Clustering | Unknown | N/A | |
| WRIM-Net: Wide-Ranging Information Mining Network for Visible-Infrared Person Re-Identification | Unknown | N/A | |
| Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models | Unknown | N/A | |
| Length-Aware Motion Synthesis via Latent Diffusion | Unknown | N/A | |
| Attention-Challenging Multiple Instance Learning for Whole Slide Image Classification | Unknown | N/A | |
| Free Lunch for Gait Recognition: A Novel Relation Descriptor | Unknown | N/A | |
| OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers | Unknown | N/A | |
| An Optimal Control View of LoRA and Binary Controller Design for Vision Transformers | Unknown | N/A | |
| Disentangled Clothed Avatar Generation from Text Descriptions | Unknown | N/A | |
| Exploring Phrase-Level Grounding with Text-to-Image Diffusion Model | Unknown | N/A | |
| Exemplar-free Continual Representation Learning via Learnable Drift Compensation | Unknown | N/A | |
| Improving image synthesis with diffusion-negative sampling | Unknown | N/A | |
| AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos | Unknown | N/A | |
| FedVAD: Enhancing Federated Video Anomaly Detection with GPT-Driven Semantic Distillation | Unknown | N/A | |
| SignGen: End-to-End Sign Language Video Generation with Latent Diffusion | Unknown | N/A | |
| Diffusion Prior-Based Amortized Variational Inference for Noisy Inverse Problems | Unknown | N/A | |
| Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization | Unknown | N/A | |
| Temporal Residual Guided Diffusion Framework for Event-Driven Video Reconstruction | Unknown | N/A | |
| S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis | Unknown | N/A | |
| The Gaussian Discriminant Variational Autoencoder (GdVAE): A Self-Explainable Model with Counterfactual Explanations | Unknown | N/A | |
| FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior | Unknown | N/A | |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Unknown | N/A | |
| Revisit Event Generation Model: Self-Supervised Learning of Event-to-Video Reconstruction with Implicit Neural Representations | Unknown | N/A | |
| SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds | Unknown | N/A | |
| Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models | Unknown | N/A | |
| TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos | Unknown | N/A | |
| TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation | Unknown | N/A | |
| Camera Calibration using a Collimator System | Unknown | N/A | |
| GRA: Detecting Oriented Objects through Group-wise Rotating and Attention | Unknown | N/A | |
| Track Everything Everywhere Fast and Robustly | Unknown | N/A | |
| AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion | Unknown | N/A | |
| Label-free Neural Semantic Image Synthesis | Unknown | N/A | |
| Exploring Reliable Matching with Phase Enhancement for Night-time Semantic Segmentation | Unknown | N/A | |
| Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts | Unknown | N/A | |
| Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment | Unknown | N/A | |
| FARSE-CNN: Fully Asynchronous, Recurrent and Sparse Event-Based CNN | Unknown | N/A | |
| ConDense: Consistent 2D-3D Pre-training for Dense and Sparse Features from Multi-View Images | Unknown | N/A | |
| Event-Aided Time-To-Collision Estimation for Autonomous Driving | Unknown | N/A | |
| MTA-CLIP: Language-Guided Semantic Segmentation with Mask-Text Alignment | Unknown | N/A | |
| The Devil is in the Statistics: Mitigating and Exploiting Statistics Difference for Generalizable Semi-supervised Medical Image Segmentation | Unknown | N/A | |
| Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning | Unknown | N/A | |
| VEON: Vocabulary-Enhanced Occupancy Prediction | Unknown | N/A | |
| Adapt without Forgetting: Distill Proximity from Dual Teachers in Vision-Language Models | Unknown | N/A | |
| HiEI: A Universal Framework for Generating High-quality Emerging Images from Natural Images | Unknown | N/A | |
| Nonverbal Interaction Detection | Unknown | N/A | |
| The Sky's the Limit: Relightable Outdoor Scenes via a Sky-pixel Constrained Illumination Prior and Outside-In Visibility | Unknown | N/A | |
| DiffFAS: Face Anti-Spoofing via Generative Diffusion Models | Unknown | N/A | |
| Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights | Unknown | N/A | |
| I-MedSAM: Implicit Medical Image Segmentation with Segment Anything | Unknown | N/A | |
| Neural Spectral Decomposition for Dataset Distillation | Unknown | N/A | |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Unknown | N/A | |
| Region-Adaptive Transform with Segmentation Prior for Image Compression | Unknown | N/A | |
| SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions | Unknown | N/A | |
| Cascade Prompt Learning for Visual-Language Model Adaptation | Unknown | N/A | |
| Class-Incremental Learning with CLIP: Adaptive Representation Adjustment and Parameter Fusion | Unknown | N/A | |
| cDP-MIL: Robust Multiple Instance Learning via Cascaded Dirichlet Process | Unknown | N/A | |
| DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLM | Unknown | N/A | |
| Causality-inspired Discriminative Feature Learning in Triple Domains for Gait Recognition | Unknown | N/A | |
| Delving Deep into Engagement Prediction of Short Videos | Unknown | N/A | |
| CLEO: Continual Learning of Evolving Ontologies | Unknown | N/A | |
| ByteEdit: Boost, Comply and Accelerate Generative Image Editing | Unknown | N/A | |
| BK-SDM: A Lightweight, Fast, and Cheap Version of Stable Diffusion | Unknown | N/A | |
| Leveraging scale- and orientation-covariant features for planar motion estimation | Unknown | N/A | |
| MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion | Unknown | N/A | |
| MultiGen: Zero-shot Image Generation from Multi-modal Prompts | Unknown | N/A | |
| Understanding and Mitigating Human-Labelling Errors in Supervised Contrastive Learning | Unknown | N/A | |
| VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Unknown | N/A | |
| SWinGS: Sliding Windows for Dynamic 3D Gaussian Splatting | Unknown | N/A | |
| Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Unknown | N/A | |
| Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction | Unknown | N/A | |
| Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network | Unknown | N/A | |
| AdaDistill: Adaptive Knowledge Distillation for Deep Face Recognition | Unknown | N/A | |
| HERGen: Elevating Radiology Report Generation with Longitudinal Data | Unknown | N/A | |
| Labeled Data Selection for Category Discovery | Unknown | N/A | |
| Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation | Unknown | N/A | |
| Dependency-aware Differentiable Neural Architecture Search | Unknown | N/A | |
| CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection | Unknown | N/A | |
| GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer | Unknown | N/A | |
| SNeRV: Spectra-preserving Neural Representation for Video | Unknown | N/A | |
| COMO: Compact Mapping and Odometry | Unknown | N/A | |
| SelfSwapper: Self-Supervised Face Swapping via Shape Agnostic Masked AutoEncoder | Unknown | N/A | |
| EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation | Unknown | N/A | |
| An Information Theoretical View for Out-Of-Distribution Detection | Unknown | N/A | |
| HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras | Unknown | N/A | |
| Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth | Unknown | N/A | |
| WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models | Unknown | N/A | |
| SILC: Improving Vision Language Pretraining with Self-Distillation | Unknown | N/A | |
| Analysis-by-Synthesis Transformer for Single-View 3D Reconstruction | Unknown | N/A | |
| DMiT: Deformable Mipmapped Tri-Plane Representation for Dynamic Scenes | Unknown | N/A | |
| Transferable 3D Adversarial Shape Completion using Diffusion Models | Unknown | N/A | |
| Gradient-Aware for Class-Imbalanced Semi-supervised Medical Image Segmentation | Unknown | N/A | |
| Exploiting Dual-Correlation for Multi-frame Time-of-Flight Denoising | Unknown | N/A | |
| Event-Adapted Video Super-Resolution | Unknown | N/A | |
| HowToCaption: Prompting LLMs to Transform Video Annotations at Scale | Unknown | N/A | |
| ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback | Unknown | N/A | |
| UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation | Unknown | N/A | |
| LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection | Unknown | N/A | |
| Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction | Unknown | N/A | |
| On Pretraining Data Diversity for Self-Supervised Learning | Unknown | N/A | |
| Bayesian Self-Training for Semi-Supervised 3D Segmentation | Unknown | N/A | |
| Tri^{2}-plane: Thinking Head Avatar via Feature Pyramid | Unknown | N/A | |
| Motion-Oriented Compositional Neural Radiance Fields for Monocular Dynamic Human Modeling | Unknown | N/A | |
| Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal | Unknown | N/A | |
| ParCo: Part-Coordinating Text-to-Motion Synthesis | Unknown | N/A | |
| Learning to Complement and to Defer to Multiple Users | Unknown | N/A | |
| Tiny Models are the Computational Saver for Large Models | Unknown | N/A | |
| Multi-Sentence Grounding for Long-term Instructional Video | Unknown | N/A | |
| AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization | Unknown | N/A | |
| Unveiling Privacy Risks in Stochastic Neural Networks Training: Effective Image Reconstruction from Gradients | Unknown | N/A | |
| Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360° | Unknown | N/A | |
| KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding | Unknown | N/A | |
| Rate-Distortion-Cognition Controllable Versatile Neural Image Compression | Unknown | N/A | |
| Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers | Unknown | N/A | |
| MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models | Unknown | N/A | |
| ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency | Unknown | N/A | |
| PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Unknown | N/A | |
| CrossGLG: LLM Guides One-shot Skeleton-based 3D Action Recognition in a Cross-level Manner | Unknown | N/A | |
| Vista3D: unravel the 3d darkside of a single image | Unknown | N/A | |
| Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer | Unknown | N/A | |
| Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models | Unknown | N/A | |
| Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors | Unknown | N/A | |
| Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation | Unknown | N/A | |
| StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion | Unknown | N/A | |
| DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control | Unknown | N/A | |
| Segment and Recognize Anything at Any Granularity | Unknown | N/A | |
| ST-LLM: Large Language Models Are Effective Temporal Learners | Unknown | N/A | |
| LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents | Unknown | N/A | |
| Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception | Unknown | N/A | |
| LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models | Unknown | N/A | |
| A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars | Unknown | N/A | |
| Exact Diffusion Inversion via Bidirectional Integration Approximation | Unknown | N/A | |
| Textual Query-Driven Mask Transformer for Domain Generalized Segmentation | Unknown | N/A | |
| EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head | Unknown | N/A | |
| SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis | Unknown | N/A | |
| Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors | Unknown | N/A | |
| Object-Centric Diffusion for Efficient Video Editing | Unknown | N/A | |
| Single-Mask Inpainting for Voxel-based Neural Radiance Fields | Unknown | N/A | |
| Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Unknown | N/A | |
| SLAck: Semantic, Location, and Appearance Aware Open-Vocabulary Tracking | Unknown | N/A | |
| Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts | Unknown | N/A | |
| Agglomerative Token Clustering | Unknown | N/A | |
| CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection | Unknown | N/A | |
| NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition | Unknown | N/A | |
| GIVT: Generative Infinite-Vocabulary Transformers | Unknown | N/A | |
| SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion | Unknown | N/A | |
| Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment | Unknown | N/A | |
| Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density | Unknown | N/A | |
| Multi-Modal Video Dialog State Tracking in the Wild | Unknown | N/A | |
| Factorized Diffusion: Perceptual Illusions by Noise Decomposition | Unknown | N/A | |
| Combining Generative and Geometry Priors for Wide-Angle Portrait Correction | Unknown | N/A | |
| To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now | Unknown | N/A | |
| CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Unknown | N/A | |
| Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions | Unknown | N/A | |
| StereoGlue: Joint Feature Matching and Robust Estimation | Unknown | N/A | |
| Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory | Unknown | N/A | |
| Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction | Unknown | N/A | |
| Foster Adaptivity and Balance in Learning with Noisy Labels | Unknown | N/A | |
| Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection | Unknown | N/A | |
| Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM | Unknown | N/A | |
| AWOL: Analysis WithOut synthesis using Language | Unknown | N/A | |
| OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework | Unknown | N/A | |
| MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model | Unknown | N/A | |
| Temporal Residual Jacobians for Rig-free Motion Transfer | Unknown | N/A | |
| Object-Aware NIR-to-Visible Translation | Unknown | N/A | |
| Taming Lookup Tables for Efficient Image Retouching | Unknown | N/A | |
| DualDn: Dual-domain Denoising via Differentiable ISP | Unknown | N/A | |
| From Fake to Real: Pretraining on Balanced Synthetic Images to Prevent Spurious Correlations in Image Recognition | Unknown | N/A | |
| Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector | Unknown | N/A | |
| NICP: Neural ICP for 3D Human Registration at Scale | Unknown | N/A | |
| Syn-to-Real Domain Adaptation for Point Cloud Completion via Part-based Approach | Unknown | N/A | |
| PredBench: Benchmarking Spatio-Temporal Prediction across Diverse Disciplines | Unknown | N/A | |
| LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar | Unknown | N/A | |
| FontStudio: Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation | Unknown | N/A | |
| Finding Meaning in Points: Weakly Supervised Semantic Segmentation for Event Cameras | Unknown | N/A | |
| StableDrag: Stable Dragging for Point-based Image Editing | Unknown | N/A | |
| Phase Concentration and Shortcut Suppression for Weakly Supervised Semantic Segmentation | Unknown | N/A | |
| Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization | Unknown | N/A | |
| Unlocking Attributes' Contribution to Successful Camouflage: A Combined Textual and Visual Analysis Strategy | Unknown | N/A | |
| Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context | Unknown | N/A | |
| Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated Matching | Unknown | N/A | |
| Monocular Occupancy Prediction for Scalable Indoor Scenes | Unknown | N/A | |
| Neural Surface Detection for Unsigned Distance Fields | Unknown | N/A | |
| Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation | Unknown | N/A | |
| Random Walk on Pixel Manifolds for Anomaly Segmentation of Complex Driving Scenes | Unknown | N/A | |
| Event-Based Motion Magnification | Unknown | N/A | |
| AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network | Unknown | N/A | |
| Improving Neural Surface Reconstruction with Feature Priors from Multi-View Images | Unknown | N/A | |
| Towards Multimodal Sentiment Analysis Debiasing via Bias Purification | Unknown | N/A | |
| ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos | Unknown | N/A | |
| MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty | Unknown | N/A | |
| PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts | Unknown | N/A | |
| Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively | Unknown | N/A | |
| Event-based Head Pose Estimation: Benchmark and Method | Unknown | N/A | |
| UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction | Unknown | N/A | |
| PSALM: Pixelwise Segmentation with Large Multi-modal Model | Unknown | N/A | |
| Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging | Unknown | N/A | |
| Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning | Unknown | N/A | |
| Robustness Tokens: Towards Adversarial Robustness of Transformers | Unknown | N/A | |
| DecentNeRFs: Decentralized Neural Radiance Fields from Crowdsourced Images | Unknown | N/A | |
| DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation | Unknown | N/A | |
| Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Models | Unknown | N/A | |
| PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments | Unknown | N/A | |
| Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision | Unknown | N/A | |
| Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models | Unknown | N/A | |
| EINet: Point Cloud Completion via Extrapolation and Interpolation | Unknown | N/A | |
| Bridging the Gap Between Human Motion and Action Semantics via Kinematics Phrases | Unknown | N/A | |
| Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation | Unknown | N/A | |
| ReCON: Training-Free Acceleration for Text-to-Image Synthesis with Retrieval of Concept Prompt Trajectories | Unknown | N/A | |
| AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval | Unknown | N/A | |
| TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models | Unknown | N/A | |
| DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays | Unknown | N/A | |
| StyleCity: Large-Scale 3D Urban Scenes Stylization | Unknown | N/A | |
| Free-ATM: Harnessing Free Attention Masks for Representation Learning on Diffusion-Generated Images | Unknown | N/A | |
| ViG-Bias: Visually Grounded Bias Discovery and Mitigation | Unknown | N/A | |
| Learning Unsigned Distance Functions from Multi-view Images with Volume Rendering Priors | Unknown | N/A | |
| DiffBIR: Toward Blind Image Restoration with Generative Diffusion Prior | Unknown | N/A | |
| Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable Repainting | Unknown | N/A | |
| Relightable Neural Actor with Intrinsic Decomposition and Pose Control | Unknown | N/A | |
| Assessing Sample Quality via the Latent Space of Generative Models | Unknown | N/A | |
| Enhancing Vectorized Map Perception with Historical Rasterized Maps | Unknown | N/A | |
| Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Unknown | N/A | |
| Pseudo-keypoint RKHS Learning for Self-supervised 6DoF Pose Estimation | Unknown | N/A | |
| M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models | Unknown | N/A | |
| Responsible Visual Editing | Unknown | N/A | |
| Consistent 3D Line Mapping | Unknown | N/A | |
| Distributed Active Client Selection With Noisy Clients Using Model Association Scores | Unknown | N/A | |
| PixOOD: Pixel-Level Out-of-Distribution Detection | Unknown | N/A | |
| SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving | Unknown | N/A | |
| ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Unknown | N/A | |
| MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems? | Unknown | N/A | |
| Editable Image Elements for Controllable Synthesis | Unknown | N/A | |
| General Geometry-aware Weakly Supervised 3D Object Detection | Unknown | N/A | |
| F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions | Unknown | N/A | |
| SCPNet: Unsupervised Cross-modal Homography Estimation via Intra-modal Self-supervised Learning | Unknown | N/A | |
| EA-VTR: Event-Aware Video-Text Retrieval | Unknown | N/A | |
| GarmentCodeData: A Dataset of 3D Made-to-Measure Garments With Sewing Patterns | Unknown | N/A | |
| POA: Pre-training Once for Models of All Sizes | Unknown | N/A | |
| Towards a Density Preserving Objective Function for Learning on Point Sets | Unknown | N/A | |
| VF-NeRF: Viewshed Fields for Rigid NeRF Registration | Unknown | N/A | |
| RSL-BA: Rolling Shutter Line Bundle Adjustment | Unknown | N/A | |
| Task-Driven Uncertainty Quantification in Inverse Problems via Conformal Prediction | Unknown | N/A | |
| Trainable Highly-expressive Activation Functions | Unknown | N/A | |
| MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation | Unknown | N/A | |
| RealViformer: Investigating Attention for Real-World Video Super-Resolution | Unknown | N/A | |
| Do text-free diffusion models learn discriminative visual representations? | Unknown | N/A | |
| Part2Object: Hierarchical Unsupervised 3D Instance Segmentation | Unknown | N/A | |
| Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame Interpolation | Unknown | N/A | |
| Training-Free Model Merging for Multi-target Domain Adaptation | Unknown | N/A | |
| MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing | Unknown | N/A | |
| OV-Uni3DETR: Towards Unified Open-Vocabulary 3D Object Detection via Cycle-Modality Propagation | Unknown | N/A | |
| Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models | Unknown | N/A | |
| Instant 3D Human Avatar Generation using Image Diffusion Models | Unknown | N/A | |
| MotionDirector: Motion Customization of Text-to-Video Diffusion Models | Unknown | N/A | |
| DOCCI: Descriptions of Connected and Contrasting Images | Unknown | N/A | |
| Drag Anything: Motion Control for Anything using Entity Representation | Unknown | N/A | |
| RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Unknown | N/A | |
| ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction | Unknown | N/A | |
| A Rotation-invariant Texture ViT for Fine-Grained Recognition of Esophageal Cancer Endoscopic Ultrasound Images | Unknown | N/A | |
| EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks | Unknown | N/A | |
| AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild | Unknown | N/A | |
| ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition | Unknown | N/A | |
| LogoSticker: Inserting Logos into Diffusion Models for Customized Generation | Unknown | N/A | |
| R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection | Unknown | N/A | |
| McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction | Unknown | N/A | |
| OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving | Unknown | N/A | |
| LEROjD: Lidar Extended Radar-Only Object Detection | Unknown | N/A | |
| ProCreate, Don't Reproduce! Propulsive Energy Diffusion for Creative Generation | Unknown | N/A | |
| Probabilistic Image-Driven Traffic Modeling via Remote Sensing | Unknown | N/A | |
| VideoStudio: Generating Consistent-Content and Multi-Scene Videos | Unknown | N/A | |
| Semantic Residual Prompts for Continual Learning | Unknown | N/A | |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Unknown | N/A | |
| DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation | Unknown | N/A | |
| TransCAD: A Hierarchical Transformer for CAD Sequence Inference from Point Clouds | Unknown | N/A | |
| Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection | Unknown | N/A | |
| Learning Modality-agnostic Representation for Semantic Segmentation from Any Modalities | Unknown | N/A | |
| Occupancy as Set of Points | Unknown | N/A | |
| UAV First-Person Viewers Are Radiance Field Learners | Unknown | N/A | |
| Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching | Unknown | N/A | |
| A Fair Ranking and New Model for Panoptic Scene Graph Generation | Unknown | N/A | |
| ProSub: Probabilistic Open-Set Semi-Supervised Learning with Subspace-Based Out-of-Distribution Detection | Unknown | N/A | |
| DyFADet: Dynamic Feature Aggregation for Temporal Action Detection | Unknown | N/A | |
| Knowledge-enhanced Visual-Language Pretraining for Computational Pathology | Unknown | N/A | |
| Pick-a-back: Selective Device-to-Device Knowledge Transfer in Federated Continual Learning | Unknown | N/A | |
| Situated Instruction Following | Unknown | N/A | |
| M3DBench: Towards Omni 3D Assistant with Interleaved Multi-modal Instructions | Unknown | N/A | |
| FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance | Unknown | N/A | |
| Holodepth: Programmable Depth-Varying Projection via Computer-Generated Holography | Unknown | N/A | |
| Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators | Unknown | N/A | |
| Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance | Unknown | N/A | |
| GalLop: Learning global and local prompts for vision-language models | Unknown | N/A | |
| Depth on Demand: Streaming Dense Depth from a Low Frame Rate Active Sensor | Unknown | N/A | |
| Two-Stage Video Shadow Detection via Temporal-Spatial Adaption | Unknown | N/A | |
| N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields | Unknown | N/A | |
| Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization | Unknown | N/A | |
| Bidirectional Uncertainty-Based Active Learning for Open-Set Annotation | Unknown | N/A | |
| Lossy Image Compression with Foundation Diffusion Models | Unknown | N/A | |
| UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving | Unknown | N/A | |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Unknown | N/A | |
| FMBoost: Boosting Latent Diffusion with Flow Matching | Unknown | N/A | |
| Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection | Unknown | N/A | |
| M^2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation | Unknown | N/A | |
| Shifted Autoencoders for Point Annotation Restoration in Object Counting | Unknown | N/A | |
| An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes | Unknown | N/A | |
| Kernel Diffusion: An Alternate Approach to Blind Deconvolution | Unknown | N/A | |
| FoundPose: Unseen Object Pose Estimation with Foundation Features | Unknown | N/A | |
| LNL+K: Enhancing Learning with Noisy Labels Through Noise Source Knowledge Integration | Unknown | N/A | |
| Diffusion Models as Data Mining Tools | Unknown | N/A | |
| Graph Neural Network Causal Explanation via Neural Causal Models | Unknown | N/A | |
| SAMFusion: Sensor-Adaptive Multimodal Fusion for 3D Object Detection in Adverse Weather | Unknown | N/A | |
| SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models | Unknown | N/A | |
| PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology | Unknown | N/A | |
| Improving Adversarial Transferability via Model Alignment | Unknown | N/A | |
| RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios | Unknown | N/A | |
| ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation | Unknown | N/A | |
| Embodied Understanding of Driving Scenarios | Unknown | N/A | |
| NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation | Unknown | N/A | |
| ViLA: Efficient Video-Language Alignment for Video Question Answering | Unknown | N/A | |
| OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation | Unknown | N/A | |
| Factorizing Text-to-Video Generation by Explicit Image Conditioning | Unknown | N/A | |
| MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices | Unknown | N/A | |
| Open-Set Biometrics: Beyond Good Closed-Set Models | Unknown | N/A | |
| Osmosis: RGBD Diffusion Prior for Underwater Image Restoration | Unknown | N/A | |
| Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization | Unknown | N/A | |
| Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements | Unknown | N/A | |
| DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields | Unknown | N/A | |
| Flowed Time of Flight Radiance Fields | Unknown | N/A | |
| Cut out the Middleman: Revisiting Pose-based Gait Recognition | Unknown | N/A | |
| 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing | Unknown | N/A | |
| Fast Registration of Photorealistic Avatars for VR Facial Animation | Unknown | N/A | |
| CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings | Unknown | N/A | |
| HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs | Unknown | N/A | |
| FedHARM: Harmonizing Model Architectural Diversity in Federated Learning | Unknown | N/A | |
| Thinking Outside the BBox: Unconstrained Generative Object Compositing | Unknown | N/A | |
| EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS | Unknown | N/A | |
| Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance | Unknown | N/A | |
| TCLC-GS: Tightly Coupled LiDAR-Camera Gaussian Splatting for Autonomous Driving | Unknown | N/A | |
| RT-Pose: A 4D Radar-Tensor based 3D Human Pose Estimation and Localization Benchmark | Unknown | N/A | |
| EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models | Unknown | N/A | |
| RICA^2: Rubric-Informed, Calibrated Assessment of Actions | Unknown | N/A | |
| Commonly Interesting Images | Unknown | N/A | |
| Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities | Unknown | N/A | |
| CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching | Unknown | N/A | |
| Caltech Aerial RGB-Thermal Dataset in the Wild | Unknown | N/A | |
| Diffusion Soup: Model Merging for Text-to-Image Diffusion Models | Unknown | N/A | |
| CityGuessr: City-Level Video Geo-Localization on a Global Scale | Unknown | N/A | |
| Bayesian Detector Combination for Object Detection with Crowdsourced Annotations | Unknown | N/A | |
| Revising Densification in Gaussian Splatting | Unknown | N/A | |
| FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing | Unknown | N/A | |
| Text Motion Translator: A Bi-Directional Model for Enhanced 3D Human Motion Generation from Open-Vocabulary Descriptions | Unknown | N/A | |
| UL-VIO: Ultra-lightweight Visual-Inertial Odometry with Noise Robust Test-time Adaptation | Unknown | N/A | |
| PolyOculus: Simultaneous Multi-view Image-based Novel View Synthesis | Unknown | N/A | |
| A Graph-Based Approach for Category-Agnostic Pose Estimation | Unknown | N/A | |
| Depth-guided NeRF Training via Earth Mover’s Distance | Unknown | N/A | |
| INTRA: Interaction Relationship-aware Weakly Supervised Affordance Grounding | Unknown | N/A | |
| DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks | Unknown | N/A | |
| Diagnosing and Re-learning for Balanced Multimodal Learning | Unknown | N/A | |
| Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time | Unknown | N/A | |
| Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders | Unknown | N/A | |
| Contribution-based Low-Rank Adaptation with Pre-training Model for Real Image Restoration | Unknown | N/A | |
| BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion | Unknown | N/A | |
| MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning | Unknown | N/A | |
| Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs | Unknown | N/A | |
| Bridging the Pathology Domain Gap: Efficiently Adapting CLIP for Pathology Image Analysis with Limited Labeled Data | Unknown | N/A | |
| AugUndo: Scaling Up Augmentations for Monocular Depth Completion and Estimation | Unknown | N/A | |
| CARB-Net: Camera-Assisted Radar-Based Network for Vulnerable Road User Detection | Unknown | N/A | |
| SAH-SCI: Self-Supervised Adapter for Efficient Hyperspectral Snapshot Compressive Imaging | Unknown | N/A | |
| Minimalist Vision with Freeform Pixels | Unknown | N/A | |
| All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation | Unknown | N/A | |
| LatentEditor: Text Driven Local Editing of 3D Scenes | Unknown | N/A | |
| POET: Prompt Offset Tuning for Continual Human Action Adaptation | Unknown | N/A | |
| IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers | Unknown | N/A | |
| Cross-Domain Semantic Segmentation on Inconsistent Taxonomy using VLMs | Unknown | N/A | |
| TrafficNight : An Aerial Multimodal Benchmark For Nighttime Vehicle Surveillance | Unknown | N/A | |
| Towards Open Domain Text-Driven Synthesis of Multi-Person Motions | Unknown | N/A | |
| Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing | Unknown | N/A | |
| Generative End-to-End Autonomous Driving | Unknown | N/A | |
| Learning to Distinguish Samples for Generalized Category Discovery | Unknown | N/A | |
| COM Kitchens: An Unedited Overhead-view Procedural Videos Dataset a Vision-Language Benchmark | Unknown | N/A | |
| Diff-Reg: Diffusion Model in Doubly Stochastic Matrix Space for Registration Problem | Unknown | N/A | |
| WBP: Training-time Backdoor Attacks through Hardware-based Weight Bit Poisoning | Unknown | N/A | |
| Towards Dual Transparent Liquid Level Estimation in Biomedical Lab: Dataset, Methods and Practice | Unknown | N/A | |
| Encapsulating Knowledge in One Prompt | Unknown | N/A | |
| Delving into Adversarial Robustness on Document Tampering Localization | Unknown | N/A | |
| Adaptive Selection of Sampling-Reconstruction in Fourier Compressed Sensing | Unknown | N/A | |
| Confidence-Based Iterative Generation for Real-World Image Super-Resolution | Unknown | N/A | |
| Seeing Faces in Things: A Model and Dataset for Pareidolia | Unknown | N/A | |
| Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering | Unknown | N/A | |
| AMD: Automatic Multi-step Distillation of Large-scale Vision Models | Unknown | N/A | |
| FairViT: Fair Vision Transformer via Adaptive Masking | Unknown | N/A | |
| VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks | Unknown | N/A | |
| Frugal 3D Point Cloud Model Training via Progressive Near Point Filtering and Fused Aggregation | Unknown | N/A | |
| HVCLIP: High-dimensional Vector in CLIP for Unsupervised Domain Adaptation | Unknown | N/A | |
| Improving 3D Semi-supervised Learning by Effectively Utilizing All Unlabelled Data | Unknown | N/A | |
| Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention | Unknown | N/A | |
| MART: MultiscAle Relational Transformer Networks for Multi-agent Trajectory Prediction | Unknown | N/A | |
| Investigating Style Similarity in Diffusion Models | Unknown | N/A | |
| JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention | Unknown | N/A | |
| MagicMirror: Fast and High-Quality Avatar Generation with Constrained Search Space | Unknown | N/A | |
| EntAugment: Entropy-Driven Adaptive Data Augmentation Framework for Image Classification | Unknown | N/A | |
| SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision | Unknown | N/A | |
| Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Unknown | N/A | |
| GTMS: A Gradient-driven Tree-guided Mask-free Referring Image Segmentation Method | Unknown | N/A | |
| SUP-NeRF: A Streamlined Unification of Pose Estimation and NeRF for Monocular 3D Object Reconstruction | Unknown | N/A | |
| VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Unknown | N/A | |
| Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling | Unknown | N/A | |
| Unmasking Bias in Diffusion Model Training | Unknown | N/A | |
| Multimodal Label Relevance Ranking via Reinforcement Learning | Unknown | N/A | |
| A Simple Background Augmentation Method for Object Detection with Diffusion Model | Unknown | N/A | |
| BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events | Unknown | N/A | |
| A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and Localization | Unknown | N/A | |
| Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation | Unknown | N/A | |
| Sparse Refinement for Efficient High-Resolution Semantic Segmentation | Unknown | N/A | |
| Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion | Unknown | N/A | |
| An Explainable Vision Question Answer Model via Diffusion Chain-of-Thought | Unknown | N/A | |
| Fast Sprite Decomposition from Animated Graphics | Unknown | N/A | |
| Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection | Unknown | N/A | |
| IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection | Unknown | N/A | |
| PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation | Unknown | N/A | |
| Towards Robust Event-based Networks for Nighttime via Unpaired Day-to-Night Event Translation | Unknown | N/A | |
| CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs | Unknown | N/A | |
| UGG: Unified Generative Grasping | Unknown | N/A | |
| A Riemannian Approach for Spatiotemporal Analysis and Generation of 4D Tree-shaped Structures | Unknown | N/A | |
| FrePolad: Frequency-Rectified Point Latent Diffusion for Point Cloud Generation | Unknown | N/A | |
| Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt | Unknown | N/A | |
| GAMMA-FACE: GAussian Mixture Models Amend Diffusion Models for Bias Mitigation in Face Images | Unknown | N/A | |
| Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation | Unknown | N/A | |
| Training-free Composite Scene Generation for Layout-to-Image Synthesis | Unknown | N/A | |
| Robustness Preserving Fine-tuning using Neuron Importance | Unknown | N/A | |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Unknown | N/A | |
| PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation | Unknown | N/A | |
| Similarity of Neural Architectures using Adversarial Attack Transferability | Unknown | N/A | |
| Dual-Rain: Video Rain Removal using Assertive and Gentle Teachers | Unknown | N/A | |
| PMT: Progressive Mean Teacher via Exploring Temporal Consistency for Semi-Supervised Medical Image Segmentation | Unknown | N/A | |
| Unsupervised Variational Translator for Bridging Image Restoration and High-Level Vision Tasks | Unknown | N/A | |
| Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement | Unknown | N/A | |
| Scene-Conditional 3D Object Stylization and Composition | Unknown | N/A | |
| GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning | Unknown | N/A | |
| Revisit Anything: Visual Place Recognition via Image Segment Retrieval | Unknown | N/A | |
| Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation | Unknown | N/A | |
| DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation | Unknown | N/A | |
| Self-Guided Generation of Minority Samples Using Diffusion Models | Unknown | N/A | |
| DEVIAS: Learning Disentangled Video Representations of Action and Scene | Unknown | N/A | |
| RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting | Unknown | N/A | |
| Class-Agnostic Object Counting with Text-to-Image Diffusion Model | Unknown | N/A | |
| Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks | Unknown | N/A | |
| Forbes: Face Obfuscation Rendering via Backpropagation Refinement Scheme | Unknown | N/A | |
| Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity | Unknown | N/A | |
| Information Bottleneck Based Data Correction in Continual Learning | Unknown | N/A | |
| A Watermark-Conditioned Diffusion Model for IP Protection | Unknown | N/A | |
| Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation | Unknown | N/A | |
| SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning | Unknown | N/A | |
| FTBC: Forward Temporal Bias Correction for Optimizing ANN-SNN Conversion | Unknown | N/A | |
| Centering the Value of Every Modality: Towards Efficient and Resilient Modality-agnostic Semantic Segmentation | Unknown | N/A | |
| On Spectral Properties of Gradient-based Explanation Methods | Unknown | N/A | |
| DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation | Unknown | N/A | |
| Generalizing to Unseen Domains via Text-guided Augmentation | Unknown | N/A | |
| Contextual Correspondence Matters: Bidirectional Graph Matching for Video Summarization | Unknown | N/A | |
| VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation | Unknown | N/A | |
| Lost in Translation: Latent Concept Misalignment in Text-to-Image Diffusion Models | Unknown | N/A | |
| Zero-shot Text-guided Infinite Image Synthesis with LLM guidance | Unknown | N/A | |
| Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution | Unknown | N/A | |
| Boosting Gaze Object Prediction via Pixel-level Supervision from Vision Foundation Model | Unknown | N/A | |
| Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization | Unknown | N/A | |
| Adaptive Multi-head Contrastive Learning | Unknown | N/A | |
| Rotated Orthographic Projection for Self-Supervised 3D Human Pose Estimation | Unknown | N/A | |
| Easing 3D Pattern Reasoning with Side-view Features for Semantic Scene Completion | Unknown | N/A | |
| MO-EMT-NAS: Multi-Objective Continuous Transfer of Architectural Knowledge Between Tasks from Different Datasets | Unknown | N/A | |
| Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression | Unknown | N/A | |
| Adaptive Annealing for Robust Averaging | Unknown | N/A | |
| MaxMI: A Maximal Mutual Information Criterion for Manipulation Concept Discovery | Unknown | N/A | |
| High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering | Unknown | N/A | |
| Early Anticipation of Driving Maneuvers | Unknown | N/A | |
| SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization | Unknown | N/A | |
| On the Evaluation Consistency of Attribution-based Explanations | Unknown | N/A | |
| Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation | Unknown | N/A | |
| InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction | Unknown | N/A | |
| DreamReward: Aligning Human Preference in Text-to-3D Generation | Unknown | N/A | |
| Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos | Unknown | N/A | |
| MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders | Unknown | N/A | |
| Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph | Unknown | N/A | |
| VITATECS: A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models | Unknown | N/A | |
| Learning a Dynamic Privacy-preserving Camera Robust to Inversion Attacks | Unknown | N/A | |
| CadVLM: Bridging Language and Vision in the Generation of Parametric CAD Sketches | Unknown | N/A | |
| Towards Image Ambient Lighting Normalization | Unknown | N/A | |
| FedHide: Federated Learning by Hiding in the Neighbors | Unknown | N/A | |
| Self-Cooperation Knowledge Distillation for Novel Class Discovery | Unknown | N/A | |
| SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery | Unknown | N/A | |
| EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding | Unknown | N/A | |
| GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection | Unknown | N/A | |
| Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection? | Unknown | N/A | |
| A Comparative Study of Image Restoration Networks for General Backbone Network Design | Unknown | N/A | |
| HoloADMM: High-Quality Holographic Complex Field Recovery | Unknown | N/A | |
| Synthesizing Time-varying BRDFs via Latent Space | Unknown | N/A | |
| Fundamental Matrix Estimation Using Relative Depths | Unknown | N/A | |
| MTaDCS: Moving Trace and Feature Density-based Confidence Sample Selection under Label Noise | Unknown | N/A | |
| Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis | Unknown | N/A | |
| DataDream: Few-shot Guided Dataset Generation | Unknown | N/A | |
| LPViT: Low-Power Semi-structured Pruning for Vision Transformers | Unknown | N/A | |
| Weighted Ensemble Models Are Strong Continual Learners | Unknown | N/A | |
| GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time | Unknown | N/A | |
| Learning Equilibrium Transformation for Gamut Expansion and Color Restoration | Unknown | N/A | |
| Physics-informed Knowledge Transfer for Underwater Monocular Depth Estimation | Unknown | N/A | |
| Robust Nearest Neighbors for Source-Free Domain Adaptation under Class Distribution Shift | Unknown | N/A | |
| Chains of Diffusion Models | Unknown | N/A | |
| Feature Diversification and Adaptation for Federated Domain Generalization | Unknown | N/A | |
| TP2O: Creative Text Pair-to-Object Generation using Balance Swap-Sampling | Unknown | N/A | |
| Dataset Distillation by Automatic Training Trajectories | Unknown | N/A | |
| RoDUS: Robust Decomposition of Static and Dynamic Elements in Urban Scenes | Unknown | N/A | |
| RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection | Unknown | N/A | |
| Learning Neural Deformation Representation for 4D Dynamic Shape Generation | Unknown | N/A | |
| Synchronization is All You Need: Exocentric-to-Egocentric Transfer for Temporal Action Segmentation with Unlabeled Synchronized Video Pairs | Unknown | N/A | |
| LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models | Unknown | N/A | |
| Domain Shifting: A Generalized Solution for Heterogeneous Cross-Modality Person Re-Identification | Unknown | N/A | |
| Self-Supervised Video Desmoking for Laparoscopic Surgery | Unknown | N/A | |
| Removing Rows and Columns of Tokens in Vision Transformer enables Faster Dense Prediction without Retraining | Unknown | N/A | |
| Continuity Preserving Online CenterLine Graph Learning | Unknown | N/A | |
| Decomposition of Neural Discrete Representations for Large-Scale 3D Mapping | Unknown | N/A | |
| MirrorGaussian: Reflecting 3D Gaussians for Reconstructing Mirror Reflections | Unknown | N/A | |
| Leveraging Representations from Intermediate Encoder-blocks for Synthetic Image Detection | Unknown | N/A | |
| AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking | Unknown | N/A | |
| HSR: Holistic 3D Human-Scene Reconstruction from Monocular Videos | Unknown | N/A | |
| Online Video Quality Enhancement with Spatial-Temporal Look-up Tables | Unknown | N/A | |
| PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal Model | Unknown | N/A | |
| Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance | Unknown | N/A | |
| Oulu Remote-photoplethysmography Physical Domain Attacks Database (ORPDAD) | Unknown | N/A | |
| Leveraging Imperfect Restoration for Data Availability Attack | Unknown | N/A | |
| DoubleTake: Geometry Guided Depth Estimation | Unknown | N/A | |
| Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL | Unknown | N/A | |
| Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models | Unknown | N/A | |
| Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition | Unknown | N/A | |
| HiFi-123: Towards High-fidelity One Image to 3D Content Generation | Unknown | N/A | |
| Revisiting Adaptive Cellular Recognition Under Domain Shifts: A Contextual Correspondence View | Unknown | N/A | |
| Good Teachers Explain: Explanation-Enhanced Knowledge Distillation | Unknown | N/A | |
| FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models | Unknown | N/A | |
| Möbius Transform for Mitigating Perspective Distortions in Representation Learning | Unknown | N/A | |
| TAG: Text Prompt Augmentation for Zero-Shot Out-of-Distribution Detection | Unknown | N/A | |
| CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction | Unknown | N/A | |
| Continual Learning and Unknown Object Discovery in 3D Scenes via Self-Distillation | Unknown | N/A | |
| DiffCD: A Symmetric Differentiable Chamfer Distance for Neural Implicit Surface Fitting | Unknown | N/A | |
| Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking | Unknown | N/A | |
| Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection | Unknown | N/A | |
| Region-Native Visual Tokenization | Unknown | N/A | |
| The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization | Unknown | N/A | |
| Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond | Unknown | N/A | |
| Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction | Unknown | N/A | |
| A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment | Unknown | N/A | |
| Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes | Unknown | N/A | |
| DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling | Unknown | N/A | |
| Multi-modal Crowd Counting via a Broker Modality | Unknown | N/A | |
| FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation | Unknown | N/A | |
| Made to Order: Discovering monotonic temporal changes via self-supervised video ordering | Unknown | N/A | |
| MeshVPR: Citywide Visual Place Recognition Using 3D Meshes | Unknown | N/A | |
| Can Textual Semantics Mitigate Sounding Object Segmentation Preference? | Unknown | N/A | |
| ViPer: Visual Personalization of Generative Models via Individual Preference Learning | Unknown | N/A | |
| MLPHand: Real Time Multi-View 3D Hand Reconstruction via MLP Modeling | Unknown | N/A | |
| LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model | Unknown | N/A | |
| How Far Can a 1-Pixel Camera Go? Solving Vision Tasks using Photoreceptors and Computationally Designed Visual Morphology | Unknown | N/A | |
| MONTRAGE: Monitoring Training for Attribution of Generative Diffusion Models | Unknown | N/A | |
| Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations | Unknown | N/A | |
| Self-supervised visual learning from interactions with objects | Unknown | N/A | |
| OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation | Unknown | N/A | |
| BAFFLE: A Baseline of Backpropagation-Free Federated Learning | Unknown | N/A | |
| OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects | Unknown | N/A | |
| Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking | Unknown | N/A | |
| Diverse Text-to-3D Synthesis with Augmented Text Embedding | Unknown | N/A | |
| LLMCO4MR: LLMs-aided Neural Combinatorial Optimization for Ancient Manuscript Restoration from Fragments with Case Studies on Dunhuang | Unknown | N/A | |
| AdversariaLeak: External Information Leakage Attack Using Adversarial Samples on Face Recognition Systems | Unknown | N/A | |
| SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation | Unknown | N/A | |
| Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier | Unknown | N/A | |
| Enhanced Sparsification via Stimulative Training | Unknown | N/A | |
| Solving the inverse problem of microscopy deconvolution with a residual Beylkin-Coifman-Rokhlin neural network | Unknown | N/A | |
| FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models | Unknown | N/A | |
| Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation | Unknown | N/A | |
| WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding | Unknown | N/A | |
| Spiking Wavelet Transformer | Unknown | N/A | |
| WAVE: Warping DDIM Inversion Features for Zero-shot Text-to-Video Editing | Unknown | N/A | |
| PDT Uav Target Detection Dataset for Pests and Diseases Tree | Unknown | N/A | |
| Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection | Unknown | N/A | |
| COD: Learning Conditional Invariant Representation for Domain Adaptation Regression | Unknown | N/A | |
| RANRAC: Robust Neural Scene Representations via Random Ray Consensus | Unknown | N/A | |
| LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model | Unknown | N/A | |
| Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding | Unknown | N/A | |
| SIMBA: Split Inference - Mechanisms, Benchmarks and Attacks | Unknown | N/A | |
| DQ-DETR: DETR with Dynamic Query for Tiny Object Detection | Unknown | N/A | |
| SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians | Unknown | N/A | |
| Gaussian in the wild: 3D Gaussian Splatting for Unconstrained Image Collections | Unknown | N/A | |
| Few-shot Defect Image Generation based on Consistency Modeling | Unknown | N/A | |
| CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs | Unknown | N/A | |
| Video Editing via Factorized Diffusion Distillation | Unknown | N/A | |
| Trackastra: Transformer-based cell tracking for live-cell microscopy | Unknown | N/A | |
| SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers | Unknown | N/A | |
| Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM | Unknown | N/A | |
| Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation | Unknown | N/A | |
| GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring | Unknown | N/A | |
| Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring | Unknown | N/A | |
| ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Unknown | N/A | |
| Curved Diffusion: A Generative Model With Optical Geometry Control | Unknown | N/A | |
| CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning | Unknown | N/A | |
| OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation | Unknown | N/A | |
| Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures | Unknown | N/A | |
| Conceptual Codebook Learning for Vision-Language Models | Unknown | N/A | |
| AnimateMe: 4D Facial Expressions via Diffusion Models | Unknown | N/A | |
| LingoQA: Video Question Answering for Autonomous Driving | Unknown | N/A | |
| HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning | Unknown | N/A | |
| LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis | Unknown | N/A | |
| Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention | Unknown | N/A | |
| PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors | Unknown | N/A | |
| iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning | Unknown | N/A | |
| Context Diffusion: In-Context Aware Image Generation | Unknown | N/A | |
| Pose Guided Fine-Grained Sign Language Video Generation | Unknown | N/A | |
| RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos | Unknown | N/A | |
| Certifiably Robust Image Watermark | Unknown | N/A | |
| Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery | Unknown | N/A | |
| Online Zero-Shot Classification with CLIP | Unknown | N/A | |
| SeA: Semantic Adversarial Augmentation for Last Layer Features from Unsupervised Representation Learning | Unknown | N/A | |
| Unlocking the Potential of Federated Learning: The Symphony of Dataset Distillation via Deep Generative Latents | Unknown | N/A | |
| BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues | Unknown | N/A | |
| Enhancing Plausibility Evaluation for Generated Designs with Denoising Autoencoder | Unknown | N/A | |
| Weakly-Supervised 3D Hand Reconstruction with Knowledge Prior and Uncertainty Guidance | Unknown | N/A | |
| 3D Reconstruction of Objects in Hands without Real World 3D Supervision | Unknown | N/A | |
| To Supervise or Not to Supervise: Understanding and Addressing the Key Challenges of Point Cloud Transfer Learning | Unknown | N/A | |
| Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops | Unknown | N/A | |
| Parameterized Quasi-Physical Simulators for Dexterous Manipulations Transfer | Unknown | N/A | |
| Optimization-based Uncertainty Attribution Via Learning Informative Perturbations | Unknown | N/A | |
| Semi-supervised Segmentation of Histopathology Images with Noise-Aware Topological Consistency | Unknown | N/A | |
| Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling | Unknown | N/A | |
| MetaAT: Active Testing for Label-Efficient Evaluation of Dense Recognition Tasks | Unknown | N/A | |
| Explorative Inbetweening of Time and Space | Unknown | N/A | |
| A Diffusion Model for Simulation Ready Coronary Anatomy with Morpho-skeletal Control | Unknown | N/A | |
| Learning to Make Keypoints Sub-Pixel Accurate | Unknown | N/A | |
| Imaging with Confidence: Uncertainty Quantification for High-dimensional Undersampled MR Images | Unknown | N/A | |
| Generalizable Human Gaussians for Sparse View Synthesis | Unknown | N/A | |
| Evaluating the Adversarial Robustness of Semantic Segmentation: Trying Harder Pays Off | Unknown | N/A | |
| GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction | Unknown | N/A | |
| AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation | Unknown | N/A | |
| PFedEdit: Personalized Federated Learning via Automated Model Editing | Unknown | N/A | |
| De-Confusing Pseudo-Labels in Source-Free Domain Adaptation | Unknown | N/A | |
| Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models | Unknown | N/A | |
| Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas | Unknown | N/A | |
| Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos | Unknown | N/A | |
| Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores | Unknown | N/A | |
| MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception | Unknown | N/A | |
| EpipolarGAN: Omnidirectional Image Synthesis with Explicit Camera Control | Unknown | N/A | |
| Photorealistic Video Generation with Diffusion Models | Unknown | N/A | |
| RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement | Unknown | N/A | |
| TIBET: Identifying and Evaluating Biases in Text-to-Image Generative Models | Unknown | N/A | |
| Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval | Unknown | N/A | |
| Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding | Unknown | N/A | |
| Self-Supervised Audio-Visual Soundscape Stylization | Unknown | N/A | |
| Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning | Unknown | N/A | |
| Source-Free Domain-Invariant Performance Prediction | Unknown | N/A | |
| Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures | Unknown | N/A | |
| Constructing Concept-based Models to Mitigate Spurious Correlations with Minimal Human Effort | Unknown | N/A | |
| Direct Distillation between Different Domains | Unknown | N/A | |
| GRiT: A Generative Region-to-text Transformer for Object Understanding | Unknown | N/A | |
| LRSLAM: Low-rank Representation of Signed Distance Fields in Dense Visual SLAM System | Unknown | N/A | |
| Learning Representation for Multitask Learning through Self-Supervised Auxiliary Learning | Unknown | N/A | |
| Neural Poisson Solver: A Universal and Continuous Framework for Natural Signal Blending | Unknown | N/A | |
| Geometry Fidelity for Spherical Images | Unknown | N/A | |
| BAGS: Blur Agnostic Gaussian Splatting through Multi-Scale Kernel Modeling | Unknown | N/A | |
| CroMo-Mixup: Augmenting Cross-Model Representations for Continual Self-Supervised Learning | Unknown | N/A | |
| Free-Editor: Zero-shot Text-driven 3D Scene Editing | Unknown | N/A | |
| DPA-Net: Structured 3D Abstraction from Sparse Views via Differentiable Primitive Assembly | Unknown | N/A | |
| An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation | Unknown | N/A | |
| An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models | Unknown | N/A | |
| Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation | Unknown | N/A | |
| Generalizable Symbolic Optimizer Learning | Unknown | N/A | |
| Tackling Structural Hallucination in Image Translation with Local Diffusion | Unknown | N/A | |
| Unified Medical Image Pre-training in Language-Guided Common Semantic Space | Unknown | N/A | |
| On the Vulnerability of Skip Connections to Model Inversion Attacks | Unknown | N/A | |
| Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector | Unknown | N/A | |
| Reinforcement Learning via Auxillary Task Distillation | Unknown | N/A | |
| DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation | Unknown | N/A | |
| View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields | Unknown | N/A | |
| Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception | Unknown | N/A | |
| Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models | Unknown | N/A | |
| STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay | Unknown | N/A | |
| Fairness-aware Vision Transformer via Debiased Self-Attention | Unknown | N/A | |
| Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry | Unknown | N/A | |
| Exploring Conditional Multi-Modal Prompts for Zero-shot HOI Detection | Unknown | N/A | |
| Training-free Video Temporal Grounding using Large-scale Pre-trained Models | Unknown | N/A | |
| Efficient Learning of Event-based Dense Representation using Hierarchical Memories with Adaptive Update | Unknown | N/A | |
| SNP: Structured Neuron-level Pruning to Preserve Attention Scores | Unknown | N/A | |
| PALM: Predicting Actions through Language Models | Unknown | N/A | |
| Motion Keyframe Interpolation for Any Human Skeleton using Point Cloud-based Human Motion Data Homogenisation | Unknown | N/A | |
| Learning to Localize Actions in Instructional Videos with LLM-Based Multi-Pathway Text-Video Alignment | Unknown | N/A | |
| SwiftBrush v2: Make Your One-step Diffusion Model Better Than Its Teacher | Unknown | N/A | |
| Improving Hyperbolic Representations via Gromov-Wasserstein Regularization | Unknown | N/A | |
| VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViG | Unknown | N/A | |
| DiffSurf: A Transformer-based Diffusion Model for Generating and Reconstructing 3D Surfaces in Pose | Unknown | N/A | |
| Exploiting Supervised Poison Vulnerability to Strengthen Self-Supervised Defense | Unknown | N/A | |
| Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics | Unknown | N/A | |
| PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation | Unknown | N/A | |
| Depth-Aware Blind Image Decomposition for Real-World Adverse Weather Recovery | Unknown | N/A | |
| DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation | Unknown | N/A | |
| Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation | Unknown | N/A | |
| PosterLlama: Bridging Design Ability of Langauge Model to Content-Aware Layout Generation | Unknown | N/A | |
| PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control | Unknown | N/A | |
| LG-Gaze: Learning Geometry-aware Continuous Prompts for Language-Guided Gaze Estimation | Unknown | N/A | |
| Efficient Training with Denoised Neural Weights | Unknown | N/A | |
| Integration of Global and Local Representations for Fine-grained Cross-modal Alignment | Unknown | N/A | |
| Local and Global Flatness for Federated Domain Generalization | Unknown | N/A | |
| SRPose: Two-view Relative Pose Estimation with Sparse Keypoints | Unknown | N/A | |
| Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models | Unknown | N/A | |
| Paying More Attention to Images: A Training-Free Method for Alleviating Hallucination in LVLMs | Unknown | N/A | |
| Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation | Unknown | N/A | |
| Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering | Unknown | N/A | |
| EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Unknown | N/A | |
| LMT-GP: Combined Latent Mean-Teacher and Gaussian Process for Semi-supervised Low-light Image Enhancement | Unknown | N/A | |
| Efficient Vision Transformers with Partial Attention | Unknown | N/A | |
| Generalized Coverage for More Robust Low-Budget Active Learning | Unknown | N/A | |
| Rasterized Edge Gradients: Handling Discontinuities Differentially | Unknown | N/A | |
| Kinetic Typography Diffusion Model | Unknown | N/A | |
| Enhancing Cross-Subject fMRI-to-Video Decoding with Global-Local Functional Alignment | Unknown | N/A | |
| ZeroI2V: Zero-Cost Adaptation of Pre-Trained Transformers from Image to Video | Unknown | N/A | |
| Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse Problems | Unknown | N/A | |
| R.A.C.E.: Robust Adversarial Concept Erasure for Secure Text-to-Image Diffusion Model | Unknown | N/A | |
| OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection | Unknown | N/A | |
| Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion | Unknown | N/A | |
| Data Poisoning Quantization Backdoor Attack | Unknown | N/A | |
| T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy | Unknown | N/A | |
| DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition | Unknown | N/A | |
| A high-quality robust diffusion framework for corrupted dataset | Unknown | N/A | |
| Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning | Unknown | N/A | |
| Distilling Knowledge from Large-Scale Image Models for Object Detection | Unknown | N/A | |
| Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection | Unknown | N/A | |
| TimeLens-XL: Real-time Event-based Video Frame Interpolation with Large Motion | Unknown | N/A | |
| Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection | Unknown | N/A | |
| Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets | Unknown | N/A | |
| Unsupervised Representation Learning by Balanced Self Attention Matching | Unknown | N/A | |
| Identity-Consistent Diffusion Network for Grading Knee Osteoarthritis Progression in Radiographic Imaging | Unknown | N/A | |
| Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation | Unknown | N/A | |
| Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation | Unknown | N/A | |
| Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation | Unknown | N/A | |
| SCOD: From Heuristics to Theory | Unknown | N/A | |
| Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts | Unknown | N/A | |
| Teach CLIP to Develop a Number Sense for Ordinal Regression | Unknown | N/A | |
| Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation | Unknown | N/A | |
| Compact 3D Scene Representation via Self-Organizing Gaussian Grids | Unknown | N/A | |
| VETRA: A Dataset for Vehicle Tracking in Aerial Imagery - New Challenges for Multi-Object Tracking | Unknown | N/A | |
| SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes | Unknown | N/A | |
| Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning | Unknown | N/A | |
| T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models | Unknown | N/A | |
| Towards Certifiably Robust Face Recognition | Unknown | N/A | |
| Linking in Style: Understanding learned features in deep learning models | Unknown | N/A | |
| Stable Video Portraits | Unknown | N/A | |
| CliffPhys: Camera-based Respiratory Measurement using Clifford Neural Networks | Unknown | N/A | |
| Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network | Unknown | N/A | |
| PDiscoFormer: Relaxing Part Discovery Constraints with Vision Transformers | Unknown | N/A | |
| Instant Uncertainty Calibration of NeRFs Using a Meta-Calibrator | Unknown | N/A | |
| SHIC: Shape-Image Correspondences with no Keypoint Supervision | Unknown | N/A | |
| Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection | Unknown | N/A | |
| Weight Conditioning for Smooth Optimization of Neural Networks | Unknown | N/A | |
| Energy-Clibrated VAE with Test Time Free Lunch | Unknown | N/A | |
| SceneTeller: Language-to-3D Scene Generation | Unknown | N/A | |
| MagMax: Leveraging Model Merging for Seamless Continual Learning | Unknown | N/A | |
| Physics-Free Spectrally Multiplexed Photometric Stereo under Unknown Spectral Composition | Unknown | N/A | |
| InternVideo2: Scaling Foundation Models for Multimodal Video Understanding | Unknown | N/A | |
| Debiasing surgeon: fantastic weights and how to find them | Unknown | N/A | |
| Denoising Vision Transformers | Unknown | N/A | |
| Differentiable Product Quantization for Memory Efficient Camera Relocalization | Unknown | N/A | |
| Spline-based Transformers | Unknown | N/A | |
| Learning Pseudo 3D Guidance for View-consistent Texturing with 2D Diffusion | Unknown | N/A | |
| SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data | Unknown | N/A | |
| TreeSBA: Tree-Transformer for Self-Supervised Sequential Brick Assembly | Unknown | N/A | |
| Efficient NeRF Optimization - Not All Samples Remain Equally Hard | Unknown | N/A | |
| Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models | Unknown | N/A | |
| Catastrophic Overfitting: A Potential Blessing in Disguise | Unknown | N/A | |
| Adversarial Diffusion Distillation | Unknown | N/A | |
| Fake It till You Make It: Curricular Dynamic Forgery Augmentations towards General Deepfake Detection | Unknown | N/A | |
| Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts | Unknown | N/A | |
| Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation | Unknown | N/A | |
| A Multimodal Benchmark Dataset and Model for Crop Disease Diagnosis | Unknown | N/A | |
| Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models | Unknown | N/A | |
| Learning Where to Look: Self-supervised Viewpoint Selection for Active Localization using Geometrical Information | Unknown | N/A | |
| Text-Conditioned Resampler For Long Form Video Understanding | Unknown | N/A | |
| Using My Artistic Style? You Must Obtain My Authorization | Unknown | N/A | |
| Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation | Unknown | N/A | |
| UMERegRobust – Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration | Unknown | N/A | |
| Non-transferable Pruning | Unknown | N/A | |
| A Compact Dynamic 3D Gaussian Representation for Real-Time Dynamic View Synthesis | Unknown | N/A | |
| Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning | Unknown | N/A | |
| Affine steerers for structured keypoint description | Unknown | N/A | |
| FipTR: A Simple yet Effective Transformer Framework for Future Instance Prediction in Autonomous Driving | Unknown | N/A | |
| GroCo: Ground Constraint for Metric Self-Supervised Monocular Depth | Unknown | N/A | |
| EMIE-MAP: Large-Scale Road Surface Reconstruction Based on Explicit Mesh and Implicit Encoding | Unknown | N/A | |
| UniIR: Training and Benchmarking Universal Multimodal Information Retrievers | Unknown | N/A | |
| Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision | Unknown | N/A | |
| latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction | Unknown | N/A | |
| HyperSpaceX: Radial and Angular Exploration of HyperSpherical Dimensions | Unknown | N/A | |
| HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation | Unknown | N/A | |
| InstructGIE: Towards Generalizable Image Editing | Unknown | N/A | |
| Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning | Unknown | N/A | |
| CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models | Unknown | N/A | |
| Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation | Unknown | N/A | |
| Towards Scene Graph Anticipation | Unknown | N/A | |
| Distributed Semantic Segmentation with Efficient Joint Source and Task Decoding | Unknown | N/A | |
| NePhi: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration | Unknown | N/A | |
| Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks | Unknown | N/A | |
| Concept Arithmetics for Circumventing Concept Inhibition in Diffusion Models | Unknown | N/A | |
| DeTra: A Unified Model for Object Detection and Trajectory Forecasting | Unknown | N/A | |
| ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems | Unknown | N/A | |
| Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction | Unknown | N/A | |
| Common Sense Reasoning for Deep Fake Detection | Unknown | N/A | |
| GOEmbed: Gradient Origin Embeddings for Representation Agnostic 3D Feature Learning | Unknown | N/A | |
| Tight and Efficient Upper Bound on Spectral Norm of Convolutional Layers | Unknown | N/A | |
| Deciphering the Role of Representation Disentanglement: Investigating Compositional Generalization in CLIP Models | Unknown | N/A | |
| FroSSL: Frobenius Norm Minimization for Efficient Multiview Self-Supervised Learning | Unknown | N/A | |
| Learning Multimodal Latent Generative Models with Energy-Based Prior | Unknown | N/A | |
| Hierarchical Conditioning of Diffusion Models Using Tree-of-Life for Studying Species Evolution | Unknown | N/A | |
| Markov Knowledge Distillation: Make Nasty Teachers trained by Self-undermining Knowledge Distillation Fully Distillable | Unknown | N/A | |
| CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting | Unknown | N/A | |
| Snuffy: Efficient Whole Slide Image Classifier | Unknown | N/A | |
| Learning to Build by Building Your Own Instructions | Unknown | N/A | |
| Exploring Active Learning in Meta-Learning: Enhancing Context Set Labeling | Unknown | N/A | |
| BlenderAlchemy: Editing 3D Graphics with Vision-Language Models | Unknown | N/A | |
| CoTracker: It is Better to Track Together | Unknown | N/A | |
| Mesh2NeRF: Direct Mesh Supervision for Neural Radiance Field Representation and Generation | Unknown | N/A | |
| Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning | Unknown | N/A | |
| Improving Text-guided Object Inpainting with Semantic Pre-inpainting | Unknown | N/A | |
| SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views | Unknown | N/A | |
| SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic | Unknown | N/A | |
| LISO: Lidar-only Self-Supervised 3D Object Detection | Unknown | N/A | |
| Frontier-enhanced Topological Memory with Improved Exploration Awareness for Embodied Visual Navigation | Unknown | N/A | |
| Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-v2) | Unknown | N/A | |
| LookupViT: Compressing visual information to a limited number of tokens | Unknown | N/A | |
| Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Personalized Stylization | Unknown | N/A | |
| REDIR: Refocus-free Event-based De-occlusion Image Reconstruction | Unknown | N/A | |
| Towards compact reversible image representations for neural style transfer | Unknown | N/A | |
| InsMapper: Exploring Inner-instance Information for Vectorized HD Mapping | Unknown | N/A | |
| Exploring Vulnerabilities in Spiking Neural Networks: Direct Adversarial Attacks on Raw Event Data | Unknown | N/A | |
| MRSP: Learn Multi-Representations of Single Primitive for Compositional Zero-Shot Learning | Unknown | N/A | |
| GRIDS: Grouped Multiple-Degradation Restoration with Image Degradation Similarity | Unknown | N/A | |
| KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval | Unknown | N/A | |
| The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? | Unknown | N/A | |
| VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition | Unknown | N/A | |
| Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation | Unknown | N/A | |
| Better Regression Makes Better Test-time Adaptive 3D Object Detection | Unknown | N/A | |
| Temporally Consistent Stereo Matching | Unknown | N/A | |
| ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling | Unknown | N/A | |
| Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy | Unknown | N/A | |
| ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention | Unknown | N/A | |
| Asynchronous Large Language Model Enhanced Planner for Autonomous Driving | Unknown | N/A | |
| Benchmarking Spurious Bias in Few-Shot Image Classifiers | Unknown | N/A | |
| Deep Companion Learning: Enhancing Generalization Through Historical Consistency | Unknown | N/A | |
| WSI-VQA: Interpreting Whole Slide Images by Generative Visual Question Answering | Unknown | N/A | |
| Straightforward Layer-wise Pruning for More Efficient Visual Adaptation | Unknown | N/A | |
| ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting | Unknown | N/A | |
| CrossScore: A Multi-View Approach to Image Evaluation and Scoring | Unknown | N/A | |
| CPM: Class-conditional Prompting Machine for Audio-visual Segmentation | Unknown | N/A | |
| DiffClass: Diffusion-Based Class Incremental Learning | Unknown | N/A | |
| Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-Supervised Multi-Label Learning | Unknown | N/A | |
| PromptFusion: Decoupling Stability and Plasticity for Continual Learning | Unknown | N/A | |
| SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation | Unknown | N/A | |
| Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm | Unknown | N/A | |
| PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Unknown | N/A | |
| DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing | Unknown | N/A | |
| Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework | Unknown | N/A | |
| DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception | Unknown | N/A | |
| PILoRA: Prototype Guided Incremental LoRA for Federated Class-Incremental Learning | Unknown | N/A | |
| Exploring the Feature Extraction and Relation Modeling For Light-Weight Transformer Tracking | Unknown | N/A | |
| Data Augmentation via Latent Diffusion for Saliency Prediction | Unknown | N/A | |
| PiTe: Pixel-Temporal Alignment for Large Video-Language Model | Unknown | N/A | |
| 3D Gaussian Parametric Head Model | Unknown | N/A | |
| Dynamic Neural Radiance Field From Defocused Monocular Video | Unknown | N/A | |
| Retargeting Visual Data with Deformation Fields | Unknown | N/A | |
| Ray-Distance Volume Rendering for Neural Scene Reconstruction | Unknown | N/A | |
| 4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation | Unknown | N/A | |
| Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction | Unknown | N/A | |
| Sur^2f: A Hybrid Representation for High-Quality and Efficient Surface Reconstruction from Multi-view Images | Unknown | N/A | |
| ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion | Unknown | N/A | |
| Realistic Human Motion Generation with Cross-Diffusion Models | Unknown | N/A | |
| SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation | Unknown | N/A | |
| Continuous Memory Representation for Anomaly Detection | Unknown | N/A | |
| UniTalker: Scaling up Audio-Driven 3D Facial Animation through A Unified Model | Unknown | N/A | |
| Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Unknown | N/A | |
| Efficient Depth-Guided Urban View Synthesis | Unknown | N/A | |
| OneRestore: A Universal Restoration Framework for Composite Degradation | Unknown | N/A | |
| Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention | Unknown | N/A | |
| Beyond MOT: Semantic Multi-Object Tracking | Unknown | N/A | |
| PartCraft: Crafting Creative Objects by Parts | Unknown | N/A | |
| WordRobe: Text-Guided Generation of Textured 3D Garments | Unknown | N/A | |
| ZeST: Zero-Shot Material Transfer from a Single Image | Unknown | N/A | |
| AnyControl: Create Your Artwork with Versatile Control on Text-to-Image Generation | Unknown | N/A | |
| UDA-Bench: Revisiting Common Assumptions in Unsupervised Domain Adaptation Using a Standardized Framework | Unknown | N/A | |
| Online Continuous Generalized Category Discovery | Unknown | N/A | |
| AddMe: Zero-shot Group-photo Synthesis by Inserting People into Scenes | Unknown | N/A | |
| Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos | Unknown | N/A | |
| Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning | Unknown | N/A | |
| KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter | Unknown | N/A | |
| MERLiN: Single-Shot Material Estimation and Relighting for Photometric Stereo | Unknown | N/A | |
| MC-PanDA: Mask Confidence for Panoptic Domain Adaptation | Unknown | N/A | |
| GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting | Unknown | N/A | |
| Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design | Unknown | N/A | |
| BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation | Unknown | N/A | |
| PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation | Unknown | N/A | |
| Rethinking Few-shot Class-incremental Learning: Learning from Yourself | Unknown | N/A | |
| VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement | Unknown | N/A | |
| STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning | Unknown | N/A | |
| Teaching Tailored to Talent: Adverse Weather Restoration via Prompt Pool and Depth-Anything Constraint | Unknown | N/A | |
| AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation | Unknown | N/A | |
| UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding | Unknown | N/A | |
| Long-CLIP: Unlocking the Long-Text Capability of CLIP | Unknown | N/A | |
| RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF | Unknown | N/A | |
| Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation | Unknown | N/A | |
| FuseTeacher: Modality-fused Encoders are Strong Vision Supervisors | Unknown | N/A | |
| MVDD: Multi-View Depth Diffusion Models | Unknown | N/A | |
| Dataset Quantization with Active Learning based Adaptive Sampling | Unknown | N/A | |
| Interpretability-Guided Test-Time Adversarial Defense | Unknown | N/A | |
| Self-Supervised Representation Learning for Adversarial Attack Detection | Unknown | N/A | |
| GroundUp: Rapid Sketch-Based 3D City Massing | Unknown | N/A | |
| Photon Inhibition for Energy-Efficient Single-Photon Imaging | Unknown | N/A | |
| CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning | Unknown | N/A | |
| Learning with Counterfactual Explanations for Radiology Report Generation | Unknown | N/A | |
| Pseudo-Embedding for Generalized Few-Shot Point Cloud Segmentation | Unknown | N/A | |
| Wavelet Convolutions for Large Receptive Fields | Unknown | N/A | |
| AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer | Unknown | N/A | |
| Gradient-based Out-of-Distribution Detection | Unknown | N/A | |
| Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs | Unknown | N/A | |
| Non-Exemplar Domain Incremental Learning via Cross-Domain Concept Integration | Unknown | N/A | |
| Data-to-Model Distillation: Data-Efficient Learning Framework | Unknown | N/A | |
| Simple Unsupervised Knowledge Distillation With Space Similarity | Unknown | N/A | |
| 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance | Unknown | N/A | |
| DSMix: Distortion-Induced Saliency Map Based Pre-training for No-Reference Image Quality Assessment | Unknown | N/A | |
| Learning Natural Consistency Representation for Face Forgery Video Detection | Unknown | N/A | |
| DragVideo: Interactive Drag-style Video Editing | Unknown | N/A | |
| Brain-ID: Learning Contrast-agnostic Anatomical Representations for Brain Imaging | Unknown | N/A | |
| One-Shot Diffusion Mimicker for Handwritten Text Generation | Unknown | N/A | |
| Diffusion-Driven Data Replay: A Novel Approach to Combat Forgetting in Federated Class Continual Learning | Unknown | N/A | |
| Multi-Person Pose Forecasting with Individual Interaction Perceptron and Prior Learning | Unknown | N/A | |
| FunQA: Towards Surprising Video Comprehension | Unknown | N/A | |
| Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Unknown | N/A | |
| UpFusion: Novel View Diffusion from Unposed Sparse View Observations | Unknown | N/A | |
| EDformer: Transformer-Based Event Denoising Across Varied Noise Levels | Unknown | N/A | |
| UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation | Unknown | N/A | |
| View-Consistent 3D Editing with Gaussian Splatting | Unknown | N/A | |
| Few-shot NeRF by Adaptive Rendering Loss Regularization | Unknown | N/A | |
| HO-Gaussian: Hybrid Optimization of 3D Gaussian Splatting for Urban Scenes | Unknown | N/A | |
| FAMOUS: High-Fidelity Monocular 3D Human Digitization Using View Synthesis | Unknown | N/A | |
| Generating Human Interaction Motions in Scenes with Text Control | Unknown | N/A | |
| Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging | Unknown | N/A | |
| MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis | Unknown | N/A | |
| VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing | Unknown | N/A | |
| MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping | Unknown | N/A | |
| FSGS: Real-Time Few-shot View Synthesis using Gaussian Splatting | Unknown | N/A | |
| VersatileGaussian: Real-time Neural Rendering for Versatile Tasks using Gaussian Splatting | Unknown | N/A | |
| Instruction Tuning-free Visual Token Complement for Multimodal LLMs | Unknown | N/A | |
| Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance | Unknown | N/A | |
| Pyramid Diffusion for Fine 3D Large Scene Generation | Unknown | N/A | |
| Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts | Unknown | N/A | |
| MotionChain: Conversational Motion Controllers via Multimodal Prompts | Unknown | N/A | |
| Synthesizing Environment-Specific People in Photographs | Unknown | N/A | |
| Open-World Dynamic Prompt and Continual Visual Representation Learning | Unknown | N/A | |
| Masked Motion Prediction with Semantic Contrast for Point Cloud Sequence Learning | Unknown | N/A | |
| Customized Generation Reimagined: Fidelity and Editability Harmonized | Unknown | N/A | |
| HybridBooth: Hybrid Prompt Inversion for Efficient Subject-Driven Generation | Unknown | N/A | |
| Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer | Unknown | N/A | |
| Co-speech Gesture Video Generation with 3D Human Meshes | Unknown | N/A | |
| SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Unknown | N/A | |
| NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields | Unknown | N/A | |
| DiffusionPen: Towards Controlling the Style of Handwritten Text Generation | Unknown | N/A | |
| From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation | Unknown | N/A | |
| PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation | Unknown | N/A | |
| MonoTTA: Fully Test-Time Adaptation for Monocular 3D Object Detection | Unknown | N/A | |
| Revisit Self-supervision with Local Structure-from-Motion | Unknown | N/A | |
| On the Viability of Monocular Depth Pre-training for Semantic Segmentation | Unknown | N/A | |
| Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Unknown | N/A | |
| NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation | Unknown | N/A | |
| Latent Guard: a Safety Framework for Text-to-image Generation | Unknown | N/A | |
| TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering | Unknown | N/A | |
| GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection | Unknown | N/A | |
| ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers | Unknown | N/A | |
| ProtoComp: Diverse Point Cloud Completion with Controllable Prototype | Unknown | N/A | |
| FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation | Unknown | N/A | |
| Physical-Based Event Camera Simulator | Unknown | N/A | |
| Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture | Unknown | N/A | |
| EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis | Unknown | N/A | |
| Implicit Steganography Beyond the Constraints of Modality | Unknown | N/A | |
| Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos | Unknown | N/A | |
| Volumetric Rendering with Baked Quadrature Fields | Unknown | N/A | |
| Flying with Photons: Rendering Novel Views of Propagating Light | Unknown | N/A | |
| LivePhoto: Real Image Animation with Text-guided Motion Control | Unknown | N/A | |
| Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment | Unknown | N/A | |
| High-Fidelity and Transferable NeRF Editing by Frequency Decomposition | Unknown | N/A | |
| Implicit Style-Content Separation using B-LoRA | Unknown | N/A | |
| Inf-DiT: Upsampling any-resolution image with memory-efficient diffusion transformer. | Unknown | N/A | |
| CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion | Unknown | N/A | |
| Deep Diffusion Image Prior for Efficient OOD Adaptation in 3D Inverse Problems | Unknown | N/A | |
| OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal | Unknown | N/A | |
| Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration | Unknown | N/A | |
| Understanding Multi-compositional learning in Vision and Language models via Category Theory | Unknown | N/A | |
| Animate Your Motion: Turning Still Images into Dynamic Videos | Unknown | N/A | |
| Spatial-Temporal Multi-level Association for Video Object Segmentation | Unknown | N/A | |
| Point-supervised Panoptic Segmentation via Estimating Pseudo Labels from Learnable Distance | Unknown | N/A | |
| CSOT: Cross-Scan Object Transfer for Semi-Supervised LiDAR Object Detection | Unknown | N/A | |
| Context-Aware Action Recognition: Introducing a Comprehensive Dataset for Behavior Contrast | Unknown | N/A | |
| NVS-Adapter: Plug-and-Play Novel View Synthesis from a Single Image | Unknown | N/A | |
| Face Reconstruction Transfer Attack as Out-of-Distribution Generalization | Unknown | N/A | |
| Rethinking Image-to-Video Adaptation: An Object-centric Perspective | Unknown | N/A | |
| Texture-GS: Disentangle the Geometry and Texture for 3D Gaussian Splatting Editing | Unknown | N/A | |
| Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models | Unknown | N/A | |
| DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors | Unknown | N/A | |
| UniProcessor: A Text-induced Unified Low-level Image Processor | Unknown | N/A | |
| Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors | Unknown | N/A | |
| Tokenize Anything via Prompting | Unknown | N/A | |
| Visual Alignment Pre-training for Sign Language Translation | Unknown | N/A | |
| GRACE: Graph-Based Contextual Debiasing for Fair Visual Question Answering | Unknown | N/A | |
| Learning Chain of Counterfactual Thought for Bias-Robust Vision-Language Reasoning | Unknown | N/A | |
| Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models | Unknown | N/A | |
| FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Unknown | N/A | |
| Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing | Unknown | N/A | |
| Evaluating Text-to-Visual Generation with Image-to-Text Generation | Unknown | N/A | |
| Removing Distributional Discrepancies in Captions Improves Image-Text Alignment | Unknown | N/A | |
| Arc2Face: A Foundation Model for ID-Consistent Human Faces | Unknown | N/A | |
| Let the Avatar Talk using Texts without Paired Training Data | Unknown | N/A | |
| LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Unknown | N/A | |
| Region-centric Image-Language Pretraining for Open-Vocabulary Detection | Unknown | N/A | |
| DECOLLAGE: 3D Detailization by Controllable, Localized, and Learned Geometry Enhancement | Unknown | N/A | |
| Learning Camouflaged Object Detection from Noisy Pseudo Label | Unknown | N/A | |
| PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition | Unknown | N/A | |
| SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization | Unknown | N/A | |
| Attention Beats Linear for Fast Implicit Neural Representation Generation | Unknown | N/A | |
| WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation | Unknown | N/A | |
| Timestep-Aware Correction for Quantized Diffusion Models | Unknown | N/A | |
| LightenDiffusion: Unsupervised Low-Light Image Enhancement with Latent-Retinex Diffusion Models | Unknown | N/A | |
| Prompt-Based Test-Time Real Image Dehazing: A Novel Pipeline | Unknown | N/A | |
| RCS-Prompt: Learning Prompt to Rearrange Class Space for Prompt-based Continual Learning | Unknown | N/A | |
| FedTSA: A Cluster-based Two-Stage Aggregation Method for Model-heterogeneous Federated Learning | Unknown | N/A | |
| Dynamic Guidance Adversarial Distillation with Enhanced Teacher Knowledge | Unknown | N/A | |
| Emerging Property of Masked Token for Effective Pre-training | Unknown | N/A | |
| OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model | Unknown | N/A | |
| Hierarchical Separable Video Transformer for Snapshot Compressive Imaging | Unknown | N/A | |
| Gaussian Grouping: Segment and Edit Anything in 3D Scenes | Unknown | N/A | |
| 3D Hand Sequence Recovery from Real Blurry Images and Event Stream | Unknown | N/A | |
| Sapiens: Foundation for Human Vision Models | Unknown | N/A | |
| Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model | Unknown | N/A | |
| SweepNet: Unsupervised Learning Shape Abstraction via Neural Sweepers | Unknown | N/A | |
| Active Coarse-to-Fine Segmentation of Moveable Parts from Real Images | Unknown | N/A | |
| ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model | Unknown | N/A | |
| Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration | Unknown | N/A | |
| Segmentation-guided Layer-wise Image Vectorization with Gradient Fills | Unknown | N/A | |
| IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination | Unknown | N/A | |
| SAM-guided Graph Cut for 3D Instance Segmentation | Unknown | N/A | |
| GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation | Unknown | N/A | |
| A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting | Unknown | N/A | |
| HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting | Unknown | N/A | |
| VividDreamer: Invariant Score Distillation for Hyper-Realistic Text-to-3D Generation | Unknown | N/A | |
| Explicitly Guided Information Interaction Network for Cross-modal Point Cloud Completion | Unknown | N/A | |
| TLControl: Trajectory and Language Control for Human Motion Synthesis | Unknown | N/A | |
| StructLDM: Structured Latent Diffusion for 3D Human Generation | Unknown | N/A | |
| LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation | Unknown | N/A | |
| ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance | Unknown | N/A | |
| ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer | Unknown | N/A | |
| High-Fidelity Modeling of Generalizable Wrinkle Deformation | Unknown | N/A | |
| COMPOSE: Comprehensive Portrait Shadow Editing | Unknown | N/A | |
| GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering | Unknown | N/A | |
| EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion | Unknown | N/A | |
| PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations | Unknown | N/A | |
| Learning Representations from Foundation Models for Domain Generalized Stereo Matching | Unknown | N/A | |
| Distractor-Free Novel View Synthesis via Exploiting Memorization Effect in Optimization | Unknown | N/A | |
| PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration | Unknown | N/A | |
| MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps | Unknown | N/A | |
| NeRF-XL: NeRF at Any Scale with Multi-GPU | Unknown | N/A | |
| NOVUM: Neural Object Volumes for Robust Object Classification | Unknown | N/A | |
| De-confounded Gaze Estimation | Unknown | N/A | |
| 3D Hand Pose Estimation in Everyday Egocentric Images | Unknown | N/A | |
| Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition | Unknown | N/A | |
| Controllable Human-Object Interaction Synthesis | Unknown | N/A | |
| Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild | Unknown | N/A | |
| Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images | Unknown | N/A | |
| DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving | Unknown | N/A | |
| DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation | Unknown | N/A | |
| GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers | Unknown | N/A | |
| SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding | Unknown | N/A | |
| DrivingDiffusion: Layout-Guided Multi-View Driving Scenarios Video Generation with Latent Diffusion Model | Unknown | N/A | |
| Zero-Shot Multi-Object Scene Completion | Unknown | N/A | |
| Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene | Unknown | N/A | |
| Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation | Unknown | N/A | |
| DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment | Unknown | N/A | |
| Personalized Video Relighting With an At-Home Light Stage | Unknown | N/A | |
| Six-Point Method for Multi-Camera Systems with Reduced Solution Space | Unknown | N/A | |
| UniINR: Event-guided Unified Rolling Shutter Correction, Deblurring, and Interpolation | Unknown | N/A | |
| Tuning-Free Image Customization with Image and Text Guidance | Unknown | N/A | |
| Stripe Observation Guided Inference Cost-free Attention Mechanism | Unknown | N/A | |
| MegaScenes: Scene-Level View Synthesis at Scale | Unknown | N/A | |
| GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting | Unknown | N/A | |
| Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation | Unknown | N/A | |
| SAFNet: Selective Alignment Fusion Network for Efficient HDR Imaging | Unknown | N/A | |
| FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models | Unknown | N/A | |
| Non-parametric Sensor Noise Modeling and Synthesis | Unknown | N/A | |
| Learned Image Enhancement via Color Naming | Unknown | N/A | |
| Idea2Img: Iterative Self-Refinement with GPT-4V for Automatic Image Design and Generation | Unknown | N/A | |
| Preventing Catastrophic Forgetting through Memory Networks in Continuous Detection | Unknown | N/A | |
| Navigating Text-to-Image Generative Bias across Indic Languages | Unknown | N/A | |
| Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction | Unknown | N/A | |
| Improving Diffusion Models for Authentic Virtual Try-on in the Wild | Unknown | N/A | |
| LCM-Lookahead for Encoder-based Text-to-Image Personalization | Unknown | N/A | |
| COIN-Matting: Confounder Intervention for Image Matting | Unknown | N/A | |
| GaussReg: Fast 3D Registration with Gaussian Splatting | Unknown | N/A | |
| PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects | Unknown | N/A | |
| Score Distillation Sampling with Learned Manifold Corrective | Unknown | N/A | |
| WAS: Dataset and Methods for Artistic Text Segmentation | Unknown | N/A | |
| Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective | Unknown | N/A | |
| BAMM: Bidirectional Autoregressive Motion Model | Unknown | N/A | |
| AdaDiffSR: Adaptive Region-aware Dynamic acceleration Diffusion Model for Real-World Image Super-Resolution | Unknown | N/A | |
| Image-adaptive 3D Lookup Tables for Real-time Image Enhancement with Bilateral Grids | Unknown | N/A | |
| Region-Aware Sequence-to-Sequence Learning for Hyperspectral Denoising | Unknown | N/A | |
| ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video Colorization | Unknown | N/A | |
| Bi-TTA: Bidirectional Test-Time Adapter for Remote Physiological Measurement | Unknown | N/A | |
| Parameterization-driven Neural Surface Reconstruction for Object-oriented Editing in Neural Rendering | Unknown | N/A | |
| Idempotent Unsupervised Representation Learning for Skeleton-Based Action Recognition | Unknown | N/A | |
| Agent Attention: On the Integration of Softmax and Linear Attention | Unknown | N/A | |
| Fine-grained Dynamic Network for Generic Event Boundary Detection | Unknown | N/A | |
| Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution | Unknown | N/A | |
| Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-frequency Fusion and Uncertainty Correction | Unknown | N/A | |
| VP-SAM: Taming Segment Anything Model for Video Polyp Segmentation via Disentanglement and Spatio-temporal Side Network | Unknown | N/A | |
| GraspXL: Generating Grasping Motions for Diverse Objects at Scale | Unknown | N/A | |
| Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition | Unknown | N/A | |
| Interaction-centric Spatio-Temporal Context Reasoning for Multi-Person Video HOI Recognition | Unknown | N/A | |
| IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation | Unknown | N/A | |
| Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images | Unknown | N/A | |
| LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models | Unknown | N/A | |
| Audio-visual Generalized Zero-shot Learning the Easy Way | Unknown | N/A | |
| Pre-trained Visual Dynamics Representations for Efficient Policy Learning | Unknown | N/A | |
| Reinforcement Learning Friendly Vision-Language Model for Minecraft | Unknown | N/A | |
| GRAPE: Generalizable and Robust Multi-view Facial Capture | Unknown | N/A | |
| R^2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations | Unknown | N/A | |
| Agent3D-Zero: An Agent for Zero-shot 3D Understanding | Unknown | N/A | |
| Multiscale Sliced Wasserstein Distances as Perceptual Color Difference Measures | Unknown | N/A | |
| SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images | Unknown | N/A | |
| SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant | Unknown | N/A | |
| BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models | Unknown | N/A | |
| Structured-NeRF: Hierarchical Scene Graph with Neural Representation | Unknown | N/A | |
| MetaWeather: Few-Shot Weather-Degraded Image Restoration | Unknown | N/A | |
| Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting | Unknown | N/A | |
| TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes | Unknown | N/A | |
| APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension | Unknown | N/A | |
| ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference | Unknown | N/A | |
| DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency | Unknown | N/A | |
| MeshFeat: Multi-Resolution Features for Neural Fields on Meshes | Unknown | N/A | |
| TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias | Unknown | N/A | |
| DragAPart: Learning a Part-Level Motion Prior for Articulated Objects | Unknown | N/A | |
| Surface Reconstruction for 3D Gaussian Splatting via Local Structural Hints | Unknown | N/A | |
| PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion | Unknown | N/A | |
| Learning to Unlearn for Robust Machine Unlearning | Unknown | N/A | |
| Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers | Unknown | N/A | |
| Echoes of the Past: Boosting Long-tail Recognition via Reflective Learning | Unknown | N/A | |
| MinD-3D: Reconstruct High-quality 3D objects in Human Brain | Unknown | N/A | |
| Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models | Unknown | N/A | |
| Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits | Unknown | N/A | |
| Visual Text Generation in the Wild | Unknown | N/A | |
| Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement | Unknown | N/A | |
| E3M: Zero-Shot Spatio-Temporal Video Grounding with Expectation-Maximization Multimodal Modulation | Unknown | N/A | |
| A Unified Image Compression Method for Human Perception and Multiple Vision Tasks | Unknown | N/A | |
| Diffusion for Natural Image Matting | Unknown | N/A | |
| Eliminating Warping Shakes for Unsupervised Online Video Stitching | Unknown | N/A | |
| FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification | Unknown | N/A | |
| Facial Affective Behavior Analysis with Instruction Tuning | Unknown | N/A | |
| Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection | Unknown | N/A | |
| Learning Quantized Adaptive Conditions for Diffusion Models | Unknown | N/A | |
| Learn to Optimize Denoising Scores: A Unified and Improved Diffusion Prior for 3D Generation | Unknown | N/A | |
| Discovering Unwritten Visual Classifiers with Large Language Models | Unknown | N/A | |
| Enhancing Diffusion Models with Text-Encoder Reinforcement Learning | Unknown | N/A | |
| GenQ: Quantization in Low Data Regimes with Generative Synthetic Data | Unknown | N/A | |
| Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach | Unknown | N/A | |
| Hetecooper: Feature Collaboration Graph for Heterogeneous Collaborative Perception | Unknown | N/A | |
| DATENeRF: Depth-Aware Text-based Editing of NeRFs | Unknown | N/A | |
| Soft Prompt Generation for Domain Generalization | Unknown | N/A | |
| Efficient Inference of Vision Instruction-Following Models with Elastic Cache | Unknown | N/A | |
| Dynamic Data Selection for Efficient SSL via Coarse-to-Fine Refinement | Unknown | N/A | |
| On the Approximation Risk of Few-Shot Class-Incremental Learning | Unknown | N/A | |
| Time-Efficient and Identity-Consistent Virtual Try-On Using A Variant of Altered Diffusion Models | Unknown | N/A | |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Unknown | N/A | |
| Fast Encoding and Decoding for Implicit Video Representation | Unknown | N/A | |
| SAIR: Learning Semantic-aware Implicit Representation | Unknown | N/A | |
| Just a Hint: Point-Supervised Camouflaged Object Detection | Unknown | N/A | |
| Rethinking Normalization Layers for Domain Generalizable Person Re-identification | Unknown | N/A | |
| URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields | Unknown | N/A | |
| Hierarchically Structured Neural Bones for Reconstructing Animatable Objects from Casual Videos | Unknown | N/A | |
| Efficient Cascaded Multiscale Adaptive Network for Image Restoration | Unknown | N/A | |
| ConGeo: Robust Cross-view Geo-localization across Ground View Variations | Unknown | N/A | |
| Learning to Drive via Asymmetric Self-Play | Unknown | N/A | |
| Event-based Mosaicing Bundle Adjustment | Unknown | N/A | |
| Robust-Wide: Robust Watermarking against Instruction-driven Image Editing | Unknown | N/A | |
| FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction | Unknown | N/A | |
| Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model | Unknown | N/A | |
| MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos | Unknown | N/A | |
| V-Trans4Style: Visual Transition Recommendation for Video Production Style Adaptation | Unknown | N/A | |
| OmniSat: Self-Supervised Modality Fusion for Earth Observation | Unknown | N/A | |
| WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Unknown | N/A | |
| TriNeRFLet: A Wavelet Based Triplane NeRF Representation | Unknown | N/A | |
| Uncertainty-Driven Spectral Compressive Imaging with Spatial-Frequency Transformer | Unknown | N/A | |
| milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing | Unknown | N/A | |
| Weakly-Supervised Spatio-Temporal Video Grounding with Variational Cross-Modal Alignment | Unknown | N/A | |
| Toward Tiny and High-quality Facial Makeup with Data Amplify Learning | Unknown | N/A | |
| Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models | Unknown | N/A | |
| Bidirectional Progressive Transformer for Interaction Intention Anticipation | Unknown | N/A | |
| Semantic-guided Robustness Tuning for Few-Shot Transfer Across Extreme Domain Shift | Unknown | N/A | |
| SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow | Unknown | N/A | |
| Domain Reduction Strategy for Non-Line-of-Sight Imaging | Unknown | N/A | |
| Learning to Enhance Aperture Phasor Field for Non-Line-of-Sight Imaging | Unknown | N/A | |
| EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching | Unknown | N/A | |
| Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning | Unknown | N/A | |
| FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation | Unknown | N/A | |
| C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition | Unknown | N/A | |
| Vamos: Versatile Action Models for Video Understanding | Unknown | N/A | |
| A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation | Unknown | N/A | |
| DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation | Unknown | N/A | |
| AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation | Unknown | N/A | |
| ExMatch: Self-guided Exploitation for Semi-Supervised Learning with Scarce Labeled Samples | Unknown | N/A | |
| TIP: Tabular-Image Pre-training for Multimodal Classification with Incomplete Data | Unknown | N/A | |
| CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering | Unknown | N/A | |
| CoSIGN: Few-Step Guidance of ConSIstency Model to Solve General INverse Problems | Unknown | N/A | |
| Open-Vocabulary RGB-Thermal Semantic Segmentation | Unknown | N/A | |
| RaFE: Generative Radiance Fields Restoration | Unknown | N/A | |
| denoiSplit: a method for joint microscopy image splitting and unsupervised denoising | Unknown | N/A | |
| UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening | Unknown | N/A | |
| Efficient Neural Video Representation with Temporally Coherent Modulation | Unknown | N/A | |
| Contourlet Residual for Prompt Learning Enhanced Infrared Image Super-Resolution | Unknown | N/A | |
| Unsupervised Moving Object Segmentation with Atmospheric Turbulence | Unknown | N/A | |
| Modeling Label Correlations with Latent Context for Multi-Label Recognition | Unknown | N/A | |
| Language-Driven Physics-Based Scene Synthesis and Editing via Feature Splatting | Unknown | N/A | |
| MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos | Unknown | N/A | |
| WindPoly: Polygonal Mesh Reconstruction via Winding Numbers | Unknown | N/A | |
| AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation | Unknown | N/A | |
| Towards Reliable Advertising Image Generation Using Human Feedback | Unknown | N/A | |
| Distributionally Robust Loss for Long-Tailed Multi-Label Image Classification | Unknown | N/A | |
| Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection | Unknown | N/A | |
| Inter-Class Topology Alignment for Efficient Black-Box Substitute Attacks | Unknown | N/A | |
| TurboEdit: Real-time text-based disentangled real image editing | Unknown | N/A | |
| The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers | Unknown | N/A | |
| Improving Vision and Language Concepts Understanding with Multimodal Counterfactual Samples | Unknown | N/A | |
| Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery | Unknown | N/A | |
| Harmonizing knowledge Transfer in Neural Network with Unified Distillation | Unknown | N/A | |
| MoEAD: A Parameter-efficient Model for Multi-class Anomaly Detection | Unknown | N/A | |
| Clean & Compact: Efficient Data-Free Backdoor Defense with Model Compactness | Unknown | N/A | |
| Context-Guided Spatial Feature Reconstruction for Efficient Semantic Segmentation | Unknown | N/A | |
| EraseDraw : Learning to Insert Objects by Erasing Them from Images | Unknown | N/A | |
| Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action Segmentation | Unknown | N/A | |
| Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval | Unknown | N/A | |
| Attention Prompting on Image for Large Vision-Language Models | Unknown | N/A | |
| ELSE: Efficient Deep Neural Network Inference through Line-based Sparsity Exploration | Unknown | N/A | |
| Personalized Privacy Protection Mask Against Unauthorized Facial Recognition | Unknown | N/A | |
| Content-Aware Radiance Fields: Aligning Model Complexity with Scene Intricacy Through Learned Bitwidth Quantization | Unknown | N/A | |
| A Cephalometric Landmark Regression Method based on Dual-encoder for High-resolution X-ray Image | Unknown | N/A | |
| HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation | Unknown | N/A | |
| Track2Act: Predicting Point Tracks from Internet Videos enables Generalizable Robot Manipulation | Unknown | N/A | |
| Viewpoint textual inversion: discovering scene representations and 3D view control in 2D diffusion models | Unknown | N/A | |
| A Geometric Distortion Immunized Deep Watermarking Framework with Robustness Generalizability | Unknown | N/A | |
| CipherDM: Secure Three-Party Inference for Diffusion Model Sampling | Unknown | N/A | |
| How to Train the Teacher Model for Effective Knowledge Distillation | Unknown | N/A | |
| LineFit: A Geometric Approach for Fitting Line Segments in Images | Unknown | N/A | |
| CompGS: Smaller and Faster Gaussian Splatting with Vector Quantization | Unknown | N/A | |
| Global Counterfactual Directions | Unknown | N/A | |
| VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Unknown | N/A | |
| RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via Diffusion | Unknown | N/A | |
| Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering | Unknown | N/A | |
| ChEX: Interactive Localization and Region Description in Chest X-rays | Unknown | N/A | |
| MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration | Unknown | N/A | |
| Grounding Image Matching in 3D with MASt3R | Unknown | N/A | |
| COSMU: Complete 3D human shape from monocular unconstrained images | Unknown | N/A | |
| LASS3D: Language-Assisted Semi-Supervised 3D Semantic Segmentation with Progressive Unreliable Data Exploitation | Unknown | N/A | |
| Efficient Active Domain Adaptation for Semantic Segmentation by Selecting Information-rich Superpixels | Unknown | N/A | |
| Adaptive Human Trajectory Prediction via Latent Corridors | Unknown | N/A | |
| Generalizable Facial Expression Recognition | Unknown | N/A | |
| RS-NeRF: Neural Radiance Fields from Rolling Shutter Images | Unknown | N/A | |
| MARs: Multi-view Attention Regularizations for Patch-based Feature Recognition of Space Terrain | Unknown | N/A | |
| Do Generalised Classifiers really work on Human Drawn Sketches? | Unknown | N/A | |
| Representing Topological Self-Similarity Using Fractal Feature Maps for Accurate Segmentation of Tubular Structures | Unknown | N/A | |
| Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning | Unknown | N/A | |
| Detecting As Labeling: Rethinking LiDAR-camera Fusion in 3D Object Detection | Unknown | N/A | |
| GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo Views | Unknown | N/A | |
| Enhanced Motion Forecasting with Visual Relation Reasoning | Unknown | N/A | |
| Multi-scale Cross Distillation for Object Detection in Aerial Images | Unknown | N/A | |
| Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models | Unknown | N/A | |
| DSA: Discriminative Scatter Analysis for Early Smoke Segmentation | Unknown | N/A | |
| Long-term Temporal Context Gathering for Neural Video Compression | Unknown | N/A | |
| DualBEV: Unifying Dual View Transformation with Probabilistic Correspondences | Unknown | N/A | |
| Continuous SO(3) Equivariant Convolution for 3D Point Cloud Analysis | Unknown | N/A | |
| SemanticHuman-HD: High Resolution Semantic disentangled 3D Human Generation | Unknown | N/A | |
| MedRAT: Unpaired Medical Report Generation via Auxiliary Tasks | Unknown | N/A | |
| Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-Spoofing | Unknown | N/A | |
| Norface: Improving Facial Expression Analysis by Identity Normalization | Unknown | N/A | |
| Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models | Unknown | N/A | |
| Bucketed Ranking-based Losses for Efficient Training of Object Detectors | Unknown | N/A | |
| OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web | Unknown | N/A | |
| Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification | Unknown | N/A | |
| Self-Supervised Underwater Caustics Removal and Descattering via Deep Monocular SLAM | Unknown | N/A | |
| Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition | Unknown | N/A | |
| The Nerfect Match: Exploring NeRF Features for Visual Localization | Unknown | N/A | |
| SparseCraft: Few-Shot Neural Reconstruction through Stereopsis Guided Geometric Linearization | Unknown | N/A | |
| Image Manipulation Detection With Implicit Neural Representation and Limited Supervision | Unknown | N/A | |
| Adapting to Shifting Correlations with Unlabeled Data Calibration | Unknown | N/A | |
| SCAPE: A Simple and Strong Category-Agnostic Pose Estimator | Unknown | N/A | |
| FedRA: A Random Allocation Strategy for Federated Tuning to Unleash the Power of Heterogeneous Clients | Unknown | N/A | |
| Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model | Unknown | N/A | |
| Image-to-Lidar Relational Distillation for Autonomous Driving Data | Unknown | N/A | |
| EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval | Unknown | N/A | |
| Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions | Unknown | N/A | |
| Learning-based Axial Video Motion Magnification | Unknown | N/A | |
| IGNORE: Information Gap-based False Negative Loss Rejection for Single Positive Multi-Label Learning | Unknown | N/A | |
| Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization | Unknown | N/A | |
| AD3: Introducing a score for Anomaly Detection Dataset Difficulty assessment using VIADUCT dataset | Unknown | N/A | |
| RegionDrag: Fast Region-Based Image Editing with Diffusion Models | Unknown | N/A | |
| FlowCon: Out-of-Distribution Detection using Flow-based Contrastive Learning | Unknown | N/A | |
| CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts | Unknown | N/A | |
| CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Unknown | N/A | |
| Siamese Vision Transformers are Scalable Audio-visual Learners | Unknown | N/A | |
| Rectify the Regression Bias in Long-Tailed Object Detection | Unknown | N/A | |
| Learning Neural Volumetric Pose Features for Camera Localization | Unknown | N/A | |
| Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection | Unknown | N/A | |
| Visual Relationship Transformation | Unknown | N/A | |
| Scene-aware Human Motion Forecasting via Mutual Distance Prediction | Unknown | N/A | |
| Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding | Unknown | N/A | |
| AFF-ttention! Affordances and Attention models for Short-Term Object Interaction Anticipation | Unknown | N/A | |
| Visible and Clear: Finding Tiny Objects in Difference Map | Unknown | N/A | |
| Elysium: Exploring Object-level Perception in Videos through Semantic Integration Using MLLMs | Unknown | N/A | |
| Sequential Representation Learning via Static-Dynamic Conditional Disentanglement | Unknown | N/A | |
| Temporal-Mapping Photography for Event Cameras | Unknown | N/A | |
| RGBD GS-ICP SLAM | Unknown | N/A | |
| SAVE: Protagonist Diversification with Structure Agnostic Video Editing | Unknown | N/A | |
| Rethinking Data Bias: Dataset Copyright Protection via Embedding Class-wise Hidden Bias | Unknown | N/A | |
| Federated Learning with Local Openset Noisy Labels | Unknown | N/A | |
| Match-Stereo-Videos: Bidirectional Alignment for Consistent Dynamic Stereo Matching | Unknown | N/A | |
| End-to-End Rate-Distortion Optimized 3D Gaussian Representation | Unknown | N/A | |
| Multistain Pretraining for Slide Representation Learning in Pathology | Unknown | N/A | |
| Efficient Few-Shot Action Recognition via Multi-Level Post-Reasoning | Unknown | N/A | |
| Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation | Unknown | N/A | |
| GenerateCT: Text-Conditional Generation of 3D Chest CT Volumes | Unknown | N/A | |
| 3R-INN: How to be climate friendly while consuming/delivering videos? | Unknown | N/A | |
| ADMap: Anti-disturbance Framework for Vectorized HD Map Construction | Unknown | N/A | |
| GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields | Unknown | N/A | |
| OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction | Unknown | N/A | |
| Self-supervised co-salient object detection via feature correspondences at multiple scales | Unknown | N/A | |
| Improving Knowledge Distillation via Regularizing Feature Direction and Norm | Unknown | N/A | |
| DynMF: Neural Motion Factorization for Real-time Dynamic View Synthesis with 3D Gaussian Splatting | Unknown | N/A | |
| UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models | Unknown | N/A | |
| Reinforcement Learning Meets Visual Odometry | Unknown | N/A | |
| PoseSOR: Human Pose Can Guide Our Attention | Unknown | N/A | |
| Canonical Shape Projection is All You Need for 3D Few-shot Class Incremental Learning | Unknown | N/A | |
| SpeedUpNet: A Plug-and-Play Adapter Network for Accelerating Text-to-Image Diffusion Models | Unknown | N/A | |
| Edge-Guided Fusion and Motion Augmentation for Event-Image Stereo | Unknown | N/A | |
| Rethinking Tree-Ring Watermarking for Enhanced Multi-Key Identification | Unknown | N/A | |
| DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs | Unknown | N/A | |
| NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models | Unknown | N/A | |
| Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken | Unknown | N/A | |
| Optimal Transport of Diverse Unsupervised Tasks for Robust Learning from Noisy Few-Shot Data | Unknown | N/A | |
| Non-Line-of-Sight Estimation of Fast Human Motion with Slow Scanning Imagers | Unknown | N/A | |
| HPE-Li: WiFi-enabled Lightweight Dual Selective Kernel Convolution for Human Pose Estimation | Unknown | N/A | |
| LITA: Language Instructed Temporal-Localization Assistant | Unknown | N/A | |
| Improving Agent Behaviors with RL Fine-tuning for Autonomous Driving | Unknown | N/A | |
| Enhancing Tracking Robustness with Auxiliary Adversarial Defense Networks | Unknown | N/A | |
| BurstM: Deep Burst Multi-scale SR using Fourier Space with Optical Flow | Unknown | N/A | |
| MEVG : Multi-event Video Generation with Text-to-Video Models | Unknown | N/A | |
| Unsupervised Dense Prediction using Differentiable Normalized Cuts | Unknown | N/A | |
| RPBG: Towards Robust Neural Point-based Graphics in the Wild | Unknown | N/A | |
| uCAP: An Unsupervised Prompting Method for Vision-Language Models | Unknown | N/A | |
| CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing | Unknown | N/A | |
| Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration | Unknown | N/A | |
| PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference | Unknown | N/A | |
| Human Motion Forecasting in Dynamic Domain Shifts: A Homeostatic Continual Test-time Adaptation Framework | Unknown | N/A | |
| Placing Objects in Context via Inpainting for Out-of-distribution Segmentation | Unknown | N/A | |
| Efficient Frequency-Domain Image Deraining with Contrastive Regularization | Unknown | N/A | |
| EgoBody3M: Egocentric Body Tracking on a VR Headset using a Diverse Dataset | Unknown | N/A | |
| Deep Cost Ray Fusion for Sparse Depth Video Completion | Unknown | N/A | |
| Background Adaptation with Residual Modeling for Exemplar-Free Class-Incremental Semantic Segmentation | Unknown | N/A | |
| SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning | Unknown | N/A | |
| Large-Scale Multi-Hypotheses Cell Tracking Using Ultrametric Contours Maps | Unknown | N/A | |
| Prediction Exposes Your Face: Black-box Model Inversion via Prediction Alignment | Unknown | N/A | |
| Norma: A Noise Robust Memory-Augmented Framework for Whole Slide Image Classification | Unknown | N/A | |
| Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models | Unknown | N/A | |
| SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection | Unknown | N/A | |
| Adaptive High-Frequency Transformer for Diverse Wildlife Re-Identification | Unknown | N/A | |
| Noise-assisted Prompt Learning for Image Forgery Detection and Localization | Unknown | N/A | |
| Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization | Unknown | N/A | |
| Dynamic Retraining-Updating Mean Teacher for Source-Free Object Detection | Unknown | N/A | |
| Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation | Unknown | N/A | |
| An accurate detection is not all you need to combat label noise in web-noisy datasets | Unknown | N/A | |
| Self-Supervised Video Copy Localization with Regional Token Representation | Unknown | N/A | |
| Crowd-SAM:SAM as a smart annotator for object detection in crowded scenes | Unknown | N/A | |
| Multi-RoI Human Mesh Recovery with Camera Consistency and Contrastive Losses | Unknown | N/A | |
| Free-VSC: Free Semantics from Visual Foundation Models for Unsupervised Video Semantic Compression | Unknown | N/A | |
| Zero-shot Object Counting with Good Exemplars | Unknown | N/A | |
| On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition | Unknown | N/A | |
| Synergy of Sight and Semantics: Visual Intention Understanding with CLIP | Unknown | N/A | |
| FreeMotion: MoCap-Free Human Motion Synthesis with Multimodal Large Language Models | Unknown | N/A | |
| Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians | Unknown | N/A | |
| Single-Photon 3D Imaging with Equi-Depth Photon Histograms | Unknown | N/A | |
| Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training | Unknown | N/A | |
| Anytime Continual Learning for Open Vocabulary Classification | Unknown | N/A | |
| Gated Temporal Diffusion for Stochastic Long-term Dense Anticipation | Unknown | N/A | |
| Domain Generalization of 3D Object Detection by Density-Resampling | Unknown | N/A | |
| Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Unknown | N/A | |
| O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation | Unknown | N/A | |
| On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection Strategy | Unknown | N/A | |
| Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data | Unknown | N/A | |
| REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models | Unknown | N/A | |
| Causal Subgraphs and Information Bottlenecks: Redefining OOD Robustness in Graph Neural Networks | Unknown | N/A | |
| Multi-Task Domain Adaptation for Language Grounding with 3D Objects | Unknown | N/A | |
| PISR: Polarimetric Neural Implicit Surface Reconstruction for Textureless and Specular Objects | Unknown | N/A | |
| Watching it in Dark: A Target-aware Representation Learning Framework for High-Level Vision Tasks in Low Illumination | Unknown | N/A | |
| SINDER: Repairing the Singular Defects of DINOv2 | Unknown | N/A | |
| Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians | Unknown | N/A | |
| Revisiting Domain-Adaptive Object Detection in Adverse Weather by the Generation and Composition of High-Quality Pseudo-Labels | Unknown | N/A | |
| Open-Set Recognition in the Age of Vision-Language Models | Unknown | N/A | |
| Two-Stage Active Learning for Efficient Temporal Action Segmentation | Unknown | N/A | |
| High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior | Unknown | N/A | |
| ARoFace: Alignment Robustness to Improve Low-quality Face Recognition | Unknown | N/A | |
| Un-EVIMO: Unsupervised Event-based Independent Motion Segmentation | Unknown | N/A | |
| Finding a needle in a haystack: A Black-Box Approach to Invisible Watermark Detection | Unknown | N/A | |
| Is Retain Set All You Need in Machine Unlearning? Restoring Performance of Unlearned Models with Out-Of-Distribution Images | Unknown | N/A | |
| CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation | Unknown | N/A | |
| Faceptor: A Generalist Model for Face Perception | Unknown | N/A | |
| Shapefusion: 3D localized human diffusion models | Unknown | N/A | |
| LLaVA-UHD: an LMM Perceiving any Aspect Ratio and High-Resolution Images | Unknown | N/A | |
| Training A Secure Model against Data-Free Model Extraction | Unknown | N/A | |
| VeCLIP: Improving CLIP Training via Visual-enriched Captions | Unknown | N/A | |
| Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models | Unknown | N/A | |
| Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation | Unknown | N/A | |
| Neural Metamorphosis | Unknown | N/A | |
| Superpixel-informed Implicit Neural Representation for Multi-Dimensional Data | Unknown | N/A | |
| UniCode : Learning a Unified Codebook for Multimodal Large Language Models | Unknown | N/A | |
| When Fast Fourier Transform Meets Transformer for Image Restoration | Unknown | N/A | |
| DGD: Dynamic 3D Gaussians Distillation | Unknown | N/A | |
| OMR: Occlusion-Aware Memory-Based Refinement for Video Lane Detection | Unknown | N/A | |
| Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation | Unknown | N/A | |
| HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis | Unknown | N/A | |
| Chameleon: A Data-Efficient Generalist for Dense Visual Prediction in the Wild | Unknown | N/A | |
| Dual-Path Adversarial Lifting for Domain Shift Correction in Online Test-time Adaptation | Unknown | N/A | |
| Dropout Mixture Low-Rank Adaptation for Visual Parameters-Efficient Fine-Tuning | Unknown | N/A | |
| Learning Cross-hand Policies of High-DOF Reaching and Grasping | Unknown | N/A | |
| Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation | Unknown | N/A | |
| Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion | Unknown | N/A | |
| Probabilistic Weather Forecasting with Deterministic Guidance-based Diffusion Model | Unknown | N/A | |
| Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution | Unknown | N/A | |
| Keypoint Promptable Re-Identification | Unknown | N/A | |
| When and How do negative prompts take effect? | Unknown | N/A | |
| Rethinking Features-Fused-Pyramid-Neck for Object Detection | Unknown | N/A | |
| Training A Small Emotional Vision Language Model for Visual Art Comprehension | Unknown | N/A | |
| Learning Local Pattern Modularization for Point Cloud Reconstruction from Unseen Classes | Unknown | N/A | |
| Learned HDR Image Compression for Perceptually Optimal Storage and Display | Unknown | N/A | |
| FastCAD: Real-Time CAD Retrieval and Alignment from Scans and Videos | Unknown | N/A | |
| On the Topology Awareness and Generalization Performance of Graph Neural Networks | Unknown | N/A | |
| Accelerating Image Super-Resolution Networks with Pixel-Level Classification | Unknown | N/A | |
| On Calibration of Object Detectors: Pitfalls, Evaluation and Baselines | Unknown | N/A | |
| IVTP: Instruction-guided Visual Token Pruning for Large Vision-Language Models | Unknown | N/A | |
| SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras | Unknown | N/A | |
| Compensation Sampling for Improved Convergence in Diffusion Models | Unknown | N/A | |
| Rethinking Fast Adversarial Training: A Splitting Technique To Overcome Catastrophic Overfitting | Unknown | N/A | |
| SkyScenes: A Synthetic Dataset for Aerial Scene Understanding | Unknown | N/A | |
| RING-NeRF : Rethinking Inductive Biases for Versatile and Efficient Neural Fields | Unknown | N/A | |
| SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference | Unknown | N/A | |
| Scissorhands: Scrub Data Influence via Connection Sensitivity in Networks | Unknown | N/A | |
| KeypointDETR: An End-to-End 3D Keypoint Detector | Unknown | N/A | |
| CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion | Unknown | N/A | |
| Real-data-driven 2000 FPS Color Video from Mosaicked Chromatic Spikes | Unknown | N/A | |
| IAM-VFI : Interpolate Any Motion for Video Frame Interpolation with motion complexity map | Unknown | N/A | |
| Implicit Neural Models to Extract Heart Rate from Video | Unknown | N/A | |
| Self-Supervised Any-Point Tracking by Contrastive Random Walks | Unknown | N/A | |
| Cross-Platform Video Person ReID: A New Benchmark Dataset and Adaptation Approach | Unknown | N/A | |
| OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding | Unknown | N/A | |
| Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models | Unknown | N/A | |
| Unsupervised, Online and On-The-Fly Anomaly Detection For Non-Stationary Image Distributions | Unknown | N/A | |
| Statewide Visual Geolocalization in the Wild | Unknown | N/A | |
| Deblurring 3D Gaussian Splatting | Unknown | N/A | |
| SEDiff: Structure Extraction for Domain Adaptive Depth Estimation via Denoising Diffusion Models | Unknown | N/A | |
| Joint RGB-Spectral Decomposition Model Guided Image Enhancement in Mobile Photography | Unknown | N/A | |
| Layer-Wise Relevance Propagation with Conservation Property for ResNet | Unknown | N/A | |
| Layered Rendering Diffusion Model for Controllable Zero-Shot Image Synthesis | Unknown | N/A | |
| Sketch2Vox: Learning 3D Reconstruction from a Single Monocular Sketch Image | Unknown | N/A | |
| Few-shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt | Unknown | N/A | |
| Decomposition Betters Tracking Everything Everywhere | Unknown | N/A | |
| R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding | Unknown | N/A | |
| Make a Strong Teacher with Label Assistance: A Novel Knowledge Distillation Approach for Semantic Segmentation | Unknown | N/A | |
| Lost in Translation: Modern Neural Networks Still Struggle With Small Realistic Image Transformations | Unknown | N/A | |
| Controlling the World by Sleight of Hand | Unknown | N/A | |
| Pseudo-Labelling Should Be Aware of Disguising Channel Activations | Unknown | N/A | |
| Towards Architecture-Agnostic Untrained Networks Priors for Image Reconstruction with Frequency Regularization | Unknown | N/A |
EMNLP 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation | Juhwan Choi, Yeonghwa Kim, Seunguk Yu, JungMin Yun, YoungBin Kim | N/A | N/A |
| Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation | Juhwan Choi, JungMin Yun, Kyohoon Jin, YoungBin Kim | N/A | N/A |
| FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document | Joonho Yang, Seunghyun Yoon, ByeongJeong Kim, Hwanhee Lee | N/A | N/A |
| Prompts have evil twins | Rimon Melamed, Lucas Hurley McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà | N/A | N/A |
| Table Question Answering for Low-resourced Indic Languages | Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke | N/A | N/A |
| ImageInWords: Unlocking Hyper-Detailed Image Descriptions | Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Michael Baldridge, Radu Soricut | N/A | N/A |
| LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay | Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang | N/A | N/A |
| When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection | Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps | N/A | N/A |
| Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model | Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia Perera, EngSiong Chng, Lina Yao | N/A | N/A |
| Hateful Word in Context Classification | Sanne Hoeken, Sina Zarrieß, Özge Alacam | N/A | N/A |
| Eyes Don’t Lie: Subjective Hate Annotation and Detection with Gaze | Özge Alacam, Sanne Hoeken, Sina Zarrieß | N/A | N/A |
| NumeroLogic: Number Encoding for Enhanced LLMs’ Numerical Reasoning | Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle | N/A | N/A |
| Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models | Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka | N/A | N/A |
| A Usage-centric Take on Intent Understanding in E-Commerce | Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan | N/A | N/A |
| Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs | Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha | N/A | N/A |
| Systematic Biases in LLM Simulations of Debates | Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein | N/A | N/A |
| Studying and Mitigating Biases in Sign Language Understanding Models | Katherine Atwell, Danielle Bragg, Malihe Alikhani | N/A | N/A |
| Uncertainty in Language Models: Assessment through Rank-Calibration | Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban | N/A | N/A |
| RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning | Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing | Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty | N/A | N/A |
| Scaling Properties of Speech Language Models | Santiago Cuervo, Ricard Marxer | N/A | N/A |
| “We Demand Justice!”: Towards Social Context Grounding of Political Texts | Rajkumar Pujari, Chengfei Wu, Dan Goldwasser | N/A | N/A |
| An Experimental Analysis on Evaluating Patent Citations | Rabindra Nath Nandi, Suman Maity, Brian Uzzi, Sourav Medya | N/A | N/A |
| Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? | Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow | N/A | N/A |
| Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing | Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis | N/A | N/A |
| Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning | Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, zujie wen, Wenqiang Lei, Tat-Seng Chua | N/A | N/A |
| Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation | Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman | N/A | N/A |
| Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, shimin tao, Xiaofeng Zhao, Mahongxia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, JingBo Zhu | N/A | N/A |
| On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models | Abhilasha Sancheti, Haozhe An, Rachel Rudinger | N/A | N/A |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Maureen de Seyssel, Antony D’Avirro, Adina Williams, Emmanuel Dupoux | N/A | N/A |
| On Fake News Detection with LLM Enhanced Semantics Mining | Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang, Jia Wu, Hao Fan | N/A | N/A |
| On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices | Branislav Pecher, Ivan Srba, Maria Bielikova | N/A | N/A |
| Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection | Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan | N/A | N/A |
| A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers | Valentin Barriere, Sebastian Cifuentes | N/A | N/A |
| Mitigating the Alignment Tax of RLHF | Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang | N/A | N/A |
| Evaluating Readability and Faithfulness of Concept-based Explanations | Meng Li, Haoran Jin, Ruixuan HUANG, Zhihao Xu, Defu Lian, Zijia Lin, Di ZHANG, Xiting Wang | N/A | N/A |
| Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems | Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen | N/A | N/A |
| MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making | Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou | N/A | N/A |
| CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds | Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Kenneth Huang | N/A | N/A |
| Tokenization Is More Than Compression | Craig W Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner | N/A | N/A |
| FLIRT: Feedback Loop In-context Red Teaming | Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta | N/A | N/A |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Lingjun Zhao, Khanh Xuan Nguyen, Hal Daumé III | N/A | N/A |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Haoyuan WU, Haisheng Zheng, Zhuolun He, Bei Yu | N/A | N/A |
| GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation | Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng | N/A | N/A |
| Improved Learned Sparse Retrieval with Entity Vocabulary | Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates | N/A | N/A |
| Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models | Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu | N/A | N/A |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li | N/A | N/A |
| Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences | Xiangyang Liu, Junliang He, Xipeng Qiu | N/A | N/A |
| Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue | Xianlong Luo, Yihao Wang, Meng Yang | N/A | N/A |
| Integrating Plutchik’s Theory with Mixture of Experts for Enhancing Emotion Classification | Dongjun LIM, Yun-Gyung Cheong | N/A | N/A |
| In-context Contrastive Learning for Event Causality Identification | 梁超, Wei Xiang, Bang Wang | N/A | N/A |
| What’s Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs | Anna Wegmann, Tijs A. van den Broek, Dong Nguyen | N/A | N/A |
| Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs | Kanishka Misra, Kyle Mahowald | N/A | N/A |
| Large Language Models for Data Annotation: A Survey | Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, huan liu | N/A | N/A |
| Chain-of-Dictionary Prompting Elicits Translation in Large Language Models | Hongyuan Lu, HAORAN YANG, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei | N/A | N/A |
| AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning | Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang | N/A | N/A |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering | Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, chen luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs | Jocelyn J Shen, Joel Mire, Hae Won Park, Cynthia Breazeal, Maarten Sap | N/A | N/A |
| Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence | Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, di yin, Xing Sun | N/A | N/A |
| Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval | Tianyi Hu, Maria Maistro, Daniel Hershcovich | N/A | N/A |
| RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models | Peng Xia, Kangyu Zhu, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao | N/A | N/A |
| A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading | Yuan Li, Bingqiao Luo, Qian Wang, Nuo Chen, Xu Liu, Bingsheng He | N/A | N/A |
| A Survey on In-context Learning | Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui | N/A | N/A |
| DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing | Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao | N/A | N/A |
| AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation | Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing | N/A | N/A |
| EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models | Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai | N/A | N/A |
| Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization | Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee | N/A | N/A |
| LLMs Are Zero-Shot Context-Aware Simultaneous Translators | Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura | N/A | N/A |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang | N/A | N/A |
| ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval | Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou | N/A | N/A |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulić, Anna Korhonen | N/A | N/A |
| Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation | Chenlong Deng, Kelong Mao, Zhicheng Dou | N/A | N/A |
| Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process | Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang | N/A | N/A |
| Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation | Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander T Toshev | N/A | N/A |
| QUDSELECT: Selective Decoding for Questions Under Discussion Parsing | Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration | Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng | N/A | N/A |
| Model Balancing Helps Low-data Training and Fine-tuning | Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang | N/A | N/A |
| Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment | Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami | N/A | N/A |
| Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment | Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu | N/A | N/A |
| A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning | Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou | N/A | N/A |
| Towards Tool Use Alignment of Large Language Models | Zhi-Yuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin | N/A | N/A |
| DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models | Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass | N/A | N/A |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation | Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin | N/A | N/A |
| Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network | Haoran Li, Qiang Gao, Hongmei Wu, Li Huang | N/A | N/A |
| Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors | Wenjian Ding, YAO ZHANG, Jun Wang, Adam Jatowt, Zhenglu Yang | N/A | N/A |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao, Yuehan Zhang, zhangwenlong, Xiao-Ming Wu | N/A | N/A |
| Tracking the perspectives of interacting language models | Hayden Helm, Brandon Duderstadt, Youngser Park, Carey Priebe | N/A | N/A |
| MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering | Zhengxuan Zhang, Yin WU, Yuyu Luo, Nan Tang | N/A | N/A |
| Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? | Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui | N/A | N/A |
| Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement | Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng LI, Wei Peng, Sujian Li | N/A | N/A |
| Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation | Joseph Marvin Imperial, Gail Forey, Harish Tayyar Madabushi | N/A | N/A |
| Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective | Zhihao Zhang, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou | N/A | N/A |
| “Glue pizza and eat rocks” - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models | Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, huan liu | N/A | N/A |
| Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement | Yuxuan Wang, Xiaoyuan Liu | N/A | N/A |
| SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation | Xiaoze Liu, Ting Sun, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao | N/A | N/A |
| MatchTime: Towards Automatic Soccer Game Commentary Generation | Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie | N/A | N/A |
| Rethinking Token Reduction for State Space Models | Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang | N/A | N/A |
| Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering | Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang | N/A | N/A |
| MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic | Yuyan Zhou, Liang Song, Bingning Wang, weipeng chen | N/A | N/A |
| Event Causality Identification with Synthetic Control | Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson | N/A | N/A |
| Retrieved Sequence Augmentation for Protein Representation Learning | Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Young Lu, Qi Liu, Sheng Wang, Lingpeng Kong | N/A | N/A |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li | N/A | N/A |
| TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić | N/A | N/A |
| DA$^3$: A Distribution-Aware Adversarial Attack against Language Models | Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu | N/A | N/A |
| Evaluating Psychological Safety of Large Language Models | Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing | N/A | N/A |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong | N/A | N/A |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu | N/A | N/A |
| PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation | Libo Zhao, Jing Li, Ziqian Zeng | N/A | N/A |
| TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging | Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang | N/A | N/A |
| Do We Need Language-Specific Fact-Checking Models? The Case of Chinese | Caiqi Zhang, Zhijiang Guo, Andreas Vlachos | N/A | N/A |
| Enhancing Advanced Visual Reasoning Ability of Large Language Models | Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai | N/A | N/A |
| CMD: a framework for Context-aware Model self-Detoxification | Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang | N/A | N/A |
| Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection | Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao | N/A | N/A |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao | N/A | N/A |
| Be Helpful but Don’t Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support | LI Junlin, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang | N/A | N/A |
| Aligning Language Models to Explicitly Handle Ambiguity | Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim | N/A | N/A |
| Tag-grounded Visual Instruction Tuning with Retrieval Augmentation | Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li | N/A | N/A |
| GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models | Xuanchang Zhang, Zhuosheng Zhang, hai zhao | N/A | N/A |
| Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information | Runze Xia, Congchi Yin, Piji Li | N/A | N/A |
| Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models | Rui Li, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang | N/A | N/A |
| Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models | Yongjin Yang, Jongwoo Ko, Se-Young Yun | N/A | N/A |
| Advancing Process Verification for Large Language Models via Tree-Based Preference Learning | Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu | N/A | N/A |
| An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference | Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan | N/A | N/A |
| MantisScore: A Reliable Fine-grained Metric for Video Generation | Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen | N/A | N/A |
| A ∧ B ⇔ B ∧ A: Evaluating and Improving Logical Reasoning Ability of Large Language Models | Yuxuan WAN, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training | Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin | N/A | N/A |
| FuseGen: PLM Fusion for Data-generation based Zero-shot Learning | Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang | N/A | N/A |
| I Need Help! Evaluating LLM’s Ability to Ask for Users’ Support: A Case Study on Text-to-SQL Generation | Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm | Michael Wiegand, Josef Ruppenhofer | N/A | N/A |
| By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting | Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee | N/A | N/A |
| Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization | Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee | N/A | N/A |
| CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search | Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie | N/A | N/A |
| Towards Low-Resource Harmful Meme Detection with LMM Agents | Jianzhao Huang, Hongzhan Lin, ZiyanLiu, Ziyang Luo, Guang Chen, Jing Ma | N/A | N/A |
| VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values | Zhe Hu, Yixiao Ren, Jing Li, Yu Yin | N/A | N/A |
| Direct Multi-Turn Preference Optimization for Language Agents | Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng | N/A | N/A |
| Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models | Leonardo Ranaldi, Andre Freitas | N/A | N/A |
| In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search | Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren | N/A | N/A |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, liqian wen, Zulong Chen | N/A | N/A |
| Backward Lens: Projecting Language Model Gradients into the Vocabulary Space | Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf | N/A | N/A |
| Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding | Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu | N/A | N/A |
| Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! | Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu | N/A | N/A |
| Reusing Transferable Weight Increments for Low-resource Style Generation | Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar Zaiane | N/A | N/A |
| Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course | Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee | N/A | N/A |
| Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? | Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler | N/A | N/A |
| Instruction Pre-Training: Language Models are Supervised Multitask Learners | Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei | N/A | N/A |
| LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Renzhi Wang, Piji Li | N/A | N/A |
| Collaborative Performance Prediction for Large Language Models | Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma | N/A | N/A |
| Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese | Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari | N/A | N/A |
| Knowledge Verification to Nip Hallucination in the Bud | Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi | N/A | N/A |
| QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios | Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich | N/A | N/A |
| African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models | Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models | Bastien Liétard, Pascal Denis, Mikaela Keller | N/A | N/A |
| ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings | Hao Wang, Hao Li, Minlie Huang, Lei Sha | N/A | N/A |
| An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making | Xiutian Zhao, Ke Wang, Wei Peng | N/A | N/A |
| Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment | zhenyu liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning | Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, ZihanWang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang | N/A | N/A |
| Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification | Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang | N/A | N/A |
| PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study | Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu | N/A | N/A |
| Alignment-Enhanced Decoding: Defending via Token-Level Adaptive Refining of Probability Distributions | Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su | N/A | N/A |
| MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction | Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu | N/A | N/A |
| Evaluating Large Language Models via Linguistic Profiling | Alessio Miaschi, Felice Dell’Orletta, Giulia Venturi | N/A | N/A |
| With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models | Tyler Loakman, YUCHENG LI, Chenghua Lin | N/A | N/A |
| KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases | Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li | N/A | N/A |
| Understanding Higher-Order Correlations Among Semantic Components in Embeddings | Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira | N/A | N/A |
| DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection | Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Wei Liu, Xian Wu, Shaorong Xie, Yefeng Zheng | N/A | N/A |
| Evaluating D-MERIT of Partial-annotation on Information Retrieval | Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg | N/A | N/A |
| Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving | XIN QUAN, Marco Valentino, Louise A. Dennis, Andre Freitas | N/A | N/A |
| Calibrating the Confidence of Large Language Models by Eliciting Fidelity | Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu | N/A | N/A |
| Exploring Reward Model Strength’s Impact on Language Models | Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen | N/A | N/A |
| How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics | Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea | N/A | N/A |
| Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Gaetan Lopez Latouche, Marc-André Carbonneau, Benjamin Swanson | N/A | N/A |
| CUTE: Measuring LLMs’ Understanding of Their Tokens | Lukas Edman, Helmut Schmid, Alexander Fraser | N/A | N/A |
| SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation | Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| On The Role of Context in Reading Time Prediction | Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox | N/A | N/A |
| BC-Prover: Backward Chaining Prover for Formal Theorem Proving | Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin, Ruifeng Xu, Wotao Yin | N/A | N/A |
| From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP | Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva | N/A | N/A |
| Dual Modalities of Text: Visual and Textual Generative Pre-Training | Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu | N/A | N/A |
| On Training Data Influence of GPT Models | Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu | N/A | N/A |
| Understanding “Democratization” in NLP and ML Research | Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat | N/A | N/A |
| DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models | Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto | N/A | N/A |
| Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages | Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng | N/A | N/A |
| Word Alignment as Preference for Machine Translation | Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka | N/A | N/A |
| Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence | Yaxin FAN, PEIFENG LI, Qiaoming Zhu | N/A | N/A |
| SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models | Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang | N/A | N/A |
| Neuron-Level Knowledge Attribution in Large Language Models | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Pixology: Probing the Linguistic and Visual Knowledge of Pixel-based Language Models | Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux | N/A | N/A |
| GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory | Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song | N/A | N/A |
| Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature | ALI ALLAITH, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Ingemann Parby, Alexander Conroy, Timothy R Tangherlini | N/A | N/A |
| QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models | Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh | N/A | N/A |
| Fine-Grained Prediction of Reading Comprehension from Eye Movements | Omer Shubi, Yoav Meiri, Cfir Avraham Hadar, Yevgeni Berzak | N/A | N/A |
| Efficient Retriever for Multi-Hop Retrieval Question Answerin | Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang | N/A | N/A |
| Unsupervised Human Preference Learning | Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani Tur | N/A | N/A |
| Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering | Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo, Elena Cabrio, Serena Villata, Marco Guerini | N/A | N/A |
| Leading Whitespaces of Language Models’ Subword Vocabulary Poses a Confound for Calculating Word Probabilities | Byung-Doh Oh, William Schuler | N/A | N/A |
| LLM4Decompile: Decompiling Binary Code with Large Language Models | Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang | N/A | N/A |
| From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning | Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong | N/A | N/A |
| CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering | Yike Wu, Yi Huang, Nan Hu, YUNCHENG HUA, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan | N/A | N/A |
| MTLS: Making Texts into Linguistic Symbols | Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li | N/A | N/A |
| D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection | Yifan Chen, Kuntao Li, Weixing Mai, Qiaofeng Wu, Yun Xue, Fenghuan Li | N/A | N/A |
| A Generic Method for Fine-grained Category Discovery in Natural Language Texts | Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens | N/A | N/A |
| Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method | Yang Trista Cao, Lovely-Frances Domingo, Sarah Gilbert, Michelle L. Mazurek, Katherine Shilton, Hal Daumé III | N/A | N/A |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie | N/A | N/A |
| Decompose and Compare Consistency: Measuring VLMs’ Answer Reliability via Task-Decomposition Consistency Comparison | Qian Yang, Weixiang Yan, Aishwarya Agrawal | N/A | N/A |
| Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism | Lang Cao | N/A | N/A |
| VGBench: A Comprehensive Benchmark of Vector Graphics Understanding and Generation for Large Language Models | Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee | N/A | N/A |
| What do large language models need for machine translation evaluation? | Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orasan, Tharindu Ranasinghe, Fred Blain | N/A | N/A |
| Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo, Prateek Singhi, Bilal H Fadlallah | N/A | N/A |
| External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models | Debela Gemechu, Chris Reed | N/A | N/A |
| C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits | Maaz Bin Musa, Rishab Nithyanand, Padmini Srinivasan, Mihailis E. Diamantis, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin | N/A | N/A |
| MPT: Multimodal Prompt Tuning for Zero-shot Instruction Learning | Taowen Wang, Yiyang Liu, James Chenhao Liang, junhan zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han, Lifu Huang, Qifan Wang, Dongfang Liu | N/A | N/A |
| Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng, Yi Gu, Chengyu Dong, Zihan Wang, Jingbo Shang | N/A | N/A |
| Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng, Zilong Wang, Jingbo Shang | N/A | N/A |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang | N/A | N/A |
| Conditional and Modal Reasoning in Large Language Models | Wesley H. Holliday, Matthew Mandelkern, Cedegao E. Zhang | N/A | N/A |
| Advancing Large Language Model Attribution through Self-Improving | Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Ziqi Liang, Haoxiang Shi, Hanhui Chen | N/A | N/A |
| Interpretability-based Tailored Knowledge Editing in Transformers | Yihuai Hong, Aldo Lipani | N/A | N/A |
| PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling | Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan | N/A | N/A |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap | N/A | N/A |
| Dissecting Fine-Tuning Unlearning in Large Language Models | Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang | N/A | N/A |
| Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models | Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, zhiheng huang | N/A | N/A |
| Where is the signal in tokenization space? | Renato Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck | N/A | N/A |
| Private Language Models via Truncated Laplacian Mechanism | Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang | N/A | N/A |
| Estimating Knowledge in Large Language Models Without Generating a Single Token | Daniela Gottesman, Mor Geva | N/A | N/A |
| Consistent Autoformalization for Constructing Mathematical Libraries | Lan Zhang, XIN QUAN, Andre Freitas | N/A | N/A |
| Contextual and Parametric Knowledge: More Context, More Focus | Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal | N/A | N/A |
| Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers | Aditya Yedetore, Najoung Kim | N/A | N/A |
| When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages | Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen | N/A | N/A |
| Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai | N/A | N/A |
| MiTTenS: A Dataset for Evaluating Gender Mistranslation | Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings | N/A | N/A |
| Teaching LLMs to Abstain across Languages via Multilingual Feedback | Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov | N/A | N/A |
| Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration | Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov | N/A | N/A |
| StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements | Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L Gordon, Zaid Harchaoui, Yejin Choi | N/A | N/A |
| I Could’ve Asked That: Reformulating Unanswerable Questions | Wenting Zhao, Ge Gao, Claire Cardie, Alexander M Rush | N/A | N/A |
| STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions | Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami | N/A | N/A |
| Hidden Persuaders: How LLM Political Bias Could Sway Our Elections | Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song | N/A | N/A |
| SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning | Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu | N/A | N/A |
| When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives | Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu | N/A | N/A |
| An Analysis of Multilingual FActScore | Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai | N/A | N/A |
| Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models | Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo | N/A | N/A |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli | N/A | N/A |
| PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval | Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon | N/A | N/A |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov | N/A | N/A |
| ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback | Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault | N/A | N/A |
| Order of Magnitude Speedups for LLM Membership Inference | Rongting Zhang, Martin Andres Bertran, Aaron Roth | N/A | N/A |
| VIMI: Grounding Video Generation through Multi-modal Instruction | Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov | N/A | N/A |
| F$^2$RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation | Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou | N/A | N/A |
| Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning | Chang Yang, Peng Zhang, Hui Gao, Jing Zhang | N/A | N/A |
| Visual Prompting in LLMs for Enhancing Emotion Recognition | Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Yang Liu, Zhenyue Qin, Wenjia Niu, Sabrina Caldwell, Tom Gedeon | N/A | N/A |
| IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding | Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang | N/A | N/A |
| Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset | Che Wei Tsai, Yen-Hao Huang, Tsu-keng Liao, Didier Fernando Salazar Estrada, Retnani Latifah, Yi-Shin Chen | N/A | N/A |
| Outcome-Constrained Large Language Models for Countering Hate Speech | Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song | N/A | N/A |
| Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing | Changbing Yang, Garrett Nicolai, Miikka Silfverberg | N/A | N/A |
| Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks | Ao Wang, Xinghao Yang, Chen Li, Bao-di Liu, Weifeng Liu | N/A | N/A |
| Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping | Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang | N/A | N/A |
| PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling | Huachuan Qiu, Lizhi Ma, Zhenzhong Lan | N/A | N/A |
| World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering | Jiacong Wang, Bohong Wu, Haiyong Jiang, Haoyuan Guo, Xin Xiao, zhou Xun, Jun Xiao | N/A | N/A |
| DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering | Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo | N/A | N/A |
| How Do Humans Write Code? Large Models Do It the Same Way Too | Long Li, Xuzheng He, Haozhe Wang, Linlin Wang, Liang He | N/A | N/A |
| Retrospex: Language Agent Meets Offline Reinforcement Learning Critic | Yufei Xiang, Yiqun Shen, Yeqin Zhang, Nguyen Cam-Tu | N/A | N/A |
| Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models | Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, JingBo Zhu | N/A | N/A |
| Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation | Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen | N/A | N/A |
| CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation | Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang | N/A | N/A |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J Su, Camillo Jose Taylor, Dan Roth | N/A | N/A |
| Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao, Gonghan Xu, Zhe Wang, Arman Cohan | N/A | N/A |
| MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning | Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai | N/A | N/A |
| Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients | Weijun Li, Qiongkai Xu, Mark Dras | N/A | N/A |
| RWKV-CLIP: A Robust Vision-Language Representation Learner | Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng | N/A | N/A |
| KidLM: Advancing Language Models for Children – Early Insights and Future Directions | Mir Tafseer Nayeem, Davood Rafiei | N/A | N/A |
| Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr | N/A | N/A |
| How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin | N/A | N/A |
| An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records | Joakim Edin, Maria Maistro, Lars Maaløe, Lasse Borgholt, Jakob Drachmann Havtorn, Tuukka Ruotsalo | N/A | N/A |
| Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs | Zheng Wang, Zhongyang Li, Jiang Zeren, Dandan Tu, Wei Shi | N/A | N/A |
| EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation | Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji | N/A | N/A |
| Predicting Nonnative Sentence Processing with L2LMs | Tatsuya Aoyama, Nathan Schneider | N/A | N/A |
| From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis | Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan | N/A | N/A |
| Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs | Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar Karnin | N/A | N/A |
| Cross-Domain Audio Deepfake Detection: Dataset and Analysis | Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang | N/A | N/A |
| MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension | Ting Liu, Zunnan Xu, Zhiqiang Wang, Yue Hu, Liangtao Shi, Quanjun Yin | N/A | N/A |
| Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning | Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo | N/A | N/A |
| Aligning Translation-Specific Understanding to General Understanding in Large Language Models | Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin | N/A | N/A |
| FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation | Mohamad Ballout, Anne Dedert, Nohayr Muhammad Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger | N/A | N/A |
| Concept-skill Transferability-based Data Selection for Large Vision-Language Models | Jaewoo Lee, Boyang Li, Sung Ju Hwang | N/A | N/A |
| LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing | Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi LIU, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin | N/A | N/A |
| Academics Can Contribute to Domain-Specialized Language Models | Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S Rosenberg, Sebastian Gehrmann | N/A | N/A |
| Beyond Reference: Evaluating High Quality Translations Better than Human References | Keonwoong Noh, Seokjin Oh, Woohwan Jung | N/A | N/A |
| Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement | Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie | N/A | N/A |
| SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages | Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James Validad Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Railey Montalan, Ryan Ignatius Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze GAO, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Tai Ngee Chia, Ayu Purwarianti, Sebastian Ruder, William Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya | N/A | N/A |
| Induct-Learn: Short Phrase Prompting with Instruction Induction | Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen | N/A | N/A |
| Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning | Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li | N/A | N/A |
| LUQ: Long-text Uncertainty Quantification for LLMs | Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier | N/A | N/A |
| Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method | Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng | N/A | N/A |
| Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars | Damien Sileo | N/A | N/A |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux | N/A | N/A |
| Safely Learning with Private Data: A Federated Learning Framework for Large Language Model | Jia-Ying Zheng, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hong-Wei Zheng, Zhi-Ming Zheng | N/A | N/A |
| Formality Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge | Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen | N/A | N/A |
| How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You | N/A | N/A |
| How Far Can We Extract Diverse Perspectives from Large Language Models? | Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang | N/A | N/A |
| EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning | Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Mohan Yerragorla, Sourangshu Bhattacharya, Avishek Anand | N/A | N/A |
| An LLM Feature-based Framework for Dialogue Constructiveness Assessment | Lexin Zhou, Youmna Farag, Andreas Vlachos | N/A | N/A |
| Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System | Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou | N/A | N/A |
| Dialog2Flow: Pre-training Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction | Sergio Burdisso, Srikanth Madikeri, Petr Motlicek | N/A | N/A |
| Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture | N/A | N/A |
| Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024 | Ilias Chalkidis | N/A | N/A |
| Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning | Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian | N/A | N/A |
| LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations | Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu | N/A | N/A |
| Concept Space Alignment in Multilingual LLMs | Qiwei Peng, Anders Søgaard | N/A | N/A |
| Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model | Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou | N/A | N/A |
| NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian | Peng Liu, Lemei Zhang, Terje Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang | N/A | N/A |
| RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework | Yifan Wang, Vera Demberg | N/A | N/A |
| Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models | Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang | N/A | N/A |
| Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems | Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering | Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen | N/A | N/A |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Minzheng Wang, Longze Chen, ChengFu, Liaoshengyi, Xinghua Zhang, Bingliwu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li | N/A | N/A |
| On Mitigating Performance Disparities in Multilingual Speech Recognition | Monorama Swain, Anna Katrine van Zee, Anders Søgaard | N/A | N/A |
| Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting | Stephen Meisenbacher, Florian Matthes | N/A | N/A |
| From Coarse to Fine: Impacts of Feature-Preserving and Feature-Compressing Connectors on Perception in Multimodal Models | Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen | N/A | N/A |
| What is ‘‘Typological Diversity’’ in NLP? | Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva | N/A | N/A |
| The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse | Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi | N/A | N/A |
| Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness | Georgi Shopov, Stefan Gerdjikov | N/A | N/A |
| Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal | N/A | N/A |
| Methods of Automatic Matrix Language Determination for Code-Switched Speech | Olga Iakovenko, Thomas Hain | N/A | N/A |
| Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts | Jaewook Lee, Yeajin Jang, Hongjin KIM, Woojin Lee, Harksoo Kim | N/A | N/A |
| Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models | Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar | N/A | N/A |
| Teaching Small Language Models Reasoning through Counterfactual Distillation | FengTao, Yicheng Li, Li Chenglin, Hao Chen, Fei Yu, Yin Zhang | N/A | N/A |
| Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese | Meet Doshi, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| Quantifying the Gap Between Machine Translation and Native Language in Training for Multimodal, Multilingual Retrieval | Kyle Buettner, Adriana Kovashka | N/A | N/A |
| MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval | Qixi Lu, Gongbo Tang | N/A | N/A |
| Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates | Aida Kostikova, Dominik Beese, Benjamin Paassen, Ole Pütz, Gregor Wiedemann, Steffen Eger | N/A | N/A |
| CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling | Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie CK Cheung | N/A | N/A |
| Story Embeddings — Narrative-Focused Representations of Fictional Stories | Hans Ole Hatzel, Chris Biemann | N/A | N/A |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou | N/A | N/A |
| PSC: Extending Context Window of Large Language Models via Phase Shift Calibration | Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu | N/A | N/A |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan | N/A | N/A |
| SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao | N/A | N/A |
| Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing | Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn | N/A | N/A |
| ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations | Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-Wei Lee | N/A | N/A |
| Boosting Scientific Concepts Understanding: Can Analogies from Teacher Models Empower Student Models? | Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang | N/A | N/A |
| Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation | Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza | N/A | N/A |
| Do Large Language Models Know How Much They Know? | Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar | N/A | N/A |
| Investigating Mysteries of CoT-Augmented Distillation | Somin Wadhwa, Silvio Amir, Byron C Wallace | N/A | N/A |
| SciPrompt: Knowledge-Augmented Prompting for Fine-Grained Categorization of Scientific Topics | Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludaescher, Jana Diesner | N/A | N/A |
| Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP | Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi | N/A | N/A |
| Learning from Natural Language Explanations for Generalizable Entity Matching | Somin Wadhwa, ADIT KRISHNAN, Runhui Wang, Byron C Wallace, Luyang Kong | N/A | N/A |
| Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation | Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Sricharan Kumar, Murat Kantarcioglu, Bradley A. Malin | N/A | N/A |
| On the Reliability of Psychological Scales on Large Language Models | Jen-tse Huang, Wenxuan Wang, Man Ho LAM, Eric John Li, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring | N/A | N/A |
| Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | Jeonghwan Kim, Heng Ji | N/A | N/A |
| Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts | Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata | N/A | N/A |
| VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment | Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu | N/A | N/A |
| Focused Large Language Models are Stable Many-Shot Learners | Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| Reconsidering Sentence-Level Sign Language Translation | Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus | N/A | N/A |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha | N/A | N/A |
| Verba volant, scripta volant? Don’t worry! There are computational solutions for protoword reconstruction | Liviu P Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas | N/A | N/A |
| ChatGPT Doesn’t Trust LA Chargers Fans: Guardrail Sensitivity in Context | Victoria R Li, Yida Chen, Naomi Saphra | N/A | N/A |
| Personas as a Way to Model Truthfulness in Language Models | Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He | N/A | N/A |
| Satyrn: A Platform for Analytics Augmented Generation | Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J Hammond | N/A | N/A |
| EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning | Ashish Seth, Ramaneswaran S, S Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha | N/A | N/A |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao, Haotian Fu, Chen Sun, George Konidaris | N/A | N/A |
| Detection and Measurement of Syntactic Templates in Generated Text | Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C Wallace | N/A | N/A |
| UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models | Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu | N/A | N/A |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet | N/A | N/A |
| Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts | Zhaoxuan Tan, Zheyuan Liu, Meng Jiang | N/A | N/A |
| Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning | Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang | N/A | N/A |
| Unifying Multimodal Retrieval via Document Screenshot Embedding | Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin | N/A | N/A |
| Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation | Shaomu Tan, Di Wu, Christof Monz | N/A | N/A |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson | N/A | N/A |
| Discovering Knowledge-Critical Subnetworks in Pretrained Language Models | Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut | N/A | N/A |
| Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models | Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang | N/A | N/A |
| Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering | Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner | N/A | N/A |
| Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution | Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner | N/A | N/A |
| Understanding and Mitigating Language Confusion in LLMs | Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder | N/A | N/A |
| Can Large Language Models Learn Independent Causal Mechanisms? | Gael Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael Witbrock, Gillian Dobbie | N/A | N/A |
| MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models | Sarfaroz Yunusov, Hamza Sidat, Ali Emami | N/A | N/A |
| InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context | Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao | N/A | N/A |
| Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia | Farhan Samir, Chan Young Park, Vered Shwartz, Anjalie Field, Yulia Tsvetkov | N/A | N/A |
| From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models | Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, EunJeong Hwang, Vered Shwartz | N/A | N/A |
| Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation | Karin De Langis, Ryan Koo, Dongyeop Kang | N/A | N/A |
| MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model | Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu | N/A | N/A |
| Learning to Extract Structured Entities Using Language Models | Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra | N/A | N/A |
| Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons | Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark Gales | N/A | N/A |
| A Survey of AMR Applications | Shira Wein, Juri Opitz | N/A | N/A |
| Beyond Embeddings: The Promise of Visual Table in Visual Reasoning | Yiwu Zhong, Zi-Yuan Hu, Michael Lyu, Liwei Wang | N/A | N/A |
| CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation | Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C Kaelin, Mary Khetani, Natalie Parde | N/A | N/A |
| Secured Weight Release for Large Language Models via Taylor Expansion | Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Hu | N/A | N/A |
| TimeR$^4$ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering | Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song | N/A | N/A |
| Knowledge-Centric Hallucination Detection | Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang | N/A | N/A |
| Revealing the Parallel Multilingual Learning within Large Language Models | Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, JingBo Zhu | N/A | N/A |
| Automatic Instruction Evolving for Large Language Models | Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen | N/A | N/A |
| RepEval: Effective Text Evaluation with LLM Representation | Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou | N/A | N/A |
| Generative Models for Automatic Medical Decision Rule Extraction from Text | Yuxin He, Buzhou Tang, Xiaoling Wang | N/A | N/A |
| Encoding and Controlling Global Semantics for Long-form Video Question Answering | Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu | N/A | N/A |
| Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis | Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang | N/A | N/A |
| Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs | Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Does Large Language Model Contain Task-Specific Neurons? | Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu | N/A | N/A |
| Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models | Philipp Mondorf, Barbara Plank | N/A | N/A |
| Advancing Test-Time Adaptation in Wild Acoustic Test Settings | Hongfu Liu, Hengguan Huang, Ye Wang | N/A | N/A |
| Learning to Retrieve Iteratively for In-Context Learning | Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme | N/A | N/A |
| Taxonomy-guided Semantic Indexing for Academic Paper Search | SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu | N/A | N/A |
| Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts | Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models | Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh | N/A | N/A |
| Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation | Zhiyu Cao, PEIFENG LI, Yaxin FAN, Qiaoming Zhu | N/A | N/A |
| FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs | Yiyuan Li, Shichao Sun, Pengfei Liu | N/A | N/A |
| Aligning Large Language Models with Diverse Political Viewpoints | Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash | N/A | N/A |
| “You Gotta be a Doctor, Lin” : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations | Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III | N/A | N/A |
| Extending Context Window of Large Language Models from a Distributional Perspective | Yingsheng Wu, Yuxuan Gu, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions | Hakyung Sung, Kristopher Kyle | N/A | N/A |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng | N/A | N/A |
| Position Engineering: Boosting Large Language Models through Positional Information Manipulation | Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna K. Qiu, Lili Qiu | N/A | N/A |
| Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale | Junying Chen, Chi Gui, OuyangRuyi, Anningzhe Gao, Shunian Chen, Guiming Hardy Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang | N/A | N/A |
| ADELIE: Aligning Large Language Models on Information Extraction | Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li | N/A | N/A |
| Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons | Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Dajun Zeng | N/A | N/A |
| Lexically Grounded Subword Segmentation | Jindřich Libovický, Jindřich Helcl | N/A | N/A |
| EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang | N/A | N/A |
| Do Text-to-Vis Benchmarks Test Real Use of Visualizations? | Hy Nguyen, Xuefei He, Andrew Reeson, Cecile Paris, Josiah Poon, Jonathan K. Kummerfeld | N/A | N/A |
| Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu | N/A | N/A |
| Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning | Jingyu Hu, Weiru Liu, Mengnan Du | N/A | N/A |
| Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges | Nguyen Van Dinh, Thanh Chi Dang, Luan Thanh Nguyen, Kiet Van Nguyen | N/A | N/A |
| Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment | Vyas Raina, Adian Liusie, Mark Gales | N/A | N/A |
| Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal | Zhicong Lu, Li Jin, PeiguangLi, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai | N/A | N/A |
| More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Yangyang Kang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu | N/A | N/A |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | Vyas Raina, Rao Ma, Charles McGhee, Kate Knill, Mark Gales | N/A | N/A |
| GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation | Georgios Katsimpras, Georgios Paliouras | N/A | N/A |
| XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs | Zichen Chen, Jianda Chen, Ambuj Singh, Misha Sra | N/A | N/A |
| Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning | Yuanpin Zhou, Huogen Wang | N/A | N/A |
| SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information | Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, Yu Cheng | N/A | N/A |
| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui | N/A | N/A |
| Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments | Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su | N/A | N/A |
| MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space | Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia, Zhangjijun, Ruifang He, Yuexian Hou | N/A | N/A |
| KnowledgeSG: Privacy-Preserving Synthetic Text Generation With Knowledge Distillation From Server | WenHao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang | N/A | N/A |
| DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination | Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei | N/A | N/A |
| Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models | Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao | N/A | N/A |
| Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale | Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou | N/A | N/A |
| An Empirical Study of Multilingual Reasoning Distillation for Question Answering | Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? | Gal Yona, Roee Aharoni, Mor Geva | N/A | N/A |
| Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? | Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig | N/A | N/A |
| Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning | Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee | N/A | N/A |
| MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding | Baixuan Xu, Weiqi Wang, Haochen Shi, Wenxuan Ding, Huihao JING, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song | N/A | N/A |
| ECON: On the Detection and Resolution of Evidence Conflicts | Cheng Jiayang, Qianqian Zhuang, Chunkit Chan, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang | N/A | N/A |
| “Image, Tell me your story!” Predicting the original meta-context of visual misinformation | Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych | N/A | N/A |
| Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan | N/A | N/A |
| Mixture-of-Subspaces in Low-Rank Adaptation | Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong | N/A | N/A |
| A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data | Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram | N/A | N/A |
| LawBench: Benchmarking Legal Knowledge of Large Language Models | Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng | N/A | N/A |
| Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards | Furkan Şahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych | N/A | N/A |
| Efficient Vision-Language pre-training via domain-specific learning for human activities | Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martinez, Georgios Tzimiropoulos | N/A | N/A |
| Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training | Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su | N/A | N/A |
| Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works | Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang | N/A | N/A |
| Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners | Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang | N/A | N/A |
| AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning | Hao Sun, Jiayi Wu, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li | N/A | N/A |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen | N/A | N/A |
| Data Advisor: Data Curation with Foresight for Safety Alignment of Large Language Models | Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan | N/A | N/A |
| Language-to-Code Translation with a Single Labeled Example | Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas | N/A | N/A |
| Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann, Xiao Liu, Iryna Gurevych | N/A | N/A |
| FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models | Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma | N/A | N/A |
| Retrieved In-Context Principles from Previous Mistakes | Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang | N/A | N/A |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Haozhe Chen, Run Chen, Julia Hirschberg | N/A | N/A |
| VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang | N/A | N/A |
| Deterministic Weighted L* Algorithm | Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Ryan Cotterell | N/A | N/A |
| Towards Verifiable Text Generation with Evolving Memory and Self-Reflection | Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification | Pritish Sahu, Karan Sikka, Ajay Divakaran | N/A | N/A |
| Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota, Jerone Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang | N/A | N/A |
| RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? | Di Cao, Yong Liao, Xiuwei Shang | N/A | N/A |
| Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel | Brendan King, Jeffrey Flanigan | N/A | N/A |
| Humans or LLMs as the Judge? A Study on Judgement Bias | Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang | N/A | N/A |
| WPO: Enhancing RLHF with Weighted Preference Optimization | Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu | N/A | N/A |
| Walking in Others’ Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias | Rongwu Xu, Zian Zhou, Tianwei Zhang, Zehan Qi, SU YAO, Ke Xu, Wei Xu, Han Qiu | N/A | N/A |
| MetaReflection: Learning Instructions for Language Agents using Past Reflections | Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi | N/A | N/A |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan | N/A | N/A |
| On Eliciting Syntax from Language Models via Hashing | Yiran Wang, Masao Utiyama | N/A | N/A |
| CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios | Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He | N/A | N/A |
| The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples | Heng Yang | N/A | N/A |
| CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages | Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal | N/A | N/A |
| Perceptions of Linguistic Uncertainty by Language Models and Humans | Catarina G Belém, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth | N/A | N/A |
| Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM | Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung | N/A | N/A |
| Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding | Xiaoyu DONG, Yujie Feng, ZEXIN LU, Guangyuan SHI, Xiao-Ming Wu | N/A | N/A |
| Knowledge Conflicts for LLMs: A Survey | Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru WANG, Yue Zhang, Wei Xu | N/A | N/A |
| Generative AI in the Era of “Alternative Facts” | Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar | N/A | N/A |
| MEANT: Multimodal Encoder for Antecedent Information | Benjamin Irving, Annika Marie Schoene | N/A | N/A |
| A Thorough Examination of Decoding Methods in the Era of LLMs | Chufan Shi, HAORAN YANG, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam | N/A | N/A |
| AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings | Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar | N/A | N/A |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji | N/A | N/A |
| Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights | Hongjin KIM, Jai-Eun Kim, Harksoo Kim | N/A | N/A |
| ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods | Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Zhenqiang Gong, Bhuwan Dhingra | N/A | N/A |
| “Flex Tape Can’t Fix That”: Bias and Misinformation in Edited Language Models | Karina H Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut | N/A | N/A |
| Revisiting Who’s Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective | Yujian Liu, Yang Zhang, Tommi Jaakkola, Shiyu Chang | N/A | N/A |
| LIONs: An Empirically Optimized Approach to Align Language Models | Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu | N/A | N/A |
| Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing | Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada | N/A | N/A |
| A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery | Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han | N/A | N/A |
| MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents | Liyan Tang, Philippe Laban, Greg Durrett | N/A | N/A |
| Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning | John Wu, David Wu, Jimeng Sun | N/A | N/A |
| MOSEL: Inference Serving Using Dynamic Modality Selection | Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J Yadwadkar, Aditya Akella | N/A | N/A |
| From RAG to Riches: Retrieval Interlaced with Sequence Generation | Palak Jain, Livio Baldini Soares, Tom Kwiatkowski | N/A | N/A |
| Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition | Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee | N/A | N/A |
| Learning to Correct for QA Reasoning with Black-box LLMs | Jaehyung Kim, Dongyoung Kim, Yiming Yang | N/A | N/A |
| AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? | Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant | N/A | N/A |
| PostMark: A Robust Blackbox Watermark for Large Language Models | Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Frederick Wieting, Mohit Iyyer | N/A | N/A |
| Assessing “Implicit” Retrieval Robustness of Large Language Models | Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang | N/A | N/A |
| On the Relationship between Truth and Political Bias in Language Models | Suyash Fulay, William Brannon, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara | N/A | N/A |
| Can Active Label Correction Improve LLM-based Modular AI Systems? | Karan Taneja, Ashok Goel | N/A | N/A |
| Statistical Uncertainty in Word Embeddings: GloVe-V | Andrea Vallebueno, Cassandra Handan-Nader, Christopher D Manning, Daniel E. Ho | N/A | N/A |
| Annotation alignment: Comparing LLM and human annotations of conversational safety | Rajiv Movva, Pang Wei Koh, Emma Pierson | N/A | N/A |
| DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Nigel Fernandez, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, Andrew Lan | N/A | N/A |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang | N/A | N/A |
| CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models | Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran | N/A | N/A |
| Enhancing Reinforcement Learning with Intrinsic Rewards from Language Model Critique | Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng | N/A | N/A |
| Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models | Layla Bouzoubaa, Elham Aghakhani, Shadi Rezapour | N/A | N/A |
| Efficient Sequential Decision Making with Large Language Models | Dingyang Chen, Qi Zhang, Yinglun Zhu | N/A | N/A |
| SignCLIP: Connecting Text and Sign Language by Contrastive Learning | Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling | N/A | N/A |
| APPLS: Evaluating Evaluation Metrics for Plain Language Summarization | Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang | N/A | N/A |
| Ontologically Faithful Generation of Non-Player Character Dialogues | Nathaniel Weir, Ryan Thomas, Randolph d’Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani | N/A | N/A |
| LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives | Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs | Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov | N/A | N/A |
| Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction | Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Toward Compositional Behavior in Neural Models: A Survey of Current Views | Kate McCurdy, Paul Soulos, Paul Smolensky | N/A | N/A |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab | N/A | N/A |
| Reverse-Engineering the Reader | Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell | N/A | N/A |
| Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation | Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text | Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun | N/A | N/A |
| Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning | David Schulte, Felix Hamborg, Alan Akbik | N/A | N/A |
| The effects of distance on NPI illusive effects in BERT | So Young Lee, Mai Ha Vu | N/A | N/A |
| Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic | Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme | N/A | N/A |
| Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US | Christabel Acquaye, Haozhe An, Rachel Rudinger | N/A | N/A |
| Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding | Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Eric Wang | N/A | N/A |
| Ranking Manipulation for Conversational Search Engines | Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi | N/A | N/A |
| Fast Forwarding Low-Rank Training | Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov | N/A | N/A |
| Precise Model Benchmarking with Only a Few Observations | Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort | N/A | N/A |
| Attribute Diversity Determines the Systematicity Gap in VQA | Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra | N/A | N/A |
| “Rows, Columns and Values, Oh My!” Synthesizing Scientific Literature into Tables using Language Models | Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S Weld, Joseph Chee Chang, Kyle Lo | N/A | N/A |
| Development of Cognitive Intelligence in Pre-trained Language Models | Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma | N/A | N/A |
| Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding | Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui | N/A | N/A |
| Birdie: Advancing State Space Models with a Minimalist Architecture and Novel Pre-training Objectives | Sam Blouir, Jimmy T.H. Smith, Antonios Anastasopoulos, Amarda Shehu | N/A | N/A |
| Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? | Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow | N/A | N/A |
| Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs | Sheridan Feucht, David Atkinson, Byron C Wallace, David Bau | N/A | N/A |
| TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering | Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig | N/A | N/A |
| Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding | Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell | N/A | N/A |
| Unlocking Memorization in Large Language Models with Dynamic Soft Prompting | Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang | N/A | N/A |
| If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions | Reza Esfandiarpoor, Cristina Menghini, Stephen Bach | N/A | N/A |
| Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction | Bowen Zhang, Harold Soh | N/A | N/A |
| MQuinE: a Cure for “Z-paradox” in Knowledge Graph Embedding | Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun | N/A | N/A |
| Can Transformer Language Models Learn $n$-gram Language Models? | Anej Svete, Nadav Borenstein, Mike Zhou, Ryan Cotterell | N/A | N/A |
| StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model | Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim | N/A | N/A |
| Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems | Philippe Laban, Alexander Fabbri, Caiming Xiong, Chien-Sheng Wu | N/A | N/A |
| Multi-pass Decoding for Grammatical Error Correction | Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu | N/A | N/A |
| Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang, Yijia Shao, Dekun Ma, Sina Semnani, Monica Lam | N/A | N/A |
| SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation | Chenming Tang, Zhixiang Wang, Yunfang Wu | N/A | N/A |
| Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge | Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng | N/A | N/A |
| STORYSUMM: Evaluating Faithfulness in Story Summarization | Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Thomas Adams, Lydia Chilton, Kathleen McKeown | N/A | N/A |
| MMOE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Haofei Yu, Zhengyang Qi, Lawrence Keunho Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang | N/A | N/A |
| OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer | Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee | N/A | N/A |
| Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension | Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg | N/A | N/A |
| CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions | Jun Rao, Xuebo Liu, Lian Lian, shengjun cheng, Yunjie Liao, Min Zhang | N/A | N/A |
| ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers | Yuzhe Gu, Enmao Diao | N/A | N/A |
| Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models | Jaeseong Lee, seung-won hwang, Wonpyo Park, Mingi Ji | N/A | N/A |
| Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood | Yang Xu, Yu Wang, Hao An, Yongyuan Li, Zhichen Liu | N/A | N/A |
| Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning | Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, JUN ZHOU | N/A | N/A |
| Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models | XiaoHua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin | N/A | N/A |
| ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs | Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen | N/A | N/A |
| On the In-context Generation of Language Models | Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu | N/A | N/A |
| Atomic Inference for NLI with Generated Facts as Atoms | Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei | N/A | N/A |
| Towards Robust Speech Representation Learning for Thousands of Languages | William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe | N/A | N/A |
| I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses | Xuan Ren, Biao Wu, Lingqiao Liu | N/A | N/A |
| PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment | Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen | N/A | N/A |
| An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance | Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig | N/A | N/A |
| When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models | Ting-Yun Chang, Jesse Thomason, Robin Jia | N/A | N/A |
| Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference | Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo zhang, Yanghui Rao | N/A | N/A |
| Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions | Jinsung Yoon, Rajarishi Sinha, Sercan O Arik, Tomas Pfister | N/A | N/A |
| KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction | Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao | N/A | N/A |
| Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation | Zhen Lin, Shubhendu Trivedi, Jimeng Sun | N/A | N/A |
| $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity | Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl | N/A | N/A |
| CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction | Tuan Dung Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen | N/A | N/A |
| “In-Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning | Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan | N/A | N/A |
| Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective | Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He | N/A | N/A |
| Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding | Xin Liu, Farima Fatahi Bayat, Lu Wang | N/A | N/A |
| Reasoning Robustness of LLMs to Adversarial Typographical Errors | Esther Gan, Yiran Zhao, Liying Cheng, Mao Yancan, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh | N/A | N/A |
| InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance | Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Mozhi Zhang, Ke Ren, Botian Jiang, Xipeng Qiu | N/A | N/A |
| Belief Revision: The Adaptability of Large Language Models Reasoning | Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung | N/A | N/A |
| Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models | Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou | N/A | N/A |
| Bio-RFX: Refining Biomedical Extraction via Advanced Relation Classification and Structural Constraints | Minjia Wang, Fangzhou Liu, Xiuxing Li, Bowen Dong, Zhenyu Li, Tengyu Pan, Jianyong Wang | N/A | N/A |
| Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models | Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng | N/A | N/A |
| LLMs Are Prone to Fallacies in Causal Inference | Nitish Joshi, Abulhair Saparov, Yixin Wang, He He | N/A | N/A |
| Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles | Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang | N/A | N/A |
| The Lou Dataset - Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis, Joel Birrer, Anne Lauscher, Iryna Gurevych | N/A | N/A |
| When Generative Adversarial Networks Meet Sequence Labeling Challenges | Yu Tong, Ge Chen, Guokai Zheng, Rui Li, Jiang Dazhi | N/A | N/A |
| Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering | Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee | N/A | N/A |
| Speechworthy Instruction-tuned Language Models | Hyundong Justin Cho, Nicolaas Paul Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May | N/A | N/A |
| Data, Data Everywhere: A Guide for Pretraining Dataset Construction | Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| Fine-Tuning and Prompt Optimization: Two Good Steps that Work Better Together | Dilara Soylu, Christopher Potts, Omar Khattab | N/A | N/A |
| Demystifying Verbatim Memorization in Large Language Models | Jing Huang, Diyi Yang, Christopher Potts | N/A | N/A |
| AmbigNLG: Addressing Task Ambiguity in Instruction for NLG | Ayana Niwa, Hayate Iso | N/A | N/A |
| Distributional Properties of Subword Regularization | Marco Cognetta, Vilém Zouhar, Naoaki Okazaki | N/A | N/A |
| DataTales: A Benchmark for Real-World Intelligent Data Narration | Yajing Yang, Qian Liu, Min-Yen Kan | N/A | N/A |
| Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun | N/A | N/A |
| GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization | Yangfan Ye, Xiachong Feng, Xiaocheng Feng, Weitao Ma, Libo Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models | Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer | N/A | N/A |
| More Insightful Feedback for Tutoring: Enhancing Generation Mechanisms and Automatic Evaluation | Wencke Liermann, Jin-Xia Huang, Yohan Lee, Kong Joo Lee | N/A | N/A |
| Stable Language Model Pre-training by Reducing Embedding Variability | Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun | N/A | N/A |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Kavya Manohar, Leena G Pillai | N/A | N/A |
| Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets | Benjamin Schiller, Johannes Daxenberger, Andreas Waldis, Iryna Gurevych | N/A | N/A |
| Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas | Seungjong Sun, Eungu Lee, Seo Yeon Baek, Seunghyun Hwang, Lee wonbyung, Dongyan Nan, Bernard J Jansen, Jang Hyun Kim | N/A | N/A |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha | N/A | N/A |
| Dynamic Multi-granularity Attribution Network for Aspect-based Sentiment Analysis | Yanjiang Chen, Kai Zhang, hufeng, Xianquan Wang, Ruikang li, Qi Liu | N/A | N/A |
| Unlabeled Debiasing in Downstream Tasks via Class-wise Low Variance Regularization | Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl | N/A | N/A |
| Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA | Pu Jian, Donglei Yu, Jiajun Zhang | N/A | N/A |
| Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights | Hao Yang, Lizhen Qu, Ehsan Shareghi, Reza Haf | N/A | N/A |
| Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations | Milan BHAN, Jean-Noël Vittaut, Nicolas CHESNEAU, Marie-Jeanne Lesot | N/A | N/A |
| What are the Generator Preferences for End-to-end Task-Oriented Dialog System? | Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou | N/A | N/A |
| Paraphrase Types Elicit Prompt Engineering Capabilities | Jan Philip Wahle, Terry Ruas, Yang Xu, Bela Gipp | N/A | N/A |
| VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models | Jingtao Cao, Zhang Zheng, Hongru WANG, Kam-Fai Wong | N/A | N/A |
| Towards Online Continuous Sign Language Recognition and Translation | Ronglai Zuo, Fangyun Wei, Brian Mak | N/A | N/A |
| Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment | Yiwei Dai, Hengrui Gu, Ying Wang, Xin Wang | N/A | N/A |
| Split and Merge: Aligning Position Biases in LLM-based Evaluators | Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu | N/A | N/A |
| Integrating Argumentation and Hate-Speech-based Techniques for Countering Misinformation | Sougata Saha, Rohini Srihari | N/A | N/A |
| BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM | Wenda Xu, Jiachen Li, William Yang Wang, Lei Li | N/A | N/A |
| One2Set + Large Language Model: Best Partners for Keyphrase Generation | Liangying Shao, Liang Zhang, Minlong Peng, Guoqi Ma, Hao Yue, Mingming Sun, Jinsong Su | N/A | N/A |
| Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering | Yifei Yuan, Yang Deng, Anders Søgaard, Mohammad Aliannejadi | N/A | N/A |
| ORPO: Monolithic Preference Optimization without Reference Model | Jiwoo Hong, Noah Lee, James Thorne | N/A | N/A |
| A Multi-Perspective Analysis of Memorization in Large Language Models | Bowen Chen, Namgi Han, Yusuke Miyao | N/A | N/A |
| Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations | Nicolò Penzo, Maryam Sajedinia, Bruno Lepri, Sara Tonelli, Marco Guerini | N/A | N/A |
| Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs | Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych | N/A | N/A |
| Unveiling the Role of Pretraining in Direct Speech Translation | Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà | N/A | N/A |
| PCQPR: Proactive Conversational Question Planning with Reflection | Shasha Guo | N/A | N/A |
| CodeAgent: Autonomous Communicative Agents for Code Review | Xunzhu Tang, KISUB KIM, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawendé F. Bissyandé | N/A | N/A |
| TroL: Traversal of Layers for Large Language and Vision Models | Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro | N/A | N/A |
| MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language | Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin | N/A | N/A |
| Revisiting Supertagging for faster HPSG parsing | Olga Zamaraeva, Carlos Gómez-Rodríguez | N/A | N/A |
| Improve Dense Passage Retrieval with Entailment Tuning | Lu Dai, Hao Liu, Hui Xiong | N/A | N/A |
| ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen WAN, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana | N/A | N/A |
| TEMA: Token Embeddings Mapping for Enriching Low-Resource Language Models | Rodolfo Zevallos, Núria Bel, Mireia Farrús | N/A | N/A |
| DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting | Xuanming Zhang, Anthony Diaz, Zixun Chen, Qingyang Wu, Kun Qian, Erik Voss, Zhou Yu | N/A | N/A |
| Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback | Fatemeh Pesaran zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim | N/A | N/A |
| PrExMe: Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter, Steffen Eger | N/A | N/A |
| Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning | Shuai Zhao, Meihuizi Jia, Anh Tuan Luu, Fengjun Pan, Jinming Wen | N/A | N/A |
| Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models | Javier Chiyah-Garcia, Alessandro Suglia, Arash Eshghi | N/A | N/A |
| Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models | Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu | N/A | N/A |
| Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations | Matthias Lindemann, Alexander Koller, Ivan Titov | N/A | N/A |
| Puzzle Solving using Reasoning of Large Language Models: A Survey | Panagiotis Giadikiaroglou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou | N/A | N/A |
| SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading | Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Jianfeng Gao, Fabian Peller-Konrad, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues | N/A | N/A |
| Red Teaming Language Models for Processing Contradictory Dialogues | Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen | N/A | N/A |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | Sander Land, Max Bartolo | N/A | N/A |
| Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs | Houman Mehrafarin, Arash Eshghi, Ioannis Konstas | N/A | N/A |
| Don’t Underestimate the Octopus - Why The Symbol Grounding Problem Does Not Apply to LLMs | Reto Gubelmann | N/A | N/A |
| Major Entity Identification: A Generalizable Alternative to Coreference Resolution | Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi | N/A | N/A |
| Enhancing High-order Interaction Awareness in LLM-based Recommender Model | Xinfeng Wang, Jin Cui, Fumiyo Fukumoto, Yoshimi Suzuki | N/A | N/A |
| What Are the Odds? Language Models Are Capable of Probabilistic Reasoning | Akshay Paruchuri, Jake Garrison, shun liao, John B Hernandez, Jacob Sunshine, Tim Althoff, Xin Liu, Daniel McDuff | N/A | N/A |
| MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction | Han Jiang, Junwen Duan, Zhe Qu, Jianxin Wang | N/A | N/A |
| LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models | Hayder Elesedy, Pedro M Esperanca, Silviu Vlad Oprea, Mete Ozay | N/A | N/A |
| “A good pun is its own reword”: Can Large Language Models Understand Puns? | Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang | N/A | N/A |
| QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation | Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu | N/A | N/A |
| Dependency Graph Parsing as Sequence Labeling | Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez | N/A | N/A |
| NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data | Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoit Crabbé, Etienne P Bernard | N/A | N/A |
| Towards a Greek Proverb Atlas: Computational Spatial Exploration and Attribution of Greek Proverbs | John Pavlopoulos, Panos Louridas, Panagiotis Filos | N/A | N/A |
| Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications | Weize Liu, Yinlong Xu, Hongxia Xu, Jintai Chen, Xuming Hu, Jian Wu | N/A | N/A |
| Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss | Bowen Zhang, Chunping Li | N/A | N/A |
| Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training | Marc Felix Brinner, Sina Zarrieß | N/A | N/A |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Markus Frohmann, Igor Sterner, Ivan Vulić, Benjamin Minixhofer, Markus Schedl | N/A | N/A |
| Applying Contrastive Learning to Code Vulnerability Type Classification | Chen Ji, Su Yang, Hongyu Sun, Yuqing Zhang | N/A | N/A |
| TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts | Ruida WANG, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang | N/A | N/A |
| Multi-Level Cross-Modal Alignment for Speech Relation Extraction | Liang Zhang, Zhen Yang, Biao Fu, Ziyao Lu, Liangying Shao, Shiyu Liu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jinsong Su | N/A | N/A |
| Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder, Gerhard Heyer | N/A | N/A |
| PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models | Jinsung Kim, Seonmin Koo, Heuiseok Lim | N/A | N/A |
| The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm | Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| Subword Segmentation in LLMs: Looking at Inflection and Consistency | Marion Di Marco, Alexander Fraser | N/A | N/A |
| Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments | Omar Sharif, Joseph Gatto, MADHUSUDAN BASAK, Sarah Masud Preum | N/A | N/A |
| Let Me Teach You: Pedagogical Foundations of Feedback for Language Models | Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut | N/A | N/A |
| Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data | Jean-Flavien Bussotti, Luca Ragazzi, Giacomo Frisoni, Gianluca Moro, Paolo Papotti | N/A | N/A |
| TL-CL: Task And Language Incremental Continual Learning | Shrey Satapara, P. K. Srijith | N/A | N/A |
| Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P Jeong, Saurabh Garg, Zachary Chase Lipton, Michael Oberst | N/A | N/A |
| Empowering Multi-step Reasoning across Languages via Program-Aided Language Models | Leonardo Ranaldi, Giulia Pucci | N/A | N/A |
| Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models | Yu Yuan, Lili Zhao, Kai Zhang, Guangting Zheng, Qi Liu | N/A | N/A |
| ControlMath: Controllable Data Generation Promotes Math Generalist Models | Nuo Chen, Ning Wu, Jianhui Chang, MING GONG, Linjun Shou, Dongmei Zhang, Jia Li | N/A | N/A |
| Where Am I From? Identifying Origin of LLM-generated Content | Liying LI, Yihan Bai, Minhao Cheng | N/A | N/A |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | Tarek Naous, Michael J Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu | N/A | N/A |
| GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text | Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin | N/A | N/A |
| GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains | Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Elizabeth Levine, Jessica Lin, Devika Tiwari, Amir Zeldes | N/A | N/A |
| RA2FD: Distilling Faithfulness into Efficient Dialogue Systems | Zhiyuan Zhu, Yusheng Liao, Chenxin Xu, Yunfeng Guan, Yanfeng Wang, Yu Wang | N/A | N/A |
| Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation | Fangrui Lv, Kaixiong Gong, Jian Liang, Xinyu Pang, Changshui Zhang | N/A | N/A |
| Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently | Kanishka Misra, Allyson Ettinger, Kyle Mahowald | N/A | N/A |
| Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking | Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li, Chenghua Lin, Wenge Rong | N/A | N/A |
| A Coordinate System for In-Context Learning | Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen | N/A | N/A |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Tengfei Yu, Xuebo Liu, Zhiyi Hou, Liang Ding, Dacheng Tao, Min Zhang | N/A | N/A |
| ABSEval: An Agent-based Framework for Script Evaluation | Sirui Liang, Baoli Zhang, Jun Zhao, Kang Liu | N/A | N/A |
| Latent Concept-based Explanation of NLP Models | Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad | N/A | N/A |
| Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher | Hyunjong Ok, Jegwang Ryu, Jaeho Lee | N/A | N/A |
| Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research | Yida Mu, Mali Jin, Xingyi Song, Nikolaos Aletras | N/A | N/A |
| The Mystery of the Pathological Path-star Task for Language Models | Arvid Frydenlund | N/A | N/A |
| Voices in a Crowd: Searching for clusters of unique perspectives | Nikolas Vitsakis, Amit Parekh, Ioannis Konstas | N/A | N/A |
| Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent | Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu | N/A | N/A |
| SLANG: New Concept Comprehension of Large Language Models | Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Xueqi Cheng | N/A | N/A |
| Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models | Michael Lan, Philip Torr, Fazl Barez | N/A | N/A |
| Why Does New Knowledge Create Messy Ripple Effects in LLMs? | Jiaxin Qin, Zixuan Zhang, Chi Han, Pengfei Yu, Manling Li, Heng Ji | N/A | N/A |
| Lifelong Event Detection via Optimal Transport | Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot | N/A | N/A |
| FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | KaShun SHUM, Minrui Xu, Jianshu Zhang, Zixin CHEN, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza | N/A | N/A |
| Domain adapted machine translation: What does catastrophic forgetting forget and why? | Danielle Saunders, Steve DeNeefe | N/A | N/A |
| Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback | Benjamin Towle, Ke Zhou | N/A | N/A |
| Atomic Self-Consistency for Better Long Form Generations | Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra | N/A | N/A |
| “Global is Good, Local is Bad?’’: Understanding Brand Bias in LLMs | Mahammed Kamruzzaman, Hieu Minh Nguyen, Gene Louis Kim | N/A | N/A |
| Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach | Siqi Li, Danni Liu, Jan Niehues | N/A | N/A |
| ACE: A LLM-based Negotiation Coaching System | Ryan Shea, Aymen Kallala, Xin Lucy Liu, Michael W. Morris, Zhou Yu | N/A | N/A |
| TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities | Ming Zhang, Caishuang Huang, Yilong Wu, Shichun Liu, Huiyuan Zheng, Yurui Dong, Yujiong Shen, Shihan Dou, Jun Zhao, Junjie Ye, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals | Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M Murphy, Nev Jones, Kate V Hardy, Hong Shen, Fei Fang, Zhiyu Chen | N/A | N/A |
| DKEC: Domain Knowledge Enhanced Multi-Label Classification for Diagnosis Prediction | Xueren Ge, Abhishek Satpathy, Ronald Dean Williams, John Stankovic, Homa Alemzadeh | N/A | N/A |
| $\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities | Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang | N/A | N/A |
| Large Language Models Can Self-Correct with Key Condition Verification | Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang | N/A | N/A |
| Learning to Write Rationally: How Information Is Distributed in Non-native Speakers’ Essays | Zixin Tang, Janet van Hell | N/A | N/A |
| Defending Against Social Engineering Attacks in the Age of LLMs | Lin Ai, Tharindu Sandaruwan Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael S. Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, huan liu, Julia Hirschberg | N/A | N/A |
| Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models | Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi | N/A | N/A |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Target-Aware Language Modeling via Granular Data Sampling | Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra | N/A | N/A |
| SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness | Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, Nanyun Peng | N/A | N/A |
| Learning from Feedback with Coupled Comprehension and Generation | Mustafa Omer Gul, Yoav Artzi | N/A | N/A |
| UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks | Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei | N/A | N/A |
| Story Morals: Surfacing value-driven narrative schemas using large language models | David G Hobson, Haiqi Zhou, Derek Ruths, Andrew Piper | N/A | N/A |
| OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants | Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta | N/A | N/A |
| AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Murari Tiyyala, Nicholas Andrews, Daniel Khashabi | N/A | N/A |
| SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents | Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut | N/A | N/A |
| Analysis of Plan-based Retrieval for Grounded Text Generation | Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer | N/A | N/A |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Alex Chandler, Devesh Surve, Hui Su | N/A | N/A |
| RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs | John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker | N/A | N/A |
| Improving Logical Fallacy Reasoning with Logical Structure Tree | Yuanyuan Lei, Ruihong Huang | N/A | N/A |
| Chain and Causal Attention for Efficient Entity Tracking | Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen | N/A | N/A |
| BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models | Yi Zeng, Weiyu Sun, Tran Ngoc Huynh, Dawn Song, Bo Li, Ruoxi Jia | N/A | N/A |
| A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution | Zhengmian Hu, Tong Zheng, Heng Huang | N/A | N/A |
| FAC$^2$E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition | Xiaoqiang Wang, Lingfei Wu, Tengfei Ma, Bang Liu | N/A | N/A |
| OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation | Tanvir Mahmud, Diana Marculescu | N/A | N/A |
| Language Concept Erasure for Language-invariant Dense Retrieval | Zhiqi Huang, Puxuan Yu, Shauli Ravfogel, James Allan | N/A | N/A |
| Learning Personalized Alignment for Evaluating Open-ended Text Generation | Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian | N/A | N/A |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang | N/A | N/A |
| Turn Waste into Worth: Rectifying Top-$k$ Router of MoE | Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu | N/A | N/A |
| Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination | Pittawat Taveekitworachai, Febri Abdullah, Ruck Thawonmas | N/A | N/A |
| CommVQA: Situating Visual Question Answering in Communicative Contexts | Nandita Shankar Naik, Christopher Potts, Elisa Kreiss | N/A | N/A |
| Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding | Weilin Zhao, Yuxiang Huang, Xu Han, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? | Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun | N/A | N/A |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar | N/A | N/A |
| Style-Specific Neurons for Steering LLMs in Text Style Transfer | Wen Lai, Viktor Hangya, Alexander Fraser | N/A | N/A |
| Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers | Tianhua Zhang, Kun LI, Hongyin Luo, Xixin Wu, James R. Glass, Helen M. Meng | N/A | N/A |
| Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction | Sizhe Zhou, Yu Meng, Bowen Jin, Jiawei Han | N/A | N/A |
| DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models | Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Leveraging Context-aware Prompting for Commit Message Generation | Zhihua Jiang, Jianwei Chen, Dongning Rao, Guanghui Ye | N/A | N/A |
| Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination | Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein | N/A | N/A |
| Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning | Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue’ | N/A | N/A |
| A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models | Zhihao Wang, Shiyu Liu, Jianheng Huang, Wang Zheng, YiXuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su | N/A | N/A |
| Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages | Jimin Sohn, Haeji Jung, Alex Cheng, Jooeon Kang, Yilin Du, David R Mortensen | N/A | N/A |
| An Analysis and Mitigation of the Reversal Curse | Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan | N/A | N/A |
| Exploring the Practicality of Generative Retrieval on Dynamic Corpora | Soyoung Yoon, Chaeeun Kim, Hyunji Lee, Joel Jang, Sohee Yang, Minjoon Seo | N/A | N/A |
| OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting | Xukai Liu, Ye Liu, Kai Zhang, Kehang Wang, Qi Liu, Enhong Chen | N/A | N/A |
| Gotcha! Don’t trick me with unanswerable questions! Self-aligning Large Language Models for Proactively Responding to Unknown Questions | Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning | Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang | N/A | N/A |
| Large Language Models in the Clinic: A Comprehensive Benchmark | Fenglin Liu, Zheng Li, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Hongjian Zhou, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei Clifton, Bing Yin, David A. Clifton | N/A | N/A |
| Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction | Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu | N/A | N/A |
| Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective | Van-Cuong Pham, Thien Huu Nguyen | N/A | N/A |
| DynamicER: Resolving Emerging Mentions to Dynamic Entities for RAG | Jinyoung Kim, Dayoon Ko, Gunhee Kim | N/A | N/A |
| Preserving Generalization of Language models in Few-shot Continual Relation Extraction | Quyen Tran, Nguyen Xuan Thanh, Nguyen Hoang Anh, Nam Le Hai, Trung Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations | Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang | N/A | N/A |
| Consecutive Batch Model Editing with HooK Layers | Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang CHEN, Wai Lam | N/A | N/A |
| Topic-Oriented Open Relation Extraction with A Priori Seed Generation | Linyi Ding, Jinfeng Xiao, Sizhe Zhou, Chaoqi Yang, Jiawei Han | N/A | N/A |
| Related Work and Citation Text Generation: A Survey | Xiangci Li, Jessica Ouyang | N/A | N/A |
| Curriculum Consistency Learning for Conditional Sentence Generation | Liangxin Liu, Xuebo Liu, Lian Lian, shengjun cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang | N/A | N/A |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Leonardo Bertolazzi, Albert Gatt, Raffaella Bernardi | N/A | N/A |
| Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision | Fan Jiang, Tom Drummond, Trevor Cohn | N/A | N/A |
| Towards an Open-Source Speech Foundation Model for EU: 950,000 Hours of Open-Source Compliant Speech Data for EU Languages | Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri | N/A | N/A |
| Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning | Jiashi Lin, Lifang Wang, Xinyu Lu, Zhongtian Hu, Wei Zhang, Wenxuan Lu | N/A | N/A |
| Contribution of Linguistic Typology to Universal Dependency Parsing: An Empirical Investigation | Ali Basirat, Navid Baradaran Hemmati | N/A | N/A |
| TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse | Francesco Periti, Pierluigi Cassotti, Stefano Montanelli, Nina Tahmasebi, Dominik Schlechtweg | N/A | N/A |
| Structured Optimal Brain Pruning for Large Language Models | Jiateng Wei, Quan Lu, ning jiang, Siqi Li, Jingyang Xiang, Jun Chen, Yong Liu | N/A | N/A |
| Automatically Generated Definitions and their utility for Modeling Word Meaning | Francesco Periti, David Alfter, Nina Tahmasebi | N/A | N/A |
| How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data | Yejie Wang, Keqing He, Dayuan Fu, Zhuoma GongQue, Heyang Xu, Yanxu Chen, Zhexu Wang, Yujia Fu, Guanting Dong, Muxi Diao, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu | N/A | N/A |
| MINT: A Benchmark for Evaluating Instructed Information Retrieval | Weiwei Sun, Zhengliang Shi, Wu Jiu Long, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren | N/A | N/A |
| Rethinking the Evaluation of In-Context Learning for LLMs | Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu, Xiang Ao | N/A | N/A |
| Cluster-Norm for Unsupervised Probing of Knowledge | Walter Laurito, Sharan Maiya, Grégoire DHIMOÏLA, Owen Ho Wan Yeung, Kaarel Hänni | N/A | N/A |
| Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries | Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson | N/A | N/A |
| Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration | Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng | N/A | N/A |
| Where am I? Large Language Models Wandering between Semantics and Structures in Long Contexts | Seonmin Koo, Jinsung Kim, YoungJoon Jang, Chanjun Park, Heuiseok Lim | N/A | N/A |
| KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students | Matthew Shu, Nishant Balepur, Shi Feng, Jordan Lee Boyd-Graber | N/A | N/A |
| Large Language Models Can Be Contextual Privacy Protection Learners | Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng | N/A | N/A |
| A SMART Mnemonic Sounds like “Glue Tonic”: Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick | Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Lee Boyd-Graber | N/A | N/A |
| Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models | Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Reza Haf | N/A | N/A |
| MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction | Jun-Hyung Park, Yeachan Kim, Mingyu Lee, Hyuntae Park, SangKeun Lee | N/A | N/A |
| First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning | Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui | N/A | N/A |
| Tools Fail: Detecting Silent Errors in Faulty Tools | Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk | N/A | N/A |
| Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity | Bowen Zhang, Chunping Li | N/A | N/A |
| Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing | Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling | Georgios Pantazopoulos, Malvina Nikandrou, Alessandro Suglia, Oliver Lemon, Arash Eshghi | N/A | N/A |
| Are LLMs Good Zero-Shot Fallacy Classifiers? | Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu | N/A | N/A |
| The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis | Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages | Dominik Schlechtweg, Pierluigi Cassotti, Bill Noble, David Alfter, Sabine Schulte im Walde, Nina Tahmasebi | N/A | N/A |
| Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification | Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama | N/A | N/A |
| ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos | Arpan Phukan, Manish Gupta, Asif Ekbal | N/A | N/A |
| Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation | Elaf Alhazmi, Quan Z. Sheng, Wei Emma Zhang, Munazza Zaib, Ahoud Alhazmi | N/A | N/A |
| Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG | William Merrill, Noah A. Smith, Yanai Elazar | N/A | N/A |
| ASL STEMpedia: Dataset and Benchmark for Interpreting STEM Articles | Kayo Yin, Chinmay Singh, Fyodor O Minakov, Vanessa Milan, Hal Daumé III, Cyril Zhang, Alex Xijie Lu, Danielle Bragg | N/A | N/A |
| Can Automatic Metrics Assess High-Quality Translations? | Sweta Agrawal, António Farinhas, Ricardo Rei, Andre Martins | N/A | N/A |
| Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal, José G. C. de Souza, Ricardo Rei, António Farinhas, Gonçalo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins | N/A | N/A |
| DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding | Bowen Xing, Lizi Liao, Minlie Huang, Ivor Tsang | N/A | N/A |
| KnowTuning: Knowledge-aware Fine-tuning for Large Language Models | Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren | N/A | N/A |
| SecCoder: Towards Generalizable and Robust Secure Code Generation | Boyu Zhang, Tianyu Du, Junkai Tong, Xuhong Zhang, Kingsum Chow, Sheng Cheng, Xun Wang, Jianwei Yin | N/A | N/A |
| Nash CoT: Multi-Path Inference with Preference Equilibrium | Ziqi Zhang, Cunxiang Wang, Xiao Xiong, Yue Zhang, Donglin Wang | N/A | N/A |
| Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention | Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou | N/A | N/A |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di ZHANG, Kun Gai, Ji-Rong Wen | N/A | N/A |
| Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding | Wei Li, Zhen Huang, Xinmei Tian, Le Lu, Houqiang Li, Xu Shen, Jieping Ye | N/A | N/A |
| LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History | Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz | N/A | N/A |
| Social Bias Probing: Fairness Benchmarking for Language Models | Marta Marchiori Manerba, Karolina Stanczak, Riccardo Guidotti, Isabelle Augenstein | N/A | N/A |
| Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models | Wenhao Yu, Hongming Zhang, Xiaoman Pan, peixin cao, Kaixin Ma, Jian Li, Hongwei Wang, Dong Yu | N/A | N/A |
| DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models | Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li | N/A | N/A |
| Revisiting Automated Evaluation for Long-form Table Question Answering in the Era of Large Language Models | Yuqi Wang, Lyuhao Chen, Yilun Zhao | N/A | N/A |
| Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems | Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning | Zhihan Zhang, Tao Ge, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang | N/A | N/A |
| FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents | Yilun Zhao, Yitao Long, Tintin Jiang, Weiyuan Chen, Chengye Wang, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan | N/A | N/A |
| Extracting Prompts by Inverting LLM Outputs | Collin Zhang, John Xavier Morris, Vitaly Shmatikov | N/A | N/A |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu | N/A | N/A |
| VHASR: A Multimodal Speech Recognition System With Vision Hotwords | Jiliang Hu, Zuchao Li, Ping Wang, Haojun Ai, Lefei Zhang, hai zhao | N/A | N/A |
| A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Min-Yen Kan, Ryan Cotterell | N/A | N/A |
| Bridging Local Details and Global Context in Text-Attributed Graphs | Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, liyunfei, Siliang Tang | N/A | N/A |
| Building Resources for Emakhuwa: Machine Translation and News Classification Benchmarks | Felermino D. M. A. Ali, Henrique Lopes Cardoso, Rui Sousa-Silva | N/A | N/A |
| RepMatch: Quantifying Cross-Instance Similarities in Representation Space | Mohammad Reza Modarres, Sina Abbasi, Mohammad Taher Pilehvar | N/A | N/A |
| Commonsense Knowledge Editing Based on Free-Text in LLMs | Xiusheng Huang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| A Closer Look at Multidimensional Online Political Incivility | Sagi Pendzel, Nir Lotan, Alon Zoizner, Einat Minkov | N/A | N/A |
| Leveraging BERT and TFIDF Features for Short Text Clustering via Alignment-Promoting Co-Training | Zetong Li, Qinliang Su, Shijing Si, Jianxing Yu | N/A | N/A |
| Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation | Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky | N/A | N/A |
| Unsupervised Named Entity Disambiguation for Low Resource Domains | Debarghya Datta, Soumajit Pramanik | N/A | N/A |
| SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers | Viktoriia A. Chekalina, Anna Rudenko, Gleb Mezentsev, Aleksandr Mikhalev, Alexander Panchenko, Ivan Oseledets | N/A | N/A |
| MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion | Qingyang Li, Yanru Zhong, Yuchu Qin | N/A | N/A |
| ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities | Ying Su, Zhan Ling, Haochen Shi, Cheng Jiayang, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Shortcuts Arising from Contrast: Towards Effective and Lightweight Clean-Label Attacks in Prompt-Based Learning | Xiaopeng Xie, Ming YAN, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Tianyi Zhou | N/A | N/A |
| GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Aashiq Muhamed, Oscar Li, David Woodruff, Mona T. Diab, Virginia Smith | N/A | N/A |
| RaTEScore: A Metric for Entity-Aware Radiology Text Similarity | Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Weidi Xie | N/A | N/A |
| HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning | Shayan Ali Akbar, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Victor Alvarez, Erica M Salinas, Erwin Cornejo | N/A | N/A |
| Learning to Rank Salient Content for Query-focused Summarization | Sajad Sotudeh, Nazli Goharian | N/A | N/A |
| Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions | Qian Ruan, Ilia Kuznetsov, Iryna Gurevych | N/A | N/A |
| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao | N/A | N/A |
| Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang | N/A | N/A |
| LMs learn governing principles of dynamical systems, revealing an in-context neural scaling law | Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati, Christopher Earls | N/A | N/A |
| AKEW: Assessing Knowledge Editing in the Wild | Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu | N/A | N/A |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh | N/A | N/A |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu | N/A | N/A |
| Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach | Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang | N/A | N/A |
| Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models | Zheng Zhao, Yftah Ziser, Shay B Cohen | N/A | N/A |
| XDetox: Text Detoxification with Token-Level Toxicity Explanations | Beomseok Lee, Hyunwoo Kim, Keon Kim, Yong Suk Choi | N/A | N/A |
| Optimizing Chinese Lexical Simplification Across Word Types: A Hybrid Approach | ZiHao Xiao, Jiefu Gong, Shijin Wang, Wei Song | N/A | N/A |
| Evaluating LLMs’ Capability in Satisfying Lexical Constraints | Bingxuan Li, Yiwei Wang, Tao Meng, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion | Chenyu Qiu, Pengjiang Qian, Chuang Wang, Jian Yao, Li Liu, Fang wei, Eddie Y.K. Eddie | N/A | N/A |
| Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning | Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| RoCEL: Advancing Table Entity Linking through Distinctive Row and Column Contexts | Yuanzheng Wang, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng | N/A | N/A |
| Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models | Zi’ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu | N/A | N/A |
| Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning | Panuthep Tasawong, Peerat Limkonchotiwat, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| MetaBench: Planning of Multiple APIs from Various APPs for Complex User Instruction | Hongru WANG, Rui Wang, Boyang XUE, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong | N/A | N/A |
| Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment | Zhipeng Chen, Kun Zhou, Xin Zhao, Jingyuan Wang, Ji-Rong Wen | N/A | N/A |
| AudioVSR: Enhancing Video Speech Recognition with Audio Data | Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin | N/A | N/A |
| ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? | Siddhant Waghjale, Vishruth Veerendranath, Zhiruo Wang, Daniel Fried | N/A | N/A |
| Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level | Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu | N/A | N/A |
| Re-ReST: Reflection-Reinforced Self-Training for Language Agents | Zi-Yi Dou, Cheng-Fu Yang, Xueqing Wu, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Effective Synthetic Data and Test-Time Adaptation for OCR Correction | Shuhao Guan, Cheng Xu, Moule Lin, Derek Greene | N/A | N/A |
| SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework | Fu Zhang, Qi Miao, Jingwei Cheng, Hongsen Yu, Yi Yan, Xin Li, YongxueWu | N/A | N/A |
| FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension | Junzhuo Liu, Xuzheng Yang, WEIWEI LI, Peng Wang | N/A | N/A |
| Exploring the Learning Capabilities of Language Models using LEVERWORLDS | Eitan Wagner, Amir Feder, Omri Abend | N/A | N/A |
| CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models | Eitan Wagner, Yuli Slavutsky, Omri Abend | N/A | N/A |
| DocEditAgent: Document Structure Editing Via Multimodal LLM Grounding | Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha | N/A | N/A |
| DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging | Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Understanding Slang with LLMs: Modelling Cross-Cultural Nuances through Paraphrasing | Ifeoluwa Wuraola, Nina Dethlefs, Daniel Marciniak | N/A | N/A |
| Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding | Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou | N/A | N/A |
| Re-Reading Improves Reasoning in Large Language Models | Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-Guang Lou, Shuai Ma | N/A | N/A |
| Adaptive Axes: A Pipeline for In-domain Social Stereotype Analysis | Qingcheng Zeng, Mingyu Jin, Rob Voigt | N/A | N/A |
| ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments | Sourjyadip Ray, Kushal Gupta, Soumi Kundu, Dr Payal Arvind Kasat, Somak Aditya, Pawan Goyal | N/A | N/A |
| Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations | Jiyi Li | N/A | N/A |
| Improve Student’s Reasoning Generalizability through Cascading Decomposed CoTs Distillation | Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu | N/A | N/A |
| Revisiting Supervised Contrastive Learning for Microblog Classification | Junbo Huang, Ricardo Usbeck | N/A | N/A |
| BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting | Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang | N/A | N/A |
| Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective | Zhaotian Weng, Zijun Gao, Jerone Andrews, Jieyu Zhao | N/A | N/A |
| Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing | Weichuan Wang, Zhaoyi Li, Defu Lian, Chen Ma, Linqi Song, Ying Wei | N/A | N/A |
| SciAgent: Tool-augmented Language Models for Scientific Reasoning | Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun | N/A | N/A |
| Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents | Dong Won Lee, Hae Won Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency | N/A | N/A |
| Towards Measuring and Modeling “Culture” in LLMs: A Survey | Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Shivdutt Singh, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, Monojit Choudhury | N/A | N/A |
| ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models | Haiquan Zhao, Lingyu Li, Shisong Chen, Shuqi Kong, Jiaan Wang, Kexin Huang, Tianle Gu, Yixu Wang, Jian Wang, Liang Dandan, Zhixu Li, Yan Teng, Yanghua Xiao, Yingchun Wang | N/A | N/A |
| Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting | Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury | N/A | N/A |
| Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features | Xiao Yu, Kejiang Chen, Qi Yang, Weiming Zhang, Nenghai Yu | N/A | N/A |
| Hate Personified: Investigating the role of LLMs in content moderation pipeline for hate speech | Sarah Masud, Sahajpreet Singh, Viktor Hangya, Alexander Fraser, Tanmoy Chakraborty | N/A | N/A |
| Temporally Consistent Factuality Probing for Large Language Models | Ashutosh Bajpai, Aaryan Goyal, Atif Anwer, Tanmoy Chakraborty | N/A | N/A |
| A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives | Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann | N/A | N/A |
| Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators | Prasoon Bajpai, Niladri Chatterjee, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training | Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng | N/A | N/A |
| Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability | Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan | N/A | N/A |
| Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging | Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, hanyu Zhao, Siqi Fan, Zheng Zhang | N/A | N/A |
| Generating Demonstrations for In-Context Compositional Generalization in Grounded Language Learning | Sam Spilsbury, Pekka Marttinen, Alexander Ilin | N/A | N/A |
| FAME: Factual Multi-task Model Editing Benchmark | Li Zeng, Yingyu Shan, Zeming Liu, Jiashu Yao, Yuhang Guo | N/A | N/A |
| MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance | Renjie Pi, Tianyang Han, Jianshu Zhang, Yueqi XIE, Rui Pan, Qing LIAN, Hanze Dong, Jipeng Zhang, Tong Zhang | N/A | N/A |
| Leveraging Large Language Models for NLG Evaluation: Advances and Challenges | Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Yuxuan Lai, Chongyang Tao, Shuai Ma | N/A | N/A |
| InfiniPot: Infinite Context Processing on Memory-Constrained LLMs | Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang | N/A | N/A |
| VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models | Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin | N/A | N/A |
| CorrSynth - A Correlated Sampling Method for Diverse dataset Generation from LLMs | Abhishek Divekar, Suhas S Kowshik, Vijit Malik | N/A | N/A |
| Defining Knowledge: Bridging Epistemology and Large Language Models | Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard | N/A | N/A |
| TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMs | Peiwen Jiang, Zibo Zhao, Xinbo Lin, Ruhui Ma, Yvonne Jie Chen, Jinhua Cheng | N/A | N/A |
| Free your mouse! Command Large Language Models to Generate Code to Format Word Documents | Shihao Rao, Liang Li, Jiapeng Liu, Guan Weixin, Xiyan Gao, bing lim | N/A | N/A |
| CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu, Zacc Yang, Chuanghao Ding, Rui Zhao, Fei Tan | N/A | N/A |
| The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs | Tianyang Han, Qing LIAN, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang | N/A | N/A |
| Rationale-Aware Answer Verification by Pairwise Self-Evaluation | Akira Kawabata, Saku Sugawara | N/A | N/A |
| On the Robustness of Editing Large Language Models | Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, hai zhao, lifeng Liu, Yulong Wang | N/A | N/A |
| IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method | MiHyeon Kim, Juhyoung Park, YoungBin Kim | N/A | N/A |
| Distract Large Language Models for Automatic Jailbreak Attack | Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen | N/A | N/A |
| Exploring Space Efficiency in a Tree-based Linear Model for Extreme Multi-label Classification | He-Zhe Lin, Cheng-Hung Liu, Chih-Jen Lin | N/A | N/A |
| WorryWords: Norms of Anxiety Association for 44,450 English Words | Saif M. Mohammad | N/A | N/A |
| Finding Blind Spots in Evaluator LLMs with Interpretable Checklists | Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M Khapra | N/A | N/A |
| LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration | Jun Zhao, Can Zu, Xu Hao, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments | Till Raphael Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart | N/A | N/A |
| Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs | Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li | N/A | N/A |
| Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems | Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang | N/A | N/A |
| Scaling Laws for Linear Complexity Language Models | Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong | N/A | N/A |
| Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards | Heejin Do, Sangwon Ryu, Gary Lee | N/A | N/A |
| Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis | Guangliang Liu, Haitao Mao, Jiliang Tang, Kristen Johnson | N/A | N/A |
| ATAP: Automatic Template-Augmented Commonsense Knowledge Graph Completion via Pre-Trained Language Models | Fu Zhang, Yifan Ding, Jingwei Cheng | N/A | N/A |
| LM2: A Simple Society of Language Models Solves Complex Reasoning | Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| Towards a Semantically-aware Surprisal Theory | Clara Meister, Mario Giulianelli, Tiago Pimentel | N/A | N/A |
| Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering | Adjali Omar, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne | N/A | N/A |
| Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu | N/A | N/A |
| Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP | Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty | N/A | N/A |
| BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training | Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, Ivan P. Yamshchikov | N/A | N/A |
| SEGMENT+: Long Text Processing with Short-Context Language Models | Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao | N/A | N/A |
| Explicit Memory Learning with Expectation Maximization | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang | N/A | N/A |
| Learning to Generate Writing Feedback via Language Model Simulated Student Revisions | Inderjeet Jayakumar Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, Lu Wang | N/A | N/A |
| Small LLMs Are Weak Tool Learners: A Multi-LLM Agent | Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang | N/A | N/A |
| Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions | Clement Neo, Shay B Cohen, Fazl Barez | N/A | N/A |
| Still Not Quite There! Assessing Large Language Models for Comorbid Mental Health Diagnosis | Amey Hengle, Atharva Kulkarni, Shantanu Deepak Patankar, Rashmi Gupta | N/A | N/A |
| The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning | Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings | N/A | N/A |
| Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups | Răzvan-Alexandru Smădu, David-Gabriel ION, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel | N/A | N/A |
| Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue | Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! | Divya Patel, Pathik Patel, Ankush Chander, Sourish Dasgupta, Tanmoy Chakraborty | N/A | N/A |
| MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations | Vishal Vivek Saley, Goonjan Saha, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| **YesBut | Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, ANKIT RAJ, Pawan Goyal, Niloy Ganguly | N/A | N/A |
| Scaling Cognitive Limits: Identifying Working Memory Limits in LLMs | Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi | N/A | N/A |
| RAFT: Realistic Attacks to Fool Text Detectors | James Liyuan Wang, Ran Li, Junfeng Yang, Chengzhi Mao | N/A | N/A |
| LLM-Evolve: Evaluation for LLM’s Evolving Capability on Benchmarks | Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping | AJAY KUMAR JAISWAL, Bodun Hu, Lu Yin, Yeonju Ro, Tianlong Chen, Shiwei Liu, Aditya Akella | N/A | N/A |
| LLM-based Code-Switched Text Generation for Grammatical Error Correction | Tom Potter, Zheng Yuan | N/A | N/A |
| Deciphering the Interplay of Parametric and Non-Parametric Memory in RAG Models | Mehrdad Farahani, Richard Johansson | N/A | N/A |
| On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning | Geewook Kim, Minjoon Seo | N/A | N/A |
| Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities | Zihao He, Rebecca Dorn, Minh Duc Chu, Siyi Guo, Kristina Lerman | N/A | N/A |
| Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models | Eldar Kurtic, Amir Moeini, Dan Alistarh | N/A | N/A |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna | N/A | N/A |
| One Thousand and One Pairs: A “novel” challenge for long-context language models | Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer | N/A | N/A |
| Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu, Kalpesh Krishna, Salaheddin Alzubi, Chris Tar, Manaal Faruqui, Yun-Hsuan Sung | N/A | N/A |
| Do LLMs learn a true syntactic universal? | John T. Hale, Miloš Stanojević | N/A | N/A |
| GDPO: Learning to Align Language Models with Diversity Using GFlowNets | Oh Joon Kwon, Daiki E. Matsunaga, Kee-Eung Kim | N/A | N/A |
| How Susceptible are Large Language Models to Ideological Manipulation? | Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman | N/A | N/A |
| Measuring Psychological Depth in Language Models | Fabrice Y Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Senahan Yildiz, Miryung Kim, Nanyun Peng, Amit Sahai | N/A | N/A |
| Media Attitude Detection via Framing Analysis with Events and their Relations | Jin Zhao, Jingxuan Tu, Han Du, Nianwen Xue | N/A | N/A |
| Fill In The Gaps: Model Calibration and Generalization with Synthetic Data | Yang Ba, Michelle V Mancenido, Rong Pan | N/A | N/A |
| Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations | Sagi Shaier, Ari Kobren, Philip V. Ogren | N/A | N/A |
| Granular Privacy Control for Geolocation with Vision Language Models | Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter | N/A | N/A |
| MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain | Chao Jiang, Wei Xu | N/A | N/A |
| MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification | Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang | N/A | N/A |
| FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization | Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao | N/A | N/A |
| StorySpark: Expert-Annotated QA Pairs with Real-World Knowledge for Children Storytelling | Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun | N/A | N/A |
| MedCoT: Medical Chain of Thought via Hierarchical Expert | Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Tianyi Zhou, Zuozhu Liu | N/A | N/A |
| Varying Sentence Representations via Condition-Specified Routers | Ziyong Lin, Quansen Wang, Zixia Jia, Zilong Zheng | N/A | N/A |
| Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues | Jiao Ou, jiayu wu, Che Liu, Fuzheng Zhang, Di ZHANG, Kun Gai | N/A | N/A |
| Information Flow Routes: Automatically Interpreting Language Models at Scale | Javier Ferrando, Elena Voita | N/A | N/A |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang | N/A | N/A |
| Low-rank Subspace for Binding in Large Language Models | Qin Dai, Benjamin Heinzerling, Kentaro Inui | N/A | N/A |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Erxin Yu, Jing Li, Ming Liao, Siqi Wang, GAO Zuchen, Fei Mi, Lanqing HONG | N/A | N/A |
| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Tobias Schimanski, Jingwei Ni, Roberto Spacey Martín, Nicola Ranger, Markus Leippold | N/A | N/A |
| Context-Aware Adapter Tuning for Few-Shot Relation Learning in Knowledge Graphs | LIU Ran, Zhongzhou Liu, Xiaoli Li, Yuan Fang | N/A | N/A |
| Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness | Shixuan Ma, Quan Wang | N/A | N/A |
| Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection | Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking | Siyuan Wang, Zhuohan Long, Zhihao Fan, zhongyu wei | N/A | N/A |
| Symbolic Working Memory Enhances Language Models for Complex Rule Application | Siyuan Wang, zhongyu wei, Yejin Choi, Xiang Ren | N/A | N/A |
| LLoCO: Learning Long Contexts Offline | Sijun Tan, Xiuyu Li, Shishir G Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Popa | N/A | N/A |
| Don’t Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, WANG CHEN, Anh Tuan Luu | N/A | N/A |
| Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee, Junho Kim, SangKeun Lee | N/A | N/A |
| Are Large Language Models Capable of Generating Human-Level Narratives? | Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng | N/A | N/A |
| MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs | Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung | N/A | N/A |
| Can Large Language Models Enhance Predictions of Disease Progression? Investigating Through Disease Network Link Prediction | Haohui Lu, Usman Naseem | N/A | N/A |
| Searching for Best Practices in Retrieval-Augmented Generation | Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| Moral Foundations of Large Language Models | Marwa Abdulhai, Gregory Serapio-García, Clement CREPY, Daria Valter, John Canny, Natasha Jaques | N/A | N/A |
| The Zeno’s Paradox of ‘Low-Resource’ Languages | Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury | N/A | N/A |
| Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization | Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty, Md Shad Akhtar | N/A | N/A |
| Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition | Pritika Ramu, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan | N/A | N/A |
| From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment | Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima | N/A | N/A |
| Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging | Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui | N/A | N/A |
| Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovic, Michael Färber | N/A | N/A |
| Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training | Zhou Zhang, Dongzeng Tan, Jiaan Wang, Yilong Chen, Jiarong Xu | N/A | N/A |
| Data Contamination Can Cross Language Barriers | Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang | N/A | N/A |
| Automated Essay Scoring: A Reflection on the State of the Art | Shengjie Li, Vincent Ng | N/A | N/A |
| Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate | Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Shuming Shi, Zhaopeng Tu | N/A | N/A |
| Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs | Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| CURE: Context- and Uncertainty-Aware Mental Disorder Detection | Migyeong Kang, goun choi, Hyolim Jeon, Ji hyun An, Daejin Choi, Jinyoung Han | N/A | N/A |
| PepRec: Progressive Enhancement of Prompting for Recommendation | Yakun Yu, Shi-ang Qi, Baochun Li, Di Niu | N/A | N/A |
| In-Context Compositional Generalization for Large Vision-Language Models | Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, Yunde Jia | N/A | N/A |
| Improving Zero-shot LLM Re-Ranker with Risk Minimization | Xiaowei Yuan, Zhao Yang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory | Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou | N/A | N/A |
| Label Confidence Weighted Learning for Target-level Sentence Simplification | Jingshen Zhang, Xin Ying Qiu | N/A | N/A |
| Quantum Recurrent Architectures for Text Classification | Wenduan Xu, Stephen Clark, Douglas Brown, Gabriel Matos, Konstantinos Meichanetzidis | N/A | N/A |
| Tree of Problems: Improving structured problem solving with compositionality | Armel Randy Zebaze, Benoît Sagot, Rachel Bawden | N/A | N/A |
| What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study | Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof-Arenas, Luisa Bentivogli | N/A | N/A |
| Seg2Act: Global Context-aware Action Generation for Document Logical Structuring | Zichao Li, Shaojie He, Meng Liao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Yanxiong Lu, Xianpei Han, Le Sun | N/A | N/A |
| Is C4 Dataset Enough for Pruning? An Investigation of Calibration Data for LLM Pruning | Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, AJAY KUMAR JAISWAL, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu | N/A | N/A |
| Revisiting the Robustness of Watermarking to Paraphrasing Attacks | Saksham Rastogi, Danish Pruthi | N/A | N/A |
| A Survey of Ontology Expansion for Conversational Understanding | Jinggui Liang, Yuxia Wu, Yuan Fang, Hao Fei, Lizi Liao | N/A | N/A |
| Calibrating Language Models with Adaptive Temperature Scaling | Johnathan Xie, Annie S Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn | N/A | N/A |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo | N/A | N/A |
| Why do objects have many names? A study on word informativeness in language use and lexical systems. | Eleonora Gualdoni, Gemma Boleda | N/A | N/A |
| Dual-Space Knowledge Distillation for Large Language Models | Songming Zhang, Xue Zhang, Zengkui Sun, Yufeng Chen, Jinan Xu | N/A | N/A |
| NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition | Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik | N/A | N/A |
| On the Universal Truthfulness Hyperplane Inside LLMs | Junteng Liu, Shiqi Chen, Yu Cheng, Junxian He | N/A | N/A |
| PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Chao-Wei Huang, Yun-Nung Chen | N/A | N/A |
| User Inference Attacks on Large Language Models | Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu | N/A | N/A |
| HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy | YongKang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schuetze | N/A | N/A |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Yufang Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Aimin Zhou | N/A | N/A |
| Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation | Matthew Raffel, Victor Agostinelli, Lizhong Chen | N/A | N/A |
| ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback | Qinzhuo Wu, Wei Liu, Jian Luan, Bin Wang | N/A | N/A |
| Please note that I’m just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification | Esra Dönmez, Thang Vu, Agnieszka Falenska | N/A | N/A |
| How to Compute the Probability of a Word | Tiago Pimentel, Clara Meister | N/A | N/A |
| A linguistically-motivated evaluation methodology for unraveling model’s abilities in reading comprehension tasks | Elie Antoine, Frederic Bechet, Géraldine Damnati, Philippe Langlais | N/A | N/A |
| GuardBench: A Large-Scale Benchmark for Guardrail Models | Elias Bassani, Ignacio Sanchez | N/A | N/A |
| Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering | Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Guang Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Language models and brains align due to more than next-word prediction and word-level information | Gabriele Merlin, Mariya Toneva | N/A | N/A |
| LLMEdgeRefine: Enhancing Text Clustering with LLM-Based Boundary Point Refinement | Zijin Feng, Luyang Lin, Lingzhi Wang, Hong Cheng, Kam-Fai Wong | N/A | N/A |
| CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures | Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri | N/A | N/A |
| A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression | Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini | N/A | N/A |
| GOME: Grounding-based Metaphor Binding With Conceptual Elaboration For Figurative Language Illustration | Linhao Zhang, Jintao Liu, Li Jin, Hao Wang, kaiwen wei, Guangluan Xu | N/A | N/A |
| D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation | Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, Vinodkumar Prabhakaran | N/A | N/A |
| PALM: Few-Shot Prompt Learning for Audio Language Models | Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki | N/A | N/A |
| Annotator-Centric Active Learning for Subjective NLP Tasks | Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico Liscio | N/A | N/A |
| Lost in Tokenization: How to Measure Word Surprisal From LM Token Probabilities | Luca Malagutti, Juan Luis Gastaldi, Brian DuSell, Tim Vieira, Ryan Cotterell, Mario Giulianelli | N/A | N/A |
| Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation | Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M Guerreiro | N/A | N/A |
| Jailbreaking LLMs with Arabic Transliteration and Arabizi | Mansour Al Ghanim, saleh almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou | N/A | N/A |
| Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models | Zara Siddique, Liam Turner, Luis Espinosa-Anke | N/A | N/A |
| Instruction Matters, a Simple yet Effective Task Selection Approach in Instruction Tuning for Specific Tasks | Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae | N/A | N/A |
| Recurrent Alignment with Hard Attention for Hierarchical Text Rating | Chenxi Lin, Ren Jiayu, Guoxiu He, Zhuoren Jiang, Haiyan yu, Xiaomin Zhu | N/A | N/A |
| CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification | Junhui He, Shangyu Wu, Weidong Wen, Chun Jason Xue, Qingan Li | N/A | N/A |
| Semformer: Transformer Language Models with Semantic Planning | Yongjing Yin, Junran Ding, Kai Song, Yue Zhang | N/A | N/A |
| DocCGen: Document-based Controlled Code Generation | Sameer Pimparkhede, Mehant Kammakomati, Srikanth G. Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya | N/A | N/A |
| Semantics and Sentiment: Cross-lingual Variations in Emoji Use | Giulio Zhou, Sydelle de Souza, Ella Markham, Oghenetekevwe Kwakpovwe, Sumin Zhao | N/A | N/A |
| The Emergence of Compositional Languages in Multi-entity Referential Games: from Image to Graph Representations | Daniel Akkerman, Phong Le, Raquel G. Alhama | N/A | N/A |
| Transformers are Multi-State RNNs | Matanel Oren, Michael Hassid, Nir Yarden, Yossi Adi, Roy Schwartz | N/A | N/A |
| Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization | Niyati Bafna, Kenton Murray, David Yarowsky | N/A | N/A |
| Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion | Kerem Zaman, Leshem Choshen, Shashank Srivastava | N/A | N/A |
| Collective Critics for Creative Story Generation | Minwook Bae, Hyounghun Kim | N/A | N/A |
| Surprisal Curves of Discourse | Eleftheria Tsipidi, Franz Nowak, Ryan Cotterell, Ethan Wilcox, Mario Giulianelli, Alex Warstadt | N/A | N/A |
| Model-based Preference Optimization in Abstractive Summarization without Human Feedback | Jaepill choi, Kyubyung Chae, Jiwoo Song, Yohan Jo, Taesup Kim | N/A | N/A |
| Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation? | Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| NeuroTrialNER: An Annotated Corpus for Neurological Diseases and Therapies in Clinical Trial Registries | Simona Emilova Doneva, Tilia Ellendorff, Jean-Philippe Goldman, Amelia Elaine Cannon, Gerold Schneider, Beate Sick, Benjamin Victor Ineichen | N/A | N/A |
| Do Explanations Help or Hurt? Saliency Maps vs Natural Language Explanations in a Clinical Decision-Support Setting | Maxime Guillaume Kayser, Bayar Menzat, Cornelius Emde, Bogdan Alexandru Bercean, Alex Novak, Abdalá Trinidad Espinosa Morgado, Bartlomiej Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu | N/A | N/A |
| Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering | WEIHE ZHAI, Arkaitz Zubiaga, Bingquan Liu, Chengjie Sun, Yalong Zhao | N/A | N/A |
| Generation with Dynamic Vocabulary | Yanting Liu, Tao Ji, Yuanbin Wu, Xiaoling Wang, Changzhi Sun | N/A | N/A |
| Argument Relation Classification through Discourse Markers and Adversarial Training | Michele Luca Contalbo, Francesco Guerra, Matteo Paganelli | N/A | N/A |
| Getting The Most Out of Your Training Data: Exploring Unsupervised Tasks for Morphological Inflection | Abhishek Purushothama, Adam Wiemerslage, Katharina von der Wense | N/A | N/A |
| Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval | Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev | N/A | N/A |
| Efficient Unseen Language Adaptation for Multilingual Pre-Trained Language Models | Po-Heng Chen, Yun-Nung Chen | N/A | N/A |
| Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation | Ruiyu Xiao, Lei Wu, Yuhang Gou, Weinan Zhang, Ting Liu | N/A | N/A |
| TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders, Nathaniel Weir, Benjamin Van Durme | N/A | N/A |
| Unsupervised Extraction of Dialogue Policies from Conversations | Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien | N/A | N/A |
| GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization | Onkar Kishor Susladkar, Gayatri Sudhir Deshmukh, Vandan Gorade, Sparsh Mittal | N/A | N/A |
| Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality | Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim | N/A | N/A |
| FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture | Wenyan Li, Crystina Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott | N/A | N/A |
| A Two-Step Approach for Data-Efficient French Pronunciation Learning | Hoyeon Lee, Hyeeun Jang, JONGHWAN KIM, Jaemin Kim | N/A | N/A |
| Exploring Intra and Inter-language Consistency in Embeddings with ICA | Rongzhi Li, Takeru Matsuda, Hitomi Yanaka | N/A | N/A |
| DetoxLLM: A Framework for Detoxification with Explanations | Md Tawkat Islam Khondaker, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan | N/A | N/A |
| Building a Multi-Platform, BERT Classifier for Detecting Connective Language | Josephine Lukito, Bin Chen, Gina M. Masullo, Natalie Jomini Stroud | N/A | N/A |
| ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models | Yash Akhauri, Ahmed F AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M Rush, Safeen Huda, Mohamed S Abdelfattah | N/A | N/A |
| Emotion Granularity from Text: An Aggregate-Level Indicator of Mental Health | Krishnapriya Vishnubhotla, Daniela Teodorescu, Mallory J Feldman, Kristen Lindquist, Saif M. Mohammad | N/A | N/A |
| BLSP-Emo: Towards Empathetic Large Speech-Language Models | Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang | N/A | N/A |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | Abhishek Divekar, Greg Durrett | N/A | N/A |
| Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang | N/A | N/A |
| DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts | Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty | N/A | N/A |
| DEM: Distribution Edited Model for Training with Mixed Data Distributions | Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha | N/A | N/A |
| Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu, Po-Yao Huang, Xiaoqing Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer | N/A | N/A |
| VerifyMatch: A Semi-Supervised Learning Paradigm for Natural Language Inference with Confidence-Aware MixUp | Seo Yeon Park, Cornelia Caragea | N/A | N/A |
| CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Ray Mooney | N/A | N/A |
| Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics | Théo Gigant, Camille Guinaudeau, Marc decombas, Frederic Dufaux | N/A | N/A |
| An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs | Manuj Malik, Jing Jiang, Kian Ming A. Chai | N/A | N/A |
| Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks | Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas | N/A | N/A |
| GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Aleksander Ficek, Jiaqi Zeng, Oleksii Kuchaiev | N/A | N/A |
| CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing | Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang | N/A | N/A |
| Sequential API Function Calling Using GraphQL Schema | Avirup Saha, Lakshmi Mandal, Balaji Ganesan, Sambit Ghosh, Renuka Sindhgatta, Carlos Eberhardt, Dan Debrunner, Sameep Mehta | N/A | N/A |
| The Illusion of Competence: Evaluating the Effect of Explanations on Users’ Mental Models of Visual Question Answering Systems | Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß | N/A | N/A |
| Re-Evaluating Evaluation for Multilingual Summarization | Jessica Zosa Forde, Ruochen Zhang, Lintang Sutawika, Alham Fikri Aji, Samuel Cahyawijaya, Genta Indra Winata, Minghao Wu, Carsten Eickhoff, Stella Biderman, Ellie Pavlick | N/A | N/A |
| Video-Text Prompting for Weakly Supervised Spatio-Temporal Video Grounding | Heng zhao, Zhao Yinjie, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou | N/A | N/A |
| A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition | Caio Filippo Corro | N/A | N/A |
| Factuality of Large Language Models in the Year 2024 | Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Nenkov Georgiev, Rocktim Jyoti Das, Preslav Nakov | N/A | N/A |
| Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation | Youngwoo Kim, Razieh Rahimi, James Allan | N/A | N/A |
| Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse | Rongchen Guo, Isar Nejadgholi, Hillary Dawkins, Kathleen C. Fraser, Svetlana Kiritchenko | N/A | N/A |
| DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R Menon, Shashank Srivastava | N/A | N/A |
| IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning | Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha | N/A | N/A |
| Scope-enhanced Compositional Semantic Parsing for DRT | Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos | N/A | N/A |
| The Generation Gap: Exploring Age Bias Underlying in the Value Systems of Large Language Models | Siyang Liu, Trisha Maturi, Bowen Yi, Siqi Shen, Rada Mihalcea | N/A | N/A |
| TempoFormer: A Transformer for Temporally-aware Representations in Change Detection | Talia Tseriotou, Adam Tsakalidis, Maria Liakata | N/A | N/A |
| Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? | Guillermo Marco, Julio Gonzalo, M.Teresa Mateo-Girona, Ramón del Castillo Santos | N/A | N/A |
| Evaluating Diversity in Automatic Poetry Generation | Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger | N/A | N/A |
| Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models | Yi Zhou, Danushka Bollegala, Jose Camacho-Collados | N/A | N/A |
| Delving into Qualitative Implications of Synthetic Data for Hate Speech Detection | Camilla Casula, Sebastiano Vecellio Salto, Alan Ramponi, Sara Tonelli | N/A | N/A |
| Grounding Language in Multi-Perspective Referential Communication | Zineng Tang, Lingjun Mao, Alane Suhr | N/A | N/A |
| Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval | Yifan Qiao, Parker Carlson, Shanxiu He, Yingrui Yang, Tao Yang | N/A | N/A |
| Error Analysis of Multilingual Language Models in Machine Translation for Low-resource Languages: A Case Study of Amharic to English Bi-directional Machine Translation | Hizkiel Mitiku Alemayehu, Hamada M Zahera, Axel-Cyrille Ngonga Ngomo | N/A | N/A |
| MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation | Arkadiusz Modzelewski, Giovanni Da San Martino, Pavel Savov, Magdalena Anna Wilczyńska, Adam Wierzbicki | N/A | N/A |
| Unsupervised Discrete Representations of American Sign Language | Artem Abzaliev, Rada Mihalcea | N/A | N/A |
| Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models | Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim | N/A | N/A |
| Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs | Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui | N/A | N/A |
| Jump Starting Bandits with LLM-Generated Prior Knowledge | Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson | N/A | N/A |
| Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? | Fırat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çağatay Yıldız | N/A | N/A |
| Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation | Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun | N/A | N/A |
| Virtual Personas for Language Models via an Anthology of Backstories | Suhong Moon, Marwa Abdulhai, Minwoo Kang, Joseph Suh, Widyadewi Soedarmadji, Eran Kohen Behar, David Chan | N/A | N/A |
| Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? | Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral | N/A | N/A |
| Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun | N/A | N/A |
| The Empirical Variability of Narrative Perceptions of Social Media Texts | Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper, Maarten Sap | N/A | N/A |
| Which questions should I answer? Salience Prediction of Inquisitive Questions | Yating Wu, Ritika Rajesh Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li | N/A | N/A |
| Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues | Lei Sun, Jinming Zhao, Qin Jin | N/A | N/A |
| Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech | Guan-Ting Lin, Wei Ping Huang, Hung-yi Lee | N/A | N/A |
| Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon, Richard Zemel, Carl Vondrick | N/A | N/A |
| CodeJudge: Evaluating Code Generation with Large Language Models | Weixi Tong, Tianyi Zhang | N/A | N/A |
| Self-Training Large Language and Vision Assistant for Medical | Guohao Sun, Can Qin, Huazhu Fu, Linwei Wang, ZHIQIANG TAO | N/A | N/A |
| SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, hong yu | N/A | N/A |
| Defending Jailbreak Prompts via In-Context Adversarial Game | Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang | N/A | N/A |
| Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter | Kateryna Kasianenko, Shima Khanehzar, Stephen Wan, Ehsan Dehghan, Axel Bruns | N/A | N/A |
| Multilingual Topic Classification in X: Dataset and Analysis | Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Jose Camacho-Collados | N/A | N/A |
| MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models | Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong | N/A | N/A |
| Updating CLIP to Prefer Descriptions Over Captions | Amir Zur, Elisa Kreiss, Karel D’Oosterlinck, Christopher Potts, Atticus Geiger | N/A | N/A |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang | N/A | N/A |
| Back to School: Translation Using Grammar Books | Jonathan Hus, Antonios Anastasopoulos | N/A | N/A |
| VIEWS: Entity-Aware News Video Captioning | Hammad Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, feng han, Yukun Zhu, Xuande Feng, Kevin Zhang, Jialu Liu, Shih-Fu Chang | N/A | N/A |
| Towards Aligning Language Models with Textual Feedback | Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan | N/A | N/A |
| ATPO: Automatic Tree-Structured Prompt Optimization | Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Xiaodi Sun, Bin Benjamin Zhu, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang | N/A | N/A |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, shimin tao, Hao Yang, Min Zhang | N/A | N/A |
| DEFT-UCS: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection | Devleena Das, Vivek Khetan | N/A | N/A |
| Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models | Yuko Nakagi, Takuya Matsuyama, Naoko Koide-Majima, Hiroto Q. Yamaguchi, Rieko Kubo, Shinji Nishimoto, Yu Takagi | N/A | N/A |
| “They are uncultured”: Unveiling Covert Harms and Social Threats in LLM Generated Conversations | Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanu Mitra | N/A | N/A |
| Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models | Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen | N/A | N/A |
| Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? | Gabriel Roccabruna, Massimo Rizzoli, giuseppe riccardi | N/A | N/A |
| Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties | Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai | N/A | N/A |
| Framework for Robust and Scalable Text Watermarking | Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low | N/A | N/A |
| MASIVE: Open-Ended Affective State Identification in English and Spanish | Nicholas Deas, Elsbeth Turcan, Ivan Ernesto Perez Mejia, Kathleen McKeown | N/A | N/A |
| You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions | Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, Jordan Lee Boyd-Graber | N/A | N/A |
| AlphaExpert: Assigning LoRA Experts Based on Layer Training Quality | Peijun Qing, Chongyang Gao, Yefan Zhou, Xingjian Diao, Yaoqing Yang, Soroush Vosoughi | N/A | N/A |
| Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling | Irfan Robbani, Paul Reisert, Surawat Pothong, Naoya Inoue, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui | N/A | N/A |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Question | Leena Mathur, Paul Pu Liang, Louis-Philippe Morency | N/A | N/A |
| RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models | Ziyi Kou, Shichao Pei, Meng Jiang, Xiangliang Zhang | N/A | N/A |
| Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese | Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh | N/A | N/A |
| Learnability of Indirect Evidence in Language Models | Miyu Oba, Yohei Oseki, Akiyo Fukatsu, Akari Haga, Hiroki Ouchi, Taro Watanabe, Saku Sugawara | N/A | N/A |
| Do LLMs Know to Respect Copyright Notice? | Jialiang Xu, SHENGLAN LI, Zhaozhuo Xu, Denghui Zhang | N/A | N/A |
| SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding | Hanchi Sun, Tianyi Zhou, Xun Chen, Lichao Sun | N/A | N/A |
| Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding | YeonJoon Jung, Jaeseong Lee, Seungtaek Choi, Dohyeon Lee, Minsoo Kim, seung-won hwang | N/A | N/A |
| Rethinking the Role of Proxy Rewards in Language Model Alignment | Sungdong Kim, Minjoon Seo | N/A | N/A |
| Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant | Abhirama Subramanyam Penamakuri, Anand Mishra | N/A | N/A |
| How Good is my MT Metric? A Framework for the Interpretation of Metric Assessments | Stefano Perrella, Lorenzo Proietti, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli | N/A | N/A |
| IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning | Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim | N/A | N/A |
| SPREADSHEETLLM: Encoding Spreadsheets for Large Language Models | Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang | N/A | N/A |
| Let’s discuss! Quality Dimensions and Annotated Datasets for Computational Argument Quality | Rositsa V Ivanova, Thomas Huber, Christina Niklaus | N/A | N/A |
| Automatic sentence segmentation of clinical record narratives in real-world data | Dongfang Xu, Davy Weissenbacher, Karen O’Connor, Siddharth Rawal, Graciela Gonzalez Hernandez | N/A | N/A |
| One-to-Many Communication and Compositionality in Emergent Communication | Heeyoung Lee | N/A | N/A |
| Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities | Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang | N/A | N/A |
| Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? | Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali | N/A | N/A |
| Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models | Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral | N/A | N/A |
| Contrastive Classification via Linear Layer Extrapolation | Mayukh Sharma, Sean O’Brien, Julian McAuley | N/A | N/A |
| Task Oriented In-Domain Data Augmentation | Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao | N/A | N/A |
| SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers | Shruti Singh, Nandan Sarkar, Arman Cohan | N/A | N/A |
| Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules | Zhuocheng Gong, Ang Lv, Jian Guan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan | N/A | N/A |
| No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages | Youssef Mohamed, Runjia Li, Ibrahim Said Ahmad, Kilichbek Haydarov, Philip Torr, Kenneth Church, Mohamed Elhoseiny | N/A | N/A |
| PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection | Someen Park, Jaehoon Kim, Seungwan Jin, Sohyun Park, Kyungsik Han | N/A | N/A |
| TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR | Shashi Kumar, Srikanth Madikeri, Juan Pablo Zuluaga Gomez, Iuliia Thorbecke, Esaú VILLATORO-TELLO, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S, Aravind Ganapathiraju | N/A | N/A |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz | N/A | N/A |
| Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk | Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li, Qun Liu, Xipeng Qiu | N/A | N/A |
| A Morphology-Based Investigation of Positional Encodings | Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| I love pineapple on pizza != I hate pineapple on pizza: Stance-Aware Sentence Transformers for Opinion Mining | Vahid Ghafouri, Jose M. Such, Guillermo Suarez-Tangil | N/A | N/A |
| BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability | Mamta Mamta, Rishikant Chigrupaatii, Asif Ekbal | N/A | N/A |
| ArMeme: Propagandistic Content in Arabic Memes | Firoj Alam, Abul Hasnat, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain | N/A | N/A |
| Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts | Arianna Muti, Federico Ruggeri, Khalid Al Khatib, Alberto Barrón-Cedeño, Tommaso Caselli | N/A | N/A |
| Thoughts to Target: Enhance Planning for Target-driven Conversation | Zhonghua Zheng, Lizi Liao, Yang Deng, Ee-Peng Lim, Minlie Huang, Liqiang Nie | N/A | N/A |
| Scalable Data Ablation Approximations for Language Models through Modular Training and Merging | Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi | N/A | N/A |
| Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation | Zhe Cao, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters | Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Generative Subgraph Retrieval for Knowledge Graph–Grounded Dialog Generation | Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim | N/A | N/A |
| Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers | Tuc Van Nguyen, Thai Le | N/A | N/A |
| Generalizing Clinical De-identification Models by Privacy-safe Data Augmentation using GPT-4 | Woojin Kim, Sungeun Hahm, Jaejin Lee | N/A | N/A |
| Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game | Prisha Samdarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan | N/A | N/A |
| GottBERT: a pure German Language Model | Raphael Scheible, Johann Frei, Fabian Thomczyk, Henry He, Patric Tippmann, Jochen Knaus, Victor Jaravine, Frank Kramer, Martin Boeker | N/A | N/A |
| Computational Meme Understanding: A Survey | Khoi P. N. Nguyen, Vincent Ng | N/A | N/A |
| CoverICL: Selective Annotation for In-Context Learning via Active Graph Coverage | Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis | N/A | N/A |
| Retrieval-enriched zero-shot image classification in low-resource domains | Nicola Dall’Asen, Yiming Wang, Enrico Fini, Elisa Ricci | N/A | N/A |
| I-AM-G: Interest Augmented Multimodal Generator for Item Personalization | Xianquan Wang, Likang Wu, Shukang Yin, Zhi Li, Yanjiang Chen, hufeng, Yu Su, Qi Liu | N/A | N/A |
| Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps | Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy | N/A | N/A |
| Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing | Baihe Huang, Hiteshi Sharma, Yi Mao | N/A | N/A |
| Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion | Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist | N/A | N/A |
| Show and Guide: Instructional-Plan Grounded Vision and Language Model | Diogo Glória-Silva, David Semedo, Joao Magalhaes | N/A | N/A |
| Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents | Bandhav Veluri, Benjamin N Peloquin, Bokai YU, Hongyu Gong, Shyamnath Gollakota | N/A | N/A |
| QuBE: Question-based Belief Enhancement for Agentic LLM | Minsoo Kim, Jongyoon Kim, Jihyuk Kim, seung-won hwang | N/A | N/A |
| COMPACT: Compressing Retrieved Documents Actively for Question Answering | Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang | N/A | N/A |
| An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Fatemeh Shiri, Xiao-Yu Guo, Mona Golestan Far, Xin Yu, Reza Haf, Yuan-Fang Li | N/A | N/A |
| Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models | Jiaxin Zhang, Wendi Cui, Yiran Huang, Kamalika Das, Sricharan Kumar | N/A | N/A |
| Local Contrastive Editing of Gender Stereotypes | Marlene Lutz, Rochelle Choenni, Markus Strohmaier, Anne Lauscher | N/A | N/A |
| De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP | Stefan Larson, Nicole Cornehl Lima, Santiago Pedroza Diaz, Amogh Manoj Joshi, Siddharth Betala, Jamiu Tunde Suleiman, Yash Mathur, Kaushal Kumar Prajapati, Ramla Alakraa, Junjie Shen, Temi Okotore, Kevin Leach | N/A | N/A |
| RAR: Retrieval Augmented Retrieval for Code Generation in Low Resource Languages | Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le | N/A | N/A |
| STAR: SocioTechnical Approach to Red Teaming Language Models | Laura Weidinger, John F J Mellor, Bernat Guillén Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, A. Stevie Bergman, Mikel D. Rodriguez, Verena Rieser, William Isaac | N/A | N/A |
| Do great minds think alike? Investigating Human-AI Complementarity for Question Answering | Maharshi Gor, Hal Daumé III, Tianyi Zhou, Jordan Lee Boyd-Graber | N/A | N/A |
| Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang | N/A | N/A |
| Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories | Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli | N/A | N/A |
| Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark | Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko | N/A | N/A |
| Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner | Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, Min zhang | N/A | N/A |
| Preference-Guided Reflective Sampling for Aligning Language Models | Hai Ye, Hwee Tou Ng | N/A | N/A |
| Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP | Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat | N/A | N/A |
| Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs | Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap | N/A | N/A |
| A Simple LLM Framework for Long-Range Video Question-Answering | Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius | N/A | N/A |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli | N/A | N/A |
| Casablanca: Data and Models for Multidialectal Arabic Speech Recognition | Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou cheikh tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah EL MEKKI, El Moatez Billah Nagoudi, Benelhadj Djelloul Mama Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir ECH-CHAMMAKHY, AMAL MAKOUAR, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed | N/A | N/A |
| Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations | Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria | N/A | N/A |
| Communicating with Speakers and Listeners of Different Pragmatic Levels | Kata Naszadi, Frans A Oliehoek, Christof Monz | N/A | N/A |
| RECANTFormer: Referring Expression Comprehension with Varying Numbers of Targets | Bhathiya Hemanthage, Hakan Bilen, Phil Bartie, Christian Dondrup, Oliver Lemon | N/A | N/A |
| Sprout: Green Generative AI with Carbon-Efficient LLM Inference | Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari | N/A | N/A |
| Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs | Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze | N/A | N/A |
| T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings | Björn Deiseroth, Manuel Brack, Samuel Weinbach, Patrick Schramowski, Kristian Kersting | N/A | N/A |
| SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han, Kevin Duh, Marine Carpuat | N/A | N/A |
| Assessing and Verifying Task Utility in LLM-Powered Applications | Negar Arabzadeh, Siqing Huo, Nikhil Mehta, Qingyun Wu, Chi Wang, Ahmed Hassan Awadallah, Charles L. A. Clarke, Julia Kiseleva | N/A | N/A |
| Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models | Somanshu Singla, Zhen Wang, Tianyang Liu, Abdullah Ashfaq, Zhiting Hu, Eric P. Xing | N/A | N/A |
| Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree | Harbani Jaggi, Kashyap Coimbatore Murali, Eve Fleisig, Erdem Biyik | N/A | N/A |
| Adversarial Text Generation using Large Language Models for Dementia Detection | Youxiang Zhu, Nana Lin, Kiran Sandilya Balivada, Daniel Haehn, Xiaohui Liang | N/A | N/A |
| xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics | Daniil Larionov, Mikhail Seleznyov, Vasiliy Viskov, Alexander Panchenko, Steffen Eger | N/A | N/A |
| The Greatest Good Benchmark: Measuring LLMs’ Alignment with Utilitarian Moral Dilemmas | Giovanni Franco Gabriel Marraffini, Andrés Cotton, Noe Fabian Hsueh, Juan Wisznia, Axel Fridman, Luciano Del Corro | N/A | N/A |
| FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding | Jiali Cheng, Hadi Amiri | N/A | N/A |
| Style-Shifting Behaviour of the Manosphere on Reddit | Jai Aggarwal, Suzanne Stevenson | N/A | N/A |
| The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective | Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang | N/A | N/A |
| Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu, Zhiyang Xu, Zihao Lin, Trevor Ashby, Joy Rimchala, Jiaxin Zhang, Lifu Huang | N/A | N/A |
| FOLIO: Natural Language Reasoning with First-Order Logic | SIMENG HAN, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander Fabbri, Wojciech Maciej Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev | N/A | N/A |
| The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? | Alexander Choi, Syeda Sabrina Akter, J.P. Singh, Antonios Anastasopoulos | N/A | N/A |
| Is Child-Directed Speech Effective Training Data for Language Models? | Steven Y. Feng, Noah Goodman, Michael Frank | N/A | N/A |
| RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference | Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao | N/A | N/A |
| HCEG: Improving the Abstraction Ability of Language Models with Hierarchical Conceptual Entailment Graphs | Juncai Li, Ru Li, Xiaoli Li, Qinghua Chai, Jeff Z. Pan | N/A | N/A |
| M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought | Gitanjali Kumari, Kirtan Jain, Asif Ekbal | N/A | N/A |
| GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation | Govind Ramesh, Yao Dou, Wei Xu | N/A | N/A |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Kiseung Kim, Jay-Yoon Lee | N/A | N/A |
| Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets | Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth | N/A | N/A |
| Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model | Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu, Aparna Garimella, Sambaran Bandyopadhyay | N/A | N/A |
| On the Fragility of Active Learners for Text Classification | Abhishek Ghose, Emma Thuong Nguyen | N/A | N/A |
| BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers | Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May Dongmei Wang, Joyce C. Ho, Chao Zhang, Carl Yang | N/A | N/A |
| Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval | Jonghyun Song, Cheyon Jin, Wenlong Zhao, Jay-Yoon Lee | N/A | N/A |
| M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection | Chia-Wei Tang, Ting-Chih Chen, Alvi Md Ishmam, Kiet A. Nguyen, Kazi Sajeed Mehrab, Chris Thomas | N/A | N/A |
| MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Haotian Sun, Hang Wu, Carl Yang, May Dongmei Wang | N/A | N/A |
| EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce C. Ho, Carl Yang, May Dongmei Wang | N/A | N/A |
| SimLLM: Detecting Sentences Generated by Large Language Models Using Similarity between the Generation and its Re-generation | Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu | N/A | N/A |
| CELLO: Causal Evaluation of Large Vision-Language Models | Meiqi Chen, Bo Peng, Yan Zhang, Chaochao Lu | N/A | N/A |
| Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Training-free Deep Concept Injection Enables Language Models for Video Question Answering | Xudong Lin, Manling Li, Richard Zemel, Heng Ji, Shih-Fu Chang | N/A | N/A |
| MIBench: Evaluating Multimodal Large Language Models over Multiple Images | Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu | N/A | N/A |
| ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering | Francesco Maria Molfese, Simone Conia, Riccardo Orlando, Roberto Navigli | N/A | N/A |
| ABLE: Personalized Disability Support with Politeness and Empathy Integration | Kshitij Mishra, Manisha Burja, Asif Ekbal | N/A | N/A |
| Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models | Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Sunghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo | N/A | N/A |
| Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code | Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Beong-woo Kwak, Seonghyeon Bae, seung-won hwang, Jinyoung Yeo | N/A | N/A |
| Improving Minimum Bayes Risk Decoding with Multi-Prompt | David Heineman, Yao Dou, Wei Xu | N/A | N/A |
| Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework | gopendra Vikram singh, Sai Vardhan Vemulapalli, Mauajama Firdaus, Asif Ekbal | N/A | N/A |
| Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah Schwettmann, Tristan Thrush | N/A | N/A |
| Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning | Shengguang Wu, Shusheng Yang, Zhenglun Chen, Qi Su | N/A | N/A |
| LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang | N/A | N/A |
| Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models | Yuxuan Guo, Zhiliang Tian, YIPING SONG, Tianlun Liu, Liang Ding, Dongsheng Li | N/A | N/A |
| Knowledge Graph Enhanced Large Language Model Editing | Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen | N/A | N/A |
| Quis custodiet ipsos custodes?’ Who will watch the watchmen? On Detecting AI-generated peer-reviews | Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal | N/A | N/A |
| Mitigating Open-Vocabulary Caption Hallucinations | Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor | N/A | N/A |
| Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes | Kosuke Nishida, Kyosuke Nishida, Kuniko Saito | N/A | N/A |
| ALVIN: Active Learning Via INterpolation | Michalis Korakakis, Andreas Vlachos | N/A | N/A |
| Filtered Direct Preference Optimization | Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu | N/A | N/A |
| Instruction Fine-Tuning: Does Prompt Loss Matter? | Mathew Huerta-Enochian, Seung Yong Ko | N/A | N/A |
| Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia | Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West | N/A | N/A |
EMNLP 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation | Juhwan Choi, Yeonghwa Kim, Seunguk Yu, JungMin Yun, YoungBin Kim | N/A | N/A |
| Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation | Juhwan Choi, JungMin Yun, Kyohoon Jin, YoungBin Kim | N/A | N/A |
| FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document | Joonho Yang, Seunghyun Yoon, ByeongJeong Kim, Hwanhee Lee | N/A | N/A |
| Prompts have evil twins | Rimon Melamed, Lucas Hurley McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà | N/A | N/A |
| Table Question Answering for Low-resourced Indic Languages | Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke | N/A | N/A |
| ImageInWords: Unlocking Hyper-Detailed Image Descriptions | Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Michael Baldridge, Radu Soricut | N/A | N/A |
| LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay | Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang | N/A | N/A |
| When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection | Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps | N/A | N/A |
| Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model | Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia Perera, EngSiong Chng, Lina Yao | N/A | N/A |
| Hateful Word in Context Classification | Sanne Hoeken, Sina Zarrieß, Özge Alacam | N/A | N/A |
| Eyes Don’t Lie: Subjective Hate Annotation and Detection with Gaze | Özge Alacam, Sanne Hoeken, Sina Zarrieß | N/A | N/A |
| NumeroLogic: Number Encoding for Enhanced LLMs’ Numerical Reasoning | Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle | N/A | N/A |
| Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models | Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka | N/A | N/A |
| A Usage-centric Take on Intent Understanding in E-Commerce | Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan | N/A | N/A |
| Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs | Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha | N/A | N/A |
| Systematic Biases in LLM Simulations of Debates | Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein | N/A | N/A |
| Studying and Mitigating Biases in Sign Language Understanding Models | Katherine Atwell, Danielle Bragg, Malihe Alikhani | N/A | N/A |
| Uncertainty in Language Models: Assessment through Rank-Calibration | Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban | N/A | N/A |
| RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning | Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing | Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty | N/A | N/A |
| Scaling Properties of Speech Language Models | Santiago Cuervo, Ricard Marxer | N/A | N/A |
| “We Demand Justice!”: Towards Social Context Grounding of Political Texts | Rajkumar Pujari, Chengfei Wu, Dan Goldwasser | N/A | N/A |
| An Experimental Analysis on Evaluating Patent Citations | Rabindra Nath Nandi, Suman Maity, Brian Uzzi, Sourav Medya | N/A | N/A |
| Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? | Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow | N/A | N/A |
| Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing | Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis | N/A | N/A |
| Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning | Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, zujie wen, Wenqiang Lei, Tat-Seng Chua | N/A | N/A |
| Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation | Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman | N/A | N/A |
| Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, shimin tao, Xiaofeng Zhao, Mahongxia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, JingBo Zhu | N/A | N/A |
| On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models | Abhilasha Sancheti, Haozhe An, Rachel Rudinger | N/A | N/A |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Maureen de Seyssel, Antony D’Avirro, Adina Williams, Emmanuel Dupoux | N/A | N/A |
| On Fake News Detection with LLM Enhanced Semantics Mining | Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang, Jia Wu, Hao Fan | N/A | N/A |
| On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices | Branislav Pecher, Ivan Srba, Maria Bielikova | N/A | N/A |
| Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection | Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan | N/A | N/A |
| A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers | Valentin Barriere, Sebastian Cifuentes | N/A | N/A |
| Mitigating the Alignment Tax of RLHF | Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang | N/A | N/A |
| Evaluating Readability and Faithfulness of Concept-based Explanations | Meng Li, Haoran Jin, Ruixuan HUANG, Zhihao Xu, Defu Lian, Zijia Lin, Di ZHANG, Xiting Wang | N/A | N/A |
| Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems | Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen | N/A | N/A |
| MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making | Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou | N/A | N/A |
| CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds | Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Kenneth Huang | N/A | N/A |
| Tokenization Is More Than Compression | Craig W Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner | N/A | N/A |
| FLIRT: Feedback Loop In-context Red Teaming | Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta | N/A | N/A |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Lingjun Zhao, Khanh Xuan Nguyen, Hal Daumé III | N/A | N/A |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Haoyuan WU, Haisheng Zheng, Zhuolun He, Bei Yu | N/A | N/A |
| GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation | Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng | N/A | N/A |
| Improved Learned Sparse Retrieval with Entity Vocabulary | Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates | N/A | N/A |
| Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models | Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu | N/A | N/A |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li | N/A | N/A |
| Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences | Xiangyang Liu, Junliang He, Xipeng Qiu | N/A | N/A |
| Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue | Xianlong Luo, Yihao Wang, Meng Yang | N/A | N/A |
| Integrating Plutchik’s Theory with Mixture of Experts for Enhancing Emotion Classification | Dongjun LIM, Yun-Gyung Cheong | N/A | N/A |
| In-context Contrastive Learning for Event Causality Identification | 梁超, Wei Xiang, Bang Wang | N/A | N/A |
| What’s Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs | Anna Wegmann, Tijs A. van den Broek, Dong Nguyen | N/A | N/A |
| Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs | Kanishka Misra, Kyle Mahowald | N/A | N/A |
| Large Language Models for Data Annotation: A Survey | Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, huan liu | N/A | N/A |
| Chain-of-Dictionary Prompting Elicits Translation in Large Language Models | Hongyuan Lu, HAORAN YANG, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei | N/A | N/A |
| AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning | Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang | N/A | N/A |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering | Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, chen luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs | Jocelyn J Shen, Joel Mire, Hae Won Park, Cynthia Breazeal, Maarten Sap | N/A | N/A |
| Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence | Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, di yin, Xing Sun | N/A | N/A |
| Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval | Tianyi Hu, Maria Maistro, Daniel Hershcovich | N/A | N/A |
| RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models | Peng Xia, Kangyu Zhu, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao | N/A | N/A |
| A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading | Yuan Li, Bingqiao Luo, Qian Wang, Nuo Chen, Xu Liu, Bingsheng He | N/A | N/A |
| A Survey on In-context Learning | Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui | N/A | N/A |
| DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing | Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao | N/A | N/A |
| AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation | Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing | N/A | N/A |
| EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models | Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai | N/A | N/A |
| Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization | Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee | N/A | N/A |
| LLMs Are Zero-Shot Context-Aware Simultaneous Translators | Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura | N/A | N/A |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang | N/A | N/A |
| ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval | Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou | N/A | N/A |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulić, Anna Korhonen | N/A | N/A |
| Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation | Chenlong Deng, Kelong Mao, Zhicheng Dou | N/A | N/A |
| Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process | Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang | N/A | N/A |
| Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation | Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander T Toshev | N/A | N/A |
| QUDSELECT: Selective Decoding for Questions Under Discussion Parsing | Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration | Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng | N/A | N/A |
| Model Balancing Helps Low-data Training and Fine-tuning | Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang | N/A | N/A |
| Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment | Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami | N/A | N/A |
| Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment | Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu | N/A | N/A |
| A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning | Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou | N/A | N/A |
| Towards Tool Use Alignment of Large Language Models | Zhi-Yuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin | N/A | N/A |
| DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models | Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass | N/A | N/A |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation | Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin | N/A | N/A |
| Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network | Haoran Li, Qiang Gao, Hongmei Wu, Li Huang | N/A | N/A |
| Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors | Wenjian Ding, YAO ZHANG, Jun Wang, Adam Jatowt, Zhenglu Yang | N/A | N/A |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao, Yuehan Zhang, zhangwenlong, Xiao-Ming Wu | N/A | N/A |
| Tracking the perspectives of interacting language models | Hayden Helm, Brandon Duderstadt, Youngser Park, Carey Priebe | N/A | N/A |
| MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering | Zhengxuan Zhang, Yin WU, Yuyu Luo, Nan Tang | N/A | N/A |
| Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? | Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui | N/A | N/A |
| Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement | Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng LI, Wei Peng, Sujian Li | N/A | N/A |
| Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation | Joseph Marvin Imperial, Gail Forey, Harish Tayyar Madabushi | N/A | N/A |
| Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective | Zhihao Zhang, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou | N/A | N/A |
| “Glue pizza and eat rocks” - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models | Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, huan liu | N/A | N/A |
| Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement | Yuxuan Wang, Xiaoyuan Liu | N/A | N/A |
| SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation | Xiaoze Liu, Ting Sun, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao | N/A | N/A |
| MatchTime: Towards Automatic Soccer Game Commentary Generation | Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie | N/A | N/A |
| Rethinking Token Reduction for State Space Models | Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang | N/A | N/A |
| Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering | Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang | N/A | N/A |
| MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic | Yuyan Zhou, Liang Song, Bingning Wang, weipeng chen | N/A | N/A |
| Event Causality Identification with Synthetic Control | Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson | N/A | N/A |
| Retrieved Sequence Augmentation for Protein Representation Learning | Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Young Lu, Qi Liu, Sheng Wang, Lingpeng Kong | N/A | N/A |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li | N/A | N/A |
| TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić | N/A | N/A |
| DA$^3$: A Distribution-Aware Adversarial Attack against Language Models | Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu | N/A | N/A |
| Evaluating Psychological Safety of Large Language Models | Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing | N/A | N/A |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong | N/A | N/A |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu | N/A | N/A |
| PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation | Libo Zhao, Jing Li, Ziqian Zeng | N/A | N/A |
| TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging | Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang | N/A | N/A |
| Do We Need Language-Specific Fact-Checking Models? The Case of Chinese | Caiqi Zhang, Zhijiang Guo, Andreas Vlachos | N/A | N/A |
| Enhancing Advanced Visual Reasoning Ability of Large Language Models | Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai | N/A | N/A |
| CMD: a framework for Context-aware Model self-Detoxification | Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang | N/A | N/A |
| Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection | Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao | N/A | N/A |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao | N/A | N/A |
| Be Helpful but Don’t Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support | LI Junlin, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang | N/A | N/A |
| Aligning Language Models to Explicitly Handle Ambiguity | Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim | N/A | N/A |
| Tag-grounded Visual Instruction Tuning with Retrieval Augmentation | Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li | N/A | N/A |
| GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models | Xuanchang Zhang, Zhuosheng Zhang, hai zhao | N/A | N/A |
| Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information | Runze Xia, Congchi Yin, Piji Li | N/A | N/A |
| Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models | Rui Li, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang | N/A | N/A |
| Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models | Yongjin Yang, Jongwoo Ko, Se-Young Yun | N/A | N/A |
| Advancing Process Verification for Large Language Models via Tree-Based Preference Learning | Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu | N/A | N/A |
| An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference | Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan | N/A | N/A |
| MantisScore: A Reliable Fine-grained Metric for Video Generation | Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen | N/A | N/A |
| A ∧ B ⇔ B ∧ A: Evaluating and Improving Logical Reasoning Ability of Large Language Models | Yuxuan WAN, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training | Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin | N/A | N/A |
| FuseGen: PLM Fusion for Data-generation based Zero-shot Learning | Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang | N/A | N/A |
| I Need Help! Evaluating LLM’s Ability to Ask for Users’ Support: A Case Study on Text-to-SQL Generation | Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm | Michael Wiegand, Josef Ruppenhofer | N/A | N/A |
| By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting | Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee | N/A | N/A |
| Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization | Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee | N/A | N/A |
| CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search | Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie | N/A | N/A |
| Towards Low-Resource Harmful Meme Detection with LMM Agents | Jianzhao Huang, Hongzhan Lin, ZiyanLiu, Ziyang Luo, Guang Chen, Jing Ma | N/A | N/A |
| VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values | Zhe Hu, Yixiao Ren, Jing Li, Yu Yin | N/A | N/A |
| Direct Multi-Turn Preference Optimization for Language Agents | Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng | N/A | N/A |
| Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models | Leonardo Ranaldi, Andre Freitas | N/A | N/A |
| In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search | Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren | N/A | N/A |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, liqian wen, Zulong Chen | N/A | N/A |
| Backward Lens: Projecting Language Model Gradients into the Vocabulary Space | Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf | N/A | N/A |
| Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding | Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu | N/A | N/A |
| Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! | Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu | N/A | N/A |
| Reusing Transferable Weight Increments for Low-resource Style Generation | Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar Zaiane | N/A | N/A |
| Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course | Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee | N/A | N/A |
| Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? | Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler | N/A | N/A |
| Instruction Pre-Training: Language Models are Supervised Multitask Learners | Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei | N/A | N/A |
| LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Renzhi Wang, Piji Li | N/A | N/A |
| Collaborative Performance Prediction for Large Language Models | Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma | N/A | N/A |
| Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese | Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari | N/A | N/A |
| Knowledge Verification to Nip Hallucination in the Bud | Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi | N/A | N/A |
| QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios | Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich | N/A | N/A |
| African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models | Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models | Bastien Liétard, Pascal Denis, Mikaela Keller | N/A | N/A |
| ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings | Hao Wang, Hao Li, Minlie Huang, Lei Sha | N/A | N/A |
| An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making | Xiutian Zhao, Ke Wang, Wei Peng | N/A | N/A |
| Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment | zhenyu liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning | Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, ZihanWang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang | N/A | N/A |
| Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification | Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang | N/A | N/A |
| PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study | Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu | N/A | N/A |
| Alignment-Enhanced Decoding: Defending via Token-Level Adaptive Refining of Probability Distributions | Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su | N/A | N/A |
| MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction | Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu | N/A | N/A |
| Evaluating Large Language Models via Linguistic Profiling | Alessio Miaschi, Felice Dell’Orletta, Giulia Venturi | N/A | N/A |
| With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models | Tyler Loakman, YUCHENG LI, Chenghua Lin | N/A | N/A |
| KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases | Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li | N/A | N/A |
| Understanding Higher-Order Correlations Among Semantic Components in Embeddings | Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira | N/A | N/A |
| DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection | Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Wei Liu, Xian Wu, Shaorong Xie, Yefeng Zheng | N/A | N/A |
| Evaluating D-MERIT of Partial-annotation on Information Retrieval | Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg | N/A | N/A |
| Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving | XIN QUAN, Marco Valentino, Louise A. Dennis, Andre Freitas | N/A | N/A |
| Calibrating the Confidence of Large Language Models by Eliciting Fidelity | Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu | N/A | N/A |
| Exploring Reward Model Strength’s Impact on Language Models | Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen | N/A | N/A |
| How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics | Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea | N/A | N/A |
| Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Gaetan Lopez Latouche, Marc-André Carbonneau, Benjamin Swanson | N/A | N/A |
| CUTE: Measuring LLMs’ Understanding of Their Tokens | Lukas Edman, Helmut Schmid, Alexander Fraser | N/A | N/A |
| SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation | Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| On The Role of Context in Reading Time Prediction | Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox | N/A | N/A |
| BC-Prover: Backward Chaining Prover for Formal Theorem Proving | Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin, Ruifeng Xu, Wotao Yin | N/A | N/A |
| From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP | Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva | N/A | N/A |
| Dual Modalities of Text: Visual and Textual Generative Pre-Training | Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu | N/A | N/A |
| On Training Data Influence of GPT Models | Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu | N/A | N/A |
| Understanding “Democratization” in NLP and ML Research | Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat | N/A | N/A |
| DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models | Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto | N/A | N/A |
| Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages | Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng | N/A | N/A |
| Word Alignment as Preference for Machine Translation | Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka | N/A | N/A |
| Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence | Yaxin FAN, PEIFENG LI, Qiaoming Zhu | N/A | N/A |
| SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models | Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang | N/A | N/A |
| Neuron-Level Knowledge Attribution in Large Language Models | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Pixology: Probing the Linguistic and Visual Knowledge of Pixel-based Language Models | Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux | N/A | N/A |
| GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory | Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song | N/A | N/A |
| Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature | ALI ALLAITH, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Ingemann Parby, Alexander Conroy, Timothy R Tangherlini | N/A | N/A |
| QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models | Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh | N/A | N/A |
| Fine-Grained Prediction of Reading Comprehension from Eye Movements | Omer Shubi, Yoav Meiri, Cfir Avraham Hadar, Yevgeni Berzak | N/A | N/A |
| Efficient Retriever for Multi-Hop Retrieval Question Answerin | Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang | N/A | N/A |
| Unsupervised Human Preference Learning | Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani Tur | N/A | N/A |
| Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering | Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo, Elena Cabrio, Serena Villata, Marco Guerini | N/A | N/A |
| Leading Whitespaces of Language Models’ Subword Vocabulary Poses a Confound for Calculating Word Probabilities | Byung-Doh Oh, William Schuler | N/A | N/A |
| LLM4Decompile: Decompiling Binary Code with Large Language Models | Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang | N/A | N/A |
| From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning | Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong | N/A | N/A |
| CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering | Yike Wu, Yi Huang, Nan Hu, YUNCHENG HUA, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan | N/A | N/A |
| MTLS: Making Texts into Linguistic Symbols | Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li | N/A | N/A |
| D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection | Yifan Chen, Kuntao Li, Weixing Mai, Qiaofeng Wu, Yun Xue, Fenghuan Li | N/A | N/A |
| A Generic Method for Fine-grained Category Discovery in Natural Language Texts | Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens | N/A | N/A |
| Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method | Yang Trista Cao, Lovely-Frances Domingo, Sarah Gilbert, Michelle L. Mazurek, Katherine Shilton, Hal Daumé III | N/A | N/A |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie | N/A | N/A |
| Decompose and Compare Consistency: Measuring VLMs’ Answer Reliability via Task-Decomposition Consistency Comparison | Qian Yang, Weixiang Yan, Aishwarya Agrawal | N/A | N/A |
| Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism | Lang Cao | N/A | N/A |
| VGBench: A Comprehensive Benchmark of Vector Graphics Understanding and Generation for Large Language Models | Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee | N/A | N/A |
| What do large language models need for machine translation evaluation? | Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orasan, Tharindu Ranasinghe, Fred Blain | N/A | N/A |
| Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo, Prateek Singhi, Bilal H Fadlallah | N/A | N/A |
| External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models | Debela Gemechu, Chris Reed | N/A | N/A |
| C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits | Maaz Bin Musa, Rishab Nithyanand, Padmini Srinivasan, Mihailis E. Diamantis, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin | N/A | N/A |
| MPT: Multimodal Prompt Tuning for Zero-shot Instruction Learning | Taowen Wang, Yiyang Liu, James Chenhao Liang, junhan zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han, Lifu Huang, Qifan Wang, Dongfang Liu | N/A | N/A |
| Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng, Yi Gu, Chengyu Dong, Zihan Wang, Jingbo Shang | N/A | N/A |
| Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng, Zilong Wang, Jingbo Shang | N/A | N/A |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang | N/A | N/A |
| Conditional and Modal Reasoning in Large Language Models | Wesley H. Holliday, Matthew Mandelkern, Cedegao E. Zhang | N/A | N/A |
| Advancing Large Language Model Attribution through Self-Improving | Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Ziqi Liang, Haoxiang Shi, Hanhui Chen | N/A | N/A |
| Interpretability-based Tailored Knowledge Editing in Transformers | Yihuai Hong, Aldo Lipani | N/A | N/A |
| PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling | Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan | N/A | N/A |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap | N/A | N/A |
| Dissecting Fine-Tuning Unlearning in Large Language Models | Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang | N/A | N/A |
| Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models | Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, zhiheng huang | N/A | N/A |
| Where is the signal in tokenization space? | Renato Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck | N/A | N/A |
| Private Language Models via Truncated Laplacian Mechanism | Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang | N/A | N/A |
| Estimating Knowledge in Large Language Models Without Generating a Single Token | Daniela Gottesman, Mor Geva | N/A | N/A |
| Consistent Autoformalization for Constructing Mathematical Libraries | Lan Zhang, XIN QUAN, Andre Freitas | N/A | N/A |
| Contextual and Parametric Knowledge: More Context, More Focus | Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal | N/A | N/A |
| Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers | Aditya Yedetore, Najoung Kim | N/A | N/A |
| When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages | Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen | N/A | N/A |
| Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai | N/A | N/A |
| MiTTenS: A Dataset for Evaluating Gender Mistranslation | Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings | N/A | N/A |
| Teaching LLMs to Abstain across Languages via Multilingual Feedback | Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov | N/A | N/A |
| Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration | Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov | N/A | N/A |
| StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements | Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L Gordon, Zaid Harchaoui, Yejin Choi | N/A | N/A |
| I Could’ve Asked That: Reformulating Unanswerable Questions | Wenting Zhao, Ge Gao, Claire Cardie, Alexander M Rush | N/A | N/A |
| STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions | Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami | N/A | N/A |
| Hidden Persuaders: How LLM Political Bias Could Sway Our Elections | Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song | N/A | N/A |
| SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning | Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu | N/A | N/A |
| When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives | Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu | N/A | N/A |
| An Analysis of Multilingual FActScore | Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai | N/A | N/A |
| Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models | Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo | N/A | N/A |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli | N/A | N/A |
| PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval | Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon | N/A | N/A |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov | N/A | N/A |
| ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback | Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault | N/A | N/A |
| Order of Magnitude Speedups for LLM Membership Inference | Rongting Zhang, Martin Andres Bertran, Aaron Roth | N/A | N/A |
| VIMI: Grounding Video Generation through Multi-modal Instruction | Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov | N/A | N/A |
| F$^2$RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation | Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou | N/A | N/A |
| Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning | Chang Yang, Peng Zhang, Hui Gao, Jing Zhang | N/A | N/A |
| Visual Prompting in LLMs for Enhancing Emotion Recognition | Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Yang Liu, Zhenyue Qin, Wenjia Niu, Sabrina Caldwell, Tom Gedeon | N/A | N/A |
| IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding | Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang | N/A | N/A |
| Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset | Che Wei Tsai, Yen-Hao Huang, Tsu-keng Liao, Didier Fernando Salazar Estrada, Retnani Latifah, Yi-Shin Chen | N/A | N/A |
| Outcome-Constrained Large Language Models for Countering Hate Speech | Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song | N/A | N/A |
| Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing | Changbing Yang, Garrett Nicolai, Miikka Silfverberg | N/A | N/A |
| Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks | Ao Wang, Xinghao Yang, Chen Li, Bao-di Liu, Weifeng Liu | N/A | N/A |
| Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping | Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang | N/A | N/A |
| PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling | Huachuan Qiu, Lizhi Ma, Zhenzhong Lan | N/A | N/A |
| World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering | Jiacong Wang, Bohong Wu, Haiyong Jiang, Haoyuan Guo, Xin Xiao, zhou Xun, Jun Xiao | N/A | N/A |
| DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering | Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo | N/A | N/A |
| How Do Humans Write Code? Large Models Do It the Same Way Too | Long Li, Xuzheng He, Haozhe Wang, Linlin Wang, Liang He | N/A | N/A |
| Retrospex: Language Agent Meets Offline Reinforcement Learning Critic | Yufei Xiang, Yiqun Shen, Yeqin Zhang, Nguyen Cam-Tu | N/A | N/A |
| Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models | Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, JingBo Zhu | N/A | N/A |
| Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation | Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen | N/A | N/A |
| CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation | Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang | N/A | N/A |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J Su, Camillo Jose Taylor, Dan Roth | N/A | N/A |
| Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao, Gonghan Xu, Zhe Wang, Arman Cohan | N/A | N/A |
| MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning | Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai | N/A | N/A |
| Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients | Weijun Li, Qiongkai Xu, Mark Dras | N/A | N/A |
| RWKV-CLIP: A Robust Vision-Language Representation Learner | Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng | N/A | N/A |
| KidLM: Advancing Language Models for Children – Early Insights and Future Directions | Mir Tafseer Nayeem, Davood Rafiei | N/A | N/A |
| Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr | N/A | N/A |
| How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin | N/A | N/A |
| An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records | Joakim Edin, Maria Maistro, Lars Maaløe, Lasse Borgholt, Jakob Drachmann Havtorn, Tuukka Ruotsalo | N/A | N/A |
| Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs | Zheng Wang, Zhongyang Li, Jiang Zeren, Dandan Tu, Wei Shi | N/A | N/A |
| EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation | Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji | N/A | N/A |
| Predicting Nonnative Sentence Processing with L2LMs | Tatsuya Aoyama, Nathan Schneider | N/A | N/A |
| From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis | Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan | N/A | N/A |
| Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs | Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar Karnin | N/A | N/A |
| Cross-Domain Audio Deepfake Detection: Dataset and Analysis | Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang | N/A | N/A |
| MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension | Ting Liu, Zunnan Xu, Zhiqiang Wang, Yue Hu, Liangtao Shi, Quanjun Yin | N/A | N/A |
| Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning | Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo | N/A | N/A |
| Aligning Translation-Specific Understanding to General Understanding in Large Language Models | Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin | N/A | N/A |
| FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation | Mohamad Ballout, Anne Dedert, Nohayr Muhammad Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger | N/A | N/A |
| Concept-skill Transferability-based Data Selection for Large Vision-Language Models | Jaewoo Lee, Boyang Li, Sung Ju Hwang | N/A | N/A |
| LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing | Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi LIU, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin | N/A | N/A |
| Academics Can Contribute to Domain-Specialized Language Models | Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S Rosenberg, Sebastian Gehrmann | N/A | N/A |
| Beyond Reference: Evaluating High Quality Translations Better than Human References | Keonwoong Noh, Seokjin Oh, Woohwan Jung | N/A | N/A |
| Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement | Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie | N/A | N/A |
| SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages | Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James Validad Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Railey Montalan, Ryan Ignatius Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze GAO, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Tai Ngee Chia, Ayu Purwarianti, Sebastian Ruder, William Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya | N/A | N/A |
| Induct-Learn: Short Phrase Prompting with Instruction Induction | Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen | N/A | N/A |
| Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning | Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li | N/A | N/A |
| LUQ: Long-text Uncertainty Quantification for LLMs | Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier | N/A | N/A |
| Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method | Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng | N/A | N/A |
| Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars | Damien Sileo | N/A | N/A |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux | N/A | N/A |
| Safely Learning with Private Data: A Federated Learning Framework for Large Language Model | Jia-Ying Zheng, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hong-Wei Zheng, Zhi-Ming Zheng | N/A | N/A |
| Formality Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge | Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen | N/A | N/A |
| How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You | N/A | N/A |
| How Far Can We Extract Diverse Perspectives from Large Language Models? | Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang | N/A | N/A |
| EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning | Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Mohan Yerragorla, Sourangshu Bhattacharya, Avishek Anand | N/A | N/A |
| An LLM Feature-based Framework for Dialogue Constructiveness Assessment | Lexin Zhou, Youmna Farag, Andreas Vlachos | N/A | N/A |
| Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System | Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou | N/A | N/A |
| Dialog2Flow: Pre-training Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction | Sergio Burdisso, Srikanth Madikeri, Petr Motlicek | N/A | N/A |
| Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture | N/A | N/A |
| Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024 | Ilias Chalkidis | N/A | N/A |
| Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning | Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian | N/A | N/A |
| LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations | Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu | N/A | N/A |
| Concept Space Alignment in Multilingual LLMs | Qiwei Peng, Anders Søgaard | N/A | N/A |
| Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model | Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou | N/A | N/A |
| NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian | Peng Liu, Lemei Zhang, Terje Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang | N/A | N/A |
| RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework | Yifan Wang, Vera Demberg | N/A | N/A |
| Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models | Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang | N/A | N/A |
| Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems | Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering | Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen | N/A | N/A |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Minzheng Wang, Longze Chen, ChengFu, Liaoshengyi, Xinghua Zhang, Bingliwu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li | N/A | N/A |
| On Mitigating Performance Disparities in Multilingual Speech Recognition | Monorama Swain, Anna Katrine van Zee, Anders Søgaard | N/A | N/A |
| Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting | Stephen Meisenbacher, Florian Matthes | N/A | N/A |
| From Coarse to Fine: Impacts of Feature-Preserving and Feature-Compressing Connectors on Perception in Multimodal Models | Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen | N/A | N/A |
| What is ‘‘Typological Diversity’’ in NLP? | Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva | N/A | N/A |
| The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse | Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi | N/A | N/A |
| Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness | Georgi Shopov, Stefan Gerdjikov | N/A | N/A |
| Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal | N/A | N/A |
| Methods of Automatic Matrix Language Determination for Code-Switched Speech | Olga Iakovenko, Thomas Hain | N/A | N/A |
| Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts | Jaewook Lee, Yeajin Jang, Hongjin KIM, Woojin Lee, Harksoo Kim | N/A | N/A |
| Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models | Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar | N/A | N/A |
| Teaching Small Language Models Reasoning through Counterfactual Distillation | FengTao, Yicheng Li, Li Chenglin, Hao Chen, Fei Yu, Yin Zhang | N/A | N/A |
| Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese | Meet Doshi, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| Quantifying the Gap Between Machine Translation and Native Language in Training for Multimodal, Multilingual Retrieval | Kyle Buettner, Adriana Kovashka | N/A | N/A |
| MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval | Qixi Lu, Gongbo Tang | N/A | N/A |
| Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates | Aida Kostikova, Dominik Beese, Benjamin Paassen, Ole Pütz, Gregor Wiedemann, Steffen Eger | N/A | N/A |
| CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling | Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie CK Cheung | N/A | N/A |
| Story Embeddings — Narrative-Focused Representations of Fictional Stories | Hans Ole Hatzel, Chris Biemann | N/A | N/A |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou | N/A | N/A |
| PSC: Extending Context Window of Large Language Models via Phase Shift Calibration | Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu | N/A | N/A |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan | N/A | N/A |
| SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao | N/A | N/A |
| Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing | Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn | N/A | N/A |
| ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations | Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-Wei Lee | N/A | N/A |
| Boosting Scientific Concepts Understanding: Can Analogies from Teacher Models Empower Student Models? | Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang | N/A | N/A |
| Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation | Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza | N/A | N/A |
| Do Large Language Models Know How Much They Know? | Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar | N/A | N/A |
| Investigating Mysteries of CoT-Augmented Distillation | Somin Wadhwa, Silvio Amir, Byron C Wallace | N/A | N/A |
| SciPrompt: Knowledge-Augmented Prompting for Fine-Grained Categorization of Scientific Topics | Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludaescher, Jana Diesner | N/A | N/A |
| Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP | Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi | N/A | N/A |
| Learning from Natural Language Explanations for Generalizable Entity Matching | Somin Wadhwa, ADIT KRISHNAN, Runhui Wang, Byron C Wallace, Luyang Kong | N/A | N/A |
| Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation | Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Sricharan Kumar, Murat Kantarcioglu, Bradley A. Malin | N/A | N/A |
| On the Reliability of Psychological Scales on Large Language Models | Jen-tse Huang, Wenxuan Wang, Man Ho LAM, Eric John Li, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring | N/A | N/A |
| Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | Jeonghwan Kim, Heng Ji | N/A | N/A |
| Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts | Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata | N/A | N/A |
| VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment | Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu | N/A | N/A |
| Focused Large Language Models are Stable Many-Shot Learners | Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| Reconsidering Sentence-Level Sign Language Translation | Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus | N/A | N/A |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha | N/A | N/A |
| Verba volant, scripta volant? Don’t worry! There are computational solutions for protoword reconstruction | Liviu P Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas | N/A | N/A |
| ChatGPT Doesn’t Trust LA Chargers Fans: Guardrail Sensitivity in Context | Victoria R Li, Yida Chen, Naomi Saphra | N/A | N/A |
| Personas as a Way to Model Truthfulness in Language Models | Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He | N/A | N/A |
| Satyrn: A Platform for Analytics Augmented Generation | Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J Hammond | N/A | N/A |
| EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning | Ashish Seth, Ramaneswaran S, S Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha | N/A | N/A |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao, Haotian Fu, Chen Sun, George Konidaris | N/A | N/A |
| Detection and Measurement of Syntactic Templates in Generated Text | Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C Wallace | N/A | N/A |
| UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models | Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu | N/A | N/A |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet | N/A | N/A |
| Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts | Zhaoxuan Tan, Zheyuan Liu, Meng Jiang | N/A | N/A |
| Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning | Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang | N/A | N/A |
| Unifying Multimodal Retrieval via Document Screenshot Embedding | Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin | N/A | N/A |
| Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation | Shaomu Tan, Di Wu, Christof Monz | N/A | N/A |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson | N/A | N/A |
| Discovering Knowledge-Critical Subnetworks in Pretrained Language Models | Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut | N/A | N/A |
| Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models | Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang | N/A | N/A |
| Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering | Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner | N/A | N/A |
| Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution | Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner | N/A | N/A |
| Understanding and Mitigating Language Confusion in LLMs | Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder | N/A | N/A |
| Can Large Language Models Learn Independent Causal Mechanisms? | Gael Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael Witbrock, Gillian Dobbie | N/A | N/A |
| MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models | Sarfaroz Yunusov, Hamza Sidat, Ali Emami | N/A | N/A |
| InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context | Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao | N/A | N/A |
| Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia | Farhan Samir, Chan Young Park, Vered Shwartz, Anjalie Field, Yulia Tsvetkov | N/A | N/A |
| From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models | Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, EunJeong Hwang, Vered Shwartz | N/A | N/A |
| Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation | Karin De Langis, Ryan Koo, Dongyeop Kang | N/A | N/A |
| MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model | Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu | N/A | N/A |
| Learning to Extract Structured Entities Using Language Models | Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra | N/A | N/A |
| Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons | Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark Gales | N/A | N/A |
| A Survey of AMR Applications | Shira Wein, Juri Opitz | N/A | N/A |
| Beyond Embeddings: The Promise of Visual Table in Visual Reasoning | Yiwu Zhong, Zi-Yuan Hu, Michael Lyu, Liwei Wang | N/A | N/A |
| CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation | Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C Kaelin, Mary Khetani, Natalie Parde | N/A | N/A |
| Secured Weight Release for Large Language Models via Taylor Expansion | Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Hu | N/A | N/A |
| TimeR$^4$ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering | Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song | N/A | N/A |
| Knowledge-Centric Hallucination Detection | Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang | N/A | N/A |
| Revealing the Parallel Multilingual Learning within Large Language Models | Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, JingBo Zhu | N/A | N/A |
| Automatic Instruction Evolving for Large Language Models | Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen | N/A | N/A |
| RepEval: Effective Text Evaluation with LLM Representation | Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou | N/A | N/A |
| Generative Models for Automatic Medical Decision Rule Extraction from Text | Yuxin He, Buzhou Tang, Xiaoling Wang | N/A | N/A |
| Encoding and Controlling Global Semantics for Long-form Video Question Answering | Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu | N/A | N/A |
| Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis | Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang | N/A | N/A |
| Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs | Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Does Large Language Model Contain Task-Specific Neurons? | Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu | N/A | N/A |
| Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models | Philipp Mondorf, Barbara Plank | N/A | N/A |
| Advancing Test-Time Adaptation in Wild Acoustic Test Settings | Hongfu Liu, Hengguan Huang, Ye Wang | N/A | N/A |
| Learning to Retrieve Iteratively for In-Context Learning | Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme | N/A | N/A |
| Taxonomy-guided Semantic Indexing for Academic Paper Search | SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu | N/A | N/A |
| Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts | Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models | Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh | N/A | N/A |
| Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation | Zhiyu Cao, PEIFENG LI, Yaxin FAN, Qiaoming Zhu | N/A | N/A |
| FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs | Yiyuan Li, Shichao Sun, Pengfei Liu | N/A | N/A |
| Aligning Large Language Models with Diverse Political Viewpoints | Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash | N/A | N/A |
| “You Gotta be a Doctor, Lin” : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations | Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III | N/A | N/A |
| Extending Context Window of Large Language Models from a Distributional Perspective | Yingsheng Wu, Yuxuan Gu, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions | Hakyung Sung, Kristopher Kyle | N/A | N/A |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng | N/A | N/A |
| Position Engineering: Boosting Large Language Models through Positional Information Manipulation | Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna K. Qiu, Lili Qiu | N/A | N/A |
| Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale | Junying Chen, Chi Gui, OuyangRuyi, Anningzhe Gao, Shunian Chen, Guiming Hardy Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang | N/A | N/A |
| ADELIE: Aligning Large Language Models on Information Extraction | Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li | N/A | N/A |
| Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons | Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Dajun Zeng | N/A | N/A |
| Lexically Grounded Subword Segmentation | Jindřich Libovický, Jindřich Helcl | N/A | N/A |
| EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang | N/A | N/A |
| Do Text-to-Vis Benchmarks Test Real Use of Visualizations? | Hy Nguyen, Xuefei He, Andrew Reeson, Cecile Paris, Josiah Poon, Jonathan K. Kummerfeld | N/A | N/A |
| Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu | N/A | N/A |
| Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning | Jingyu Hu, Weiru Liu, Mengnan Du | N/A | N/A |
| Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges | Nguyen Van Dinh, Thanh Chi Dang, Luan Thanh Nguyen, Kiet Van Nguyen | N/A | N/A |
| Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment | Vyas Raina, Adian Liusie, Mark Gales | N/A | N/A |
| Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal | Zhicong Lu, Li Jin, PeiguangLi, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai | N/A | N/A |
| More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Yangyang Kang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu | N/A | N/A |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | Vyas Raina, Rao Ma, Charles McGhee, Kate Knill, Mark Gales | N/A | N/A |
| GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation | Georgios Katsimpras, Georgios Paliouras | N/A | N/A |
| XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs | Zichen Chen, Jianda Chen, Ambuj Singh, Misha Sra | N/A | N/A |
| Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning | Yuanpin Zhou, Huogen Wang | N/A | N/A |
| SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information | Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, Yu Cheng | N/A | N/A |
| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui | N/A | N/A |
| Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments | Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su | N/A | N/A |
| MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space | Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia, Zhangjijun, Ruifang He, Yuexian Hou | N/A | N/A |
| KnowledgeSG: Privacy-Preserving Synthetic Text Generation With Knowledge Distillation From Server | WenHao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang | N/A | N/A |
| DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination | Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei | N/A | N/A |
| Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models | Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao | N/A | N/A |
| Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale | Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou | N/A | N/A |
| An Empirical Study of Multilingual Reasoning Distillation for Question Answering | Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? | Gal Yona, Roee Aharoni, Mor Geva | N/A | N/A |
| Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? | Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig | N/A | N/A |
| Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning | Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee | N/A | N/A |
| MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding | Baixuan Xu, Weiqi Wang, Haochen Shi, Wenxuan Ding, Huihao JING, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song | N/A | N/A |
| ECON: On the Detection and Resolution of Evidence Conflicts | Cheng Jiayang, Qianqian Zhuang, Chunkit Chan, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang | N/A | N/A |
| “Image, Tell me your story!” Predicting the original meta-context of visual misinformation | Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych | N/A | N/A |
| Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan | N/A | N/A |
| Mixture-of-Subspaces in Low-Rank Adaptation | Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong | N/A | N/A |
| A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data | Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram | N/A | N/A |
| LawBench: Benchmarking Legal Knowledge of Large Language Models | Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng | N/A | N/A |
| Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards | Furkan Şahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych | N/A | N/A |
| Efficient Vision-Language pre-training via domain-specific learning for human activities | Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martinez, Georgios Tzimiropoulos | N/A | N/A |
| Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training | Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su | N/A | N/A |
| Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works | Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang | N/A | N/A |
| Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners | Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang | N/A | N/A |
| AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning | Hao Sun, Jiayi Wu, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li | N/A | N/A |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen | N/A | N/A |
| Data Advisor: Data Curation with Foresight for Safety Alignment of Large Language Models | Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan | N/A | N/A |
| Language-to-Code Translation with a Single Labeled Example | Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas | N/A | N/A |
| Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann, Xiao Liu, Iryna Gurevych | N/A | N/A |
| FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models | Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma | N/A | N/A |
| Retrieved In-Context Principles from Previous Mistakes | Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang | N/A | N/A |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Haozhe Chen, Run Chen, Julia Hirschberg | N/A | N/A |
| VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang | N/A | N/A |
| Deterministic Weighted L* Algorithm | Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Ryan Cotterell | N/A | N/A |
| Towards Verifiable Text Generation with Evolving Memory and Self-Reflection | Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification | Pritish Sahu, Karan Sikka, Ajay Divakaran | N/A | N/A |
| Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota, Jerone Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang | N/A | N/A |
| RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? | Di Cao, Yong Liao, Xiuwei Shang | N/A | N/A |
| Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel | Brendan King, Jeffrey Flanigan | N/A | N/A |
| Humans or LLMs as the Judge? A Study on Judgement Bias | Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang | N/A | N/A |
| WPO: Enhancing RLHF with Weighted Preference Optimization | Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu | N/A | N/A |
| Walking in Others’ Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias | Rongwu Xu, Zian Zhou, Tianwei Zhang, Zehan Qi, SU YAO, Ke Xu, Wei Xu, Han Qiu | N/A | N/A |
| MetaReflection: Learning Instructions for Language Agents using Past Reflections | Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi | N/A | N/A |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan | N/A | N/A |
| On Eliciting Syntax from Language Models via Hashing | Yiran Wang, Masao Utiyama | N/A | N/A |
| CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios | Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He | N/A | N/A |
| The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples | Heng Yang | N/A | N/A |
| CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages | Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal | N/A | N/A |
| Perceptions of Linguistic Uncertainty by Language Models and Humans | Catarina G Belém, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth | N/A | N/A |
| Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM | Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung | N/A | N/A |
| Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding | Xiaoyu DONG, Yujie Feng, ZEXIN LU, Guangyuan SHI, Xiao-Ming Wu | N/A | N/A |
| Knowledge Conflicts for LLMs: A Survey | Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru WANG, Yue Zhang, Wei Xu | N/A | N/A |
| Generative AI in the Era of “Alternative Facts” | Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar | N/A | N/A |
| MEANT: Multimodal Encoder for Antecedent Information | Benjamin Irving, Annika Marie Schoene | N/A | N/A |
| A Thorough Examination of Decoding Methods in the Era of LLMs | Chufan Shi, HAORAN YANG, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam | N/A | N/A |
| AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings | Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar | N/A | N/A |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji | N/A | N/A |
| Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights | Hongjin KIM, Jai-Eun Kim, Harksoo Kim | N/A | N/A |
| ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods | Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Zhenqiang Gong, Bhuwan Dhingra | N/A | N/A |
| “Flex Tape Can’t Fix That”: Bias and Misinformation in Edited Language Models | Karina H Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut | N/A | N/A |
| Revisiting Who’s Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective | Yujian Liu, Yang Zhang, Tommi Jaakkola, Shiyu Chang | N/A | N/A |
| LIONs: An Empirically Optimized Approach to Align Language Models | Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu | N/A | N/A |
| Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing | Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada | N/A | N/A |
| A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery | Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han | N/A | N/A |
| MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents | Liyan Tang, Philippe Laban, Greg Durrett | N/A | N/A |
| Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning | John Wu, David Wu, Jimeng Sun | N/A | N/A |
| MOSEL: Inference Serving Using Dynamic Modality Selection | Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J Yadwadkar, Aditya Akella | N/A | N/A |
| From RAG to Riches: Retrieval Interlaced with Sequence Generation | Palak Jain, Livio Baldini Soares, Tom Kwiatkowski | N/A | N/A |
| Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition | Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee | N/A | N/A |
| Learning to Correct for QA Reasoning with Black-box LLMs | Jaehyung Kim, Dongyoung Kim, Yiming Yang | N/A | N/A |
| AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? | Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant | N/A | N/A |
| PostMark: A Robust Blackbox Watermark for Large Language Models | Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Frederick Wieting, Mohit Iyyer | N/A | N/A |
| Assessing “Implicit” Retrieval Robustness of Large Language Models | Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang | N/A | N/A |
| On the Relationship between Truth and Political Bias in Language Models | Suyash Fulay, William Brannon, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara | N/A | N/A |
| Can Active Label Correction Improve LLM-based Modular AI Systems? | Karan Taneja, Ashok Goel | N/A | N/A |
| Statistical Uncertainty in Word Embeddings: GloVe-V | Andrea Vallebueno, Cassandra Handan-Nader, Christopher D Manning, Daniel E. Ho | N/A | N/A |
| Annotation alignment: Comparing LLM and human annotations of conversational safety | Rajiv Movva, Pang Wei Koh, Emma Pierson | N/A | N/A |
| DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Nigel Fernandez, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, Andrew Lan | N/A | N/A |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang | N/A | N/A |
| CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models | Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran | N/A | N/A |
| Enhancing Reinforcement Learning with Intrinsic Rewards from Language Model Critique | Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng | N/A | N/A |
| Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models | Layla Bouzoubaa, Elham Aghakhani, Shadi Rezapour | N/A | N/A |
| Efficient Sequential Decision Making with Large Language Models | Dingyang Chen, Qi Zhang, Yinglun Zhu | N/A | N/A |
| SignCLIP: Connecting Text and Sign Language by Contrastive Learning | Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling | N/A | N/A |
| APPLS: Evaluating Evaluation Metrics for Plain Language Summarization | Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang | N/A | N/A |
| Ontologically Faithful Generation of Non-Player Character Dialogues | Nathaniel Weir, Ryan Thomas, Randolph d’Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani | N/A | N/A |
| LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives | Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs | Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov | N/A | N/A |
| Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction | Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Toward Compositional Behavior in Neural Models: A Survey of Current Views | Kate McCurdy, Paul Soulos, Paul Smolensky | N/A | N/A |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab | N/A | N/A |
| Reverse-Engineering the Reader | Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell | N/A | N/A |
| Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation | Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text | Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun | N/A | N/A |
| Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning | David Schulte, Felix Hamborg, Alan Akbik | N/A | N/A |
| The effects of distance on NPI illusive effects in BERT | So Young Lee, Mai Ha Vu | N/A | N/A |
| Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic | Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme | N/A | N/A |
| Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US | Christabel Acquaye, Haozhe An, Rachel Rudinger | N/A | N/A |
| Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding | Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Eric Wang | N/A | N/A |
| Ranking Manipulation for Conversational Search Engines | Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi | N/A | N/A |
| Fast Forwarding Low-Rank Training | Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov | N/A | N/A |
| Precise Model Benchmarking with Only a Few Observations | Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort | N/A | N/A |
| Attribute Diversity Determines the Systematicity Gap in VQA | Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra | N/A | N/A |
| “Rows, Columns and Values, Oh My!” Synthesizing Scientific Literature into Tables using Language Models | Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S Weld, Joseph Chee Chang, Kyle Lo | N/A | N/A |
| Development of Cognitive Intelligence in Pre-trained Language Models | Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma | N/A | N/A |
| Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding | Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui | N/A | N/A |
| Birdie: Advancing State Space Models with a Minimalist Architecture and Novel Pre-training Objectives | Sam Blouir, Jimmy T.H. Smith, Antonios Anastasopoulos, Amarda Shehu | N/A | N/A |
| Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? | Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow | N/A | N/A |
| Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs | Sheridan Feucht, David Atkinson, Byron C Wallace, David Bau | N/A | N/A |
| TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering | Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig | N/A | N/A |
| Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding | Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell | N/A | N/A |
| Unlocking Memorization in Large Language Models with Dynamic Soft Prompting | Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang | N/A | N/A |
| If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions | Reza Esfandiarpoor, Cristina Menghini, Stephen Bach | N/A | N/A |
| Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction | Bowen Zhang, Harold Soh | N/A | N/A |
| MQuinE: a Cure for “Z-paradox” in Knowledge Graph Embedding | Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun | N/A | N/A |
| Can Transformer Language Models Learn $n$-gram Language Models? | Anej Svete, Nadav Borenstein, Mike Zhou, Ryan Cotterell | N/A | N/A |
| StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model | Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim | N/A | N/A |
| Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems | Philippe Laban, Alexander Fabbri, Caiming Xiong, Chien-Sheng Wu | N/A | N/A |
| Multi-pass Decoding for Grammatical Error Correction | Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu | N/A | N/A |
| Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang, Yijia Shao, Dekun Ma, Sina Semnani, Monica Lam | N/A | N/A |
| SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation | Chenming Tang, Zhixiang Wang, Yunfang Wu | N/A | N/A |
| Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge | Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng | N/A | N/A |
| STORYSUMM: Evaluating Faithfulness in Story Summarization | Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Thomas Adams, Lydia Chilton, Kathleen McKeown | N/A | N/A |
| MMOE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Haofei Yu, Zhengyang Qi, Lawrence Keunho Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang | N/A | N/A |
| OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer | Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee | N/A | N/A |
| Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension | Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg | N/A | N/A |
| CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions | Jun Rao, Xuebo Liu, Lian Lian, shengjun cheng, Yunjie Liao, Min Zhang | N/A | N/A |
| ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers | Yuzhe Gu, Enmao Diao | N/A | N/A |
| Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models | Jaeseong Lee, seung-won hwang, Wonpyo Park, Mingi Ji | N/A | N/A |
| Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood | Yang Xu, Yu Wang, Hao An, Yongyuan Li, Zhichen Liu | N/A | N/A |
| Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning | Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, JUN ZHOU | N/A | N/A |
| Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models | XiaoHua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin | N/A | N/A |
| ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs | Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen | N/A | N/A |
| On the In-context Generation of Language Models | Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu | N/A | N/A |
| Atomic Inference for NLI with Generated Facts as Atoms | Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei | N/A | N/A |
| Towards Robust Speech Representation Learning for Thousands of Languages | William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe | N/A | N/A |
| I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses | Xuan Ren, Biao Wu, Lingqiao Liu | N/A | N/A |
| PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment | Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen | N/A | N/A |
| An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance | Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig | N/A | N/A |
| When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models | Ting-Yun Chang, Jesse Thomason, Robin Jia | N/A | N/A |
| Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference | Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo zhang, Yanghui Rao | N/A | N/A |
| Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions | Jinsung Yoon, Rajarishi Sinha, Sercan O Arik, Tomas Pfister | N/A | N/A |
| KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction | Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao | N/A | N/A |
| Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation | Zhen Lin, Shubhendu Trivedi, Jimeng Sun | N/A | N/A |
| $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity | Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl | N/A | N/A |
| CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction | Tuan Dung Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen | N/A | N/A |
| “In-Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning | Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan | N/A | N/A |
| Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective | Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He | N/A | N/A |
| Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding | Xin Liu, Farima Fatahi Bayat, Lu Wang | N/A | N/A |
| Reasoning Robustness of LLMs to Adversarial Typographical Errors | Esther Gan, Yiran Zhao, Liying Cheng, Mao Yancan, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh | N/A | N/A |
| InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance | Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Mozhi Zhang, Ke Ren, Botian Jiang, Xipeng Qiu | N/A | N/A |
| Belief Revision: The Adaptability of Large Language Models Reasoning | Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung | N/A | N/A |
| Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models | Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou | N/A | N/A |
| Bio-RFX: Refining Biomedical Extraction via Advanced Relation Classification and Structural Constraints | Minjia Wang, Fangzhou Liu, Xiuxing Li, Bowen Dong, Zhenyu Li, Tengyu Pan, Jianyong Wang | N/A | N/A |
| Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models | Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng | N/A | N/A |
| LLMs Are Prone to Fallacies in Causal Inference | Nitish Joshi, Abulhair Saparov, Yixin Wang, He He | N/A | N/A |
| Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles | Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang | N/A | N/A |
| The Lou Dataset - Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis, Joel Birrer, Anne Lauscher, Iryna Gurevych | N/A | N/A |
| When Generative Adversarial Networks Meet Sequence Labeling Challenges | Yu Tong, Ge Chen, Guokai Zheng, Rui Li, Jiang Dazhi | N/A | N/A |
| Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering | Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee | N/A | N/A |
| Speechworthy Instruction-tuned Language Models | Hyundong Justin Cho, Nicolaas Paul Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May | N/A | N/A |
| Data, Data Everywhere: A Guide for Pretraining Dataset Construction | Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| Fine-Tuning and Prompt Optimization: Two Good Steps that Work Better Together | Dilara Soylu, Christopher Potts, Omar Khattab | N/A | N/A |
| Demystifying Verbatim Memorization in Large Language Models | Jing Huang, Diyi Yang, Christopher Potts | N/A | N/A |
| AmbigNLG: Addressing Task Ambiguity in Instruction for NLG | Ayana Niwa, Hayate Iso | N/A | N/A |
| Distributional Properties of Subword Regularization | Marco Cognetta, Vilém Zouhar, Naoaki Okazaki | N/A | N/A |
| DataTales: A Benchmark for Real-World Intelligent Data Narration | Yajing Yang, Qian Liu, Min-Yen Kan | N/A | N/A |
| Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun | N/A | N/A |
| GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization | Yangfan Ye, Xiachong Feng, Xiaocheng Feng, Weitao Ma, Libo Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models | Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer | N/A | N/A |
| More Insightful Feedback for Tutoring: Enhancing Generation Mechanisms and Automatic Evaluation | Wencke Liermann, Jin-Xia Huang, Yohan Lee, Kong Joo Lee | N/A | N/A |
| Stable Language Model Pre-training by Reducing Embedding Variability | Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun | N/A | N/A |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Kavya Manohar, Leena G Pillai | N/A | N/A |
| Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets | Benjamin Schiller, Johannes Daxenberger, Andreas Waldis, Iryna Gurevych | N/A | N/A |
| Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas | Seungjong Sun, Eungu Lee, Seo Yeon Baek, Seunghyun Hwang, Lee wonbyung, Dongyan Nan, Bernard J Jansen, Jang Hyun Kim | N/A | N/A |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha | N/A | N/A |
| Dynamic Multi-granularity Attribution Network for Aspect-based Sentiment Analysis | Yanjiang Chen, Kai Zhang, hufeng, Xianquan Wang, Ruikang li, Qi Liu | N/A | N/A |
| Unlabeled Debiasing in Downstream Tasks via Class-wise Low Variance Regularization | Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl | N/A | N/A |
| Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA | Pu Jian, Donglei Yu, Jiajun Zhang | N/A | N/A |
| Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights | Hao Yang, Lizhen Qu, Ehsan Shareghi, Reza Haf | N/A | N/A |
| Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations | Milan BHAN, Jean-Noël Vittaut, Nicolas CHESNEAU, Marie-Jeanne Lesot | N/A | N/A |
| What are the Generator Preferences for End-to-end Task-Oriented Dialog System? | Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou | N/A | N/A |
| Paraphrase Types Elicit Prompt Engineering Capabilities | Jan Philip Wahle, Terry Ruas, Yang Xu, Bela Gipp | N/A | N/A |
| VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models | Jingtao Cao, Zhang Zheng, Hongru WANG, Kam-Fai Wong | N/A | N/A |
| Towards Online Continuous Sign Language Recognition and Translation | Ronglai Zuo, Fangyun Wei, Brian Mak | N/A | N/A |
| Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment | Yiwei Dai, Hengrui Gu, Ying Wang, Xin Wang | N/A | N/A |
| Split and Merge: Aligning Position Biases in LLM-based Evaluators | Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu | N/A | N/A |
| Integrating Argumentation and Hate-Speech-based Techniques for Countering Misinformation | Sougata Saha, Rohini Srihari | N/A | N/A |
| BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM | Wenda Xu, Jiachen Li, William Yang Wang, Lei Li | N/A | N/A |
| One2Set + Large Language Model: Best Partners for Keyphrase Generation | Liangying Shao, Liang Zhang, Minlong Peng, Guoqi Ma, Hao Yue, Mingming Sun, Jinsong Su | N/A | N/A |
| Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering | Yifei Yuan, Yang Deng, Anders Søgaard, Mohammad Aliannejadi | N/A | N/A |
| ORPO: Monolithic Preference Optimization without Reference Model | Jiwoo Hong, Noah Lee, James Thorne | N/A | N/A |
| A Multi-Perspective Analysis of Memorization in Large Language Models | Bowen Chen, Namgi Han, Yusuke Miyao | N/A | N/A |
| Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations | Nicolò Penzo, Maryam Sajedinia, Bruno Lepri, Sara Tonelli, Marco Guerini | N/A | N/A |
| Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs | Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych | N/A | N/A |
| Unveiling the Role of Pretraining in Direct Speech Translation | Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà | N/A | N/A |
| PCQPR: Proactive Conversational Question Planning with Reflection | Shasha Guo | N/A | N/A |
| CodeAgent: Autonomous Communicative Agents for Code Review | Xunzhu Tang, KISUB KIM, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawendé F. Bissyandé | N/A | N/A |
| TroL: Traversal of Layers for Large Language and Vision Models | Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro | N/A | N/A |
| MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language | Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin | N/A | N/A |
| Revisiting Supertagging for faster HPSG parsing | Olga Zamaraeva, Carlos Gómez-Rodríguez | N/A | N/A |
| Improve Dense Passage Retrieval with Entailment Tuning | Lu Dai, Hao Liu, Hui Xiong | N/A | N/A |
| ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen WAN, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana | N/A | N/A |
| TEMA: Token Embeddings Mapping for Enriching Low-Resource Language Models | Rodolfo Zevallos, Núria Bel, Mireia Farrús | N/A | N/A |
| DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting | Xuanming Zhang, Anthony Diaz, Zixun Chen, Qingyang Wu, Kun Qian, Erik Voss, Zhou Yu | N/A | N/A |
| Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback | Fatemeh Pesaran zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim | N/A | N/A |
| PrExMe: Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter, Steffen Eger | N/A | N/A |
| Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning | Shuai Zhao, Meihuizi Jia, Anh Tuan Luu, Fengjun Pan, Jinming Wen | N/A | N/A |
| Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models | Javier Chiyah-Garcia, Alessandro Suglia, Arash Eshghi | N/A | N/A |
| Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models | Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu | N/A | N/A |
| Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations | Matthias Lindemann, Alexander Koller, Ivan Titov | N/A | N/A |
| Puzzle Solving using Reasoning of Large Language Models: A Survey | Panagiotis Giadikiaroglou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou | N/A | N/A |
| SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading | Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Jianfeng Gao, Fabian Peller-Konrad, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues | N/A | N/A |
| Red Teaming Language Models for Processing Contradictory Dialogues | Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen | N/A | N/A |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | Sander Land, Max Bartolo | N/A | N/A |
| Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs | Houman Mehrafarin, Arash Eshghi, Ioannis Konstas | N/A | N/A |
| Don’t Underestimate the Octopus - Why The Symbol Grounding Problem Does Not Apply to LLMs | Reto Gubelmann | N/A | N/A |
| Major Entity Identification: A Generalizable Alternative to Coreference Resolution | Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi | N/A | N/A |
| Enhancing High-order Interaction Awareness in LLM-based Recommender Model | Xinfeng Wang, Jin Cui, Fumiyo Fukumoto, Yoshimi Suzuki | N/A | N/A |
| What Are the Odds? Language Models Are Capable of Probabilistic Reasoning | Akshay Paruchuri, Jake Garrison, shun liao, John B Hernandez, Jacob Sunshine, Tim Althoff, Xin Liu, Daniel McDuff | N/A | N/A |
| MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction | Han Jiang, Junwen Duan, Zhe Qu, Jianxin Wang | N/A | N/A |
| LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models | Hayder Elesedy, Pedro M Esperanca, Silviu Vlad Oprea, Mete Ozay | N/A | N/A |
| “A good pun is its own reword”: Can Large Language Models Understand Puns? | Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang | N/A | N/A |
| QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation | Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu | N/A | N/A |
| Dependency Graph Parsing as Sequence Labeling | Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez | N/A | N/A |
| NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data | Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoit Crabbé, Etienne P Bernard | N/A | N/A |
| Towards a Greek Proverb Atlas: Computational Spatial Exploration and Attribution of Greek Proverbs | John Pavlopoulos, Panos Louridas, Panagiotis Filos | N/A | N/A |
| Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications | Weize Liu, Yinlong Xu, Hongxia Xu, Jintai Chen, Xuming Hu, Jian Wu | N/A | N/A |
| Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss | Bowen Zhang, Chunping Li | N/A | N/A |
| Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training | Marc Felix Brinner, Sina Zarrieß | N/A | N/A |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Markus Frohmann, Igor Sterner, Ivan Vulić, Benjamin Minixhofer, Markus Schedl | N/A | N/A |
| Applying Contrastive Learning to Code Vulnerability Type Classification | Chen Ji, Su Yang, Hongyu Sun, Yuqing Zhang | N/A | N/A |
| TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts | Ruida WANG, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang | N/A | N/A |
| Multi-Level Cross-Modal Alignment for Speech Relation Extraction | Liang Zhang, Zhen Yang, Biao Fu, Ziyao Lu, Liangying Shao, Shiyu Liu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jinsong Su | N/A | N/A |
| Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder, Gerhard Heyer | N/A | N/A |
| PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models | Jinsung Kim, Seonmin Koo, Heuiseok Lim | N/A | N/A |
| The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm | Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| Subword Segmentation in LLMs: Looking at Inflection and Consistency | Marion Di Marco, Alexander Fraser | N/A | N/A |
| Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments | Omar Sharif, Joseph Gatto, MADHUSUDAN BASAK, Sarah Masud Preum | N/A | N/A |
| Let Me Teach You: Pedagogical Foundations of Feedback for Language Models | Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut | N/A | N/A |
| Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data | Jean-Flavien Bussotti, Luca Ragazzi, Giacomo Frisoni, Gianluca Moro, Paolo Papotti | N/A | N/A |
| TL-CL: Task And Language Incremental Continual Learning | Shrey Satapara, P. K. Srijith | N/A | N/A |
| Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P Jeong, Saurabh Garg, Zachary Chase Lipton, Michael Oberst | N/A | N/A |
| Empowering Multi-step Reasoning across Languages via Program-Aided Language Models | Leonardo Ranaldi, Giulia Pucci | N/A | N/A |
| Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models | Yu Yuan, Lili Zhao, Kai Zhang, Guangting Zheng, Qi Liu | N/A | N/A |
| ControlMath: Controllable Data Generation Promotes Math Generalist Models | Nuo Chen, Ning Wu, Jianhui Chang, MING GONG, Linjun Shou, Dongmei Zhang, Jia Li | N/A | N/A |
| Where Am I From? Identifying Origin of LLM-generated Content | Liying LI, Yihan Bai, Minhao Cheng | N/A | N/A |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | Tarek Naous, Michael J Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu | N/A | N/A |
| GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text | Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin | N/A | N/A |
| GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains | Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Elizabeth Levine, Jessica Lin, Devika Tiwari, Amir Zeldes | N/A | N/A |
| RA2FD: Distilling Faithfulness into Efficient Dialogue Systems | Zhiyuan Zhu, Yusheng Liao, Chenxin Xu, Yunfeng Guan, Yanfeng Wang, Yu Wang | N/A | N/A |
| Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation | Fangrui Lv, Kaixiong Gong, Jian Liang, Xinyu Pang, Changshui Zhang | N/A | N/A |
| Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently | Kanishka Misra, Allyson Ettinger, Kyle Mahowald | N/A | N/A |
| Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking | Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li, Chenghua Lin, Wenge Rong | N/A | N/A |
| A Coordinate System for In-Context Learning | Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen | N/A | N/A |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Tengfei Yu, Xuebo Liu, Zhiyi Hou, Liang Ding, Dacheng Tao, Min Zhang | N/A | N/A |
| ABSEval: An Agent-based Framework for Script Evaluation | Sirui Liang, Baoli Zhang, Jun Zhao, Kang Liu | N/A | N/A |
| Latent Concept-based Explanation of NLP Models | Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad | N/A | N/A |
| Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher | Hyunjong Ok, Jegwang Ryu, Jaeho Lee | N/A | N/A |
| Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research | Yida Mu, Mali Jin, Xingyi Song, Nikolaos Aletras | N/A | N/A |
| The Mystery of the Pathological Path-star Task for Language Models | Arvid Frydenlund | N/A | N/A |
| Voices in a Crowd: Searching for clusters of unique perspectives | Nikolas Vitsakis, Amit Parekh, Ioannis Konstas | N/A | N/A |
| Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent | Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu | N/A | N/A |
| SLANG: New Concept Comprehension of Large Language Models | Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Xueqi Cheng | N/A | N/A |
| Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models | Michael Lan, Philip Torr, Fazl Barez | N/A | N/A |
| Why Does New Knowledge Create Messy Ripple Effects in LLMs? | Jiaxin Qin, Zixuan Zhang, Chi Han, Pengfei Yu, Manling Li, Heng Ji | N/A | N/A |
| Lifelong Event Detection via Optimal Transport | Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot | N/A | N/A |
| FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | KaShun SHUM, Minrui Xu, Jianshu Zhang, Zixin CHEN, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza | N/A | N/A |
| Domain adapted machine translation: What does catastrophic forgetting forget and why? | Danielle Saunders, Steve DeNeefe | N/A | N/A |
| Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback | Benjamin Towle, Ke Zhou | N/A | N/A |
| Atomic Self-Consistency for Better Long Form Generations | Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra | N/A | N/A |
| “Global is Good, Local is Bad?’’: Understanding Brand Bias in LLMs | Mahammed Kamruzzaman, Hieu Minh Nguyen, Gene Louis Kim | N/A | N/A |
| Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach | Siqi Li, Danni Liu, Jan Niehues | N/A | N/A |
| ACE: A LLM-based Negotiation Coaching System | Ryan Shea, Aymen Kallala, Xin Lucy Liu, Michael W. Morris, Zhou Yu | N/A | N/A |
| TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities | Ming Zhang, Caishuang Huang, Yilong Wu, Shichun Liu, Huiyuan Zheng, Yurui Dong, Yujiong Shen, Shihan Dou, Jun Zhao, Junjie Ye, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals | Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M Murphy, Nev Jones, Kate V Hardy, Hong Shen, Fei Fang, Zhiyu Chen | N/A | N/A |
| DKEC: Domain Knowledge Enhanced Multi-Label Classification for Diagnosis Prediction | Xueren Ge, Abhishek Satpathy, Ronald Dean Williams, John Stankovic, Homa Alemzadeh | N/A | N/A |
| $\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities | Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang | N/A | N/A |
| Large Language Models Can Self-Correct with Key Condition Verification | Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang | N/A | N/A |
| Learning to Write Rationally: How Information Is Distributed in Non-native Speakers’ Essays | Zixin Tang, Janet van Hell | N/A | N/A |
| Defending Against Social Engineering Attacks in the Age of LLMs | Lin Ai, Tharindu Sandaruwan Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael S. Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, huan liu, Julia Hirschberg | N/A | N/A |
| Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models | Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi | N/A | N/A |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Target-Aware Language Modeling via Granular Data Sampling | Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra | N/A | N/A |
| SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness | Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, Nanyun Peng | N/A | N/A |
| Learning from Feedback with Coupled Comprehension and Generation | Mustafa Omer Gul, Yoav Artzi | N/A | N/A |
| UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks | Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei | N/A | N/A |
| Story Morals: Surfacing value-driven narrative schemas using large language models | David G Hobson, Haiqi Zhou, Derek Ruths, Andrew Piper | N/A | N/A |
| OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants | Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta | N/A | N/A |
| AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Murari Tiyyala, Nicholas Andrews, Daniel Khashabi | N/A | N/A |
| SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents | Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut | N/A | N/A |
| Analysis of Plan-based Retrieval for Grounded Text Generation | Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer | N/A | N/A |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Alex Chandler, Devesh Surve, Hui Su | N/A | N/A |
| RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs | John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker | N/A | N/A |
| Improving Logical Fallacy Reasoning with Logical Structure Tree | Yuanyuan Lei, Ruihong Huang | N/A | N/A |
| Chain and Causal Attention for Efficient Entity Tracking | Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen | N/A | N/A |
| BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models | Yi Zeng, Weiyu Sun, Tran Ngoc Huynh, Dawn Song, Bo Li, Ruoxi Jia | N/A | N/A |
| A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution | Zhengmian Hu, Tong Zheng, Heng Huang | N/A | N/A |
| FAC$^2$E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition | Xiaoqiang Wang, Lingfei Wu, Tengfei Ma, Bang Liu | N/A | N/A |
| OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation | Tanvir Mahmud, Diana Marculescu | N/A | N/A |
| Language Concept Erasure for Language-invariant Dense Retrieval | Zhiqi Huang, Puxuan Yu, Shauli Ravfogel, James Allan | N/A | N/A |
| Learning Personalized Alignment for Evaluating Open-ended Text Generation | Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian | N/A | N/A |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang | N/A | N/A |
| Turn Waste into Worth: Rectifying Top-$k$ Router of MoE | Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu | N/A | N/A |
| Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination | Pittawat Taveekitworachai, Febri Abdullah, Ruck Thawonmas | N/A | N/A |
| CommVQA: Situating Visual Question Answering in Communicative Contexts | Nandita Shankar Naik, Christopher Potts, Elisa Kreiss | N/A | N/A |
| Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding | Weilin Zhao, Yuxiang Huang, Xu Han, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? | Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun | N/A | N/A |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar | N/A | N/A |
| Style-Specific Neurons for Steering LLMs in Text Style Transfer | Wen Lai, Viktor Hangya, Alexander Fraser | N/A | N/A |
| Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers | Tianhua Zhang, Kun LI, Hongyin Luo, Xixin Wu, James R. Glass, Helen M. Meng | N/A | N/A |
| Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction | Sizhe Zhou, Yu Meng, Bowen Jin, Jiawei Han | N/A | N/A |
| DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models | Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Leveraging Context-aware Prompting for Commit Message Generation | Zhihua Jiang, Jianwei Chen, Dongning Rao, Guanghui Ye | N/A | N/A |
| Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination | Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein | N/A | N/A |
| Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning | Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue’ | N/A | N/A |
| A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models | Zhihao Wang, Shiyu Liu, Jianheng Huang, Wang Zheng, YiXuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su | N/A | N/A |
| Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages | Jimin Sohn, Haeji Jung, Alex Cheng, Jooeon Kang, Yilin Du, David R Mortensen | N/A | N/A |
| An Analysis and Mitigation of the Reversal Curse | Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan | N/A | N/A |
| Exploring the Practicality of Generative Retrieval on Dynamic Corpora | Soyoung Yoon, Chaeeun Kim, Hyunji Lee, Joel Jang, Sohee Yang, Minjoon Seo | N/A | N/A |
| OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting | Xukai Liu, Ye Liu, Kai Zhang, Kehang Wang, Qi Liu, Enhong Chen | N/A | N/A |
| Gotcha! Don’t trick me with unanswerable questions! Self-aligning Large Language Models for Proactively Responding to Unknown Questions | Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning | Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang | N/A | N/A |
| Large Language Models in the Clinic: A Comprehensive Benchmark | Fenglin Liu, Zheng Li, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Hongjian Zhou, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei Clifton, Bing Yin, David A. Clifton | N/A | N/A |
| Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction | Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu | N/A | N/A |
| Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective | Van-Cuong Pham, Thien Huu Nguyen | N/A | N/A |
| DynamicER: Resolving Emerging Mentions to Dynamic Entities for RAG | Jinyoung Kim, Dayoon Ko, Gunhee Kim | N/A | N/A |
| Preserving Generalization of Language models in Few-shot Continual Relation Extraction | Quyen Tran, Nguyen Xuan Thanh, Nguyen Hoang Anh, Nam Le Hai, Trung Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations | Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang | N/A | N/A |
| Consecutive Batch Model Editing with HooK Layers | Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang CHEN, Wai Lam | N/A | N/A |
| Topic-Oriented Open Relation Extraction with A Priori Seed Generation | Linyi Ding, Jinfeng Xiao, Sizhe Zhou, Chaoqi Yang, Jiawei Han | N/A | N/A |
| Related Work and Citation Text Generation: A Survey | Xiangci Li, Jessica Ouyang | N/A | N/A |
| Curriculum Consistency Learning for Conditional Sentence Generation | Liangxin Liu, Xuebo Liu, Lian Lian, shengjun cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang | N/A | N/A |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Leonardo Bertolazzi, Albert Gatt, Raffaella Bernardi | N/A | N/A |
| Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision | Fan Jiang, Tom Drummond, Trevor Cohn | N/A | N/A |
| Towards an Open-Source Speech Foundation Model for EU: 950,000 Hours of Open-Source Compliant Speech Data for EU Languages | Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri | N/A | N/A |
| Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning | Jiashi Lin, Lifang Wang, Xinyu Lu, Zhongtian Hu, Wei Zhang, Wenxuan Lu | N/A | N/A |
| Contribution of Linguistic Typology to Universal Dependency Parsing: An Empirical Investigation | Ali Basirat, Navid Baradaran Hemmati | N/A | N/A |
| TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse | Francesco Periti, Pierluigi Cassotti, Stefano Montanelli, Nina Tahmasebi, Dominik Schlechtweg | N/A | N/A |
| Structured Optimal Brain Pruning for Large Language Models | Jiateng Wei, Quan Lu, ning jiang, Siqi Li, Jingyang Xiang, Jun Chen, Yong Liu | N/A | N/A |
| Automatically Generated Definitions and their utility for Modeling Word Meaning | Francesco Periti, David Alfter, Nina Tahmasebi | N/A | N/A |
| How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data | Yejie Wang, Keqing He, Dayuan Fu, Zhuoma GongQue, Heyang Xu, Yanxu Chen, Zhexu Wang, Yujia Fu, Guanting Dong, Muxi Diao, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu | N/A | N/A |
| MINT: A Benchmark for Evaluating Instructed Information Retrieval | Weiwei Sun, Zhengliang Shi, Wu Jiu Long, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren | N/A | N/A |
| Rethinking the Evaluation of In-Context Learning for LLMs | Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu, Xiang Ao | N/A | N/A |
| Cluster-Norm for Unsupervised Probing of Knowledge | Walter Laurito, Sharan Maiya, Grégoire DHIMOÏLA, Owen Ho Wan Yeung, Kaarel Hänni | N/A | N/A |
| Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries | Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson | N/A | N/A |
| Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration | Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng | N/A | N/A |
| Where am I? Large Language Models Wandering between Semantics and Structures in Long Contexts | Seonmin Koo, Jinsung Kim, YoungJoon Jang, Chanjun Park, Heuiseok Lim | N/A | N/A |
| KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students | Matthew Shu, Nishant Balepur, Shi Feng, Jordan Lee Boyd-Graber | N/A | N/A |
| Large Language Models Can Be Contextual Privacy Protection Learners | Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng | N/A | N/A |
| A SMART Mnemonic Sounds like “Glue Tonic”: Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick | Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Lee Boyd-Graber | N/A | N/A |
| Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models | Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Reza Haf | N/A | N/A |
| MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction | Jun-Hyung Park, Yeachan Kim, Mingyu Lee, Hyuntae Park, SangKeun Lee | N/A | N/A |
| First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning | Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui | N/A | N/A |
| Tools Fail: Detecting Silent Errors in Faulty Tools | Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk | N/A | N/A |
| Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity | Bowen Zhang, Chunping Li | N/A | N/A |
| Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing | Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling | Georgios Pantazopoulos, Malvina Nikandrou, Alessandro Suglia, Oliver Lemon, Arash Eshghi | N/A | N/A |
| Are LLMs Good Zero-Shot Fallacy Classifiers? | Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu | N/A | N/A |
| The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis | Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages | Dominik Schlechtweg, Pierluigi Cassotti, Bill Noble, David Alfter, Sabine Schulte im Walde, Nina Tahmasebi | N/A | N/A |
| Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification | Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama | N/A | N/A |
| ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos | Arpan Phukan, Manish Gupta, Asif Ekbal | N/A | N/A |
| Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation | Elaf Alhazmi, Quan Z. Sheng, Wei Emma Zhang, Munazza Zaib, Ahoud Alhazmi | N/A | N/A |
| Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG | William Merrill, Noah A. Smith, Yanai Elazar | N/A | N/A |
| ASL STEMpedia: Dataset and Benchmark for Interpreting STEM Articles | Kayo Yin, Chinmay Singh, Fyodor O Minakov, Vanessa Milan, Hal Daumé III, Cyril Zhang, Alex Xijie Lu, Danielle Bragg | N/A | N/A |
| Can Automatic Metrics Assess High-Quality Translations? | Sweta Agrawal, António Farinhas, Ricardo Rei, Andre Martins | N/A | N/A |
| Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal, José G. C. de Souza, Ricardo Rei, António Farinhas, Gonçalo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins | N/A | N/A |
| DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding | Bowen Xing, Lizi Liao, Minlie Huang, Ivor Tsang | N/A | N/A |
| KnowTuning: Knowledge-aware Fine-tuning for Large Language Models | Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren | N/A | N/A |
| SecCoder: Towards Generalizable and Robust Secure Code Generation | Boyu Zhang, Tianyu Du, Junkai Tong, Xuhong Zhang, Kingsum Chow, Sheng Cheng, Xun Wang, Jianwei Yin | N/A | N/A |
| Nash CoT: Multi-Path Inference with Preference Equilibrium | Ziqi Zhang, Cunxiang Wang, Xiao Xiong, Yue Zhang, Donglin Wang | N/A | N/A |
| Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention | Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou | N/A | N/A |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di ZHANG, Kun Gai, Ji-Rong Wen | N/A | N/A |
| Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding | Wei Li, Zhen Huang, Xinmei Tian, Le Lu, Houqiang Li, Xu Shen, Jieping Ye | N/A | N/A |
| LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History | Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz | N/A | N/A |
| Social Bias Probing: Fairness Benchmarking for Language Models | Marta Marchiori Manerba, Karolina Stanczak, Riccardo Guidotti, Isabelle Augenstein | N/A | N/A |
| Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models | Wenhao Yu, Hongming Zhang, Xiaoman Pan, peixin cao, Kaixin Ma, Jian Li, Hongwei Wang, Dong Yu | N/A | N/A |
| DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models | Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li | N/A | N/A |
| Revisiting Automated Evaluation for Long-form Table Question Answering in the Era of Large Language Models | Yuqi Wang, Lyuhao Chen, Yilun Zhao | N/A | N/A |
| Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems | Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning | Zhihan Zhang, Tao Ge, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang | N/A | N/A |
| FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents | Yilun Zhao, Yitao Long, Tintin Jiang, Weiyuan Chen, Chengye Wang, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan | N/A | N/A |
| Extracting Prompts by Inverting LLM Outputs | Collin Zhang, John Xavier Morris, Vitaly Shmatikov | N/A | N/A |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu | N/A | N/A |
| VHASR: A Multimodal Speech Recognition System With Vision Hotwords | Jiliang Hu, Zuchao Li, Ping Wang, Haojun Ai, Lefei Zhang, hai zhao | N/A | N/A |
| A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Min-Yen Kan, Ryan Cotterell | N/A | N/A |
| Bridging Local Details and Global Context in Text-Attributed Graphs | Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, liyunfei, Siliang Tang | N/A | N/A |
| Building Resources for Emakhuwa: Machine Translation and News Classification Benchmarks | Felermino D. M. A. Ali, Henrique Lopes Cardoso, Rui Sousa-Silva | N/A | N/A |
| RepMatch: Quantifying Cross-Instance Similarities in Representation Space | Mohammad Reza Modarres, Sina Abbasi, Mohammad Taher Pilehvar | N/A | N/A |
| Commonsense Knowledge Editing Based on Free-Text in LLMs | Xiusheng Huang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| A Closer Look at Multidimensional Online Political Incivility | Sagi Pendzel, Nir Lotan, Alon Zoizner, Einat Minkov | N/A | N/A |
| Leveraging BERT and TFIDF Features for Short Text Clustering via Alignment-Promoting Co-Training | Zetong Li, Qinliang Su, Shijing Si, Jianxing Yu | N/A | N/A |
| Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation | Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky | N/A | N/A |
| Unsupervised Named Entity Disambiguation for Low Resource Domains | Debarghya Datta, Soumajit Pramanik | N/A | N/A |
| SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers | Viktoriia A. Chekalina, Anna Rudenko, Gleb Mezentsev, Aleksandr Mikhalev, Alexander Panchenko, Ivan Oseledets | N/A | N/A |
| MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion | Qingyang Li, Yanru Zhong, Yuchu Qin | N/A | N/A |
| ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities | Ying Su, Zhan Ling, Haochen Shi, Cheng Jiayang, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Shortcuts Arising from Contrast: Towards Effective and Lightweight Clean-Label Attacks in Prompt-Based Learning | Xiaopeng Xie, Ming YAN, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Tianyi Zhou | N/A | N/A |
| GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Aashiq Muhamed, Oscar Li, David Woodruff, Mona T. Diab, Virginia Smith | N/A | N/A |
| RaTEScore: A Metric for Entity-Aware Radiology Text Similarity | Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Weidi Xie | N/A | N/A |
| HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning | Shayan Ali Akbar, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Victor Alvarez, Erica M Salinas, Erwin Cornejo | N/A | N/A |
| Learning to Rank Salient Content for Query-focused Summarization | Sajad Sotudeh, Nazli Goharian | N/A | N/A |
| Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions | Qian Ruan, Ilia Kuznetsov, Iryna Gurevych | N/A | N/A |
| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao | N/A | N/A |
| Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang | N/A | N/A |
| LMs learn governing principles of dynamical systems, revealing an in-context neural scaling law | Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati, Christopher Earls | N/A | N/A |
| AKEW: Assessing Knowledge Editing in the Wild | Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu | N/A | N/A |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh | N/A | N/A |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu | N/A | N/A |
| Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach | Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang | N/A | N/A |
| Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models | Zheng Zhao, Yftah Ziser, Shay B Cohen | N/A | N/A |
| XDetox: Text Detoxification with Token-Level Toxicity Explanations | Beomseok Lee, Hyunwoo Kim, Keon Kim, Yong Suk Choi | N/A | N/A |
| Optimizing Chinese Lexical Simplification Across Word Types: A Hybrid Approach | ZiHao Xiao, Jiefu Gong, Shijin Wang, Wei Song | N/A | N/A |
| Evaluating LLMs’ Capability in Satisfying Lexical Constraints | Bingxuan Li, Yiwei Wang, Tao Meng, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion | Chenyu Qiu, Pengjiang Qian, Chuang Wang, Jian Yao, Li Liu, Fang wei, Eddie Y.K. Eddie | N/A | N/A |
| Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning | Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| RoCEL: Advancing Table Entity Linking through Distinctive Row and Column Contexts | Yuanzheng Wang, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng | N/A | N/A |
| Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models | Zi’ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu | N/A | N/A |
| Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning | Panuthep Tasawong, Peerat Limkonchotiwat, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| MetaBench: Planning of Multiple APIs from Various APPs for Complex User Instruction | Hongru WANG, Rui Wang, Boyang XUE, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong | N/A | N/A |
| Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment | Zhipeng Chen, Kun Zhou, Xin Zhao, Jingyuan Wang, Ji-Rong Wen | N/A | N/A |
| AudioVSR: Enhancing Video Speech Recognition with Audio Data | Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin | N/A | N/A |
| ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? | Siddhant Waghjale, Vishruth Veerendranath, Zhiruo Wang, Daniel Fried | N/A | N/A |
| Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level | Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu | N/A | N/A |
| Re-ReST: Reflection-Reinforced Self-Training for Language Agents | Zi-Yi Dou, Cheng-Fu Yang, Xueqing Wu, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Effective Synthetic Data and Test-Time Adaptation for OCR Correction | Shuhao Guan, Cheng Xu, Moule Lin, Derek Greene | N/A | N/A |
| SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework | Fu Zhang, Qi Miao, Jingwei Cheng, Hongsen Yu, Yi Yan, Xin Li, YongxueWu | N/A | N/A |
| FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension | Junzhuo Liu, Xuzheng Yang, WEIWEI LI, Peng Wang | N/A | N/A |
| Exploring the Learning Capabilities of Language Models using LEVERWORLDS | Eitan Wagner, Amir Feder, Omri Abend | N/A | N/A |
| CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models | Eitan Wagner, Yuli Slavutsky, Omri Abend | N/A | N/A |
| DocEditAgent: Document Structure Editing Via Multimodal LLM Grounding | Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha | N/A | N/A |
| DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging | Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Understanding Slang with LLMs: Modelling Cross-Cultural Nuances through Paraphrasing | Ifeoluwa Wuraola, Nina Dethlefs, Daniel Marciniak | N/A | N/A |
| Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding | Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou | N/A | N/A |
| Re-Reading Improves Reasoning in Large Language Models | Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-Guang Lou, Shuai Ma | N/A | N/A |
| Adaptive Axes: A Pipeline for In-domain Social Stereotype Analysis | Qingcheng Zeng, Mingyu Jin, Rob Voigt | N/A | N/A |
| ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments | Sourjyadip Ray, Kushal Gupta, Soumi Kundu, Dr Payal Arvind Kasat, Somak Aditya, Pawan Goyal | N/A | N/A |
| Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations | Jiyi Li | N/A | N/A |
| Improve Student’s Reasoning Generalizability through Cascading Decomposed CoTs Distillation | Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu | N/A | N/A |
| Revisiting Supervised Contrastive Learning for Microblog Classification | Junbo Huang, Ricardo Usbeck | N/A | N/A |
| BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting | Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang | N/A | N/A |
| Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective | Zhaotian Weng, Zijun Gao, Jerone Andrews, Jieyu Zhao | N/A | N/A |
| Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing | Weichuan Wang, Zhaoyi Li, Defu Lian, Chen Ma, Linqi Song, Ying Wei | N/A | N/A |
| SciAgent: Tool-augmented Language Models for Scientific Reasoning | Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun | N/A | N/A |
| Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents | Dong Won Lee, Hae Won Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency | N/A | N/A |
| Towards Measuring and Modeling “Culture” in LLMs: A Survey | Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Shivdutt Singh, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, Monojit Choudhury | N/A | N/A |
| ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models | Haiquan Zhao, Lingyu Li, Shisong Chen, Shuqi Kong, Jiaan Wang, Kexin Huang, Tianle Gu, Yixu Wang, Jian Wang, Liang Dandan, Zhixu Li, Yan Teng, Yanghua Xiao, Yingchun Wang | N/A | N/A |
| Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting | Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury | N/A | N/A |
| Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features | Xiao Yu, Kejiang Chen, Qi Yang, Weiming Zhang, Nenghai Yu | N/A | N/A |
| Hate Personified: Investigating the role of LLMs in content moderation pipeline for hate speech | Sarah Masud, Sahajpreet Singh, Viktor Hangya, Alexander Fraser, Tanmoy Chakraborty | N/A | N/A |
| Temporally Consistent Factuality Probing for Large Language Models | Ashutosh Bajpai, Aaryan Goyal, Atif Anwer, Tanmoy Chakraborty | N/A | N/A |
| A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives | Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann | N/A | N/A |
| Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators | Prasoon Bajpai, Niladri Chatterjee, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training | Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng | N/A | N/A |
| Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability | Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan | N/A | N/A |
| Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging | Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, hanyu Zhao, Siqi Fan, Zheng Zhang | N/A | N/A |
| Generating Demonstrations for In-Context Compositional Generalization in Grounded Language Learning | Sam Spilsbury, Pekka Marttinen, Alexander Ilin | N/A | N/A |
| FAME: Factual Multi-task Model Editing Benchmark | Li Zeng, Yingyu Shan, Zeming Liu, Jiashu Yao, Yuhang Guo | N/A | N/A |
| MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance | Renjie Pi, Tianyang Han, Jianshu Zhang, Yueqi XIE, Rui Pan, Qing LIAN, Hanze Dong, Jipeng Zhang, Tong Zhang | N/A | N/A |
| Leveraging Large Language Models for NLG Evaluation: Advances and Challenges | Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Yuxuan Lai, Chongyang Tao, Shuai Ma | N/A | N/A |
| InfiniPot: Infinite Context Processing on Memory-Constrained LLMs | Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang | N/A | N/A |
| VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models | Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin | N/A | N/A |
| CorrSynth - A Correlated Sampling Method for Diverse dataset Generation from LLMs | Abhishek Divekar, Suhas S Kowshik, Vijit Malik | N/A | N/A |
| Defining Knowledge: Bridging Epistemology and Large Language Models | Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard | N/A | N/A |
| TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMs | Peiwen Jiang, Zibo Zhao, Xinbo Lin, Ruhui Ma, Yvonne Jie Chen, Jinhua Cheng | N/A | N/A |
| Free your mouse! Command Large Language Models to Generate Code to Format Word Documents | Shihao Rao, Liang Li, Jiapeng Liu, Guan Weixin, Xiyan Gao, bing lim | N/A | N/A |
| CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu, Zacc Yang, Chuanghao Ding, Rui Zhao, Fei Tan | N/A | N/A |
| The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs | Tianyang Han, Qing LIAN, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang | N/A | N/A |
| Rationale-Aware Answer Verification by Pairwise Self-Evaluation | Akira Kawabata, Saku Sugawara | N/A | N/A |
| On the Robustness of Editing Large Language Models | Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, hai zhao, lifeng Liu, Yulong Wang | N/A | N/A |
| IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method | MiHyeon Kim, Juhyoung Park, YoungBin Kim | N/A | N/A |
| Distract Large Language Models for Automatic Jailbreak Attack | Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen | N/A | N/A |
| Exploring Space Efficiency in a Tree-based Linear Model for Extreme Multi-label Classification | He-Zhe Lin, Cheng-Hung Liu, Chih-Jen Lin | N/A | N/A |
| WorryWords: Norms of Anxiety Association for 44,450 English Words | Saif M. Mohammad | N/A | N/A |
| Finding Blind Spots in Evaluator LLMs with Interpretable Checklists | Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M Khapra | N/A | N/A |
| LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration | Jun Zhao, Can Zu, Xu Hao, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments | Till Raphael Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart | N/A | N/A |
| Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs | Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li | N/A | N/A |
| Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems | Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang | N/A | N/A |
| Scaling Laws for Linear Complexity Language Models | Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong | N/A | N/A |
| Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards | Heejin Do, Sangwon Ryu, Gary Lee | N/A | N/A |
| Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis | Guangliang Liu, Haitao Mao, Jiliang Tang, Kristen Johnson | N/A | N/A |
| ATAP: Automatic Template-Augmented Commonsense Knowledge Graph Completion via Pre-Trained Language Models | Fu Zhang, Yifan Ding, Jingwei Cheng | N/A | N/A |
| LM2: A Simple Society of Language Models Solves Complex Reasoning | Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| Towards a Semantically-aware Surprisal Theory | Clara Meister, Mario Giulianelli, Tiago Pimentel | N/A | N/A |
| Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering | Adjali Omar, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne | N/A | N/A |
| Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu | N/A | N/A |
| Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP | Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty | N/A | N/A |
| BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training | Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, Ivan P. Yamshchikov | N/A | N/A |
| SEGMENT+: Long Text Processing with Short-Context Language Models | Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao | N/A | N/A |
| Explicit Memory Learning with Expectation Maximization | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang | N/A | N/A |
| Learning to Generate Writing Feedback via Language Model Simulated Student Revisions | Inderjeet Jayakumar Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, Lu Wang | N/A | N/A |
| Small LLMs Are Weak Tool Learners: A Multi-LLM Agent | Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang | N/A | N/A |
| Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions | Clement Neo, Shay B Cohen, Fazl Barez | N/A | N/A |
| Still Not Quite There! Assessing Large Language Models for Comorbid Mental Health Diagnosis | Amey Hengle, Atharva Kulkarni, Shantanu Deepak Patankar, Rashmi Gupta | N/A | N/A |
| The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning | Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings | N/A | N/A |
| Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups | Răzvan-Alexandru Smădu, David-Gabriel ION, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel | N/A | N/A |
| Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue | Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! | Divya Patel, Pathik Patel, Ankush Chander, Sourish Dasgupta, Tanmoy Chakraborty | N/A | N/A |
| MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations | Vishal Vivek Saley, Goonjan Saha, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| **YesBut | Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, ANKIT RAJ, Pawan Goyal, Niloy Ganguly | N/A | N/A |
| Scaling Cognitive Limits: Identifying Working Memory Limits in LLMs | Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi | N/A | N/A |
| RAFT: Realistic Attacks to Fool Text Detectors | James Liyuan Wang, Ran Li, Junfeng Yang, Chengzhi Mao | N/A | N/A |
| LLM-Evolve: Evaluation for LLM’s Evolving Capability on Benchmarks | Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping | AJAY KUMAR JAISWAL, Bodun Hu, Lu Yin, Yeonju Ro, Tianlong Chen, Shiwei Liu, Aditya Akella | N/A | N/A |
| LLM-based Code-Switched Text Generation for Grammatical Error Correction | Tom Potter, Zheng Yuan | N/A | N/A |
| Deciphering the Interplay of Parametric and Non-Parametric Memory in RAG Models | Mehrdad Farahani, Richard Johansson | N/A | N/A |
| On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning | Geewook Kim, Minjoon Seo | N/A | N/A |
| Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities | Zihao He, Rebecca Dorn, Minh Duc Chu, Siyi Guo, Kristina Lerman | N/A | N/A |
| Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models | Eldar Kurtic, Amir Moeini, Dan Alistarh | N/A | N/A |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna | N/A | N/A |
| One Thousand and One Pairs: A “novel” challenge for long-context language models | Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer | N/A | N/A |
| Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu, Kalpesh Krishna, Salaheddin Alzubi, Chris Tar, Manaal Faruqui, Yun-Hsuan Sung | N/A | N/A |
| Do LLMs learn a true syntactic universal? | John T. Hale, Miloš Stanojević | N/A | N/A |
| GDPO: Learning to Align Language Models with Diversity Using GFlowNets | Oh Joon Kwon, Daiki E. Matsunaga, Kee-Eung Kim | N/A | N/A |
| How Susceptible are Large Language Models to Ideological Manipulation? | Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman | N/A | N/A |
| Measuring Psychological Depth in Language Models | Fabrice Y Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Senahan Yildiz, Miryung Kim, Nanyun Peng, Amit Sahai | N/A | N/A |
| Media Attitude Detection via Framing Analysis with Events and their Relations | Jin Zhao, Jingxuan Tu, Han Du, Nianwen Xue | N/A | N/A |
| Fill In The Gaps: Model Calibration and Generalization with Synthetic Data | Yang Ba, Michelle V Mancenido, Rong Pan | N/A | N/A |
| Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations | Sagi Shaier, Ari Kobren, Philip V. Ogren | N/A | N/A |
| Granular Privacy Control for Geolocation with Vision Language Models | Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter | N/A | N/A |
| MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain | Chao Jiang, Wei Xu | N/A | N/A |
| MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification | Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang | N/A | N/A |
| FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization | Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao | N/A | N/A |
| StorySpark: Expert-Annotated QA Pairs with Real-World Knowledge for Children Storytelling | Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun | N/A | N/A |
| MedCoT: Medical Chain of Thought via Hierarchical Expert | Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Tianyi Zhou, Zuozhu Liu | N/A | N/A |
| Varying Sentence Representations via Condition-Specified Routers | Ziyong Lin, Quansen Wang, Zixia Jia, Zilong Zheng | N/A | N/A |
| Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues | Jiao Ou, jiayu wu, Che Liu, Fuzheng Zhang, Di ZHANG, Kun Gai | N/A | N/A |
| Information Flow Routes: Automatically Interpreting Language Models at Scale | Javier Ferrando, Elena Voita | N/A | N/A |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang | N/A | N/A |
| Low-rank Subspace for Binding in Large Language Models | Qin Dai, Benjamin Heinzerling, Kentaro Inui | N/A | N/A |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Erxin Yu, Jing Li, Ming Liao, Siqi Wang, GAO Zuchen, Fei Mi, Lanqing HONG | N/A | N/A |
| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Tobias Schimanski, Jingwei Ni, Roberto Spacey Martín, Nicola Ranger, Markus Leippold | N/A | N/A |
| Context-Aware Adapter Tuning for Few-Shot Relation Learning in Knowledge Graphs | LIU Ran, Zhongzhou Liu, Xiaoli Li, Yuan Fang | N/A | N/A |
| Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness | Shixuan Ma, Quan Wang | N/A | N/A |
| Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection | Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking | Siyuan Wang, Zhuohan Long, Zhihao Fan, zhongyu wei | N/A | N/A |
| Symbolic Working Memory Enhances Language Models for Complex Rule Application | Siyuan Wang, zhongyu wei, Yejin Choi, Xiang Ren | N/A | N/A |
| LLoCO: Learning Long Contexts Offline | Sijun Tan, Xiuyu Li, Shishir G Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Popa | N/A | N/A |
| Don’t Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, WANG CHEN, Anh Tuan Luu | N/A | N/A |
| Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee, Junho Kim, SangKeun Lee | N/A | N/A |
| Are Large Language Models Capable of Generating Human-Level Narratives? | Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng | N/A | N/A |
| MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs | Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung | N/A | N/A |
| Can Large Language Models Enhance Predictions of Disease Progression? Investigating Through Disease Network Link Prediction | Haohui Lu, Usman Naseem | N/A | N/A |
| Searching for Best Practices in Retrieval-Augmented Generation | Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| Moral Foundations of Large Language Models | Marwa Abdulhai, Gregory Serapio-García, Clement CREPY, Daria Valter, John Canny, Natasha Jaques | N/A | N/A |
| The Zeno’s Paradox of ‘Low-Resource’ Languages | Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury | N/A | N/A |
| Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization | Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty, Md Shad Akhtar | N/A | N/A |
| Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition | Pritika Ramu, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan | N/A | N/A |
| From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment | Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima | N/A | N/A |
| Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging | Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui | N/A | N/A |
| Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovic, Michael Färber | N/A | N/A |
| Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training | Zhou Zhang, Dongzeng Tan, Jiaan Wang, Yilong Chen, Jiarong Xu | N/A | N/A |
| Data Contamination Can Cross Language Barriers | Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang | N/A | N/A |
| Automated Essay Scoring: A Reflection on the State of the Art | Shengjie Li, Vincent Ng | N/A | N/A |
| Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate | Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Shuming Shi, Zhaopeng Tu | N/A | N/A |
| Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs | Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| CURE: Context- and Uncertainty-Aware Mental Disorder Detection | Migyeong Kang, goun choi, Hyolim Jeon, Ji hyun An, Daejin Choi, Jinyoung Han | N/A | N/A |
| PepRec: Progressive Enhancement of Prompting for Recommendation | Yakun Yu, Shi-ang Qi, Baochun Li, Di Niu | N/A | N/A |
| In-Context Compositional Generalization for Large Vision-Language Models | Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, Yunde Jia | N/A | N/A |
| Improving Zero-shot LLM Re-Ranker with Risk Minimization | Xiaowei Yuan, Zhao Yang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory | Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou | N/A | N/A |
| Label Confidence Weighted Learning for Target-level Sentence Simplification | Jingshen Zhang, Xin Ying Qiu | N/A | N/A |
| Quantum Recurrent Architectures for Text Classification | Wenduan Xu, Stephen Clark, Douglas Brown, Gabriel Matos, Konstantinos Meichanetzidis | N/A | N/A |
| Tree of Problems: Improving structured problem solving with compositionality | Armel Randy Zebaze, Benoît Sagot, Rachel Bawden | N/A | N/A |
| What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study | Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof-Arenas, Luisa Bentivogli | N/A | N/A |
| Seg2Act: Global Context-aware Action Generation for Document Logical Structuring | Zichao Li, Shaojie He, Meng Liao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Yanxiong Lu, Xianpei Han, Le Sun | N/A | N/A |
| Is C4 Dataset Enough for Pruning? An Investigation of Calibration Data for LLM Pruning | Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, AJAY KUMAR JAISWAL, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu | N/A | N/A |
| Revisiting the Robustness of Watermarking to Paraphrasing Attacks | Saksham Rastogi, Danish Pruthi | N/A | N/A |
| A Survey of Ontology Expansion for Conversational Understanding | Jinggui Liang, Yuxia Wu, Yuan Fang, Hao Fei, Lizi Liao | N/A | N/A |
| Calibrating Language Models with Adaptive Temperature Scaling | Johnathan Xie, Annie S Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn | N/A | N/A |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo | N/A | N/A |
| Why do objects have many names? A study on word informativeness in language use and lexical systems. | Eleonora Gualdoni, Gemma Boleda | N/A | N/A |
| Dual-Space Knowledge Distillation for Large Language Models | Songming Zhang, Xue Zhang, Zengkui Sun, Yufeng Chen, Jinan Xu | N/A | N/A |
| NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition | Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik | N/A | N/A |
| On the Universal Truthfulness Hyperplane Inside LLMs | Junteng Liu, Shiqi Chen, Yu Cheng, Junxian He | N/A | N/A |
| PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Chao-Wei Huang, Yun-Nung Chen | N/A | N/A |
| User Inference Attacks on Large Language Models | Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu | N/A | N/A |
| HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy | YongKang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schuetze | N/A | N/A |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Yufang Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Aimin Zhou | N/A | N/A |
| Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation | Matthew Raffel, Victor Agostinelli, Lizhong Chen | N/A | N/A |
| ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback | Qinzhuo Wu, Wei Liu, Jian Luan, Bin Wang | N/A | N/A |
| Please note that I’m just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification | Esra Dönmez, Thang Vu, Agnieszka Falenska | N/A | N/A |
| How to Compute the Probability of a Word | Tiago Pimentel, Clara Meister | N/A | N/A |
| A linguistically-motivated evaluation methodology for unraveling model’s abilities in reading comprehension tasks | Elie Antoine, Frederic Bechet, Géraldine Damnati, Philippe Langlais | N/A | N/A |
| GuardBench: A Large-Scale Benchmark for Guardrail Models | Elias Bassani, Ignacio Sanchez | N/A | N/A |
| Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering | Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Guang Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Language models and brains align due to more than next-word prediction and word-level information | Gabriele Merlin, Mariya Toneva | N/A | N/A |
| LLMEdgeRefine: Enhancing Text Clustering with LLM-Based Boundary Point Refinement | Zijin Feng, Luyang Lin, Lingzhi Wang, Hong Cheng, Kam-Fai Wong | N/A | N/A |
| CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures | Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri | N/A | N/A |
| A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression | Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini | N/A | N/A |
| GOME: Grounding-based Metaphor Binding With Conceptual Elaboration For Figurative Language Illustration | Linhao Zhang, Jintao Liu, Li Jin, Hao Wang, kaiwen wei, Guangluan Xu | N/A | N/A |
| D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation | Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, Vinodkumar Prabhakaran | N/A | N/A |
| PALM: Few-Shot Prompt Learning for Audio Language Models | Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki | N/A | N/A |
| Annotator-Centric Active Learning for Subjective NLP Tasks | Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico Liscio | N/A | N/A |
| Lost in Tokenization: How to Measure Word Surprisal From LM Token Probabilities | Luca Malagutti, Juan Luis Gastaldi, Brian DuSell, Tim Vieira, Ryan Cotterell, Mario Giulianelli | N/A | N/A |
| Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation | Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M Guerreiro | N/A | N/A |
| Jailbreaking LLMs with Arabic Transliteration and Arabizi | Mansour Al Ghanim, saleh almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou | N/A | N/A |
| Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models | Zara Siddique, Liam Turner, Luis Espinosa-Anke | N/A | N/A |
| Instruction Matters, a Simple yet Effective Task Selection Approach in Instruction Tuning for Specific Tasks | Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae | N/A | N/A |
| Recurrent Alignment with Hard Attention for Hierarchical Text Rating | Chenxi Lin, Ren Jiayu, Guoxiu He, Zhuoren Jiang, Haiyan yu, Xiaomin Zhu | N/A | N/A |
| CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification | Junhui He, Shangyu Wu, Weidong Wen, Chun Jason Xue, Qingan Li | N/A | N/A |
| Semformer: Transformer Language Models with Semantic Planning | Yongjing Yin, Junran Ding, Kai Song, Yue Zhang | N/A | N/A |
| DocCGen: Document-based Controlled Code Generation | Sameer Pimparkhede, Mehant Kammakomati, Srikanth G. Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya | N/A | N/A |
| Semantics and Sentiment: Cross-lingual Variations in Emoji Use | Giulio Zhou, Sydelle de Souza, Ella Markham, Oghenetekevwe Kwakpovwe, Sumin Zhao | N/A | N/A |
| The Emergence of Compositional Languages in Multi-entity Referential Games: from Image to Graph Representations | Daniel Akkerman, Phong Le, Raquel G. Alhama | N/A | N/A |
| Transformers are Multi-State RNNs | Matanel Oren, Michael Hassid, Nir Yarden, Yossi Adi, Roy Schwartz | N/A | N/A |
| Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization | Niyati Bafna, Kenton Murray, David Yarowsky | N/A | N/A |
| Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion | Kerem Zaman, Leshem Choshen, Shashank Srivastava | N/A | N/A |
| Collective Critics for Creative Story Generation | Minwook Bae, Hyounghun Kim | N/A | N/A |
| Surprisal Curves of Discourse | Eleftheria Tsipidi, Franz Nowak, Ryan Cotterell, Ethan Wilcox, Mario Giulianelli, Alex Warstadt | N/A | N/A |
| Model-based Preference Optimization in Abstractive Summarization without Human Feedback | Jaepill choi, Kyubyung Chae, Jiwoo Song, Yohan Jo, Taesup Kim | N/A | N/A |
| Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation? | Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| NeuroTrialNER: An Annotated Corpus for Neurological Diseases and Therapies in Clinical Trial Registries | Simona Emilova Doneva, Tilia Ellendorff, Jean-Philippe Goldman, Amelia Elaine Cannon, Gerold Schneider, Beate Sick, Benjamin Victor Ineichen | N/A | N/A |
| Do Explanations Help or Hurt? Saliency Maps vs Natural Language Explanations in a Clinical Decision-Support Setting | Maxime Guillaume Kayser, Bayar Menzat, Cornelius Emde, Bogdan Alexandru Bercean, Alex Novak, Abdalá Trinidad Espinosa Morgado, Bartlomiej Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu | N/A | N/A |
| Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering | WEIHE ZHAI, Arkaitz Zubiaga, Bingquan Liu, Chengjie Sun, Yalong Zhao | N/A | N/A |
| Generation with Dynamic Vocabulary | Yanting Liu, Tao Ji, Yuanbin Wu, Xiaoling Wang, Changzhi Sun | N/A | N/A |
| Argument Relation Classification through Discourse Markers and Adversarial Training | Michele Luca Contalbo, Francesco Guerra, Matteo Paganelli | N/A | N/A |
| Getting The Most Out of Your Training Data: Exploring Unsupervised Tasks for Morphological Inflection | Abhishek Purushothama, Adam Wiemerslage, Katharina von der Wense | N/A | N/A |
| Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval | Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev | N/A | N/A |
| Efficient Unseen Language Adaptation for Multilingual Pre-Trained Language Models | Po-Heng Chen, Yun-Nung Chen | N/A | N/A |
| Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation | Ruiyu Xiao, Lei Wu, Yuhang Gou, Weinan Zhang, Ting Liu | N/A | N/A |
| TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders, Nathaniel Weir, Benjamin Van Durme | N/A | N/A |
| Unsupervised Extraction of Dialogue Policies from Conversations | Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien | N/A | N/A |
| GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization | Onkar Kishor Susladkar, Gayatri Sudhir Deshmukh, Vandan Gorade, Sparsh Mittal | N/A | N/A |
| Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality | Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim | N/A | N/A |
| FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture | Wenyan Li, Crystina Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott | N/A | N/A |
| A Two-Step Approach for Data-Efficient French Pronunciation Learning | Hoyeon Lee, Hyeeun Jang, JONGHWAN KIM, Jaemin Kim | N/A | N/A |
| Exploring Intra and Inter-language Consistency in Embeddings with ICA | Rongzhi Li, Takeru Matsuda, Hitomi Yanaka | N/A | N/A |
| DetoxLLM: A Framework for Detoxification with Explanations | Md Tawkat Islam Khondaker, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan | N/A | N/A |
| Building a Multi-Platform, BERT Classifier for Detecting Connective Language | Josephine Lukito, Bin Chen, Gina M. Masullo, Natalie Jomini Stroud | N/A | N/A |
| ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models | Yash Akhauri, Ahmed F AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M Rush, Safeen Huda, Mohamed S Abdelfattah | N/A | N/A |
| Emotion Granularity from Text: An Aggregate-Level Indicator of Mental Health | Krishnapriya Vishnubhotla, Daniela Teodorescu, Mallory J Feldman, Kristen Lindquist, Saif M. Mohammad | N/A | N/A |
| BLSP-Emo: Towards Empathetic Large Speech-Language Models | Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang | N/A | N/A |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | Abhishek Divekar, Greg Durrett | N/A | N/A |
| Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang | N/A | N/A |
| DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts | Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty | N/A | N/A |
| DEM: Distribution Edited Model for Training with Mixed Data Distributions | Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha | N/A | N/A |
| Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu, Po-Yao Huang, Xiaoqing Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer | N/A | N/A |
| VerifyMatch: A Semi-Supervised Learning Paradigm for Natural Language Inference with Confidence-Aware MixUp | Seo Yeon Park, Cornelia Caragea | N/A | N/A |
| CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Ray Mooney | N/A | N/A |
| Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics | Théo Gigant, Camille Guinaudeau, Marc decombas, Frederic Dufaux | N/A | N/A |
| An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs | Manuj Malik, Jing Jiang, Kian Ming A. Chai | N/A | N/A |
| Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks | Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas | N/A | N/A |
| GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Aleksander Ficek, Jiaqi Zeng, Oleksii Kuchaiev | N/A | N/A |
| CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing | Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang | N/A | N/A |
| Sequential API Function Calling Using GraphQL Schema | Avirup Saha, Lakshmi Mandal, Balaji Ganesan, Sambit Ghosh, Renuka Sindhgatta, Carlos Eberhardt, Dan Debrunner, Sameep Mehta | N/A | N/A |
| The Illusion of Competence: Evaluating the Effect of Explanations on Users’ Mental Models of Visual Question Answering Systems | Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß | N/A | N/A |
| Re-Evaluating Evaluation for Multilingual Summarization | Jessica Zosa Forde, Ruochen Zhang, Lintang Sutawika, Alham Fikri Aji, Samuel Cahyawijaya, Genta Indra Winata, Minghao Wu, Carsten Eickhoff, Stella Biderman, Ellie Pavlick | N/A | N/A |
| Video-Text Prompting for Weakly Supervised Spatio-Temporal Video Grounding | Heng zhao, Zhao Yinjie, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou | N/A | N/A |
| A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition | Caio Filippo Corro | N/A | N/A |
| Factuality of Large Language Models in the Year 2024 | Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Nenkov Georgiev, Rocktim Jyoti Das, Preslav Nakov | N/A | N/A |
| Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation | Youngwoo Kim, Razieh Rahimi, James Allan | N/A | N/A |
| Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse | Rongchen Guo, Isar Nejadgholi, Hillary Dawkins, Kathleen C. Fraser, Svetlana Kiritchenko | N/A | N/A |
| DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R Menon, Shashank Srivastava | N/A | N/A |
| IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning | Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha | N/A | N/A |
| Scope-enhanced Compositional Semantic Parsing for DRT | Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos | N/A | N/A |
| The Generation Gap: Exploring Age Bias Underlying in the Value Systems of Large Language Models | Siyang Liu, Trisha Maturi, Bowen Yi, Siqi Shen, Rada Mihalcea | N/A | N/A |
| TempoFormer: A Transformer for Temporally-aware Representations in Change Detection | Talia Tseriotou, Adam Tsakalidis, Maria Liakata | N/A | N/A |
| Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? | Guillermo Marco, Julio Gonzalo, M.Teresa Mateo-Girona, Ramón del Castillo Santos | N/A | N/A |
| Evaluating Diversity in Automatic Poetry Generation | Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger | N/A | N/A |
| Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models | Yi Zhou, Danushka Bollegala, Jose Camacho-Collados | N/A | N/A |
| Delving into Qualitative Implications of Synthetic Data for Hate Speech Detection | Camilla Casula, Sebastiano Vecellio Salto, Alan Ramponi, Sara Tonelli | N/A | N/A |
| Grounding Language in Multi-Perspective Referential Communication | Zineng Tang, Lingjun Mao, Alane Suhr | N/A | N/A |
| Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval | Yifan Qiao, Parker Carlson, Shanxiu He, Yingrui Yang, Tao Yang | N/A | N/A |
| Error Analysis of Multilingual Language Models in Machine Translation for Low-resource Languages: A Case Study of Amharic to English Bi-directional Machine Translation | Hizkiel Mitiku Alemayehu, Hamada M Zahera, Axel-Cyrille Ngonga Ngomo | N/A | N/A |
| MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation | Arkadiusz Modzelewski, Giovanni Da San Martino, Pavel Savov, Magdalena Anna Wilczyńska, Adam Wierzbicki | N/A | N/A |
| Unsupervised Discrete Representations of American Sign Language | Artem Abzaliev, Rada Mihalcea | N/A | N/A |
| Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models | Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim | N/A | N/A |
| Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs | Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui | N/A | N/A |
| Jump Starting Bandits with LLM-Generated Prior Knowledge | Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson | N/A | N/A |
| Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? | Fırat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çağatay Yıldız | N/A | N/A |
| Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation | Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun | N/A | N/A |
| Virtual Personas for Language Models via an Anthology of Backstories | Suhong Moon, Marwa Abdulhai, Minwoo Kang, Joseph Suh, Widyadewi Soedarmadji, Eran Kohen Behar, David Chan | N/A | N/A |
| Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? | Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral | N/A | N/A |
| Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun | N/A | N/A |
| The Empirical Variability of Narrative Perceptions of Social Media Texts | Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper, Maarten Sap | N/A | N/A |
| Which questions should I answer? Salience Prediction of Inquisitive Questions | Yating Wu, Ritika Rajesh Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li | N/A | N/A |
| Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues | Lei Sun, Jinming Zhao, Qin Jin | N/A | N/A |
| Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech | Guan-Ting Lin, Wei Ping Huang, Hung-yi Lee | N/A | N/A |
| Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon, Richard Zemel, Carl Vondrick | N/A | N/A |
| CodeJudge: Evaluating Code Generation with Large Language Models | Weixi Tong, Tianyi Zhang | N/A | N/A |
| Self-Training Large Language and Vision Assistant for Medical | Guohao Sun, Can Qin, Huazhu Fu, Linwei Wang, ZHIQIANG TAO | N/A | N/A |
| SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, hong yu | N/A | N/A |
| Defending Jailbreak Prompts via In-Context Adversarial Game | Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang | N/A | N/A |
| Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter | Kateryna Kasianenko, Shima Khanehzar, Stephen Wan, Ehsan Dehghan, Axel Bruns | N/A | N/A |
| Multilingual Topic Classification in X: Dataset and Analysis | Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Jose Camacho-Collados | N/A | N/A |
| MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models | Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong | N/A | N/A |
| Updating CLIP to Prefer Descriptions Over Captions | Amir Zur, Elisa Kreiss, Karel D’Oosterlinck, Christopher Potts, Atticus Geiger | N/A | N/A |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang | N/A | N/A |
| Back to School: Translation Using Grammar Books | Jonathan Hus, Antonios Anastasopoulos | N/A | N/A |
| VIEWS: Entity-Aware News Video Captioning | Hammad Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, feng han, Yukun Zhu, Xuande Feng, Kevin Zhang, Jialu Liu, Shih-Fu Chang | N/A | N/A |
| Towards Aligning Language Models with Textual Feedback | Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan | N/A | N/A |
| ATPO: Automatic Tree-Structured Prompt Optimization | Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Xiaodi Sun, Bin Benjamin Zhu, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang | N/A | N/A |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, shimin tao, Hao Yang, Min Zhang | N/A | N/A |
| DEFT-UCS: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection | Devleena Das, Vivek Khetan | N/A | N/A |
| Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models | Yuko Nakagi, Takuya Matsuyama, Naoko Koide-Majima, Hiroto Q. Yamaguchi, Rieko Kubo, Shinji Nishimoto, Yu Takagi | N/A | N/A |
| “They are uncultured”: Unveiling Covert Harms and Social Threats in LLM Generated Conversations | Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanu Mitra | N/A | N/A |
| Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models | Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen | N/A | N/A |
| Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? | Gabriel Roccabruna, Massimo Rizzoli, giuseppe riccardi | N/A | N/A |
| Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties | Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai | N/A | N/A |
| Framework for Robust and Scalable Text Watermarking | Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low | N/A | N/A |
| MASIVE: Open-Ended Affective State Identification in English and Spanish | Nicholas Deas, Elsbeth Turcan, Ivan Ernesto Perez Mejia, Kathleen McKeown | N/A | N/A |
| You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions | Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, Jordan Lee Boyd-Graber | N/A | N/A |
| AlphaExpert: Assigning LoRA Experts Based on Layer Training Quality | Peijun Qing, Chongyang Gao, Yefan Zhou, Xingjian Diao, Yaoqing Yang, Soroush Vosoughi | N/A | N/A |
| Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling | Irfan Robbani, Paul Reisert, Surawat Pothong, Naoya Inoue, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui | N/A | N/A |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Question | Leena Mathur, Paul Pu Liang, Louis-Philippe Morency | N/A | N/A |
| RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models | Ziyi Kou, Shichao Pei, Meng Jiang, Xiangliang Zhang | N/A | N/A |
| Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese | Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh | N/A | N/A |
| Learnability of Indirect Evidence in Language Models | Miyu Oba, Yohei Oseki, Akiyo Fukatsu, Akari Haga, Hiroki Ouchi, Taro Watanabe, Saku Sugawara | N/A | N/A |
| Do LLMs Know to Respect Copyright Notice? | Jialiang Xu, SHENGLAN LI, Zhaozhuo Xu, Denghui Zhang | N/A | N/A |
| SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding | Hanchi Sun, Tianyi Zhou, Xun Chen, Lichao Sun | N/A | N/A |
| Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding | YeonJoon Jung, Jaeseong Lee, Seungtaek Choi, Dohyeon Lee, Minsoo Kim, seung-won hwang | N/A | N/A |
| Rethinking the Role of Proxy Rewards in Language Model Alignment | Sungdong Kim, Minjoon Seo | N/A | N/A |
| Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant | Abhirama Subramanyam Penamakuri, Anand Mishra | N/A | N/A |
| How Good is my MT Metric? A Framework for the Interpretation of Metric Assessments | Stefano Perrella, Lorenzo Proietti, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli | N/A | N/A |
| IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning | Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim | N/A | N/A |
| SPREADSHEETLLM: Encoding Spreadsheets for Large Language Models | Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang | N/A | N/A |
| Let’s discuss! Quality Dimensions and Annotated Datasets for Computational Argument Quality | Rositsa V Ivanova, Thomas Huber, Christina Niklaus | N/A | N/A |
| Automatic sentence segmentation of clinical record narratives in real-world data | Dongfang Xu, Davy Weissenbacher, Karen O’Connor, Siddharth Rawal, Graciela Gonzalez Hernandez | N/A | N/A |
| One-to-Many Communication and Compositionality in Emergent Communication | Heeyoung Lee | N/A | N/A |
| Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities | Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang | N/A | N/A |
| Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? | Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali | N/A | N/A |
| Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models | Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral | N/A | N/A |
| Contrastive Classification via Linear Layer Extrapolation | Mayukh Sharma, Sean O’Brien, Julian McAuley | N/A | N/A |
| Task Oriented In-Domain Data Augmentation | Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao | N/A | N/A |
| SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers | Shruti Singh, Nandan Sarkar, Arman Cohan | N/A | N/A |
| Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules | Zhuocheng Gong, Ang Lv, Jian Guan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan | N/A | N/A |
| No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages | Youssef Mohamed, Runjia Li, Ibrahim Said Ahmad, Kilichbek Haydarov, Philip Torr, Kenneth Church, Mohamed Elhoseiny | N/A | N/A |
| PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection | Someen Park, Jaehoon Kim, Seungwan Jin, Sohyun Park, Kyungsik Han | N/A | N/A |
| TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR | Shashi Kumar, Srikanth Madikeri, Juan Pablo Zuluaga Gomez, Iuliia Thorbecke, Esaú VILLATORO-TELLO, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S, Aravind Ganapathiraju | N/A | N/A |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz | N/A | N/A |
| Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk | Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li, Qun Liu, Xipeng Qiu | N/A | N/A |
| A Morphology-Based Investigation of Positional Encodings | Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| I love pineapple on pizza != I hate pineapple on pizza: Stance-Aware Sentence Transformers for Opinion Mining | Vahid Ghafouri, Jose M. Such, Guillermo Suarez-Tangil | N/A | N/A |
| BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability | Mamta Mamta, Rishikant Chigrupaatii, Asif Ekbal | N/A | N/A |
| ArMeme: Propagandistic Content in Arabic Memes | Firoj Alam, Abul Hasnat, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain | N/A | N/A |
| Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts | Arianna Muti, Federico Ruggeri, Khalid Al Khatib, Alberto Barrón-Cedeño, Tommaso Caselli | N/A | N/A |
| Thoughts to Target: Enhance Planning for Target-driven Conversation | Zhonghua Zheng, Lizi Liao, Yang Deng, Ee-Peng Lim, Minlie Huang, Liqiang Nie | N/A | N/A |
| Scalable Data Ablation Approximations for Language Models through Modular Training and Merging | Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi | N/A | N/A |
| Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation | Zhe Cao, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters | Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Generative Subgraph Retrieval for Knowledge Graph–Grounded Dialog Generation | Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim | N/A | N/A |
| Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers | Tuc Van Nguyen, Thai Le | N/A | N/A |
| Generalizing Clinical De-identification Models by Privacy-safe Data Augmentation using GPT-4 | Woojin Kim, Sungeun Hahm, Jaejin Lee | N/A | N/A |
| Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game | Prisha Samdarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan | N/A | N/A |
| GottBERT: a pure German Language Model | Raphael Scheible, Johann Frei, Fabian Thomczyk, Henry He, Patric Tippmann, Jochen Knaus, Victor Jaravine, Frank Kramer, Martin Boeker | N/A | N/A |
| Computational Meme Understanding: A Survey | Khoi P. N. Nguyen, Vincent Ng | N/A | N/A |
| CoverICL: Selective Annotation for In-Context Learning via Active Graph Coverage | Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis | N/A | N/A |
| Retrieval-enriched zero-shot image classification in low-resource domains | Nicola Dall’Asen, Yiming Wang, Enrico Fini, Elisa Ricci | N/A | N/A |
| I-AM-G: Interest Augmented Multimodal Generator for Item Personalization | Xianquan Wang, Likang Wu, Shukang Yin, Zhi Li, Yanjiang Chen, hufeng, Yu Su, Qi Liu | N/A | N/A |
| Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps | Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy | N/A | N/A |
| Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing | Baihe Huang, Hiteshi Sharma, Yi Mao | N/A | N/A |
| Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion | Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist | N/A | N/A |
| Show and Guide: Instructional-Plan Grounded Vision and Language Model | Diogo Glória-Silva, David Semedo, Joao Magalhaes | N/A | N/A |
| Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents | Bandhav Veluri, Benjamin N Peloquin, Bokai YU, Hongyu Gong, Shyamnath Gollakota | N/A | N/A |
| QuBE: Question-based Belief Enhancement for Agentic LLM | Minsoo Kim, Jongyoon Kim, Jihyuk Kim, seung-won hwang | N/A | N/A |
| COMPACT: Compressing Retrieved Documents Actively for Question Answering | Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang | N/A | N/A |
| An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Fatemeh Shiri, Xiao-Yu Guo, Mona Golestan Far, Xin Yu, Reza Haf, Yuan-Fang Li | N/A | N/A |
| Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models | Jiaxin Zhang, Wendi Cui, Yiran Huang, Kamalika Das, Sricharan Kumar | N/A | N/A |
| Local Contrastive Editing of Gender Stereotypes | Marlene Lutz, Rochelle Choenni, Markus Strohmaier, Anne Lauscher | N/A | N/A |
| De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP | Stefan Larson, Nicole Cornehl Lima, Santiago Pedroza Diaz, Amogh Manoj Joshi, Siddharth Betala, Jamiu Tunde Suleiman, Yash Mathur, Kaushal Kumar Prajapati, Ramla Alakraa, Junjie Shen, Temi Okotore, Kevin Leach | N/A | N/A |
| RAR: Retrieval Augmented Retrieval for Code Generation in Low Resource Languages | Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le | N/A | N/A |
| STAR: SocioTechnical Approach to Red Teaming Language Models | Laura Weidinger, John F J Mellor, Bernat Guillén Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, A. Stevie Bergman, Mikel D. Rodriguez, Verena Rieser, William Isaac | N/A | N/A |
| Do great minds think alike? Investigating Human-AI Complementarity for Question Answering | Maharshi Gor, Hal Daumé III, Tianyi Zhou, Jordan Lee Boyd-Graber | N/A | N/A |
| Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang | N/A | N/A |
| Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories | Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli | N/A | N/A |
| Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark | Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko | N/A | N/A |
| Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner | Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, Min zhang | N/A | N/A |
| Preference-Guided Reflective Sampling for Aligning Language Models | Hai Ye, Hwee Tou Ng | N/A | N/A |
| Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP | Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat | N/A | N/A |
| Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs | Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap | N/A | N/A |
| A Simple LLM Framework for Long-Range Video Question-Answering | Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius | N/A | N/A |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli | N/A | N/A |
| Casablanca: Data and Models for Multidialectal Arabic Speech Recognition | Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou cheikh tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah EL MEKKI, El Moatez Billah Nagoudi, Benelhadj Djelloul Mama Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir ECH-CHAMMAKHY, AMAL MAKOUAR, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed | N/A | N/A |
| Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations | Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria | N/A | N/A |
| Communicating with Speakers and Listeners of Different Pragmatic Levels | Kata Naszadi, Frans A Oliehoek, Christof Monz | N/A | N/A |
| RECANTFormer: Referring Expression Comprehension with Varying Numbers of Targets | Bhathiya Hemanthage, Hakan Bilen, Phil Bartie, Christian Dondrup, Oliver Lemon | N/A | N/A |
| Sprout: Green Generative AI with Carbon-Efficient LLM Inference | Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari | N/A | N/A |
| Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs | Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze | N/A | N/A |
| T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings | Björn Deiseroth, Manuel Brack, Samuel Weinbach, Patrick Schramowski, Kristian Kersting | N/A | N/A |
| SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han, Kevin Duh, Marine Carpuat | N/A | N/A |
| Assessing and Verifying Task Utility in LLM-Powered Applications | Negar Arabzadeh, Siqing Huo, Nikhil Mehta, Qingyun Wu, Chi Wang, Ahmed Hassan Awadallah, Charles L. A. Clarke, Julia Kiseleva | N/A | N/A |
| Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models | Somanshu Singla, Zhen Wang, Tianyang Liu, Abdullah Ashfaq, Zhiting Hu, Eric P. Xing | N/A | N/A |
| Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree | Harbani Jaggi, Kashyap Coimbatore Murali, Eve Fleisig, Erdem Biyik | N/A | N/A |
| Adversarial Text Generation using Large Language Models for Dementia Detection | Youxiang Zhu, Nana Lin, Kiran Sandilya Balivada, Daniel Haehn, Xiaohui Liang | N/A | N/A |
| xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics | Daniil Larionov, Mikhail Seleznyov, Vasiliy Viskov, Alexander Panchenko, Steffen Eger | N/A | N/A |
| The Greatest Good Benchmark: Measuring LLMs’ Alignment with Utilitarian Moral Dilemmas | Giovanni Franco Gabriel Marraffini, Andrés Cotton, Noe Fabian Hsueh, Juan Wisznia, Axel Fridman, Luciano Del Corro | N/A | N/A |
| FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding | Jiali Cheng, Hadi Amiri | N/A | N/A |
| Style-Shifting Behaviour of the Manosphere on Reddit | Jai Aggarwal, Suzanne Stevenson | N/A | N/A |
| The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective | Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang | N/A | N/A |
| Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu, Zhiyang Xu, Zihao Lin, Trevor Ashby, Joy Rimchala, Jiaxin Zhang, Lifu Huang | N/A | N/A |
| FOLIO: Natural Language Reasoning with First-Order Logic | SIMENG HAN, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander Fabbri, Wojciech Maciej Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev | N/A | N/A |
| The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? | Alexander Choi, Syeda Sabrina Akter, J.P. Singh, Antonios Anastasopoulos | N/A | N/A |
| Is Child-Directed Speech Effective Training Data for Language Models? | Steven Y. Feng, Noah Goodman, Michael Frank | N/A | N/A |
| RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference | Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao | N/A | N/A |
| HCEG: Improving the Abstraction Ability of Language Models with Hierarchical Conceptual Entailment Graphs | Juncai Li, Ru Li, Xiaoli Li, Qinghua Chai, Jeff Z. Pan | N/A | N/A |
| M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought | Gitanjali Kumari, Kirtan Jain, Asif Ekbal | N/A | N/A |
| GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation | Govind Ramesh, Yao Dou, Wei Xu | N/A | N/A |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Kiseung Kim, Jay-Yoon Lee | N/A | N/A |
| Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets | Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth | N/A | N/A |
| Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model | Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu, Aparna Garimella, Sambaran Bandyopadhyay | N/A | N/A |
| On the Fragility of Active Learners for Text Classification | Abhishek Ghose, Emma Thuong Nguyen | N/A | N/A |
| BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers | Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May Dongmei Wang, Joyce C. Ho, Chao Zhang, Carl Yang | N/A | N/A |
| Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval | Jonghyun Song, Cheyon Jin, Wenlong Zhao, Jay-Yoon Lee | N/A | N/A |
| M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection | Chia-Wei Tang, Ting-Chih Chen, Alvi Md Ishmam, Kiet A. Nguyen, Kazi Sajeed Mehrab, Chris Thomas | N/A | N/A |
| MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Haotian Sun, Hang Wu, Carl Yang, May Dongmei Wang | N/A | N/A |
| EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce C. Ho, Carl Yang, May Dongmei Wang | N/A | N/A |
| SimLLM: Detecting Sentences Generated by Large Language Models Using Similarity between the Generation and its Re-generation | Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu | N/A | N/A |
| CELLO: Causal Evaluation of Large Vision-Language Models | Meiqi Chen, Bo Peng, Yan Zhang, Chaochao Lu | N/A | N/A |
| Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Training-free Deep Concept Injection Enables Language Models for Video Question Answering | Xudong Lin, Manling Li, Richard Zemel, Heng Ji, Shih-Fu Chang | N/A | N/A |
| MIBench: Evaluating Multimodal Large Language Models over Multiple Images | Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu | N/A | N/A |
| ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering | Francesco Maria Molfese, Simone Conia, Riccardo Orlando, Roberto Navigli | N/A | N/A |
| ABLE: Personalized Disability Support with Politeness and Empathy Integration | Kshitij Mishra, Manisha Burja, Asif Ekbal | N/A | N/A |
| Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models | Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Sunghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo | N/A | N/A |
| Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code | Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Beong-woo Kwak, Seonghyeon Bae, seung-won hwang, Jinyoung Yeo | N/A | N/A |
| Improving Minimum Bayes Risk Decoding with Multi-Prompt | David Heineman, Yao Dou, Wei Xu | N/A | N/A |
| Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework | gopendra Vikram singh, Sai Vardhan Vemulapalli, Mauajama Firdaus, Asif Ekbal | N/A | N/A |
| Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah Schwettmann, Tristan Thrush | N/A | N/A |
| Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning | Shengguang Wu, Shusheng Yang, Zhenglun Chen, Qi Su | N/A | N/A |
| LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang | N/A | N/A |
| Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models | Yuxuan Guo, Zhiliang Tian, YIPING SONG, Tianlun Liu, Liang Ding, Dongsheng Li | N/A | N/A |
| Knowledge Graph Enhanced Large Language Model Editing | Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen | N/A | N/A |
| Quis custodiet ipsos custodes?’ Who will watch the watchmen? On Detecting AI-generated peer-reviews | Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal | N/A | N/A |
| Mitigating Open-Vocabulary Caption Hallucinations | Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor | N/A | N/A |
| Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes | Kosuke Nishida, Kyosuke Nishida, Kuniko Saito | N/A | N/A |
| ALVIN: Active Learning Via INterpolation | Michalis Korakakis, Andreas Vlachos | N/A | N/A |
| Filtered Direct Preference Optimization | Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu | N/A | N/A |
| Instruction Fine-Tuning: Does Prompt Loss Matter? | Mathew Huerta-Enochian, Seung Yong Ko | N/A | N/A |
| Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia | Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West | N/A | N/A |
EMNLP 2022
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation | Juhwan Choi, Yeonghwa Kim, Seunguk Yu, JungMin Yun, YoungBin Kim | N/A | N/A |
| Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation | Juhwan Choi, JungMin Yun, Kyohoon Jin, YoungBin Kim | N/A | N/A |
| FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document | Joonho Yang, Seunghyun Yoon, ByeongJeong Kim, Hwanhee Lee | N/A | N/A |
| Prompts have evil twins | Rimon Melamed, Lucas Hurley McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà | N/A | N/A |
| Table Question Answering for Low-resourced Indic Languages | Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke | N/A | N/A |
| ImageInWords: Unlocking Hyper-Detailed Image Descriptions | Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Michael Baldridge, Radu Soricut | N/A | N/A |
| LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay | Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang | N/A | N/A |
| When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection | Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps | N/A | N/A |
| Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model | Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia Perera, EngSiong Chng, Lina Yao | N/A | N/A |
| Hateful Word in Context Classification | Sanne Hoeken, Sina Zarrieß, Özge Alacam | N/A | N/A |
| Eyes Don’t Lie: Subjective Hate Annotation and Detection with Gaze | Özge Alacam, Sanne Hoeken, Sina Zarrieß | N/A | N/A |
| NumeroLogic: Number Encoding for Enhanced LLMs’ Numerical Reasoning | Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle | N/A | N/A |
| Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models | Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka | N/A | N/A |
| A Usage-centric Take on Intent Understanding in E-Commerce | Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan | N/A | N/A |
| Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs | Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha | N/A | N/A |
| Systematic Biases in LLM Simulations of Debates | Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein | N/A | N/A |
| Studying and Mitigating Biases in Sign Language Understanding Models | Katherine Atwell, Danielle Bragg, Malihe Alikhani | N/A | N/A |
| Uncertainty in Language Models: Assessment through Rank-Calibration | Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban | N/A | N/A |
| RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning | Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing | Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty | N/A | N/A |
| Scaling Properties of Speech Language Models | Santiago Cuervo, Ricard Marxer | N/A | N/A |
| “We Demand Justice!”: Towards Social Context Grounding of Political Texts | Rajkumar Pujari, Chengfei Wu, Dan Goldwasser | N/A | N/A |
| An Experimental Analysis on Evaluating Patent Citations | Rabindra Nath Nandi, Suman Maity, Brian Uzzi, Sourav Medya | N/A | N/A |
| Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? | Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow | N/A | N/A |
| Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing | Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis | N/A | N/A |
| Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning | Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, zujie wen, Wenqiang Lei, Tat-Seng Chua | N/A | N/A |
| Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation | Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman | N/A | N/A |
| Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, shimin tao, Xiaofeng Zhao, Mahongxia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, JingBo Zhu | N/A | N/A |
| On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models | Abhilasha Sancheti, Haozhe An, Rachel Rudinger | N/A | N/A |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Maureen de Seyssel, Antony D’Avirro, Adina Williams, Emmanuel Dupoux | N/A | N/A |
| On Fake News Detection with LLM Enhanced Semantics Mining | Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang, Jia Wu, Hao Fan | N/A | N/A |
| On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices | Branislav Pecher, Ivan Srba, Maria Bielikova | N/A | N/A |
| Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection | Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan | N/A | N/A |
| A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers | Valentin Barriere, Sebastian Cifuentes | N/A | N/A |
| Mitigating the Alignment Tax of RLHF | Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang | N/A | N/A |
| Evaluating Readability and Faithfulness of Concept-based Explanations | Meng Li, Haoran Jin, Ruixuan HUANG, Zhihao Xu, Defu Lian, Zijia Lin, Di ZHANG, Xiting Wang | N/A | N/A |
| Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems | Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen | N/A | N/A |
| MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making | Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou | N/A | N/A |
| CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds | Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Kenneth Huang | N/A | N/A |
| Tokenization Is More Than Compression | Craig W Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner | N/A | N/A |
| FLIRT: Feedback Loop In-context Red Teaming | Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta | N/A | N/A |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Lingjun Zhao, Khanh Xuan Nguyen, Hal Daumé III | N/A | N/A |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Haoyuan WU, Haisheng Zheng, Zhuolun He, Bei Yu | N/A | N/A |
| GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation | Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng | N/A | N/A |
| Improved Learned Sparse Retrieval with Entity Vocabulary | Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates | N/A | N/A |
| Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models | Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu | N/A | N/A |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li | N/A | N/A |
| Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences | Xiangyang Liu, Junliang He, Xipeng Qiu | N/A | N/A |
| Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue | Xianlong Luo, Yihao Wang, Meng Yang | N/A | N/A |
| Integrating Plutchik’s Theory with Mixture of Experts for Enhancing Emotion Classification | Dongjun LIM, Yun-Gyung Cheong | N/A | N/A |
| In-context Contrastive Learning for Event Causality Identification | 梁超, Wei Xiang, Bang Wang | N/A | N/A |
| What’s Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs | Anna Wegmann, Tijs A. van den Broek, Dong Nguyen | N/A | N/A |
| Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs | Kanishka Misra, Kyle Mahowald | N/A | N/A |
| Large Language Models for Data Annotation: A Survey | Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, huan liu | N/A | N/A |
| Chain-of-Dictionary Prompting Elicits Translation in Large Language Models | Hongyuan Lu, HAORAN YANG, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei | N/A | N/A |
| AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning | Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang | N/A | N/A |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering | Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, chen luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs | Jocelyn J Shen, Joel Mire, Hae Won Park, Cynthia Breazeal, Maarten Sap | N/A | N/A |
| Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence | Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, di yin, Xing Sun | N/A | N/A |
| Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval | Tianyi Hu, Maria Maistro, Daniel Hershcovich | N/A | N/A |
| RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models | Peng Xia, Kangyu Zhu, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao | N/A | N/A |
| A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading | Yuan Li, Bingqiao Luo, Qian Wang, Nuo Chen, Xu Liu, Bingsheng He | N/A | N/A |
| A Survey on In-context Learning | Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui | N/A | N/A |
| DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing | Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao | N/A | N/A |
| AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation | Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing | N/A | N/A |
| EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models | Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai | N/A | N/A |
| Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization | Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee | N/A | N/A |
| LLMs Are Zero-Shot Context-Aware Simultaneous Translators | Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura | N/A | N/A |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang | N/A | N/A |
| ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval | Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou | N/A | N/A |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulić, Anna Korhonen | N/A | N/A |
| Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation | Chenlong Deng, Kelong Mao, Zhicheng Dou | N/A | N/A |
| Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process | Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang | N/A | N/A |
| Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation | Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander T Toshev | N/A | N/A |
| QUDSELECT: Selective Decoding for Questions Under Discussion Parsing | Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration | Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng | N/A | N/A |
| Model Balancing Helps Low-data Training and Fine-tuning | Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang | N/A | N/A |
| Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment | Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami | N/A | N/A |
| Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment | Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu | N/A | N/A |
| A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning | Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou | N/A | N/A |
| Towards Tool Use Alignment of Large Language Models | Zhi-Yuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin | N/A | N/A |
| DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models | Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass | N/A | N/A |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation | Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin | N/A | N/A |
| Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network | Haoran Li, Qiang Gao, Hongmei Wu, Li Huang | N/A | N/A |
| Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors | Wenjian Ding, YAO ZHANG, Jun Wang, Adam Jatowt, Zhenglu Yang | N/A | N/A |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao, Yuehan Zhang, zhangwenlong, Xiao-Ming Wu | N/A | N/A |
| Tracking the perspectives of interacting language models | Hayden Helm, Brandon Duderstadt, Youngser Park, Carey Priebe | N/A | N/A |
| MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering | Zhengxuan Zhang, Yin WU, Yuyu Luo, Nan Tang | N/A | N/A |
| Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? | Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui | N/A | N/A |
| Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement | Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng LI, Wei Peng, Sujian Li | N/A | N/A |
| Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation | Joseph Marvin Imperial, Gail Forey, Harish Tayyar Madabushi | N/A | N/A |
| Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective | Zhihao Zhang, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou | N/A | N/A |
| “Glue pizza and eat rocks” - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models | Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, huan liu | N/A | N/A |
| Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement | Yuxuan Wang, Xiaoyuan Liu | N/A | N/A |
| SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation | Xiaoze Liu, Ting Sun, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao | N/A | N/A |
| MatchTime: Towards Automatic Soccer Game Commentary Generation | Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie | N/A | N/A |
| Rethinking Token Reduction for State Space Models | Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang | N/A | N/A |
| Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering | Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang | N/A | N/A |
| MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic | Yuyan Zhou, Liang Song, Bingning Wang, weipeng chen | N/A | N/A |
| Event Causality Identification with Synthetic Control | Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson | N/A | N/A |
| Retrieved Sequence Augmentation for Protein Representation Learning | Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Young Lu, Qi Liu, Sheng Wang, Lingpeng Kong | N/A | N/A |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li | N/A | N/A |
| TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić | N/A | N/A |
| DA$^3$: A Distribution-Aware Adversarial Attack against Language Models | Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu | N/A | N/A |
| Evaluating Psychological Safety of Large Language Models | Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing | N/A | N/A |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong | N/A | N/A |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu | N/A | N/A |
| PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation | Libo Zhao, Jing Li, Ziqian Zeng | N/A | N/A |
| TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging | Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang | N/A | N/A |
| Do We Need Language-Specific Fact-Checking Models? The Case of Chinese | Caiqi Zhang, Zhijiang Guo, Andreas Vlachos | N/A | N/A |
| Enhancing Advanced Visual Reasoning Ability of Large Language Models | Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai | N/A | N/A |
| CMD: a framework for Context-aware Model self-Detoxification | Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang | N/A | N/A |
| Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection | Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao | N/A | N/A |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao | N/A | N/A |
| Be Helpful but Don’t Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support | LI Junlin, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang | N/A | N/A |
| Aligning Language Models to Explicitly Handle Ambiguity | Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim | N/A | N/A |
| Tag-grounded Visual Instruction Tuning with Retrieval Augmentation | Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li | N/A | N/A |
| GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models | Xuanchang Zhang, Zhuosheng Zhang, hai zhao | N/A | N/A |
| Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information | Runze Xia, Congchi Yin, Piji Li | N/A | N/A |
| Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models | Rui Li, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang | N/A | N/A |
| Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models | Yongjin Yang, Jongwoo Ko, Se-Young Yun | N/A | N/A |
| Advancing Process Verification for Large Language Models via Tree-Based Preference Learning | Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu | N/A | N/A |
| An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference | Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan | N/A | N/A |
| MantisScore: A Reliable Fine-grained Metric for Video Generation | Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen | N/A | N/A |
| A ∧ B ⇔ B ∧ A: Evaluating and Improving Logical Reasoning Ability of Large Language Models | Yuxuan WAN, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training | Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin | N/A | N/A |
| FuseGen: PLM Fusion for Data-generation based Zero-shot Learning | Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang | N/A | N/A |
| I Need Help! Evaluating LLM’s Ability to Ask for Users’ Support: A Case Study on Text-to-SQL Generation | Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm | Michael Wiegand, Josef Ruppenhofer | N/A | N/A |
| By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting | Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee | N/A | N/A |
| Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization | Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee | N/A | N/A |
| CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search | Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie | N/A | N/A |
| Towards Low-Resource Harmful Meme Detection with LMM Agents | Jianzhao Huang, Hongzhan Lin, ZiyanLiu, Ziyang Luo, Guang Chen, Jing Ma | N/A | N/A |
| VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values | Zhe Hu, Yixiao Ren, Jing Li, Yu Yin | N/A | N/A |
| Direct Multi-Turn Preference Optimization for Language Agents | Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng | N/A | N/A |
| Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models | Leonardo Ranaldi, Andre Freitas | N/A | N/A |
| In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search | Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren | N/A | N/A |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, liqian wen, Zulong Chen | N/A | N/A |
| Backward Lens: Projecting Language Model Gradients into the Vocabulary Space | Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf | N/A | N/A |
| Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding | Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu | N/A | N/A |
| Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! | Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu | N/A | N/A |
| Reusing Transferable Weight Increments for Low-resource Style Generation | Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar Zaiane | N/A | N/A |
| Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course | Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee | N/A | N/A |
| Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? | Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler | N/A | N/A |
| Instruction Pre-Training: Language Models are Supervised Multitask Learners | Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei | N/A | N/A |
| LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Renzhi Wang, Piji Li | N/A | N/A |
| Collaborative Performance Prediction for Large Language Models | Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma | N/A | N/A |
| Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese | Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari | N/A | N/A |
| Knowledge Verification to Nip Hallucination in the Bud | Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi | N/A | N/A |
| QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios | Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich | N/A | N/A |
| African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models | Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models | Bastien Liétard, Pascal Denis, Mikaela Keller | N/A | N/A |
| ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings | Hao Wang, Hao Li, Minlie Huang, Lei Sha | N/A | N/A |
| An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making | Xiutian Zhao, Ke Wang, Wei Peng | N/A | N/A |
| Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment | zhenyu liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning | Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, ZihanWang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang | N/A | N/A |
| Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification | Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang | N/A | N/A |
| PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study | Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu | N/A | N/A |
| Alignment-Enhanced Decoding: Defending via Token-Level Adaptive Refining of Probability Distributions | Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su | N/A | N/A |
| MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction | Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu | N/A | N/A |
| Evaluating Large Language Models via Linguistic Profiling | Alessio Miaschi, Felice Dell’Orletta, Giulia Venturi | N/A | N/A |
| With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models | Tyler Loakman, YUCHENG LI, Chenghua Lin | N/A | N/A |
| KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases | Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li | N/A | N/A |
| Understanding Higher-Order Correlations Among Semantic Components in Embeddings | Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira | N/A | N/A |
| DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection | Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Wei Liu, Xian Wu, Shaorong Xie, Yefeng Zheng | N/A | N/A |
| Evaluating D-MERIT of Partial-annotation on Information Retrieval | Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg | N/A | N/A |
| Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving | XIN QUAN, Marco Valentino, Louise A. Dennis, Andre Freitas | N/A | N/A |
| Calibrating the Confidence of Large Language Models by Eliciting Fidelity | Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu | N/A | N/A |
| Exploring Reward Model Strength’s Impact on Language Models | Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen | N/A | N/A |
| How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics | Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea | N/A | N/A |
| Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Gaetan Lopez Latouche, Marc-André Carbonneau, Benjamin Swanson | N/A | N/A |
| CUTE: Measuring LLMs’ Understanding of Their Tokens | Lukas Edman, Helmut Schmid, Alexander Fraser | N/A | N/A |
| SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation | Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| On The Role of Context in Reading Time Prediction | Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox | N/A | N/A |
| BC-Prover: Backward Chaining Prover for Formal Theorem Proving | Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin, Ruifeng Xu, Wotao Yin | N/A | N/A |
| From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP | Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva | N/A | N/A |
| Dual Modalities of Text: Visual and Textual Generative Pre-Training | Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu | N/A | N/A |
| On Training Data Influence of GPT Models | Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu | N/A | N/A |
| Understanding “Democratization” in NLP and ML Research | Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat | N/A | N/A |
| DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models | Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto | N/A | N/A |
| Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages | Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng | N/A | N/A |
| Word Alignment as Preference for Machine Translation | Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka | N/A | N/A |
| Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence | Yaxin FAN, PEIFENG LI, Qiaoming Zhu | N/A | N/A |
| SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models | Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang | N/A | N/A |
| Neuron-Level Knowledge Attribution in Large Language Models | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Pixology: Probing the Linguistic and Visual Knowledge of Pixel-based Language Models | Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux | N/A | N/A |
| GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory | Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song | N/A | N/A |
| Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature | ALI ALLAITH, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Ingemann Parby, Alexander Conroy, Timothy R Tangherlini | N/A | N/A |
| QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models | Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh | N/A | N/A |
| Fine-Grained Prediction of Reading Comprehension from Eye Movements | Omer Shubi, Yoav Meiri, Cfir Avraham Hadar, Yevgeni Berzak | N/A | N/A |
| Efficient Retriever for Multi-Hop Retrieval Question Answerin | Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang | N/A | N/A |
| Unsupervised Human Preference Learning | Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani Tur | N/A | N/A |
| Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering | Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo, Elena Cabrio, Serena Villata, Marco Guerini | N/A | N/A |
| Leading Whitespaces of Language Models’ Subword Vocabulary Poses a Confound for Calculating Word Probabilities | Byung-Doh Oh, William Schuler | N/A | N/A |
| LLM4Decompile: Decompiling Binary Code with Large Language Models | Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang | N/A | N/A |
| From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning | Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong | N/A | N/A |
| CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering | Yike Wu, Yi Huang, Nan Hu, YUNCHENG HUA, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan | N/A | N/A |
| MTLS: Making Texts into Linguistic Symbols | Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li | N/A | N/A |
| D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection | Yifan Chen, Kuntao Li, Weixing Mai, Qiaofeng Wu, Yun Xue, Fenghuan Li | N/A | N/A |
| A Generic Method for Fine-grained Category Discovery in Natural Language Texts | Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens | N/A | N/A |
| Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method | Yang Trista Cao, Lovely-Frances Domingo, Sarah Gilbert, Michelle L. Mazurek, Katherine Shilton, Hal Daumé III | N/A | N/A |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie | N/A | N/A |
| Decompose and Compare Consistency: Measuring VLMs’ Answer Reliability via Task-Decomposition Consistency Comparison | Qian Yang, Weixiang Yan, Aishwarya Agrawal | N/A | N/A |
| Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism | Lang Cao | N/A | N/A |
| VGBench: A Comprehensive Benchmark of Vector Graphics Understanding and Generation for Large Language Models | Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee | N/A | N/A |
| What do large language models need for machine translation evaluation? | Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orasan, Tharindu Ranasinghe, Fred Blain | N/A | N/A |
| Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo, Prateek Singhi, Bilal H Fadlallah | N/A | N/A |
| External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models | Debela Gemechu, Chris Reed | N/A | N/A |
| C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits | Maaz Bin Musa, Rishab Nithyanand, Padmini Srinivasan, Mihailis E. Diamantis, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin | N/A | N/A |
| MPT: Multimodal Prompt Tuning for Zero-shot Instruction Learning | Taowen Wang, Yiyang Liu, James Chenhao Liang, junhan zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han, Lifu Huang, Qifan Wang, Dongfang Liu | N/A | N/A |
| Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng, Yi Gu, Chengyu Dong, Zihan Wang, Jingbo Shang | N/A | N/A |
| Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng, Zilong Wang, Jingbo Shang | N/A | N/A |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang | N/A | N/A |
| Conditional and Modal Reasoning in Large Language Models | Wesley H. Holliday, Matthew Mandelkern, Cedegao E. Zhang | N/A | N/A |
| Advancing Large Language Model Attribution through Self-Improving | Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Ziqi Liang, Haoxiang Shi, Hanhui Chen | N/A | N/A |
| Interpretability-based Tailored Knowledge Editing in Transformers | Yihuai Hong, Aldo Lipani | N/A | N/A |
| PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling | Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan | N/A | N/A |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap | N/A | N/A |
| Dissecting Fine-Tuning Unlearning in Large Language Models | Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang | N/A | N/A |
| Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models | Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, zhiheng huang | N/A | N/A |
| Where is the signal in tokenization space? | Renato Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck | N/A | N/A |
| Private Language Models via Truncated Laplacian Mechanism | Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang | N/A | N/A |
| Estimating Knowledge in Large Language Models Without Generating a Single Token | Daniela Gottesman, Mor Geva | N/A | N/A |
| Consistent Autoformalization for Constructing Mathematical Libraries | Lan Zhang, XIN QUAN, Andre Freitas | N/A | N/A |
| Contextual and Parametric Knowledge: More Context, More Focus | Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal | N/A | N/A |
| Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers | Aditya Yedetore, Najoung Kim | N/A | N/A |
| When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages | Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen | N/A | N/A |
| Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai | N/A | N/A |
| MiTTenS: A Dataset for Evaluating Gender Mistranslation | Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings | N/A | N/A |
| Teaching LLMs to Abstain across Languages via Multilingual Feedback | Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov | N/A | N/A |
| Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration | Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov | N/A | N/A |
| StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements | Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L Gordon, Zaid Harchaoui, Yejin Choi | N/A | N/A |
| I Could’ve Asked That: Reformulating Unanswerable Questions | Wenting Zhao, Ge Gao, Claire Cardie, Alexander M Rush | N/A | N/A |
| STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions | Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami | N/A | N/A |
| Hidden Persuaders: How LLM Political Bias Could Sway Our Elections | Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song | N/A | N/A |
| SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning | Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu | N/A | N/A |
| When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives | Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu | N/A | N/A |
| An Analysis of Multilingual FActScore | Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai | N/A | N/A |
| Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models | Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo | N/A | N/A |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli | N/A | N/A |
| PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval | Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon | N/A | N/A |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov | N/A | N/A |
| ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback | Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault | N/A | N/A |
| Order of Magnitude Speedups for LLM Membership Inference | Rongting Zhang, Martin Andres Bertran, Aaron Roth | N/A | N/A |
| VIMI: Grounding Video Generation through Multi-modal Instruction | Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov | N/A | N/A |
| F$^2$RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation | Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou | N/A | N/A |
| Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning | Chang Yang, Peng Zhang, Hui Gao, Jing Zhang | N/A | N/A |
| Visual Prompting in LLMs for Enhancing Emotion Recognition | Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Yang Liu, Zhenyue Qin, Wenjia Niu, Sabrina Caldwell, Tom Gedeon | N/A | N/A |
| IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding | Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang | N/A | N/A |
| Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset | Che Wei Tsai, Yen-Hao Huang, Tsu-keng Liao, Didier Fernando Salazar Estrada, Retnani Latifah, Yi-Shin Chen | N/A | N/A |
| Outcome-Constrained Large Language Models for Countering Hate Speech | Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song | N/A | N/A |
| Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing | Changbing Yang, Garrett Nicolai, Miikka Silfverberg | N/A | N/A |
| Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks | Ao Wang, Xinghao Yang, Chen Li, Bao-di Liu, Weifeng Liu | N/A | N/A |
| Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping | Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang | N/A | N/A |
| PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling | Huachuan Qiu, Lizhi Ma, Zhenzhong Lan | N/A | N/A |
| World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering | Jiacong Wang, Bohong Wu, Haiyong Jiang, Haoyuan Guo, Xin Xiao, zhou Xun, Jun Xiao | N/A | N/A |
| DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering | Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo | N/A | N/A |
| How Do Humans Write Code? Large Models Do It the Same Way Too | Long Li, Xuzheng He, Haozhe Wang, Linlin Wang, Liang He | N/A | N/A |
| Retrospex: Language Agent Meets Offline Reinforcement Learning Critic | Yufei Xiang, Yiqun Shen, Yeqin Zhang, Nguyen Cam-Tu | N/A | N/A |
| Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models | Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, JingBo Zhu | N/A | N/A |
| Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation | Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen | N/A | N/A |
| CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation | Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang | N/A | N/A |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J Su, Camillo Jose Taylor, Dan Roth | N/A | N/A |
| Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao, Gonghan Xu, Zhe Wang, Arman Cohan | N/A | N/A |
| MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning | Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai | N/A | N/A |
| Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients | Weijun Li, Qiongkai Xu, Mark Dras | N/A | N/A |
| RWKV-CLIP: A Robust Vision-Language Representation Learner | Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng | N/A | N/A |
| KidLM: Advancing Language Models for Children – Early Insights and Future Directions | Mir Tafseer Nayeem, Davood Rafiei | N/A | N/A |
| Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr | N/A | N/A |
| How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin | N/A | N/A |
| An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records | Joakim Edin, Maria Maistro, Lars Maaløe, Lasse Borgholt, Jakob Drachmann Havtorn, Tuukka Ruotsalo | N/A | N/A |
| Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs | Zheng Wang, Zhongyang Li, Jiang Zeren, Dandan Tu, Wei Shi | N/A | N/A |
| EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation | Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji | N/A | N/A |
| Predicting Nonnative Sentence Processing with L2LMs | Tatsuya Aoyama, Nathan Schneider | N/A | N/A |
| From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis | Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan | N/A | N/A |
| Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs | Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar Karnin | N/A | N/A |
| Cross-Domain Audio Deepfake Detection: Dataset and Analysis | Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang | N/A | N/A |
| MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension | Ting Liu, Zunnan Xu, Zhiqiang Wang, Yue Hu, Liangtao Shi, Quanjun Yin | N/A | N/A |
| Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning | Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo | N/A | N/A |
| Aligning Translation-Specific Understanding to General Understanding in Large Language Models | Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin | N/A | N/A |
| FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation | Mohamad Ballout, Anne Dedert, Nohayr Muhammad Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger | N/A | N/A |
| Concept-skill Transferability-based Data Selection for Large Vision-Language Models | Jaewoo Lee, Boyang Li, Sung Ju Hwang | N/A | N/A |
| LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing | Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi LIU, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin | N/A | N/A |
| Academics Can Contribute to Domain-Specialized Language Models | Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S Rosenberg, Sebastian Gehrmann | N/A | N/A |
| Beyond Reference: Evaluating High Quality Translations Better than Human References | Keonwoong Noh, Seokjin Oh, Woohwan Jung | N/A | N/A |
| Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement | Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie | N/A | N/A |
| SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages | Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James Validad Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Railey Montalan, Ryan Ignatius Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze GAO, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Tai Ngee Chia, Ayu Purwarianti, Sebastian Ruder, William Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya | N/A | N/A |
| Induct-Learn: Short Phrase Prompting with Instruction Induction | Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen | N/A | N/A |
| Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning | Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li | N/A | N/A |
| LUQ: Long-text Uncertainty Quantification for LLMs | Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier | N/A | N/A |
| Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method | Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng | N/A | N/A |
| Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars | Damien Sileo | N/A | N/A |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux | N/A | N/A |
| Safely Learning with Private Data: A Federated Learning Framework for Large Language Model | Jia-Ying Zheng, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hong-Wei Zheng, Zhi-Ming Zheng | N/A | N/A |
| Formality Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge | Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen | N/A | N/A |
| How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You | N/A | N/A |
| How Far Can We Extract Diverse Perspectives from Large Language Models? | Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang | N/A | N/A |
| EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning | Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Mohan Yerragorla, Sourangshu Bhattacharya, Avishek Anand | N/A | N/A |
| An LLM Feature-based Framework for Dialogue Constructiveness Assessment | Lexin Zhou, Youmna Farag, Andreas Vlachos | N/A | N/A |
| Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System | Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou | N/A | N/A |
| Dialog2Flow: Pre-training Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction | Sergio Burdisso, Srikanth Madikeri, Petr Motlicek | N/A | N/A |
| Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture | N/A | N/A |
| Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024 | Ilias Chalkidis | N/A | N/A |
| Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning | Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian | N/A | N/A |
| LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations | Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu | N/A | N/A |
| Concept Space Alignment in Multilingual LLMs | Qiwei Peng, Anders Søgaard | N/A | N/A |
| Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model | Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou | N/A | N/A |
| NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian | Peng Liu, Lemei Zhang, Terje Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang | N/A | N/A |
| RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework | Yifan Wang, Vera Demberg | N/A | N/A |
| Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models | Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang | N/A | N/A |
| Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems | Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering | Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen | N/A | N/A |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Minzheng Wang, Longze Chen, ChengFu, Liaoshengyi, Xinghua Zhang, Bingliwu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li | N/A | N/A |
| On Mitigating Performance Disparities in Multilingual Speech Recognition | Monorama Swain, Anna Katrine van Zee, Anders Søgaard | N/A | N/A |
| Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting | Stephen Meisenbacher, Florian Matthes | N/A | N/A |
| From Coarse to Fine: Impacts of Feature-Preserving and Feature-Compressing Connectors on Perception in Multimodal Models | Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen | N/A | N/A |
| What is ‘‘Typological Diversity’’ in NLP? | Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva | N/A | N/A |
| The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse | Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi | N/A | N/A |
| Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness | Georgi Shopov, Stefan Gerdjikov | N/A | N/A |
| Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal | N/A | N/A |
| Methods of Automatic Matrix Language Determination for Code-Switched Speech | Olga Iakovenko, Thomas Hain | N/A | N/A |
| Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts | Jaewook Lee, Yeajin Jang, Hongjin KIM, Woojin Lee, Harksoo Kim | N/A | N/A |
| Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models | Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar | N/A | N/A |
| Teaching Small Language Models Reasoning through Counterfactual Distillation | FengTao, Yicheng Li, Li Chenglin, Hao Chen, Fei Yu, Yin Zhang | N/A | N/A |
| Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese | Meet Doshi, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| Quantifying the Gap Between Machine Translation and Native Language in Training for Multimodal, Multilingual Retrieval | Kyle Buettner, Adriana Kovashka | N/A | N/A |
| MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval | Qixi Lu, Gongbo Tang | N/A | N/A |
| Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates | Aida Kostikova, Dominik Beese, Benjamin Paassen, Ole Pütz, Gregor Wiedemann, Steffen Eger | N/A | N/A |
| CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling | Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie CK Cheung | N/A | N/A |
| Story Embeddings — Narrative-Focused Representations of Fictional Stories | Hans Ole Hatzel, Chris Biemann | N/A | N/A |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou | N/A | N/A |
| PSC: Extending Context Window of Large Language Models via Phase Shift Calibration | Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu | N/A | N/A |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan | N/A | N/A |
| SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao | N/A | N/A |
| Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing | Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn | N/A | N/A |
| ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations | Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-Wei Lee | N/A | N/A |
| Boosting Scientific Concepts Understanding: Can Analogies from Teacher Models Empower Student Models? | Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang | N/A | N/A |
| Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation | Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza | N/A | N/A |
| Do Large Language Models Know How Much They Know? | Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar | N/A | N/A |
| Investigating Mysteries of CoT-Augmented Distillation | Somin Wadhwa, Silvio Amir, Byron C Wallace | N/A | N/A |
| SciPrompt: Knowledge-Augmented Prompting for Fine-Grained Categorization of Scientific Topics | Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludaescher, Jana Diesner | N/A | N/A |
| Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP | Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi | N/A | N/A |
| Learning from Natural Language Explanations for Generalizable Entity Matching | Somin Wadhwa, ADIT KRISHNAN, Runhui Wang, Byron C Wallace, Luyang Kong | N/A | N/A |
| Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation | Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Sricharan Kumar, Murat Kantarcioglu, Bradley A. Malin | N/A | N/A |
| On the Reliability of Psychological Scales on Large Language Models | Jen-tse Huang, Wenxuan Wang, Man Ho LAM, Eric John Li, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring | N/A | N/A |
| Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | Jeonghwan Kim, Heng Ji | N/A | N/A |
| Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts | Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata | N/A | N/A |
| VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment | Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu | N/A | N/A |
| Focused Large Language Models are Stable Many-Shot Learners | Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| Reconsidering Sentence-Level Sign Language Translation | Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus | N/A | N/A |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha | N/A | N/A |
| Verba volant, scripta volant? Don’t worry! There are computational solutions for protoword reconstruction | Liviu P Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas | N/A | N/A |
| ChatGPT Doesn’t Trust LA Chargers Fans: Guardrail Sensitivity in Context | Victoria R Li, Yida Chen, Naomi Saphra | N/A | N/A |
| Personas as a Way to Model Truthfulness in Language Models | Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He | N/A | N/A |
| Satyrn: A Platform for Analytics Augmented Generation | Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J Hammond | N/A | N/A |
| EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning | Ashish Seth, Ramaneswaran S, S Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha | N/A | N/A |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao, Haotian Fu, Chen Sun, George Konidaris | N/A | N/A |
| Detection and Measurement of Syntactic Templates in Generated Text | Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C Wallace | N/A | N/A |
| UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models | Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu | N/A | N/A |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet | N/A | N/A |
| Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts | Zhaoxuan Tan, Zheyuan Liu, Meng Jiang | N/A | N/A |
| Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning | Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang | N/A | N/A |
| Unifying Multimodal Retrieval via Document Screenshot Embedding | Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin | N/A | N/A |
| Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation | Shaomu Tan, Di Wu, Christof Monz | N/A | N/A |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson | N/A | N/A |
| Discovering Knowledge-Critical Subnetworks in Pretrained Language Models | Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut | N/A | N/A |
| Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models | Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang | N/A | N/A |
| Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering | Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner | N/A | N/A |
| Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution | Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner | N/A | N/A |
| Understanding and Mitigating Language Confusion in LLMs | Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder | N/A | N/A |
| Can Large Language Models Learn Independent Causal Mechanisms? | Gael Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael Witbrock, Gillian Dobbie | N/A | N/A |
| MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models | Sarfaroz Yunusov, Hamza Sidat, Ali Emami | N/A | N/A |
| InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context | Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao | N/A | N/A |
| Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia | Farhan Samir, Chan Young Park, Vered Shwartz, Anjalie Field, Yulia Tsvetkov | N/A | N/A |
| From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models | Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, EunJeong Hwang, Vered Shwartz | N/A | N/A |
| Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation | Karin De Langis, Ryan Koo, Dongyeop Kang | N/A | N/A |
| MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model | Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu | N/A | N/A |
| Learning to Extract Structured Entities Using Language Models | Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra | N/A | N/A |
| Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons | Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark Gales | N/A | N/A |
| A Survey of AMR Applications | Shira Wein, Juri Opitz | N/A | N/A |
| Beyond Embeddings: The Promise of Visual Table in Visual Reasoning | Yiwu Zhong, Zi-Yuan Hu, Michael Lyu, Liwei Wang | N/A | N/A |
| CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation | Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C Kaelin, Mary Khetani, Natalie Parde | N/A | N/A |
| Secured Weight Release for Large Language Models via Taylor Expansion | Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Hu | N/A | N/A |
| TimeR$^4$ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering | Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song | N/A | N/A |
| Knowledge-Centric Hallucination Detection | Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang | N/A | N/A |
| Revealing the Parallel Multilingual Learning within Large Language Models | Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, JingBo Zhu | N/A | N/A |
| Automatic Instruction Evolving for Large Language Models | Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen | N/A | N/A |
| RepEval: Effective Text Evaluation with LLM Representation | Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou | N/A | N/A |
| Generative Models for Automatic Medical Decision Rule Extraction from Text | Yuxin He, Buzhou Tang, Xiaoling Wang | N/A | N/A |
| Encoding and Controlling Global Semantics for Long-form Video Question Answering | Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu | N/A | N/A |
| Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis | Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang | N/A | N/A |
| Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs | Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Does Large Language Model Contain Task-Specific Neurons? | Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu | N/A | N/A |
| Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models | Philipp Mondorf, Barbara Plank | N/A | N/A |
| Advancing Test-Time Adaptation in Wild Acoustic Test Settings | Hongfu Liu, Hengguan Huang, Ye Wang | N/A | N/A |
| Learning to Retrieve Iteratively for In-Context Learning | Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme | N/A | N/A |
| Taxonomy-guided Semantic Indexing for Academic Paper Search | SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu | N/A | N/A |
| Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts | Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models | Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh | N/A | N/A |
| Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation | Zhiyu Cao, PEIFENG LI, Yaxin FAN, Qiaoming Zhu | N/A | N/A |
| FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs | Yiyuan Li, Shichao Sun, Pengfei Liu | N/A | N/A |
| Aligning Large Language Models with Diverse Political Viewpoints | Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash | N/A | N/A |
| “You Gotta be a Doctor, Lin” : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations | Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III | N/A | N/A |
| Extending Context Window of Large Language Models from a Distributional Perspective | Yingsheng Wu, Yuxuan Gu, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions | Hakyung Sung, Kristopher Kyle | N/A | N/A |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng | N/A | N/A |
| Position Engineering: Boosting Large Language Models through Positional Information Manipulation | Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna K. Qiu, Lili Qiu | N/A | N/A |
| Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale | Junying Chen, Chi Gui, OuyangRuyi, Anningzhe Gao, Shunian Chen, Guiming Hardy Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang | N/A | N/A |
| ADELIE: Aligning Large Language Models on Information Extraction | Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li | N/A | N/A |
| Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons | Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Dajun Zeng | N/A | N/A |
| Lexically Grounded Subword Segmentation | Jindřich Libovický, Jindřich Helcl | N/A | N/A |
| EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang | N/A | N/A |
| Do Text-to-Vis Benchmarks Test Real Use of Visualizations? | Hy Nguyen, Xuefei He, Andrew Reeson, Cecile Paris, Josiah Poon, Jonathan K. Kummerfeld | N/A | N/A |
| Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu | N/A | N/A |
| Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning | Jingyu Hu, Weiru Liu, Mengnan Du | N/A | N/A |
| Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges | Nguyen Van Dinh, Thanh Chi Dang, Luan Thanh Nguyen, Kiet Van Nguyen | N/A | N/A |
| Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment | Vyas Raina, Adian Liusie, Mark Gales | N/A | N/A |
| Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal | Zhicong Lu, Li Jin, PeiguangLi, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai | N/A | N/A |
| More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Yangyang Kang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu | N/A | N/A |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | Vyas Raina, Rao Ma, Charles McGhee, Kate Knill, Mark Gales | N/A | N/A |
| GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation | Georgios Katsimpras, Georgios Paliouras | N/A | N/A |
| XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs | Zichen Chen, Jianda Chen, Ambuj Singh, Misha Sra | N/A | N/A |
| Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning | Yuanpin Zhou, Huogen Wang | N/A | N/A |
| SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information | Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, Yu Cheng | N/A | N/A |
| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui | N/A | N/A |
| Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments | Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su | N/A | N/A |
| MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space | Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia, Zhangjijun, Ruifang He, Yuexian Hou | N/A | N/A |
| KnowledgeSG: Privacy-Preserving Synthetic Text Generation With Knowledge Distillation From Server | WenHao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang | N/A | N/A |
| DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination | Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei | N/A | N/A |
| Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models | Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao | N/A | N/A |
| Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale | Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou | N/A | N/A |
| An Empirical Study of Multilingual Reasoning Distillation for Question Answering | Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? | Gal Yona, Roee Aharoni, Mor Geva | N/A | N/A |
| Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? | Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig | N/A | N/A |
| Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning | Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee | N/A | N/A |
| MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding | Baixuan Xu, Weiqi Wang, Haochen Shi, Wenxuan Ding, Huihao JING, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song | N/A | N/A |
| ECON: On the Detection and Resolution of Evidence Conflicts | Cheng Jiayang, Qianqian Zhuang, Chunkit Chan, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang | N/A | N/A |
| “Image, Tell me your story!” Predicting the original meta-context of visual misinformation | Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych | N/A | N/A |
| Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan | N/A | N/A |
| Mixture-of-Subspaces in Low-Rank Adaptation | Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong | N/A | N/A |
| A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data | Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram | N/A | N/A |
| LawBench: Benchmarking Legal Knowledge of Large Language Models | Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng | N/A | N/A |
| Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards | Furkan Şahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych | N/A | N/A |
| Efficient Vision-Language pre-training via domain-specific learning for human activities | Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martinez, Georgios Tzimiropoulos | N/A | N/A |
| Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training | Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su | N/A | N/A |
| Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works | Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang | N/A | N/A |
| Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners | Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang | N/A | N/A |
| AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning | Hao Sun, Jiayi Wu, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li | N/A | N/A |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen | N/A | N/A |
| Data Advisor: Data Curation with Foresight for Safety Alignment of Large Language Models | Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan | N/A | N/A |
| Language-to-Code Translation with a Single Labeled Example | Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas | N/A | N/A |
| Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann, Xiao Liu, Iryna Gurevych | N/A | N/A |
| FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models | Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma | N/A | N/A |
| Retrieved In-Context Principles from Previous Mistakes | Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang | N/A | N/A |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Haozhe Chen, Run Chen, Julia Hirschberg | N/A | N/A |
| VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang | N/A | N/A |
| Deterministic Weighted L* Algorithm | Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Ryan Cotterell | N/A | N/A |
| Towards Verifiable Text Generation with Evolving Memory and Self-Reflection | Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification | Pritish Sahu, Karan Sikka, Ajay Divakaran | N/A | N/A |
| Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota, Jerone Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang | N/A | N/A |
| RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? | Di Cao, Yong Liao, Xiuwei Shang | N/A | N/A |
| Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel | Brendan King, Jeffrey Flanigan | N/A | N/A |
| Humans or LLMs as the Judge? A Study on Judgement Bias | Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang | N/A | N/A |
| WPO: Enhancing RLHF with Weighted Preference Optimization | Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu | N/A | N/A |
| Walking in Others’ Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias | Rongwu Xu, Zian Zhou, Tianwei Zhang, Zehan Qi, SU YAO, Ke Xu, Wei Xu, Han Qiu | N/A | N/A |
| MetaReflection: Learning Instructions for Language Agents using Past Reflections | Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi | N/A | N/A |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan | N/A | N/A |
| On Eliciting Syntax from Language Models via Hashing | Yiran Wang, Masao Utiyama | N/A | N/A |
| CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios | Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He | N/A | N/A |
| The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples | Heng Yang | N/A | N/A |
| CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages | Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal | N/A | N/A |
| Perceptions of Linguistic Uncertainty by Language Models and Humans | Catarina G Belém, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth | N/A | N/A |
| Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM | Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung | N/A | N/A |
| Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding | Xiaoyu DONG, Yujie Feng, ZEXIN LU, Guangyuan SHI, Xiao-Ming Wu | N/A | N/A |
| Knowledge Conflicts for LLMs: A Survey | Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru WANG, Yue Zhang, Wei Xu | N/A | N/A |
| Generative AI in the Era of “Alternative Facts” | Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar | N/A | N/A |
| MEANT: Multimodal Encoder for Antecedent Information | Benjamin Irving, Annika Marie Schoene | N/A | N/A |
| A Thorough Examination of Decoding Methods in the Era of LLMs | Chufan Shi, HAORAN YANG, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam | N/A | N/A |
| AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings | Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar | N/A | N/A |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji | N/A | N/A |
| Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights | Hongjin KIM, Jai-Eun Kim, Harksoo Kim | N/A | N/A |
| ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods | Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Zhenqiang Gong, Bhuwan Dhingra | N/A | N/A |
| “Flex Tape Can’t Fix That”: Bias and Misinformation in Edited Language Models | Karina H Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut | N/A | N/A |
| Revisiting Who’s Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective | Yujian Liu, Yang Zhang, Tommi Jaakkola, Shiyu Chang | N/A | N/A |
| LIONs: An Empirically Optimized Approach to Align Language Models | Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu | N/A | N/A |
| Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing | Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada | N/A | N/A |
| A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery | Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han | N/A | N/A |
| MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents | Liyan Tang, Philippe Laban, Greg Durrett | N/A | N/A |
| Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning | John Wu, David Wu, Jimeng Sun | N/A | N/A |
| MOSEL: Inference Serving Using Dynamic Modality Selection | Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J Yadwadkar, Aditya Akella | N/A | N/A |
| From RAG to Riches: Retrieval Interlaced with Sequence Generation | Palak Jain, Livio Baldini Soares, Tom Kwiatkowski | N/A | N/A |
| Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition | Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee | N/A | N/A |
| Learning to Correct for QA Reasoning with Black-box LLMs | Jaehyung Kim, Dongyoung Kim, Yiming Yang | N/A | N/A |
| AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? | Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant | N/A | N/A |
| PostMark: A Robust Blackbox Watermark for Large Language Models | Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Frederick Wieting, Mohit Iyyer | N/A | N/A |
| Assessing “Implicit” Retrieval Robustness of Large Language Models | Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang | N/A | N/A |
| On the Relationship between Truth and Political Bias in Language Models | Suyash Fulay, William Brannon, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara | N/A | N/A |
| Can Active Label Correction Improve LLM-based Modular AI Systems? | Karan Taneja, Ashok Goel | N/A | N/A |
| Statistical Uncertainty in Word Embeddings: GloVe-V | Andrea Vallebueno, Cassandra Handan-Nader, Christopher D Manning, Daniel E. Ho | N/A | N/A |
| Annotation alignment: Comparing LLM and human annotations of conversational safety | Rajiv Movva, Pang Wei Koh, Emma Pierson | N/A | N/A |
| DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Nigel Fernandez, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, Andrew Lan | N/A | N/A |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang | N/A | N/A |
| CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models | Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran | N/A | N/A |
| Enhancing Reinforcement Learning with Intrinsic Rewards from Language Model Critique | Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng | N/A | N/A |
| Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models | Layla Bouzoubaa, Elham Aghakhani, Shadi Rezapour | N/A | N/A |
| Efficient Sequential Decision Making with Large Language Models | Dingyang Chen, Qi Zhang, Yinglun Zhu | N/A | N/A |
| SignCLIP: Connecting Text and Sign Language by Contrastive Learning | Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling | N/A | N/A |
| APPLS: Evaluating Evaluation Metrics for Plain Language Summarization | Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang | N/A | N/A |
| Ontologically Faithful Generation of Non-Player Character Dialogues | Nathaniel Weir, Ryan Thomas, Randolph d’Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani | N/A | N/A |
| LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives | Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs | Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov | N/A | N/A |
| Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction | Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Toward Compositional Behavior in Neural Models: A Survey of Current Views | Kate McCurdy, Paul Soulos, Paul Smolensky | N/A | N/A |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab | N/A | N/A |
| Reverse-Engineering the Reader | Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell | N/A | N/A |
| Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation | Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text | Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun | N/A | N/A |
| Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning | David Schulte, Felix Hamborg, Alan Akbik | N/A | N/A |
| The effects of distance on NPI illusive effects in BERT | So Young Lee, Mai Ha Vu | N/A | N/A |
| Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic | Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme | N/A | N/A |
| Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US | Christabel Acquaye, Haozhe An, Rachel Rudinger | N/A | N/A |
| Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding | Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Eric Wang | N/A | N/A |
| Ranking Manipulation for Conversational Search Engines | Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi | N/A | N/A |
| Fast Forwarding Low-Rank Training | Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov | N/A | N/A |
| Precise Model Benchmarking with Only a Few Observations | Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort | N/A | N/A |
| Attribute Diversity Determines the Systematicity Gap in VQA | Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra | N/A | N/A |
| “Rows, Columns and Values, Oh My!” Synthesizing Scientific Literature into Tables using Language Models | Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S Weld, Joseph Chee Chang, Kyle Lo | N/A | N/A |
| Development of Cognitive Intelligence in Pre-trained Language Models | Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma | N/A | N/A |
| Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding | Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui | N/A | N/A |
| Birdie: Advancing State Space Models with a Minimalist Architecture and Novel Pre-training Objectives | Sam Blouir, Jimmy T.H. Smith, Antonios Anastasopoulos, Amarda Shehu | N/A | N/A |
| Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? | Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow | N/A | N/A |
| Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs | Sheridan Feucht, David Atkinson, Byron C Wallace, David Bau | N/A | N/A |
| TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering | Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig | N/A | N/A |
| Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding | Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell | N/A | N/A |
| Unlocking Memorization in Large Language Models with Dynamic Soft Prompting | Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang | N/A | N/A |
| If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions | Reza Esfandiarpoor, Cristina Menghini, Stephen Bach | N/A | N/A |
| Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction | Bowen Zhang, Harold Soh | N/A | N/A |
| MQuinE: a Cure for “Z-paradox” in Knowledge Graph Embedding | Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun | N/A | N/A |
| Can Transformer Language Models Learn $n$-gram Language Models? | Anej Svete, Nadav Borenstein, Mike Zhou, Ryan Cotterell | N/A | N/A |
| StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model | Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim | N/A | N/A |
| Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems | Philippe Laban, Alexander Fabbri, Caiming Xiong, Chien-Sheng Wu | N/A | N/A |
| Multi-pass Decoding for Grammatical Error Correction | Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu | N/A | N/A |
| Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang, Yijia Shao, Dekun Ma, Sina Semnani, Monica Lam | N/A | N/A |
| SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation | Chenming Tang, Zhixiang Wang, Yunfang Wu | N/A | N/A |
| Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge | Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng | N/A | N/A |
| STORYSUMM: Evaluating Faithfulness in Story Summarization | Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Thomas Adams, Lydia Chilton, Kathleen McKeown | N/A | N/A |
| MMOE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Haofei Yu, Zhengyang Qi, Lawrence Keunho Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang | N/A | N/A |
| OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer | Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee | N/A | N/A |
| Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension | Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg | N/A | N/A |
| CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions | Jun Rao, Xuebo Liu, Lian Lian, shengjun cheng, Yunjie Liao, Min Zhang | N/A | N/A |
| ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers | Yuzhe Gu, Enmao Diao | N/A | N/A |
| Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models | Jaeseong Lee, seung-won hwang, Wonpyo Park, Mingi Ji | N/A | N/A |
| Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood | Yang Xu, Yu Wang, Hao An, Yongyuan Li, Zhichen Liu | N/A | N/A |
| Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning | Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, JUN ZHOU | N/A | N/A |
| Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models | XiaoHua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin | N/A | N/A |
| ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs | Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen | N/A | N/A |
| On the In-context Generation of Language Models | Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu | N/A | N/A |
| Atomic Inference for NLI with Generated Facts as Atoms | Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei | N/A | N/A |
| Towards Robust Speech Representation Learning for Thousands of Languages | William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe | N/A | N/A |
| I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses | Xuan Ren, Biao Wu, Lingqiao Liu | N/A | N/A |
| PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment | Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen | N/A | N/A |
| An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance | Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig | N/A | N/A |
| When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models | Ting-Yun Chang, Jesse Thomason, Robin Jia | N/A | N/A |
| Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference | Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo zhang, Yanghui Rao | N/A | N/A |
| Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions | Jinsung Yoon, Rajarishi Sinha, Sercan O Arik, Tomas Pfister | N/A | N/A |
| KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction | Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao | N/A | N/A |
| Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation | Zhen Lin, Shubhendu Trivedi, Jimeng Sun | N/A | N/A |
| $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity | Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl | N/A | N/A |
| CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction | Tuan Dung Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen | N/A | N/A |
| “In-Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning | Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan | N/A | N/A |
| Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective | Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He | N/A | N/A |
| Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding | Xin Liu, Farima Fatahi Bayat, Lu Wang | N/A | N/A |
| Reasoning Robustness of LLMs to Adversarial Typographical Errors | Esther Gan, Yiran Zhao, Liying Cheng, Mao Yancan, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh | N/A | N/A |
| InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance | Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Mozhi Zhang, Ke Ren, Botian Jiang, Xipeng Qiu | N/A | N/A |
| Belief Revision: The Adaptability of Large Language Models Reasoning | Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung | N/A | N/A |
| Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models | Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou | N/A | N/A |
| Bio-RFX: Refining Biomedical Extraction via Advanced Relation Classification and Structural Constraints | Minjia Wang, Fangzhou Liu, Xiuxing Li, Bowen Dong, Zhenyu Li, Tengyu Pan, Jianyong Wang | N/A | N/A |
| Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models | Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng | N/A | N/A |
| LLMs Are Prone to Fallacies in Causal Inference | Nitish Joshi, Abulhair Saparov, Yixin Wang, He He | N/A | N/A |
| Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles | Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang | N/A | N/A |
| The Lou Dataset - Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis, Joel Birrer, Anne Lauscher, Iryna Gurevych | N/A | N/A |
| When Generative Adversarial Networks Meet Sequence Labeling Challenges | Yu Tong, Ge Chen, Guokai Zheng, Rui Li, Jiang Dazhi | N/A | N/A |
| Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering | Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee | N/A | N/A |
| Speechworthy Instruction-tuned Language Models | Hyundong Justin Cho, Nicolaas Paul Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May | N/A | N/A |
| Data, Data Everywhere: A Guide for Pretraining Dataset Construction | Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| Fine-Tuning and Prompt Optimization: Two Good Steps that Work Better Together | Dilara Soylu, Christopher Potts, Omar Khattab | N/A | N/A |
| Demystifying Verbatim Memorization in Large Language Models | Jing Huang, Diyi Yang, Christopher Potts | N/A | N/A |
| AmbigNLG: Addressing Task Ambiguity in Instruction for NLG | Ayana Niwa, Hayate Iso | N/A | N/A |
| Distributional Properties of Subword Regularization | Marco Cognetta, Vilém Zouhar, Naoaki Okazaki | N/A | N/A |
| DataTales: A Benchmark for Real-World Intelligent Data Narration | Yajing Yang, Qian Liu, Min-Yen Kan | N/A | N/A |
| Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun | N/A | N/A |
| GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization | Yangfan Ye, Xiachong Feng, Xiaocheng Feng, Weitao Ma, Libo Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models | Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer | N/A | N/A |
| More Insightful Feedback for Tutoring: Enhancing Generation Mechanisms and Automatic Evaluation | Wencke Liermann, Jin-Xia Huang, Yohan Lee, Kong Joo Lee | N/A | N/A |
| Stable Language Model Pre-training by Reducing Embedding Variability | Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun | N/A | N/A |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Kavya Manohar, Leena G Pillai | N/A | N/A |
| Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets | Benjamin Schiller, Johannes Daxenberger, Andreas Waldis, Iryna Gurevych | N/A | N/A |
| Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas | Seungjong Sun, Eungu Lee, Seo Yeon Baek, Seunghyun Hwang, Lee wonbyung, Dongyan Nan, Bernard J Jansen, Jang Hyun Kim | N/A | N/A |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha | N/A | N/A |
| Dynamic Multi-granularity Attribution Network for Aspect-based Sentiment Analysis | Yanjiang Chen, Kai Zhang, hufeng, Xianquan Wang, Ruikang li, Qi Liu | N/A | N/A |
| Unlabeled Debiasing in Downstream Tasks via Class-wise Low Variance Regularization | Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl | N/A | N/A |
| Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA | Pu Jian, Donglei Yu, Jiajun Zhang | N/A | N/A |
| Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights | Hao Yang, Lizhen Qu, Ehsan Shareghi, Reza Haf | N/A | N/A |
| Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations | Milan BHAN, Jean-Noël Vittaut, Nicolas CHESNEAU, Marie-Jeanne Lesot | N/A | N/A |
| What are the Generator Preferences for End-to-end Task-Oriented Dialog System? | Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou | N/A | N/A |
| Paraphrase Types Elicit Prompt Engineering Capabilities | Jan Philip Wahle, Terry Ruas, Yang Xu, Bela Gipp | N/A | N/A |
| VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models | Jingtao Cao, Zhang Zheng, Hongru WANG, Kam-Fai Wong | N/A | N/A |
| Towards Online Continuous Sign Language Recognition and Translation | Ronglai Zuo, Fangyun Wei, Brian Mak | N/A | N/A |
| Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment | Yiwei Dai, Hengrui Gu, Ying Wang, Xin Wang | N/A | N/A |
| Split and Merge: Aligning Position Biases in LLM-based Evaluators | Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu | N/A | N/A |
| Integrating Argumentation and Hate-Speech-based Techniques for Countering Misinformation | Sougata Saha, Rohini Srihari | N/A | N/A |
| BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM | Wenda Xu, Jiachen Li, William Yang Wang, Lei Li | N/A | N/A |
| One2Set + Large Language Model: Best Partners for Keyphrase Generation | Liangying Shao, Liang Zhang, Minlong Peng, Guoqi Ma, Hao Yue, Mingming Sun, Jinsong Su | N/A | N/A |
| Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering | Yifei Yuan, Yang Deng, Anders Søgaard, Mohammad Aliannejadi | N/A | N/A |
| ORPO: Monolithic Preference Optimization without Reference Model | Jiwoo Hong, Noah Lee, James Thorne | N/A | N/A |
| A Multi-Perspective Analysis of Memorization in Large Language Models | Bowen Chen, Namgi Han, Yusuke Miyao | N/A | N/A |
| Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations | Nicolò Penzo, Maryam Sajedinia, Bruno Lepri, Sara Tonelli, Marco Guerini | N/A | N/A |
| Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs | Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych | N/A | N/A |
| Unveiling the Role of Pretraining in Direct Speech Translation | Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà | N/A | N/A |
| PCQPR: Proactive Conversational Question Planning with Reflection | Shasha Guo | N/A | N/A |
| CodeAgent: Autonomous Communicative Agents for Code Review | Xunzhu Tang, KISUB KIM, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawendé F. Bissyandé | N/A | N/A |
| TroL: Traversal of Layers for Large Language and Vision Models | Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro | N/A | N/A |
| MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language | Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin | N/A | N/A |
| Revisiting Supertagging for faster HPSG parsing | Olga Zamaraeva, Carlos Gómez-Rodríguez | N/A | N/A |
| Improve Dense Passage Retrieval with Entailment Tuning | Lu Dai, Hao Liu, Hui Xiong | N/A | N/A |
| ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen WAN, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana | N/A | N/A |
| TEMA: Token Embeddings Mapping for Enriching Low-Resource Language Models | Rodolfo Zevallos, Núria Bel, Mireia Farrús | N/A | N/A |
| DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting | Xuanming Zhang, Anthony Diaz, Zixun Chen, Qingyang Wu, Kun Qian, Erik Voss, Zhou Yu | N/A | N/A |
| Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback | Fatemeh Pesaran zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim | N/A | N/A |
| PrExMe: Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter, Steffen Eger | N/A | N/A |
| Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning | Shuai Zhao, Meihuizi Jia, Anh Tuan Luu, Fengjun Pan, Jinming Wen | N/A | N/A |
| Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models | Javier Chiyah-Garcia, Alessandro Suglia, Arash Eshghi | N/A | N/A |
| Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models | Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu | N/A | N/A |
| Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations | Matthias Lindemann, Alexander Koller, Ivan Titov | N/A | N/A |
| Puzzle Solving using Reasoning of Large Language Models: A Survey | Panagiotis Giadikiaroglou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou | N/A | N/A |
| SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading | Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Jianfeng Gao, Fabian Peller-Konrad, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues | N/A | N/A |
| Red Teaming Language Models for Processing Contradictory Dialogues | Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen | N/A | N/A |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | Sander Land, Max Bartolo | N/A | N/A |
| Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs | Houman Mehrafarin, Arash Eshghi, Ioannis Konstas | N/A | N/A |
| Don’t Underestimate the Octopus - Why The Symbol Grounding Problem Does Not Apply to LLMs | Reto Gubelmann | N/A | N/A |
| Major Entity Identification: A Generalizable Alternative to Coreference Resolution | Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi | N/A | N/A |
| Enhancing High-order Interaction Awareness in LLM-based Recommender Model | Xinfeng Wang, Jin Cui, Fumiyo Fukumoto, Yoshimi Suzuki | N/A | N/A |
| What Are the Odds? Language Models Are Capable of Probabilistic Reasoning | Akshay Paruchuri, Jake Garrison, shun liao, John B Hernandez, Jacob Sunshine, Tim Althoff, Xin Liu, Daniel McDuff | N/A | N/A |
| MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction | Han Jiang, Junwen Duan, Zhe Qu, Jianxin Wang | N/A | N/A |
| LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models | Hayder Elesedy, Pedro M Esperanca, Silviu Vlad Oprea, Mete Ozay | N/A | N/A |
| “A good pun is its own reword”: Can Large Language Models Understand Puns? | Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang | N/A | N/A |
| QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation | Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu | N/A | N/A |
| Dependency Graph Parsing as Sequence Labeling | Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez | N/A | N/A |
| NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data | Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoit Crabbé, Etienne P Bernard | N/A | N/A |
| Towards a Greek Proverb Atlas: Computational Spatial Exploration and Attribution of Greek Proverbs | John Pavlopoulos, Panos Louridas, Panagiotis Filos | N/A | N/A |
| Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications | Weize Liu, Yinlong Xu, Hongxia Xu, Jintai Chen, Xuming Hu, Jian Wu | N/A | N/A |
| Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss | Bowen Zhang, Chunping Li | N/A | N/A |
| Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training | Marc Felix Brinner, Sina Zarrieß | N/A | N/A |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Markus Frohmann, Igor Sterner, Ivan Vulić, Benjamin Minixhofer, Markus Schedl | N/A | N/A |
| Applying Contrastive Learning to Code Vulnerability Type Classification | Chen Ji, Su Yang, Hongyu Sun, Yuqing Zhang | N/A | N/A |
| TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts | Ruida WANG, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang | N/A | N/A |
| Multi-Level Cross-Modal Alignment for Speech Relation Extraction | Liang Zhang, Zhen Yang, Biao Fu, Ziyao Lu, Liangying Shao, Shiyu Liu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jinsong Su | N/A | N/A |
| Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder, Gerhard Heyer | N/A | N/A |
| PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models | Jinsung Kim, Seonmin Koo, Heuiseok Lim | N/A | N/A |
| The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm | Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| Subword Segmentation in LLMs: Looking at Inflection and Consistency | Marion Di Marco, Alexander Fraser | N/A | N/A |
| Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments | Omar Sharif, Joseph Gatto, MADHUSUDAN BASAK, Sarah Masud Preum | N/A | N/A |
| Let Me Teach You: Pedagogical Foundations of Feedback for Language Models | Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut | N/A | N/A |
| Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data | Jean-Flavien Bussotti, Luca Ragazzi, Giacomo Frisoni, Gianluca Moro, Paolo Papotti | N/A | N/A |
| TL-CL: Task And Language Incremental Continual Learning | Shrey Satapara, P. K. Srijith | N/A | N/A |
| Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P Jeong, Saurabh Garg, Zachary Chase Lipton, Michael Oberst | N/A | N/A |
| Empowering Multi-step Reasoning across Languages via Program-Aided Language Models | Leonardo Ranaldi, Giulia Pucci | N/A | N/A |
| Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models | Yu Yuan, Lili Zhao, Kai Zhang, Guangting Zheng, Qi Liu | N/A | N/A |
| ControlMath: Controllable Data Generation Promotes Math Generalist Models | Nuo Chen, Ning Wu, Jianhui Chang, MING GONG, Linjun Shou, Dongmei Zhang, Jia Li | N/A | N/A |
| Where Am I From? Identifying Origin of LLM-generated Content | Liying LI, Yihan Bai, Minhao Cheng | N/A | N/A |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | Tarek Naous, Michael J Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu | N/A | N/A |
| GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text | Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin | N/A | N/A |
| GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains | Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Elizabeth Levine, Jessica Lin, Devika Tiwari, Amir Zeldes | N/A | N/A |
| RA2FD: Distilling Faithfulness into Efficient Dialogue Systems | Zhiyuan Zhu, Yusheng Liao, Chenxin Xu, Yunfeng Guan, Yanfeng Wang, Yu Wang | N/A | N/A |
| Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation | Fangrui Lv, Kaixiong Gong, Jian Liang, Xinyu Pang, Changshui Zhang | N/A | N/A |
| Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently | Kanishka Misra, Allyson Ettinger, Kyle Mahowald | N/A | N/A |
| Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking | Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li, Chenghua Lin, Wenge Rong | N/A | N/A |
| A Coordinate System for In-Context Learning | Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen | N/A | N/A |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Tengfei Yu, Xuebo Liu, Zhiyi Hou, Liang Ding, Dacheng Tao, Min Zhang | N/A | N/A |
| ABSEval: An Agent-based Framework for Script Evaluation | Sirui Liang, Baoli Zhang, Jun Zhao, Kang Liu | N/A | N/A |
| Latent Concept-based Explanation of NLP Models | Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad | N/A | N/A |
| Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher | Hyunjong Ok, Jegwang Ryu, Jaeho Lee | N/A | N/A |
| Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research | Yida Mu, Mali Jin, Xingyi Song, Nikolaos Aletras | N/A | N/A |
| The Mystery of the Pathological Path-star Task for Language Models | Arvid Frydenlund | N/A | N/A |
| Voices in a Crowd: Searching for clusters of unique perspectives | Nikolas Vitsakis, Amit Parekh, Ioannis Konstas | N/A | N/A |
| Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent | Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu | N/A | N/A |
| SLANG: New Concept Comprehension of Large Language Models | Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Xueqi Cheng | N/A | N/A |
| Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models | Michael Lan, Philip Torr, Fazl Barez | N/A | N/A |
| Why Does New Knowledge Create Messy Ripple Effects in LLMs? | Jiaxin Qin, Zixuan Zhang, Chi Han, Pengfei Yu, Manling Li, Heng Ji | N/A | N/A |
| Lifelong Event Detection via Optimal Transport | Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot | N/A | N/A |
| FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | KaShun SHUM, Minrui Xu, Jianshu Zhang, Zixin CHEN, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza | N/A | N/A |
| Domain adapted machine translation: What does catastrophic forgetting forget and why? | Danielle Saunders, Steve DeNeefe | N/A | N/A |
| Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback | Benjamin Towle, Ke Zhou | N/A | N/A |
| Atomic Self-Consistency for Better Long Form Generations | Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra | N/A | N/A |
| “Global is Good, Local is Bad?’’: Understanding Brand Bias in LLMs | Mahammed Kamruzzaman, Hieu Minh Nguyen, Gene Louis Kim | N/A | N/A |
| Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach | Siqi Li, Danni Liu, Jan Niehues | N/A | N/A |
| ACE: A LLM-based Negotiation Coaching System | Ryan Shea, Aymen Kallala, Xin Lucy Liu, Michael W. Morris, Zhou Yu | N/A | N/A |
| TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities | Ming Zhang, Caishuang Huang, Yilong Wu, Shichun Liu, Huiyuan Zheng, Yurui Dong, Yujiong Shen, Shihan Dou, Jun Zhao, Junjie Ye, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals | Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M Murphy, Nev Jones, Kate V Hardy, Hong Shen, Fei Fang, Zhiyu Chen | N/A | N/A |
| DKEC: Domain Knowledge Enhanced Multi-Label Classification for Diagnosis Prediction | Xueren Ge, Abhishek Satpathy, Ronald Dean Williams, John Stankovic, Homa Alemzadeh | N/A | N/A |
| $\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities | Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang | N/A | N/A |
| Large Language Models Can Self-Correct with Key Condition Verification | Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang | N/A | N/A |
| Learning to Write Rationally: How Information Is Distributed in Non-native Speakers’ Essays | Zixin Tang, Janet van Hell | N/A | N/A |
| Defending Against Social Engineering Attacks in the Age of LLMs | Lin Ai, Tharindu Sandaruwan Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael S. Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, huan liu, Julia Hirschberg | N/A | N/A |
| Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models | Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi | N/A | N/A |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Target-Aware Language Modeling via Granular Data Sampling | Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra | N/A | N/A |
| SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness | Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, Nanyun Peng | N/A | N/A |
| Learning from Feedback with Coupled Comprehension and Generation | Mustafa Omer Gul, Yoav Artzi | N/A | N/A |
| UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks | Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei | N/A | N/A |
| Story Morals: Surfacing value-driven narrative schemas using large language models | David G Hobson, Haiqi Zhou, Derek Ruths, Andrew Piper | N/A | N/A |
| OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants | Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta | N/A | N/A |
| AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Murari Tiyyala, Nicholas Andrews, Daniel Khashabi | N/A | N/A |
| SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents | Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut | N/A | N/A |
| Analysis of Plan-based Retrieval for Grounded Text Generation | Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer | N/A | N/A |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Alex Chandler, Devesh Surve, Hui Su | N/A | N/A |
| RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs | John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker | N/A | N/A |
| Improving Logical Fallacy Reasoning with Logical Structure Tree | Yuanyuan Lei, Ruihong Huang | N/A | N/A |
| Chain and Causal Attention for Efficient Entity Tracking | Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen | N/A | N/A |
| BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models | Yi Zeng, Weiyu Sun, Tran Ngoc Huynh, Dawn Song, Bo Li, Ruoxi Jia | N/A | N/A |
| A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution | Zhengmian Hu, Tong Zheng, Heng Huang | N/A | N/A |
| FAC$^2$E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition | Xiaoqiang Wang, Lingfei Wu, Tengfei Ma, Bang Liu | N/A | N/A |
| OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation | Tanvir Mahmud, Diana Marculescu | N/A | N/A |
| Language Concept Erasure for Language-invariant Dense Retrieval | Zhiqi Huang, Puxuan Yu, Shauli Ravfogel, James Allan | N/A | N/A |
| Learning Personalized Alignment for Evaluating Open-ended Text Generation | Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian | N/A | N/A |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang | N/A | N/A |
| Turn Waste into Worth: Rectifying Top-$k$ Router of MoE | Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu | N/A | N/A |
| Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination | Pittawat Taveekitworachai, Febri Abdullah, Ruck Thawonmas | N/A | N/A |
| CommVQA: Situating Visual Question Answering in Communicative Contexts | Nandita Shankar Naik, Christopher Potts, Elisa Kreiss | N/A | N/A |
| Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding | Weilin Zhao, Yuxiang Huang, Xu Han, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? | Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun | N/A | N/A |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar | N/A | N/A |
| Style-Specific Neurons for Steering LLMs in Text Style Transfer | Wen Lai, Viktor Hangya, Alexander Fraser | N/A | N/A |
| Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers | Tianhua Zhang, Kun LI, Hongyin Luo, Xixin Wu, James R. Glass, Helen M. Meng | N/A | N/A |
| Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction | Sizhe Zhou, Yu Meng, Bowen Jin, Jiawei Han | N/A | N/A |
| DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models | Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Leveraging Context-aware Prompting for Commit Message Generation | Zhihua Jiang, Jianwei Chen, Dongning Rao, Guanghui Ye | N/A | N/A |
| Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination | Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein | N/A | N/A |
| Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning | Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue’ | N/A | N/A |
| A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models | Zhihao Wang, Shiyu Liu, Jianheng Huang, Wang Zheng, YiXuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su | N/A | N/A |
| Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages | Jimin Sohn, Haeji Jung, Alex Cheng, Jooeon Kang, Yilin Du, David R Mortensen | N/A | N/A |
| An Analysis and Mitigation of the Reversal Curse | Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan | N/A | N/A |
| Exploring the Practicality of Generative Retrieval on Dynamic Corpora | Soyoung Yoon, Chaeeun Kim, Hyunji Lee, Joel Jang, Sohee Yang, Minjoon Seo | N/A | N/A |
| OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting | Xukai Liu, Ye Liu, Kai Zhang, Kehang Wang, Qi Liu, Enhong Chen | N/A | N/A |
| Gotcha! Don’t trick me with unanswerable questions! Self-aligning Large Language Models for Proactively Responding to Unknown Questions | Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning | Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang | N/A | N/A |
| Large Language Models in the Clinic: A Comprehensive Benchmark | Fenglin Liu, Zheng Li, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Hongjian Zhou, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei Clifton, Bing Yin, David A. Clifton | N/A | N/A |
| Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction | Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu | N/A | N/A |
| Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective | Van-Cuong Pham, Thien Huu Nguyen | N/A | N/A |
| DynamicER: Resolving Emerging Mentions to Dynamic Entities for RAG | Jinyoung Kim, Dayoon Ko, Gunhee Kim | N/A | N/A |
| Preserving Generalization of Language models in Few-shot Continual Relation Extraction | Quyen Tran, Nguyen Xuan Thanh, Nguyen Hoang Anh, Nam Le Hai, Trung Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations | Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang | N/A | N/A |
| Consecutive Batch Model Editing with HooK Layers | Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang CHEN, Wai Lam | N/A | N/A |
| Topic-Oriented Open Relation Extraction with A Priori Seed Generation | Linyi Ding, Jinfeng Xiao, Sizhe Zhou, Chaoqi Yang, Jiawei Han | N/A | N/A |
| Related Work and Citation Text Generation: A Survey | Xiangci Li, Jessica Ouyang | N/A | N/A |
| Curriculum Consistency Learning for Conditional Sentence Generation | Liangxin Liu, Xuebo Liu, Lian Lian, shengjun cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang | N/A | N/A |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Leonardo Bertolazzi, Albert Gatt, Raffaella Bernardi | N/A | N/A |
| Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision | Fan Jiang, Tom Drummond, Trevor Cohn | N/A | N/A |
| Towards an Open-Source Speech Foundation Model for EU: 950,000 Hours of Open-Source Compliant Speech Data for EU Languages | Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri | N/A | N/A |
| Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning | Jiashi Lin, Lifang Wang, Xinyu Lu, Zhongtian Hu, Wei Zhang, Wenxuan Lu | N/A | N/A |
| Contribution of Linguistic Typology to Universal Dependency Parsing: An Empirical Investigation | Ali Basirat, Navid Baradaran Hemmati | N/A | N/A |
| TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse | Francesco Periti, Pierluigi Cassotti, Stefano Montanelli, Nina Tahmasebi, Dominik Schlechtweg | N/A | N/A |
| Structured Optimal Brain Pruning for Large Language Models | Jiateng Wei, Quan Lu, ning jiang, Siqi Li, Jingyang Xiang, Jun Chen, Yong Liu | N/A | N/A |
| Automatically Generated Definitions and their utility for Modeling Word Meaning | Francesco Periti, David Alfter, Nina Tahmasebi | N/A | N/A |
| How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data | Yejie Wang, Keqing He, Dayuan Fu, Zhuoma GongQue, Heyang Xu, Yanxu Chen, Zhexu Wang, Yujia Fu, Guanting Dong, Muxi Diao, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu | N/A | N/A |
| MINT: A Benchmark for Evaluating Instructed Information Retrieval | Weiwei Sun, Zhengliang Shi, Wu Jiu Long, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren | N/A | N/A |
| Rethinking the Evaluation of In-Context Learning for LLMs | Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu, Xiang Ao | N/A | N/A |
| Cluster-Norm for Unsupervised Probing of Knowledge | Walter Laurito, Sharan Maiya, Grégoire DHIMOÏLA, Owen Ho Wan Yeung, Kaarel Hänni | N/A | N/A |
| Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries | Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson | N/A | N/A |
| Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration | Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng | N/A | N/A |
| Where am I? Large Language Models Wandering between Semantics and Structures in Long Contexts | Seonmin Koo, Jinsung Kim, YoungJoon Jang, Chanjun Park, Heuiseok Lim | N/A | N/A |
| KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students | Matthew Shu, Nishant Balepur, Shi Feng, Jordan Lee Boyd-Graber | N/A | N/A |
| Large Language Models Can Be Contextual Privacy Protection Learners | Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng | N/A | N/A |
| A SMART Mnemonic Sounds like “Glue Tonic”: Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick | Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Lee Boyd-Graber | N/A | N/A |
| Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models | Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Reza Haf | N/A | N/A |
| MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction | Jun-Hyung Park, Yeachan Kim, Mingyu Lee, Hyuntae Park, SangKeun Lee | N/A | N/A |
| First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning | Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui | N/A | N/A |
| Tools Fail: Detecting Silent Errors in Faulty Tools | Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk | N/A | N/A |
| Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity | Bowen Zhang, Chunping Li | N/A | N/A |
| Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing | Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling | Georgios Pantazopoulos, Malvina Nikandrou, Alessandro Suglia, Oliver Lemon, Arash Eshghi | N/A | N/A |
| Are LLMs Good Zero-Shot Fallacy Classifiers? | Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu | N/A | N/A |
| The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis | Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages | Dominik Schlechtweg, Pierluigi Cassotti, Bill Noble, David Alfter, Sabine Schulte im Walde, Nina Tahmasebi | N/A | N/A |
| Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification | Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama | N/A | N/A |
| ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos | Arpan Phukan, Manish Gupta, Asif Ekbal | N/A | N/A |
| Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation | Elaf Alhazmi, Quan Z. Sheng, Wei Emma Zhang, Munazza Zaib, Ahoud Alhazmi | N/A | N/A |
| Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG | William Merrill, Noah A. Smith, Yanai Elazar | N/A | N/A |
| ASL STEMpedia: Dataset and Benchmark for Interpreting STEM Articles | Kayo Yin, Chinmay Singh, Fyodor O Minakov, Vanessa Milan, Hal Daumé III, Cyril Zhang, Alex Xijie Lu, Danielle Bragg | N/A | N/A |
| Can Automatic Metrics Assess High-Quality Translations? | Sweta Agrawal, António Farinhas, Ricardo Rei, Andre Martins | N/A | N/A |
| Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal, José G. C. de Souza, Ricardo Rei, António Farinhas, Gonçalo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins | N/A | N/A |
| DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding | Bowen Xing, Lizi Liao, Minlie Huang, Ivor Tsang | N/A | N/A |
| KnowTuning: Knowledge-aware Fine-tuning for Large Language Models | Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren | N/A | N/A |
| SecCoder: Towards Generalizable and Robust Secure Code Generation | Boyu Zhang, Tianyu Du, Junkai Tong, Xuhong Zhang, Kingsum Chow, Sheng Cheng, Xun Wang, Jianwei Yin | N/A | N/A |
| Nash CoT: Multi-Path Inference with Preference Equilibrium | Ziqi Zhang, Cunxiang Wang, Xiao Xiong, Yue Zhang, Donglin Wang | N/A | N/A |
| Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention | Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou | N/A | N/A |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di ZHANG, Kun Gai, Ji-Rong Wen | N/A | N/A |
| Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding | Wei Li, Zhen Huang, Xinmei Tian, Le Lu, Houqiang Li, Xu Shen, Jieping Ye | N/A | N/A |
| LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History | Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz | N/A | N/A |
| Social Bias Probing: Fairness Benchmarking for Language Models | Marta Marchiori Manerba, Karolina Stanczak, Riccardo Guidotti, Isabelle Augenstein | N/A | N/A |
| Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models | Wenhao Yu, Hongming Zhang, Xiaoman Pan, peixin cao, Kaixin Ma, Jian Li, Hongwei Wang, Dong Yu | N/A | N/A |
| DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models | Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li | N/A | N/A |
| Revisiting Automated Evaluation for Long-form Table Question Answering in the Era of Large Language Models | Yuqi Wang, Lyuhao Chen, Yilun Zhao | N/A | N/A |
| Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems | Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning | Zhihan Zhang, Tao Ge, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang | N/A | N/A |
| FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents | Yilun Zhao, Yitao Long, Tintin Jiang, Weiyuan Chen, Chengye Wang, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan | N/A | N/A |
| Extracting Prompts by Inverting LLM Outputs | Collin Zhang, John Xavier Morris, Vitaly Shmatikov | N/A | N/A |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu | N/A | N/A |
| VHASR: A Multimodal Speech Recognition System With Vision Hotwords | Jiliang Hu, Zuchao Li, Ping Wang, Haojun Ai, Lefei Zhang, hai zhao | N/A | N/A |
| A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Min-Yen Kan, Ryan Cotterell | N/A | N/A |
| Bridging Local Details and Global Context in Text-Attributed Graphs | Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, liyunfei, Siliang Tang | N/A | N/A |
| Building Resources for Emakhuwa: Machine Translation and News Classification Benchmarks | Felermino D. M. A. Ali, Henrique Lopes Cardoso, Rui Sousa-Silva | N/A | N/A |
| RepMatch: Quantifying Cross-Instance Similarities in Representation Space | Mohammad Reza Modarres, Sina Abbasi, Mohammad Taher Pilehvar | N/A | N/A |
| Commonsense Knowledge Editing Based on Free-Text in LLMs | Xiusheng Huang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| A Closer Look at Multidimensional Online Political Incivility | Sagi Pendzel, Nir Lotan, Alon Zoizner, Einat Minkov | N/A | N/A |
| Leveraging BERT and TFIDF Features for Short Text Clustering via Alignment-Promoting Co-Training | Zetong Li, Qinliang Su, Shijing Si, Jianxing Yu | N/A | N/A |
| Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation | Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky | N/A | N/A |
| Unsupervised Named Entity Disambiguation for Low Resource Domains | Debarghya Datta, Soumajit Pramanik | N/A | N/A |
| SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers | Viktoriia A. Chekalina, Anna Rudenko, Gleb Mezentsev, Aleksandr Mikhalev, Alexander Panchenko, Ivan Oseledets | N/A | N/A |
| MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion | Qingyang Li, Yanru Zhong, Yuchu Qin | N/A | N/A |
| ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities | Ying Su, Zhan Ling, Haochen Shi, Cheng Jiayang, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Shortcuts Arising from Contrast: Towards Effective and Lightweight Clean-Label Attacks in Prompt-Based Learning | Xiaopeng Xie, Ming YAN, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Tianyi Zhou | N/A | N/A |
| GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Aashiq Muhamed, Oscar Li, David Woodruff, Mona T. Diab, Virginia Smith | N/A | N/A |
| RaTEScore: A Metric for Entity-Aware Radiology Text Similarity | Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Weidi Xie | N/A | N/A |
| HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning | Shayan Ali Akbar, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Victor Alvarez, Erica M Salinas, Erwin Cornejo | N/A | N/A |
| Learning to Rank Salient Content for Query-focused Summarization | Sajad Sotudeh, Nazli Goharian | N/A | N/A |
| Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions | Qian Ruan, Ilia Kuznetsov, Iryna Gurevych | N/A | N/A |
| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao | N/A | N/A |
| Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang | N/A | N/A |
| LMs learn governing principles of dynamical systems, revealing an in-context neural scaling law | Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati, Christopher Earls | N/A | N/A |
| AKEW: Assessing Knowledge Editing in the Wild | Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu | N/A | N/A |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh | N/A | N/A |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu | N/A | N/A |
| Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach | Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang | N/A | N/A |
| Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models | Zheng Zhao, Yftah Ziser, Shay B Cohen | N/A | N/A |
| XDetox: Text Detoxification with Token-Level Toxicity Explanations | Beomseok Lee, Hyunwoo Kim, Keon Kim, Yong Suk Choi | N/A | N/A |
| Optimizing Chinese Lexical Simplification Across Word Types: A Hybrid Approach | ZiHao Xiao, Jiefu Gong, Shijin Wang, Wei Song | N/A | N/A |
| Evaluating LLMs’ Capability in Satisfying Lexical Constraints | Bingxuan Li, Yiwei Wang, Tao Meng, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion | Chenyu Qiu, Pengjiang Qian, Chuang Wang, Jian Yao, Li Liu, Fang wei, Eddie Y.K. Eddie | N/A | N/A |
| Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning | Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| RoCEL: Advancing Table Entity Linking through Distinctive Row and Column Contexts | Yuanzheng Wang, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng | N/A | N/A |
| Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models | Zi’ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu | N/A | N/A |
| Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning | Panuthep Tasawong, Peerat Limkonchotiwat, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| MetaBench: Planning of Multiple APIs from Various APPs for Complex User Instruction | Hongru WANG, Rui Wang, Boyang XUE, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong | N/A | N/A |
| Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment | Zhipeng Chen, Kun Zhou, Xin Zhao, Jingyuan Wang, Ji-Rong Wen | N/A | N/A |
| AudioVSR: Enhancing Video Speech Recognition with Audio Data | Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin | N/A | N/A |
| ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? | Siddhant Waghjale, Vishruth Veerendranath, Zhiruo Wang, Daniel Fried | N/A | N/A |
| Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level | Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu | N/A | N/A |
| Re-ReST: Reflection-Reinforced Self-Training for Language Agents | Zi-Yi Dou, Cheng-Fu Yang, Xueqing Wu, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Effective Synthetic Data and Test-Time Adaptation for OCR Correction | Shuhao Guan, Cheng Xu, Moule Lin, Derek Greene | N/A | N/A |
| SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework | Fu Zhang, Qi Miao, Jingwei Cheng, Hongsen Yu, Yi Yan, Xin Li, YongxueWu | N/A | N/A |
| FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension | Junzhuo Liu, Xuzheng Yang, WEIWEI LI, Peng Wang | N/A | N/A |
| Exploring the Learning Capabilities of Language Models using LEVERWORLDS | Eitan Wagner, Amir Feder, Omri Abend | N/A | N/A |
| CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models | Eitan Wagner, Yuli Slavutsky, Omri Abend | N/A | N/A |
| DocEditAgent: Document Structure Editing Via Multimodal LLM Grounding | Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha | N/A | N/A |
| DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging | Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Understanding Slang with LLMs: Modelling Cross-Cultural Nuances through Paraphrasing | Ifeoluwa Wuraola, Nina Dethlefs, Daniel Marciniak | N/A | N/A |
| Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding | Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou | N/A | N/A |
| Re-Reading Improves Reasoning in Large Language Models | Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-Guang Lou, Shuai Ma | N/A | N/A |
| Adaptive Axes: A Pipeline for In-domain Social Stereotype Analysis | Qingcheng Zeng, Mingyu Jin, Rob Voigt | N/A | N/A |
| ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments | Sourjyadip Ray, Kushal Gupta, Soumi Kundu, Dr Payal Arvind Kasat, Somak Aditya, Pawan Goyal | N/A | N/A |
| Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations | Jiyi Li | N/A | N/A |
| Improve Student’s Reasoning Generalizability through Cascading Decomposed CoTs Distillation | Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu | N/A | N/A |
| Revisiting Supervised Contrastive Learning for Microblog Classification | Junbo Huang, Ricardo Usbeck | N/A | N/A |
| BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting | Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang | N/A | N/A |
| Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective | Zhaotian Weng, Zijun Gao, Jerone Andrews, Jieyu Zhao | N/A | N/A |
| Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing | Weichuan Wang, Zhaoyi Li, Defu Lian, Chen Ma, Linqi Song, Ying Wei | N/A | N/A |
| SciAgent: Tool-augmented Language Models for Scientific Reasoning | Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun | N/A | N/A |
| Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents | Dong Won Lee, Hae Won Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency | N/A | N/A |
| Towards Measuring and Modeling “Culture” in LLMs: A Survey | Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Shivdutt Singh, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, Monojit Choudhury | N/A | N/A |
| ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models | Haiquan Zhao, Lingyu Li, Shisong Chen, Shuqi Kong, Jiaan Wang, Kexin Huang, Tianle Gu, Yixu Wang, Jian Wang, Liang Dandan, Zhixu Li, Yan Teng, Yanghua Xiao, Yingchun Wang | N/A | N/A |
| Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting | Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury | N/A | N/A |
| Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features | Xiao Yu, Kejiang Chen, Qi Yang, Weiming Zhang, Nenghai Yu | N/A | N/A |
| Hate Personified: Investigating the role of LLMs in content moderation pipeline for hate speech | Sarah Masud, Sahajpreet Singh, Viktor Hangya, Alexander Fraser, Tanmoy Chakraborty | N/A | N/A |
| Temporally Consistent Factuality Probing for Large Language Models | Ashutosh Bajpai, Aaryan Goyal, Atif Anwer, Tanmoy Chakraborty | N/A | N/A |
| A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives | Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann | N/A | N/A |
| Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators | Prasoon Bajpai, Niladri Chatterjee, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training | Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng | N/A | N/A |
| Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability | Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan | N/A | N/A |
| Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging | Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, hanyu Zhao, Siqi Fan, Zheng Zhang | N/A | N/A |
| Generating Demonstrations for In-Context Compositional Generalization in Grounded Language Learning | Sam Spilsbury, Pekka Marttinen, Alexander Ilin | N/A | N/A |
| FAME: Factual Multi-task Model Editing Benchmark | Li Zeng, Yingyu Shan, Zeming Liu, Jiashu Yao, Yuhang Guo | N/A | N/A |
| MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance | Renjie Pi, Tianyang Han, Jianshu Zhang, Yueqi XIE, Rui Pan, Qing LIAN, Hanze Dong, Jipeng Zhang, Tong Zhang | N/A | N/A |
| Leveraging Large Language Models for NLG Evaluation: Advances and Challenges | Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Yuxuan Lai, Chongyang Tao, Shuai Ma | N/A | N/A |
| InfiniPot: Infinite Context Processing on Memory-Constrained LLMs | Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang | N/A | N/A |
| VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models | Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin | N/A | N/A |
| CorrSynth - A Correlated Sampling Method for Diverse dataset Generation from LLMs | Abhishek Divekar, Suhas S Kowshik, Vijit Malik | N/A | N/A |
| Defining Knowledge: Bridging Epistemology and Large Language Models | Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard | N/A | N/A |
| TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMs | Peiwen Jiang, Zibo Zhao, Xinbo Lin, Ruhui Ma, Yvonne Jie Chen, Jinhua Cheng | N/A | N/A |
| Free your mouse! Command Large Language Models to Generate Code to Format Word Documents | Shihao Rao, Liang Li, Jiapeng Liu, Guan Weixin, Xiyan Gao, bing lim | N/A | N/A |
| CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu, Zacc Yang, Chuanghao Ding, Rui Zhao, Fei Tan | N/A | N/A |
| The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs | Tianyang Han, Qing LIAN, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang | N/A | N/A |
| Rationale-Aware Answer Verification by Pairwise Self-Evaluation | Akira Kawabata, Saku Sugawara | N/A | N/A |
| On the Robustness of Editing Large Language Models | Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, hai zhao, lifeng Liu, Yulong Wang | N/A | N/A |
| IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method | MiHyeon Kim, Juhyoung Park, YoungBin Kim | N/A | N/A |
| Distract Large Language Models for Automatic Jailbreak Attack | Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen | N/A | N/A |
| Exploring Space Efficiency in a Tree-based Linear Model for Extreme Multi-label Classification | He-Zhe Lin, Cheng-Hung Liu, Chih-Jen Lin | N/A | N/A |
| WorryWords: Norms of Anxiety Association for 44,450 English Words | Saif M. Mohammad | N/A | N/A |
| Finding Blind Spots in Evaluator LLMs with Interpretable Checklists | Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M Khapra | N/A | N/A |
| LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration | Jun Zhao, Can Zu, Xu Hao, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments | Till Raphael Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart | N/A | N/A |
| Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs | Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li | N/A | N/A |
| Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems | Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang | N/A | N/A |
| Scaling Laws for Linear Complexity Language Models | Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong | N/A | N/A |
| Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards | Heejin Do, Sangwon Ryu, Gary Lee | N/A | N/A |
| Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis | Guangliang Liu, Haitao Mao, Jiliang Tang, Kristen Johnson | N/A | N/A |
| ATAP: Automatic Template-Augmented Commonsense Knowledge Graph Completion via Pre-Trained Language Models | Fu Zhang, Yifan Ding, Jingwei Cheng | N/A | N/A |
| LM2: A Simple Society of Language Models Solves Complex Reasoning | Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| Towards a Semantically-aware Surprisal Theory | Clara Meister, Mario Giulianelli, Tiago Pimentel | N/A | N/A |
| Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering | Adjali Omar, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne | N/A | N/A |
| Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu | N/A | N/A |
| Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP | Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty | N/A | N/A |
| BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training | Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, Ivan P. Yamshchikov | N/A | N/A |
| SEGMENT+: Long Text Processing with Short-Context Language Models | Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao | N/A | N/A |
| Explicit Memory Learning with Expectation Maximization | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang | N/A | N/A |
| Learning to Generate Writing Feedback via Language Model Simulated Student Revisions | Inderjeet Jayakumar Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, Lu Wang | N/A | N/A |
| Small LLMs Are Weak Tool Learners: A Multi-LLM Agent | Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang | N/A | N/A |
| Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions | Clement Neo, Shay B Cohen, Fazl Barez | N/A | N/A |
| Still Not Quite There! Assessing Large Language Models for Comorbid Mental Health Diagnosis | Amey Hengle, Atharva Kulkarni, Shantanu Deepak Patankar, Rashmi Gupta | N/A | N/A |
| The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning | Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings | N/A | N/A |
| Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups | Răzvan-Alexandru Smădu, David-Gabriel ION, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel | N/A | N/A |
| Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue | Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! | Divya Patel, Pathik Patel, Ankush Chander, Sourish Dasgupta, Tanmoy Chakraborty | N/A | N/A |
| MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations | Vishal Vivek Saley, Goonjan Saha, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| **YesBut | Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, ANKIT RAJ, Pawan Goyal, Niloy Ganguly | N/A | N/A |
| Scaling Cognitive Limits: Identifying Working Memory Limits in LLMs | Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi | N/A | N/A |
| RAFT: Realistic Attacks to Fool Text Detectors | James Liyuan Wang, Ran Li, Junfeng Yang, Chengzhi Mao | N/A | N/A |
| LLM-Evolve: Evaluation for LLM’s Evolving Capability on Benchmarks | Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping | AJAY KUMAR JAISWAL, Bodun Hu, Lu Yin, Yeonju Ro, Tianlong Chen, Shiwei Liu, Aditya Akella | N/A | N/A |
| LLM-based Code-Switched Text Generation for Grammatical Error Correction | Tom Potter, Zheng Yuan | N/A | N/A |
| Deciphering the Interplay of Parametric and Non-Parametric Memory in RAG Models | Mehrdad Farahani, Richard Johansson | N/A | N/A |
| On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning | Geewook Kim, Minjoon Seo | N/A | N/A |
| Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities | Zihao He, Rebecca Dorn, Minh Duc Chu, Siyi Guo, Kristina Lerman | N/A | N/A |
| Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models | Eldar Kurtic, Amir Moeini, Dan Alistarh | N/A | N/A |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna | N/A | N/A |
| One Thousand and One Pairs: A “novel” challenge for long-context language models | Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer | N/A | N/A |
| Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu, Kalpesh Krishna, Salaheddin Alzubi, Chris Tar, Manaal Faruqui, Yun-Hsuan Sung | N/A | N/A |
| Do LLMs learn a true syntactic universal? | John T. Hale, Miloš Stanojević | N/A | N/A |
| GDPO: Learning to Align Language Models with Diversity Using GFlowNets | Oh Joon Kwon, Daiki E. Matsunaga, Kee-Eung Kim | N/A | N/A |
| How Susceptible are Large Language Models to Ideological Manipulation? | Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman | N/A | N/A |
| Measuring Psychological Depth in Language Models | Fabrice Y Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Senahan Yildiz, Miryung Kim, Nanyun Peng, Amit Sahai | N/A | N/A |
| Media Attitude Detection via Framing Analysis with Events and their Relations | Jin Zhao, Jingxuan Tu, Han Du, Nianwen Xue | N/A | N/A |
| Fill In The Gaps: Model Calibration and Generalization with Synthetic Data | Yang Ba, Michelle V Mancenido, Rong Pan | N/A | N/A |
| Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations | Sagi Shaier, Ari Kobren, Philip V. Ogren | N/A | N/A |
| Granular Privacy Control for Geolocation with Vision Language Models | Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter | N/A | N/A |
| MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain | Chao Jiang, Wei Xu | N/A | N/A |
| MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification | Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang | N/A | N/A |
| FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization | Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao | N/A | N/A |
| StorySpark: Expert-Annotated QA Pairs with Real-World Knowledge for Children Storytelling | Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun | N/A | N/A |
| MedCoT: Medical Chain of Thought via Hierarchical Expert | Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Tianyi Zhou, Zuozhu Liu | N/A | N/A |
| Varying Sentence Representations via Condition-Specified Routers | Ziyong Lin, Quansen Wang, Zixia Jia, Zilong Zheng | N/A | N/A |
| Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues | Jiao Ou, jiayu wu, Che Liu, Fuzheng Zhang, Di ZHANG, Kun Gai | N/A | N/A |
| Information Flow Routes: Automatically Interpreting Language Models at Scale | Javier Ferrando, Elena Voita | N/A | N/A |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang | N/A | N/A |
| Low-rank Subspace for Binding in Large Language Models | Qin Dai, Benjamin Heinzerling, Kentaro Inui | N/A | N/A |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Erxin Yu, Jing Li, Ming Liao, Siqi Wang, GAO Zuchen, Fei Mi, Lanqing HONG | N/A | N/A |
| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Tobias Schimanski, Jingwei Ni, Roberto Spacey Martín, Nicola Ranger, Markus Leippold | N/A | N/A |
| Context-Aware Adapter Tuning for Few-Shot Relation Learning in Knowledge Graphs | LIU Ran, Zhongzhou Liu, Xiaoli Li, Yuan Fang | N/A | N/A |
| Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness | Shixuan Ma, Quan Wang | N/A | N/A |
| Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection | Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking | Siyuan Wang, Zhuohan Long, Zhihao Fan, zhongyu wei | N/A | N/A |
| Symbolic Working Memory Enhances Language Models for Complex Rule Application | Siyuan Wang, zhongyu wei, Yejin Choi, Xiang Ren | N/A | N/A |
| LLoCO: Learning Long Contexts Offline | Sijun Tan, Xiuyu Li, Shishir G Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Popa | N/A | N/A |
| Don’t Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, WANG CHEN, Anh Tuan Luu | N/A | N/A |
| Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee, Junho Kim, SangKeun Lee | N/A | N/A |
| Are Large Language Models Capable of Generating Human-Level Narratives? | Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng | N/A | N/A |
| MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs | Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung | N/A | N/A |
| Can Large Language Models Enhance Predictions of Disease Progression? Investigating Through Disease Network Link Prediction | Haohui Lu, Usman Naseem | N/A | N/A |
| Searching for Best Practices in Retrieval-Augmented Generation | Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| Moral Foundations of Large Language Models | Marwa Abdulhai, Gregory Serapio-García, Clement CREPY, Daria Valter, John Canny, Natasha Jaques | N/A | N/A |
| The Zeno’s Paradox of ‘Low-Resource’ Languages | Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury | N/A | N/A |
| Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization | Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty, Md Shad Akhtar | N/A | N/A |
| Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition | Pritika Ramu, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan | N/A | N/A |
| From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment | Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima | N/A | N/A |
| Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging | Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui | N/A | N/A |
| Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovic, Michael Färber | N/A | N/A |
| Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training | Zhou Zhang, Dongzeng Tan, Jiaan Wang, Yilong Chen, Jiarong Xu | N/A | N/A |
| Data Contamination Can Cross Language Barriers | Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang | N/A | N/A |
| Automated Essay Scoring: A Reflection on the State of the Art | Shengjie Li, Vincent Ng | N/A | N/A |
| Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate | Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Shuming Shi, Zhaopeng Tu | N/A | N/A |
| Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs | Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| CURE: Context- and Uncertainty-Aware Mental Disorder Detection | Migyeong Kang, goun choi, Hyolim Jeon, Ji hyun An, Daejin Choi, Jinyoung Han | N/A | N/A |
| PepRec: Progressive Enhancement of Prompting for Recommendation | Yakun Yu, Shi-ang Qi, Baochun Li, Di Niu | N/A | N/A |
| In-Context Compositional Generalization for Large Vision-Language Models | Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, Yunde Jia | N/A | N/A |
| Improving Zero-shot LLM Re-Ranker with Risk Minimization | Xiaowei Yuan, Zhao Yang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory | Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou | N/A | N/A |
| Label Confidence Weighted Learning for Target-level Sentence Simplification | Jingshen Zhang, Xin Ying Qiu | N/A | N/A |
| Quantum Recurrent Architectures for Text Classification | Wenduan Xu, Stephen Clark, Douglas Brown, Gabriel Matos, Konstantinos Meichanetzidis | N/A | N/A |
| Tree of Problems: Improving structured problem solving with compositionality | Armel Randy Zebaze, Benoît Sagot, Rachel Bawden | N/A | N/A |
| What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study | Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof-Arenas, Luisa Bentivogli | N/A | N/A |
| Seg2Act: Global Context-aware Action Generation for Document Logical Structuring | Zichao Li, Shaojie He, Meng Liao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Yanxiong Lu, Xianpei Han, Le Sun | N/A | N/A |
| Is C4 Dataset Enough for Pruning? An Investigation of Calibration Data for LLM Pruning | Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, AJAY KUMAR JAISWAL, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu | N/A | N/A |
| Revisiting the Robustness of Watermarking to Paraphrasing Attacks | Saksham Rastogi, Danish Pruthi | N/A | N/A |
| A Survey of Ontology Expansion for Conversational Understanding | Jinggui Liang, Yuxia Wu, Yuan Fang, Hao Fei, Lizi Liao | N/A | N/A |
| Calibrating Language Models with Adaptive Temperature Scaling | Johnathan Xie, Annie S Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn | N/A | N/A |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo | N/A | N/A |
| Why do objects have many names? A study on word informativeness in language use and lexical systems. | Eleonora Gualdoni, Gemma Boleda | N/A | N/A |
| Dual-Space Knowledge Distillation for Large Language Models | Songming Zhang, Xue Zhang, Zengkui Sun, Yufeng Chen, Jinan Xu | N/A | N/A |
| NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition | Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik | N/A | N/A |
| On the Universal Truthfulness Hyperplane Inside LLMs | Junteng Liu, Shiqi Chen, Yu Cheng, Junxian He | N/A | N/A |
| PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Chao-Wei Huang, Yun-Nung Chen | N/A | N/A |
| User Inference Attacks on Large Language Models | Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu | N/A | N/A |
| HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy | YongKang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schuetze | N/A | N/A |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Yufang Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Aimin Zhou | N/A | N/A |
| Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation | Matthew Raffel, Victor Agostinelli, Lizhong Chen | N/A | N/A |
| ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback | Qinzhuo Wu, Wei Liu, Jian Luan, Bin Wang | N/A | N/A |
| Please note that I’m just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification | Esra Dönmez, Thang Vu, Agnieszka Falenska | N/A | N/A |
| How to Compute the Probability of a Word | Tiago Pimentel, Clara Meister | N/A | N/A |
| A linguistically-motivated evaluation methodology for unraveling model’s abilities in reading comprehension tasks | Elie Antoine, Frederic Bechet, Géraldine Damnati, Philippe Langlais | N/A | N/A |
| GuardBench: A Large-Scale Benchmark for Guardrail Models | Elias Bassani, Ignacio Sanchez | N/A | N/A |
| Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering | Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Guang Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Language models and brains align due to more than next-word prediction and word-level information | Gabriele Merlin, Mariya Toneva | N/A | N/A |
| LLMEdgeRefine: Enhancing Text Clustering with LLM-Based Boundary Point Refinement | Zijin Feng, Luyang Lin, Lingzhi Wang, Hong Cheng, Kam-Fai Wong | N/A | N/A |
| CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures | Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri | N/A | N/A |
| A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression | Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini | N/A | N/A |
| GOME: Grounding-based Metaphor Binding With Conceptual Elaboration For Figurative Language Illustration | Linhao Zhang, Jintao Liu, Li Jin, Hao Wang, kaiwen wei, Guangluan Xu | N/A | N/A |
| D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation | Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, Vinodkumar Prabhakaran | N/A | N/A |
| PALM: Few-Shot Prompt Learning for Audio Language Models | Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki | N/A | N/A |
| Annotator-Centric Active Learning for Subjective NLP Tasks | Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico Liscio | N/A | N/A |
| Lost in Tokenization: How to Measure Word Surprisal From LM Token Probabilities | Luca Malagutti, Juan Luis Gastaldi, Brian DuSell, Tim Vieira, Ryan Cotterell, Mario Giulianelli | N/A | N/A |
| Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation | Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M Guerreiro | N/A | N/A |
| Jailbreaking LLMs with Arabic Transliteration and Arabizi | Mansour Al Ghanim, saleh almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou | N/A | N/A |
| Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models | Zara Siddique, Liam Turner, Luis Espinosa-Anke | N/A | N/A |
| Instruction Matters, a Simple yet Effective Task Selection Approach in Instruction Tuning for Specific Tasks | Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae | N/A | N/A |
| Recurrent Alignment with Hard Attention for Hierarchical Text Rating | Chenxi Lin, Ren Jiayu, Guoxiu He, Zhuoren Jiang, Haiyan yu, Xiaomin Zhu | N/A | N/A |
| CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification | Junhui He, Shangyu Wu, Weidong Wen, Chun Jason Xue, Qingan Li | N/A | N/A |
| Semformer: Transformer Language Models with Semantic Planning | Yongjing Yin, Junran Ding, Kai Song, Yue Zhang | N/A | N/A |
| DocCGen: Document-based Controlled Code Generation | Sameer Pimparkhede, Mehant Kammakomati, Srikanth G. Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya | N/A | N/A |
| Semantics and Sentiment: Cross-lingual Variations in Emoji Use | Giulio Zhou, Sydelle de Souza, Ella Markham, Oghenetekevwe Kwakpovwe, Sumin Zhao | N/A | N/A |
| The Emergence of Compositional Languages in Multi-entity Referential Games: from Image to Graph Representations | Daniel Akkerman, Phong Le, Raquel G. Alhama | N/A | N/A |
| Transformers are Multi-State RNNs | Matanel Oren, Michael Hassid, Nir Yarden, Yossi Adi, Roy Schwartz | N/A | N/A |
| Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization | Niyati Bafna, Kenton Murray, David Yarowsky | N/A | N/A |
| Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion | Kerem Zaman, Leshem Choshen, Shashank Srivastava | N/A | N/A |
| Collective Critics for Creative Story Generation | Minwook Bae, Hyounghun Kim | N/A | N/A |
| Surprisal Curves of Discourse | Eleftheria Tsipidi, Franz Nowak, Ryan Cotterell, Ethan Wilcox, Mario Giulianelli, Alex Warstadt | N/A | N/A |
| Model-based Preference Optimization in Abstractive Summarization without Human Feedback | Jaepill choi, Kyubyung Chae, Jiwoo Song, Yohan Jo, Taesup Kim | N/A | N/A |
| Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation? | Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| NeuroTrialNER: An Annotated Corpus for Neurological Diseases and Therapies in Clinical Trial Registries | Simona Emilova Doneva, Tilia Ellendorff, Jean-Philippe Goldman, Amelia Elaine Cannon, Gerold Schneider, Beate Sick, Benjamin Victor Ineichen | N/A | N/A |
| Do Explanations Help or Hurt? Saliency Maps vs Natural Language Explanations in a Clinical Decision-Support Setting | Maxime Guillaume Kayser, Bayar Menzat, Cornelius Emde, Bogdan Alexandru Bercean, Alex Novak, Abdalá Trinidad Espinosa Morgado, Bartlomiej Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu | N/A | N/A |
| Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering | WEIHE ZHAI, Arkaitz Zubiaga, Bingquan Liu, Chengjie Sun, Yalong Zhao | N/A | N/A |
| Generation with Dynamic Vocabulary | Yanting Liu, Tao Ji, Yuanbin Wu, Xiaoling Wang, Changzhi Sun | N/A | N/A |
| Argument Relation Classification through Discourse Markers and Adversarial Training | Michele Luca Contalbo, Francesco Guerra, Matteo Paganelli | N/A | N/A |
| Getting The Most Out of Your Training Data: Exploring Unsupervised Tasks for Morphological Inflection | Abhishek Purushothama, Adam Wiemerslage, Katharina von der Wense | N/A | N/A |
| Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval | Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev | N/A | N/A |
| Efficient Unseen Language Adaptation for Multilingual Pre-Trained Language Models | Po-Heng Chen, Yun-Nung Chen | N/A | N/A |
| Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation | Ruiyu Xiao, Lei Wu, Yuhang Gou, Weinan Zhang, Ting Liu | N/A | N/A |
| TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders, Nathaniel Weir, Benjamin Van Durme | N/A | N/A |
| Unsupervised Extraction of Dialogue Policies from Conversations | Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien | N/A | N/A |
| GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization | Onkar Kishor Susladkar, Gayatri Sudhir Deshmukh, Vandan Gorade, Sparsh Mittal | N/A | N/A |
| Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality | Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim | N/A | N/A |
| FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture | Wenyan Li, Crystina Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott | N/A | N/A |
| A Two-Step Approach for Data-Efficient French Pronunciation Learning | Hoyeon Lee, Hyeeun Jang, JONGHWAN KIM, Jaemin Kim | N/A | N/A |
| Exploring Intra and Inter-language Consistency in Embeddings with ICA | Rongzhi Li, Takeru Matsuda, Hitomi Yanaka | N/A | N/A |
| DetoxLLM: A Framework for Detoxification with Explanations | Md Tawkat Islam Khondaker, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan | N/A | N/A |
| Building a Multi-Platform, BERT Classifier for Detecting Connective Language | Josephine Lukito, Bin Chen, Gina M. Masullo, Natalie Jomini Stroud | N/A | N/A |
| ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models | Yash Akhauri, Ahmed F AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M Rush, Safeen Huda, Mohamed S Abdelfattah | N/A | N/A |
| Emotion Granularity from Text: An Aggregate-Level Indicator of Mental Health | Krishnapriya Vishnubhotla, Daniela Teodorescu, Mallory J Feldman, Kristen Lindquist, Saif M. Mohammad | N/A | N/A |
| BLSP-Emo: Towards Empathetic Large Speech-Language Models | Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang | N/A | N/A |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | Abhishek Divekar, Greg Durrett | N/A | N/A |
| Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang | N/A | N/A |
| DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts | Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty | N/A | N/A |
| DEM: Distribution Edited Model for Training with Mixed Data Distributions | Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha | N/A | N/A |
| Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu, Po-Yao Huang, Xiaoqing Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer | N/A | N/A |
| VerifyMatch: A Semi-Supervised Learning Paradigm for Natural Language Inference with Confidence-Aware MixUp | Seo Yeon Park, Cornelia Caragea | N/A | N/A |
| CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Ray Mooney | N/A | N/A |
| Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics | Théo Gigant, Camille Guinaudeau, Marc decombas, Frederic Dufaux | N/A | N/A |
| An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs | Manuj Malik, Jing Jiang, Kian Ming A. Chai | N/A | N/A |
| Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks | Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas | N/A | N/A |
| GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Aleksander Ficek, Jiaqi Zeng, Oleksii Kuchaiev | N/A | N/A |
| CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing | Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang | N/A | N/A |
| Sequential API Function Calling Using GraphQL Schema | Avirup Saha, Lakshmi Mandal, Balaji Ganesan, Sambit Ghosh, Renuka Sindhgatta, Carlos Eberhardt, Dan Debrunner, Sameep Mehta | N/A | N/A |
| The Illusion of Competence: Evaluating the Effect of Explanations on Users’ Mental Models of Visual Question Answering Systems | Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß | N/A | N/A |
| Re-Evaluating Evaluation for Multilingual Summarization | Jessica Zosa Forde, Ruochen Zhang, Lintang Sutawika, Alham Fikri Aji, Samuel Cahyawijaya, Genta Indra Winata, Minghao Wu, Carsten Eickhoff, Stella Biderman, Ellie Pavlick | N/A | N/A |
| Video-Text Prompting for Weakly Supervised Spatio-Temporal Video Grounding | Heng zhao, Zhao Yinjie, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou | N/A | N/A |
| A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition | Caio Filippo Corro | N/A | N/A |
| Factuality of Large Language Models in the Year 2024 | Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Nenkov Georgiev, Rocktim Jyoti Das, Preslav Nakov | N/A | N/A |
| Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation | Youngwoo Kim, Razieh Rahimi, James Allan | N/A | N/A |
| Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse | Rongchen Guo, Isar Nejadgholi, Hillary Dawkins, Kathleen C. Fraser, Svetlana Kiritchenko | N/A | N/A |
| DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R Menon, Shashank Srivastava | N/A | N/A |
| IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning | Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha | N/A | N/A |
| Scope-enhanced Compositional Semantic Parsing for DRT | Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos | N/A | N/A |
| The Generation Gap: Exploring Age Bias Underlying in the Value Systems of Large Language Models | Siyang Liu, Trisha Maturi, Bowen Yi, Siqi Shen, Rada Mihalcea | N/A | N/A |
| TempoFormer: A Transformer for Temporally-aware Representations in Change Detection | Talia Tseriotou, Adam Tsakalidis, Maria Liakata | N/A | N/A |
| Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? | Guillermo Marco, Julio Gonzalo, M.Teresa Mateo-Girona, Ramón del Castillo Santos | N/A | N/A |
| Evaluating Diversity in Automatic Poetry Generation | Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger | N/A | N/A |
| Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models | Yi Zhou, Danushka Bollegala, Jose Camacho-Collados | N/A | N/A |
| Delving into Qualitative Implications of Synthetic Data for Hate Speech Detection | Camilla Casula, Sebastiano Vecellio Salto, Alan Ramponi, Sara Tonelli | N/A | N/A |
| Grounding Language in Multi-Perspective Referential Communication | Zineng Tang, Lingjun Mao, Alane Suhr | N/A | N/A |
| Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval | Yifan Qiao, Parker Carlson, Shanxiu He, Yingrui Yang, Tao Yang | N/A | N/A |
| Error Analysis of Multilingual Language Models in Machine Translation for Low-resource Languages: A Case Study of Amharic to English Bi-directional Machine Translation | Hizkiel Mitiku Alemayehu, Hamada M Zahera, Axel-Cyrille Ngonga Ngomo | N/A | N/A |
| MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation | Arkadiusz Modzelewski, Giovanni Da San Martino, Pavel Savov, Magdalena Anna Wilczyńska, Adam Wierzbicki | N/A | N/A |
| Unsupervised Discrete Representations of American Sign Language | Artem Abzaliev, Rada Mihalcea | N/A | N/A |
| Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models | Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim | N/A | N/A |
| Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs | Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui | N/A | N/A |
| Jump Starting Bandits with LLM-Generated Prior Knowledge | Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson | N/A | N/A |
| Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? | Fırat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çağatay Yıldız | N/A | N/A |
| Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation | Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun | N/A | N/A |
| Virtual Personas for Language Models via an Anthology of Backstories | Suhong Moon, Marwa Abdulhai, Minwoo Kang, Joseph Suh, Widyadewi Soedarmadji, Eran Kohen Behar, David Chan | N/A | N/A |
| Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? | Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral | N/A | N/A |
| Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun | N/A | N/A |
| The Empirical Variability of Narrative Perceptions of Social Media Texts | Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper, Maarten Sap | N/A | N/A |
| Which questions should I answer? Salience Prediction of Inquisitive Questions | Yating Wu, Ritika Rajesh Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li | N/A | N/A |
| Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues | Lei Sun, Jinming Zhao, Qin Jin | N/A | N/A |
| Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech | Guan-Ting Lin, Wei Ping Huang, Hung-yi Lee | N/A | N/A |
| Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon, Richard Zemel, Carl Vondrick | N/A | N/A |
| CodeJudge: Evaluating Code Generation with Large Language Models | Weixi Tong, Tianyi Zhang | N/A | N/A |
| Self-Training Large Language and Vision Assistant for Medical | Guohao Sun, Can Qin, Huazhu Fu, Linwei Wang, ZHIQIANG TAO | N/A | N/A |
| SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, hong yu | N/A | N/A |
| Defending Jailbreak Prompts via In-Context Adversarial Game | Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang | N/A | N/A |
| Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter | Kateryna Kasianenko, Shima Khanehzar, Stephen Wan, Ehsan Dehghan, Axel Bruns | N/A | N/A |
| Multilingual Topic Classification in X: Dataset and Analysis | Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Jose Camacho-Collados | N/A | N/A |
| MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models | Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong | N/A | N/A |
| Updating CLIP to Prefer Descriptions Over Captions | Amir Zur, Elisa Kreiss, Karel D’Oosterlinck, Christopher Potts, Atticus Geiger | N/A | N/A |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang | N/A | N/A |
| Back to School: Translation Using Grammar Books | Jonathan Hus, Antonios Anastasopoulos | N/A | N/A |
| VIEWS: Entity-Aware News Video Captioning | Hammad Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, feng han, Yukun Zhu, Xuande Feng, Kevin Zhang, Jialu Liu, Shih-Fu Chang | N/A | N/A |
| Towards Aligning Language Models with Textual Feedback | Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan | N/A | N/A |
| ATPO: Automatic Tree-Structured Prompt Optimization | Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Xiaodi Sun, Bin Benjamin Zhu, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang | N/A | N/A |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, shimin tao, Hao Yang, Min Zhang | N/A | N/A |
| DEFT-UCS: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection | Devleena Das, Vivek Khetan | N/A | N/A |
| Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models | Yuko Nakagi, Takuya Matsuyama, Naoko Koide-Majima, Hiroto Q. Yamaguchi, Rieko Kubo, Shinji Nishimoto, Yu Takagi | N/A | N/A |
| “They are uncultured”: Unveiling Covert Harms and Social Threats in LLM Generated Conversations | Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanu Mitra | N/A | N/A |
| Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models | Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen | N/A | N/A |
| Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? | Gabriel Roccabruna, Massimo Rizzoli, giuseppe riccardi | N/A | N/A |
| Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties | Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai | N/A | N/A |
| Framework for Robust and Scalable Text Watermarking | Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low | N/A | N/A |
| MASIVE: Open-Ended Affective State Identification in English and Spanish | Nicholas Deas, Elsbeth Turcan, Ivan Ernesto Perez Mejia, Kathleen McKeown | N/A | N/A |
| You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions | Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, Jordan Lee Boyd-Graber | N/A | N/A |
| AlphaExpert: Assigning LoRA Experts Based on Layer Training Quality | Peijun Qing, Chongyang Gao, Yefan Zhou, Xingjian Diao, Yaoqing Yang, Soroush Vosoughi | N/A | N/A |
| Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling | Irfan Robbani, Paul Reisert, Surawat Pothong, Naoya Inoue, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui | N/A | N/A |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Question | Leena Mathur, Paul Pu Liang, Louis-Philippe Morency | N/A | N/A |
| RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models | Ziyi Kou, Shichao Pei, Meng Jiang, Xiangliang Zhang | N/A | N/A |
| Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese | Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh | N/A | N/A |
| Learnability of Indirect Evidence in Language Models | Miyu Oba, Yohei Oseki, Akiyo Fukatsu, Akari Haga, Hiroki Ouchi, Taro Watanabe, Saku Sugawara | N/A | N/A |
| Do LLMs Know to Respect Copyright Notice? | Jialiang Xu, SHENGLAN LI, Zhaozhuo Xu, Denghui Zhang | N/A | N/A |
| SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding | Hanchi Sun, Tianyi Zhou, Xun Chen, Lichao Sun | N/A | N/A |
| Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding | YeonJoon Jung, Jaeseong Lee, Seungtaek Choi, Dohyeon Lee, Minsoo Kim, seung-won hwang | N/A | N/A |
| Rethinking the Role of Proxy Rewards in Language Model Alignment | Sungdong Kim, Minjoon Seo | N/A | N/A |
| Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant | Abhirama Subramanyam Penamakuri, Anand Mishra | N/A | N/A |
| How Good is my MT Metric? A Framework for the Interpretation of Metric Assessments | Stefano Perrella, Lorenzo Proietti, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli | N/A | N/A |
| IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning | Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim | N/A | N/A |
| SPREADSHEETLLM: Encoding Spreadsheets for Large Language Models | Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang | N/A | N/A |
| Let’s discuss! Quality Dimensions and Annotated Datasets for Computational Argument Quality | Rositsa V Ivanova, Thomas Huber, Christina Niklaus | N/A | N/A |
| Automatic sentence segmentation of clinical record narratives in real-world data | Dongfang Xu, Davy Weissenbacher, Karen O’Connor, Siddharth Rawal, Graciela Gonzalez Hernandez | N/A | N/A |
| One-to-Many Communication and Compositionality in Emergent Communication | Heeyoung Lee | N/A | N/A |
| Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities | Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang | N/A | N/A |
| Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? | Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali | N/A | N/A |
| Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models | Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral | N/A | N/A |
| Contrastive Classification via Linear Layer Extrapolation | Mayukh Sharma, Sean O’Brien, Julian McAuley | N/A | N/A |
| Task Oriented In-Domain Data Augmentation | Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao | N/A | N/A |
| SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers | Shruti Singh, Nandan Sarkar, Arman Cohan | N/A | N/A |
| Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules | Zhuocheng Gong, Ang Lv, Jian Guan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan | N/A | N/A |
| No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages | Youssef Mohamed, Runjia Li, Ibrahim Said Ahmad, Kilichbek Haydarov, Philip Torr, Kenneth Church, Mohamed Elhoseiny | N/A | N/A |
| PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection | Someen Park, Jaehoon Kim, Seungwan Jin, Sohyun Park, Kyungsik Han | N/A | N/A |
| TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR | Shashi Kumar, Srikanth Madikeri, Juan Pablo Zuluaga Gomez, Iuliia Thorbecke, Esaú VILLATORO-TELLO, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S, Aravind Ganapathiraju | N/A | N/A |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz | N/A | N/A |
| Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk | Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li, Qun Liu, Xipeng Qiu | N/A | N/A |
| A Morphology-Based Investigation of Positional Encodings | Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| I love pineapple on pizza != I hate pineapple on pizza: Stance-Aware Sentence Transformers for Opinion Mining | Vahid Ghafouri, Jose M. Such, Guillermo Suarez-Tangil | N/A | N/A |
| BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability | Mamta Mamta, Rishikant Chigrupaatii, Asif Ekbal | N/A | N/A |
| ArMeme: Propagandistic Content in Arabic Memes | Firoj Alam, Abul Hasnat, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain | N/A | N/A |
| Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts | Arianna Muti, Federico Ruggeri, Khalid Al Khatib, Alberto Barrón-Cedeño, Tommaso Caselli | N/A | N/A |
| Thoughts to Target: Enhance Planning for Target-driven Conversation | Zhonghua Zheng, Lizi Liao, Yang Deng, Ee-Peng Lim, Minlie Huang, Liqiang Nie | N/A | N/A |
| Scalable Data Ablation Approximations for Language Models through Modular Training and Merging | Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi | N/A | N/A |
| Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation | Zhe Cao, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters | Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Generative Subgraph Retrieval for Knowledge Graph–Grounded Dialog Generation | Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim | N/A | N/A |
| Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers | Tuc Van Nguyen, Thai Le | N/A | N/A |
| Generalizing Clinical De-identification Models by Privacy-safe Data Augmentation using GPT-4 | Woojin Kim, Sungeun Hahm, Jaejin Lee | N/A | N/A |
| Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game | Prisha Samdarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan | N/A | N/A |
| GottBERT: a pure German Language Model | Raphael Scheible, Johann Frei, Fabian Thomczyk, Henry He, Patric Tippmann, Jochen Knaus, Victor Jaravine, Frank Kramer, Martin Boeker | N/A | N/A |
| Computational Meme Understanding: A Survey | Khoi P. N. Nguyen, Vincent Ng | N/A | N/A |
| CoverICL: Selective Annotation for In-Context Learning via Active Graph Coverage | Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis | N/A | N/A |
| Retrieval-enriched zero-shot image classification in low-resource domains | Nicola Dall’Asen, Yiming Wang, Enrico Fini, Elisa Ricci | N/A | N/A |
| I-AM-G: Interest Augmented Multimodal Generator for Item Personalization | Xianquan Wang, Likang Wu, Shukang Yin, Zhi Li, Yanjiang Chen, hufeng, Yu Su, Qi Liu | N/A | N/A |
| Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps | Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy | N/A | N/A |
| Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing | Baihe Huang, Hiteshi Sharma, Yi Mao | N/A | N/A |
| Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion | Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist | N/A | N/A |
| Show and Guide: Instructional-Plan Grounded Vision and Language Model | Diogo Glória-Silva, David Semedo, Joao Magalhaes | N/A | N/A |
| Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents | Bandhav Veluri, Benjamin N Peloquin, Bokai YU, Hongyu Gong, Shyamnath Gollakota | N/A | N/A |
| QuBE: Question-based Belief Enhancement for Agentic LLM | Minsoo Kim, Jongyoon Kim, Jihyuk Kim, seung-won hwang | N/A | N/A |
| COMPACT: Compressing Retrieved Documents Actively for Question Answering | Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang | N/A | N/A |
| An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Fatemeh Shiri, Xiao-Yu Guo, Mona Golestan Far, Xin Yu, Reza Haf, Yuan-Fang Li | N/A | N/A |
| Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models | Jiaxin Zhang, Wendi Cui, Yiran Huang, Kamalika Das, Sricharan Kumar | N/A | N/A |
| Local Contrastive Editing of Gender Stereotypes | Marlene Lutz, Rochelle Choenni, Markus Strohmaier, Anne Lauscher | N/A | N/A |
| De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP | Stefan Larson, Nicole Cornehl Lima, Santiago Pedroza Diaz, Amogh Manoj Joshi, Siddharth Betala, Jamiu Tunde Suleiman, Yash Mathur, Kaushal Kumar Prajapati, Ramla Alakraa, Junjie Shen, Temi Okotore, Kevin Leach | N/A | N/A |
| RAR: Retrieval Augmented Retrieval for Code Generation in Low Resource Languages | Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le | N/A | N/A |
| STAR: SocioTechnical Approach to Red Teaming Language Models | Laura Weidinger, John F J Mellor, Bernat Guillén Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, A. Stevie Bergman, Mikel D. Rodriguez, Verena Rieser, William Isaac | N/A | N/A |
| Do great minds think alike? Investigating Human-AI Complementarity for Question Answering | Maharshi Gor, Hal Daumé III, Tianyi Zhou, Jordan Lee Boyd-Graber | N/A | N/A |
| Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang | N/A | N/A |
| Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories | Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli | N/A | N/A |
| Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark | Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko | N/A | N/A |
| Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner | Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, Min zhang | N/A | N/A |
| Preference-Guided Reflective Sampling for Aligning Language Models | Hai Ye, Hwee Tou Ng | N/A | N/A |
| Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP | Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat | N/A | N/A |
| Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs | Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap | N/A | N/A |
| A Simple LLM Framework for Long-Range Video Question-Answering | Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius | N/A | N/A |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli | N/A | N/A |
| Casablanca: Data and Models for Multidialectal Arabic Speech Recognition | Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou cheikh tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah EL MEKKI, El Moatez Billah Nagoudi, Benelhadj Djelloul Mama Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir ECH-CHAMMAKHY, AMAL MAKOUAR, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed | N/A | N/A |
| Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations | Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria | N/A | N/A |
| Communicating with Speakers and Listeners of Different Pragmatic Levels | Kata Naszadi, Frans A Oliehoek, Christof Monz | N/A | N/A |
| RECANTFormer: Referring Expression Comprehension with Varying Numbers of Targets | Bhathiya Hemanthage, Hakan Bilen, Phil Bartie, Christian Dondrup, Oliver Lemon | N/A | N/A |
| Sprout: Green Generative AI with Carbon-Efficient LLM Inference | Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari | N/A | N/A |
| Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs | Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze | N/A | N/A |
| T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings | Björn Deiseroth, Manuel Brack, Samuel Weinbach, Patrick Schramowski, Kristian Kersting | N/A | N/A |
| SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han, Kevin Duh, Marine Carpuat | N/A | N/A |
| Assessing and Verifying Task Utility in LLM-Powered Applications | Negar Arabzadeh, Siqing Huo, Nikhil Mehta, Qingyun Wu, Chi Wang, Ahmed Hassan Awadallah, Charles L. A. Clarke, Julia Kiseleva | N/A | N/A |
| Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models | Somanshu Singla, Zhen Wang, Tianyang Liu, Abdullah Ashfaq, Zhiting Hu, Eric P. Xing | N/A | N/A |
| Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree | Harbani Jaggi, Kashyap Coimbatore Murali, Eve Fleisig, Erdem Biyik | N/A | N/A |
| Adversarial Text Generation using Large Language Models for Dementia Detection | Youxiang Zhu, Nana Lin, Kiran Sandilya Balivada, Daniel Haehn, Xiaohui Liang | N/A | N/A |
| xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics | Daniil Larionov, Mikhail Seleznyov, Vasiliy Viskov, Alexander Panchenko, Steffen Eger | N/A | N/A |
| The Greatest Good Benchmark: Measuring LLMs’ Alignment with Utilitarian Moral Dilemmas | Giovanni Franco Gabriel Marraffini, Andrés Cotton, Noe Fabian Hsueh, Juan Wisznia, Axel Fridman, Luciano Del Corro | N/A | N/A |
| FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding | Jiali Cheng, Hadi Amiri | N/A | N/A |
| Style-Shifting Behaviour of the Manosphere on Reddit | Jai Aggarwal, Suzanne Stevenson | N/A | N/A |
| The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective | Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang | N/A | N/A |
| Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu, Zhiyang Xu, Zihao Lin, Trevor Ashby, Joy Rimchala, Jiaxin Zhang, Lifu Huang | N/A | N/A |
| FOLIO: Natural Language Reasoning with First-Order Logic | SIMENG HAN, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander Fabbri, Wojciech Maciej Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev | N/A | N/A |
| The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? | Alexander Choi, Syeda Sabrina Akter, J.P. Singh, Antonios Anastasopoulos | N/A | N/A |
| Is Child-Directed Speech Effective Training Data for Language Models? | Steven Y. Feng, Noah Goodman, Michael Frank | N/A | N/A |
| RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference | Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao | N/A | N/A |
| HCEG: Improving the Abstraction Ability of Language Models with Hierarchical Conceptual Entailment Graphs | Juncai Li, Ru Li, Xiaoli Li, Qinghua Chai, Jeff Z. Pan | N/A | N/A |
| M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought | Gitanjali Kumari, Kirtan Jain, Asif Ekbal | N/A | N/A |
| GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation | Govind Ramesh, Yao Dou, Wei Xu | N/A | N/A |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Kiseung Kim, Jay-Yoon Lee | N/A | N/A |
| Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets | Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth | N/A | N/A |
| Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model | Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu, Aparna Garimella, Sambaran Bandyopadhyay | N/A | N/A |
| On the Fragility of Active Learners for Text Classification | Abhishek Ghose, Emma Thuong Nguyen | N/A | N/A |
| BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers | Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May Dongmei Wang, Joyce C. Ho, Chao Zhang, Carl Yang | N/A | N/A |
| Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval | Jonghyun Song, Cheyon Jin, Wenlong Zhao, Jay-Yoon Lee | N/A | N/A |
| M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection | Chia-Wei Tang, Ting-Chih Chen, Alvi Md Ishmam, Kiet A. Nguyen, Kazi Sajeed Mehrab, Chris Thomas | N/A | N/A |
| MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Haotian Sun, Hang Wu, Carl Yang, May Dongmei Wang | N/A | N/A |
| EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce C. Ho, Carl Yang, May Dongmei Wang | N/A | N/A |
| SimLLM: Detecting Sentences Generated by Large Language Models Using Similarity between the Generation and its Re-generation | Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu | N/A | N/A |
| CELLO: Causal Evaluation of Large Vision-Language Models | Meiqi Chen, Bo Peng, Yan Zhang, Chaochao Lu | N/A | N/A |
| Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Training-free Deep Concept Injection Enables Language Models for Video Question Answering | Xudong Lin, Manling Li, Richard Zemel, Heng Ji, Shih-Fu Chang | N/A | N/A |
| MIBench: Evaluating Multimodal Large Language Models over Multiple Images | Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu | N/A | N/A |
| ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering | Francesco Maria Molfese, Simone Conia, Riccardo Orlando, Roberto Navigli | N/A | N/A |
| ABLE: Personalized Disability Support with Politeness and Empathy Integration | Kshitij Mishra, Manisha Burja, Asif Ekbal | N/A | N/A |
| Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models | Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Sunghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo | N/A | N/A |
| Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code | Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Beong-woo Kwak, Seonghyeon Bae, seung-won hwang, Jinyoung Yeo | N/A | N/A |
| Improving Minimum Bayes Risk Decoding with Multi-Prompt | David Heineman, Yao Dou, Wei Xu | N/A | N/A |
| Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework | gopendra Vikram singh, Sai Vardhan Vemulapalli, Mauajama Firdaus, Asif Ekbal | N/A | N/A |
| Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah Schwettmann, Tristan Thrush | N/A | N/A |
| Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning | Shengguang Wu, Shusheng Yang, Zhenglun Chen, Qi Su | N/A | N/A |
| LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang | N/A | N/A |
| Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models | Yuxuan Guo, Zhiliang Tian, YIPING SONG, Tianlun Liu, Liang Ding, Dongsheng Li | N/A | N/A |
| Knowledge Graph Enhanced Large Language Model Editing | Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen | N/A | N/A |
| Quis custodiet ipsos custodes?’ Who will watch the watchmen? On Detecting AI-generated peer-reviews | Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal | N/A | N/A |
| Mitigating Open-Vocabulary Caption Hallucinations | Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor | N/A | N/A |
| Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes | Kosuke Nishida, Kyosuke Nishida, Kuniko Saito | N/A | N/A |
| ALVIN: Active Learning Via INterpolation | Michalis Korakakis, Andreas Vlachos | N/A | N/A |
| Filtered Direct Preference Optimization | Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu | N/A | N/A |
| Instruction Fine-Tuning: Does Prompt Loss Matter? | Mathew Huerta-Enochian, Seung Yong Ko | N/A | N/A |
| Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia | Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West | N/A | N/A |
EMNLP 2023
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation | Juhwan Choi, Yeonghwa Kim, Seunguk Yu, JungMin Yun, YoungBin Kim | N/A | N/A |
| Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation | Juhwan Choi, JungMin Yun, Kyohoon Jin, YoungBin Kim | N/A | N/A |
| FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document | Joonho Yang, Seunghyun Yoon, ByeongJeong Kim, Hwanhee Lee | N/A | N/A |
| Prompts have evil twins | Rimon Melamed, Lucas Hurley McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà | N/A | N/A |
| Table Question Answering for Low-resourced Indic Languages | Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke | N/A | N/A |
| ImageInWords: Unlocking Hyper-Detailed Image Descriptions | Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Michael Baldridge, Radu Soricut | N/A | N/A |
| LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay | Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang | N/A | N/A |
| When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection | Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps | N/A | N/A |
| Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model | Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia Perera, EngSiong Chng, Lina Yao | N/A | N/A |
| Hateful Word in Context Classification | Sanne Hoeken, Sina Zarrieß, Özge Alacam | N/A | N/A |
| Eyes Don’t Lie: Subjective Hate Annotation and Detection with Gaze | Özge Alacam, Sanne Hoeken, Sina Zarrieß | N/A | N/A |
| NumeroLogic: Number Encoding for Enhanced LLMs’ Numerical Reasoning | Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle | N/A | N/A |
| Thinking Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models | Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka | N/A | N/A |
| A Usage-centric Take on Intent Understanding in E-Commerce | Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan | N/A | N/A |
| Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs | Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha | N/A | N/A |
| Systematic Biases in LLM Simulations of Debates | Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein | N/A | N/A |
| Studying and Mitigating Biases in Sign Language Understanding Models | Katherine Atwell, Danielle Bragg, Malihe Alikhani | N/A | N/A |
| Uncertainty in Language Models: Assessment through Rank-Calibration | Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban | N/A | N/A |
| RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning | Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing | Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty | N/A | N/A |
| Scaling Properties of Speech Language Models | Santiago Cuervo, Ricard Marxer | N/A | N/A |
| “We Demand Justice!”: Towards Social Context Grounding of Political Texts | Rajkumar Pujari, Chengfei Wu, Dan Goldwasser | N/A | N/A |
| An Experimental Analysis on Evaluating Patent Citations | Rabindra Nath Nandi, Suman Maity, Brian Uzzi, Sourav Medya | N/A | N/A |
| Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? | Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow | N/A | N/A |
| Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing | Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis | N/A | N/A |
| Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning | Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, zujie wen, Wenqiang Lei, Tat-Seng Chua | N/A | N/A |
| Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation | Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman | N/A | N/A |
| Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation | Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, shimin tao, Xiaofeng Zhao, Mahongxia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, JingBo Zhu | N/A | N/A |
| On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models | Abhilasha Sancheti, Haozhe An, Rachel Rudinger | N/A | N/A |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Maureen de Seyssel, Antony D’Avirro, Adina Williams, Emmanuel Dupoux | N/A | N/A |
| On Fake News Detection with LLM Enhanced Semantics Mining | Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang, Jia Wu, Hao Fan | N/A | N/A |
| On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices | Branislav Pecher, Ivan Srba, Maria Bielikova | N/A | N/A |
| Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection | Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan | N/A | N/A |
| A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers | Valentin Barriere, Sebastian Cifuentes | N/A | N/A |
| Mitigating the Alignment Tax of RLHF | Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang | N/A | N/A |
| Evaluating Readability and Faithfulness of Concept-based Explanations | Meng Li, Haoran Jin, Ruixuan HUANG, Zhihao Xu, Defu Lian, Zijia Lin, Di ZHANG, Xiting Wang | N/A | N/A |
| Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems | Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen | N/A | N/A |
| MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making | Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou | N/A | N/A |
| CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds | Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Kenneth Huang | N/A | N/A |
| Tokenization Is More Than Compression | Craig W Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner | N/A | N/A |
| FLIRT: Feedback Loop In-context Red Teaming | Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta | N/A | N/A |
| Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections | Lingjun Zhao, Khanh Xuan Nguyen, Hal Daumé III | N/A | N/A |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Haoyuan WU, Haisheng Zheng, Zhuolun He, Bei Yu | N/A | N/A |
| GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation | Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng | N/A | N/A |
| Improved Learned Sparse Retrieval with Entity Vocabulary | Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates | N/A | N/A |
| Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models | Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu | N/A | N/A |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li | N/A | N/A |
| Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences | Xiangyang Liu, Junliang He, Xipeng Qiu | N/A | N/A |
| Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue | Xianlong Luo, Yihao Wang, Meng Yang | N/A | N/A |
| Integrating Plutchik’s Theory with Mixture of Experts for Enhancing Emotion Classification | Dongjun LIM, Yun-Gyung Cheong | N/A | N/A |
| In-context Contrastive Learning for Event Causality Identification | 梁超, Wei Xiang, Bang Wang | N/A | N/A |
| What’s Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs | Anna Wegmann, Tijs A. van den Broek, Dong Nguyen | N/A | N/A |
| Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs | Kanishka Misra, Kyle Mahowald | N/A | N/A |
| Large Language Models for Data Annotation: A Survey | Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, huan liu | N/A | N/A |
| Chain-of-Dictionary Prompting Elicits Translation in Large Language Models | Hongyuan Lu, HAORAN YANG, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei | N/A | N/A |
| AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning | Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang | N/A | N/A |
| RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning | Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering | Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, chen luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao | N/A | N/A |
| HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs | Jocelyn J Shen, Joel Mire, Hae Won Park, Cynthia Breazeal, Maarten Sap | N/A | N/A |
| Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence | Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, di yin, Xing Sun | N/A | N/A |
| Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval | Tianyi Hu, Maria Maistro, Daniel Hershcovich | N/A | N/A |
| RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models | Peng Xia, Kangyu Zhu, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao | N/A | N/A |
| A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading | Yuan Li, Bingqiao Luo, Qian Wang, Nuo Chen, Xu Liu, Bingsheng He | N/A | N/A |
| A Survey on In-context Learning | Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui | N/A | N/A |
| DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing | Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao | N/A | N/A |
| AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation | Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing | N/A | N/A |
| EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models | Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai | N/A | N/A |
| Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization | Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee | N/A | N/A |
| LLMs Are Zero-Shot Context-Aware Simultaneous Translators | Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura | N/A | N/A |
| AgentReview: Exploring Peer Review Dynamics with LLM Agents | Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang | N/A | N/A |
| ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval | Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou | N/A | N/A |
| Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments | Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulić, Anna Korhonen | N/A | N/A |
| Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation | Chenlong Deng, Kelong Mao, Zhicheng Dou | N/A | N/A |
| Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process | Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang | N/A | N/A |
| Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation | Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander T Toshev | N/A | N/A |
| QUDSELECT: Selective Decoding for Questions Under Discussion Parsing | Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration | Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng | N/A | N/A |
| Model Balancing Helps Low-data Training and Fine-tuning | Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang | N/A | N/A |
| Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment | Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami | N/A | N/A |
| Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment | Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu | N/A | N/A |
| A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning | Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou | N/A | N/A |
| Towards Tool Use Alignment of Large Language Models | Zhi-Yuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin | N/A | N/A |
| DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models | Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass | N/A | N/A |
| Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment | Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation | Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin | N/A | N/A |
| Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network | Haoran Li, Qiang Gao, Hongmei Wu, Li Huang | N/A | N/A |
| Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors | Wenjian Ding, YAO ZHANG, Jun Wang, Adam Jatowt, Zhenglu Yang | N/A | N/A |
| UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao, Yuehan Zhang, zhangwenlong, Xiao-Ming Wu | N/A | N/A |
| Tracking the perspectives of interacting language models | Hayden Helm, Brandon Duderstadt, Youngser Park, Carey Priebe | N/A | N/A |
| MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering | Zhengxuan Zhang, Yin WU, Yuyu Luo, Nan Tang | N/A | N/A |
| Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? | Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui | N/A | N/A |
| Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement | Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng LI, Wei Peng, Sujian Li | N/A | N/A |
| Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation | Joseph Marvin Imperial, Gail Forey, Harish Tayyar Madabushi | N/A | N/A |
| Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective | Zhihao Zhang, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou | N/A | N/A |
| “Glue pizza and eat rocks” - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models | Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, huan liu | N/A | N/A |
| Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement | Yuxuan Wang, Xiaoyuan Liu | N/A | N/A |
| SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation | Xiaoze Liu, Ting Sun, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao | N/A | N/A |
| MatchTime: Towards Automatic Soccer Game Commentary Generation | Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie | N/A | N/A |
| Rethinking Token Reduction for State Space Models | Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang | N/A | N/A |
| Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering | Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang | N/A | N/A |
| MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic | Yuyan Zhou, Liang Song, Bingning Wang, weipeng chen | N/A | N/A |
| Event Causality Identification with Synthetic Control | Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson | N/A | N/A |
| Retrieved Sequence Augmentation for Protein Representation Learning | Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Young Lu, Qi Liu, Sheng Wang, Lingpeng Kong | N/A | N/A |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li | N/A | N/A |
| TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulić | N/A | N/A |
| DA$^3$: A Distribution-Aware Adversarial Attack against Language Models | Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu | N/A | N/A |
| Evaluating Psychological Safety of Large Language Models | Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing | N/A | N/A |
| An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification | Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong | N/A | N/A |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu | N/A | N/A |
| PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation | Libo Zhao, Jing Li, Ziqian Zeng | N/A | N/A |
| TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging | Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang | N/A | N/A |
| Do We Need Language-Specific Fact-Checking Models? The Case of Chinese | Caiqi Zhang, Zhijiang Guo, Andreas Vlachos | N/A | N/A |
| Enhancing Advanced Visual Reasoning Ability of Large Language Models | Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai | N/A | N/A |
| CMD: a framework for Context-aware Model self-Detoxification | Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang | N/A | N/A |
| Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection | Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao | N/A | N/A |
| TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control | Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao | N/A | N/A |
| Be Helpful but Don’t Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support | LI Junlin, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang | N/A | N/A |
| Aligning Language Models to Explicitly Handle Ambiguity | Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim | N/A | N/A |
| Tag-grounded Visual Instruction Tuning with Retrieval Augmentation | Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li | N/A | N/A |
| GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models | Xuanchang Zhang, Zhuosheng Zhang, hai zhao | N/A | N/A |
| Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information | Runze Xia, Congchi Yin, Piji Li | N/A | N/A |
| Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models | Rui Li, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang | N/A | N/A |
| Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models | Yongjin Yang, Jongwoo Ko, Se-Young Yun | N/A | N/A |
| Advancing Process Verification for Large Language Models via Tree-Based Preference Learning | Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu | N/A | N/A |
| An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference | Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan | N/A | N/A |
| MantisScore: A Reliable Fine-grained Metric for Video Generation | Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen | N/A | N/A |
| A ∧ B ⇔ B ∧ A: Evaluating and Improving Logical Reasoning Ability of Large Language Models | Yuxuan WAN, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training | Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin | N/A | N/A |
| FuseGen: PLM Fusion for Data-generation based Zero-shot Learning | Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang | N/A | N/A |
| I Need Help! Evaluating LLM’s Ability to Ask for Users’ Support: A Case Study on Text-to-SQL Generation | Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm | Michael Wiegand, Josef Ruppenhofer | N/A | N/A |
| By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting | Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee | N/A | N/A |
| Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization | Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee | N/A | N/A |
| CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search | Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie | N/A | N/A |
| Towards Low-Resource Harmful Meme Detection with LMM Agents | Jianzhao Huang, Hongzhan Lin, ZiyanLiu, Ziyang Luo, Guang Chen, Jing Ma | N/A | N/A |
| VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values | Zhe Hu, Yixiao Ren, Jing Li, Yu Yin | N/A | N/A |
| Direct Multi-Turn Preference Optimization for Language Agents | Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng | N/A | N/A |
| Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models | Leonardo Ranaldi, Andre Freitas | N/A | N/A |
| In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search | Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Lorraine Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren | N/A | N/A |
| AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation | Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, liqian wen, Zulong Chen | N/A | N/A |
| Backward Lens: Projecting Language Model Gradients into the Vocabulary Space | Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf | N/A | N/A |
| Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding | Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu | N/A | N/A |
| Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! | Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu | N/A | N/A |
| Reusing Transferable Weight Increments for Low-resource Style Generation | Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar Zaiane | N/A | N/A |
| Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course | Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee | N/A | N/A |
| Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? | Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler | N/A | N/A |
| Instruction Pre-Training: Language Models are Supervised Multitask Learners | Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei | N/A | N/A |
| LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models | Renzhi Wang, Piji Li | N/A | N/A |
| Collaborative Performance Prediction for Large Language Models | Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma | N/A | N/A |
| Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese | Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari | N/A | N/A |
| Knowledge Verification to Nip Hallucination in the Bud | Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi | N/A | N/A |
| QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios | Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich | N/A | N/A |
| African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models | Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao | N/A | N/A |
| To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models | Bastien Liétard, Pascal Denis, Mikaela Keller | N/A | N/A |
| ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings | Hao Wang, Hao Li, Minlie Huang, Lei Sha | N/A | N/A |
| An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making | Xiutian Zhao, Ke Wang, Wei Peng | N/A | N/A |
| Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Gregor Geigle, Radu Timofte, Goran Glavaš | N/A | N/A |
| Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment | zhenyu liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning | Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, ZihanWang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang | N/A | N/A |
| Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification | Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang | N/A | N/A |
| PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study | Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu | N/A | N/A |
| Alignment-Enhanced Decoding: Defending via Token-Level Adaptive Refining of Probability Distributions | Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su | N/A | N/A |
| MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction | Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu | N/A | N/A |
| Evaluating Large Language Models via Linguistic Profiling | Alessio Miaschi, Felice Dell’Orletta, Giulia Venturi | N/A | N/A |
| With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models | Tyler Loakman, YUCHENG LI, Chenghua Lin | N/A | N/A |
| KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases | Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li | N/A | N/A |
| Understanding Higher-Order Correlations Among Semantic Components in Embeddings | Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira | N/A | N/A |
| DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection | Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Wei Liu, Xian Wu, Shaorong Xie, Yefeng Zheng | N/A | N/A |
| Evaluating D-MERIT of Partial-annotation on Information Retrieval | Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg | N/A | N/A |
| Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving | XIN QUAN, Marco Valentino, Louise A. Dennis, Andre Freitas | N/A | N/A |
| Calibrating the Confidence of Large Language Models by Eliciting Fidelity | Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu | N/A | N/A |
| Exploring Reward Model Strength’s Impact on Language Models | Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen | N/A | N/A |
| How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics | Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea | N/A | N/A |
| Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection | Gaetan Lopez Latouche, Marc-André Carbonneau, Benjamin Swanson | N/A | N/A |
| CUTE: Measuring LLMs’ Understanding of Their Tokens | Lukas Edman, Helmut Schmid, Alexander Fraser | N/A | N/A |
| SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation | Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min zhang | N/A | N/A |
| On The Role of Context in Reading Time Prediction | Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox | N/A | N/A |
| BC-Prover: Backward Chaining Prover for Formal Theorem Proving | Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin, Ruifeng Xu, Wotao Yin | N/A | N/A |
| From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP | Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva | N/A | N/A |
| Dual Modalities of Text: Visual and Textual Generative Pre-Training | Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu | N/A | N/A |
| On Training Data Influence of GPT Models | Qingyi Liu, Yekun Chai, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu | N/A | N/A |
| Understanding “Democratization” in NLP and ML Research | Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat | N/A | N/A |
| DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models | Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto | N/A | N/A |
| Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages | Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws | Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng | N/A | N/A |
| Word Alignment as Preference for Machine Translation | Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka | N/A | N/A |
| Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence | Yaxin FAN, PEIFENG LI, Qiaoming Zhu | N/A | N/A |
| SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models | Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang | N/A | N/A |
| Neuron-Level Knowledge Attribution in Large Language Models | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis | ZEPING YU, Sophia Ananiadou | N/A | N/A |
| Pixology: Probing the Linguistic and Visual Knowledge of Pixel-based Language Models | Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux | N/A | N/A |
| GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory | Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song | N/A | N/A |
| Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature | ALI ALLAITH, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Ingemann Parby, Alexander Conroy, Timothy R Tangherlini | N/A | N/A |
| QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models | Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh | N/A | N/A |
| Fine-Grained Prediction of Reading Comprehension from Eye Movements | Omer Shubi, Yoav Meiri, Cfir Avraham Hadar, Yevgeni Berzak | N/A | N/A |
| Efficient Retriever for Multi-Hop Retrieval Question Answerin | Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang | N/A | N/A |
| Unsupervised Human Preference Learning | Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani Tur | N/A | N/A |
| Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering | Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo, Elena Cabrio, Serena Villata, Marco Guerini | N/A | N/A |
| Leading Whitespaces of Language Models’ Subword Vocabulary Poses a Confound for Calculating Word Probabilities | Byung-Doh Oh, William Schuler | N/A | N/A |
| LLM4Decompile: Decompiling Binary Code with Large Language Models | Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang | N/A | N/A |
| From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning | Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong | N/A | N/A |
| CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering | Yike Wu, Yi Huang, Nan Hu, YUNCHENG HUA, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan | N/A | N/A |
| MTLS: Making Texts into Linguistic Symbols | Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li | N/A | N/A |
| D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection | Yifan Chen, Kuntao Li, Weixing Mai, Qiaofeng Wu, Yun Xue, Fenghuan Li | N/A | N/A |
| A Generic Method for Fine-grained Category Discovery in Natural Language Texts | Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens | N/A | N/A |
| Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method | Yang Trista Cao, Lovely-Frances Domingo, Sarah Gilbert, Michelle L. Mazurek, Katherine Shilton, Hal Daumé III | N/A | N/A |
| A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models | Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie | N/A | N/A |
| Decompose and Compare Consistency: Measuring VLMs’ Answer Reliability via Task-Decomposition Consistency Comparison | Qian Yang, Weixiang Yan, Aishwarya Agrawal | N/A | N/A |
| Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism | Lang Cao | N/A | N/A |
| VGBench: A Comprehensive Benchmark of Vector Graphics Understanding and Generation for Large Language Models | Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee | N/A | N/A |
| What do large language models need for machine translation evaluation? | Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orasan, Tharindu Ranasinghe, Fred Blain | N/A | N/A |
| Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale | Flavio Di Palo, Prateek Singhi, Bilal H Fadlallah | N/A | N/A |
| External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models | Debela Gemechu, Chris Reed | N/A | N/A |
| C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits | Maaz Bin Musa, Rishab Nithyanand, Padmini Srinivasan, Mihailis E. Diamantis, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin | N/A | N/A |
| MPT: Multimodal Prompt Tuning for Zero-shot Instruction Learning | Taowen Wang, Yiyang Liu, James Chenhao Liang, junhan zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han, Lifu Huang, Qifan Wang, Dongfang Liu | N/A | N/A |
| Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification | Letian Peng, Yi Gu, Chengyu Dong, Zihan Wang, Jingbo Shang | N/A | N/A |
| Incubating Text Classifiers Following User Instruction with Nothing but LLM | Letian Peng, Zilong Wang, Jingbo Shang | N/A | N/A |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang | N/A | N/A |
| Conditional and Modal Reasoning in Large Language Models | Wesley H. Holliday, Matthew Mandelkern, Cedegao E. Zhang | N/A | N/A |
| Advancing Large Language Model Attribution through Self-Improving | Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| AlignCap: Aligning Speech Emotion Captioning to Human Preferences | Ziqi Liang, Haoxiang Shi, Hanhui Chen | N/A | N/A |
| Interpretability-based Tailored Knowledge Editing in Transformers | Yihuai Hong, Aldo Lipani | N/A | N/A |
| PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling | Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan | N/A | N/A |
| Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting | Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap | N/A | N/A |
| Dissecting Fine-Tuning Unlearning in Large Language Models | Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang | N/A | N/A |
| Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models | Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, zhiheng huang | N/A | N/A |
| Where is the signal in tokenization space? | Renato Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck | N/A | N/A |
| Private Language Models via Truncated Laplacian Mechanism | Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang | N/A | N/A |
| Estimating Knowledge in Large Language Models Without Generating a Single Token | Daniela Gottesman, Mor Geva | N/A | N/A |
| Consistent Autoformalization for Constructing Mathematical Libraries | Lan Zhang, XIN QUAN, Andre Freitas | N/A | N/A |
| Contextual and Parametric Knowledge: More Context, More Focus | Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal | N/A | N/A |
| Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers | Aditya Yedetore, Najoung Kim | N/A | N/A |
| When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages | Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen | N/A | N/A |
| Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use | Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai | N/A | N/A |
| MiTTenS: A Dataset for Evaluating Gender Mistranslation | Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings | N/A | N/A |
| Teaching LLMs to Abstain across Languages via Multilingual Feedback | Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov | N/A | N/A |
| Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration | Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov | N/A | N/A |
| StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements | Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L Gordon, Zaid Harchaoui, Yejin Choi | N/A | N/A |
| I Could’ve Asked That: Reformulating Unanswerable Questions | Wenting Zhao, Ge Gao, Claire Cardie, Alexander M Rush | N/A | N/A |
| STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions | Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami | N/A | N/A |
| Hidden Persuaders: How LLM Political Bias Could Sway Our Elections | Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song | N/A | N/A |
| SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning | Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu | N/A | N/A |
| When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives | Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu | N/A | N/A |
| An Analysis of Multilingual FActScore | Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai | N/A | N/A |
| Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models | Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo | N/A | N/A |
| RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering | Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli | N/A | N/A |
| PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval | Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon | N/A | N/A |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov | N/A | N/A |
| ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback | Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault | N/A | N/A |
| Order of Magnitude Speedups for LLM Membership Inference | Rongting Zhang, Martin Andres Bertran, Aaron Roth | N/A | N/A |
| VIMI: Grounding Video Generation through Multi-modal Instruction | Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov | N/A | N/A |
| F$^2$RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation | Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou | N/A | N/A |
| Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning | Chang Yang, Peng Zhang, Hui Gao, Jing Zhang | N/A | N/A |
| Visual Prompting in LLMs for Enhancing Emotion Recognition | Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Yang Liu, Zhenyue Qin, Wenjia Niu, Sabrina Caldwell, Tom Gedeon | N/A | N/A |
| IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding | Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang | N/A | N/A |
| Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset | Che Wei Tsai, Yen-Hao Huang, Tsu-keng Liao, Didier Fernando Salazar Estrada, Retnani Latifah, Yi-Shin Chen | N/A | N/A |
| Outcome-Constrained Large Language Models for Countering Hate Speech | Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song | N/A | N/A |
| Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing | Changbing Yang, Garrett Nicolai, Miikka Silfverberg | N/A | N/A |
| Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks | Ao Wang, Xinghao Yang, Chen Li, Bao-di Liu, Weifeng Liu | N/A | N/A |
| Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping | Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang | N/A | N/A |
| PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling | Huachuan Qiu, Lizhi Ma, Zhenzhong Lan | N/A | N/A |
| World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering | Jiacong Wang, Bohong Wu, Haiyong Jiang, Haoyuan Guo, Xin Xiao, zhou Xun, Jun Xiao | N/A | N/A |
| DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering | Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo | N/A | N/A |
| How Do Humans Write Code? Large Models Do It the Same Way Too | Long Li, Xuzheng He, Haozhe Wang, Linlin Wang, Liang He | N/A | N/A |
| Retrospex: Language Agent Meets Offline Reinforcement Learning Critic | Yufei Xiang, Yiqun Shen, Yeqin Zhang, Nguyen Cam-Tu | N/A | N/A |
| Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models | Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, JingBo Zhu | N/A | N/A |
| Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation | Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen | N/A | N/A |
| CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation | Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang | N/A | N/A |
| A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners | Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J Su, Camillo Jose Taylor, Dan Roth | N/A | N/A |
| Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao, Gonghan Xu, Zhe Wang, Arman Cohan | N/A | N/A |
| MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning | Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai | N/A | N/A |
| Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients | Weijun Li, Qiongkai Xu, Mark Dras | N/A | N/A |
| RWKV-CLIP: A Robust Vision-Language Representation Learner | Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng | N/A | N/A |
| KidLM: Advancing Language Models for Children – Early Insights and Future Directions | Mir Tafseer Nayeem, Davood Rafiei | N/A | N/A |
| Using Language Models to Disambiguate Lexical Choices in Translation | Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr | N/A | N/A |
| How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? | Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin | N/A | N/A |
| An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records | Joakim Edin, Maria Maistro, Lars Maaløe, Lasse Borgholt, Jakob Drachmann Havtorn, Tuukka Ruotsalo | N/A | N/A |
| Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs | Zheng Wang, Zhongyang Li, Jiang Zeren, Dandan Tu, Wei Shi | N/A | N/A |
| EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation | Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji | N/A | N/A |
| Predicting Nonnative Sentence Processing with L2LMs | Tatsuya Aoyama, Nathan Schneider | N/A | N/A |
| From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis | Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan | N/A | N/A |
| Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs | Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar Karnin | N/A | N/A |
| Cross-Domain Audio Deepfake Detection: Dataset and Analysis | Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang | N/A | N/A |
| MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension | Ting Liu, Zunnan Xu, Zhiqiang Wang, Yue Hu, Liangtao Shi, Quanjun Yin | N/A | N/A |
| Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning | Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo | N/A | N/A |
| Aligning Translation-Specific Understanding to General Understanding in Large Language Models | Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin | N/A | N/A |
| FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation | Mohamad Ballout, Anne Dedert, Nohayr Muhammad Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger | N/A | N/A |
| Concept-skill Transferability-based Data Selection for Large Vision-Language Models | Jaewoo Lee, Boyang Li, Sung Ju Hwang | N/A | N/A |
| LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing | Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi LIU, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin | N/A | N/A |
| Academics Can Contribute to Domain-Specialized Language Models | Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S Rosenberg, Sebastian Gehrmann | N/A | N/A |
| Beyond Reference: Evaluating High Quality Translations Better than Human References | Keonwoong Noh, Seokjin Oh, Woohwan Jung | N/A | N/A |
| Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement | Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie | N/A | N/A |
| SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages | Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James Validad Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno P. Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Railey Montalan, Ryan Ignatius Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje F. Karlsson, James Jaya, Ryandito Diandaru, Yuze GAO, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Tai Ngee Chia, Ayu Purwarianti, Sebastian Ruder, William Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya | N/A | N/A |
| Induct-Learn: Short Phrase Prompting with Instruction Induction | Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen | N/A | N/A |
| Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning | Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li | N/A | N/A |
| LUQ: Long-text Uncertainty Quantification for LLMs | Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier | N/A | N/A |
| Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method | Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng | N/A | N/A |
| Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars | Damien Sileo | N/A | N/A |
| Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach | Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux | N/A | N/A |
| Safely Learning with Private Data: A Federated Learning Framework for Large Language Model | Jia-Ying Zheng, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hong-Wei Zheng, Zhi-Ming Zheng | N/A | N/A |
| Formality Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge | Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen | N/A | N/A |
| How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? | Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You | N/A | N/A |
| How Far Can We Extract Diverse Perspectives from Large Language Models? | Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang | N/A | N/A |
| EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning | Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Mohan Yerragorla, Sourangshu Bhattacharya, Avishek Anand | N/A | N/A |
| An LLM Feature-based Framework for Dialogue Constructiveness Assessment | Lexin Zhou, Youmna Farag, Andreas Vlachos | N/A | N/A |
| Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System | Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou | N/A | N/A |
| Dialog2Flow: Pre-training Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction | Sergio Burdisso, Srikanth Madikeri, Petr Motlicek | N/A | N/A |
| Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation | Raphael Tang, Crystina Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture | N/A | N/A |
| Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024 | Ilias Chalkidis | N/A | N/A |
| Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning | Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian | N/A | N/A |
| LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations | Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu | N/A | N/A |
| Concept Space Alignment in Multilingual LLMs | Qiwei Peng, Anders Søgaard | N/A | N/A |
| Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model | Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou | N/A | N/A |
| NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian | Peng Liu, Lemei Zhang, Terje Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang | N/A | N/A |
| RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework | Yifan Wang, Vera Demberg | N/A | N/A |
| Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models | Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang | N/A | N/A |
| Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems | Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering | Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen | N/A | N/A |
| Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA | Minzheng Wang, Longze Chen, ChengFu, Liaoshengyi, Xinghua Zhang, Bingliwu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li | N/A | N/A |
| On Mitigating Performance Disparities in Multilingual Speech Recognition | Monorama Swain, Anna Katrine van Zee, Anders Søgaard | N/A | N/A |
| Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting | Stephen Meisenbacher, Florian Matthes | N/A | N/A |
| From Coarse to Fine: Impacts of Feature-Preserving and Feature-Compressing Connectors on Perception in Multimodal Models | Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen | N/A | N/A |
| What is ‘‘Typological Diversity’’ in NLP? | Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva | N/A | N/A |
| The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse | Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi | N/A | N/A |
| Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness | Georgi Shopov, Stefan Gerdjikov | N/A | N/A |
| Benchmarking Vision Language Models for Cultural Understanding | Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal | N/A | N/A |
| Methods of Automatic Matrix Language Determination for Code-Switched Speech | Olga Iakovenko, Thomas Hain | N/A | N/A |
| Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts | Jaewook Lee, Yeajin Jang, Hongjin KIM, Woojin Lee, Harksoo Kim | N/A | N/A |
| Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models | Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar | N/A | N/A |
| Teaching Small Language Models Reasoning through Counterfactual Distillation | FengTao, Yicheng Li, Li Chenglin, Hao Chen, Fei Yu, Yin Zhang | N/A | N/A |
| Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese | Meet Doshi, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| Quantifying the Gap Between Machine Translation and Native Language in Training for Multimodal, Multilingual Retrieval | Kyle Buettner, Adriana Kovashka | N/A | N/A |
| MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval | Qixi Lu, Gongbo Tang | N/A | N/A |
| Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates | Aida Kostikova, Dominik Beese, Benjamin Paassen, Ole Pütz, Gregor Wiedemann, Steffen Eger | N/A | N/A |
| CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling | Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie CK Cheung | N/A | N/A |
| Story Embeddings — Narrative-Focused Representations of Fictional Stories | Hans Ole Hatzel, Chris Biemann | N/A | N/A |
| C-LLM: Learn to Check Chinese Spelling Errors Character by Character | Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou | N/A | N/A |
| PSC: Extending Context Window of Large Language Models via Phase Shift Calibration | Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu | N/A | N/A |
| Video-LLaVA: Learning United Visual Representation by Alignment Before Projection | Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan | N/A | N/A |
| SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales | Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao | N/A | N/A |
| Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing | Richard Diehl Martinez, Zebulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn | N/A | N/A |
| ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations | Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-Wei Lee | N/A | N/A |
| Boosting Scientific Concepts Understanding: Can Analogies from Teacher Models Empower Student Models? | Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang | N/A | N/A |
| Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation | Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza | N/A | N/A |
| Do Large Language Models Know How Much They Know? | Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar | N/A | N/A |
| Investigating Mysteries of CoT-Augmented Distillation | Somin Wadhwa, Silvio Amir, Byron C Wallace | N/A | N/A |
| SciPrompt: Knowledge-Augmented Prompting for Fine-Grained Categorization of Scientific Topics | Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludaescher, Jana Diesner | N/A | N/A |
| Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP | Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi | N/A | N/A |
| Learning from Natural Language Explanations for Generalizable Entity Matching | Somin Wadhwa, ADIT KRISHNAN, Runhui Wang, Byron C Wallace, Luyang Kong | N/A | N/A |
| Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation | Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Sricharan Kumar, Murat Kantarcioglu, Bradley A. Malin | N/A | N/A |
| On the Reliability of Psychological Scales on Large Language Models | Jen-tse Huang, Wenxuan Wang, Man Ho LAM, Eric John Li, Wenxiang Jiao, Michael Lyu | N/A | N/A |
| Contrastive Entity Coreference and Disambiguation for Historical Texts | Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring | N/A | N/A |
| Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models | Jeonghwan Kim, Heng Ji | N/A | N/A |
| Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts | Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata | N/A | N/A |
| VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment | Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu | N/A | N/A |
| Focused Large Language Models are Stable Many-Shot Learners | Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li | N/A | N/A |
| Reconsidering Sentence-Level Sign Language Translation | Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus | N/A | N/A |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha | N/A | N/A |
| Verba volant, scripta volant? Don’t worry! There are computational solutions for protoword reconstruction | Liviu P Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas | N/A | N/A |
| ChatGPT Doesn’t Trust LA Chargers Fans: Guardrail Sensitivity in Context | Victoria R Li, Yida Chen, Naomi Saphra | N/A | N/A |
| Personas as a Way to Model Truthfulness in Language Models | Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He | N/A | N/A |
| Satyrn: A Platform for Analytics Augmented Generation | Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J Hammond | N/A | N/A |
| EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning | Ashish Seth, Ramaneswaran S, S Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha | N/A | N/A |
| EPO: Hierarchical LLM Agents with Environment Preference Optimization | Qi Zhao, Haotian Fu, Chen Sun, George Konidaris | N/A | N/A |
| Detection and Measurement of Syntactic Templates in Generated Text | Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C Wallace | N/A | N/A |
| UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models | Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu | N/A | N/A |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet | N/A | N/A |
| Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts | Zhaoxuan Tan, Zheyuan Liu, Meng Jiang | N/A | N/A |
| Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning | Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang | N/A | N/A |
| Unifying Multimodal Retrieval via Document Screenshot Embedding | Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin | N/A | N/A |
| Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation | Shaomu Tan, Di Wu, Christof Monz | N/A | N/A |
| An Audit on the Perspectives and Challenges of Hallucinations in NLP | Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson | N/A | N/A |
| Discovering Knowledge-Critical Subnetworks in Pretrained Language Models | Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut | N/A | N/A |
| Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models | Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang | N/A | N/A |
| Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering | Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner | N/A | N/A |
| Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution | Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner | N/A | N/A |
| Understanding and Mitigating Language Confusion in LLMs | Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder | N/A | N/A |
| Can Large Language Models Learn Independent Causal Mechanisms? | Gael Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael Witbrock, Gillian Dobbie | N/A | N/A |
| MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models | Sarfaroz Yunusov, Hamza Sidat, Ali Emami | N/A | N/A |
| InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context | Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao | N/A | N/A |
| Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia | Farhan Samir, Chan Young Park, Vered Shwartz, Anjalie Field, Yulia Tsvetkov | N/A | N/A |
| From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models | Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, EunJeong Hwang, Vered Shwartz | N/A | N/A |
| Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation | Karin De Langis, Ryan Koo, Dongyeop Kang | N/A | N/A |
| MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model | Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu | N/A | N/A |
| Learning to Extract Structured Entities Using Language Models | Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra | N/A | N/A |
| Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons | Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark Gales | N/A | N/A |
| A Survey of AMR Applications | Shira Wein, Juri Opitz | N/A | N/A |
| Beyond Embeddings: The Promise of Visual Table in Visual Reasoning | Yiwu Zhong, Zi-Yuan Hu, Michael Lyu, Liwei Wang | N/A | N/A |
| CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation | Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C Kaelin, Mary Khetani, Natalie Parde | N/A | N/A |
| Secured Weight Release for Large Language Models via Taylor Expansion | Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Hu | N/A | N/A |
| TimeR$^4$ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering | Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song | N/A | N/A |
| Knowledge-Centric Hallucination Detection | Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang | N/A | N/A |
| Revealing the Parallel Multilingual Learning within Large Language Models | Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, JingBo Zhu | N/A | N/A |
| Automatic Instruction Evolving for Large Language Models | Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen | N/A | N/A |
| RepEval: Effective Text Evaluation with LLM Representation | Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou | N/A | N/A |
| Generative Models for Automatic Medical Decision Rule Extraction from Text | Yuxin He, Buzhou Tang, Xiaoling Wang | N/A | N/A |
| Encoding and Controlling Global Semantics for Long-form Video Question Answering | Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu | N/A | N/A |
| Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis | Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang | N/A | N/A |
| Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs | Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| Does Large Language Model Contain Task-Specific Neurons? | Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu | N/A | N/A |
| Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models | Philipp Mondorf, Barbara Plank | N/A | N/A |
| Advancing Test-Time Adaptation in Wild Acoustic Test Settings | Hongfu Liu, Hengguan Huang, Ye Wang | N/A | N/A |
| Learning to Retrieve Iteratively for In-Context Learning | Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme | N/A | N/A |
| Taxonomy-guided Semantic Indexing for Academic Paper Search | SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu | N/A | N/A |
| Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts | Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models | Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh | N/A | N/A |
| Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation | Zhiyu Cao, PEIFENG LI, Yaxin FAN, Qiaoming Zhu | N/A | N/A |
| FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs | Yiyuan Li, Shichao Sun, Pengfei Liu | N/A | N/A |
| Aligning Large Language Models with Diverse Political Viewpoints | Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash | N/A | N/A |
| “You Gotta be a Doctor, Lin” : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations | Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III | N/A | N/A |
| Extending Context Window of Large Language Models from a Distributional Perspective | Yingsheng Wu, Yuxuan Gu, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions | Hakyung Sung, Kristopher Kyle | N/A | N/A |
| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng | N/A | N/A |
| Position Engineering: Boosting Large Language Models through Positional Information Manipulation | Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna K. Qiu, Lili Qiu | N/A | N/A |
| Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale | Junying Chen, Chi Gui, OuyangRuyi, Anningzhe Gao, Shunian Chen, Guiming Hardy Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang | N/A | N/A |
| ADELIE: Aligning Large Language Models on Information Extraction | Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li | N/A | N/A |
| Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons | Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Dajun Zeng | N/A | N/A |
| Lexically Grounded Subword Segmentation | Jindřich Libovický, Jindřich Helcl | N/A | N/A |
| EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees | Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang | N/A | N/A |
| Do Text-to-Vis Benchmarks Test Real Use of Visualizations? | Hy Nguyen, Xuefei He, Andrew Reeson, Cecile Paris, Josiah Poon, Jonathan K. Kummerfeld | N/A | N/A |
| Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu | N/A | N/A |
| Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning | Jingyu Hu, Weiru Liu, Mengnan Du | N/A | N/A |
| Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges | Nguyen Van Dinh, Thanh Chi Dang, Luan Thanh Nguyen, Kiet Van Nguyen | N/A | N/A |
| Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment | Vyas Raina, Adian Liusie, Mark Gales | N/A | N/A |
| Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal | Zhicong Lu, Li Jin, PeiguangLi, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai | N/A | N/A |
| More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs | Chengyuan Liu, Shihang Wang, Yangyang Kang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu | N/A | N/A |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | Vyas Raina, Rao Ma, Charles McGhee, Kate Knill, Mark Gales | N/A | N/A |
| GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation | Georgios Katsimpras, Georgios Paliouras | N/A | N/A |
| XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs | Zichen Chen, Jianda Chen, Ambuj Singh, Misha Sra | N/A | N/A |
| Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning | Yuanpin Zhou, Huogen Wang | N/A | N/A |
| SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information | Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, Yu Cheng | N/A | N/A |
| UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models | Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui | N/A | N/A |
| Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments | Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su | N/A | N/A |
| MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space | Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia, Zhangjijun, Ruifang He, Yuexian Hou | N/A | N/A |
| KnowledgeSG: Privacy-Preserving Synthetic Text Generation With Knowledge Distillation From Server | WenHao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang | N/A | N/A |
| DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination | Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei | N/A | N/A |
| Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models | Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao | N/A | N/A |
| Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale | Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou | N/A | N/A |
| An Empirical Study of Multilingual Reasoning Distillation for Question Answering | Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? | Gal Yona, Roee Aharoni, Mor Geva | N/A | N/A |
| Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? | Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig | N/A | N/A |
| Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning | Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee | N/A | N/A |
| MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding | Baixuan Xu, Weiqi Wang, Haochen Shi, Wenxuan Ding, Huihao JING, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song | N/A | N/A |
| ECON: On the Detection and Resolution of Evidence Conflicts | Cheng Jiayang, Qianqian Zhuang, Chunkit Chan, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang | N/A | N/A |
| “Image, Tell me your story!” Predicting the original meta-context of visual misinformation | Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych | N/A | N/A |
| Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning | Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan | N/A | N/A |
| Mixture-of-Subspaces in Low-Rank Adaptation | Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong | N/A | N/A |
| A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data | Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram | N/A | N/A |
| LawBench: Benchmarking Legal Knowledge of Large Language Models | Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng | N/A | N/A |
| Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards | Furkan Şahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych | N/A | N/A |
| Efficient Vision-Language pre-training via domain-specific learning for human activities | Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martinez, Georgios Tzimiropoulos | N/A | N/A |
| Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training | Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su | N/A | N/A |
| Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works | Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang | N/A | N/A |
| Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners | Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang | N/A | N/A |
| AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning | Hao Sun, Jiayi Wu, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models | Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li | N/A | N/A |
| mDPO: Conditional Preference Optimization for Multimodal Large Language Models | Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen | N/A | N/A |
| Data Advisor: Data Curation with Foresight for Safety Alignment of Large Language Models | Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan | N/A | N/A |
| Language-to-Code Translation with a Single Labeled Example | Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas | N/A | N/A |
| Attribute or Abstain: Large Language Models as Long Document Assistants | Jan Buchmann, Xiao Liu, Iryna Gurevych | N/A | N/A |
| FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models | Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma | N/A | N/A |
| Retrieved In-Context Principles from Previous Mistakes | Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang | N/A | N/A |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Haozhe Chen, Run Chen, Julia Hirschberg | N/A | N/A |
| VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models | Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang | N/A | N/A |
| Deterministic Weighted L* Algorithm | Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Ryan Cotterell | N/A | N/A |
| Towards Verifiable Text Generation with Evolving Memory and Self-Reflection | Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin | N/A | N/A |
| Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification | Pritish Sahu, Karan Sikka, Ajay Divakaran | N/A | N/A |
| Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes | Yusuke Hirota, Jerone Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang | N/A | N/A |
| RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? | Di Cao, Yong Liao, Xiuwei Shang | N/A | N/A |
| Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel | Brendan King, Jeffrey Flanigan | N/A | N/A |
| Humans or LLMs as the Judge? A Study on Judgement Bias | Guiming Hardy Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang | N/A | N/A |
| WPO: Enhancing RLHF with Weighted Preference Optimization | Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu | N/A | N/A |
| Walking in Others’ Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias | Rongwu Xu, Zian Zhou, Tianwei Zhang, Zehan Qi, SU YAO, Ke Xu, Wei Xu, Han Qiu | N/A | N/A |
| MetaReflection: Learning Instructions for Language Agents using Past Reflections | Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi | N/A | N/A |
| Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors | Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan | N/A | N/A |
| On Eliciting Syntax from Language Models via Hashing | Yiran Wang, Masao Utiyama | N/A | N/A |
| CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios | Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He | N/A | N/A |
| The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples | Heng Yang | N/A | N/A |
| CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages | Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal | N/A | N/A |
| Perceptions of Linguistic Uncertainty by Language Models and Humans | Catarina G Belém, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth | N/A | N/A |
| Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM | Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung | N/A | N/A |
| Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding | Xiaoyu DONG, Yujie Feng, ZEXIN LU, Guangyuan SHI, Xiao-Ming Wu | N/A | N/A |
| Knowledge Conflicts for LLMs: A Survey | Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru WANG, Yue Zhang, Wei Xu | N/A | N/A |
| Generative AI in the Era of “Alternative Facts” | Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar | N/A | N/A |
| MEANT: Multimodal Encoder for Antecedent Information | Benjamin Irving, Annika Marie Schoene | N/A | N/A |
| A Thorough Examination of Decoding Methods in the Era of LLMs | Chufan Shi, HAORAN YANG, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam | N/A | N/A |
| AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings | Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar | N/A | N/A |
| FIRST: Faster Improved Listwise Reranking with Single Token Decoding | Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji | N/A | N/A |
| Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights | Hongjin KIM, Jai-Eun Kim, Harksoo Kim | N/A | N/A |
| ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods | Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Zhenqiang Gong, Bhuwan Dhingra | N/A | N/A |
| “Flex Tape Can’t Fix That”: Bias and Misinformation in Edited Language Models | Karina H Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut | N/A | N/A |
| Revisiting Who’s Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective | Yujian Liu, Yang Zhang, Tommi Jaakkola, Shiyu Chang | N/A | N/A |
| LIONs: An Empirically Optimized Approach to Align Language Models | Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu | N/A | N/A |
| Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing | Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada | N/A | N/A |
| A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery | Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han | N/A | N/A |
| MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents | Liyan Tang, Philippe Laban, Greg Durrett | N/A | N/A |
| Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning | John Wu, David Wu, Jimeng Sun | N/A | N/A |
| MOSEL: Inference Serving Using Dynamic Modality Selection | Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J Yadwadkar, Aditya Akella | N/A | N/A |
| From RAG to Riches: Retrieval Interlaced with Sequence Generation | Palak Jain, Livio Baldini Soares, Tom Kwiatkowski | N/A | N/A |
| Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition | Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee | N/A | N/A |
| Learning to Correct for QA Reasoning with Black-box LLMs | Jaehyung Kim, Dongyoung Kim, Yiming Yang | N/A | N/A |
| AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? | Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant | N/A | N/A |
| PostMark: A Robust Blackbox Watermark for Large Language Models | Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Frederick Wieting, Mohit Iyyer | N/A | N/A |
| Assessing “Implicit” Retrieval Robustness of Large Language Models | Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang | N/A | N/A |
| On the Relationship between Truth and Political Bias in Language Models | Suyash Fulay, William Brannon, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara | N/A | N/A |
| Can Active Label Correction Improve LLM-based Modular AI Systems? | Karan Taneja, Ashok Goel | N/A | N/A |
| Statistical Uncertainty in Word Embeddings: GloVe-V | Andrea Vallebueno, Cassandra Handan-Nader, Christopher D Manning, Daniel E. Ho | N/A | N/A |
| Annotation alignment: Comparing LLM and human annotations of conversational safety | Rajiv Movva, Pang Wei Koh, Emma Pierson | N/A | N/A |
| DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions | Nigel Fernandez, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, Andrew Lan | N/A | N/A |
| The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention | Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang | N/A | N/A |
| CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models | Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran | N/A | N/A |
| Enhancing Reinforcement Learning with Intrinsic Rewards from Language Model Critique | Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng | N/A | N/A |
| Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models | Layla Bouzoubaa, Elham Aghakhani, Shadi Rezapour | N/A | N/A |
| Efficient Sequential Decision Making with Large Language Models | Dingyang Chen, Qi Zhang, Yinglun Zhu | N/A | N/A |
| SignCLIP: Connecting Text and Sign Language by Contrastive Learning | Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling | N/A | N/A |
| APPLS: Evaluating Evaluation Metrics for Plain Language Summarization | Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang | N/A | N/A |
| Ontologically Faithful Generation of Non-Player Character Dialogues | Nathaniel Weir, Ryan Thomas, Randolph d’Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani | N/A | N/A |
| LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives | Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs | Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov | N/A | N/A |
| Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction | Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Toward Compositional Behavior in Neural Models: A Survey of Current Views | Kate McCurdy, Paul Soulos, Paul Smolensky | N/A | N/A |
| Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs | Krista Opsahl-Ong, Michael J Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab | N/A | N/A |
| Reverse-Engineering the Reader | Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell | N/A | N/A |
| Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation | Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text | Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun | N/A | N/A |
| Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning | David Schulte, Felix Hamborg, Alan Akbik | N/A | N/A |
| The effects of distance on NPI illusive effects in BERT | So Young Lee, Mai Ha Vu | N/A | N/A |
| Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic | Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter Jansen, Peter Clark, Benjamin Van Durme | N/A | N/A |
| Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US | Christabel Acquaye, Haozhe An, Rachel Rudinger | N/A | N/A |
| Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding | Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Eric Wang | N/A | N/A |
| Ranking Manipulation for Conversational Search Engines | Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi | N/A | N/A |
| Fast Forwarding Low-Rank Training | Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov | N/A | N/A |
| Precise Model Benchmarking with Only a Few Observations | Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort | N/A | N/A |
| Attribute Diversity Determines the Systematicity Gap in VQA | Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra | N/A | N/A |
| “Rows, Columns and Values, Oh My!” Synthesizing Scientific Literature into Tables using Language Models | Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S Weld, Joseph Chee Chang, Kyle Lo | N/A | N/A |
| Development of Cognitive Intelligence in Pre-trained Language Models | Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma | N/A | N/A |
| Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding | Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui | N/A | N/A |
| Birdie: Advancing State Space Models with a Minimalist Architecture and Novel Pre-training Objectives | Sam Blouir, Jimmy T.H. Smith, Antonios Anastasopoulos, Amarda Shehu | N/A | N/A |
| Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? | Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow | N/A | N/A |
| Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs | Sheridan Feucht, David Atkinson, Byron C Wallace, David Bau | N/A | N/A |
| TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering | Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig | N/A | N/A |
| Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding | Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell | N/A | N/A |
| Unlocking Memorization in Large Language Models with Dynamic Soft Prompting | Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang | N/A | N/A |
| If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions | Reza Esfandiarpoor, Cristina Menghini, Stephen Bach | N/A | N/A |
| Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction | Bowen Zhang, Harold Soh | N/A | N/A |
| MQuinE: a Cure for “Z-paradox” in Knowledge Graph Embedding | Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun | N/A | N/A |
| Can Transformer Language Models Learn $n$-gram Language Models? | Anej Svete, Nadav Borenstein, Mike Zhou, Ryan Cotterell | N/A | N/A |
| StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model | Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim | N/A | N/A |
| Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems | Philippe Laban, Alexander Fabbri, Caiming Xiong, Chien-Sheng Wu | N/A | N/A |
| Multi-pass Decoding for Grammatical Error Correction | Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu | N/A | N/A |
| Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations | Yucheng Jiang, Yijia Shao, Dekun Ma, Sina Semnani, Monica Lam | N/A | N/A |
| SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation | Chenming Tang, Zhixiang Wang, Yunfang Wu | N/A | N/A |
| Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge | Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng | N/A | N/A |
| STORYSUMM: Evaluating Faithfulness in Story Summarization | Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Thomas Adams, Lydia Chilton, Kathleen McKeown | N/A | N/A |
| MMOE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Haofei Yu, Zhengyang Qi, Lawrence Keunho Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang | N/A | N/A |
| OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer | Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee | N/A | N/A |
| Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension | Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg | N/A | N/A |
| CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions | Jun Rao, Xuebo Liu, Lian Lian, shengjun cheng, Yunjie Liao, Min Zhang | N/A | N/A |
| ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers | Yuzhe Gu, Enmao Diao | N/A | N/A |
| Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models | Jaeseong Lee, seung-won hwang, Wonpyo Park, Mingi Ji | N/A | N/A |
| Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood | Yang Xu, Yu Wang, Hao An, Yongyuan Li, Zhichen Liu | N/A | N/A |
| Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning | Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, JUN ZHOU | N/A | N/A |
| Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models | XiaoHua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin | N/A | N/A |
| ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs | Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen | N/A | N/A |
| On the In-context Generation of Language Models | Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu | N/A | N/A |
| Atomic Inference for NLI with Generated Facts as Atoms | Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei | N/A | N/A |
| Towards Robust Speech Representation Learning for Thousands of Languages | William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe | N/A | N/A |
| I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses | Xuan Ren, Biao Wu, Lingqiao Liu | N/A | N/A |
| PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment | Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen | N/A | N/A |
| An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance | Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig | N/A | N/A |
| When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models | Ting-Yun Chang, Jesse Thomason, Robin Jia | N/A | N/A |
| Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference | Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo zhang, Yanghui Rao | N/A | N/A |
| Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions | Jinsung Yoon, Rajarishi Sinha, Sercan O Arik, Tomas Pfister | N/A | N/A |
| KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction | Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao | N/A | N/A |
| Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation | Zhen Lin, Shubhendu Trivedi, Jimeng Sun | N/A | N/A |
| $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity | Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl | N/A | N/A |
| CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction | Tuan Dung Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen | N/A | N/A |
| “In-Dialogues We Learn”: Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning | Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan | N/A | N/A |
| Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective | Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He | N/A | N/A |
| Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding | Xin Liu, Farima Fatahi Bayat, Lu Wang | N/A | N/A |
| Reasoning Robustness of LLMs to Adversarial Typographical Errors | Esther Gan, Yiran Zhao, Liying Cheng, Mao Yancan, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh | N/A | N/A |
| InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance | Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Mozhi Zhang, Ke Ren, Botian Jiang, Xipeng Qiu | N/A | N/A |
| Belief Revision: The Adaptability of Large Language Models Reasoning | Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung | N/A | N/A |
| Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models | Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou | N/A | N/A |
| Bio-RFX: Refining Biomedical Extraction via Advanced Relation Classification and Structural Constraints | Minjia Wang, Fangzhou Liu, Xiuxing Li, Bowen Dong, Zhenyu Li, Tengyu Pan, Jianyong Wang | N/A | N/A |
| Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models | Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng | N/A | N/A |
| LLMs Are Prone to Fallacies in Causal Inference | Nitish Joshi, Abulhair Saparov, Yixin Wang, He He | N/A | N/A |
| Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles | Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang | N/A | N/A |
| The Lou Dataset - Exploring the Impact of Gender-Fair Language in German Text Classification | Andreas Waldis, Joel Birrer, Anne Lauscher, Iryna Gurevych | N/A | N/A |
| When Generative Adversarial Networks Meet Sequence Labeling Challenges | Yu Tong, Ge Chen, Guokai Zheng, Rui Li, Jiang Dazhi | N/A | N/A |
| Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering | Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee | N/A | N/A |
| Speechworthy Instruction-tuned Language Models | Hyundong Justin Cho, Nicolaas Paul Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May | N/A | N/A |
| Data, Data Everywhere: A Guide for Pretraining Dataset Construction | Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| Fine-Tuning and Prompt Optimization: Two Good Steps that Work Better Together | Dilara Soylu, Christopher Potts, Omar Khattab | N/A | N/A |
| Demystifying Verbatim Memorization in Large Language Models | Jing Huang, Diyi Yang, Christopher Potts | N/A | N/A |
| AmbigNLG: Addressing Task Ambiguity in Instruction for NLG | Ayana Niwa, Hayate Iso | N/A | N/A |
| Distributional Properties of Subword Regularization | Marco Cognetta, Vilém Zouhar, Naoaki Okazaki | N/A | N/A |
| DataTales: A Benchmark for Real-World Intelligent Data Narration | Yajing Yang, Qian Liu, Min-Yen Kan | N/A | N/A |
| Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun | N/A | N/A |
| GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization | Yangfan Ye, Xiachong Feng, Xiaocheng Feng, Weitao Ma, Libo Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin | N/A | N/A |
| Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models | Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer | N/A | N/A |
| More Insightful Feedback for Tutoring: Enhancing Generation Mechanisms and Automatic Evaluation | Wencke Liermann, Jin-Xia Huang, Yohan Lee, Kong Joo Lee | N/A | N/A |
| Stable Language Model Pre-training by Reducing Embedding Variability | Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun | N/A | N/A |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Kavya Manohar, Leena G Pillai | N/A | N/A |
| Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets | Benjamin Schiller, Johannes Daxenberger, Andreas Waldis, Iryna Gurevych | N/A | N/A |
| Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas | Seungjong Sun, Eungu Lee, Seo Yeon Baek, Seunghyun Hwang, Lee wonbyung, Dongyan Nan, Bernard J Jansen, Jang Hyun Kim | N/A | N/A |
| ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator | Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha | N/A | N/A |
| Dynamic Multi-granularity Attribution Network for Aspect-based Sentiment Analysis | Yanjiang Chen, Kai Zhang, hufeng, Xianquan Wang, Ruikang li, Qi Liu | N/A | N/A |
| Unlabeled Debiasing in Downstream Tasks via Class-wise Low Variance Regularization | Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl | N/A | N/A |
| Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA | Pu Jian, Donglei Yu, Jiajun Zhang | N/A | N/A |
| Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights | Hao Yang, Lizhen Qu, Ehsan Shareghi, Reza Haf | N/A | N/A |
| Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations | Milan BHAN, Jean-Noël Vittaut, Nicolas CHESNEAU, Marie-Jeanne Lesot | N/A | N/A |
| What are the Generator Preferences for End-to-end Task-Oriented Dialog System? | Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou | N/A | N/A |
| Paraphrase Types Elicit Prompt Engineering Capabilities | Jan Philip Wahle, Terry Ruas, Yang Xu, Bela Gipp | N/A | N/A |
| VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models | Jingtao Cao, Zhang Zheng, Hongru WANG, Kam-Fai Wong | N/A | N/A |
| Towards Online Continuous Sign Language Recognition and Translation | Ronglai Zuo, Fangyun Wei, Brian Mak | N/A | N/A |
| Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment | Yiwei Dai, Hengrui Gu, Ying Wang, Xin Wang | N/A | N/A |
| Split and Merge: Aligning Position Biases in LLM-based Evaluators | Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu | N/A | N/A |
| Integrating Argumentation and Hate-Speech-based Techniques for Countering Misinformation | Sougata Saha, Rohini Srihari | N/A | N/A |
| BPO: Supercharging Online Preference Learning by Adhering to the Proximity of Behavior LLM | Wenda Xu, Jiachen Li, William Yang Wang, Lei Li | N/A | N/A |
| One2Set + Large Language Model: Best Partners for Keyphrase Generation | Liangying Shao, Liang Zhang, Minlong Peng, Guoqi Ma, Hao Yue, Mingming Sun, Jinsong Su | N/A | N/A |
| Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering | Yifei Yuan, Yang Deng, Anders Søgaard, Mohammad Aliannejadi | N/A | N/A |
| ORPO: Monolithic Preference Optimization without Reference Model | Jiwoo Hong, Noah Lee, James Thorne | N/A | N/A |
| A Multi-Perspective Analysis of Memorization in Large Language Models | Bowen Chen, Namgi Han, Yusuke Miyao | N/A | N/A |
| Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations | Nicolò Penzo, Maryam Sajedinia, Bruno Lepri, Sara Tonelli, Marco Guerini | N/A | N/A |
| Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs | Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych | N/A | N/A |
| Unveiling the Role of Pretraining in Direct Speech Translation | Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà | N/A | N/A |
| PCQPR: Proactive Conversational Question Planning with Reflection | Shasha Guo | N/A | N/A |
| CodeAgent: Autonomous Communicative Agents for Code Review | Xunzhu Tang, KISUB KIM, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawendé F. Bissyandé | N/A | N/A |
| TroL: Traversal of Layers for Large Language and Vision Models | Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro | N/A | N/A |
| MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language | Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin | N/A | N/A |
| Revisiting Supertagging for faster HPSG parsing | Olga Zamaraeva, Carlos Gómez-Rodríguez | N/A | N/A |
| Improve Dense Passage Retrieval with Entailment Tuning | Lu Dai, Hao Liu, Hui Xiong | N/A | N/A |
| ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models | Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen WAN, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana | N/A | N/A |
| TEMA: Token Embeddings Mapping for Enriching Low-Resource Language Models | Rodolfo Zevallos, Núria Bel, Mireia Farrús | N/A | N/A |
| DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting | Xuanming Zhang, Anthony Diaz, Zixun Chen, Qingyang Wu, Kun Qian, Erik Voss, Zhou Yu | N/A | N/A |
| Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback | Fatemeh Pesaran zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim | N/A | N/A |
| PrExMe: Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation | Christoph Leiter, Steffen Eger | N/A | N/A |
| Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning | Shuai Zhao, Meihuizi Jia, Anh Tuan Luu, Fengjun Pan, Jinming Wen | N/A | N/A |
| Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models | Javier Chiyah-Garcia, Alessandro Suglia, Arash Eshghi | N/A | N/A |
| Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models | Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu | N/A | N/A |
| Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations | Matthias Lindemann, Alexander Koller, Ivan Titov | N/A | N/A |
| Puzzle Solving using Reasoning of Large Language Models: A Survey | Panagiotis Giadikiaroglou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou | N/A | N/A |
| SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading | Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Jianfeng Gao, Fabian Peller-Konrad, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues | N/A | N/A |
| Red Teaming Language Models for Processing Contradictory Dialogues | Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen | N/A | N/A |
| Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models | Sander Land, Max Bartolo | N/A | N/A |
| Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs | Houman Mehrafarin, Arash Eshghi, Ioannis Konstas | N/A | N/A |
| Don’t Underestimate the Octopus - Why The Symbol Grounding Problem Does Not Apply to LLMs | Reto Gubelmann | N/A | N/A |
| Major Entity Identification: A Generalizable Alternative to Coreference Resolution | Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi | N/A | N/A |
| Enhancing High-order Interaction Awareness in LLM-based Recommender Model | Xinfeng Wang, Jin Cui, Fumiyo Fukumoto, Yoshimi Suzuki | N/A | N/A |
| What Are the Odds? Language Models Are Capable of Probabilistic Reasoning | Akshay Paruchuri, Jake Garrison, shun liao, John B Hernandez, Jacob Sunshine, Tim Althoff, Xin Liu, Daniel McDuff | N/A | N/A |
| MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction | Han Jiang, Junwen Duan, Zhe Qu, Jianxin Wang | N/A | N/A |
| LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models | Hayder Elesedy, Pedro M Esperanca, Silviu Vlad Oprea, Mete Ozay | N/A | N/A |
| “A good pun is its own reword”: Can Large Language Models Understand Puns? | Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang | N/A | N/A |
| QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation | Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu | N/A | N/A |
| Dependency Graph Parsing as Sequence Labeling | Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez | N/A | N/A |
| NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data | Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoit Crabbé, Etienne P Bernard | N/A | N/A |
| Towards a Greek Proverb Atlas: Computational Spatial Exploration and Attribution of Greek Proverbs | John Pavlopoulos, Panos Louridas, Panagiotis Filos | N/A | N/A |
| Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications | Weize Liu, Yinlong Xu, Hongxia Xu, Jintai Chen, Xuming Hu, Jian Wu | N/A | N/A |
| Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss | Bowen Zhang, Chunping Li | N/A | N/A |
| Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training | Marc Felix Brinner, Sina Zarrieß | N/A | N/A |
| Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation | Markus Frohmann, Igor Sterner, Ivan Vulić, Benjamin Minixhofer, Markus Schedl | N/A | N/A |
| Applying Contrastive Learning to Code Vulnerability Type Classification | Chen Ji, Su Yang, Hongyu Sun, Yuqing Zhang | N/A | N/A |
| TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts | Ruida WANG, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang | N/A | N/A |
| Multi-Level Cross-Modal Alignment for Speech Relation Extraction | Liang Zhang, Zhen Yang, Biao Fu, Ziyao Lu, Liangying Shao, Shiyu Liu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jinsong Su | N/A | N/A |
| Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models | Christopher Schröder, Gerhard Heyer | N/A | N/A |
| PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models | Jinsung Kim, Seonmin Koo, Heuiseok Lim | N/A | N/A |
| The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm | Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker | N/A | N/A |
| Subword Segmentation in LLMs: Looking at Inflection and Consistency | Marion Di Marco, Alexander Fraser | N/A | N/A |
| Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments | Omar Sharif, Joseph Gatto, MADHUSUDAN BASAK, Sarah Masud Preum | N/A | N/A |
| Let Me Teach You: Pedagogical Foundations of Feedback for Language Models | Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut | N/A | N/A |
| Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data | Jean-Flavien Bussotti, Luca Ragazzi, Giacomo Frisoni, Gianluca Moro, Paolo Papotti | N/A | N/A |
| TL-CL: Task And Language Incremental Continual Learning | Shrey Satapara, P. K. Srijith | N/A | N/A |
| Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Daniel P Jeong, Saurabh Garg, Zachary Chase Lipton, Michael Oberst | N/A | N/A |
| Empowering Multi-step Reasoning across Languages via Program-Aided Language Models | Leonardo Ranaldi, Giulia Pucci | N/A | N/A |
| Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models | Yu Yuan, Lili Zhao, Kai Zhang, Guangting Zheng, Qi Liu | N/A | N/A |
| ControlMath: Controllable Data Generation Promotes Math Generalist Models | Nuo Chen, Ning Wu, Jianhui Chang, MING GONG, Linjun Shou, Dongmei Zhang, Jia Li | N/A | N/A |
| Where Am I From? Identifying Origin of LLM-generated Content | Liying LI, Yihan Bai, Minhao Cheng | N/A | N/A |
| ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment | Tarek Naous, Michael J Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu | N/A | N/A |
| GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text | Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori Levin | N/A | N/A |
| GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains | Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Elizabeth Levine, Jessica Lin, Devika Tiwari, Amir Zeldes | N/A | N/A |
| RA2FD: Distilling Faithfulness into Efficient Dialogue Systems | Zhiyuan Zhu, Yusheng Liao, Chenxin Xu, Yunfeng Guan, Yanfeng Wang, Yu Wang | N/A | N/A |
| Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation | Fangrui Lv, Kaixiong Gong, Jian Liang, Xinyu Pang, Changshui Zhang | N/A | N/A |
| Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently | Kanishka Misra, Allyson Ettinger, Kyle Mahowald | N/A | N/A |
| Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking | Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li, Chenghua Lin, Wenge Rong | N/A | N/A |
| A Coordinate System for In-Context Learning | Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen | N/A | N/A |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Tengfei Yu, Xuebo Liu, Zhiyi Hou, Liang Ding, Dacheng Tao, Min Zhang | N/A | N/A |
| ABSEval: An Agent-based Framework for Script Evaluation | Sirui Liang, Baoli Zhang, Jun Zhao, Kang Liu | N/A | N/A |
| Latent Concept-based Explanation of NLP Models | Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad | N/A | N/A |
| Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher | Hyunjong Ok, Jegwang Ryu, Jaeho Lee | N/A | N/A |
| Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research | Yida Mu, Mali Jin, Xingyi Song, Nikolaos Aletras | N/A | N/A |
| The Mystery of the Pathological Path-star Task for Language Models | Arvid Frydenlund | N/A | N/A |
| Voices in a Crowd: Searching for clusters of unique perspectives | Nikolas Vitsakis, Amit Parekh, Ioannis Konstas | N/A | N/A |
| Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent | Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu | N/A | N/A |
| SLANG: New Concept Comprehension of Large Language Models | Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Xueqi Cheng | N/A | N/A |
| Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models | Michael Lan, Philip Torr, Fazl Barez | N/A | N/A |
| Why Does New Knowledge Create Messy Ripple Effects in LLMs? | Jiaxin Qin, Zixuan Zhang, Chi Han, Pengfei Yu, Manling Li, Heng Ji | N/A | N/A |
| Lifelong Event Detection via Optimal Transport | Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories | Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot | N/A | N/A |
| FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation | KaShun SHUM, Minrui Xu, Jianshu Zhang, Zixin CHEN, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza | N/A | N/A |
| Domain adapted machine translation: What does catastrophic forgetting forget and why? | Danielle Saunders, Steve DeNeefe | N/A | N/A |
| Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback | Benjamin Towle, Ke Zhou | N/A | N/A |
| Atomic Self-Consistency for Better Long Form Generations | Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra | N/A | N/A |
| “Global is Good, Local is Bad?’’: Understanding Brand Bias in LLMs | Mahammed Kamruzzaman, Hieu Minh Nguyen, Gene Louis Kim | N/A | N/A |
| Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach | Siqi Li, Danni Liu, Jan Niehues | N/A | N/A |
| ACE: A LLM-based Negotiation Coaching System | Ryan Shea, Aymen Kallala, Xin Lucy Liu, Michael W. Morris, Zhou Yu | N/A | N/A |
| TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities | Ming Zhang, Caishuang Huang, Yilong Wu, Shichun Liu, Huiyuan Zheng, Yurui Dong, Yujiong Shen, Shihan Dou, Jun Zhao, Junjie Ye, Qi Zhang, Tao Gui, Xuanjing Huang | N/A | N/A |
| PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals | Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M Murphy, Nev Jones, Kate V Hardy, Hong Shen, Fei Fang, Zhiyu Chen | N/A | N/A |
| DKEC: Domain Knowledge Enhanced Multi-Label Classification for Diagnosis Prediction | Xueren Ge, Abhishek Satpathy, Ronald Dean Williams, John Stankovic, Homa Alemzadeh | N/A | N/A |
| $\texttt{ModSCAN}$: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities | Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang | N/A | N/A |
| Large Language Models Can Self-Correct with Key Condition Verification | Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang | N/A | N/A |
| Learning to Write Rationally: How Information Is Distributed in Non-native Speakers’ Essays | Zixin Tang, Janet van Hell | N/A | N/A |
| Defending Against Social Engineering Attacks in the Age of LLMs | Lin Ai, Tharindu Sandaruwan Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael S. Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, huan liu, Julia Hirschberg | N/A | N/A |
| Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models | Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi | N/A | N/A |
| Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training | Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che | N/A | N/A |
| Target-Aware Language Modeling via Granular Data Sampling | Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra | N/A | N/A |
| SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness | Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, Nanyun Peng | N/A | N/A |
| Learning from Feedback with Coupled Comprehension and Generation | Mustafa Omer Gul, Yoav Artzi | N/A | N/A |
| UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks | Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei | N/A | N/A |
| Story Morals: Surfacing value-driven narrative schemas using large language models | David G Hobson, Haiqi Zhou, Derek Ruths, Andrew Piper | N/A | N/A |
| OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants | Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta | N/A | N/A |
| AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies | Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Murari Tiyyala, Nicholas Andrews, Daniel Khashabi | N/A | N/A |
| SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents | Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard Dragut | N/A | N/A |
| Analysis of Plan-based Retrieval for Grounded Text Generation | Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer | N/A | N/A |
| Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors | Alex Chandler, Devesh Surve, Hui Su | N/A | N/A |
| RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs | John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker | N/A | N/A |
| Improving Logical Fallacy Reasoning with Logical Structure Tree | Yuanyuan Lei, Ruihong Huang | N/A | N/A |
| Chain and Causal Attention for Efficient Entity Tracking | Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen | N/A | N/A |
| BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models | Yi Zeng, Weiyu Sun, Tran Ngoc Huynh, Dawn Song, Bo Li, Ruoxi Jia | N/A | N/A |
| A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution | Zhengmian Hu, Tong Zheng, Heng Huang | N/A | N/A |
| FAC$^2$E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition | Xiaoqiang Wang, Lingfei Wu, Tengfei Ma, Bang Liu | N/A | N/A |
| OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation | Tanvir Mahmud, Diana Marculescu | N/A | N/A |
| Language Concept Erasure for Language-invariant Dense Retrieval | Zhiqi Huang, Puxuan Yu, Shauli Ravfogel, James Allan | N/A | N/A |
| Learning Personalized Alignment for Evaluating Open-ended Text Generation | Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian | N/A | N/A |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang | N/A | N/A |
| Turn Waste into Worth: Rectifying Top-$k$ Router of MoE | Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu | N/A | N/A |
| Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination | Pittawat Taveekitworachai, Febri Abdullah, Ruck Thawonmas | N/A | N/A |
| CommVQA: Situating Visual Question Answering in Communicative Contexts | Nandita Shankar Naik, Christopher Potts, Elisa Kreiss | N/A | N/A |
| Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding | Weilin Zhao, Yuxiang Huang, Xu Han, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun | N/A | N/A |
| 1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? | Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun | N/A | N/A |
| How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective | Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G Honavar | N/A | N/A |
| Style-Specific Neurons for Steering LLMs in Text Style Transfer | Wen Lai, Viktor Hangya, Alexander Fraser | N/A | N/A |
| Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers | Tianhua Zhang, Kun LI, Hongyin Luo, Xixin Wu, James R. Glass, Helen M. Meng | N/A | N/A |
| Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction | Sizhe Zhou, Yu Meng, Bowen Jin, Jiawei Han | N/A | N/A |
| DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models | Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Leveraging Context-aware Prompting for Commit Message Generation | Zhihua Jiang, Jianwei Chen, Dongning Rao, Guanghui Ye | N/A | N/A |
| Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination | Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein | N/A | N/A |
| Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning | Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue’ | N/A | N/A |
| A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models | Zhihao Wang, Shiyu Liu, Jianheng Huang, Wang Zheng, YiXuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su | N/A | N/A |
| Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages | Jimin Sohn, Haeji Jung, Alex Cheng, Jooeon Kang, Yilin Du, David R Mortensen | N/A | N/A |
| An Analysis and Mitigation of the Reversal Curse | Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan | N/A | N/A |
| Exploring the Practicality of Generative Retrieval on Dynamic Corpora | Soyoung Yoon, Chaeeun Kim, Hyunji Lee, Joel Jang, Sohee Yang, Minjoon Seo | N/A | N/A |
| OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting | Xukai Liu, Ye Liu, Kai Zhang, Kehang Wang, Qi Liu, Enhong Chen | N/A | N/A |
| Gotcha! Don’t trick me with unanswerable questions! Self-aligning Large Language Models for Proactively Responding to Unknown Questions | Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua | N/A | N/A |
| Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning | Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang | N/A | N/A |
| Large Language Models in the Clinic: A Comprehensive Benchmark | Fenglin Liu, Zheng Li, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Hongjian Zhou, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei Clifton, Bing Yin, David A. Clifton | N/A | N/A |
| Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction | Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu | N/A | N/A |
| Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective | Van-Cuong Pham, Thien Huu Nguyen | N/A | N/A |
| DynamicER: Resolving Emerging Mentions to Dynamic Entities for RAG | Jinyoung Kim, Dayoon Ko, Gunhee Kim | N/A | N/A |
| Preserving Generalization of Language models in Few-shot Continual Relation Extraction | Quyen Tran, Nguyen Xuan Thanh, Nguyen Hoang Anh, Nam Le Hai, Trung Le, Linh Van Ngo, Thien Huu Nguyen | N/A | N/A |
| A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations | Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang | N/A | N/A |
| Consecutive Batch Model Editing with HooK Layers | Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang CHEN, Wai Lam | N/A | N/A |
| Topic-Oriented Open Relation Extraction with A Priori Seed Generation | Linyi Ding, Jinfeng Xiao, Sizhe Zhou, Chaoqi Yang, Jiawei Han | N/A | N/A |
| Related Work and Citation Text Generation: A Survey | Xiangci Li, Jessica Ouyang | N/A | N/A |
| Curriculum Consistency Learning for Conditional Sentence Generation | Liangxin Liu, Xuebo Liu, Lian Lian, shengjun cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang | N/A | N/A |
| A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences | Leonardo Bertolazzi, Albert Gatt, Raffaella Bernardi | N/A | N/A |
| Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision | Fan Jiang, Tom Drummond, Trevor Cohn | N/A | N/A |
| Towards an Open-Source Speech Foundation Model for EU: 950,000 Hours of Open-Source Compliant Speech Data for EU Languages | Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri | N/A | N/A |
| Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning | Jiashi Lin, Lifang Wang, Xinyu Lu, Zhongtian Hu, Wei Zhang, Wenxuan Lu | N/A | N/A |
| Contribution of Linguistic Typology to Universal Dependency Parsing: An Empirical Investigation | Ali Basirat, Navid Baradaran Hemmati | N/A | N/A |
| TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse | Francesco Periti, Pierluigi Cassotti, Stefano Montanelli, Nina Tahmasebi, Dominik Schlechtweg | N/A | N/A |
| Structured Optimal Brain Pruning for Large Language Models | Jiateng Wei, Quan Lu, ning jiang, Siqi Li, Jingyang Xiang, Jun Chen, Yong Liu | N/A | N/A |
| Automatically Generated Definitions and their utility for Modeling Word Meaning | Francesco Periti, David Alfter, Nina Tahmasebi | N/A | N/A |
| How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data | Yejie Wang, Keqing He, Dayuan Fu, Zhuoma GongQue, Heyang Xu, Yanxu Chen, Zhexu Wang, Yujia Fu, Guanting Dong, Muxi Diao, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu | N/A | N/A |
| MINT: A Benchmark for Evaluating Instructed Information Retrieval | Weiwei Sun, Zhengliang Shi, Wu Jiu Long, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren | N/A | N/A |
| Rethinking the Evaluation of In-Context Learning for LLMs | Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu, Xiang Ao | N/A | N/A |
| Cluster-Norm for Unsupervised Probing of Knowledge | Walter Laurito, Sharan Maiya, Grégoire DHIMOÏLA, Owen Ho Wan Yeung, Kaarel Hänni | N/A | N/A |
| Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries | Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson | N/A | N/A |
| Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration | Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng | N/A | N/A |
| Where am I? Large Language Models Wandering between Semantics and Structures in Long Contexts | Seonmin Koo, Jinsung Kim, YoungJoon Jang, Chanjun Park, Heuiseok Lim | N/A | N/A |
| KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students | Matthew Shu, Nishant Balepur, Shi Feng, Jordan Lee Boyd-Graber | N/A | N/A |
| Large Language Models Can Be Contextual Privacy Protection Learners | Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng | N/A | N/A |
| A SMART Mnemonic Sounds like “Glue Tonic”: Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick | Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Lee Boyd-Graber | N/A | N/A |
| Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models | Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Reza Haf | N/A | N/A |
| MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction | Jun-Hyung Park, Yeachan Kim, Mingyu Lee, Hyuntae Park, SangKeun Lee | N/A | N/A |
| First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning | Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui | N/A | N/A |
| Tools Fail: Detecting Silent Errors in Faulty Tools | Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk | N/A | N/A |
| Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity | Bowen Zhang, Chunping Li | N/A | N/A |
| Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing | Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, Gary Lee | N/A | N/A |
| Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling | Georgios Pantazopoulos, Malvina Nikandrou, Alessandro Suglia, Oliver Lemon, Arash Eshghi | N/A | N/A |
| Are LLMs Good Zero-Shot Fallacy Classifiers? | Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu | N/A | N/A |
| The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis | Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages | Dominik Schlechtweg, Pierluigi Cassotti, Bill Noble, David Alfter, Sabine Schulte im Walde, Nina Tahmasebi | N/A | N/A |
| Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification | Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama | N/A | N/A |
| ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos | Arpan Phukan, Manish Gupta, Asif Ekbal | N/A | N/A |
| Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation | Elaf Alhazmi, Quan Z. Sheng, Wei Emma Zhang, Munazza Zaib, Ahoud Alhazmi | N/A | N/A |
| Evaluating $n$-Gram Novelty of Language Models Using Rusty-DAWG | William Merrill, Noah A. Smith, Yanai Elazar | N/A | N/A |
| ASL STEMpedia: Dataset and Benchmark for Interpreting STEM Articles | Kayo Yin, Chinmay Singh, Fyodor O Minakov, Vanessa Milan, Hal Daumé III, Cyril Zhang, Alex Xijie Lu, Danielle Bragg | N/A | N/A |
| Can Automatic Metrics Assess High-Quality Translations? | Sweta Agrawal, António Farinhas, Ricardo Rei, Andre Martins | N/A | N/A |
| Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation | Sweta Agrawal, José G. C. de Souza, Ricardo Rei, António Farinhas, Gonçalo Faria, Patrick Fernandes, Nuno M Guerreiro, Andre Martins | N/A | N/A |
| DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding | Bowen Xing, Lizi Liao, Minlie Huang, Ivor Tsang | N/A | N/A |
| KnowTuning: Knowledge-aware Fine-tuning for Large Language Models | Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren | N/A | N/A |
| SecCoder: Towards Generalizable and Robust Secure Code Generation | Boyu Zhang, Tianyu Du, Junkai Tong, Xuhong Zhang, Kingsum Chow, Sheng Cheng, Xun Wang, Jianwei Yin | N/A | N/A |
| Nash CoT: Multi-Path Inference with Preference Equilibrium | Ziqi Zhang, Cunxiang Wang, Xiao Xiong, Yue Zhang, Donglin Wang | N/A | N/A |
| Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention | Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou | N/A | N/A |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di ZHANG, Kun Gai, Ji-Rong Wen | N/A | N/A |
| Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding | Wei Li, Zhen Huang, Xinmei Tian, Le Lu, Houqiang Li, Xu Shen, Jieping Ye | N/A | N/A |
| LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History | Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark Gales, Mario Fritz | N/A | N/A |
| Social Bias Probing: Fairness Benchmarking for Language Models | Marta Marchiori Manerba, Karolina Stanczak, Riccardo Guidotti, Isabelle Augenstein | N/A | N/A |
| Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models | Wenhao Yu, Hongming Zhang, Xiaoman Pan, peixin cao, Kaixin Ma, Jian Li, Hongwei Wang, Dong Yu | N/A | N/A |
| DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models | Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li | N/A | N/A |
| Revisiting Automated Evaluation for Long-form Table Question Answering in the Era of Large Language Models | Yuqi Wang, Lyuhao Chen, Yilun Zhao | N/A | N/A |
| Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems | Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He | N/A | N/A |
| Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning | Zhihan Zhang, Tao Ge, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang | N/A | N/A |
| FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents | Yilun Zhao, Yitao Long, Tintin Jiang, Weiyuan Chen, Chengye Wang, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan | N/A | N/A |
| Extracting Prompts by Inverting LLM Outputs | Collin Zhang, John Xavier Morris, Vitaly Shmatikov | N/A | N/A |
| BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs | Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu | N/A | N/A |
| VHASR: A Multimodal Speech Recognition System With Vision Hotwords | Jiliang Hu, Zuchao Li, Ping Wang, Haojun Ai, Lefei Zhang, hai zhao | N/A | N/A |
| A Fundamental Trade-off in Aligned Language Models and its Relation to Sampling Adaptors | Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Min-Yen Kan, Ryan Cotterell | N/A | N/A |
| Bridging Local Details and Global Context in Text-Attributed Graphs | Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, liyunfei, Siliang Tang | N/A | N/A |
| Building Resources for Emakhuwa: Machine Translation and News Classification Benchmarks | Felermino D. M. A. Ali, Henrique Lopes Cardoso, Rui Sousa-Silva | N/A | N/A |
| RepMatch: Quantifying Cross-Instance Similarities in Representation Space | Mohammad Reza Modarres, Sina Abbasi, Mohammad Taher Pilehvar | N/A | N/A |
| Commonsense Knowledge Editing Based on Free-Text in LLMs | Xiusheng Huang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| A Closer Look at Multidimensional Online Political Incivility | Sagi Pendzel, Nir Lotan, Alon Zoizner, Einat Minkov | N/A | N/A |
| Leveraging BERT and TFIDF Features for Short Text Clustering via Alignment-Promoting Co-Training | Zetong Li, Qinliang Su, Shijing Si, Jianxing Yu | N/A | N/A |
| Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation | Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky | N/A | N/A |
| Unsupervised Named Entity Disambiguation for Low Resource Domains | Debarghya Datta, Soumajit Pramanik | N/A | N/A |
| SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers | Viktoriia A. Chekalina, Anna Rudenko, Gleb Mezentsev, Aleksandr Mikhalev, Alexander Panchenko, Ivan Oseledets | N/A | N/A |
| MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion | Qingyang Li, Yanru Zhong, Yuchu Qin | N/A | N/A |
| ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities | Ying Su, Zhan Ling, Haochen Shi, Cheng Jiayang, Yauwai Yim, Yangqiu Song | N/A | N/A |
| Shortcuts Arising from Contrast: Towards Effective and Lightweight Clean-Label Attacks in Prompt-Based Learning | Xiaopeng Xie, Ming YAN, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Tianyi Zhou | N/A | N/A |
| GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Aashiq Muhamed, Oscar Li, David Woodruff, Mona T. Diab, Virginia Smith | N/A | N/A |
| RaTEScore: A Metric for Entity-Aware Radiology Text Similarity | Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Weidi Xie | N/A | N/A |
| HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning | Shayan Ali Akbar, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Victor Alvarez, Erica M Salinas, Erwin Cornejo | N/A | N/A |
| Learning to Rank Salient Content for Query-focused Summarization | Sajad Sotudeh, Nazli Goharian | N/A | N/A |
| Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions | Qian Ruan, Ilia Kuznetsov, Iryna Gurevych | N/A | N/A |
| LitSearch: A Retrieval Benchmark for Scientific Literature Search | Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao | N/A | N/A |
| Open-world Multi-label Text Classification with Extremely Weak Supervision | Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang | N/A | N/A |
| LMs learn governing principles of dynamical systems, revealing an in-context neural scaling law | Toni J.B. Liu, Nicolas Boulle, Raphaël Sarfati, Christopher Earls | N/A | N/A |
| AKEW: Assessing Knowledge Editing in the Wild | Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu | N/A | N/A |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh | N/A | N/A |
| Dense X Retrieval: What Retrieval Granularity Should We Use? | Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu | N/A | N/A |
| Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach | Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang | N/A | N/A |
| Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models | Zheng Zhao, Yftah Ziser, Shay B Cohen | N/A | N/A |
| XDetox: Text Detoxification with Token-Level Toxicity Explanations | Beomseok Lee, Hyunwoo Kim, Keon Kim, Yong Suk Choi | N/A | N/A |
| Optimizing Chinese Lexical Simplification Across Word Types: A Hybrid Approach | ZiHao Xiao, Jiefu Gong, Shijin Wang, Wei Song | N/A | N/A |
| Evaluating LLMs’ Capability in Satisfying Lexical Constraints | Bingxuan Li, Yiwei Wang, Tao Meng, Nanyun Peng, Kai-Wei Chang | N/A | N/A |
| Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion | Chenyu Qiu, Pengjiang Qian, Chuang Wang, Jian Yao, Li Liu, Fang wei, Eddie Y.K. Eddie | N/A | N/A |
| Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning | Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| RoCEL: Advancing Table Entity Linking through Distinctive Row and Column Contexts | Yuanzheng Wang, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng | N/A | N/A |
| Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models | Zi’ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu | N/A | N/A |
| Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning | Panuthep Tasawong, Peerat Limkonchotiwat, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong | N/A | N/A |
| MetaBench: Planning of Multiple APIs from Various APPs for Complex User Instruction | Hongru WANG, Rui Wang, Boyang XUE, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong | N/A | N/A |
| Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment | Zhipeng Chen, Kun Zhou, Xin Zhao, Jingyuan Wang, Ji-Rong Wen | N/A | N/A |
| AudioVSR: Enhancing Video Speech Recognition with Audio Data | Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin | N/A | N/A |
| ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? | Siddhant Waghjale, Vishruth Veerendranath, Zhiruo Wang, Daniel Fried | N/A | N/A |
| Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level | Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu | N/A | N/A |
| Re-ReST: Reflection-Reinforced Self-Training for Language Agents | Zi-Yi Dou, Cheng-Fu Yang, Xueqing Wu, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Effective Synthetic Data and Test-Time Adaptation for OCR Correction | Shuhao Guan, Cheng Xu, Moule Lin, Derek Greene | N/A | N/A |
| SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework | Fu Zhang, Qi Miao, Jingwei Cheng, Hongsen Yu, Yi Yan, Xin Li, YongxueWu | N/A | N/A |
| FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension | Junzhuo Liu, Xuzheng Yang, WEIWEI LI, Peng Wang | N/A | N/A |
| Exploring the Learning Capabilities of Language Models using LEVERWORLDS | Eitan Wagner, Amir Feder, Omri Abend | N/A | N/A |
| CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models | Eitan Wagner, Yuli Slavutsky, Omri Abend | N/A | N/A |
| DocEditAgent: Document Structure Editing Via Multimodal LLM Grounding | Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha | N/A | N/A |
| DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging | Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen | N/A | N/A |
| Understanding Slang with LLMs: Modelling Cross-Cultural Nuances through Paraphrasing | Ifeoluwa Wuraola, Nina Dethlefs, Daniel Marciniak | N/A | N/A |
| Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding | Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou | N/A | N/A |
| Re-Reading Improves Reasoning in Large Language Models | Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-Guang Lou, Shuai Ma | N/A | N/A |
| Adaptive Axes: A Pipeline for In-domain Social Stereotype Analysis | Qingcheng Zeng, Mingyu Jin, Rob Voigt | N/A | N/A |
| ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments | Sourjyadip Ray, Kushal Gupta, Soumi Kundu, Dr Payal Arvind Kasat, Somak Aditya, Pawan Goyal | N/A | N/A |
| Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations | Jiyi Li | N/A | N/A |
| Improve Student’s Reasoning Generalizability through Cascading Decomposed CoTs Distillation | Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu | N/A | N/A |
| Revisiting Supervised Contrastive Learning for Microblog Classification | Junbo Huang, Ricardo Usbeck | N/A | N/A |
| BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting | Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang | N/A | N/A |
| Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective | Zhaotian Weng, Zijun Gao, Jerone Andrews, Jieyu Zhao | N/A | N/A |
| Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing | Weichuan Wang, Zhaoyi Li, Defu Lian, Chen Ma, Linqi Song, Ying Wei | N/A | N/A |
| SciAgent: Tool-augmented Language Models for Scientific Reasoning | Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun | N/A | N/A |
| Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents | Dong Won Lee, Hae Won Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency | N/A | N/A |
| Towards Measuring and Modeling “Culture” in LLMs: A Survey | Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Shivdutt Singh, Alham Fikri Aji, Jacki O’Neill, Ashutosh Modi, Monojit Choudhury | N/A | N/A |
| ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models | Haiquan Zhao, Lingyu Li, Shisong Chen, Shuqi Kong, Jiaan Wang, Kexin Huang, Tianle Gu, Yixu Wang, Jian Wang, Liang Dandan, Zhixu Li, Yan Teng, Yanghua Xiao, Yingchun Wang | N/A | N/A |
| Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting | Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury | N/A | N/A |
| Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features | Xiao Yu, Kejiang Chen, Qi Yang, Weiming Zhang, Nenghai Yu | N/A | N/A |
| Hate Personified: Investigating the role of LLMs in content moderation pipeline for hate speech | Sarah Masud, Sahajpreet Singh, Viktor Hangya, Alexander Fraser, Tanmoy Chakraborty | N/A | N/A |
| Temporally Consistent Factuality Probing for Large Language Models | Ashutosh Bajpai, Aaryan Goyal, Atif Anwer, Tanmoy Chakraborty | N/A | N/A |
| A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives | Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann | N/A | N/A |
| Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators | Prasoon Bajpai, Niladri Chatterjee, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training | Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng | N/A | N/A |
| Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability | Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan | N/A | N/A |
| Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging | Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, hanyu Zhao, Siqi Fan, Zheng Zhang | N/A | N/A |
| Generating Demonstrations for In-Context Compositional Generalization in Grounded Language Learning | Sam Spilsbury, Pekka Marttinen, Alexander Ilin | N/A | N/A |
| FAME: Factual Multi-task Model Editing Benchmark | Li Zeng, Yingyu Shan, Zeming Liu, Jiashu Yao, Yuhang Guo | N/A | N/A |
| MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance | Renjie Pi, Tianyang Han, Jianshu Zhang, Yueqi XIE, Rui Pan, Qing LIAN, Hanze Dong, Jipeng Zhang, Tong Zhang | N/A | N/A |
| Leveraging Large Language Models for NLG Evaluation: Advances and Challenges | Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Yuxuan Lai, Chongyang Tao, Shuai Ma | N/A | N/A |
| InfiniPot: Infinite Context Processing on Memory-Constrained LLMs | Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang | N/A | N/A |
| VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models | Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin | N/A | N/A |
| CorrSynth - A Correlated Sampling Method for Diverse dataset Generation from LLMs | Abhishek Divekar, Suhas S Kowshik, Vijit Malik | N/A | N/A |
| Defining Knowledge: Bridging Epistemology and Large Language Models | Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard | N/A | N/A |
| TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMs | Peiwen Jiang, Zibo Zhao, Xinbo Lin, Ruhui Ma, Yvonne Jie Chen, Jinhua Cheng | N/A | N/A |
| Free your mouse! Command Large Language Models to Generate Code to Format Word Documents | Shihao Rao, Liang Li, Jiapeng Liu, Guan Weixin, Xiyan Gao, bing lim | N/A | N/A |
| CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models | Jiawei Gu, Zacc Yang, Chuanghao Ding, Rui Zhao, Fei Tan | N/A | N/A |
| The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs | Tianyang Han, Qing LIAN, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang | N/A | N/A |
| Rationale-Aware Answer Verification by Pairwise Self-Evaluation | Akira Kawabata, Saku Sugawara | N/A | N/A |
| On the Robustness of Editing Large Language Models | Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, hai zhao, lifeng Liu, Yulong Wang | N/A | N/A |
| IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method | MiHyeon Kim, Juhyoung Park, YoungBin Kim | N/A | N/A |
| Distract Large Language Models for Automatic Jailbreak Attack | Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen | N/A | N/A |
| Exploring Space Efficiency in a Tree-based Linear Model for Extreme Multi-label Classification | He-Zhe Lin, Cheng-Hung Liu, Chih-Jen Lin | N/A | N/A |
| WorryWords: Norms of Anxiety Association for 44,450 English Words | Saif M. Mohammad | N/A | N/A |
| Finding Blind Spots in Evaluator LLMs with Interpretable Checklists | Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M Khapra | N/A | N/A |
| LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration | Jun Zhao, Can Zu, Xu Hao, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments | Till Raphael Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart | N/A | N/A |
| Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs | Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li | N/A | N/A |
| Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems | Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang | N/A | N/A |
| Scaling Laws for Linear Complexity Language Models | Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong | N/A | N/A |
| Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards | Heejin Do, Sangwon Ryu, Gary Lee | N/A | N/A |
| Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis | Guangliang Liu, Haitao Mao, Jiliang Tang, Kristen Johnson | N/A | N/A |
| ATAP: Automatic Template-Augmented Commonsense Knowledge Graph Completion via Pre-Trained Language Models | Fu Zhang, Yifan Ding, Jingwei Cheng | N/A | N/A |
| LM2: A Simple Society of Language Models Solves Complex Reasoning | Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty | N/A | N/A |
| Towards a Semantically-aware Surprisal Theory | Clara Meister, Mario Giulianelli, Tiago Pimentel | N/A | N/A |
| Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering | Adjali Omar, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne | N/A | N/A |
| Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? | Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu | N/A | N/A |
| Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP | Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty | N/A | N/A |
| BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training | Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, Ivan P. Yamshchikov | N/A | N/A |
| SEGMENT+: Long Text Processing with Short-Context Language Models | Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao | N/A | N/A |
| Explicit Memory Learning with Expectation Maximization | Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang | N/A | N/A |
| Learning to Generate Writing Feedback via Language Model Simulated Student Revisions | Inderjeet Jayakumar Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, Lu Wang | N/A | N/A |
| Small LLMs Are Weak Tool Learners: A Multi-LLM Agent | Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang | N/A | N/A |
| Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions | Clement Neo, Shay B Cohen, Fazl Barez | N/A | N/A |
| Still Not Quite There! Assessing Large Language Models for Comorbid Mental Health Diagnosis | Amey Hengle, Atharva Kulkarni, Shantanu Deepak Patankar, Rashmi Gupta | N/A | N/A |
| The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning | Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings | N/A | N/A |
| Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups | Răzvan-Alexandru Smădu, David-Gabriel ION, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel | N/A | N/A |
| Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue | Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng | N/A | N/A |
| Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! | Divya Patel, Pathik Patel, Ankush Chander, Sourish Dasgupta, Tanmoy Chakraborty | N/A | N/A |
| MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations | Vishal Vivek Saley, Goonjan Saha, Rocktim Jyoti Das, Dinesh Raghu, Mausam . | N/A | N/A |
| **YesBut | Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, ANKIT RAJ, Pawan Goyal, Niloy Ganguly | N/A | N/A |
| Scaling Cognitive Limits: Identifying Working Memory Limits in LLMs | Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi | N/A | N/A |
| RAFT: Realistic Attacks to Fool Text Detectors | James Liyuan Wang, Ran Li, Junfeng Yang, Chengzhi Mao | N/A | N/A |
| LLM-Evolve: Evaluation for LLM’s Evolving Capability on Benchmarks | Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro | N/A | N/A |
| FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping | AJAY KUMAR JAISWAL, Bodun Hu, Lu Yin, Yeonju Ro, Tianlong Chen, Shiwei Liu, Aditya Akella | N/A | N/A |
| LLM-based Code-Switched Text Generation for Grammatical Error Correction | Tom Potter, Zheng Yuan | N/A | N/A |
| Deciphering the Interplay of Parametric and Non-Parametric Memory in RAG Models | Mehrdad Farahani, Richard Johansson | N/A | N/A |
| On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning | Geewook Kim, Minjoon Seo | N/A | N/A |
| Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities | Zihao He, Rebecca Dorn, Minh Duc Chu, Siyi Guo, Kristina Lerman | N/A | N/A |
| Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models | Eldar Kurtic, Amir Moeini, Dan Alistarh | N/A | N/A |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna | N/A | N/A |
| One Thousand and One Pairs: A “novel” challenge for long-context language models | Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer | N/A | N/A |
| Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation | Tu Vu, Kalpesh Krishna, Salaheddin Alzubi, Chris Tar, Manaal Faruqui, Yun-Hsuan Sung | N/A | N/A |
| Do LLMs learn a true syntactic universal? | John T. Hale, Miloš Stanojević | N/A | N/A |
| GDPO: Learning to Align Language Models with Diversity Using GFlowNets | Oh Joon Kwon, Daiki E. Matsunaga, Kee-Eung Kim | N/A | N/A |
| How Susceptible are Large Language Models to Ideological Manipulation? | Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman | N/A | N/A |
| Measuring Psychological Depth in Language Models | Fabrice Y Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Senahan Yildiz, Miryung Kim, Nanyun Peng, Amit Sahai | N/A | N/A |
| Media Attitude Detection via Framing Analysis with Events and their Relations | Jin Zhao, Jingxuan Tu, Han Du, Nianwen Xue | N/A | N/A |
| Fill In The Gaps: Model Calibration and Generalization with Synthetic Data | Yang Ba, Michelle V Mancenido, Rong Pan | N/A | N/A |
| Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations | Sagi Shaier, Ari Kobren, Philip V. Ogren | N/A | N/A |
| Granular Privacy Control for Geolocation with Vision Language Models | Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter | N/A | N/A |
| MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain | Chao Jiang, Wei Xu | N/A | N/A |
| MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification | Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang | N/A | N/A |
| FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization | Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao | N/A | N/A |
| StorySpark: Expert-Annotated QA Pairs with Real-World Knowledge for Children Storytelling | Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun | N/A | N/A |
| MedCoT: Medical Chain of Thought via Hierarchical Expert | Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Tianyi Zhou, Zuozhu Liu | N/A | N/A |
| Varying Sentence Representations via Condition-Specified Routers | Ziyong Lin, Quansen Wang, Zixia Jia, Zilong Zheng | N/A | N/A |
| Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues | Jiao Ou, jiayu wu, Che Liu, Fuzheng Zhang, Di ZHANG, Kun Gai | N/A | N/A |
| Information Flow Routes: Automatically Interpreting Language Models at Scale | Javier Ferrando, Elena Voita | N/A | N/A |
| A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models | Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang | N/A | N/A |
| Low-rank Subspace for Binding in Large Language Models | Qin Dai, Benjamin Heinzerling, Kentaro Inui | N/A | N/A |
| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Erxin Yu, Jing Li, Ming Liao, Siqi Wang, GAO Zuchen, Fei Mi, Lanqing HONG | N/A | N/A |
| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Tobias Schimanski, Jingwei Ni, Roberto Spacey Martín, Nicola Ranger, Markus Leippold | N/A | N/A |
| Context-Aware Adapter Tuning for Few-Shot Relation Learning in Knowledge Graphs | LIU Ran, Zhongzhou Liu, Xiaoli Li, Yuan Fang | N/A | N/A |
| Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness | Shixuan Ma, Quan Wang | N/A | N/A |
| Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection | Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou | N/A | N/A |
| From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking | Siyuan Wang, Zhuohan Long, Zhihao Fan, zhongyu wei | N/A | N/A |
| Symbolic Working Memory Enhances Language Models for Complex Rule Application | Siyuan Wang, zhongyu wei, Yejin Choi, Xiang Ren | N/A | N/A |
| LLoCO: Learning Long Contexts Offline | Sijun Tan, Xiuyu Li, Shishir G Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Popa | N/A | N/A |
| Don’t Forget Your Reward Values: Language Model Alignment via Value-based Calibration | Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, WANG CHEN, Anh Tuan Luu | N/A | N/A |
| Mentor-KD: Making Small Language Models Better Multi-step Reasoners | Hojae Lee, Junho Kim, SangKeun Lee | N/A | N/A |
| Are Large Language Models Capable of Generating Human-Level Narratives? | Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng | N/A | N/A |
| MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs | Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung | N/A | N/A |
| Can Large Language Models Enhance Predictions of Disease Progression? Investigating Through Disease Network Link Prediction | Haohui Lu, Usman Naseem | N/A | N/A |
| Searching for Best Practices in Retrieval-Augmented Generation | Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang | N/A | N/A |
| Moral Foundations of Large Language Models | Marwa Abdulhai, Gregory Serapio-García, Clement CREPY, Daria Valter, John Canny, Natasha Jaques | N/A | N/A |
| The Zeno’s Paradox of ‘Low-Resource’ Languages | Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury | N/A | N/A |
| Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization | Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty, Md Shad Akhtar | N/A | N/A |
| Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition | Pritika Ramu, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan | N/A | N/A |
| From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment | Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima | N/A | N/A |
| Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging | Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui | N/A | N/A |
| Embedded Named Entity Recognition using Probing Classifiers | Nicholas Popovic, Michael Färber | N/A | N/A |
| Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training | Zhou Zhang, Dongzeng Tan, Jiaan Wang, Yilong Chen, Jiarong Xu | N/A | N/A |
| Data Contamination Can Cross Language Barriers | Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang | N/A | N/A |
| Automated Essay Scoring: A Reflection on the State of the Art | Shengjie Li, Vincent Ng | N/A | N/A |
| Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate | Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Shuming Shi, Zhaopeng Tu | N/A | N/A |
| Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs | Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang | N/A | N/A |
| CURE: Context- and Uncertainty-Aware Mental Disorder Detection | Migyeong Kang, goun choi, Hyolim Jeon, Ji hyun An, Daejin Choi, Jinyoung Han | N/A | N/A |
| PepRec: Progressive Enhancement of Prompting for Recommendation | Yakun Yu, Shi-ang Qi, Baochun Li, Di Niu | N/A | N/A |
| In-Context Compositional Generalization for Large Vision-Language Models | Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, Yunde Jia | N/A | N/A |
| Improving Zero-shot LLM Re-Ranker with Risk Minimization | Xiaowei Yuan, Zhao Yang, Yequan Wang, Jun Zhao, Kang Liu | N/A | N/A |
| Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory | Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou | N/A | N/A |
| Label Confidence Weighted Learning for Target-level Sentence Simplification | Jingshen Zhang, Xin Ying Qiu | N/A | N/A |
| Quantum Recurrent Architectures for Text Classification | Wenduan Xu, Stephen Clark, Douglas Brown, Gabriel Matos, Konstantinos Meichanetzidis | N/A | N/A |
| Tree of Problems: Improving structured problem solving with compositionality | Armel Randy Zebaze, Benoît Sagot, Rachel Bawden | N/A | N/A |
| What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study | Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof-Arenas, Luisa Bentivogli | N/A | N/A |
| Seg2Act: Global Context-aware Action Generation for Document Logical Structuring | Zichao Li, Shaojie He, Meng Liao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Yanxiong Lu, Xianpei Han, Le Sun | N/A | N/A |
| Is C4 Dataset Enough for Pruning? An Investigation of Calibration Data for LLM Pruning | Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, AJAY KUMAR JAISWAL, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu | N/A | N/A |
| Revisiting the Robustness of Watermarking to Paraphrasing Attacks | Saksham Rastogi, Danish Pruthi | N/A | N/A |
| A Survey of Ontology Expansion for Conversational Understanding | Jinggui Liang, Yuxia Wu, Yuan Fang, Hao Fei, Lizi Liao | N/A | N/A |
| Calibrating Language Models with Adaptive Temperature Scaling | Johnathan Xie, Annie S Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn | N/A | N/A |
| Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? | Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo | N/A | N/A |
| Why do objects have many names? A study on word informativeness in language use and lexical systems. | Eleonora Gualdoni, Gemma Boleda | N/A | N/A |
| Dual-Space Knowledge Distillation for Large Language Models | Songming Zhang, Xue Zhang, Zengkui Sun, Yufeng Chen, Jinan Xu | N/A | N/A |
| NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition | Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik | N/A | N/A |
| On the Universal Truthfulness Hyperplane Inside LLMs | Junteng Liu, Shiqi Chen, Yu Cheng, Junxian He | N/A | N/A |
| PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Chao-Wei Huang, Yun-Nung Chen | N/A | N/A |
| User Inference Attacks on Large Language Models | Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu | N/A | N/A |
| HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy | YongKang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schuetze | N/A | N/A |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Yufang Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Aimin Zhou | N/A | N/A |
| Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation | Matthew Raffel, Victor Agostinelli, Lizhong Chen | N/A | N/A |
| ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback | Qinzhuo Wu, Wei Liu, Jian Luan, Bin Wang | N/A | N/A |
| Please note that I’m just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification | Esra Dönmez, Thang Vu, Agnieszka Falenska | N/A | N/A |
| How to Compute the Probability of a Word | Tiago Pimentel, Clara Meister | N/A | N/A |
| A linguistically-motivated evaluation methodology for unraveling model’s abilities in reading comprehension tasks | Elie Antoine, Frederic Bechet, Géraldine Damnati, Philippe Langlais | N/A | N/A |
| GuardBench: A Large-Scale Benchmark for Guardrail Models | Elias Bassani, Ignacio Sanchez | N/A | N/A |
| Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering | Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Guang Liu, Jun Zhao, Kang Liu | N/A | N/A |
| Language models and brains align due to more than next-word prediction and word-level information | Gabriele Merlin, Mariya Toneva | N/A | N/A |
| LLMEdgeRefine: Enhancing Text Clustering with LLM-Based Boundary Point Refinement | Zijin Feng, Luyang Lin, Lingzhi Wang, Hong Cheng, Kam-Fai Wong | N/A | N/A |
| CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures | Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri | N/A | N/A |
| A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression | Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini | N/A | N/A |
| GOME: Grounding-based Metaphor Binding With Conceptual Elaboration For Figurative Language Illustration | Linhao Zhang, Jintao Liu, Li Jin, Hao Wang, kaiwen wei, Guangluan Xu | N/A | N/A |
| D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation | Aida Mostafazadeh Davani, Mark Diaz, Dylan K Baker, Vinodkumar Prabhakaran | N/A | N/A |
| PALM: Few-Shot Prompt Learning for Audio Language Models | Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki | N/A | N/A |
| Annotator-Centric Active Learning for Subjective NLP Tasks | Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico Liscio | N/A | N/A |
| Lost in Tokenization: How to Measure Word Surprisal From LM Token Probabilities | Luca Malagutti, Juan Luis Gastaldi, Brian DuSell, Tim Vieira, Ryan Cotterell, Mario Giulianelli | N/A | N/A |
| Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation | Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M Guerreiro | N/A | N/A |
| Jailbreaking LLMs with Arabic Transliteration and Arabizi | Mansour Al Ghanim, saleh almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou | N/A | N/A |
| Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models | Zara Siddique, Liam Turner, Luis Espinosa-Anke | N/A | N/A |
| Instruction Matters, a Simple yet Effective Task Selection Approach in Instruction Tuning for Specific Tasks | Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae | N/A | N/A |
| Recurrent Alignment with Hard Attention for Hierarchical Text Rating | Chenxi Lin, Ren Jiayu, Guoxiu He, Zhuoren Jiang, Haiyan yu, Xiaomin Zhu | N/A | N/A |
| CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification | Junhui He, Shangyu Wu, Weidong Wen, Chun Jason Xue, Qingan Li | N/A | N/A |
| Semformer: Transformer Language Models with Semantic Planning | Yongjing Yin, Junran Ding, Kai Song, Yue Zhang | N/A | N/A |
| DocCGen: Document-based Controlled Code Generation | Sameer Pimparkhede, Mehant Kammakomati, Srikanth G. Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya | N/A | N/A |
| Semantics and Sentiment: Cross-lingual Variations in Emoji Use | Giulio Zhou, Sydelle de Souza, Ella Markham, Oghenetekevwe Kwakpovwe, Sumin Zhao | N/A | N/A |
| The Emergence of Compositional Languages in Multi-entity Referential Games: from Image to Graph Representations | Daniel Akkerman, Phong Le, Raquel G. Alhama | N/A | N/A |
| Transformers are Multi-State RNNs | Matanel Oren, Michael Hassid, Nir Yarden, Yossi Adi, Roy Schwartz | N/A | N/A |
| Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization | Niyati Bafna, Kenton Murray, David Yarowsky | N/A | N/A |
| Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion | Kerem Zaman, Leshem Choshen, Shashank Srivastava | N/A | N/A |
| Collective Critics for Creative Story Generation | Minwook Bae, Hyounghun Kim | N/A | N/A |
| Surprisal Curves of Discourse | Eleftheria Tsipidi, Franz Nowak, Ryan Cotterell, Ethan Wilcox, Mario Giulianelli, Alex Warstadt | N/A | N/A |
| Model-based Preference Optimization in Abstractive Summarization without Human Feedback | Jaepill choi, Kyubyung Chae, Jiwoo Song, Yohan Jo, Taesup Kim | N/A | N/A |
| Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation? | Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| NeuroTrialNER: An Annotated Corpus for Neurological Diseases and Therapies in Clinical Trial Registries | Simona Emilova Doneva, Tilia Ellendorff, Jean-Philippe Goldman, Amelia Elaine Cannon, Gerold Schneider, Beate Sick, Benjamin Victor Ineichen | N/A | N/A |
| Do Explanations Help or Hurt? Saliency Maps vs Natural Language Explanations in a Clinical Decision-Support Setting | Maxime Guillaume Kayser, Bayar Menzat, Cornelius Emde, Bogdan Alexandru Bercean, Alex Novak, Abdalá Trinidad Espinosa Morgado, Bartlomiej Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu | N/A | N/A |
| Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering | WEIHE ZHAI, Arkaitz Zubiaga, Bingquan Liu, Chengjie Sun, Yalong Zhao | N/A | N/A |
| Generation with Dynamic Vocabulary | Yanting Liu, Tao Ji, Yuanbin Wu, Xiaoling Wang, Changzhi Sun | N/A | N/A |
| Argument Relation Classification through Discourse Markers and Adversarial Training | Michele Luca Contalbo, Francesco Guerra, Matteo Paganelli | N/A | N/A |
| Getting The Most Out of Your Training Data: Exploring Unsupervised Tasks for Morphological Inflection | Abhishek Purushothama, Adam Wiemerslage, Katharina von der Wense | N/A | N/A |
| Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval | Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev | N/A | N/A |
| Efficient Unseen Language Adaptation for Multilingual Pre-Trained Language Models | Po-Heng Chen, Yun-Nung Chen | N/A | N/A |
| Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation | Ruiyu Xiao, Lei Wu, Yuhang Gou, Weinan Zhang, Ting Liu | N/A | N/A |
| TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders, Nathaniel Weir, Benjamin Van Durme | N/A | N/A |
| Unsupervised Extraction of Dialogue Policies from Conversations | Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien | N/A | N/A |
| GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization | Onkar Kishor Susladkar, Gayatri Sudhir Deshmukh, Vandan Gorade, Sparsh Mittal | N/A | N/A |
| Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality | Youngtaek Oh, Jae Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim | N/A | N/A |
| FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture | Wenyan Li, Crystina Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott | N/A | N/A |
| A Two-Step Approach for Data-Efficient French Pronunciation Learning | Hoyeon Lee, Hyeeun Jang, JONGHWAN KIM, Jaemin Kim | N/A | N/A |
| Exploring Intra and Inter-language Consistency in Embeddings with ICA | Rongzhi Li, Takeru Matsuda, Hitomi Yanaka | N/A | N/A |
| DetoxLLM: A Framework for Detoxification with Explanations | Md Tawkat Islam Khondaker, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan | N/A | N/A |
| Building a Multi-Platform, BERT Classifier for Detecting Connective Language | Josephine Lukito, Bin Chen, Gina M. Masullo, Natalie Jomini Stroud | N/A | N/A |
| ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models | Yash Akhauri, Ahmed F AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M Rush, Safeen Huda, Mohamed S Abdelfattah | N/A | N/A |
| Emotion Granularity from Text: An Aggregate-Level Indicator of Mental Health | Krishnapriya Vishnubhotla, Daniela Teodorescu, Mallory J Feldman, Kristen Lindquist, Saif M. Mohammad | N/A | N/A |
| BLSP-Emo: Towards Empathetic Large Speech-Language Models | Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang | N/A | N/A |
| SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation | Abhishek Divekar, Greg Durrett | N/A | N/A |
| Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model | Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang | N/A | N/A |
| DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts | Mohammed Saidul Islam, Md Tahmid Rahman Laskar, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty | N/A | N/A |
| DEM: Distribution Edited Model for Training with Mixed Data Distributions | Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha | N/A | N/A |
| Altogether: Image Captioning via Re-aligning Alt-text | Hu Xu, Po-Yao Huang, Xiaoqing Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer | N/A | N/A |
| VerifyMatch: A Semi-Supervised Learning Paradigm for Natural Language Inference with Confidence-Aware MixUp | Seo Yeon Park, Cornelia Caragea | N/A | N/A |
| CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans | Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Ray Mooney | N/A | N/A |
| Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics | Théo Gigant, Camille Guinaudeau, Marc decombas, Frederic Dufaux | N/A | N/A |
| An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs | Manuj Malik, Jing Jiang, Kian Ming A. Chai | N/A | N/A |
| Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks | Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas | N/A | N/A |
| GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning | Aleksander Ficek, Jiaqi Zeng, Oleksii Kuchaiev | N/A | N/A |
| CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing | Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang | N/A | N/A |
| Sequential API Function Calling Using GraphQL Schema | Avirup Saha, Lakshmi Mandal, Balaji Ganesan, Sambit Ghosh, Renuka Sindhgatta, Carlos Eberhardt, Dan Debrunner, Sameep Mehta | N/A | N/A |
| The Illusion of Competence: Evaluating the Effect of Explanations on Users’ Mental Models of Visual Question Answering Systems | Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß | N/A | N/A |
| Re-Evaluating Evaluation for Multilingual Summarization | Jessica Zosa Forde, Ruochen Zhang, Lintang Sutawika, Alham Fikri Aji, Samuel Cahyawijaya, Genta Indra Winata, Minghao Wu, Carsten Eickhoff, Stella Biderman, Ellie Pavlick | N/A | N/A |
| Video-Text Prompting for Weakly Supervised Spatio-Temporal Video Grounding | Heng zhao, Zhao Yinjie, Bihan Wen, Yew-Soon Ong, Joey Tianyi Zhou | N/A | N/A |
| A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition | Caio Filippo Corro | N/A | N/A |
| Factuality of Large Language Models in the Year 2024 | Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Nenkov Georgiev, Rocktim Jyoti Das, Preslav Nakov | N/A | N/A |
| Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation | Youngwoo Kim, Razieh Rahimi, James Allan | N/A | N/A |
| Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse | Rongchen Guo, Isar Nejadgholi, Hillary Dawkins, Kathleen C. Fraser, Svetlana Kiritchenko | N/A | N/A |
| DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers | Rakesh R Menon, Shashank Srivastava | N/A | N/A |
| IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning | Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha | N/A | N/A |
| Scope-enhanced Compositional Semantic Parsing for DRT | Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos | N/A | N/A |
| The Generation Gap: Exploring Age Bias Underlying in the Value Systems of Large Language Models | Siyang Liu, Trisha Maturi, Bowen Yi, Siqi Shen, Rada Mihalcea | N/A | N/A |
| TempoFormer: A Transformer for Temporally-aware Representations in Change Detection | Talia Tseriotou, Adam Tsakalidis, Maria Liakata | N/A | N/A |
| Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? | Guillermo Marco, Julio Gonzalo, M.Teresa Mateo-Girona, Ramón del Castillo Santos | N/A | N/A |
| Evaluating Diversity in Automatic Poetry Generation | Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger | N/A | N/A |
| Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models | Yi Zhou, Danushka Bollegala, Jose Camacho-Collados | N/A | N/A |
| Delving into Qualitative Implications of Synthetic Data for Hate Speech Detection | Camilla Casula, Sebastiano Vecellio Salto, Alan Ramponi, Sara Tonelli | N/A | N/A |
| Grounding Language in Multi-Perspective Referential Communication | Zineng Tang, Lingjun Mao, Alane Suhr | N/A | N/A |
| Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval | Yifan Qiao, Parker Carlson, Shanxiu He, Yingrui Yang, Tao Yang | N/A | N/A |
| Error Analysis of Multilingual Language Models in Machine Translation for Low-resource Languages: A Case Study of Amharic to English Bi-directional Machine Translation | Hizkiel Mitiku Alemayehu, Hamada M Zahera, Axel-Cyrille Ngonga Ngomo | N/A | N/A |
| MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation | Arkadiusz Modzelewski, Giovanni Da San Martino, Pavel Savov, Magdalena Anna Wilczyńska, Adam Wierzbicki | N/A | N/A |
| Unsupervised Discrete Representations of American Sign Language | Artem Abzaliev, Rada Mihalcea | N/A | N/A |
| Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models | Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim | N/A | N/A |
| Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs | Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui | N/A | N/A |
| Jump Starting Bandits with LLM-Generated Prior Knowledge | Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson | N/A | N/A |
| Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? | Fırat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çağatay Yıldız | N/A | N/A |
| Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation | Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun | N/A | N/A |
| Virtual Personas for Language Models via an Anthology of Backstories | Suhong Moon, Marwa Abdulhai, Minwoo Kang, Joseph Suh, Widyadewi Soedarmadji, Eran Kohen Behar, David Chan | N/A | N/A |
| Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? | Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral | N/A | N/A |
| Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies | Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun | N/A | N/A |
| The Empirical Variability of Narrative Perceptions of Social Media Texts | Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper, Maarten Sap | N/A | N/A |
| Which questions should I answer? Salience Prediction of Inquisitive Questions | Yating Wu, Ritika Rajesh Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li | N/A | N/A |
| Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues | Lei Sun, Jinming Zhao, Qin Jin | N/A | N/A |
| Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech | Guan-Ting Lin, Wei Ping Huang, Hung-yi Lee | N/A | N/A |
| Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities | Sachit Menon, Richard Zemel, Carl Vondrick | N/A | N/A |
| CodeJudge: Evaluating Code Generation with Large Language Models | Weixi Tong, Tianyi Zhang | N/A | N/A |
| Self-Training Large Language and Vision Assistant for Medical | Guohao Sun, Can Qin, Huazhu Fu, Linwei Wang, ZHIQIANG TAO | N/A | N/A |
| SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, hong yu | N/A | N/A |
| Defending Jailbreak Prompts via In-Context Adversarial Game | Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang | N/A | N/A |
| Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter | Kateryna Kasianenko, Shima Khanehzar, Stephen Wan, Ehsan Dehghan, Axel Bruns | N/A | N/A |
| Multilingual Topic Classification in X: Dataset and Analysis | Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, Jose Camacho-Collados | N/A | N/A |
| MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models | Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong | N/A | N/A |
| Updating CLIP to Prefer Descriptions Over Captions | Amir Zur, Elisa Kreiss, Karel D’Oosterlinck, Christopher Potts, Atticus Geiger | N/A | N/A |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang | N/A | N/A |
| Back to School: Translation Using Grammar Books | Jonathan Hus, Antonios Anastasopoulos | N/A | N/A |
| VIEWS: Entity-Aware News Video Captioning | Hammad Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, feng han, Yukun Zhu, Xuande Feng, Kevin Zhang, Jialu Liu, Shih-Fu Chang | N/A | N/A |
| Towards Aligning Language Models with Textual Feedback | Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan | N/A | N/A |
| ATPO: Automatic Tree-Structured Prompt Optimization | Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Xiaodi Sun, Bin Benjamin Zhu, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang | N/A | N/A |
| DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators | Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, shimin tao, Hao Yang, Min Zhang | N/A | N/A |
| DEFT-UCS: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection | Devleena Das, Vivek Khetan | N/A | N/A |
| Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models | Yuko Nakagi, Takuya Matsuyama, Naoko Koide-Majima, Hiroto Q. Yamaguchi, Rieko Kubo, Shinji Nishimoto, Yu Takagi | N/A | N/A |
| “They are uncultured”: Unveiling Covert Harms and Social Threats in LLM Generated Conversations | Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanu Mitra | N/A | N/A |
| Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models | Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen | N/A | N/A |
| Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? | Gabriel Roccabruna, Massimo Rizzoli, giuseppe riccardi | N/A | N/A |
| Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties | Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai | N/A | N/A |
| Framework for Robust and Scalable Text Watermarking | Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low | N/A | N/A |
| MASIVE: Open-Ended Affective State Identification in English and Spanish | Nicholas Deas, Elsbeth Turcan, Ivan Ernesto Perez Mejia, Kathleen McKeown | N/A | N/A |
| You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions | Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, Jordan Lee Boyd-Graber | N/A | N/A |
| AlphaExpert: Assigning LoRA Experts Based on Layer Training Quality | Peijun Qing, Chongyang Gao, Yefan Zhou, Xingjian Diao, Yaoqing Yang, Soroush Vosoughi | N/A | N/A |
| Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling | Irfan Robbani, Paul Reisert, Surawat Pothong, Naoya Inoue, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui | N/A | N/A |
| Advancing Social Intelligence in AI Agents: Technical Challenges and Open Question | Leena Mathur, Paul Pu Liang, Louis-Philippe Morency | N/A | N/A |
| RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models | Ziyi Kou, Shichao Pei, Meng Jiang, Xiangliang Zhang | N/A | N/A |
| Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese | Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh | N/A | N/A |
| Learnability of Indirect Evidence in Language Models | Miyu Oba, Yohei Oseki, Akiyo Fukatsu, Akari Haga, Hiroki Ouchi, Taro Watanabe, Saku Sugawara | N/A | N/A |
| Do LLMs Know to Respect Copyright Notice? | Jialiang Xu, SHENGLAN LI, Zhaozhuo Xu, Denghui Zhang | N/A | N/A |
| SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding | Hanchi Sun, Tianyi Zhou, Xun Chen, Lichao Sun | N/A | N/A |
| Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding | YeonJoon Jung, Jaeseong Lee, Seungtaek Choi, Dohyeon Lee, Minsoo Kim, seung-won hwang | N/A | N/A |
| Rethinking the Role of Proxy Rewards in Language Model Alignment | Sungdong Kim, Minjoon Seo | N/A | N/A |
| Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant | Abhirama Subramanyam Penamakuri, Anand Mishra | N/A | N/A |
| How Good is my MT Metric? A Framework for the Interpretation of Metric Assessments | Stefano Perrella, Lorenzo Proietti, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli | N/A | N/A |
| IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning | Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim | N/A | N/A |
| SPREADSHEETLLM: Encoding Spreadsheets for Large Language Models | Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang | N/A | N/A |
| Let’s discuss! Quality Dimensions and Annotated Datasets for Computational Argument Quality | Rositsa V Ivanova, Thomas Huber, Christina Niklaus | N/A | N/A |
| Automatic sentence segmentation of clinical record narratives in real-world data | Dongfang Xu, Davy Weissenbacher, Karen O’Connor, Siddharth Rawal, Graciela Gonzalez Hernandez | N/A | N/A |
| One-to-Many Communication and Compositionality in Emergent Communication | Heeyoung Lee | N/A | N/A |
| Bayesian Example Selection Improves In-Context Learning for Speech, Text, and Visual Modalities | Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang | N/A | N/A |
| Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? | Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali | N/A | N/A |
| Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models | Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral | N/A | N/A |
| Contrastive Classification via Linear Layer Extrapolation | Mayukh Sharma, Sean O’Brien, Julian McAuley | N/A | N/A |
| Task Oriented In-Domain Data Augmentation | Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao | N/A | N/A |
| SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers | Shruti Singh, Nandan Sarkar, Arman Cohan | N/A | N/A |
| Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules | Zhuocheng Gong, Ang Lv, Jian Guan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan | N/A | N/A |
| No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages | Youssef Mohamed, Runjia Li, Ibrahim Said Ahmad, Kilichbek Haydarov, Philip Torr, Kenneth Church, Mohamed Elhoseiny | N/A | N/A |
| PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection | Someen Park, Jaehoon Kim, Seungwan Jin, Sohyun Park, Kyungsik Han | N/A | N/A |
| TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR | Shashi Kumar, Srikanth Madikeri, Juan Pablo Zuluaga Gomez, Iuliia Thorbecke, Esaú VILLATORO-TELLO, Sergio Burdisso, Petr Motlicek, Karthik Pandia D S, Aravind Ganapathiraju | N/A | N/A |
| ApiQ: Finetuning of 2-Bit Quantized Large Language Model | Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz | N/A | N/A |
| Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk | Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li, Qun Liu, Xipeng Qiu | N/A | N/A |
| A Morphology-Based Investigation of Positional Encodings | Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya | N/A | N/A |
| I love pineapple on pizza != I hate pineapple on pizza: Stance-Aware Sentence Transformers for Opinion Mining | Vahid Ghafouri, Jose M. Such, Guillermo Suarez-Tangil | N/A | N/A |
| BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability | Mamta Mamta, Rishikant Chigrupaatii, Asif Ekbal | N/A | N/A |
| ArMeme: Propagandistic Content in Arabic Memes | Firoj Alam, Abul Hasnat, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain | N/A | N/A |
| Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts | Arianna Muti, Federico Ruggeri, Khalid Al Khatib, Alberto Barrón-Cedeño, Tommaso Caselli | N/A | N/A |
| Thoughts to Target: Enhance Planning for Target-driven Conversation | Zhonghua Zheng, Lizi Liao, Yang Deng, Ee-Peng Lim, Minlie Huang, Liqiang Nie | N/A | N/A |
| Scalable Data Ablation Approximations for Language Models through Modular Training and Merging | Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi | N/A | N/A |
| Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation | Zhe Cao, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters | Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Generative Subgraph Retrieval for Knowledge Graph–Grounded Dialog Generation | Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim | N/A | N/A |
| Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers | Tuc Van Nguyen, Thai Le | N/A | N/A |
| Generalizing Clinical De-identification Models by Privacy-safe Data Augmentation using GPT-4 | Woojin Kim, Sungeun Hahm, Jaejin Lee | N/A | N/A |
| Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game | Prisha Samdarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan | N/A | N/A |
| GottBERT: a pure German Language Model | Raphael Scheible, Johann Frei, Fabian Thomczyk, Henry He, Patric Tippmann, Jochen Knaus, Victor Jaravine, Frank Kramer, Martin Boeker | N/A | N/A |
| Computational Meme Understanding: A Survey | Khoi P. N. Nguyen, Vincent Ng | N/A | N/A |
| CoverICL: Selective Annotation for In-Context Learning via Active Graph Coverage | Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis | N/A | N/A |
| Retrieval-enriched zero-shot image classification in low-resource domains | Nicola Dall’Asen, Yiming Wang, Enrico Fini, Elisa Ricci | N/A | N/A |
| I-AM-G: Interest Augmented Multimodal Generator for Item Personalization | Xianquan Wang, Likang Wu, Shukang Yin, Zhi Li, Yanjiang Chen, hufeng, Yu Su, Qi Liu | N/A | N/A |
| Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps | Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy | N/A | N/A |
| Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing | Baihe Huang, Hiteshi Sharma, Yi Mao | N/A | N/A |
| Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion | Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist | N/A | N/A |
| Show and Guide: Instructional-Plan Grounded Vision and Language Model | Diogo Glória-Silva, David Semedo, Joao Magalhaes | N/A | N/A |
| Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents | Bandhav Veluri, Benjamin N Peloquin, Bokai YU, Hongyu Gong, Shyamnath Gollakota | N/A | N/A |
| QuBE: Question-based Belief Enhancement for Agentic LLM | Minsoo Kim, Jongyoon Kim, Jihyuk Kim, seung-won hwang | N/A | N/A |
| COMPACT: Compressing Retrieved Documents Actively for Question Answering | Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang | N/A | N/A |
| An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models | Fatemeh Shiri, Xiao-Yu Guo, Mona Golestan Far, Xin Yu, Reza Haf, Yuan-Fang Li | N/A | N/A |
| Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models | Jiaxin Zhang, Wendi Cui, Yiran Huang, Kamalika Das, Sricharan Kumar | N/A | N/A |
| Local Contrastive Editing of Gender Stereotypes | Marlene Lutz, Rochelle Choenni, Markus Strohmaier, Anne Lauscher | N/A | N/A |
| De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP | Stefan Larson, Nicole Cornehl Lima, Santiago Pedroza Diaz, Amogh Manoj Joshi, Siddharth Betala, Jamiu Tunde Suleiman, Yash Mathur, Kaushal Kumar Prajapati, Ramla Alakraa, Junjie Shen, Temi Okotore, Kevin Leach | N/A | N/A |
| RAR: Retrieval Augmented Retrieval for Code Generation in Low Resource Languages | Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le | N/A | N/A |
| STAR: SocioTechnical Approach to Red Teaming Language Models | Laura Weidinger, John F J Mellor, Bernat Guillén Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, A. Stevie Bergman, Mikel D. Rodriguez, Verena Rieser, William Isaac | N/A | N/A |
| Do great minds think alike? Investigating Human-AI Complementarity for Question Answering | Maharshi Gor, Hal Daumé III, Tianyi Zhou, Jordan Lee Boyd-Graber | N/A | N/A |
| Memory-Efficient Fine-Tuning of Transformers via Token Selection | Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang | N/A | N/A |
| Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories | Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli | N/A | N/A |
| Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark | Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko | N/A | N/A |
| Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner | Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, Min zhang | N/A | N/A |
| Preference-Guided Reflective Sampling for Aligning Language Models | Hai Ye, Hwee Tou Ng | N/A | N/A |
| Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP | Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat | N/A | N/A |
| Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs | Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap | N/A | N/A |
| A Simple LLM Framework for Long-Range Video Question-Answering | Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius | N/A | N/A |
| Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing | Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli | N/A | N/A |
| Casablanca: Data and Models for Multidialectal Arabic Speech Recognition | Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou cheikh tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah EL MEKKI, El Moatez Billah Nagoudi, Benelhadj Djelloul Mama Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir ECH-CHAMMAKHY, AMAL MAKOUAR, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed | N/A | N/A |
| Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations | Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria | N/A | N/A |
| Communicating with Speakers and Listeners of Different Pragmatic Levels | Kata Naszadi, Frans A Oliehoek, Christof Monz | N/A | N/A |
| RECANTFormer: Referring Expression Comprehension with Varying Numbers of Targets | Bhathiya Hemanthage, Hakan Bilen, Phil Bartie, Christian Dondrup, Oliver Lemon | N/A | N/A |
| Sprout: Green Generative AI with Carbon-Efficient LLM Inference | Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari | N/A | N/A |
| Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs | Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze | N/A | N/A |
| T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings | Björn Deiseroth, Manuel Brack, Samuel Weinbach, Patrick Schramowski, Kristian Kersting | N/A | N/A |
| SpeechQE: Estimating the Quality of Direct Speech Translation | HyoJung Han, Kevin Duh, Marine Carpuat | N/A | N/A |
| Assessing and Verifying Task Utility in LLM-Powered Applications | Negar Arabzadeh, Siqing Huo, Nikhil Mehta, Qingyun Wu, Chi Wang, Ahmed Hassan Awadallah, Charles L. A. Clarke, Julia Kiseleva | N/A | N/A |
| Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models | Somanshu Singla, Zhen Wang, Tianyang Liu, Abdullah Ashfaq, Zhiting Hu, Eric P. Xing | N/A | N/A |
| Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree | Harbani Jaggi, Kashyap Coimbatore Murali, Eve Fleisig, Erdem Biyik | N/A | N/A |
| Adversarial Text Generation using Large Language Models for Dementia Detection | Youxiang Zhu, Nana Lin, Kiran Sandilya Balivada, Daniel Haehn, Xiaohui Liang | N/A | N/A |
| xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics | Daniil Larionov, Mikhail Seleznyov, Vasiliy Viskov, Alexander Panchenko, Steffen Eger | N/A | N/A |
| The Greatest Good Benchmark: Measuring LLMs’ Alignment with Utilitarian Moral Dilemmas | Giovanni Franco Gabriel Marraffini, Andrés Cotton, Noe Fabian Hsueh, Juan Wisznia, Axel Fridman, Luciano Del Corro | N/A | N/A |
| FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding | Jiali Cheng, Hadi Amiri | N/A | N/A |
| Style-Shifting Behaviour of the Manosphere on Reddit | Jai Aggarwal, Suzanne Stevenson | N/A | N/A |
| The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective | Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang | N/A | N/A |
| Holistic Evaluation for Interleaved Text-and-Image Generation | Minqian Liu, Zhiyang Xu, Zihao Lin, Trevor Ashby, Joy Rimchala, Jiaxin Zhang, Lifu Huang | N/A | N/A |
| FOLIO: Natural Language Reasoning with First-Order Logic | SIMENG HAN, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander Fabbri, Wojciech Maciej Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev | N/A | N/A |
| The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? | Alexander Choi, Syeda Sabrina Akter, J.P. Singh, Antonios Anastasopoulos | N/A | N/A |
| Is Child-Directed Speech Effective Training Data for Language Models? | Steven Y. Feng, Noah Goodman, Michael Frank | N/A | N/A |
| RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference | Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao | N/A | N/A |
| HCEG: Improving the Abstraction Ability of Language Models with Hierarchical Conceptual Entailment Graphs | Juncai Li, Ru Li, Xiaoli Li, Qinghua Chai, Jeff Z. Pan | N/A | N/A |
| M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought | Gitanjali Kumari, Kirtan Jain, Asif Ekbal | N/A | N/A |
| GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation | Govind Ramesh, Yao Dou, Wei Xu | N/A | N/A |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Kiseung Kim, Jay-Yoon Lee | N/A | N/A |
| Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets | Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth | N/A | N/A |
| Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model | Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text | Pritika Ramu, Aparna Garimella, Sambaran Bandyopadhyay | N/A | N/A |
| On the Fragility of Active Learners for Text Classification | Abhishek Ghose, Emma Thuong Nguyen | N/A | N/A |
| BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers | Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May Dongmei Wang, Joyce C. Ho, Chao Zhang, Carl Yang | N/A | N/A |
| Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval | Jonghyun Song, Cheyon Jin, Wenlong Zhao, Jay-Yoon Lee | N/A | N/A |
| M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection | Chia-Wei Tang, Ting-Chih Chen, Alvi Md Ishmam, Kiet A. Nguyen, Kazi Sajeed Mehrab, Chris Thomas | N/A | N/A |
| MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Haotian Sun, Hang Wu, Carl Yang, May Dongmei Wang | N/A | N/A |
| EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records | Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce C. Ho, Carl Yang, May Dongmei Wang | N/A | N/A |
| SimLLM: Detecting Sentences Generated by Large Language Models Using Similarity between the Generation and its Re-generation | Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu | N/A | N/A |
| CELLO: Causal Evaluation of Large Vision-Language Models | Meiqi Chen, Bo Peng, Yan Zhang, Chaochao Lu | N/A | N/A |
| Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe | N/A | N/A |
| Training-free Deep Concept Injection Enables Language Models for Video Question Answering | Xudong Lin, Manling Li, Richard Zemel, Heng Ji, Shih-Fu Chang | N/A | N/A |
| MIBench: Evaluating Multimodal Large Language Models over Multiple Images | Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu | N/A | N/A |
| ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering | Francesco Maria Molfese, Simone Conia, Riccardo Orlando, Roberto Navigli | N/A | N/A |
| ABLE: Personalized Disability Support with Politeness and Empathy Integration | Kshitij Mishra, Manisha Burja, Asif Ekbal | N/A | N/A |
| Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models | Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Sunghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo | N/A | N/A |
| Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code | Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Beong-woo Kwak, Seonghyeon Bae, seung-won hwang, Jinyoung Yeo | N/A | N/A |
| Improving Minimum Bayes Risk Decoding with Multi-Prompt | David Heineman, Yao Dou, Wei Xu | N/A | N/A |
| Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework | gopendra Vikram singh, Sai Vardhan Vemulapalli, Mauajama Firdaus, Asif Ekbal | N/A | N/A |
| Nearest Neighbor Normalization Improves Multimodal Retrieval | Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah Schwettmann, Tristan Thrush | N/A | N/A |
| Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning | Shengguang Wu, Shusheng Yang, Zhenglun Chen, Qi Su | N/A | N/A |
| LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering | Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang | N/A | N/A |
| Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models | Yuxuan Guo, Zhiliang Tian, YIPING SONG, Tianlun Liu, Liang Ding, Dongsheng Li | N/A | N/A |
| Knowledge Graph Enhanced Large Language Model Editing | Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen | N/A | N/A |
| Quis custodiet ipsos custodes?’ Who will watch the watchmen? On Detecting AI-generated peer-reviews | Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal | N/A | N/A |
| Mitigating Open-Vocabulary Caption Hallucinations | Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor | N/A | N/A |
| Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes | Kosuke Nishida, Kyosuke Nishida, Kuniko Saito | N/A | N/A |
| ALVIN: Active Learning Via INterpolation | Michalis Korakakis, Andreas Vlachos | N/A | N/A |
| Filtered Direct Preference Optimization | Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu | N/A | N/A |
| Instruction Fine-Tuning: Does Prompt Loss Matter? | Mathew Huerta-Enochian, Seung Yong Ko | N/A | N/A |
| Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia | Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West | N/A | N/A |
ICCV 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| AdaSGN: Adapting Joint Number and Model Size for Efficient Skeleton-Based Action Recognition | Lei Shi · , · Yifan Zhang · , · Jian Cheng · , · Hanqing Lu | N/A | |
| C2N: Practical Generative Noise Modeling for Real-World Denoising | Geonwoon Jang · , · Wooseok Lee · , · Sanghyun Son · , · Kyoung Mu Lee | N/A | |
| Continual Learning on Noisy Data Streams via Self-Purified Replay | Chris Dongjoo Kim · , · Jinseo Jeong · , · Sangwoo Moon · , · Gunhee Kim | N/A | |
| FOVEA: Foveated Image Magnification for Autonomous Navigation | Chittesh Thavamani · , · Mengtian Li · , · Nicolas Cebron · , · Deva Ramanan | N/A | |
| PlenOctrees for Real-Time Rendering of Neural Radiance Fields | Alex Yu · , · Ruilong Li · , · Matthew Tancik · , · Hao Li · , · Ren Ng · , · Angjoo Kanazawa | N/A | |
| Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation | Robin Chan · , · Matthias Rottmann · , · Hanno Gottschalk | N/A | |
| Specificity-Preserving RGB-D Saliency Detection | Tao Zhou · , · Huazhu Fu · , · Geng Chen · , · Yi Zhou · , · Deng-Ping Fan · , · Ling Shao | N/A | |
| 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds | Lichen Zhao · , · Daigang Cai · , · Lu Sheng · , · Dong Xu | N/A | |
| 4D-Net for Learned Multi-Modal Alignment | AJ Piergiovanni · , · Vincent Casser · , · Michael S. Ryoo · , · Anelia Angelova | N/A | |
| Patch Craft: Video Denoising by Deep Modeling and Patch Matching | Gregory Vaksman · , · Michael Elad · , · Peyman Milanfar | N/A | |
| Image Manipulation Detection by Multi-View Multi-Scale Supervision | Xinru Chen · , · Chengbo Dong · , · Jiaqi Ji · , · Juan Cao · , · Xirong Li | N/A | |
| Perturbed Self-Distillation: Weakly Supervised Large-Scale Point Cloud Semantic Segmentation | Yachao Zhang · , · Yanyun Qu · , · Yuan Xie · , · Zonghao Li · , · Shanshan Zheng · , · Cuihua Li | N/A | |
| Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation | Mikhail Usvyatsov · , · Anastasia Makarova · , · Rafael Ballester-Ripoll · , · Maxim Rakhuba · , · Andreas Krause · , · Konrad Schindler | N/A | |
| Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval With Partial Query | Guanyu Cai · , · Jun Zhang · , · Xinyang Jiang · , · Yifei Gong · , · Lianghua He · , · Fufu Yu · , · Pai Peng · , · Xiaowei Guo · , · Feiyue Huang · , · Xing Sun | N/A | |
| EventHands: Real-Time Neural 3D Hand Pose Estimation From an Event Stream | Viktor Rudnev · , · Vladislav Golyanik · , · Jiayi Wang · , · Hans-Peter Seidel · , · Franziska Mueller · , · Mohamed Elgharib · , · Christian Theobalt | N/A | |
| Composable Augmentation Encoding for Video Representation Learning | Chen Sun · , · Arsha Nagrani · , · Yonglong Tian · , · Cordelia Schmid | N/A | |
| Exploiting Explanations for Model Inversion Attacks | Xuejun Zhao · , · Wencan Zhang · , · Xiaokui Xiao · , · Brian Lim | N/A | |
| Semantic Diversity Learning for Zero-Shot Multi-Label Classification | Avi Ben-Cohen · , · Nadav Zamir · , · Emanuel Ben-Baruch · , · Itamar Friedman · , · Lihi Zelnik-Manor | N/A | |
| Describing and Localizing Multiple Changes With Transformers | Yue Qiu · , · Shintaro Yamamoto · , · Kodai Nakashima · , · Ryota Suzuki · , · Kenji Iwata · , · Hirokatsu Kataoka · , · Yutaka Satoh | N/A | |
| Score-Based Point Cloud Denoising | Shitong Luo · , · Wei Hu | N/A | |
| Panoptic Segmentation of Satellite Image Time Series With Convolutional Temporal Attention Networks | Vivien Sainte Fare Garnot · , · Loic Landrieu | N/A | |
| Focus on the Positives: Self-Supervised Learning for Biodiversity Monitoring | Omiros Pantazis · , · Gabriel J. Brostow · , · Kate E. Jones · , · Oisin Mac Aodha | N/A | |
| Bridging Unsupervised and Supervised Depth From Focus via All-in-Focus Supervision | Ning-Hsu Wang · , · Ren Wang · , · Yu-Lun Liu · , · Yu-Hao Huang · , · Yu-Lin Chang · , · Chia-Ping Chen · , · Kevin Jou | N/A | |
| Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction Without Convolutions | Wenhai Wang · , · Enze Xie · , · Xiang Li · , · Deng-Ping Fan · , · Kaitao Song · , · Ding Liang · , · Tong Lu · , · Ping Luo · , · Ling Shao | N/A | |
| DOLG: Single-Stage Image Retrieval With Deep Orthogonal Fusion of Local and Global Features | Min Yang · , · Dongliang He · , · Miao Fan · , · Baorong Shi · , · Xuetong Xue · , · Fu Li · , · Errui Ding · , · Jizhou Huang | N/A | |
| Light Source Guided Single-Image Flare Removal From Unpaired Data | Xiaotian Qiao · , · Gerhard P. Hancke · , · Rynson W.H. Lau | N/A | |
| Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization | Wei Zhu · , · Haitian Zheng · , · Haofu Liao · , · Weijian Li · , · Jiebo Luo | N/A | |
| Selective Feature Compression for Efficient Activity Recognition Inference | Chunhui Liu · , · Xinyu Li · , · Hao Chen · , · Davide Modolo · , · Joseph Tighe | N/A | |
| Attention-Based Multi-Reference Learning for Image Super-Resolution | Marco Pesavento · , · Marco Volino · , · Adrian Hilton | N/A | |
| Spatial-Temporal Transformer for Dynamic Scene Graph Generation | Yuren Cong · , · Wentong Liao · , · Hanno Ackermann · , · Bodo Rosenhahn · , · Michael Ying Yang | N/A | |
| Deep Transport Network for Unsupervised Video Object Segmentation | Kaihua Zhang · , · Zicheng Zhao · , · Dong Liu · , · Qingshan Liu · , · Bo Liu | N/A | |
| RDI-Net: Relational Dynamic Inference Networks | Huanyu Wang · , · Songyuan Li · , · Shihao Su · , · Zequn Qin · , · Xi Li | N/A | |
| Densely Guided Knowledge Distillation Using Multiple Teacher Assistants | Wonchul Son · , · Jaemin Na · , · Junyong Choi · , · Wonjun Hwang | N/A | |
| Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift | Jiefeng Peng · , · Jiqi Zhang · , · Changlin Li · , · Guangrun Wang · , · Xiaodan Liang · , · Liang Lin | N/A | |
| ARAPReg: An As-Rigid-As Possible Regularization Loss for Learning Deformable Shape Generators | Qixing Huang · , · Xiangru Huang · , · Bo Sun · , · Zaiwei Zhang · , · Junfeng Jiang · , · Chandrajit Bajaj | N/A | |
| Online Refinement of Low-Level Feature Based Activation Map for Weakly Supervised Object Localization | Jinheng Xie · , · Cheng Luo · , · Xiangping Zhu · , · Ziqi Jin · , · Weizeng Lu · , · Linlin Shen | N/A | |
| Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection | Markos Diomataris · , · Nikolaos Gkanatsios · , · Vassilis Pitsikalis · , · Petros Maragos | N/A | |
| Long-Term Temporally Consistent Unpaired Video Translation From Simulated Surgical 3D Data | Dominik Rivoir · , · Micha Pfeiffer · , · Reuben Docea · , · Fiona Kolbinger · , · Carina Riediger · , · Jürgen Weitz · , · Stefanie Speidel | N/A | |
| Bridging the Gap Between Label- and Reference-Based Synthesis in Multi-Attribute Image-to-Image Translation | Qiusheng Huang · , · Zhilin Zheng · , · Xueqi Hu · , · Li Sun · , · Qingli Li | N/A | |
| A Broad Study on the Transferability of Visual Representations With Contrastive Learning | Ashraful Islam · , · Chun-Fu (Richard) Chen · , · Rameswar Panda · , · Leonid Karlinsky · , · Richard Radke · , · Rogerio Feris | N/A | |
| TempNet: Online Semantic Segmentation on Large-Scale Point Cloud Series | Yunsong Zhou · , · Hongzi Zhu · , · Chunqin Li · , · Tiankai Cui · , · Shan Chang · , · Minyi Guo | N/A | |
| Bayesian Deep Basis Fitting for Depth Completion With Uncertainty | Chao Qu · , · Wenxin Liu · , · Camillo J. Taylor | N/A | |
| Query Adaptive Few-Shot Object Detection With Heterogeneous Graph Convolutional Networks | Guangxing Han · , · Yicheng He · , · Shiyuan Huang · , · Jiawei Ma · , · Shih-Fu Chang | N/A | |
| ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting | Xiaohan Ding · , · Tianxiang Hao · , · Jianchao Tan · , · Ji Liu · , · Jungong Han · , · Yuchen Guo · , · Guiguang Ding | N/A | |
| P2-Net: Joint Description and Detection of Local Features for Pixel and Point Matching | Bing Wang · , · Changhao Chen · , · Zhaopeng Cui · , · Jie Qin · , · Chris Xiaoxuan Lu · , · Zhengdi Yu · , · Peijun Zhao · , · Zhen Dong · , · Fan Zhu · , · Niki Trigoni · , · Andrew Markham | N/A | |
| Generalize Then Adapt: Source-Free Domain Adaptive Semantic Segmentation | Jogendra Nath Kundu · , · Akshay Kulkarni · , · Amit Singh · , · Varun Jampani · , · R. Venkatesh Babu | N/A | |
| Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation | Xin Hao · , · Sanyuan Zhao · , · Mang Ye · , · Jianbing Shen | N/A | |
| T-Net: Effective Permutation-Equivariant Network for Two-View Correspondence Learning | Zhen Zhong · , · Guobao Xiao · , · Linxin Zheng · , · Yan Lu · , · Jiayi Ma | N/A | |
| Temporal Cue Guided Video Highlight Detection With Low-Rank Audio-Visual Fusion | Qinghao Ye · , · Xiyue Shen · , · Yuan Gao · , · Zirui Wang · , · Qi Bi · , · Ping Li · , · Guang Yang | N/A | |
| Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models | Linjie Li · , · Jie Lei · , · Zhe Gan · , · Jingjing Liu | N/A | |
| S3VAADA: Submodular Subset Selection for Virtual Adversarial Active Domain Adaptation | Harsh Rangwani · , · Arihant Jain · , · Sumukh K Aithal · , · R. Venkatesh Babu | N/A | |
| Cross-Sentence Temporal and Semantic Relations in Video Activity Localisation | Jiabo Huang · , · Yang Liu · , · Shaogang Gong · , · Hailin Jin | N/A | |
| StructDepth: Leveraging the Structural Regularities for Self-Supervised Indoor Depth Estimation | Boying Li · , · Yuan Huang · , · Zeyu Liu · , · Danping Zou · , · Wenxian Yu | N/A | |
| Feature Interactive Representation for Point Cloud Registration | Bingli Wu · , · Jie Ma · , · Gaojie Chen · , · Pei An | N/A | |
| Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis | Tiange Xiang · , · Chaoyi Zhang · , · Yang Song · , · Jianhui Yu · , · Weidong Cai | N/A | |
| LSG-CPD: Coherent Point Drift With Local Surface Geometry for Point Cloud Registration | Weixiao Liu · , · Hongtao Wu · , · Gregory S. Chirikjian | N/A | |
| ISD: Self-Supervised Learning by Iterative Similarity Distillation | Ajinkya Tejankar · , · Soroush Abbasi Koohpayegani · , · Vipin Pillai · , · Paolo Favaro · , · Hamed Pirsiavash | N/A | |
| An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation | Rongchang Xie · , · Chunyu Wang · , · Wenjun Zeng · , · Yizhou Wang | N/A | |
| Self-Supervised Neural Networks for Spectral Snapshot Compressive Imaging | Ziyi Meng · , · Zhenming Yu · , · Kun Xu · , · Xin Yuan | N/A | |
| Group-Aware Contrastive Regression for Action Quality Assessment | Xumin Yu · , · Yongming Rao · , · Wenliang Zhao · , · Jiwen Lu · , · Jie Zhou | N/A | |
| The Road To Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation | Yuankai Qi · , · Zizheng Pan · , · Yicong Hong · , · Ming-Hsuan Yang · , · Anton van den Hengel · , · Qi Wu | N/A | |
| Support-Set Based Cross-Supervision for Video Grounding | Xinpeng Ding · , · Nannan Wang · , · Shiwei Zhang · , · De Cheng · , · Xiaomeng Li · , · Ziyuan Huang · , · Mingqian Tang · , · Xinbo Gao | N/A | |
| Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration | Haobo Jiang · , · Yaqi Shen · , · Jin Xie · , · Jun Li · , · Jianjun Qian · , · Jian Yang | N/A | |
| Voxel-Based Network for Shape Completion by Leveraging Edge Generation | Xiaogang Wang · , · Marcelo H Ang · , · Gim Hee Lee | N/A | |
| THUNDR: Transformer-Based 3D Human Reconstruction With Markers | Mihai Zanfir · , · Andrei Zanfir · , · Eduard Gabriel Bazavan · , · William T. Freeman · , · Rahul Sukthankar · , · Cristian Sminchisescu | N/A | |
| OadTR: Online Action Detection With Transformers | Xiang Wang · , · Shiwei Zhang · , · Zhiwu Qing · , · Yuanjie Shao · , · Zhengrong Zuo · , · Changxin Gao · , · Nong Sang | N/A | |
| Instance-Level Image Retrieval Using Reranking Transformers | Fuwen Tan · , · Jiangbo Yuan · , · Vicente Ordonez | N/A | |
| Mutual-Complementing Framework for Nuclei Detection and Segmentation in Pathology Image | Zunlei Feng · , · Zhonghua Wang · , · Xinchao Wang · , · Yining Mao · , · Thomas Li · , · Jie Lei · , · Yuexuan Wang · , · Mingli Song | N/A | |
| Accelerating Atmospheric Turbulence Simulation via Learned Phase-to-Space Transform | Zhiyuan Mao · , · Nicholas Chimitt · , · Stanley H. Chan | N/A | |
| Graph Constrained Data Representation Learning for Human Motion Segmentation | Mariella Dimiccoli · , · Lluís Garrido · , · Guillem Rodriguez-Corominas · , · Herwig Wendt | N/A | |
| Spatial and Semantic Consistency Regularizations for Pedestrian Attribute Recognition | Jian Jia · , · Xiaotang Chen · , · Kaiqi Huang | N/A | |
| Learning To Stylize Novel Views | Hsin-Ping Huang · , · Hung-Yu Tseng · , · Saurabh Saini · , · Maneesh Singh · , · Ming-Hsuan Yang | N/A | |
| Morphable Detector for Object Detection on Demand | Xiangyun Zhao · , · Xu Zou · , · Ying Wu | N/A | |
| Stacked Homography Transformations for Multi-View Pedestrian Detection | Liangchen Song · , · Jialian Wu · , · Ming Yang · , · Qian Zhang · , · Yuan Li · , · Junsong Yuan | N/A | |
| Env-QA: A Video Question Answering Benchmark for Comprehensive Understanding of Dynamic Environments | Difei Gao · , · Ruiping Wang · , · Ziyi Bai · , · Xilin Chen | N/A | |
| Region-Aware Contrastive Learning for Semantic Segmentation | Hanzhe Hu · , · Jinshi Cui · , · Liwei Wang | N/A | |
| Image Retrieval on Real-Life Images With Pre-Trained Vision-and-Language Models | Zheyuan Liu · , · Cristian Rodriguez-Opazo · , · Damien Teney · , · Stephen Gould | N/A | |
| Self-Supervised Real-to-Sim Scene Generation | Aayush Prakash · , · Shoubhik Debnath · , · Jean-Francois Lafleche · , · Eric Cameracci · , · Gavriel State · , · Stan Birchfield · , · Marc T. Law | N/A | |
| GP-S3Net: Graph-Based Panoptic Sparse Semantic Segmentation Network | Ryan Razani · , · Ran Cheng · , · Enxu Li · , · Ehsan Taghavi · , · Yuan Ren · , · Liu Bingbing | N/A | |
| Learning From Noisy Data With Robust Representation Learning | Junnan Li · , · Caiming Xiong · , · Steven C.H. Hoi | N/A | |
| Self-Supervised 3D Skeleton Action Representation Learning With Motion Consistency and Continuity | Yukun Su · , · Guosheng Lin · , · Qingyao Wu | N/A | |
| Feature Importance-Aware Transferable Adversarial Attacks | Zhibo Wang · , · Hengchang Guo · , · Zhifei Zhang · , · Wenxin Liu · , · Zhan Qin · , · Kui Ren | N/A | |
| Exploring Classification Equilibrium in Long-Tailed Object Detection | Chengjian Feng · , · Yujie Zhong · , · Weilin Huang | N/A | |
| Meta Gradient Adversarial Attack | Zheng Yuan · , · Jie Zhang · , · Yunpei Jia · , · Chuanqi Tan · , · Tao Xue · , · Shiguang Shan | N/A | |
| Differentiable Convolution Search for Point Cloud Processing | Xing Nie · , · Yongcheng Liu · , · Shaohong Chen · , · Jianlong Chang · , · Chunlei Huo · , · Gaofeng Meng · , · Qi Tian · , · Weiming Hu · , · Chunhong Pan | N/A | |
| Zero-Shot Day-Night Domain Adaptation With a Physics Prior | Attila Lengyel · , · Sourav Garg · , · Michael Milford · , · Jan C. van Gemert | N/A | |
| Sketch Your Own GAN | Sheng-Yu Wang · , · David Bau · , · Jun-Yan Zhu | N/A | |
| Minimal Solutions for Panoramic Stitching Given Gravity Prior | Yaqing Ding · , · Daniel Barath · , · Zuzana Kukelova | N/A | |
| iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis | Andreas Blattmann · , · Timo Milbich · , · Michael Dorkenwald · , · Björn Ommer | N/A | |
| Neural Radiance Flow for 4D View Synthesis and Video Processing | Yilun Du · , · Yinan Zhang · , · Hong-Xing Yu · , · Joshua B. Tenenbaum · , · Jiajun Wu | N/A | |
| Assignment-Space-Based Multi-Object Tracking and Segmentation | Anwesa Choudhuri · , · Girish Chowdhary · , · Alexander G. Schwing | N/A | |
| Vi2CLR: Video and Image for Visual Contrastive Learning of Representation | Ali Diba · , · Vivek Sharma · , · Reza Safdari · , · Dariush Lotfi · , · Saquib Sarfraz · , · Rainer Stiefelhagen · , · Luc Van Gool | N/A | |
| R-MSFM: Recurrent Multi-Scale Feature Modulation for Monocular Depth Estimating | Zhongkai Zhou · , · Xinnan Fan · , · Pengfei Shi · , · Yuanxue Xin | N/A | |
| Spatially Conditioned Graphs for Detecting Human-Object Interactions | Frederic Z. Zhang · , · Dylan Campbell · , · Stephen Gould | N/A | |
| G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation | Lewei Yao · , · Renjie Pi · , · Hang Xu · , · Wei Zhang · , · Zhenguo Li · , · Tong Zhang | N/A | |
| End-to-End Detection and Pose Estimation of Two Interacting Hands | Dong Uk Kim · , · Kwang In Kim · , · Seungryul Baek | N/A | |
| Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather | Martin Hahner · , · Christos Sakaridis · , · Dengxin Dai · , · Luc Van Gool | N/A | |
| Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better | Bojia Zi · , · Shihao Zhao · , · Xingjun Ma · , · Yu-Gang Jiang | N/A | |
| Normalization Matters in Weakly Supervised Object Localization | Jeesoo Kim · , · Junsuk Choe · , · Sangdoo Yun · , · Nojun Kwak | N/A | |
| Joint Inductive and Transductive Learning for Video Object Segmentation | Yunyao Mao · , · Ning Wang · , · Wengang Zhou · , · Houqiang Li | N/A | |
| Contrast and Order Representations for Video Self-Supervised Learning | Kai Hu · , · Jie Shao · , · Yuan Liu · , · Bhiksha Raj · , · Marios Savvides · , · Zhiqiang Shen | N/A | |
| Out-of-Core Surface Reconstruction via Global TGV Minimization | Nikolai Poliarnyi | N/A | |
| Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning | Siyuan Yang · , · Jun Liu · , · Shijian Lu · , · Meng Hwa Er · , · Alex C. Kot | N/A | |
| Latent Transformations via NeuralODEs for GAN-Based Image Editing | Valentin Khrulkov · , · Leyla Mirvakhabova · , · Ivan Oseledets · , · Artem Babenko | N/A | |
| DECA: Deep Viewpoint-Equivariant Human Pose Estimation Using Capsule Autoencoders | Nicola Garau · , · Niccolò Bisagno · , · Piotr Bródka · , · Nicola Conci | N/A | |
| Extensions of Karger's Algorithm: Why They Fail in Theory and How They Are Useful in Practice | Erik Jenner · , · Enrique Fita Sanmartín · , · Fred A. Hamprecht | N/A | |
| Geometry-Free View Synthesis: Transformers and No 3D Priors | Robin Rombach · , · Patrick Esser · , · Björn Ommer | N/A | |
| Scaling-Up Disentanglement for Image Translation | Aviv Gabbay · , · Yedid Hoshen | N/A | |
| MeshTalk: 3D Face Animation From Speech Using Cross-Modality Disentanglement | Alexander Richard · , · Michael Zollhöfer · , · Yandong Wen · , · Fernando de la Torre · , · Yaser Sheikh | N/A | |
| Learning a Single Network for Scale-Arbitrary Super-Resolution | Longguang Wang · , · Yingqian Wang · , · Zaiping Lin · , · Jungang Yang · , · Wei An · , · Yulan Guo | N/A | |
| Salient Object Ranking With Position-Preserved Attention | Hao Fang · , · Daoxin Zhang · , · Yi Zhang · , · Minghao Chen · , · Jiawei Li · , · Yao Hu · , · Deng Cai · , · Xiaofei He | N/A | |
| Paint Transformer: Feed Forward Neural Painting With Stroke Prediction | Songhua Liu · , · Tianwei Lin · , · Dongliang He · , · Fu Li · , · Ruifeng Deng · , · Xin Li · , · Errui Ding · , · Hao Wang | N/A | |
| DetCo: Unsupervised Contrastive Learning for Object Detection | Enze Xie · , · Jian Ding · , · Wenhai Wang · , · Xiaohang Zhan · , · Hang Xu · , · Peize Sun · , · Zhenguo Li · , · Ping Luo | N/A | |
| PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-Rigid Structure-From-Motion | Haitian Zeng · , · Yuchao Dai · , · Xin Yu · , · Xiaohan Wang · , · Yi Yang | N/A | |
| YouRefIt: Embodied Reference Understanding With Language and Gesture | Yixin Chen · , · Qing Li · , · Deqian Kong · , · Yik Lun Kei · , · Song-Chun Zhu · , · Tao Gao · , · Yixin Zhu · , · Siyuan Huang | N/A | |
| Omniscient Video Super-Resolution | Peng Yi · , · Zhongyuan Wang · , · Kui Jiang · , · Junjun Jiang · , · Tao Lu · , · Xin Tian · , · Jiayi Ma | N/A | |
| Clothing Status Awareness for Long-Term Person Re-Identification | Yan Huang · , · Qiang Wu · , · JingSong Xu · , · Yi Zhong · , · ZhaoXiang Zhang | N/A | |
| Exploring Temporal Coherence for More General Video Face Forgery Detection | Yinglin Zheng · , · Jianmin Bao · , · Dong Chen · , · Ming Zeng · , · Fang Wen | N/A | |
| A Lazy Approach to Long-Horizon Gradient-Based Meta-Learning | Muhammad Abdullah Jamal · , · Liqiang Wang · , · Boqing Gong | N/A | |
| Representative Color Transform for Image Enhancement | Hanul Kim · , · Su-Min Choi · , · Chang-Su Kim · , · Yeong Jun Koh | N/A | |
| Image Synthesis via Semantic Composition | Yi Wang · , · Lu Qi · , · Ying-Cong Chen · , · Xiangyu Zhang · , · Jiaya Jia | N/A | |
| Temporally-Coherent Surface Reconstruction via Metric-Consistent Atlases | Jan Bednarik · , · Vladimir G. Kim · , · Siddhartha Chaudhuri · , · Shaifali Parashar · , · Mathieu Salzmann · , · Pascal Fua · , · Noam Aigerman | N/A | |
| Online Continual Learning With Natural Distribution Shifts: An Empirical Study With Visual Data | Zhipeng Cai · , · Ozan Sener · , · Vladlen Koltun | N/A | |
| Social Fabric: Tubelet Compositions for Video Relation Detection | Shuo Chen · , · Zenglin Shi · , · Pascal Mettes · , · Cees G. M. Snoek | N/A | |
| On Feature Decorrelation in Self-Supervised Learning | Tianyu Hua · , · Wenxiao Wang · , · Zihui Xue · , · Sucheng Ren · , · Yue Wang · , · Hang Zhao | N/A | |
| Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding | Pengchuan Zhang · , · Xiyang Dai · , · Jianwei Yang · , · Bin Xiao · , · Lu Yuan · , · Lei Zhang · , · Jianfeng Gao | N/A | |
| Viewing Graph Solvability via Cycle Consistency | Federica Arrigoni · , · Andrea Fusiello · , · Elisa Ricci · , · Tomas Pajdla | N/A | |
| Low Curvature Activations Reduce Overfitting in Adversarial Training | Vasu Singla · , · Sahil Singla · , · Soheil Feizi · , · David Jacobs | N/A | |
| SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam | Yonggan Fu · , · Yang Zhang · , · Yue Wang · , · Zhihan Lu · , · Vivek Boominathan · , · Ashok Veeraraghavan · , · Yingyan Lin | N/A | |
| Adaptive Focus for Efficient Video Recognition | Yulin Wang · , · Zhaoxi Chen · , · Haojun Jiang · , · Shiji Song · , · Yizeng Han · , · Gao Huang | N/A | |
| Audio2Gestures: Generating Diverse Gestures From Speech Audio With Conditional Variational Autoencoders | Jing Li · , · Di Kang · , · Wenjie Pei · , · Xuefei Zhe · , · Ying Zhang · , · Zhenyu He · , · Linchao Bao | N/A | |
| SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution With Skip-Transformer | Peng Xiang · , · Xin Wen · , · Yu-Shen Liu · , · Yan-Pei Cao · , · Pengfei Wan · , · Wen Zheng · , · Zhizhong Han | N/A | |
| LeViT: A Vision Transformer in ConvNet's Clothing for Faster Inference | Benjamin Graham · , · Alaaeldin El-Nouby · , · Hugo Touvron · , · Pierre Stock · , · Armand Joulin · , · Hervé Jégou · , · Matthijs Douze | N/A | |
| Active Universal Domain Adaptation | Xinhong Ma · , · Junyu Gao · , · Changsheng Xu | N/A | |
| FairNAS: Rethinking Evaluation Fairness of Weight Sharing Neural Architecture Search | Xiangxiang Chu · , · Bo Zhang · , · Ruijun Xu | N/A | |
| Keep CALM and Improve Visual Feature Attribution | Jae Myung Kim · , · Junsuk Choe · , · Zeynep Akata · , · Seong Joon Oh | N/A | |
| AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition | Rameswar Panda · , · Chun-Fu (Richard) Chen · , · Quanfu Fan · , · Ximeng Sun · , · Kate Saenko · , · Aude Oliva · , · Rogerio Feris | N/A | |
| Rethinking 360deg Image Visual Attention Modelling With Unsupervised Learning. | Yasser Abdelaziz Dahou Djilali · , · Tarun Krishna · , · Kevin McGuinness · , · Noel E. O’Connor | N/A | |
| An End-to-End Transformer Model for 3D Object Detection | Ishan Misra · , · Rohit Girdhar · , · Armand Joulin | N/A | |
| Lipschitz Continuity Guided Knowledge Distillation | Yuzhang Shang · , · Bin Duan · , · Ziliang Zong · , · Liqiang Nie · , · Yan Yan | N/A | |
| Instance Similarity Learning for Unsupervised Feature Representation | Ziwei Wang · , · Yunsong Wang · , · Ziyi Wu · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives | Ben Saunders · , · Necati Cihan Camgoz · , · Richard Bowden | N/A | |
| DocFormer: End-to-End Transformer for Document Understanding | Srikar Appalaraju · , · Bhavan Jasani · , · Bhargava Urala Kota · , · Yusheng Xie · , · R. Manmatha | N/A | |
| Spatially-Adaptive Image Restoration Using Distortion-Guided Networks | Kuldeep Purohit · , · Maitreya Suin · , · A. N. Rajagopalan · , · Vishnu Naresh Boddeti | N/A | |
| Exploiting Sample Correlation for Crowd Counting With Multi-Expert Network | Xinyan Liu · , · Guorong Li · , · Zhenjun Han · , · Weigang Zhang · , · Yifan Yang · , · Qingming Huang · , · Nicu Sebe | N/A | |
| Unlimited Neighborhood Interaction for Heterogeneous Trajectory Prediction | Fang Zheng · , · Le Wang · , · Sanping Zhou · , · Wei Tang · , · Zhenxing Niu · , · Nanning Zheng · , · Gang Hua | N/A | |
| Multi-Scale Matching Networks for Semantic Correspondence | Dongyang Zhao · , · Ziyang Song · , · Zhenghao Ji · , · Gangming Zhao · , · Weifeng Ge · , · Yizhou Yu | N/A | |
| LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions | Oğuz Kaan Yüksel · , · Enis Simsar · , · Ezgi Gülperi Er · , · Pinar Yanardag | N/A | |
| Few-Shot Visual Relationship Co-Localization | Revant Teotia · , · Vaibhav Mishra · , · Mayank Maheshwari · , · Anand Mishra | N/A | |
| RFNet: Recurrent Forward Network for Dense Point Cloud Completion | Tianxin Huang · , · Hao Zou · , · Jinhao Cui · , · Xuemeng Yang · , · Mengmeng Wang · , · Xiangrui Zhao · , · Jiangning Zhang · , · Yi Yuan · , · Yifan Xu · , · Yong Liu | N/A | |
| Towards Better Explanations of Class Activation Mapping | Hyungsik Jung · , · Youngrock Oh | N/A | |
| Domain Adaptive Video Segmentation via Temporal Consistency Regularization | Dayan Guan · , · Jiaxing Huang · , · Aoran Xiao · , · Shijian Lu | N/A | |
| PR-Net: Preference Reasoning for Personalized Video Highlight Detection | Runnan Chen · , · Penghao Zhou · , · Wenzhe Wang · , · Nenglun Chen · , · Pai Peng · , · Xing Sun · , · Wenping Wang | N/A | |
| PoinTr: Diverse Point Cloud Completion With Geometry-Aware Transformers | Xumin Yu · , · Yongming Rao · , · Ziyi Wang · , · Zuyan Liu · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Learn-To-Race: A Multimodal Control Environment for Autonomous Racing | James Herman · , · Jonathan Francis · , · Siddha Ganju · , · Bingqing Chen · , · Anirudh Koul · , · Abhinav Gupta · , · Alexey Skabelkin · , · Ivan Zhukov · , · Max Kumskoy · , · Eric Nyberg | N/A | |
| SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition | Hezhen Hu · , · Weichao Zhao · , · Wengang Zhou · , · Yuechen Wang · , · Houqiang Li | N/A | |
| Improving Low-Precision Network Quantization via Bin Regularization | Tiantian Han · , · Dong Li · , · Ji Liu · , · Lu Tian · , · Yi Shan | N/A | |
| Probabilistic Modeling for Human Mesh Recovery | Nikos Kolotouros · , · Georgios Pavlakos · , · Dinesh Jayaraman · , · Kostas Daniilidis | N/A | |
| Distilling Virtual Examples for Long-Tailed Recognition | Yin-Yin He · , · Jianxin Wu · , · Xiu-Shen Wei | N/A | |
| Understanding and Mitigating Annotation Bias in Facial Expression Recognition | Yunliang Chen · , · Jungseock Joo | N/A | |
| Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm Under Mixed Illumination | Dongyoung Kim · , · Jinwoo Kim · , · Seonghyeon Nam · , · Dongwoo Lee · , · Yeonkyung Lee · , · Nahyup Kang · , · Hyong-Euk Lee · , · ByungIn Yoo · , · Jae-Joon Han · , · Seon Joo Kim | N/A | |
| Reality Transform Adversarial Generators for Image Splicing Forgery Detection and Localization | Xiuli Bi · , · Zhipeng Zhang · , · Bin Xiao | N/A | |
| Learning To Regress Bodies From Images Using Differentiable Semantic Rendering | Sai Kumar Dwivedi · , · Nikos Athanasiou · , · Muhammed Kocabas · , · Michael J. Black | N/A | |
| Bifold and Semantic Reasoning for Pedestrian Behavior Prediction | Amir Rasouli · , · Mohsen Rohani · , · Jun Luo | N/A | |
| Learning Target Candidate Association To Keep Track of What Not To Track | Christoph Mayer · , · Martin Danelljan · , · Danda Pani Paudel · , · Luc Van Gool | N/A | |
| VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction | Jaesung Choe · , · Sunghoon Im · , · Francois Rameau · , · Minjun Kang · , · In So Kweon | N/A | |
| Black-Box Detection of Backdoor Attacks With Limited Information and Data | Yinpeng Dong · , · Xiao Yang · , · Zhijie Deng · , · Tianyu Pang · , · Zihao Xiao · , · Hang Su · , · Jun Zhu | N/A | |
| A Robust Loss for Point Cloud Registration | Zhi Deng · , · Yuxin Yao · , · Bailin Deng · , · Juyong Zhang | N/A | |
| Semantic Concentration for Domain Adaptation | Shuang Li · , · Mixue Xie · , · Fangrui Lv · , · Chi Harold Liu · , · Jian Liang · , · Chen Qin · , · Wei Li | N/A | |
| PICCOLO: Point Cloud-Centric Omnidirectional Localization | Junho Kim · , · Changwoon Choi · , · Hojun Jang · , · Young Min Kim | N/A | |
| Distributional Robustness Loss for Long-Tail Learning | Dvir Samuel · , · Gal Chechik | N/A | |
| NGC: A Unified Framework for Learning With Open-World Noisy Data | Zhi-Fan Wu · , · Tong Wei · , · Jianwen Jiang · , · Chaojie Mao · , · Mingqian Tang · , · Yu-Feng Li | N/A | |
| Superpoint Network for Point Cloud Oversegmentation | Le Hui · , · Jia Yuan · , · Mingmei Cheng · , · Jin Xie · , · Xiaoya Zhang · , · Jian Yang | N/A | |
| Exploring Simple 3D Multi-Object Tracking for Autonomous Driving | Chenxu Luo · , · Xiaodong Yang · , · Alan Yuille | N/A | |
| Looking Here or There? Gaze Following in 360-Degree Images | Yunhao Li · , · Wei Shen · , · Zhongpai Gao · , · Yucheng Zhu · , · Guangtao Zhai · , · Guodong Guo | N/A | |
| LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning | Bhavya Vasudeva · , · Puneesh Deora · , · Saumik Bhattacharya · , · Umapada Pal · , · Sukalpa Chanda | N/A | |
| How To Train Neural Networks for Flare Removal | Yicheng Wu · , · Qiurui He · , · Tianfan Xue · , · Rahul Garg · , · Jiawen Chen · , · Ashok Veeraraghavan · , · Jonathan T. Barron | N/A | |
| Motion Basis Learning for Unsupervised Deep Homography Estimation With Subspace Projection | Nianjin Ye · , · Chuan Wang · , · Haoqiang Fan · , · Shuaicheng Liu | N/A | |
| DeepMultiCap: Performance Capture of Multiple Characters Using Sparse Multiview Cameras | Yang Zheng · , · Ruizhi Shao · , · Yuxiang Zhang · , · Tao Yu · , · Zerong Zheng · , · Qionghai Dai · , · Yebin Liu | N/A | |
| AI Choreographer: Music Conditioned 3D Dance Generation With AIST++ | Ruilong Li · , · Shan Yang · , · David A. Ross · , · Angjoo Kanazawa | N/A | |
| PU-EVA: An Edge-Vector Based Approximation Solution for Flexible-Scale Point Cloud Upsampling | Luqing Luo · , · Lulu Tang · , · Wanyi Zhou · , · Shizheng Wang · , · Zhi-Xin Yang | N/A | |
| Spatial Uncertainty-Aware Semi-Supervised Crowd Counting | Yanda Meng · , · Hongrun Zhang · , · Yitian Zhao · , · Xiaoyun Yang · , · Xuesheng Qian · , · Xiaowei Huang · , · Yalin Zheng | N/A | |
| SurfGen: Adversarial 3D Shape Synthesis With Explicit Surface Discriminators | Andrew Luo · , · Tianqin Li · , · Wen-Hao Zhang · , · Tai Sing Lee | N/A | |
| TransReID: Transformer-Based Object Re-Identification | Shuting He · , · Hao Luo · , · Pichao Wang · , · Fan Wang · , · Hao Li · , · Wei Jiang | N/A | |
| Batch Normalization Increases Adversarial Vulnerability and Decreases Adversarial Transferability: A Non-Robust Feature Perspective | Philipp Benz · , · Chaoning Zhang · , · In So Kweon | N/A | |
| Foreground Activation Maps for Weakly Supervised Object Localization | Meng Meng · , · Tianzhu Zhang · , · Qi Tian · , · Yongdong Zhang · , · Feng Wu | N/A | |
| Self-Mutual Distillation Learning for Continuous Sign Language Recognition | Aiming Hao · , · Yuecong Min · , · Xilin Chen | N/A | |
| SOMA: Solving Optical Marker-Based MoCap Automatically | Nima Ghorbani · , · Michael J. Black | N/A | |
| GLiT: Neural Architecture Search for Global and Local Image Transformer | Boyu Chen · , · Peixia Li · , · Chuming Li · , · Baopu Li · , · Lei Bai · , · Chen Lin · , · Ming Sun · , · Junjie Yan · , · Wanli Ouyang | N/A | |
| In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces | Jinhui Xiong · , · Wolfgang Heidrich | N/A | |
| DeepCAD: A Deep Generative Network for Computer-Aided Design Models | Rundi Wu · , · Chang Xiao · , · Changxi Zheng | N/A | |
| Understanding Robustness of Transformers for Image Classification | Srinadh Bhojanapalli · , · Ayan Chakrabarti · , · Daniel Glasner · , · Daliang Li · , · Thomas Unterthiner · , · Andreas Veit | N/A | |
| Learning Canonical View Representation for 3D Shape Recognition With Arbitrary Views | Xin Wei · , · Yifei Gong · , · Fudong Wang · , · Xing Sun · , · Jian Sun | N/A | |
| Fourier Space Losses for Efficient Perceptual Image Super-Resolution | Dario Fuoli · , · Luc Van Gool · , · Radu Timofte | N/A | |
| A Backdoor Attack Against 3D Point Cloud Classifiers | Zhen Xiang · , · David J. Miller · , · Siheng Chen · , · Xi Li · , · George Kesidis | N/A | |
| Guided Point Contrastive Learning for Semi-Supervised Point Cloud Semantic Segmentation | Li Jiang · , · Shaoshuai Shi · , · Zhuotao Tian · , · Xin Lai · , · Shu Liu · , · Chi-Wing Fu · , · Jiaya Jia | N/A | |
| Location-Aware Single Image Reflection Removal | Zheng Dong · , · Ke Xu · , · Yin Yang · , · Hujun Bao · , · Weiwei Xu · , · Rynson W.H. Lau | N/A | |
| Better Aggregation in Test-Time Augmentation | Divya Shanmugam · , · Davis Blalock · , · Guha Balakrishnan · , · John Guttag | N/A | |
| Self-Born Wiring for Neural Trees | Ying Chen · , · Feng Mao · , · Jie Song · , · Xinchao Wang · , · Huiqiong Wang · , · Mingli Song | N/A | |
| DenseTNT: End-to-End Trajectory Prediction From Dense Goal Sets | Junru Gu · , · Chen Sun · , · Hang Zhao | N/A | |
| Segmentation-Grounded Scene Graph Generation | Siddhesh Khandelwal · , · Mohammed Suhail · , · Leonid Sigal | N/A | |
| Detector-Free Weakly Supervised Grounding by Separation | Assaf Arbelle · , · Sivan Doveh · , · Amit Alfassy · , · Joseph Shtok · , · Guy Lev · , · Eli Schwartz · , · Hilde Kuehne · , · Hila Barak Levi · , · Prasanna Sattigeri · , · Rameswar Panda · , · Chun-Fu (Richard) Chen · , · Alex Bronstein · , · Kate Saenko · , · Shimon Ullman · , · Raja Giryes · , · Rogerio Feris · , · Leonid Karlinsky | N/A | |
| Geography-Aware Self-Supervised Learning | Kumar Ayush · , · Burak Uzkent · , · Chenlin Meng · , · Kumar Tanmay · , · Marshall Burke · , · David Lobell · , · Stefano Ermon | N/A | |
| CrossCLR: Cross-Modal Contrastive Learning for Multi-Modal Video Representations | Mohammadreza Zolfaghari · , · Yi Zhu · , · Peter Gehler · , · Thomas Brox | N/A | |
| Shape-Aware Multi-Person Pose Estimation From Multi-View Images | Zijian Dong · , · Jie Song · , · Xu Chen · , · Chen Guo · , · Otmar Hilliges | N/A | |
| Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions | Hyeongseok Son · , · Junyong Lee · , · Sunghyun Cho · , · Seungyong Lee | N/A | |
| Time-Multiplexed Coded Aperture Imaging: Learned Coded Aperture and Pixel Exposures for Compressive Imaging Systems | Edwin Vargas · , · Julien N. P. Martel · , · Gordon Wetzstein · , · Henry Arguello | N/A | |
| Motion-Aware Dynamic Architecture for Efficient Frame Interpolation | Myungsub Choi · , · Suyoung Lee · , · Heewon Kim · , · Kyoung Mu Lee | N/A | |
| Contrasting Contrastive Self-Supervised Representation Learning Pipelines | Klemen Kotar · , · Gabriel Ilharco · , · Ludwig Schmidt · , · Kiana Ehsani · , · Roozbeh Mottaghi | N/A | |
| Normalized Human Pose Features for Human Action Video Alignment | Jingyuan Liu · , · Mingyi Shi · , · Qifeng Chen · , · Hongbo Fu · , · Chiew-Lan Tai | N/A | |
| Learning Hierarchical Graph Neural Networks for Image Clustering | Yifan Xing · , · Tong He · , · Tianjun Xiao · , · Yongxin Wang · , · Yuanjun Xiong · , · Wei Xia · , · David Wipf · , · Zheng Zhang · , · Stefano Soatto | N/A | |
| Indoor Scene Generation From a Collection of Semantic-Segmented Depth Images | Ming-Jia Yang · , · Yu-Xiao Guo · , · Bin Zhou · , · Xin Tong | N/A | |
| Keypoint Communities | Duncan Zauss · , · Sven Kreiss · , · Alexandre Alahi | N/A | |
| Can Scale-Consistent Monocular Depth Be Learned in a Self-Supervised Scale-Invariant Manner? | Lijun Wang · , · Yifan Wang · , · Linzhao Wang · , · Yunlong Zhan · , · Ying Wang · , · Huchuan Lu | N/A | |
| Multi-Task Self-Training for Learning General Representations | Golnaz Ghiasi · , · Barret Zoph · , · Ekin D. Cubuk · , · Quoc V. Le · , · Tsung-Yi Lin | N/A | |
| Adaptive Unfolding Total Variation Network for Low-Light Image Enhancement | Chuanjun Zheng · , · Daming Shi · , · Wentian Shi | N/A | |
| Training Weakly Supervised Video Frame Interpolation With Events | Zhiyang Yu · , · Yu Zhang · , · Deyuan Liu · , · Dongqing Zou · , · Xijun Chen · , · Yebin Liu · , · Jimmy S. Ren | N/A | |
| TransView: Inside, Outside, and Across the Cropping View Boundaries | Zhiyu Pan · , · Zhiguo Cao · , · Kewei Wang · , · Hao Lu · , · Weicai Zhong | N/A | |
| Vis2Mesh: Efficient Mesh Reconstruction From Unstructured Point Clouds of Large Scenes With Learned Virtual View Visibility | Shuang Song · , · Zhaopeng Cui · , · Rongjun Qin | N/A | |
| ID-Reveal: Identity-Aware DeepFake Video Detection | Davide Cozzolino · , · Andreas Rössler · , · Justus Thies · , · Matthias Nießner · , · Luisa Verdoliva | N/A | |
| GAN-Control: Explicitly Controllable GANs | Alon Shoshan · , · Nadav Bhonker · , · Igor Kviatkovsky · , · Gérard Medioni | N/A | |
| A Closer Look at Rotation-Invariant Deep Point Cloud Analysis | Feiran Li · , · Kent Fujiwara · , · Fumio Okura · , · Yasuyuki Matsushita | N/A | |
| Relating Adversarially Robust Generalization to Flat Minima | David Stutz · , · Matthias Hein · , · Bernt Schiele | N/A | |
| Re-Energizing Domain Discriminator With Sample Relabeling for Adversarial Domain Adaptation | Xin Jin · , · Cuiling Lan · , · Wenjun Zeng · , · Zhibo Chen | N/A | |
| Learning To Adversarially Blur Visual Object Tracking | Qing Guo · , · Ziyi Cheng · , · Felix Juefei-Xu · , · Lei Ma · , · Xiaofei Xie · , · Yang Liu · , · Jianjun Zhao | N/A | |
| Few-Shot Image Classification: Just Use a Library of Pre-Trained Feature Extractors and a Simple Classifier | Arkabandhu Chowdhury · , · Mingchao Jiang · , · Swarat Chaudhuri · , · Chris Jermaine | N/A | |
| COMISR: Compression-Informed Video Super-Resolution | Yinxiao Li · , · Pengchong Jin · , · Feng Yang · , · Ce Liu · , · Ming-Hsuan Yang · , · Peyman Milanfar | N/A | |
| Bit-Mixer: Mixed-Precision Networks With Runtime Bit-Width Selection | Adrian Bulat · , · Georgios Tzimiropoulos | N/A | |
| Light Field Saliency Detection With Dual Local Graph Learning and Reciprocative Guidance | Nian Liu · , · Wangbo Zhao · , · Dingwen Zhang · , · Junwei Han · , · Ling Shao | N/A | |
| Finding Representative Interpretations on Convolutional Neural Networks | Peter Cho-Ho Lam · , · Lingyang Chu · , · Maxim Torgonskiy · , · Jian Pei · , · Yong Zhang · , · Lanjun Wang | N/A | |
| AINet: Association Implantation for Superpixel Segmentation | Yaxiong Wang · , · Yunchao Wei · , · Xueming Qian · , · Li Zhu · , · Yi Yang | N/A | |
| An Asynchronous Kalman Filter for Hybrid Event Cameras | Ziwei Wang · , · Yonhon Ng · , · Cedric Scheerlinck · , · Robert Mahony | N/A | |
| Orthogonal Projection Loss | Kanchana Ranasinghe · , · Muzammal Naseer · , · Munawar Hayat · , · Salman Khan · , · Fahad Shahbaz Khan | N/A | |
| Deep Virtual Markers for Articulated 3D Shapes | Hyomin Kim · , · Jungeon Kim · , · Jaewon Kam · , · Jaesik Park · , · Seungyong Lee | N/A | |
| Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search | Zheng Zhan · , · Yifan Gong · , · Pu Zhao · , · Geng Yuan · , · Wei Niu · , · Yushu Wu · , · Tianyun Zhang · , · Malith Jayaweera · , · David Kaeli · , · Bin Ren · , · Xue Lin · , · Yanzhi Wang | N/A | |
| One-Pass Multi-View Clustering for Large-Scale Data | Jiyuan Liu · , · Xinwang Liu · , · Yuexiang Yang · , · Li Liu · , · Siqi Wang · , · Weixuan Liang · , · Jiangyong Shi | N/A | |
| Knowledge-Enriched Distributional Model Inversion Attacks | Si Chen · , · Mostafa Kahla · , · Ruoxi Jia · , · Guo-Jun Qi | N/A | |
| Z-Score Normalization, Hubness, and Few-Shot Learning | Nanyi Fei · , · Yizhao Gao · , · Zhiwu Lu · , · Tao Xiang | N/A | |
| Dense Interaction Learning for Video-Based Person Re-Identification | Tianyu He · , · Xin Jin · , · Xu Shen · , · Jianqiang Huang · , · Zhibo Chen · , · Xian-Sheng Hua | N/A | |
| M3D-VTON: A Monocular-to-3D Virtual Try-On Network | Fuwei Zhao · , · Zhenyu Xie · , · Michael Kampffmeyer · , · Haoye Dong · , · Songfang Han · , · Tianxiang Zheng · , · Tao Zhang · , · Xiaodan Liang | N/A | |
| Explanations for Occluded Images | Hana Chockler · , · Daniel Kroening · , · Youcheng Sun | N/A | |
| Designing a Practical Degradation Model for Deep Blind Image Super-Resolution | Kai Zhang · , · Jingyun Liang · , · Luc Van Gool · , · Radu Timofte | N/A | |
| Unshuffling Data for Improved Generalization in Visual Question Answering | Damien Teney · , · Ehsan Abbasnejad · , · Anton van den Hengel | N/A | |
| Architecture Disentanglement for Deep Neural Networks | Jie Hu · , · Liujuan Cao · , · Tong Tong · , · Qixiang Ye · , · Shengchuan Zhang · , · Ke Li · , · Feiyue Huang · , · Ling Shao · , · Rongrong Ji | N/A | |
| Instances As Queries | Yuxin Fang · , · Shusheng Yang · , · Xinggang Wang · , · Yu Li · , · Chen Fang · , · Ying Shan · , · Bin Feng · , · Wenyu Liu | N/A | |
| Omni-GAN: On the Secrets of cGANs and Beyond | Peng Zhou · , · Lingxi Xie · , · Bingbing Ni · , · Cong Geng · , · Qi Tian | N/A | |
| ACDC: The Adverse Conditions Dataset With Correspondences for Semantic Driving Scene Understanding | Christos Sakaridis · , · Dengxin Dai · , · Luc Van Gool | N/A | |
| Improving De-Raining Generalization via Neural Reorganization | Jie Xiao · , · Man Zhou · , · Xueyang Fu · , · Aiping Liu · , · Zheng-Jun Zha | N/A | |
| 3D Shape Generation and Completion Through Point-Voxel Diffusion | Linqi Zhou · , · Yilun Du · , · Jiajun Wu | N/A | |
| Temporal Knowledge Consistency for Unsupervised Visual Representation Learning | Weixin Feng · , · Yuanjiang Wang · , · Lihua Ma · , · Ye Yuan · , · Chi Zhang | N/A | |
| Self-Conditioned Probabilistic Learning of Video Rescaling | Yuan Tian · , · Guo Lu · , · Xiongkuo Min · , · Zhaohui Che · , · Guangtao Zhai · , · Guodong Guo · , · Zhiyong Gao | N/A | |
| Unsupervised Image Generation With Infinite Generative Adversarial Networks | Hui Ying · , · He Wang · , · Tianjia Shao · , · Yin Yang · , · Kun Zhou | N/A | |
| SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation | Kai Chen · , · Qi Dou | N/A | |
| Inferring High-Resolution Traffic Accident Risk Maps Based on Satellite Imagery and GPS Trajectories | Songtao He · , · Mohammad Amin Sadeghi · , · Sanjay Chawla · , · Mohammad Alizadeh · , · Hari Balakrishnan · , · Samuel Madden | N/A | |
| Self-Supervised Product Quantization for Deep Unsupervised Image Retrieval | Young Kyun Jang · , · Nam Ik Cho | N/A | |
| On Equivariant and Invariant Learning of Object Landmark Representations | Zezhou Cheng · , · Jong-Chyi Su · , · Subhransu Maji | N/A | |
| Rethinking Deep Image Prior for Denoising | Yeonsik Jo · , · Se Young Chun · , · Jonghyun Choi | N/A | |
| VariTex: Variational Neural Face Textures | Marcel C. Bühler · , · Abhimitra Meka · , · Gengyan Li · , · Thabo Beeler · , · Otmar Hilliges | N/A | |
| Domain Adaptive Semantic Segmentation With Self-Supervised Depth Estimation | Qin Wang · , · Dengxin Dai · , · Lukas Hoyer · , · Luc Van Gool · , · Olga Fink | N/A | |
| The Way to My Heart Is Through Contrastive Learning: Remote Photoplethysmography From Unlabelled Video | John Gideon · , · Simon Stent | N/A | |
| IICNet: A Generic Framework for Reversible Image Conversion | Ka Leong Cheng · , · Yueqi Xie · , · Qifeng Chen | N/A | |
| Deep Hough Voting for Robust Global Registration | Junha Lee · , · Seungwook Kim · , · Minsu Cho · , · Jaesik Park | N/A | |
| Image Synthesis From Layout With Locality-Aware Mask Adaption | Zejian Li · , · Jingyu Wu · , · Immanuel Koh · , · Yongchuan Tang · , · Lingyun Sun | N/A | |
| Generalized and Incremental Few-Shot Learning by Explicit Learning and Calibration Without Forgetting | Anna Kukleva · , · Hilde Kuehne · , · Bernt Schiele | N/A | |
| Scribble-Supervised Semantic Segmentation by Uncertainty Reduction on Neural Representation and Self-Supervision on Neural Eigenspace | Zhiyi Pan · , · Peng Jiang · , · Yunhai Wang · , · Changhe Tu · , · Anthony G. Cohn | N/A | |
| Unsupervised Domain Adaptive 3D Detection With Multi-Level Consistency | Zhipeng Luo · , · Zhongang Cai · , · Changqing Zhou · , · Gongjie Zhang · , · Haiyu Zhao · , · Shuai Yi · , · Shijian Lu · , · Hongsheng Li · , · Shanghang Zhang · , · Ziwei Liu | N/A | |
| Transporting Causal Mechanisms for Unsupervised Domain Adaptation | Zhongqi Yue · , · Qianru Sun · , · Xian-Sheng Hua · , · Hanwang Zhang | N/A | |
| Learning To Estimate Hidden Motions With Global Motion Aggregation | Shihao Jiang · , · Dylan Campbell · , · Yao Lu · , · Hongdong Li · , · Richard Hartley | N/A | |
| Predicting With Confidence on Unseen Distributions | Devin Guillory · , · Vaishaal Shankar · , · Sayna Ebrahimi · , · Trevor Darrell · , · Ludwig Schmidt | N/A | |
| TAM: Temporal Adaptive Module for Video Recognition | Zhaoyang Liu · , · Limin Wang · , · Wayne Wu · , · Chen Qian · , · Tong Lu | N/A | |
| Generating Masks From Boxes by Mining Spatio-Temporal Consistencies in Videos | Bin Zhao · , · Goutam Bhat · , · Martin Danelljan · , · Luc Van Gool · , · Radu Timofte | N/A | |
| TRAR: Routing the Attention Spans in Transformer for Visual Question Answering | Yiyi Zhou · , · Tianhe Ren · , · Chaoyang Zhu · , · Xiaoshuai Sun · , · Jianzhuang Liu · , · Xinghao Ding · , · Mingliang Xu · , · Rongrong Ji | N/A | |
| Embed Me if You Can: A Geometric Perceptron | Pavlo Melnyk · , · Michael Felsberg · , · Mårten Wadenbäck | N/A | |
| Learning Rare Category Classifiers on a Tight Labeling Budget | Ravi Teja Mullapudi · , · Fait Poms · , · William R. Mark · , · Deva Ramanan · , · Kayvon Fatahalian | N/A | |
| Persistent Homology Based Graph Convolution Network for Fine-Grained 3D Shape Segmentation | Chi-Chong Wong · , · Chi-Man Vong | N/A | |
| Hybrid Neural Fusion for Full-Frame Video Stabilization | Yu-Lun Liu · , · Wei-Sheng Lai · , · Ming-Hsuan Yang · , · Yung-Yu Chuang · , · Jia-Bin Huang | N/A | |
| HIRE-SNN: Harnessing the Inherent Robustness of Energy-Efficient Deep Spiking Neural Networks by Training With Crafted Input Noise | Souvik Kundu · , · Massoud Pedram · , · Peter A. Beerel | N/A | |
| CDNet: Centripetal Direction Network for Nuclear Instance Segmentation | Hongliang He · , · Zhongyi Huang · , · Yao Ding · , · Guoli Song · , · Lin Wang · , · Qian Ren · , · Pengxu Wei · , · Zhiqiang Gao · , · Jie Chen | N/A | |
| 3D Human Texture Estimation From a Single Image With Transformers | Xiangyu Xu · , · Chen Change Loy | N/A | |
| The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation | Xiaoming Zhao · , · Harsh Agrawal · , · Dhruv Batra · , · Alexander G. Schwing | N/A | |
| Rehearsal Revealed: The Limits and Merits of Revisiting Samples in Continual Learning | Eli Verwimp · , · Matthias De Lange · , · Tinne Tuytelaars | N/A | |
| Group-Free 3D Object Detection via Transformers | Ze Liu · , · Zheng Zhang · , · Yue Cao · , · Han Hu · , · Xin Tong | N/A | |
| Discover the Unknown Biased Attribute of an Image Classifier | Zhiheng Li · , · Chenliang Xu | N/A | |
| Learn To Cluster Faces via Pairwise Classification | Junfu Liu · , · Di Qiu · , · Pengfei Yan · , · Xiaolin Wei | N/A | |
| DAE-GAN: Dynamic Aspect-Aware GAN for Text-to-Image Synthesis | Shulan Ruan · , · Yong Zhang · , · Kun Zhang · , · Yanbo Fan · , · Fan Tang · , · Qi Liu · , · Enhong Chen | N/A | |
| Learning Facial Representations From the Cycle-Consistency of Face | Jia-Ren Chang · , · Yong-Sheng Chen · , · Wei-Chen Chiu | N/A | |
| Towards Memory-Efficient Neural Networks via Multi-Level In Situ Generation | Jiaqi Gu · , · Hanqing Zhu · , · Chenghao Feng · , · Mingjie Liu · , · Zixuan Jiang · , · Ray T. Chen · , · David Z. Pan | N/A | |
| Greedy Gradient Ensemble for Robust Visual Question Answering | Xinzhe Han · , · Shuhui Wang · , · Chi Su · , · Qingming Huang · , · Qi Tian | N/A | |
| Influence Selection for Active Learning | Zhuoming Liu · , · Hao Ding · , · Huaping Zhong · , · Weijia Li · , · Jifeng Dai · , · Conghui He | N/A | |
| Visual Alignment Constraint for Continuous Sign Language Recognition | Yuecong Min · , · Aiming Hao · , · Xiujuan Chai · , · Xilin Chen | N/A | |
| On the Hidden Treasure of Dialog in Video Question Answering | Deniz Engin · , · François Schnitzler · , · Ngoc Q. K. Duong · , · Yannis Avrithis | N/A | |
| From Culture to Clothing: Discovering the World Events Behind a Century of Fashion Images | Wei-Lin Hsiao · , · Kristen Grauman | N/A | |
| Contextually Plausible and Diverse 3D Human Motion Prediction | Sadegh Aliakbarian · , · Fatemeh Saleh · , · Lars Petersson · , · Stephen Gould · , · Mathieu Salzmann | N/A | |
| BN-NAS: Neural Architecture Search With Batch Normalization | Boyu Chen · , · Peixia Li · , · Baopu Li · , · Chen Lin · , · Chuming Li · , · Ming Sun · , · Junjie Yan · , · Wanli Ouyang | N/A | |
| Condensing a Sequence to One Informative Frame for Video Recognition | Zhaofan Qiu · , · Ting Yao · , · Yan Shu · , · Chong-Wah Ngo · , · Tao Mei | N/A | |
| The Benefit of Distraction: Denoising Camera-Based Physiological Measurements Using Inverse Attention | Ewa M. Nowara · , · Daniel McDuff · , · Ashok Veeraraghavan | N/A | |
| Collaborative and Adversarial Learning of Focused and Dispersive Representations for Semi-Supervised Polyp Segmentation | Huisi Wu · , · Guilian Chen · , · Zhenkun Wen · , · Jing Qin | N/A | |
| Active Domain Adaptation via Clustering Uncertainty-Weighted Embeddings | Viraj Prabhu · , · Arjun Chandrasekaran · , · Kate Saenko · , · Judy Hoffman | N/A | |
| Detail Me More: Improving GAN's Photo-Realism of Complex Scenes | Raghudeep Gadde · , · Qianli Feng · , · Aleix M. Martinez | N/A | |
| Rethinking Self-Supervised Correspondence Learning: A Video Frame-Level Similarity Perspective | Jiarui Xu · , · Xiaolong Wang | N/A | |
| Event Stream Super-Resolution via Spatiotemporal Constraint Learning | Siqi Li · , · Yutong Feng · , · Yipeng Li · , · Yu Jiang · , · Changqing Zou · , · Yue Gao | N/A | |
| PrimitiveNet: Primitive Instance Segmentation With Local Primitive Embedding Under Adversarial Metric | Jingwei Huang · , · Yanfeng Zhang · , · Mingwei Sun | N/A | |
| FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting | Rui Liu · , · Hanming Deng · , · Yangyi Huang · , · Xiaoyu Shi · , · Lewei Lu · , · Wenxiu Sun · , · Xiaogang Wang · , · Jifeng Dai · , · Hongsheng Li | N/A | |
| FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation | Yuhang Zang · , · Chen Huang · , · Chen Change Loy | N/A | |
| Online-Trained Upsampler for Deep Low Complexity Video Compression | Jan P. Klopp · , · Keng-Chi Liu · , · Shao-Yi Chien · , · Liang-Gee Chen | N/A | |
| DTMNet: A Discrete Tchebichef Moments-Based Deep Neural Network for Multi-Focus Image Fusion | Bin Xiao · , · Haifeng Wu · , · Xiuli Bi | N/A | |
| Interactive Prototype Learning for Egocentric Action Recognition | Xiaohan Wang · , · Linchao Zhu · , · Heng Wang · , · Yi Yang | N/A | |
| MBA-VO: Motion Blur Aware Visual Odometry | Peidong Liu · , · Xingxing Zuo · , · Viktor Larsson · , · Marc Pollefeys | N/A | |
| Co2L: Contrastive Continual Learning | Hyuntak Cha · , · Jaeho Lee · , · Jinwoo Shin | N/A | |
| STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing | Wen-Cheng Chen · , · Min-Chun Hu · , · Chu-Song Chen | N/A | |
| Unconstrained Scene Generation With Locally Conditioned Radiance Fields | Terrance DeVries · , · Miguel Angel Bautista · , · Nitish Srivastava · , · Graham W. Taylor · , · Joshua M. Susskind | N/A | |
| 3D Human Pose Estimation With Spatial and Temporal Transformers | Ce Zheng · , · Sijie Zhu · , · Matias Mendieta · , · Taojiannan Yang · , · Chen Chen · , · Zhengming Ding | N/A | |
| Self-Supervised Representation Learning From Flow Equivariance | Yuwen Xiong · , · Mengye Ren · , · Wenyuan Zeng · , · Raquel Urtasun | N/A | |
| Continual Learning for Image-Based Camera Localization | Shuzhe Wang · , · Zakaria Laskar · , · Iaroslav Melekhov · , · Xiaotian Li · , · Juho Kannala | N/A | |
| Visual-Textual Attentive Semantic Consistency for Medical Report Generation | Yi Zhou · , · Lei Huang · , · Tao Zhou · , · Huazhu Fu · , · Ling Shao | N/A | |
| Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective With Transformers | Zhaoshuo Li · , · Xingtong Liu · , · Nathan Drenkow · , · Andy Ding · , · Francis X. Creighton · , · Russell H. Taylor · , · Mathias Unberath | N/A | |
| Augmenting Depth Estimation With Geospatial Context | Scott Workman · , · Hunter Blanton | N/A | |
| Explaining Local, Global, and Higher-Order Interactions in Deep Learning | Samuel Lerman · , · Charles Venuto · , · Henry Kautz · , · Chenliang Xu | N/A | |
| Learning Attribute-Driven Disentangled Representations for Interactive Fashion Retrieval | Yuxin Hou · , · Eleonora Vig · , · Michael Donoser · , · Loris Bazzani | N/A | |
| SemiHand: Semi-Supervised Hand Pose Estimation With Consistency | Linlin Yang · , · Shicheng Chen · , · Angela Yao | N/A | |
| Efficient Action Recognition via Dynamic Knowledge Propagation | Hanul Kim · , · Mihir Jain · , · Jun-Tae Lee · , · Sungrack Yun · , · Fatih Porikli | N/A | |
| Bias Loss for Mobile Neural Networks | Lusine Abrahamyan · , · Valentin Ziatchin · , · Yiming Chen · , · Nikos Deligiannis | N/A | |
| Visual Scene Graphs for Audio Source Separation | Moitreya Chatterjee · , · Jonathan Le Roux · , · Narendra Ahuja · , · Anoop Cherian | N/A | |
| Beyond Trivial Counterfactual Explanations With Diverse Valuable Explanations | Pau Rodríguez · , · Massimo Caccia · , · Alexandre Lacoste · , · Lee Zamparo · , · Issam Laradji · , · Laurent Charlin · , · David Vazquez | N/A | |
| Homogeneous Architecture Augmentation for Neural Predictor | Yuqiao Liu · , · Yehui Tang · , · Yanan Sun | N/A | |
| Co-Scale Conv-Attentional Image Transformers | Weijian Xu · , · Yifan Xu · , · Tyler Chang · , · Zhuowen Tu | N/A | |
| Impact of Aliasing on Generalization in Deep Convolutional Networks | Cristina Vasconcelos · , · Hugo Larochelle · , · Vincent Dumoulin · , · Rob Romijnders · , · Nicolas Le Roux · , · Ross Goroshin | N/A | |
| PARE: Part Attention Regressor for 3D Human Body Estimation | Muhammed Kocabas · , · Chun-Hao P. Huang · , · Otmar Hilliges · , · Michael J. Black | N/A | |
| RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering | Shun Iwase · , · Xingyu Liu · , · Rawal Khirodkar · , · Rio Yokota · , · Kris M. Kitani | N/A | |
| How To Design a Three-Stage Architecture for Audio-Visual Active Speaker Detection in the Wild | Okan Köpüklü · , · Maja Taseska · , · Gerhard Rigoll | N/A | |
| Do Different Deep Metric Learning Losses Lead to Similar Learned Features? | Konstantin Kobs · , · Michael Steininger · , · Andrzej Dulny · , · Andreas Hotho | N/A | |
| Just Ask: Learning To Answer Questions From Millions of Narrated Videos | Antoine Yang · , · Antoine Miech · , · Josef Sivic · , · Ivan Laptev · , · Cordelia Schmid | N/A | |
| Towards Face Encryption by Generating Adversarial Identity Masks | Xiao Yang · , · Yinpeng Dong · , · Tianyu Pang · , · Hang Su · , · Jun Zhu · , · Yuefeng Chen · , · Hui Xue | N/A | |
| UniT: Multimodal Multitask Learning With a Unified Transformer | Ronghang Hu · , · Amanpreet Singh | N/A | |
| CSG-Stump: A Learning Friendly CSG-Like Representation for Interpretable Shape Parsing | Daxuan Ren · , · Jianmin Zheng · , · Jianfei Cai · , · Jiatong Li · , · Haiyong Jiang · , · Zhongang Cai · , · Junzhe Zhang · , · Liang Pan · , · Mingyuan Zhang · , · Haiyu Zhao · , · Shuai Yi | N/A | |
| Compressing Visual-Linguistic Model via Knowledge Distillation | Zhiyuan Fang · , · Jianfeng Wang · , · Xiaowei Hu · , · Lijuan Wang · , · Yezhou Yang · , · Zicheng Liu | N/A | |
| Full-Duplex Strategy for Video Object Segmentation | Ge-Peng Ji · , · Keren Fu · , · Zhe Wu · , · Deng-Ping Fan · , · Jianbing Shen · , · Ling Shao | N/A | |
| Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments With Support Samples | Mahmoud Assran · , · Mathilde Caron · , · Ishan Misra · , · Piotr Bojanowski · , · Armand Joulin · , · Nicolas Ballas · , · Michael Rabbat | N/A | |
| Unsupervised Non-Rigid Image Distortion Removal via Grid Deformation | Nianyi Li · , · Simron Thapa · , · Cameron Whyte · , · Albert W. Reed · , · Suren Jayasuriya · , · Jinwei Ye | N/A | |
| Stochastic Transformer Networks With Linear Competing Units: Application To End-to-End SL Translation | Andreas Voskou · , · Konstantinos P. Panousis · , · Dimitrios Kosmopoulos · , · Dimitris N. Metaxas · , · Sotirios Chatzis | N/A | |
| BlockCopy: High-Resolution Video Processing With Block-Sparse Feature Propagation and Online Policies | Thomas Verelst · , · Tinne Tuytelaars | N/A | |
| Telling the What While Pointing to the Where: Multimodal Queries for Image Retrieval | Soravit Changpinyo · , · Jordi Pont-Tuset · , · Vittorio Ferrari · , · Radu Soricut | N/A | |
| Unsupervised Learning of Fine Structure Generation for 3D Point Clouds by 2D Projections Matching | Chao Chen · , · Zhizhong Han · , · Yu-Shen Liu · , · Matthias Zwicker | N/A | |
| SS-IL: Separated Softmax for Incremental Learning | Hongjoon Ahn · , · Jihwan Kwak · , · Subin Lim · , · Hyeonsu Bang · , · Hyojun Kim · , · Taesup Moon | N/A | |
| Multiple Pairwise Ranking Networks for Personalized Video Summarization | Yassir Saquil · , · Da Chen · , · Yuan He · , · Chuan Li · , · Yong-Liang Yang | N/A | |
| Domain-Invariant Disentangled Network for Generalizable Object Detection | Chuang Lin · , · Zehuan Yuan · , · Sicheng Zhao · , · Peize Sun · , · Changhu Wang · , · Jianfei Cai | N/A | |
| Social NCE: Contrastive Learning of Socially-Aware Motion Representations | Yuejiang Liu · , · Qi Yan · , · Alexandre Alahi | N/A | |
| The Center of Attention: Center-Keypoint Grouping via Attention for Multi-Person Pose Estimation | Guillem Brasó · , · Nikita Kister · , · Laura Leal-Taixé | N/A | |
| FloW: A Dataset and Benchmark for Floating Waste Detection in Inland Waters | Yuwei Cheng · , · Jiannan Zhu · , · Mengxin Jiang · , · Jie Fu · , · Changsong Pang · , · Peidong Wang · , · Kris Sankaran · , · Olawale Onabola · , · Yimin Liu · , · Dianbo Liu · , · Yoshua Bengio | N/A | |
| Robust Trust Region for Weakly Supervised Segmentation | Dmitrii Marin · , · Yuri Boykov | N/A | |
| imGHUM: Implicit Generative Models of 3D Human Shape and Articulated Pose | Thiemo Alldieck · , · Hongyi Xu · , · Cristian Sminchisescu | N/A | |
| Dynamic High-Pass Filtering and Multi-Spectral Attention for Image Super-Resolution | Salma Abdel Magid · , · Yulun Zhang · , · Donglai Wei · , · Won-Dong Jang · , · Zudi Lin · , · Yun Fu · , · Hanspeter Pfister | N/A | |
| Self-Knowledge Distillation With Progressive Refinement of Targets | Kyungyul Kim · , · ByeongMoon Ji · , · Doyoung Yoon · , · Sangheum Hwang | N/A | |
| Towards Flexible Blind JPEG Artifacts Removal | Jiaxi Jiang · , · Kai Zhang · , · Radu Timofte | N/A | |
| Channel-Wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition | Yuxin Chen · , · Ziqi Zhang · , · Chunfeng Yuan · , · Bing Li · , · Ying Deng · , · Weiming Hu | N/A | |
| VSAC: Efficient and Accurate Estimator for H and F | Maksym Ivashechkin · , · Daniel Barath · , · Jiří Matas | N/A | |
| HPNet: Deep Primitive Segmentation Using Hybrid Representations | Siming Yan · , · Zhenpei Yang · , · Chongyang Ma · , · Haibin Huang · , · Etienne Vouga · , · Qixing Huang | N/A | |
| Fusion Moves for Graph Matching | Lisa Hutschenreiter · , · Stefan Haller · , · Lorenz Feineis · , · Carsten Rother · , · Dagmar Kainmüller · , · Bogdan Savchynskyy | N/A | |
| Universal-Prototype Enhancing for Few-Shot Object Detection | Aming Wu · , · Yahong Han · , · Linchao Zhu · , · Yi Yang | N/A | |
| I2UV-HandNet: Image-to-UV Prediction Network for Accurate and High-Fidelity 3D Hand Mesh Modeling | Ping Chen · , · Yujin Chen · , · Dong Yang · , · Fangyin Wu · , · Qin Li · , · Qingpei Xia · , · Yong Tan | N/A | |
| Fast Video Moment Retrieval | Junyu Gao · , · Changsheng Xu | N/A | |
| Self-Supervised Geometric Features Discovery via Interpretable Attention for Vehicle Re-Identification and Beyond | Ming Li · , · Xinming Huang · , · Ziming Zhang | N/A | |
| With a Little Help From My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations | Debidatta Dwibedi · , · Yusuf Aytar · , · Jonathan Tompson · , · Pierre Sermanet · , · Andrew Zisserman | N/A | |
| Explainable Person Re-Identification With Attribute-Guided Metric Distillation | Xiaodong Chen · , · Xinchen Liu · , · Wu Liu · , · Xiao-Ping Zhang · , · Yongdong Zhang · , · Tao Mei | N/A | |
| Motion-Focused Contrastive Learning of Video Representations | Rui Li · , · Yiheng Zhang · , · Zhaofan Qiu · , · Ting Yao · , · Dong Liu · , · Tao Mei | N/A | |
| Motion Guided Region Message Passing for Video Captioning | Shaoxiang Chen · , · Yu-Gang Jiang | N/A | |
| Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions | Xiheng Zhang · , · Yongkang Wong · , · Xiaofei Wu · , · Juwei Lu · , · Mohan Kankanhalli · , · Xiangdong Li · , · Weidong Geng | N/A | |
| Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar | Peike Li · , · Xin Yu · , · Yi Yang | N/A | |
| Webly Supervised Fine-Grained Recognition: Benchmark Datasets and an Approach | Zeren Sun · , · Yazhou Yao · , · Xiu-Shen Wei · , · Yongshun Zhang · , · Fumin Shen · , · Jianxin Wu · , · Jian Zhang · , · Heng Tao Shen | N/A | |
| Towards Interpretable Deep Networks for Monocular Depth Estimation | Zunzhi You · , · Yi-Hsuan Tsai · , · Wei-Chen Chiu · , · Guanbin Li | N/A | |
| Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks | Zhihao Liang · , · Zhihao Li · , · Songcen Xu · , · Mingkui Tan · , · Kui Jia | N/A | |
| Exploring Cross-Image Pixel Contrast for Semantic Segmentation | Wenguan Wang · , · Tianfei Zhou · , · Fisher Yu · , · Jifeng Dai · , · Ender Konukoglu · , · Luc Van Gool | N/A | |
| Geometric Granularity Aware Pixel-To-Mesh | Yue Shi · , · Bingbing Ni · , · Jinxian Liu · , · Dingyi Rong · , · Ye Qian · , · Wenjun Zhang | N/A | |
| Pixel Difference Networks for Efficient Edge Detection | Zhuo Su · , · Wenzhe Liu · , · Zitong Yu · , · Dewen Hu · , · Qing Liao · , · Qi Tian · , · Matti Pietikäinen · , · Li Liu | N/A | |
| Towards Understanding the Generative Capability of Adversarially Robust Classifiers | Yao Zhu · , · Jiacheng Ma · , · Jiacheng Sun · , · Zewei Chen · , · Rongxin Jiang · , · Yaowu Chen · , · Zhenguo Li | N/A | |
| Learning Efficient Photometric Feature Transform for Multi-View Stereo | Kaizhang Kang · , · Cihui Xie · , · Ruisheng Zhu · , · Xiaohe Ma · , · Ping Tan · , · Hongzhi Wu · , · Kun Zhou | N/A | |
| NEAT: Neural Attention Fields for End-to-End Autonomous Driving | Kashyap Chitta · , · Aditya Prakash · , · Andreas Geiger | N/A | |
| Modulated Periodic Activations for Generalizable Local Functional Representations | Ishit Mehta · , · Michaël Gharbi · , · Connelly Barnes · , · Eli Shechtman · , · Ravi Ramamoorthi · , · Manmohan Chandraker | N/A | |
| Neural Architecture Search for Joint Human Parsing and Pose Estimation | Dan Zeng · , · Yuhang Huang · , · Qian Bao · , · Junjie Zhang · , · Chi Su · , · Wu Liu | N/A | |
| Fast Light-Field Disparity Estimation With Multi-Disparity-Scale Cost Aggregation | Zhicong Huang · , · Xuemei Hu · , · Zhou Xue · , · Weizhu Xu · , · Tao Yue | N/A | |
| SemIE: Semantically-Aware Image Extrapolation | Bholeshwar Khurana · , · Soumya Ranjan Dash · , · Abhishek Bhatia · , · Aniruddha Mahapatra · , · Hrituraj Singh · , · Kuldeep Kulkarni | N/A | |
| Transformer-Based Dual Relation Graph for Multi-Label Image Recognition | Jiawei Zhao · , · Ke Yan · , · Yifan Zhao · , · Xiaowei Guo · , · Feiyue Huang · , · Jia Li | N/A | |
| Self-Supervised Transfer Learning for Hand Mesh Recovery From Binocular Images | Zheng Chen · , · Sihan Wang · , · Yi Sun · , · Xiaohong Ma | N/A | |
| Faster Multi-Object Segmentation Using Parallel Quadratic Pseudo-Boolean Optimization | Niels Jeppesen · , · Patrick M. Jensen · , · Anders N. Christensen · , · Anders B. Dahl · , · Vedrana A. Dahl | N/A | |
| Partial Off-Policy Learning: Balance Accuracy and Diversity for Human-Oriented Image Captioning | Jiahe Shi · , · Yali Li · , · Shengjin Wang | N/A | |
| Prior to Segment: Foreground Cues for Weakly Annotated Classes in Partially Supervised Instance Segmentation | David Biertimpel · , · Sindi Shkodrani · , · Anil S. Baslamisli · , · Nóra Baka | N/A | |
| Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents | Shivansh Patel · , · Saim Wani · , · Unnat Jain · , · Alexander G. Schwing · , · Svetlana Lazebnik · , · Manolis Savva · , · Angel X. Chang | N/A | |
| A Dark Flash Normal Camera | Zhihao Xia · , · Jason Lawrence · , · Supreeth Achar | N/A | |
| Dynamic CT Reconstruction From Limited Views With Implicit Neural Representations and Parametric Motion Fields | Albert W. Reed · , · Hyojin Kim · , · Rushil Anirudh · , · K. Aditya Mohan · , · Kyle Champley · , · Jingu Kang · , · Suren Jayasuriya | N/A | |
| Diverse Image Style Transfer via Invertible Cross-Space Mapping | Haibo Chen · , · Lei Zhao · , · Huiming Zhang · , · Zhizhong Wang · , · Zhiwen Zuo · , · Ailin Li · , · Wei Xing · , · Dongming Lu | N/A | |
| Variational Attention: Propagating Domain-Specific Knowledge for Multi-Domain Learning in Crowd Counting | Binghui Chen · , · Zhaoyi Yan · , · Ke Li · , · Pengyu Li · , · Biao Wang · , · Wangmeng Zuo · , · Lei Zhang | N/A | |
| Pri3D: Can 3D Priors Help 2D Representation Learning? | Ji Hou · , · Saining Xie · , · Benjamin Graham · , · Angela Dai · , · Matthias Nießner | N/A | |
| PoGO-Net: Pose Graph Optimization With Graph Neural Networks | Xinyi Li · , · Haibin Ling | N/A | |
| Federated Learning for Non-IID Data via Unified Feature Learning and Optimization Objective Alignment | Lin Zhang · , · Yong Luo · , · Yan Bai · , · Bo Du · , · Ling-Yu Duan | N/A | |
| Self-Supervised Video Object Segmentation by Motion Grouping | Charig Yang · , · Hala Lamdouar · , · Erika Lu · , · Andrew Zisserman · , · Weidi Xie | N/A | |
| End-to-End Piece-Wise Unwarping of Document Images | Sagnik Das · , · Kunwar Yashraj Singh · , · Jon Wu · , · Erhan Bas · , · Vijay Mahadevan · , · Rahul Bhotika · , · Dimitris Samaras | N/A | |
| 4D Cloud Scattering Tomography | Roi Ronen · , · Yoav Y. Schechner · , · Eshkol Eytan | N/A | |
| Weakly Supervised Representation Learning With Coarse Labels | Yuanhong Xu · , · Qi Qian · , · Hao Li · , · Rong Jin · , · Juhua Hu | N/A | |
| Asymmetric Bilateral Motion Estimation for Video Frame Interpolation | Junheum Park · , · Chul Lee · , · Chang-Su Kim | N/A | |
| LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution Homography Estimation | Ruizhi Shao · , · Gaochang Wu · , · Yuemei Zhou · , · Ying Fu · , · Lu Fang · , · Yebin Liu | N/A | |
| De-Rendering Stylized Texts | Wataru Shimoda · , · Daichi Haraguchi · , · Seiichi Uchida · , · Kota Yamaguchi | N/A | |
| HRegNet: A Hierarchical Network for Large-Scale Outdoor LiDAR Point Cloud Registration | Fan Lu · , · Guang Chen · , · Yinlong Liu · , · Lijun Zhang · , · Sanqing Qu · , · Shu Liu · , · Rongqi Gu | N/A | |
| A Latent Transformer for Disentangled Face Editing in Images and Videos | Xu Yao · , · Alasdair Newson · , · Yann Gousseau · , · Pierre Hellier | N/A | |
| AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis | Yudong Guo · , · Keyu Chen · , · Sen Liang · , · Yong-Jin Liu · , · Hujun Bao · , · Juyong Zhang | N/A | |
| Graph-Based Asynchronous Event Processing for Rapid Object Recognition | Yijin Li · , · Han Zhou · , · Bangbang Yang · , · Ye Zhang · , · Zhaopeng Cui · , · Hujun Bao · , · Guofeng Zhang | N/A | |
| Learning With Noisy Labels via Sparse Regularization | Xiong Zhou · , · Xianming Liu · , · Chenyang Wang · , · Deming Zhai · , · Junjun Jiang · , · Xiangyang Ji | N/A | |
| Leveraging Auxiliary Tasks With Affinity Learning for Weakly Supervised Semantic Segmentation | Lian Xu · , · Wanli Ouyang · , · Mohammed Bennamoun · , · Farid Boussaid · , · Ferdous Sohel · , · Dan Xu | N/A | |
| Improving 3D Object Detection With Channel-Wise Transformer | Hualian Sheng · , · Sijia Cai · , · Yuan Liu · , · Bing Deng · , · Jianqiang Huang · , · Xian-Sheng Hua · , · Min-Jian Zhao | N/A | |
| CanvasVAE: Learning To Generate Vector Graphic Documents | Kota Yamaguchi | N/A | |
| Flow-Guided Video Inpainting With Scene Templates | Dong Lao · , · Peihao Zhu · , · Peter Wonka · , · Ganesh Sundaramoorthi | N/A | |
| Long Short View Feature Decomposition via Contrastive Video Representation Learning | Nadine Behrmann · , · Mohsen Fayyaz · , · Juergen Gall · , · Mehdi Noroozi | N/A | |
| TACo: Token-Aware Cascade Contrastive Learning for Video-Text Alignment | Jianwei Yang · , · Yonatan Bisk · , · Jianfeng Gao | N/A | |
| Meta Learning on a Sequence of Imbalanced Domains With Difficulty Awareness | Zhenyi Wang · , · Tiehang Duan · , · Le Fang · , · Qiuling Suo · , · Mingchen Gao | N/A | |
| Ranking Models in Unlabeled New Environments | Xiaoxiao Sun · , · Yunzhong Hou · , · Weijian Deng · , · Hongdong Li · , · Liang Zheng | N/A | |
| Adaptive Confidence Thresholding for Monocular Depth Estimation | Hyesong Choi · , · Hunsang Lee · , · Sunkyung Kim · , · Sunok Kim · , · Seungryong Kim · , · Kwanghoon Sohn · , · Dongbo Min | N/A | |
| Embedding Novel Views in a Single JPEG Image | Yue Wu · , · Guotao Meng · , · Qifeng Chen | N/A | |
| Channel Augmented Joint Learning for Visible-Infrared Recognition | Mang Ye · , · Weijian Ruan · , · Bo Du · , · Mike Zheng Shou | N/A | |
| OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud Registration | Hao Xu · , · Shuaicheng Liu · , · Guangfu Wang · , · Guanghui Liu · , · Bing Zeng | N/A | |
| Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation | Ailing Zeng · , · Xiao Sun · , · Lei Yang · , · Nanxuan Zhao · , · Minhao Liu · , · Qiang Xu | N/A | |
| Recursively Conditional Gaussian for Ordinal Unsupervised Domain Adaptation | Xiaofeng Liu · , · Site Li · , · Yubin Ge · , · Pengyi Ye · , · Jane You · , · Jun Lu | N/A | |
| Learning Anchored Unsigned Distance Functions With Gradient Direction Alignment for Single-View Garment Reconstruction | Fang Zhao · , · Wenhao Wang · , · Shengcai Liao · , · Ling Shao | N/A | |
| TeachText: CrossModal Generalized Distillation for Text-Video Retrieval | Ioana Croitoru · , · Simion-Vlad Bogolin · , · Marius Leordeanu · , · Hailin Jin · , · Andrew Zisserman · , · Samuel Albanie · , · Yang Liu | N/A | |
| Geometry Uncertainty Projection Network for Monocular 3D Object Detection | Yan Lu · , · Xinzhu Ma · , · Lei Yang · , · Tianzhu Zhang · , · Yating Liu · , · Qi Chu · , · Junjie Yan · , · Wanli Ouyang | N/A | |
| OVANet: One-vs-All Network for Universal Domain Adaptation | Kuniaki Saito · , · Kate Saenko | N/A | |
| A Hybrid Frequency-Spatial Domain Model for Sparse Image Reconstruction in Scanning Transmission Electron Microscopy | Bintao He · , · Fa Zhang · , · Huanshui Zhang · , · Renmin Han | N/A | |
| Attentional Pyramid Pooling of Salient Visual Residuals for Place Recognition | Guohao Peng · , · Jun Zhang · , · Heshan Li · , · Danwei Wang | N/A | |
| Learning With Noisy Labels for Robust Point Cloud Segmentation | Shuquan Ye · , · Dongdong Chen · , · Songfang Han · , · Jing Liao | N/A | |
| Where Are You Heading? Dynamic Trajectory Prediction With Expert Goal Examples | He Zhao · , · Richard P. Wildes | N/A | |
| Planar Surface Reconstruction From Sparse Views | Linyi Jin · , · Shengyi Qian · , · Andrew Owens · , · David F. Fouhey | N/A | |
| Dynamic Divide-and-Conquer Adversarial Training for Robust Semantic Segmentation | Xiaogang Xu · , · Hengshuang Zhao · , · Jiaya Jia | N/A | |
| MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing | Yuhang Li · , · Feng Zhu · , · Ruihao Gong · , · Mingzhu Shen · , · Xin Dong · , · Fengwei Yu · , · Shaoqing Lu · , · Shi Gu | N/A | |
| VidTr: Video Transformer Without Convolutions | Yanyi Zhang · , · Xinyu Li · , · Chunhui Liu · , · Bing Shuai · , · Yi Zhu · , · Biagio Brattoli · , · Hao Chen · , · Ivan Marsic · , · Joseph Tighe | N/A | |
| LocTex: Learning Data-Efficient Visual Representations From Localized Textual Supervision | Zhijian Liu · , · Simon Stent · , · Jie Li · , · John Gideon · , · Song Han | N/A | |
| Weakly Supervised Segmentation of Small Buildings With Point Labels | Jae-Hun Lee · , · ChanYoung Kim · , · Sanghoon Sull | N/A | |
| Online Knowledge Distillation for Efficient Pose Estimation | Zheng Li · , · Jingwen Ye · , · Mingli Song · , · Ying Huang · , · Zhigeng Pan | N/A | |
| HAA500: Human-Centric Atomic Action Dataset With Curated Videos | Jihoon Chung · , · Cheng-hsin Wuu · , · Hsuan-ru Yang · , · Yu-Wing Tai · , · Chi-Keung Tang | N/A | |
| Efficient Large Scale Inlier Voting for Geometric Vision Problems | Dror Aiger · , · Simon Lynen · , · Jan Hosang · , · Bernhard Zeisl | N/A | |
| From Goals, Waypoints & Paths to Long Term Human Trajectory Forecasting | Karttikeya Mangalam · , · Yang An · , · Harshayu Girase · , · Jitendra Malik | N/A | |
| DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence From Box Supervision | Shiyi Lan · , · Zhiding Yu · , · Christopher Choy · , · Subhashree Radhakrishnan · , · Guilin Liu · , · Yuke Zhu · , · Larry S. Davis · , · Anima Anandkumar | N/A | |
| VENet: Voting Enhancement Network for 3D Object Detection | Qian Xie · , · Yu-Kun Lai · , · Jing Wu · , · Zhoutao Wang · , · Dening Lu · , · Mingqiang Wei · , · Jun Wang | N/A | |
| Intrinsic-Extrinsic Preserved GANs for Unsupervised 3D Pose Transfer | Haoyu Chen · , · Hao Tang · , · Henglin Shi · , · Wei Peng · , · Nicu Sebe · , · Guoying Zhao | N/A | |
| Aggregation With Feature Detection | Shuyang Sun · , · Xiaoyu Yue · , · Xiaojuan Qi · , · Wanli Ouyang · , · Victor Adrian Prisacariu · , · Philip H.S. Torr | N/A | |
| Multi-Echo LiDAR for 3D Object Detection | Yunze Man · , · Xinshuo Weng · , · Prasanna Kumar Sivakumar · , · Matthew O'Toole · , · Kris M. Kitani | N/A | |
| Self-Regulation for Semantic Segmentation | Dong Zhang · , · Hanwang Zhang · , · Jinhui Tang · , · Xian-Sheng Hua · , · Qianru Sun | N/A | |
| Skeleton2Mesh: Kinematics Prior Injected Unsupervised Human Mesh Recovery | Zhenbo Yu · , · Junjie Wang · , · Jingwei Xu · , · Bingbing Ni · , · Chenglong Zhao · , · Minsi Wang · , · Wenjun Zhang | N/A | |
| ReCU: Reviving the Dead Weights in Binary Neural Networks | Zihan Xu · , · Mingbao Lin · , · Jianzhuang Liu · , · Jie Chen · , · Ling Shao · , · Yue Gao · , · Yonghong Tian · , · Rongrong Ji | N/A | |
| Membership Inference Attacks Are Easier on Difficult Problems | Avital Shafran · , · Shmuel Peleg · , · Yedid Hoshen | N/A | |
| Auxiliary Tasks and Exploration Enable ObjectGoal Navigation | Joel Ye · , · Dhruv Batra · , · Abhishek Das · , · Erik Wijmans | N/A | |
| Semantic-Embedded Unsupervised Spectral Reconstruction From Single RGB Images in the Wild | Zhiyu Zhu · , · Hui Liu · , · Junhui Hou · , · Huanqiang Zeng · , · Qingfu Zhang | N/A | |
| Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization | Linjiang Huang · , · Liang Wang · , · Hongsheng Li | N/A | |
| MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks | Alexandre Ramé · , · Rémy Sun · , · Matthieu Cord | N/A | |
| CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization | Ara Jafarzadeh · , · Manuel López Antequera · , · Pau Gargallo · , · Yubin Kuang · , · Carl Toft · , · Fredrik Kahl · , · Torsten Sattler | N/A | |
| PnP-DETR: Towards Efficient Visual Analysis With Transformers | Tao Wang · , · Li Yuan · , · Yunpeng Chen · , · Jiashi Feng · , · Shuicheng Yan | N/A | |
| PlaneTR: Structure-Guided Transformers for 3D Plane Recovery | Bin Tan · , · Nan Xue · , · Song Bai · , · Tianfu Wu · , · Gui-Song Xia | N/A | |
| Concept Generalization in Visual Representation Learning | Mert Bulent Sariyildiz · , · Yannis Kalantidis · , · Diane Larlus · , · Karteek Alahari | N/A | |
| Unsupervised Segmentation Incorporating Shape Prior via Generative Adversarial Networks | Dahye Kim · , · Byung-Woo Hong | N/A | |
| DRB-GAN: A Dynamic ResBlock Generative Adversarial Network for Artistic Style Transfer | Wenju Xu · , · Chengjiang Long · , · Ruisheng Wang · , · Guanghui Wang | N/A | |
| Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery | Samir Yitzhak Gadre · , · Kiana Ehsani · , · Shuran Song | N/A | |
| DCT-SNN: Using DCT To Distribute Spatial Information Over Time for Low-Latency Spiking Neural Networks | Isha Garg · , · Sayeed Shafayet Chowdhury · , · Kaushik Roy | N/A | |
| Learning To Resize Images for Computer Vision Tasks | Hossein Talebi · , · Peyman Milanfar | N/A | |
| Self-Supervised Cryo-Electron Tomography Volumetric Image Restoration From Single Noisy Volume With Sparsity Constraint | Zhidong Yang · , · Fa Zhang · , · Renmin Han | N/A | |
| The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization | Dan Hendrycks · , · Steven Basart · , · Norman Mu · , · Saurav Kadavath · , · Frank Wang · , · Evan Dorundo · , · Rahul Desai · , · Tyler Zhu · , · Samyak Parajuli · , · Mike Guo · , · Dawn Song · , · Jacob Steinhardt · , · Justin Gilmer | N/A | |
| Field of Junctions: Extracting Boundary Structure at Low SNR | Dor Verbin · , · Todd Zickler | N/A | |
| Crowd Counting With Partial Annotations in an Image | Yanyu Xu · , · Ziming Zhong · , · Dongze Lian · , · Jing Li · , · Zhengxin Li · , · Xinxing Xu · , · Shenghua Gao | N/A | |
| Continual Neural Mapping: Learning an Implicit Scene Representation From Sequential Observations | Zike Yan · , · Yuxin Tian · , · Xuesong Shi · , · Ping Guo · , · Peng Wang · , · Hongbin Zha | N/A | |
| LSD-StructureNet: Modeling Levels of Structural Detail in 3D Part Hierarchies | Dominic Roberts · , · Ara Danielyan · , · Hang Chu · , · Mani Golparvar-Fard · , · David Forsyth | N/A | |
| Weakly Supervised Temporal Anomaly Segmentation With Dynamic Time Warping | Dongha Lee · , · Sehun Yu · , · Hyunjun Ju · , · Hwanjo Yu | N/A | |
| Adaptive Label Noise Cleaning With Meta-Supervision for Deep Face Recognition | Yaobin Zhang · , · Weihong Deng · , · Yaoyao Zhong · , · Jiani Hu · , · Xian Li · , · Dongyue Zhao · , · Dongchao Wen | N/A | |
| Unsupervised Dense Deformation Embedding Network for Template-Free Shape Correspondence | Ronghan Chen · , · Yang Cong · , · Jiahua Dong | N/A | |
| Learning Action Completeness From Points for Weakly-Supervised Temporal Action Localization | Pilhyeon Lee · , · Hyeran Byun | N/A | |
| Re-Distributing Biased Pseudo Labels for Semi-Supervised Semantic Segmentation: A Baseline Investigation | Ruifei He · , · Jihan Yang · , · Xiaojuan Qi | N/A | |
| Visformer: The Vision-Friendly Transformer | Zhengsu Chen · , · Lingxi Xie · , · Jianwei Niu · , · Xuefeng Liu · , · Longhui Wei · , · Qi Tian | N/A | |
| Learning Indoor Inverse Rendering With 3D Spatially-Varying Lighting | Zian Wang · , · Jonah Philion · , · Sanja Fidler · , · Jan Kautz | N/A | |
| DeepGaze IIE: Calibrated Prediction in and Out-of-Domain for State-of-the-Art Saliency Modeling | Akis Linardos · , · Matthias Kümmerer · , · Ori Press · , · Matthias Bethge | N/A | |
| Learning To Drive From a World on Rails | Dian Chen · , · Vladlen Koltun · , · Philipp Krähenbühl | N/A | |
| Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies | Sida Peng · , · Junting Dong · , · Qianqian Wang · , · Shangzhan Zhang · , · Qing Shuai · , · Xiaowei Zhou · , · Hujun Bao | N/A | |
| OpenGAN: Open-Set Recognition via Open Data Generation | Shu Kong · , · Deva Ramanan | N/A | |
| Learning To Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data | Abdullah Abuolaim · , · Mauricio Delbracio · , · Damien Kelly · , · Michael S. Brown · , · Peyman Milanfar | N/A | |
| Uncertainty-Aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation | Yuxi Wang · , · Junran Peng · , · ZhaoXiang Zhang | N/A | |
| Mining Contextual Information Beyond Image for Semantic Segmentation | Zhenchao Jin · , · Tao Gong · , · Dongdong Yu · , · Qi Chu · , · Jian Wang · , · Changhu Wang · , · Jie Shao | N/A | |
| A General Recurrent Tracking Framework Without Real Data | Shuai Wang · , · Hao Sheng · , · Yang Zhang · , · Yubin Wu · , · Zhang Xiong | N/A | |
| Knowledge Mining and Transferring for Domain Adaptive Object Detection | Kun Tian · , · Chenghao Zhang · , · Ying Wang · , · Shiming Xiang · , · Chunhong Pan | N/A | |
| Cloud Transformers: A Universal Approach to Point Cloud Processing Tasks | Kirill Mazur · , · Victor Lempitsky | N/A | |
| TravelNet: Self-Supervised Physically Plausible Hand Motion Learning From Monocular Color Images | Zimeng Zhao · , · Xi Zhao · , · Yangang Wang | N/A | |
| Joint Visual Semantic Reasoning: Multi-Stage Decoder for Text Recognition | Ayan Kumar Bhunia · , · Aneeshan Sain · , · Amandeep Kumar · , · Shuvozit Ghose · , · Pinaki Nath Chowdhury · , · Yi-Zhe Song | N/A | |
| Video Self-Stitching Graph Network for Temporal Action Localization | Chen Zhao · , · Ali K. Thabet · , · Bernard Ghanem | N/A | |
| Dynamic Dual Gating Neural Networks | Fanrong Li · , · Gang Li · , · Xiangyu He · , · Jian Cheng | N/A | |
| Gravity-Aware Monocular 3D Human-Object Reconstruction | Rishabh Dabral · , · Soshi Shimada · , · Arjun Jain · , · Christian Theobalt · , · Vladislav Golyanik | N/A | |
| Exploring Relational Context for Multi-Task Dense Prediction | David Brüggemann · , · Menelaos Kanakis · , · Anton Obukhov · , · Stamatios Georgoulis · , · Luc Van Gool | N/A | |
| Going Deeper With Image Transformers | Hugo Touvron · , · Matthieu Cord · , · Alexandre Sablayrolles · , · Gabriel Synnaeve · , · Hervé Jégou | N/A | |
| UltraPose: Synthesizing Dense Pose With 1 Billion Points by Human-Body Decoupling 3D Model | Haonan Yan · , · Jiaqi Chen · , · Xujie Zhang · , · Shengkai Zhang · , · Nianhong Jiao · , · Xiaodan Liang · , · Tianxiang Zheng | N/A | |
| Hand Image Understanding via Deep Multi-Task Learning | Xiong Zhang · , · Hongsheng Huang · , · Jianchao Tan · , · Hongmin Xu · , · Cheng Yang · , · Guozhu Peng · , · Lei Wang · , · Ji Liu | N/A | |
| MFNet: Multi-Filter Directive Network for Weakly Supervised Salient Object Detection | Yongri Piao · , · Jian Wang · , · Miao Zhang · , · Huchuan Lu | N/A | |
| DRIVE: Deep Reinforced Accident Anticipation With Visual Explanation | Wentao Bao · , · Qi Yu · , · Yu Kong | N/A | |
| On the Importance of Distractors for Few-Shot Classification | Rajshekhar Das · , · Yu-Xiong Wang · , · José M. F. Moura | N/A | |
| Zero-Shot Natural Language Video Localization | Jinwoo Nam · , · Daechul Ahn · , · Dongyeop Kang · , · Seong Jong Ha · , · Jonghyun Choi | N/A | |
| Deep Halftoning With Reversible Binary Pattern | Menghan Xia · , · Wenbo Hu · , · Xueting Liu · , · Tien-Tsin Wong | N/A | |
| Robustness via Cross-Domain Ensembles | Teresa Yeo · , · Oğuzhan Fatih Kar · , · Amir Zamir | N/A | |
| Topic Scene Graph Generation by Attention Distillation From Caption | Wenbin Wang · , · Ruiping Wang · , · Xilin Chen | N/A | |
| FFT-OT: A Fast Algorithm for Optimal Transportation | Na Lei · , · Xianfeng Gu | N/A | |
| Contrastive Learning for Label Efficient Semantic Segmentation | Xiangyun Zhao · , · Raviteja Vemulapalli · , · Philip Andrew Mansfield · , · Boqing Gong · , · Bradley Green · , · Lior Shapira · , · Ying Wu | N/A | |
| Progressive Correspondence Pruning by Consensus Learning | Chen Zhao · , · Yixiao Ge · , · Feng Zhu · , · Rui Zhao · , · Hongsheng Li · , · Mathieu Salzmann | N/A | |
| BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation | Thanh-Dat Truong · , · Chi Nhan Duong · , · Ngan Le · , · Son Lam Phung · , · Chase Rainwater · , · Khoa Luu | N/A | |
| Multiscale Vision Transformers | Haoqi Fan · , · Bo Xiong · , · Karttikeya Mangalam · , · Yanghao Li · , · Zhicheng Yan · , · Jitendra Malik · , · Christoph Feichtenhofer | N/A | |
| Robust Small Object Detection on the Water Surface Through Fusion of Camera and Millimeter Wave Radar | Yuwei Cheng · , · Hu Xu · , · Yimin Liu | N/A | |
| Just a Few Points Are All You Need for Multi-View Stereo: A Novel Semi-Supervised Learning Method for Multi-View Stereo | Taekyung Kim · , · Jaehoon Choi · , · Seokeon Choi · , · Dongki Jung · , · Changick Kim | N/A | |
| Multispectral Illumination Estimation Using Deep Unrolling Network | Yuqi Li · , · Qiang Fu · , · Wolfgang Heidrich | N/A | |
| GroupFormer: Group Activity Recognition With Clustered Spatial-Temporal Transformer | Shuaicheng Li · , · Qianggang Cao · , · Lingbo Liu · , · Kunlin Yang · , · Shinan Liu · , · Jun Hou · , · Shuai Yi | N/A | |
| BAPA-Net: Boundary Adaptation and Prototype Alignment for Cross-Domain Semantic Segmentation | Yahao Liu · , · Jinhong Deng · , · Xinchen Gao · , · Wen Li · , · Lixin Duan | N/A | |
| TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild | Vida Adeli · , · Mahsa Ehsanpour · , · Ian Reid · , · Juan Carlos Niebles · , · Silvio Savarese · , · Ehsan Adeli · , · Hamid Rezatofighi | N/A | |
| Conditional DETR for Fast Training Convergence | Depu Meng · , · Xiaokang Chen · , · Zejia Fan · , · Gang Zeng · , · Houqiang Li · , · Yuhui Yuan · , · Lei Sun · , · Jingdong Wang | N/A | |
| Distilling Global and Local Logits With Densely Connected Relations | Youmin Kim · , · Jinbae Park · , · YounHo Jang · , · Muhammad Ali · , · Tae-Hyun Oh · , · Sung-Ho Bae | N/A | |
| A Hierarchical Transformation-Discriminating Generative Model for Few Shot Anomaly Detection | Shelly Sheynin · , · Sagie Benaim · , · Lior Wolf | N/A | |
| MVTN: Multi-View Transformation Network for 3D Shape Recognition | Abdullah Hamdi · , · Silvio Giancola · , · Bernard Ghanem | N/A | |
| GNeRF: GAN-Based Neural Radiance Field Without Posed Camera | Quan Meng · , · Anpei Chen · , · Haimin Luo · , · Minye Wu · , · Hao Su · , · Lan Xu · , · Xuming He · , · Jingyi Yu | N/A | |
| ODAM: Object Detection, Association, and Mapping Using Posed RGB Video | Kejie Li · , · Daniel DeTone · , · Yu Fan (Steven) Chen · , · Minh Vo · , · Ian Reid · , · Hamid Rezatofighi · , · Chris Sweeney · , · Julian Straub · , · Richard Newcombe | N/A | |
| Learning Specialized Activation Functions With the Piecewise Linear Unit | Yucong Zhou · , · Zezhou Zhu · , · Zhao Zhong | N/A | |
| Viewpoint Invariant Dense Matching for Visual Geolocalization | Gabriele Berton · , · Carlo Masone · , · Valerio Paolicelli · , · Barbara Caputo | N/A | |
| Dual Contrastive Loss and Attention for GANs | Ning Yu · , · Guilin Liu · , · Aysegul Dundar · , · Andrew Tao · , · Bryan Catanzaro · , · Larry S. Davis · , · Mario Fritz | N/A | |
| Video Autoencoder: Self-Supervised Disentanglement of Static 3D Structure and Motion | Zihang Lai · , · Sifei Liu · , · Alexei A. Efros · , · Xiaolong Wang | N/A | |
| Adaptive Convolutions With Per-Pixel Dynamic Filter Atom | Ze Wang · , · Zichen Miao · , · Jun Hu · , · Qiang Qiu | N/A | |
| Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition | James Hong · , · Matthew Fisher · , · Michaël Gharbi · , · Kayvon Fatahalian | N/A | |
| Else-Net: Elastic Semantic Network for Continual Action Recognition From Skeleton Data | Tianjiao Li · , · Qiuhong Ke · , · Hossein Rahmani · , · Rui En Ho · , · Henghui Ding · , · Jun Liu | N/A | |
| Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories | Fait Poms · , · Vishnu Sarukkai · , · Ravi Teja Mullapudi · , · Nimit S. Sohoni · , · William R. Mark · , · Deva Ramanan · , · Kayvon Fatahalian | N/A | |
| Deep Matching Prior: Test-Time Optimization for Dense Correspondence | Sunghwan Hong · , · Seungryong Kim | N/A | |
| DualPoseNet: Category-Level 6D Object Pose and Size Estimation Using Dual Pose Network With Refined Learning of Pose Consistency | Jiehong Lin · , · Zewei Wei · , · Zhihao Li · , · Songcen Xu · , · Kui Jia · , · Yuanqing Li | N/A | |
| MDETR - Modulated Detection for End-to-End Multi-Modal Understanding | Aishwarya Kamath · , · Mannat Singh · , · Yann LeCun · , · Gabriel Synnaeve · , · Ishan Misra · , · Nicolas Carion | N/A | |
| Calibrated and Partially Calibrated Semi-Generalized Homographies | Snehal Bhayani · , · Torsten Sattler · , · Daniel Barath · , · Patrik Beliansky · , · Janne Heikkilä · , · Zuzana Kukelova | N/A | |
| End-to-End Video Instance Segmentation via Spatial-Temporal Graph Neural Networks | Tao Wang · , · Ning Xu · , · Kean Chen · , · Weiyao Lin | N/A | |
| The Surprising Impact of Mask-Head Architecture on Novel Class Segmentation | Vighnesh Birodkar · , · Zhichao Lu · , · Siyang Li · , · Vivek Rathod · , · Jonathan Huang | N/A | |
| The Spatio-Temporal Poisson Point Process: A Simple Model for the Alignment of Event Camera Data | Cheng Gu · , · Erik Learned-Miller · , · Daniel Sheldon · , · Guillermo Gallego · , · Pia Bideau | N/A | |
| Learning Self-Similarity in Space and Time As Generalized Motion for Video Action Recognition | Heeseung Kwon · , · Manjin Kim · , · Suha Kwak · , · Minsu Cho | N/A | |
| Collaborative Optimization and Aggregation for Decentralized Domain Generalization and Adaptation | Guile Wu · , · Shaogang Gong | N/A | |
| CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction | Yu Zeng · , · Zhe Lin · , · Huchuan Lu · , · Vishal M. Patel | N/A | |
| LookOut: Diverse Multi-Future Prediction and Planning for Self-Driving | Alexander Cui · , · Sergio Casas · , · Abbas Sadat · , · Renjie Liao · , · Raquel Urtasun | N/A | |
| Causal Attention for Unbiased Visual Recognition | Tan Wang · , · Chang Zhou · , · Qianru Sun · , · Hanwang Zhang | N/A | |
| EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS | Qinqin Zhou · , · Xiawu Zheng · , · Liujuan Cao · , · Bineng Zhong · , · Teng Xi · , · Gang Zhang · , · Errui Ding · , · Mingliang Xu · , · Rongrong Ji | N/A | |
| Detecting Persuasive Atypicality by Modeling Contextual Compatibility | Meiqi Guo · , · Rebecca Hwa · , · Adriana Kovashka | N/A | |
| Warp-Refine Propagation: Semi-Supervised Auto-Labeling via Cycle-Consistency | Aditya Ganeshan · , · Alexis Vallet · , · Yasunori Kudo · , · Shin-ichi Maeda · , · Tommi Kerola · , · Rares Ambrus · , · Dennis Park · , · Adrien Gaidon | N/A | |
| ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models | Jooyoung Choi · , · Sungwon Kim · , · Yonghyun Jeong · , · Youngjune Gwon · , · Sungroh Yoon | N/A | |
| STVGBert: A Visual-Linguistic Transformer Based Framework for Spatio-Temporal Video Grounding | Rui Su · , · Qian Yu · , · Dong Xu | N/A | |
| Universal Representation Learning From Multiple Domains for Few-Shot Classification | Wei-Hong Li · , · Xialei Liu · , · Hakan Bilen | N/A | |
| Pseudo-Loss Confidence Metric for Semi-Supervised Few-Shot Learning | Kai Huang · , · Jie Geng · , · Wen Jiang · , · Xinyang Deng · , · Zhe Xu | N/A | |
| Learning Dual Priors for JPEG Compression Artifacts Removal | Xueyang Fu · , · Xi Wang · , · Aiping Liu · , · Junwei Han · , · Zheng-Jun Zha | N/A | |
| Searching for Two-Stream Models in Multivariate Space for Video Recognition | Xinyu Gong · , · Heng Wang · , · Mike Zheng Shou · , · Matt Feiszli · , · Zhangyang Wang · , · Zhicheng Yan | N/A | |
| Refining Activation Downsampling With SoftPool | Alexandros Stergiou · , · Ronald Poppe · , · Grigorios Kalliatakis | N/A | |
| Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing | Garvita Tiwari · , · Nikolaos Sarafianos · , · Tony Tung · , · Gerard Pons-Moll | N/A | |
| Learning Multiple Pixelwise Tasks Based on Loss Scale Balancing | Jae-Han Lee · , · Chul Lee · , · Chang-Su Kim | N/A | |
| MLVSNet: Multi-Level Voting Siamese Network for 3D Visual Tracking | Zhoutao Wang · , · Qian Xie · , · Yu-Kun Lai · , · Jing Wu · , · Kun Long · , · Jun Wang | N/A | |
| ACE: Ally Complementary Experts for Solving Long-Tailed Recognition in One-Shot | Jiarui Cai · , · Yizhou Wang · , · Jenq-Neng Hwang | N/A | |
| Hyperspectral Image Denoising With Realistic Data | Tao Zhang · , · Ying Fu · , · Cheng Li | N/A | |
| Collaborative Learning With Disentangled Features for Zero-Shot Domain Adaptation | Won Young Jhoo · , · Jae-Pil Heo | N/A | |
| Rethinking Noise Synthesis and Modeling in Raw Denoising | Yi Zhang · , · Hongwei Qin · , · Xiaogang Wang · , · Hongsheng Li | N/A | |
| Disentangled Representation for Age-Invariant Face Recognition: A Mutual Information Minimization Perspective | Xuege Hou · , · Yali Li · , · Shengjin Wang | N/A | |
| Contact-Aware Retargeting of Skinned Motion | Ruben Villegas · , · Duygu Ceylan · , · Aaron Hertzmann · , · Jimei Yang · , · Jun Saito | N/A | |
| Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds | Chaoda Zheng · , · Xu Yan · , · Jiantao Gao · , · Weibing Zhao · , · Wei Zhang · , · Zhen Li · , · Shuguang Cui | N/A | |
| FATNN: Fast and Accurate Ternary Neural Networks | Peng Chen · , · Bohan Zhuang · , · Chunhua Shen | N/A | |
| Multitask AET With Orthogonal Tangent Regularity for Dark Object Detection | Ziteng Cui · , · Guo-Jun Qi · , · Lin Gu · , · Shaodi You · , · Zenghui Zhang · , · Tatsuya Harada | N/A | |
| Field-Guide-Inspired Zero-Shot Learning | Utkarsh Mall · , · Bharath Hariharan · , · Kavita Bala | N/A | |
| Contrastive Attention Maps for Self-Supervised Co-Localization | Minsong Ki · , · Youngjung Uh · , · Junsuk Choe · , · Hyeran Byun | N/A | |
| ORBIT: A Real-World Few-Shot Dataset for Teachable Object Recognition | Daniela Massiceti · , · Luisa Zintgraf · , · John Bronskill · , · Lida Theodorou · , · Matthew Tobias Harris · , · Edward Cutrell · , · Cecily Morrison · , · Katja Hofmann · , · Simone Stumpf | N/A | |
| Domain Generalization via Gradient Surgery | Lucas Mansilla · , · Rodrigo Echeveste · , · Diego H. Milone · , · Enzo Ferrante | N/A | |
| Semi-Supervised Single-Stage Controllable GANs for Conditional Fine-Grained Image Generation | Tianyi Chen · , · Yi Liu · , · Yunfei Zhang · , · Si Wu · , · Yong Xu · , · Feng Liangbing · , · Hau San Wong | N/A | |
| Partner-Assisted Learning for Few-Shot Image Classification | Jiawei Ma · , · Hanchen Xie · , · Guangxing Han · , · Shih-Fu Chang · , · Aram Galstyan · , · Wael Abd-Almageed | N/A | |
| Contrastive Coding for Active Learning Under Class Distribution Mismatch | Pan Du · , · Suyun Zhao · , · Hui Chen · , · Shuwen Chai · , · Hong Chen · , · Cuiping Li | N/A | |
| Partial Video Domain Adaptation With Partial Adversarial Temporal Attentive Network | Yuecong Xu · , · Jianfei Yang · , · Haozhi Cao · , · Zhenghua Chen · , · Qi Li · , · Kezhi Mao | N/A | |
| Personalized and Invertible Face De-Identification by Disentangled Identity Information Manipulation | Jingyi Cao · , · Bo Liu · , · Yunqian Wen · , · Rong Xie · , · Li Song | N/A | |
| Learning Compatible Embeddings | Qiang Meng · , · Chixiang Zhang · , · Xiaoqiang Xu · , · Feng Zhou | N/A | |
| Seasonal Contrast: Unsupervised Pre-Training From Uncurated Remote Sensing Data | Oscar Mañas · , · Alexandre Lacoste · , · Xavier Giró-i-Nieto · , · David Vazquez · , · Pau Rodríguez | N/A | |
| Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation | Zechen Bai · , · Yuta Nakashima · , · Noa Garcia | N/A | |
| Unsupervised Curriculum Domain Adaptation for No-Reference Video Quality Assessment | Pengfei Chen · , · Leida Li · , · Jinjian Wu · , · Weisheng Dong · , · Guangming Shi | N/A | |
| GTT-Net: Learned Generalized Trajectory Triangulation | Xiangyu Xu · , · Enrique Dunn | N/A | |
| Contrastive Learning of Image Representations With Cross-Video Cycle-Consistency | Haiping Wu · , · Xiaolong Wang | N/A | |
| Learning To Remove Refractive Distortions From Underwater Images | Simron Thapa · , · Nianyi Li · , · Jinwei Ye | N/A | |
| BARF: Bundle-Adjusting Neural Radiance Fields | Chen-Hsuan Lin · , · Wei-Chiu Ma · , · Antonio Torralba · , · Simon Lucey | N/A | |
| Seeking Similarities Over Differences: Similarity-Based Domain Alignment for Adaptive Object Detection | Farzaneh Rezaeianaran · , · Rakshith Shetty · , · Rahaf Aljundi · , · Daniel Olmeda Reino · , · Shanshan Zhang · , · Bernt Schiele | N/A | |
| LapsCore: Language-Guided Person Search via Color Reasoning | Yushuang Wu · , · Zizheng Yan · , · Xiaoguang Han · , · Guanbin Li · , · Changqing Zou · , · Shuguang Cui | N/A | |
| Deep Permutation Equivariant Structure From Motion | Dror Moran · , · Hodaya Koslowsky · , · Yoni Kasten · , · Haggai Maron · , · Meirav Galun · , · Ronen Basri | N/A | |
| Collaborative Unsupervised Visual Representation Learning From Decentralized Data | Weiming Zhuang · , · Xin Gan · , · Yonggang Wen · , · Shuai Zhang · , · Shuai Yi | N/A | |
| DeepPRO: Deep Partial Point Cloud Registration of Objects | Donghoon Lee · , · Onur C. Hamsici · , · Steven Feng · , · Prachee Sharma · , · Thorsten Gernoth | N/A | |
| RECALL: Replay-Based Continual Learning in Semantic Segmentation | Andrea Maracani · , · Umberto Michieli · , · Marco Toldo · , · Pietro Zanuttigh | N/A | |
| Extending Neural P-Frame Codecs for B-Frame Coding | Reza Pourreza · , · Taco Cohen | N/A | |
| HAIR: Hierarchical Visual-Semantic Relational Reasoning for Video Question Answering | Fei Liu · , · Jing Liu · , · Weining Wang · , · Hanqing Lu | N/A | |
| Ensemble Attention Distillation for Privacy-Preserving Federated Learning | Xuan Gong · , · Abhishek Sharma · , · Srikrishna Karanam · , · Ziyan Wu · , · Terrence Chen · , · David Doermann · , · Arun Innanje | N/A | |
| Voxel Transformer for 3D Object Detection | Jiageng Mao · , · Yujing Xue · , · Minzhe Niu · , · Haoyue Bai · , · Jiashi Feng · , · Xiaodan Liang · , · Hang Xu · , · Chunjing Xu | N/A | |
| Out-of-Boundary View Synthesis Towards Full-Frame Video Stabilization | Yufei Xu · , · Jing Zhang · , · Dacheng Tao | N/A | |
| Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos | Brian Chen · , · Andrew Rouditchenko · , · Kevin Duarte · , · Hilde Kuehne · , · Samuel Thomas · , · Angie Boggust · , · Rameswar Panda · , · Brian Kingsbury · , · Rogerio Feris · , · David Harwath · , · James Glass · , · Michael Picheny · , · Shih-Fu Chang | N/A | |
| Towards a Universal Model for Cross-Dataset Crowd Counting | Zhiheng Ma · , · Xiaopeng Hong · , · Xing Wei · , · Yunfeng Qiu · , · Yihong Gong | N/A | |
| GAN Inversion for Out-of-Range Images With Geometric Transformations | Kyoungkook Kang · , · Seongtae Kim · , · Sunghyun Cho | N/A | |
| Scaling Up Instance Annotation via Label Propagation | Dim P. Papadopoulos · , · Ethan Weber · , · Antonio Torralba | N/A | |
| Learning RAW-to-sRGB Mappings With Inaccurately Aligned Supervision | Zhilu Zhang · , · Haolin Wang · , · Ming Liu · , · Ruohao Wang · , · Jiawei Zhang · , · Wangmeng Zuo | N/A | |
| Context Reasoning Attention Network for Image Super-Resolution | Yulun Zhang · , · Donglai Wei · , · Can Qin · , · Huan Wang · , · Hanspeter Pfister · , · Yun Fu | N/A | |
| FastNeRF: High-Fidelity Neural Rendering at 200FPS | Stephan J. Garbin · , · Marek Kowalski · , · Matthew Johnson · , · Jamie Shotton · , · Julien Valentin | N/A | |
| 3DeepCT: Learning Volumetric Scattering Tomography of Clouds | Yael Sde-Chen · , · Yoav Y. Schechner · , · Vadim Holodovsky · , · Eshkol Eytan | N/A | |
| EvIntSR-Net: Event Guided Multiple Latent Frames Reconstruction and Super-Resolution | Jin Han · , · Yixin Yang · , · Chu Zhou · , · Chao Xu · , · Boxin Shi | N/A | |
| Deep Structured Instance Graph for Distilling Object Detectors | Yixin Chen · , · Pengguang Chen · , · Shu Liu · , · Liwei Wang · , · Jiaya Jia | N/A | |
| Adversarial Attacks Are Reversible With Natural Supervision | Chengzhi Mao · , · Mia Chiquier · , · Hao Wang · , · Junfeng Yang · , · Carl Vondrick | N/A | |
| Hierarchical Graph Attention Network for Few-Shot Visual-Semantic Learning | Chengxiang Yin · , · Kun Wu · , · Zhengping Che · , · Bo Jiang · , · Zhiyuan Xu · , · Jian Tang | N/A | |
| Semantics Disentangling for Generalized Zero-Shot Learning | Zhi Chen · , · Yadan Luo · , · Ruihong Qiu · , · Sen Wang · , · Zi Huang · , · Jingjing Li · , · Zheng Zhang | N/A | |
| Space-Time-Separable Graph Convolutional Network for Pose Forecasting | Theodoros Sofianos · , · Alessio Sampieri · , · Luca Franco · , · Fabio Galasso | N/A | |
| 3D-FRONT: 3D Furnished Rooms With layOuts and semaNTics | Huan Fu · , · Bowen Cai · , · Lin Gao · , · Ling-Xiao Zhang · , · Jiaming Wang · , · Cao Li · , · Qixun Zeng · , · Chengyue Sun · , · Rongfei Jia · , · Binqiang Zhao · , · Hao Zhang | N/A | |
| Meta-Learning With Task-Adaptive Loss Function for Few-Shot Learning | Sungyong Baik · , · Janghoon Choi · , · Heewon Kim · , · Dohee Cho · , · Jaesik Min · , · Kyoung Mu Lee | N/A | |
| Learning To Track Objects From Unlabeled Videos | Jilai Zheng · , · Chao Ma · , · Houwen Peng · , · Xiaokang Yang | N/A | |
| (Just) A Spoonful of Refinements Helps the Registration Error Go Down | Sérgio Agostinho · , · Aljoša Ošep · , · Alessio Del Bue · , · Laura Leal-Taixé | N/A | |
| H2O: A Benchmark for Visual Human-Human Object Handover Analysis | Ruolin Ye · , · Wenqiang Xu · , · Zhendong Xue · , · Tutian Tang · , · Yanfeng Wang · , · Cewu Lu | N/A | |
| ECS-Net: Improving Weakly Supervised Semantic Segmentation by Using Connections Between Class Activation Maps | Kunyang Sun · , · Haoqing Shi · , · Zhengming Zhang · , · Yongming Huang | N/A | |
| Heterogeneous Relational Complement for Vehicle Re-Identification | Jiajian Zhao · , · Yifan Zhao · , · Jia Li · , · Ke Yan · , · Yonghong Tian | N/A | |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sixian Zhang · , · Xinhang Song · , · Yubing Bai · , · Weijie Li · , · Yakui Chu · , · Shuqiang Jiang | N/A | |
| Information-Theoretic Regularization for Multi-Source Domain Adaptation | Geon Yeong Park · , · Sang Wan Lee | N/A | |
| Beyond Road Extraction: A Dataset for Map Update Using Aerial Images | Favyen Bastani · , · Samuel Madden | N/A | |
| Transfusion: A Novel SLAM Method Focused on Transparent Objects | Yifan Zhu · , · Jiaxiong Qiu · , · Bo Ren | N/A | |
| Physics-Based Human Motion Estimation and Synthesis From Videos | Kevin Xie · , · Tingwu Wang · , · Umar Iqbal · , · Yunrong Guo · , · Sanja Fidler · , · Florian Shkurti | N/A | |
| Hierarchical Memory Matching Network for Video Object Segmentation | Hongje Seong · , · Seoung Wug Oh · , · Joon-Young Lee · , · Seongwon Lee · , · Suhyeon Lee · , · Euntai Kim | N/A | |
| Pathdreamer: A World Model for Indoor Navigation | Jing Yu Koh · , · Honglak Lee · , · Yinfei Yang · , · Jason Baldridge · , · Peter Anderson | N/A | |
| Saliency-Associated Object Tracking | Zikun Zhou · , · Wenjie Pei · , · Xin Li · , · Hongpeng Wang · , · Feng Zheng · , · Zhenyu He | N/A | |
| Wanderlust: Online Continual Object Detection in the Real World | Jianren Wang · , · Xin Wang · , · Yue Shang-Guan · , · Abhinav Gupta | N/A | |
| Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces | Bert Moons · , · Parham Noorzad · , · Andrii Skliar · , · Giovanni Mariani · , · Dushyant Mehta · , · Chris Lott · , · Tijmen Blankevoort | N/A | |
| Removing Adversarial Noise in Class Activation Feature Space | Dawei Zhou · , · Nannan Wang · , · Chunlei Peng · , · Xinbo Gao · , · Xiaoyu Wang · , · Jun Yu · , · Tongliang Liu | N/A | |
| Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution | Jingyun Liang · , · Guolei Sun · , · Kai Zhang · , · Luc Van Gool · , · Radu Timofte | N/A | |
| Unifying Nonlocal Blocks for Neural Networks | Lei Zhu · , · Qi She · , · Duo Li · , · Yanye Lu · , · Xuejing Kang · , · Jie Hu · , · Changhu Wang | N/A | |
| Learning Realistic Human Reposing Using Cyclic Self-Supervision With 3D Shape, Pose, and Appearance Consistency | Soubhik Sanyal · , · Alex Vorobiov · , · Timo Bolkart · , · Matthew Loper · , · Betty Mohler · , · Larry S. Davis · , · Javier Romero · , · Michael J. Black | N/A | |
| MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning | Sonia Baee · , · Erfan Pakdamanian · , · Inki Kim · , · Lu Feng · , · Vicente Ordonez · , · Laura Barnes | N/A | |
| Simpler Is Better: Few-Shot Semantic Segmentation With Classifier Weight Transformer | Zhihe Lu · , · Sen He · , · Xiatian Zhu · , · Li Zhang · , · Yi-Zhe Song · , · Tao Xiang | N/A | |
| Prediction by Anticipation: An Action-Conditional Prediction Method Based on Interaction Learning | Ershad Banijamali · , · Mohsen Rohani · , · Elmira Amirloo · , · Jun Luo · , · Pascal Poupart | N/A | |
| Scene Synthesis via Uncertainty-Driven Attribute Synchronization | Haitao Yang · , · Zaiwei Zhang · , · Siming Yan · , · Haibin Huang · , · Chongyang Ma · , · Yi Zheng · , · Chandrajit Bajaj · , · Qixing Huang | N/A | |
| Domain-Aware Universal Style Transfer | Kibeom Hong · , · Seogkyu Jeon · , · Huan Yang · , · Jianlong Fu · , · Hyeran Byun | N/A | |
| Adversarial Example Detection Using Latent Neighborhood Graph | Ahmed Abusnaina · , · Yuhang Wu · , · Sunpreet Arora · , · Yizhen Wang · , · Fei Wang · , · Hao Yang · , · David Mohaisen | N/A | |
| Dynamic View Synthesis From Dynamic Monocular Video | Chen Gao · , · Ayush Saraf · , · Johannes Kopf · , · Jia-Bin Huang | N/A | |
| Online Pseudo Label Generation by Hierarchical Cluster Dynamics for Adaptive Person Re-Identification | Yi Zheng · , · Shixiang Tang · , · Guolong Teng · , · Yixiao Ge · , · Kaijian Liu · , · Jing Qin · , · Donglian Qi · , · Dapeng Chen | N/A | |
| Learning To Match Features With Seeded Graph Matching Network | Hongkai Chen · , · Zixin Luo · , · Jiahui Zhang · , · Lei Zhou · , · Xuyang Bai · , · Zeyu Hu · , · Chiew-Lan Tai · , · Long Quan | N/A | |
| Aha! Adaptive History-Driven Attack for Decision-Based Black-Box Models | Jie Li · , · Rongrong Ji · , · Peixian Chen · , · Baochang Zhang · , · Xiaopeng Hong · , · Ruixin Zhang · , · Shaoxin Li · , · Jilin Li · , · Feiyue Huang · , · Yongjian Wu | N/A | |
| Anonymizing Egocentric Videos | Daksh Thapar · , · Aditya Nigam · , · Chetan Arora | N/A | |
| Modulated Graph Convolutional Network for 3D Human Pose Estimation | Zhiming Zou · , · Wei Tang | N/A | |
| Learning Self-Consistency for Deepfake Detection | Tianchen Zhao · , · Xiang Xu · , · Mingze Xu · , · Hui Ding · , · Yuanjun Xiong · , · Wei Xia | N/A | |
| PreDet: Large-Scale Weakly Supervised Pre-Training for Detection | Vignesh Ramanathan · , · Rui Wang · , · Dhruv Mahajan | N/A | |
| Real-Time Video Inference on Edge Devices via Adaptive Model Streaming | Mehrdad Khani · , · Pouya Hamadanian · , · Arash Nasr-Esfahany · , · Mohammad Alizadeh | N/A | |
| Learning Generative Models of Textured 3D Meshes From Real-World Images | Dario Pavllo · , · Jonas Kohler · , · Thomas Hofmann · , · Aurelien Lucchi | N/A | |
| Multi-Modality Associative Bridging Through Memory: Speech Sound Recollected From Face Video | Minsu Kim · , · Joanna Hong · , · Se Jin Park · , · Yong Man Ro | N/A | |
| Warp Consistency for Unsupervised Learning of Dense Correspondences | Prune Truong · , · Martin Danelljan · , · Fisher Yu · , · Luc Van Gool | N/A | |
| Towards Rotation Invariance in Object Detection | Agastya Kalra · , · Guy Stoppi · , · Bradley Brown · , · Rishav Agarwal · , · Achuta Kadambi | N/A | |
| Unlocking the Potential of Ordinary Classifier: Class-Specific Adversarial Erasing Framework for Weakly Supervised Semantic Segmentation | Hyeokjun Kweon · , · Sung-Hoon Yoon · , · Hyeonseong Kim · , · Daehee Park · , · Kuk-Jin Yoon | N/A | |
| Sparse-to-Dense Feature Matching: Intra and Inter Domain Cross-Modal Learning in Domain Adaptation for 3D Semantic Segmentation | Duo Peng · , · Yinjie Lei · , · Wen Li · , · Pingping Zhang · , · Yulan Guo | N/A | |
| Toward Spatially Unbiased Generative Models | Jooyoung Choi · , · Jungbeom Lee · , · Yonghyun Jeong · , · Sungroh Yoon | N/A | |
| ReconfigISP: Reconfigurable Camera Image Processing Pipeline | Ke Yu · , · Zexian Li · , · Yue Peng · , · Chen Change Loy · , · Jinwei Gu | N/A | |
| Multi-Expert Adversarial Attack Detection in Person Re-Identification Using Context Inconsistency | Xueping Wang · , · Shasha Li · , · Min Liu · , · Yaonan Wang · , · Amit K. Roy-Chowdhury | N/A | |
| Video Instance Segmentation With a Propose-Reduce Paradigm | Huaijia Lin · , · Ruizheng Wu · , · Shu Liu · , · Jiangbo Lu · , · Jiaya Jia | N/A | |
| Divide-and-Assemble: Learning Block-Wise Memory for Unsupervised Anomaly Detection | Jinlei Hou · , · Yingying Zhang · , · Qiaoyong Zhong · , · Di Xie · , · Shiliang Pu · , · Hong Zhou | N/A | |
| Dense Deep Unfolding Network With 3D-CNN Prior for Snapshot Compressive Imaging | Zhuoyuan Wu · , · Jian Zhang · , · Chong Mou | N/A | |
| SelfReg: Self-Supervised Contrastive Regularization for Domain Generalization | Daehee Kim · , · Youngjun Yoo · , · Seunghyun Park · , · Jinkyu Kim · , · Jaekoo Lee | N/A | |
| Towards the Unseen: Iterative Text Recognition by Distilling From Errors | Ayan Kumar Bhunia · , · Pinaki Nath Chowdhury · , · Aneeshan Sain · , · Yi-Zhe Song | N/A | |
| SA-ConvONet: Sign-Agnostic Optimization of Convolutional Occupancy Networks | Jiapeng Tang · , · Jiabao Lei · , · Dan Xu · , · Feiying Ma · , · Kui Jia · , · Lei Zhang | N/A | |
| Rethinking Spatial Dimensions of Vision Transformers | Byeongho Heo · , · Sangdoo Yun · , · Dongyoon Han · , · Sanghyuk Chun · , · Junsuk Choe · , · Seong Joon Oh | N/A | |
| Move2Hear: Active Audio-Visual Source Separation | Sagnik Majumder · , · Ziad Al-Halah · , · Kristen Grauman | N/A | |
| VIL-100: A New Dataset and a Baseline Model for Video Instance Lane Detection | Yujun Zhang · , · Lei Zhu · , · Wei Feng · , · Huazhu Fu · , · Mingqian Wang · , · Qingxia Li · , · Cheng Li · , · Song Wang | N/A | |
| Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation | Taosha Fan · , · Kalyan Vasudev Alwala · , · Donglai Xiang · , · Weipeng Xu · , · Todd Murphey · , · Mustafa Mukadam | N/A | |
| AA-RMVSNet: Adaptive Aggregation Recurrent Multi-View Stereo Network | Zizhuang Wei · , · Qingtian Zhu · , · Chen Min · , · Yisong Chen · , · Guoping Wang | N/A | |
| ME-PCN: Point Completion Conditioned on Mask Emptiness | Bingchen Gong · , · Yinyu Nie · , · Yiqun Lin · , · Xiaoguang Han · , · Yizhou Yu | N/A | |
| Full-Body Motion From a Single Head-Mounted Device: Generating SMPL Poses From Partial Observations | Andrea Dittadi · , · Sebastian Dziadzio · , · Darren Cosker · , · Ben Lundell · , · Thomas J. Cashman · , · Jamie Shotton | N/A | |
| LOKI: Long Term and Key Intentions for Trajectory Prediction | Harshayu Girase · , · Haiming Gang · , · Srikanth Malla · , · Jiachen Li · , · Akira Kanehara · , · Karttikeya Mangalam · , · Chiho Choi | N/A | |
| Three Steps to Multimodal Trajectory Prediction: Modality Clustering, Classification and Synthesis | Jianhua Sun · , · Yuxuan Li · , · Hao-Shu Fang · , · Cewu Lu | N/A | |
| Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation | Yuxiang Wei · , · Yupeng Shi · , · Xiao Liu · , · Zhilong Ji · , · Yuan Gao · , · Zhongqin Wu · , · Wangmeng Zuo | N/A | |
| SGMNet: Learning Rotation-Invariant Point Cloud Representations via Sorted Gram Matrix | Jianyun Xu · , · Xin Tang · , · Yushi Zhu · , · Jie Sun · , · Shiliang Pu | N/A | |
| Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation | Weilun Wang · , · Wengang Zhou · , · Jianmin Bao · , · Dong Chen · , · Houqiang Li | N/A | |
| Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation | Antoine Saporta · , · Tuan-Hung Vu · , · Matthieu Cord · , · Patrick Pérez | N/A | |
| SSH: A Self-Supervised Framework for Image Harmonization | Yifan Jiang · , · He Zhang · , · Jianming Zhang · , · Yilin Wang · , · Zhe Lin · , · Kalyan Sunkavalli · , · Simon Chen · , · Sohrab Amirghodsi · , · Sarah Kong · , · Zhangyang Wang | N/A | |
| Context-Aware Scene Graph Generation With Seq2Seq Transformers | Yichao Lu · , · Himanshu Rai · , · Jason Chang · , · Boris Knyazev · , · Guangwei Yu · , · Shashank Shekhar · , · Graham W. Taylor · , · Maksims Volkovs | N/A | |
| Rethinking the Backdoor Attacks' Triggers: A Frequency Perspective | Yi Zeng · , · Won Park · , · Z. Morley Mao · , · Ruoxi Jia | N/A | |
| Learning Multi-Scene Absolute Pose Regression With Transformers | Yoli Shavit · , · Ron Ferens · , · Yosi Keller | N/A | |
| Self-Supervised 3D Face Reconstruction via Conditional Estimation | Yandong Wen · , · Weiyang Liu · , · Bhiksha Raj · , · Rita Singh | N/A | |
| Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image | Jaeyoung Yoo · , · Hojun Lee · , · Inseop Chung · , · Geonseok Seo · , · Nojun Kwak | N/A | |
| Defocus Map Estimation and Deblurring From a Single Dual-Pixel Image | Shumian Xin · , · Neal Wadhwa · , · Tianfan Xue · , · Jonathan T. Barron · , · Pratul P. Srinivasan · , · Jiawen Chen · , · Ioannis Gkioulekas · , · Rahul Garg | N/A | |
| DivAug: Plug-In Automated Data Augmentation With Explicit Diversity Maximization | Zirui Liu · , · Haifeng Jin · , · Ting-Hsiang Wang · , · Kaixiong Zhou · , · Xia Hu | N/A | |
| VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation | Zeyu Hu · , · Xuyang Bai · , · Jiaxiang Shang · , · Runze Zhang · , · Jiayu Dong · , · Xin Wang · , · Guangyuan Sun · , · Hongbo Fu · , · Chiew-Lan Tai | N/A | |
| FMODetect: Robust Detection of Fast Moving Objects | Denys Rozumnyi · , · Jiří Matas · , · Filip Šroubek · , · Marc Pollefeys · , · Martin R. Oswald | N/A | |
| VideoLT: Large-Scale Long-Tailed Video Recognition | Xing Zhang · , · Zuxuan Wu · , · Zejia Weng · , · Huazhu Fu · , · Jingjing Chen · , · Yu-Gang Jiang · , · Larry S. Davis | N/A | |
| Self-Supervised Monocular Depth Estimation for All Day Images Using Domain Separation | Lina Liu · , · Xibin Song · , · Mengmeng Wang · , · Yong Liu · , · Liangjun Zhang | N/A | |
| An Empirical Study of Training Self-Supervised Vision Transformers | Xinlei Chen · , · Saining Xie · , · Kaiming He | N/A | |
| RangeDet: In Defense of Range View for LiDAR-Based 3D Object Detection | Lue Fan · , · Xuan Xiong · , · Feng Wang · , · Naiyan Wang · , · ZhaoXiang Zhang | N/A | |
| Data-Free Universal Adversarial Perturbation and Black-Box Attack | Chaoning Zhang · , · Philipp Benz · , · Adil Karjauv · , · In So Kweon | N/A | |
| Learning To Hallucinate Examples From Extrinsic and Intrinsic Supervision | Liangke Gui · , · Adrien Bardes · , · Ruslan Salakhutdinov · , · Alexander Hauptmann · , · Martial Hebert · , · Yu-Xiong Wang | N/A | |
| Multiresolution Deep Implicit Functions for 3D Shape Representation | Zhang Chen · , · Yinda Zhang · , · Kyle Genova · , · Sean Fanello · , · Sofien Bouaziz · , · Christian Häne · , · Ruofei Du · , · Cem Keskin · , · Thomas Funkhouser · , · Danhang Tang | N/A | |
| Single-Shot Hyperspectral-Depth Imaging With Learned Diffractive Optics | Seung-Hwan Baek · , · Hayato Ikoma · , · Daniel S. Jeon · , · Yuqi Li · , · Wolfgang Heidrich · , · Gordon Wetzstein · , · Min H. Kim | N/A | |
| Discriminative Region-Based Multi-Label Zero-Shot Learning | Sanath Narayan · , · Akshita Gupta · , · Salman Khan · , · Fahad Shahbaz Khan · , · Ling Shao · , · Mubarak Shah | N/A | |
| FaPN: Feature-Aligned Pyramid Network for Dense Image Prediction | Shihua Huang · , · Zhichao Lu · , · Ran Cheng · , · Cheng He | N/A | |
| Personalized Trajectory Prediction via Distribution Discrimination | Guangyi Chen · , · Junlong Li · , · Nuoxing Zhou · , · Liangliang Ren · , · Jiwen Lu | N/A | |
| GridToPix: Training Embodied Agents With Minimal Supervision | Unnat Jain · , · Iou-Jen Liu · , · Svetlana Lazebnik · , · Aniruddha Kembhavi · , · Luca Weihs · , · Alexander G. Schwing | N/A | |
| On the Robustness of Vision Transformers to Adversarial Examples | Kaleel Mahmood · , · Rigel Mahmood · , · Marten van Dijk | N/A | |
| HiFT: Hierarchical Feature Transformer for Aerial Tracking | Ziang Cao · , · Changhong Fu · , · Junjie Ye · , · Bowen Li · , · Yiming Li | N/A | |
| Tokens-to-Token ViT: Training Vision Transformers From Scratch on ImageNet | Li Yuan · , · Yunpeng Chen · , · Tao Wang · , · Weihao Yu · , · Yujun Shi · , · Zi-Hang Jiang · , · Francis E.H. Tay · , · Jiashi Feng · , · Shuicheng Yan | N/A | |
| Teacher-Student Adversarial Depth Hallucination To Improve Face Recognition | Hardik Uppal · , · Alireza Sepas-Moghaddam · , · Michael Greenspan · , · Ali Etemad | N/A | |
| Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing | Henghui Ding · , · Hui Zhang · , · Jun Liu · , · Jiaxin Li · , · Zijian Feng · , · Xudong Jiang | N/A | |
| Online Multi-Granularity Distillation for GAN Compression | Yuxi Ren · , · Jie Wu · , · Xuefeng Xiao · , · Jianchao Yang | N/A | |
| Influence-Balanced Loss for Imbalanced Visual Classification | Seulki Park · , · Jongin Lim · , · Younghan Jeon · , · Jin Young Choi | N/A | |
| Consistency-Aware Graph Network for Human Interaction Understanding | Zhenhua Wang · , · Jiajun Meng · , · Dongyan Guo · , · Jianhua Zhang · , · Javen Qinfeng Shi · , · Shengyong Chen | N/A | |
| Where2Act: From Pixels to Actions for Articulated 3D Objects | Kaichun Mo · , · Leonidas J. Guibas · , · Mustafa Mukadam · , · Abhinav Gupta · , · Shubham Tulsiani | N/A | |
| Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision | Xiaoshi Wu · , · Hadar Averbuch-Elor · , · Jin Sun · , · Noah Snavely | N/A | |
| End-to-End Unsupervised Document Image Blind Denoising | Mehrdad J. Gangeh · , · Marcin Plata · , · Hamid R. Motahari Nezhad · , · Nigel P Duffy | N/A | |
| Differentiable Dynamic Wirings for Neural Networks | Kun Yuan · , · Quanquan Li · , · Shaopeng Guo · , · Dapeng Chen · , · Aojun Zhou · , · Fengwei Yu · , · Ziwei Liu | N/A | |
| A Simple Framework for 3D Lensless Imaging With Programmable Masks | Yucheng Zheng · , · Yi Hua · , · Aswin C. Sankaranarayanan · , · M. Salman Asif | N/A | |
| Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-On and Outfit Editing | Aiyu Cui · , · Daniel McKee · , · Svetlana Lazebnik | N/A | |
| Attack As the Best Defense: Nullifying Image-to-Image Translation GANs via Limit-Aware Adversarial Attack | Chin-Yuan Yeh · , · Hsi-Wen Chen · , · Hong-Han Shuai · , · De-Nian Yang · , · Ming-Syan Chen | N/A | |
| Benchmark Platform for Ultra-Fine-Grained Visual Categorization Beyond Human Performance | Xiaohan Yu · , · Yang Zhao · , · Yongsheng Gao · , · Xiaohui Yuan · , · Shengwu Xiong | N/A | |
| JEM++: Improved Techniques for Training JEM | Xiulong Yang · , · Shihao Ji | N/A | |
| Contrast and Classify: Training Robust VQA Models | Yash Kant · , · Abhinav Moudgil · , · Dhruv Batra · , · Devi Parikh · , · Harsh Agrawal | N/A | |
| Photon-Starved Scene Inference Using Single Photon Cameras | Bhavya Goyal · , · Mohit Gupta | N/A | |
| Towards Learning Spatially Discriminative Feature Representations | Chaofei Wang · , · Jiayu Xiao · , · Yizeng Han · , · Qisen Yang · , · Shiji Song · , · Gao Huang | N/A | |
| Pyramid Spatial-Temporal Aggregation for Video-Based Person Re-Identification | Yingquan Wang · , · Pingping Zhang · , · Shang Gao · , · Xia Geng · , · Hu Lu · , · Dong Wang | N/A | |
| Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation | Yukun Su · , · Ruizhou Sun · , · Guosheng Lin · , · Qingyao Wu | N/A | |
| CAPTRA: CAtegory-Level Pose Tracking for Rigid and Articulated Objects From Point Clouds | Yijia Weng · , · He Wang · , · Qiang Zhou · , · Yuzhe Qin · , · Yueqi Duan · , · Qingnan Fan · , · Baoquan Chen · , · Hao Su · , · Leonidas J. Guibas | N/A | |
| X-World: Accessibility, Vision, and Autonomy Meet | Jimuyang Zhang · , · Minglan Zheng · , · Matthew Boyd · , · Eshed Ohn-Bar | N/A | |
| Target Adaptive Context Aggregation for Video Scene Graph Generation | Yao Teng · , · Limin Wang · , · Zhifeng Li · , · Gangshan Wu | N/A | |
| Learnable Boundary Guided Adversarial Training | Jiequan Cui · , · Shu Liu · , · Liwei Wang · , · Jiaya Jia | N/A | |
| Memory-Augmented Dynamic Neural Relational Inference | Dong Gong · , · Frederic Z. Zhang · , · Javen Qinfeng Shi · , · Anton van den Hengel | N/A | |
| Physics-Based Differentiable Depth Sensor Simulation | Benjamin Planche · , · Rajat Vikram Singh | N/A | |
| Temporal Action Detection With Multi-Level Supervision | Baifeng Shi · , · Qi Dai · , · Judy Hoffman · , · Kate Saenko · , · Trevor Darrell · , · Huijuan Xu | N/A | |
| FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning | Chenxu Zhang · , · Yifan Zhao · , · Yifei Huang · , · Ming Zeng · , · Saifeng Ni · , · Madhukar Budagavi · , · Xiaohu Guo | N/A | |
| Unsupervised Deep Video Denoising | Dev Yashpal Sheth · , · Sreyas Mohan · , · Joshua L. Vincent · , · Ramon Manzorro · , · Peter A. Crozier · , · Mitesh M. Khapra · , · Eero P. Simoncelli · , · Carlos Fernandez-Granda | N/A | |
| Making Higher Order MOT Scalable: An Efficient Approximate Solver for Lifted Disjoint Paths | Andrea Hornakova · , · Timo Kaiser · , · Paul Swoboda · , · Michal Rolinek · , · Bodo Rosenhahn · , · Roberto Henschel | N/A | |
| TMCOSS: Thresholded Multi-Criteria Online Subset Selection for Data-Efficient Autonomous Driving | Soumi Das · , · Harikrishna Patibandla · , · Suparna Bhattacharya · , · Kshounis Bera · , · Niloy Ganguly · , · Sourangshu Bhattacharya | N/A | |
| Efficient Visual Pretraining With Contrastive Detection | Olivier J. Hénaff · , · Skanda Koppula · , · Jean-Baptiste Alayrac · , · Aaron van den Oord · , · Oriol Vinyals · , · João Carreira | N/A | |
| Exploiting Scene Graphs for Human-Object Interaction Detection | Tao He · , · Lianli Gao · , · Jingkuan Song · , · Yuan-Fang Li | N/A | |
| Multi-VAE: Learning Disentangled View-Common and View-Peculiar Visual Representations for Multi-View Clustering | Jie Xu · , · Yazhou Ren · , · Huayi Tang · , · Xiaorong Pu · , · Xiaofeng Zhu · , · Ming Zeng · , · Lifang He | N/A | |
| TF-Blender: Temporal Feature Blender for Video Object Detection | Yiming Cui · , · Liqi Yan · , · Zhiwen Cao · , · Dongfang Liu | N/A | |
| Adversarial Robustness for Unsupervised Domain Adaptation | Muhammad Awais · , · Fengwei Zhou · , · Hang Xu · , · Lanqing Hong · , · Ping Luo · , · Sung-Ho Bae · , · Zhenguo Li | N/A | |
| Discovering 3D Parts From Image Collections | Chun-Han Yao · , · Wei-Chih Hung · , · Varun Jampani · , · Ming-Hsuan Yang | N/A | |
| ICE: Inter-Instance Contrastive Encoding for Unsupervised Person Re-Identification | Hao Chen · , · Benoit Lagadec · , · François Bremond | N/A | |
| PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering | Yurui Ren · , · Ge Li · , · Yuanqi Chen · , · Thomas H. Li · , · Shan Liu | N/A | |
| Toward Human-Like Grasp: Dexterous Grasping via Semantic Representation of Object-Hand | Tianqiang Zhu · , · Rina Wu · , · Xiangbo Lin · , · Yi Sun | N/A | |
| MAAS: Multi-Modal Assignation for Active Speaker Detection | Juan Léon Alcázar · , · Fabian Caba · , · Ali K. Thabet · , · Bernard Ghanem | N/A | |
| Multi-Source Domain Adaptation for Object Detection | Xingxu Yao · , · Sicheng Zhao · , · Pengfei Xu · , · Jufeng Yang | N/A | |
| Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment | Heliang Zheng · , · Huan Yang · , · Jianlong Fu · , · Zheng-Jun Zha · , · Jiebo Luo | N/A | |
| ShapeConv: Shape-Aware Convolutional Layer for Indoor RGB-D Semantic Segmentation | Jinming Cao · , · Hanchao Leng · , · Dani Lischinski · , · Daniel Cohen-Or · , · Changhe Tu · , · Yangyan Li | N/A | |
| GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition | Shih-Cheng Huang · , · Liyue Shen · , · Matthew P. Lungren · , · Serena Yeung | N/A | |
| Summarize and Search: Learning Consensus-Aware Dynamic Convolution for Co-Saliency Detection | Ni Zhang · , · Junwei Han · , · Nian Liu · , · Ling Shao | N/A | |
| Visual Distant Supervision for Scene Graph Generation | Yuan Yao · , · Ao Zhang · , · Xu Han · , · Mengdi Li · , · Cornelius Weber · , · Zhiyuan Liu · , · Stefan Wermter · , · Maosong Sun | N/A | |
| Viewpoint-Agnostic Change Captioning With Cycle Consistency | Hoeseong Kim · , · Jongseok Kim · , · Hyungseok Lee · , · Hyunsung Park · , · Gunhee Kim | N/A | |
| Neural Video Portrait Relighting in Real-Time via Consistency Modeling | Longwen Zhang · , · Qixuan Zhang · , · Minye Wu · , · Jingyi Yu · , · Lan Xu | N/A | |
| Image Shape Manipulation From a Single Augmented Training Sample | Yael Vinker · , · Eliahu Horwitz · , · Nir Zabari · , · Yedid Hoshen | N/A | |
| SNARF: Differentiable Forward Skinning for Animating Non-Rigid Neural Implicit Shapes | Xu Chen · , · Yufeng Zheng · , · Michael J. Black · , · Otmar Hilliges · , · Andreas Geiger | N/A | |
| Curvature Generation in Curved Spaces for Few-Shot Learning | Zhi Gao · , · Yuwei Wu · , · Yunde Jia · , · Mehrtash Harandi | N/A | |
| Single Image 3D Shape Retrieval via Cross-Modal Instance and Category Contrastive Learning | Ming-Xian Lin · , · Jie Yang · , · He Wang · , · Yu-Kun Lai · , · Rongfei Jia · , · Binqiang Zhao · , · Lin Gao | N/A | |
| Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets From 3D Scans | Ainaz Eftekhar · , · Alexander Sax · , · Jitendra Malik · , · Amir Zamir | N/A | |
| Single View Physical Distance Estimation Using Human Pose | Xiaohan Fei · , · Henry Wang · , · Lin Lee Cheong · , · Xiangyu Zeng · , · Meng Wang · , · Joseph Tighe | N/A | |
| Few-Shot Semantic Segmentation With Cyclic Memory Network | Guo-Sen Xie · , · Huan Xiong · , · Jie Liu · , · Yazhou Yao · , · Ling Shao | N/A | |
| Weakly-Supervised Action Segmentation and Alignment via Transcript-Aware Union-of-Subspaces Learning | Zijia Lu · , · Ehsan Elhamifar | N/A | |
| Not All Operations Contribute Equally: Hierarchical Operation-Adaptive Predictor for Neural Architecture Search | Ziye Chen · , · Yibing Zhan · , · Baosheng Yu · , · Mingming Gong · , · Bo Du | N/A | |
| SOTR: Segmenting Objects With Transformers | Ruohao Guo · , · Dantong Niu · , · Liao Qu · , · Zhenbo Li | N/A | |
| Adaptive Surface Normal Constraint for Depth Estimation | Xiaoxiao Long · , · Cheng Lin · , · Lingjie Liu · , · Wei Li · , · Christian Theobalt · , · Ruigang Yang · , · Wenping Wang | N/A | |
| Enriching Local and Global Contexts for Temporal Action Localization | Zixin Zhu · , · Wei Tang · , · Le Wang · , · Nanning Zheng · , · Gang Hua | N/A | |
| Hypergraph Neural Networks for Hypergraph Matching | Xiaowei Liao · , · Yong Xu · , · Haibin Ling | N/A | |
| DRAEM - A Discriminatively Trained Reconstruction Embedding for Surface Anomaly Detection | Vitjan Zavrtanik · , · Matej Kristan · , · Danijel Skočaj | N/A | |
| Gaussian Fusion: Accurate 3D Reconstruction via Geometry-Guided Displacement Interpolation | Duo Chen · , · Zixin Tang · , · Zhenyu Xu · , · Yunan Zheng · , · Yiguang Liu | N/A | |
| Frequency-Aware Spatiotemporal Transformers for Video Inpainting Detection | Bingyao Yu · , · Wanhua Li · , · Xiu Li · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction | Bo Xu · , · Han Huang · , · Cheng Lu · , · Ziwen Li · , · Yandong Guo | N/A | |
| Mutual Supervision for Dense Object Detection | Ziteng Gao · , · Limin Wang · , · Gangshan Wu | N/A | |
| Orthographic-Perspective Epipolar Geometry | Viktor Larsson · , · Marc Pollefeys · , · Magnus Oskarsson | N/A | |
| Large Scale Interactive Motion Forecasting for Autonomous Driving: The Waymo Open Motion Dataset | Scott Ettinger · , · Shuyang Cheng · , · Benjamin Caine · , · Chenxi Liu · , · Hang Zhao · , · Sabeek Pradhan · , · Yuning Chai · , · Ben Sapp · , · Charles R. Qi · , · Yin Zhou · , · Zoey Yang · , · Aurélien Chouard · , · Pei Sun · , · Jiquan Ngiam · , · Vijay Vasudevan · , · Alexander McCauley · , · Jonathon Shlens · , · Dragomir Anguelov | N/A | |
| Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation | Hongjun Chen · , · Jinbao Wang · , · Hong Cai Chen · , · Xiantong Zhen · , · Feng Zheng · , · Rongrong Ji · , · Ling Shao | N/A | |
| Retrieve in Style: Unsupervised Facial Feature Transfer and Retrieval | Min Jin Chong · , · Wen-Sheng Chu · , · Abhishek Kumar · , · David Forsyth | N/A | |
| Rethinking and Improving Relative Position Encoding for Vision Transformer | Kan Wu · , · Houwen Peng · , · Minghao Chen · , · Jianlong Fu · , · Hongyang Chao | N/A | |
| Meta-Aggregator: Learning To Aggregate for 1-Bit Graph Neural Networks | Yongcheng Jing · , · Yiding Yang · , · Xinchao Wang · , · Mingli Song · , · Dacheng Tao | N/A | |
| STRIVE: Scene Text Replacement in Videos | Vijay Kumar B G · , · Jeyasri Subramanian · , · Varnith Chordia · , · Eugene Bart · , · Shaobo Fang · , · Kelly Guan · , · Raja Bala | N/A | |
| Disentangled High Quality Salient Object Detection | Lv Tang · , · Bo Li · , · Yijie Zhong · , · Shouhong Ding · , · Mofei Song | N/A | |
| FREE: Feature Refinement for Generalized Zero-Shot Learning | Shiming Chen · , · Wenjie Wang · , · Beihao Xia · , · Qinmu Peng · , · Xinge You · , · Feng Zheng · , · Ling Shao | N/A | |
| Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding | Mike Roberts · , · Jason Ramapuram · , · Anurag Ranjan · , · Atulit Kumar · , · Miguel Angel Bautista · , · Nathan Paczan · , · Russ Webb · , · Joshua M. Susskind | N/A | |
| Self-Supervised Object Detection via Generative Image Synthesis | Siva Karthik Mustikovela · , · Shalini De Mello · , · Aayush Prakash · , · Umar Iqbal · , · Sifei Liu · , · Thu Nguyen-Phuoc · , · Carsten Rother · , · Jan Kautz | N/A | |
| Action-Conditioned 3D Human Motion Synthesis With Transformer VAE | Mathis Petrovich · , · Michael J. Black · , · Gül Varol | N/A | |
| Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling? | Yue Song · , · Nicu Sebe · , · Wei Wang | N/A | |
| SUNet: Symmetric Undistortion Network for Rolling Shutter Correction | Bin Fan · , · Yuchao Dai · , · Mingyi He | N/A | |
| DWKS: A Local Descriptor of Deformations Between Meshes and Point Clouds | Robin Magnet · , · Maks Ovsjanikov | N/A | |
| PixelPyramids: Exact Inference Models From Lossless Image Pyramids | Shweta Mahajan · , · Stefan Roth | N/A | |
| Deep Blind Video Super-Resolution | Jinshan Pan · , · Haoran Bai · , · Jiangxin Dong · , · Jiawei Zhang · , · Jinhui Tang | N/A | |
| Deep Relational Metric Learning | Wenzhao Zheng · , · Borui Zhang · , · Jiwen Lu · , · Jie Zhou | N/A | |
| A Unified Objective for Novel Class Discovery | Enrico Fini · , · Enver Sangineto · , · Stéphane Lathuilière · , · Zhun Zhong · , · Moin Nabi · , · Elisa Ricci | N/A | |
| Provably Approximated Point Cloud Registration | Ibrahim Jubran · , · Alaa Maalouf · , · Ron Kimmel · , · Dan Feldman | N/A | |
| SAT: 2D Semantics Assisted Training for 3D Visual Grounding | Zhengyuan Yang · , · Songyang Zhang · , · Liwei Wang · , · Jiebo Luo | N/A | |
| Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings | Mazda Moayeri · , · Soheil Feizi | N/A | |
| Invisible Backdoor Attack With Sample-Specific Triggers | Yuezun Li · , · Yiming Li · , · Baoyuan Wu · , · Longkang Li · , · Ran He · , · Siwei Lyu | N/A | |
| Toward a Visual Concept Vocabulary for GAN Latent Space | Sarah Schwettmann · , · Evan Hernandez · , · David Bau · , · Samuel Klein · , · Jacob Andreas · , · Antonio Torralba | N/A | |
| Weakly Supervised 3D Semantic Segmentation Using Cross-Image Consensus and Inter-Voxel Affinity Relations | Xiaoyu Zhu · , · Jeffrey Chen · , · Xiangrui Zeng · , · Junwei Liang · , · Chengqi Li · , · Sinuo Liu · , · Sima Behpour · , · Min Xu | N/A | |
| Bootstrap Your Own Correspondences | Mohamed El Banani · , · Justin Johnson | N/A | |
| A Multi-Mode Modulator for Multi-Domain Few-Shot Classification | Yanbin Liu · , · Juho Lee · , · Linchao Zhu · , · Ling Chen · , · Humphrey Shi · , · Yi Yang | N/A | |
| SketchAA: Abstract Representation for Abstract Sketches | Lan Yang · , · Kaiyue Pang · , · Honggang Zhang · , · Yi-Zhe Song | N/A | |
| Detecting Human-Object Relationships in Videos | Jingwei Ji · , · Rishi Desai · , · Juan Carlos Niebles | N/A | |
| Adaptive Curriculum Learning | Yajing Kong · , · Liu Liu · , · Jun Wang · , · Dacheng Tao | N/A | |
| SurfaceNet: Adversarial SVBRDF Estimation From a Single Image | Giuseppe Vecchio · , · Simone Palazzo · , · Concetto Spampinato | N/A | |
| FloorPlanCAD: A Large-Scale CAD Drawing Dataset for Panoptic Symbol Spotting | Zhiwen Fan · , · Lingjie Zhu · , · Honghua Li · , · Xiaohao Chen · , · Siyu Zhu · , · Ping Tan | N/A | |
| TkML-AP: Adversarial Attacks to Top-k Multi-Label Learning | Shu Hu · , · Lipeng Ke · , · Xin Wang · , · Siwei Lyu | N/A | |
| Gradient Distribution Alignment Certificates Better Adversarial Domain Adaptation | Zhiqiang Gao · , · Shufei Zhang · , · Kaizhu Huang · , · Qiufeng Wang · , · Chaoliang Zhong | N/A | |
| Sparse-Shot Learning With Exclusive Cross-Entropy for Extremely Many Localisations | Andreas Panteli · , · Jonas Teuwen · , · Hugo Horlings · , · Efstratios Gavves | N/A | |
| Learning Latent Architectural Distribution in Differentiable Neural Architecture Search via Variational Information Maximization | Yaoming Wang · , · Yuchen Liu · , · Wenrui Dai · , · Chenglin Li · , · Junni Zou · , · Hongkai Xiong | N/A | |
| Motion Deblurring With Real Events | Fang Xu · , · Lei Yu · , · Bishan Wang · , · Wen Yang · , · Gui-Song Xia · , · Xu Jia · , · Zhendong Qiao · , · Jianzhuang Liu | N/A | |
| Episodic Transformer for Vision-and-Language Navigation | Alexander Pashevich · , · Cordelia Schmid · , · Chen Sun | N/A | |
| Change Is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery | Zhuo Zheng · , · Ailong Ma · , · Liangpei Zhang · , · Yanfei Zhong | N/A | |
| Visual Saliency Transformer | Nian Liu · , · Ni Zhang · , · Kaiyuan Wan · , · Ling Shao · , · Junwei Han | N/A | |
| Event-Based Video Reconstruction Using Transformer | Wenming Weng · , · Yueyi Zhang · , · Zhiwei Xiong | N/A | |
| Naturalistic Physical Adversarial Patch for Object Detectors | Yu-Chih-Tuan Hu · , · Bo-Han Kung · , · Daniel Stanley Tan · , · Jun-Cheng Chen · , · Kai-Lung Hua · , · Wen-Huang Cheng | N/A | |
| Attentive and Contrastive Learning for Joint Depth and Motion Field Estimation | Seokju Lee · , · Francois Rameau · , · Fei Pan · , · In So Kweon | N/A | |
| Graspness Discovery in Clutters for Fast and Accurate Grasp Detection | Chenxi Wang · , · Hao-Shu Fang · , · Minghao Gou · , · Hongjie Fang · , · Jin Gao · , · Cewu Lu | N/A | |
| Gradient Normalization for Generative Adversarial Networks | Yi-Lun Wu · , · Hong-Han Shuai · , · Zhi-Rui Tam · , · Hong-Yu Chiu | N/A | |
| Clustering by Maximizing Mutual Information Across Views | Kien Do · , · Truyen Tran · , · Svetha Venkatesh | N/A | |
| SIGN: Spatial-Information Incorporated Generative Network for Generalized Zero-Shot Semantic Segmentation | Jiaxin Cheng · , · Soumyaroop Nandi · , · Prem Natarajan · , · Wael Abd-Almageed | N/A | |
| Learning With Privileged Tasks | Yuru Song · , · Zan Lou · , · Shan You · , · Erkun Yang · , · Fei Wang · , · Chen Qian · , · Changshui Zhang · , · Xiaogang Wang | N/A | |
| Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning | Zixu Zhao · , · Yueming Jin · , · Pheng-Ann Heng | N/A | |
| DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities | Elias Eulig · , · Piyapat Saranrittichai · , · Chaithanya Kumar Mummadi · , · Kilian Rambach · , · William Beluch · , · Xiahan Shi · , · Volker Fischer | N/A | |
| ASMR: Learning Attribute-Based Person Search With Adaptive Semantic Margin Regularizer | Boseung Jeong · , · Jicheol Park · , · Suha Kwak | N/A | |
| Learning Inner-Group Relations on Point Clouds | Haoxi Ran · , · Wei Zhuo · , · Jun Liu · , · Li Lu | N/A | |
| Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows | Ze Liu · , · Yutong Lin · , · Yue Cao · , · Han Hu · , · Yixuan Wei · , · Zheng Zhang · , · Stephen Lin · , · Baining Guo | N/A | |
| Dynamic Cross Feature Fusion for Remote Sensing Pansharpening | Xiao Wu · , · Ting-Zhu Huang · , · Liang-Jian Deng · , · Tian-Jing Zhang | N/A | |
| Free-Form Description Guided 3D Visual Graph Network for Object Grounding in Point Cloud | Mingtao Feng · , · Zhen Li · , · Qi Li · , · Liang Zhang · , · XiangDong Zhang · , · Guangming Zhu · , · Hui Zhang · , · Yaonan Wang · , · Ajmal Mian | N/A | |
| The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection | Zhikang Zou · , · Xiaoqing Ye · , · Liang Du · , · Xianhui Cheng · , · Xiao Tan · , · Li Zhang · , · Jianfeng Feng · , · Xiangyang Xue · , · Errui Ding | N/A | |
| SimROD: A Simple Adaptation Method for Robust Object Detection | Rindra Ramamonjison · , · Amin Banitalebi-Dehkordi · , · Xinyu Kang · , · Xiaolong Bai · , · Yong Zhang | N/A | |
| Gated3D: Monocular 3D Object Detection From Temporal Illumination Cues | Frank Julca-Aguilar · , · Jason Taylor · , · Mario Bijelic · , · Fahim Mannan · , · Ethan Tseng · , · Felix Heide | N/A | |
| iMAP: Implicit Mapping and Positioning in Real-Time | Edgar Sucar · , · Shikun Liu · , · Joseph Ortiz · , · Andrew J. Davison | N/A | |
| Conditional Diffusion for Interactive Segmentation | Xi Chen · , · Zhiyan Zhao · , · Feiwu Yu · , · Yilei Zhang · , · Manni Duan | N/A | |
| Semi-Supervised Active Learning for Semi-Supervised Models: Exploit Adversarial Examples With Graph-Based Virtual Labels | Jiannan Guo · , · Haochen Shi · , · Yangyang Kang · , · Kun Kuang · , · Siliang Tang · , · Zhuoren Jiang · , · Changlong Sun · , · Fei Wu · , · Yueting Zhuang | N/A | |
| RobustNav: Towards Benchmarking Robustness in Embodied Navigation | Prithvijit Chattopadhyay · , · Judy Hoffman · , · Roozbeh Mottaghi · , · Aniruddha Kembhavi | N/A | |
| Conditional Variational Capsule Network for Open Set Recognition | Yunrui Guo · , · Guglielmo Camporese · , · Wenjing Yang · , · Alessandro Sperduti · , · Lamberto Ballan | N/A | |
| Towards Real-World Prohibited Item Detection: A Large-Scale X-Ray Benchmark | Boying Wang · , · Libo Zhang · , · Longyin Wen · , · Xianglong Liu · , · Yanjun Wu | N/A | |
| Let's See Clearly: Contaminant Artifact Removal for Moving Cameras | Xiaoyu Li · , · Bo Zhang · , · Jing Liao · , · Pedro V. Sander | N/A | |
| Generating Attribution Maps With Disentangled Masked Backpropagation | Adria Ruiz · , · Antonio Agudo · , · Francesc Moreno-Noguer | N/A | |
| A Simple Baseline for Weakly-Supervised Scene Graph Generation | Jing Shi · , · Yiwu Zhong · , · Ning Xu · , · Yin Li · , · Chenliang Xu | N/A | |
| Self-Supervised Vessel Segmentation via Adversarial Learning | Yuxin Ma · , · Yang Hua · , · Hanming Deng · , · Tao Song · , · Hao Wang · , · Zhengui Xue · , · Heng Cao · , · Ruhui Ma · , · Haibing Guan | N/A | |
| 3DStyleNet: Creating 3D Shapes With Geometric and Texture Style Variations | Kangxue Yin · , · Jun Gao · , · Maria Shugrina · , · Sameh Khamis · , · Sanja Fidler | N/A | |
| NeRD: Neural Reflectance Decomposition From Image Collections | Mark Boss · , · Raphael Braun · , · Varun Jampani · , · Jonathan T. Barron · , · Ce Liu · , · Hendrik P.A. Lensch | N/A | |
| Deep Hybrid Self-Prior for Full 3D Mesh Generation | Xingkui Wei · , · Zhengqing Chen · , · Yanwei Fu · , · Zhaopeng Cui · , · Yinda Zhang | N/A | |
| MSR-GCN: Multi-Scale Residual Graph Convolution Networks for Human Motion Prediction | Lingwei Dang · , · Yongwei Nie · , · Chengjiang Long · , · Qing Zhang · , · Guiqing Li | N/A | |
| Super Resolve Dynamic Scene From Continuous Spike Streams | Jing Zhao · , · Jiyu Xie · , · Ruiqin Xiong · , · Jian Zhang · , · Zhaofei Yu · , · Tiejun Huang | N/A | |
| Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation | Beibei Lin · , · Shunli Zhang · , · Xin Yu | N/A | |
| Labels4Free: Unsupervised Segmentation Using StyleGAN | Rameen Abdal · , · Peihao Zhu · , · Niloy J. Mitra · , · Peter Wonka | N/A | |
| Harnessing the Conditioning Sensorium for Improved Image Translation | Cooper Nederhood · , · Nicholas Kolkin · , · Deqing Fu · , · Jason Salavon | N/A | |
| DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation | Maosheng Ye · , · Shuangjie Xu · , · Tongyi Cao · , · Qifeng Chen | N/A | |
| Spectral Leakage and Rethinking the Kernel Size in CNNs | Nergis Tomen · , · Jan C. van Gemert | N/A | |
| High-Fidelity Pluralistic Image Completion With Transformers | Ziyu Wan · , · Jingbo Zhang · , · Dongdong Chen · , · Jing Liao | N/A | |
| Dance With Self-Attention: A New Look of Conditional Random Fields on Anomaly Detection in Videos | Didik Purwanto · , · Yie-Tarng Chen · , · Wen-Hsien Fang | N/A | |
| Text Is Text, No Matter What: Unifying Text Recognition Using Knowledge Distillation | Ayan Kumar Bhunia · , · Aneeshan Sain · , · Pinaki Nath Chowdhury · , · Yi-Zhe Song | N/A | |
| Unsupervised 3D Pose Estimation for Hierarchical Dance Video Recognition | Xiaodan Hu · , · Narendra Ahuja | N/A | |
| What You Can Learn by Staring at a Blank Wall | Prafull Sharma · , · Miika Aittala · , · Yoav Y. Schechner · , · Antonio Torralba · , · Gregory W. Wornell · , · William T. Freeman · , · Frédo Durand | N/A | |
| Improving Contrastive Learning by Visualizing Feature Transformation | Rui Zhu · , · Bingchen Zhao · , · Jingen Liu · , · Zhenglong Sun · , · Chang Wen Chen | N/A | |
| Complementary Patch for Weakly Supervised Semantic Segmentation | Fei Zhang · , · Chaochen Gu · , · Chenyue Zhang · , · Yuchao Dai | N/A | |
| Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining | Xunlin Zhan · , · Yangxin Wu · , · Xiao Dong · , · Yunchao Wei · , · Minlong Lu · , · Yichi Zhang · , · Hang Xu · , · Xiaodan Liang | N/A | |
| StyleFormer: Real-Time Arbitrary Style Transfer via Parametric Style Composition | Xiaolei Wu · , · Zhihao Hu · , · Lu Sheng · , · Dong Xu | N/A | |
| Hypercorrelation Squeeze for Few-Shot Segmentation | Juhong Min · , · Dahyun Kang · , · Minsu Cho | N/A | |
| Digging Into Uncertainty in Self-Supervised Multi-View Stereo | Hongbin Xu · , · Zhipeng Zhou · , · Yali Wang · , · Wenxiong Kang · , · Baigui Sun · , · Hao Li · , · Yu Qiao | N/A | |
| A Style and Semantic Memory Mechanism for Domain Generalization | Yang Chen · , · Yu Wang · , · Yingwei Pan · , · Ting Yao · , · Xinmei Tian · , · Tao Mei | N/A | |
| EigenGAN: Layer-Wise Eigen-Learning for GANs | Zhenliang He · , · Meina Kan · , · Shiguang Shan | N/A | |
| Uncertainty-Aware Human Mesh Recovery From Video by Learning Part-Based 3D Dynamics | Gun-Hee Lee · , · Seong-Whan Lee | N/A | |
| Neural TMDlayer: Modeling Instantaneous Flow of Features via SDE Generators | Zihang Meng · , · Vikas Singh · , · Sathya N. Ravi | N/A | |
| Rethinking Transformer-Based Set Prediction for Object Detection | Zhiqing Sun · , · Shengcao Cao · , · Yiming Yang · , · Kris M. Kitani | N/A | |
| FIERY: Future Instance Prediction in Bird's-Eye View From Surround Monocular Cameras | Anthony Hu · , · Zak Murez · , · Nikhil Mohan · , · Sofía Dudas · , · Jeffrey Hawke · , · Vijay Badrinarayanan · , · Roberto Cipolla · , · Alex Kendall | N/A | |
| CLEAR: Clean-Up Sample-Targeted Backdoor in Neural Networks | Liuwan Zhu · , · Rui Ning · , · Chunsheng Xin · , · Chonggang Wang · , · Hongyi Wu | N/A | |
| Motion-Augmented Self-Training for Video Recognition at Smaller Scale | Kirill Gavrilyuk · , · Mihir Jain · , · Ilia Karmanov · , · Cees G. M. Snoek | N/A | |
| Generating Smooth Pose Sequences for Diverse Human Motion Prediction | Wei Mao · , · Miaomiao Liu · , · Mathieu Salzmann | N/A | |
| DensePose 3D: Lifting Canonical Surface Maps of Articulated Objects to the Third Dimension | Roman Shapovalov · , · David Novotny · , · Benjamin Graham · , · Patrick Labatut · , · Andrea Vedaldi | N/A | |
| Unpaired Learning for Deep Image Deraining With Rain Direction Regularizer | Yang Liu · , · Ziyu Yue · , · Jinshan Pan · , · Zhixun Su | N/A | |
| Self-Supervised Image Prior Learning With GMM From a Single Noisy Image | Haosen Liu · , · Xuan Liu · , · Jiangbo Lu · , · Shan Tan | N/A | |
| Exploiting a Joint Embedding Space for Generalized Zero-Shot Semantic Segmentation | Donghyeon Baek · , · Youngmin Oh · , · Bumsub Ham | N/A | |
| HeadGAN: One-Shot Neural Head Synthesis and Editing | Michail Christos Doukas · , · Stefanos Zafeiriou · , · Viktoriia Sharmanska | N/A | |
| Aligning Subtitles in Sign Language Videos | Hannah Bull · , · Triantafyllos Afouras · , · Gül Varol · , · Samuel Albanie · , · Liliane Momeni · , · Andrew Zisserman | N/A | |
| Variational Feature Disentangling for Fine-Grained Few-Shot Classification | Jingyi Xu · , · Hieu Le · , · Mingzhen Huang · , · ShahRukh Athar · , · Dimitris Samaras | N/A | |
| MultiSiam: Self-Supervised Multi-Instance Siamese Representation Learning for Autonomous Driving | Kai Chen · , · Lanqing Hong · , · Hang Xu · , · Zhenguo Li · , · Dit-Yan Yeung | N/A | |
| Pano-AVQA: Grounded Audio-Visual Question Answering on 360deg Videos | Heeseung Yun · , · Youngjae Yu · , · Wonsuk Yang · , · Kangil Lee · , · Gunhee Kim | N/A | |
| Deep Implicit Surface Point Prediction Networks | Rahul Venkatesh · , · Tejan Karmali · , · Sarthak Sharma · , · Aurobrata Ghosh · , · R. Venkatesh Babu · , · László A. Jeni · , · Maneesh Singh | N/A | |
| Broaden Your Views for Self-Supervised Video Learning | Adrià Recasens · , · Pauline Luc · , · Jean-Baptiste Alayrac · , · Luyu Wang · , · Florian Strub · , · Corentin Tallec · , · Mateusz Malinowski · , · Viorica Pătrăucean · , · Florent Altché · , · Michal Valko · , · Jean-Bastien Grill · , · Aäron van den Oord · , · Andrew Zisserman | N/A | |
| Deep Metric Learning for Open World Semantic Segmentation | Jun Cen · , · Peng Yun · , · Junhao Cai · , · Michael Yu Wang · , · Ming Liu | N/A | |
| Boundary-Sensitive Pre-Training for Temporal Localization in Videos | Mengmeng Xu · , · Juan-Manuel Pérez-Rúa · , · Victor Escorcia · , · Brais Martínez · , · Xiatian Zhu · , · Li Zhang · , · Bernard Ghanem · , · Tao Xiang | N/A | |
| SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation | Yan Di · , · Fabian Manhardt · , · Gu Wang · , · Xiangyang Ji · , · Nassir Navab · , · Federico Tombari | N/A | |
| Explainable Video Entailment With Grounded Visual Evidence | Junwen Chen · , · Yu Kong | N/A | |
| HiT: Hierarchical Transformer With Momentum Contrast for Video-Text Retrieval | Song Liu · , · Haoqi Fan · , · Shengsheng Qian · , · Yiru Chen · , · Wenkui Ding · , · Zhongyuan Wang | N/A | |
| Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation | Jay Patravali · , · Gaurav Mittal · , · Ye Yu · , · Fuxin Li · , · Mei Chen | N/A | |
| Poly-NL: Linear Complexity Non-Local Layers With 3rd Order Polynomials | Francesca Babiloni · , · Ioannis Marras · , · Filippos Kokkinos · , · Jiankang Deng · , · Grigorios Chrysos · , · Stefanos Zafeiriou | N/A | |
| PatchMatch-RL: Deep MVS With Pixelwise Depth, Normal, and Visibility | Jae Yong Lee · , · Joseph DeGol · , · Chuhang Zou · , · Derek Hoiem | N/A | |
| Distinctiveness Oriented Positional Equilibrium for Point Cloud Registration | Taewon Min · , · Chonghyuk Song · , · Eunseok Kim · , · Inwook Shim | N/A | |
| Deep Edge-Aware Interactive Colorization Against Color-Bleeding Effects | Eungyeup Kim · , · Sanghyeon Lee · , · Jeonghoon Park · , · Somi Choi · , · Choonghyun Seo · , · Jaegul Choo | N/A | |
| ELSD: Efficient Line Segment Detector and Descriptor | Haotian Zhang · , · Yicheng Luo · , · Fangbo Qin · , · Yijia He · , · Xiao Liu | N/A | |
| Separable Flow: Learning Motion Cost Volumes for Optical Flow Estimation | Feihu Zhang · , · Oliver J. Woodford · , · Victor Adrian Prisacariu · , · Philip H.S. Torr | N/A | |
| Learned Spatial Representations for Few-Shot Talking-Head Synthesis | Moustafa Meshry · , · Saksham Suri · , · Larry S. Davis · , · Abhinav Shrivastava | N/A | |
| Vision Transformers for Dense Prediction | René Ranftl · , · Alexey Bochkovskiy · , · Vladlen Koltun | N/A | |
| V-DESIRR: Very Fast Deep Embedded Single Image Reflection Removal | B H Pawan Prasad · , · Green Rosh K S · , · Lokesh R. Boregowda · , · Kaushik Mitra · , · Sanjoy Chowdhury | N/A | |
| CrackFormer: Transformer Network for Fine-Grained Crack Detection | Huajun Liu · , · Xiangyu Miao · , · Christoph Mertz · , · Chengzhong Xu · , · Hui Kong | N/A | |
| Factorizing Perception and Policy for Interactive Instruction Following | Kunal Pratap Singh · , · Suvaansh Bhambri · , · Byeonghwi Kim · , · Roozbeh Mottaghi · , · Jonghyun Choi | N/A | |
| Detecting Invisible People | Tarasha Khurana · , · Achal Dave · , · Deva Ramanan | N/A | |
| GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds | Zekun Hao · , · Arun Mallya · , · Serge Belongie · , · Ming-Yu Liu | N/A | |
| Talk-To-Edit: Fine-Grained Facial Editing via Dialog | Yuming Jiang · , · Ziqi Huang · , · Xingang Pan · , · Chen Change Loy · , · Ziwei Liu | N/A | |
| AgentFormer: Agent-Aware Transformers for Socio-Temporal Multi-Agent Forecasting | Ye Yuan · , · Xinshuo Weng · , · Yanglan Ou · , · Kris M. Kitani | N/A | |
| Transparent Object Tracking Benchmark | Heng Fan · , · Halady Akhilesha Miththanthaya · , · Harshit · , · Siranjiv Ramana Rajan · , · Xiaoqiong Liu · , · Zhilin Zou · , · Yuewei Lin · , · Haibin Ling | N/A | |
| Boosting Weakly Supervised Object Detection via Learning Bounding Box Adjusters | Bowen Dong · , · Zitong Huang · , · Yuelin Guo · , · Qilong Wang · , · Zhenxing Niu · , · Wangmeng Zuo | N/A | |
| Group-Wise Inhibition Based Feature Regularization for Robust Classification | Haozhe Liu · , · Haoqian Wu · , · Weicheng Xie · , · Feng Liu · , · Linlin Shen | N/A | |
| Incorporating Convolution Designs Into Visual Transformers | Kun Yuan · , · Shaopeng Guo · , · Ziwei Liu · , · Aojun Zhou · , · Fengwei Yu · , · Wei Wu | N/A | |
| CDS: Cross-Domain Self-Supervised Pre-Training | Donghyun Kim · , · Kuniaki Saito · , · Tae-Hyun Oh · , · Bryan A. Plummer · , · Stan Sclaroff · , · Kate Saenko | N/A | |
| CaT: Weakly Supervised Object Detection With Category Transfer | Tianyue Cao · , · Lianyu Du · , · Xiaoyun Zhang · , · Siheng Chen · , · Ya Zhang · , · Yan-Feng Wang | N/A | |
| 4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface | Yang Li · , · Hikari Takehara · , · Takafumi Taketomi · , · Bo Zheng · , · Matthias Nießner | N/A | |
| Scaling Semantic Segmentation Beyond 1K Classes on a Single GPU | Shipra Jain · , · Danda Pani Paudel · , · Martin Danelljan · , · Luc Van Gool | N/A | |
| Searching for Robustness: Loss Learning for Noisy Classification Tasks | Boyan Gao · , · Henry Gouk · , · Timothy M. Hospedales | N/A | |
| On Compositions of Transformations in Contrastive Self-Supervised Learning | Mandela Patrick · , · Yuki M. Asano · , · Polina Kuznetsova · , · Ruth Fong · , · João F. Henriques · , · Geoffrey Zweig · , · Andrea Vedaldi | N/A | |
| Handwriting Transformers | Ankan Kumar Bhunia · , · Salman Khan · , · Hisham Cholakkal · , · Rao Muhammad Anwer · , · Fahad Shahbaz Khan · , · Mubarak Shah | N/A | |
| BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification | Cheng Yan · , · Guansong Pang · , · Lei Wang · , · Jile Jiao · , · Xuetao Feng · , · Chunhua Shen · , · Jingjing Li | N/A | |
| Hierarchical Kinematic Probability Distributions for 3D Human Shape and Pose Estimation From Images in the Wild | Akash Sengupta · , · Ignas Budvytis · , · Roberto Cipolla | N/A | |
| Dynamic DETR: End-to-End Object Detection With Dynamic Attention | Xiyang Dai · , · Yinpeng Chen · , · Jianwei Yang · , · Pengchuan Zhang · , · Lu Yuan · , · Lei Zhang | N/A | |
| DepthTrack: Unveiling the Power of RGBD Tracking | Song Yan · , · Jinyu Yang · , · Jani Käpylä · , · Feng Zheng · , · Aleš Leonardis · , · Joni-Kristian Kämäräinen | N/A | |
| XVFI: eXtreme Video Frame Interpolation | Hyeonjun Sim · , · Jihyong Oh · , · Munchurl Kim | N/A | |
| Cortical Surface Shape Analysis Based on Alexandrov Polyhedra | Min Zhang · , · Yang Guo · , · Na Lei · , · Zhou Zhao · , · Jianfeng Wu · , · Xiaoyin Xu · , · Yalin Wang · , · Xianfeng Gu | N/A | |
| Watch Only Once: An End-to-End Video Action Detection Framework | Shoufa Chen · , · Peize Sun · , · Enze Xie · , · Chongjian Ge · , · Jiannan Wu · , · Lan Ma · , · Jiajun Shen · , · Ping Luo | N/A | |
| Spatial-Temporal Consistency Network for Low-Latency Trajectory Forecasting | Shijie Li · , · Yanying Zhou · , · Jinhui Yi · , · Juergen Gall | N/A | |
| Point Cloud Augmentation With Weighted Local Transformations | Sihyeon Kim · , · Sanghyeok Lee · , · Dasol Hwang · , · Jaewon Lee · , · Seong Jae Hwang · , · Hyunwoo J. Kim | N/A | |
| Solving Inefficiency of Self-Supervised Representation Learning | Guangrun Wang · , · Keze Wang · , · Guangcong Wang · , · Philip H.S. Torr · , · Liang Lin | N/A | |
| Stochastic Scene-Aware Motion Prediction | Mohamed Hassan · , · Duygu Ceylan · , · Ruben Villegas · , · Jun Saito · , · Jimei Yang · , · Yi Zhou · , · Michael J. Black | N/A | |
| Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation | Gwangbin Bae · , · Ignas Budvytis · , · Roberto Cipolla | N/A | |
| Explaining in Style: Training a GAN To Explain a Classifier in StyleSpace | Oran Lang · , · Yossi Gandelsman · , · Michal Yarom · , · Yoav Wald · , · Gal Elidan · , · Avinatan Hassidim · , · William T. Freeman · , · Phillip Isola · , · Amir Globerson · , · Michal Irani · , · Inbar Mosseri | N/A | |
| Exploring Visual Engagement Signals for Representation Learning | Menglin Jia · , · Zuxuan Wu · , · Austin Reiter · , · Claire Cardie · , · Serge Belongie · , · Ser-Nam Lim | N/A | |
| MUSIQ: Multi-Scale Image Quality Transformer | Junjie Ke · , · Qifei Wang · , · Yilin Wang · , · Peyman Milanfar · , · Feng Yang | N/A | |
| FcaNet: Frequency Channel Attention Networks | Zequn Qin · , · Pengyi Zhang · , · Fei Wu · , · Xi Li | N/A | |
| Matching in the Dark: A Dataset for Matching Image Pairs of Low-Light Scenes | Wenzheng Song · , · Masanori Suganuma · , · Xing Liu · , · Noriyuki Shimobayashi · , · Daisuke Maruta · , · Takayuki Okatani | N/A | |
| ReDAL: Region-Based and Diversity-Aware Active Learning for Point Cloud Semantic Segmentation | Tsung-Han Wu · , · Yueh-Cheng Liu · , · Yu-Kai Huang · , · Hsin-Ying Lee · , · Hung-Ting Su · , · Ping-Chia Huang · , · Winston H. Hsu | N/A | |
| Point Transformer | Hengshuang Zhao · , · Li Jiang · , · Jiaya Jia · , · Philip H.S. Torr · , · Vladlen Koltun | N/A | |
| Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation | Yi Zhu · , · Yue Weng · , · Fengda Zhu · , · Xiaodan Liang · , · Qixiang Ye · , · Yutong Lu · , · Jianbin Jiao | N/A | |
| Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation | Shu Yang · , · Lu Zhang · , · Jinqing Qi · , · Huchuan Lu · , · Shuo Wang · , · Xiaoxing Zhang | N/A | |
| Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis | Ajay Jain · , · Matthew Tancik · , · Pieter Abbeel | N/A | |
| CrossDet: Crossline Representation for Object Detection | Heqian Qiu · , · Hongliang Li · , · Qingbo Wu · , · Jianhua Cui · , · Zichen Song · , · Lanxiao Wang · , · Minjian Zhang | N/A | |
| Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes Using Scene Graphs | Helisa Dhamo · , · Fabian Manhardt · , · Nassir Navab · , · Federico Tombari | N/A | |
| Universal and Flexible Optical Aberration Correction Using Deep-Prior Based Deconvolution | Xiu Li · , · Jinli Suo · , · Weihang Zhang · , · Xin Yuan · , · Qionghai Dai | N/A | |
| E-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks | Maxime Kayser · , · Oana-Maria Camburu · , · Leonard Salewski · , · Cornelius Emde · , · Virginie Do · , · Zeynep Akata · , · Thomas Lukasiewicz | N/A | |
| Universal Cross-Domain Retrieval: Generalizing Across Classes and Domains | Soumava Paul · , · Titir Dutta · , · Soma Biswas | N/A | |
| Learning Unsupervised Metaformer for Anomaly Detection | Jhih-Ciang Wu · , · Ding-Jie Chen · , · Chiou-Shann Fuh · , · Tyng-Luh Liu | N/A | |
| Robust Object Detection via Instance-Level Temporal Cycle Confusion | Xin Wang · , · Thomas E. Huang · , · Benlin Liu · , · Fisher Yu · , · Xiaolong Wang · , · Joseph E. Gonzalez · , · Trevor Darrell | N/A | |
| HighlightMe: Detecting Highlights From Human-Centric Videos | Uttaran Bhattacharya · , · Gang Wu · , · Stefano Petrangeli · , · Viswanathan Swaminathan · , · Dinesh Manocha | N/A | |
| Procedure Planning in Instructional Videos via Contextual Modeling and Model-Based Policy Learning | Jing Bi · , · Jiebo Luo · , · Chenliang Xu | N/A | |
| Variable-Rate Deep Image Compression Through Spatially-Adaptive Feature Transform | Myungseo Song · , · Jinyoung Choi · , · Bohyung Han | N/A | |
| DeePSD: Automatic Deep Skinning and Pose Space Deformation for 3D Garment Animation | Hugo Bertiche · , · Meysam Madadi · , · Emilio Tylson · , · Sergio Escalera | N/A | |
| Structured Outdoor Architecture Reconstruction by Exploration and Classification | Fuyang Zhang · , · Xiang Xu · , · Nelson Nauata · , · Yasutaka Furukawa | N/A | |
| MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction | Patrick Dendorfer · , · Sven Elflein · , · Laura Leal-Taixé | N/A | |
| Rethinking Coarse-To-Fine Approach in Single Image Deblurring | Sung-Jin Cho · , · Seo-Won Ji · , · Jun-Pyo Hong · , · Seung-Won Jung · , · Sung-Jea Ko | N/A | |
| Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation | Rawal Khirodkar · , · Visesh Chari · , · Amit Agrawal · , · Ambrish Tyagi | N/A | |
| Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching | Jian Gao · , · Jin Liu · , · Shunping Ji | N/A | |
| A New Journey From SDRTV to HDRTV | Xiangyu Chen · , · Zhengwen Zhang · , · Jimmy S. Ren · , · Lynhoo Tian · , · Yu Qiao · , · Chao Dong | N/A | |
| Point-Set Distances for Learning Representations of 3D Point Clouds | Trung Nguyen · , · Quang-Hieu Pham · , · Tam Le · , · Tung Pham · , · Nhat Ho · , · Binh-Son Hua | N/A | |
| ELLIPSDF: Joint Object Pose and Shape Optimization With a Bi-Level Ellipsoid and Signed Distance Function Description | Mo Shan · , · Qiaojun Feng · , · You-Yi Jau · , · Nikolay Atanasov | N/A | |
| ARCH++: Animation-Ready Clothed Human Reconstruction Revisited | Tong He · , · Yuanlu Xu · , · Shunsuke Saito · , · Stefano Soatto · , · Tony Tung | N/A | |
| Vision-Language Transformer and Query Generation for Referring Segmentation | Henghui Ding · , · Chang Liu · , · Suchen Wang · , · Xudong Jiang | N/A | |
| Semantically Coherent Out-of-Distribution Detection | Jingkang Yang · , · Haoqi Wang · , · Litong Feng · , · Xiaopeng Yan · , · Huabin Zheng · , · Wayne Zhang · , · Ziwei Liu | N/A | |
| SCOUTER: Slot Attention-Based Classifier for Explainable Image Recognition | Liangzhi Li · , · Bowen Wang · , · Manisha Verma · , · Yuta Nakashima · , · Ryo Kawasaki · , · Hajime Nagahara | N/A | |
| RetrievalFuse: Neural 3D Scene Reconstruction With a Database | Yawar Siddiqui · , · Justus Thies · , · Fangchang Ma · , · Qi Shan · , · Matthias Nießner · , · Angela Dai | N/A | |
| Spatio-Temporal Self-Supervised Representation Learning for 3D Point Clouds | Siyuan Huang · , · Yichen Xie · , · Song-Chun Zhu · , · Yixin Zhu | N/A | |
| Learning To Know Where To See: A Visibility-Aware Approach for Occluded Person Re-Identification | Jinrui Yang · , · Jiawei Zhang · , · Fufu Yu · , · Xinyang Jiang · , · Mengdan Zhang · , · Xing Sun · , · Ying-Cong Chen · , · Wei-Shi Zheng | N/A | |
| Focal Frequency Loss for Image Reconstruction and Synthesis | Liming Jiang · , · Bo Dai · , · Wayne Wu · , · Chen Change Loy | N/A | |
| Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images | Zhuowan Li · , · Elias Stengel-Eskin · , · Yixiao Zhang · , · Cihang Xie · , · Quan Hung Tran · , · Benjamin Van Durme · , · Alan Yuille | N/A | |
| Vision-Language Navigation With Random Environmental Mixup | Chong Liu · , · Fengda Zhu · , · Xiaojun Chang · , · Xiaodan Liang · , · Zongyuan Ge · , · Yi-Dong Shen | N/A | |
| Object Tracking by Jointly Exploiting Frame and Event Domain | Jiqing Zhang · , · Xin Yang · , · Yingkai Fu · , · Xiaopeng Wei · , · Baocai Yin · , · Bo Dong | N/A | |
| Learning To Generate Scene Graph From Natural Language Supervision | Yiwu Zhong · , · Jing Shi · , · Jianwei Yang · , · Chenliang Xu · , · Yin Li | N/A | |
| Editing Conditional Radiance Fields | Steven Liu · , · Xiuming Zhang · , · Zhoutong Zhang · , · Richard Zhang · , · Jun-Yan Zhu · , · Bryan Russell | N/A | |
| Global Pooling, More Than Meets the Eye: Position Information Is Encoded Channel-Wise in CNNs | Md Amirul Islam · , · Matthew Kowal · , · Sen Jia · , · Konstantinos G. Derpanis · , · Neil D. B. Bruce | N/A | |
| Testing Using Privileged Information by Adapting Features With Statistical Dependence | Kwang In Kim · , · James Tompkin | N/A | |
| CvT: Introducing Convolutions to Vision Transformers | Haiping Wu · , · Bin Xiao · , · Noel Codella · , · Mengchen Liu · , · Xiyang Dai · , · Lu Yuan · , · Lei Zhang | N/A | |
| Context-Sensitive Temporal Feature Learning for Gait Recognition | Xiaohu Huang · , · Duowang Zhu · , · Hao Wang · , · Xinggang Wang · , · Bo Yang · , · Botao He · , · Wenyu Liu · , · Bin Feng | N/A | |
| Pseudo-Mask Matters in Weakly-Supervised Semantic Segmentation | Yi Li · , · Zhanghui Kuang · , · Liyang Liu · , · Yimin Chen · , · Wayne Zhang | N/A | |
| COTR: Correspondence Transformer for Matching Across Images | Wei Jiang · , · Eduard Trulls · , · Jan Hosang · , · Andrea Tagliasacchi · , · Kwang Moo Yi | N/A | |
| CoMatch: Semi-Supervised Learning With Contrastive Graph Regularization | Junnan Li · , · Caiming Xiong · , · Steven C.H. Hoi | N/A | |
| End-to-End Semi-Supervised Object Detection With Soft Teacher | Mengde Xu · , · Zheng Zhang · , · Han Hu · , · Jianfeng Wang · , · Lijuan Wang · , · Fangyun Wei · , · Xiang Bai · , · Zicheng Liu | N/A | |
| Zen-NAS: A Zero-Shot NAS for High-Performance Image Recognition | Ming Lin · , · Pichao Wang · , · Zhenhong Sun · , · Hesen Chen · , · Xiuyu Sun · , · Qi Qian · , · Hao Li · , · Rong Jin | N/A | |
| Virtual Light Transport Matrices for Non-Line-of-Sight Imaging | Julio Marco · , · Adrian Jarabo · , · Ji Hyun Nam · , · Xiaochun Liu · , · Miguel Ángel Cosculluela · , · Andreas Velten · , · Diego Gutierrez | N/A | |
| DecentLaM: Decentralized Momentum SGD for Large-Batch Deep Training | Kun Yuan · , · Yiming Chen · , · Xinmeng Huang · , · Yingya Zhang · , · Pan Pan · , · Yinghui Xu · , · Wotao Yin | N/A | |
| Video Object Segmentation With Dynamic Memory Networks and Adaptive Object Alignment | Shuxian Liang · , · Xu Shen · , · Jianqiang Huang · , · Xian-Sheng Hua | N/A | |
| Augmented Lagrangian Adversarial Attacks | Jérôme Rony · , · Eric Granger · , · Marco Pedersoli · , · Ismail Ben Ayed | N/A | |
| Contrastive Multimodal Fusion With TupleInfoNCE | Yunze Liu · , · Qingnan Fan · , · Shanghang Zhang · , · Hao Dong · , · Thomas Funkhouser · , · Li Yi | N/A | |
| Deep Reparametrization of Multi-Frame Super-Resolution and Denoising | Goutam Bhat · , · Martin Danelljan · , · Fisher Yu · , · Luc Van Gool · , · Radu Timofte | N/A | |
| Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning | James Smith · , · Yen-Chang Hsu · , · Jonathan Balloch · , · Yilin Shen · , · Hongxia Jin · , · Zsolt Kira | N/A | |
| ViViT: A Video Vision Transformer | Anurag Arnab · , · Mostafa Dehghani · , · Georg Heigold · , · Chen Sun · , · Mario Lučić · , · Cordelia Schmid | N/A | |
| Generative Compositional Augmentations for Scene Graph Prediction | Boris Knyazev · , · Harm de Vries · , · Cătălina Cangea · , · Graham W. Taylor · , · Aaron Courville · , · Eugene Belilovsky | N/A | |
| StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery | Or Patashnik · , · Zongze Wu · , · Eli Shechtman · , · Daniel Cohen-Or · , · Dani Lischinski | N/A | |
| Meta-Attack: Class-Agnostic and Model-Agnostic Physical Adversarial Attack | Weiwei Feng · , · Baoyuan Wu · , · Tianzhu Zhang · , · Yong Zhang · , · Yongdong Zhang | N/A | |
| Benchmarking Ultra-High-Definition Image Super-Resolution | Kaihao Zhang · , · Dongxu Li · , · Wenhan Luo · , · Wenqi Ren · , · Björn Stenger · , · Wei Liu · , · Hongdong Li · , · Ming-Hsuan Yang | N/A | |
| 3D Local Convolutional Neural Networks for Gait Recognition | Zhen Huang · , · Dixiu Xue · , · Xu Shen · , · Xinmei Tian · , · Houqiang Li · , · Jianqiang Huang · , · Xian-Sheng Hua | N/A | |
| Lucas-Kanade Reloaded: End-to-End Super-Resolution From Raw Image Bursts | Bruno Lecouat · , · Jean Ponce · , · Julien Mairal | N/A | |
| Learning Better Visual Data Similarities via New Grouplet Non-Euclidean Embedding | Yanfu Zhang · , · Lei Luo · , · Wenhan Xian · , · Heng Huang | N/A | |
| PR-GCN: A Deep Graph Convolutional Network With Point Refinement for 6D Pose Estimation | Guangyuan Zhou · , · Huiqun Wang · , · Jiaxin Chen · , · Di Huang | N/A | |
| Learning High-Fidelity Face Texture Completion Without Complete Face Texture | Jongyoo Kim · , · Jiaolong Yang · , · Xin Tong | N/A | |
| Product Quantizer Aware Inverted Index for Scalable Nearest Neighbor Search | Haechan Noh · , · Taeho Kim · , · Jae-Pil Heo | N/A | |
| RINDNet: Edge Detection for Discontinuity in Reflectance, Illumination, Normal and Depth | Mengyang Pu · , · Yaping Huang · , · Qingji Guan · , · Haibin Ling | N/A | |
| Track Without Appearance: Learn Box and Tracklet Embedding With Local and Global Motion Patterns for Vehicle Tracking | Gaoang Wang · , · Renshu Gu · , · Zuozhu Liu · , · Weijie Hu · , · Mingli Song · , · Jenq-Neng Hwang | N/A | |
| Are We Missing Confidence in Pseudo-LiDAR Methods for Monocular 3D Object Detection? | Andrea Simonelli · , · Samuel Rota Bulò · , · Lorenzo Porzi · , · Peter Kontschieder · , · Elisa Ricci | N/A | |
| On the Limits of Pseudo Ground Truth in Visual Camera Re-Localisation | Eric Brachmann · , · Martin Humenberger · , · Carsten Rother · , · Torsten Sattler | N/A | |
| An Elastica Geodesic Approach With Convexity Shape Prior | Da Chen · , · Laurent D. Cohen · , · Jean-Marie Mirebeau · , · Xue-Cheng Tai | N/A | |
| AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer | Songhua Liu · , · Tianwei Lin · , · Dongliang He · , · Fu Li · , · Meiling Wang · , · Xin Li · , · Zhengxing Sun · , · Qian Li · , · Errui Ding | N/A | |
| PASS: Protected Attribute Suppression System for Mitigating Bias in Face Recognition | Prithviraj Dhar · , · Joshua Gleason · , · Aniket Roy · , · Carlos D. Castillo · , · Rama Chellappa | N/A | |
| Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection | Shi-Xue Zhang · , · Xiaobin Zhu · , · Chun Yang · , · Hongfa Wang · , · Xu-Cheng Yin | N/A | |
| Video Matting via Consistency-Regularized Graph Neural Networks | Tiantian Wang · , · Sifei Liu · , · Yapeng Tian · , · Kai Li · , · Ming-Hsuan Yang | N/A | |
| Probabilistic Monocular 3D Human Pose Estimation With Normalizing Flows | Tom Wehrbein · , · Marco Rudolph · , · Bodo Rosenhahn · , · Bastian Wandt | N/A | |
| Inverting a Rolling Shutter Camera: Bring Rolling Shutter Images to High Framerate Global Shutter Video | Bin Fan · , · Yuchao Dai | N/A | |
| Human Detection and Segmentation via Multi-View Consensus | Isinsu Katircioglu · , · Helge Rhodin · , · Jörg Spörri · , · Mathieu Salzmann · , · Pascal Fua | N/A | |
| GDP: Stabilized Neural Network Pruning via Gates With Differentiable Polarization | Yi Guo · , · Huan Yuan · , · Jianchao Tan · , · Zhangyang Wang · , · Sen Yang · , · Ji Liu | N/A | |
| From Two to One: A New Scene Text Recognizer With Visual Language Modeling Network | Yuxin Wang · , · Hongtao Xie · , · Shancheng Fang · , · Jing Wang · , · Shenggao Zhu · , · Yongdong Zhang | N/A | |
| GRF: Learning a General Radiance Field for 3D Representation and Rendering | Alex Trevithick · , · Bo Yang | N/A | |
| Neural Strokes: Stylized Line Drawing of 3D Shapes | Difan Liu · , · Matthew Fisher · , · Aaron Hertzmann · , · Evangelos Kalogerakis | N/A | |
| Multimodal Knowledge Expansion | Zihui Xue · , · Sucheng Ren · , · Zhengqi Gao · , · Hang Zhao | N/A | |
| Learning To Bundle-Adjust: A Graph Network Approach to Faster Optimization of Bundle Adjustment for Vehicular SLAM | Tetsuya Tanaka · , · Yukihiro Sasagawa · , · Takayuki Okatani | N/A | |
| MosaicOS: A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection | Cheng Zhang · , · Tai-Yu Pan · , · Yandong Li · , · Hexiang Hu · , · Dong Xuan · , · Soravit Changpinyo · , · Boqing Gong · , · Wei-Lun Chao | N/A | |
| Bringing Events Into Video Deblurring With Non-Consecutively Blurry Frames | Wei Shang · , · Dongwei Ren · , · Dongqing Zou · , · Jimmy S. Ren · , · Ping Luo · , · Wangmeng Zuo | N/A | |
| SPG: Unsupervised Domain Adaptation for 3D Object Detection via Semantic Point Generation | Qiangeng Xu · , · Yin Zhou · , · Weiyue Wang · , · Charles R. Qi · , · Dragomir Anguelov | N/A | |
| Extreme-Quality Computational Imaging via Degradation Framework | Shiqi Chen · , · Huajun Feng · , · Keming Gao · , · Zhihai Xu · , · Yueting Chen | N/A | |
| Direct Differentiable Augmentation Search | Aoming Liu · , · Zehao Huang · , · Zhiwu Huang · , · Naiyan Wang | N/A | |
| The Functional Correspondence Problem | Zihang Lai · , · Senthil Purushwalkam · , · Abhinav Gupta | N/A | |
| Detection and Continual Learning of Novel Face Presentation Attacks | Mohammad Rostami · , · Leonidas Spinoulas · , · Mohamed Hussein · , · Joe Mathai · , · Wael Abd-Almageed | N/A | |
| Adaptive Adversarial Network for Source-Free Domain Adaptation | Haifeng Xia · , · Handong Zhao · , · Zhengming Ding | N/A | |
| Painting From Part | Dongsheng Guo · , · Haoru Zhao · , · Yunhao Cheng · , · Haiyong Zheng · , · Zhaorui Gu · , · Bing Zheng | N/A | |
| Attack-Guided Perceptual Data Generation for Real-World Re-Identification | Yukun Huang · , · Xueyang Fu · , · Zheng-Jun Zha | N/A | |
| Parallel Multi-Resolution Fusion Network for Image Inpainting | Wentao Wang · , · Jianfu Zhang · , · Li Niu · , · Haoyu Ling · , · Xue Yang · , · Liqing Zhang | N/A | |
| Joint Topology-Preserving and Feature-Refinement Network for Curvilinear Structure Segmentation | Mingfei Cheng · , · Kaili Zhao · , · Xuhong Guo · , · Yajing Xu · , · Jun Guo | N/A | |
| MT-ORL: Multi-Task Occlusion Relationship Learning | Panhe Feng · , · Qi She · , · Lei Zhu · , · Jiaxin Li · , · Lin Zhang · , · Zijian Feng · , · Changhu Wang · , · Chunpeng Li · , · Xuejing Kang · , · Anlong Ming | N/A | |
| Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions | Shuang Li · , · Yilun Du · , · Antonio Torralba · , · Josef Sivic · , · Bryan Russell | N/A | |
| Generative Layout Modeling Using Constraint Graphs | Wamiq Para · , · Paul Guerrero · , · Tom Kelly · , · Leonidas J. Guibas · , · Peter Wonka | N/A | |
| ZFlow: Gated Appearance Flow-Based Virtual Try-On With 3D Priors | Ayush Chopra · , · Rishabh Jain · , · Mayur Hemani · , · Balaji Krishnamurthy | N/A | |
| Overfitting the Data: Compact Neural Video Delivery via Content-Aware Feature Modulation | Jiaming Liu · , · Ming Lu · , · Kaixin Chen · , · Xiaoqi Li · , · Shizun Wang · , · Zhaoqing Wang · , · Enhua Wu · , · Yurong Chen · , · Chuang Zhang · , · Ming Wu | N/A | |
| Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation | Weiyao Wang · , · Matt Feiszli · , · Heng Wang · , · Du Tran | N/A | |
| Weakly Supervised Relative Spatial Reasoning for Visual Question Answering | Pratyay Banerjee · , · Tejas Gokhale · , · Yezhou Yang · , · Chitta Baral | N/A | |
| Task-Aware Part Mining Network for Few-Shot Learning | Jiamin Wu · , · Tianzhu Zhang · , · Yongdong Zhang · , · Feng Wu | N/A | |
| Cascade Image Matting With Deformable Graph Refinement | Zijian Yu · , · Xuhui Li · , · Huijuan Huang · , · Wen Zheng · , · Li Chen | N/A | |
| Geometric Unsupervised Domain Adaptation for Semantic Segmentation | Vitor Guizilini · , · Jie Li · , · Rareș Ambruș · , · Adrien Gaidon | N/A | |
| A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction | Moitreya Chatterjee · , · Narendra Ahuja · , · Anoop Cherian | N/A | |
| PX-NET: Simple and Efficient Pixel-Wise Training of Photometric Stereo Networks | Fotios Logothetis · , · Ignas Budvytis · , · Roberto Mecca · , · Roberto Cipolla | N/A | |
| Dynamic Surface Function Networks for Clothed Human Bodies | Andrei Burov · , · Matthias Nießner · , · Justus Thies | N/A | |
| Preservational Learning Improves Self-Supervised Medical Image Models by Reconstructing Diverse Contexts | Hong-Yu Zhou · , · Chixiang Lu · , · Sibei Yang · , · Xiaoguang Han · , · Yizhou Yu | N/A | |
| Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework | Qingyu Song · , · Changan Wang · , · Zhengkai Jiang · , · Yabiao Wang · , · Ying Tai · , · Chengjie Wang · , · Jilin Li · , · Feiyue Huang · , · Yang Wu | N/A | |
| The Right To Talk: An Audio-Visual Transformer Approach | Thanh-Dat Truong · , · Chi Nhan Duong · , · The De Vu · , · Hoang Anh Pham · , · Bhiksha Raj · , · Ngan Le · , · Khoa Luu | N/A | |
| Neural Image Compression via Attentional Multi-Scale Back Projection and Frequency Decomposition | Ge Gao · , · Pei You · , · Rong Pan · , · Shunyuan Han · , · Yuanyuan Zhang · , · Yuchao Dai · , · Hojae Lee | N/A | |
| Unpaired Learning for High Dynamic Range Image Tone Mapping | Yael Vinker · , · Inbar Huberman-Spiegelglas · , · Raanan Fattal | N/A | |
| Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective | Wei Wang · , · Haochen Zhang · , · Zehuan Yuan · , · Changhu Wang | N/A | |
| Unaligned Image-to-Image Translation by Learning to Reweight | Shaoan Xie · , · Mingming Gong · , · Yanwu Xu · , · Kun Zhang | N/A | |
| OSCAR-Net: Object-Centric Scene Graph Attention for Image Attribution | Eric Nguyen · , · Tu Bui · , · Viswanathan Swaminathan · , · John Collomosse | N/A | |
| A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation | Jiteng Mu · , · Weichao Qiu · , · Adam Kortylewski · , · Alan Yuille · , · Nuno Vasconcelos · , · Xiaolong Wang | N/A | |
| Consistency-Sensitivity Guided Ensemble Black-Box Adversarial Attacks in Low-Dimensional Spaces | Jianhe Yuan · , · Zhihai He | N/A | |
| Vision Transformer With Progressive Sampling | Xiaoyu Yue · , · Shuyang Sun · , · Zhanghui Kuang · , · Meng Wei · , · Philip H.S. Torr · , · Wayne Zhang · , · Dahua Lin | N/A | |
| Exploring Long Tail Visual Relationship Recognition With Large Vocabulary | Sherif Abdelkarim · , · Aniket Agarwal · , · Panos Achlioptas · , · Jun Chen · , · Jiaji Huang · , · Boyang Li · , · Kenneth Church · , · Mohamed Elhoseiny | N/A | |
| EM-POSE: 3D Human Pose Estimation From Sparse Electromagnetic Trackers | Manuel Kaufmann · , · Yi Zhao · , · Chengcheng Tang · , · Lingling Tao · , · Christopher Twigg · , · Jie Song · , · Robert Wang · , · Otmar Hilliges | N/A | |
| On Exposing the Challenging Long Tail in Future Prediction of Traffic Actors | Osama Makansi · , · Özgün Çiçek · , · Yassine Marrakchi · , · Thomas Brox | N/A | |
| Video Geo-Localization Employing Geo-Temporal Feature Learning and GPS Trajectory Smoothing | Krishna Regmi · , · Mubarak Shah | N/A | |
| ICON: Learning Regular Maps Through Inverse Consistency | Hastings Greer · , · Roland Kwitt · , · François-Xavier Vialard · , · Marc Niethammer | N/A | |
| ELF-VC: Efficient Learned Flexible-Rate Video Coding | Oren Rippel · , · Alexander G. Anderson · , · Kedar Tatwawadi · , · Sanjay Nair · , · Craig Lytle · , · Lubomir Bourdev | N/A | |
| Structure-Preserving Deraining With Residue Channel Prior Guidance | Qiaosi Yi · , · Juncheng Li · , · Qinyan Dai · , · Faming Fang · , · Guixu Zhang · , · Tieyong Zeng | N/A | |
| Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning | Qi Sun · , · Chen Bai · , · Tinghuan Chen · , · Hao Geng · , · Xinyun Zhang · , · Yang Bai · , · Bei Yu | N/A | |
| Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation | Kang Liao · , · Chunyu Lin · , · Yunchao Wei · , · Feng Li · , · Shangrong Yang · , · Yao Zhao | N/A | |
| ViewNet: Unsupervised Viewpoint Estimation From Conditional Generation | Octave Mariotti · , · Oisin Mac Aodha · , · Hakan Bilen | N/A | |
| High-Resolution Optical Flow From 1D Attention and Correlation | Haofei Xu · , · Jiaolong Yang · , · Jianfei Cai · , · Juyong Zhang · , · Xin Tong | N/A | |
| RGB-D Saliency Detection via Cascaded Mutual Information Minimization | Jing Zhang · , · Deng-Ping Fan · , · Yuchao Dai · , · Xin Yu · , · Yiran Zhong · , · Nick Barnes · , · Ling Shao | N/A | |
| A Weakly Supervised Amodal Segmenter With Boundary Uncertainty Estimation | Khoi Nguyen · , · Sinisa Todorovic | N/A | |
| Cross-Camera Convolutional Color Constancy | Mahmoud Afifi · , · Jonathan T. Barron · , · Chloe LeGendre · , · Yun-Ta Tsai · , · Francois Bleibel | N/A | |
| Kernel Methods in Hyperbolic Spaces | Pengfei Fang · , · Mehrtash Harandi · , · Lars Petersson | N/A | |
| Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation | Zhenbo Yu · , · Bingbing Ni · , · Jingwei Xu · , · Junjie Wang · , · Chenglong Zhao · , · Wenjun Zhang | N/A | |
| Geometric Deep Neural Network Using Rigid and Non-Rigid Transformations for Human Action Recognition | Rasha Friji · , · Hassen Drira · , · Faten Chaieb · , · Hamza Kchok · , · Sebastian Kurtek | N/A | |
| Enhanced Boundary Learning for Glass-Like Object Segmentation | Hao He · , · Xiangtai Li · , · Guangliang Cheng · , · Jianping Shi · , · Yunhai Tong · , · Gaofeng Meng · , · Véronique Prinet · , · LuBin Weng | N/A | |
| Self-Supervised Pretraining of 3D Features on Any Point-Cloud | Zaiwei Zhang · , · Rohit Girdhar · , · Armand Joulin · , · Ishan Misra | N/A | |
| N-ImageNet: Towards Robust, Fine-Grained Object Recognition With Event Cameras | Junho Kim · , · Jaehyeok Bae · , · Gangin Park · , · Dongsu Zhang · , · Young Min Kim | N/A | |
| Diagonal Attention and Style-Based GAN for Content-Style Disentanglement in Image Generation and Translation | Gihyun Kwon · , · Jong Chul Ye | N/A | |
| Who's Waldo? Linking People Across Text and Images | Yuqing Cui · , · Apoorv Khandelwal · , · Yoav Artzi · , · Noah Snavely · , · Hadar Averbuch-Elor | N/A | |
| Switchable K-Class Hyperplanes for Noise-Robust Representation Learning | Boxiao Liu · , · Guanglu Song · , · Manyuan Zhang · , · Haihang You · , · Yu Liu | N/A | |
| Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction | Guanglei Yang · , · Hao Tang · , · Mingli Ding · , · Nicu Sebe · , · Elisa Ricci | N/A | |
| BlockPlanner: City Block Generation With Vectorized Graph Representation | Linning Xu · , · Yuanbo Xiangli · , · Anyi Rao · , · Nanxuan Zhao · , · Bo Dai · , · Ziwei Liu · , · Dahua Lin | N/A | |
| PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds | Anh-Quan Cao · , · Gilles Puy · , · Alexandre Boulch · , · Renaud Marlet | N/A | |
| CCT-Net: Category-Invariant Cross-Domain Transfer for Medical Single-to-Multiple Disease Diagnosis | Yi Zhou · , · Lei Huang · , · Tao Zhou · , · Ling Shao | N/A | |
| FLAR: A Unified Prototype Framework for Few-Sample Lifelong Active Recognition | Lei Fan · , · Peixi Xiong · , · Wei Wei · , · Ying Wu | N/A | |
| VLGrammar: Grounded Grammar Induction of Vision and Language | Yining Hong · , · Qing Li · , · Song-Chun Zhu · , · Siyuan Huang | N/A | |
| Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning | Yinbo Chen · , · Zhuang Liu · , · Huijuan Xu · , · Trevor Darrell · , · Xiaolong Wang | N/A | |
| CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point Clouds | Eric-Tuan Lê · , · Minhyuk Sung · , · Duygu Ceylan · , · Radomir Mech · , · Tamy Boubekeur · , · Niloy J. Mitra | N/A | |
| PARTS: Unsupervised Segmentation With Slots, Attention and Independence Maximization | Daniel Zoran · , · Rishabh Kabra · , · Alexander Lerchner · , · Danilo J. Rezende | N/A | |
| Fine-Grained Semantics-Aware Representation Enhancement for Self-Supervised Monocular Depth Estimation | Hyunyoung Jung · , · Eunhyeok Park · , · Sungjoo Yoo | N/A | |
| Learning Signed Distance Field for Multi-View Surface Reconstruction | Jingyang Zhang · , · Yao Yao · , · Long Quan | N/A | |
| Pose Correction for Highly Accurate Visual Localization in Large-Scale Indoor Spaces | Janghun Hyeon · , · Joohyung Kim · , · Nakju Doh | N/A | |
| A Hybrid Video Anomaly Detection Framework via Memory-Augmented Flow Reconstruction and Flow-Guided Frame Prediction | Zhian Liu · , · Yongwei Nie · , · Chengjiang Long · , · Qing Zhang · , · Guiqing Li | N/A | |
| PointBA: Towards Backdoor Attacks in 3D Point Cloud | Xinke Li · , · Zhirui Chen · , · Yue Zhao · , · Zekun Tong · , · Yabang Zhao · , · Andrew Lim · , · Joey Tianyi Zhou | N/A | |
| Linguistically Routing Capsule Network for Out-of-Distribution Visual Question Answering | Qingxing Cao · , · Wentao Wan · , · Keze Wang · , · Xiaodan Liang · , · Liang Lin | N/A | |
| Neural Articulated Radiance Field | Atsuhiro Noguchi · , · Xiao Sun · , · Stephen Lin · , · Tatsuya Harada | N/A | |
| Region Similarity Representation Learning | Tete Xiao · , · Colorado J Reed · , · Xiaolong Wang · , · Kurt Keutzer · , · Trevor Darrell | N/A | |
| Learning of Visual Relations: The Devil Is in the Tails | Alakh Desai · , · Tz-Ying Wu · , · Subarna Tripathi · , · Nuno Vasconcelos | N/A | |
| T-SVDNet: Exploring High-Order Prototypical Correlations for Multi-Source Domain Adaptation | Ruihuang Li · , · Xu Jia · , · Jianzhong He · , · Shuaijun Chen · , · Qinghua Hu | N/A | |
| BuildingNet: Learning To Label 3D Buildings | Pratheba Selvaraju · , · Mohamed Nabail · , · Marios Loizou · , · Maria Maslioukova · , · Melinos Averkiou · , · Andreas Andreou · , · Siddhartha Chaudhuri · , · Evangelos Kalogerakis | N/A | |
| Student Customized Knowledge Distillation: Bridging the Gap Between Student and Teacher | Yichen Zhu · , · Yi Wang | N/A | |
| A Machine Teaching Framework for Scalable Recognition | Pei Wang · , · Nuno Vasconcelos | N/A | |
| Divide and Conquer for Single-Frame Temporal Action Localization | Chen Ju · , · Peisen Zhao · , · Siheng Chen · , · Ya Zhang · , · Yanfeng Wang · , · Qi Tian | N/A | |
| Real-World Video Super-Resolution: A Benchmark Dataset and a Decomposition Based Learning Scheme | Xi Yang · , · Wangmeng Xiang · , · Hui Zeng · , · Lei Zhang | N/A | |
| Towards High Fidelity Monocular Face Reconstruction With Rich Reflectance Using Self-Supervised Learning and Ray Tracing | Abdallah Dib · , · Cédric Thébault · , · Junghyun Ahn · , · Philippe-Henri Gosselin · , · Christian Theobalt · , · Louis Chevallier | N/A | |
| Frequency Domain Image Translation: More Photo-Realistic, Better Identity-Preserving | Mu Cai · , · Hong Zhang · , · Huijuan Huang · , · Qichuan Geng · , · Yixuan Li · , · Gao Huang | N/A | |
| ASCNet: Self-Supervised Video Representation Learning With Appearance-Speed Consistency | Deng Huang · , · Wenhao Wu · , · Weiwen Hu · , · Xu Liu · , · Dongliang He · , · Zhihua Wu · , · Xiangmiao Wu · , · Mingkui Tan · , · Errui Ding | N/A | |
| Improving Generalization of Batch Whitening by Convolutional Unit Optimization | Yooshin Cho · , · Hanbyel Cho · , · Youngsoo Kim · , · Junmo Kim | N/A | |
| Motion Guided Attention Fusion To Recognize Interactions From Videos | Tae Soo Kim · , · Jonathan Jones · , · Gregory D. Hager | N/A | |
| Statistically Consistent Saliency Estimation | Shunyan Luo · , · Emre Barut · , · Fang Jin | N/A | |
| SLIDE: Single Image 3D Photography With Soft Layering and Depth-Aware Inpainting | Varun Jampani · , · Huiwen Chang · , · Kyle Sargent · , · Abhishek Kar · , · Richard Tucker · , · Michael Krainin · , · Dominik Kaeser · , · William T. Freeman · , · David Salesin · , · Brian Curless · , · Ce Liu | N/A | |
| Learning Spatio-Temporal Transformer for Visual Tracking | Bin Yan · , · Houwen Peng · , · Jianlong Fu · , · Dong Wang · , · Huchuan Lu | N/A | |
| From Contexts to Locality: Ultra-High Resolution Image Segmentation via Locality-Aware Contextual Correlation | Qi Li · , · Weixiang Yang · , · Wenxi Liu · , · Yuanlong Yu · , · Shengfeng He | N/A | |
| Channel-Wise Knowledge Distillation for Dense Prediction | Changyong Shu · , · Yifan Liu · , · Jianfei Gao · , · Zheng Yan · , · Chunhua Shen | N/A | |
| Multi-View 3D Reconstruction With Transformers | Dan Wang · , · Xinrui Cui · , · Xun Chen · , · Zhengxia Zou · , · Tianyang Shi · , · Septimiu Salcudean · , · Z. Jane Wang · , · Rabab Ward | N/A | |
| From General to Specific: Informative Scene Graph Generation via Balance Adjustment | Yuyu Guo · , · Lianli Gao · , · Xuanhan Wang · , · Yuxuan Hu · , · Xing Xu · , · Xu Lu · , · Heng Tao Shen · , · Jingkuan Song | N/A | |
| Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering | Bangbang Yang · , · Yinda Zhang · , · Yinghao Xu · , · Yijin Li · , · Han Zhou · , · Hujun Bao · , · Guofeng Zhang · , · Zhaopeng Cui | N/A | |
| Practical Relative Order Attack in Deep Ranking | Mo Zhou · , · Le Wang · , · Zhenxing Niu · , · Qilin Zhang · , · Yinghui Xu · , · Nanning Zheng · , · Gang Hua | N/A | |
| Unsupervised Layered Image Decomposition Into Object Prototypes | Tom Monnier · , · Elliot Vincent · , · Jean Ponce · , · Mathieu Aubry | N/A | |
| Manifold Alignment for Semantically Aligned Style Transfer | Jing Huo · , · Shiyin Jin · , · Wenbin Li · , · Jing Wu · , · Yu-Kun Lai · , · Yinghuan Shi · , · Yang Gao | N/A | |
| Defending Against Universal Adversarial Patches by Clipping Feature Norms | Cheng Yu · , · Jiansheng Chen · , · Youze Xue · , · Yuyang Liu · , · Weitao Wan · , · Jiayu Bao · , · Huimin Ma | N/A | |
| Q-Match: Iterative Shape Matching via Quantum Annealing | Marcel Seelbach Benkner · , · Zorah Lähner · , · Vladislav Golyanik · , · Christof Wunderlich · , · Christian Theobalt · , · Michael Moeller | N/A | |
| Fast Convergence of DETR With Spatially Modulated Co-Attention | Peng Gao · , · Minghang Zheng · , · Xiaogang Wang · , · Jifeng Dai · , · Hongsheng Li | N/A | |
| Discovering Human Interactions With Large-Vocabulary Objects via Query and Multi-Scale Detection | Suchen Wang · , · Kim-Hui Yap · , · Henghui Ding · , · Jiyan Wu · , · Junsong Yuan · , · Yap-Peng Tan | N/A | |
| T-AutoML: Automated Machine Learning for Lesion Segmentation Using Transformers in 3D Medical Imaging | Dong Yang · , · Andriy Myronenko · , · Xiaosong Wang · , · Ziyue Xu · , · Holger R. Roth · , · Daguang Xu | N/A | |
| CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification | Chaoyou Fu · , · Yibo Hu · , · Xiang Wu · , · Hailin Shi · , · Tao Mei · , · Ran He | N/A | |
| Learning To Better Segment Objects From Unseen Classes With Unlabeled Videos | Yuming Du · , · Yang Xiao · , · Vincent Lepetit | N/A | |
| SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation | Viraj Prabhu · , · Shivam Khare · , · Deeksha Kartik · , · Judy Hoffman | N/A | |
| Self-Supervised Domain Adaptation for Forgery Localization of JPEG Compressed Images | Yuan Rao · , · Jiangqun Ni | N/A | |
| Unsupervised Point Cloud Object Co-Segmentation by Co-Contrastive Learning and Mutual Attention Sampling | Cheng-Kun Yang · , · Yung-Yu Chuang · , · Yen-Yu Lin | N/A | |
| Meta Pairwise Relationship Distillation for Unsupervised Person Re-Identification | Haoxuanye Ji · , · Le Wang · , · Sanping Zhou · , · Wei Tang · , · Nanning Zheng · , · Gang Hua | N/A | |
| Relational Embedding for Few-Shot Classification | Dahyun Kang · , · Heeseung Kwon · , · Juhong Min · , · Minsu Cho | N/A | |
| Globally Optimal and Efficient Manhattan Frame Estimation by Delimiting Rotation Search Space | Wuwei Ge · , · Yu Song · , · Baichao Zhang · , · Zehua Dong | N/A | |
| Robustness Certification for Point Cloud Models | Tobias Lorenz · , · Anian Ruoss · , · Mislav Balunović · , · Gagandeep Singh · , · Martin Vechev | N/A | |
| Square Root Marginalization for Sliding-Window Bundle Adjustment | Nikolaus Demmel · , · David Schubert · , · Christiane Sommer · , · Daniel Cremers · , · Vladyslav Usenko | N/A | |
| OpenForensics: Large-Scale Challenging Dataset for Multi-Face Forgery Detection and Segmentation In-the-Wild | Trung-Nghia Le · , · Huy H. Nguyen · , · Junichi Yamagishi · , · Isao Echizen | N/A | |
| Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling | Jingyun Liang · , · Andreas Lugmayr · , · Kai Zhang · , · Martin Danelljan · , · Luc Van Gool · , · Radu Timofte | N/A | |
| Deep Symmetric Network for Underexposed Image Enhancement With Recurrent Attentional Learning | Lin Zhao · , · Shao-Ping Lu · , · Tao Chen · , · Zhenglu Yang · , · Ariel Shamir | N/A | |
| Syncretic Modality Collaborative Learning for Visible Infrared Person Re-Identification | Ziyu Wei · , · Xi Yang · , · Nannan Wang · , · Xinbo Gao | N/A | |
| Estimating Egocentric 3D Human Pose in Global Space | Jian Wang · , · Lingjie Liu · , · Weipeng Xu · , · Kripasindhu Sarkar · , · Christian Theobalt | N/A | |
| Re-Aging GAN: Toward Personalized Face Age Transformation | Farkhod Makhmudkhujaev · , · Sungeun Hong · , · In Kyu Park | N/A | |
| SeLFVi: Self-Supervised Light-Field Video Reconstruction From Stereo Video | Prasan Shedligeri · , · Florian Schiffers · , · Sushobhan Ghosh · , · Oliver Cossairt · , · Kaushik Mitra | N/A | |
| Cross-Encoder for Unsupervised Gaze Representation Learning | Yunjia Sun · , · Jiabei Zeng · , · Shiguang Shan · , · Xilin Chen | N/A | |
| Towards Discriminative Representation Learning for Unsupervised Person Re-Identification | Takashi Isobe · , · Dong Li · , · Lu Tian · , · Weihua Chen · , · Yi Shan · , · Shengjin Wang | N/A | |
| Event-Intensity Stereo: Estimating Depth by the Best of Both Worlds | Mohammad Mostafavi · , · Kuk-Jin Yoon · , · Jonghyun Choi | N/A | |
| When Do GANs Replicate? On the Choice of Dataset Size | Qianli Feng · , · Chenqi Guo · , · Fabian Benitez-Quiroz · , · Aleix M. Martinez | N/A | |
| Just One Moment: Structural Vulnerability of Deep Action Recognition Against One Frame Attack | Jaehui Hwang · , · Jun-Hyuk Kim · , · Jun-Ho Choi · , · Jong-Seok Lee | N/A | |
| Big Self-Supervised Models Advance Medical Image Classification | Shekoofeh Azizi · , · Basil Mustafa · , · Fiona Ryan · , · Zachary Beaver · , · Jan Freyberg · , · Jonathan Deaton · , · Aaron Loh · , · Alan Karthikesalingam · , · Simon Kornblith · , · Ting Chen · , · Vivek Natarajan · , · Mohammad Norouzi | N/A | |
| Scene Context-Aware Salient Object Detection | Avishek Siris · , · Jianbo Jiao · , · Gary K.L. Tam · , · Xianghua Xie · , · Rynson W.H. Lau | N/A | |
| Learning Frequency-Aware Dynamic Network for Efficient Super-Resolution | Wenbin Xie · , · Dehua Song · , · Chang Xu · , · Chunjing Xu · , · Hui Zhang · , · Yunhe Wang | N/A | |
| Road Anomaly Detection by Partial Image Reconstruction With Segmentation Coupling | Tomas Vojir · , · Tomáš Šipka · , · Rahaf Aljundi · , · Nikolay Chumerin · , · Daniel Olmeda Reino · , · Jiri Matas | N/A | |
| BossNAS: Exploring Hybrid CNN-Transformers With Block-Wisely Self-Supervised Neural Architecture Search | Changlin Li · , · Tao Tang · , · Guangrun Wang · , · Jiefeng Peng · , · Bing Wang · , · Xiaodan Liang · , · Xiaojun Chang | N/A | |
| H2O: Two Hands Manipulating Objects for First Person Interaction Recognition | Taein Kwon · , · Bugra Tekin · , · Jan Stühmer · , · Federica Bogo · , · Marc Pollefeys | N/A | |
| Residual Attention: A Simple but Effective Method for Multi-Label Recognition | Ke Zhu · , · Jianxin Wu | N/A | |
| TransferI2I: Transfer Learning for Image-to-Image Translation From Small Datasets | Yaxing Wang · , · Héctor Laria · , · Joost van de Weijer · , · Laura Lopez-Fuentes · , · Bogdan Raducanu | N/A | |
| On Generating Transferable Targeted Perturbations | Muzammal Naseer · , · Salman Khan · , · Munawar Hayat · , · Fahad Shahbaz Khan · , · Fatih Porikli | N/A | |
| SynFace: Face Recognition With Synthetic Data | Haibo Qiu · , · Baosheng Yu · , · Dihong Gong · , · Zhifeng Li · , · Wei Liu · , · Dacheng Tao | N/A | |
| Camera Distortion-Aware 3D Human Pose Estimation in Video With Optimization-Based Meta-Learning | Hanbyel Cho · , · Yooshin Cho · , · Jaemyung Yu · , · Junmo Kim | N/A | |
| KiloNeRF: Speeding Up Neural Radiance Fields With Thousands of Tiny MLPs | Christian Reiser · , · Songyou Peng · , · Yiyi Liao · , · Andreas Geiger | N/A | |
| Do Image Classifiers Generalize Across Time? | Vaishaal Shankar · , · Achal Dave · , · Rebecca Roelofs · , · Deva Ramanan · , · Benjamin Recht · , · Ludwig Schmidt | N/A | |
| Refining Action Segmentation With Hierarchical Video Representations | Hyemin Ahn · , · Dongheui Lee | N/A | |
| Hierarchical Disentangled Representation Learning for Outdoor Illumination Estimation and Editing | Piaopiao Yu · , · Jie Guo · , · Fan Huang · , · Cheng Zhou · , · Hongwei Che · , · Xiao Ling · , · Yanwen Guo | N/A | |
| InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images | Anoop Cherian · , · Gonçalo Dias Pais · , · Siddarth Jain · , · Tim K. Marks · , · Alan Sullivan | N/A | |
| High-Performance Discriminative Tracking With Transformers | Bin Yu · , · Ming Tang · , · Linyu Zheng · , · Guibo Zhu · , · Jinqiao Wang · , · Hao Feng · , · Xuetao Feng · , · Hanqing Lu | N/A | |
| GraphFPN: Graph Feature Pyramid Network for Object Detection | Gangming Zhao · , · Weifeng Ge · , · Yizhou Yu | N/A | |
| Self-Supervised 3D Hand Pose Estimation From Monocular RGB via Contrastive Learning | Adrian Spurr · , · Aneesh Dahiya · , · Xi Wang · , · Xucong Zhang · , · Otmar Hilliges | N/A | |
| NeuSpike-Net: High Speed Video Reconstruction via Bio-Inspired Neuromorphic Cameras | Lin Zhu · , · Jianing Li · , · Xiao Wang · , · Tiejun Huang · , · Yonghong Tian | N/A | |
| Admix: Enhancing the Transferability of Adversarial Attacks | Xiaosen Wang · , · Xuanran He · , · Jingdong Wang · , · Kun He | N/A | |
| ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning | Sangho Lee · , · Jiwan Chung · , · Youngjae Yu · , · Gunhee Kim · , · Thomas Breuel · , · Gal Chechik · , · Yale Song | N/A | |
| Local Temperature Scaling for Probability Calibration | Zhipeng Ding · , · Xu Han · , · Peirong Liu · , · Marc Niethammer | N/A | |
| RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation | Jianyun Xu · , · Ruixiang Zhang · , · Jian Dou · , · Yushi Zhu · , · Jie Sun · , · Shiliang Pu | N/A | |
| WarpedGANSpace: Finding Non-Linear RBF Paths in GAN Latent Space | Christos Tzelepis · , · Georgios Tzimiropoulos · , · Ioannis Patras | N/A | |
| CodeNeRF: Disentangled Neural Radiance Fields for Object Categories | Wonbong Jang · , · Lourdes Agapito | N/A | |
| Infinite Nature: Perpetual View Generation of Natural Scenes From a Single Image | Andrew Liu · , · Richard Tucker · , · Varun Jampani · , · Ameesh Makadia · , · Noah Snavely · , · Angjoo Kanazawa | N/A | |
| Generic Attention-Model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers | Hila Chefer · , · Shir Gur · , · Lior Wolf | N/A | |
| Real-Time Image Enhancer via Learnable Spatial-Aware 3D Lookup Tables | Tao Wang · , · Yong Li · , · Jingyang Peng · , · Yipeng Ma · , · Xian Wang · , · Fenglong Song · , · Youliang Yan | N/A | |
| STAR: A Structure-Aware Lightweight Transformer for Real-Time Image Enhancement | Zhaoyang Zhang · , · Yitong Jiang · , · Jun Jiang · , · Xiaogang Wang · , · Ping Luo · , · Jinwei Gu | N/A | |
| Continuous Copy-Paste for One-Stage Multi-Object Tracking and Segmentation | Zhenbo Xu · , · Ajin Meng · , · Zhenbo Shi · , · Wei Yang · , · Zhi Chen · , · Liusheng Huang | N/A | |
| Hand-Object Contact Consistency Reasoning for Human Grasps Generation | Hanwen Jiang · , · Shaowei Liu · , · Jiashun Wang · , · Xiaolong Wang | N/A | |
| FashionMirror: Co-Attention Feature-Remapping Virtual Try-On With Sequential Template Poses | Chieh-Yun Chen · , · Ling Lo · , · Pin-Jui Huang · , · Hong-Han Shuai · , · Wen-Huang Cheng | N/A | |
| Reconcile Prediction Consistency for Balanced Object Detection | Keyang Wang · , · Lei Zhang | N/A | |
| Confidence Calibration for Domain Generalization Under Covariate Shift | Yunye Gong · , · Xiao Lin · , · Yi Yao · , · Thomas G. Dietterich · , · Ajay Divakaran · , · Melinda Gervasio | N/A | |
| Self-Supervised Video Representation Learning With Meta-Contrastive Network | Yuanze Lin · , · Xun Guo · , · Yan Lu | N/A | |
| A Confidence-Based Iterative Solver of Depths and Surface Normals for Deep Multi-View Stereo | Wang Zhao · , · Shaohui Liu · , · Yi Wei · , · Hengkai Guo · , · Yong-Jin Liu | N/A | |
| Unsupervised Depth Completion With Calibrated Backprojection Layers | Alex Wong · , · Stefano Soatto | N/A | |
| Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval | Max Bain · , · Arsha Nagrani · , · Gül Varol · , · Andrew Zisserman | N/A | |
| LIRA: Learnable, Imperceptible and Robust Backdoor Attacks | Khoa Doan · , · Yingjie Lao · , · Weijie Zhao · , · Ping Li | N/A | |
| DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes | Dongki Jung · , · Jaehoon Choi · , · Yonghan Lee · , · Deokhwa Kim · , · Changick Kim · , · Dinesh Manocha · , · Donghwan Lee | N/A | |
| Click To Move: Controlling Video Generation With Sparse Motion | Pierfrancesco Ardino · , · Marco De Nadai · , · Bruno Lepri · , · Elisa Ricci · , · Stéphane Lathuilière | N/A | |
| Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization | Weihan Chen · , · Peisong Wang · , · Jian Cheng | N/A | |
| Dual-Camera Super-Resolution With Aligned Attention Modules | Tengfei Wang · , · Jiaxin Xie · , · Wenxiu Sun · , · Qiong Yan · , · Qifeng Chen | N/A | |
| NASOA: Towards Faster Task-Oriented Online Fine-Tuning With a Zoo of Models | Hang Xu · , · Ning Kang · , · Gengwei Zhang · , · Chuanlong Xie · , · Xiaodan Liang · , · Zhenguo Li | N/A | |
| RandomRooms: Unsupervised Pre-Training From Synthetic Shapes and Randomized Layouts for 3D Object Detection | Yongming Rao · , · Benlin Liu · , · Yi Wei · , · Jiwen Lu · , · Cho-Jui Hsieh · , · Jie Zhou | N/A | |
| From Continuity to Editability: Inverting GANs With Consecutive Images | Yangyang Xu · , · Yong Du · , · Wenpeng Xiao · , · Xuemiao Xu · , · Shengfeng He | N/A | |
| GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning | Haipeng Li · , · Kunming Luo · , · Shuaicheng Liu | N/A | |
| Towards Discovery and Attribution of Open-World GAN Generated Images | Sharath Girish · , · Saksham Suri · , · Sai Saketh Rambhatla · , · Abhinav Shrivastava | N/A | |
| Vector Neurons: A General Framework for SO(3)-Equivariant Networks | Congyue Deng · , · Or Litany · , · Yueqi Duan · , · Adrien Poulenard · , · Andrea Tagliasacchi · , · Leonidas J. Guibas | N/A | |
| Integer-Arithmetic-Only Certified Robustness for Quantized Neural Networks | Haowen Lin · , · Jian Lou · , · Li Xiong · , · Cyrus Shahabi | N/A | |
| Video-Based Person Re-Identification With Spatial and Temporal Memory Networks | Chanho Eom · , · Geon Lee · , · Junghyup Lee · , · Bumsub Ham | N/A | |
| Conformer: Local Features Coupling Global Representations for Visual Recognition | Zhiliang Peng · , · Wei Huang · , · Shanzhi Gu · , · Lingxi Xie · , · Yaowei Wang · , · Jianbin Jiao · , · Qixiang Ye | N/A | |
| Lightweight Multi-Person Total Motion Capture Using Sparse Multi-View Cameras | Yuxiang Zhang · , · Zhe Li · , · Liang An · , · Mengcheng Li · , · Tao Yu · , · Yebin Liu | N/A | |
| AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-Directional Metric Learning | Hong Wang · , · Yuefan Deng · , · Shinjae Yoo · , · Haibin Ling · , · Yuewei Lin | N/A | |
| Revealing the Reciprocal Relations Between Self-Supervised Stereo and Monocular Depth Estimation | Zhi Chen · , · Xiaoqing Ye · , · Wei Yang · , · Zhenbo Xu · , · Xiao Tan · , · Zhikang Zou · , · Errui Ding · , · Xinming Zhang · , · Liusheng Huang | N/A | |
| MGSampler: An Explainable Sampling Strategy for Video Action Recognition | Yuan Zhi · , · Zhan Tong · , · Limin Wang · , · Gangshan Wu | N/A | |
| Robust 2D/3D Vehicle Parsing in Arbitrary Camera Views for CVIS | Hui Miao · , · Feixiang Lu · , · Zongdai Liu · , · Liangjun Zhang · , · Dinesh Manocha · , · Bin Zhou | N/A | |
| Recurrent Mask Refinement for Few-Shot Medical Image Segmentation | Hao Tang · , · Xingwei Liu · , · Shanlin Sun · , · Xiangyi Yan · , · Xiaohui Xie | N/A | |
| ECACL: A Holistic Framework for Semi-Supervised Domain Adaptation | Kai Li · , · Chang Liu · , · Handong Zhao · , · Yulun Zhang · , · Yun Fu | N/A | |
| WaveFill: A Wavelet-Based Generation Network for Image Inpainting | Yingchen Yu · , · Fangneng Zhan · , · Shijian Lu · , · Jianxiong Pan · , · Feiying Ma · , · Xuansong Xie · , · Chunyan Miao | N/A | |
| Egocentric Pose Estimation From Human Vision Span | Hao Jiang · , · Vamsi Krishna Ithapu | N/A | |
| Prototypical Matching and Open Set Rejection for Zero-Shot Semantic Segmentation | Hui Zhang · , · Henghui Ding | N/A | |
| GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion | Cheng Chi · , · Shuran Song | N/A | |
| Reliably Fast Adversarial Training via Latent Adversarial Perturbation | Geon Yeong Park · , · Sang Wan Lee | N/A | |
| R-SLAM: Optimizing Eye Tracking From Rolling Shutter Video of the Retina | Jay Shenoy · , · James Fong · , · Jeffrey Tan · , · Austin Roorda · , · Ren Ng | N/A | |
| Inference of Black Hole Fluid-Dynamics From Sparse Interferometric Measurements | Aviad Levis · , · Daeyoung Lee · , · Joel A. Tropp · , · Charles F. Gammie · , · Katherine L. Bouman | N/A | |
| Monocular, One-Stage, Regression of Multiple 3D People | Yu Sun · , · Qian Bao · , · Wu Liu · , · Yili Fu · , · Michael J. Black · , · Tao Mei | N/A | |
| PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation | Qiqi Gu · , · Qianyu Zhou · , · Minghao Xu · , · Zhengyang Feng · , · Guangliang Cheng · , · Xuequan Lu · , · Jianping Shi · , · Lizhuang Ma | N/A | |
| Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering | Corentin Dancette · , · Rémi Cadène · , · Damien Teney · , · Matthieu Cord | N/A | |
| H3D-Net: Few-Shot High-Fidelity 3D Head Reconstruction | Eduard Ramon · , · Gil Triginer · , · Janna Escur · , · Albert Pumarola · , · Jaime Garcia · , · Xavier Giró-i-Nieto · , · Francesc Moreno-Noguer | N/A | |
| Image Harmonization With Transformer | Zonghui Guo · , · Dongsheng Guo · , · Haiyong Zheng · , · Zhaorui Gu · , · Bing Zheng · , · Junyu Dong | N/A | |
| Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature | Nayoung Kim · , · Seong Jong Ha · , · Je-Won Kang | N/A | |
| Human Pose Regression With Residual Log-Likelihood Estimation | Jiefeng Li · , · Siyuan Bian · , · Ailing Zeng · , · Can Wang · , · Bo Pang · , · Wentao Liu · , · Cewu Lu | N/A | |
| Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis | Nikhil Singh · , · Jeff Mentch · , · Jerry Ng · , · Matthew Beveridge · , · Iddo Drori | N/A | |
| Boosting Monocular Depth Estimation With Lightweight 3D Point Fusion | Lam Huynh · , · Phong Nguyen · , · Jiří Matas · , · Esa Rahtu · , · Janne Heikkilä | N/A | |
| Spatio-Temporal Dynamic Inference Network for Group Activity Recognition | Hangjie Yuan · , · Dong Ni · , · Mang Wang | N/A | |
| Removing the Bias of Integral Pose Regression | Kerui Gu · , · Linlin Yang · , · Angela Yao | N/A | |
| High Quality Disparity Remapping With Two-Stage Warping | Bing Li · , · Chia-Wen Lin · , · Cheng Zheng · , · Shan Liu · , · Junsong Yuan · , · Bernard Ghanem · , · C.-C. Jay Kuo | N/A | |
| TrivialAugment: Tuning-Free Yet State-of-the-Art Data Augmentation | Samuel G. Müller · , · Frank Hutter | N/A | |
| Learning To Discover Reflection Symmetry via Polar Matching Convolution | Ahyun Seo · , · Woohyeon Shim · , · Minsu Cho | N/A | |
| Holistic Pose Graph: Modeling Geometric Structure Among Objects in a Scene Using Graph Inference for 3D Object Prediction | Jiwei Xiao · , · Ruiping Wang · , · Xilin Chen | N/A | |
| Crossover Learning for Fast Online Video Instance Segmentation | Shusheng Yang · , · Yuxin Fang · , · Xinggang Wang · , · Yu Li · , · Chen Fang · , · Ying Shan · , · Bin Feng · , · Wenyu Liu | N/A | |
| 3DIAS: 3D Shape Reconstruction With Implicit Algebraic Surfaces | Mohsen Yavartanoo · , · Jaeyoung Chung · , · Reyhaneh Neshatavar · , · Kyoung Mu Lee | N/A | |
| DeFRCN: Decoupled Faster R-CNN for Few-Shot Object Detection | Limeng Qiao · , · Yuxuan Zhao · , · Zhiyuan Li · , · Xi Qiu · , · Jianan Wu · , · Chi Zhang | N/A | |
| Multiview Pseudo-Labeling for Semi-Supervised Learning From Video | Bo Xiong · , · Haoqi Fan · , · Kristen Grauman · , · Christoph Feichtenhofer | N/A | |
| SketchLattice: Latticed Representation for Sketch Manipulation | Yonggang Qi · , · Guoyao Su · , · Pinaki Nath Chowdhury · , · Mingkang Li · , · Yi-Zhe Song | N/A | |
| Aligning Latent and Image Spaces To Connect the Unconnectable | Ivan Skorokhodov · , · Grigorii Sotnikov · , · Mohamed Elhoseiny | N/A | |
| End-to-End Trainable Trident Person Search Network Using Adaptive Gradient Propagation | Byeong-Ju Han · , · Kuhyeun Ko · , · Jae-Young Sim | N/A | |
| HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton | Wencan Cheng · , · Jae Hyun Park · , · Jong Hwan Ko | N/A | |
| Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval | Hui Wu · , · Min Wang · , · Wengang Zhou · , · Houqiang Li | N/A | |
| Polarimetric Helmholtz Stereopsis | Yuqi Ding · , · Yu Ji · , · Mingyuan Zhou · , · Sing Bing Kang · , · Jinwei Ye | N/A | |
| Motion Prediction Using Trajectory Cues | Zhenguang Liu · , · Pengxiang Su · , · Shuang Wu · , · Xuanjing Shen · , · Haipeng Chen · , · Yanbin Hao · , · Meng Wang | N/A | |
| Generalized Source-Free Domain Adaptation | Shiqi Yang · , · Yaxing Wang · , · Joost van de Weijer · , · Luis Herranz · , · Shangling Jui | N/A | |
| DisUnknown: Distilling Unknown Factors for Disentanglement Learning | Sitao Xiang · , · Yuming Gu · , · Pengda Xiang · , · Menglei Chai · , · Hao Li · , · Yajie Zhao · , · Mingming He | N/A | |
| Self-Mutating Network for Domain Adaptive Segmentation in Aerial Images | Kyungsu Lee · , · Haeyun Lee · , · Jae Youn Hwang | N/A | |
| 3D Building Reconstruction From Monocular Remote Sensing Images | Weijia Li · , · Lingxuan Meng · , · Jinwang Wang · , · Conghui He · , · Gui-Song Xia · , · Dahua Lin | N/A | |
| Multi-Scale Separable Network for Ultra-High-Definition Video Deblurring | Senyou Deng · , · Wenqi Ren · , · Yanyang Yan · , · Tao Wang · , · Fenglong Song · , · Xiaochun Cao | N/A | |
| Baking Neural Radiance Fields for Real-Time View Synthesis | Peter Hedman · , · Pratul P. Srinivasan · , · Ben Mildenhall · , · Jonathan T. Barron · , · Paul Debevec | N/A | |
| Parallel Rectangle Flip Attack: A Query-Based Black-Box Attack Against Object Detection | Siyuan Liang · , · Baoyuan Wu · , · Yanbo Fan · , · Xingxing Wei · , · Xiaochun Cao | N/A | |
| ALADIN: All Layer Adaptive Instance Normalization for Fine-Grained Style Similarity | Dan Ruta · , · Saeid Motiian · , · Baldo Faieta · , · Zhe Lin · , · Hailin Jin · , · Alex Filipkowski · , · Andrew Gilbert · , · John Collomosse | N/A | |
| Visio-Temporal Attention for Multi-Camera Multi-Target Association | Yu-Jhe Li · , · Xinshuo Weng · , · Yan Xu · , · Kris M. Kitani | N/A | |
| A Light Stage on Every Desk | Soumyadip Sengupta · , · Brian Curless · , · Ira Kemelmacher-Shlizerman · , · Steven M. Seitz | N/A | |
| Multi-Level Curriculum for Training a Distortion-Aware Barrel Distortion Rectification Model | Kang Liao · , · Chunyu Lin · , · Lixin Liao · , · Yao Zhao · , · Weiyao Lin | N/A | |
| DepthInSpace: Exploitation and Fusion of Multiple Video Frames for Structured-Light Depth Estimation | Mohammad Mahdi Johari · , · Camilla Carta · , · François Fleuret | N/A | |
| GeomNet: A Neural Network Based on Riemannian Geometries of SPD Matrix Space and Cholesky Space for 3D Skeleton-Based Interaction Recognition | Xuan Son Nguyen | N/A | |
| Learning Dynamic Interpolation for Extremely Sparse Light Fields With Wide Baselines | Mantang Guo · , · Jing Jin · , · Hui Liu · , · Junhui Hou | N/A | |
| Ultra-High-Definition Image HDR Reconstruction via Collaborative Bilateral Learning | Zhuoran Zheng · , · Wenqi Ren · , · Xiaochun Cao · , · Tao Wang · , · Xiuyi Jia | N/A | |
| Visual Relationship Detection Using Part-and-Sum Transformers With Composite Queries | Qi Dong · , · Zhuowen Tu · , · Haofu Liao · , · Yuting Zhang · , · Vijay Mahadevan · , · Stefano Soatto | N/A | |
| SLAMP: Stochastic Latent Appearance and Motion Prediction | Adil Kaan Akan · , · Erkut Erdem · , · Aykut Erdem · , · Fatma Güney | N/A | |
| Learning To Diversify for Single Domain Generalization | Zijian Wang · , · Yadan Luo · , · Ruihong Qiu · , · Zi Huang · , · Mahsa Baktashmotlagh | N/A | |
| CPF: Learning a Contact Potential Field To Model the Hand-Object Interaction | Lixin Yang · , · Xinyu Zhan · , · Kailin Li · , · Wenqiang Xu · , · Jiefeng Li · , · Cewu Lu | N/A | |
| Sensor-Guided Optical Flow | Matteo Poggi · , · Filippo Aleotti · , · Stefano Mattoccia | N/A | |
| Wasserstein Coupled Graph Learning for Cross-Modal Retrieval | Yun Wang · , · Tong Zhang · , · Xueya Zhang · , · Zhen Cui · , · Yuge Huang · , · Pengcheng Shen · , · Shaoxin Li · , · Jian Yang | N/A | |
| ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment | Yangyu Huang · , · Hao Yang · , · Chong Li · , · Jongyoo Kim · , · Fangyun Wei | N/A | |
| Pixel-Perfect Structure-From-Motion With Featuremetric Refinement | Philipp Lindenberger · , · Paul-Edouard Sarlin · , · Viktor Larsson · , · Marc Pollefeys | N/A | |
| BiaSwap: Removing Dataset Bias With Bias-Tailored Swapping Augmentation | Eungyeup Kim · , · Jihyeon Lee · , · Jaegul Choo | N/A | |
| GistNet: A Geometric Structure Transfer Network for Long-Tailed Recognition | Bo Liu · , · Haoxiang Li · , · Hao Kang · , · Gang Hua · , · Nuno Vasconcelos | N/A | |
| Distance-Aware Quantization | Dohyung Kim · , · Junghyup Lee · , · Bumsub Ham | N/A | |
| Shape-Biased Domain Generalization via Shock Graph Embeddings | Maruthi Narayanan · , · Vickram Rajendran · , · Benjamin Kimia | N/A | |
| PixelSynth: Generating a 3D-Consistent Experience From a Single Image | Chris Rockwell · , · David F. Fouhey · , · Justin Johnson | N/A | |
| Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Dynamic Scene From Monocular Video | Edgar Tretschk · , · Ayush Tewari · , · Vladislav Golyanik · , · Michael Zollhöfer · , · Christoph Lassner · , · Christian Theobalt | N/A | |
| Learning To Cut by Watching Movies | Alejandro Pardo · , · Fabian Caba · , · Juan Léon Alcázar · , · Ali K. Thabet · , · Bernard Ghanem | N/A | |
| Sketch2Mesh: Reconstructing and Editing 3D Shapes From Sketches | Benoit Guillard · , · Edoardo Remelli · , · Pierre Yvernay · , · Pascal Fua | N/A | |
| Generic Event Boundary Detection: A Benchmark for Event Segmentation | Mike Zheng Shou · , · Stan Weixian Lei · , · Weiyao Wang · , · Deepti Ghadiyaram · , · Matt Feiszli | N/A | |
| How Shift Equivariance Impacts Metric Learning for Instance Segmentation | Josef Lorenz Rumberger · , · Xiaoyan Yu · , · Peter Hirsch · , · Melanie Dohmen · , · Vanessa Emanuela Guarino · , · Ashkan Mokarian · , · Lisa Mais · , · Jan Funke · , · Dagmar Kainmüller | N/A | |
| Calibrated Adversarial Refinement for Stochastic Semantic Segmentation | Elias Kassapis · , · Georgi Dikov · , · Deepak K. Gupta · , · Cedric Nugteren | N/A | |
| Self-Supervised Visual Representations Learning by Contrastive Mask Prediction | Yucheng Zhao · , · Guangting Wang · , · Chong Luo · , · Wenjun Zeng · , · Zheng-Jun Zha | N/A | |
| Personalized Image Semantic Segmentation | Yu Zhang · , · Chang-Bin Zhang · , · Peng-Tao Jiang · , · Ming-Ming Cheng · , · Feng Mao | N/A | |
| Fooling LiDAR Perception via Adversarial Trajectory Perturbation | Yiming Li · , · Congcong Wen · , · Felix Juefei-Xu · , · Chen Feng | N/A | |
| Extreme Structure From Motion for Indoor Panoramas Without Visual Overlaps | Mohammad Amin Shabani · , · Weilian Song · , · Makoto Odamaki · , · Hirochika Fujiki · , · Yasutaka Furukawa | N/A | |
| Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection | Miao Zhang · , · Jie Liu · , · Yifei Wang · , · Yongri Piao · , · Shunyu Yao · , · Wei Ji · , · Jingjing Li · , · Huchuan Lu · , · Zhongxuan Luo | N/A | |
| Boosting the Generalization Capability in Cross-Domain Few-Shot Learning via Noise-Enhanced Supervised Autoencoder | Hanwen Liang · , · Qiong Zhang · , · Peng Dai · , · Juwei Lu | N/A | |
| Fake It Till You Make It: Face Analysis in the Wild Using Synthetic Data Alone | Erroll Wood · , · Tadas Baltrušaitis · , · Charlie Hewitt · , · Sebastian Dziadzio · , · Thomas J. Cashman · , · Jamie Shotton | N/A | |
| StereOBJ-1M: Large-Scale Stereo Image Dataset for 6D Object Pose Estimation | Xingyu Liu · , · Shun Iwase · , · Kris M. Kitani | N/A | |
| Predictive Feature Learning for Future Segmentation Prediction | Zihang Lin · , · Jiangxin Sun · , · Jian-Fang Hu · , · Qizhi Yu · , · Jian-Huang Lai · , · Wei-Shi Zheng | N/A | |
| PIAP-DF: Pixel-Interested and Anti Person-Specific Facial Action Unit Detection Net With Discrete Feedback Learning | Yang Tang · , · Wangding Zeng · , · Dafei Zhao · , · Honggang Zhang | N/A | |
| NPMs: Neural Parametric Models for 3D Deformable Shapes | Pablo Palafox · , · Aljaž Božič · , · Justus Thies · , · Matthias Nießner · , · Angela Dai | N/A | |
| Semantic Aware Data Augmentation for Cell Nuclei Microscopical Images With Artificial Neural Networks | Alireza Naghizadeh · , · Hongye Xu · , · Mohab Mohamed · , · Dimitris N. Metaxas · , · Dongfang Liu | N/A | |
| NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-View Stereo | Yi Wei · , · Shaohui Liu · , · Yongming Rao · , · Wang Zhao · , · Jiwen Lu · , · Jie Zhou | N/A | |
| When Pigs Fly: Contextual Reasoning in Synthetic and Natural Scenes | Philipp Bomatter · , · Mengmi Zhang · , · Dimitar Karev · , · Spandan Madan · , · Claire Tseng · , · Gabriel Kreiman | N/A | |
| You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking | Jiaming Sun · , · Yiming Xie · , · Siyu Zhang · , · Linghao Chen · , · Guofeng Zhang · , · Hujun Bao · , · Xiaowei Zhou | N/A | |
| Learning With Memory-Based Virtual Classes for Deep Metric Learning | Byungsoo Ko · , · Geonmo Gu · , · Han-Gyu Kim | N/A | |
| Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation | Rui Peng · , · Ronggang Wang · , · Yawen Lai · , · Luyang Tang · , · Yangang Cai | N/A | |
| SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation | Xuning Shao · , · Weidong Zhang | N/A | |
| Sub-Bit Neural Networks: Learning To Compress and Accelerate Binary Neural Networks | Yikai Wang · , · Yi Yang · , · Fuchun Sun · , · Anbang Yao | N/A | |
| Interacting Two-Hand 3D Pose and Shape Reconstruction From Single Color Image | Baowen Zhang · , · Yangang Wang · , · Xiaoming Deng · , · Yinda Zhang · , · Ping Tan · , · Cuixia Ma · , · Hongan Wang | N/A | |
| CODEs: Chamfer Out-of-Distribution Examples Against Overconfidence Issue | Keke Tang · , · Dingruibo Miao · , · Weilong Peng · , · Jianpeng Wu · , · Yawen Shi · , · Zhaoquan Gu · , · Zhihong Tian · , · Wenping Wang | N/A | |
| Lifelong Infinite Mixture Model Based on Knowledge-Driven Dirichlet Process | Fei Ye · , · Adrian G. Bors | N/A | |
| LayoutTransformer: Layout Generation and Completion With Self-Attention | Kamal Gupta · , · Justin Lazarow · , · Alessandro Achille · , · Larry S. Davis · , · Vijay Mahadevan · , · Abhinav Shrivastava | N/A | |
| The Power of Points for Modeling Humans in Clothing | Qianli Ma · , · Jinlong Yang · , · Siyu Tang · , · Michael J. Black | N/A | |
| Physics-Enhanced Machine Learning for Virtual Fluorescence Microscopy | Colin L. Cooke · , · Fanjie Kong · , · Amey Chaware · , · Kevin C. Zhou · , · Kanghyun Kim · , · Rong Xu · , · D. Michael Ando · , · Samuel J. Yang · , · Pavan Chandra Konda · , · Roarke Horstmeyer | N/A | |
| Dynamic Attentive Graph Learning for Image Restoration | Chong Mou · , · Jian Zhang · , · Zhuoyuan Wu | N/A | |
| Adversarial Unsupervised Domain Adaptation With Conditional and Label Shift: Infer, Align and Iterate | Xiaofeng Liu · , · Zhenhua Guo · , · Site Li · , · Fangxu Xing · , · Jane You · , · C.-C. Jay Kuo · , · Georges El Fakhri · , · Jonghye Woo | N/A | |
| Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection | Hanxue Liang · , · Chenhan Jiang · , · Dapeng Feng · , · Xin Chen · , · Hang Xu · , · Xiaodan Liang · , · Wei Zhang · , · Zhenguo Li · , · Luc Van Gool | N/A | |
| Learning Meta-Class Memory for Few-Shot Semantic Segmentation | Zhonghua Wu · , · Xiangxi Shi · , · Guosheng Lin · , · Jianfei Cai | N/A | |
| Semi-Supervised Active Learning With Temporal Output Discrepancy | Siyu Huang · , · Tianyang Wang · , · Haoyi Xiong · , · Jun Huan · , · Dejing Dou | N/A | |
| Learning Cross-Modal Contrastive Features for Video Domain Adaptation | Donghyun Kim · , · Yi-Hsuan Tsai · , · Bingbing Zhuang · , · Xiang Yu · , · Stan Sclaroff · , · Kate Saenko · , · Manmohan Chandraker | N/A | |
| Energy-Based Open-World Uncertainty Modeling for Confidence Calibration | Yezhen Wang · , · Bo Li · , · Tong Che · , · Kaiyang Zhou · , · Ziwei Liu · , · Dongsheng Li | N/A | |
| Sat2Vid: Street-View Panoramic Video Synthesis From a Single Satellite Image | Zuoyue Li · , · Zhenqiang Li · , · Zhaopeng Cui · , · Rongjun Qin · , · Marc Pollefeys · , · Martin R. Oswald | N/A | |
| NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization | Haoyue Bai · , · Fengwei Zhou · , · Lanqing Hong · , · Nanyang Ye · , · S.-H. Gary Chan · , · Zhenguo Li | N/A | |
| Hierarchical Aggregation for 3D Instance Segmentation | Shaoyu Chen · , · Jiemin Fang · , · Qian Zhang · , · Wenyu Liu · , · Xinggang Wang | N/A | |
| Large-Scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image Classification | Zhuoning Yuan · , · Yan Yan · , · Milan Sonka · , · Tianbao Yang | N/A | |
| A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation | Jianlong Yuan · , · Yifan Liu · , · Chunhua Shen · , · Zhibin Wang · , · Hao Li | N/A | |
| Ground-Truth or DAER: Selective Re-Query of Secondary Information | Stephan J. Lemmer · , · Jason J. Corso | N/A | |
| Evidential Deep Learning for Open Set Action Recognition | Wentao Bao · , · Qi Yu · , · Yu Kong | N/A | |
| Perception-Aware Multi-Sensor Fusion for 3D LiDAR Semantic Segmentation | Zhuangwei Zhuang · , · Rong Li · , · Kui Jia · , · Qicheng Wang · , · Yuanqing Li · , · Mingkui Tan | N/A | |
| UVStyle-Net: Unsupervised Few-Shot Learning of 3D Style Similarity Measure for B-Reps | Peter Meltzer · , · Hooman Shayani · , · Amir Khasahmadi · , · Pradeep Kumar Jayaraman · , · Aditya Sanghi · , · Joseph Lambourne | N/A | |
| End-to-End Dense Video Captioning With Parallel Decoding | Teng Wang · , · Ruimao Zhang · , · Zhichao Lu · , · Feng Zheng · , · Ran Cheng · , · Ping Luo | N/A | |
| StarEnhancer: Learning Real-Time and Style-Aware Image Enhancement | Yuda Song · , · Hui Qian · , · Xin Du | N/A | |
| Can Shape Structure Features Improve Model Robustness Under Diverse Adversarial Settings? | Mingjie Sun · , · Zichao Li · , · Chaowei Xiao · , · Haonan Qiu · , · Bhavya Kailkhura · , · Mingyan Liu · , · Bo Li | N/A | |
| Multi-Class Cell Detection Using Spatial Context Representation | Shahira Abousamra · , · David Belinsky · , · John Van Arnam · , · Felicia Allard · , · Eric Yee · , · Rajarsi Gupta · , · Tahsin Kurc · , · Dimitris Samaras · , · Joel Saltz · , · Chao Chen | N/A | |
| Learning by Aligning: Visible-Infrared Person Re-Identification Using Cross-Modal Correspondences | Hyunjong Park · , · Sanghoon Lee · , · Junghyup Lee · , · Bumsub Ham | N/A | |
| Localize to Binauralize: Audio Spatialization From Visual Sound Source Localization | Kranthi Kumar Rachavarapu · , · Aakanksha · , · Vignesh Sundaresha · , · A. N. Rajagopalan | N/A | |
| ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-Tree Complex Wavelet Representation and Contradict Channel Loss | Wei-Ting Chen · , · Hao-Yu Fang · , · Cheng-Lin Hsieh · , · Cheng-Che Tsai · , · I-Hsiang Chen · , · Jian-Jiun Ding · , · Sy-Yen Kuo | N/A | |
| MINE: Towards Continuous Depth MPI With NeRF for Novel View Synthesis | Jiaxin Li · , · Zijian Feng · , · Qi She · , · Henghui Ding · , · Changhu Wang · , · Gim Hee Lee | N/A | |
| LoFGAN: Fusing Local Representations for Few-Shot Image Generation | Zheng Gu · , · Wenbin Li · , · Jing Huo · , · Lei Wang · , · Yang Gao | N/A | |
| Grafit: Learning Fine-Grained Image Representations With Coarse Labels | Hugo Touvron · , · Alexandre Sablayrolles · , · Matthijs Douze · , · Matthieu Cord · , · Hervé Jégou | N/A | |
| Rethinking the Truly Unsupervised Image-to-Image Translation | Kyungjune Baek · , · Yunjey Choi · , · Youngjung Uh · , · Jaejun Yoo · , · Hyunjung Shim | N/A | |
| Point-Based Modeling of Human Clothing | Ilya Zakharkin · , · Kirill Mazur · , · Artur Grigorev · , · Victor Lempitsky | N/A | |
| Equivariant Imaging: Learning Beyond the Range Space | Dongdong Chen · , · Julián Tachella · , · Mike E. Davies | N/A | |
| Mitigating Intensity Bias in Shadow Detection via Feature Decomposition and Reweighting | Lei Zhu · , · Ke Xu · , · Zhanghan Ke · , · Rynson W.H. Lau | N/A | |
| Joint Representation Learning and Novel Category Discovery on Single- and Multi-Modal Data | Xuhui Jia · , · Kai Han · , · Yukun Zhu · , · Bradley Green | N/A | |
| Sparse Needlets for Lighting Estimation With Spherical Transport Loss | Fangneng Zhan · , · Changgong Zhang · , · Wenbo Hu · , · Shijian Lu · , · Feiying Ma · , · Xuansong Xie · , · Ling Shao | N/A | |
| CANet: A Context-Aware Network for Shadow Removal | Zipei Chen · , · Chengjiang Long · , · Ling Zhang · , · Chunxia Xiao | N/A | |
| Semantic Perturbations With Normalizing Flows for Improved Generalization | Oguz Kaan Yüksel · , · Sebastian U. Stich · , · Martin Jaggi · , · Tatjana Chavdarova | N/A | |
| Audio-Visual Floorplan Reconstruction | Senthil Purushwalkam · , · Sebastià Vicenc Amengual Garí · , · Vamsi Krishna Ithapu · , · Carl Schissler · , · Philip Robinson · , · Abhinav Gupta · , · Kristen Grauman | N/A | |
| MonoIndoor: Towards Good Practice of Self-Supervised Monocular Depth Estimation for Indoor Environments | Pan Ji · , · Runze Li · , · Bir Bhanu · , · Yi Xu | N/A | |
| Common Objects in 3D: Large-Scale Learning and Evaluation of Real-Life 3D Category Reconstruction | Jeremy Reizenstein · , · Roman Shapovalov · , · Philipp Henzler · , · Luca Sbordone · , · Patrick Labatut · , · David Novotny | N/A | |
| Reconstructing Hand-Object Interactions in the Wild | Zhe Cao · , · Ilija Radosavovic · , · Angjoo Kanazawa · , · Jitendra Malik | N/A | |
| TOOD: Task-Aligned One-Stage Object Detection | Chengjian Feng · , · Yujie Zhong · , · Yu Gao · , · Matthew R. Scott · , · Weilin Huang | N/A | |
| Generalizable Mixed-Precision Quantization via Attribution Rank Preservation | Ziwei Wang · , · Han Xiao · , · Jiwen Lu · , · Jie Zhou | N/A | |
| LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation | Inkyu Shin · , · Dong-Jin Kim · , · Jae Won Cho · , · Sanghyun Woo · , · Kwanyong Park · , · In So Kweon | N/A | |
| SPEC: Seeing People in the Wild With an Estimated Camera | Muhammed Kocabas · , · Chun-Hao P. Huang · , · Joachim Tesch · , · Lea Müller · , · Otmar Hilliges · , · Michael J. Black | N/A | |
| Binocular Mutual Learning for Improving Few-Shot Classification | Ziqi Zhou · , · Xi Qiu · , · Jiangtao Xie · , · Jianan Wu · , · Chi Zhang | N/A | |
| Distilling Holistic Knowledge With Graph Neural Networks | Sheng Zhou · , · Yucheng Wang · , · Defang Chen · , · Jiawei Chen · , · Xin Wang · , · Can Wang · , · Jiajun Bu | N/A | |
| Towards Robustness of Deep Neural Networks via Regularization | Yao Li · , · Martin Renqiang Min · , · Thomas Lee · , · Wenchao Yu · , · Erik Kruus · , · Wei Wang · , · Cho-Jui Hsieh | N/A | |
| STEM: An Approach to Multi-Source Domain Adaptation With Guarantees | Van-Anh Nguyen · , · Tuan Nguyen · , · Trung Le · , · Quan Hung Tran · , · Dinh Phung | N/A | |
| Divide and Contrast: Self-Supervised Learning From Uncurated Data | Yonglong Tian · , · Olivier J. Hénaff · , · Aäron van den Oord | N/A | |
| Parallel Detection-and-Segmentation Learning for Weakly Supervised Instance Segmentation | Yunhang Shen · , · Liujuan Cao · , · Zhiwei Chen · , · Baochang Zhang · , · Chi Su · , · Yongjian Wu · , · Feiyue Huang · , · Rongrong Ji | N/A | |
| IntraTomo: Self-Supervised Learning-Based Tomography via Sinogram Synthesis and Prediction | Guangming Zang · , · Ramzi Idoughi · , · Rui Li · , · Peter Wonka · , · Wolfgang Heidrich | N/A | |
| Towards Real-World X-Ray Security Inspection: A High-Quality Benchmark and Lateral Inhibition Module for Prohibited Items Detection | Renshuai Tao · , · Yanlu Wei · , · Xiangjian Jiang · , · Hainan Li · , · Haotong Qin · , · Jiakai Wang · , · Yuqing Ma · , · Libo Zhang · , · Xianglong Liu | N/A | |
| Differentiable Surface Rendering via Non-Differentiable Sampling | Forrester Cole · , · Kyle Genova · , · Avneesh Sud · , · Daniel Vlasic · , · Zhoutong Zhang | N/A | |
| Distillation-Guided Image Inpainting | Maitreya Suin · , · Kuldeep Purohit · , · A. N. Rajagopalan | N/A | |
| Real-Time Instance Segmentation With Discriminative Orientation Maps | Wentao Du · , · Zhiyu Xiang · , · Shuya Chen · , · Chengyu Qiao · , · Yiman Chen · , · Tingming Bai | N/A | |
| Segmenter: Transformer for Semantic Segmentation | Robin Strudel · , · Ricardo Garcia · , · Ivan Laptev · , · Cordelia Schmid | N/A | |
| IDARTS: Interactive Differentiable Architecture Search | Song Xue · , · Runqi Wang · , · Baochang Zhang · , · Tian Wang · , · Guodong Guo · , · David Doermann | N/A | |
| AutoSpace: Neural Architecture Search With Less Human Interference | Daquan Zhou · , · Xiaojie Jin · , · Xiaochen Lian · , · Linjie Yang · , · Yujing Xue · , · Qibin Hou · , · Jiashi Feng | N/A | |
| Evolving Search Space for Neural Architecture Search | Yuanzheng Ci · , · Chen Lin · , · Ming Sun · , · Boyu Chen · , · Hongwen Zhang · , · Wanli Ouyang | N/A | |
| THDA: Treasure Hunt Data Augmentation for Semantic Navigation | Oleksandr Maksymets · , · Vincent Cartillier · , · Aaron Gokaslan · , · Erik Wijmans · , · Wojciech Galuba · , · Stefan Lee · , · Dhruv Batra | N/A | |
| Tripartite Information Mining and Integration for Image Matting | Yuhao Liu · , · Jiake Xie · , · Xiao Shi · , · Yu Qiao · , · Yujie Huang · , · Yong Tang · , · Xin Yang | N/A | |
| Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-Grained Recognition | Shaoli Huang · , · Xinchao Wang · , · Dacheng Tao | N/A | |
| BEV-Net: Assessing Social Distancing Compliance by Joint People Localization and Geometric Reasoning | Zhirui Dai · , · Yuepeng Jiang · , · Yi Li · , · Bo Liu · , · Antoni B. Chan · , · Nuno Vasconcelos | N/A | |
| Pyramid Architecture Search for Real-Time Image Deblurring | Xiaobin Hu · , · Wenqi Ren · , · Kaicheng Yu · , · Kaihao Zhang · , · Xiaochun Cao · , · Wei Liu · , · Bjoern Menze | N/A | |
| TransForensics: Image Forgery Localization With Dense Self-Attention | Jing Hao · , · Zhixin Zhang · , · Shicai Yang · , · Di Xie · , · Shiliang Pu | N/A | |
| Joint Audio-Visual Deepfake Detection | Yipin Zhou · , · Ser-Nam Lim | N/A | |
| Objects As Cameras: Estimating High-Frequency Illumination From Shadows | Tristan Swedish · , · Connor Henley · , · Ramesh Raskar | N/A | |
| Time-Equivariant Contrastive Video Representation Learning | Simon Jenni · , · Hailin Jin | N/A | |
| Dynamical Pose Estimation | Heng Yang · , · Chris Doran · , · Jean-Jacques Slotine | N/A | |
| Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images | Size Wu · , · Sheng Jin · , · Wentao Liu · , · Lei Bai · , · Chen Qian · , · Dong Liu · , · Wanli Ouyang | N/A | |
| Learning Fast Sample Re-Weighting Without Reward Data | Zizhao Zhang · , · Tomas Pfister | N/A | |
| Multi-Anchor Active Domain Adaptation for Semantic Segmentation | Munan Ning · , · Donghuan Lu · , · Dong Wei · , · Cheng Bian · , · Chenglang Yuan · , · Shuang Yu · , · Kai Ma · , · Yefeng Zheng | N/A | |
| C3-SemiSeg: Contrastive Semi-Supervised Segmentation via Cross-Set Learning and Dynamic Class-Balancing | Yanning Zhou · , · Hang Xu · , · Wei Zhang · , · Bin Gao · , · Pheng-Ann Heng | N/A | |
| PyMAF: 3D Human Pose and Shape Regression With Pyramidal Mesh Alignment Feedback Loop | Hongwen Zhang · , · Yating Tian · , · Xinchi Zhou · , · Wanli Ouyang · , · Yebin Liu · , · Limin Wang · , · Zhenan Sun | N/A | |
| COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-Training for Vision-Language Representation | Keyu Wen · , · Jin Xia · , · Yuanyuan Huang · , · Linyang Li · , · Jiayan Xu · , · Jie Shao | N/A | |
| KoDF: A Large-Scale Korean DeepFake Detection Dataset | Patrick Kwon · , · Jaeseong You · , · Gyuhyeon Nam · , · Sungwoo Park · , · Gyeongsu Chae | N/A | |
| Radial Distortion Invariant Factorization for Structure From Motion | José Pedro Iglesias · , · Carl Olsson | N/A | |
| LaLaLoc: Latent Layout Localisation in Dynamic, Unvisited Environments | Henry Howard-Jenkins · , · Jose-Raul Ruiz-Sarmiento · , · Victor Adrian Prisacariu | N/A | |
| Learning Privacy-Preserving Optics for Human Pose Estimation | Carlos Hinojosa · , · Juan Carlos Niebles · , · Henry Arguello | N/A | |
| EPP-MVSNet: Epipolar-Assembling Based Depth Prediction for Multi-View Stereo | Xinjun Ma · , · Yue Gong · , · Qirui Wang · , · Jingwei Huang · , · Lei Chen · , · Fan Yu | N/A | |
| Full-Velocity Radar Returns by Radar-Camera Fusion | Yunfei Long · , · Daniel Morris · , · Xiaoming Liu · , · Marcos Castro · , · Punarjay Chakravarty · , · Praveen Narayanan | N/A | |
| Toward Realistic Single-View 3D Object Reconstruction With Unsupervised Learning From Multiple Images | Long-Nhat Ho · , · Anh Tuan Tran · , · Quynh Phung · , · Minh Hoai | N/A | |
| MVSNeRF: Fast Generalizable Radiance Field Reconstruction From Multi-View Stereo | Anpei Chen · , · Zexiang Xu · , · Fuqiang Zhao · , · Xiaoshuai Zhang · , · Fanbo Xiang · , · Jingyi Yu · , · Hao Su | N/A | |
| Transforms Based Tensor Robust PCA: Corrupted Low-Rank Tensors Recovery via Convex Optimization | Canyi Lu | N/A | |
| Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields | Jonathan T. Barron · , · Ben Mildenhall · , · Matthew Tancik · , · Peter Hedman · , · Ricardo Martin-Brualla · , · Pratul P. Srinivasan | N/A | |
| Uniformity in Heterogeneity: Diving Deep Into Count Interval Partition for Crowd Counting | Changan Wang · , · Qingyu Song · , · Boshen Zhang · , · Yabiao Wang · , · Ying Tai · , · Xuyi Hu · , · Chengjie Wang · , · Jilin Li · , · Jiayi Ma · , · Yang Wu | N/A | |
| HDR Video Reconstruction: A Coarse-To-Fine Network and a Real-World Benchmark Dataset | Guanying Chen · , · Chaofeng Chen · , · Shi Guo · , · Zhetong Liang · , · Kwan-Yee K. Wong · , · Lei Zhang | N/A | |
| Self Supervision to Distillation for Long-Tailed Visual Recognition | Tianhao Li · , · Limin Wang · , · Gangshan Wu | N/A | |
| Learning To Track With Object Permanence | Pavel Tokmakov · , · Jie Li · , · Wolfram Burgard · , · Adrien Gaidon | N/A | |
| Uncertainty-Guided Transformer Reasoning for Camouflaged Object Detection | Fan Yang · , · Qiang Zhai · , · Xin Li · , · Rui Huang · , · Ao Luo · , · Hong Cheng · , · Deng-Ping Fan | N/A | |
| Deep Co-Training With Task Decomposition for Semi-Supervised Domain Adaptation | Luyu Yang · , · Yan Wang · , · Mingfei Gao · , · Abhinav Shrivastava · , · Kilian Q. Weinberger · , · Wei-Lun Chao · , · Ser-Nam Lim | N/A | |
| Dual Projection Generative Adversarial Networks for Conditional Image Generation | Ligong Han · , · Martin Renqiang Min · , · Anastasis Stathopoulos · , · Yu Tian · , · Ruijiang Gao · , · Asim Kadav · , · Dimitris N. Metaxas | N/A | |
| EventHPE: Event-Based 3D Human Pose and Shape Estimation | Shihao Zou · , · Chuan Guo · , · Xinxin Zuo · , · Sen Wang · , · Pengyu Wang · , · Xiaoqin Hu · , · Shoushun Chen · , · Minglun Gong · , · Li Cheng | N/A | |
| Synchronization of Group-Labelled Multi-Graphs | Andrea Porfiri Dal Cin · , · Luca Magri · , · Federica Arrigoni · , · Andrea Fusiello · , · Giacomo Boracchi | N/A | |
| UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction | Michael Oechsle · , · Songyou Peng · , · Andreas Geiger | N/A | |
| CTRL-C: Camera Calibration TRansformer With Line-Classification | Jinwoo Lee · , · Hyunsung Go · , · Hyunjoon Lee · , · Sunghyun Cho · , · Minhyuk Sung · , · Junho Kim | N/A | |
| Parsing Table Structures in the Wild | Rujiao Long · , · Wen Wang · , · Nan Xue · , · Feiyu Gao · , · Zhibo Yang · , · Yongpan Wang · , · Gui-Song Xia | N/A | |
| Spatio-Temporal Representation Factorization for Video-Based Person Re-Identification | Abhishek Aich · , · Meng Zheng · , · Srikrishna Karanam · , · Terrence Chen · , · Amit K. Roy-Chowdhury · , · Ziyan Wu | N/A | |
| CondLaneNet: A Top-To-Down Lane Detection Framework Based on Conditional Convolution | Lizhe Liu · , · Xiaohao Chen · , · Siyu Zhu · , · Ping Tan | N/A | |
| Adversarial Attacks on Multi-Agent Communication | James Tu · , · Tsunhsuan Wang · , · Jingkang Wang · , · Sivabalan Manivasagam · , · Mengye Ren · , · Raquel Urtasun | N/A | |
| TransPose: Keypoint Localization via Transformer | Sen Yang · , · Zhibin Quan · , · Mu Nie · , · Wankou Yang | N/A | |
| Vector-Decomposed Disentanglement for Domain-Invariant Object Detection | Aming Wu · , · Rui Liu · , · Yahong Han · , · Linchao Zhu · , · Yi Yang | N/A | |
| Topologically Consistent Multi-View Face Inference Using Volumetric Sampling | Tianye Li · , · Shichen Liu · , · Timo Bolkart · , · Jiayi Liu · , · Hao Li · , · Yajie Zhao | N/A | |
| IDM: An Intermediate Domain Module for Domain Adaptive Person Re-ID | Yongxing Dai · , · Jun Liu · , · Yifan Sun · , · Zekun Tong · , · Chi Zhang · , · Ling-Yu Duan | N/A | |
| Robust Watermarking for Deep Neural Networks via Bi-Level Optimization | Peng Yang · , · Yingjie Lao · , · Ping Li | N/A | |
| Efficient Video Compression via Content-Adaptive Super-Resolution | Mehrdad Khani · , · Vibhaalakshmi Sivaraman · , · Mohammad Alizadeh | N/A | |
| Video Annotation for Visual Tracking via Selection and Refinement | Kenan Dai · , · Jie Zhao · , · Lijun Wang · , · Dong Wang · , · Jianhua Li · , · Huchuan Lu · , · Xuesheng Qian · , · Xiaoyun Yang | N/A | |
| A Unified 3D Human Motion Synthesis Model via Conditional Variational Auto-Encoder | Yujun Cai · , · Yiwei Wang · , · Yiheng Zhu · , · Tat-Jen Cham · , · Jianfei Cai · , · Junsong Yuan · , · Jun Liu · , · Chuanxia Zheng · , · Sijie Yan · , · Henghui Ding · , · Xiaohui Shen · , · Ding Liu · , · Nadia Magnenat Thalmann | N/A | |
| SaccadeCam: Adaptive Visual Attention for Monocular Depth Sensing | Brevin Tilmon · , · Sanjeev J. Koppal | N/A | |
| Safety-Aware Motion Prediction With Unseen Vehicles for Autonomous Driving | Xuanchi Ren · , · Tao Yang · , · Li Erran Li · , · Alexandre Alahi · , · Qifeng Chen | N/A | |
| Mesh Graphormer | Kevin Lin · , · Lijuan Wang · , · Zicheng Liu | N/A | |
| CrossNorm and SelfNorm for Generalization Under Distribution Shifts | Zhiqiang Tang · , · Yunhe Gao · , · Yi Zhu · , · Zhi Zhang · , · Mu Li · , · Dimitris N. Metaxas | N/A | |
| Elaborative Rehearsal for Zero-Shot Action Recognition | Shizhe Chen · , · Dong Huang | N/A | |
| CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization | Hyolim Kang · , · Kyungmin Kim · , · Yumin Ko · , · Seon Joo Kim | N/A | |
| Joint Visual and Audio Learning for Video Highlight Detection | Taivanbat Badamdorj · , · Mrigank Rochan · , · Yang Wang · , · Li Cheng | N/A | |
| Shallow Bayesian Meta Learning for Real-World Few-Shot Recognition | Xueting Zhang · , · Debin Meng · , · Henry Gouk · , · Timothy M. Hospedales | N/A | |
| Towards Interpretable Deep Metric Learning With Structural Matching | Wenliang Zhao · , · Yongming Rao · , · Ziyi Wang · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Weakly Supervised Text-Based Person Re-Identification | Shizhen Zhao · , · Changxin Gao · , · Yuanjie Shao · , · Wei-Shi Zheng · , · Nong Sang | N/A | |
| Learning Temporal Dynamics From Cycles in Narrated Video | Dave Epstein · , · Jiajun Wu · , · Cordelia Schmid · , · Chen Sun | N/A | |
| von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning | Tyler R. Scott · , · Andrew C. Gallagher · , · Michael C. Mozer | N/A | |
| Multiple Heads Are Better Than One: Few-Shot Font Generation With Multiple Localized Experts | Song Park · , · Sanghyuk Chun · , · Junbum Cha · , · Bado Lee · , · Hyunjung Shim | N/A | |
| Me-Momentum: Extracting Hard Confident Examples From Noisily Labeled Data | Yingbin Bai · , · Tongliang Liu | N/A | |
| mDALU: Multi-Source Domain Adaptation and Label Unification With Partial Datasets | Rui Gong · , · Dengxin Dai · , · Yuhua Chen · , · Wen Li · , · Luc Van Gool | N/A | |
| Collaging Class-Specific GANs for Semantic Image Synthesis | Yuheng Li · , · Yijun Li · , · Jingwan Lu · , · Eli Shechtman · , · Yong Jae Lee · , · Krishna Kumar Singh | N/A | |
| Meta Navigator: Search for a Good Adaptation Policy for Few-Shot Learning | Chi Zhang · , · Henghui Ding · , · Guosheng Lin · , · Ruibo Li · , · Changhu Wang · , · Chunhua Shen | N/A | |
| Occlusion-Aware Video Object Inpainting | Lei Ke · , · Yu-Wing Tai · , · Chi-Keung Tang | N/A | |
| TransFER: Learning Relation-Aware Facial Expression Representations With Transformers | Fanglei Xue · , · Qiangchang Wang · , · Guodong Guo | N/A | |
| Bayesian Triplet Loss: Uncertainty Quantification in Image Retrieval | Frederik Warburg · , · Martin Jørgensen · , · Javier Civera · , · Søren Hauberg | N/A | |
| Manifold Matching via Deep Metric Learning for Generative Modeling | Mengyu Dai · , · Haibin Hang | N/A | |
| ProFlip: Targeted Trojan Attack With Progressive Bit Flips | Huili Chen · , · Cheng Fu · , · Jishen Zhao · , · Farinaz Koushanfar | N/A | |
| AutoFormer: Searching Transformers for Visual Recognition | Minghao Chen · , · Houwen Peng · , · Jianlong Fu · , · Haibin Ling | N/A | |
| Mining Latent Classes for Few-Shot Segmentation | Lihe Yang · , · Wei Zhuo · , · Lei Qi · , · Yinghuan Shi · , · Yang Gao | N/A | |
| Active Learning for Deep Object Detection via Probabilistic Modeling | Jiwoong Choi · , · Ismail Elezi · , · Hyuk-Jae Lee · , · Clement Farabet · , · Jose M. Alvarez | N/A | |
| Occlude Them All: Occlusion-Aware Attention Network for Occluded Person Re-ID | Peixian Chen · , · Wenfeng Liu · , · Pingyang Dai · , · Jianzhuang Liu · , · Qixiang Ye · , · Mingliang Xu · , · Qi’an Chen · , · Rongrong Ji | N/A | |
| Towards Accurate Alignment in Real-Time 3D Hand-Mesh Reconstruction | Xiao Tang · , · Tianyu Wang · , · Chi-Wing Fu | N/A | |
| Searching for Controllable Image Restoration Networks | Heewon Kim · , · Sungyong Baik · , · Myungsub Choi · , · Janghoon Choi · , · Kyoung Mu Lee | N/A | |
| Cross-Category Video Highlight Detection via Set-Based Learning | Minghao Xu · , · Hang Wang · , · Bingbing Ni · , · Riheng Zhu · , · Zhenbang Sun · , · Changhu Wang | N/A | |
| Attention Is Not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion | Tao Liang · , · Guosheng Lin · , · Lei Feng · , · Yan Zhang · , · Fengmao Lv | N/A | |
| Seeing Dynamic Scene in the Dark: A High-Quality Video Dataset With Mechatronic Alignment | Ruixing Wang · , · Xiaogang Xu · , · Chi-Wing Fu · , · Jiangbo Lu · , · Bei Yu · , · Jiaya Jia | N/A | |
| AdvRush: Searching for Adversarially Robust Neural Architectures | Jisoo Mok · , · Byunggook Na · , · Hyeokjun Choe · , · Sungroh Yoon | N/A | |
| Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain | Guangyao Chen · , · Peixi Peng · , · Li Ma · , · Jia Li · , · Lin Du · , · Yonghong Tian | N/A | |
| Mean Shift for Self-Supervised Learning | Soroush Abbasi Koohpayegani · , · Ajinkya Tejankar · , · Hamed Pirsiavash | N/A | |
| Speech Drives Templates: Co-Speech Gesture Synthesis With Learned Templates | Shenhan Qian · , · Zhi Tu · , · Yihao Zhi · , · Wen Liu · , · Shenghua Gao | N/A | |
| Improving Robustness Against Common Corruptions With Frequency Biased Models | Tonmoy Saikia · , · Cordelia Schmid · , · Thomas Brox | N/A | |
| AdvDrop: Adversarial Attack to DNNs by Dropping Information | Ranjie Duan · , · Yuefeng Chen · , · Dantong Niu · , · Yun Yang · , · A. K. Qin · , · Yuan He | N/A | |
| HuMoR: 3D Human Motion Model for Robust Pose Estimation | Davis Rempe · , · Tolga Birdal · , · Aaron Hertzmann · , · Jimei Yang · , · Srinath Sridhar · , · Leonidas J. Guibas | N/A | |
| Class Semantics-Based Attention for Action Detection | Deepak Sridhar · , · Niamul Quader · , · Srikanth Muralidharan · , · Yaoxin Li · , · Peng Dai · , · Juwei Lu | N/A | |
| Adaptive Graph Convolution for Point Cloud Analysis | Haoran Zhou · , · Yidan Feng · , · Mingsheng Fang · , · Mingqiang Wei · , · Jing Qin · , · Tong Lu | N/A | |
| Adversarial Attack on Deep Cross-Modal Hamming Retrieval | Chao Li · , · Shangqian Gao · , · Cheng Deng · , · Wei Liu · , · Heng Huang | N/A | |
| UASNet: Uncertainty Adaptive Sampling Network for Deep Stereo Matching | Yamin Mao · , · Zhihua Liu · , · Weiming Li · , · Yuchao Dai · , · Qiang Wang · , · Yun-Tae Kim · , · Hong-Seok Lee | N/A | |
| Minimal Adversarial Examples for Deep Learning on 3D Point Clouds | Jaeyeon Kim · , · Binh-Son Hua · , · Thanh Nguyen · , · Sai-Kit Yeung | N/A | |
| MultiSports: A Multi-Person Video Dataset of Spatio-Temporally Localized Sports Actions | Yixuan Li · , · Lei Chen · , · Runyu He · , · Zhenzhi Wang · , · Gangshan Wu · , · Limin Wang | N/A | |
| Triggering Failures: Out-of-Distribution Detection by Learning From Local Adversarial Attacks in Semantic Segmentation | Victor Besnier · , · Andrei Bursuc · , · David Picard · , · Alexandre Briot | N/A | |
| Glimpse-Attend-and-Explore: Self-Attention for Active Visual Exploration | Soroush Seifi · , · Abhishek Jha · , · Tinne Tuytelaars | N/A | |
| Field Convolutions for Surface CNNs | Thomas W. Mitchel · , · Vladimir G. Kim · , · Michael Kazhdan | N/A | |
| SIMstack: A Generative Shape and Instance Model for Unordered Object Stacks | Zoe Landgraf · , · Raluca Scona · , · Tristan Laidlow · , · Stephen James · , · Stefan Leutenegger · , · Andrew J. Davison | N/A | |
| Weakly Supervised Person Search With Region Siamese Networks | Chuchu Han · , · Kai Su · , · Dongdong Yu · , · Zehuan Yuan · , · Changxin Gao · , · Nong Sang · , · Yi Yang · , · Changhu Wang | N/A | |
| Learning Icosahedral Spherical Probability Map Based on Bingham Mixture Model for Vanishing Point Estimation | Haoang Li · , · Kai Chen · , · Pyojin Kim · , · Kuk-Jin Yoon · , · Zhe Liu · , · Kyungdon Joo · , · Yun-Hui Liu | N/A | |
| Incorporating Learnable Membrane Time Constant To Enhance Learning of Spiking Neural Networks | Wei Fang · , · Zhaofei Yu · , · Yanqi Chen · , · Timothée Masquelier · , · Tiejun Huang · , · Yonghong Tian | N/A | |
| Real-Time Vanishing Point Detector Integrating Under-Parameterized RANSAC and Hough Transform | Jianping Wu · , · Liang Zhang · , · Ye Liu · , · Ke Chen | N/A | |
| Pose Invariant Topological Memory for Visual Navigation | Asuto Taniguchi · , · Fumihiro Sasaki · , · Ryota Yamashina | N/A | |
| Shape Self-Correction for Unsupervised Point Cloud Understanding | Ye Chen · , · Jinxian Liu · , · Bingbing Ni · , · Hang Wang · , · Jiancheng Yang · , · Ning Liu · , · Teng Li · , · Qi Tian | N/A | |
| ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement | Yuval Alaluf · , · Or Patashnik · , · Daniel Cohen-Or | N/A | |
| Low-Rank Tensor Completion by Approximating the Tensor Average Rank | Zhanliang Wang · , · Junyu Dong · , · Xinguo Liu · , · Xueying Zeng | N/A | |
| Dissecting Image Crops | Basile Van Hoorick · , · Carl Vondrick | N/A | |
| Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes | Mingjun Yin · , · Shasha Li · , · Zikui Cai · , · Chengyu Song · , · M. Salman Asif · , · Amit K. Roy-Chowdhury · , · Srikanth V. Krishnamurthy | N/A | |
| Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation | Yuanyi Zhong · , · Bodi Yuan · , · Hong Wu · , · Zhiqiang Yuan · , · Jian Peng · , · Yu-Xiong Wang | N/A | |
| Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation | Sanghun Jung · , · Jungsoo Lee · , · Daehoon Gwak · , · Sungha Choi · , · Jaegul Choo | N/A | |
| SIGNET: Efficient Neural Representation for Light Fields | Brandon Yushan Feng · , · Amitabh Varshney | N/A | |
| Cross-Descriptor Visual Localization and Mapping | Mihai Dusmanu · , · Ondrej Miksik · , · Johannes L. Schönberger · , · Marc Pollefeys | N/A | |
| Understanding and Evaluating Racial Biases in Image Captioning | Dora Zhao · , · Angelina Wang · , · Olga Russakovsky | N/A | |
| Panoptic Narrative Grounding | Cristina González · , · Nicolás Ayobi · , · Isabela Hernández · , · José Hernández · , · Jordi Pont-Tuset · , · Pablo Arbeláez | N/A | |
| Weakly-Supervised Video Anomaly Detection With Robust Temporal Feature Magnitude Learning | Yu Tian · , · Guansong Pang · , · Yuanhong Chen · , · Rajvinder Singh · , · Johan W. Verjans · , · Gustavo Carneiro | N/A | |
| Learning an Augmented RGB Representation With Cross-Modal Knowledge Distillation for Action Detection | Rui Dai · , · Srijan Das · , · François Bremond | N/A | |
| VaPiD: A Rapid Vanishing Point Detector via Learned Optimizers | Shichen Liu · , · Yichao Zhou · , · Yajie Zhao | N/A | |
| Deep Survival Analysis With Longitudinal X-Rays for COVID-19 | Michelle Shu · , · Richard Strong Bowen · , · Charles Herrmann · , · Gengmo Qi · , · Michele Santacatterina · , · Ramin Zabih | N/A | |
| Cluster-Promoting Quantization With Bit-Drop for Minimizing Network Quantization Loss | Jung Hyun Lee · , · Jihun Yun · , · Sung Ju Hwang · , · Eunho Yang | N/A | |
| Continual Prototype Evolution: Learning Online From Non-Stationary Data Streams | Matthias De Lange · , · Tinne Tuytelaars | N/A | |
| Iterative Label Cleaning for Transductive and Semi-Supervised Few-Shot Learning | Michalis Lazarou · , · Tania Stathaki · , · Yannis Avrithis | N/A | |
| Striking a Balance Between Stability and Plasticity for Class-Incremental Learning | Guile Wu · , · Shaogang Gong · , · Pan Li | N/A | |
| Few-Shot and Continual Learning With Attentive Independent Mechanisms | Eugene Lee · , · Cheng-Han Huang · , · Chen-Yi Lee | N/A | |
| Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning | Junkai Huang · , · Chaowei Fang · , · Weikai Chen · , · Zhenhua Chai · , · Xiaolin Wei · , · Pengxu Wei · , · Liang Lin · , · Guanbin Li | N/A | |
| AdaFit: Rethinking Learning-Based Normal Estimation on Point Clouds | Runsong Zhu · , · Yuan Liu · , · Zhen Dong · , · Yuan Wang · , · Tengping Jiang · , · Wenping Wang · , · Bisheng Yang | N/A | |
| Regularizing Nighttime Weirdness: Efficient Self-Supervised Monocular Depth Estimation in the Dark | Kun Wang · , · Zhenyu Zhang · , · Zhiqiang Yan · , · Xiang Li · , · Baobei Xu · , · Jun Li · , · Jian Yang | N/A | |
| Occluded Person Re-Identification With Single-Scale Global Representations | Cheng Yan · , · Guansong Pang · , · Jile Jiao · , · Xiao Bai · , · Xuetao Feng · , · Chunhua Shen | N/A | |
| Nerfies: Deformable Neural Radiance Fields | Keunhong Park · , · Utkarsh Sinha · , · Jonathan T. Barron · , · Sofien Bouaziz · , · Dan B Goldman · , · Steven M. Seitz · , · Ricardo Martin-Brualla | N/A | |
| Towards Novel Target Discovery Through Open-Set Domain Adaptation | Taotao Jing · , · Hongfu Liu · , · Zhengming Ding | N/A | |
| Interpretable Visual Reasoning via Induced Symbolic Space | Zhonghao Wang · , · Kai Wang · , · Mo Yu · , · Jinjun Xiong · , · Wen-mei Hwu · , · Mark Hasegawa-Johnson · , · Humphrey Shi | N/A | |
| Generalizing Gaze Estimation With Outlier-Guided Collaborative Adaptation | Yunfei Liu · , · Ruicong Liu · , · Haofei Wang · , · Feng Lu | N/A | |
| Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism | Wentao Jiang · , · Ning Xu · , · Jiayun Wang · , · Chen Gao · , · Jing Shi · , · Zhe Lin · , · Si Liu | N/A | |
| BioFors: A Large Biomedical Image Forensics Dataset | Ekraam Sabir · , · Soumyaroop Nandi · , · Wael Abd-Almageed · , · Prem Natarajan | N/A | |
| DAM: Discrepancy Alignment Metric for Face Recognition | Jiaheng Liu · , · Yudong Wu · , · Yichao Wu · , · Chuming Li · , · Xiaolin Hu · , · Ding Liang · , · Mengyu Wang | N/A | |
| Structure-Transformed Texture-Enhanced Network for Person Image Synthesis | Munan Xu · , · Yuanqi Chen · , · Shan Liu · , · Thomas H. Li · , · Ge Li | N/A | |
| RMSMP: A Novel Deep Neural Network Quantization Framework With Row-Wise Mixed Schemes and Multiple Precisions | Sung-En Chang · , · Yanyu Li · , · Mengshu Sun · , · Weiwen Jiang · , · Sijia Liu · , · Yanzhi Wang · , · Xue Lin | N/A | |
| Robust Small-Scale Pedestrian Detection With Cued Recall via Memory Learning | Jung Uk Kim · , · Sungjune Park · , · Yong Man Ro | N/A | |
| RAIN: Reinforced Hybrid Attention Inference Network for Motion Forecasting | Jiachen Li · , · Fan Yang · , · Hengbo Ma · , · Srikanth Malla · , · Masayoshi Tomizuka · , · Chiho Choi | N/A | |
| Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images | Richard J. Chen · , · Ming Y. Lu · , · Wei-Hung Weng · , · Tiffany Y. Chen · , · Drew F.K. Williamson · , · Trevor Manz · , · Maha Shady · , · Faisal Mahmood | N/A | |
| AutoShape: Real-Time Shape-Aware Monocular 3D Object Detection | Zongdai Liu · , · Dingfu Zhou · , · Feixiang Lu · , · Jin Fang · , · Liangjun Zhang | N/A | |
| Coarsely-Labeled Data for Better Few-Shot Transfer | Cheng Perng Phoo · , · Bharath Hariharan | N/A | |
| Tune It the Right Way: Unsupervised Validation of Domain Adaptation via Soft Neighborhood Density | Kuniaki Saito · , · Donghyun Kim · , · Piotr Teterwak · , · Stan Sclaroff · , · Trevor Darrell · , · Kate Saenko | N/A | |
| Improving Neural Network Efficiency via Post-Training Quantization With Adaptive Floating-Point | Fangxin Liu · , · Wenbo Zhao · , · Zhezhi He · , · Yanzhi Wang · , · Zongwu Wang · , · Changzhi Dai · , · Xiaoyao Liang · , · Li Jiang | N/A | |
| Emerging Properties in Self-Supervised Vision Transformers | Mathilde Caron · , · Hugo Touvron · , · Ishan Misra · , · Hervé Jégou · , · Julien Mairal · , · Piotr Bojanowski · , · Armand Joulin | N/A | |
| Improving Robustness of Facial Landmark Detection by Defending Against Adversarial Attacks | Congcong Zhu · , · Xiaoqiang Li · , · Jide Li · , · Songmin Dai | N/A | |
| DeepPanoContext: Panoramic 3D Scene Understanding With Holistic Scene Context Graph and Relation-Based Optimization | Cheng Zhang · , · Zhaopeng Cui · , · Cai Chen · , · Shuaicheng Liu · , · Bing Zeng · , · Hujun Bao · , · Yinda Zhang | N/A | |
| Multi-View Radar Semantic Segmentation | Arthur Ouaknine · , · Alasdair Newson · , · Patrick Pérez · , · Florence Tupin · , · Julien Rebut | N/A | |
| Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation | Jinyu Yang · , · Chunyuan Li · , · Weizhi An · , · Hehuan Ma · , · Yuzhi Guo · , · Yu Rong · , · Peilin Zhao · , · Junzhou Huang | N/A | |
| Interpretable Image Recognition by Constructing Transparent Embedding Space | Jiaqi Wang · , · Huafeng Liu · , · Xinyue Wang · , · Liping Jing | N/A | |
| Synthesized Feature Based Few-Shot Class-Incremental Learning on a Mixture of Subspaces | Ali Cheraghian · , · Shafin Rahman · , · Sameera Ramasinghe · , · Pengfei Fang · , · Christian Simon · , · Lars Petersson · , · Mehrtash Harandi | N/A | |
| Pyramid Point Cloud Transformer for Large-Scale Place Recognition | Le Hui · , · Hang Yang · , · Mingmei Cheng · , · Jin Xie · , · Jian Yang | N/A | |
| Interpreting Attributions and Interactions of Adversarial Attacks | Xin Wang · , · Shuyun Lin · , · Hao Zhang · , · Yufei Zhu · , · Quanshi Zhang | N/A | |
| Neural Photofit: Gaze-Based Mental Image Reconstruction | Florian Strohm · , · Ekta Sood · , · Sven Mayer · , · Philipp Müller · , · Mihai Bâce · , · Andreas Bulling | N/A | |
| Efficient and Differentiable Shadow Computation for Inverse Problems | Linjie Lyu · , · Marc Habermann · , · Lingjie Liu · , · Mallikarjun B R · , · Ayush Tewari · , · Christian Theobalt | N/A | |
| Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data | Ning Yu · , · Vladislav Skripniuk · , · Sahar Abdelnabi · , · Mario Fritz | N/A | |
| TokenPose: Learning Keypoint Tokens for Human Pose Estimation | Yanjie Li · , · Shoukui Zhang · , · Zhicheng Wang · , · Sen Yang · , · Wankou Yang · , · Shu-Tao Xia · , · Erjin Zhou | N/A | |
| Disentangled Lifespan Face Synthesis | Sen He · , · Wentong Liao · , · Michael Ying Yang · , · Yi-Zhe Song · , · Bodo Rosenhahn · , · Tao Xiang | N/A | |
| Dual Transfer Learning for Event-Based End-Task Prediction via Pluggable Event to Image Translation | Lin Wang · , · Yujeong Chae · , · Kuk-Jin Yoon | N/A | |
| Exploration and Estimation for Model Compression | Yanfu Zhang · , · Shangqian Gao · , · Heng Huang | N/A | |
| Task Switching Network for Multi-Task Learning | Guolei Sun · , · Thomas Probst · , · Danda Pani Paudel · , · Nikola Popović · , · Menelaos Kanakis · , · Jagruti Patel · , · Dengxin Dai · , · Luc Van Gool | N/A | |
| Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations | Dat Huynh · , · Ehsan Elhamifar | N/A | |
| Unsupervised Point Cloud Pre-Training via Occlusion Completion | Hanchen Wang · , · Qi Liu · , · Xiangyu Yue · , · Joan Lasenby · , · Matt J. Kusner | N/A | |
| Structure-From-Sherds: Incremental 3D Reassembly of Axially Symmetric Pots From Unordered and Mixed Fragment Collections | Je Hyeong Hong · , · Seong Jong Yoo · , · Muhammad Arshad Zeeshan · , · Young Min Kim · , · Jinwook Kim | N/A | |
| Towards Vivid and Diverse Image Colorization With Generative Color Prior | Yanze Wu · , · Xintao Wang · , · Yu Li · , · Honglun Zhang · , · Xun Zhao · , · Ying Shan | N/A | |
| Asymmetric Loss for Multi-Label Classification | Tal Ridnik · , · Emanuel Ben-Baruch · , · Nadav Zamir · , · Asaf Noy · , · Itamar Friedman · , · Matan Protter · , · Lihi Zelnik-Manor | N/A | |
| The Pursuit of Knowledge: Discovering and Localizing Novel Categories Using Dual Memory | Sai Saketh Rambhatla · , · Rama Chellappa · , · Abhinav Shrivastava | N/A | |
| Unconditional Scene Graph Generation | Sarthak Garg · , · Helisa Dhamo · , · Azade Farshad · , · Sabrina Musatian · , · Nassir Navab · , · Federico Tombari | N/A | |
| Unified Graph Structured Models for Video Understanding | Anurag Arnab · , · Chen Sun · , · Cordelia Schmid | N/A | |
| Minimal Cases for Computing the Generalized Relative Pose Using Affine Correspondences | Banglei Guan · , · Ji Zhao · , · Daniel Barath · , · Friedrich Fraundorfer | N/A | |
| Towards Efficient Graph Convolutional Networks for Point Cloud Handling | Yawei Li · , · He Chen · , · Zhaopeng Cui · , · Radu Timofte · , · Marc Pollefeys · , · Gregory S. Chirikjian · , · Luc Van Gool | N/A | |
| Gait Recognition in the Wild: A Benchmark | Zheng Zhu · , · Xianda Guo · , · Tian Yang · , · Junjie Huang · , · Jiankang Deng · , · Guan Huang · , · Dalong Du · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Structured Bird's-Eye-View Traffic Scene Understanding From Onboard Images | Yigit Baran Can · , · Alexander Liniger · , · Danda Pani Paudel · , · Luc Van Gool | N/A | |
| MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking? | Matteo Fabbri · , · Guillem Brasó · , · Gianluca Maugeri · , · Orcun Cetintas · , · Riccardo Gasparini · , · Aljoša Ošep · , · Simone Calderara · , · Laura Leal-Taixé · , · Rita Cucchiara | N/A | |
| MonteFloor: Extending MCTS for Reconstructing Accurate Large-Scale Floor Plans | Sinisa Stekovic · , · Mahdi Rad · , · Friedrich Fraundorfer · , · Vincent Lepetit | N/A | |
| Relaxed Transformer Decoders for Direct Action Proposal Generation | Jing Tan · , · Jiaqi Tang · , · Limin Wang · , · Gangshan Wu | N/A | |
| D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddings and Denoised Activations | Sanath Narayan · , · Hisham Cholakkal · , · Munawar Hayat · , · Fahad Shahbaz Khan · , · Ming-Hsuan Yang · , · Ling Shao | N/A | |
| Auto Graph Encoder-Decoder for Neural Network Pruning | Sixing Yu · , · Arya Mazaheri · , · Ali Jannesari | N/A | |
| Adaptive Surface Reconstruction With Multiscale Convolutional Kernels | Benjamin Ummenhofer · , · Vladlen Koltun | N/A | |
| Localized Simple Multiple Kernel K-Means | Xinwang Liu · , · Sihang Zhou · , · Li Liu · , · Chang Tang · , · Siwei Wang · , · Jiyuan Liu · , · Yi Zhang | N/A | |
| SmartShadow: Artistic Shadow Drawing Tool for Line Drawings | Lvmin Zhang · , · Jinyue Jiang · , · Yi Ji · , · Chunping Liu | N/A | |
| PT-CapsNet: A Novel Prediction-Tuning Capsule Network Suitable for Deeper Architectures | Chenbin Pan · , · Senem Velipasalar | N/A | |
| In-Place Scene Labelling and Understanding With Implicit Scene Representation | Shuaifeng Zhi · , · Tristan Laidlow · , · Stefan Leutenegger · , · Andrew J. Davison | N/A | |
| TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition | Wenyuan Xue · , · Baosheng Yu · , · Wen Wang · , · Dacheng Tao · , · Qingyong Li | N/A | |
| Mixture-Based Feature Space Learning for Few-Shot Image Classification | Arman Afrasiyabi · , · Jean-François Lalonde · , · Christian Gagné | N/A | |
| Learning a Sketch Tensor Space for Image Inpainting of Man-Made Scenes | Chenjie Cao · , · Yanwei Fu | N/A | |
| MicroNet: Improving Image Recognition With Extremely Low FLOPs | Yunsheng Li · , · Yinpeng Chen · , · Xiyang Dai · , · Dongdong Chen · , · Mengchen Liu · , · Lu Yuan · , · Zicheng Liu · , · Lei Zhang · , · Nuno Vasconcelos | N/A | |
| Learning Canonical 3D Object Representation for Fine-Grained Recognition | Sunghun Joung · , · Seungryong Kim · , · Minsu Kim · , · Ig-Jae Kim · , · Kwanghoon Sohn | N/A | |
| Multi-Class Multi-Instance Count Conditioned Adversarial Image Generation | Amrutha Saseendran · , · Kathrin Skubch · , · Margret Keuper | N/A | |
| Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation | Chi-Wei Hsiao · , · Cheng Sun · , · Hwann-Tzong Chen · , · Min Sun | N/A | |
| DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network | Yeying Jin · , · Aashish Sharma · , · Robby T. Tan | N/A | |
| Scalable Vision Transformers With Hierarchical Pooling | Zizheng Pan · , · Bohan Zhuang · , · Jing Liu · , · Haoyu He · , · Jianfei Cai | N/A | |
| Learning Instance-Level Spatial-Temporal Patterns for Person Re-Identification | Min Ren · , · Lingxiao He · , · Xingyu Liao · , · Wu Liu · , · Yunlong Wang · , · Tieniu Tan | N/A | |
| EgoRenderer: Rendering Human Avatars From Egocentric Camera Images | Tao Hu · , · Kripasindhu Sarkar · , · Lingjie Liu · , · Matthias Zwicker · , · Christian Theobalt | N/A | |
| Generative Adversarial Registration for Improved Conditional Deformable Templates | Neel Dey · , · Mengwei Ren · , · Adrian V. Dalca · , · Guido Gerig | N/A | |
| Visual Graph Memory With Unsupervised Representation for Visual Navigation | Obin Kwon · , · Nuri Kim · , · Yunho Choi · , · Hwiyeon Yoo · , · Jeongho Park · , · Songhwai Oh | N/A | |
| MGNet: Monocular Geometric Scene Understanding for Autonomous Driving | Markus Schön · , · Michael Buchholz · , · Klaus Dietmayer | N/A | |
| Auto-Parsing Network for Image Captioning and Visual Question Answering | Xu Yang · , · Chongyang Gao · , · Hanwang Zhang · , · Jianfei Cai | N/A | |
| F-Drop&Match: GANs With a Dead Zone in the High-Frequency Domain | Shin'ya Yamaguchi · , · Sekitoshi Kanai | N/A | |
| CryoDRGN2: Ab Initio Neural Reconstruction of 3D Protein Structures From Real Cryo-EM Images | Ellen D. Zhong · , · Adam Lerer · , · Joseph H. Davis · , · Bonnie Berger | N/A | |
| Generalized Shuffled Linear Regression | Feiran Li · , · Kent Fujiwara · , · Fumio Okura · , · Yasuyuki Matsushita | N/A | |
| AESOP: Abstract Encoding of Stories, Objects, and Pictures | Hareesh Ravi · , · Kushal Kafle · , · Scott Cohen · , · Jonathan Brandt · , · Mubbasir Kapadia | N/A | |
| Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals | Wouter Van Gansbeke · , · Simon Vandenhende · , · Stamatios Georgoulis · , · Luc Van Gool | N/A | |
| Graph Contrastive Clustering | Huasong Zhong · , · Jianlong Wu · , · Chong Chen · , · Jianqiang Huang · , · Minghua Deng · , · Liqiang Nie · , · Zhouchen Lin · , · Xian-Sheng Hua | N/A | |
| LFI-CAM: Learning Feature Importance for Better Visual Explanation | Kwang Hee Lee · , · Chaewon Park · , · Junghyun Oh · , · Nojun Kwak | N/A | |
| InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds Through Instance Multi-Level Contextual Referring | Zhihao Yuan · , · Xu Yan · , · Yinghong Liao · , · Ruimao Zhang · , · Sheng Wang · , · Zhen Li · , · Shuguang Cui | N/A | |
| Temporal-Wise Attention Spiking Neural Networks for Event Streams Classification | Man Yao · , · Huanhuan Gao · , · Guangshe Zhao · , · Dingheng Wang · , · Yihan Lin · , · Zhaoxu Yang · , · Guoqi Li | N/A | |
| Encoder-Decoder With Multi-Level Attention for 3D Human Shape and Pose Estimation | Ziniu Wan · , · Zhengjia Li · , · Maoqing Tian · , · Jianbo Liu · , · Shuai Yi · , · Hongsheng Li | N/A | |
| Adaptive Hierarchical Graph Reasoning With Semantic Coherence for Video-and-Language Inference | Juncheng Li · , · Siliang Tang · , · Linchao Zhu · , · Haochen Shi · , · Xuanwen Huang · , · Fei Wu · , · Yi Yang · , · Yueting Zhuang | N/A | |
| Transductive Few-Shot Classification on the Oblique Manifold | Guodong Qi · , · Huimin Yu · , · Zhaohui Lu · , · Shuzhao Li | N/A | |
| iNAS: Integral NAS for Device-Aware Salient Object Detection | Yu-Chao Gu · , · Shang-Hua Gao · , · Xu-Sheng Cao · , · Peng Du · , · Shao-Ping Lu · , · Ming-Ming Cheng | N/A | |
| Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection | Jiageng Mao · , · Minzhe Niu · , · Haoyue Bai · , · Xiaodan Liang · , · Hang Xu · , · Chunjing Xu | N/A | |
| Graph-BAS3Net: Boundary-Aware Semi-Supervised Segmentation Network With Bilateral Graph Convolution | Huimin Huang · , · Lanfen Lin · , · Yue Zhang · , · Yingying Xu · , · Jing Zheng · , · XiongWei Mao · , · Xiaohan Qian · , · Zhiyi Peng · , · Jianying Zhou · , · Yen-Wei Chen · , · Ruofeng Tong | N/A | |
| The Animation Transformer: Visual Correspondence via Segment Matching | Evan Casey · , · Víctor Pérez · , · Zhuoru Li | N/A | |
| CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification | Chun-Fu (Richard) Chen · , · Quanfu Fan · , · Rameswar Panda | N/A | |
| Weak Adaptation Learning: Addressing Cross-Domain Data Insufficiency With Weak Annotator | Shichao Xu · , · Lixu Wang · , · Yixuan Wang · , · Qi Zhu | N/A | |
| Building-GAN: Graph-Conditioned Architectural Volumetric Design Generation | Kai-Hung Chang · , · Chin-Yi Cheng · , · Jieliang Luo · , · Shingo Murata · , · Mehdi Nourbakhsh · , · Yoshito Tsuji | N/A | |
| Scribble-Supervised Semantic Segmentation Inference | Jingshan Xu · , · Chuanwei Zhou · , · Zhen Cui · , · Chunyan Xu · , · Yuge Huang · , · Pengcheng Shen · , · Shaoxin Li · , · Jian Yang | N/A | |
| Improve Unsupervised Pretraining for Few-Label Transfer | Suichan Li · , · Dongdong Chen · , · Yinpeng Chen · , · Lu Yuan · , · Lei Zhang · , · Qi Chu · , · Bin Liu · , · Nenghai Yu | N/A | |
| Image Inpainting via Conditional Texture and Structure Dual Generation | Xiefan Guo · , · Hongyu Yang · , · Di Huang | N/A | |
| Geometry-Aware Self-Training for Unsupervised Domain Adaptation on Object Point Clouds | Longkun Zou · , · Hui Tang · , · Ke Chen · , · Kui Jia | N/A | |
| Robustness and Generalization via Generative Adversarial Training | Omid Poursaeed · , · Tianxing Jiang · , · Harry Yang · , · Serge Belongie · , · Ser-Nam Lim | N/A | |
| Exploring Inter-Channel Correlation for Diversity-Preserved Knowledge Distillation | Li Liu · , · Qingle Huang · , · Sihao Lin · , · Hongwei Xie · , · Bing Wang · , · Xiaojun Chang · , · Xiaodan Liang | N/A | |
| Class-Incremental Learning for Action Recognition in Videos | Jaeyoo Park · , · Minsoo Kang · , · Bohyung Han | N/A | |
| Procrustean Training for Imbalanced Deep Learning | Han-Jia Ye · , · De-Chuan Zhan · , · Wei-Lun Chao | N/A | |
| Dynamic Network Quantization for Efficient Video Inference | Ximeng Sun · , · Rameswar Panda · , · Chun-Fu (Richard) Chen · , · Aude Oliva · , · Rogerio Feris · , · Kate Saenko | N/A | |
| Space-Time Crop & Attend: Improving Cross-Modal Video Representation Learning | Mandela Patrick · , · Po-Yao Huang · , · Ishan Misra · , · Florian Metze · , · Andrea Vedaldi · , · Yuki M. Asano · , · João F. Henriques | N/A | |
| RDA: Robust Domain Adaptation via Fourier Adversarial Attacking | Jiaxing Huang · , · Dayan Guan · , · Aoran Xiao · , · Shijian Lu | N/A | |
| WB-DETR: Transformer-Based Detector Without Backbone | Fanfan Liu · , · Haoran Wei · , · Wenzhe Zhao · , · Guozhen Li · , · Jingquan Peng · , · Zihao Li | N/A | |
| Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis From a Single Image | Ronghang Hu · , · Nikhila Ravi · , · Alexander C. Berg · , · Deepak Pathak | N/A | |
| Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval From a Single Image | Weicheng Kuo · , · Anelia Angelova · , · Tsung-Yi Lin · , · Angela Dai | N/A | |
| Perceptual Variousness Motion Deblurring With Light Global Context Refinement | Jichun Li · , · Weimin Tan · , · Bo Yan | N/A | |
| Self-Calibrating Neural Radiance Fields | Yoonwoo Jeong · , · Seokjun Ahn · , · Christopher Choy · , · Anima Anandkumar · , · Minsu Cho · , · Jaesik Park | N/A | |
| Motion Adaptive Pose Estimation From Compressed Videos | Zhipeng Fan · , · Jun Liu · , · Yao Wang | N/A | |
| Learning Motion Priors for 4D Human Body Capture in 3D Scenes | Siwei Zhang · , · Yan Zhang · , · Federica Bogo · , · Marc Pollefeys · , · Siyu Tang | N/A | |
| Robust Automatic Monocular Vehicle Speed Estimation for Traffic Surveillance | Jerome Revaud · , · Martin Humenberger | N/A | |
| Face Image Retrieval With Attribute Manipulation | Alireza Zaeemzadeh · , · Shabnam Ghadar · , · Baldo Faieta · , · Zhe Lin · , · Nazanin Rahnavard · , · Mubarak Shah · , · Ratheesh Kalarot | N/A | |
| RFNet: Region-Aware Fusion Network for Incomplete Multi-Modal Brain Tumor Segmentation | Yuhang Ding · , · Xin Yu · , · Yi Yang | N/A | |
| Weakly Supervised Contrastive Learning | Mingkai Zheng · , · Fei Wang · , · Shan You · , · Chen Qian · , · Changshui Zhang · , · Xiaogang Wang · , · Chang Xu | N/A | |
| SLIM: Self-Supervised LiDAR Scene Flow and Motion Segmentation | Stefan Andreas Baur · , · David Josef Emmerichs · , · Frank Moosmann · , · Peter Pinggera · , · Björn Ommer · , · Andreas Geiger | N/A | |
| Likelihood-Based Diverse Sampling for Trajectory Forecasting | Yecheng Jason Ma · , · Jeevana Priya Inala · , · Dinesh Jayaraman · , · Osbert Bastani | N/A | |
| In Defense of Scene Graphs for Image Captioning | Kien Nguyen · , · Subarna Tripathi · , · Bang Du · , · Tanaya Guha · , · Truong Q. Nguyen | N/A | |
| Rank & Sort Loss for Object Detection and Instance Segmentation | Kemal Oksuz · , · Baris Can Cam · , · Emre Akbas · , · Sinan Kalkan | N/A | |
| RANK-NOSH: Efficient Predictor-Based Architecture Search via Non-Uniform Successive Halving | Ruochen Wang · , · Xiangning Chen · , · Minhao Cheng · , · Xiaocheng Tang · , · Cho-Jui Hsieh | N/A | |
| Dual Path Learning for Domain Adaptation of Semantic Segmentation | Yiting Cheng · , · Fangyun Wei · , · Jianmin Bao · , · Dong Chen · , · Fang Wen · , · Wenqiang Zhang | N/A | |
| Synthesis of Compositional Animations From Textual Descriptions | Anindita Ghosh · , · Noshaba Cheema · , · Cennet Oguz · , · Christian Theobalt · , · Philipp Slusallek | N/A | |
| Dual Bipartite Graph Learning: A General Approach for Domain Adaptive Object Detection | Chaoqi Chen · , · Jiongcheng Li · , · Zebiao Zheng · , · Yue Huang · , · Xinghao Ding · , · Yizhou Yu | N/A | |
| Parametric Contrastive Learning | Jiequan Cui · , · Zhisheng Zhong · , · Shu Liu · , · Bei Yu · , · Jiaya Jia | N/A | |
| A Simple Feature Augmentation for Domain Generalization | Pan Li · , · Da Li · , · Wei Li · , · Shaogang Gong · , · Yanwei Fu · , · Timothy M. Hospedales | N/A | |
| Visual Transformers: Where Do Transformers Really Belong in Vision Models? | Bichen Wu · , · Chenfeng Xu · , · Xiaoliang Dai · , · Alvin Wan · , · Peizhao Zhang · , · Zhicheng Yan · , · Masayoshi Tomizuka · , · Joseph E. Gonzalez · , · Kurt Keutzer · , · Peter Vajda | N/A | |
| Is Pseudo-Lidar Needed for Monocular 3D Object Detection? | Dennis Park · , · Rares Ambrus · , · Vitor Guizilini · , · Jie Li · , · Adrien Gaidon | N/A | |
| TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization | Wei Gao · , · Fang Wan · , · Xingjia Pan · , · Zhiliang Peng · , · Qi Tian · , · Zhenjun Han · , · Bolei Zhou · , · Qixiang Ye | N/A | |
| Geometry-Based Distance Decomposition for Monocular 3D Object Detection | Xuepeng Shi · , · Qi Ye · , · Xiaozhi Chen · , · Chuangrong Chen · , · Zhixiang Chen · , · Tae-Kyun Kim | N/A | |
| Deep 3D Mask Volume for View Synthesis of Dynamic Scenes | Kai-En Lin · , · Lei Xiao · , · Feng Liu · , · Guowei Yang · , · Ravi Ramamoorthi | N/A | |
| Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue | Shoya Matsumori · , · Kosuke Shingyouchi · , · Yuki Abe · , · Yosuke Fukuchi · , · Komei Sugiura · , · Michita Imai | N/A | |
| Human Trajectory Prediction via Counterfactual Analysis | Guangyi Chen · , · Junlong Li · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-Identification | Yongming Rao · , · Guangyi Chen · , · Jiwen Lu · , · Jie Zhou | N/A | |
| Effectively Leveraging Attributes for Visual Similarity | Samarth Mishra · , · Zhongping Zhang · , · Yuan Shen · , · Ranjitha Kumar · , · Venkatesh Saligrama · , · Bryan A. Plummer | N/A | |
| Anticipative Video Transformer | Rohit Girdhar · , · Kristen Grauman | N/A | |
| Semantically Robust Unpaired Image Translation for Data With Unmatched Semantics Statistics | Zhiwei Jia · , · Bodi Yuan · , · Kangkang Wang · , · Hong Wu · , · David Clifford · , · Zhiqiang Yuan · , · Hao Su | N/A | |
| Progressive Seed Generation Auto-Encoder for Unsupervised Point Cloud Learning | Juyoung Yang · , · Pyunghwan Ahn · , · Doyeon Kim · , · Haeil Lee · , · Junmo Kim | N/A | |
| Waypoint Models for Instruction-Guided Navigation in Continuous Environments | Jacob Krantz · , · Aaron Gokaslan · , · Dhruv Batra · , · Stefan Lee · , · Oleksandr Maksymets | N/A | |
| Rethinking Preventing Class-Collapsing in Metric Learning With Margin-Based Losses | Elad Levi · , · Tete Xiao · , · Xiaolong Wang · , · Trevor Darrell | N/A | |
| HiNet: Deep Image Hiding by Invertible Network | Junpeng Jing · , · Xin Deng · , · Mai Xu · , · Jianyi Wang · , · Zhenyu Guan | N/A | |
| Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs | Gabriel Moreira · , · Manuel Marques · , · João Paulo Costeira | N/A | |
| End-to-End Robust Joint Unsupervised Image Alignment and Clustering | Xiangrui Zeng · , · Gregory Howe · , · Min Xu | N/A | |
| BabelCalib: A Universal Approach to Calibrating Central Cameras | Yaroslava Lochman · , · Kostiantyn Liepieshov · , · Jianhui Chen · , · Michal Perdoch · , · Christopher Zach · , · James Pritts | N/A | |
| Curious Representation Learning for Embodied Intelligence | Yilun Du · , · Chuang Gan · , · Phillip Isola | N/A | |
| Multi-Modal Multi-Action Video Recognition | Zhensheng Shi · , · Ju Liang · , · Qianqian Li · , · Haiyong Zheng · , · Zhaorui Gu · , · Junyu Dong · , · Bing Zheng | N/A | |
| Cross-Patch Graph Convolutional Network for Image Denoising | Yao Li · , · Xueyang Fu · , · Zheng-Jun Zha | N/A | |
| ISNet: Integrate Image-Level and Semantic-Level Context for Semantic Segmentation | Zhenchao Jin · , · Bin Liu · , · Qi Chu · , · Nenghai Yu | N/A | |
| Body-Face Joint Detection via Embedding and Head Hook | Junfeng Wan · , · Jiangfan Deng · , · Xiaosong Qiu · , · Feng Zhou | N/A | |
| Enhancing Self-Supervised Video Representation Learning via Multi-Level Feature Optimization | Rui Qian · , · Yuxi Li · , · Huabin Liu · , · John See · , · Shuangrui Ding · , · Xian Liu · , · Dian Li · , · Weiyao Lin | N/A | |
| LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-Based 3D Detector | Xiaoyang Guo · , · Shaoshuai Shi · , · Xiaogang Wang · , · Hongsheng Li | N/A | |
| Semi-Supervised Semantic Segmentation With Pixel-Level Contrastive Learning From a Class-Wise Memory Bank | Iñigo Alonso · , · Alberto Sabater · , · David Ferstl · , · Luis Montesano · , · Ana C. Murillo | N/A | |
| End-to-End Urban Driving by Imitating a Reinforcement Learning Coach | Zhejun Zhang · , · Alexander Liniger · , · Dengxin Dai · , · Fisher Yu · , · Luc Van Gool | N/A | |
| Interpolation-Aware Padding for 3D Sparse Convolutional Neural Networks | Yu-Qi Yang · , · Peng-Shuai Wang · , · Yang Liu | N/A | |
| Active Learning for Lane Detection: A Knowledge Distillation Approach | Fengchao Peng · , · Chao Wang · , · Jianzhuang Liu · , · Zhen Yang | N/A | |
| Once Quantization-Aware Training: High Performance Extremely Low-Bit Architecture Search | Mingzhu Shen · , · Feng Liang · , · Ruihao Gong · , · Yuhang Li · , · Chuming Li · , · Chen Lin · , · Fengwei Yu · , · Junjie Yan · , · Wanli Ouyang | N/A | |
| Learn To Match: Automatic Matching Network Design for Visual Tracking | Zhipeng Zhang · , · Yihao Liu · , · Xiao Wang · , · Bing Li · , · Weiming Hu | N/A | |
| Oriented R-CNN for Object Detection | Xingxing Xie · , · Gong Cheng · , · Jiabao Wang · , · Xiwen Yao · , · Junwei Han | N/A | |
| TransVG: End-to-End Visual Grounding With Transformers | Jiajun Deng · , · Zhengyuan Yang · , · Tianlang Chen · , · Wengang Zhou · , · Houqiang Li | N/A | |
| Airbert: In-Domain Pretraining for Vision-and-Language Navigation | Pierre-Louis Guhur · , · Makarand Tapaswi · , · Shizhe Chen · , · Ivan Laptev · , · Cordelia Schmid | N/A | |
| Internal Video Inpainting by Implicit Long-Range Propagation | Hao Ouyang · , · Tengfei Wang · , · Qifeng Chen | N/A |
ICLR 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Computation Reallocation for Object Detection | Unknown | N/A | |
| At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks? | Unknown | N/A | |
| A Closer Look at the Optimization Landscapes of Generative Adversarial Networks | Unknown | N/A | |
| Hamiltonian Generative Networks | Unknown | N/A | |
| Stochastic Weight Averaging in Parallel: Large-Batch Training That Generalizes Well | Unknown | N/A | |
| Convergence of Gradient Methods on Bilinear Zero-Sum Games | Unknown | N/A | |
| Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies | Unknown | N/A | |
| Global Relational Models of Source Code | Unknown | N/A | |
| Continual learning with hypernetworks | Unknown | N/A | |
| Environmental drivers of systematicity and generalization in a situated agent | Unknown | N/A | |
| An Exponential Learning Rate Schedule for Deep Learning | Unknown | N/A | |
| Understanding the Limitations of Conditional Generative Models | Unknown | N/A | |
| ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning | Unknown | N/A | |
| Adversarial AutoAugment | Unknown | N/A | |
| Neural Machine Translation with Universal Visual Representation | Unknown | N/A | |
| Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning? | Unknown | N/A | |
| Once for All: Train One Network and Specialize it for Efficient Deployment | Unknown | N/A | |
| Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery | Unknown | N/A | |
| Differentiation of Blackbox Combinatorial Solvers | Unknown | N/A | |
| Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets | Unknown | N/A | |
| Dynamically Pruned Message Passing Networks for Large-scale Knowledge Graph Reasoning | Unknown | N/A | |
| Counterfactuals uncover the modular structure of deep generative models | Unknown | N/A | |
| Fast Neural Network Adaptation via Parameter Remapping and Architecture Search | Unknown | N/A | |
| Towards Stabilizing Batch Statistics in Backward Propagation of Batch Normalization | Unknown | N/A | |
| Learning to Group: A Bottom-Up Framework for 3D Part Discovery in Unseen Categories | Unknown | N/A | |
| Action Semantics Network: Considering the Effects of Actions in Multiagent Systems | Unknown | N/A | |
| Kernelized Wasserstein Natural Gradient | Unknown | N/A | |
| Learning from Explanations with Neural Execution Tree | Unknown | N/A | |
| Variance Reduction With Sparse Gradients | Unknown | N/A | |
| Batch-shaping for learning conditional channel gated networks | Unknown | N/A | |
| Self-Supervised Learning of Appliance Usage | Unknown | N/A | |
| CAQL: Continuous Action Q-Learning | Unknown | N/A | |
| Domain Adaptive Multibranch Networks | Unknown | N/A | |
| Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Mirror-Generative Neural Machine Translation | Unknown | N/A | |
| FSNet: Compression of Deep Convolutional Neural Networks by Filter Summary | Unknown | N/A | |
| Explain Your Move: Understanding Agent Actions Using Salient and Relevant Feature Attribution | Unknown | N/A | |
| Convolutional Conditional Neural Processes | Unknown | N/A | |
| Regularizing activations in neural networks via distribution matching with the Wasserstein metric | Unknown | N/A | |
| VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation | Unknown | N/A | |
| Deep Orientation Uncertainty Learning based on a Bingham Loss | Unknown | N/A | |
| Scale-Equivariant Steerable Networks | Unknown | N/A | |
| The intriguing role of module criticality in the generalization of deep networks | Unknown | N/A | |
| A Theoretical Analysis of the Number of Shots in Few-Shot Learning | Unknown | N/A | |
| Graph Neural Networks Exponentially Lose Expressive Power for Node Classification | Unknown | N/A | |
| Provable Filter Pruning for Efficient Neural Networks | Unknown | N/A | |
| Option Discovery using Deep Skill Chaining | Unknown | N/A | |
| Deep Symbolic Superoptimization Without Human Knowledge | Unknown | N/A | |
| State Alignment-based Imitation Learning | Unknown | N/A | |
| Mogrifier LSTM | Unknown | N/A | |
| Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control | Unknown | N/A | |
| Target-Embedding Autoencoders for Supervised Representation Learning | Unknown | N/A | |
| Fair Resource Allocation in Federated Learning | Unknown | N/A | |
| Causal Discovery with Reinforcement Learning | Unknown | N/A | |
| Geom-GCN: Geometric Graph Convolutional Networks | Unknown | N/A | |
| Variational Hetero-Encoder Randomized GANs for Joint Image-Text Modeling | Unknown | N/A | |
| Sampling-Free Learning of Bayesian Quantized Neural Networks | Unknown | N/A | |
| On the Relationship between Self-Attention and Convolutional Layers | Unknown | N/A | |
| A Generalized Training Approach for Multiagent Learning | Unknown | N/A | |
| Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee | Unknown | N/A | |
| Towards Verified Robustness under Text Deletion Interventions | Unknown | N/A | |
| Mixed Precision DNNs: All you need is a good parametrization | Unknown | N/A | |
| On Computation and Generalization of Generative Adversarial Imitation Learning | Unknown | N/A | |
| Demystifying Inter-Class Disentanglement | Unknown | N/A | |
| Progressive Learning and Disentanglement of Hierarchical Representations | Unknown | N/A | |
| Transferable Perturbations of Deep Feature Distributions | Unknown | N/A | |
| Hypermodels for Exploration | Unknown | N/A | |
| AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures | Unknown | N/A | |
| Semi-Supervised Generative Modeling for Controllable Speech Synthesis | Unknown | N/A | |
| You Only Train Once: Loss-Conditional Training of Deep Networks | Unknown | N/A | |
| Ranking Policy Gradient | Unknown | N/A | |
| Understanding and Robustifying Differentiable Architecture Search | Unknown | N/A | |
| On the interaction between supervision and self-play in emergent communication | Unknown | N/A | |
| Knowledge Consistency between Neural Networks and Beyond | Unknown | N/A | |
| Capsules with Inverted Dot-Product Attention Routing | Unknown | N/A | |
| Variational Autoencoders for Highly Multivariate Spatial Point Processes Intensities | Unknown | N/A | |
| Towards Fast Adaptation of Neural Architectures with Meta Learning | Unknown | N/A | |
| Stochastic Conditional Generative Networks with Basis Decomposition | Unknown | N/A | |
| Guiding Program Synthesis by Learning to Generate Examples | Unknown | N/A | |
| HiLLoC: lossless image compression with hierarchical latent variable models | Unknown | N/A | |
| Estimating counterfactual treatment outcomes over time through adversarially balanced representations | Unknown | N/A | |
| Denoising and Regularization via Exploiting the Structural Bias of Convolutional Generators | Unknown | N/A | |
| Identifying through Flows for Recovering Latent Representations | Unknown | N/A | |
| Learning Efficient Parameter Server Synchronization Policies for Distributed SGD | Unknown | N/A | |
| Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards | Unknown | N/A | |
| Strategies for Pre-training Graph Neural Networks | Unknown | N/A | |
| Decoupling Representation and Classifier for Long-Tailed Recognition | Unknown | N/A | |
| Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks | Unknown | N/A | |
| Accelerating SGD with momentum for over-parameterized learning | Unknown | N/A | |
| PAC Confidence Sets for Deep Neural Networks via Calibrated Prediction | Unknown | N/A | |
| Inductive and Unsupervised Representation Learning on Graph Structured Objects | Unknown | N/A | |
| GraphSAINT: Graph Sampling Based Inductive Learning Method | Unknown | N/A | |
| Non-Autoregressive Dialog State Tracking | Unknown | N/A | |
| Disentangling neural mechanisms for perceptual grouping | Unknown | N/A | |
| A Probabilistic Formulation of Unsupervised Text Style Transfer | Unknown | N/A | |
| MEMO: A Deep Network for Flexible Combination of Episodic Memories | Unknown | N/A | |
| Neural Stored-program Memory | Unknown | N/A | |
| Asymptotics of Wide Networks from Feynman Diagrams | Unknown | N/A | |
| Optimistic Exploration even with a Pessimistic Initialisation | Unknown | N/A | |
| Gradient Descent Maximizes the Margin of Homogeneous Neural Networks | Unknown | N/A | |
| Duration-of-Stay Storage Assignment under Uncertainty | Unknown | N/A | |
| Continual Learning with Bayesian Neural Networks for Non-Stationary Data | Unknown | N/A | |
| Language GANs Falling Short | Unknown | N/A | |
| Neural Tangents: Fast and Easy Infinite Neural Networks in Python | Unknown | N/A | |
| Fooling Detection Alone is Not Enough: Adversarial Attack against Multiple Object Tracking | Unknown | N/A | |
| Gap-Aware Mitigation of Gradient Staleness | Unknown | N/A | |
| Finite Depth and Width Corrections to the Neural Tangent Kernel | Unknown | N/A | |
| Augmenting Genetic Algorithms with Deep Neural Networks for Exploring the Chemical Space | Unknown | N/A | |
| Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning | Unknown | N/A | |
| SCALOR: Generative World Models with Scalable Object Representations | Unknown | N/A | |
| ReMixMatch: Semi-Supervised Learning with Distribution Matching and Augmentation Anchoring | Unknown | N/A | |
| Graph Constrained Reinforcement Learning for Natural Language Action Spaces | Unknown | N/A | |
| Learning Robust Representations via Multi-View Information Bottleneck | Unknown | N/A | |
| Dynamics-Aware Unsupervised Skill Discovery | Unknown | N/A | |
| Ridge Regression: Structure, Cross-Validation, and Sketching | Unknown | N/A | |
| Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks | Unknown | N/A | |
| Feature Interaction Interpretability: A Case for Explaining Ad-Recommendation Systems via Neural Interaction Detection | Unknown | N/A | |
| End to End Trainable Active Contours via Differentiable Rendering | Unknown | N/A | |
| Learning Disentangled Representations for CounterFactual Regression | Unknown | N/A | |
| Symplectic Recurrent Neural Networks | Unknown | N/A | |
| RNA Secondary Structure Prediction By Learning Unrolled Algorithms | Unknown | N/A | |
| BinaryDuo: Reducing Gradient Mismatch in Binary Activation Network by Coupling Binary Activations | Unknown | N/A | |
| Training individually fair ML models with sensitive subspace robustness | Unknown | N/A | |
| Mixed-curvature Variational Autoencoders | Unknown | N/A | |
| The Variational Bandwidth Bottleneck: Stochastic Evaluation on an Information Budget | Unknown | N/A | |
| Meta-Learning with Warped Gradient Descent | Unknown | N/A | |
| Towards a Deep Network Architecture for Structured Smoothness | Unknown | N/A | |
| Making Efficient Use of Demonstrations to Solve Hard Exploration Problems | Unknown | N/A | |
| Locally Constant Networks | Unknown | N/A | |
| Phase Transitions for the Information Bottleneck in Representation Learning | Unknown | N/A | |
| Lookahead: A Far-sighted Alternative of Magnitude-based Pruning | Unknown | N/A | |
| Decentralized Deep Learning with Arbitrary Communication Compression | Unknown | N/A | |
| Fast is better than free: Revisiting adversarial training | Unknown | N/A | |
| Structured Object-Aware Physics Prediction for Video Modeling and Planning | Unknown | N/A | |
| Fast Task Inference with Variational Intrinsic Successor Features | Unknown | N/A | |
| Ae-Ot: A New Generative Model Based on Extended Semi-Discrete Optimal Transport | Unknown | N/A | |
| Generative Ratio Matching Networks | Unknown | N/A | |
| ALBERT: A Lite BERT for Self-supervised Learning of Language Representations | Unknown | N/A | |
| The Early Phase of Neural Network Training | Unknown | N/A | |
| Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples | Unknown | N/A | |
| Pay Attention to Features, Transfer Learn Faster CNNs | Unknown | N/A | |
| Few-shot Text Classification with Distributional Signatures | Unknown | N/A | |
| Memory-Based Graph Networks | Unknown | N/A | |
| On the Equivalence between Positional Node Embeddings and Structural Graph Representations | Unknown | N/A | |
| Sub-policy Adaptation for Hierarchical Reinforcement Learning | Unknown | N/A | |
| DBA: Distributed Backdoor Attacks against Federated Learning | Unknown | N/A | |
| Gradient-Based Neural DAG Learning | Unknown | N/A | |
| Scalable Neural Methods for Reasoning With a Symbolic Knowledge Base | Unknown | N/A | |
| A closer look at the approximation capabilities of neural networks | Unknown | N/A | |
| Black-Box Adversarial Attack with Transferable Model-based Embedding | Unknown | N/A | |
| Online and stochastic optimization beyond Lipschitz continuity: A Riemannian approach | Unknown | N/A | |
| Self-labelling via simultaneous clustering and representation learning | Unknown | N/A | |
| Classification-Based Anomaly Detection for General Data | Unknown | N/A | |
| AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty | Unknown | N/A | |
| RNNs Incrementally Evolving on an Equilibrium Manifold: A Panacea for Vanishing and Exploding Gradients? | Unknown | N/A | |
| Implementing Inductive bias for different navigation tasks through diverse RNN attrractors | Unknown | N/A | |
| Neural Text Generation With Unlikelihood Training | Unknown | N/A | |
| Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps | Unknown | N/A | |
| Rethinking Softmax Cross-Entropy Loss for Adversarial Robustness | Unknown | N/A | |
| Spike-based causal inference for weight alignment | Unknown | N/A | |
| Poly-encoders: Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring | Unknown | N/A | |
| Empirical Studies on the Properties of Linear Regions in Deep Neural Networks | Unknown | N/A | |
| Expected Information Maximization: Using the I-Projection for Mixture Density Estimation | Unknown | N/A | |
| Meta-learning curiosity algorithms | Unknown | N/A | |
| Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games | Unknown | N/A | |
| Learning Execution Through Neural Code Fusion | Unknown | N/A | |
| Emergence of functional and structural properties of the head direction system by optimization of recurrent neural networks | Unknown | N/A | |
| Implicit Bias of Gradient Descent based Adversarial Training on Separable Data | Unknown | N/A | |
| Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering | Unknown | N/A | |
| Learning from Rules Generalizing Labeled Exemplars | Unknown | N/A | |
| Measuring Compositional Generalization: A Comprehensive Method on Realistic Data | Unknown | N/A | |
| SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference | Unknown | N/A | |
| Automated Relational Meta-learning | Unknown | N/A | |
| Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs | Unknown | N/A | |
| RaPP: Novelty Detection with Reconstruction along Projection Pathway | Unknown | N/A | |
| Neural Execution of Graph Algorithms | Unknown | N/A | |
| Compositional Language Continual Learning | Unknown | N/A | |
| Observational Overfitting in Reinforcement Learning | Unknown | N/A | |
| Understanding Knowledge Distillation in Non-autoregressive Machine Translation | Unknown | N/A | |
| Learning to Plan in High Dimensions via Neural Exploration-Exploitation Trees | Unknown | N/A | |
| TabFact: A Large-scale Dataset for Table-based Fact Verification | Unknown | N/A | |
| Neural Arithmetic Units | Unknown | N/A | |
| Reconstructing continuous distributions of 3D protein structure from cryo-EM images | Unknown | N/A | |
| Generalization of Two-layer Neural Networks: An Asymptotic Viewpoint | Unknown | N/A | |
| SELF: Learning to Filter Noisy Labels with Self-Ensembling | Unknown | N/A | |
| Robust Reinforcement Learning for Continuous Control with Model Misspecification | Unknown | N/A | |
| GenDICE: Generalized Offline Estimation of Stationary Values | Unknown | N/A | |
| Unsupervised Model Selection for Variational Disentangled Representation Learning | Unknown | N/A | |
| Robust training with ensemble consensus | Unknown | N/A | |
| Improved Sample Complexities for Deep Neural Networks and Robust Classification via an All-Layer Margin | Unknown | N/A | |
| Functional vs. parametric equivalence of ReLU networks | Unknown | N/A | |
| Robust Subspace Recovery Layer for Unsupervised Anomaly Detection | Unknown | N/A | |
| A Constructive Prediction of the Generalization Error Across Scales | Unknown | N/A | |
| Learning deep graph matching with channel-independent embedding and Hungarian attention | Unknown | N/A | |
| Learning To Explore Using Active Neural SLAM | Unknown | N/A | |
| BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget | Unknown | N/A | |
| Tranquil Clouds: Neural Networks for Learning Temporally Coherent Features in Point Clouds | Unknown | N/A | |
| Why Not to Use Zero Imputation? Correcting Sparsity Bias in Training Neural Networks | Unknown | N/A | |
| Rényi Fair Inference | Unknown | N/A | |
| SAdam: A Variant of Adam for Strongly Convex Functions | Unknown | N/A | |
| SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition | Unknown | N/A | |
| Hoppity: Learning Graph Transformations to Detect and Fix Bugs in Programs | Unknown | N/A | |
| Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension | Unknown | N/A | |
| Adversarial Training and Provable Defenses: Bridging the Gap | Unknown | N/A | |
| Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation | Unknown | N/A | |
| Composing Task-Agnostic Policies with Deep Reinforcement Learning | Unknown | N/A | |
| NeurQuRI: Neural Question Requirement Inspector for Answerability Prediction in Machine Reading Comprehension | Unknown | N/A | |
| Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information | Unknown | N/A | |
| Semantically-Guided Representation Learning for Self-Supervised Monocular Depth | Unknown | N/A | |
| Vid2Game: Controllable Characters Extracted from Real-World Videos | Unknown | N/A | |
| Network Deconvolution | Unknown | N/A | |
| Pure and Spurious Critical Points: a Geometric Study of Linear Networks | Unknown | N/A | |
| Improving Generalization in Meta Reinforcement Learning using Learned Objectives | Unknown | N/A | |
| Meta Dropout: Learning to Perturb Latent Features for Generalization | Unknown | N/A | |
| A Theory of Usable Information under Computational Constraints | Unknown | N/A | |
| Gradientless Descent: High-Dimensional Zeroth-Order Optimization | Unknown | N/A | |
| Measuring and Improving the Use of Graph Information in Graph Neural Networks | Unknown | N/A | |
| Few-Shot Learning on Graphs via Super-Classes Based on Graph Spectral Measures | Unknown | N/A | |
| Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning | Unknown | N/A | |
| A Learning-based Iterative Method for Solving Vehicle Routing Problems | Unknown | N/A | |
| Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness | Unknown | N/A | |
| Understanding the Limitations of Variational Mutual Information Estimators | Unknown | N/A | |
| Generalized Convolutional Forest Networks for Domain Generalization and Visual Recognition | Unknown | N/A | |
| Q-learning with UCB Exploration is Sample Efficient for Infinite-Horizon MDP | Unknown | N/A | |
| Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning | Unknown | N/A | |
| Relational State-Space Model for Stochastic Multi-Object Systems | Unknown | N/A | |
| Deep Learning of Determinantal Point Processes via Proper Spectral Sub-gradient | Unknown | N/A | |
| Unrestricted Adversarial Examples via Semantic Manipulation | Unknown | N/A | |
| Data-Independent Neural Pruning via Coresets | Unknown | N/A | |
| Your classifier is secretly an energy based model and you should treat it like one | Unknown | N/A | |
| Generalization through Memorization: Nearest Neighbor Language Models | Unknown | N/A | |
| Piecewise linear activations substantially shape the loss surfaces of neural networks | Unknown | N/A | |
| Contrastive Learning of Structured World Models | Unknown | N/A | |
| On the Variance of the Adaptive Learning Rate and Beyond | Unknown | N/A | |
| Scalable Model Compression by Entropy Penalized Reparameterization | Unknown | N/A | |
| Ensemble Distribution Distillation | Unknown | N/A | |
| Low-Resource Knowledge-Grounded Dialogue Generation | Unknown | N/A | |
| Novelty Detection Via Blurring | Unknown | N/A | |
| A Signal Propagation Perspective for Pruning Neural Networks at Initialization | Unknown | N/A | |
| GLAD: Learning Sparse Graph Recovery | Unknown | N/A | |
| Cyclical Stochastic Gradient MCMC for Bayesian Deep Learning | Unknown | N/A | |
| Rethinking the Hyperparameters for Fine-tuning | Unknown | N/A | |
| Dynamics-Aware Embeddings | Unknown | N/A | |
| Unbiased Contrastive Divergence Algorithm for Training Energy-Based Latent Variable Models | Unknown | N/A | |
| Robustness Verification for Transformers | Unknown | N/A | |
| Improved memory in recurrent neural networks with sequential non-normal dynamics | Unknown | N/A | |
| Locality and Compositionality in Zero-Shot Learning | Unknown | N/A | |
| Extreme Classification via Adversarial Softmax Approximation | Unknown | N/A | |
| Economy Statistical Recurrent Units For Inferring Nonlinear Granger Causality | Unknown | N/A | |
| The Gambler's Problem and Beyond | Unknown | N/A | |
| DropEdge: Towards Deep Graph Convolutional Networks on Node Classification | Unknown | N/A | |
| Interpretable Complex-Valued Neural Networks for Privacy Protection | Unknown | N/A | |
| Geometric Insights into the Convergence of Nonlinear TD Learning | Unknown | N/A | |
| Learning Compositional Koopman Operators for Model-Based Control | Unknown | N/A | |
| On the Need for Topology-Aware Generative Models for Manifold-Based Defenses | Unknown | N/A | |
| Probabilistic Connection Importance Inference and Lossless Compression of Deep Neural Networks | Unknown | N/A | |
| Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations | Unknown | N/A | |
| Robust anomaly detection and backdoor attack detection via differential privacy | Unknown | N/A | |
| Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents | Unknown | N/A | |
| Conditional Learning of Fair Representations | Unknown | N/A | |
| Infinite-Horizon Differentiable Model Predictive Control | Unknown | N/A | |
| Measuring the Reliability of Reinforcement Learning Algorithms | Unknown | N/A | |
| Disagreement-Regularized Imitation Learning | Unknown | N/A | |
| Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP | Unknown | N/A | |
| NAS evaluation is frustratingly hard | Unknown | N/A | |
| Curvature Graph Network | Unknown | N/A | |
| Compositional languages emerge in a neural iterated learning model | Unknown | N/A | |
| Depth-Adaptive Transformer | Unknown | N/A | |
| Plug and Play Language Models: A Simple Approach to Controlled Text Generation | Unknown | N/A | |
| NAS-Bench-1Shot1: Benchmarking and Dissecting One-shot Neural Architecture Search | Unknown | N/A | |
| Self-Adversarial Learning with Comparative Discrimination for Text Generation | Unknown | N/A | |
| Model Based Reinforcement Learning for Atari | Unknown | N/A | |
| Stochastic AUC Maximization with Deep Neural Networks | Unknown | N/A | |
| Compression based bound for non-compressed network: unified generalization error analysis of large compressible deep neural network | Unknown | N/A | |
| Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control | Unknown | N/A | |
| GAT: Generative Adversarial Training for Adversarial Example Detection and Classification | Unknown | N/A | |
| DeepSphere: a graph-based spherical CNN | Unknown | N/A | |
| Learning-Augmented Data Stream Algorithms | Unknown | N/A | |
| Disentanglement by Nonlinear ICA with General Incompressible-flow Networks (GIN) | Unknown | N/A | |
| Drawing Early-Bird Tickets: Toward More Efficient Training of Deep Networks | Unknown | N/A | |
| The Shape of Data: Intrinsic Distance for Data Distributions | Unknown | N/A | |
| Implementation Matters in Deep RL: A Case Study on PPO and TRPO | Unknown | N/A | |
| Skip Connections Matter: On the Transferability of Adversarial Examples Generated with ResNets | Unknown | N/A | |
| Universal Approximation with Certified Networks | Unknown | N/A | |
| Deep Semi-Supervised Anomaly Detection | Unknown | N/A | |
| BayesOpt Adversarial Attack | Unknown | N/A | |
| Encoding word order in complex embeddings | Unknown | N/A | |
| N-BEATS: Neural basis expansion analysis for interpretable time series forecasting | Unknown | N/A | |
| Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks | Unknown | N/A | |
| To Relieve Your Headache of Training an MRF, Take AdVIL | Unknown | N/A | |
| Learning to Link | Unknown | N/A | |
| Federated Learning with Matched Averaging | Unknown | N/A | |
| BERTScore: Evaluating Text Generation with BERT | Unknown | N/A | |
| Dream to Control: Learning Behaviors by Latent Imagination | Unknown | N/A | |
| What graph neural networks cannot learn: depth vs width | Unknown | N/A | |
| Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks | Unknown | N/A | |
| Efficient Probabilistic Logic Reasoning with Graph Neural Networks | Unknown | N/A | |
| Breaking Certified Defenses: Semantic Adversarial Examples With Spoofed Robustness Certificates | Unknown | N/A | |
| Biologically inspired sleep algorithm for increased generalization and adversarial robustness in deep neural networks | Unknown | N/A | |
| Deep neuroethology of a virtual rodent | Unknown | N/A | |
| DeepHoyer: Learning Sparser Neural Network with Differentiable Scale-Invariant Sparsity Measures | Unknown | N/A | |
| Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving | Unknown | N/A | |
| Dynamic Time Lag Regression: Predicting What & When | Unknown | N/A | |
| On Mutual Information Maximization for Representation Learning | Unknown | N/A | |
| Lite Transformer with Long-Short Range Attention | Unknown | N/A | |
| Adversarial Policies: Attacking Deep Reinforcement Learning | Unknown | N/A | |
| A critical analysis of self-supervision, or what we can learn from a single image | Unknown | N/A | |
| Discovering Motor Programs by Recomposing Demonstrations | Unknown | N/A | |
| PairNorm: Tackling Oversmoothing in GNNs | Unknown | N/A | |
| On the Global Convergence of Training Deep Linear ResNets | Unknown | N/A | |
| Defending Against Physically Realizable Attacks on Image Classification | Unknown | N/A | |
| Learning to Coordinate Manipulation Skills via Skill Behavior Diversification | Unknown | N/A | |
| Learning the Arrow of Time for Problems in Reinforcement Learning | Unknown | N/A | |
| Deep probabilistic subsampling for task-adaptive compressed sensing | Unknown | N/A | |
| Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks | Unknown | N/A | |
| Multiplicative Interactions and Where to Find Them | Unknown | N/A | |
| And the Bit Goes Down: Revisiting the Quantization of Neural Networks | Unknown | N/A | |
| Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning | Unknown | N/A | |
| Curriculum Loss: Robust Learning and Generalization against Label Corruption | Unknown | N/A | |
| PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search | Unknown | N/A | |
| On the Weaknesses of Reinforcement Learning for Neural Machine Translation | Unknown | N/A | |
| Contrastive Representation Distillation | Unknown | N/A | |
| Dynamic Model Pruning with Feedback | Unknown | N/A | |
| Sign Bits Are All You Need for Black-Box Attacks | Unknown | N/A | |
| Building Deep Equivariant Capsule Networks | Unknown | N/A | |
| U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation | Unknown | N/A | |
| B-Spline CNNs on Lie groups | Unknown | N/A | |
| Span Recovery for Deep Neural Networks with Applications to Input Obfuscation | Unknown | N/A | |
| Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model | Unknown | N/A | |
| Sliced Cramer Synaptic Consolidation for Preserving Deeply Learned Representations | Unknown | N/A | |
| Critical initialisation in continuous approximations of binary neural networks | Unknown | N/A | |
| Learn to Explain Efficiently via Neural Logic Inductive Learning | Unknown | N/A | |
| Transformer-XH: Multi-Evidence Reasoning with eXtra Hop Attention | Unknown | N/A | |
| SpikeGrad: An ANN-equivalent Computation Model for Implementing Backpropagation with Spikes | Unknown | N/A | |
| Massively Multilingual Sparse Word Representations | Unknown | N/A | |
| Deep Audio Priors Emerge From Harmonic Convolutional Networks | Unknown | N/A | |
| The Local Elasticity of Neural Networks | Unknown | N/A | |
| RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments | Unknown | N/A | |
| Uncertainty-guided Continual Learning with Bayesian Neural Networks | Unknown | N/A | |
| Differentiable Reasoning over a Virtual Knowledge Base | Unknown | N/A | |
| Understanding and Improving Information Transfer in Multi-Task Learning | Unknown | N/A | |
| StructPool: Structured Graph Pooling via Conditional Random Fields | Unknown | N/A | |
| Generative Models for Effective ML on Private, Decentralized Datasets | Unknown | N/A | |
| Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML | Unknown | N/A | |
| Energy-based models for atomic-resolution protein conformations | Unknown | N/A | |
| Abductive Commonsense Reasoning | Unknown | N/A | |
| Training binary neural networks with real-to-binary convolutions | Unknown | N/A | |
| Maxmin Q-learning: Controlling the Estimation Bias of Q-learning | Unknown | N/A | |
| Discriminative Particle Filter Reinforcement Learning for Complex Partial observations | Unknown | N/A | |
| Thieves on Sesame Street! Model Extraction of BERT-based APIs | Unknown | N/A | |
| High Fidelity Speech Synthesis with Adversarial Networks | Unknown | N/A | |
| Program Guided Agent | Unknown | N/A | |
| Sharing Knowledge in Multi-Task Deep Reinforcement Learning | Unknown | N/A | |
| Controlling generative models with continuous factors of variations | Unknown | N/A | |
| Federated Adversarial Domain Adaptation | Unknown | N/A | |
| Symplectic ODE-Net: Learning Hamiltonian Dynamics with Control | Unknown | N/A | |
| AdvectiveNet: An Eulerian-Lagrangian Fluidic Reservoir for Point Cloud Processing | Unknown | N/A | |
| State-only Imitation with Transition Dynamics Mismatch | Unknown | N/A | |
| Lipschitz constant estimation of Neural Networks via sparse polynomial optimization | Unknown | N/A | |
| GraphAF: a Flow-based Autoregressive Model for Molecular Graph Generation | Unknown | N/A | |
| A Fair Comparison of Graph Neural Networks for Graph Classification | Unknown | N/A | |
| Deep Imitative Models for Flexible Inference, Planning, and Control | Unknown | N/A | |
| Augmenting Non-Collaborative Dialog Systems with Explicit Semantic and Strategic Dialog History | Unknown | N/A | |
| Distance-Based Learning from Errors for Confidence Calibration | Unknown | N/A | |
| Training Recurrent Neural Networks Online by Learning Explicit State Variables | Unknown | N/A | |
| Query2box: Reasoning over Knowledge Graphs in Vector Space Using Box Embeddings | Unknown | N/A | |
| CoPhy: Counterfactual Learning of Physical Dynamics | Unknown | N/A | |
| ES-MAML: Simple Hessian-Free Meta Learning | Unknown | N/A | |
| Combining Q-Learning and Search with Amortized Value Estimates | Unknown | N/A | |
| Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints | Unknown | N/A | |
| Are Pre-trained Language Models Aware of Phrases? Simple but Strong Baselines for Grammar Induction | Unknown | N/A | |
| Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks | Unknown | N/A | |
| Towards Stable and Efficient Training of Verifiably Robust Neural Networks | Unknown | N/A | |
| Neural Policy Gradient Methods: Global Optimality and Rates of Convergence | Unknown | N/A | |
| The Break-Even Point on Optimization Trajectories of Deep Neural Networks | Unknown | N/A | |
| Adjustable Real-time Style Transfer | Unknown | N/A | |
| vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations | Unknown | N/A | |
| Learned Step Size Quantization | Unknown | N/A | |
| Low-dimensional statistical manifold embedding of directed graphs | Unknown | N/A | |
| Query-efficient Meta Attack to Deep Neural Networks | Unknown | N/A | |
| Efficient Riemannian Optimization on the Stiefel Manifold via the Cayley Transform | Unknown | N/A | |
| Learning Expensive Coordination: An Event-Based Deep RL Approach | Unknown | N/A | |
| Adversarial Lipschitz Regularization | Unknown | N/A | |
| Provable robustness against all adversarial $l_p$-perturbations for $p\geq 1$ | Unknown | N/A | |
| Effect of Activation Functions on the Training of Overparametrized Neural Nets | Unknown | N/A | |
| Transferring Optimality Across Data Distributions via Homotopy Methods | Unknown | N/A | |
| Population-Guided Parallel Policy Search for Reinforcement Learning | Unknown | N/A | |
| Explanation by Progressive Exaggeration | Unknown | N/A | |
| Enhancing Adversarial Defense by k-Winners-Take-All | Unknown | N/A | |
| Enhancing Transformation-Based Defenses Against Adversarial Attacks with a Distribution Classifier | Unknown | N/A | |
| Making Sense of Reinforcement Learning and Probabilistic Inference | Unknown | N/A | |
| Tree-Structured Attention with Hierarchical Accumulation | Unknown | N/A | |
| Optimal Strategies Against Generative Attacks | Unknown | N/A | |
| Deep Network Classification by Scattering and Homotopy Dictionary Learning | Unknown | N/A | |
| Physics-aware Difference Graph Networks for Sparsely-Observed Dynamics | Unknown | N/A | |
| Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models | Unknown | N/A | |
| On Universal Equivariant Set Networks | Unknown | N/A | |
| Neural tangent kernels, transportation mappings, and universal approximation | Unknown | N/A | |
| CLN2INV: Learning Loop Invariants with Continuous Logic Networks | Unknown | N/A | |
| Neural Epitome Search for Architecture-Agnostic Network Compression | Unknown | N/A | |
| Episodic Reinforcement Learning with Associative Memory | Unknown | N/A | |
| Improving Neural Language Generation with Spectrum Control | Unknown | N/A | |
| In Search for a SAT-friendly Binarized Neural Network Architecture | Unknown | N/A | |
| CLEVRER: Collision Events for Video Representation and Reasoning | Unknown | N/A | |
| Understanding Generalization in Recurrent Neural Networks | Unknown | N/A | |
| Harnessing Structures for Value-Based Planning and Reinforcement Learning | Unknown | N/A | |
| Multilingual Alignment of Contextual Word Representations | Unknown | N/A | |
| Mathematical Reasoning in Latent Space | Unknown | N/A | |
| Smooth markets: A basic mechanism for organizing gradient-based learners | Unknown | N/A | |
| Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework | Unknown | N/A | |
| Learning to solve the credit assignment problem | Unknown | N/A | |
| Permutation Equivariant Models for Compositional Generalization in Language | Unknown | N/A | |
| The Logical Expressiveness of Graph Neural Networks | Unknown | N/A | |
| Reducing Transformer Depth on Demand with Structured Dropout | Unknown | N/A | |
| On Identifiability in Transformers | Unknown | N/A | |
| Overlearning Reveals Sensitive Attributes | Unknown | N/A | |
| MACER: Attack-free and Scalable Robust Training via Maximizing Certified Radius | Unknown | N/A | |
| Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization | Unknown | N/A | |
| Sample Efficient Policy Gradient Methods with Recursive Variance Reduction | Unknown | N/A | |
| Mutual Information Gradient Estimation for Representation Learning | Unknown | N/A | |
| CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning | Unknown | N/A | |
| Lagrangian Fluid Simulation with Continuous Convolutions | Unknown | N/A | |
| Pruned Graph Scattering Transforms | Unknown | N/A | |
| Influence-Based Multi-Agent Exploration | Unknown | N/A | |
| Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling | Unknown | N/A | |
| DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames | Unknown | N/A | |
| Four Things Everyone Should Know to Improve Batch Normalization | Unknown | N/A | |
| On Bonus Based Exploration Methods In The Arcade Learning Environment | Unknown | N/A | |
| Weakly Supervised Clustering by Exploiting Unique Class Count | Unknown | N/A | |
| Theory and Evaluation Metrics for Learning Disentangled Representations | Unknown | N/A | |
| A Target-Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning | Unknown | N/A | |
| Rotation-invariant clustering of neuronal responses in primary visual cortex | Unknown | N/A | |
| Hierarchical Foresight: Self-Supervised Learning of Long-Horizon Tasks via Visual Subgoal Generation | Unknown | N/A | |
| EMPIR: Ensembles of Mixed Precision Deep Networks for Increased Robustness Against Adversarial Attacks | Unknown | N/A | |
| Intrinsic Motivation for Encouraging Synergistic Behavior | Unknown | N/A | |
| An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality | Unknown | N/A | |
| Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models | Unknown | N/A | |
| Data-dependent Gaussian Prior Objective for Language Generation | Unknown | N/A | |
| Intensity-Free Learning of Temporal Point Processes | Unknown | N/A | |
| GraphZoom: A Multi-level Spectral Approach for Accurate and Scalable Graph Embedding | Unknown | N/A | |
| Masked Based Unsupervised Content Transfer | Unknown | N/A | |
| Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation | Unknown | N/A | |
| Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives | Unknown | N/A | |
| A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms | Unknown | N/A | |
| Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks | Unknown | N/A | |
| Truth or backpropaganda? An empirical investigation of deep learning theory | Unknown | N/A | |
| Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks | Unknown | N/A | |
| Exploring Model-based Planning with Policy Networks | Unknown | N/A | |
| Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation | Unknown | N/A | |
| Improving Adversarial Robustness Requires Revisiting Misclassified Examples | Unknown | N/A | |
| Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning | Unknown | N/A | |
| Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization | Unknown | N/A | |
| FasterSeg: Searching for Faster Real-time Semantic Segmentation | Unknown | N/A | |
| Jacobian Adversarially Regularized Networks for Robustness | Unknown | N/A | |
| Distributed Bandit Learning: Near-Optimal Regret with Efficient Communication | Unknown | N/A | |
| Cross-Lingual Ability of Multilingual BERT: An Empirical Study | Unknown | N/A | |
| DivideMix: Learning with Noisy Labels as Semi-supervised Learning | Unknown | N/A | |
| Quantifying the Cost of Reliable Photo Authentication via High-Performance Learned Lossy Representations | Unknown | N/A | |
| VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning | Unknown | N/A | |
| Monotonic Multihead Attention | Unknown | N/A | |
| Residual Energy-Based Models for Text Generation | Unknown | N/A | |
| Fantastic Generalization Measures and Where to Find Them | Unknown | N/A | |
| Adversarially robust transfer learning | Unknown | N/A | |
| Adversarially Robust Representations with Smooth Encoders | Unknown | N/A | |
| On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning | Unknown | N/A | |
| Graph inference learning for semi-supervised classification | Unknown | N/A | |
| Discrepancy Ratio: Evaluating Model Performance When Even Experts Disagree on the Truth | Unknown | N/A | |
| Revisiting Self-Training for Neural Sequence Generation | Unknown | N/A | |
| Imitation Learning via Off-Policy Distribution Matching | Unknown | N/A | |
| Iterative energy-based projection on a normal data manifold for anomaly localization | Unknown | N/A | |
| A Closer Look at Deep Policy Gradients | Unknown | N/A | |
| Tensor Decompositions for Temporal Knowledge Base Completion | Unknown | N/A | |
| Progressive Memory Banks for Incremental Domain Adaptation | Unknown | N/A | |
| IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks | Unknown | N/A | |
| AtomNAS: Fine-Grained End-to-End Neural Architecture Search | Unknown | N/A | |
| Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning | Unknown | N/A | |
| The Ingredients of Real World Robotic Reinforcement Learning | Unknown | N/A | |
| Frequency-based Search-control in Dyna | Unknown | N/A | |
| Exploration in Reinforcement Learning with Deep Covering Options | Unknown | N/A | |
| Projection-Based Constrained Policy Optimization | Unknown | N/A | |
| On Robustness of Neural Ordinary Differential Equations | Unknown | N/A | |
| Generalization bounds for deep convolutional neural networks | Unknown | N/A | |
| Learning to Control PDEs with Differentiable Physics | Unknown | N/A | |
| Real or Not Real, that is the Question | Unknown | N/A | |
| Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds | Unknown | N/A | |
| MMA Training: Direct Input Space Margin Maximization through Adversarial Training | Unknown | N/A | |
| Editable Neural Networks | Unknown | N/A | |
| Learning to Move with Affordance Maps | Unknown | N/A | |
| Model-Augmented Actor-Critic: Backpropagating through Paths | Unknown | N/A | |
| Multi-Agent Interactions Modeling with Correlated Policies | Unknown | N/A | |
| Model-based reinforcement learning for biological sequence design | Unknown | N/A | |
| Intriguing Properties of Adversarial Training at Scale | Unknown | N/A | |
| Behaviour Suite for Reinforcement Learning | Unknown | N/A | |
| Meta-Q-Learning | Unknown | N/A | |
| Deep Double Descent: Where Bigger Models and More Data Hurt | Unknown | N/A | |
| Understanding Architectures Learnt by Cell-based Neural Architecture Search | Unknown | N/A | |
| Extreme Tensoring for Low-Memory Preconditioning | Unknown | N/A | |
| Differentiable learning of numerical rules in knowledge graphs | Unknown | N/A | |
| Neural Network Branching for Neural Network Verification | Unknown | N/A | |
| Learning representations for binary-classification without backpropagation | Unknown | N/A | |
| NAS-Bench-201: Extending the Scope of Reproducible Neural Architecture Search | Unknown | N/A | |
| Why Gradient Clipping Accelerates Training: A Theoretical Justification for Adaptivity | Unknown | N/A | |
| Robust Local Features for Improving the Generalization of Adversarial Training | Unknown | N/A | |
| Shifted and Squeezed 8-bit Floating Point format for Low-Precision Training of Deep Neural Networks | Unknown | N/A | |
| A Neural Dirichlet Process Mixture Model for Task-Free Continual Learning | Unknown | N/A | |
| Learning Space Partitions for Nearest Neighbor Search | Unknown | N/A | |
| Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning | Unknown | N/A | |
| Principled Weight Initialization for Hypernetworks | Unknown | N/A | |
| Order Learning and Its Application to Age Estimation | Unknown | N/A | |
| Recurrent neural circuits for contour detection | Unknown | N/A | |
| Hyper-SAGNN: a self-attention based graph neural network for hypergraphs | Unknown | N/A | |
| Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions | Unknown | N/A | |
| DiffTaichi: Differentiable Programming for Physical Simulation | Unknown | N/A | |
| Efficient and Information-Preserving Future Frame Prediction and Beyond | Unknown | N/A | |
| Meta-Learning Deep Energy-Based Memory Models | Unknown | N/A | |
| Bayesian Meta Sampling for Fast Uncertainty Adaptation | Unknown | N/A | |
| Restricting the Flow: Information Bottlenecks for Attribution | Unknown | N/A | |
| Multi-agent Reinforcement Learning for Networked System Control | Unknown | N/A | |
| Composition-based Multi-Relational Graph Convolutional Networks | Unknown | N/A | |
| Gradient $\ell_1$ Regularization for Quantization Robustness | Unknown | N/A | |
| Towards neural networks that provably know when they don't know | Unknown | N/A | |
| Reanalysis of Variance Reduced Temporal Difference Learning | Unknown | N/A | |
| Quantum Algorithms for Deep Convolutional Neural Networks | Unknown | N/A | |
| Inductive representation learning on temporal graphs | Unknown | N/A | |
| Bounds on Over-Parameterization for Guaranteed Existence of Descent Paths in Shallow ReLU Networks | Unknown | N/A | |
| Consistency Regularization for Generative Adversarial Networks | Unknown | N/A | |
| Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation | Unknown | N/A | |
| Learning to Guide Random Search | Unknown | N/A | |
| Emergent Tool Use From Multi-Agent Autocurricula | Unknown | N/A | |
| Can gradient clipping mitigate label noise? | Unknown | N/A | |
| From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech | Unknown | N/A | |
| LAMOL: LAnguage MOdeling for Lifelong Language Learning | Unknown | N/A | |
| Sparse Coding with Gated Learned ISTA | Unknown | N/A | |
| FSPool: Learning Set Representations with Featurewise Sort Pooling | Unknown | N/A | |
| Pre-training Tasks for Embedding-based Large-scale Retrieval | Unknown | N/A | |
| Robust And Interpretable Blind Image Denoising Via Bias-Free Convolutional Neural Networks | Unknown | N/A | |
| DeepV2D: Video to Depth with Differentiable Structure from Motion | Unknown | N/A | |
| AMRL: Aggregated Memory For Reinforcement Learning | Unknown | N/A | |
| Learning to Represent Programs with Property Signatures | Unknown | N/A | |
| V4D: 4D Convolutional Neural Networks for Video-level Representation Learning | Unknown | N/A | |
| Selection via Proxy: Efficient Data Selection for Deep Learning | Unknown | N/A | |
| PCMC-Net: Feature-based Pairwise Choice Markov Chains | Unknown | N/A | |
| BackPACK: Packing more into Backprop | Unknown | N/A | |
| Kernel of CycleGAN as a principal homogeneous space | Unknown | N/A | |
| Higher-Order Function Networks for Learning Composable 3D Object Representations | Unknown | N/A | |
| Decoding As Dynamic Programming For Recurrent Autoregressive Models | Unknown | N/A | |
| Variational Recurrent Models for Solving Partially Observable Control Tasks | Unknown | N/A | |
| Learning Heuristics for Quantified Boolean Formulas through Reinforcement Learning | Unknown | N/A | |
| VL-BERT: Pre-training of Generic Visual-Linguistic Representations | Unknown | N/A | |
| On the Convergence of FedAvg on Non-IID Data | Unknown | N/A | |
| Never Give Up: Learning Directed Exploration Strategies | Unknown | N/A | |
| Depth-Width Trade-offs for ReLU Networks via Sharkovsky's Theorem | Unknown | N/A | |
| Deep Learning For Symbolic Mathematics | Unknown | N/A | |
| Incorporating BERT into Neural Machine Translation | Unknown | N/A | |
| Escaping Saddle Points Faster with Stochastic Momentum | Unknown | N/A | |
| Learning from Unlabelled Videos Using Contrastive Predictive Neural 3D Mapping | Unknown | N/A | |
| Unpaired Point Cloud Completion on Real Scans using Adversarial Training | Unknown | N/A | |
| Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers | Unknown | N/A | |
| ProxSGD: Training Structured Neural Networks under Regularization and Constraints | Unknown | N/A | |
| Single Episode Policy Transfer in Reinforcement Learning | Unknown | N/A | |
| Don't Use Large Mini-batches, Use Local SGD | Unknown | N/A | |
| Disentangling Factors of Variations Using Few Labels | Unknown | N/A | |
| A Latent Morphology Model for Open-Vocabulary Neural Machine Translation | Unknown | N/A | |
| Stable Rank Normalization for Improved Generalization in Neural Networks and GANs | Unknown | N/A | |
| A Framework for Robustness Certification of Smoothed Classifiers Using F-Divergences | Unknown | N/A | |
| Co-Attentive Equivariant Neural Networks: Focusing Equivariance On Transformations Co-Occurring in Data | Unknown | N/A | |
| Gradients as Features for Deep Representation Learning | Unknown | N/A | |
| Multi-Scale Representation Learning for Spatial Feature Distributions using Grid Cells | Unknown | N/A | |
| Large Batch Optimization for Deep Learning: Training BERT in 76 minutes | Unknown | N/A | |
| The asymptotic spectrum of the Hessian of DNN throughout training | Unknown | N/A | |
| Detecting Extrapolation with Local Ensembles | Unknown | N/A | |
| Spectral Embedding of Regularized Block Models | Unknown | N/A | |
| Empirical Bayes Transductive Meta-Learning with Synthetic Gradients | Unknown | N/A | |
| How much Position Information Do Convolutional Neural Networks Encode? | Unknown | N/A | |
| RGBD-GAN: Unsupervised 3D Representation Learning From Natural Image Datasets via RGBD Image Synthesis | Unknown | N/A | |
| Weakly Supervised Disentanglement with Guarantees | Unknown | N/A | |
| Directional Message Passing for Molecular Graphs | Unknown | N/A | |
| Information Geometry of Orthogonal Initializations and Training | Unknown | N/A | |
| BatchEnsemble: an Alternative Approach to Efficient Ensemble and Lifelong Learning | Unknown | N/A | |
| Minimizing FLOPs to Learn Efficient Sparse Representations | Unknown | N/A | |
| Quantifying Point-Prediction Uncertainty in Neural Networks via Residual Estimation with an I/O Kernel | Unknown | N/A | |
| A Mutual Information Maximization Perspective of Language Representation Learning | Unknown | N/A | |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Unknown | N/A | |
| Diverse Trajectory Forecasting with Determinantal Point Processes | Unknown | N/A | |
| Graph Convolutional Reinforcement Learning | Unknown | N/A | |
| Picking Winning Tickets Before Training by Preserving Gradient Flow | Unknown | N/A | |
| Unsupervised Clustering using Pseudo-semi-supervised Learning | Unknown | N/A | |
| Learning transport cost from subset correspondence | Unknown | N/A | |
| Scalable and Order-robust Continual Learning with Additive Parameter Decomposition | Unknown | N/A | |
| Scaling Autoregressive Video Models | Unknown | N/A | |
| GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations | Unknown | N/A | |
| Understanding Why Neural Networks Generalize Well Through GSNR of Parameters | Unknown | N/A | |
| Linear Symmetric Quantization of Neural Networks for Low-precision Integer Hardware | Unknown | N/A | |
| Reformer: The Efficient Transformer | Unknown | N/A | |
| Automatically Discovering and Learning New Visual Categories with Ranking Statistics | Unknown | N/A | |
| Difference-Seeking Generative Adversarial Network--Unseen Sample Generation | Unknown | N/A | |
| Comparing Rewinding and Fine-tuning in Neural Network Pruning | Unknown | N/A | |
| I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively | Unknown | N/A | |
| Are Transformers universal approximators of sequence-to-sequence functions? | Unknown | N/A | |
| StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding | Unknown | N/A | |
| Enabling Deep Spiking Neural Networks with Hybrid Conversion and Spike Timing Dependent Backpropagation | Unknown | N/A | |
| RTFM: Generalising to New Environment Dynamics via Reading | Unknown | N/A | |
| Certified Robustness for Top-k Predictions against Adversarial Perturbations via Randomized Smoothing | Unknown | N/A | |
| Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference | Unknown | N/A | |
| Analysis of Video Feature Learning in Two-Stream CNNs on the Example of Zebrafish Swim Bout Classification | Unknown | N/A | |
| You CAN Teach an Old Dog New Tricks! On Training Knowledge Graph Embeddings | Unknown | N/A | |
| SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models | Unknown | N/A | |
| Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness | Unknown | N/A | |
| Inductive Matrix Completion Based on Graph Neural Networks | Unknown | N/A | |
| LambdaNet: Probabilistic Type Inference using Graph Neural Networks | Unknown | N/A | |
| Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings | Unknown | N/A | |
| Conservative Uncertainty Estimation By Fitting Prior Networks | Unknown | N/A | |
| Compressive Transformers for Long-Range Sequence Modelling | Unknown | N/A | |
| word2ket: Space-efficient Word Embeddings inspired by Quantum Entanglement | Unknown | N/A | |
| Differentially Private Meta-Learning | Unknown | N/A | |
| Adaptive Structural Fingerprints for Graph Attention Networks | Unknown | N/A | |
| Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data | Unknown | N/A | |
| InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization | Unknown | N/A | |
| Infinite-horizon Off-Policy Policy Evaluation with Multiple Behavior Policies | Unknown | N/A | |
| Learning to Learn by Zeroth-Order Oracle | Unknown | N/A | |
| RaCT: Toward Amortized Ranking-Critical Training For Collaborative Filtering | Unknown | N/A | |
| Training Generative Adversarial Networks from Incomplete Observations using Factorised Discriminators | Unknown | N/A | |
| Reinforcement Learning Based Graph-to-Sequence Model for Natural Question Generation | Unknown | N/A | |
| Learning The Difference That Makes A Difference With Counterfactually-Augmented Data | Unknown | N/A | |
| White Noise Analysis of Neural Networks | Unknown | N/A | |
| Intrinsically Motivated Discovery of Diverse Patterns in Self-Organizing Systems | Unknown | N/A | |
| Neural Outlier Rejection for Self-Supervised Keypoint Learning | Unknown | N/A | |
| Estimating Gradients for Discrete Random Variables by Sampling without Replacement | Unknown | N/A | |
| Cross-Domain Few-Shot Classification via Learned Feature-Wise Transformation | Unknown | N/A | |
| Reinforced active learning for image segmentation | Unknown | N/A | |
| On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach | Unknown | N/A | |
| A Stochastic Derivative Free Optimization Method with Momentum | Unknown | N/A | |
| Input Complexity and Out-of-distribution Detection with Likelihood-based Generative Models | Unknown | N/A | |
| Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech | Unknown | N/A | |
| Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue | Unknown | N/A | |
| Watch the Unobserved: A Simple Approach to Parallelizing Monte Carlo Tree Search | Unknown | N/A | |
| Physics-as-Inverse-Graphics: Unsupervised Physical Parameter Estimation from Video | Unknown | N/A | |
| Jelly Bean World: A Testbed for Never-Ending Learning | Unknown | N/A | |
| Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information | Unknown | N/A | |
| FreeLB: Enhanced Adversarial Training for Natural Language Understanding | Unknown | N/A | |
| Neural Module Networks for Reasoning over Text | Unknown | N/A | |
| SlowMo: Improving Communication-Efficient Distributed SGD with Slow Momentum | Unknown | N/A | |
| AutoQ: Automated Kernel-Wise Neural Network Quantization | Unknown | N/A | |
| V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control | Unknown | N/A | |
| Meta-Learning without Memorization | Unknown | N/A | |
| DDSP: Differentiable Digital Signal Processing | Unknown | N/A | |
| A Function Space View of Bounded Norm Infinite Width ReLU Nets: The Multivariate Case | Unknown | N/A | |
| What Can Neural Networks Reason About? | Unknown | N/A | |
| MetaPix: Few-Shot Video Retargeting | Unknown | N/A | |
| Functional Regularisation for Continual Learning with Gaussian Processes | Unknown | N/A | |
| Identity Crisis: Memorization and Generalization Under Extreme Overparameterization | Unknown | N/A | |
| Probability Calibration for Knowledge Graph Embedding Models | Unknown | N/A | |
| Thinking While Moving: Deep Reinforcement Learning with Concurrent Control | Unknown | N/A | |
| Smoothness and Stability in GANs | Unknown | N/A | |
| From Variational to Deterministic Autoencoders | Unknown | N/A | |
| SNOW: Subscribing to Knowledge via Channel Pooling for Transfer & Lifelong Learning of Convolutional Neural Networks | Unknown | N/A | |
| Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification | Unknown | N/A | |
| CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning | Unknown | N/A | |
| Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks | Unknown | N/A | |
| How to 0wn the NAS in Your Spare Time | Unknown | N/A | |
| Variational Template Machine for Data-to-Text Generation | Unknown | N/A | |
| Deep Graph Matching Consensus | Unknown | N/A | |
| Certified Defenses for Adversarial Patches | Unknown | N/A | |
| The Curious Case of Neural Text Degeneration | Unknown | N/A | |
| Learning Nearly Decomposable Value Functions Via Communication Minimization | Unknown | N/A | |
| Short and Sparse Deconvolution --- A Geometric Approach | Unknown | N/A | |
| Deep 3D Pan via Local adaptive "t-shaped" convolutions with global and local adaptive dilations | Unknown | N/A | |
| Distributionally Robust Neural Networks | Unknown | N/A | |
| DeFINE: Deep Factorized Input Token Embeddings for Neural Sequence Modeling | Unknown | N/A | |
| Sign-OPT: A Query-Efficient Hard-label Adversarial Attack | Unknown | N/A | |
| Evaluating The Search Phase of Neural Architecture Search | Unknown | N/A | |
| A Baseline for Few-Shot Image Classification | Unknown | N/A | |
| Abstract Diagrammatic Reasoning with Multiplex Graph Networks | Unknown | N/A | |
| SNODE: Spectral Discretization of Neural ODEs for System Identification | Unknown | N/A | |
| On the "steerability" of generative adversarial networks | Unknown | N/A | |
| SVQN: Sequential Variational Soft Q-Learning Networks | Unknown | N/A | |
| Automated curriculum generation through setter-solver interactions | Unknown | N/A | |
| One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation | Unknown | N/A | |
| Synthesizing Programmatic Policies that Inductively Generalize | Unknown | N/A | |
| The Implicit Bias of Depth: How Incremental Learning Drives Generalization | Unknown | N/A | |
| Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks | Unknown | N/A | |
| Double Neural Counterfactual Regret Minimization | Unknown | N/A | |
| Continual Learning with Adaptive Weights (CLAW) | Unknown | N/A | |
| SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards | Unknown | N/A | |
| Logic and the 2-Simplicial Transformer | Unknown | N/A | |
| Image-guided Neural Object Rendering | Unknown | N/A |
ICLR 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| UMEC: Unified model and embedding compression for efficient recommendation systems | Unknown | N/A | |
| Adversarially-Trained Deep Nets Transfer Better: Illustration on Image Classification | Unknown | N/A | |
| ResNet After All: Neural ODEs and Their Numerical Solution | Unknown | N/A | |
| Deep Partition Aggregation: Provable Defenses against General Poisoning Attacks | Unknown | N/A | |
| MetaNorm: Learning to Normalize Few-Shot Batches Across Domains | Unknown | N/A | |
| Fidelity-based Deep Adiabatic Scheduling | Unknown | N/A | |
| Witches' Brew: Industrial Scale Data Poisoning via Gradient Matching | Unknown | N/A | |
| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Unknown | N/A | |
| Tilted Empirical Risk Minimization | Unknown | N/A | |
| Improved Estimation of Concentration Under $\ell_p$-Norm Distance Metrics Using Half Spaces | Unknown | N/A | |
| ALFWorld: Aligning Text and Embodied Environments for Interactive Learning | Unknown | N/A | |
| SEDONA: Search for Decoupled Neural Networks toward Greedy Block-wise Learning | Unknown | N/A | |
| Learning to Generate 3D Shapes with Generative Cellular Automata | Unknown | N/A | |
| Sliced Kernelized Stein Discrepancy | Unknown | N/A | |
| Learning to Deceive Knowledge Graph Augmented Models via Targeted Perturbation | Unknown | N/A | |
| VA-RED$^2$: Video Adaptive Redundancy Reduction | Unknown | N/A | |
| On InstaHide, Phase Retrieval, and Sparse Matrix Factorization | Unknown | N/A | |
| Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity | Unknown | N/A | |
| Statistical inference for individual fairness | Unknown | N/A | |
| Emergent Symbols through Binding in External Memory | Unknown | N/A | |
| Early Stopping in Deep Networks: Double Descent and How to Eliminate it | Unknown | N/A | |
| On the Universality of the Double Descent Peak in Ridgeless Regression | Unknown | N/A | |
| Evaluation of Neural Architectures Trained With Square Loss vs Cross-Entropy in Classification Tasks | Unknown | N/A | |
| Explainable Subgraph Reasoning for Forecasting on Temporal Knowledge Graphs | Unknown | N/A | |
| Simple Spectral Graph Convolution | Unknown | N/A | |
| PolarNet: Learning to Optimize Polar Keypoints for Keypoint Based Object Detection | Unknown | N/A | |
| Deconstructing the Regularization of BatchNorm | Unknown | N/A | |
| RMSprop converges with proper hyper-parameter | Unknown | N/A | |
| Generative Scene Graph Networks | Unknown | N/A | |
| Learnable Embedding sizes for Recommender Systems | Unknown | N/A | |
| Overfitting for Fun and Profit: Instance-Adaptive Data Compression | Unknown | N/A | |
| Acting in Delayed Environments with Non-Stationary Markov Policies | Unknown | N/A | |
| ARMOURED: Adversarially Robust MOdels using Unlabeled data by REgularizing Diversity | Unknown | N/A | |
| Solving Compositional Reinforcement Learning Problems via Task Reduction | Unknown | N/A | |
| A Geometric Analysis of Deep Generative Image Models and Its Applications | Unknown | N/A | |
| A Discriminative Gaussian Mixture Model with Sparsity | Unknown | N/A | |
| Interpretable Neural Architecture Search via Bayesian Optimisation with Weisfeiler-Lehman Kernels | Unknown | N/A | |
| Certify or Predict: Boosting Certified Robustness with Compositional Architectures | Unknown | N/A | |
| Taming GANs with Lookahead-Minmax | Unknown | N/A | |
| Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis | Unknown | N/A | |
| Learning Subgoal Representations with Slow Dynamics | Unknown | N/A | |
| GAN2GAN: Generative Noise Learning for Blind Denoising with Single Noisy Images | Unknown | N/A | |
| CO2: Consistent Contrast for Unsupervised Visual Representation Learning | Unknown | N/A | |
| CPR: Classifier-Projection Regularization for Continual Learning | Unknown | N/A | |
| MARS: Markov Molecular Sampling for Multi-objective Drug Discovery | Unknown | N/A | |
| Fooling a Complete Neural Network Verifier | Unknown | N/A | |
| Representation Learning via Invariant Causal Mechanisms | Unknown | N/A | |
| Interpreting and Boosting Dropout from a Game-Theoretic View | Unknown | N/A | |
| Quantifying Differences in Reward Functions | Unknown | N/A | |
| BOIL: Towards Representation Change for Few-shot Learning | Unknown | N/A | |
| Generating Adversarial Computer Programs using Optimized Obfuscations | Unknown | N/A | |
| On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis | Unknown | N/A | |
| Stabilized Medical Image Attacks | Unknown | N/A | |
| FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization | Unknown | N/A | |
| Seq2Tens: An Efficient Representation of Sequences by Low-Rank Tensor Projections | Unknown | N/A | |
| Memory Optimization for Deep Networks | Unknown | N/A | |
| Neural Jump Ordinary Differential Equations: Consistent Continuous-Time Prediction and Filtering | Unknown | N/A | |
| Identifying nonlinear dynamical systems with multiple time scales and long-range dependencies | Unknown | N/A | |
| Hyperbolic Neural Networks++ | Unknown | N/A | |
| Coupled Oscillatory Recurrent Neural Network (coRNN): An accurate and (gradient) stable architecture for learning long time dependencies | Unknown | N/A | |
| Spatially Structured Recurrent Modules | Unknown | N/A | |
| Teaching Temporal Logics to Neural Networks | Unknown | N/A | |
| Self-supervised Visual Reinforcement Learning with Object-centric Representations | Unknown | N/A | |
| On Self-Supervised Image Representations for GAN Evaluation | Unknown | N/A | |
| Robust Learning of Fixed-Structure Bayesian Networks in Nearly-Linear Time | Unknown | N/A | |
| Reset-Free Lifelong Learning with Skill-Space Planning | Unknown | N/A | |
| Fast Geometric Projections for Local Robustness Certification | Unknown | N/A | |
| TropEx: An Algorithm for Extracting Linear Terms in Deep Neural Networks | Unknown | N/A | |
| Adapting to Reward Progressivity via Spectral Reinforcement Learning | Unknown | N/A | |
| Decentralized Attribution of Generative Models | Unknown | N/A | |
| Combining Physics and Machine Learning for Network Flow Estimation | Unknown | N/A | |
| Large Batch Simulation for Deep Reinforcement Learning | Unknown | N/A | |
| Byzantine-Resilient Non-Convex Stochastic Gradient Descent | Unknown | N/A | |
| Accurate Learning of Graph Representations with Graph Multiset Pooling | Unknown | N/A | |
| GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing | Unknown | N/A | |
| Improving Zero-Shot Voice Style Transfer via Disentangled Representation Learning | Unknown | N/A | |
| HyperDynamics: Meta-Learning Object and Agent Dynamics with Hypernetworks | Unknown | N/A | |
| Anytime Sampling for Autoregressive Models via Ordered Autoencoding | Unknown | N/A | |
| Disentangling 3D Prototypical Networks for Few-Shot Concept Learning | Unknown | N/A | |
| Iterated learning for emergent systematicity in VQA | Unknown | N/A | |
| Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning | Unknown | N/A | |
| Generating Furry Cars: Disentangling Object Shape and Appearance across Multiple Domains | Unknown | N/A | |
| Why Are Convolutional Nets More Sample-Efficient than Fully-Connected Nets? | Unknown | N/A | |
| Learning to Make Decisions via Submodular Regularization | Unknown | N/A | |
| Adaptive and Generative Zero-Shot Learning | Unknown | N/A | |
| CompOFA – Compound Once-For-All Networks for Faster Multi-Platform Deployment | Unknown | N/A | |
| LambdaNetworks: Modeling long-range Interactions without Attention | Unknown | N/A | |
| Orthogonalizing Convolutional Layers with the Cayley Transform | Unknown | N/A | |
| Training BatchNorm and Only BatchNorm: On the Expressive Power of Random Features in CNNs | Unknown | N/A | |
| Gradient Projection Memory for Continual Learning | Unknown | N/A | |
| More or Less: When and How to Build Convolutional Neural Network Ensembles | Unknown | N/A | |
| Efficient Empowerment Estimation for Unsupervised Stabilization | Unknown | N/A | |
| A Panda? No, It's a Sloth: Slowdown Attacks on Adaptive Multi-Exit Neural Network Inference | Unknown | N/A | |
| Combining Ensembles and Data Augmentation Can Harm Your Calibration | Unknown | N/A | |
| Fourier Neural Operator for Parametric Partial Differential Equations | Unknown | N/A | |
| Saliency is a Possible Red Herring When Diagnosing Poor Generalization | Unknown | N/A | |
| Provably robust classification of adversarial examples with detection | Unknown | N/A | |
| SAFENet: A Secure, Accurate and Fast Neural Network Inference | Unknown | N/A | |
| MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training | Unknown | N/A | |
| Improved Autoregressive Modeling with Distribution Smoothing | Unknown | N/A | |
| Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning | Unknown | N/A | |
| Combining Label Propagation and Simple Models out-performs Graph Neural Networks | Unknown | N/A | |
| Local Search Algorithms for Rank-Constrained Convex Optimization | Unknown | N/A | |
| Pre-training Text-to-Text Transformers for Concept-centric Common Sense | Unknown | N/A | |
| Decoupling Global and Local Representations via Invertible Generative Flows | Unknown | N/A | |
| SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing | Unknown | N/A | |
| Evaluating the Disentanglement of Deep Generative Models through Manifold Topology | Unknown | N/A | |
| End-to-End Egospheric Spatial Memory | Unknown | N/A | |
| Neural Approximate Sufficient Statistics for Implicit Models | Unknown | N/A | |
| Bypassing the Ambient Dimension: Private SGD with Gradient Subspace Identification | Unknown | N/A | |
| BREEDS: Benchmarks for Subpopulation Shift | Unknown | N/A | |
| PAC Confidence Predictions for Deep Neural Network Classifiers | Unknown | N/A | |
| Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning | Unknown | N/A | |
| Hopper: Multi-hop Transformer for Spatiotemporal Reasoning | Unknown | N/A | |
| Why resampling outperforms reweighting for correcting sampling bias with stochastic gradients | Unknown | N/A | |
| In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning | Unknown | N/A | |
| Vector-output ReLU Neural Network Problems are Copositive Programs: Convex Analysis of Two Layer Networks and Polynomial-time Algorithms | Unknown | N/A | |
| AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition | Unknown | N/A | |
| Dataset Meta-Learning from Kernel Ridge-Regression | Unknown | N/A | |
| Repurposing Pretrained Models for Robust Out-of-domain Few-Shot Learning | Unknown | N/A | |
| On Position Embeddings in BERT | Unknown | N/A | |
| Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions | Unknown | N/A | |
| Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy | Unknown | N/A | |
| Uncertainty Sets for Image Classifiers using Conformal Prediction | Unknown | N/A | |
| Conditional Negative Sampling for Contrastive Learning of Visual Representations | Unknown | N/A | |
| Faster Binary Embeddings for Preserving Euclidean Distances | Unknown | N/A | |
| Model-Based Offline Planning | Unknown | N/A | |
| Neural Networks for Learning Counterfactual G-Invariances from Single Environments | Unknown | N/A | |
| Learning Energy-Based Models by Diffusion Recovery Likelihood | Unknown | N/A | |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Unknown | N/A | |
| Does enhanced shape bias improve neural network robustness to common corruptions? | Unknown | N/A | |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Unknown | N/A | |
| PDE-Driven Spatiotemporal Disentanglement | Unknown | N/A | |
| Mapping the Timescale Organization of Neural Language Models | Unknown | N/A | |
| The Intrinsic Dimension of Images and Its Impact on Learning | Unknown | N/A | |
| Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models | Unknown | N/A | |
| CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers | Unknown | N/A | |
| Long Range Arena : A Benchmark for Efficient Transformers | Unknown | N/A | |
| Recurrent Independent Mechanisms | Unknown | N/A | |
| Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering | Unknown | N/A | |
| SALD: Sign Agnostic Learning with Derivatives | Unknown | N/A | |
| WaveGrad: Estimating Gradients for Waveform Generation | Unknown | N/A | |
| Linear Last-iterate Convergence in Constrained Saddle-point Optimization | Unknown | N/A | |
| Go with the flow: Adaptive control for Neural ODEs | Unknown | N/A | |
| Understanding Over-parameterization in Generative Adversarial Networks | Unknown | N/A | |
| Multiscale Score Matching for Out-of-Distribution Detection | Unknown | N/A | |
| Random Feature Attention | Unknown | N/A | |
| Tradeoffs in Data Augmentation: An Empirical Study | Unknown | N/A | |
| Rapid Task-Solving in Novel Environments | Unknown | N/A | |
| Extracting Strong Policies for Robotics Tasks from Zero-Order Trajectory Optimizers | Unknown | N/A | |
| Unsupervised Discovery of 3D Physical Objects | Unknown | N/A | |
| Sample-Efficient Automated Deep Reinforcement Learning | Unknown | N/A | |
| Learning Structural Edits via Incremental Tree Transformations | Unknown | N/A | |
| Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis | Unknown | N/A | |
| Practical Real Time Recurrent Learning with a Sparse Approximation | Unknown | N/A | |
| Private Post-GAN Boosting | Unknown | N/A | |
| Modeling the Second Player in Distributionally Robust Optimization | Unknown | N/A | |
| HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark | Unknown | N/A | |
| Representation learning for improved interpretability and classification accuracy of clinical factors from EEG | Unknown | N/A | |
| GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding | Unknown | N/A | |
| Multi-Level Local SGD: Distributed SGD for Heterogeneous Hierarchical Networks | Unknown | N/A | |
| R-GAP: Recursive Gradient Attack on Privacy | Unknown | N/A | |
| Share or Not? Learning to Schedule Language-Specific Capacity for Multilingual Translation | Unknown | N/A | |
| Isometric Transformation Invariant and Equivariant Graph Convolutional Networks | Unknown | N/A | |
| Chaos of Learning Beyond Zero-sum and Coordination via Game Decompositions | Unknown | N/A | |
| Grounding Language to Autonomously-Acquired Skills via Goal Generation | Unknown | N/A | |
| Trajectory Prediction using Equivariant Continuous Convolution | Unknown | N/A | |
| Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels | Unknown | N/A | |
| On the role of planning in model-based deep reinforcement learning | Unknown | N/A | |
| A Hypergradient Approach to Robust Regression without Correspondence | Unknown | N/A | |
| Fast convergence of stochastic subgradient method under interpolation | Unknown | N/A | |
| Wasserstein Embedding for Graph Learning | Unknown | N/A | |
| Convex Potential Flows: Universal Probability Distributions with Optimal Transport and Convex Optimization | Unknown | N/A | |
| Shape or Texture: Understanding Discriminative Features in CNNs | Unknown | N/A | |
| Neurally Augmented ALISTA | Unknown | N/A | |
| Learning from Demonstration with Weakly Supervised Disentanglement | Unknown | N/A | |
| Score-Based Generative Modeling through Stochastic Differential Equations | Unknown | N/A | |
| On Data-Augmentation and Consistency-Based Semi-Supervised Learning | Unknown | N/A | |
| Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch | Unknown | N/A | |
| The role of Disentanglement in Generalisation | Unknown | N/A | |
| Shapley Explanation Networks | Unknown | N/A | |
| C-Learning: Horizon-Aware Cumulative Accessibility Estimation | Unknown | N/A | |
| Multi-resolution modeling of a discrete stochastic process identifies causes of cancer | Unknown | N/A | |
| PC2WF: 3D Wireframe Reconstruction from Raw Point Clouds | Unknown | N/A | |
| Universal Weakly Supervised Segmentation by Pixel-to-Segment Contrastive Learning | Unknown | N/A | |
| DINO: A Conditional Energy-Based GAN for Domain Translation | Unknown | N/A | |
| HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients | Unknown | N/A | |
| AdaSpeech: Adaptive Text to Speech for Custom Voice | Unknown | N/A | |
| Simple Augmentation Goes a Long Way: ADRL for DNN Quantization | Unknown | N/A | |
| Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds | Unknown | N/A | |
| SkipW: Resource Adaptable RNN with Strict Upper Computational Limit | Unknown | N/A | |
| Pruning Neural Networks at Initialization: Why Are We Missing the Mark? | Unknown | N/A | |
| On the Origin of Implicit Regularization in Stochastic Gradient Descent | Unknown | N/A | |
| Transient Non-stationarity and Generalisation in Deep Reinforcement Learning | Unknown | N/A | |
| Adversarial score matching and improved sampling for image generation | Unknown | N/A | |
| LiftPool: Bidirectional ConvNet Pooling | Unknown | N/A | |
| Scalable Bayesian Inverse Reinforcement Learning | Unknown | N/A | |
| Return-Based Contrastive Representation Learning for Reinforcement Learning | Unknown | N/A | |
| Implicit Gradient Regularization | Unknown | N/A | |
| Variational Intrinsic Control Revisited | Unknown | N/A | |
| Understanding and Improving Lexical Choice in Non-Autoregressive Translation | Unknown | N/A | |
| Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models | Unknown | N/A | |
| Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets | Unknown | N/A | |
| Complex Query Answering with Neural Link Predictors | Unknown | N/A | |
| Learning to Sample with Local and Global Contexts in Experience Replay Buffer | Unknown | N/A | |
| Gradient Origin Networks | Unknown | N/A | |
| Nonseparable Symplectic Neural Networks | Unknown | N/A | |
| Revisiting Locally Supervised Learning: an Alternative to End-to-end Training | Unknown | N/A | |
| Deep Repulsive Clustering of Ordered Data Based on Order-Identity Decomposition | Unknown | N/A | |
| Spatial Dependency Networks: Neural Layers for Improved Generative Image Modeling | Unknown | N/A | |
| Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning | Unknown | N/A | |
| Reweighting Augmented Samples by Minimizing the Maximal Expected Loss | Unknown | N/A | |
| Robust early-learning: Hindering the memorization of noisy labels | Unknown | N/A | |
| Monte-Carlo Planning and Learning with Language Action Value Estimates | Unknown | N/A | |
| Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration | Unknown | N/A | |
| Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting | Unknown | N/A | |
| EigenGame: PCA as a Nash Equilibrium | Unknown | N/A | |
| DrNAS: Dirichlet Neural Architecture Search | Unknown | N/A | |
| Graph Edit Networks | Unknown | N/A | |
| Capturing Label Characteristics in VAEs | Unknown | N/A | |
| Neural Delay Differential Equations | Unknown | N/A | |
| A Better Alternative to Error Feedback for Communication-Efficient Distributed Learning | Unknown | N/A | |
| Group Equivariant Stand-Alone Self-Attention For Vision | Unknown | N/A | |
| Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows | Unknown | N/A | |
| Undistillable: Making A Nasty Teacher That CANNOT teach students | Unknown | N/A | |
| Learning Hyperbolic Representations of Topological Features | Unknown | N/A | |
| Lipschitz Recurrent Neural Networks | Unknown | N/A | |
| Explaining the Efficacy of Counterfactually Augmented Data | Unknown | N/A | |
| Behavioral Cloning from Noisy Demonstrations | Unknown | N/A | |
| Layer-adaptive Sparsity for the Magnitude-based Pruning | Unknown | N/A | |
| Prototypical Representation Learning for Relation Extraction | Unknown | N/A | |
| Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues | Unknown | N/A | |
| Deformable DETR: Deformable Transformers for End-to-End Object Detection | Unknown | N/A | |
| When does preconditioning help or hurt generalization? | Unknown | N/A | |
| Group Equivariant Conditional Neural Processes | Unknown | N/A | |
| Learning from Protein Structure with Geometric Vector Perceptrons | Unknown | N/A | |
| PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences | Unknown | N/A | |
| Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies | Unknown | N/A | |
| Molecule Optimization by Explainable Evolution | Unknown | N/A | |
| Predicting Inductive Biases of Pre-Trained Models | Unknown | N/A | |
| Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing | Unknown | N/A | |
| Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation | Unknown | N/A | |
| Multi-timescale Representation Learning in LSTM Language Models | Unknown | N/A | |
| Adaptive Procedural Task Generation for Hard-Exploration Problems | Unknown | N/A | |
| DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs | Unknown | N/A | |
| Extreme Memorization via Scale of Initialization | Unknown | N/A | |
| Prototypical Contrastive Learning of Unsupervised Representations | Unknown | N/A | |
| Learning from others' mistakes: Avoiding dataset biases without modeling them | Unknown | N/A | |
| LowKey: Leveraging Adversarial Attacks to Protect Social Media Users from Facial Recognition | Unknown | N/A | |
| WaNet - Imperceptible Warping-based Backdoor Attack | Unknown | N/A | |
| Neural representation and generation for RNA secondary structures | Unknown | N/A | |
| Can a Fruit Fly Learn Word Embeddings? | Unknown | N/A | |
| RNNLogic: Learning Logic Rules for Reasoning on Knowledge Graphs | Unknown | N/A | |
| Evaluations and Methods for Explanation through Robustness Analysis | Unknown | N/A | |
| gradSim: Differentiable simulation for system identification and visuomotor control | Unknown | N/A | |
| Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization | Unknown | N/A | |
| Isotropy in the Contextual Embedding Space: Clusters and Manifolds | Unknown | N/A | |
| Reinforcement Learning with Random Delays | Unknown | N/A | |
| Deep Learning meets Projective Clustering | Unknown | N/A | |
| Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling | Unknown | N/A | |
| On Graph Neural Networks versus Graph-Augmented MLPs | Unknown | N/A | |
| NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation | Unknown | N/A | |
| Distributional Sliced-Wasserstein and Applications to Generative Modeling | Unknown | N/A | |
| MoVie: Revisiting Modulated Convolutions for Visual Counting and Beyond | Unknown | N/A | |
| NOVAS: Non-convex Optimization via Adaptive Stochastic Search for End-to-end Learning and Control | Unknown | N/A | |
| Understanding the effects of data parallelism and sparsity on neural network training | Unknown | N/A | |
| Planning from Pixels using Inverse Dynamics Models | Unknown | N/A | |
| Benchmarks for Deep Off-Policy Evaluation | Unknown | N/A | |
| Meta-Learning of Structured Task Distributions in Humans and Machines | Unknown | N/A | |
| Growing Efficient Deep Networks by Structured Continuous Sparsification | Unknown | N/A | |
| Training independent subnetworks for robust prediction | Unknown | N/A | |
| Better Fine-Tuning by Reducing Representational Collapse | Unknown | N/A | |
| Selective Classification Can Magnify Disparities Across Groups | Unknown | N/A | |
| Zero-shot Synthesis with Group-Supervised Learning | Unknown | N/A | |
| Learning Task-General Representations with Generative Neuro-Symbolic Modeling | Unknown | N/A | |
| BERTology Meets Biology: Interpreting Attention in Protein Language Models | Unknown | N/A | |
| Mathematical Reasoning via Self-supervised Skip-tree Training | Unknown | N/A | |
| AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly | Unknown | N/A | |
| BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization | Unknown | N/A | |
| Economic Hyperparameter Optimization With Blended Search Strategy | Unknown | N/A | |
| Average-case Acceleration for Bilinear Games and Normal Matrices | Unknown | N/A | |
| Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network | Unknown | N/A | |
| IsarStep: a Benchmark for High-level Mathematical Reasoning | Unknown | N/A | |
| Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis | Unknown | N/A | |
| SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments | Unknown | N/A | |
| Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning | Unknown | N/A | |
| On the mapping between Hopfield networks and Restricted Boltzmann Machines | Unknown | N/A | |
| Distance-Based Regularisation of Deep Networks for Fine-Tuning | Unknown | N/A | |
| Ringing ReLUs: Harmonic Distortion Analysis of Nonlinear Feedforward Networks | Unknown | N/A | |
| Generalization in data-driven models of primary visual cortex | Unknown | N/A | |
| Efficient Continual Learning with Modular Networks and Task-Driven Priors | Unknown | N/A | |
| Implicit Convex Regularizers of CNN Architectures: Convex Optimization of Two- and Three-Layer Networks in Polynomial Time | Unknown | N/A | |
| Activation-level uncertainty in deep neural networks | Unknown | N/A | |
| On Statistical Bias In Active Learning: How and When to Fix It | Unknown | N/A | |
| Local Convergence Analysis of Gradient Descent Ascent with Finite Timescale Separation | Unknown | N/A | |
| Scaling the Convex Barrier with Active Sets | Unknown | N/A | |
| NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition | Unknown | N/A | |
| PseudoSeg: Designing Pseudo Labels for Semantic Segmentation | Unknown | N/A | |
| Symmetry-Aware Actor-Critic for 3D Molecular Design | Unknown | N/A | |
| Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning | Unknown | N/A | |
| Robust Overfitting may be mitigated by properly learned smoothening | Unknown | N/A | |
| Characterizing signal propagation to close the performance gap in unnormalized ResNets | Unknown | N/A | |
| Learning continuous-time PDEs from sparse data with graph neural networks | Unknown | N/A | |
| Latent Skill Planning for Exploration and Transfer | Unknown | N/A | |
| Uncertainty-aware Active Learning for Optimal Bayesian Classifier | Unknown | N/A | |
| Self-supervised Adversarial Robustness for the Low-label, High-data Regime | Unknown | N/A | |
| Single-Photon Image Classification | Unknown | N/A | |
| Unsupervised Object Keypoint Learning using Local Spatial Predictability | Unknown | N/A | |
| CcGAN: Continuous Conditional Generative Adversarial Networks for Image Generation | Unknown | N/A | |
| Differentially Private Learning Needs Better Features (or Much More Data) | Unknown | N/A | |
| Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks | Unknown | N/A | |
| ANOCE: Analysis of Causal Effects with Multiple Mediators via Constrained Structural Learning | Unknown | N/A | |
| Long-tailed Recognition by Routing Diverse Distribution-Aware Experts | Unknown | N/A | |
| Grounded Language Learning Fast and Slow | Unknown | N/A | |
| Transformer protein language models are unsupervised structure learners | Unknown | N/A | |
| Uncertainty Estimation in Autoregressive Structured Prediction | Unknown | N/A | |
| Learning to live with Dale's principle: ANNs with separate excitatory and inhibitory units | Unknown | N/A | |
| An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | Unknown | N/A | |
| CT-Net: Channel Tensorization Network for Video Classification | Unknown | N/A | |
| On the Universality of Rotation Equivariant Point Cloud Networks | Unknown | N/A | |
| Universal approximation power of deep residual neural networks via nonlinear control theory | Unknown | N/A | |
| Learning a Latent Search Space for Routing Problems using Variational Autoencoders | Unknown | N/A | |
| A teacher-student framework to distill future trajectories | Unknown | N/A | |
| The Traveling Observer Model: Multi-task Learning Through Spatial Variable Embeddings | Unknown | N/A | |
| What they do when in doubt: a study of inductive biases in seq2seq learners | Unknown | N/A | |
| Group Equivariant Generative Adversarial Networks | Unknown | N/A | |
| Robust Curriculum Learning: from clean label detection to noisy label self-correction | Unknown | N/A | |
| Support-set bottlenecks for video-text representation learning | Unknown | N/A | |
| Graph Information Bottleneck for Subgraph Recognition | Unknown | N/A | |
| Learning Deep Features in Instrumental Variable Regression | Unknown | N/A | |
| Neural Synthesis of Binaural Speech From Mono Audio | Unknown | N/A | |
| Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning | Unknown | N/A | |
| Differentiable Segmentation of Sequences | Unknown | N/A | |
| Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation | Unknown | N/A | |
| Network Pruning That Matters: A Case Study on Retraining Variants | Unknown | N/A | |
| Degree-Quant: Quantization-Aware Training for Graph Neural Networks | Unknown | N/A | |
| Boost then Convolve: Gradient Boosting Meets Graph Neural Networks | Unknown | N/A | |
| Learning Associative Inference Using Fast Weight Memory | Unknown | N/A | |
| SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better Regularization | Unknown | N/A | |
| Towards Robust Neural Networks via Close-loop Control | Unknown | N/A | |
| Differentiable Trust Region Layers for Deep Reinforcement Learning | Unknown | N/A | |
| Discovering Non-monotonic Autoregressive Orderings with Variational Inference | Unknown | N/A | |
| Rethinking Positional Encoding in Language Pre-training | Unknown | N/A | |
| PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics | Unknown | N/A | |
| Improving Relational Regularized Autoencoders with Spherical Sliced Fused Gromov Wasserstein | Unknown | N/A | |
| DiffWave: A Versatile Diffusion Model for Audio Synthesis | Unknown | N/A | |
| Calibration of Neural Networks using Splines | Unknown | N/A | |
| Exploring Balanced Feature Spaces for Representation Learning | Unknown | N/A | |
| Measuring Massive Multitask Language Understanding | Unknown | N/A | |
| Kanerva++: Extending the Kanerva Machine With Differentiable, Locally Block Allocated Latent Memory | Unknown | N/A | |
| Aligning AI With Shared Human Values | Unknown | N/A | |
| Learning Manifold Patch-Based Representations of Man-Made Shapes | Unknown | N/A | |
| Filtered Inner Product Projection for Crosslingual Embedding Alignment | Unknown | N/A | |
| Correcting experience replay for multi-agent communication | Unknown | N/A | |
| How Benign is Benign Overfitting ? | Unknown | N/A | |
| High-Capacity Expert Binary Networks | Unknown | N/A | |
| Structured Prediction as Translation between Augmented Natural Languages | Unknown | N/A | |
| Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online | Unknown | N/A | |
| Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting | Unknown | N/A | |
| Incremental few-shot learning via vector quantization in deep embedded space | Unknown | N/A | |
| In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness | Unknown | N/A | |
| Reducing the Computational Cost of Deep Generative Models with Binary Neural Networks | Unknown | N/A | |
| MALI: A memory efficient and reverse accurate integrator for Neural ODEs | Unknown | N/A | |
| FedBE: Making Bayesian Model Ensemble Applicable to Federated Learning | Unknown | N/A | |
| Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability | Unknown | N/A | |
| My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control | Unknown | N/A | |
| Neural Learning of One-of-Many Solutions for Combinatorial Problems in Structured Output Spaces | Unknown | N/A | |
| Adaptive Universal Generalized PageRank Graph Neural Network | Unknown | N/A | |
| Latent Convergent Cross Mapping | Unknown | N/A | |
| Semantic Re-tuning with Contrastive Tension | Unknown | N/A | |
| On the Theory of Implicit Deep Learning: Global Convergence with Implicit Layers | Unknown | N/A | |
| GANs Can Play Lottery Tickets Too | Unknown | N/A | |
| Efficient Conformal Prediction via Cascaded Inference with Expanded Admission | Unknown | N/A | |
| Disambiguating Symbolic Expressions in Informal Documents | Unknown | N/A | |
| Lossless Compression of Structured Convolutional Models via Lifting | Unknown | N/A | |
| Uncertainty in Gradient Boosting via Ensembles | Unknown | N/A | |
| An Unsupervised Deep Learning Approach for Real-World Image Denoising | Unknown | N/A | |
| Conformation-Guided Molecular Representation with Hamiltonian Neural Networks | Unknown | N/A | |
| Neural ODE Processes | Unknown | N/A | |
| Towards Robustness Against Natural Language Word Substitutions | Unknown | N/A | |
| Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning | Unknown | N/A | |
| Effective Distributed Learning with Random Features: Improved Bounds and Algorithms | Unknown | N/A | |
| On Learning Universal Representations Across Languages | Unknown | N/A | |
| Minimum Width for Universal Approximation | Unknown | N/A | |
| Factorizing Declarative and Procedural Knowledge in Structured, Dynamical Environments | Unknown | N/A | |
| Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control | Unknown | N/A | |
| Self-Supervised Learning of Compressed Video Representations | Unknown | N/A | |
| Initialization and Regularization of Factorized Neural Layers | Unknown | N/A | |
| Predicting Infectiousness for Proactive Contact Tracing | Unknown | N/A | |
| Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees? | Unknown | N/A | |
| Trusted Multi-View Classification | Unknown | N/A | |
| Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics | Unknown | N/A | |
| On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning | Unknown | N/A | |
| Efficient Wasserstein Natural Gradients for Reinforcement Learning | Unknown | N/A | |
| Robust Pruning at Initialization | Unknown | N/A | |
| Parameter Efficient Multimodal Transformers for Video Representation Learning | Unknown | N/A | |
| Active Contrastive Learning of Audio-Visual Video Representations | Unknown | N/A | |
| Enforcing robust control guarantees within neural network policies | Unknown | N/A | |
| Contrastive Divergence Learning is a Time Reversal Adversarial Game | Unknown | N/A | |
| Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding | Unknown | N/A | |
| Domain-Robust Visual Imitation Learning with Mutual Information Constraints | Unknown | N/A | |
| Theoretical bounds on estimation error for meta-learning | Unknown | N/A | |
| Towards Impartial Multi-task Learning | Unknown | N/A | |
| Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data | Unknown | N/A | |
| Counterfactual Generative Networks | Unknown | N/A | |
| IOT: Instance-wise Layer Reordering for Transformer Structures | Unknown | N/A | |
| A statistical theory of cold posteriors in deep neural networks | Unknown | N/A | |
| The inductive bias of ReLU networks on orthogonally separable data | Unknown | N/A | |
| A Unified Approach to Interpreting and Boosting Adversarial Transferability | Unknown | N/A | |
| Contextual Transformation Networks for Online Continual Learning | Unknown | N/A | |
| Private Image Reconstruction from System Side Channels Using Generative Models | Unknown | N/A | |
| GAN "Steerability" without optimization | Unknown | N/A | |
| Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose? | Unknown | N/A | |
| HalentNet: Multimodal Trajectory Forecasting with Hallucinative Intents | Unknown | N/A | |
| Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs | Unknown | N/A | |
| AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights | Unknown | N/A | |
| Intrinsic-Extrinsic Convolution and Pooling for Learning on 3D Protein Structures | Unknown | N/A | |
| Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds | Unknown | N/A | |
| Blending MPC & Value Function Approximation for Efficient Reinforcement Learning | Unknown | N/A | |
| Using latent space regression to analyze and leverage compositionality in GANs | Unknown | N/A | |
| Shape-Texture Debiased Neural Network Training | Unknown | N/A | |
| Is Label Smoothing Truly Incompatible with Knowledge Distillation: An Empirical Study | Unknown | N/A | |
| DC3: A learning method for optimization with hard constraints | Unknown | N/A | |
| On the geometry of generalization and memorization in deep neural networks | Unknown | N/A | |
| Exploring the Uncertainty Properties of Neural Networks’ Implicit Priors in the Infinite-Width Limit | Unknown | N/A | |
| Usable Information and Evolution of Optimal Representations During Training | Unknown | N/A | |
| Learning Invariant Representations for Reinforcement Learning without Reconstruction | Unknown | N/A | |
| Parrot: Data-Driven Behavioral Priors for Reinforcement Learning | Unknown | N/A | |
| Zero-Cost Proxies for Lightweight NAS | Unknown | N/A | |
| Perceptual Adversarial Robustness: Defense Against Unseen Threat Models | Unknown | N/A | |
| Deep Neural Network Fingerprinting by Conferrable Adversarial Examples | Unknown | N/A | |
| Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space Navigation | Unknown | N/A | |
| Noise or Signal: The Role of Image Backgrounds in Object Recognition | Unknown | N/A | |
| Shapley explainability on the data manifold | Unknown | N/A | |
| Improving Transformation Invariance in Contrastive Representation Learning | Unknown | N/A | |
| Learning "What-if" Explanations for Sequential Decision-Making | Unknown | N/A | |
| A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention | Unknown | N/A | |
| Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking | Unknown | N/A | |
| Graph Convolution with Low-rank Learnable Local Filters | Unknown | N/A | |
| Meta-Learning with Neural Tangent Kernels | Unknown | N/A | |
| FedBN: Federated Learning on Non-IID Features via Local Batch Normalization | Unknown | N/A | |
| Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization | Unknown | N/A | |
| Colorization Transformer | Unknown | N/A | |
| Human-Level Performance in No-Press Diplomacy via Equilibrium Search | Unknown | N/A | |
| Separation and Concentration in Deep Networks | Unknown | N/A | |
| Training GANs with Stronger Augmentations via Contrastive Discriminator | Unknown | N/A | |
| Locally Free Weight Sharing for Network Width Search | Unknown | N/A | |
| Language-Agnostic Representation Learning of Source Code from Structure and Context | Unknown | N/A | |
| Learning Mesh-Based Simulation with Graph Networks | Unknown | N/A | |
| Set Prediction without Imposing Structure as Conditional Density Estimation | Unknown | N/A | |
| Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning | Unknown | N/A | |
| Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search | Unknown | N/A | |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Unknown | N/A | |
| Learning Accurate Entropy Model with Global Reference for Image Compression | Unknown | N/A | |
| What Makes Instance Discrimination Good for Transfer Learning? | Unknown | N/A | |
| Improving Adversarial Robustness via Channel-wise Activation Suppressing | Unknown | N/A | |
| A unifying view on implicit bias in training linear neural networks | Unknown | N/A | |
| Representation Learning for Sequence Data with Deep Autoencoding Predictive Components | Unknown | N/A | |
| Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples | Unknown | N/A | |
| Direction Matters: On the Implicit Bias of Stochastic Gradient Descent with Moderate Learning Rate | Unknown | N/A | |
| Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint Learning | Unknown | N/A | |
| Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers | Unknown | N/A | |
| Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency | Unknown | N/A | |
| Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective | Unknown | N/A | |
| Do Wide and Deep Networks Learn the Same Things? Uncovering How Neural Network Representations Vary with Width and Depth | Unknown | N/A | |
| BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration | Unknown | N/A | |
| UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers | Unknown | N/A | |
| A Good Image Generator Is What You Need for High-Resolution Video Synthesis | Unknown | N/A | |
| What Should Not Be Contrastive in Contrastive Learning | Unknown | N/A | |
| A Design Space Study for LISTA and Beyond | Unknown | N/A | |
| Rethinking Soft Labels for Knowledge Distillation: A Bias–Variance Tradeoff Perspective | Unknown | N/A | |
| Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval | Unknown | N/A | |
| Hierarchical Reinforcement Learning by Discovering Intrinsic Options | Unknown | N/A | |
| Denoising Diffusion Implicit Models | Unknown | N/A | |
| Intraclass clustering: an implicit learning ability that regularizes DNNs | Unknown | N/A | |
| Contrastive Learning with Hard Negative Samples | Unknown | N/A | |
| Discrete Graph Structure Learning for Forecasting Multiple Time Series | Unknown | N/A | |
| Selectivity considered harmful: evaluating the causal impact of class selectivity in DNNs | Unknown | N/A | |
| Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration | Unknown | N/A | |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Unknown | N/A | |
| A Distributional Approach to Controlled Text Generation | Unknown | N/A | |
| A Block Minifloat Representation for Training Deep Neural Networks | Unknown | N/A | |
| On the Impossibility of Global Convergence in Multi-Loss Optimization | Unknown | N/A | |
| Self-supervised Representation Learning with Relative Predictive Coding | Unknown | N/A | |
| Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images | Unknown | N/A | |
| Heating up decision boundaries: isocapacitory saturation, adversarial scenarios and generalization bounds | Unknown | N/A | |
| Rethinking Architecture Selection in Differentiable NAS | Unknown | N/A | |
| CaPC Learning: Confidential and Private Collaborative Learning | Unknown | N/A | |
| Incorporating Symmetry into Deep Dynamics Models for Improved Generalization | Unknown | N/A | |
| Dataset Condensation with Gradient Matching | Unknown | N/A | |
| PMI-Masking: Principled masking of correlated spans | Unknown | N/A | |
| Sharpness-aware Minimization for Efficiently Improving Generalization | Unknown | N/A | |
| Learning with AMIGo: Adversarially Motivated Intrinsic Goals | Unknown | N/A | |
| Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning | Unknown | N/A | |
| End-to-end Adversarial Text-to-Speech | Unknown | N/A | |
| SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness | Unknown | N/A | |
| not-MIWAE: Deep Generative Modelling with Missing not at Random Data | Unknown | N/A | |
| Distilling Knowledge from Reader to Retriever for Question Answering | Unknown | N/A | |
| Adaptive Extra-Gradient Methods for Min-Max Optimization and Games | Unknown | N/A | |
| Training with Quantization Noise for Extreme Model Compression | Unknown | N/A | |
| The Role of Momentum Parameters in the Optimal Convergence of Adaptive Polyak's Heavy-ball Methods | Unknown | N/A | |
| IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning | Unknown | N/A | |
| ChipNet: Budget-Aware Pruning with Heaviside Continuous Approximations | Unknown | N/A | |
| Learning Long-term Visual Dynamics with Region Proposal Interaction Networks | Unknown | N/A | |
| Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity | Unknown | N/A | |
| Conditional Generative Modeling via Learning the Latent Space | Unknown | N/A | |
| When Optimizing $f$-Divergence is Robust with Label Noise | Unknown | N/A | |
| Contrastive Learning with Adversarial Perturbations for Conditional Text Generation | Unknown | N/A | |
| Self-Supervised Policy Adaptation during Deployment | Unknown | N/A | |
| Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks | Unknown | N/A | |
| Large Scale Image Completion via Co-Modulated Generative Adversarial Networks | Unknown | N/A | |
| Geometry-aware Instance-reweighted Adversarial Training | Unknown | N/A | |
| Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL | Unknown | N/A | |
| Learning with Instance-Dependent Label Noise: A Sample Sieve Approach | Unknown | N/A | |
| Bag of Tricks for Adversarial Training | Unknown | N/A | |
| DynaTune: Dynamic Tensor Program Optimization in Deep Neural Network Compilation | Unknown | N/A | |
| The Risks of Invariant Risk Minimization | Unknown | N/A | |
| DOP: Off-Policy Multi-Agent Decomposed Policy Gradients | Unknown | N/A | |
| Generative Time-series Modeling with Fourier Flows | Unknown | N/A | |
| Individually Fair Gradient Boosting | Unknown | N/A | |
| Federated Learning Based on Dynamic Regularization | Unknown | N/A | |
| Contemplating Real-World Object Classification | Unknown | N/A | |
| When Do Curricula Work? | Unknown | N/A | |
| Learning Neural Event Functions for Ordinary Differential Equations | Unknown | N/A | |
| Mastering Atari with Discrete World Models | Unknown | N/A | |
| Getting a CLUE: A Method for Explaining Uncertainty Estimates | Unknown | N/A | |
| Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy | Unknown | N/A | |
| DeLighT: Deep and Light-weight Transformer | Unknown | N/A | |
| Domain Generalization with MixStyle | Unknown | N/A | |
| Concept Learners for Few-Shot Learning | Unknown | N/A | |
| Creative Sketch Generation | Unknown | N/A | |
| Rethinking Embedding Coupling in Pre-trained Language Models | Unknown | N/A | |
| How Does Mixup Help With Robustness and Generalization? | Unknown | N/A | |
| Lifelong Learning of Compositional Structures | Unknown | N/A | |
| Debiasing Concept-based Explanations with Causal Analysis | Unknown | N/A | |
| Learning to Represent Action Values as a Hypergraph on the Action Vertices | Unknown | N/A | |
| Collective Robustness Certificates: Exploiting Interdependence in Graph Neural Networks | Unknown | N/A | |
| Rethinking Attention with Performers | Unknown | N/A | |
| Benefit of deep learning with non-convex noisy gradient descent: Provable excess risk bound and superiority to kernel methods | Unknown | N/A | |
| Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator | Unknown | N/A | |
| Mutual Information State Intrinsic Control | Unknown | N/A | |
| Learning explanations that are hard to vary | Unknown | N/A | |
| Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System | Unknown | N/A | |
| Physics-aware, probabilistic model order reduction with guaranteed stability | Unknown | N/A | |
| RODE: Learning Roles to Decompose Multi-Agent Tasks | Unknown | N/A | |
| Neural gradients are near-lognormal: improved quantized and sparse training | Unknown | N/A | |
| Neural Mechanics: Symmetry and Broken Conservation Laws in Deep Learning Dynamics | Unknown | N/A | |
| Property Controllable Variational Autoencoder via Invertible Mutual Dependence | Unknown | N/A | |
| Neural Thompson Sampling | Unknown | N/A | |
| Continuous Wasserstein-2 Barycenter Estimation without Minimax Optimization | Unknown | N/A | |
| Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization | Unknown | N/A | |
| Effective and Efficient Vote Attack on Capsule Networks | Unknown | N/A | |
| Information Laundering for Model Privacy | Unknown | N/A | |
| Isometric Propagation Network for Generalized Zero-shot Learning | Unknown | N/A | |
| Learning with Feature-Dependent Label Noise: A Progressive Approach | Unknown | N/A | |
| SEED: Self-supervised Distillation For Visual Representation | Unknown | N/A | |
| Unsupervised Audiovisual Synthesis via Exemplar Autoencoders | Unknown | N/A | |
| Learning Energy-Based Generative Models via Coarse-to-Fine Expanding and Sampling | Unknown | N/A | |
| DDPNOpt: Differential Dynamic Programming Neural Optimizer | Unknown | N/A | |
| Mirostat: A Neural Text Decoding Algorithm That Directly Controls Perplexity | Unknown | N/A | |
| Contextual Dropout: An Efficient Sample-Dependent Dropout Module | Unknown | N/A | |
| Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry | Unknown | N/A | |
| Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters | Unknown | N/A | |
| Large Associative Memory Problem in Neurobiology and Machine Learning | Unknown | N/A | |
| Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation | Unknown | N/A | |
| On the Dynamics of Training Attention Models | Unknown | N/A | |
| INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving | Unknown | N/A | |
| A Critique of Self-Expressive Deep Subspace Clustering | Unknown | N/A | |
| Learning to Recombine and Resample Data For Compositional Generalization | Unknown | N/A | |
| Learning Generalizable Visual Representations via Interactive Gameplay | Unknown | N/A | |
| Overparameterisation and worst-case generalisation: friend or foe? | Unknown | N/A | |
| Calibration tests beyond classification | Unknown | N/A | |
| On the Transfer of Disentangled Representations in Realistic Settings | Unknown | N/A | |
| Deep Neural Tangent Kernel and Laplace Kernel Have the Same RKHS | Unknown | N/A | |
| Revisiting Few-sample BERT Fine-tuning | Unknown | N/A | |
| Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning | Unknown | N/A | |
| SSD: A Unified Framework for Self-Supervised Outlier Detection | Unknown | N/A | |
| Long-tail learning via logit adjustment | Unknown | N/A | |
| Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval | Unknown | N/A | |
| Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments | Unknown | N/A | |
| Federated Learning via Posterior Averaging: A New Perspective and Practical Algorithms | Unknown | N/A | |
| Understanding the role of importance weighting for deep learning | Unknown | N/A | |
| LEAF: A Learnable Frontend for Audio Classification | Unknown | N/A | |
| Monotonic Kronecker-Factored Lattice | Unknown | N/A | |
| Tomographic Auto-Encoder: Unsupervised Bayesian Recovery of Corrupted Data | Unknown | N/A | |
| Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics | Unknown | N/A | |
| Wasserstein-2 Generative Networks | Unknown | N/A | |
| Emergent Road Rules In Multi-Agent Driving Environments | Unknown | N/A | |
| Iterative Empirical Game Solving via Single Policy Best Response | Unknown | N/A | |
| Scalable Learning and MAP Inference for Nonsymmetric Determinantal Point Processes | Unknown | N/A | |
| Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule | Unknown | N/A | |
| Understanding the failure modes of out-of-distribution generalization | Unknown | N/A | |
| Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs | Unknown | N/A | |
| Hopfield Networks is All You Need | Unknown | N/A | |
| The Importance of Pessimism in Fixed-Dataset Policy Optimization | Unknown | N/A | |
| Representation Balancing Offline Model-based Reinforcement Learning | Unknown | N/A | |
| FairBatch: Batch Selection for Model Fairness | Unknown | N/A | |
| Inductive Representation Learning in Temporal Networks via Causal Anonymous Walks | Unknown | N/A | |
| Systematic generalisation with group invariant predictions | Unknown | N/A | |
| Efficient Inference of Flexible Interaction in Spiking-neuron Networks | Unknown | N/A | |
| Graph Coarsening with Neural Networks | Unknown | N/A | |
| Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks | Unknown | N/A | |
| Are wider nets better given the same number of parameters? | Unknown | N/A | |
| Autoregressive Entity Retrieval | Unknown | N/A | |
| DARTS-: Robustly Stepping out of Performance Collapse Without Indicators | Unknown | N/A | |
| Adversarially Guided Actor-Critic | Unknown | N/A | |
| Balancing Constraints and Rewards with Meta-Gradient D4PG | Unknown | N/A | |
| Auxiliary Learning by Implicit Differentiation | Unknown | N/A | |
| Distributed Momentum for Byzantine-resilient Stochastic Gradient Descent | Unknown | N/A | |
| Large-width functional asymptotics for deep Gaussian neural networks | Unknown | N/A | |
| Deciphering and Optimizing Multi-Task Learning: a Random Matrix Approach | Unknown | N/A | |
| Free Lunch for Few-shot Learning: Distribution Calibration | Unknown | N/A | |
| Generalized Multimodal ELBO | Unknown | N/A | |
| Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits | Unknown | N/A | |
| Convex Regularization behind Neural Reconstruction | Unknown | N/A | |
| Efficient Certified Defenses Against Patch Attacks on Image Classifiers | Unknown | N/A | |
| Learning Neural Generative Dynamics for Molecular Conformation Generation | Unknown | N/A | |
| Individually Fair Rankings | Unknown | N/A | |
| Hierarchical Autoregressive Modeling for Neural Video Compression | Unknown | N/A | |
| Robust Reinforcement Learning on State Observations with Learned Optimal Adversary | Unknown | N/A | |
| How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks? | Unknown | N/A | |
| A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima | Unknown | N/A | |
| Evaluation of Similarity-based Explanations | Unknown | N/A | |
| Geometry-Aware Gradient Algorithms for Neural Architecture Search | Unknown | N/A | |
| Open Question Answering over Tables and Text | Unknown | N/A | |
| The Unreasonable Effectiveness of Patches in Deep Convolutional Kernels Methods | Unknown | N/A | |
| VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models | Unknown | N/A | |
| Self-supervised Learning from a Multi-view Perspective | Unknown | N/A | |
| Fair Mixup: Fairness via Interpolation | Unknown | N/A | |
| Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models | Unknown | N/A | |
| Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning | Unknown | N/A | |
| Removing Undesirable Feature Contributions Using Out-of-Distribution Data | Unknown | N/A | |
| Meta-learning Symmetries by Reparameterization | Unknown | N/A | |
| How to Find Your Friendly Neighborhood: Graph Attention Design with Self-Supervision | Unknown | N/A | |
| For self-supervised learning, Rationality implies generalization, provably | Unknown | N/A | |
| A Temporal Kernel Approach for Deep Learning with Continuous-time Information | Unknown | N/A | |
| Improve Object Detection with Feature-based Knowledge Distillation: Towards Accurate and Efficient Detectors | Unknown | N/A | |
| Conservative Safety Critics for Exploration | Unknown | N/A | |
| Model-Based Visual Planning with Self-Supervised Functional Distances | Unknown | N/A | |
| GraphCodeBERT: Pre-training Code Representations with Data Flow | Unknown | N/A | |
| No MCMC for me: Amortized sampling for fast and stable training of energy-based models | Unknown | N/A | |
| BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction | Unknown | N/A | |
| Predicting Classification Accuracy When Adding New Unobserved Classes | Unknown | N/A | |
| Estimating and Evaluating Regression Predictive Uncertainty in Deep Object Detectors | Unknown | N/A | |
| Learning the Pareto Front with Hypernetworks | Unknown | N/A | |
| Optimal Rates for Averaged Stochastic Gradient Descent under Neural Tangent Kernel Regime | Unknown | N/A | |
| Projected Latent Markov Chain Monte Carlo: Conditional Sampling of Normalizing Flows | Unknown | N/A | |
| MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space | Unknown | N/A | |
| Impact of Representation Learning in Linear Bandits | Unknown | N/A | |
| EEC: Learning to Encode and Regenerate Images for Continual Learning | Unknown | N/A | |
| What Can You Learn From Your Muscles? Learning Visual Representation from Human Interactions | Unknown | N/A | |
| Improving VAEs' Robustness to Adversarial Attack | Unknown | N/A | |
| The Deep Bootstrap Framework: Good Online Learners are Good Offline Generalizers | Unknown | N/A | |
| Control-Aware Representations for Model-based Reinforcement Learning | Unknown | N/A | |
| Scaling Symbolic Methods using Gradients for Neural Model Explanation | Unknown | N/A | |
| Empirical or Invariant Risk Minimization? A Sample Complexity Perspective | Unknown | N/A | |
| CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning | Unknown | N/A | |
| Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients | Unknown | N/A | |
| Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability | Unknown | N/A | |
| The geometry of integration in text classification RNNs | Unknown | N/A | |
| On the Bottleneck of Graph Neural Networks and its Practical Implications | Unknown | N/A | |
| Learning to Reach Goals via Iterated Supervised Learning | Unknown | N/A | |
| On the Critical Role of Conventions in Adaptive Human-AI Collaboration | Unknown | N/A | |
| CopulaGNN: Towards Integrating Representational and Correlational Roles of Graphs in Graph Neural Networks | Unknown | N/A | |
| Discovering a set of policies for the worst case reward | Unknown | N/A | |
| Learning perturbation sets for robust machine learning | Unknown | N/A | |
| Primal Wasserstein Imitation Learning | Unknown | N/A | |
| A Universal Representation Transformer Layer for Few-Shot Image Classification | Unknown | N/A | |
| MoPro: Webly Supervised Learning with Momentum Prototypes | Unknown | N/A | |
| Signatory: differentiable computations of the signature and logsignature transforms, on both CPU and GPU | Unknown | N/A | |
| Optimism in Reinforcement Learning with Generalized Linear Function Approximation | Unknown | N/A | |
| Deberta: Decoding-Enhanced Bert With Disentangled Attention | Unknown | N/A | |
| Variational Information Bottleneck for Effective Low-Resource Fine-Tuning | Unknown | N/A | |
| Expressive Power of Invariant and Equivariant Graph Neural Networks | Unknown | N/A | |
| On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines | Unknown | N/A | |
| Computational Separation Between Convolutional and Fully-Connected Networks | Unknown | N/A | |
| Probabilistic Numeric Convolutional Neural Networks | Unknown | N/A | |
| FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders | Unknown | N/A | |
| Coping with Label Shift via Distributionally Robust Optimisation | Unknown | N/A | |
| MixKD: Towards Efficient Distillation of Large-scale Language Models | Unknown | N/A | |
| Learning a Latent Simplex in Input Sparsity Time | Unknown | N/A | |
| Teaching with Commentaries | Unknown | N/A | |
| CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding | Unknown | N/A | |
| Fantastic Four: Differentiable and Efficient Bounds on Singular Values of Convolution Layers | Unknown | N/A | |
| Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data | Unknown | N/A | |
| Negative Data Augmentation | Unknown | N/A | |
| Scalable Transfer Learning with Expert Models | Unknown | N/A | |
| A Wigner-Eckart Theorem for Group Equivariant Convolution Kernels | Unknown | N/A | |
| Learning A Minimax Optimizer: A Pilot Study | Unknown | N/A | |
| Meta Back-Translation | Unknown | N/A | |
| Optimal Regularization can Mitigate Double Descent | Unknown | N/A | |
| Net-DNF: Effective Deep Modeling of Tabular Data | Unknown | N/A | |
| MultiModalQA: complex question answering over text, tables and images | Unknown | N/A | |
| Dynamic Tensor Rematerialization | Unknown | N/A | |
| AdaGCN: Adaboosting Graph Convolutional Networks into Deep Models | Unknown | N/A | |
| Few-Shot Learning via Learning the Representation, Provably | Unknown | N/A | |
| Wandering within a world: Online contextualized few-shot learning | Unknown | N/A | |
| WrapNet: Neural Net Inference with Ultra-Low-Precision Arithmetic | Unknown | N/A | |
| Nearest Neighbor Machine Translation | Unknown | N/A | |
| Knowledge distillation via softmax regression representation learning | Unknown | N/A | |
| Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition | Unknown | N/A | |
| Practical Massively Parallel Monte-Carlo Tree Search Applied to Molecular Design | Unknown | N/A | |
| Neural Pruning via Growing Regularization | Unknown | N/A | |
| Mixed-Features Vectors and Subspace Splitting | Unknown | N/A | |
| Graph-Based Continual Learning | Unknown | N/A | |
| Sparse Quantized Spectral Clustering | Unknown | N/A | |
| Taking Notes on the Fly Helps Language Pre-Training | Unknown | N/A | |
| Explainable Deep One-Class Classification | Unknown | N/A | |
| Revisiting Dynamic Convolution via Matrix Decomposition | Unknown | N/A | |
| BiPointNet: Binary Neural Network for Point Clouds | Unknown | N/A | |
| Prediction and generalisation over directed actions by grid cells | Unknown | N/A | |
| Continual learning in recurrent neural networks | Unknown | N/A | |
| Neural networks with late-phase weights | Unknown | N/A | |
| Sparse encoding for more-interpretable feature-selecting representations in probabilistic matrix factorization | Unknown | N/A | |
| HyperGrid Transformers: Towards A Single Model for Multiple Tasks | Unknown | N/A | |
| Learning Robust State Abstractions for Hidden-Parameter Block MDPs | Unknown | N/A | |
| Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning | Unknown | N/A | |
| Viewmaker Networks: Learning Views for Unsupervised Representation Learning | Unknown | N/A | |
| Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning | Unknown | N/A | |
| Is Attention Better Than Matrix Decomposition? | Unknown | N/A | |
| Learning Incompressible Fluid Dynamics from Scratch - Towards Fast, Differentiable Fluid Models that Generalize | Unknown | N/A | |
| Refining Deep Generative Models via Discriminator Gradient Flow | Unknown | N/A | |
| Entropic gradient descent algorithms and wide flat minima | Unknown | N/A | |
| New Bounds For Distributed Mean Estimation and Variance Reduction | Unknown | N/A | |
| Learning Value Functions in Deep Policy Gradients using Residual Variance | Unknown | N/A | |
| Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic | Unknown | N/A | |
| No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks | Unknown | N/A | |
| Parameter-Based Value Functions | Unknown | N/A | |
| CoCon: A Self-Supervised Approach for Controlled Text Generation | Unknown | N/A | |
| MELR: Meta-Learning via Modeling Episode-Level Relationships for Few-Shot Learning | Unknown | N/A | |
| Beyond Categorical Label Representations for Image Classification | Unknown | N/A | |
| Meta-learning with negative learning rates | Unknown | N/A | |
| Provable Rich Observation Reinforcement Learning with Combinatorial Latent States | Unknown | N/A | |
| Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime | Unknown | N/A | |
| Variational State-Space Models for Localisation and Dense 3D Mapping in 6 DoF | Unknown | N/A | |
| Contrastive Syn-to-Real Generalization | Unknown | N/A | |
| Evolving Reinforcement Learning Algorithms | Unknown | N/A | |
| Neural Topic Model via Optimal Transport | Unknown | N/A | |
| Class Normalization for (Continual)? Generalized Zero-Shot Learning | Unknown | N/A | |
| Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction | Unknown | N/A | |
| A Gradient Flow Framework For Analyzing Network Pruning | Unknown | N/A | |
| A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks | Unknown | N/A | |
| Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes | Unknown | N/A | |
| Knowledge Distillation as Semiparametric Inference | Unknown | N/A | |
| NBDT: Neural-Backed Decision Tree | Unknown | N/A | |
| Deep Equals Shallow for ReLU Networks in Kernel Regimes | Unknown | N/A | |
| Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling | Unknown | N/A | |
| MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering | Unknown | N/A | |
| Mind the Pad -- CNNs Can Develop Blind Spots | Unknown | N/A | |
| A PAC-Bayesian Approach to Generalization Bounds for Graph Neural Networks | Unknown | N/A | |
| Gauge Equivariant Mesh CNNs: Anisotropic convolutions on geometric graphs | Unknown | N/A | |
| Interpretable Models for Granger Causality Using Self-explaining Neural Networks | Unknown | N/A | |
| Estimating Lipschitz constants of monotone deep equilibrium models | Unknown | N/A | |
| Probing BERT in Hyperbolic Spaces | Unknown | N/A | |
| Batch Reinforcement Learning Through Continuation Method | Unknown | N/A | |
| Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning | Unknown | N/A | |
| Diverse Video Generation using a Gaussian Process Trigger | Unknown | N/A | |
| Learning and Evaluating Representations for Deep One-Class Classification | Unknown | N/A | |
| Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers | Unknown | N/A | |
| Retrieval-Augmented Generation for Code Summarization via Hybrid GNN | Unknown | N/A | |
| Randomized Automatic Differentiation | Unknown | N/A | |
| Generalized Energy Based Models | Unknown | N/A | |
| Temporally-Extended ε-Greedy Exploration | Unknown | N/A | |
| Multiplicative Filter Networks | Unknown | N/A | |
| Clustering-friendly Representation Learning via Instance Discrimination and Feature Decorrelation | Unknown | N/A | |
| FedMix: Approximation of Mixup under Mean Augmented Federated Learning | Unknown | N/A | |
| Self-training For Few-shot Transfer Across Extreme Task Differences | Unknown | N/A | |
| VCNet and Functional Targeted Regularization For Learning Causal Effects of Continuous Treatments | Unknown | N/A | |
| Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks | Unknown | N/A | |
| Generalization bounds via distillation | Unknown | N/A | |
| Model Patching: Closing the Subgroup Performance Gap with Data Augmentation | Unknown | N/A | |
| A Learning Theoretic Perspective on Local Explainability | Unknown | N/A | |
| Sharper Generalization Bounds for Learning with Gradient-dominated Objective Functions | Unknown | N/A | |
| Learning to Set Waypoints for Audio-Visual Navigation | Unknown | N/A | |
| Partitioned Learned Bloom Filters | Unknown | N/A | |
| Unsupervised Meta-Learning through Latent-Space Interpolation in Generative Models | Unknown | N/A | |
| Generalized Variational Continual Learning | Unknown | N/A | |
| Robust and Generalizable Visual Representation Learning via Random Convolutions | Unknown | N/A | |
| One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks | Unknown | N/A | |
| X2T: Training an X-to-Text Typing Interface with Online Learning from User Feedback | Unknown | N/A | |
| Attentional Constellation Nets for Few-Shot Learning | Unknown | N/A | |
| InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective | Unknown | N/A | |
| Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech | Unknown | N/A | |
| Risk-Averse Offline Reinforcement Learning | Unknown | N/A | |
| Spatio-Temporal Graph Scattering Transform | Unknown | N/A | |
| How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks | Unknown | N/A | |
| Async-RED: A Provably Convergent Asynchronous Block Parallel Stochastic Method using Deep Denoising Priors | Unknown | N/A | |
| Communication in Multi-Agent Reinforcement Learning: Intention Sharing | Unknown | N/A | |
| Few-Shot Bayesian Optimization with Deep Kernel Surrogates | Unknown | N/A | |
| Disentangled Recurrent Wasserstein Autoencoder | Unknown | N/A | |
| In Search of Lost Domain Generalization | Unknown | N/A | |
| Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization | Unknown | N/A | |
| Implicit Normalizing Flows | Unknown | N/A | |
| VTNet: Visual Transformer Network for Object Goal Navigation | Unknown | N/A | |
| Learning Task Decomposition with Ordered Memory Policy Network | Unknown | N/A | |
| Deep Networks and the Multiple Manifold Problem | Unknown | N/A | |
| Learning What To Do by Simulating the Past | Unknown | N/A | |
| Progressive Skeletonization: Trimming more fat from a network at initialization | Unknown | N/A | |
| $i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning | Unknown | N/A | |
| Topology-Aware Segmentation Using Discrete Morse Theory | Unknown | N/A | |
| On Dyadic Fairness: Exploring and Mitigating Bias in Graph Connections | Unknown | N/A | |
| Tent: Fully Test-Time Adaptation by Entropy Minimization | Unknown | N/A | |
| Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding | Unknown | N/A | |
| Dataset Inference: Ownership Resolution in Machine Learning | Unknown | N/A | |
| Regularized Inverse Reinforcement Learning | Unknown | N/A | |
| Fast And Slow Learning Of Recurrent Independent Mechanisms | Unknown | N/A | |
| Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates | Unknown | N/A | |
| Semi-supervised Keypoint Localization | Unknown | N/A | |
| Representing Partial Programs with Blended Abstract Semantics | Unknown | N/A | |
| FastSpeech 2: Fast and High-Quality End-to-End Text to Speech | Unknown | N/A | |
| Unlearnable Examples: Making Personal Data Unexploitable | Unknown | N/A | |
| IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression | Unknown | N/A | |
| Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization | Unknown | N/A | |
| Identifying Physical Law of Hamiltonian Systems via Meta-Learning | Unknown | N/A | |
| Text Generation by Learning from Demonstrations | Unknown | N/A | |
| Unbiased Teacher for Semi-Supervised Object Detection | Unknown | N/A | |
| Estimating informativeness of samples with Smooth Unique Information | Unknown | N/A | |
| Efficient Generalized Spherical CNNs | Unknown | N/A | |
| DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues | Unknown | N/A | |
| Global Convergence of Three-layer Neural Networks in the Mean Field Regime | Unknown | N/A | |
| Multi-Time Attention Networks for Irregularly Sampled Time Series | Unknown | N/A | |
| Linear Convergent Decentralized Optimization with Compression | Unknown | N/A | |
| Adaptive Federated Optimization | Unknown | N/A | |
| Auction Learning as a Two-Player Game | Unknown | N/A | |
| Analyzing the Expressive Power of Graph Neural Networks in a Spectral Perspective | Unknown | N/A | |
| Exemplary Natural Images Explain CNN Activations Better than State-of-the-Art Feature Visualization | Unknown | N/A | |
| Interpreting Knowledge Graph Relation Representation from Word Embeddings | Unknown | N/A | |
| Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction | Unknown | N/A | |
| DICE: Diversity in Deep Ensembles via Conditional Redundancy Adversarial Estimation | Unknown | N/A | |
| Learning-based Support Estimation in Sublinear Time | Unknown | N/A | |
| Integrating Categorical Semantics into Unsupervised Domain Translation | Unknown | N/A | |
| The Recurrent Neural Tangent Kernel | Unknown | N/A | |
| C-Learning: Learning to Achieve Goals via Recursive Classification | Unknown | N/A | |
| Graph Traversal with Tensor Functionals: A Meta-Algorithm for Scalable Learning | Unknown | N/A | |
| What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study | Unknown | N/A | |
| Influence Estimation for Generative Adversarial Networks | Unknown | N/A | |
| Clairvoyance: A Pipeline Toolkit for Medical Time Series | Unknown | N/A | |
| Randomized Ensembled Double Q-Learning: Learning Fast Without a Model | Unknown | N/A | |
| Learning advanced mathematical computations from examples | Unknown | N/A | |
| Noise against noise: stochastic label noise helps combat inherent label noise | Unknown | N/A | |
| Achieving Linear Speedup with Partial Worker Participation in Non-IID Federated Learning | Unknown | N/A | |
| Protecting DNNs from Theft using an Ensemble of Diverse Models | Unknown | N/A | |
| You Only Need Adversarial Supervision for Semantic Image Synthesis | Unknown | N/A | |
| Neural Spatio-Temporal Point Processes | Unknown | N/A | |
| Linear Mode Connectivity in Multitask and Continual Learning | Unknown | N/A | |
| Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders | Unknown | N/A | |
| Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral | Unknown | N/A | |
| Influence Functions in Deep Learning Are Fragile | Unknown | N/A | |
| Categorical Normalizing Flows via Continuous Transformations | Unknown | N/A | |
| Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation | Unknown | N/A | |
| Directed Acyclic Graph Neural Networks | Unknown | N/A | |
| Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multilingual Models | Unknown | N/A | |
| SOLAR: Sparse Orthogonal Learned and Random Embeddings | Unknown | N/A | |
| Bayesian Context Aggregation for Neural Processes | Unknown | N/A | |
| Cut out the annotator, keep the cutout: better segmentation with weak supervision | Unknown | N/A | |
| Effective Abstract Reasoning with Dual-Contrast Network | Unknown | N/A | |
| Personalized Federated Learning with First Order Model Optimization | Unknown | N/A | |
| Task-Agnostic Morphology Evolution | Unknown | N/A | |
| Learning Parametrised Graph Shift Operators | Unknown | N/A | |
| Online Adversarial Purification based on Self-supervised Learning | Unknown | N/A |
ICLR 2022
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Distributionally Robust Fair Principal Components via Geodesic Descents | Unknown | N/A | |
| Ancestral protein sequence reconstruction using a tree-structured Ornstein-Uhlenbeck variational autoencoder | Unknown | N/A | |
| The Efficiency Misnomer | Unknown | N/A | |
| The Role of Permutation Invariance in Linear Mode Connectivity of Neural Networks | Unknown | N/A | |
| Huber Additive Models for Non-stationary Time Series Analysis | Unknown | N/A | |
| Rethinking Supervised Pre-Training for Better Downstream Transferring | Unknown | N/A | |
| Optimization inspired Multi-Branch Equilibrium Models | Unknown | N/A | |
| Know Your Action Set: Learning Action Relations for Reinforcement Learning | Unknown | N/A | |
| On the Importance of Difficulty Calibration in Membership Inference Attacks | Unknown | N/A | |
| Do Users Benefit From Interpretable Vision? A User Study, Baseline, And Dataset | Unknown | N/A | |
| Rethinking Class-Prior Estimation for Positive-Unlabeled Learning | Unknown | N/A | |
| Enabling Arbitrary Translation Objectives with Adaptive Tree Search | Unknown | N/A | |
| Fine-grained Differentiable Physics: A Yarn-level Model for Fabrics | Unknown | N/A | |
| Transition to Linearity of Wide Neural Networks is an Emerging Property of Assembling Weak Models | Unknown | N/A | |
| SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation | Unknown | N/A | |
| Is Importance Weighting Incompatible with Interpolating Classifiers? | Unknown | N/A | |
| DIVA: Dataset Derivative of a Learning Task | Unknown | N/A | |
| Path Integral Sampler: A Stochastic Control Approach For Sampling | Unknown | N/A | |
| Feature Kernel Distillation | Unknown | N/A | |
| Representation Learning for Online and Offline RL in Low-rank MDPs | Unknown | N/A | |
| Tackling the Generative Learning Trilemma with Denoising Diffusion GANs | Unknown | N/A | |
| Strength of Minibatch Noise in SGD | Unknown | N/A | |
| Sample Selection with Uncertainty of Losses for Learning with Noisy Labels | Unknown | N/A | |
| Online Facility Location with Predictions | Unknown | N/A | |
| High Probability Bounds for a Class of Nonconvex Algorithms with AdaGrad Stepsize | Unknown | N/A | |
| Contrastive Fine-grained Class Clustering via Generative Adversarial Networks | Unknown | N/A | |
| TAMP-S2GCNets: Coupling Time-Aware Multipersistence Knowledge Representation with Spatio-Supra Graph Convolutional Networks for Time-Series Forecasting | Unknown | N/A | |
| D-CODE: Discovering Closed-form ODEs from Observed Trajectories | Unknown | N/A | |
| Declarative nets that are equilibrium models | Unknown | N/A | |
| Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural Networks | Unknown | N/A | |
| DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR | Unknown | N/A | |
| Graph-Relational Domain Adaptation | Unknown | N/A | |
| On the approximation properties of recurrent encoder-decoder architectures | Unknown | N/A | |
| Learning Object-Oriented Dynamics for Planning from Text | Unknown | N/A | |
| Quantitative Performance Assessment of CNN Units via Topological Entropy Calculation | Unknown | N/A | |
| The Hidden Convex Optimization Landscape of Regularized Two-Layer ReLU Networks: an Exact Characterization of Optimal Solutions | Unknown | N/A | |
| AlphaZero-based Proof Cost Network to Aid Game Solving | Unknown | N/A | |
| Open-World Semi-Supervised Learning | Unknown | N/A | |
| A generalization of the randomized singular value decomposition | Unknown | N/A | |
| Learning Graphon Mean Field Games and Approximate Nash Equilibria | Unknown | N/A | |
| A Neural Tangent Kernel Perspective of Infinite Tree Ensembles | Unknown | N/A | |
| Multitask Prompted Training Enables Zero-Shot Task Generalization | Unknown | N/A | |
| On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning | Unknown | N/A | |
| On the benefits of maximum likelihood estimation for Regression and Forecasting | Unknown | N/A | |
| Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods | Unknown | N/A | |
| Universalizing Weak Supervision | Unknown | N/A | |
| Uncertainty Modeling for Out-of-Distribution Generalization | Unknown | N/A | |
| Generalization of Neural Combinatorial Solvers Through the Lens of Adversarial Robustness | Unknown | N/A | |
| Neural Link Prediction with Walk Pooling | Unknown | N/A | |
| On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning | Unknown | N/A | |
| When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently? | Unknown | N/A | |
| Robbing the Fed: Directly Obtaining Private Data in Federated Learning with Modified Models | Unknown | N/A | |
| Crystal Diffusion Variational Autoencoder for Periodic Material Generation | Unknown | N/A | |
| High Probability Generalization Bounds with Fast Rates for Minimax Problems | Unknown | N/A | |
| Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking | Unknown | N/A | |
| Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations | Unknown | N/A | |
| Multimeasurement Generative Models | Unknown | N/A | |
| Generalized Natural Gradient Flows in Hidden Convex-Concave Games and GANs | Unknown | N/A | |
| Solving Inverse Problems in Medical Imaging with Score-Based Generative Models | Unknown | N/A | |
| switch-GLAT: Multilingual Parallel Machine Translation Via Code-Switch Decoder | Unknown | N/A | |
| MAML is a Noisy Contrastive Learner in Classification | Unknown | N/A | |
| A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion | Unknown | N/A | |
| Implicit Bias of Adversarial Training for Deep Neural Networks | Unknown | N/A | |
| Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration | Unknown | N/A | |
| POETREE: Interpretable Policy Learning with Adaptive Decision Trees | Unknown | N/A | |
| A Johnson-Lindenstrauss Framework for Randomly Initialized CNNs | Unknown | N/A | |
| Online Ad Hoc Teamwork under Partial Observability | Unknown | N/A | |
| Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games | Unknown | N/A | |
| Neural Networks as Kernel Learners: The Silent Alignment Effect | Unknown | N/A | |
| A Fine-Grained Analysis on Distribution Shift | Unknown | N/A | |
| The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design | Unknown | N/A | |
| Towards Evaluating the Robustness of Neural Networks Learned by Transduction | Unknown | N/A | |
| GradSign: Model Performance Inference with Theoretical Insights | Unknown | N/A | |
| Consistent Counterfactuals for Deep Models | Unknown | N/A | |
| The Three Stages of Learning Dynamics in High-dimensional Kernel Methods | Unknown | N/A | |
| Real-Time Neural Voice Camouflage | Unknown | N/A | |
| Sample Efficient Stochastic Policy Extragradient Algorithm for Zero-Sum Markov Game | Unknown | N/A | |
| Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100 | Unknown | N/A | |
| The Effects of Invertibility on the Representational Complexity of Encoders in Variational Autoencoders | Unknown | N/A | |
| Extending the WILDS Benchmark for Unsupervised Adaptation | Unknown | N/A | |
| Efficiently Modeling Long Sequences with Structured State Spaces | Unknown | N/A | |
| Unified Visual Transformer Compression | Unknown | N/A | |
| NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy | Unknown | N/A | |
| Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic Forecasting | Unknown | N/A | |
| A Theory of Tournament Representations | Unknown | N/A | |
| Mapping conditional distributions for domain adaptation under generalized target shift | Unknown | N/A | |
| NeuPL: Neural Population Learning | Unknown | N/A | |
| An Operator Theoretic View On Pruning Deep Neural Networks | Unknown | N/A | |
| MetaMorph: Learning Universal Controllers with Transformers | Unknown | N/A | |
| PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method | Unknown | N/A | |
| Modular Lifelong Reinforcement Learning via Neural Composition | Unknown | N/A | |
| TPU-GAN: Learning temporal coherence from dynamic point cloud sequences | Unknown | N/A | |
| Learning Hierarchical Structures with Differentiable Nondeterministic Stacks | Unknown | N/A | |
| Missingness Bias in Model Debugging | Unknown | N/A | |
| Accelerated Policy Learning with Parallel Differentiable Simulation | Unknown | N/A | |
| Causal Contextual Bandits with Targeted Interventions | Unknown | N/A | |
| Learning Altruistic Behaviours in Reinforcement Learning without External Rewards | Unknown | N/A | |
| FILIP: Fine-grained Interactive Language-Image Pre-Training | Unknown | N/A | |
| Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation | Unknown | N/A | |
| The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex Program | Unknown | N/A | |
| Variational oracle guiding for reinforcement learning | Unknown | N/A | |
| P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts | Unknown | N/A | |
| Demystifying Batch Normalization in ReLU Networks: Equivalent Convex Optimization Models and Implicit Regularization | Unknown | N/A | |
| Cross-Lingual Transfer with Class-Weighted Language-Invariant Representations | Unknown | N/A | |
| Reverse Engineering of Imperceptible Adversarial Image Perturbations | Unknown | N/A | |
| Pareto Policy Adaptation | Unknown | N/A | |
| Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias | Unknown | N/A | |
| Amortized Tree Generation for Bottom-up Synthesis Planning and Synthesizable Molecular Design | Unknown | N/A | |
| C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks | Unknown | N/A | |
| Understanding Domain Randomization for Sim-to-real Transfer | Unknown | N/A | |
| Model-Based Offline Meta-Reinforcement Learning with Regularization | Unknown | N/A | |
| Analyzing and Improving the Optimization Landscape of Noise-Contrastive Estimation | Unknown | N/A | |
| Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface | Unknown | N/A | |
| TRGP: Trust Region Gradient Projection for Continual Learning | Unknown | N/A | |
| Contextualized Scene Imagination for Generative Commonsense Reasoning | Unknown | N/A | |
| Shallow and Deep Networks are Near-Optimal Approximators of Korobov Functions | Unknown | N/A | |
| Phenomenology of Double Descent in Finite-Width Neural Networks | Unknown | N/A | |
| InfinityGAN: Towards Infinite-Pixel Image Synthesis | Unknown | N/A | |
| Discovering Nonlinear PDEs from Scarce Data with Physics-encoded Learning | Unknown | N/A | |
| COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks | Unknown | N/A | |
| How Much Can CLIP Benefit Vision-and-Language Tasks? | Unknown | N/A | |
| Learning Causal Models from Conditional Moment Restrictions by Importance Weighting | Unknown | N/A | |
| Learning Optimal Conformal Classifiers | Unknown | N/A | |
| Self-Supervision Enhanced Feature Selection with Correlated Gates | Unknown | N/A | |
| Understanding the Variance Collapse of SVGD in High Dimensions | Unknown | N/A | |
| Information Bottleneck: Exact Analysis of (Quantized) Neural Networks | Unknown | N/A | |
| CoST: Contrastive Learning of Disentangled Seasonal-Trend Representations for Time Series Forecasting | Unknown | N/A | |
| OBJECT DYNAMICS DISTILLATION FOR SCENE DECOMPOSITION AND REPRESENTATION | Unknown | N/A | |
| $\beta$-Intact-VAE: Identifying and Estimating Causal Effects under Limited Overlap | Unknown | N/A | |
| Augmented Sliced Wasserstein Distances | Unknown | N/A | |
| Neural Processes with Stochastic Attention: Paying more attention to the context dataset | Unknown | N/A | |
| Learnability of convolutional neural networks for infinite dimensional input via mixed and anisotropic smoothness | Unknown | N/A | |
| Towards General Function Approximation in Zero-Sum Markov Games | Unknown | N/A | |
| Energy-Based Learning for Cooperative Games, with Applications to Valuation Problems in Machine Learning | Unknown | N/A | |
| Counterfactual Plans under Distributional Ambiguity | Unknown | N/A | |
| Boosted Curriculum Reinforcement Learning | Unknown | N/A | |
| TAda! Temporally-Adaptive Convolutions for Video Understanding | Unknown | N/A | |
| Towards Training Billion Parameter Graph Neural Networks for Atomic Simulations | Unknown | N/A | |
| $\mathrm{SO}(2)$-Equivariant Reinforcement Learning | Unknown | N/A | |
| Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference | Unknown | N/A | |
| Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How | Unknown | N/A | |
| Equivariant Graph Mechanics Networks with Constraints | Unknown | N/A | |
| THOMAS: Trajectory Heatmap Output with learned Multi-Agent Sampling | Unknown | N/A | |
| SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training | Unknown | N/A | |
| Understanding Dimensional Collapse in Contrastive Self-supervised Learning | Unknown | N/A | |
| Constrained Physical-Statistics Models for Dynamical System Identification and Prediction | Unknown | N/A | |
| Anti-Concentrated Confidence Bonuses For Scalable Exploration | Unknown | N/A | |
| Learning Distributionally Robust Models at Scale via Composite Optimization | Unknown | N/A | |
| On Predicting Generalization using GANs | Unknown | N/A | |
| Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization | Unknown | N/A | |
| Deep Point Cloud Reconstruction | Unknown | N/A | |
| Graph-Augmented Normalizing Flows for Anomaly Detection of Multiple Time Series | Unknown | N/A | |
| Half-Inverse Gradients for Physical Deep Learning | Unknown | N/A | |
| Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views? | Unknown | N/A | |
| On the relation between statistical learning and perceptual distances | Unknown | N/A | |
| Graph-based Nearest Neighbor Search in Hyperbolic Spaces | Unknown | N/A | |
| Generalized rectifier wavelet covariance models for texture synthesis | Unknown | N/A | |
| Learning 3D Representations of Molecular Chirality with Invariance to Bond Rotations | Unknown | N/A | |
| PSA-GAN: Progressive Self Attention GANs for Synthetic Time Series | Unknown | N/A | |
| Variational Neural Cellular Automata | Unknown | N/A | |
| On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks | Unknown | N/A | |
| Generalization Through the Lens of Leave-One-Out Error | Unknown | N/A | |
| Variational methods for simulation-based inference | Unknown | N/A | |
| Diurnal or Nocturnal? Federated Learning of Multi-branch Networks from Periodically Shifting Distributions | Unknown | N/A | |
| Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning | Unknown | N/A | |
| Practical Conditional Neural Process Via Tractable Dependent Predictions | Unknown | N/A | |
| CodeTrek: Flexible Modeling of Code using an Extensible Relational Representation | Unknown | N/A | |
| Selective Ensembles for Consistent Predictions | Unknown | N/A | |
| Multi-Stage Episodic Control for Strategic Exploration in Text Games | Unknown | N/A | |
| Surreal-GAN:Semi-Supervised Representation Learning via GAN for uncovering heterogeneous disease-related imaging patterns | Unknown | N/A | |
| A First-Occupancy Representation for Reinforcement Learning | Unknown | N/A | |
| Space-Time Graph Neural Networks | Unknown | N/A | |
| Offline Reinforcement Learning with Value-based Episodic Memory | Unknown | N/A | |
| The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models | Unknown | N/A | |
| Defending Against Image Corruptions Through Adversarial Augmentations | Unknown | N/A | |
| Object Pursuit: Building a Space of Objects via Discriminative Weight Generation | Unknown | N/A | |
| The Spectral Bias of Polynomial Neural Networks | Unknown | N/A | |
| Synchromesh: Reliable Code Generation from Pre-trained Language Models | Unknown | N/A | |
| Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction | Unknown | N/A | |
| Fast topological clustering with Wasserstein distance | Unknown | N/A | |
| Skill-based Meta-Reinforcement Learning | Unknown | N/A | |
| Differentiable Scaffolding Tree for Molecule Optimization | Unknown | N/A | |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | Unknown | N/A | |
| An Experimental Design Perspective on Model-Based Reinforcement Learning | Unknown | N/A | |
| The Evolution of Uncertainty of Learning in Games | Unknown | N/A | |
| CoMPS: Continual Meta Policy Search | Unknown | N/A | |
| Reinforcement Learning in Presence of Discrete Markovian Context Evolution | Unknown | N/A | |
| Provable Learning-based Algorithm For Sparse Recovery | Unknown | N/A | |
| Granger causal inference on DAGs identifies genomic loci regulating transcription | Unknown | N/A | |
| Curriculum learning as a tool to uncover learning principles in the brain | Unknown | N/A | |
| Learning Temporally Causal Latent Processes from General Temporal Data | Unknown | N/A | |
| Autoregressive Quantile Flows for Predictive Uncertainty Estimation | Unknown | N/A | |
| Bi-linear Value Networks for Multi-goal Reinforcement Learning | Unknown | N/A | |
| Deep ReLU Networks Preserve Expected Length | Unknown | N/A | |
| Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap | Unknown | N/A | |
| HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning | Unknown | N/A | |
| What Do We Mean by Generalization in Federated Learning? | Unknown | N/A | |
| A Comparison of Hamming Errors of Representative Variable Selection Methods | Unknown | N/A | |
| Model Zoo: A Growing Brain That Learns Continually | Unknown | N/A | |
| Distributional Reinforcement Learning with Monotonic Splines | Unknown | N/A | |
| Anisotropic Random Feature Regression in High Dimensions | Unknown | N/A | |
| Modeling Label Space Interactions in Multi-label Classification using Box Embeddings | Unknown | N/A | |
| Variational Predictive Routing with Nested Subjective Timescales | Unknown | N/A | |
| Training invariances and the low-rank phenomenon: beyond linear networks | Unknown | N/A | |
| A NON-PARAMETRIC REGRESSION VIEWPOINT : GENERALIZATION OF OVERPARAMETRIZED DEEP RELU NETWORK UNDER NOISY OBSERVATIONS | Unknown | N/A | |
| Top-N: Equivariant Set and Graph Generation without Exchangeability | Unknown | N/A | |
| LFPT5: A Unified Framework for Lifelong Few-shot Language Learning Based on Prompt Tuning of T5 | Unknown | N/A | |
| How Does SimSiam Avoid Collapse Without Negative Samples? A Unified Understanding with Self-supervised Contrastive Learning | Unknown | N/A | |
| FedChain: Chained Algorithms for Near-optimal Communication Cost in Federated Learning | Unknown | N/A | |
| Eigencurve: Optimal Learning Rate Schedule for SGD on Quadratic Objectives with Skewed Hessian Spectrums | Unknown | N/A | |
| Reinforcement Learning under a Multi-agent Predictive State Representation Model: Method and Theory | Unknown | N/A | |
| Unsupervised Learning of Full-Waveform Inversion: Connecting CNN and Partial Differential Equation in a Loop | Unknown | N/A | |
| Representational Continuity for Unsupervised Continual Learning | Unknown | N/A | |
| Context-Aware Sparse Deep Coordination Graphs | Unknown | N/A | |
| The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models | Unknown | N/A | |
| Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models | Unknown | N/A | |
| Towards Understanding Generalization via Decomposing Excess Risk Dynamics | Unknown | N/A | |
| Chunked Autoregressive GAN for Conditional Waveform Synthesis | Unknown | N/A | |
| Topological Experience Replay | Unknown | N/A | |
| Generalisation in Lifelong Reinforcement Learning through Logical Composition | Unknown | N/A | |
| Learning Fast, Learning Slow: A General Continual Learning Method based on Complementary Learning System | Unknown | N/A | |
| Discovering Invariant Rationales for Graph Neural Networks | Unknown | N/A | |
| DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools | Unknown | N/A | |
| Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression | Unknown | N/A | |
| Normalization of Language Embeddings for Cross-Lingual Alignment | Unknown | N/A | |
| Machine Learning For Elliptic PDEs: Fast Rate Generalization Bound, Neural Scaling Law and Minimax Optimality | Unknown | N/A | |
| Do We Need Anisotropic Graph Neural Networks? | Unknown | N/A | |
| Learning to Guide and to be Guided in the Architect-Builder Problem | Unknown | N/A | |
| Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems | Unknown | N/A | |
| Towards a Unified View of Parameter-Efficient Transfer Learning | Unknown | N/A | |
| Few-Shot Backdoor Attacks on Visual Object Tracking | Unknown | N/A | |
| Relational Learning with Variational Bayes | Unknown | N/A | |
| Scattering Networks on the Sphere for Scalable and Rotationally Equivariant Spherical CNNs | Unknown | N/A | |
| GATSBI: Generative Adversarial Training for Simulation-Based Inference | Unknown | N/A | |
| Multiset-Equivariant Set Prediction with Approximate Implicit Differentiation | Unknown | N/A | |
| Increasing the Cost of Model Extraction with Calibrated Proof of Work | Unknown | N/A | |
| BAM: Bayes with Adaptive Memory | Unknown | N/A | |
| Open-Set Recognition: A Good Closed-Set Classifier is All You Need | Unknown | N/A | |
| Is High Variance Unavoidable in RL? A Case Study in Continuous Control | Unknown | N/A | |
| Predicting Physics in Mesh-reduced Space with Temporal Attention | Unknown | N/A | |
| Constructing a Good Behavior Basis for Transfer using Generalized Policy Updates | Unknown | N/A | |
| Lipschitz-constrained Unsupervised Skill Discovery | Unknown | N/A | |
| Task Relatedness-Based Generalization Bounds for Meta Learning | Unknown | N/A | |
| Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration | Unknown | N/A | |
| Universal Approximation Under Constraints is Possible with Transformers | Unknown | N/A | |
| Unraveling Model-Agnostic Meta-Learning via The Adaptation Learning Rate | Unknown | N/A | |
| LORD: Lower-Dimensional Embedding of Log-Signature in Neural Rough Differential Equations | Unknown | N/A | |
| Improving Federated Learning Face Recognition via Privacy-Agnostic Clusters | Unknown | N/A | |
| On the Learning and Learnability of Quasimetrics | Unknown | N/A | |
| Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization | Unknown | N/A | |
| Disentanglement Analysis with Partial Information Decomposition | Unknown | N/A | |
| Emergent Communication at Scale | Unknown | N/A | |
| Associated Learning: an Alternative to End-to-End Backpropagation that Works on CNN, RNN, and Transformer | Unknown | N/A | |
| Optimal Transport for Long-Tailed Recognition with Learnable Cost Matrix | Unknown | N/A | |
| Adversarially Robust Conformal Prediction | Unknown | N/A | |
| Generative Pseudo-Inverse Memory | Unknown | N/A | |
| Improving the Accuracy of Learning Example Weights for Imbalance Classification | Unknown | N/A | |
| Learning meta-features for AutoML | Unknown | N/A | |
| Denoising Likelihood Score Matching for Conditional Score-based Data Generation | Unknown | N/A | |
| Meta Discovery: Learning to Discover Novel Classes given Very Limited Data | Unknown | N/A | |
| Neural Markov Controlled SDE: Stochastic Optimization for Continuous-Time Data | Unknown | N/A | |
| A New Perspective on "How Graph Neural Networks Go Beyond Weisfeiler-Lehman?" | Unknown | N/A | |
| Representation-Agnostic Shape Fields | Unknown | N/A | |
| MetaShift: A Dataset of Datasets for Evaluating Contextual Distribution Shifts and Training Conflicts | Unknown | N/A | |
| Looking Back on Learned Experiences For Class/task Incremental Learning | Unknown | N/A | |
| Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable | Unknown | N/A | |
| Acceleration of Federated Learning with Alleviated Forgetting in Local Training | Unknown | N/A | |
| Delaunay Component Analysis for Evaluation of Data Representations | Unknown | N/A | |
| Equivariant Transformers for Neural Network based Molecular Potentials | Unknown | N/A | |
| Long Expressive Memory for Sequence Modeling | Unknown | N/A | |
| Training Transition Policies via Distribution Matching for Complex Tasks | Unknown | N/A | |
| Spherical Message Passing for 3D Molecular Graphs | Unknown | N/A | |
| Wisdom of Committees: An Overlooked Approach To Faster and More Accurate Models | Unknown | N/A | |
| Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise | Unknown | N/A | |
| Large-Scale Representation Learning on Graphs via Bootstrapping | Unknown | N/A | |
| VAE Approximation Error: ELBO and Exponential Families | Unknown | N/A | |
| Label Leakage and Protection in Two-party Split Learning | Unknown | N/A | |
| Online Adversarial Attacks | Unknown | N/A | |
| Discrepancy-Based Active Learning for Domain Adaptation | Unknown | N/A | |
| Bootstrapping Semantic Segmentation with Regional Contrast | Unknown | N/A | |
| Gradient Matching for Domain Generalization | Unknown | N/A | |
| Open-vocabulary Object Detection via Vision and Language Knowledge Distillation | Unknown | N/A | |
| DISSECT: Disentangled Simultaneous Explanations via Concept Traversals | Unknown | N/A | |
| Exploring Memorization in Adversarial Training | Unknown | N/A | |
| When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations | Unknown | N/A | |
| NODE-GAM: Neural Generalized Additive Model for Interpretable Deep Learning | Unknown | N/A | |
| Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL | Unknown | N/A | |
| Incremental False Negative Detection for Contrastive Learning | Unknown | N/A | |
| Learning Curves for SGD on Structured Features | Unknown | N/A | |
| AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation | Unknown | N/A | |
| Generative Models as a Data Source for Multiview Representation Learning | Unknown | N/A | |
| Geometry-Consistent Neural Shape Representation with Implicit Displacement Fields | Unknown | N/A | |
| Reliable Adversarial Distillation with Unreliable Teachers | Unknown | N/A | |
| Automated Self-Supervised Learning for Graphs | Unknown | N/A | |
| Stein Latent Optimization for Generative Adversarial Networks | Unknown | N/A | |
| Is Homophily a Necessity for Graph Neural Networks? | Unknown | N/A | |
| Boosting Randomized Smoothing with Variance Reduced Classifiers | Unknown | N/A | |
| Do Not Escape From the Manifold: Discovering the Local Coordinates on the Latent Space of GANs | Unknown | N/A | |
| Query Embedding on Hyper-Relational Knowledge Graphs | Unknown | N/A | |
| Steerable Partial Differential Operators for Equivariant Neural Networks | Unknown | N/A | |
| Learning Multimodal VAEs through Mutual Supervision | Unknown | N/A | |
| Charformer: Fast Character Transformers via Gradient-based Subword Tokenization | Unknown | N/A | |
| Deep Ensembling with No Overhead for either Training or Testing: The All-Round Blessings of Dynamic Sparsity | Unknown | N/A | |
| The MultiBERTs: BERT Reproductions for Robustness Analysis | Unknown | N/A | |
| ViTGAN: Training GANs with Vision Transformers | Unknown | N/A | |
| Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions | Unknown | N/A | |
| TAPEX: Table Pre-training via Learning a Neural SQL Executor | Unknown | N/A | |
| Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning | Unknown | N/A | |
| CycleMLP: A MLP-like Architecture for Dense Prediction | Unknown | N/A | |
| Heteroscedastic Temporal Variational Autoencoder For Irregularly Sampled Time Series | Unknown | N/A | |
| Perceiver IO: A General Architecture for Structured Inputs & Outputs | Unknown | N/A | |
| SphereFace2: Binary Classification is All You Need for Deep Face Recognition | Unknown | N/A | |
| Policy Gradients Incorporating the Future | Unknown | N/A | |
| SimVLM: Simple Visual Language Model Pretraining with Weak Supervision | Unknown | N/A | |
| Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners | Unknown | N/A | |
| NASI: Label- and Data-agnostic Neural Architecture Search at Initialization | Unknown | N/A | |
| How to Inject Backdoors with Better Consistency: Logit Anchoring on Clean Data | Unknown | N/A | |
| Divisive Feature Normalization Improves Image Recognition Performance in AlexNet | Unknown | N/A | |
| Finetuned Language Models are Zero-Shot Learners | Unknown | N/A | |
| CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation | Unknown | N/A | |
| Should We Be Pre-training? An Argument for End-task Aware Training as an Alternative | Unknown | N/A | |
| Hindsight Foresight Relabeling for Meta-Reinforcement Learning | Unknown | N/A | |
| Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Learning to Downsample for Segmentation of Ultra-High Resolution Images | Unknown | N/A | |
| Vitruvion: A Generative Model of Parametric CAD Sketches | Unknown | N/A | |
| IGLU: Efficient GCN Training via Lazy Updates | Unknown | N/A | |
| Random matrices in service of ML footprint: ternary random features with no performance loss | Unknown | N/A | |
| Geometric and Physical Quantities improve E(3) Equivariant Message Passing | Unknown | N/A | |
| PoNet: Pooling Network for Efficient Token Mixing in Long Sequences | Unknown | N/A | |
| Focus on the Common Good: Group Distributional Robustness Follows | Unknown | N/A | |
| Task Affinity with Maximum Bipartite Matching in Few-Shot Learning | Unknown | N/A | |
| 8-bit Optimizers via Block-wise Quantization | Unknown | N/A | |
| Graphon based Clustering and Testing of Networks: Algorithms and Theory | Unknown | N/A | |
| The Information Geometry of Unsupervised Reinforcement Learning | Unknown | N/A | |
| VC dimension of partially quantized neural networks in the overparametrized regime | Unknown | N/A | |
| EntQA: Entity Linking as Question Answering | Unknown | N/A | |
| Evaluating Model-Based Planning and Planner Amortization for Continuous Control | Unknown | N/A | |
| GNN is a Counter? Revisiting GNN for Question Answering | Unknown | N/A | |
| Gradient Step Denoiser for convergent Plug-and-Play | Unknown | N/A | |
| Planning in Stochastic Environments with a Learned Model | Unknown | N/A | |
| Creating Training Sets via Weak Indirect Supervision | Unknown | N/A | |
| Frame Averaging for Invariant and Equivariant Network Design | Unknown | N/A | |
| Taming Sparsely Activated Transformer with Stochastic Experts | Unknown | N/A | |
| From Stars to Subgraphs: Uplifting Any GNN with Local Structure Awareness | Unknown | N/A | |
| On Non-Random Missing Labels in Semi-Supervised Learning | Unknown | N/A | |
| PipeGCN: Efficient Full-Graph Training of Graph Convolutional Networks with Pipelined Feature Communication | Unknown | N/A | |
| QUERY EFFICIENT DECISION BASED SPARSE ATTACKS AGAINST BLACK-BOX DEEP LEARNING MODELS | Unknown | N/A | |
| Which Shortcut Cues Will DNNs Choose? A Study from the Parameter-Space Perspective | Unknown | N/A | |
| EViT: Expediting Vision Transformers via Token Reorganizations | Unknown | N/A | |
| Understanding Intrinsic Robustness Using Label Uncertainty | Unknown | N/A | |
| Label Encoding for Regression Networks | Unknown | N/A | |
| Knowledge Infused Decoding | Unknown | N/A | |
| Deep Learning without Shortcuts: Shaping the Kernel with Tailored Rectifiers | Unknown | N/A | |
| Provably Robust Adversarial Examples | Unknown | N/A | |
| A Tale of Two Flows: Cooperative Learning of Langevin Flow and Normalizing Flow Toward Energy-Based Model | Unknown | N/A | |
| Sparsity Winning Twice: Better Robust Generalization from More Efficient Training | Unknown | N/A | |
| Measuring the Interpretability of Unsupervised Representations via Quantized Reversed Probing | Unknown | N/A | |
| Conditional Image Generation by Conditioning Variational Auto-Encoders | Unknown | N/A | |
| Post hoc Explanations may be Ineffective for Detecting Unknown Spurious Correlation | Unknown | N/A | |
| Generalized Decision Transformer for Offline Hindsight Information Matching | Unknown | N/A | |
| SUMNAS: Supernet with Unbiased Meta-Features for Neural Architecture Search | Unknown | N/A | |
| CrossMatch: Cross-Classifier Consistency Regularization for Open-Set Single Domain Generalization | Unknown | N/A | |
| Imitation Learning from Observations under Transition Model Disparity | Unknown | N/A | |
| LOSSY COMPRESSION WITH DISTRIBUTION SHIFT AS ENTROPY CONSTRAINED OPTIMAL TRANSPORT | Unknown | N/A | |
| How Did the Model Change? Efficiently Assessing Machine Learning API Shifts | Unknown | N/A | |
| Semi-relaxed Gromov-Wasserstein divergence and applications on graphs | Unknown | N/A | |
| On Bridging Generic and Personalized Federated Learning for Image Classification | Unknown | N/A | |
| Few-shot Learning via Dirichlet Tessellation Ensemble | Unknown | N/A | |
| Learning to Dequantise with Truncated Flows | Unknown | N/A | |
| Latent Image Animator: Learning to Animate Images via Latent Space Navigation | Unknown | N/A | |
| FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations | Unknown | N/A | |
| Superclass-Conditional Gaussian Mixture Model For Learning Fine-Grained Embeddings | Unknown | N/A | |
| Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling | Unknown | N/A | |
| Environment Predictive Coding for Visual Navigation | Unknown | N/A | |
| An Agnostic Approach to Federated Learning with Class Imbalance | Unknown | N/A | |
| Vision-Based Manipulators Need to Also See from Their Hands | Unknown | N/A | |
| A Program to Build E(N)-Equivariant Steerable CNNs | Unknown | N/A | |
| Deconstructing the Inductive Biases of Hamiltonian Neural Networks | Unknown | N/A | |
| A Zest of LIME: Towards Architecture-Independent Model Distances | Unknown | N/A | |
| W-CTC: a Connectionist Temporal Classification Loss with Wild Cards | Unknown | N/A | |
| DictFormer: Tiny Transformer with Shared Dictionary | Unknown | N/A | |
| PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions | Unknown | N/A | |
| RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning | Unknown | N/A | |
| Equivariant Self-Supervised Learning: Encouraging Equivariance in Representations | Unknown | N/A | |
| iLQR-VAE : control-based learning of input-driven dynamics with applications to neural data | Unknown | N/A | |
| Information Gain Propagation: a New Way to Graph Active Learning with Soft Labels | Unknown | N/A | |
| Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators | Unknown | N/A | |
| Compositional Training for End-to-End Deep AUC Maximization | Unknown | N/A | |
| FedBABU: Toward Enhanced Representation for Federated Image Classification | Unknown | N/A | |
| Unsupervised Semantic Segmentation by Distilling Feature Correspondences | Unknown | N/A | |
| Efficient and Differentiable Conformal Prediction with General Function Classes | Unknown | N/A | |
| Pseudo Numerical Methods for Diffusion Models on Manifolds | Unknown | N/A | |
| Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective | Unknown | N/A | |
| Learning Weakly-supervised Contrastive Representations | Unknown | N/A | |
| Fast Model Editing at Scale | Unknown | N/A | |
| DEGREE: Decomposition Based Explanation for Graph Neural Networks | Unknown | N/A | |
| Generalizing Few-Shot NAS with Gradient Matching | Unknown | N/A | |
| Sound Adversarial Audio-Visual Navigation | Unknown | N/A | |
| The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization | Unknown | N/A | |
| Illiterate DALL-E Learns to Compose | Unknown | N/A | |
| Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators | Unknown | N/A | |
| Hindsight is 20/20: Leveraging Past Traversals to Aid 3D Perception | Unknown | N/A | |
| Linking Emergent and Natural Languages via Corpus Transfer | Unknown | N/A | |
| Know Thyself: Transferable Visual Control Policies Through Robot-Awareness | Unknown | N/A | |
| PiCO: Contrastive Label Disambiguation for Partial Label Learning | Unknown | N/A | |
| ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity | Unknown | N/A | |
| Reversible Instance Normalization for Accurate Time-Series Forecasting against Distribution Shift | Unknown | N/A | |
| Model-augmented Prioritized Experience Replay | Unknown | N/A | |
| Bridging Recommendation and Marketing via Recurrent Intensity Modeling | Unknown | N/A | |
| Continual Learning with Filter Atom Swapping | Unknown | N/A | |
| One After Another: Learning Incremental Skills for a Changing World | Unknown | N/A | |
| DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations | Unknown | N/A | |
| Learning to Schedule Learning rate with Graph Neural Networks | Unknown | N/A | |
| How Do Vision Transformers Work? | Unknown | N/A | |
| BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis | Unknown | N/A | |
| Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank? | Unknown | N/A | |
| On Incorporating Inductive Biases into VAEs | Unknown | N/A | |
| Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities | Unknown | N/A | |
| Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework | Unknown | N/A | |
| MCMC Should Mix: Learning Energy-Based Model with Neural Transport Latent Space MCMC | Unknown | N/A | |
| Orchestrated Value Mapping for Reinforcement Learning | Unknown | N/A | |
| Gradient Information Matters in Policy Optimization by Back-propagating through Model | Unknown | N/A | |
| Robust Unlearnable Examples: Protecting Data Privacy Against Adversarial Learning | Unknown | N/A | |
| Evaluating Distributional Distortion in Neural Language Modeling | Unknown | N/A | |
| Mapping Language Models to Grounded Conceptual Spaces | Unknown | N/A | |
| UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning | Unknown | N/A | |
| On Robust Prefix-Tuning for Text Classification | Unknown | N/A | |
| Regularized Autoencoders for Isometric Representation Learning | Unknown | N/A | |
| Learning to Generalize across Domains on Single Test Samples | Unknown | N/A | |
| Relational Multi-Task Learning: Modeling Relations between Data and Tasks | Unknown | N/A | |
| ConFeSS: A Framework for Single Source Cross-Domain Few-Shot Learning | Unknown | N/A | |
| GeneDisco: A Benchmark for Experimental Design in Drug Discovery | Unknown | N/A | |
| Scene Transformer: A unified architecture for predicting future trajectories of multiple agents | Unknown | N/A | |
| Simple GNN Regularisation for 3D Molecular Property Prediction and Beyond | Unknown | N/A | |
| How Well Does Self-Supervised Pre-Training Perform with Streaming Data? | Unknown | N/A | |
| Learning Efficient Image Super-Resolution Networks via Structure-Regularized Pruning | Unknown | N/A | |
| Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training | Unknown | N/A | |
| Practical Integration via Separable Bijective Networks | Unknown | N/A | |
| Switch to Generalize: Domain-Switch Learning for Cross-Domain Few-Shot Classification | Unknown | N/A | |
| Local Feature Swapping for Generalization in Reinforcement Learning | Unknown | N/A | |
| Language modeling via stochastic processes | Unknown | N/A | |
| Fine-Tuning can Distort Pretrained Features and Underperform Out-of-Distribution | Unknown | N/A | |
| Filling the G_ap_s: Multivariate Time Series Imputation by Graph Neural Networks | Unknown | N/A | |
| F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization | Unknown | N/A | |
| Neural Relational Inference with Node-Specific Information | Unknown | N/A | |
| DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning | Unknown | N/A | |
| Training Data Generating Networks: Shape Reconstruction via Bi-level Optimization | Unknown | N/A | |
| GraphENS: Neighbor-Aware Ego Network Synthesis for Class-Imbalanced Node Classification | Unknown | N/A | |
| Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph | Unknown | N/A | |
| Using Graph Representation Learning with Schema Encoders to Measure the Severity of Depressive Symptoms | Unknown | N/A | |
| Hot-Refresh Model Upgrades with Regression-Free Compatible Training in Image Retrieval | Unknown | N/A | |
| Graph Auto-Encoder via Neighborhood Wasserstein Reconstruction | Unknown | N/A | |
| Neural Deep Equilibrium Solvers | Unknown | N/A | |
| Relational Surrogate Loss Learning | Unknown | N/A | |
| Graph-Enhanced Exploration for Goal-oriented Reinforcement Learning | Unknown | N/A | |
| Conditioning Sequence-to-sequence Networks with Learned Activations | Unknown | N/A | |
| Inductive Relation Prediction Using Analogy Subgraph Embeddings | Unknown | N/A | |
| Continual Normalization: Rethinking Batch Normalization for Online Continual Learning | Unknown | N/A | |
| Divergence-aware Federated Self-Supervised Learning | Unknown | N/A | |
| Controlling the Complexity and Lipschitz Constant improves Polynomial Nets | Unknown | N/A | |
| Generative Principal Component Analysis | Unknown | N/A | |
| Learning Towards The Largest Margins | Unknown | N/A | |
| Iterative Refinement Graph Neural Network for Antibody Sequence-Structure Co-design | Unknown | N/A | |
| Domain Adversarial Training: A Game Perspective | Unknown | N/A | |
| On the Uncomputability of Partition Functions in Energy-Based Sequence Models | Unknown | N/A | |
| Towards Model Agnostic Federated Learning Using Knowledge Distillation | Unknown | N/A | |
| Minimax Optimization with Smooth Algorithmic Adversaries | Unknown | N/A | |
| A Loss Curvature Perspective on Training Instabilities of Deep Learning Models | Unknown | N/A | |
| Backdoor Defense via Decoupling the Training Process | Unknown | N/A | |
| Learnability Lock: Authorized Learnability Control Through Adversarial Invertible Transformations | Unknown | N/A | |
| On Redundancy and Diversity in Cell-based Neural Architecture Search | Unknown | N/A | |
| Coordination Among Neural Modules Through a Shared Global Workspace | Unknown | N/A | |
| RotoGrad: Gradient Homogenization in Multitask Learning | Unknown | N/A | |
| cosFormer: Rethinking Softmax In Attention | Unknown | N/A | |
| Exploring the Limits of Large Scale Pre-training | Unknown | N/A | |
| Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning | Unknown | N/A | |
| KL Guided Domain Adaptation | Unknown | N/A | |
| ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind | Unknown | N/A | |
| Map Induction: Compositional spatial submap learning for efficient exploration in novel environments | Unknown | N/A | |
| Proving the Lottery Ticket Hypothesis for Convolutional Neural Networks | Unknown | N/A | |
| Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions | Unknown | N/A | |
| Meta-Learning with Fewer Tasks through Task Interpolation | Unknown | N/A | |
| Transformer-based Transform Coding | Unknown | N/A | |
| BEiT: BERT Pre-Training of Image Transformers | Unknown | N/A | |
| Zero Pixel Directional Boundary by Vector Transform | Unknown | N/A | |
| Gaussian Mixture Convolution Networks | Unknown | N/A | |
| Image BERT Pre-training with Online Tokenizer | Unknown | N/A | |
| Online Coreset Selection for Rehearsal-based Continual Learning | Unknown | N/A | |
| Non-Linear Operator Approximations for Initial Value Problems | Unknown | N/A | |
| Better Supervisory Signals by Observing Learning Paths | Unknown | N/A | |
| Bag of Instances Aggregation Boosts Self-supervised Distillation | Unknown | N/A | |
| Omni-Dimensional Dynamic Convolution | Unknown | N/A | |
| Learning State Representations via Retracing in Reinforcement Learning | Unknown | N/A | |
| The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training | Unknown | N/A | |
| Learning to Map for Active Semantic Goal Navigation | Unknown | N/A | |
| Transfer RL across Observation Feature Spaces via Model-Based Regularization | Unknown | N/A | |
| CROP: Certifying Robust Policies for Reinforcement Learning through Functional Smoothing | Unknown | N/A | |
| Adversarial Support Alignment | Unknown | N/A | |
| Boosting the Certified Robustness of L-infinity Distance Nets | Unknown | N/A | |
| Spanning Tree-based Graph Generation for Molecules | Unknown | N/A | |
| Complete Verification via Multi-Neuron Relaxation Guided Branch-and-Bound | Unknown | N/A | |
| Policy improvement by planning with Gumbel | Unknown | N/A | |
| Learning Scenario Representation for Solving Two-stage Stochastic Integer Programs | Unknown | N/A | |
| Measuring CLEVRness: Black-box Testing of Visual Reasoning Models | Unknown | N/A | |
| Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation | Unknown | N/A | |
| Subspace Regularizers for Few-Shot Class Incremental Learning | Unknown | N/A | |
| Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning | Unknown | N/A | |
| Learning Super-Features for Image Retrieval | Unknown | N/A | |
| Auto-scaling Vision Transformers without Training | Unknown | N/A | |
| Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond | Unknown | N/A | |
| NASViT: Neural Architecture Search for Efficient Vision Transformers with Gradient Conflict aware Supernet Training | Unknown | N/A | |
| L0-Sparse Canonical Correlation Analysis | Unknown | N/A | |
| Exploiting Class Activation Value for Partial-Label Learning | Unknown | N/A | |
| Dual Lottery Ticket Hypothesis | Unknown | N/A | |
| Visual Representation Learning over Latent Domains | Unknown | N/A | |
| Towards Building A Group-based Unsupervised Representation Disentanglement Framework | Unknown | N/A | |
| Benchmarking the Spectrum of Agent Capabilities | Unknown | N/A | |
| Self-ensemble Adversarial Training for Improved Robustness | Unknown | N/A | |
| Learning Prototype-oriented Set Representations for Meta-Learning | Unknown | N/A | |
| Neural Program Synthesis with Query | Unknown | N/A | |
| Peek-a-Boo: What (More) is Disguised in a Randomly Weighted Neural Network, and How to Find It Efficiently | Unknown | N/A | |
| Coherence-based Label Propagation over Time Series for Accelerated Active Learning | Unknown | N/A | |
| Scaling Laws for Neural Machine Translation | Unknown | N/A | |
| Interacting Contour Stochastic Gradient Langevin Dynamics | Unknown | N/A | |
| GDA-AM: ON THE EFFECTIVENESS OF SOLVING MIN-IMAX OPTIMIZATION VIA ANDERSON MIXING | Unknown | N/A | |
| Minimax Optimality (Probably) Doesn't Imply Distribution Learning for GANs | Unknown | N/A | |
| Evolutionary Diversity Optimization with Clustering-based Selection for Reinforcement Learning | Unknown | N/A | |
| Adversarial Robustness Through the Lens of Causality | Unknown | N/A | |
| Stochastic Training is Not Necessary for Generalization | Unknown | N/A | |
| Conditional Object-Centric Learning from Video | Unknown | N/A | |
| Distributionally Robust Models with Parametric Likelihood Ratios | Unknown | N/A | |
| Connectome-constrained Latent Variable Model of Whole-Brain Neural Activity | Unknown | N/A | |
| Implicit Bias of MSE Gradient Optimization in Underparameterized Neural Networks | Unknown | N/A | |
| Igeood: An Information Geometry Approach to Out-of-Distribution Detection | Unknown | N/A | |
| The Rich Get Richer: Disparate Impact of Semi-Supervised Learning | Unknown | N/A | |
| GiraffeDet: A Heavy-Neck Paradigm for Object Detection | Unknown | N/A | |
| Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning | Unknown | N/A | |
| Is Fairness Only Metric Deep? Evaluating and Addressing Subgroup Gaps in Deep Metric Learning | Unknown | N/A | |
| $\pi$BO: Augmenting Acquisition Functions with User Beliefs for Bayesian Optimization | Unknown | N/A | |
| R4D: Utilizing Reference Objects for Long-Range Distance Estimation | Unknown | N/A | |
| RegionViT: Regional-to-Local Attention for Vision Transformers | Unknown | N/A | |
| Efficient Computation of Deep Nonlinear Infinite-Width Neural Networks that Learn Features | Unknown | N/A | |
| Efficient Active Search for Combinatorial Optimization Problems | Unknown | N/A | |
| No One Representation to Rule Them All: Overlapping Features of Training Methods | Unknown | N/A | |
| Particle Stochastic Dual Coordinate Ascent: Exponential convergent algorithm for mean field neural network optimization | Unknown | N/A | |
| Deep Attentive Variational Inference | Unknown | N/A | |
| Towards Empirical Sandwich Bounds on the Rate-Distortion Function | Unknown | N/A | |
| Clean Images are Hard to Reblur: Exploiting the Ill-Posed Inverse Task for Dynamic Scene Deblurring | Unknown | N/A | |
| Data Efficient Language-Supervised Zero-Shot Recognition with Optimal Transport Distillation | Unknown | N/A | |
| Unsupervised Discovery of Object Radiance Fields | Unknown | N/A | |
| A Class of Short-term Recurrence Anderson Mixing Methods and Their Applications | Unknown | N/A | |
| Efficient Self-supervised Vision Transformers for Representation Learning | Unknown | N/A | |
| On the Role of Neural Collapse in Transfer Learning | Unknown | N/A | |
| Optimization and Adaptive Generalization of Three layer Neural Networks | Unknown | N/A | |
| On the Importance of Firth Bias Reduction in Few-Shot Classification | Unknown | N/A | |
| Memorizing Transformers | Unknown | N/A | |
| On the role of population heterogeneity in emergent communication | Unknown | N/A | |
| Plant 'n' Seek: Can You Find the Winning Ticket? | Unknown | N/A | |
| Neural Stochastic Dual Dynamic Programming | Unknown | N/A | |
| Discrete Representations Strengthen Vision Transformer Robustness | Unknown | N/A | |
| You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks | Unknown | N/A | |
| Efficient Split-Mix Federated Learning for On-Demand and In-Situ Customization | Unknown | N/A | |
| GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems | Unknown | N/A | |
| Bootstrapped Meta-Learning | Unknown | N/A | |
| Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization | Unknown | N/A | |
| Efficient Sharpness-aware Minimization for Improved Training of Neural Networks | Unknown | N/A | |
| Bandit Learning with Joint Effect of Incentivized Sampling, Delayed Sampling Feedback, and Self-Reinforcing User Preferences | Unknown | N/A | |
| Efficient Neural Causal Discovery without Acyclicity Constraints | Unknown | N/A | |
| Path Auxiliary Proposal for MCMC in Discrete Space | Unknown | N/A | |
| PI3NN: Out-of-distribution-aware Prediction Intervals from Three Neural Networks | Unknown | N/A | |
| Approximation and Learning with Deep Convolutional Models: a Kernel Perspective | Unknown | N/A | |
| Fairness Guarantees under Demographic Shift | Unknown | N/A | |
| Do deep networks transfer invariances across classes? | Unknown | N/A | |
| Resolving Training Biases via Influence-based Data Relabeling | Unknown | N/A | |
| Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage | Unknown | N/A | |
| On feature learning in neural networks with global convergence guarantees | Unknown | N/A | |
| Case-based reasoning for better generalization in textual reinforcement learning | Unknown | N/A | |
| Assessing Generalization of SGD via Disagreement | Unknown | N/A | |
| Churn Reduction via Distillation | Unknown | N/A | |
| Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits | Unknown | N/A | |
| Finding an Unsupervised Image Segmenter in each of your Deep Generative Models | Unknown | N/A | |
| PAC Prediction Sets Under Covariate Shift | Unknown | N/A | |
| FP-DETR: Detection Transformer Advanced by Fully Pre-training | Unknown | N/A | |
| Generalized Kernel Thinning | Unknown | N/A | |
| Graph Neural Network Guided Local Search for the Traveling Salesperson Problem | Unknown | N/A | |
| Toward Efficient Low-Precision Training: Data Format Optimization and Hysteresis Quantization | Unknown | N/A | |
| Almost Tight L0-norm Certified Robustness of Top-k Predictions against Adversarial Perturbations | Unknown | N/A | |
| Amortized Implicit Differentiation for Stochastic Bilevel Optimization | Unknown | N/A | |
| Neural Models for Output-Space Invariance in Combinatorial Problems | Unknown | N/A | |
| Memory Augmented Optimizers for Deep Learning | Unknown | N/A | |
| Tracking the risk of a deployed model and detecting harmful distribution shifts | Unknown | N/A | |
| ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning | Unknown | N/A | |
| Learning-Augmented $k$-means Clustering | Unknown | N/A | |
| Latent Variable Sequential Set Transformers for Joint Multi-Agent Motion Prediction | Unknown | N/A | |
| It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation | Unknown | N/A | |
| Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation | Unknown | N/A | |
| Spike-inspired rank coding for fast and accurate recurrent neural networks | Unknown | N/A | |
| Attention-based Interpretability with Concept Transformers | Unknown | N/A | |
| Information-theoretic Online Memory Selection for Continual Learning | Unknown | N/A | |
| On Improving Adversarial Transferability of Vision Transformers | Unknown | N/A | |
| Programmatic Reinforcement Learning without Oracles | Unknown | N/A | |
| CADDA: Class-wise Automatic Differentiable Data Augmentation for EEG Signals | Unknown | N/A | |
| Understanding and Improving Graph Injection Attack by Promoting Unnoticeability | Unknown | N/A | |
| Learning transferable motor skills with hierarchical latent mixture policies | Unknown | N/A | |
| Responsible Disclosure of Generative Models Using Scalable Fingerprinting | Unknown | N/A | |
| A Fine-Tuning Approach to Belief State Modeling | Unknown | N/A | |
| Decentralized Learning for Overparameterized Problems: A Multi-Agent Kernel Approximation Approach | Unknown | N/A | |
| Learning to Annotate Part Segmentation with Gradient Matching | Unknown | N/A | |
| Collapse by Conditioning: Training Class-conditional GANs with Limited Data | Unknown | N/A | |
| Surrogate NAS Benchmarks: Going Beyond the Limited Search Spaces of Tabular NAS Benchmarks | Unknown | N/A | |
| DriPP: Driven Point Processes to Model Stimuli Induced Patterns in M/EEG Signals | Unknown | N/A | |
| Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis | Unknown | N/A | |
| On the Certified Robustness for Ensemble Models and Beyond | Unknown | N/A | |
| Visual hyperacuity with moving sensor and recurrent neural computations | Unknown | N/A | |
| Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown Codimension | Unknown | N/A | |
| CLEVA-Compass: A Continual Learning Evaluation Assessment Compass to Promote Research Transparency and Comparability | Unknown | N/A | |
| GRAND++: Graph Neural Diffusion with A Source Term | Unknown | N/A | |
| Can an Image Classifier Suffice For Action Recognition? | Unknown | N/A | |
| AS-MLP: An Axial Shifted MLP Architecture for Vision | Unknown | N/A | |
| Fairness in Representation for Multilingual NLP: Insights from Controlled Experiments on Conditional Language Modeling | Unknown | N/A | |
| MT3: Multi-Task Multitrack Music Transcription | Unknown | N/A | |
| Information Prioritization through Empowerment in Visual Model-based RL | Unknown | N/A | |
| VOS: Learning What You Don't Know by Virtual Outlier Synthesis | Unknown | N/A | |
| Reducing Excessive Margin to Achieve a Better Accuracy vs. Robustness Trade-off | Unknown | N/A | |
| Pre-training Molecular Graph Representation with 3D Geometry | Unknown | N/A | |
| A General Analysis of Example-Selection for Stochastic Gradient Descent | Unknown | N/A | |
| Omni-Scale CNNs: a simple and effective kernel size configuration for time series classification | Unknown | N/A | |
| Objects in Semantic Topology | Unknown | N/A | |
| Neural Spectral Marked Point Processes | Unknown | N/A | |
| Demystifying Limited Adversarial Transferability in Automatic Speech Recognition Systems | Unknown | N/A | |
| Label-Efficient Semantic Segmentation with Diffusion Models | Unknown | N/A | |
| EXACT: Scalable Graph Neural Networks Training via Extreme Activation Compression | Unknown | N/A | |
| Improving Mutual Information Estimation with Annealed and Energy-Based Bounds | Unknown | N/A | |
| An Autoregressive Flow Model for 3D Molecular Geometry Generation from Scratch | Unknown | N/A | |
| Learning Curves for Gaussian Process Regression with Power-Law Priors and Targets | Unknown | N/A | |
| Differentiable Gradient Sampling for Learning Implicit 3D Scene Reconstructions from a Single Image | Unknown | N/A | |
| Tighter Sparse Approximation Bounds for ReLU Neural Networks | Unknown | N/A | |
| The Uncanny Similarity of Recurrence and Depth | Unknown | N/A | |
| A Deep Variational Approach to Clustering Survival Data | Unknown | N/A | |
| Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness? | Unknown | N/A | |
| A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning | Unknown | N/A | |
| Continuous-Time Meta-Learning with Forward Mode Differentiation | Unknown | N/A | |
| Geometric Transformers for Protein Interface Contact Prediction | Unknown | N/A | |
| Who Is Your Right Mixup Partner in Positive and Unlabeled Learning | Unknown | N/A | |
| Generative Modeling with Optimal Transport Maps | Unknown | N/A | |
| MaGNET: Uniform Sampling from Deep Generative Network Manifolds Without Retraining | Unknown | N/A | |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Unknown | N/A | |
| ADAVI: Automatic Dual Amortized Variational Inference Applied To Pyramidal Bayesian Models | Unknown | N/A | |
| Monotonic Differentiable Sorting Networks | Unknown | N/A | |
| Interpretable Unsupervised Diversity Denoising and Artefact Removal | Unknown | N/A | |
| FedPara: Low-rank Hadamard Product for Communication-Efficient Federated Learning | Unknown | N/A | |
| Trivial or Impossible --- dichotomous data difficulty masks model differences (on ImageNet and beyond) | Unknown | N/A | |
| Neural Contextual Bandits with Deep Representation and Shallow Exploration | Unknown | N/A | |
| Learning Neural Contextual Bandits through Perturbed Rewards | Unknown | N/A | |
| Multi-Task Processes | Unknown | N/A | |
| The Role of Pretrained Representations for the OOD Generalization of RL Agents | Unknown | N/A | |
| Differentially Private Fine-tuning of Language Models | Unknown | N/A | |
| Anomaly Transformer: Time Series Anomaly Detection with Association Discrepancy | Unknown | N/A | |
| QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization | Unknown | N/A | |
| Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum | Unknown | N/A | |
| Stiffness-aware neural network for learning Hamiltonian systems | Unknown | N/A | |
| Topological Graph Neural Networks | Unknown | N/A | |
| CoordX: Accelerating Implicit Neural Representation with a Split MLP Architecture | Unknown | N/A | |
| Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality | Unknown | N/A | |
| Meta Learning Low Rank Covariance Factors for Energy Based Deterministic Uncertainty | Unknown | N/A | |
| Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations | Unknown | N/A | |
| Sample and Computation Redistribution for Efficient Face Detection | Unknown | N/A | |
| How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective | Unknown | N/A | |
| Closed-form Sample Probing for Learning Generative Models in Zero-shot Learning | Unknown | N/A | |
| Dive Deeper Into Integral Pose Regression | Unknown | N/A | |
| MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer | Unknown | N/A | |
| On the Convergence of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning | Unknown | N/A | |
| SHINE: SHaring the INverse Estimate from the forward pass for bi-level optimization and implicit models | Unknown | N/A | |
| Leveraging Automated Unit Tests for Unsupervised Code Translation | Unknown | N/A | |
| Explainable GNN-Based Models over Knowledge Graphs | Unknown | N/A | |
| Byzantine-Robust Learning on Heterogeneous Datasets via Bucketing | Unknown | N/A | |
| DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization | Unknown | N/A | |
| SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations | Unknown | N/A | |
| IntSGD: Adaptive Floatless Compression of Stochastic Gradients | Unknown | N/A | |
| Understanding the Role of Self Attention for Efficient Speech Recognition | Unknown | N/A | |
| PAC-Bayes Information Bottleneck | Unknown | N/A | |
| When should agents explore? | Unknown | N/A | |
| Understanding and Leveraging Overparameterization in Recursive Value Estimation | Unknown | N/A | |
| Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path | Unknown | N/A | |
| Hindsight: Posterior-guided training of retrievers for improved open-ended generation | Unknown | N/A | |
| Imbedding Deep Neural Networks | Unknown | N/A | |
| PF-GNN: Differentiable particle filtering based approximation of universal graph representations | Unknown | N/A | |
| Mirror Descent Policy Optimization | Unknown | N/A | |
| Unrolling PALM for Sparse Semi-Blind Source Separation | Unknown | N/A | |
| MoReL: Multi-omics Relational Learning | Unknown | N/A | |
| Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity | Unknown | N/A | |
| Hybrid Local SGD for Federated Learning with Heterogeneous Communications | Unknown | N/A | |
| How to Train Your MAML to Excel in Few-Shot Classification | Unknown | N/A | |
| Learning more skills through optimistic exploration | Unknown | N/A | |
| Understanding and Preventing Capacity Loss in Reinforcement Learning | Unknown | N/A | |
| Dealing with Non-Stationarity in MARL via Trust-Region Decomposition | Unknown | N/A | |
| Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics | Unknown | N/A | |
| Cross-Domain Imitation Learning via Optimal Transport | Unknown | N/A | |
| Prototypical Contrastive Predictive Coding | Unknown | N/A | |
| LoRA: Low-Rank Adaptation of Large Language Models | Unknown | N/A | |
| Optimal ANN-SNN Conversion for High-accuracy and Ultra-low-latency Spiking Neural Networks | Unknown | N/A | |
| Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation | Unknown | N/A | |
| Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning | Unknown | N/A | |
| Optimizing Neural Networks with Gradient Lexicase Selection | Unknown | N/A | |
| Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm | Unknown | N/A | |
| Value Gradient weighted Model-Based Reinforcement Learning | Unknown | N/A | |
| Learning Value Functions from Undirected State-only Experience | Unknown | N/A | |
| Vector-quantized Image Modeling with Improved VQGAN | Unknown | N/A | |
| Toward Faithful Case-based Reasoning through Learning Prototypes in a Nearest Neighbor-friendly Space. | Unknown | N/A | |
| Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism | Unknown | N/A | |
| Transformers Can Do Bayesian Inference | Unknown | N/A | |
| Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space | Unknown | N/A | |
| Network Augmentation for Tiny Deep Learning | Unknown | N/A | |
| Optimal Transport for Causal Discovery | Unknown | N/A | |
| GLASS: GNN with Labeling Tricks for Subgraph Representation Learning | Unknown | N/A | |
| New Insights on Reducing Abrupt Representation Change in Online Continual Learning | Unknown | N/A | |
| Decoupled Adaptation for Cross-Domain Object Detection | Unknown | N/A | |
| No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models | Unknown | N/A | |
| Visual Correspondence Hallucination | Unknown | N/A | |
| Bayesian Framework for Gradient Leakage | Unknown | N/A | |
| Non-Transferable Learning: A New Approach for Model Ownership Verification and Applicability Authorization | Unknown | N/A | |
| Learning curves for continual learning in neural networks: Self-knowledge transfer and forgetting | Unknown | N/A | |
| How many degrees of freedom do we need to train deep networks: a loss landscape perspective | Unknown | N/A | |
| Effective Model Sparsification by Scheduled Grow-and-Prune Methods | Unknown | N/A | |
| Finite-Time Convergence and Sample Complexity of Multi-Agent Actor-Critic Reinforcement Learning with Average Reward | Unknown | N/A | |
| Energy-Inspired Molecular Conformation Optimization | Unknown | N/A | |
| An Explanation of In-context Learning as Implicit Bayesian Inference | Unknown | N/A | |
| The Close Relationship Between Contrastive Learning and Meta-Learning | Unknown | N/A | |
| Learning Transferable Reward for Query Object Localization with Policy Adaptation | Unknown | N/A | |
| Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings | Unknown | N/A | |
| You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction | Unknown | N/A | |
| Shuffle Private Stochastic Convex Optimization | Unknown | N/A | |
| Maximum Entropy RL (Provably) Solves Some Robust RL Problems | Unknown | N/A | |
| Wiring Up Vision: Minimizing Supervised Synaptic Updates Needed to Produce a Primate Ventral Stream | Unknown | N/A | |
| Autonomous Learning of Object-Centric Abstractions for High-Level Planning | Unknown | N/A | |
| Fortuitous Forgetting in Connectionist Networks | Unknown | N/A | |
| A Generalized Weighted Optimization Method for Computational Learning and Inversion | Unknown | N/A | |
| Learning Synthetic Environments and Reward Networks for Reinforcement Learning | Unknown | N/A | |
| What Makes Better Augmentation Strategies? Augment Difficult but Not too Different | Unknown | N/A | |
| Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modeling | Unknown | N/A | |
| FILM: Following Instructions in Language with Modular Methods | Unknown | N/A | |
| T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis | Unknown | N/A | |
| Top-label calibration and multiclass-to-binary reductions | Unknown | N/A | |
| PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning | Unknown | N/A | |
| Leveraging unlabeled data to predict out-of-distribution performance | Unknown | N/A | |
| On the Limitations of Multimodal VAEs | Unknown | N/A | |
| Maximum n-times Coverage for Vaccine Design | Unknown | N/A | |
| Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features | Unknown | N/A | |
| TRAIL: Near-Optimal Imitation Learning with Suboptimal Data | Unknown | N/A | |
| The Geometry of Memoryless Stochastic Policy Optimization in Infinite-Horizon POMDPs | Unknown | N/A | |
| A Theoretical Analysis on Feature Learning in Neural Networks: Emergence from Inputs and Advantage over Fixed Features | Unknown | N/A | |
| Recursive Disentanglement Network | Unknown | N/A | |
| Givens Coordinate Descent Methods for Rotation Matrix Learning in Trainable Embedding Indexes | Unknown | N/A | |
| Multi-objective Optimization by Learning Space Partition | Unknown | N/A | |
| Neural Structured Prediction for Inductive Node Classification | Unknown | N/A | |
| Knowledge Removal in Sampling-based Bayesian Inference | Unknown | N/A | |
| Einops: Clear and Reliable Tensor Manipulations with Einstein-like Notation | Unknown | N/A | |
| Language-biased image classification: evaluation based on semantic representations | Unknown | N/A | |
| What’s Wrong with Deep Learning in Tree Search for Combinatorial Optimization | Unknown | N/A | |
| Convergent and Efficient Deep Q Learning Algorithm | Unknown | N/A | |
| Constrained Policy Optimization via Bayesian World Models | Unknown | N/A | |
| Quadtree Attention for Vision Transformers | Unknown | N/A | |
| FastSHAP: Real-Time Shapley Value Estimation | Unknown | N/A | |
| Robust and Scalable SDE Learning: A Functional Perspective | Unknown | N/A | |
| CURVATURE-GUIDED DYNAMIC SCALE NETWORKS FOR MULTI-VIEW STEREO | Unknown | N/A | |
| StyleAlign: Analysis and Applications of Aligned StyleGAN Models | Unknown | N/A | |
| Autonomous Reinforcement Learning: Formalism and Benchmarking | Unknown | N/A | |
| Understanding over-squashing and bottlenecks on graphs via curvature | Unknown | N/A | |
| Exploring extreme parameter compression for pre-trained language models | Unknown | N/A | |
| Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks | Unknown | N/A | |
| Pix2seq: A Language Modeling Framework for Object Detection | Unknown | N/A | |
| AEVA: Black-box Backdoor Detection Using Adversarial Extreme Value Analysis | Unknown | N/A | |
| Scale Efficiently: Insights from Pretraining and Finetuning Transformers | Unknown | N/A | |
| Noisy Feature Mixup | Unknown | N/A | |
| Pretrained Language Model in Continual Learning: A Comparative Study | Unknown | N/A | |
| Fooling Explanations in Text Classifiers | Unknown | N/A | |
| Large Language Models Can Be Strong Differentially Private Learners | Unknown | N/A | |
| Data Poisoning Won’t Save You From Facial Recognition | Unknown | N/A | |
| Adaptive Wavelet Transformer Network for 3D Shape Representation Learning | Unknown | N/A | |
| Evidential Turing Processes | Unknown | N/A | |
| Invariant Causal Representation Learning for Out-of-Distribution Generalization | Unknown | N/A | |
| Enhancing Cross-lingual Transfer by Manifold Mixup | Unknown | N/A | |
| Why Propagate Alone? Parallel Use of Labels and Features on Graphs | Unknown | N/A | |
| Progressive Distillation for Fast Sampling of Diffusion Models | Unknown | N/A | |
| Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming | Unknown | N/A | |
| How Low Can We Go: Trading Memory for Error in Low-Precision Training | Unknown | N/A | |
| Towards Deepening Graph Neural Networks: A GNTK-based Optimization Perspective | Unknown | N/A | |
| Active Hierarchical Exploration with Stable Subgoal Representation Learning | Unknown | N/A | |
| SOSP: Efficiently Capturing Global Correlations by Second-Order Structured Pruning | Unknown | N/A | |
| Permutation-Based SGD: Is Random Optimal? | Unknown | N/A | |
| Towards Continual Knowledge Learning of Language Models | Unknown | N/A | |
| GeoDiff: A Geometric Diffusion Model for Molecular Conformation Generation | Unknown | N/A | |
| Learning to Complete Code with Sketches | Unknown | N/A | |
| Dynamic Token Normalization improves Vision Transformers | Unknown | N/A | |
| Evaluation Metrics for Graph Generative Models: Problems, Pitfalls, and Practical Solutions | Unknown | N/A | |
| Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies | Unknown | N/A | |
| Memory Replay with Data Compression for Continual Learning | Unknown | N/A | |
| VAT-Mart: Learning Visual Action Trajectory Proposals for Manipulating 3D ARTiculated Objects | Unknown | N/A | |
| Goal-Directed Planning via Hindsight Experience Replay | Unknown | N/A | |
| NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs | Unknown | N/A | |
| On the Convergence of mSGD and AdaGrad for Stochastic Optimization | Unknown | N/A | |
| Hybrid Random Features | Unknown | N/A | |
| Domino: Discovering Systematic Errors with Cross-Modal Embeddings | Unknown | N/A | |
| Learned Simulators for Turbulence | Unknown | N/A | |
| Scalable Sampling for Nonsymmetric Determinantal Point Processes | Unknown | N/A | |
| Deep AutoAugment | Unknown | N/A | |
| Towards Understanding the Robustness Against Evasion Attack on Categorical Data | Unknown | N/A | |
| Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies | Unknown | N/A | |
| Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme | Unknown | N/A | |
| Auto-Transfer: Learning to Route Transferable Representations | Unknown | N/A | |
| Hierarchical Few-Shot Imitation with Skill Transition Models | Unknown | N/A | |
| How Attentive are Graph Attention Networks? | Unknown | N/A | |
| FairCal: Fairness Calibration for Face Verification | Unknown | N/A | |
| Provably convergent quasistatic dynamics for mean-field two-player zero-sum games | Unknown | N/A | |
| ProtoRes: Proto-Residual Network for Pose Authoring via Learned Inverse Kinematics | Unknown | N/A | |
| Sparse Attention with Learning to Hash | Unknown | N/A | |
| Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations | Unknown | N/A | |
| AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning | Unknown | N/A | |
| Optimizer Amalgamation | Unknown | N/A | |
| Entroformer: A Transformer-based Entropy Model for Learned Image Compression | Unknown | N/A | |
| Continual Learning with Recursive Gradient Optimization | Unknown | N/A | |
| HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation | Unknown | N/A | |
| OntoProtein: Protein Pretraining With Gene Ontology Embedding | Unknown | N/A | |
| Permutation Compressors for Provably Faster Distributed Nonconvex Optimization | Unknown | N/A | |
| CrossBeam: Learning to Search in Bottom-Up Program Synthesis | Unknown | N/A | |
| RvS: What is Essential for Offline RL via Supervised Learning? | Unknown | N/A | |
| Learning Continuous Environment Fields via Implicit Functions | Unknown | N/A | |
| Fast AdvProp | Unknown | N/A | |
| Revisiting flow generative models for Out-of-distribution detection | Unknown | N/A | |
| DISCOVERING AND EXPLAINING THE REPRESENTATION BOTTLENECK OF DNNS | Unknown | N/A | |
| SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning | Unknown | N/A | |
| Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences | Unknown | N/A | |
| Poisoning and Backdooring Contrastive Learning | Unknown | N/A | |
| iFlood: A Stable and Effective Regularizer | Unknown | N/A | |
| Node Feature Extraction by Self-Supervised Multi-scale Neighborhood Prediction | Unknown | N/A | |
| Prospect Pruning: Finding Trainable Weights at Initialization using Meta-Gradients | Unknown | N/A | |
| When, Why, and Which Pretrained GANs Are Useful? | Unknown | N/A | |
| Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver | Unknown | N/A | |
| RelaxLoss: Defending Membership Inference Attacks without Losing Utility | Unknown | N/A | |
| Compositional Attention: Disentangling Search and Retrieval | Unknown | N/A | |
| Anomaly Detection for Tabular Data with Internal Contrastive Learning | Unknown | N/A | |
| Variational Inference for Discriminative Learning with Generative Modeling of Feature Incompletion | Unknown | N/A | |
| Stability Regularization for Discrete Representation Learning | Unknown | N/A | |
| Fixed Neural Network Steganography: Train the images, not the network | Unknown | N/A | |
| Bregman Gradient Policy Optimization | Unknown | N/A | |
| X-model: Improving Data Efficiency in Deep Learning with A Minimax Model | Unknown | N/A | |
| Low-Budget Active Learning via Wasserstein Distance: An Integer Programming Approach | Unknown | N/A | |
| Target-Side Input Augmentation for Sequence to Sequence Generation | Unknown | N/A | |
| Graph Condensation for Graph Neural Networks | Unknown | N/A | |
| GreaseLM: Graph REASoning Enhanced Language Models | Unknown | N/A | |
| Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics | Unknown | N/A | |
| MonoDistill: Learning Spatial Features for Monocular 3D Object Detection | Unknown | N/A | |
| Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL | Unknown | N/A | |
| Language-driven Semantic Segmentation | Unknown | N/A | |
| Constructing Orthogonal Convolutions in an Explicit Manner | Unknown | N/A | |
| HTLM: Hyper-Text Pre-Training and Prompting of Language Models | Unknown | N/A | |
| Pyraformer: Low-Complexity Pyramidal Attention for Long-Range Time Series Modeling and Forecasting | Unknown | N/A | |
| BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models | Unknown | N/A | |
| Mind the Gap: Domain Gap Control for Single Shot Domain Adaptation for Generative Adversarial Networks | Unknown | N/A | |
| Neural Methods for Logical Reasoning over Knowledge Graphs | Unknown | N/A | |
| Data-Driven Offline Optimization for Architecting Hardware Accelerators | Unknown | N/A | |
| Towards Better Understanding and Better Generalization of Low-shot Classification in Histology Images with Contrastive Learning | Unknown | N/A | |
| Post-Training Detection of Backdoor Attacks for Two-Class and Multi-Attack Scenarios | Unknown | N/A | |
| Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios | Unknown | N/A | |
| Learning Versatile Neural Architectures by Propagating Network Codes | Unknown | N/A | |
| A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning | Unknown | N/A | |
| Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization | Unknown | N/A | |
| Dynamics-Aware Comparison of Learned Reward Functions | Unknown | N/A | |
| In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications | Unknown | N/A | |
| Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations? | Unknown | N/A | |
| Back2Future: Leveraging Backfill Dynamics for Improving Real-time Predictions in Future | Unknown | N/A | |
| Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching | Unknown | N/A | |
| Out-of-distribution Generalization in the Presence of Nuisance-Induced Spurious Correlations | Unknown | N/A | |
| Sound and Complete Neural Network Repair with Minimality and Locality Guarantees | Unknown | N/A | |
| Discriminative Similarity for Data Clustering | Unknown | N/A | |
| Generalized Demographic Parity for Group Fairness | Unknown | N/A | |
| Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic Functions | Unknown | N/A | |
| StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image Synthesis | Unknown | N/A | |
| Distribution Compression in Near-Linear Time | Unknown | N/A | |
| On the Connection between Local Attention and Dynamic Depth-wise Convolution | Unknown | N/A | |
| Explanations of Black-Box Models based on Directional Feature Interactions | Unknown | N/A | |
| Language model compression with weighted low-rank factorization | Unknown | N/A | |
| Prototype memory and attention mechanisms for few shot image generation | Unknown | N/A | |
| Surrogate Gap Minimization Improves Sharpness-Aware Training | Unknown | N/A | |
| Optimal Representations for Covariate Shift | Unknown | N/A | |
| Bundle Networks: Fiber Bundles, Local Trivializations, and a Generative Approach to Exploring Many-to-one Maps | Unknown | N/A | |
| Anytime Dense Prediction with Confidence Adaptivity | Unknown | N/A | |
| Trigger Hunting with a Topological Prior for Trojan Detection | Unknown | N/A | |
| Graph-Guided Network for Irregularly Sampled Multivariate Time Series | Unknown | N/A | |
| SketchODE: Learning neural sketch representation in continuous time | Unknown | N/A | |
| Convergent Graph Solvers | Unknown | N/A | |
| MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling | Unknown | N/A | |
| From Intervention to Domain Transportation: A Novel Perspective to Optimize Recommendation | Unknown | N/A | |
| Concurrent Adversarial Learning for Large-Batch Training | Unknown | N/A | |
| CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention | Unknown | N/A | |
| Properties from mechanisms: an equivariance perspective on identifiable representation learning | Unknown | N/A | |
| A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial Training | Unknown | N/A | |
| Likelihood Training of Schrödinger Bridge using Forward-Backward SDEs Theory | Unknown | N/A | |
| On the Pitfalls of Analyzing Individual Neurons in Language Models | Unknown | N/A | |
| Graph Neural Networks with Learnable Structural and Positional Representations | Unknown | N/A | |
| Step-unrolled Denoising Autoencoders for Text Generation | Unknown | N/A | |
| Sparse Communication via Mixed Distributions | Unknown | N/A | |
| Chemical-Reaction-Aware Molecule Representation Learning | Unknown | N/A | |
| CrowdPlay: Crowdsourcing Human Demonstrations for Offline Learning | Unknown | N/A | |
| Adversarial Retriever-Ranker for Dense Text Retrieval | Unknown | N/A | |
| Tuformer: Data-driven Design of Transformers for Improved Generalization or Efficiency | Unknown | N/A | |
| Handling Distribution Shifts on Graphs: An Invariance Perspective | Unknown | N/A | |
| Effect of scale on catastrophic forgetting in neural networks | Unknown | N/A | |
| Fast Differentiable Matrix Square Root | Unknown | N/A | |
| Topologically Regularized Data Embeddings | Unknown | N/A | |
| Neural Variational Dropout Processes | Unknown | N/A | |
| Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View | Unknown | N/A | |
| Privacy Implications of Shuffling | Unknown | N/A | |
| Transform2Act: Learning a Transform-and-Control Policy for Efficient Agent Design | Unknown | N/A | |
| Doubly Adaptive Scaled Algorithm for Machine Learning Using Second-Order Information | Unknown | N/A | |
| Proof Artifact Co-Training for Theorem Proving with Language Models | Unknown | N/A | |
| Non-Parallel Text Style Transfer with Self-Parallel Supervision | Unknown | N/A | |
| A global convergence theory for deep ReLU implicit networks via over-parameterization | Unknown | N/A | |
| R5: Rule Discovery with Reinforced and Recurrent Relational Reasoning | Unknown | N/A | |
| Attacking deep networks with surrogate-based adversarial black-box methods is easy | Unknown | N/A | |
| Revisiting Over-smoothing in BERT from the Perspective of Graph | Unknown | N/A | |
| Neural Network Approximation based on Hausdorff distance of Tropical Zonotopes | Unknown | N/A | |
| Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice | Unknown | N/A | |
| VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning | Unknown | N/A | |
| On-Policy Model Errors in Reinforcement Learning | Unknown | N/A | |
| Signing the Supermask: Keep, Hide, Invert | Unknown | N/A | |
| Diverse Client Selection for Federated Learning via Submodular Maximization | Unknown | N/A | |
| Unifying Likelihood-free Inference with Black-box Optimization and Beyond | Unknown | N/A | |
| COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation | Unknown | N/A | |
| Transformer Embeddings of Irregularly Spaced Events and Their Participants | Unknown | N/A | |
| ViDT: An Efficient and Effective Fully Transformer-based Object Detector | Unknown | N/A | |
| NETWORK INSENSITIVITY TO PARAMETER NOISE VIA PARAMETER ATTACK DURING TRAINING | Unknown | N/A | |
| Rethinking Adversarial Transferability from a Data Distribution Perspective | Unknown | N/A | |
| Learning the Dynamics of Physical Systems from Sparse Observations with Finite Element Networks | Unknown | N/A | |
| Sequential Reptile: Inter-Task Gradient Alignment for Multilingual Learning | Unknown | N/A | |
| Neural graphical modelling in continuous-time: consistency guarantees and algorithms | Unknown | N/A | |
| Capturing Structural Locality in Non-parametric Language Models | Unknown | N/A | |
| Distilling GANs with Style-Mixed Triplets for X2I Translation with Limited Data | Unknown | N/A | |
| EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits | Unknown | N/A | |
| On the Existence of Universal Lottery Tickets | Unknown | N/A | |
| Sampling with Mirrored Stein Operators | Unknown | N/A | |
| A Statistical Framework for Efficient Out of Distribution Detection in Deep Neural Networks | Unknown | N/A | |
| Discovering Latent Concepts Learned in BERT | Unknown | N/A | |
| Provable Adaptation across Multiway Domains via Representation Learning | Unknown | N/A | |
| Temporal Alignment Prediction for Supervised Representation Learning and Few-Shot Sequence Classification | Unknown | N/A | |
| Group equivariant neural posterior estimation | Unknown | N/A | |
| Source-Free Adaptation to Measurement Shift via Bottom-Up Feature Restoration | Unknown | N/A | |
| Phase Collapse in Neural Networks | Unknown | N/A | |
| Federated Learning from Only Unlabeled Data with Class-conditional-sharing Clients | Unknown | N/A | |
| Message Passing Neural PDE Solvers | Unknown | N/A | |
| It Takes Two to Tango: Mixup for Deep Metric Learning | Unknown | N/A | |
| Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games | Unknown | N/A | |
| Self-supervised Learning is More Robust to Dataset Imbalance | Unknown | N/A | |
| Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation | Unknown | N/A | |
| A fast and accurate splitting method for optimal transport: analysis and implementation | Unknown | N/A | |
| Task-Induced Representation Learning | Unknown | N/A | |
| Triangle and Four Cycle Counting with Predictions in Graph Streams | Unknown | N/A | |
| ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods | Unknown | N/A | |
| How unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis | Unknown | N/A | |
| Expressiveness and Approximation Properties of Graph Neural Networks | Unknown | N/A | |
| How to deal with missing data in supervised deep learning? | Unknown | N/A | |
| Possibility Before Utility: Learning And Using Hierarchical Affordances | Unknown | N/A | |
| Autoregressive Diffusion Models | Unknown | N/A | |
| Expressivity of Emergent Languages is a Trade-off between Contextual Complexity and Unpredictability | Unknown | N/A | |
| Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game | Unknown | N/A | |
| Fast Regression for Structured Inputs | Unknown | N/A | |
| Learning Efficient Online 3D Bin Packing on Packing Configuration Trees | Unknown | N/A | |
| Learning Long-Term Reward Redistribution via Randomized Return Decomposition | Unknown | N/A | |
| Equivariant and Stable Positional Encoding for More Powerful Graph Neural Networks | Unknown | N/A | |
| Learning Strides in Convolutional Neural Networks | Unknown | N/A | |
| Data-Efficient Graph Grammar Learning for Molecular Generation | Unknown | N/A | |
| Large Learning Rate Tames Homogeneity: Convergence and Balancing Effect | Unknown | N/A | |
| Certified Robustness for Deep Equilibrium Models via Interval Bound Propagation | Unknown | N/A | |
| On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications | Unknown | N/A | |
| ARTEMIS: Attention-based Retrieval with Text-Explicit Matching and Implicit Similarity | Unknown | N/A | |
| Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality | Unknown | N/A | |
| Explaining Point Processes by Learning Interpretable Temporal Logic Rules | Unknown | N/A | |
| On Evaluation Metrics for Graph Generative Models | Unknown | N/A | |
| Probabilistic Implicit Scene Completion | Unknown | N/A | |
| Training Structured Neural Networks Through Manifold Identification and Variance Reduction | Unknown | N/A | |
| Natural Language Descriptions of Deep Features | Unknown | N/A | |
| Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, And No Retraining | Unknown | N/A | |
| Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers | Unknown | N/A | |
| Visual Representation Learning Does Not Generalize Strongly Within the Same Domain | Unknown | N/A | |
| miniF2F: a cross-system benchmark for formal Olympiad-level mathematics | Unknown | N/A | |
| Controlling Directions Orthogonal to a Classifier | Unknown | N/A | |
| Pareto Policy Pool for Model-based Offline Reinforcement Learning | Unknown | N/A | |
| Learning a subspace of policies for online adaptation in Reinforcement Learning | Unknown | N/A | |
| Neural Parameter Allocation Search | Unknown | N/A | |
| A Unified Wasserstein Distributional Robustness Framework for Adversarial Training | Unknown | N/A | |
| Fair Normalizing Flows | Unknown | N/A | |
| Self-Joint Supervised Learning | Unknown | N/A | |
| BiBERT: Accurate Fully Binarized BERT | Unknown | N/A | |
| LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning | Unknown | N/A | |
| An Unconstrained Layer-Peeled Perspective on Neural Collapse | Unknown | N/A | |
| Hierarchical Variational Memory for Few-shot Learning Across Domains | Unknown | N/A | |
| Scarf: Self-Supervised Contrastive Learning using Random Feature Corruption | Unknown | N/A | |
| What Happens after SGD Reaches Zero Loss? --A Mathematical Framework | Unknown | N/A | |
| Symbolic Learning to Optimize: Towards Interpretability and Scalability | Unknown | N/A | |
| PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior | Unknown | N/A | |
| Revisit Kernel Pruning with Lottery Regulated Grouped Convolutions | Unknown | N/A | |
| Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting | Unknown | N/A | |
| Improving Non-Autoregressive Translation Models Without Distillation | Unknown | N/A | |
| Evaluating Disentanglement of Structured Representations | Unknown | N/A | |
| Sqrt(d) Dimension Dependence of Langevin Monte Carlo | Unknown | N/A | |
| Neural Solvers for Fast and Accurate Numerical Optimal Control | Unknown | N/A | |
| Comparing Distributions by Measuring Differences that Affect Decision Making | Unknown | N/A | |
| Graph-less Neural Networks: Teaching Old MLPs New Tricks Via Distillation | Unknown | N/A | |
| Score-Based Generative Modeling with Critically-Damped Langevin Diffusion | Unknown | N/A | |
| Learning by Directional Gradient Descent | Unknown | N/A | |
| Spatial Graph Attention and Curiosity-driven Policy for Antiviral Drug Discovery | Unknown | N/A | |
| An Information Fusion Approach to Learning with Instance-Dependent Label Noise | Unknown | N/A | |
| Joint Shapley values: a measure of joint feature importance | Unknown | N/A | |
| Weighted Training for Cross-Task Learning | Unknown | N/A | |
| Adversarial Unlearning of Backdoors via Implicit Hypergradient | Unknown | N/A | |
| Pareto Set Learning for Neural Multi-Objective Combinatorial Optimization | Unknown | N/A | |
| Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure Space | Unknown | N/A | |
| Learning Representation from Neural Fisher Kernel with Low-rank Approximation | Unknown | N/A | |
| Actor-critic is implicitly biased towards high entropy optimal policies | Unknown | N/A | |
| Self-Supervised Inference in State-Space Models | Unknown | N/A | |
| Overcoming The Spectral Bias of Neural Value Approximation | Unknown | N/A | |
| Automatic Loss Function Search for Predict-Then-Optimize Problems with Strong Ranking Property | Unknown | N/A | |
| IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes | Unknown | N/A | |
| Learning to Extend Molecular Scaffolds with Structural Motifs | Unknown | N/A | |
| Lossless Compression with Probabilistic Circuits | Unknown | N/A | |
| SGD Can Converge to Local Maxima | Unknown | N/A | |
| Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings | Unknown | N/A | |
| Meta-Imitation Learning by Watching Video Demonstrations | Unknown | N/A | |
| WeakM3D: Towards Weakly Supervised Monocular 3D Object Detection | Unknown | N/A | |
| Differentiable DAG Sampling | Unknown | N/A | |
| Hyperparameter Tuning with Renyi Differential Privacy | Unknown | N/A | |
| Understanding approximate and unrolled dictionary learning for pattern recovery | Unknown | N/A | |
| Constraining Linear-chain CRFs to Regular Languages | Unknown | N/A | |
| Conditional Contrastive Learning with Kernel | Unknown | N/A | |
| Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent | Unknown | N/A | |
| Imitation Learning by Reinforcement Learning | Unknown | N/A | |
| Multi-Agent MDP Homomorphic Networks | Unknown | N/A | |
| Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute Estimation | Unknown | N/A | |
| Nonlinear ICA Using Volume-Preserving Transformations | Unknown | N/A | |
| Online Hyperparameter Meta-Learning with Hypergradient Distillation | Unknown | N/A | |
| Relating transformers to models and neural representations of the hippocampal formation | Unknown | N/A | |
| Ab-Initio Potential Energy Surfaces by Pairing GNNs with Neural Wave Functions | Unknown | N/A | |
| Bayesian Neural Network Priors Revisited | Unknown | N/A | |
| AdaAug: Learning Class- and Instance-adaptive Data Augmentation Policies | Unknown | N/A | |
| Procedural generalization by planning with self-supervised world models | Unknown | N/A | |
| EigenGame Unloaded: When playing games is better than optimizing | Unknown | N/A | |
| End-to-End Learning of Probabilistic Hierarchies on Graphs | Unknown | N/A | |
| Asymmetry Learning for Counterfactually-invariant Classification in OOD Tasks | Unknown | N/A | |
| Dropout Q-Functions for Doubly Efficient Reinforcement Learning | Unknown | N/A | |
| Promoting Saliency From Depth: Deep Unsupervised RGB-D Saliency Detection | Unknown | N/A | |
| DeSKO: Stability-Assured Robust Control with a Deep Stochastic Koopman Operator | Unknown | N/A | |
| Differentially Private Fractional Frequency Moments Estimation with Polylogarithmic Space | Unknown | N/A | |
| On the Optimal Memorization Power of ReLU Neural Networks | Unknown | N/A | |
| Model Agnostic Interpretability for Multiple Instance Learning | Unknown | N/A | |
| GNN-LM: Language Modeling based on Global Contexts via GNN | Unknown | N/A | |
| Fast Generic Interaction Detection for Model Interpretability and Compression | Unknown | N/A | |
| Towards Understanding the Data Dependency of Mixup-style Training | Unknown | N/A | |
| DEPTS: Deep Expansion Learning for Periodic Time Series Forecasting | Unknown | N/A | |
| Safe Neurosymbolic Learning with Differentiable Symbolic Execution | Unknown | N/A | |
| Gradient Importance Learning for Incomplete Observations | Unknown | N/A | |
| A Biologically Interpretable Graph Convolutional Network to Link Genetic Risk Pathways and Imaging Phenotypes of Disease | Unknown | N/A | |
| Equivariant Subgraph Aggregation Networks | Unknown | N/A | |
| CoBERL: Contrastive BERT for Reinforcement Learning | Unknown | N/A | |
| Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs | Unknown | N/A | |
| Unsupervised Disentanglement with Tensor Product Representations on the Torus | Unknown | N/A | |
| Multi-Mode Deep Matrix and Tensor Factorization | Unknown | N/A | |
| Zero-Shot Self-Supervised Learning for MRI Reconstruction | Unknown | N/A | |
| NASPY: Automated Extraction of Automated Machine Learning Models | Unknown | N/A | |
| LEARNING GUARANTEES FOR GRAPH CONVOLUTIONAL NETWORKS ON THE STOCHASTIC BLOCK MODEL | Unknown | N/A | |
| Should I Run Offline Reinforcement Learning or Behavioral Cloning? | Unknown | N/A | |
| Mention Memory: incorporating textual knowledge into Transformers through entity mention attention | Unknown | N/A | |
| Learning Features with Parameter-Free Layers | Unknown | N/A | |
| Multi-Critic Actor Learning: Teaching RL Policies to Act with Style | Unknown | N/A | |
| GradMax: Growing Neural Networks using Gradient Information | Unknown | N/A | |
| Critical Points in Quantum Generative Models | Unknown | N/A | |
| ComPhy: Compositional Physical Reasoning of Objects and Events from Videos | Unknown | N/A | |
| Transferable Adversarial Attack based on Integrated Gradients | Unknown | N/A | |
| DKM: Differentiable k-Means Clustering Layer for Neural Network Compression | Unknown | N/A | |
| Scale Mixtures of Neural Network Gaussian Processes | Unknown | N/A | |
| Reward Uncertainty for Exploration in Preference-based Reinforcement Learning | Unknown | N/A | |
| Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains | Unknown | N/A | |
| Policy Smoothing for Provably Robust Reinforcement Learning | Unknown | N/A | |
| FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes | Unknown | N/A | |
| CKConv: Continuous Kernel Convolution For Sequential Data | Unknown | N/A | |
| RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation | Unknown | N/A | |
| On the Convergence of Certified Robust Training with Interval Bound Propagation | Unknown | N/A | |
| Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL | Unknown | N/A | |
| Analytic-DPM: an Analytic Estimate of the Optimal Reverse Variance in Diffusion Probabilistic Models | Unknown | N/A | |
| On Distributed Adaptive Optimization with Gradient Compression | Unknown | N/A | |
| Sequence Approximation using Feedforward Spiking Neural Network for Spatiotemporal Learning: Theory and Optimization Methods | Unknown | N/A | |
| Zero-CL: Instance and Feature decorrelation for negative-free symmetric contrastive learning | Unknown | N/A | |
| Salient ImageNet: How to discover spurious features in Deep Learning? | Unknown | N/A | |
| Differentiable Expectation-Maximization for Set Representation Learning | Unknown | N/A | |
| Offline Reinforcement Learning with Implicit Q-Learning | Unknown | N/A | |
| Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks | Unknown | N/A |
ICLR 2023
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Contrastive Audio-Visual Masked Autoencoder | Unknown | N/A | |
| Fairness-aware Contrastive Learning with Partially Annotated Sensitive Attributes | Unknown | N/A | |
| Approximation and non-parametric estimation of functions over high-dimensional spheres via deep ReLU networks | Unknown | N/A | |
| Statistical Guarantees for Consensus Clustering | Unknown | N/A | |
| Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective | Unknown | N/A | |
| Seeing Differently, Acting Similarly: Heterogeneously Observable Imitation Learning | Unknown | N/A | |
| Humanly Certifying Superhuman Classifiers | Unknown | N/A | |
| Write and Paint: Generative Vision-Language Models are Unified Modal Learners | Unknown | N/A | |
| A law of adversarial risk, interpolation, and label noise | Unknown | N/A | |
| Learning Low Dimensional State Spaces with Overparameterized Recurrent Neural Nets | Unknown | N/A | |
| Unsupervised 3D Object Learning through Neuron Activity aware Plasticity | Unknown | N/A | |
| Amortised Invariance Learning for Contrastive Self-Supervision | Unknown | N/A | |
| Expressive Monotonic Neural Networks | Unknown | N/A | |
| In-sample Actor Critic for Offline Reinforcement Learning | Unknown | N/A | |
| The Implicit Bias of Minima Stability in Multivariate Shallow ReLU Networks | Unknown | N/A | |
| Fake It Until You Make It : Towards Accurate Near-Distribution Novelty Detection | Unknown | N/A | |
| CktGNN: Circuit Graph Neural Network for Electronic Design Automation | Unknown | N/A | |
| Jointly Learning Visual and Auditory Speech Representations from Raw Data | Unknown | N/A | |
| Collaborative Pure Exploration in Kernel Bandit | Unknown | N/A | |
| Beyond Lipschitz: Sharp Generalization and Excess Risk Bounds for Full-Batch GD | Unknown | N/A | |
| DELTA: DEGRADATION-FREE FULLY TEST-TIME ADAPTATION | Unknown | N/A | |
| Priors, Hierarchy, and Information Asymmetry for Skill Transfer in Reinforcement Learning | Unknown | N/A | |
| Provable Sim-to-real Transfer in Continuous Domain with Partial Observations | Unknown | N/A | |
| Diagnosing and Rectifying Vision Models using Language | Unknown | N/A | |
| Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path | Unknown | N/A | |
| Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game | Unknown | N/A | |
| DensePure: Understanding Diffusion Models for Adversarial Robustness | Unknown | N/A | |
| Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling | Unknown | N/A | |
| Learning ReLU networks to high uniform accuracy is intractable | Unknown | N/A | |
| DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models | Unknown | N/A | |
| Budgeted Training for Vision Transformer | Unknown | N/A | |
| Hyperbolic Deep Reinforcement Learning | Unknown | N/A | |
| Interpretable Debiasing of Vectorized Language Representations with Iterative Orthogonalization | Unknown | N/A | |
| Rethinking Symbolic Regression: Morphology and Adaptability in the Context of Evolutionary Algorithms | Unknown | N/A | |
| Sequential Gradient Coding For Straggler Mitigation | Unknown | N/A | |
| How gradient estimator variance and bias impact learning in neural networks | Unknown | N/A | |
| TempCLR: Temporal Alignment Representation with Contrastive Learning | Unknown | N/A | |
| Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian | Unknown | N/A | |
| Distilling Model Failures as Directions in Latent Space | Unknown | N/A | |
| Equivariant Hypergraph Diffusion Neural Operators | Unknown | N/A | |
| Learning to Grow Pretrained Models for Efficient Transformer Training | Unknown | N/A | |
| Is Attention All That NeRF Needs? | Unknown | N/A | |
| NeRF-SOS: Any-View Self-supervised Object Segmentation on Complex Scenes | Unknown | N/A | |
| Function-space regularized Rényi divergences | Unknown | N/A | |
| Finding the Global Semantic Representation in GAN through Fréchet Mean | Unknown | N/A | |
| DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics | Unknown | N/A | |
| Effective passive membership inference attacks in federated learning against overparameterized models | Unknown | N/A | |
| CoRTX: Contrastive Framework for Real-time Explanation | Unknown | N/A | |
| Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning | Unknown | N/A | |
| A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games | Unknown | N/A | |
| Forward Super-Resolution: How Can GANs Learn Hierarchical Generative Models for Real-World Distributions | Unknown | N/A | |
| Moderate Coreset: A Universal Method of Data Selection for Real-world Data-efficient Deep Learning | Unknown | N/A | |
| Harnessing Out-Of-Distribution Examples via Augmenting Content and Style | Unknown | N/A | |
| Using Language to Extend to Unseen Domains | Unknown | N/A | |
| A Holistic View of Label Noise Transition Matrix in Deep Learning and Beyond | Unknown | N/A | |
| Softened Symbol Grounding for Neuro-symbolic Systems | Unknown | N/A | |
| Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions | Unknown | N/A | |
| Logical Message Passing Networks with One-hop Inference on Atomic Formulas | Unknown | N/A | |
| Transformers Learn Shortcuts to Automata | Unknown | N/A | |
| Noise-Robust De-Duplication at Scale | Unknown | N/A | |
| Guiding Energy-based Models via Contrastive Latent Variables | Unknown | N/A | |
| Equivariance-aware Architectural Optimization of Neural Networks | Unknown | N/A | |
| Information Plane Analysis for Dropout Neural Networks | Unknown | N/A | |
| UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining | Unknown | N/A | |
| Unsupervised Meta-learning via Few-shot Pseudo-supervised Contrastive Learning | Unknown | N/A | |
| STUNT: Few-shot Tabular Learning with Self-generated Tasks from Unlabeled Tables | Unknown | N/A | |
| HiT-MDP: Learning the SMDP option framework on MDPs with Hidden Temporal Embeddings | Unknown | N/A | |
| Improved Learning-augmented Algorithms for k-means and k-medians Clustering | Unknown | N/A | |
| Progressive Mix-Up for Few-Shot Supervised Multi-Source Domain Transfer | Unknown | N/A | |
| Does Deep Learning Learn to Abstract? A Systematic Probing Framework | Unknown | N/A | |
| RandProx: Primal-Dual Optimization Algorithms with Randomized Proximal Updates | Unknown | N/A | |
| GOGGLE: Generative Modelling for Tabular Data by Learning Relational Structure | Unknown | N/A | |
| Quantized Compressed Sensing with Score-Based Generative Models | Unknown | N/A | |
| Latent Neural ODEs with Sparse Bayesian Multiple Shooting | Unknown | N/A | |
| $\mathcal{O}$-GNN: incorporating ring priors into molecular modeling | Unknown | N/A | |
| Characterizing the spectrum of the NTK via a power series expansion | Unknown | N/A | |
| Self-Stabilization: The Implicit Bias of Gradient Descent at the Edge of Stability | Unknown | N/A | |
| FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data | Unknown | N/A | |
| Equivariant Shape-Conditioned Generation of 3D Molecules for Ligand-Based Drug Design | Unknown | N/A | |
| VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation | Unknown | N/A | |
| $\rm A^2Q$: Aggregation-Aware Quantization for Graph Neural Networks | Unknown | N/A | |
| Exponential Generalization Bounds with Near-Optimal Rates for $L_q$-Stable Algorithms | Unknown | N/A | |
| Learning to Induce Causal Structure | Unknown | N/A | |
| FoSR: First-order spectral rewiring for addressing oversquashing in GNNs | Unknown | N/A | |
| Achieving Near-Optimal Individual Regret & Low Communications in Multi-Agent Bandits | Unknown | N/A | |
| Online Boundary-Free Continual Learning by Scheduled Data Prior | Unknown | N/A | |
| A Higher Precision Algorithm for Computing the $1$-Wasserstein Distance | Unknown | N/A | |
| Bidirectional Language Models Are Also Few-shot Learners | Unknown | N/A | |
| Delving into Semantic Scale Imbalance | Unknown | N/A | |
| On the Trade-Off between Actionable Explanations and the Right to be Forgotten | Unknown | N/A | |
| Over-parameterized Model Optimization with Polyak-{\L}ojasiewicz Condition | Unknown | N/A | |
| Continuous-time identification of dynamic state-space models by deep subspace encoding | Unknown | N/A | |
| AE-FLOW: Autoencoders with Normalizing Flows for Medical Images Anomaly Detection | Unknown | N/A | |
| SpeedyZero: Mastering Atari with Limited Data and Time | Unknown | N/A | |
| Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse | Unknown | N/A | |
| Sampling-based inference for large linear models, with application to linearised Laplace | Unknown | N/A | |
| Constructive TT-representation of the tensors given as index interaction functions with applications | Unknown | N/A | |
| Adaptive Robust Evidential Optimization For Open Set Detection from Imbalanced Data | Unknown | N/A | |
| Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function | Unknown | N/A | |
| Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation | Unknown | N/A | |
| Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors | Unknown | N/A | |
| CANIFE: Crafting Canaries for Empirical Privacy Measurement in Federated Learning | Unknown | N/A | |
| Learning to Estimate Single-View Volumetric Flow Motions without 3D Supervision | Unknown | N/A | |
| Variational Information Pursuit for Interpretable Predictions | Unknown | N/A | |
| DiffusER: Diffusion via Edit-based Reconstruction | Unknown | N/A | |
| Synthetic Data Generation of Many-to-Many Datasets via Random Graph Generation | Unknown | N/A | |
| One-Pixel Shortcut: On the Learning Preference of Deep Neural Networks | Unknown | N/A | |
| Unified Detoxifying and Debiasing in Language Generation via Inference-time Adaptive Optimization | Unknown | N/A | |
| Boosting Causal Discovery via Adaptive Sample Reweighting | Unknown | N/A | |
| Inequality phenomenon in $l_{\infty}$-adversarial training, and its unrealized threats | Unknown | N/A | |
| Understanding weight-magnitude hyperparameters in training binary networks | Unknown | N/A | |
| GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis | Unknown | N/A | |
| Copy is All You Need | Unknown | N/A | |
| Grounding Graph Network Simulators using Physical Sensor Observations | Unknown | N/A | |
| Predictive Inference with Feature Conformal Prediction | Unknown | N/A | |
| The Curious Case of Benign Memorization | Unknown | N/A | |
| Accurate Bayesian Meta-Learning by Accurate Task Posterior Inference | Unknown | N/A | |
| Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection | Unknown | N/A | |
| Guiding continuous operator learning through Physics-based boundary constraints | Unknown | N/A | |
| Preference Transformer: Modeling Human Preferences using Transformers for RL | Unknown | N/A | |
| LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval | Unknown | N/A | |
| SWIFT: Rapid Decentralized Federated Learning via Wait-Free Model Communication | Unknown | N/A | |
| An Equal-Size Hard EM Algorithm for Diverse Dialogue Generation | Unknown | N/A | |
| A GNN-Guided Predict-and-Search Framework for Mixed-Integer Linear Programming | Unknown | N/A | |
| On Explaining Neural Network Robustness with Activation Path | Unknown | N/A | |
| Liquid Structural State-Space Models | Unknown | N/A | |
| Approximate Vanishing Ideal Computations at Scale | Unknown | N/A | |
| Multi-task Self-supervised Graph Neural Networks Enable Stronger Task Generalization | Unknown | N/A | |
| Empowering Graph Representation Learning with Test-Time Graph Transformation | Unknown | N/A | |
| A Statistical Framework for Personalized Federated Learning and Estimation: Theory, Algorithms, and Privacy | Unknown | N/A | |
| DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training | Unknown | N/A | |
| Personalized Reward Learning with Interaction-Grounded Learning (IGL) | Unknown | N/A | |
| Ordered GNN: Ordering Message Passing to Deal with Heterophily and Over-smoothing | Unknown | N/A | |
| Defending against Adversarial Audio via Diffusion Model | Unknown | N/A | |
| Unsupervised Learning for Combinatorial Optimization Needs Meta Learning | Unknown | N/A | |
| Revisit Finetuning strategy for Few-Shot Learning to Transfer the Emdeddings | Unknown | N/A | |
| The Augmented Image Prior: Distilling 1000 Classes by Extrapolating from a Single Image | Unknown | N/A | |
| A Non-Asymptotic Analysis of Oversmoothing in Graph Neural Networks | Unknown | N/A | |
| LDMIC: Learning-based Distributed Multi-view Image Coding | Unknown | N/A | |
| Riemannian Metric Learning via Optimal Transport | Unknown | N/A | |
| Meta Learning to Bridge Vision and Language Models for Multimodal Few-Shot Learning | Unknown | N/A | |
| $\mathscr{N}$-WL: A New Hierarchy of Expressivity for Graph Neural Networks | Unknown | N/A | |
| Learning Input-agnostic Manipulation Directions in StyleGAN with Text Guidance | Unknown | N/A | |
| DAVA: Disentangling Adversarial Variational Autoencoder | Unknown | N/A | |
| Test-Time Adaptation via Self-Training with Nearest Neighbor Information | Unknown | N/A | |
| InCoder: A Generative Model for Code Infilling and Synthesis | Unknown | N/A | |
| Artificial Neuronal Ensembles with Learned Context Dependent Gating | Unknown | N/A | |
| A General Rank Preserving Framework for Asymmetric Image Retrieval | Unknown | N/A | |
| Preserving Pre-trained Features Helps Calibrate Fine-tuned Language Models | Unknown | N/A | |
| Can discrete information extraction prompts generalize across language models? | Unknown | N/A | |
| Score-based Continuous-time Discrete Diffusion Models | Unknown | N/A | |
| Any-scale Balanced Samplers for Discrete Space | Unknown | N/A | |
| Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms | Unknown | N/A | |
| Reliability of CKA as a Similarity Measure in Deep Learning | Unknown | N/A | |
| When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It? | Unknown | N/A | |
| Order Matters: Agent-by-agent Policy Optimization | Unknown | N/A | |
| FastFill: Efficient Compatible Model Update | Unknown | N/A | |
| Mind the Pool: Convolutional Neural Networks Can Overfit Input Size | Unknown | N/A | |
| DFlow: Learning to Synthesize Better Optical Flow Datasets via a Differentiable Pipeline | Unknown | N/A | |
| Transformer-based World Models Are Happy With 100k Interactions | Unknown | N/A | |
| Differentiable Mathematical Programming for Object-Centric Representation Learning | Unknown | N/A | |
| MAST: Masked Augmentation Subspace Training for Generalizable Self-Supervised Priors | Unknown | N/A | |
| Scalable Subset Sampling with Neural Conditional Poisson Networks | Unknown | N/A | |
| Benchmarking Offline Reinforcement Learning on Real-Robot Hardware | Unknown | N/A | |
| A General Framework For Proving The Equivariant Strong Lottery Ticket Hypothesis | Unknown | N/A | |
| Re-weighting Based Group Fairness Regularization via Classwise Robust Optimization | Unknown | N/A | |
| Empowering Networks With Scale and Rotation Equivariance Using A Similarity Convolution | Unknown | N/A | |
| LMC: Fast Training of GNNs via Subgraph Sampling with Provable Convergence | Unknown | N/A | |
| Neural-based classification rule learning for sequential data | Unknown | N/A | |
| ODAM: Gradient-based Instance-Specific Visual Explanations for Object Detection | Unknown | N/A | |
| Learning with Auxiliary Activation for Memory-Efficient Training | Unknown | N/A | |
| Policy-Based Self-Competition for Planning Problems | Unknown | N/A | |
| Out-of-Distribution Detection based on In-Distribution Data Patterns Memorization with Modern Hopfield Energy | Unknown | N/A | |
| Kernel Neural Optimal Transport | Unknown | N/A | |
| Neural Optimal Transport | Unknown | N/A | |
| Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 Small | Unknown | N/A | |
| Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data | Unknown | N/A | |
| Teacher Guided Training: An Efficient Framework for Knowledge Transfer | Unknown | N/A | |
| Measure the Predictive Heterogeneity | Unknown | N/A | |
| Provable Defense Against Geometric Transformations | Unknown | N/A | |
| Tensor-Based Sketching Method for the Low-Rank Approximation of Data Streams. | Unknown | N/A | |
| Data augmentation alone can improve adversarial training | Unknown | N/A | |
| CUTS: Neural Causal Discovery from Irregular Time-Series Data | Unknown | N/A | |
| LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification | Unknown | N/A | |
| Valid P-Value for Deep Learning-driven Salient Region | Unknown | N/A | |
| Pitfalls of Gaussians as a noise distribution in NCE | Unknown | N/A | |
| Broken Neural Scaling Laws | Unknown | N/A | |
| Bridging the Gap between ANNs and SNNs by Calibrating Offset Spikes | Unknown | N/A | |
| Visually-Augmented Language Modeling | Unknown | N/A | |
| A VAE for Transformers with Nonparametric Variational Information Bottleneck | Unknown | N/A | |
| A theoretical study of inductive biases in contrastive learning | Unknown | N/A | |
| Minimum Variance Unbiased N:M Sparsity for the Neural Gradients | Unknown | N/A | |
| Incremental Learning of Structured Memory via Closed-Loop Transcription | Unknown | N/A | |
| Explaining Temporal Graph Models through an Explorer-Navigator Framework | Unknown | N/A | |
| Backpropagation at the Infinitesimal Inference Limit of Energy-Based Models: Unifying Predictive Coding, Equilibrium Propagation, and Contrastive Hebbian Learning | Unknown | N/A | |
| Knowledge-in-Context: Towards Knowledgeable Semi-Parametric Language Models | Unknown | N/A | |
| What Do Self-Supervised Vision Transformers Learn? | Unknown | N/A | |
| Benchmarking Constraint Inference in Inverse Reinforcement Learning | Unknown | N/A | |
| Enhancing the Inductive Biases of Graph Neural ODE for Modeling Physical Systems | Unknown | N/A | |
| ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret | Unknown | N/A | |
| Sampling-free Inference for Ab-Initio Potential Energy Surface Networks | Unknown | N/A | |
| That Label's got Style: Handling Label Style Bias for Uncertain Image Segmentation | Unknown | N/A | |
| On The Relative Error of Random Fourier Features for Preserving Kernel Distance | Unknown | N/A | |
| Towards Inferential Reproducibility of Machine Learning Research | Unknown | N/A | |
| Beyond calibration: estimating the grouping loss of modern neural networks | Unknown | N/A | |
| DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing | Unknown | N/A | |
| Weighted Clock Logic Point Process | Unknown | N/A | |
| Squeeze Training for Adversarial Robustness | Unknown | N/A | |
| Asymptotic Instance-Optimal Algorithms for Interactive Decision Making | Unknown | N/A | |
| Is Forgetting Less a Good Inductive Bias for Forward Transfer? | Unknown | N/A | |
| Near-Optimal Deployment Efficiency in Reward-Free Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning | Unknown | N/A | |
| Long-Tailed Partial Label Learning via Dynamic Rebalancing | Unknown | N/A | |
| Global Explainability of GNNs via Logic Combination of Learned Concepts | Unknown | N/A | |
| Task Ambiguity in Humans and Language Models | Unknown | N/A | |
| Discrete Predictor-Corrector Diffusion Models for Image Synthesis | Unknown | N/A | |
| Recon: Reducing Conflicting Gradients From the Root For Multi-Task Learning | Unknown | N/A | |
| More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization | Unknown | N/A | |
| PerFedMask: Personalized Federated Learning with Optimized Masking Vectors | Unknown | N/A | |
| Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks | Unknown | N/A | |
| Imbalanced Semi-supervised Learning with Bias Adaptive Classifier | Unknown | N/A | |
| On Compositional Uncertainty Quantification for Seq2seq Graph Parsing | Unknown | N/A | |
| Free Lunch for Domain Adversarial Training: Environment Label Smoothing | Unknown | N/A | |
| Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning | Unknown | N/A | |
| Brain-like representational straightening of natural movies in robust feedforward neural networks | Unknown | N/A | |
| A Simple Approach for Visual Room Rearrangement: 3D Mapping and Semantic Search | Unknown | N/A | |
| Generative Modeling Helps Weak Supervision (and Vice Versa) | Unknown | N/A | |
| Federated Learning from Small Datasets | Unknown | N/A | |
| Multi-level Protein Structure Pre-training via Prompt Learning | Unknown | N/A | |
| PD-MORL: Preference-Driven Multi-Objective Reinforcement Learning Algorithm | Unknown | N/A | |
| A Mixture-of-Expert Approach to RL-based Dialogue Management | Unknown | N/A | |
| M-L2O: Towards Generalizable Learning-to-Optimize by Test-Time Fast Self-Adaptation | Unknown | N/A | |
| Dichotomy of Control: Separating What You Can Control from What You Cannot | Unknown | N/A | |
| Re-calibrating Feature Attributions for Model Interpretation | Unknown | N/A | |
| Revisiting Populations in multi-agent Communication | Unknown | N/A | |
| Learning multi-scale local conditional probability models of images | Unknown | N/A | |
| Characterizing the Influence of Graph Elements | Unknown | N/A | |
| Disentangling Learning Representations with Density Estimation | Unknown | N/A | |
| Simple and Scalable Nearest Neighbor Machine Translation | Unknown | N/A | |
| Language Models Can Teach Themselves to Program Better | Unknown | N/A | |
| What Is Missing in IRM Training and Evaluation? Challenges and Solutions | Unknown | N/A | |
| A Differential Geometric View and Explainability of GNN on Evolving Graphs | Unknown | N/A | |
| TextGrad: Advancing Robustness Evaluation in NLP by Gradient-Driven Optimization | Unknown | N/A | |
| Consolidator: Mergable Adapter with Group Connections for Visual Adaptation | Unknown | N/A | |
| Cross-Level Distillation and Feature Denoising for Cross-Domain Few-Shot Classification | Unknown | N/A | |
| Continuous PDE Dynamics Forecasting with Implicit Neural Representations | Unknown | N/A | |
| Transfer NAS with Meta-learned Bayesian Surrogates | Unknown | N/A | |
| Hierarchical Sliced Wasserstein Distance | Unknown | N/A | |
| Supervision Complexity and its Role in Knowledge Distillation | Unknown | N/A | |
| LipsFormer: Introducing Lipschitz Continuity to Vision Transformers | Unknown | N/A | |
| Automatic Chain of Thought Prompting in Large Language Models | Unknown | N/A | |
| Near-Optimal Adversarial Reinforcement Learning with Switching Costs | Unknown | N/A | |
| Federated Nearest Neighbor Machine Translation | Unknown | N/A | |
| Language Models are Realistic Tabular Data Generators | Unknown | N/A | |
| Logical Entity Representation in Knowledge-Graphs for Differentiable Rule Learning | Unknown | N/A | |
| MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP Initialization | Unknown | N/A | |
| Data Valuation Without Training of a Model | Unknown | N/A | |
| Words are all you need? Language as an approximation for human similarity judgments | Unknown | N/A | |
| Link Prediction with Non-Contrastive Learning | Unknown | N/A | |
| Impossibly Good Experts and How to Follow Them | Unknown | N/A | |
| Scaling Laws For Deep Learning Based Image Reconstruction | Unknown | N/A | |
| Equal Improvability: A New Fairness Notion Considering the Long-term Impact | Unknown | N/A | |
| Certifiably Robust Policy Learning against Adversarial Multi-Agent Communication | Unknown | N/A | |
| Implicit regularization in Heavy-ball momentum accelerated stochastic gradient descent | Unknown | N/A | |
| Hyperparameter Optimization through Neural Network Partitioning | Unknown | N/A | |
| Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks | Unknown | N/A | |
| Image to Sphere: Learning Equivariant Features for Efficient Pose Prediction | Unknown | N/A | |
| NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis | Unknown | N/A | |
| Sequential Attention for Feature Selection | Unknown | N/A | |
| Modeling Multimodal Aleatoric Uncertainty in Segmentation with Mixture of Stochastic Experts | Unknown | N/A | |
| Improved Sample Complexity for Reward-free Reinforcement Learning under Low-rank MDPs | Unknown | N/A | |
| Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning | Unknown | N/A | |
| ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor | Unknown | N/A | |
| Re-Imagen: Retrieval-Augmented Text-to-Image Generator | Unknown | N/A | |
| Towards Robustness Certification Against Universal Perturbations | Unknown | N/A | |
| Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints | Unknown | N/A | |
| Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets | Unknown | N/A | |
| Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning? | Unknown | N/A | |
| HyperDeepONet: learning operator with complex target function space using the limited resources via hypernetwork | Unknown | N/A | |
| Self-Supervised Category-Level Articulated Object Pose Estimation with Part-Level SE(3) Equivariance | Unknown | N/A | |
| PiFold: Toward effective and efficient protein inverse folding | Unknown | N/A | |
| Capturing the Motion of Every Joint: 3D Human Pose and Shape Estimation with Independent Tokens | Unknown | N/A | |
| Mind the Gap: Offline Policy Optimization for Imperfect Rewards | Unknown | N/A | |
| Autoregressive Conditional Neural Processes | Unknown | N/A | |
| CASR: Generating Complex Sequences with Autoregressive Self-Boost Refinement | Unknown | N/A | |
| Bias Propagation in Federated Learning | Unknown | N/A | |
| Make-A-Video: Text-to-Video Generation without Text-Video Data | Unknown | N/A | |
| kNN-Diffusion: Image Generation via Large-Scale Retrieval | Unknown | N/A | |
| AudioGen: Textually Guided Audio Generation | Unknown | N/A | |
| DCI-ES: An Extended Disentanglement Framework with Connections to Identifiability | Unknown | N/A | |
| Is a Caption Worth a Thousand Images? A Study on Representation Learning | Unknown | N/A | |
| Almost Linear Constant-Factor Sketching for $\ell_1$ and Logistic Regression | Unknown | N/A | |
| Concept Gradient: Concept-based Interpretation Without Linear Assumption | Unknown | N/A | |
| Neural Networks Efficiently Learn Low-Dimensional Representations with SGD | Unknown | N/A | |
| Neural Agents Struggle to Take Turns in Bidirectional Emergent Communication | Unknown | N/A | |
| Complexity-Based Prompting for Multi-step Reasoning | Unknown | N/A | |
| Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees | Unknown | N/A | |
| Human-Guided Fair Classification for Natural Language Processing | Unknown | N/A | |
| An Adaptive Policy to Employ Sharpness-Aware Minimization | Unknown | N/A | |
| NTK-SAP: Improving neural network pruning by aligning training dynamics | Unknown | N/A | |
| Revisiting the Assumption of Latent Separability for Backdoor Defenses | Unknown | N/A | |
| Holistic Adversarially Robust Pruning | Unknown | N/A | |
| Restricted Strong Convexity of Deep Learning Models with Smooth Activations | Unknown | N/A | |
| TVSPrune - Pruning Non-discriminative filters via Total Variation separability of intermediate representations without fine tuning | Unknown | N/A | |
| DFPC: Data flow driven pruning of coupled channels without data. | Unknown | N/A | |
| Graph Neural Network-Inspired Kernels for Gaussian Processes in Semi-Supervised Learning | Unknown | N/A | |
| Minimum Description Length Control | Unknown | N/A | |
| RGI: robust GAN-inversion for mask-free image inpainting and unsupervised pixel-wise anomaly detection | Unknown | N/A | |
| Part-Based Models Improve Adversarial Robustness | Unknown | N/A | |
| Basic Binary Convolution Unit for Binarized Image Restoration Network | Unknown | N/A | |
| Knowledge Distillation based Degradation Estimation for Blind Super-Resolution | Unknown | N/A | |
| Scale-invariant Bayesian Neural Networks with Connectivity Tangent Kernel | Unknown | N/A | |
| Hybrid RL: Using both offline and online data can make RL efficient | Unknown | N/A | |
| Become a Proficient Player with Limited Data through Watching Pure Videos | Unknown | N/A | |
| Edge Guided GANs with Contrastive Learning for Semantic Image Synthesis | Unknown | N/A | |
| Masked Frequency Modeling for Self-Supervised Visual Pre-Training | Unknown | N/A | |
| On the Convergence of AdaGrad(Norm) on $\mathbb{R}^d$: Beyond Convexity, Non-Asymptotic Rate and Acceleration | Unknown | N/A | |
| Curriculum-based Co-design of Morphology and Control of Voxel-based Soft Robots | Unknown | N/A | |
| Instance-wise Batch Label Restoration via Gradients in Federated Learning | Unknown | N/A | |
| Truthful Self-Play | Unknown | N/A | |
| Zeroth-Order Optimization with Trajectory-Informed Derivative Estimation | Unknown | N/A | |
| Matching receptor to odorant with protein language and graph neural networks | Unknown | N/A | |
| Decompositional Generation Process for Instance-Dependent Partial Label Learning | Unknown | N/A | |
| A Graph Neural Network Approach to Automated Model Building in Cryo-EM Maps | Unknown | N/A | |
| Investigating Multi-task Pretraining and Generalization in Reinforcement Learning | Unknown | N/A | |
| Adversarial Imitation Learning with Preferences | Unknown | N/A | |
| Simple Emergent Action Representations from Multi-Task Policy Training | Unknown | N/A | |
| From $t$-SNE to UMAP with contrastive learning | Unknown | N/A | |
| On the Importance and Applicability of Pre-Training for Federated Learning | Unknown | N/A | |
| When to Make and Break Commitments? | Unknown | N/A | |
| Gromov-Wasserstein Autoencoders | Unknown | N/A | |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Unknown | N/A | |
| Neural Episodic Control with State Abstraction | Unknown | N/A | |
| The Surprising Computational Power of Nondeterministic Stack RNNs | Unknown | N/A | |
| Feature Reconstruction From Outputs Can Mitigate Simplicity Bias in Neural Networks | Unknown | N/A | |
| Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting | Unknown | N/A | |
| Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning | Unknown | N/A | |
| FedFA: Federated Feature Augmentation | Unknown | N/A | |
| Efficient Planning in a Compact Latent Action Space | Unknown | N/A | |
| SketchKnitter: Vectorized Sketch Generation with Diffusion Models | Unknown | N/A | |
| Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| On the Perils of Cascading Robust Classifiers | Unknown | N/A | |
| Conservative Bayesian Model-Based Value Expansion for Offline Policy Optimization | Unknown | N/A | |
| FairGBM: Gradient Boosting with Fairness Constraints | Unknown | N/A | |
| Efficient Model Updates for Approximate Unlearning of Graph-Structured Data | Unknown | N/A | |
| Does Learning from Decentralized Non-IID Unlabeled Data Benefit from Self Supervision? | Unknown | N/A | |
| Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification | Unknown | N/A | |
| How I Learned to Stop Worrying and Love Retraining | Unknown | N/A | |
| Malign Overfitting: Interpolation and Invariance are Fundamentally at Odds | Unknown | N/A | |
| AGRO: Adversarial discovery of error-prone Groups for Robust Optimization | Unknown | N/A | |
| Discovering Latent Knowledge in Language Models Without Supervision | Unknown | N/A | |
| Tier Balancing: Towards Dynamic Fairness over Underlying Causal Factors | Unknown | N/A | |
| RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch | Unknown | N/A | |
| Deep Declarative Dynamic Time Warping for End-to-End Learning of Alignment Paths | Unknown | N/A | |
| PGrad: Learning Principal Gradients For Domain Generalization | Unknown | N/A | |
| Representational Dissimilarity Metric Spaces for Stochastic Neural Networks | Unknown | N/A | |
| Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections | Unknown | N/A | |
| Neural Design for Genetic Perturbation Experiments | Unknown | N/A | |
| Topology-aware Robust Optimization for Out-of-Distribution Generalization | Unknown | N/A | |
| simpleKT: A Simple But Tough-to-Beat Baseline for Knowledge Tracing | Unknown | N/A | |
| DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases | Unknown | N/A | |
| Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness | Unknown | N/A | |
| Disentangling the Mechanisms Behind Implicit Regularization in SGD | Unknown | N/A | |
| GLM-130B: An Open Bilingual Pre-trained Model | Unknown | N/A | |
| DEP-RL: Embodied Exploration for Reinforcement Learning in Overactuated and Musculoskeletal Systems | Unknown | N/A | |
| The Onset of Variance-Limited Behavior for Networks in the Lazy and Rich Regimes | Unknown | N/A | |
| Min-Max Multi-objective Bilevel Optimization with Applications in Robust Machine Learning | Unknown | N/A | |
| Vision Transformer Adapter for Dense Predictions | Unknown | N/A | |
| Learning Simultaneous Navigation and Construction in Grid Worlds | Unknown | N/A | |
| GAIN: On the Generalization of Instructional Action Understanding | Unknown | N/A | |
| Text Summarization with Oracle Expectation | Unknown | N/A | |
| Learning Sparse Group Models Through Boolean Relaxation | Unknown | N/A | |
| Combinatorial-Probabilistic Trade-Off: P-Values of Community Properties Test in the Stochastic Block Models | Unknown | N/A | |
| TEMPERA: Test-Time Prompt Editing via Reinforcement Learning | Unknown | N/A | |
| How to Train your HIPPO: State Space Models with Generalized Orthogonal Basis Projections | Unknown | N/A | |
| Sample Complexity of Nonparametric Off-Policy Evaluation on Low-Dimensional Manifolds using Deep Networks | Unknown | N/A | |
| Learning Kernelized Contextual Bandits in a Distributed and Asynchronous Environment | Unknown | N/A | |
| Representation Learning for Low-rank General-sum Markov Games | Unknown | N/A | |
| Deep Reinforcement Learning for Cost-Effective Medical Diagnosis | Unknown | N/A | |
| Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations | Unknown | N/A | |
| Symmetries, Flat Minima, and the Conserved Quantities of Gradient Flow | Unknown | N/A | |
| Noise Injection Node Regularization for Robust Learning | Unknown | N/A | |
| Differentially Private $L_2$-Heavy Hitters in the Sliding Window Model | Unknown | N/A | |
| Temporal Domain Generalization with Drift-Aware Dynamic Neural Networks | Unknown | N/A | |
| Disparate Impact in Differential Privacy from Gradient Misalignment | Unknown | N/A | |
| Few-Shot Domain Adaptation For End-to-End Communication | Unknown | N/A | |
| Planckian Jitter: countering the color-crippling effects of color jitter on self-supervised training | Unknown | N/A | |
| Calibrating Transformers via Sparse Gaussian Processes | Unknown | N/A | |
| Associative Memory Augmented Asynchronous Spatiotemporal Representation Learning for Event-based Perception | Unknown | N/A | |
| ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency | Unknown | N/A | |
| SAM as an Optimal Relaxation of Bayes | Unknown | N/A | |
| The Dark Side of AutoML: Towards Architectural Backdoor Search | Unknown | N/A | |
| ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation | Unknown | N/A | |
| BrainBERT: Self-supervised representation learning for intracranial recordings | Unknown | N/A | |
| Flow Matching for Generative Modeling | Unknown | N/A | |
| Guiding Safe Exploration with Weakest Preconditions | Unknown | N/A | |
| Accelerating Hamiltonian Monte Carlo via Chebyshev Integration Time | Unknown | N/A | |
| Causal Estimation for Text Data with (Apparent) Overlap Violations | Unknown | N/A | |
| Static Prediction of Runtime Errors by Learning to Execute Programs with External Resource Descriptions | Unknown | N/A | |
| TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization | Unknown | N/A | |
| Hungry Hungry Hippos: Towards Language Modeling with State Space Models | Unknown | N/A | |
| Understanding new tasks through the lens of training data via exponential tilting | Unknown | N/A | |
| Perfectly Secure Steganography Using Minimum Entropy Coupling | Unknown | N/A | |
| Predictor-corrector algorithms for stochastic optimization under gradual distribution shift | Unknown | N/A | |
| A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification | Unknown | N/A | |
| FunkNN: Neural Interpolation for Functional Generation | Unknown | N/A | |
| Continual Pre-training of Language Models | Unknown | N/A | |
| Accelerated Single-Call Methods for Constrained Min-Max Optimization | Unknown | N/A | |
| MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models | Unknown | N/A | |
| Lower Bounds on the Depth of Integral ReLU Neural Networks via Lattice Polytopes | Unknown | N/A | |
| Implicit Bias of Large Depth Networks: a Notion of Rank for Nonlinear Functions | Unknown | N/A | |
| Learning to CROSS exchange to solve min-max vehicle routing problems | Unknown | N/A | |
| Evolving Populations of Diverse RL Agents with MAP-Elites | Unknown | N/A | |
| Equivariant Descriptor Fields: SE(3)-Equivariant Energy-Based Models for End-to-End Visual Robotic Manipulation Learning | Unknown | N/A | |
| PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs | Unknown | N/A | |
| Outcome-directed Reinforcement Learning by Uncertainty \& Temporal Distance-Aware Curriculum Goal Generation | Unknown | N/A | |
| Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition | Unknown | N/A | |
| The In-Sample Softmax for Offline Reinforcement Learning | Unknown | N/A | |
| Compositional Law Parsing with Latent Random Functions | Unknown | N/A | |
| Reversible Column Networks | Unknown | N/A | |
| Robust Algorithms on Adaptive Inputs from Bounded Adversaries | Unknown | N/A | |
| Selection-Inference: Exploiting Large Language Models for Interpretable Logical Reasoning | Unknown | N/A | |
| Transformers are Sample-Efficient World Models | Unknown | N/A | |
| Towards Minimax Optimal Reward-free Reinforcement Learning in Linear MDPs | Unknown | N/A | |
| Robust Explanation Constraints for Neural Networks | Unknown | N/A | |
| Domain Generalization via Heckman-type Selection Models | Unknown | N/A | |
| Quasi-optimal Reinforcement Learning with Continuous Actions | Unknown | N/A | |
| Generalization Bounds for Federated Learning: Fast Rates, Unparticipating Clients and Unbounded Losses | Unknown | N/A | |
| Agree to Disagree: Diversity through Disagreement for Better Transferability | Unknown | N/A | |
| Which Layer is Learning Faster? A Systematic Exploration of Layer-wise Convergence Rate for Deep Neural Networks | Unknown | N/A | |
| Strong inductive biases provably prevent harmless interpolation | Unknown | N/A | |
| Parametrizing Product Shape Manifolds by Composite Networks | Unknown | N/A | |
| Quantifying and Mitigating the Impact of Label Errors on Model Disparity Metrics | Unknown | N/A | |
| Ollivier-Ricci Curvature for Hypergraphs: A Unified Framework | Unknown | N/A | |
| Hidden Markov Transformer for Simultaneous Machine Translation | Unknown | N/A | |
| Sequential Latent Variable Models for Few-Shot High-Dimensional Time-Series Forecasting | Unknown | N/A | |
| Minimax Optimal Kernel Operator Learning via Multilevel Training | Unknown | N/A | |
| MeshDiffusion: Score-based Generative 3D Mesh Modeling | Unknown | N/A | |
| ChordMixer: A Scalable Neural Attention Model for Sequences with Different Length | Unknown | N/A | |
| FaiREE: fair classification with finite-sample and distribution-free guarantee | Unknown | N/A | |
| On the duality between contrastive and non-contrastive self-supervised learning | Unknown | N/A | |
| ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations | Unknown | N/A | |
| Emergence of Maps in the Memories of Blind Navigation Agents | Unknown | N/A | |
| Molecular Geometry Pretraining with SE(3)-Invariant Denoising Distance Matching | Unknown | N/A | |
| Scaleformer: Iterative Multi-scale Refining Transformers for Time Series Forecasting | Unknown | N/A | |
| Simplicial Hopfield networks | Unknown | N/A | |
| Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks | Unknown | N/A | |
| Turning the Curse of Heterogeneity in Federated Learning into a Blessing for Out-of-Distribution Detection | Unknown | N/A | |
| D4AM: A General Denoising Framework for Downstream Acoustic Models | Unknown | N/A | |
| Finding Actual Descent Directions for Adversarial Training | Unknown | N/A | |
| Interpretations of Domain Adaptations via Layer Variational Analysis | Unknown | N/A | |
| Data Continuity Matters: Improving Sequence Modeling with Lipschitz Regularizer | Unknown | N/A | |
| MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction | Unknown | N/A | |
| Policy Expansion for Bridging Offline-to-Online Reinforcement Learning | Unknown | N/A | |
| Mitigating Memorization of Noisy Labels via Regularization between Representations | Unknown | N/A | |
| ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning | Unknown | N/A | |
| Learning Cut Selection for Mixed-Integer Linear Programming via Hierarchical Sequence Model | Unknown | N/A | |
| Planning Goals for Exploration | Unknown | N/A | |
| DASHA: Distributed Nonconvex Optimization with Communication Compression and Optimal Oracle Complexity | Unknown | N/A | |
| Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning | Unknown | N/A | |
| AutoGT: Automated Graph Transformer Architecture Search | Unknown | N/A | |
| Progress measures for grokking via mechanistic interpretability | Unknown | N/A | |
| Active Learning in Bayesian Neural Networks with Balanced Entropy Learning Principle | Unknown | N/A | |
| Variance-Aware Sparse Linear Bandits | Unknown | N/A | |
| Label-free Concept Bottleneck Models | Unknown | N/A | |
| The Role of ImageNet Classes in Fréchet Inception Distance | Unknown | N/A | |
| Embedding Fourier for Ultra-High-Definition Low-Light Image Enhancement | Unknown | N/A | |
| Optimal Transport for Offline Imitation Learning | Unknown | N/A | |
| Does Zero-Shot Reinforcement Learning Exist? | Unknown | N/A | |
| A Self-Attention Ansatz for Ab-initio Quantum Chemistry | Unknown | N/A | |
| A Neural Mean Embedding Approach for Back-door and Front-door Adjustment | Unknown | N/A | |
| TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation | Unknown | N/A | |
| Certified Training: Small Boxes are All You Need | Unknown | N/A | |
| CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Alignment | Unknown | N/A | |
| BSTT: A Bayesian Spatial-Temporal Transformer for Sleep Staging | Unknown | N/A | |
| Optimizing Bi-Encoder for Named Entity Recognition via Contrastive Learning | Unknown | N/A | |
| Pre-training via Denoising for Molecular Property Prediction | Unknown | N/A | |
| AANG : Automating Auxiliary Learning | Unknown | N/A | |
| Dual Algorithmic Reasoning | Unknown | N/A | |
| SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning | Unknown | N/A | |
| Equivariant Energy-Guided SDE for Inverse Molecular Design | Unknown | N/A | |
| Quantile Risk Control: A Flexible Framework for Bounding the Probability of High-Loss Predictions | Unknown | N/A | |
| Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching | Unknown | N/A | |
| On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning | Unknown | N/A | |
| Learning Continuous Normalizing Flows For Faster Convergence To Target Distribution via Ascent Regularizations | Unknown | N/A | |
| A Simple Yet Powerful Deep Active Learning With Snapshots Ensembles | Unknown | N/A | |
| Towards Interpretable Deep Reinforcement Learning with Human-Friendly Prototypes | Unknown | N/A | |
| The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning | Unknown | N/A | |
| Simplicial Embeddings in Self-Supervised Learning and Downstream Classification | Unknown | N/A | |
| Learning Label Encodings for Deep Regression | Unknown | N/A | |
| Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement | Unknown | N/A | |
| Neural Implicit Shape Editing using Boundary Sensitivity | Unknown | N/A | |
| Multifactor Sequential Disentanglement via Structured Koopman Autoencoders | Unknown | N/A | |
| Asynchronous Distributed Bilevel Optimization | Unknown | N/A | |
| Towards Open Temporal Graph Neural Networks | Unknown | N/A | |
| VA-DepthNet: A Variational Approach to Single Image Depth Prediction | Unknown | N/A | |
| Graph Contrastive Learning for Skeleton-based Action Recognition | Unknown | N/A | |
| Strategic Classification with Graph Neural Networks | Unknown | N/A | |
| Memory Gym: Partially Observable Challenges to Memory-Based Agents | Unknown | N/A | |
| What learning algorithm is in-context learning? Investigations with linear models | Unknown | N/A | |
| Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality | Unknown | N/A | |
| Learning to Extrapolate: A Transductive Approach | Unknown | N/A | |
| Learning Achievement Structure for Structured Exploration in Domains with Sparse Reward | Unknown | N/A | |
| How Sharpness-Aware Minimization Minimizes Sharpness? | Unknown | N/A | |
| Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models | Unknown | N/A | |
| DreamFusion: Text-to-3D using 2D Diffusion | Unknown | N/A | |
| The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation | Unknown | N/A | |
| What Can we Learn From The Selective Prediction And Uncertainty Estimation Performance Of 523 Imagenet Classifiers? | Unknown | N/A | |
| The Symmetric Generalized Eigenvalue Problem as a Nash Equilibrium | Unknown | N/A | |
| Integrating Symmetry into Differentiable Planning with Steerable Convolutions | Unknown | N/A | |
| Deep Generative Symbolic Regression | Unknown | N/A | |
| Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation | Unknown | N/A | |
| Correlative Information Maximization Based Biologically Plausible Neural Networks for Correlated Source Separation | Unknown | N/A | |
| Batch Multivalid Conformal Prediction | Unknown | N/A | |
| Neural Networks and the Chomsky Hierarchy | Unknown | N/A | |
| SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments | Unknown | N/A | |
| A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation | Unknown | N/A | |
| This Looks Like It Rather Than That: ProtoKNN For Similarity-Based Classifiers | Unknown | N/A | |
| DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion | Unknown | N/A | |
| Towards Understanding GD with Hard and Conjugate Pseudo-labels for Test-Time Adaptation | Unknown | N/A | |
| Context-enriched molecule representations improve few-shot drug discovery | Unknown | N/A | |
| Boosting Adversarial Transferability using Dynamic Cues | Unknown | N/A | |
| Scalable and Equivariant Spherical CNNs by Discrete-Continuous (DISCO) Convolutions | Unknown | N/A | |
| When Source-Free Domain Adaptation Meets Learning with Noisy Labels | Unknown | N/A | |
| Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis | Unknown | N/A | |
| CrAM: A Compression-Aware Minimizer | Unknown | N/A | |
| Semi-Implicit Variational Inference via Score Matching | Unknown | N/A | |
| Generative Augmented Flow Networks | Unknown | N/A | |
| Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats | Unknown | N/A | |
| Multiple sequence alignment as a sequence-to-sequence learning problem | Unknown | N/A | |
| A Laplace-inspired Distribution on SO(3) for Probabilistic Rotation Estimation | Unknown | N/A | |
| A Primal-Dual Framework for Transformers and Neural Networks | Unknown | N/A | |
| Solving Constrained Variational Inequalities via a First-order Interior Point-based Method | Unknown | N/A | |
| Spectral Augmentation for Self-Supervised Learning on Graphs | Unknown | N/A | |
| Neural Causal Models for Counterfactual Identification and Estimation | Unknown | N/A | |
| The Influence of Learning Rule on Representation Dynamics in Wide Neural Networks | Unknown | N/A | |
| Uni-Mol: A Universal 3D Molecular Representation Learning Framework | Unknown | N/A | |
| PASHA: Efficient HPO and NAS with Progressive Resource Allocation | Unknown | N/A | |
| Sign and Basis Invariant Networks for Spectral Graph Representation Learning | Unknown | N/A | |
| Betty: An Automatic Differentiation Library for Multilevel Optimization | Unknown | N/A | |
| EA-HAS-Bench: Energy-aware Hyperparameter and Architecture Search Benchmark | Unknown | N/A | |
| Structure by Architecture: Structured Representations without Regularization | Unknown | N/A | |
| A Non-monotonic Self-terminating Language Model | Unknown | N/A | |
| Learning topology-preserving data representations | Unknown | N/A | |
| Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Approach | Unknown | N/A | |
| Quantifying Memorization Across Neural Language Models | Unknown | N/A | |
| Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts | Unknown | N/A | |
| KnowDA: All-in-One Knowledge Mixture Model for Data Augmentation in Low-Resource NLP | Unknown | N/A | |
| Scaffolding a Student to Instill Knowledge | Unknown | N/A | |
| Efficient Edge Inference by Selective Query | Unknown | N/A | |
| Graph Signal Sampling for Inductive One-Bit Matrix Completion: a Closed-form Solution | Unknown | N/A | |
| Faster Gradient-Free Methods for Escaping Saddle Points | Unknown | N/A | |
| Can Agents Run Relay Race with Strangers? Generalization of RL to Out-of-Distribution Trajectories | Unknown | N/A | |
| Semi-Parametric Inducing Point Networks and Neural Processes | Unknown | N/A | |
| Taking a Step Back with KCal: Multi-Class Kernel-Based Calibration for Deep Neural Networks | Unknown | N/A | |
| Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task | Unknown | N/A | |
| Progressive Prompts: Continual Learning for Language Models | Unknown | N/A | |
| Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations | Unknown | N/A | |
| Iterative Patch Selection for High-Resolution Image Recognition | Unknown | N/A | |
| ReAct: Synergizing Reasoning and Acting in Language Models | Unknown | N/A | |
| Efficient Offline Policy Optimization with a Learned Model | Unknown | N/A | |
| Learning to Solve Constraint Satisfaction Problems with Recurrent Transformer | Unknown | N/A | |
| Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints | Unknown | N/A | |
| New Insights for the Stability-Plasticity Dilemma in Online Continual Learning | Unknown | N/A | |
| Temporal Dependencies in Feature Importance for Time Series Prediction | Unknown | N/A | |
| Video Scene Graph Generation from Single-Frame Weak Supervision | Unknown | N/A | |
| Versatile Neural Processes for Learning Implicit Neural Representations | Unknown | N/A | |
| Human Motion Diffusion Model | Unknown | N/A | |
| Compressing multidimensional weather and climate data into neural networks | Unknown | N/A | |
| SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization | Unknown | N/A | |
| Simplified State Space Layers for Sequence Modeling | Unknown | N/A | |
| Minimalistic Unsupervised Representation Learning with the Sparse Manifold Transform | Unknown | N/A | |
| SQA3D: Situated Question Answering in 3D Scenes | Unknown | N/A | |
| A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics | Unknown | N/A | |
| Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together! | Unknown | N/A | |
| Rethinking the Expressive Power of GNNs via Graph Biconnectivity | Unknown | N/A | |
| BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS | Unknown | N/A | |
| Efficiently Computing Nash Equilibria in Adversarial Team Markov Games | Unknown | N/A | |
| Unsupervised Model Selection for Time Series Anomaly Detection | Unknown | N/A | |
| Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning | Unknown | N/A | |
| Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning | Unknown | N/A | |
| NTFields: Neural Time Fields for Physics-Informed Robot Motion Planning | Unknown | N/A | |
| Backpropagation through Combinatorial Algorithms: Identity with Projection Works | Unknown | N/A | |
| DiffEdit: Diffusion-based semantic image editing with mask guidance | Unknown | N/A | |
| Towards Stable Test-time Adaptation in Dynamic Wild World | Unknown | N/A | |
| Denoising Masked Autoencoders Help Robust Classification | Unknown | N/A | |
| EquiMod: An Equivariance Module to Improve Visual Instance Discrimination | Unknown | N/A | |
| One Transformer Can Understand Both 2D & 3D Molecular Data | Unknown | N/A | |
| Warping the Space: Weight Space Rotation for Class-Incremental Few-Shot Learning | Unknown | N/A | |
| FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning | Unknown | N/A | |
| Learning with Logical Constraints but without Shortcut Satisfaction | Unknown | N/A | |
| Robust and Controllable Object-Centric Learning through Energy-based Models | Unknown | N/A | |
| Do We Really Need Complicated Model Architectures For Temporal Networks? | Unknown | N/A | |
| DepthFL : Depthwise Federated Learning for Heterogeneous Clients | Unknown | N/A | |
| Over-Training with Mixup May Hurt Generalization | Unknown | N/A | |
| Self-supervised learning with rotation-invariant kernels | Unknown | N/A | |
| Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning | Unknown | N/A | |
| Causal Balancing for Domain Generalization | Unknown | N/A | |
| Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning | Unknown | N/A | |
| Why (and When) does Local SGD Generalize Better than SGD? | Unknown | N/A | |
| Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization | Unknown | N/A | |
| Imitating Human Behaviour with Diffusion Models | Unknown | N/A | |
| Better Generative Replay for Continual Federated Learning | Unknown | N/A | |
| WikiWhy: Answering and Explaining Cause-and-Effect Questions | Unknown | N/A | |
| Exploring The Role of Mean Teachers in Self-supervised Masked Auto-Encoders | Unknown | N/A | |
| EVC: Towards Real-Time Neural Image Compression with Mask Decay | Unknown | N/A | |
| Sparse Token Transformer with Attention Back Tracking | Unknown | N/A | |
| Short-Term Memory Convolutions | Unknown | N/A | |
| MaskViT: Masked Visual Pre-Training for Video Prediction | Unknown | N/A | |
| REVISITING PRUNING AT INITIALIZATION THROUGH THE LENS OF RAMANUJAN GRAPH | Unknown | N/A | |
| Sharper Bounds for Uniformly Stable Algorithms with Stationary Mixing Process | Unknown | N/A | |
| Analogy-Forming Transformers for Few-Shot 3D Parsing | Unknown | N/A | |
| FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation | Unknown | N/A | |
| Meta Temporal Point Processes | Unknown | N/A | |
| Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules | Unknown | N/A | |
| How to prepare your task head for finetuning | Unknown | N/A | |
| Dataset Pruning: Reducing Training Data by Examining Generalization Influence | Unknown | N/A | |
| Graph Neural Networks are Inherently Good Generalizers: Insights by Bridging GNNs and MLPs | Unknown | N/A | |
| KwikBucks: Correlation Clustering with Cheap-Weak and Expensive-Strong Signals | Unknown | N/A | |
| Automated Data Augmentations for Graph Classification | Unknown | N/A | |
| Population-size-Aware Policy Optimization for Mean-Field Games | Unknown | N/A | |
| Learning Soft Constraints From Constrained Expert Demonstrations | Unknown | N/A | |
| Energy-based Out-of-Distribution Detection for Graph Neural Networks | Unknown | N/A | |
| Learning Fast and Slow for Online Time Series Forecasting | Unknown | N/A | |
| Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection | Unknown | N/A | |
| An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion | Unknown | N/A | |
| Neural Compositional Rule Learning for Knowledge Graph Reasoning | Unknown | N/A | |
| How robust is unsupervised representation learning to distribution shift? | Unknown | N/A | |
| Distilling Cognitive Backdoor Patterns within an Image | Unknown | N/A | |
| Iterative Circuit Repair Against Formal Specifications | Unknown | N/A | |
| MocoSFL: enabling cross-client collaborative self-supervised learning | Unknown | N/A | |
| Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation | Unknown | N/A | |
| MECTA: Memory-Economic Continual Test-Time Model Adaptation | Unknown | N/A | |
| Continuous pseudo-labeling from the start | Unknown | N/A | |
| IDEAL: Query-Efficient Data-Free Learning from Black-Box Models | Unknown | N/A | |
| Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes | Unknown | N/A | |
| DiGress: Discrete Denoising diffusion for graph generation | Unknown | N/A | |
| Deja Vu: Continual Model Generalization for Unseen Domains | Unknown | N/A | |
| Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots | Unknown | N/A | |
| Adversarial Diversity in Hanabi | Unknown | N/A | |
| Statistical Efficiency of Score Matching: The View from Isoperimetry | Unknown | N/A | |
| Leveraging Future Relationship Reasoning for Vehicle Trajectory Prediction | Unknown | N/A | |
| LMSeg: Language-guided Multi-dataset Segmentation | Unknown | N/A | |
| RoPAWS: Robust Semi-supervised Representation Learning from Uncurated Data | Unknown | N/A | |
| Toward Adversarial Training on Contextualized Language Representation | Unknown | N/A | |
| BC-IRL: Learning Generalizable Reward Functions from Demonstrations | Unknown | N/A | |
| Omnigrok: Grokking Beyond Algorithmic Data | Unknown | N/A | |
| GRACE-C: Generalized Rate Agnostic Causal Estimation via Constraints | Unknown | N/A | |
| Actionable Neural Representations: Grid Cells from Minimal Constraints | Unknown | N/A | |
| Optimal Activation Functions for the Random Features Regression Model | Unknown | N/A | |
| EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model | Unknown | N/A | |
| Sparse tree-based Initialization for Neural Networks | Unknown | N/A | |
| A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias | Unknown | N/A | |
| Cycle to Clique (Cy2C) Graph Neural Network: A Sight to See beyond Neighborhood Aggregation | Unknown | N/A | |
| MaskFusion: Feature Augmentation for Click-Through Rate Prediction via Input-adaptive Mask Fusion | Unknown | N/A | |
| Rethinking Self-Supervised Visual Representation Learning in Pre-training for 3D Human Pose and Shape Estimation | Unknown | N/A | |
| Learned Index with Dynamic $\epsilon$ | Unknown | N/A | |
| Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks | Unknown | N/A | |
| DAG Learning on the Permutahedron | Unknown | N/A | |
| UNICORN: A Unified Backdoor Trigger Inversion Framework | Unknown | N/A | |
| PV3D: A 3D Generative Model for Portrait Video Generation | Unknown | N/A | |
| UNIFIED-IO: A Unified Model for Vision, Language, and Multi-modal Tasks | Unknown | N/A | |
| On The Specialization of Neural Modules | Unknown | N/A | |
| Learning What and Where: Disentangling Location and Identity Tracking Without Supervision | Unknown | N/A | |
| Mass-Editing Memory in a Transformer | Unknown | N/A | |
| Few-shot Backdoor Attacks via Neural Tangent Kernels | Unknown | N/A | |
| NERDS: A General Framework to Train Camera Denoisers from Raw-RGB Noisy Image Pairs | Unknown | N/A | |
| Programmatically Grounded, Compositionally Generalizable Robotic Manipulation | Unknown | N/A | |
| Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders | Unknown | N/A | |
| ILA-DA: Improving Transferability of Intermediate Level Attack with Data Augmentation | Unknown | N/A | |
| Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning | Unknown | N/A | |
| Diffusion-GAN: Training GANs with Diffusion | Unknown | N/A | |
| BEEF: Bi-Compatible Class-Incremental Learning via Energy-Based Expansion and Fusion | Unknown | N/A | |
| QuAnt: Quantum Annealing with Learnt Couplings | Unknown | N/A | |
| MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection | Unknown | N/A | |
| Schema Inference for Interpretable Image Classification | Unknown | N/A | |
| Relative representations enable zero-shot latent space communication | Unknown | N/A | |
| On Achieving Optimal Adversarial Test Error | Unknown | N/A | |
| Self-Consistency Improves Chain of Thought Reasoning in Language Models | Unknown | N/A | |
| Spiking Convolutional Neural Networks for Text Classification | Unknown | N/A | |
| A Kernel Perspective of Skip Connections in Convolutional Networks | Unknown | N/A | |
| In-Situ Text-Only Adaptation of Speech Models with Low-Overhead Speech Imputations | Unknown | N/A | |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Unknown | N/A | |
| Unsupervised Semantic Segmentation with Self-supervised Object-centric Representations | Unknown | N/A | |
| Analyzing Tree Architectures in Ensembles via Neural Tangent Kernel | Unknown | N/A | |
| Aligning Model and Macaque Inferior Temporal Cortex Representations Improves Model-to-Human Behavioral Alignment and Adversarial Robustness | Unknown | N/A | |
| Progressive Voronoi Diagram Subdivision Enables Accurate Data-free Class-Incremental Learning | Unknown | N/A | |
| Domain-Indexing Variational Bayes: Interpretable Domain Index for Domain Adaptation | Unknown | N/A | |
| Explicitly Minimizing the Blur Error of Variational Autoencoders | Unknown | N/A | |
| Learning to Estimate Shapley Values with Vision Transformers | Unknown | N/A | |
| HotProtein: A Novel Framework for Protein Thermostability Prediction and Editing | Unknown | N/A | |
| Hyper-Decision Transformer for Efficient Online Policy Adaptation | Unknown | N/A | |
| Loss Landscapes are All You Need: Neural Network Generalization Can Be Explained Without the Implicit Bias of Gradient Descent | Unknown | N/A | |
| Sparsity-Constrained Optimal Transport | Unknown | N/A | |
| Real-Time Image Demoir$\acute{e}$ing on Mobile Devices | Unknown | N/A | |
| RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| MEDICAL IMAGE UNDERSTANDING WITH PRETRAINED VISION LANGUAGE MODELS: A COMPREHENSIVE STUDY | Unknown | N/A | |
| Understanding Influence Functions and Datamodels via Harmonic Analysis | Unknown | N/A | |
| Stochastic No-regret Learning for General Games with Variance Reduction | Unknown | N/A | |
| When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning | Unknown | N/A | |
| MIMT: Masked Image Modeling Transformer for Video Compression | Unknown | N/A | |
| Graph-based Deterministic Policy Gradient for Repetitive Combinatorial Optimization Problems | Unknown | N/A | |
| Efficient Discrete Multi Marginal Optimal Transport Regularization | Unknown | N/A | |
| Evaluating Representations with Readout Model Switching | Unknown | N/A | |
| Sequential Learning of Neural Networks for Prequential MDL | Unknown | N/A | |
| Explaining RL Decisions with Trajectories | Unknown | N/A | |
| Efficient Conditionally Invariant Representation Learning | Unknown | N/A | |
| Stable Target Field for Reduced Variance Score Estimation in Diffusion Models | Unknown | N/A | |
| Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences | Unknown | N/A | |
| ROCO: A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs | Unknown | N/A | |
| DINO as a von Mises-Fisher mixture model | Unknown | N/A | |
| The Lie Derivative for Measuring Learned Equivariance | Unknown | N/A | |
| Continuized Acceleration for Quasar Convex Functions in Non-Convex Optimization | Unknown | N/A | |
| Meta-Learning in Games | Unknown | N/A | |
| Universal Approximation Theorems for Differentiable Geometric Deep Learning | Unknown | N/A | |
| Provable Memorization Capacity of Transformers | Unknown | N/A | |
| Bridge the Inference Gaps of Neural Processes via Expectation Maximization | Unknown | N/A | |
| Masked Vision and Language Modeling for Multi-modal Representation Learning | Unknown | N/A | |
| Massively Scaling Heteroscedastic Classifiers | Unknown | N/A | |
| CROM: Continuous Reduced-Order Modeling of PDEs Using Implicit Neural Representations | Unknown | N/A | |
| Certified Defences Against Adversarial Patch Attacks on Semantic Segmentation | Unknown | N/A | |
| Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations | Unknown | N/A | |
| Decision Transformer under Random Frame Dropping | Unknown | N/A | |
| Visual Classification via Description from Large Language Models | Unknown | N/A | |
| De Novo Molecular Generation via Connection-aware Motif Mining | Unknown | N/A | |
| Targeted Hyperparameter Optimization with Lexicographic Preferences Over Multiple Objectives | Unknown | N/A | |
| Towards Smooth Video Composition | Unknown | N/A | |
| Revisiting Graph Adversarial Attack and Defense From a Data Distribution Perspective | Unknown | N/A | |
| How to Exploit Hyperspherical Embeddings for Out-of-Distribution Detection? | Unknown | N/A | |
| How Much Space Has Been Explored? Measuring the Chemical Space Covered by Databases and Machine-Generated Molecules | Unknown | N/A | |
| E-CRF: Embedded Conditional Random Field for Boundary-caused Class Weights Confusion in Semantic Segmentation | Unknown | N/A | |
| Online Bias Correction for Task-Free Continual Learning | Unknown | N/A | |
| Arbitrary Virtual Try-on Network: Characteristics Representation and Trade-off between Body and Clothing | Unknown | N/A | |
| Behind the Scenes of Gradient Descent: A Trajectory Analysis via Basis Function Decomposition | Unknown | N/A | |
| Don’t fear the unlabelled: safe semi-supervised learning via debiasing | Unknown | N/A | |
| Learning differentiable solvers for systems with hard constraints | Unknown | N/A | |
| DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images | Unknown | N/A | |
| SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric Models | Unknown | N/A | |
| A framework for benchmarking Class-out-of-distribution detection and its application to ImageNet | Unknown | N/A | |
| Token Merging: Your ViT But Faster | Unknown | N/A | |
| NAGphormer: A Tokenized Graph Transformer for Node Classification in Large Graphs | Unknown | N/A | |
| Ensuring DNN Solution Feasibility for Optimization Problems with Linear Constraints | Unknown | N/A | |
| Retrieval-based Controllable Molecule Generation | Unknown | N/A | |
| Prompt-to-Prompt Image Editing with Cross-Attention Control | Unknown | N/A | |
| Localized Randomized Smoothing for Collective Robustness Certification | Unknown | N/A | |
| Hebbian Deep Learning Without Feedback | Unknown | N/A | |
| Draft, Sketch, and Prove: Guiding Formal Theorem Provers with Informal Proofs | Unknown | N/A | |
| Generalizing and Decoupling Neural Collapse via Hyperspherical Uniformity Gap | Unknown | N/A | |
| Distributionally Robust Post-hoc Classifiers under Prior Shifts | Unknown | N/A | |
| More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity | Unknown | N/A | |
| Stochastic Multi-Person 3D Motion Forecasting | Unknown | N/A | |
| TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second | Unknown | N/A | |
| Learning Structured Representations by Embedding Class Hierarchy | Unknown | N/A | |
| Multi-Objective Reinforcement Learning: Convexity, Stationarity and Pareto Optimality | Unknown | N/A | |
| Asynchronous Gradient Play in Zero-Sum Multi-agent Games | Unknown | N/A | |
| Novel View Synthesis with Diffusion Models | Unknown | N/A | |
| Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting | Unknown | N/A | |
| Behavior Prior Representation learning for Offline Reinforcement Learning | Unknown | N/A | |
| Trading Information between Latents in Hierarchical Variational Autoencoders | Unknown | N/A | |
| MEDFAIR: Benchmarking Fairness for Medical Imaging | Unknown | N/A | |
| Decoupled Training for Long-Tailed Classification With Stochastic Representations | Unknown | N/A | |
| Fuzzy Alignments in Directed Acyclic Graph for Non-Autoregressive Machine Translation | Unknown | N/A | |
| Building a Subspace of Policies for Scalable Continual Learning | Unknown | N/A | |
| Unbiased Stochastic Proximal Solver for Graph Neural Networks with Equilibrium States | Unknown | N/A | |
| 3D Segmenter: 3D Transformer based Semantic Segmentation via 2D Panoramic Distillation | Unknown | N/A | |
| TabCaps: A Capsule Neural Network for Tabular Data Classification with BoW Routing | Unknown | N/A | |
| GOOD: Exploring geometric cues for detecting objects in an open world | Unknown | N/A | |
| Average Sensitivity of Decision Tree Learning | Unknown | N/A | |
| Towards a Unified Theoretical Understanding of Non-contrastive Learning via Rank Differential Mechanism | Unknown | N/A | |
| Tailoring Language Generation Models under Total Variation Distance | Unknown | N/A | |
| H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection | Unknown | N/A | |
| The KFIoU Loss for Rotated Object Detection | Unknown | N/A | |
| Systematic Rectification of Language Models via Dead-end Analysis | Unknown | N/A | |
| Anisotropic Message Passing: Graph Neural Networks with Directional and Long-Range Interactions | Unknown | N/A | |
| SYNC: SAFETY-AWARE NEURAL CONTROL FOR STABILIZING STOCHASTIC DELAY-DIFFERENTIAL EQUATIONS | Unknown | N/A | |
| PatchDCT: Patch Refinement for High Quality Instance Segmentation | Unknown | N/A | |
| A Message Passing Perspective on Learning Dynamics of Contrastive Learning | Unknown | N/A | |
| A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning | Unknown | N/A | |
| Improved Convergence of Differential Private SGD with Gradient Clipping | Unknown | N/A | |
| Toeplitz Neural Network for Sequence Modeling | Unknown | N/A | |
| Sub-Task Decomposition Enables Learning in Sequence to Sequence Tasks | Unknown | N/A | |
| On the Word Boundaries of Emergent Languages Based on Harris's Articulation Scheme | Unknown | N/A | |
| Effective Self-supervised Pre-training on Low-compute Networks without Distillation | Unknown | N/A | |
| On the Soft-Subnetwork for Few-Shot Class Incremental Learning | Unknown | N/A | |
| TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization | Unknown | N/A | |
| Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding | Unknown | N/A | |
| Approximate Nearest Neighbor Search through Modern Error-Correcting Codes | Unknown | N/A | |
| Boosting the Cycle Counting Power of Graph Neural Networks with I$^2$-GNNs | Unknown | N/A | |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Unknown | N/A | |
| DENSE RGB SLAM WITH NEURAL IMPLICIT MAPS | Unknown | N/A | |
| Monocular Scene Reconstruction with 3D SDF Transformers | Unknown | N/A | |
| Learning Heterogeneous Interaction Strengths by Trajectory Prediction with Graph Neural Network | Unknown | N/A | |
| Online Low Rank Matrix Completion | Unknown | N/A | |
| Robust Fair Clustering: A Novel Fairness Attack and Defense Framework | Unknown | N/A | |
| Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning | Unknown | N/A | |
| Generalize Learned Heuristics to Solve Large-scale Vehicle Routing Problems in Real-time | Unknown | N/A | |
| On the complexity of nonsmooth automatic differentiation | Unknown | N/A | |
| DaxBench: Benchmarking Deformable Object Manipulation with Differentiable Physics | Unknown | N/A | |
| Exploring perceptual straightness in learned visual representations | Unknown | N/A | |
| CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving | Unknown | N/A | |
| Understanding the Role of Nonlinearity in Training Dynamics of Contrastive Learning | Unknown | N/A | |
| Bag of Tricks for Unsupervised Text-to-Speech | Unknown | N/A | |
| Information-Theoretic Characterization of the Generalization Error for Iterative Semi-Supervised Learning | Unknown | N/A | |
| Composing Task Knowledge With Modular Successor Feature Approximators | Unknown | N/A | |
| Advancing Radiograph Representation Learning with Masked Record Modeling | Unknown | N/A | |
| Verifying the Union of Manifolds Hypothesis for Image Data | Unknown | N/A | |
| GAMR: A Guided Attention Model for (visual) Reasoning | Unknown | N/A | |
| Simple initialization and parametrization of sinusoidal networks via their kernel bandwidth | Unknown | N/A | |
| Energy-Based Test Sample Adaptation for Domain Generalization | Unknown | N/A | |
| On the Saturation Effect of Kernel Ridge Regression | Unknown | N/A | |
| Re-parameterizing Your Optimizers rather than Architectures | Unknown | N/A | |
| PLOT: Prompt Learning with Optimal Transport for Vision-Language Models | Unknown | N/A | |
| Linearly Mapping from Image to Text Space | Unknown | N/A | |
| Protein Representation Learning via Knowledge Enhanced Primary Structure Reasoning | Unknown | N/A | |
| The Provable Benefit of Unsupervised Data Sharing for Offline Reinforcement Learning | Unknown | N/A | |
| HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers | Unknown | N/A | |
| Exploring Active 3D Object Detection from a Generalization Perspective | Unknown | N/A | |
| Gradient Gating for Deep Multi-Rate Learning on Graphs | Unknown | N/A | |
| Diffusion Models Already Have A Semantic Latent Space | Unknown | N/A | |
| Masked Image Modeling with Denoising Contrast | Unknown | N/A | |
| GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation | Unknown | N/A | |
| Contrastive Learning for Unsupervised Domain Adaptation of Time Series | Unknown | N/A | |
| TILP: Differentiable Learning of Temporal Logical Rules on Knowledge Graphs | Unknown | N/A | |
| Masked Unsupervised Self-training for Label-free Image Classification | Unknown | N/A | |
| Learning the Positions in CountSketch | Unknown | N/A | |
| Deep Ensembles for Graphs with Higher-order Dependencies | Unknown | N/A | |
| Globally Optimal Training of Neural Networks with Threshold Activation Functions | Unknown | N/A | |
| Sound Randomized Smoothing in Floating-Point Arithmetic | Unknown | N/A | |
| Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms | Unknown | N/A | |
| Out-of-distribution Representation Learning for Time Series Classification | Unknown | N/A | |
| AnyDA: Anytime Domain Adaptation | Unknown | N/A | |
| Koopman Neural Operator Forecaster for Time-series with Temporal Distributional Shifts | Unknown | N/A | |
| Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models | Unknown | N/A | |
| Sampling with Mollified Interaction Energy Descent | Unknown | N/A | |
| Multimodal Federated Learning via Contrastive Representation Ensemble | Unknown | N/A | |
| Leveraging Importance Weights in Subset Selection | Unknown | N/A | |
| Can CNNs Be More Robust Than Transformers? | Unknown | N/A | |
| Optimistic Exploration with Learned Features Provably Solves Markov Decision Processes with Neural Dynamics | Unknown | N/A | |
| Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions | Unknown | N/A | |
| Causal Imitation Learning via Inverse Reinforcement Learning | Unknown | N/A | |
| FIT: A Metric for Model Sensitivity | Unknown | N/A | |
| Pushing the Accuracy-Group Robustness Frontier with Introspective Self-play | Unknown | N/A | |
| Progressively Compressed Auto-Encoder for Self-supervised Representation Learning | Unknown | N/A | |
| Understanding Edge-of-Stability Training Dynamics with a Minimalist Example | Unknown | N/A | |
| Learning to Decompose Visual Features with Latent Textual Prompts | Unknown | N/A | |
| Dual Diffusion Implicit Bridges for Image-to-Image Translation | Unknown | N/A | |
| Calibrating Sequence likelihood Improves Conditional Language Generation | Unknown | N/A | |
| Learning Proximal Operators to Discover Multiple Optima | Unknown | N/A | |
| REPAIR: REnormalizing Permuted Activations for Interpolation Repair | Unknown | N/A | |
| Neural Radiance Field Codebooks | Unknown | N/A | |
| Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation | Unknown | N/A | |
| Diffusion Probabilistic Modeling of Protein Backbones in 3D for the motif-scaffolding problem | Unknown | N/A | |
| ISAAC Newton: Input-based Approximate Curvature for Newton's Method | Unknown | N/A | |
| Revisiting adapters with adversarial training | Unknown | N/A | |
| Heterogeneous Neuronal and Synaptic Dynamics for Spike-Efficient Unsupervised Learning: Theory and Design Principles | Unknown | N/A | |
| Rethinking Graph Lottery Tickets: Graph Sparsity Matters | Unknown | N/A | |
| Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses | Unknown | N/A | |
| Unicom: Universal and Compact Representation Learning for Image Retrieval | Unknown | N/A | |
| Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model | Unknown | N/A | |
| Symbolic Physics Learner: Discovering governing equations via Monte Carlo tree search | Unknown | N/A | |
| Flow Annealed Importance Sampling Bootstrap | Unknown | N/A | |
| Calibration Matters: Tackling Maximization Bias in Large-scale Advertising Recommendation Systems | Unknown | N/A | |
| Replicable Bandits | Unknown | N/A | |
| Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation | Unknown | N/A | |
| On the Robustness of Safe Reinforcement Learning under Observational Perturbations | Unknown | N/A | |
| Image as Set of Points | Unknown | N/A | |
| Trainability Preserving Neural Pruning | Unknown | N/A | |
| Graph Neural Networks for Link Prediction with Subgraph Sketching | Unknown | N/A | |
| Learning Locality and Isotropy in Dialogue Modeling | Unknown | N/A | |
| A Unified Algebraic Perspective on Lipschitz Neural Networks | Unknown | N/A | |
| Information-Theoretic Diffusion | Unknown | N/A | |
| Evaluating Long-Term Memory in 3D Mazes | Unknown | N/A | |
| Contrastive Meta-Learning for Partially Observable Few-Shot Learning | Unknown | N/A | |
| Hebbian and Gradient-based Plasticity Enables Robust Memory and Rapid Learning in RNNs | Unknown | N/A | |
| Proactive Multi-Camera Collaboration for 3D Human Pose Estimation | Unknown | N/A | |
| Promptagator: Few-shot Dense Retrieval From 8 Examples | Unknown | N/A | |
| Backstepping Temporal Difference Learning | Unknown | N/A | |
| Human MotionFormer: Transferring Human Motions with Vision Transformers | Unknown | N/A | |
| Memorization-Dilation: Modeling Neural Collapse Under Noise | Unknown | N/A | |
| On Representing Mixed-Integer Linear Programs by Graph Neural Networks | Unknown | N/A | |
| Revisiting Robustness in Graph Machine Learning | Unknown | N/A | |
| Parallel Deep Neural Networks Have Zero Duality Gap | Unknown | N/A | |
| Reward Design with Language Models | Unknown | N/A | |
| Contrastive Corpus Attribution for Explaining Representations | Unknown | N/A | |
| Multi-domain image generation and translation with identifiability guarantees | Unknown | N/A | |
| Continual evaluation for lifelong learning: Identifying the stability gap | Unknown | N/A | |
| Can We Faithfully Represent Absence States to Compute Shapley Values on a DNN? | Unknown | N/A | |
| Dataless Knowledge Fusion by Merging Weights of Language Models | Unknown | N/A | |
| Long Range Language Modeling via Gated State Spaces | Unknown | N/A | |
| Transformer Meets Boundary Value Inverse Problems | Unknown | N/A | |
| Masked Distillation with Receptive Tokens | Unknown | N/A | |
| Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval | Unknown | N/A | |
| TextShield: Beyond Successfully Detecting Adversarial Sentences in text classification | Unknown | N/A | |
| Meta-learning Adaptive Deep Kernel Gaussian Processes for Molecular Property Prediction | Unknown | N/A | |
| Sparse Random Networks for Communication-Efficient Federated Learning | Unknown | N/A | |
| MetaGL: Evaluation-Free Selection of Graph Learning Models via Meta-Learning | Unknown | N/A | |
| Scalable Batch-Mode Deep Bayesian Active Learning via Equivalence Class Annealing | Unknown | N/A | |
| Spatial Attention Kinetic Networks with E(n)-Equivariance | Unknown | N/A | |
| Distributed Differential Privacy in Multi-Armed Bandits | Unknown | N/A | |
| Coverage-centric Coreset Selection for High Pruning Rates | Unknown | N/A | |
| Achieving Sub-linear Regret in Infinite Horizon Average Reward Constrained MDP with Linear Function Approximation | Unknown | N/A | |
| Encoding Recurrence into Transformers | Unknown | N/A | |
| Learning Hyper Label Model for Programmatic Weak Supervision | Unknown | N/A | |
| FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy | Unknown | N/A | |
| CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning | Unknown | N/A | |
| Generalized Precision Matrix for Scalable Estimation of Nonparametric Markov Networks | Unknown | N/A | |
| Learning to Jointly Share and Prune Weights for Grounding Based Vision and Language Models | Unknown | N/A | |
| Domain Generalisation via Domain Adaptation: An Adversarial Fourier Amplitude Approach | Unknown | N/A | |
| GReTo: Remedying dynamic graph topology-task discordance via target homophily | Unknown | N/A | |
| Everybody Needs Good Neighbours: An Unsupervised Locality-based Method for Bias Mitigation | Unknown | N/A | |
| Combating Exacerbated Heterogeneity for Robust Models in Federated Learning | Unknown | N/A | |
| Improving Deep Regression with Ordinal Entropy | Unknown | N/A | |
| Language Modelling with Pixels | Unknown | N/A | |
| Bitrate-Constrained DRO: Beyond Worst Case Robustness To Unknown Group Shifts | Unknown | N/A | |
| Neural Architecture Design and Robustness: A Dataset | Unknown | N/A | |
| Is Adversarial Training Really a Silver Bullet for Mitigating Data Poisoning? | Unknown | N/A | |
| Policy Pre-training for Autonomous Driving via Self-supervised Geometric Modeling | Unknown | N/A | |
| AutoTransfer: AutoML with Knowledge Transfer - An Application to Graph Neural Networks | Unknown | N/A | |
| The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks | Unknown | N/A | |
| Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games | Unknown | N/A | |
| Exploring the Limits of Differentially Private Deep Learning with Group-wise Clipping | Unknown | N/A | |
| HypeR: Multitask Hyper-Prompted Training Enables Large-Scale Retrieval Generalization | Unknown | N/A | |
| Learning without Prejudices: Continual Unbiased Learning via Benign and Malignant Forgetting | Unknown | N/A | |
| Selective Annotation Makes Language Models Better Few-Shot Learners | Unknown | N/A | |
| Switch-NeRF: Learning Scene Decomposition with Mixture of Experts for Large-scale Neural Radiance Fields | Unknown | N/A | |
| Scaling Forward Gradient With Local Losses | Unknown | N/A | |
| NORM: Knowledge Distillation via N-to-One Representation Matching | Unknown | N/A | |
| Critic Sequential Monte Carlo | Unknown | N/A | |
| (Certified!!) Adversarial Robustness for Free! | Unknown | N/A | |
| Measuring Forgetting of Memorized Training Examples | Unknown | N/A | |
| Multi-lingual Evaluation of Code Generation Models | Unknown | N/A | |
| wav2tok: Deep Sequence Tokenizer for Audio Retrieval | Unknown | N/A | |
| Deep Learning meets Nonparametric Regression: Are Weight-Decayed DNNs Locally Adaptive? | Unknown | N/A | |
| Robust Active Distillation | Unknown | N/A | |
| SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation | Unknown | N/A | |
| Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks | Unknown | N/A | |
| Near-optimal Policy Identification in Active Reinforcement Learning | Unknown | N/A | |
| Spherical Sliced-Wasserstein | Unknown | N/A | |
| InPL: Pseudo-labeling the Inliers First for Imbalanced Semi-supervised Learning | Unknown | N/A | |
| Subquadratic Algorithms for Kernel Matrices via Kernel Density Estimation | Unknown | N/A | |
| MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer | Unknown | N/A | |
| Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies | Unknown | N/A | |
| SP2 : A Second Order Stochastic Polyak Method | Unknown | N/A | |
| CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks | Unknown | N/A | |
| Continual Transformers: Redundancy-Free Attention for Online Inference | Unknown | N/A | |
| Dirichlet-based Uncertainty Calibration for Active Domain Adaptation | Unknown | N/A | |
| Accurate Image Restoration with Attention Retractable Transformer | Unknown | N/A | |
| Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information | Unknown | N/A | |
| Self-Supervised Set Representation Learning for Unsupervised Meta-Learning | Unknown | N/A | |
| Causal Representation Learning for Instantaneous and Temporal Effects in Interactive Systems | Unknown | N/A | |
| Visual Imitation Learning with Patch Rewards | Unknown | N/A | |
| CodeT: Code Generation with Generated Tests | Unknown | N/A | |
| Learning to Generate Columns with Application to Vertex Coloring | Unknown | N/A | |
| Interaction-Based Disentanglement of Entities for Object-Centric World Models | Unknown | N/A | |
| Learning where and when to reason in neuro-symbolic inference | Unknown | N/A | |
| On the Usefulness of Embeddings, Clusters and Strings for Text Generation Evaluation | Unknown | N/A | |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Unknown | N/A | |
| Plateau in Monotonic Linear Interpolation --- A "Biased" View of Loss Landscape for Deep Networks | Unknown | N/A | |
| Improving Deep Policy Gradients with Value Function Search | Unknown | N/A | |
| Depth Separation with Multilayer Mean-Field Networks | Unknown | N/A | |
| Crossformer: Transformer Utilizing Cross-Dimension Dependency for Multivariate Time Series Forecasting | Unknown | N/A | |
| Individual Privacy Accounting with Gaussian Differential Privacy | Unknown | N/A | |
| Non-parametric Outlier Synthesis | Unknown | N/A | |
| General Neural Gauge Fields | Unknown | N/A | |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Unknown | N/A | |
| Discovering Informative and Robust Positives for Video Domain Adaptation | Unknown | N/A | |
| Understanding Why Generalized Reweighting Does Not Improve Over ERM | Unknown | N/A | |
| Gradient-Guided Importance Sampling for Learning Binary Energy-Based Models | Unknown | N/A | |
| Neural Lagrangian Schr\"{o}dinger Bridge: Diffusion Modeling for Population Dynamics | Unknown | N/A | |
| Is Conditional Generative Modeling all you need for Decision Making? | Unknown | N/A | |
| Fair Attribute Completion on Graph with Missing Attributes | Unknown | N/A | |
| Planning with Sequence Models through Iterative Energy Minimization | Unknown | N/A | |
| Composing Ensembles of Pre-trained Models via Iterative Consensus | Unknown | N/A | |
| Deep Ranking Ensembles for Hyperparameter Optimization | Unknown | N/A | |
| Robustness to corruption in pre-trained Bayesian neural networks | Unknown | N/A | |
| Neural Groundplans: Persistent Neural Scene Representations from a Single Image | Unknown | N/A | |
| Improved Training of Physics-Informed Neural Networks Using Energy-Based Priors: a Study on Electrical Impedance Tomography | Unknown | N/A | |
| Disentanglement with Biological Constraints: A Theory of Functional Cell Types | Unknown | N/A | |
| ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation | Unknown | N/A | |
| Towards Understanding Why Mask Reconstruction Pretraining Helps in Downstream Tasks | Unknown | N/A | |
| Continual Unsupervised Disentangling of Self-Organizing Representations | Unknown | N/A | |
| Accelerating Guided Diffusion Sampling with Splitting Numerical Methods | Unknown | N/A | |
| Thalamus: a brain-inspired algorithm for biologically-plausible continual learning and disentangled representations | Unknown | N/A | |
| LiftedCL: Lifting Contrastive Learning for Human-Centric Perception | Unknown | N/A | |
| SIMPLE: A Gradient Estimator for k-Subset Sampling | Unknown | N/A | |
| Modeling content creator incentives on algorithm-curated platforms | Unknown | N/A | |
| Deep Variational Implicit Processes | Unknown | N/A | |
| Estimating individual treatment effects under unobserved confounding using binary instruments | Unknown | N/A | |
| Approximate Bayesian Inference with Stein Functional Variational Gradient Descent | Unknown | N/A | |
| Distributional Meta-Gradient Reinforcement Learning | Unknown | N/A | |
| Learning to Linearize Deep Neural Networks for Secure and Efficient Private Inference | Unknown | N/A | |
| Denoising Diffusion Error Correction Codes | Unknown | N/A | |
| Temperature Schedules for self-supervised contrastive methods on long-tail data | Unknown | N/A | |
| Meta Knowledge Condensation for Federated Learning | Unknown | N/A | |
| ManiSkill2: A Unified Benchmark for Generalizable Manipulation Skills | Unknown | N/A | |
| Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning | Unknown | N/A | |
| Neuro-Symbolic Procedural Planning with Commonsense Prompting | Unknown | N/A | |
| Learning Object-Language Alignments for Open-Vocabulary Object Detection | Unknown | N/A | |
| Time to augment self-supervised visual representation learning | Unknown | N/A | |
| Learning Sparse and Low-Rank Priors for Image Recovery via Iterative Reweighted Least Squares Minimization | Unknown | N/A | |
| Chasing All-Round Graph Representation Robustness: Model, Training, and Optimization | Unknown | N/A | |
| Personalized Federated Learning with Feature Alignment and Classifier Collaboration | Unknown | N/A | |
| Adversarial Attacks on Adversarial Bandits | Unknown | N/A | |
| Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling | Unknown | N/A | |
| Improving Object-centric Learning with Query Optimization | Unknown | N/A | |
| Phase transition for detecting a small community in a large network | Unknown | N/A | |
| Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients | Unknown | N/A | |
| DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection | Unknown | N/A | |
| Noise Is Not the Main Factor Behind the Gap Between Sgd and Adam on Transformers, But Sign Descent Might Be | Unknown | N/A | |
| Transformer-Patcher: One Mistake Worth One Neuron | Unknown | N/A | |
| Are More Layers Beneficial to Graph Transformers? | Unknown | N/A | |
| Searching Lottery Tickets in Graph Neural Networks: A Dual Perspective | Unknown | N/A | |
| Improving Out-of-distribution Generalization with Indirection Representations | Unknown | N/A | |
| Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint | Unknown | N/A | |
| The Power of Regularization in Solving Extensive-Form Games | Unknown | N/A | |
| S-NeRF: Neural Radiance Fields for Street Views | Unknown | N/A | |
| Cycle-consistent Masked AutoEncoder for Unsupervised Domain Generalization | Unknown | N/A | |
| CFlowNets: Continuous Control with Generative Flow Networks | Unknown | N/A | |
| Limitless Stability for Graph Convolutional Networks | Unknown | N/A | |
| DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection | Unknown | N/A | |
| Differentiable Gaussianization Layers for Inverse Problems Regularized by Deep Generative Models | Unknown | N/A | |
| Exploring Low-Rank Property in Multiple Instance Learning for Whole Slide Image Classification | Unknown | N/A | |
| Breaking Correlation Shift via Conditional Invariant Regularizer | Unknown | N/A | |
| Towards One-shot Neural Combinatorial Solvers: Theoretical and Empirical Notes on the Cardinality-Constrained Case | Unknown | N/A | |
| In-context Reinforcement Learning with Algorithm Distillation | Unknown | N/A | |
| Your Contrastive Learning Is Secretly Doing Stochastic Neighbor Embedding | Unknown | N/A | |
| Divide to Adapt: Mitigating Confirmation Bias for Domain Adaptation of Black-Box Predictors | Unknown | N/A | |
| Block and Subword-Scaling Floating-Point (BSFP) : An Efficient Non-Uniform Quantization For Low Precision Inference | Unknown | N/A | |
| Semi-supervised Community Detection via Structural Similarity Metrics | Unknown | N/A | |
| Learning Symbolic Models for Graph-structured Physical Mechanism | Unknown | N/A | |
| DDM$^2$: Self-Supervised Diffusion MRI Denoising with Generative Diffusion Models | Unknown | N/A | |
| FiT: Parameter Efficient Few-shot Transfer Learning for Personalized and Federated Image Classification | Unknown | N/A | |
| Hard-Meta-Dataset++: Towards Understanding Few-Shot Performance on Difficult Tasks | Unknown | N/A | |
| Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased | Unknown | N/A | |
| Multivariate Time-series Imputation with Disentangled Temporal Representations | Unknown | N/A | |
| Evidential Uncertainty and Diversity Guided Active Learning for Scene Graph Generation | Unknown | N/A | |
| Automating Nearest Neighbor Search Configuration with Constrained Optimization | Unknown | N/A | |
| OTOv2: Automatic, Generic, User-Friendly | Unknown | N/A | |
| Unified Discrete Diffusion for Simultaneous Vision-Language Generation | Unknown | N/A | |
| Win: Weight-Decay-Integrated Nesterov Acceleration for Adaptive Gradient Algorithms | Unknown | N/A | |
| Self-Distillation for Further Pre-training of Transformers | Unknown | N/A | |
| Statistical Inference for Fisher Market Equilibrium | Unknown | N/A | |
| Visual Recognition with Deep Nearest Centroids | Unknown | N/A | |
| LPT: Long-tailed Prompt Tuning for Image Classification | Unknown | N/A | |
| DamoFD: Digging into Backbone Design on Face Detection | Unknown | N/A | |
| Prompting GPT-3 To Be Reliable | Unknown | N/A | |
| Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation | Unknown | N/A | |
| Spikformer: When Spiking Neural Network Meets Transformer | Unknown | N/A | |
| Multimodal Analogical Reasoning over Knowledge Graphs | Unknown | N/A | |
| Voxurf: Voxel-based Efficient and Accurate Neural Surface Reconstruction | Unknown | N/A | |
| Conditional Positional Encodings for Vision Transformers | Unknown | N/A | |
| Guarded Policy Optimization with Imperfect Online Demonstrations | Unknown | N/A | |
| Contrastive Learning Can Find An Optimal Basis For Approximately View-Invariant Functions | Unknown | N/A | |
| Revisiting the Entropy Semiring for Neural Speech Recognition | Unknown | N/A | |
| Rethinking skip connection model as a learnable Markov chain | Unknown | N/A | |
| Measuring axiomatic soundness of counterfactual image models | Unknown | N/A | |
| Alternating Differentiation for Optimization Layers | Unknown | N/A | |
| Out-of-distribution Detection with Implicit Outlier Transformation | Unknown | N/A | |
| Extracting Robust Models with Uncertain Examples | Unknown | N/A | |
| Stochastic Differentially Private and Fair Learning | Unknown | N/A | |
| Volumetric Optimal Transportation by Fast Fourier Transform | Unknown | N/A | |
| Hierarchical Relational Learning for Few-Shot Knowledge Graph Completion | Unknown | N/A | |
| The Devil is in the Wrongly-classified Samples: Towards Unified Open-set Recognition | Unknown | N/A | |
| MCAL: Minimum Cost Human-Machine Active Labeling | Unknown | N/A | |
| Learnable Topological Features For Phylogenetic Inference via Graph Neural Networks | Unknown | N/A | |
| Rotamer Density Estimator is an Unsupervised Learner of the Effect of Mutations on Protein-Protein Interaction | Unknown | N/A | |
| Bit-Pruning: A Sparse Multiplication-Less Dot-Product | Unknown | N/A | |
| A Stable and Scalable Method for Solving Initial Value PDEs with Neural Networks | Unknown | N/A | |
| IS SYNTHETIC DATA FROM GENERATIVE MODELS READY FOR IMAGE RECOGNITION? | Unknown | N/A | |
| Learning Domain-Agnostic Representation for Disease Diagnosis | Unknown | N/A | |
| BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection | Unknown | N/A | |
| A Multi-Grained Self-Interpretable Symbolic-Neural Model For Single/Multi-Labeled Text Classification | Unknown | N/A | |
| Suppressing the Heterogeneity: A Strong Feature Extractor for Few-shot Segmentation | Unknown | N/A | |
| Achieve the Minimum Width of Neural Networks for Universal Approximation | Unknown | N/A | |
| UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph | Unknown | N/A | |
| On amortizing convex conjugates for optimal transport | Unknown | N/A | |
| Discovering Evolution Strategies via Meta-Black-Box Optimization | Unknown | N/A | |
| DualAfford: Learning Collaborative Visual Affordance for Dual-gripper Manipulation | Unknown | N/A | |
| SIMPLE: Specialized Model-Sample Matching for Domain Generalization | Unknown | N/A | |
| Contextual Image Masking Modeling via Synergized Contrasting without View Augmentation for Faster and Better Visual Pretraining | Unknown | N/A | |
| TDR-CL: Targeted Doubly Robust Collaborative Learning for Debiased Recommendations | Unknown | N/A | |
| Patch-Level Contrasting without Patch Correspondence for Accurate and Dense Contrastive Representation Learning | Unknown | N/A | |
| Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins | Unknown | N/A | |
| Phenaki: Variable Length Video Generation from Open Domain Textual Descriptions | Unknown | N/A | |
| Human-level Atari 200x faster | Unknown | N/A | |
| Least-to-Most Prompting Enables Complex Reasoning in Large Language Models | Unknown | N/A | |
| Molecule Generation For Target Protein Binding with Structural Motifs | Unknown | N/A | |
| Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning | Unknown | N/A | |
| Towards Robust Object Detection Invariant to Real-World Domain Shifts | Unknown | N/A | |
| Generating Diverse Cooperative Agents by Learning Incompatible Policies | Unknown | N/A | |
| Information-Theoretic Analysis of Unsupervised Domain Adaptation | Unknown | N/A | |
| Effects of Graph Convolutions in Multi-layer Networks | Unknown | N/A | |
| Diffusion Posterior Sampling for General Noisy Inverse Problems | Unknown | N/A | |
| Ask Me Anything: A simple strategy for prompting language models | Unknown | N/A | |
| Multi-skill Mobile Manipulation for Object Rearrangement | Unknown | N/A | |
| Post-hoc Concept Bottleneck Models | Unknown | N/A | |
| Neural Image-based Avatars: Generalizable Radiance Fields for Human Avatar Modeling | Unknown | N/A | |
| Corrupted Image Modeling for Self-Supervised Visual Pre-Training | Unknown | N/A | |
| Deep Learning on Implicit Neural Representations of Shapes | Unknown | N/A | |
| Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language | Unknown | N/A | |
| Clean-image Backdoor: Attacking Multi-label Models with Poisoned Labels Only | Unknown | N/A | |
| Neural DAG Scheduling via One-Shot Priority Sampling | Unknown | N/A | |
| Efficient recurrent architectures through activity sparsity and sparse back-propagation through time | Unknown | N/A | |
| Provably Auditing Ordinary Least Squares in Low Dimensions | Unknown | N/A | |
| On Accelerated Perceptrons and Beyond | Unknown | N/A | |
| EVA3D: Compositional 3D Human Generation from 2D Image Collections | Unknown | N/A | |
| Single-shot General Hyper-parameter Optimization for Federated Learning | Unknown | N/A | |
| DocPrompting: Generating Code by Retrieving the Docs | Unknown | N/A | |
| The Surprising Effectiveness of Equivariant Models in Domains with Latent Symmetry | Unknown | N/A | |
| Fooling SHAP with Stealthily Biased Sampling | Unknown | N/A | |
| View Synthesis with Sculpted Neural Points | Unknown | N/A | |
| On Pre-training Language Model for Antibody | Unknown | N/A | |
| Represent to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency | Unknown | N/A | |
| Efficient Attention via Control Variates | Unknown | N/A | |
| CUDA: Curriculum of Data Augmentation for Long-tailed Recognition | Unknown | N/A | |
| Spacetime Representation Learning | Unknown | N/A | |
| Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation | Unknown | N/A | |
| DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking | Unknown | N/A | |
| Mind's Eye: Grounded Language Model Reasoning through Simulation | Unknown | N/A | |
| Code Translation with Compiler Representations | Unknown | N/A | |
| Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection | Unknown | N/A | |
| Phase2vec: dynamical systems embedding with a physics-informed convolutional network | Unknown | N/A | |
| GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation | Unknown | N/A | |
| Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow | Unknown | N/A | |
| Learning on Large-scale Text-attributed Graphs via Variational Inference | Unknown | N/A | |
| Metadata Archaeology: Unearthing Data Subsets by Leveraging Training Dynamics | Unknown | N/A | |
| Theoretical Characterization of the Generalization Performance of Overfitted Meta-Learning | Unknown | N/A | |
| StableDR: Stabilized Doubly Robust Learning for Recommendation on Data Missing Not at Random | Unknown | N/A | |
| ACMP: Allen-Cahn Message Passing with Attractive and Repulsive Forces for Graph Neural Networks | Unknown | N/A | |
| Fundamental Limits in Formal Verification of Message-Passing Neural Networks | Unknown | N/A | |
| Generative Modelling with Inverse Heat Dissipation | Unknown | N/A | |
| Improving the imputation of missing data with Markov Blanket discovery | Unknown | N/A | |
| Classically Approximating Variational Quantum Machine Learning with Random Fourier Features | Unknown | N/A | |
| Evolve Smoothly, Fit Consistently: Learning Smooth Latent Dynamics For Advection-Dominated Systems | Unknown | N/A | |
| Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask? | Unknown | N/A | |
| Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching | Unknown | N/A | |
| Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization | Unknown | N/A | |
| Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier | Unknown | N/A | |
| Recitation-Augmented Language Models | Unknown | N/A | |
| PowerQuant: Automorphism Search for Non-Uniform Quantization | Unknown | N/A | |
| Powderworld: A Platform for Understanding Generalization via Rich Task Distributions | Unknown | N/A | |
| Fisher-Legendre (FishLeg) optimization of deep neural networks | Unknown | N/A | |
| Pseudo-label Training and Model Inertia in Neural Machine Translation | Unknown | N/A | |
| Choreographer: Learning and Adapting Skills in Imagination | Unknown | N/A | |
| Blurring Diffusion Models | Unknown | N/A | |
| MultiViz: Towards Visualizing and Understanding Multimodal Models | Unknown | N/A | |
| LAVA: Data Valuation without Pre-Specified Learning Algorithms | Unknown | N/A | |
| Modeling the Data-Generating Process is Necessary for Out-of-Distribution Generalization | Unknown | N/A | |
| Neuromechanical Autoencoders: Learning to Couple Elastic and Neural Network Nonlinearity | Unknown | N/A | |
| NeRN: Learning Neural Representations for Neural Networks | Unknown | N/A | |
| Proposal-Contrastive Pretraining for Object Detection from Fewer Data | Unknown | N/A | |
| PAC Reinforcement Learning for Predictive State Representations | Unknown | N/A | |
| Unsupervised Manifold Alignment with Joint Multidimensional Scaling | Unknown | N/A | |
| Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions | Unknown | N/A | |
| Conditional Antibody Design as 3D Equivariant Graph Translation | Unknown | N/A | |
| From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data | Unknown | N/A | |
| A CMDP-within-online framework for Meta-Safe Reinforcement Learning | Unknown | N/A | |
| Relational Attention: Generalizing Transformers for Graph-Structured Tasks | Unknown | N/A | |
| Making Better Decision by Directly Planning in Continuous Control | Unknown | N/A | |
| Training language models to summarize narratives improves brain alignment | Unknown | N/A | |
| Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images | Unknown | N/A | |
| Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery | Unknown | N/A | |
| MARS: Meta-learning as Score Matching in the Function Space | Unknown | N/A | |
| BALTO: fast tensor program optimization with diversity-based active learning | Unknown | N/A | |
| Pseudoinverse-Guided Diffusion Models for Inverse Problems | Unknown | N/A | |
| Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers | Unknown | N/A | |
| HiViT: A Simpler and More Efficient Design of Hierarchical Vision Transformer | Unknown | N/A | |
| Moving Forward by Moving Backward: Embedding Action Impact over Action Semantics | Unknown | N/A | |
| Spatio-temporal point processes with deep non-stationary kernels | Unknown | N/A | |
| Modeling Sequential Sentence Relation to Improve Cross-lingual Dense Retrieval | Unknown | N/A | |
| Learning a Data-Driven Policy Network for Pre-Training Automated Feature Engineering | Unknown | N/A | |
| $O(T^{-1})$ Convergence of Optimistic-Follow-the-Regularized-Leader in Two-Player Zero-Sum Markov Games | Unknown | N/A | |
| Towards the Generalization of Contrastive Self-Supervised Learning | Unknown | N/A | |
| Deterministic training of generative autoencoders using invertible layers | Unknown | N/A | |
| Canary in a Coalmine: Better Membership Inference with Ensembled Adversarial Queries | Unknown | N/A | |
| Learning rigid dynamics with face interaction graph networks | Unknown | N/A | |
| Few-shot Cross-domain Image Generation via Inference-time Latent-code Learning | Unknown | N/A | |
| On the Sensitivity of Reward Inference to Misspecified Human Models | Unknown | N/A | |
| ArCL: Enhancing Contrastive Learning with Augmentation-Robust Representations | Unknown | N/A | |
| Pink Noise Is All You Need: Colored Noise Exploration in Deep Reinforcement Learning | Unknown | N/A | |
| Understanding and Adopting Rational Behavior by Bellman Score Estimation | Unknown | N/A | |
| SMART: Self-supervised Multi-task pretrAining with contRol Transformers | Unknown | N/A | |
| Learning with Stochastic Orders | Unknown | N/A | |
| LightGCL: Simple Yet Effective Graph Contrastive Learning for Recommendation | Unknown | N/A | |
| Extremely Simple Activation Shaping for Out-of-Distribution Detection | Unknown | N/A | |
| Extreme Q-Learning: MaxEnt RL without Entropy | Unknown | N/A | |
| Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs | Unknown | N/A | |
| Learning About Progress From Experts | Unknown | N/A | |
| Nonlinear Reconstruction for Operator Learning of PDEs with Discontinuities | Unknown | N/A | |
| Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search | Unknown | N/A | |
| Git Re-Basin: Merging Models modulo Permutation Symmetries | Unknown | N/A | |
| SimPer: Simple Self-Supervised Learning of Periodic Targets | Unknown | N/A | |
| No Reason for No Supervision: Improved Generalization in Supervised Models | Unknown | N/A | |
| Towards Effective and Interpretable Human-Agent Collaboration in MOBA Games: A Communication Perspective | Unknown | N/A | |
| EV-GAN: Simulation of extreme events with ReLU neural networks | Unknown | N/A | |
| Learning Rates as a Function of Batch Size: A Random Matrix Theory Approach to Neural Network Training | Unknown | N/A | |
| Topologically penalized regression on manifolds | Unknown | N/A | |
| Principal Components Bias in Over-parameterized Linear Models, and its Manifestation in Deep Neural Networks | Unknown | N/A | |
| Composite Slice Transformer: An Efficient Transformer with Composition of Multi-Scale Multi-Range Attentions | Unknown | N/A | |
| OPTQ: Accurate Quantization for Generative Pre-trained Transformers | Unknown | N/A | |
| Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games | Unknown | N/A | |
| Learning Controllable Adaptive Simulation for Multi-resolution Physics | Unknown | N/A | |
| The Tilted Variational Autoencoder: Improving Out-of-Distribution Detection | Unknown | N/A | |
| Constraining Representations Yields Models That Know What They Don't Know | Unknown | N/A | |
| Faster federated optimization under second-order similarity | Unknown | N/A | |
| Diffusion Models for Causal Discovery via Topological Ordering | Unknown | N/A | |
| Deconstructing Distributions: A Pointwise Framework of Learning | Unknown | N/A | |
| Direct Embedding of Temporal Network Edges via Time-Decayed Line Graphs | Unknown | N/A | |
| Gray-Box Gaussian Processes for Automated Reinforcement Learning | Unknown | N/A | |
| Machine Unlearning of Federated Clusters | Unknown | N/A | |
| Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples | Unknown | N/A | |
| MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations | Unknown | N/A | |
| Understanding Embodied Reference with Touch-Line Transformer | Unknown | N/A | |
| Semi-supervised learning with a principled likelihood from a generative model of data curation | Unknown | N/A | |
| The hidden uniform cluster prior in self-supervised learning | Unknown | N/A | |
| STaSy: Score-based Tabular data Synthesis | Unknown | N/A | |
| Near-optimal Coresets for Robust Clustering | Unknown | N/A | |
| Learning Diffusion Bridges on Constrained Domains | Unknown | N/A | |
| Bayesian Oracle for bounding information gain in neural encoding models | Unknown | N/A | |
| Scaling Pareto-Efficient Decision Making via Offline Multi-Objective RL | Unknown | N/A | |
| On Representing Linear Programs by Graph Neural Networks | Unknown | N/A | |
| SLTUNET: A Simple Unified Model for Sign Language Translation | Unknown | N/A | |
| Confidential-PROFITT: Confidential PROof of FaIr Training of Trees | Unknown | N/A | |
| A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph Data | Unknown | N/A | |
| Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners | Unknown | N/A | |
| Understanding Neural Coding on Latent Manifolds by Sharing Features and Dividing Ensembles | Unknown | N/A | |
| Model-based Causal Bayesian Optimization | Unknown | N/A | |
| Multi-objective optimization via equivariant deep hypervolume approximation | Unknown | N/A | |
| ExpressivE: A Spatio-Functional Embedding For Knowledge Graph Completion | Unknown | N/A | |
| Why adversarial training can hurt robust accuracy | Unknown | N/A | |
| Planning with Large Language Models for Code Generation | Unknown | N/A | |
| Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions | Unknown | N/A | |
| EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data | Unknown | N/A | |
| VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training | Unknown | N/A | |
| How Does Semi-supervised Learning with Pseudo-labelers Work? A Case Study | Unknown | N/A | |
| Robust Scheduling with GFlowNets | Unknown | N/A | |
| Generating Sequences by Learning to Self-Correct | Unknown | N/A | |
| Feature selection and low test error in shallow low-rotation ReLU networks | Unknown | N/A | |
| A probabilistic framework for task-aligned intra- and inter-area neural manifold estimation | Unknown | N/A | |
| Behavior Proximal Policy Optimization | Unknown | N/A | |
| Bayes-MIL: A New Probabilistic Perspective on Attention-based Multiple Instance Learning for Whole Slide Images | Unknown | N/A | |
| Performance Bounds for Model and Policy Transfer in Hidden-parameter MDPs | Unknown | N/A | |
| Distributed Extra-gradient with Optimal Complexity and Communication Guarantees | Unknown | N/A | |
| Characteristic Neural Ordinary Differential Equation | Unknown | N/A | |
| Rhino: Deep Causal Temporal Relationship Learning with History-dependent Noise | Unknown | N/A | |
| Can We Find Nash Equilibria at a Linear Rate in Markov Games? | Unknown | N/A | |
| Causal Reasoning in the Presence of Latent Confounders via Neural ADMG Learning | Unknown | N/A | |
| Understanding DDPM Latent Codes Through Optimal Transport | Unknown | N/A | |
| Scaling up and Stabilizing Differentiable Planning with Implicit Differentiation | Unknown | N/A | |
| A Theory of Dynamic Benchmarks | Unknown | N/A | |
| Language models are multilingual chain-of-thought reasoners | Unknown | N/A | |
| FIGARO: Controllable Music Generation using Learned and Expert Features | Unknown | N/A | |
| Real-time variational method for learning neural trajectory and its dynamics | Unknown | N/A | |
| Label Propagation with Weak Supervision | Unknown | N/A | |
| MICN: Multi-scale Local and Global Context Modeling for Long-term Series Forecasting | Unknown | N/A | |
| f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation | Unknown | N/A | |
| Interpretable Geometric Deep Learning via Learnable Randomness Injection | Unknown | N/A | |
| Denoising Diffusion Samplers | Unknown | N/A | |
| Panning for Gold in Federated Learning: Targeted Text Extraction under Arbitrarily Large-Scale Aggregation | Unknown | N/A | |
| Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning | Unknown | N/A | |
| Learning Iterative Neural Optimizers for Image Steganography | Unknown | N/A | |
| Decomposed Prompting: A Modular Approach for Solving Complex Tasks | Unknown | N/A | |
| A Control-Centric Benchmark for Video Prediction | Unknown | N/A | |
| Enhancing Meta Learning via Multi-Objective Soft Improvement Functions | Unknown | N/A | |
| Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes | Unknown | N/A | |
| Memorization Capacity of Neural Networks with Conditional Computation | Unknown | N/A | |
| Graph Domain Adaptation via Theory-Grounded Spectral Regularization | Unknown | N/A | |
| Characterizing intrinsic compositionality in transformers with Tree Projections | Unknown | N/A | |
| Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Recursive Time Series Data Augmentation | Unknown | N/A | |
| Fast Nonlinear Vector Quantile Regression | Unknown | N/A | |
| Unveiling the sampling density in non-uniform geometric graphs | Unknown | N/A | |
| Test-Time Robust Personalization for Federated Learning | Unknown | N/A | |
| Solving stochastic weak Minty variational inequalities without increasing batch size | Unknown | N/A | |
| QAID: Question Answering Inspired Few-shot Intent Detection | Unknown | N/A | |
| Towards Addressing Label Skews in One-Shot Federated Learning | Unknown | N/A | |
| Contextual Convolutional Networks | Unknown | N/A | |
| Scenario-based Question Answering with Interacting Contextual Properties | Unknown | N/A | |
| EAGLE: Large-scale Learning of Turbulent Fluid Dynamics with Mesh Transformers | Unknown | N/A | |
| Neural ePDOs: Spatially Adaptive Equivariant Partial Differential Operator Based Networks | Unknown | N/A | |
| Learning Fair Graph Representations via Automated Data Augmentations | Unknown | N/A | |
| Learning Hierarchical Protein Representations via Complete 3D Graph Networks | Unknown | N/A | |
| Computing all Optimal Partial Transports | Unknown | N/A | |
| Learning Vortex Dynamics for Fluid Inference and Prediction | Unknown | N/A | |
| Analog Bits: Generating Discrete Data using Diffusion Models with Self-Conditioning | Unknown | N/A | |
| Filter-Recovery Network for Multi-Speaker Audio-Visual Speech Separation | Unknown | N/A | |
| Function-Consistent Feature Distillation | Unknown | N/A | |
| Decompose to Generalize: Species-Generalized Animal Pose Estimation | Unknown | N/A | |
| Scaling Up Probabilistic Circuits by Latent Variable Distillation | Unknown | N/A | |
| What shapes the loss landscape of self supervised learning? | Unknown | N/A | |
| Efficient approximation of neural population structure and correlations with probabilistic circuits | Unknown | N/A | |
| TypeT5: Seq2seq Type Inference using Static Analysis | Unknown | N/A | |
| Stay Moral and Explore: Learn to Behave Morally in Text-based Games | Unknown | N/A | |
| Improving Differentiable Neural Architecture Search by Encouraging Transferability | Unknown | N/A | |
| Auto-Encoding Goodness of Fit | Unknown | N/A | |
| Particle-based Variational Inference with Preconditioned Functional Gradient Flow | Unknown | N/A | |
| Markup-to-Image Diffusion Models with Scheduled Sampling | Unknown | N/A | |
| An Extensible Multi-modal Multi-task Object Dataset with Materials | Unknown | N/A | |
| DySR: Adaptive Super-Resolution via Algorithm and System Co-design | Unknown | N/A | |
| Can Neural Networks Learn Implicit Logic from Physical Reasoning? | Unknown | N/A | |
| ManyDG: Many-domain Generalization for Healthcare Applications | Unknown | N/A | |
| StyleMorph: Disentangled 3D-Aware Image Synthesis with a 3D Morphable StyleGAN | Unknown | N/A | |
| Packed Ensembles for efficient uncertainty estimation | Unknown | N/A | |
| Generalization and Estimation Error Bounds for Model-based Neural Networks | Unknown | N/A | |
| Towards convergence to Nash equilibria in two-team zero-sum games | Unknown | N/A | |
| Exploring Temporally Dynamic Data Augmentation for Video Recognition | Unknown | N/A | |
| Understanding Train-Validation Split in Meta-Learning with Neural Networks | Unknown | N/A | |
| Bispectral Neural Networks | Unknown | N/A | |
| A Learning Based Hypothesis Test for Harmful Covariate Shift | Unknown | N/A | |
| $k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference | Unknown | N/A | |
| Selective Frequency Network for Image Restoration | Unknown | N/A | |
| ImaginaryNet: Learning Object Detectors without Real Images and Annotations | Unknown | N/A | |
| Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation | Unknown | N/A | |
| Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization | Unknown | N/A | |
| FINDE: Neural Differential Equations for Finding and Preserving Invariant Quantities | Unknown | N/A | |
| Offline RL for Natural Language Generation with Implicit Language Q Learning | Unknown | N/A | |
| Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems | Unknown | N/A | |
| Human alignment of neural network representations | Unknown | N/A | |
| Risk-Aware Reinforcement Learning with Coherent Risk Measures and Non-linear Function Approximation | Unknown | N/A | |
| Self-supervision through Random Segments with Autoregressive Coding (RandSAC) | Unknown | N/A | |
| Efficiently Controlling Multiple Risks with Pareto Testing | Unknown | N/A | |
| DiffMimic: Efficient Motion Mimicking with Differentiable Physics | Unknown | N/A | |
| Causality Compensated Attention for Contextual Biased Visual Recognition | Unknown | N/A | |
| Sparse Mixture-of-Experts are Domain Generalizable Learners | Unknown | N/A | |
| Towards Better Selective Classification | Unknown | N/A | |
| Latent Bottlenecked Attentive Neural Processes | Unknown | N/A | |
| How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression? | Unknown | N/A | |
| Latent Graph Inference using Product Manifolds | Unknown | N/A | |
| Light Sampling Field and BRDF Representation for Physically-based Neural Rendering | Unknown | N/A | |
| FedExP: Speeding Up Federated Averaging via Extrapolation | Unknown | N/A | |
| Compositionality with Variation Reliably Emerges in Neural Networks | Unknown | N/A | |
| gDDIM: Generalized denoising diffusion implicit models | Unknown | N/A | |
| Fast Sampling of Diffusion Models with Exponential Integrator | Unknown | N/A | |
| Understanding The Robustness of Self-supervised Learning Through Topic Modeling | Unknown | N/A | |
| Interactive Portrait Harmonization | Unknown | N/A | |
| AIM: Adapting Image Models for Efficient Video Action Recognition | Unknown | N/A | |
| Parameter-Efficient Fine-Tuning Design Spaces | Unknown | N/A | |
| Not All Tasks Are Born Equal: Understanding Zero-Shot Generalization | Unknown | N/A | |
| Learning in temporally structured environments | Unknown | N/A | |
| Dilated convolution with learnable spacings | Unknown | N/A | |
| PEER: A Collaborative Language Model | Unknown | N/A | |
| Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video Recognition | Unknown | N/A | |
| Formal Mathematics Statement Curriculum Learning | Unknown | N/A | |
| D4FT: A Deep Learning Approach to Kohn-Sham Density Functional Theory | Unknown | N/A | |
| Subsampling in Large Graphs Using Ricci Curvature | Unknown | N/A | |
| $\Lambda$-DARTS: Mitigating Performance Collapse by Harmonizing Operation Selection among Cells | Unknown | N/A | |
| UL2: Unifying Language Learning Paradigms | Unknown | N/A | |
| STREET: A MULTI-TASK STRUCTURED REASONING AND EXPLANATION BENCHMARK | Unknown | N/A | |
| Efficient Certified Training and Robustness Verification of Neural ODEs | Unknown | N/A | |
| Active Learning for Object Detection with Evidential Deep Learning and Hierarchical Uncertainty Aggregation | Unknown | N/A | |
| Diffusion Probabilistic Fields | Unknown | N/A | |
| Globally Injective ReLU Networks | Unknown | N/A | |
| SCALE-UP: An Efficient Black-box Input-level Backdoor Detection via Analyzing Scaled Prediction Consistency | Unknown | N/A | |
| Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences | Unknown | N/A | |
| Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement | Unknown | N/A | |
| SemPPL: Predicting Pseudo-Labels for Better Contrastive Representations | Unknown | N/A | |
| Distributionally Robust Recourse Action | Unknown | N/A | |
| Bidirectional Propagation for Cross-Modal 3D Object Detection | Unknown | N/A | |
| On the Effectiveness of Out-of-Distribution Data in Self-Supervised Long-Tail Learning. | Unknown | N/A | |
| Efficient Deep Reinforcement Learning Requires Regulating Overfitting | Unknown | N/A | |
| Using Both Demonstrations and Language Instructions to Efficiently Learn Robotic Tasks | Unknown | N/A | |
| Learning to Segment from Noisy Annotations: A Spatial Correction Approach | Unknown | N/A | |
| Diffusion Adversarial Representation Learning for Self-supervised Vessel Segmentation | Unknown | N/A | |
| DynaMS: Dyanmic Margin Selection for Efficient Deep Learning | Unknown | N/A | |
| CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers | Unknown | N/A | |
| Mid-Vision Feedback | Unknown | N/A | |
| HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention | Unknown | N/A | |
| Learning Rationalizable Equilibria in Multiplayer Games | Unknown | N/A | |
| Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments | Unknown | N/A | |
| Deep Learning From Crowdsourced Labels: Coupled Cross-Entropy Minimization, Identifiability, and Regularization | Unknown | N/A | |
| Efficient Federated Domain Translation | Unknown | N/A | |
| Diversify and Disambiguate: Out-of-Distribution Robustness via Disagreement | Unknown | N/A | |
| Agnostic Learning of General ReLU Activation Using Gradient Descent | Unknown | N/A | |
| Latent Variable Representation for Reinforcement Learning | Unknown | N/A | |
| Learning Probabilistic Topological Representations Using Discrete Morse Theory | Unknown | N/A | |
| Spectral Decomposition Representation for Reinforcement Learning | Unknown | N/A | |
| Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning | Unknown | N/A | |
| Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms | Unknown | N/A | |
| MMVAE+: Enhancing the Generative Quality of Multimodal VAEs without Compromises | Unknown | N/A | |
| Causal Confusion and Reward Misidentification in Preference-Based Reward Learning | Unknown | N/A | |
| TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene Understanding | Unknown | N/A | |
| Open-Vocabulary Object Detection upon Frozen Vision and Language Models | Unknown | N/A | |
| Solving Continuous Control via Q-learning | Unknown | N/A | |
| Mutual Partial Label Learning with Competitive Label Noise | Unknown | N/A | |
| Partial Label Unsupervised Domain Adaptation with Class-Prototype Alignment | Unknown | N/A | |
| Max-Margin Works while Large Margin Fails: Generalization without Uniform Convergence | Unknown | N/A | |
| Implicit Regularization for Group Sparsity | Unknown | N/A | |
| A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning | Unknown | N/A | |
| Martingale Posterior Neural Processes | Unknown | N/A | |
| ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure | Unknown | N/A | |
| Red PANDA: Disambiguating Image Anomaly Detection by Removing Nuisance Factors | Unknown | N/A | |
| Identifiability Results for Multimodal Contrastive Learning | Unknown | N/A | |
| Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection | Unknown | N/A | |
| Learning Group Importance using the Differentiable Hypergeometric Distribution | Unknown | N/A | |
| PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification | Unknown | N/A | |
| A View From Somewhere: Human-Centric Face Representations | Unknown | N/A | |
| PaLI: A Jointly-Scaled Multilingual Language-Image Model | Unknown | N/A | |
| Learning Uncertainty for Unknown Domains with Zero-Target-Assumption | Unknown | N/A | |
| Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic | Unknown | N/A | |
| Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild | Unknown | N/A | |
| Federated Learning as Variational Inference: A Scalable Expectation Propagation Approach | Unknown | N/A | |
| MPCFORMER: FAST, PERFORMANT AND PRIVATE TRANSFORMER INFERENCE WITH MPC | Unknown | N/A | |
| Building Normalizing Flows with Stochastic Interpolants | Unknown | N/A | |
| Clifford Neural Layers for PDE Modeling | Unknown | N/A | |
| Learning to Compose Soft Prompts for Compositional Zero-Shot Learning | Unknown | N/A | |
| LogicDP: Creating Labels for Graph Data via Inductive Logic Programming | Unknown | N/A | |
| Maximizing Communication Efficiency for Large-scale Training via 0/1 Adam | Unknown | N/A | |
| Temporal Coherent Test Time Optimization for Robust Video Classification | Unknown | N/A | |
| Multi-Objective Online Learning | Unknown | N/A | |
| Provable Robustness against Wasserstein Distribution Shifts via Input Randomization | Unknown | N/A | |
| CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos | Unknown | N/A | |
| On the Performance of Temporal Difference Learning With Neural Networks | Unknown | N/A | |
| Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness | Unknown | N/A | |
| Scaling Laws for a Multi-Agent Reinforcement Learning Model | Unknown | N/A | |
| Transfer Learning with Deep Tabular Models | Unknown | N/A | |
| Anti-Symmetric DGN: a stable architecture for Deep Graph Networks | Unknown | N/A | |
| Protein Sequence and Structure Co-Design with Equivariant Translation | Unknown | N/A | |
| Differentially Private Adaptive Optimization with Delayed Preconditioners | Unknown | N/A | |
| SMART: Sentences as Basic Units for Text Evaluation | Unknown | N/A | |
| Fairness and Accuracy under Domain Generalization | Unknown | N/A | |
| Mitigating Dataset Bias by Using Per-Sample Gradient | Unknown | N/A | |
| Gradient Boosting Performs Gaussian Process Inference | Unknown | N/A | |
| A critical look at the evaluation of GNNs under heterophily: Are we really making progress? | Unknown | N/A | |
| Provably Efficient Lifelong Reinforcement Learning with Linear Representation | Unknown | N/A | |
| Confidence Estimation Using Unlabeled Data | Unknown | N/A | |
| Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve | Unknown | N/A | |
| Diffusion-based Image Translation using disentangled style and content representation | Unknown | N/A | |
| Competitive Physics Informed Networks | Unknown | N/A | |
| Learnable Graph Convolutional Attention Networks | Unknown | N/A | |
| On the Data-Efficiency with Contrastive Image Transformation in Reinforcement Learning | Unknown | N/A | |
| Unbiased Supervised Contrastive Learning | Unknown | N/A | |
| Interneurons accelerate learning dynamics in recurrent neural networks for statistical adaptation | Unknown | N/A | |
| Bridging the Gap to Real-World Object-Centric Learning | Unknown | N/A | |
| Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning | Unknown | N/A | |
| Understanding Zero-shot Adversarial Robustness for Large-Scale Models | Unknown | N/A | |
| TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis | Unknown | N/A | |
| 3D generation on ImageNet | Unknown | N/A | |
| TTN: A Domain-Shift Aware Batch Normalization in Test-Time Adaptation | Unknown | N/A | |
| Winning Both the Accuracy of Floating Point Activation and the Simplicity of Integer Arithmetic | Unknown | N/A | |
| Sparse Distributed Memory is a Continual Learner | Unknown | N/A | |
| Deep Transformers without Shortcuts: Modifying Self-attention for Faithful Signal Propagation | Unknown | N/A | |
| Symmetric Pruning in Quantum Neural Networks | Unknown | N/A | |
| On The Inadequacy of Optimizing Alignment and Uniformity in Contrastive Learning of Sentence Representations | Unknown | N/A | |
| GNNDelete: A General Strategy for Unlearning in Graph Neural Networks | Unknown | N/A | |
| Fundamental limits on the robustness of image classifiers | Unknown | N/A | |
| CodeBPE: Investigating Subtokenization Options for Large Language Model Pretraining on Source Code | Unknown | N/A | |
| Leveraging Unlabeled Data to Track Memorization | Unknown | N/A | |
| Learning to reason over visual objects | Unknown | N/A | |
| Weighted Ensemble Self-Supervised Learning | Unknown | N/A | |
| Agent-based Graph Neural Networks | Unknown | N/A | |
| Tuning Frequency Bias in Neural Network Training with Nonuniform Data | Unknown | N/A | |
| Treeformer: Dense Gradient Trees for Efficient Attention Computation | Unknown | N/A | |
| The Lazy Neuron Phenomenon: On Emergence of Activation Sparsity in Transformers | Unknown | N/A | |
| Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation | Unknown | N/A | |
| Protein Representation Learning by Geometric Structure Pretraining | Unknown | N/A | |
| DAG Matters! GFlowNets Enhanced Explainer for Graph Neural Networks | Unknown | N/A | |
| Don’t forget the nullspace! Nullspace occupancy as a mechanism for out of distribution failure | Unknown | N/A | |
| Uniform-in-time propagation of chaos for the mean-field gradient Langevin dynamics | Unknown | N/A | |
| Excess Risk of Two-Layer ReLU Neural Networks in Teacher-Student Settings and its Superiority to Kernel Methods | Unknown | N/A | |
| Confidence-Based Feature Imputation for Graphs with Partially Known Features | Unknown | N/A | |
| Imitating Graph-Based Planning with Goal-Conditioned Policies | Unknown | N/A | |
| Long-Tailed Learning Requires Feature Learning | Unknown | N/A | |
| FedDAR: Federated Domain-Aware Representation Learning | Unknown | N/A | |
| FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning | Unknown | N/A | |
| Factorized Fourier Neural Operators | Unknown | N/A | |
| Variational Latent Branching Model for Off-Policy Evaluation | Unknown | N/A | |
| 3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction | Unknown | N/A | |
| Transformer-based model for symbolic regression via joint supervised learning | Unknown | N/A | |
| LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning | Unknown | N/A | |
| Share Your Representation Only: Guaranteed Improvement of the Privacy-Utility Tradeoff in Federated Learning | Unknown | N/A | |
| Mini-batch $k$-means terminates within $O(d/\epsilon)$ iterations | Unknown | N/A | |
| Disentanglement of Correlated Factors via Hausdorff Factorized Support | Unknown | N/A | |
| An efficient encoder-decoder architecture with top-down attention for speech separation | Unknown | N/A | |
| Specformer: Spectral Graph Neural Networks Meet Transformers | Unknown | N/A | |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Unknown | N/A | |
| ZiCo: Zero-shot NAS via inverse Coefficient of Variation on Gradients | Unknown | N/A | |
| Cross-Layer Retrospective Retrieving via Layer Attention | Unknown | N/A | |
| Decision S4: Efficient Sequence-Based RL via State Spaces Layers | Unknown | N/A | |
| Easy Differentially Private Linear Regression | Unknown | N/A | |
| Contextual bandits with concave rewards, and an application to fair ranking | Unknown | N/A | |
| Can BERT Refrain from Forgetting on Sequential Tasks? A Probing Study | Unknown | N/A | |
| A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning | Unknown | N/A | |
| Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules | Unknown | N/A | |
| DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training | Unknown | N/A | |
| Latent State Marginalization as a Low-cost Approach for Improving Exploration | Unknown | N/A | |
| Implicit Bias in Leaky ReLU Networks Trained on High-Dimensional Data | Unknown | N/A | |
| GFlowNets and variational inference | Unknown | N/A | |
| Leveraging Large Language Models for Multiple Choice Question Answering | Unknown | N/A | |
| Understanding the Covariance Structure of Convolutional Filters | Unknown | N/A | |
| Regression with Label Differential Privacy | Unknown | N/A | |
| E3Bind: An End-to-End Equivariant Network for Protein-Ligand Docking | Unknown | N/A | |
| WiNeRT: Towards Neural Ray Tracing for Wireless Channel Modelling and Differentiable Simulations | Unknown | N/A | |
| MA-BERT: Towards Matrix Arithmetic-only BERT Inference by Eliminating Complex Non-Linear Functions | Unknown | N/A | |
| An Exact Poly-Time Membership-Queries Algorithm for Extracting a Three-Layer ReLU Network | Unknown | N/A | |
| On the Robustness to Misspecification of α-posteriors and Their Variational Approximations | Unknown | N/A | |
| Binding Language Models in Symbolic Languages | Unknown | N/A | |
| ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond | Unknown | N/A | |
| Compositional Semantic Parsing with Large Language Models | Unknown | N/A | |
| Coupled Multiwavelet Operator Learning for Coupled Differential Equations | Unknown | N/A | |
| POPGym: Benchmarking Partially Observable Reinforcement Learning | Unknown | N/A | |
| Rethinking the Effect of Data Augmentation in Adversarial Contrastive Learning | Unknown | N/A | |
| TrojText: Test-time Invisible Textual Trojan Insertion | Unknown | N/A | |
| Transferable Unlearnable Examples | Unknown | N/A | |
| Error Sensitivity Modulation based Experience Replay: Mitigating Abrupt Representation Drift in Continual Learning | Unknown | N/A | |
| Modelling Long Range Dependencies in $N$D: From Task-Specific to a General Purpose CNN | Unknown | N/A | |
| Task-Aware Information Routing from Common Representation Space in Lifelong Learning | Unknown | N/A | |
| GNNInterpreter: A Probabilistic Generative Model-Level Explanation for Graph Neural Networks | Unknown | N/A | |
| Pushing the Limits of Fewshot Anomaly Detection in Industry Vision: Graphcore | Unknown | N/A | |
| Combinatorial Pure Exploration of Causal Bandits | Unknown | N/A | |
| Pareto Invariant Risk Minimization: Towards Mitigating the Optimization Dilemma in Out-of-Distribution Generalization | Unknown | N/A | |
| What Makes Convolutional Models Great on Long Sequence Modeling? | Unknown | N/A | |
| Learning Multimodal Data Augmentation in Feature Space | Unknown | N/A | |
| Neural Systematic Binder | Unknown | N/A | |
| Active Image Indexing | Unknown | N/A | |
| How Much Data Are Augmentations Worth? An Investigation into Scaling Laws, Invariance, and Implicit Regularization | Unknown | N/A | |
| Learning Human-Compatible Representations for Case-Based Decision Support | Unknown | N/A | |
| Optimizing Spca-based Continual Learning: A Theoretical Approach | Unknown | N/A | |
| Decepticons: Corrupted Transformers Breach Privacy in Federated Learning for Language Models | Unknown | N/A | |
| The Best of Both Worlds: Accurate Global and Personalized Models through Federated Learning with Data-Free Hyper-Knowledge Distillation | Unknown | N/A | |
| Linear Connectivity Reveals Generalization Strategies | Unknown | N/A | |
| SCoMoE: Efficient Mixtures of Experts with Structured Communication | Unknown | N/A | |
| A view of mini-batch SGD via generating functions: conditions of convergence, phase transitions, benefit from negative momenta. | Unknown | N/A | |
| The Role of Coverage in Online Reinforcement Learning | Unknown | N/A | |
| PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales | Unknown | N/A | |
| GEASS: Neural causal feature selection for high-dimensional biological data | Unknown | N/A | |
| SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing | Unknown | N/A | |
| Hierarchical Abstraction for Combinatorial Generalization in Object Rearrangement | Unknown | N/A | |
| Augmentation Component Analysis: Modeling Similarity via the Augmentation Overlaps | Unknown | N/A | |
| Neural Bregman Divergences for Distance Learning | Unknown | N/A | |
| Incompatibility Clustering as a Defense Against Backdoor Poisoning Attacks | Unknown | N/A | |
| A Theoretical Understanding of Shallow Vision Transformers: Learning, Generalization, and Sample Complexity | Unknown | N/A | |
| ChiroDiff: Modelling chirographic data with Diffusion Models | Unknown | N/A | |
| Out-of-Distribution Detection and Selective Generation for Conditional Language Models | Unknown | N/A | |
| A Unified Framework for Soft Threshold Pruning | Unknown | N/A | |
| Federated Neural Bandits | Unknown | N/A | |
| Compositional Task Representations for Large Language Models | Unknown | N/A | |
| An Additive Instance-Wise Approach to Multi-class Model Interpretation | Unknown | N/A | |
| Editing models with task arithmetic | Unknown | N/A | |
| Reparameterization through Spatial Gradient Scaling | Unknown | N/A | |
| Quality-Similar Diversity via Population Based Reinforcement Learning | Unknown | N/A | |
| Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning | Unknown | N/A | |
| Adaptive Optimization in the $\infty$-Width Limit | Unknown | N/A | |
| Surgical Fine-Tuning Improves Adaptation to Distribution Shifts | Unknown | N/A | |
| Avoiding spurious correlations via logit correction | Unknown | N/A | |
| Interpretability with full complexity by constraining feature information | Unknown | N/A | |
| User-Interactive Offline Reinforcement Learning | Unknown | N/A | |
| Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-Free RL | Unknown | N/A | |
| Large Language Models are Human-Level Prompt Engineers | Unknown | N/A | |
| Pruning Deep Neural Networks from a Sparsity Perspective | Unknown | N/A | |
| Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus | Unknown | N/A | |
| Dual Student Networks for Data-Free Model Stealing | Unknown | N/A | |
| Data-Free One-Shot Federated Learning Under Very High Statistical Heterogeneity | Unknown | N/A | |
| Mosaic Representation Learning for Self-supervised Visual Pre-training | Unknown | N/A | |
| A Theoretical Framework for Inference and Learning in Predictive Coding Networks | Unknown | N/A | |
| Variance Reduction is an Antidote to Byzantines: Better Rates, Weaker Assumptions and Communication Compression as a Cherry on the Top | Unknown | N/A | |
| Energy-Inspired Self-Supervised Pretraining for Vision Models | Unknown | N/A | |
| Effectively Modeling Time Series with Simple Discrete State Spaces | Unknown | N/A | |
| A Time Series is Worth 64 Words: Long-term Forecasting with Transformers | Unknown | N/A | |
| Random Laplacian Features for Learning with Hyperbolic Space | Unknown | N/A | |
| Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay | Unknown | N/A | |
| $\mathrm{SE}(3)$-Equivariant Attention Networks for Shape Reconstruction in Function Space | Unknown | N/A | |
| Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought | Unknown | N/A | |
| Momentum Stiefel Optimizer, with Applications to Suitably-Orthogonal Attention, and Optimal Transport | Unknown | N/A | |
| Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection | Unknown | N/A | |
| CircNet: Meshing 3D Point Clouds with Circumcenter Detection | Unknown | N/A | |
| Robust Graph Dictionary Learning | Unknown | N/A | |
| Unsupervised visualization of image datasets using contrastive learning | Unknown | N/A | |
| Learning Harmonic Molecular Representations on Riemannian Manifold | Unknown | N/A | |
| Learning Language Representations with Logical Inductive Bias | Unknown | N/A | |
| First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains | Unknown | N/A | |
| Concept-level Debugging of Part-Prototype Networks | Unknown | N/A | |
| CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis | Unknown | N/A | |
| Computational Language Acquisition with Theory of Mind | Unknown | N/A | |
| Projective Proximal Gradient Descent for Nonconvex Nonsmooth Optimization: Fast Convergence Without Kurdyka-Lojasiewicz (KL) Property | Unknown | N/A | |
| Mega: Moving Average Equipped Gated Attention | Unknown | N/A | |
| Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees | Unknown | N/A | |
| Prototypical Calibration for Few-shot Learning of Language Models | Unknown | N/A | |
| Serving Graph Compression for Graph Neural Networks | Unknown | N/A | |
| Learning MLPs on Graphs: A Unified View of Effectiveness, Robustness, and Efficiency | Unknown | N/A | |
| Geometrically regularized autoencoders for non-Euclidean data | Unknown | N/A | |
| Calibrating the Rigged Lottery: Making All Tickets Reliable | Unknown | N/A | |
| A new characterization of the edge of stability based on a sharpness measure aware of batch gradient distribution | Unknown | N/A | |
| VoGE: A Differentiable Volume Renderer using Gaussian Ellipsoids for Analysis-by-Synthesis | Unknown | N/A |
ICLR 2024
| Title | Author | Code URL | |
|---|---|---|---|
| Spotlight Posters | Unknown | N/A | |
| Robot Fleet Learning via Policy Merging | Unknown | N/A | |
| GNNCert: Deterministic Certification of Graph Neural Networks against Adversarial Perturbations | Unknown | N/A | |
| Eureka: Human-Level Reward Design via Coding Large Language Models | Unknown | N/A | |
| Oracle Efficient Algorithms for Groupwise Regret | Unknown | N/A | |
| Self-Guided Masked Autoencoders for Domain-Agnostic Self-Supervised Learning | Unknown | N/A | |
| Topic Modeling as Multi-Objective Contrastive Optimization | Unknown | N/A | |
| Principled Architecture-aware Scaling of Hyperparameters | Unknown | N/A | |
| Set Learning for Accurate and Calibrated Models | Unknown | N/A | |
| T-MARS: Improving Visual Representations by Circumventing Text Feature Learning | Unknown | N/A | |
| Diffusion Sampling with Momentum for Mitigating Divergence Artifacts | Unknown | N/A | |
| PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation | Unknown | N/A | |
| LOQA: Learning with Opponent Q-Learning Awareness | Unknown | N/A | |
| Online Stabilization of Spiking Neural Networks | Unknown | N/A | |
| Meta Continual Learning Revisited: Implicitly Enhancing Online Hessian Approximation via Variance Reduction | Unknown | N/A | |
| Blending Imitation and Reinforcement Learning for Robust Policy Improvement | Unknown | N/A | |
| In-Context Learning Learns Label Relationships but Is Not Conventional Learning | Unknown | N/A | |
| Latent Trajectory Learning for Limited Timestamps under Distribution Shift over Time | Unknown | N/A | |
| ConR: Contrastive Regularizer for Deep Imbalanced Regression | Unknown | N/A | |
| Label-Noise Robust Diffusion Models | Unknown | N/A | |
| Exploring the cloud of feature interaction scores in a Rashomon set | Unknown | N/A | |
| Unveiling the Unseen: Identifiable Clusters in Trained Depthwise Convolutional Kernels | Unknown | N/A | |
| A Simple Romance Between Multi-Exit Vision Transformer and Token Reduction | Unknown | N/A | |
| Sign2GPT: Leveraging Large Language Models for Gloss-Free Sign Language Translation | Unknown | N/A | |
| Learning to Compose: Improving Object Centric Learning by Injecting Compositionality | Unknown | N/A | |
| Equivariant Matrix Function Neural Networks | Unknown | N/A | |
| Sparsistency for inverse optimal transport | Unknown | N/A | |
| Towards Poisoning Fair Representations | Unknown | N/A | |
| Order-Preserving GFlowNets | Unknown | N/A | |
| Zipformer: A faster and better encoder for automatic speech recognition | Unknown | N/A | |
| Looped Transformers are Better at Learning Learning Algorithms | Unknown | N/A | |
| Boosting Graph Anomaly Detection with Adaptive Message Passing | Unknown | N/A | |
| Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching | Unknown | N/A | |
| MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models | Unknown | N/A | |
| Rethinking the Benefits of Steerable Features in 3D Equivariant Graph Neural Networks | Unknown | N/A | |
| Forward $\chi^2$ Divergence Based Variational Importance Sampling | Unknown | N/A | |
| NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation | Unknown | N/A | |
| Effective Data Augmentation With Diffusion Models | Unknown | N/A | |
| Incremental Randomized Smoothing Certification | Unknown | N/A | |
| Training Graph Transformers via Curriculum-Enhanced Attention Distillation | Unknown | N/A | |
| FITS: Modeling Time Series with $10k$ Parameters | Unknown | N/A | |
| Robust Training of Federated Models with Extremely Label Deficiency | Unknown | N/A | |
| Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation | Unknown | N/A | |
| Continuous Field Reconstruction from Sparse Observations with Implicit Neural Networks | Unknown | N/A | |
| DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models | Unknown | N/A | |
| Robust agents learn causal world models | Unknown | N/A | |
| The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical Images | Unknown | N/A | |
| Scaling Convex Neural Networks with Burer-Monteiro Factorization | Unknown | N/A | |
| Differentially Private SGD Without Clipping Bias: An Error-Feedback Approach | Unknown | N/A | |
| Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control | Unknown | N/A | |
| GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion | Unknown | N/A | |
| Reward Model Ensembles Help Mitigate Overoptimization | Unknown | N/A | |
| Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks | Unknown | N/A | |
| Near-Optimal Solutions of Constrained Learning Problems | Unknown | N/A | |
| PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback | Unknown | N/A | |
| Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX | Unknown | N/A | |
| Denoising Diffusion via Image-Based Rendering | Unknown | N/A | |
| MMD Graph Kernel: Effective Metric Learning for Graphs via Maximum Mean Discrepancy | Unknown | N/A | |
| On the Hardness of Online Nonconvex Optimization with Single Oracle Feedback | Unknown | N/A | |
| Weaker MVI Condition: Extragradient Methods with Multi-Step Exploration | Unknown | N/A | |
| Domain Randomization via Entropy Maximization | Unknown | N/A | |
| Constraint-Free Structure Learning with Smooth Acyclic Orientations | Unknown | N/A | |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Unknown | N/A | |
| Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment | Unknown | N/A | |
| Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video | Unknown | N/A | |
| The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A” | Unknown | N/A | |
| Identifying the Risks of LM Agents with an LM-Emulated Sandbox | Unknown | N/A | |
| On Bias-Variance Alignment in Deep Models | Unknown | N/A | |
| InstructDET: Diversifying Referring Object Detection with Generalized Instructions | Unknown | N/A | |
| Patched Denoising Diffusion Models For High-Resolution Image Synthesis | Unknown | N/A | |
| Teach LLMs to Phish: Stealing Private Information from Language Models | Unknown | N/A | |
| Customizable Combination of Parameter-Efficient Modules for Multi-Task Learning | Unknown | N/A | |
| How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations | Unknown | N/A | |
| Noise-free Score Distillation | Unknown | N/A | |
| AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning | Unknown | N/A | |
| Efficient Integrators for Diffusion Generative Models | Unknown | N/A | |
| AttEXplore: Attribution for Explanation with model parameters eXploration | Unknown | N/A | |
| Symmetric Basis Convolutions for Learning Lagrangian Fluid Mechanics | Unknown | N/A | |
| You Only Query Once: An Efficient Label-Only Membership Inference Attack | Unknown | N/A | |
| The Marginal Value of Momentum for Small Learning Rate SGD | Unknown | N/A | |
| Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! | Unknown | N/A | |
| CLIP the Bias: How Useful is Balancing Data in Multimodal Learning? | Unknown | N/A | |
| DreamLLM: Synergistic Multimodal Comprehension and Creation | Unknown | N/A | |
| AffineQuant: Affine Transformation Quantization for Large Language Models | Unknown | N/A | |
| On the Power of the Weisfeiler-Leman Test for Graph Motif Parameters | Unknown | N/A | |
| Multisize Dataset Condensation | Unknown | N/A | |
| MVDream: Multi-view Diffusion for 3D Generation | Unknown | N/A | |
| Graph-based Virtual Sensing from Sparse and Partial Multivariate Observations | Unknown | N/A | |
| Point2SSM: Learning Morphological Variations of Anatomies from Point Clouds | Unknown | N/A | |
| Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation | Unknown | N/A | |
| ZeroFlow: Scalable Scene Flow via Distillation | Unknown | N/A | |
| Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models | Unknown | N/A | |
| Gradual Optimization Learning for Conformational Energy Minimization | Unknown | N/A | |
| Communication-Efficient Gradient Descent-Accent Methods for Distributed Variational Inequalities: Unified Analysis and Local Updates | Unknown | N/A | |
| Post-hoc bias scoring is optimal for fair classification | Unknown | N/A | |
| Conditional Information Bottleneck Approach for Time Series Imputation | Unknown | N/A | |
| Reasoning with Latent Diffusion in Offline Reinforcement Learning | Unknown | N/A | |
| COSA: Concatenated Sample Pretrained Vision-Language Foundation Model | Unknown | N/A | |
| FedWon: Triumphing Multi-domain Federated Learning Without Normalization | Unknown | N/A | |
| Grokking as the transition from lazy to rich training dynamics | Unknown | N/A | |
| Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo | Unknown | N/A | |
| Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing | Unknown | N/A | |
| QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models | Unknown | N/A | |
| InfoCon: Concept Discovery with Generative and Discriminative Informativeness | Unknown | N/A | |
| Sparse Autoencoders Find Highly Interpretable Features in Language Models | Unknown | N/A | |
| Fixed Non-negative Orthogonal Classifier: Inducing Zero-mean Neural Collapse with Feature Dimension Separation | Unknown | N/A | |
| Self-supervised Representation Learning from Random Data Projectors | Unknown | N/A | |
| Dual-Encoders for Extreme Multi-label Classification | Unknown | N/A | |
| Privileged Sensing Scaffolds Reinforcement Learning | Unknown | N/A | |
| Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and Mitigation | Unknown | N/A | |
| Neural Auto-designer for Enhanced Quantum Kernels | Unknown | N/A | |
| Fully Hyperbolic Convolutional Neural Networks for Computer Vision | Unknown | N/A | |
| Cameras as Rays: Pose Estimation via Ray Diffusion | Unknown | N/A | |
| Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy | Unknown | N/A | |
| Adversarial Supervision Makes Layout-to-Image Diffusion Models Thrive | Unknown | N/A | |
| ResFields: Residual Neural Fields for Spatiotemporal Signals | Unknown | N/A | |
| AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors | Unknown | N/A | |
| Prompt Gradient Projection for Continual Learning | Unknown | N/A | |
| Vision-by-Language for Training-Free Compositional Image Retrieval | Unknown | N/A | |
| Parallelizing non-linear sequential models over the sequence length | Unknown | N/A | |
| Single Motion Diffusion | Unknown | N/A | |
| Group Preference Optimization: Few-Shot Alignment of Large Language Models | Unknown | N/A | |
| DeepSPF: Spherical SO(3)-Equivariant Patches for Scan-to-CAD Estimation | Unknown | N/A | |
| Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform | Unknown | N/A | |
| SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs | Unknown | N/A | |
| Feature Collapse | Unknown | N/A | |
| WOODS: Benchmarks for Out-of-Distribution Generalization in Time Series | Unknown | N/A | |
| HypeBoy: Generative Self-Supervised Representation Learning on Hypergraphs | Unknown | N/A | |
| Multi-Resolution Diffusion Models for Time Series Forecasting | Unknown | N/A | |
| In-context Exploration-Exploitation for Reinforcement Learning | Unknown | N/A | |
| Non-negative Contrastive Learning | Unknown | N/A | |
| A Plug-and-Play Image Registration Network | Unknown | N/A | |
| Model Merging by Uncertainty-Based Gradient Matching | Unknown | N/A | |
| Idempotence and Perceptual Image Compression | Unknown | N/A | |
| DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation | Unknown | N/A | |
| Does Progress On Object Recognition Benchmarks Improve Generalization on Crowdsourced, Global Data? | Unknown | N/A | |
| Zero-Shot Robustification of Zero-Shot Models | Unknown | N/A | |
| Uncertainty-aware Graph-based Hyperspectral Image Classification | Unknown | N/A | |
| SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents | Unknown | N/A | |
| Towards image compression with perfect realism at ultra-low bitrates | Unknown | N/A | |
| Learning Optimal Contracts: How to Exploit Small Action Spaces | Unknown | N/A | |
| Transferring Labels to Solve Annotation Mismatches Across Object Detection Datasets | Unknown | N/A | |
| Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs | Unknown | N/A | |
| PAC-FNO: Parallel-Structured All-Component Fourier Neural Operators for Recognizing Low-Quality Images | Unknown | N/A | |
| Self-Supervised High Dynamic Range Imaging with Multi-Exposure Images in Dynamic Scenes | Unknown | N/A | |
| An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization | Unknown | N/A | |
| FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets | Unknown | N/A | |
| BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference | Unknown | N/A | |
| Test-Time Training on Nearest Neighbors for Large Language Models | Unknown | N/A | |
| STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series Prediction | Unknown | N/A | |
| Unified Projection-Free Algorithms for Adversarial DR-Submodular Optimization | Unknown | N/A | |
| Critical Learning Periods Emerge Even in Deep Linear Networks | Unknown | N/A | |
| Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods | Unknown | N/A | |
| OVOR: OnePrompt with Virtual Outlier Regularization for Rehearsal-Free Class-Incremental Learning | Unknown | N/A | |
| Accelerated Convergence of Stochastic Heavy Ball Method under Anisotropic Gradient Noise | Unknown | N/A | |
| Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency | Unknown | N/A | |
| Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks | Unknown | N/A | |
| InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning | Unknown | N/A | |
| Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling | Unknown | N/A | |
| PRES: Toward Scalable Memory-Based Dynamic Graph Neural Networks | Unknown | N/A | |
| The Lipschitz-Variance-Margin Tradeoff for Enhanced Randomized Smoothing | Unknown | N/A | |
| CoRe-GD: A Hierarchical Framework for Scalable Graph Visualization with GNNs | Unknown | N/A | |
| f-FERM: A Scalable Framework for Robust Fair Empirical Risk Minimization | Unknown | N/A | |
| Mitigating the Curse of Dimensionality for Certified Robustness via Dual Randomized Smoothing | Unknown | N/A | |
| Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations | Unknown | N/A | |
| Elastic Feature Consolidation For Cold Start Exemplar-Free Incremental Learning | Unknown | N/A | |
| BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation | Unknown | N/A | |
| Understanding the Robustness of Multi-modal Contrastive Learning to Distribution Shift | Unknown | N/A | |
| Investigating the Benefits of Projection Head for Representation Learning | Unknown | N/A | |
| LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation | Unknown | N/A | |
| Stochastic Modified Equations and Dynamics of Dropout Algorithm | Unknown | N/A | |
| Implicit Neural Representations and the Algebra of Complex Wavelets | Unknown | N/A | |
| Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature | Unknown | N/A | |
| A Benchmark for Learning to Translate a New Language from One Grammar Book | Unknown | N/A | |
| Conformal Inductive Graph Neural Networks | Unknown | N/A | |
| Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation | Unknown | N/A | |
| Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning | Unknown | N/A | |
| On the Limitations of Temperature Scaling for Distributions with Overlaps | Unknown | N/A | |
| Unveiling and Manipulating Prompt Influence in Large Language Models | Unknown | N/A | |
| FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity | Unknown | N/A | |
| LRM: Large Reconstruction Model for Single Image to 3D | Unknown | N/A | |
| Generative Sliced MMD Flows with Riesz Kernels | Unknown | N/A | |
| Kosmos-G: Generating Images in Context with Multimodal Large Language Models | Unknown | N/A | |
| AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? | Unknown | N/A | |
| BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection | Unknown | N/A | |
| Self-Supervised Dataset Distillation for Transfer Learning | Unknown | N/A | |
| Large-scale Training of Foundation Models for Wearable Biosignals | Unknown | N/A | |
| Provably Robust Conformal Prediction with Improved Efficiency | Unknown | N/A | |
| Accelerating Distributed Stochastic Optimization via Self-Repellent Random Walks | Unknown | N/A | |
| Adaptive Federated Learning with Auto-Tuned Clients | Unknown | N/A | |
| LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints | Unknown | N/A | |
| DP-SGD Without Clipping: The Lipschitz Neural Network Way | Unknown | N/A | |
| Pre-training Sequence, Structure, and Surface Features for Comprehensive Protein Representation Learning | Unknown | N/A | |
| OmniControl: Control Any Joint at Any Time for Human Motion Generation | Unknown | N/A | |
| Leave-one-out Distinguishability in Machine Learning | Unknown | N/A | |
| In defense of parameter sharing for model-compression | Unknown | N/A | |
| Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting | Unknown | N/A | |
| Accelerating Data Generation for Neural Operators via Krylov Subspace Recycling | Unknown | N/A | |
| How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks? | Unknown | N/A | |
| Revisit and Outstrip Entity Alignment: A Perspective of Generative Models | Unknown | N/A | |
| OMNI: Open-endedness via Models of human Notions of Interestingness | Unknown | N/A | |
| Risk Bounds of Accelerated SGD for Overparameterized Linear Regression | Unknown | N/A | |
| The Hidden Language of Diffusion Models | Unknown | N/A | |
| Distinguished In Uniform: Self-Attention Vs. Virtual Nodes | Unknown | N/A | |
| Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning | Unknown | N/A | |
| Consistent Multi-Class Classification from Multiple Unlabeled Datasets | Unknown | N/A | |
| Enhancing Instance-Level Image Classification with Set-Level Labels | Unknown | N/A | |
| Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources | Unknown | N/A | |
| Coeditor: Leveraging Repo-level Diffs for Code Auto-editing | Unknown | N/A | |
| Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence | Unknown | N/A | |
| Causality-Inspired Spatial-Temporal Explanations for Dynamic Graph Neural Networks | Unknown | N/A | |
| Hyper Evidential Deep Learning to Quantify Composite Classification Uncertainty | Unknown | N/A | |
| MuSc: Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images | Unknown | N/A | |
| Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Videos | Unknown | N/A | |
| Leveraging Generative Models for Unsupervised Alignment of Neural Time Series Data | Unknown | N/A | |
| TD-MPC2: Scalable, Robust World Models for Continuous Control | Unknown | N/A | |
| AnyText: Multilingual Visual Text Generation and Editing | Unknown | N/A | |
| LLM Augmented LLMs: Expanding Capabilities through Composition | Unknown | N/A | |
| On the Provable Advantage of Unsupervised Pretraining | Unknown | N/A | |
| Pooling Image Datasets with Multiple Covariate Shift and Imbalance | Unknown | N/A | |
| Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback | Unknown | N/A | |
| Domain-Agnostic Molecular Generation with Chemical Feedback | Unknown | N/A | |
| Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection | Unknown | N/A | |
| Quality-Diversity through AI Feedback | Unknown | N/A | |
| Relaxing the Additivity Constraints in Decentralized No-Regret High-Dimensional Bayesian Optimization | Unknown | N/A | |
| Bayesian Optimization through Gaussian Cox Process Models for Spatio-temporal Data | Unknown | N/A | |
| Safe and Robust Watermark Injection with a Single OoD Image | Unknown | N/A | |
| TOSS: High-quality Text-guided Novel View Synthesis from a Single Image | Unknown | N/A | |
| Elucidating the design space of classifier-guided diffusion generation | Unknown | N/A | |
| Learning Flexible Body Collision Dynamics with Hierarchical Contact Mesh Transformer | Unknown | N/A | |
| Periodicity Decoupling Framework for Long-term Series Forecasting | Unknown | N/A | |
| General Stability Analysis for Zeroth-Order Optimization Algorithms | Unknown | N/A | |
| The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning | Unknown | N/A | |
| Think before you speak: Training Language Models With Pause Tokens | Unknown | N/A | |
| DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer | Unknown | N/A | |
| Context is Environment | Unknown | N/A | |
| Class Incremental Learning via Likelihood Ratio Based Task Prediction | Unknown | N/A | |
| Denoising Diffusion Step-aware Models | Unknown | N/A | |
| Noisy Interpolation Learning with Shallow Univariate ReLU Networks | Unknown | N/A | |
| Initializing Models with Larger Ones | Unknown | N/A | |
| HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion | Unknown | N/A | |
| AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection | Unknown | N/A | |
| Benchmarking and Improving Generator-Validator Consistency of Language Models | Unknown | N/A | |
| Language-Interfaced Tabular Oversampling via Progressive Imputation and Self-Authentication | Unknown | N/A | |
| Counterfactual Density Estimation using Kernel Stein Discrepancies | Unknown | N/A | |
| EQA-MX: Embodied Question Answering using Multimodal Expression | Unknown | N/A | |
| Proper Laplacian Representation Learning | Unknown | N/A | |
| Scaling Autoregressive Models for Content-Rich Text-to-Image Generation | Unknown | N/A | |
| Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling | Unknown | N/A | |
| DNABERT-2: Efficient Foundation Model and Benchmark For Multi-Species Genomes | Unknown | N/A | |
| Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets | Unknown | N/A | |
| Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation | Unknown | N/A | |
| PanoDiffusion: 360-degree Panorama Outpainting via Diffusion | Unknown | N/A | |
| PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts | Unknown | N/A | |
| Making LLaMA SEE and Draw with SEED Tokenizer | Unknown | N/A | |
| Removing Biases from Molecular Representations via Information Maximization | Unknown | N/A | |
| Online Continual Learning for Interactive Instruction Following Agents | Unknown | N/A | |
| Human Motion Diffusion as a Generative Prior | Unknown | N/A | |
| Closing the Curious Case of Neural Text Degeneration | Unknown | N/A | |
| GTA: A Geometry-Aware Attention Mechanism for Multi-View Transformers | Unknown | N/A | |
| Is attention required for ICL? Exploring the Relationship Between Model Architecture and In-Context Learning Ability | Unknown | N/A | |
| Stable Anisotropic Regularization | Unknown | N/A | |
| A Framework for Inference Inspired by Human Memory Mechanisms | Unknown | N/A | |
| SF(DA)$^2$: Source-free Domain Adaptation Through the Lens of Data Augmentation | Unknown | N/A | |
| PBADet: A One-Stage Anchor-Free Approach for Part-Body Association | Unknown | N/A | |
| Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled Factors | Unknown | N/A | |
| A Characterization Theorem for Equivariant Networks with Point-wise Activations | Unknown | N/A | |
| Mirage: Model-agnostic Graph Distillation for Graph Classification | Unknown | N/A | |
| Exploring the Common Appearance-Boundary Adaptation for Nighttime Optical Flow | Unknown | N/A | |
| Faster Approximation of Probabilistic and Distributional Values via Least Squares | Unknown | N/A | |
| Accelerating Sinkhorn algorithm with sparse Newton iterations | Unknown | N/A | |
| Circuit Component Reuse Across Tasks in Transformer Language Models | Unknown | N/A | |
| CORN: Contact-based Object Representation for Nonprehensile Manipulation of General Unseen Objects | Unknown | N/A | |
| Neural-Symbolic Recursive Machine for Systematic Generalization | Unknown | N/A | |
| FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores | Unknown | N/A | |
| Towards Establishing Guaranteed Error for Learned Database Operations | Unknown | N/A | |
| Continuous Invariance Learning | Unknown | N/A | |
| Learning to solve Class-Constrained Bin Packing Problems via Encoder-Decoder Model | Unknown | N/A | |
| EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models | Unknown | N/A | |
| Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models | Unknown | N/A | |
| Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking | Unknown | N/A | |
| IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs | Unknown | N/A | |
| Dynamic Discounted Counterfactual Regret Minimization | Unknown | N/A | |
| Graphical Multioutput Gaussian Process with Attention | Unknown | N/A | |
| RetroBridge: Modeling Retrosynthesis with Markov Bridges | Unknown | N/A | |
| DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks | Unknown | N/A | |
| Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game | Unknown | N/A | |
| Few-Shot Detection of Machine-Generated Text using Style Representations | Unknown | N/A | |
| Improved Analysis of Sparse Linear Regression in Local Differential Privacy Model | Unknown | N/A | |
| Visual Data-Type Understanding does not emerge from scaling Vision-Language Models | Unknown | N/A | |
| Enhancing Tail Performance in Extreme Classifiers by Label Variance Reduction | Unknown | N/A | |
| Reward Design for Justifiable Sequential Decision-Making | Unknown | N/A | |
| Rethinking Complex Queries on Knowledge Graphs with Neural Link Predictors | Unknown | N/A | |
| Generative Pre-training for Speech with Flow Matching | Unknown | N/A | |
| Enhancing Neural Training via a Correlated Dynamics Model | Unknown | N/A | |
| Modeling state-dependent communication between brain regions with switching nonlinear dynamical systems | Unknown | N/A | |
| Some Fundamental Aspects about Lipschitz Continuity of Neural Networks | Unknown | N/A | |
| Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding | Unknown | N/A | |
| Certified Adversarial Robustness for Rate Encoded Spiking Neural Networks | Unknown | N/A | |
| Graph Metanetworks for Processing Diverse Neural Architectures | Unknown | N/A | |
| Structuring Representation Geometry with Rotationally Equivariant Contrastive Learning | Unknown | N/A | |
| TEDDY: Trimming Edges with Degree-based Discrimination Strategy | Unknown | N/A | |
| A differentiable brain simulator bridging brain simulation and brain-inspired computing | Unknown | N/A | |
| From Zero to Turbulence: Generative Modeling for 3D Flow Simulation | Unknown | N/A | |
| RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems | Unknown | N/A | |
| AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model | Unknown | N/A | |
| Physics-Regulated Deep Reinforcement Learning: Invariant Embeddings | Unknown | N/A | |
| FedLoGe: Joint Local and Generic Federated Learning under Long-tailed Data | Unknown | N/A | |
| Efficient Planning with Latent Diffusion | Unknown | N/A | |
| PAC Prediction Sets Under Label Shift | Unknown | N/A | |
| Structural Fairness-aware Active Learning for Graph Neural Networks | Unknown | N/A | |
| Scalable Neural Network Kernels | Unknown | N/A | |
| Language Model Detectors Are Easily Optimized Against | Unknown | N/A | |
| How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models | Unknown | N/A | |
| Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning | Unknown | N/A | |
| Zero-Mean Regularized Spectral Contrastive Learning: Implicitly Mitigating Wrong Connections in Positive-Pair Graphs | Unknown | N/A | |
| Neural Field Classifiers via Target Encoding and Classification Loss | Unknown | N/A | |
| Variance-enlarged Poisson Learning for Graph-based Semi-Supervised Learning with Extremely Sparse Labeled Data | Unknown | N/A | |
| Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution | Unknown | N/A | |
| Coordinate-Aware Modulation for Neural Fields | Unknown | N/A | |
| Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data | Unknown | N/A | |
| Do Generated Data Always Help Contrastive Learning? | Unknown | N/A | |
| Rigid Protein-Protein Docking via Equivariant Elliptic-Paraboloid Interface Prediction | Unknown | N/A | |
| An improved analysis of per-sample and per-update clipping in federated learning | Unknown | N/A | |
| Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing | Unknown | N/A | |
| CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding | Unknown | N/A | |
| WizardCoder: Empowering Code Large Language Models with Evol-Instruct | Unknown | N/A | |
| Numerical Accounting in the Shuffle Model of Differential Privacy | Unknown | N/A | |
| Language Model Decoding as Direct Metrics Optimization | Unknown | N/A | |
| On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes | Unknown | N/A | |
| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Unknown | N/A | |
| Like Oil and Water: Group Robustness Methods and Poisoning Defenses May Be at Odds | Unknown | N/A | |
| Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks | Unknown | N/A | |
| Generative Learning for Solving Non-Convex Problem with Multi-Valued Input-Solution Mapping | Unknown | N/A | |
| Knowledge Fusion of Large Language Models | Unknown | N/A | |
| Learning Multi-Agent Communication from Graph Modeling Perspective | Unknown | N/A | |
| Evaluating Language Model Agency Through Negotiations | Unknown | N/A | |
| SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos | Unknown | N/A | |
| Overthinking the Truth: Understanding how Language Models Process False Demonstrations | Unknown | N/A | |
| BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models | Unknown | N/A | |
| The mechanistic basis of data dependence and abrupt learning in an in-context classification task | Unknown | N/A | |
| A Probabilistic Framework for Modular Continual Learning | Unknown | N/A | |
| Uni-O4: Unifying Online and Offline Deep Reinforcement Learning with Multi-Step On-Policy Optimization | Unknown | N/A | |
| Universal Backdoor Attacks | Unknown | N/A | |
| DDMI: Domain-agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations | Unknown | N/A | |
| Sentence-level Prompts Benefit Composed Image Retrieval | Unknown | N/A | |
| Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory | Unknown | N/A | |
| Are Human-generated Demonstrations Necessary for In-context Learning? | Unknown | N/A | |
| Enabling Lanuguage Models to Implicitly Learn Self-Improvement | Unknown | N/A | |
| L2MAC: Large Language Model Automatic Computer for Extensive Code Generation | Unknown | N/A | |
| Light-MILPopt: Solving Large-scale Mixed Integer Linear Programs with Lightweight Optimizer and Small-scale Training Dataset | Unknown | N/A | |
| Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks | Unknown | N/A | |
| A Restoration Network as an Implicit Prior | Unknown | N/A | |
| SineNet: Learning Temporal Dynamics in Time-Dependent Partial Differential Equations | Unknown | N/A | |
| Inversion by Direct Iteration: An Alternative to Denoising Diffusion for Image Restoration | Unknown | N/A | |
| Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement | Unknown | N/A | |
| Test-time Adaptation against Multi-modal Reliability Bias | Unknown | N/A | |
| INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection | Unknown | N/A | |
| GOAt: Explaining Graph Neural Networks via Graph Output Attribution | Unknown | N/A | |
| RA-DIT: Retrieval-Augmented Dual Instruction Tuning | Unknown | N/A | |
| A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis | Unknown | N/A | |
| Directly Fine-Tuning Diffusion Models on Differentiable Rewards | Unknown | N/A | |
| OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views | Unknown | N/A | |
| Efficiently Computing Similarities to Private Datasets | Unknown | N/A | |
| Domain constraints improve risk prediction when outcome data is missing | Unknown | N/A | |
| Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning | Unknown | N/A | |
| Multi-granularity Correspondence Learning from Long-term Noisy Videos | Unknown | N/A | |
| LEAP: Liberate Sparse-View 3D Modeling from Camera Poses | Unknown | N/A | |
| Outliers with Opposing Signals Have an Outsized Effect on Neural Network Optimization | Unknown | N/A | |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Unknown | N/A | |
| Skill or Luck? Return Decomposition via Advantage Functions | Unknown | N/A | |
| Transport meets Variational Inference: Controlled Monte Carlo Diffusions | Unknown | N/A | |
| Unsupervised Order Learning | Unknown | N/A | |
| Enhancing Transferable Adversarial Attacks on Vision Transformers through Gradient Normalization Scaling and High-Frequency Adaptation | Unknown | N/A | |
| Rethinking the symmetry-preserving circuits for constrained variational quantum algorithms | Unknown | N/A | |
| KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval | Unknown | N/A | |
| Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models | Unknown | N/A | |
| Retrieval-Guided Reinforcement Learning for Boolean Circuit Minimization | Unknown | N/A | |
| Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies | Unknown | N/A | |
| Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL | Unknown | N/A | |
| Compressed Context Memory for Online Language Model Interaction | Unknown | N/A | |
| Revisiting Link Prediction: a data perspective | Unknown | N/A | |
| MINDE: Mutual Information Neural Diffusion Estimation | Unknown | N/A | |
| Realistic Evaluation of Semi-supervised Learning Algorithms in Open Environments | Unknown | N/A | |
| Achieving Human Parity in Content-Grounded Datasets Generation | Unknown | N/A | |
| Contrastive Difference Predictive Coding | Unknown | N/A | |
| Efficient-3Dim: Learning a Generalizable Single-image Novel-view Synthesizer in One Day | Unknown | N/A | |
| Unified Generative Modeling of 3D Molecules with Bayesian Flow Networks | Unknown | N/A | |
| Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval | Unknown | N/A | |
| Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks | Unknown | N/A | |
| FairerCLIP: Debiasing CLIP's Zero-Shot Predictions using Functions in RKHSs | Unknown | N/A | |
| BarLeRIa: An Efficient Tuning Framework for Referring Image Segmentation | Unknown | N/A | |
| Asymptotically Free Sketched Ridge Ensembles: Risks, Cross-Validation, and Tuning | Unknown | N/A | |
| CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling | Unknown | N/A | |
| Provably Efficient CVaR RL in Low-rank MDPs | Unknown | N/A | |
| Symphony: Symmetry-Equivariant Point-Centered Spherical Harmonics for 3D Molecule Generation | Unknown | N/A | |
| Rethinking Channel Dependence for Multivariate Time Series Forecasting: Learning from Leading Indicators | Unknown | N/A | |
| Improved Probabilistic Image-Text Representations | Unknown | N/A | |
| MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use | Unknown | N/A | |
| MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning | Unknown | N/A | |
| Spatio-Temporal Approximation: A Training-Free SNN Conversion for Transformers | Unknown | N/A | |
| Modulated Phase Diffusor: Content-Oriented Feature Synthesis for Detecting Unknown Objects | Unknown | N/A | |
| LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment | Unknown | N/A | |
| MogaNet: Multi-order Gated Aggregation Network | Unknown | N/A | |
| Deep Generative Clustering with Multimodal Diffusion Variational Autoencoders | Unknown | N/A | |
| Towards Category Unification of 3D Single Object Tracking on Point Clouds | Unknown | N/A | |
| SemiReward: A General Reward Model for Semi-supervised Learning | Unknown | N/A | |
| Learning Hierarchical Polynomials with Three-Layer Neural Networks | Unknown | N/A | |
| Graph Neural Networks for Learning Equivariant Representations of Neural Networks | Unknown | N/A | |
| LEAD: Min-Max Optimization from a Physical Perspective | Unknown | N/A | |
| Kalman Filter for Online Classification of Non-Stationary Data | Unknown | N/A | |
| AlpaGasus: Training a Better Alpaca with Fewer Data | Unknown | N/A | |
| RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design | Unknown | N/A | |
| What does automatic differentiation compute for neural networks? | Unknown | N/A | |
| Unbiased Watermark for Large Language Models | Unknown | N/A | |
| Self-Consuming Generative Models Go MAD | Unknown | N/A | |
| Out-of-Distribution Detection with Negative Prompts | Unknown | N/A | |
| STARC: A General Framework For Quantifying Differences Between Reward Functions | Unknown | N/A | |
| Learning from Aggregate responses: Instance Level versus Bag Level Loss Functions | Unknown | N/A | |
| Theoretical Understanding of Learning from Adversarial Perturbations | Unknown | N/A | |
| Time Fairness in Online Knapsack Problems | Unknown | N/A | |
| Attacking Perceptual Similarity Metrics | Unknown | N/A | |
| A ROBUST DIFFERENTIAL NEURAL ODE OPTIMIZER | Unknown | N/A | |
| GPAvatar: Generalizable and Precise Head Avatar from Image(s) | Unknown | N/A | |
| Chain-of-Experts: When LLMs Meet Complex Operations Research Problems | Unknown | N/A | |
| StructComp: Substituting propagation with Structural Compression in Training Graph Contrastive Learning | Unknown | N/A | |
| TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning | Unknown | N/A | |
| Behaviour Distillation | Unknown | N/A | |
| $\infty$-Diff: Infinite Resolution Diffusion with Subsampled Mollified States | Unknown | N/A | |
| Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain | Unknown | N/A | |
| Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control | Unknown | N/A | |
| Debiasing Algorithm through Model Adaptation | Unknown | N/A | |
| On gauge freedom, conservativity and intrinsic dimensionality estimation in diffusion models | Unknown | N/A | |
| Clifford Group Equivariant Simplicial Message Passing Networks | Unknown | N/A | |
| A Flexible Generative Model for Heterogeneous Tabular EHR with Missing Modality | Unknown | N/A | |
| Don't Play Favorites: Minority Guidance for Diffusion Models | Unknown | N/A | |
| Mitigating Emergent Robustness Degradation while Scaling Graph Learning | Unknown | N/A | |
| Frequency-Aware Transformer for Learned Image Compression | Unknown | N/A | |
| Causally Aligned Curriculum Learning | Unknown | N/A | |
| From Molecules to Materials: Pre-training Large Generalizable Models for Atomic Property Prediction | Unknown | N/A | |
| ED-NeRF: Efficient Text-Guided Editing of 3D Scene With Latent Space NeRF | Unknown | N/A | |
| The Wasserstein Believer: Learning Belief Updates for Partially Observable Environments through Reliable Latent Space Models | Unknown | N/A | |
| FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling | Unknown | N/A | |
| CAMBranch: Contrastive Learning with Augmented MILPs for Branching | Unknown | N/A | |
| Cycle Consistency Driven Object Discovery | Unknown | N/A | |
| YaRN: Efficient Context Window Extension of Large Language Models | Unknown | N/A | |
| CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding | Unknown | N/A | |
| AgentBench: Evaluating LLMs as Agents | Unknown | N/A | |
| Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks | Unknown | N/A | |
| Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints | Unknown | N/A | |
| What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning | Unknown | N/A | |
| Leveraging Low-Rank and Sparse Recurrent Connectivity for Robust Closed-Loop Control | Unknown | N/A | |
| Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs | Unknown | N/A | |
| A Multi-Level Framework for Accelerating Training Transformer Models | Unknown | N/A | |
| Supervised Knowledge Makes Large Language Models Better In-context Learners | Unknown | N/A | |
| VQ-TR: Vector Quantized Attention for Time Series Forecasting | Unknown | N/A | |
| Aligning Relational Learning with Lipschitz Fairness | Unknown | N/A | |
| Understanding In-Context Learning from Repetitions | Unknown | N/A | |
| Symmetric Neural-Collapse Representations with Supervised Contrastive Loss: The Impact of ReLU and Batching | Unknown | N/A | |
| Training Unbiased Diffusion Models From Biased Dataset | Unknown | N/A | |
| DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation | Unknown | N/A | |
| Separate and Diffuse: Using a Pretrained Diffusion Model for Better Source Separation | Unknown | N/A | |
| Conversational Drug Editing Using Retrieval and Domain Feedback | Unknown | N/A | |
| Discovering Temporally-Aware Reinforcement Learning Algorithms | Unknown | N/A | |
| A Precise Characterization of SGD Stability Using Loss Surface Geometry | Unknown | N/A | |
| The Expressive Power of Transformers with Chain of Thought | Unknown | N/A | |
| Posterior Sampling Based on Gradient Flows of the MMD with Negative Distance Kernel | Unknown | N/A | |
| ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs | Unknown | N/A | |
| Input-gradient space particle inference for neural network ensembles | Unknown | N/A | |
| Towards Reliable and Efficient Backdoor Trigger Inversion via Decoupling Benign Features | Unknown | N/A | |
| Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models | Unknown | N/A | |
| A Data-Driven Measure of Relative Uncertainty for Misclassification Detection | Unknown | N/A | |
| What Matters to You? Towards Visual Representation Alignment for Robot Learning | Unknown | N/A | |
| MOFI: Learning Image Representations from Noisy Entity Annotated Images | Unknown | N/A | |
| Large Language Models as Automated Aligners for benchmarking Vision-Language Models | Unknown | N/A | |
| Sample-Efficient Linear Representation Learning from Non-IID Non-Isotropic Data | Unknown | N/A | |
| Hypergraph Dynamic System | Unknown | N/A | |
| Elucidating the Exposure Bias in Diffusion Models | Unknown | N/A | |
| SPDER: Semiperiodic Damping-Enabled Object Representation | Unknown | N/A | |
| Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints | Unknown | N/A | |
| MagicDrive: Street View Generation with Diverse 3D Geometry Control | Unknown | N/A | |
| Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis | Unknown | N/A | |
| GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation | Unknown | N/A | |
| DATS: Difficulty-Aware Task Sampler for Meta-Learning Physics-Informed Neural Networks | Unknown | N/A | |
| SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation | Unknown | N/A | |
| Leveraging Uncertainty Estimates To Improve Classifier Performance | Unknown | N/A | |
| Learning Planning Abstractions from Language | Unknown | N/A | |
| Experimental Design for Multi-Channel Imaging via Task-Driven Feature Selection | Unknown | N/A | |
| Fast Ensembling with Diffusion Schrödinger Bridge | Unknown | N/A | |
| Benign Overfitting and Grokking in ReLU Networks for XOR Cluster Data | Unknown | N/A | |
| L2P-MIP: Learning to Presolve for Mixed Integer Programming | Unknown | N/A | |
| Neurosymbolic Grounding for Compositional World Models | Unknown | N/A | |
| Momentum Benefits Non-iid Federated Learning Simply and Provably | Unknown | N/A | |
| Making Pre-trained Language Models Great on Tabular Prediction | Unknown | N/A | |
| Multimodal Patient Representation Learning with Missing Modalities and Labels | Unknown | N/A | |
| Feature-aligned N-BEATS with Sinkhorn divergence | Unknown | N/A | |
| Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning | Unknown | N/A | |
| Video Decomposition Prior: Editing Videos Layer by Layer | Unknown | N/A | |
| New Insight of Variance reduce in Zero-Order Hard-Thresholding: Mitigating Gradient Error and Expansivity Contradictions | Unknown | N/A | |
| Subtractive Mixture Models via Squaring: Representation and Learning | Unknown | N/A | |
| AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models | Unknown | N/A | |
| Bridging Vision and Language Spaces with Assignment Prediction | Unknown | N/A | |
| Modulate Your Spectrum in Self-Supervised Learning | Unknown | N/A | |
| LLM-Assisted Code Cleaning For Training Accurate Code Generators | Unknown | N/A | |
| Implicit bias of SGD in $L_2$-regularized linear DNNs: One-way jumps from high to low rank | Unknown | N/A | |
| Simple Minimax Optimal Byzantine Robust Algorithm for Nonconvex Objectives with Uniform Gradient Heterogeneity | Unknown | N/A | |
| On the Variance of Neural Network Training with respect to Test Sets and Distributions | Unknown | N/A | |
| Graph Parsing Networks | Unknown | N/A | |
| Optimal transport based adversarial patch to leverage large scale attack transferability | Unknown | N/A | |
| SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation | Unknown | N/A | |
| Orbit-Equivariant Graph Neural Networks | Unknown | N/A | |
| Perceptual Group Tokenizer: Building Perception with Iterative Grouping | Unknown | N/A | |
| R-EDL: Relaxing Nonessential Settings of Evidential Deep Learning | Unknown | N/A | |
| Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization | Unknown | N/A | |
| Bootstrapping Variational Information Pursuit with Large Language and Vision Models for Interpretable Image Classification | Unknown | N/A | |
| Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation | Unknown | N/A | |
| More is Better: when Infinite Overparameterization is Optimal and Overfitting is Obligatory | Unknown | N/A | |
| An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression | Unknown | N/A | |
| Attention-Guided Contrastive Role Representations for Multi-agent Reinforcement Learning | Unknown | N/A | |
| Masked Distillation Advances Self-Supervised Transformer Architecture Search | Unknown | N/A | |
| Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction | Unknown | N/A | |
| End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon | Unknown | N/A | |
| Don't Judge by the Look: Towards Motion Coherent Video Representation | Unknown | N/A | |
| Submodular Reinforcement Learning | Unknown | N/A | |
| Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training | Unknown | N/A | |
| Role of Locality and Weight Sharing in Image-Based Tasks: A Sample Complexity Separation between CNNs, LCNs, and FCNs | Unknown | N/A | |
| Noise Map Guidance: Inversion with Spatial Context for Real Image Editing | Unknown | N/A | |
| A Versatile Causal Discovery Framework to Allow Causally-Related Hidden Variables | Unknown | N/A | |
| Inherently Interpretable Time Series Classification via Multiple Instance Learning | Unknown | N/A | |
| Consistent algorithms for multi-label classification with macro-at-$k$ metrics | Unknown | N/A | |
| KW-Design: Pushing the Limit of Protein Design via Knowledge Refinement | Unknown | N/A | |
| MOTOR: A Time-to-Event Foundation Model For Structured Medical Records | Unknown | N/A | |
| Constructing Adversarial Examples for Vertical Federated Learning: Optimal Client Corruption through Multi-Armed Bandit | Unknown | N/A | |
| Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis | Unknown | N/A | |
| Light Schrödinger Bridge | Unknown | N/A | |
| Learning with Mixture of Prototypes for Out-of-Distribution Detection | Unknown | N/A | |
| Kill Two Birds with One Stone: Rethinking Data Augmentation for Deep Long-tailed Learning | Unknown | N/A | |
| Can We Evaluate Domain Adaptation Models Without Target-Domain Labels? | Unknown | N/A | |
| Efficient Continual Finite-Sum Minimization | Unknown | N/A | |
| Adversarial Training Should Be Cast as a Non-Zero-Sum Game | Unknown | N/A | |
| Advancing the Lower Bounds: an Accelerated, Stochastic, Second-order Method with Optimal Adaptation to Inexactness | Unknown | N/A | |
| MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts | Unknown | N/A | |
| Magnitude Invariant Parametrizations Improve Hypernetwork Learning | Unknown | N/A | |
| Building Cooperative Embodied Agents Modularly with Large Language Models | Unknown | N/A | |
| Towards Robust Multi-Modal Reasoning via Model Selection | Unknown | N/A | |
| The optimality of kernel classifiers in Sobolev space | Unknown | N/A | |
| Proving Test Set Contamination in Black-Box Language Models | Unknown | N/A | |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Unknown | N/A | |
| Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video | Unknown | N/A | |
| CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing | Unknown | N/A | |
| Learning model uncertainty as variance-minimizing instance weights | Unknown | N/A | |
| A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation | Unknown | N/A | |
| Toward effective protection against diffusion-based mimicry through score distillation | Unknown | N/A | |
| Piecewise Linear Parametrization of Policies: Towards Interpretable Deep Reinforcement Learning | Unknown | N/A | |
| Multi-View Representation is What You Need for Point-Cloud Pre-Training | Unknown | N/A | |
| PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization | Unknown | N/A | |
| Skill Machines: Temporal Logic Skill Composition in Reinforcement Learning | Unknown | N/A | |
| Designing Skill-Compatible AI: Methodologies and Frameworks in Chess | Unknown | N/A | |
| Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization | Unknown | N/A | |
| LDReg: Local Dimensionality Regularized Self-Supervised Learning | Unknown | N/A | |
| Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI | Unknown | N/A | |
| Mixture of LoRA Experts | Unknown | N/A | |
| KoLA: Carefully Benchmarking World Knowledge of Large Language Models | Unknown | N/A | |
| CAMIL: Context-Aware Multiple Instance Learning for Cancer Detection and Subtyping in Whole Slide Images | Unknown | N/A | |
| Knowledge Distillation Based on Transformed Teacher Matching | Unknown | N/A | |
| Foundation Model-oriented Robustness: Robust Image Model Evaluation with Pretrained Models | Unknown | N/A | |
| Pseudo-Generalized Dynamic View Synthesis from a Video | Unknown | N/A | |
| Massively Scalable Inverse Reinforcement Learning in Google Maps | Unknown | N/A | |
| Improved sampling via learned diffusions | Unknown | N/A | |
| Learning Polynomial Problems with $SL(2, \mathbb{R})$-Equivariance | Unknown | N/A | |
| On the hardness of learning under symmetries | Unknown | N/A | |
| Talk like a Graph: Encoding Graphs for Large Language Models | Unknown | N/A | |
| Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners | Unknown | N/A | |
| Image Inpainting via Iteratively Decoupled Probabilistic Modeling | Unknown | N/A | |
| Maximum Entropy Heterogeneous-Agent Reinforcement Learning | Unknown | N/A | |
| Nougat: Neural Optical Understanding for Academic Documents | Unknown | N/A | |
| Learning Personalized Causally Invariant Representations for Heterogeneous Federated Clients | Unknown | N/A | |
| ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis | Unknown | N/A | |
| Tree-Planner: Efficient Close-loop Task Planning with Large Language Models | Unknown | N/A | |
| PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks | Unknown | N/A | |
| Deep Temporal Graph Clustering | Unknown | N/A | |
| DV-3DLane: End-to-end Multi-modal 3D Lane Detection with Dual-view Representation | Unknown | N/A | |
| On the Reliability of Watermarks for Large Language Models | Unknown | N/A | |
| Universal Guidance for Diffusion Models | Unknown | N/A | |
| NEFTune: Noisy Embeddings Improve Instruction Finetuning | Unknown | N/A | |
| Entropy Coding of Unordered Data Structures | Unknown | N/A | |
| A Semantic Invariant Robust Watermark for Large Language Models | Unknown | N/A | |
| Plug-and-Play Policy Planner for Large Language Model Powered Dialogue Agents | Unknown | N/A | |
| Long-tailed Diffusion Models with Oriented Calibration | Unknown | N/A | |
| When Do Prompting and Prefix-Tuning Work? A Theory of Capabilities and Limitations | Unknown | N/A | |
| QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models | Unknown | N/A | |
| Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response | Unknown | N/A | |
| Retrieval is Accurate Generation | Unknown | N/A | |
| Illusory Attacks: Information-theoretic detectability matters in adversarial attacks | Unknown | N/A | |
| Consistency-guided Prompt Learning for Vision-Language Models | Unknown | N/A | |
| Differentiable Learning of Generalized Structured Matrices for Efficient Deep Neural Networks | Unknown | N/A | |
| ACRF: Compressing Explicit Neural Radiance Fields via Attribute Compression | Unknown | N/A | |
| Navigating Dataset Documentations in AI: A Large-Scale Analysis of Dataset Cards on HuggingFace | Unknown | N/A | |
| An Analytical Solution to Gauss-Newton Loss for Direct Image Alignment | Unknown | N/A | |
| De novo Protein Design Using Geometric Vector Field Networks | Unknown | N/A | |
| Sample-Efficient Quality-Diversity by Cooperative Coevolution | Unknown | N/A | |
| Weakly-supervised Audio Separation via Bi-modal Semantic Similarity | Unknown | N/A | |
| DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization | Unknown | N/A | |
| Adversarial Feature Map Pruning for Backdoor | Unknown | N/A | |
| Toward Student-oriented Teacher Network Training for Knowledge Distillation | Unknown | N/A | |
| Independent-Set Design of Experiments for Estimating Treatment and Spillover Effects under Network Interference | Unknown | N/A | |
| Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula | Unknown | N/A | |
| Plug-and-Play Posterior Sampling under Mismatched Measurement and Prior Models | Unknown | N/A | |
| Backdoor Contrastive Learning via Bi-level Trigger Optimization | Unknown | N/A | |
| Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning | Unknown | N/A | |
| Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators | Unknown | N/A | |
| BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs | Unknown | N/A | |
| Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency | Unknown | N/A | |
| DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models | Unknown | N/A | |
| DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation | Unknown | N/A | |
| Parsing neural dynamics with infinite recurrent switching linear dynamical systems | Unknown | N/A | |
| Chain of Hindsight aligns Language Models with Feedback | Unknown | N/A | |
| Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search | Unknown | N/A | |
| Headless Language Models: Learning without Predicting with Contrastive Weight Tying | Unknown | N/A | |
| MAP IT to Visualize Representations | Unknown | N/A | |
| Waxing-and-Waning: a Generic Similarity-based Framework for Efficient Self-Supervised Learning | Unknown | N/A | |
| Revisiting Data Augmentation in Deep Reinforcement Learning | Unknown | N/A | |
| MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback | Unknown | N/A | |
| Doubly Robust Instance-Reweighted Adversarial Training | Unknown | N/A | |
| CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets | Unknown | N/A | |
| Generative Judge for Evaluating Alignment | Unknown | N/A | |
| Demystifying Linear MDPs and Novel Dynamics Aggregation Framework | Unknown | N/A | |
| Language Modeling Is Compression | Unknown | N/A | |
| The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World | Unknown | N/A | |
| Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts | Unknown | N/A | |
| An Extensible Framework for Open Heterogeneous Collaborative Perception | Unknown | N/A | |
| On the Joint Interaction of Models, Data, and Features | Unknown | N/A | |
| SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis | Unknown | N/A | |
| Accurate and Scalable Estimation of Epistemic Uncertainty for Graph Neural Networks | Unknown | N/A | |
| Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach | Unknown | N/A | |
| Replay across Experiments: A Natural Extension of Off-Policy RL | Unknown | N/A | |
| FeatUp: A Model-Agnostic Framework for Features at Any Resolution | Unknown | N/A | |
| COCO-Periph: Bridging the Gap Between Human and Machine Perception in the Periphery | Unknown | N/A | |
| Training Socially Aligned Language Models on Simulated Social Interactions | Unknown | N/A | |
| Diverse Projection Ensembles for Distributional Reinforcement Learning | Unknown | N/A | |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Unknown | N/A | |
| SafeDreamer: Safe Reinforcement Learning with World Models | Unknown | N/A | |
| Optimal Sample Complexity of Contrastive Learning | Unknown | N/A | |
| CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech | Unknown | N/A | |
| Where We Have Arrived in Proving the Emergence of Sparse Interaction Primitives in DNNs | Unknown | N/A | |
| Enhanced Face Recognition using Intra-class Incoherence Constraint | Unknown | N/A | |
| Global Optimality for Non-linear Constrained Restoration Problems via Invexity | Unknown | N/A | |
| Towards Energy Efficient Spiking Neural Networks: An Unstructured Pruning Framework | Unknown | N/A | |
| ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process | Unknown | N/A | |
| GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors | Unknown | N/A | |
| CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity | Unknown | N/A | |
| Mask-Based Modeling for Neural Radiance Fields | Unknown | N/A | |
| MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection | Unknown | N/A | |
| Towards Understanding Sycophancy in Language Models | Unknown | N/A | |
| Algorithms for Caching and MTS with reduced number of predictions | Unknown | N/A | |
| Learning to reconstruct signals from binary measurements alone | Unknown | N/A | |
| Multi-Source Diffusion Models for Simultaneous Music Generation and Separation | Unknown | N/A | |
| $\pi$2vec: Policy Representation with Successor Features | Unknown | N/A | |
| SWAP: Sparse Entropic Wasserstein Regression for Robust Network Pruning | Unknown | N/A | |
| Towards Identifiable Unsupervised Domain Translation: A Diversified Distribution Matching Approach | Unknown | N/A | |
| Conditional Variational Diffusion Models | Unknown | N/A | |
| Can LLM-Generated Misinformation Be Detected? | Unknown | N/A | |
| Pre-training LiDAR-based 3D Object Detectors through Colorization | Unknown | N/A | |
| On the Markov Property of Neural Algorithmic Reasoning: Analyses and Methods | Unknown | N/A | |
| Matrix Manifold Neural Networks++ | Unknown | N/A | |
| Neural Atoms: Propagating Long-range Interaction in Molecular Graphs through Efficient Communication Channel | Unknown | N/A | |
| Sum-Product-Set Networks: Deep Tractable Models for Tree-Structured Graphs | Unknown | N/A | |
| Optimistic Bayesian Optimization with Unknown Constraints | Unknown | N/A | |
| Less is More: One-shot Subgraph Reasoning on Large-scale Knowledge Graphs | Unknown | N/A | |
| Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks | Unknown | N/A | |
| Personalize Segment Anything Model with One Shot | Unknown | N/A | |
| On the generalization capacity of neural networks during generic multimodal reasoning | Unknown | N/A | |
| Rethinking and Extending the Probabilistic Inference Capacity of GNNs | Unknown | N/A | |
| The Expressive Power of Low-Rank Adaptation | Unknown | N/A | |
| Multi-Scale Representations by Varying Window Attention for Semantic Segmentation | Unknown | N/A | |
| Retrieval meets Long Context Large Language Models | Unknown | N/A | |
| Dynamic Layer Tying for Parameter-Efficient Transformers | Unknown | N/A | |
| Simplifying Transformer Blocks | Unknown | N/A | |
| Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning | Unknown | N/A | |
| A Branching Decoder for Set Generation | Unknown | N/A | |
| Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping | Unknown | N/A | |
| 3D Feature Prediction for Masked-AutoEncoder-Based Point Cloud Pretraining | Unknown | N/A | |
| A Graph is Worth 1-bit Spikes: When Graph Contrastive Learning Meets Spiking Neural Networks | Unknown | N/A | |
| Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling | Unknown | N/A | |
| Path Choice Matters for Clear Attributions in Path Methods | Unknown | N/A | |
| Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions | Unknown | N/A | |
| Estimating Shape Distances on Neural Representations with Limited Samples | Unknown | N/A | |
| DIAGNOSIS: Detecting Unauthorized Data Usages in Text-to-image Diffusion Models | Unknown | N/A | |
| Object centric architectures enable efficient causal representation learning | Unknown | N/A | |
| Pre-Training and Fine-Tuning Generative Flow Networks | Unknown | N/A | |
| Topological data analysis on noisy quantum computers | Unknown | N/A | |
| LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection | Unknown | N/A | |
| Variational Inference for SDEs Driven by Fractional Noise | Unknown | N/A | |
| VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections | Unknown | N/A | |
| LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving | Unknown | N/A | |
| Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity | Unknown | N/A | |
| Debiased Collaborative Filtering with Kernel-Based Causal Balancing | Unknown | N/A | |
| Towards Non-Asymptotic Convergence for Diffusion-Based Generative Models | Unknown | N/A | |
| Reverse Forward Curriculum Learning for Extreme Sample and Demo Efficiency | Unknown | N/A | |
| Langevin Monte Carlo for strongly log-concave distributions: Randomized midpoint revisited | Unknown | N/A | |
| S$2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic | Unknown | N/A | |
| FedHyper: A Universal and Robust Learning Rate Scheduler for Federated Learning with Hypergradient Descent | Unknown | N/A | |
| A Symmetry-Aware Exploration of Bayesian Neural Network Posteriors | Unknown | N/A | |
| SEA: Sparse Linear Attention with Estimated Attention Mask | Unknown | N/A | |
| Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit | Unknown | N/A | |
| Goodhart's Law in Reinforcement Learning | Unknown | N/A | |
| ReFusion: Improving Natural Language Understanding with Computation-Efficient Retrieval Representation Fusion | Unknown | N/A | |
| An Emulator for Fine-tuning Large Language Models using Small Language Models | Unknown | N/A | |
| Neural Optimal Transport with General Cost Functionals | Unknown | N/A | |
| Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks | Unknown | N/A | |
| COLLIE: Systematic Construction of Constrained Text Generation Tasks | Unknown | N/A | |
| SWE-bench: Can Language Models Resolve Real-world Github Issues? | Unknown | N/A | |
| Cross-Modal Contextualized Diffusion Models for Text-Guided Visual Generation and Editing | Unknown | N/A | |
| At Which Training Stage Does Code Data Help LLMs Reasoning? | Unknown | N/A | |
| VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs | Unknown | N/A | |
| Expressive Losses for Verified Robustness via Convex Combinations | Unknown | N/A | |
| Language Model Cascades: Token-Level Uncertainty And Beyond | Unknown | N/A | |
| A Linear Algebraic Framework for Counterfactual Generation | Unknown | N/A | |
| Learning to Reject Meets Long-tail Learning | Unknown | N/A | |
| Faithful Rule Extraction for Differentiable Rule Learning Models | Unknown | N/A | |
| Quantifying Network Similarity using Graph Cumulants | Unknown | N/A | |
| Grokking as a First Order Phase Transition in Two Layer Networks | Unknown | N/A | |
| Feature emergence via margin maximization: case studies in algebraic tasks | Unknown | N/A | |
| JointNet: Extending Text-to-Image Diffusion for Dense Distribution Modeling | Unknown | N/A | |
| HyperAttention: Long-context Attention in Near-Linear Time | Unknown | N/A | |
| An operator preconditioning perspective on training in physics-informed machine learning | Unknown | N/A | |
| Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection | Unknown | N/A | |
| Confidential-DPproof: Confidential Proof of Differentially Private Training | Unknown | N/A | |
| Hindsight PRIORs for Reward Learning from Human Preferences | Unknown | N/A | |
| A Hierarchical Bayesian Model for Few-Shot Meta Learning | Unknown | N/A | |
| The Alignment Problem from a Deep Learning Perspective | Unknown | N/A | |
| Neural Fine-Tuning Search for Few-Shot Learning | Unknown | N/A | |
| FairTune: Optimizing Parameter Efficient Fine Tuning for Fairness in Medical Image Analysis | Unknown | N/A | |
| Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework | Unknown | N/A | |
| DIG In: Evaluating Disparities in Image Generations with Indicators for Geographic Diversity | Unknown | N/A | |
| Memorization in Self-Supervised Learning Improves Downstream Generalization | Unknown | N/A | |
| Neural Active Learning Beyond Bandits | Unknown | N/A | |
| SPADE: Semi-supervised Anomaly Detection under Distribution Mismatch | Unknown | N/A | |
| CARD: Channel Aligned Robust Blend Transformer for Time Series Forecasting | Unknown | N/A | |
| Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting | Unknown | N/A | |
| EasyTPP: Towards Open Benchmarking Temporal Point Processes | Unknown | N/A | |
| Explaining Time Series via Contrastive and Locally Sparse Perturbations | Unknown | N/A | |
| Pathologies of Predictive Diversity in Deep Ensembles | Unknown | N/A | |
| Detecting Pretraining Data from Large Language Models | Unknown | N/A | |
| In-Context Pretraining: Language Modeling Beyond Document Boundaries | Unknown | N/A | |
| RECOMP: Improving Retrieval-Augmented LMs with Context Compression and Selective Augmentation | Unknown | N/A | |
| A Fast and Provable Algorithm for Sparse Phase Retrieval | Unknown | N/A | |
| SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore | Unknown | N/A | |
| Multilingual Jailbreak Challenges in Large Language Models | Unknown | N/A | |
| Lemur: Harmonizing Natural Language and Code for Language Agents | Unknown | N/A | |
| One-shot Active Learning Based on Lewis Weight Sampling for Multiple Deep Models | Unknown | N/A | |
| The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing | Unknown | N/A | |
| Window Attention is Bugged: How not to Interpolate Position Embeddings | Unknown | N/A | |
| Sharpness-Aware Data Poisoning Attack | Unknown | N/A | |
| NetInfoF Framework: Measuring and Exploiting Network Usable Information | Unknown | N/A | |
| Faster Sampling from Log-Concave Densities over Polytopes via Efficient Linear Solvers | Unknown | N/A | |
| Learning to Reject with a Fixed Predictor: Application to Decontextualization | Unknown | N/A | |
| Demystifying Poisoning Backdoor Attacks from a Statistical Perspective | Unknown | N/A | |
| A 2-Dimensional State Space Layer for Spatial Inductive Bias | Unknown | N/A | |
| High-dimensional SGD aligns with emerging outlier eigenspaces | Unknown | N/A | |
| A Unified and General Framework for Continual Learning | Unknown | N/A | |
| Dynamic Neural Response Tuning | Unknown | N/A | |
| Det-CGD: Compressed Gradient Descent with Matrix Stepsizes for Non-Convex Optimization | Unknown | N/A | |
| BatchPrompt: Accomplish more with less | Unknown | N/A | |
| CODE REPRESENTATION LEARNING AT SCALE | Unknown | N/A | |
| Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors | Unknown | N/A | |
| Forward Learning of Graph Neural Networks | Unknown | N/A | |
| Towards Cross Domain Generalization of Hamiltonian Representation via Meta Learning | Unknown | N/A | |
| AutoChunk: Automated Activation Chunk for Memory-Efficient Deep Learning Inference | Unknown | N/A | |
| One Forward is Enough for Neural Network Training via Likelihood Ratio Method | Unknown | N/A | |
| TorchRL: A data-driven decision-making library for PyTorch | Unknown | N/A | |
| SweetDreamer: Aligning Geometric Priors in 2D diffusion for Consistent Text-to-3D | Unknown | N/A | |
| CellPLM: Pre-training of Cell Language Model Beyond Single Cells | Unknown | N/A | |
| MetaPhysiCa: Improving OOD Robustness in Physics-informed Machine Learning | Unknown | N/A | |
| EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations | Unknown | N/A | |
| Counting Graph Substructures with Graph Neural Networks | Unknown | N/A | |
| Meta-VBO: Utilizing Prior Tasks in Optimizing Risk Measures with Gaussian Processes | Unknown | N/A | |
| Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain | Unknown | N/A | |
| Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment | Unknown | N/A | |
| In-Context Learning through the Bayesian Prism | Unknown | N/A | |
| Improving Domain Generalization with Domain Relations | Unknown | N/A | |
| Analyzing and Mitigating Object Hallucination in Large Vision-Language Models | Unknown | N/A | |
| Zoology: Measuring and Improving Recall in Efficient Language Models | Unknown | N/A | |
| Protein-Ligand Interaction Prior for Binding-aware 3D Molecule Diffusion Models | Unknown | N/A | |
| TRAM: Bridging Trust Regions and Sharpness Aware Minimization | Unknown | N/A | |
| Holistic Evaluation of Language Models | Unknown | N/A | |
| MOFDiff: Coarse-grained Diffusion for Metal-Organic Framework Design | Unknown | N/A | |
| Temporal Generalization Estimation in Evolving Graphs | Unknown | N/A | |
| The Curse of Diversity in Ensemble-Based Exploration | Unknown | N/A | |
| Beam Enumeration: Probabilistic Explainability For Sample Efficient Self-conditioned Molecular Design | Unknown | N/A | |
| DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization | Unknown | N/A | |
| Self-Alignment with Instruction Backtranslation | Unknown | N/A | |
| Denoising Task Routing for Diffusion Models | Unknown | N/A | |
| Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps | Unknown | N/A | |
| DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation | Unknown | N/A | |
| "What Data Benefits My Classifier?" Enhancing Model Performance and Interpretability through Influence-Based Data Selection | Unknown | N/A | |
| TapMo: Shape-aware Motion Generation of Skeleton-free Characters | Unknown | N/A | |
| Embarrassingly Simple Dataset Distillation | Unknown | N/A | |
| Pose Modulated Avatars from Video | Unknown | N/A | |
| SaProt: Protein Language Modeling with Structure-aware Vocabulary | Unknown | N/A | |
| DMV3D: Denoising Multi-view Diffusion Using 3D Large Reconstruction Model | Unknown | N/A | |
| MovingParts: Motion-based 3D Part Discovery in Dynamic Radiance Field | Unknown | N/A | |
| CausalTime: Realistically Generated Time-series for Benchmarking of Causal Discovery | Unknown | N/A | |
| Two-timescale Extragradient for Finding Local Minimax Points | Unknown | N/A | |
| Near-Optimal Quantum Algorithm for Minimizing the Maximal Loss | Unknown | N/A | |
| Contextual Bandits with Online Neural Regression | Unknown | N/A | |
| Predictive auxiliary objectives in deep RL mimic learning in the brain | Unknown | N/A | |
| Mathematical Justification of Hard Negative Mining via Isometric Approximation Theorem | Unknown | N/A | |
| Score Regularized Policy Optimization through Diffusion Behavior | Unknown | N/A | |
| Amortizing intractable inference in large language models | Unknown | N/A | |
| WildChat: 1M ChatGPT Interaction Logs in the Wild | Unknown | N/A | |
| Real-Fake: Effective Training Data Synthesis Through Distribution Matching | Unknown | N/A | |
| Be Aware of the Neighborhood Effect: Modeling Selection Bias under Interference | Unknown | N/A | |
| DAM: Towards a Foundation Model for Forecasting | Unknown | N/A | |
| CPPO: Continual Learning for Reinforcement Learning with Human Feedback | Unknown | N/A | |
| Understanding the Robustness of Randomized Feature Defense Against Query-Based Adversarial Attacks | Unknown | N/A | |
| Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation | Unknown | N/A | |
| Tuning LayerNorm in Attention: Towards Efficient Multi-Modal LLM Finetuning | Unknown | N/A | |
| Grounded Object-Centric Learning | Unknown | N/A | |
| Generalization in diffusion models arises from geometry-adaptive harmonic representations | Unknown | N/A | |
| Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Unknown | N/A | |
| Robust NAS under adversarial training: benchmark, theory, and beyond | Unknown | N/A | |
| SE(3)-Stochastic Flow Matching for Protein Backbone Generation | Unknown | N/A | |
| Future Language Modeling from Temporal Document History | Unknown | N/A | |
| AutoCast++: Enhancing World Event Prediction with Zero-shot Ranking-based Context Retrieval | Unknown | N/A | |
| SGD Finds then Tunes Features in Two-Layer Neural Networks with near-Optimal Sample Complexity: A Case Study in the XOR problem | Unknown | N/A | |
| Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL | Unknown | N/A | |
| ODE Discovery for Longitudinal Heterogeneous Treatment Effects Inference | Unknown | N/A | |
| InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation | Unknown | N/A | |
| One-shot Empirical Privacy Estimation for Federated Learning | Unknown | N/A | |
| Learning to Act from Actionless Videos through Dense Correspondences | Unknown | N/A | |
| The False Promise of Imitating Proprietary Language Models | Unknown | N/A | |
| Offline RL with Observation Histories: Analyzing and Improving Sample Complexity | Unknown | N/A | |
| Learning From Simplicial Data Based on Random Walks and 1D Convolutions | Unknown | N/A | |
| Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged Objects | Unknown | N/A | |
| Revisiting the Last-Iterate Convergence of Stochastic Gradient Methods | Unknown | N/A | |
| Large Language Models to Enhance Bayesian Optimization | Unknown | N/A | |
| PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction | Unknown | N/A | |
| Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization | Unknown | N/A | |
| Towards Transparent Time Series Forecasting | Unknown | N/A | |
| Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective | Unknown | N/A | |
| REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes | Unknown | N/A | |
| Traveling Waves Encode The Recent Past and Enhance Sequence Learning | Unknown | N/A | |
| Adaptive Sharpness-Aware Pruning for Robust Sparse Networks | Unknown | N/A | |
| TACTiS-2: Better, Faster, Simpler Attentional Copulas for Multivariate Time Series | Unknown | N/A | |
| LLaMA-Adapter: Efficient Fine-tuning of Large Language Models with Zero-initialized Attention | Unknown | N/A | |
| GROOT: Learning to Follow Instructions by Watching Gameplay Videos | Unknown | N/A | |
| Decomposed Diffusion Sampler for Accelerating Large-Scale Inverse Problems | Unknown | N/A | |
| DreamFlow: High-quality text-to-3D generation by Approximating Probability Flow | Unknown | N/A | |
| Unpaired Image-to-Image Translation via Neural Schrödinger Bridge | Unknown | N/A | |
| Exploring Diffusion Time-steps for Unsupervised Representation Learning | Unknown | N/A | |
| DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation | Unknown | N/A | |
| FROSTER: Frozen CLIP is A Strong Teacher for Open-Vocabulary Action Recognition | Unknown | N/A | |
| Understanding Catastrophic Forgetting in Language Models via Implicit Inference | Unknown | N/A | |
| Training Diffusion Models with Reinforcement Learning | Unknown | N/A | |
| Shadow Cones: A Generalized Framework for Partial Order Embeddings | Unknown | N/A | |
| Curriculum reinforcement learning for quantum architecture search under hardware errors | Unknown | N/A | |
| Continual Learning in the Presence of Spurious Correlations: Analyses and a Simple Baseline | Unknown | N/A | |
| A Lightweight Method for Tackling Unknown Participation Statistics in Federated Averaging | Unknown | N/A | |
| Data Debugging with Shapley Importance over Machine Learning Pipelines | Unknown | N/A | |
| Probabilistically Rewired Message-Passing Neural Networks | Unknown | N/A | |
| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Unknown | N/A | |
| Empirical Likelihood for Fair Classification | Unknown | N/A | |
| SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation | Unknown | N/A | |
| Deep Reinforcement Learning for Modelling Protein Complexes | Unknown | N/A | |
| Hypothesis Search: Inductive Reasoning with Language Models | Unknown | N/A | |
| LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks | Unknown | N/A | |
| Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation | Unknown | N/A | |
| Learning to Embed Time Series Patches Independently | Unknown | N/A | |
| Horizon-Free Regret for Linear Markov Decision Processes | Unknown | N/A | |
| Soft Contrastive Learning for Time Series | Unknown | N/A | |
| Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark | Unknown | N/A | |
| Mixture of Weak and Strong Experts on Graphs | Unknown | N/A | |
| Learning Energy-Based Models by Cooperative Diffusion Recovery Likelihood | Unknown | N/A | |
| Balancing Act: Constraining Disparate Impact in Sparse Models | Unknown | N/A | |
| Machine Unlearning for Image-to-Image Generative Models | Unknown | N/A | |
| Flow Matching on General Geometries | Unknown | N/A | |
| LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading | Unknown | N/A | |
| Emu: Generative Pretraining in Multimodality | Unknown | N/A | |
| Bespoke Solvers for Generative Flow Models | Unknown | N/A | |
| Candidate Label Set Pruning: A Data-centric Perspective for Deep Partial-label Learning | Unknown | N/A | |
| Kernelised Normalising Flows | Unknown | N/A | |
| Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How | Unknown | N/A | |
| Workflow Discovery from Dialogues in the Low Data Regime | Unknown | N/A | |
| Stabilizing Backpropagation Through Time to Learn Complex Physics | Unknown | N/A | |
| DittoGym: Learning to Control Soft Shape-Shifting Robots | Unknown | N/A | |
| Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models | Unknown | N/A | |
| Zero-Shot Robotic Manipulation with Pre-Trained Image-Editing Diffusion Models | Unknown | N/A | |
| Dictionary Contrastive Learning for Efficient Local Supervision without Auxiliary Networks | Unknown | N/A | |
| VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation | Unknown | N/A | |
| SEPT: Towards Efficient Scene Representation Learning for Motion Prediction | Unknown | N/A | |
| Selective Visual Representations Improve Convergence and Generalization for Embodied AI | Unknown | N/A | |
| Generalization of Scaled Deep ResNets in the Mean-Field Regime | Unknown | N/A | |
| BrainLM: A foundation model for brain activity recordings | Unknown | N/A | |
| MgNO: Efficient Parameterization of Linear Operators via Multigrid | Unknown | N/A | |
| The Generalization Gap in Offline Reinforcement Learning | Unknown | N/A | |
| Principled Federated Domain Adaptation: Gradient Projection and Auto-Weighting | Unknown | N/A | |
| Teaching Language Models to Hallucinate Less with Synthetic Tasks | Unknown | N/A | |
| Fourier Transporter: Bi-Equivariant Robotic Manipulation in 3D | Unknown | N/A | |
| TEST: Text Prototype Aligned Embedding to Activate LLM's Ability for Time Series | Unknown | N/A | |
| Neural Polynomial Gabor Fields for Macro Motion Analysis | Unknown | N/A | |
| To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets | Unknown | N/A | |
| Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals | Unknown | N/A | |
| Teaching Large Language Models to Self-Debug | Unknown | N/A | |
| SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction | Unknown | N/A | |
| Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold | Unknown | N/A | |
| Energy-Based Concept Bottleneck Models: Unifying Prediction, Concept Intervention, and Probabilistic Interpretations | Unknown | N/A | |
| What's in a Prior? Learned Proximal Networks for Inverse Problems | Unknown | N/A | |
| Large Language Models as Optimizers | Unknown | N/A | |
| Enhancing Contrastive Learning for Ordinal Regression via Ordinal Content Preserved Data Augmentation | Unknown | N/A | |
| SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking | Unknown | N/A | |
| One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention | Unknown | N/A | |
| Large Language Models Cannot Self-Correct Reasoning Yet | Unknown | N/A | |
| Incentivized Truthful Communication for Federated Bandits | Unknown | N/A | |
| Identifying Policy Gradient Subspaces | Unknown | N/A | |
| H-GAP: Humanoid Control with a Generalist Planner | Unknown | N/A | |
| Biased Temporal Convolution Graph Network for Time Series Forecasting with Missing Values | Unknown | N/A | |
| Take a Step Back: Evoking Reasoning via Abstraction in Large Language Models | Unknown | N/A | |
| Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models | Unknown | N/A | |
| IRAD: Implicit Representation-driven Image Resampling against Adversarial Attacks | Unknown | N/A | |
| Long-Short-Range Message-Passing: A Physics-Informed Framework to Capture Non-Local Interaction for Scalable Molecular Dynamics Simulation | Unknown | N/A | |
| Decoding Natural Images from EEG for Object Recognition | Unknown | N/A | |
| On the Analysis of GAN-based Image-to-Image Translation with Gaussian Noise Injection | Unknown | N/A | |
| Explaining Kernel Clustering via Decision Trees | Unknown | N/A | |
| CAS: A Probability-Based Approach for Universal Condition Alignment Score | Unknown | N/A | |
| $\alpha$TC-VAE: On the relationship between Disentanglement and Diversity | Unknown | N/A | |
| Select to Perfect: Imitating desired behavior from large multi-agent data | Unknown | N/A | |
| Unleashing the Potential of Fractional Calculus in Graph Neural Networks with FROND | Unknown | N/A | |
| Understanding Expressivity of GNN in Rule Learning | Unknown | N/A | |
| When Semantic Segmentation Meets Frequency Aliasing | Unknown | N/A | |
| Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification | Unknown | N/A | |
| Idempotent Generative Network | Unknown | N/A | |
| ReTaSA: A Nonparametric Functional Estimation Approach for Addressing Continuous Target Shift | Unknown | N/A | |
| DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning | Unknown | N/A | |
| Integrating Planning and Deep Reinforcement Learning via Automatic Induction of Task Substructures | Unknown | N/A | |
| On the Learnability of Watermarks for Language Models | Unknown | N/A | |
| Achieving Fairness in Multi-Agent MDP Using Reinforcement Learning | Unknown | N/A | |
| Stochastic Gradient Descent for Gaussian Processes Done Right | Unknown | N/A | |
| Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping | Unknown | N/A | |
| Tractable Probabilistic Graph Representation Learning with Graph-Induced Sum-Product Networks | Unknown | N/A | |
| Image Inpainting via Tractable Steering of Diffusion Models | Unknown | N/A | |
| Multimarginal Generative Modeling with Stochastic Interpolants | Unknown | N/A | |
| Generalized Neural Sorting Networks with Error-Free Differentiable Swap Functions | Unknown | N/A | |
| Meaning Representations from Trajectories in Autoregressive Models | Unknown | N/A | |
| GIO: Gradient Information Optimization for Training Dataset Selection | Unknown | N/A | |
| Parameter-Efficient Multi-Task Model Fusion with Partial Linearization | Unknown | N/A | |
| SLiMe: Segment Like Me | Unknown | N/A | |
| Language Models Represent Space and Time | Unknown | N/A | |
| M3C: A Framework towards Convergent, Flexible, and Unsupervised Learning of Mixture Graph Matching and Clustering | Unknown | N/A | |
| MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning | Unknown | N/A | |
| BadEdit: Backdooring Large Language Models by Model Editing | Unknown | N/A | |
| One For All: Towards Training One Graph Model For All Classification Tasks | Unknown | N/A | |
| Neural Monge Map estimation and its applications | Unknown | N/A | |
| Rethinking the Power of Graph Canonization in Graph Representation Learning with Stability | Unknown | N/A | |
| Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems | Unknown | N/A | |
| $t^3$-Variational Autoencoder: Learning Heavy-tailed Data with Student's t and Power Divergence | Unknown | N/A | |
| Consistency Trajectory Models: Learning Probability Flow ODE Trajectory of Diffusion | Unknown | N/A | |
| On the Sample Complexity of Lipschitz Constant Estimation | Unknown | N/A | |
| Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning | Unknown | N/A | |
| Image Background Serves as Good Proxy for Out-of-distribution Data | Unknown | N/A | |
| FedTrans: Client-Transparent Utility Estimation for Robust Federated Learning | Unknown | N/A | |
| CLEX: Continuous Length Extrapolation for Large Language Models | Unknown | N/A | |
| Combining Axes Preconditioners through Kronecker Approximation for Deep Learning | Unknown | N/A | |
| The Update-Equivalence Framework for Decision-Time Planning | Unknown | N/A | |
| MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning | Unknown | N/A | |
| Inverse Approximation Theory for Nonlinear Recurrent Neural Networks | Unknown | N/A | |
| Less is More: Fewer Interpretable Region via Submodular Subset Selection | Unknown | N/A | |
| LEMON: Lossless model expansion | Unknown | N/A | |
| Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging | Unknown | N/A | |
| Understanding Addition in Transformers | Unknown | N/A | |
| Deceptive Fairness Attacks on Graphs via Meta Learning | Unknown | N/A | |
| Fair and Efficient Contribution Valuation for Vertical Federated Learning | Unknown | N/A | |
| Federated Text-driven Prompt Generation for Vision-Language Models | Unknown | N/A | |
| Improved Active Learning via Dependent Leverage Score Sampling | Unknown | N/A | |
| Optimal Sketching for Residual Error Estimation for Matrix and Vector Norms | Unknown | N/A | |
| Crystalformer: Infinitely Connected Attention for Periodic Structure Encoding | Unknown | N/A | |
| Large Language Model Cascades with Mixture of Thought Representations for Cost-Efficient Reasoning | Unknown | N/A | |
| PAE: Reinforcement Learning from External Knowledge for Efficient Exploration | Unknown | N/A | |
| CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules | Unknown | N/A | |
| Image Clustering Conditioned on Text Criteria | Unknown | N/A | |
| How to Fine-Tune Vision Models with SGD | Unknown | N/A | |
| Towards 3D Molecule-Text Interpretation in Language Models | Unknown | N/A | |
| CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models | Unknown | N/A | |
| How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization | Unknown | N/A | |
| MT-Ranker: Reference-free machine translation evaluation by inter-system ranking | Unknown | N/A | |
| Parametric Augmentation for Time Series Contrastive Learning | Unknown | N/A | |
| UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition | Unknown | N/A | |
| Towards Robust Out-of-Distribution Generalization Bounds via Sharpness | Unknown | N/A | |
| A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models | Unknown | N/A | |
| On the Effect of Batch Size in Byzantine-Robust Distributed Learning | Unknown | N/A | |
| Expressivity of ReLU-Networks under Convex Relaxations | Unknown | N/A | |
| DiffusionSat: A Generative Foundation Model for Satellite Imagery | Unknown | N/A | |
| Denoising Diffusion Bridge Models | Unknown | N/A | |
| Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning | Unknown | N/A | |
| ASMR: Activation-Sharing Multi-Resolution Coordinate Networks for Efficient Inference | Unknown | N/A | |
| Causal Inference with Conditional Front-Door Adjustment and Identifiable Variational Autoencoder | Unknown | N/A | |
| From Bricks to Bridges: Product of Invariances to Enhance Latent Space Communication | Unknown | N/A | |
| Batched Low-Rank Adaptation of Foundation Models | Unknown | N/A | |
| GRAPH-CONSTRAINED DIFFUSION FOR END-TO-END PATH PLANNING | Unknown | N/A | |
| Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design | Unknown | N/A | |
| Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control | Unknown | N/A | |
| Weakly Supervised Virus Capsid Detection with Image-Level Annotations in Electron Microscopy Images | Unknown | N/A | |
| Generalized Schrödinger Bridge Matching | Unknown | N/A | |
| Repeated Random Sampling for Minimizing the Time-to-Accuracy of Learning | Unknown | N/A | |
| Patches Are All You Need? | Unknown | N/A | |
| Navigating the Design Space of Equivariant Diffusion-Based Generative Models for De Novo 3D Molecule Generation | Unknown | N/A | |
| Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages | Unknown | N/A | |
| Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching | Unknown | N/A | |
| How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression? | Unknown | N/A | |
| Ferret: Refer and Ground Anything Anywhere at Any Granularity | Unknown | N/A | |
| Demonstration-Regularized RL | Unknown | N/A | |
| Manifold Diffusion Fields | Unknown | N/A | |
| Language Control Diffusion: Efficiently Scaling through Space, Time, and Tasks | Unknown | N/A | |
| PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization | Unknown | N/A | |
| How do Language Models Bind Entities in Context? | Unknown | N/A | |
| Emergent mechanisms for long timescales depend on training curriculum and affect performance in memory tasks | Unknown | N/A | |
| Structural Estimation of Partially Observed Linear Non-Gaussian Acyclic Model: A Practical Approach with Identifiability | Unknown | N/A | |
| Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian distributions | Unknown | N/A | |
| FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling | Unknown | N/A | |
| Learning Performance-Improving Code Edits | Unknown | N/A | |
| Tensor Programs VI: Feature Learning in Infinite Depth Neural Networks | Unknown | N/A | |
| Privately Aligning Language Models with Reinforcement Learning | Unknown | N/A | |
| Dual RL: Unification and New Methods for Reinforcement and Imitation Learning | Unknown | N/A | |
| Let's Verify Step by Step | Unknown | N/A | |
| Complete and Efficient Graph Transformers for Crystal Material Property Prediction | Unknown | N/A | |
| Uncertainty Quantification via Stable Distribution Propagation | Unknown | N/A | |
| Exploring the Promise and Limits of Real-Time Recurrent Learning | Unknown | N/A | |
| Understanding Convergence and Generalization in Federated Learning through Feature Learning Theory | Unknown | N/A | |
| LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors | Unknown | N/A | |
| HiGen: Hierarchical Graph Generative Networks | Unknown | N/A | |
| BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models | Unknown | N/A | |
| Human Feedback is not Gold Standard | Unknown | N/A | |
| Bellman Optimal Stepsize Straightening of Flow-Matching Models | Unknown | N/A | |
| Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches | Unknown | N/A | |
| Learning Implicit Representation for Reconstructing Articulated Objects | Unknown | N/A | |
| $\texttt{NAISR}$: A 3D Neural Additive Model for Interpretable Shape Representation | Unknown | N/A | |
| Towards Offline Opponent Modeling with In-context Learning | Unknown | N/A | |
| MVSFormer++: Revealing the Devil in Transformer's Details for Multi-View Stereo | Unknown | N/A | |
| SKILL-MIX: a Flexible and Expandable Family of Evaluations for AI Models | Unknown | N/A | |
| LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents | Unknown | N/A | |
| Hybrid Directional Graph Neural Network for Molecules | Unknown | N/A | |
| DAFA: Distance-Aware Fair Adversarial Training | Unknown | N/A | |
| Fast-ELECTRA for Efficient Pre-training | Unknown | N/A | |
| Course Correcting Koopman Representations | Unknown | N/A | |
| Generating Pragmatic Examples to Train Neural Program Synthesizers | Unknown | N/A | |
| Data Filtering Networks | Unknown | N/A | |
| Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization | Unknown | N/A | |
| Learning with Language-Guided State Abstractions | Unknown | N/A | |
| On Harmonizing Implicit Subpopulations | Unknown | N/A | |
| On Error Propagation of Diffusion Models | Unknown | N/A | |
| R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation | Unknown | N/A | |
| ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models | Unknown | N/A | |
| Out-Of-Domain Unlabeled Data Improves Generalization | Unknown | N/A | |
| Poly-View Contrastive Learning | Unknown | N/A | |
| Successor Heads: Recurring, Interpretable Attention Heads In The Wild | Unknown | N/A | |
| Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models | Unknown | N/A | |
| On the Expressivity of Objective-Specification Formalisms in Reinforcement Learning | Unknown | N/A | |
| Latent Representation and Simulation of Markov Processes via Time-Lagged Information Bottleneck | Unknown | N/A | |
| DreamClean: Restoring Clean Image Using Deep Diffusion Prior | Unknown | N/A | |
| Enhancing Neural Subset Selection: Integrating Background Information into Set Representations | Unknown | N/A | |
| Probabilistic Adaptation of Black-Box Text-to-Video Models | Unknown | N/A | |
| DREAM: Dual Structured Exploration with Mixup for Open-set Graph Domain Adaption | Unknown | N/A | |
| Partitioning Message Passing for Graph Fraud Detection | Unknown | N/A | |
| Meta-Evolve: Continuous Robot Evolution for One-to-many Policy Transfer | Unknown | N/A | |
| Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding | Unknown | N/A | |
| TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models | Unknown | N/A | |
| Learning Interactive Real-World Simulators | Unknown | N/A | |
| Learning from Sparse Offline Datasets via Conservative Density Estimation | Unknown | N/A | |
| From Graphs to Hypergraphs: Hypergraph Projection and its Reconstruction | Unknown | N/A | |
| Concept Bottleneck Generative Models | Unknown | N/A | |
| Safe Collaborative Filtering | Unknown | N/A | |
| Object-Centric Learning with Slot Mixture Module | Unknown | N/A | |
| Spatio-Temporal Few-Shot Learning via Diffusive Neural Network Generation | Unknown | N/A | |
| Skip-Attention: Improving Vision Transformers by Paying Less Attention | Unknown | N/A | |
| Linear Log-Normal Attention with Unbiased Concentration | Unknown | N/A | |
| Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning | Unknown | N/A | |
| Koopman-based generalization bound: New aspect for full-rank weights | Unknown | N/A | |
| Robust Model-Based Optimization for Challenging Fitness Landscapes | Unknown | N/A | |
| Analytically Tractable Hidden-States Inference in Bayesian Neural Networks | Unknown | N/A | |
| An interpretable error correction method for enhancing code-to-code translation | Unknown | N/A | |
| Random Feature Amplification: Feature Learning and Generalization in Neural Networks | Unknown | N/A | |
| Fiber Monte Carlo | Unknown | N/A | |
| NeRM: Learning Neural Representations for High-Framerate Human Motion Synthesis | Unknown | N/A | |
| A Unified Experiment Design Approach for Cyclic and Acyclic Causal Models | Unknown | N/A | |
| A Framework and Benchmark for Deep Batch Active Learning for Regression | Unknown | N/A | |
| Tackling the Data Heterogeneity in Asynchronous Federated Learning with Cached Update Calibration | Unknown | N/A | |
| ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation | Unknown | N/A | |
| TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields | Unknown | N/A | |
| Automatic Functional Differentiation in JAX | Unknown | N/A | |
| Manipulating dropout reveals an optimal balance of efficiency and robustness in biological and machine visual systems | Unknown | N/A | |
| $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis | Unknown | N/A | |
| Transformer-VQ: Linear-Time Transformers via Vector Quantization | Unknown | N/A | |
| What does the Knowledge Neuron Thesis Have to do with Knowledge? | Unknown | N/A | |
| Ghost on the Shell: An Expressive Representation of General 3D Shapes | Unknown | N/A | |
| Intriguing Properties of Generative Classifiers | Unknown | N/A | |
| Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE | Unknown | N/A | |
| Learning semilinear neural operators: A unified recursive framework for prediction and data assimilation. | Unknown | N/A | |
| ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving | Unknown | N/A | |
| Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation | Unknown | N/A | |
| Towards Robust Offline Reinforcement Learning under Diverse Data Corruption | Unknown | N/A | |
| Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions | Unknown | N/A | |
| Towards domain-invariant Self-Supervised Learning with Batch Styles Standardization | Unknown | N/A | |
| Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations | Unknown | N/A | |
| SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training | Unknown | N/A | |
| Learning from Label Proportions: Bootstrapping Supervised Learners via Belief Propagation | Unknown | N/A | |
| Transformer-Modulated Diffusion Models for Probabilistic Multivariate Time Series Forecasting | Unknown | N/A | |
| Smooth ECE: Principled Reliability Diagrams via Kernel Smoothing | Unknown | N/A | |
| LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures | Unknown | N/A | |
| Vanishing Gradients in Reinforcement Finetuning of Language Models | Unknown | N/A | |
| What Algorithms can Transformers Learn? A Study in Length Generalization | Unknown | N/A | |
| Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization | Unknown | N/A | |
| Enhancing Small Medical Learners with Privacy-preserving Contextual Prompting | Unknown | N/A | |
| Optimal criterion for feature learning of two-layer linear neural network in high dimensional interpolation regime | Unknown | N/A | |
| On the Scalability and Memory Efficiency of Semidefinite Programs for Lipschitz Constant Estimation of Neural Networks | Unknown | N/A | |
| Learning to Act without Actions | Unknown | N/A | |
| Intelligent Switching for Reset-Free RL | Unknown | N/A | |
| PINNACLE: PINN Adaptive ColLocation and Experimental points selection | Unknown | N/A | |
| Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification | Unknown | N/A | |
| Effective and Efficient Federated Tree Learning on Hybrid Data | Unknown | N/A | |
| Neural Processing of Tri-Plane Hybrid Neural Fields | Unknown | N/A | |
| Boosting the Adversarial Robustness of Graph Neural Networks: An OOD Perspective | Unknown | N/A | |
| Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models | Unknown | N/A | |
| Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game | Unknown | N/A | |
| SetCSE: Set Operations using Contrastive Learning of Sentence Embeddings | Unknown | N/A | |
| #InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models | Unknown | N/A | |
| Harnessing Joint Rain-/Detail-aware Representations to Eliminate Intricate Rains | Unknown | N/A | |
| Pre-training with Random Orthogonal Projection Image Modeling | Unknown | N/A | |
| Debiasing Attention Mechanism in Transformer without Demographics | Unknown | N/A | |
| Unsupervised Pretraining for Fact Verification by Language Model Distillation | Unknown | N/A | |
| Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation | Unknown | N/A | |
| Image Translation as Diffusion Visual Programmers | Unknown | N/A | |
| Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation | Unknown | N/A | |
| Adversarial Imitation Learning via Boosting | Unknown | N/A | |
| Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information | Unknown | N/A | |
| ValUES: A Framework for Systematic Validation of Uncertainty Estimation in Semantic Segmentation | Unknown | N/A | |
| Provable Reward-Agnostic Preference-Based Reinforcement Learning | Unknown | N/A | |
| Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation | Unknown | N/A | |
| Copula Conformal prediction for multi-step time series prediction | Unknown | N/A | |
| Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining | Unknown | N/A | |
| Adapting Large Language Models via Reading Comprehension | Unknown | N/A | |
| Improving Convergence and Generalization Using Parameter Symmetries | Unknown | N/A | |
| COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits | Unknown | N/A | |
| PolyGCL: GRAPH CONTRASTIVE LEARNING via Learnable Spectral Polynomial Filters | Unknown | N/A | |
| Manifold Preserving Guided Diffusion | Unknown | N/A | |
| Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators | Unknown | N/A | |
| Threaten Spiking Neural Networks through Combining Rate and Temporal Information | Unknown | N/A | |
| FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning | Unknown | N/A | |
| Exploring Target Representations for Masked Autoencoders | Unknown | N/A | |
| Federated Recommendation with Additive Personalization | Unknown | N/A | |
| Neural Language of Thought Models | Unknown | N/A | |
| Text2Reward: Reward Shaping with Language Models for Reinforcement Learning | Unknown | N/A | |
| Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion | Unknown | N/A | |
| Statistical Rejection Sampling Improves Preference Optimization | Unknown | N/A | |
| Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs | Unknown | N/A | |
| Privacy Amplification for Matrix Mechanisms | Unknown | N/A | |
| Negative Label Guided OOD Detection with Pretrained Vision-Language Models | Unknown | N/A | |
| PTaRL: Prototype-based Tabular Representation Learning via Space Calibration | Unknown | N/A | |
| Improved baselines for vision-language pre-training | Unknown | N/A | |
| Constrained Bi-Level Optimization: Proximal Lagrangian Value Function Approach and Hessian-free Algorithm | Unknown | N/A | |
| Error Feedback Reloaded: From Quadratic to Arithmetic Mean of Smoothness Constants | Unknown | N/A | |
| Correlated Noise Provably Beats Independent Noise for Differentially Private Learning | Unknown | N/A | |
| ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers | Unknown | N/A | |
| On the Stability of Expressive Positional Encodings for Graphs | Unknown | N/A | |
| PolyVoice: Language Models for Speech to Speech Translation | Unknown | N/A | |
| Benchmarking Algorithms for Federated Domain Generalization | Unknown | N/A | |
| Compositional Generative Inverse Design | Unknown | N/A | |
| Evaluating Representation Learning on the Protein Structure Universe | Unknown | N/A | |
| AutoVP: An Automated Visual Prompting Framework and Benchmark | Unknown | N/A | |
| On the Hardness of Constrained Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Linearity of Relation Decoding in Transformer Language Models | Unknown | N/A | |
| Causal Structure Recovery with Latent Variables under Milder Distributional and Graphical Assumptions | Unknown | N/A | |
| Information Retention via Learning Supplemental Features | Unknown | N/A | |
| Geometry-Aware Projective Mapping for Unbounded Neural Radiance Fields | Unknown | N/A | |
| Off-Policy Primal-Dual Safe Reinforcement Learning | Unknown | N/A | |
| Can Large Language Models Infer Causation from Correlation? | Unknown | N/A | |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | Unknown | N/A | |
| ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning | Unknown | N/A | |
| SAS: Structured Activation Sparsification | Unknown | N/A | |
| Learning Multi-Agent Communication with Contrastive Learning | Unknown | N/A | |
| Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages | Unknown | N/A | |
| Xformer: Hybrid X-Shaped Transformer for Image Denoising | Unknown | N/A | |
| Efficient Backdoor Attacks for Deep Neural Networks in Real-world Scenarios | Unknown | N/A | |
| Dynamics-Informed Protein Design with Structure Conditioning | Unknown | N/A | |
| Identifiable Latent Polynomial Causal Models through the Lens of Change | Unknown | N/A | |
| SYMBOL: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning | Unknown | N/A | |
| Graph Lottery Ticket Automated | Unknown | N/A | |
| Threshold-Consistent Margin Loss for Open-World Deep Metric Learning | Unknown | N/A | |
| The Expressive Leaky Memory Neuron: an Efficient and Expressive Phenomenological Neuron Model Can Solve Long-Horizon Tasks. | Unknown | N/A | |
| DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing | Unknown | N/A | |
| Encoding Unitig-level Assembly Graphs with Heterophilous Constraints for Metagenomic Contigs Binning | Unknown | N/A | |
| Adaptive Regret for Bandits Made Possible: Two Queries Suffice | Unknown | N/A | |
| AdaMerging: Adaptive Model Merging for Multi-Task Learning | Unknown | N/A | |
| Statistically Optimal $K$-means Clustering via Nonnegative Low-rank Semidefinite Programming | Unknown | N/A | |
| AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ | Unknown | N/A | |
| Improved statistical and computational complexity of the mean-field Langevin dynamics under structured data | Unknown | N/A | |
| Zero Bubble (Almost) Pipeline Parallelism | Unknown | N/A | |
| Bridging Neural and Symbolic Representations with Transitional Dictionary Learning | Unknown | N/A | |
| Towards Characterizing Domain Counterfactuals for Invertible Latent Causal Models | Unknown | N/A | |
| Thin-Shell Object Manipulations With Differentiable Physics Simulations | Unknown | N/A | |
| Bayesian Coreset Optimization for Personalized Federated Learning | Unknown | N/A | |
| Beyond Spatio-Temporal Representations: Evolving Fourier Transform for Temporal Graphs | Unknown | N/A | |
| MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models | Unknown | N/A | |
| Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks | Unknown | N/A | |
| Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs | Unknown | N/A | |
| Towards Best Practices of Activation Patching in Language Models: Metrics and Methods | Unknown | N/A | |
| Scale-Adaptive Diffusion Model for Complex Sketch Synthesis | Unknown | N/A | |
| Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs | Unknown | N/A | |
| On the Over-Memorization During Natural, Robust and Catastrophic Overfitting | Unknown | N/A | |
| Mastering Memory Tasks with World Models | Unknown | N/A | |
| Towards Principled Representation Learning from Videos for Reinforcement Learning | Unknown | N/A | |
| Expected flow networks in stochastic environments and two-player zero-sum games | Unknown | N/A | |
| Towards Unified Multi-Modal Personalization: Large Vision-Language Models for Generative Recommendation and Beyond | Unknown | N/A | |
| DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models | Unknown | N/A | |
| Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials | Unknown | N/A | |
| SALMON: Self-Alignment with Instructable Reward Models | Unknown | N/A | |
| Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs | Unknown | N/A | |
| Augmenting Transformers with Recursively Composed Multi-grained Representations | Unknown | N/A | |
| Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization | Unknown | N/A | |
| Large Language Models as Generalizable Policies for Embodied Tasks | Unknown | N/A | |
| The Joint Effect of Task Similarity and Overparameterization on Catastrophic Forgetting — An Analytical Model | Unknown | N/A | |
| Fast Equilibrium of SGD in Generic Situations | Unknown | N/A | |
| Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data | Unknown | N/A | |
| Compositional Preference Models for Aligning LMs | Unknown | N/A | |
| Diffusion Posterior Sampling for Linear Inverse Problem Solving: A Filtering Perspective | Unknown | N/A | |
| Demystifying Local & Global Fairness Trade-offs in Federated Learning Using Partial Information Decomposition | Unknown | N/A | |
| Provable Offline Preference-Based Reinforcement Learning | Unknown | N/A | |
| Learning Conditional Invariances through Non-Commutativity | Unknown | N/A | |
| Generative Modeling with Phase Stochastic Bridge | Unknown | N/A | |
| Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation | Unknown | N/A | |
| RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies | Unknown | N/A | |
| Tailoring Self-Rationalizers with Multi-Reward Distillation | Unknown | N/A | |
| Controlling Vision-Language Models for Multi-Task Image Restoration | Unknown | N/A | |
| Frozen Transformers in Language Models Are Effective Visual Encoder Layers | Unknown | N/A | |
| VFLAIR: A Research Library and Benchmark for Vertical Federated Learning | Unknown | N/A | |
| Measuring Vision-Language STEM Skills of Neural Models | Unknown | N/A | |
| Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers | Unknown | N/A | |
| Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models | Unknown | N/A | |
| MCM: Masked Cell Modeling for Anomaly Detection in Tabular Data | Unknown | N/A | |
| GAIA: Zero-shot Talking Avatar Generation | Unknown | N/A | |
| NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers | Unknown | N/A | |
| CLIP-MUSED: CLIP-Guided Multi-Subject Visual Neural Information Semantic Decoding | Unknown | N/A | |
| How connectivity structure shapes rich and lazy learning in neural circuits | Unknown | N/A | |
| ARGS: Alignment as Reward-Guided Search | Unknown | N/A | |
| PromptTTS 2: Describing and Generating Voices with Text Prompt | Unknown | N/A | |
| Let Models Speak Ciphers: Multiagent Debate through Embeddings | Unknown | N/A | |
| NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks | Unknown | N/A | |
| Understanding when Dynamics-Invariant Data Augmentations Benefit Model-free Reinforcement Learning Updates | Unknown | N/A | |
| Selective Mixup Fine-Tuning for Optimizing Non-Decomposable Objectives | Unknown | N/A | |
| Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation | Unknown | N/A | |
| Text-to-3D with Classifier Score Distillation | Unknown | N/A | |
| Transformers can optimally learn regression mixture models | Unknown | N/A | |
| Dirichlet-based Per-Sample Weighting by Transition Matrix for Noisy Label Learning | Unknown | N/A | |
| Branch-GAN: Improving Text Generation with (not so) Large Language Models | Unknown | N/A | |
| SocioDojo: Building Lifelong Analytical Agents with Real-world Text and Time Series | Unknown | N/A | |
| ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models | Unknown | N/A | |
| Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization | Unknown | N/A | |
| A unique M-pattern for micro-expression spotting in long videos | Unknown | N/A | |
| Progressive Fourier Neural Representation for Sequential Video Compilation | Unknown | N/A | |
| Internal Cross-layer Gradients for Extending Homogeneity to Heterogeneity in Federated Learning | Unknown | N/A | |
| iTransformer: Inverted Transformers Are Effective for Time Series Forecasting | Unknown | N/A | |
| Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations | Unknown | N/A | |
| A Mutual Information Perspective on Federated Contrastive Learning | Unknown | N/A | |
| Local Graph Clustering with Noisy Labels | Unknown | N/A | |
| DistillSpec: Improving Speculative Decoding via Knowledge Distillation | Unknown | N/A | |
| Faithful Vision-Language Interpretation via Concept Bottleneck Models | Unknown | N/A | |
| AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation | Unknown | N/A | |
| Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets | Unknown | N/A | |
| Variance-aware Regret Bounds for Stochastic Contextual Dueling Bandits | Unknown | N/A | |
| Demystifying Embedding Spaces using Large Language Models | Unknown | N/A | |
| A Newborn Embodied Turing Test for Comparing Object Segmentation Across Animals and Machines | Unknown | N/A | |
| DeepZero: Scaling Up Zeroth-Order Optimization for Deep Model Training | Unknown | N/A | |
| Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View. | Unknown | N/A | |
| Unveiling the Pitfalls of Knowledge Editing for Large Language Models | Unknown | N/A | |
| Learning Thresholds with Latent Values and Censored Feedback | Unknown | N/A | |
| Extending Power of Nature from Binary to Real-Valued Graph Learning in Real World | Unknown | N/A | |
| Robustifying and Boosting Training-Free Neural Architecture Search | Unknown | N/A | |
| Guess & Sketch: Language Model Guided Transpilation | Unknown | N/A | |
| Zero and Few-shot Semantic Parsing with Ambiguous Inputs | Unknown | N/A | |
| Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models | Unknown | N/A | |
| Large-Vocabulary 3D Diffusion Model with Transformer | Unknown | N/A | |
| An Investigation of Representation and Allocation Harms in Contrastive Learning | Unknown | N/A | |
| Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words | Unknown | N/A | |
| Solving High Frequency and Multi-Scale PDEs with Gaussian Processes | Unknown | N/A | |
| Adversarial Attacks on Fairness of Graph Neural Networks | Unknown | N/A | |
| Task structure and nonlinearity jointly determine learned representational geometry | Unknown | N/A | |
| Locality Sensitive Sparse Encoding for Learning World Models Online | Unknown | N/A | |
| ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models | Unknown | N/A | |
| Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph | Unknown | N/A | |
| Tractable MCMC for Private Learning with Pure and Gaussian Differential Privacy | Unknown | N/A | |
| Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models | Unknown | N/A | |
| Graph Transformers on EHRs: Better Representation Improves Downstream Performance | Unknown | N/A | |
| SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning | Unknown | N/A | |
| Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data | Unknown | N/A | |
| Scalable Modular Network: A Framework for Adaptive Learning via Agreement Routing | Unknown | N/A | |
| Improved Regret Bounds for Non-Convex Online-Within-Online Meta Learning | Unknown | N/A | |
| Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis | Unknown | N/A | |
| Recursive Generalization Transformer for Image Super-Resolution | Unknown | N/A | |
| Reconciling Spatial and Temporal Abstractions for Goal Representation | Unknown | N/A | |
| Magnushammer: A Transformer-Based Approach to Premise Selection | Unknown | N/A | |
| Score Models for Offline Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| Treatment Effects Estimation By Uniform Transformer | Unknown | N/A | |
| Scaling Laws for Associative Memories | Unknown | N/A | |
| Entropy-MCMC: Sampling from Flat Basins with Ease | Unknown | N/A | |
| LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units | Unknown | N/A | |
| Representation Deficiency in Masked Language Modeling | Unknown | N/A | |
| Sampling Multimodal Distributions with the Vanilla Score: Benefits of Data-Based Initialization | Unknown | N/A | |
| MaGIC: Multi-modality Guided Image Completion | Unknown | N/A | |
| CO2: Efficient Distributed Training with Full Communication-Computation Overlap | Unknown | N/A | |
| Learning Robust Generalizable Radiance Field with Visibility and Feature Augmented Point Representation | Unknown | N/A | |
| Alice Benchmarks: Connecting Real World Re-Identification with the Synthetic | Unknown | N/A | |
| HoloNets: Spectral Convolutions do extend to Directed Graphs | Unknown | N/A | |
| Searching for High-Value Molecules Using Reinforcement Learning and Transformers | Unknown | N/A | |
| Interpretable Meta-Learning of Physical Systems | Unknown | N/A | |
| An Image Is Worth 1000 Lies: Transferability of Adversarial Images across Prompts on Vision-Language Models | Unknown | N/A | |
| Fast Value Tracking for Deep Reinforcement Learning | Unknown | N/A | |
| Interpretable Sparse System Identification: Beyond Recent Deep Learning Techniques on Time-Series Prediction | Unknown | N/A | |
| FedInverse: Evaluating Privacy Leakage in Federated Learning | Unknown | N/A | |
| CircuitNet 2.0: An Advanced Dataset for Promoting Machine Learning Innovations in Realistic Chip Design Environment | Unknown | N/A | |
| Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators? | Unknown | N/A | |
| Self-Supervised Contrastive Learning for Long-term Forecasting | Unknown | N/A | |
| Federated Orthogonal Training: Mitigating Global Catastrophic Forgetting in Continual Federated Learning | Unknown | N/A | |
| Neuron Activation Coverage: Rethinking Out-of-distribution Detection and Generalization | Unknown | N/A | |
| OctoPack: Instruction Tuning Code Large Language Models | Unknown | N/A | |
| Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation | Unknown | N/A | |
| Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning | Unknown | N/A | |
| Rethinking CNN’s Generalization to Backdoor Attack from Frequency Domain | Unknown | N/A | |
| Variance Reduced Halpern Iteration for Finite-Sum Monotone Inclusions | Unknown | N/A | |
| Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation | Unknown | N/A | |
| LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts | Unknown | N/A | |
| VBH-GNN: Variational Bayesian Heterogeneous Graph Neural Networks for Cross-subject Emotion Recognition | Unknown | N/A | |
| Neural Rate Control for Learned Video Compression | Unknown | N/A | |
| PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code | Unknown | N/A | |
| Harnessing Density Ratios for Online Reinforcement Learning | Unknown | N/A | |
| Sliced Denoising: A Physics-Informed Molecular Pre-Training Method | Unknown | N/A | |
| Is Self-Repair a Silver Bullet for Code Generation? | Unknown | N/A | |
| Unified Human-Scene Interaction via Prompted Chain-of-Contacts | Unknown | N/A | |
| Improved Efficiency Based on Learned Saccade and Continuous Scene Reconstruction From Foveated Visual Sampling | Unknown | N/A | |
| Grounding Multimodal Large Language Models to the World | Unknown | N/A | |
| Context-Aware Meta-Learning | Unknown | N/A | |
| A Poincaré Inequality and Consistency Results for Signal Sampling on Large Graphs | Unknown | N/A | |
| Poisoned Forgery Face: Towards Backdoor Attacks on Face Forgery Detection | Unknown | N/A | |
| MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation | Unknown | N/A | |
| Negatively Correlated Ensemble Reinforcement Learning for Online Diverse Game Level Generation | Unknown | N/A | |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Unknown | N/A | |
| Causal Fairness under Unobserved Confounding: A Neural Sensitivity Framework | Unknown | N/A | |
| Local Composite Saddle Point Optimization | Unknown | N/A | |
| ASID: Active Exploration for System Identification in Robotic Manipulation | Unknown | N/A | |
| Simple Hierarchical Planning with Diffusion | Unknown | N/A | |
| sRGB Real Noise Modeling via Noise-Aware Sampling with Normalizing Flows | Unknown | N/A | |
| BatteryML: An Open-source Platform for Machine Learning on Battery Degradation | Unknown | N/A | |
| Improving Generalization of Alignment with Human Preferences through Group Invariant Learning | Unknown | N/A | |
| Dynamic Sparse Training with Structured Sparsity | Unknown | N/A | |
| DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING | Unknown | N/A | |
| Robustifying State-space Models for Long Sequences via Approximate Diagonalization | Unknown | N/A | |
| Generative Adversarial Equilibrium Solvers | Unknown | N/A | |
| Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models | Unknown | N/A | |
| FedImpro: Measuring and Improving Client Update in Federated Learning | Unknown | N/A | |
| What Makes a Good Prune? Maximal Unstructured Pruning for Maximal Cosine Similarity | Unknown | N/A | |
| Improving LoRA in Privacy-preserving Federated Learning | Unknown | N/A | |
| Efficient Inverse Multiagent Learning | Unknown | N/A | |
| Neural Neighborhood Search for Multi-agent Path Finding | Unknown | N/A | |
| Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models | Unknown | N/A | |
| FasterViT: Fast Vision Transformers with Hierarchical Attention | Unknown | N/A | |
| C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion | Unknown | N/A | |
| DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior | Unknown | N/A | |
| Plugin estimators for selective classification with out-of-distribution detection | Unknown | N/A | |
| P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering | Unknown | N/A | |
| Unprocessing Seven Years of Algorithmic Fairness | Unknown | N/A | |
| Adaptive Stochastic Gradient Algorithm for Black-box Multi-Objective Learning | Unknown | N/A | |
| Intriguing Properties of Data Attribution on Diffusion Models | Unknown | N/A | |
| SpaCE: The Spatial Confounding Environment | Unknown | N/A | |
| How Does Unlabeled Data Provably Help Out-of-Distribution Detection? | Unknown | N/A | |
| AdjointDPM: Adjoint Sensitivity Method for Gradient Backpropagation of Diffusion Probabilistic Models | Unknown | N/A | |
| GlucoBench: Curated List of Continuous Glucose Monitoring Datasets with Prediction Benchmarks | Unknown | N/A | |
| Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning | Unknown | N/A | |
| Look, Remember and Reason: Grounded Reasoning in Videos with Language Models | Unknown | N/A | |
| It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition | Unknown | N/A | |
| MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning | Unknown | N/A | |
| Pushing Boundaries: Mixup's Influence on Neural Collapse | Unknown | N/A | |
| Analyzing and Improving Optimal-Transport-based Adversarial Networks | Unknown | N/A | |
| Two-stage LLM Fine-tuning with Less Specialization and More Generalization | Unknown | N/A | |
| LLCP: Learning Latent Causal Processes for Reasoning-based Video Question Answer | Unknown | N/A | |
| Implicit regularization of deep residual networks towards neural ODEs | Unknown | N/A | |
| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Unknown | N/A | |
| Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models | Unknown | N/A | |
| Protein Multimer Structure Prediction via Prompt Learning | Unknown | N/A | |
| Symbol as Points: Panoptic Symbol Spotting via Point-based Representation | Unknown | N/A | |
| Compressing Latent Space via Least Volume | Unknown | N/A | |
| CoLiDE: Concomitant Linear DAG Estimation | Unknown | N/A | |
| EBMDock: Neural Probabilistic Protein-Protein Docking via a Differentiable Energy Model | Unknown | N/A | |
| Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory | Unknown | N/A | |
| Habitat 3.0: A Co-Habitat for Humans, Avatars, and Robots | Unknown | N/A | |
| A Unified Framework for Bayesian Optimization under Contextual Uncertainty | Unknown | N/A | |
| Learning Large DAGs is Harder than you Think: Many Losses are Minimal for the Wrong DAG | Unknown | N/A | |
| Nearly $d$-Linear Convergence Bounds for Diffusion Models via Stochastic Localization | Unknown | N/A | |
| Active Retrosynthetic Planning Aware of Route Quality | Unknown | N/A | |
| Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency | Unknown | N/A | |
| Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Unknown | N/A | |
| Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy | Unknown | N/A | |
| Non-Exchangeable Conformal Risk Control | Unknown | N/A | |
| Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM | Unknown | N/A | |
| USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields | Unknown | N/A | |
| Are Bert Family Good Instruction Followers? A Study on Their Potential And Limitations | Unknown | N/A | |
| Synergistic Patch Pruning for Vision Transformer: Unifying Intra- & Inter-Layer Patch Importance | Unknown | N/A | |
| Dissecting Sample Hardness: A Fine-Grained Analysis of Hardness Characterization Methods for Data-Centric AI | Unknown | N/A | |
| Epitopological learning and Cannistraci-Hebb network shape intelligence brain-inspired theory for ultra-sparse advantage in deep learning | Unknown | N/A | |
| Early Stopping Against Label Noise Without Validation Data | Unknown | N/A | |
| Contrastive Preference Learning: Learning from Human Feedback without Reinforcement Learning | Unknown | N/A | |
| Unknown Domain Inconsistency Minimization for Domain Generalization | Unknown | N/A | |
| Enhancing Transfer Learning with Flexible Nonparametric Posterior Sampling | Unknown | N/A | |
| Finite Scalar Quantization: VQ-VAE Made Simple | Unknown | N/A | |
| Fixed-Budget Differentially Private Best Arm Identification | Unknown | N/A | |
| Rethinking Backdoor Attacks on Dataset Distillation: A Kernel Method Perspective | Unknown | N/A | |
| Neural Contractive Dynamical Systems | Unknown | N/A | |
| Energy-based Automated Model Evaluation | Unknown | N/A | |
| Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space | Unknown | N/A | |
| FreeDyG: Frequency Enhanced Continuous-Time Dynamic Graph Model for Link Prediction | Unknown | N/A | |
| Structural Inference with Dynamics Encoding and Partial Correlation Coefficients | Unknown | N/A | |
| SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution | Unknown | N/A | |
| Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games | Unknown | N/A | |
| Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation | Unknown | N/A | |
| Polynormer: Polynomial-Expressive Graph Transformer in Linear Time | Unknown | N/A | |
| Beyond task performance: evaluating and reducing the flaws of large multimodal models with in-context-learning | Unknown | N/A | |
| A Differentially Private Clustering Algorithm for Well-Clustered Graphs | Unknown | N/A | |
| The Trickle-down Impact of Reward Inconsistency on RLHF | Unknown | N/A | |
| Contrastive Learning is Spectral Clustering on Similarity Graph | Unknown | N/A | |
| Better Neural PDE Solvers Through Data-Free Mesh Movers | Unknown | N/A | |
| Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency | Unknown | N/A | |
| Memorization Capacity of Multi-Head Attention in Transformers | Unknown | N/A | |
| LUT-GEMM: Quantized Matrix Multiplication based on LUTs for Efficient Inference in Large-Scale Generative Language Models | Unknown | N/A | |
| Domain-Inspired Sharpness-Aware Minimization Under Domain Shifts | Unknown | N/A | |
| Adaptive Retrieval and Scalable Indexing for k-NN Search with Cross-Encoders | Unknown | N/A | |
| Enhancing Group Fairness in Online Settings Using Oblique Decision Forests | Unknown | N/A | |
| True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning | Unknown | N/A | |
| Maximum Likelihood Estimation is All You Need for Well-Specified Covariate Shift | Unknown | N/A | |
| A Sublinear Adversarial Training Algorithm | Unknown | N/A | |
| PhyloGFN: Phylogenetic inference with generative flow networks | Unknown | N/A | |
| Understanding Certified Training with Interval Bound Propagation | Unknown | N/A | |
| Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images | Unknown | N/A | |
| Performance Gaps in Multi-view Clustering under the Nested Matrix-Tensor Model | Unknown | N/A | |
| Language Model Beats Diffusion - Tokenizer is key to visual generation | Unknown | N/A | |
| ZeRO++: Extremely Efficient Collective Communication for Large Model Training | Unknown | N/A | |
| Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications | Unknown | N/A | |
| Dual Associated Encoder for Face Restoration | Unknown | N/A | |
| CausalLM is not optimal for in-context learning | Unknown | N/A | |
| An Unforgeable Publicly Verifiable Watermark for Large Language Models | Unknown | N/A | |
| Deep Orthogonal Hypersphere Compression for Anomaly Detection | Unknown | N/A | |
| IMPUS: Image Morphing with Perceptually-Uniform Sampling Using Diffusion Models | Unknown | N/A | |
| Evaluating the Zero-shot Robustness of Instruction-tuned Language Models | Unknown | N/A | |
| Does Writing with Language Models Reduce Content Diversity? | Unknown | N/A | |
| Class Probability Matching with Calibrated Networks for Label Shift Adaption | Unknown | N/A | |
| Few-shot Hybrid Domain Adaptation of Image Generator | Unknown | N/A | |
| Entity-Centric Reinforcement Learning for Object Manipulation from Pixels | Unknown | N/A | |
| Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning | Unknown | N/A | |
| Adaptive Rational Activations to Boost Deep Reinforcement Learning | Unknown | N/A | |
| InterpGNN: Understand and Improve Generalization Ability of Transdutive GNNs through the Lens of Interplay between Train and Test Nodes | Unknown | N/A | |
| A Progressive Training Framework for Spiking Neural Networks with Learnable Multi-hierarchical Model | Unknown | N/A | |
| Memory-Assisted Sub-Prototype Mining for Universal Domain Adaptation | Unknown | N/A | |
| Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments | Unknown | N/A | |
| CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis | Unknown | N/A | |
| Ito Diffusion Approximation of Universal Ito Chains for Sampling, Optimization and Boosting | Unknown | N/A | |
| OWL: A Large Language Model for IT Operations | Unknown | N/A | |
| Towards Meta-Pruning via Optimal Transport | Unknown | N/A | |
| Würstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models | Unknown | N/A | |
| REFACTOR: Learning to Extract Theorems from Proofs | Unknown | N/A | |
| From Posterior Sampling to Meaningful Diversity in Image Restoration | Unknown | N/A | |
| Federated Q-Learning: Linear Regret Speedup with Low Communication Cost | Unknown | N/A | |
| Transformer Fusion with Optimal Transport | Unknown | N/A | |
| A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models | Unknown | N/A | |
| Dynamic Neighborhood Construction for Structured Large Discrete Action Spaces | Unknown | N/A | |
| LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset | Unknown | N/A | |
| A Recipe for Improved Certifiable Robustness | Unknown | N/A | |
| Sparse MoE with Language Guided Routing for Multilingual Machine Translation | Unknown | N/A | |
| Neural Architecture Retrieval | Unknown | N/A | |
| Neural SDF Flow for 3D Reconstruction of Dynamic Scenes | Unknown | N/A | |
| Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback | Unknown | N/A | |
| Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model | Unknown | N/A | |
| Teaching Arithmetic to Small Transformers | Unknown | N/A | |
| ADOPD: A Large-Scale Document Page Decomposition Dataset | Unknown | N/A | |
| Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity | Unknown | N/A | |
| Compressing LLMs: The Truth is Rarely Pure and Never Simple | Unknown | N/A | |
| To the Cutoff... and Beyond? A Longitudinal Perspective on LLM Data Contamination | Unknown | N/A | |
| ToolChain: Efficient Action Space Navigation in Large Language Models with A Search | Unknown | N/A | |
| Large Language Models as Tool Makers | Unknown | N/A | |
| LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models | Unknown | N/A | |
| Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding | Unknown | N/A | |
| Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach | Unknown | N/A | |
| P2Seg: Pointly-supervised Segmentation via Mutual Distillation | Unknown | N/A | |
| SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores | Unknown | N/A | |
| Efficient Subgraph GNNs by Learning Effective Selection Policies | Unknown | N/A | |
| Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning | Unknown | N/A | |
| Label-free Node Classification on Graphs with Large Language Models (LLMs) | Unknown | N/A | |
| LLM-grounded Video Diffusion Models | Unknown | N/A | |
| Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions | Unknown | N/A | |
| Multimodal Web Navigation with Instruction-Finetuned Foundation Models | Unknown | N/A | |
| Function-space Parameterization of Neural Networks for Sequential Learning | Unknown | N/A | |
| One-hot Generalized Linear Model for Switching Brain State Discovery | Unknown | N/A | |
| Large Language Models as Analogical Reasoners | Unknown | N/A | |
| Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs | Unknown | N/A | |
| Annealing Self-Distillation Rectification Improves Adversarial Training | Unknown | N/A | |
| H2O-SDF: Two-phase Learning for 3D Indoor Reconstruction using Object Surface Fields | Unknown | N/A | |
| Boundary Denoising for Video Activity Localization | Unknown | N/A | |
| Out-of-Distribution Detection by Leveraging Between-Layer Transformation Smoothness | Unknown | N/A | |
| RLIF: Interactive Imitation Learning as Reinforcement Learning | Unknown | N/A | |
| Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation | Unknown | N/A | |
| On Trajectory Augmentations for Off-Policy Evaluation | Unknown | N/A | |
| How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions | Unknown | N/A | |
| Alt-Text with Context: Improving Accessibility for Images on Twitter | Unknown | N/A | |
| Combinatorial Bandits for Maximum Value Reward Function under Value-Index Feedback | Unknown | N/A | |
| Reward-Free Curricula for Training Robust World Models | Unknown | N/A | |
| Yet Another ICU Benchmark: A Flexible Multi-Center Framework for Clinical ML | Unknown | N/A | |
| PORF: POSE RESIDUAL FIELD FOR ACCURATE NEURAL SURFACE RECONSTRUCTION | Unknown | N/A | |
| PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis | Unknown | N/A | |
| AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation | Unknown | N/A | |
| Convergence of Bayesian Bilevel Optimization | Unknown | N/A | |
| Functional Interpolation for Relative Positions improves Long Context Transformers | Unknown | N/A | |
| Consistency Training with Learnable Data Augmentation for Graph Anomaly Detection with Limited Supervision | Unknown | N/A | |
| Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification | Unknown | N/A | |
| Small-scale proxies for large-scale Transformer training instabilities | Unknown | N/A | |
| Lion Secretly Solves a Constrained Optimization: As Lyapunov Predicts | Unknown | N/A | |
| Improving equilibrium propagation without weight symmetry through Jacobian homeostasis | Unknown | N/A | |
| Symmetric Single Index Learning | Unknown | N/A | |
| Tight Rates in Supervised Outlier Transfer Learning | Unknown | N/A | |
| Node2ket: Efficient High-Dimensional Network Embedding in Quantum Hilbert Space | Unknown | N/A | |
| Towards LLM4QPE: Unsupervised Pretraining of Quantum Property Estimation and A Benchmark | Unknown | N/A | |
| Object-Aware Inversion and Reassembly for Image Editing | Unknown | N/A | |
| Analysis of Learning a Flow-based Generative Model from Limited Sample Complexity | Unknown | N/A | |
| What's In My Big Data? | Unknown | N/A | |
| Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks | Unknown | N/A | |
| Minimum width for universal approximation using ReLU networks on compact domain | Unknown | N/A | |
| Provable Memory Efficient Self-Play Algorithm for Model-free Reinforcement Learning | Unknown | N/A | |
| Retro-fallback: retrosynthetic planning in an uncertain world | Unknown | N/A | |
| Generative Human Motion Stylization in Latent Space | Unknown | N/A | |
| TUVF: Learning Generalizable Texture UV Radiance Fields | Unknown | N/A | |
| Achieving the Pareto Frontier of Regret Minimization and Best Arm Identification in Multi-Armed Bandits | Unknown | N/A | |
| Monte Carlo guided Denoising Diffusion models for Bayesian linear inverse problems. | Unknown | N/A | |
| Robustness of AI-Image Detectors: Fundamental Limits and Practical Attacks | Unknown | N/A | |
| Fast Imitation via Behavior Foundation Models | Unknown | N/A | |
| Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models | Unknown | N/A | |
| MEND: Meta Demonstration Distillation for Efficient and Effective In-Context Learning | Unknown | N/A | |
| Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction | Unknown | N/A | |
| ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs | Unknown | N/A | |
| Only Pay for What Is Uncertain: Variance-Adaptive Thompson Sampling | Unknown | N/A | |
| The Consensus Game: Language Model Generation via Equilibrium Search | Unknown | N/A | |
| Tool-Augmented Reward Modeling | Unknown | N/A | |
| Procedural Fairness Through Decoupling Objectionable Data Generating Components | Unknown | N/A | |
| Generalized Policy Iteration using Tensor Approximation for Hybrid Control | Unknown | N/A | |
| Scaling Supervised Local Learning with Augmented Auxiliary Networks | Unknown | N/A | |
| Decongestion by Representation: Learning to Improve Economic Welfare in Marketplaces | Unknown | N/A | |
| Diffusion Models for Multi-Task Generative Modeling | Unknown | N/A | |
| OpenChat: Advancing Open-source Language Models with Mixed-Quality Data | Unknown | N/A | |
| Causal Modelling Agents: Causal Graph Discovery through Synergising Metadata- and Data-driven Reasoning | Unknown | N/A | |
| FedCompass: Efficient Cross-Silo Federated Learning on Heterogeneous Client Devices Using a Computing Power-Aware Scheduler | Unknown | N/A | |
| Bandits with Replenishable Knapsacks: the Best of both Worlds | Unknown | N/A | |
| Bridging State and History Representations: Understanding Self-Predictive RL | Unknown | N/A | |
| Latent 3D Graph Diffusion | Unknown | N/A | |
| State Representation Learning Using an Unbalanced Atlas | Unknown | N/A | |
| Scalable Monotonic Neural Networks | Unknown | N/A | |
| Towards a statistical theory of data selection under weak supervision | Unknown | N/A | |
| Multilinear Operator Networks | Unknown | N/A | |
| BayesPrompt: Prompting Large-Scale Pre-Trained Language Models on Few-shot Inference via Debiased Domain Abstraction | Unknown | N/A | |
| A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks | Unknown | N/A | |
| Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World | Unknown | N/A | |
| NuwaDynamics: Discovering and Updating in Causal Spatio-Temporal Modeling | Unknown | N/A | |
| Rethinking Information-theoretic Generalization: Loss Entropy Induced PAC Bounds | Unknown | N/A | |
| Decoupling Weighing and Selecting for Integrating Multiple Graph Pre-training Tasks | Unknown | N/A | |
| Democratizing Fine-grained Visual Recognition with Large Language Models | Unknown | N/A | |
| Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps | Unknown | N/A | |
| GAFormer: Enhancing Timeseries Transformers Through Group-Aware Embeddings | Unknown | N/A | |
| DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text | Unknown | N/A | |
| SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models | Unknown | N/A | |
| Prompt Learning with Quaternion Networks | Unknown | N/A | |
| Prioritized Soft Q-Decomposition for Lexicographic Reinforcement Learning | Unknown | N/A | |
| A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis | Unknown | N/A | |
| MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods | Unknown | N/A | |
| Generalizability of Adversarial Robustness Under Distribution Shifts | Unknown | N/A | |
| UC-NERF: Neural Radiance Field for Under-Calibrated Multi-View Cameras in Autonomous Driving | Unknown | N/A | |
| LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition | Unknown | N/A | |
| Facing the Elephant in the Room: Visual Prompt Tuning or Full finetuning? | Unknown | N/A | |
| Accelerated Sampling with Stacked Restricted Boltzmann Machines | Unknown | N/A | |
| BECLR: Batch Enhanced Contrastive Few-Shot Learning | Unknown | N/A | |
| DyST: Towards Dynamic Neural Scene Representations on Real-World Videos | Unknown | N/A | |
| Logical Languages Accepted by Transformer Encoders with Hard Attention | Unknown | N/A | |
| Meta-Learning Priors Using Unrolled Proximal Networks | Unknown | N/A | |
| NfgTransformer: Equivariant Representation Learning for Normal-form Games | Unknown | N/A | |
| A Topological Perspective on Demystifying GNN-Based Link Prediction Performance | Unknown | N/A | |
| Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech | Unknown | N/A | |
| Circumventing Concept Erasure Methods For Text-To-Image Generative Models | Unknown | N/A | |
| Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation | Unknown | N/A | |
| Energy-guided Entropic Neural Optimal Transport | Unknown | N/A | |
| On the Posterior Distribution in Denoising: Application to Uncertainty Quantification | Unknown | N/A | |
| Towards Eliminating Hard Label Constraints in Gradient Inversion Attacks | Unknown | N/A | |
| Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models | Unknown | N/A | |
| Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs | Unknown | N/A | |
| Ensemble Distillation for Unsupervised Constituency Parsing | Unknown | N/A | |
| MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding | Unknown | N/A | |
| Approximately Piecewise E(3) Equivariant Point Networks | Unknown | N/A | |
| EventRPG: Event Data Augmentation with Relevance Propagation Guidance | Unknown | N/A | |
| LLMCarbon: Modeling the End-to-End Carbon Footprint of Large Language Models | Unknown | N/A | |
| Implicit Maximum a Posteriori Filtering via Adaptive Optimization | Unknown | N/A | |
| Multi-modal Gaussian Process Variational Autoencoders for Neural and Behavioral Data | Unknown | N/A | |
| ContextRef: Evaluating Referenceless Metrics for Image Description Generation | Unknown | N/A | |
| Diffusion Model for Dense Matching | Unknown | N/A | |
| BENO: Boundary-embedded Neural Operators for Elliptic PDEs | Unknown | N/A | |
| Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies | Unknown | N/A | |
| On Double Descent in Reinforcement Learning with LSTD and Random Features | Unknown | N/A | |
| Bayesian Bi-clustering of Neural Spiking Activity with Latent Structures | Unknown | N/A | |
| Functional Bayesian Tucker Decomposition for Continuous-indexed Tensor Data | Unknown | N/A | |
| Doubly Robust Proximal Causal Learning for Continuous Treatments | Unknown | N/A | |
| SaNN: Simple Yet Powerful Simplicial-aware Neural Networks | Unknown | N/A | |
| DMBP: Diffusion model-based predictor for robust offline reinforcement learning against state observation perturbations | Unknown | N/A | |
| Bounds on Representation-Induced Confounding Bias for Treatment Effect Estimation | Unknown | N/A | |
| Making Retrieval-Augmented Language Models Robust to Irrelevant Context | Unknown | N/A | |
| GAIA: a benchmark for General AI Assistants | Unknown | N/A | |
| Polynomial Width is Sufficient for Set Representation with High-dimensional Features | Unknown | N/A | |
| AUC-CL: A Batchsize-Robust Framework for Self-Supervised Contrastive Representation Learning | Unknown | N/A | |
| Adversarial Causal Bayesian Optimization | Unknown | N/A | |
| A Neural Framework for Generalized Causal Sensitivity Analysis | Unknown | N/A | |
| Fast, Expressive $\mathrm{SE}(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space | Unknown | N/A | |
| Most discriminative stimuli for functional cell type clustering | Unknown | N/A | |
| Efficient Backpropagation with Variance Controlled Adaptive Sampling | Unknown | N/A | |
| Improving Intrinsic Exploration by Creating Stationary Objectives | Unknown | N/A | |
| A Dynamical View of the Question of Why | Unknown | N/A | |
| SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning | Unknown | N/A | |
| CoBIT: A Contrastive Bi-directional Image-Text Generation Model | Unknown | N/A | |
| Retrieval-based Disentangled Representation Learning with Natural Language Supervision | Unknown | N/A | |
| Large Language Models are Efficient Learners of Noise-Robust Speech Recognition | Unknown | N/A | |
| Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization | Unknown | N/A | |
| Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning | Unknown | N/A | |
| Leveraging augmented-Lagrangian techniques for differentiating over infeasible quadratic programs in machine learning | Unknown | N/A | |
| FedDA: Faster Adaptive Gradient Methods for Federated Constrained Optimization | Unknown | N/A | |
| Conformal Language Modeling | Unknown | N/A | |
| GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction | Unknown | N/A | |
| Space Group Constrained Crystal Generation | Unknown | N/A | |
| SliceGPT: Compress Large Language Models by Deleting Rows and Columns | Unknown | N/A | |
| Generating Images with 3D Annotations Using Diffusion Models | Unknown | N/A | |
| The importance of feature preprocessing for differentially private linear optimization | Unknown | N/A | |
| Conditional Instrumental Variable Regression with Representation Learning for Causal Inference | Unknown | N/A | |
| Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data | Unknown | N/A | |
| Improving the Convergence of Dynamic NeRFs via Optimal Transport | Unknown | N/A | |
| Mean Field Theory in Deep Metric Learning | Unknown | N/A | |
| Prompt Risk Control: A Rigorous Framework for Responsible Deployment of Large Language Models | Unknown | N/A | |
| Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers | Unknown | N/A | |
| Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models | Unknown | N/A | |
| Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models | Unknown | N/A | |
| Confronting Reward Model Overoptimization with Constrained RLHF | Unknown | N/A | |
| Un-Mixing Test-Time Normalization Statistics: Combatting Label Temporal Correlation | Unknown | N/A | |
| InstructPix2NeRF: Instructed 3D Portrait Editing from a Single Image | Unknown | N/A | |
| Interpretable Diffusion via Information Decomposition | Unknown | N/A | |
| LQ-LoRA: Low-rank plus Quantized Matrix Decomposition for Efficient Language Model Finetuning | Unknown | N/A | |
| Adaptive Self-training Framework for Fine-grained Scene Graph Generation | Unknown | N/A | |
| Video Language Planning | Unknown | N/A | |
| Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting | Unknown | N/A | |
| Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning | Unknown | N/A | |
| Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts | Unknown | N/A | |
| Grounding Language Plans in Demonstrations Through Counterfactual Perturbations | Unknown | N/A | |
| Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models | Unknown | N/A | |
| Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior | Unknown | N/A | |
| Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation | Unknown | N/A | |
| COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL | Unknown | N/A | |
| Neuron-Enhanced AutoEncoder Matrix Completion and Collaborative Filtering: Theory and Practice | Unknown | N/A | |
| Efficient ConvBN Blocks for Transfer Learning and Beyond | Unknown | N/A | |
| Mind Your Augmentation: The Key to Decoupling Dense Self-Supervised Learning | Unknown | N/A | |
| RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval | Unknown | N/A | |
| Lipschitz Singularities in Diffusion Models | Unknown | N/A | |
| AUGCAL: Improving Sim2Real Adaptation by Uncertainty Calibration on Augmented Synthetic Images | Unknown | N/A | |
| RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations | Unknown | N/A | |
| Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression | Unknown | N/A | |
| Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs | Unknown | N/A | |
| Vision-Language Foundation Models as Effective Robot Imitators | Unknown | N/A | |
| Learning Nash Equilibria in Rank-1 Games | Unknown | N/A | |
| GTMGC: Using Graph Transformer to Predict Molecule’s Ground-State Conformation | Unknown | N/A | |
| FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators | Unknown | N/A | |
| Fake It Till Make It: Federated Learning with Consensus-Oriented Generation | Unknown | N/A | |
| Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models | Unknown | N/A | |
| STREAM: Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models | Unknown | N/A | |
| METRA: Scalable Unsupervised RL with Metric-Aware Abstraction | Unknown | N/A | |
| Transferring Learning Trajectories of Neural Networks | Unknown | N/A | |
| Scalable and Effective Implicit Graph Neural Networks on Large Graphs | Unknown | N/A | |
| Rethinking Branching on Exact Combinatorial Optimization Solver: The First Deep Symbolic Discovery Framework | Unknown | N/A | |
| GraphPulse: Topological representations for temporal graph property prediction | Unknown | N/A | |
| Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments | Unknown | N/A | |
| MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training | Unknown | N/A | |
| Mediator Interpretation and Faster Learning Algorithms for Linear Correlated Equilibria in General Sequential Games | Unknown | N/A | |
| R-MAE: Regions Meet Masked Autoencoders | Unknown | N/A | |
| Label-Focused Inductive Bias over Latent Object Features in Visual Classification | Unknown | N/A | |
| Dissecting learning and forgetting in language model finetuning | Unknown | N/A | |
| Prediction without Preclusion: Recourse Verification with Reachable Sets | Unknown | N/A | |
| Gene Regulatory Network Inference in the Presence of Dropouts: a Causal View | Unknown | N/A | |
| Federated Causal Discovery from Heterogeneous Data | Unknown | N/A | |
| Amortized Network Intervention to Steer the Excitatory Point Processes | Unknown | N/A | |
| Decodable and Sample Invariant Continuous Object Encoder | Unknown | N/A | |
| Tangent Transformers for Composition,Privacy and Removal | Unknown | N/A | |
| Beyond Memorization: Violating Privacy via Inference with Large Language Models | Unknown | N/A | |
| Time Travel in LLMs: Tracing Data Contamination in Large Language Models | Unknown | N/A | |
| How Realistic Is Your Synthetic Data? Constraining Deep Generative Models for Tabular Data | Unknown | N/A | |
| CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents | Unknown | N/A | |
| Masked Audio Generation using a Single Non-Autoregressive Transformer | Unknown | N/A | |
| Unconstrained Stochastic CCA: Unifying Multiview and Self-Supervised Learning | Unknown | N/A | |
| lpNTK: Better Generalisation with Less Data via Sample Interaction During Learning | Unknown | N/A | |
| Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs | Unknown | N/A | |
| Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization | Unknown | N/A | |
| Sliced Wasserstein Estimation with Control Variates | Unknown | N/A | |
| VONet: Unsupervised Video Object Learning With Parallel U-Net Attention and Object-wise Sequential VAE | Unknown | N/A | |
| Relay Diffusion: Unifying diffusion process across resolutions for image synthesis | Unknown | N/A | |
| A representation-learning game for classes of prediction tasks | Unknown | N/A | |
| Query-Policy Misalignment in Preference-Based Reinforcement Learning | Unknown | N/A | |
| SparseFormer: Sparse Visual Recognition via Limited Latent Tokens | Unknown | N/A | |
| NECO: NEural Collapse Based Out-of-distribution detection | Unknown | N/A | |
| EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision | Unknown | N/A | |
| Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning | Unknown | N/A | |
| Time-LLM: Time Series Forecasting by Reprogramming Large Language Models | Unknown | N/A | |
| Accurate Forgetting for Heterogeneous Federated Continual Learning | Unknown | N/A | |
| EControl: Fast Distributed Optimization with Compression and Error Control | Unknown | N/A | |
| Divide and not forget: Ensemble of selectively trained experts in Continual Learning | Unknown | N/A | |
| UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling | Unknown | N/A | |
| Learning No-Regret Sparse Generalized Linear Models with Varying Observation(s) | Unknown | N/A | |
| Nevis'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research | Unknown | N/A | |
| Protein-ligand binding representation learning from fine-grained interactions | Unknown | N/A | |
| Universal Jailbreak Backdoors from Poisoned Human Feedback | Unknown | N/A | |
| VDT: General-purpose Video Diffusion Transformers via Mask Modeling | Unknown | N/A | |
| EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models | Unknown | N/A | |
| GraphCare: Enhancing Healthcare Predictions with Personalized Knowledge Graphs | Unknown | N/A | |
| Grokking in Linear Estimators -- A Solvable Model that Groks without Understanding | Unknown | N/A | |
| Constrained Decoding for Cross-lingual Label Projection | Unknown | N/A | |
| Efficient Heterogeneous Meta-Learning via Channel Shuffling Modulation | Unknown | N/A | |
| Tag2Text: Guiding Vision-Language Model via Image Tagging | Unknown | N/A | |
| Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners | Unknown | N/A | |
| Multi-task Learning with 3D-Aware Regularization | Unknown | N/A | |
| Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video | Unknown | N/A | |
| MixSATGEN: Learning Graph Mixing for SAT Instance Generation | Unknown | N/A | |
| On the Parameterization of Second-Order Optimization Effective towards the Infinite Width | Unknown | N/A | |
| Pareto Deep Long-Tailed Recognition: A Conflict-Averse Solution | Unknown | N/A | |
| Gradual Domain Adaptation via Gradient Flow | Unknown | N/A | |
| Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks | Unknown | N/A | |
| Layer-wise linear mode connectivity | Unknown | N/A | |
| Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram | Unknown | N/A | |
| Decoupling regularization from the action space | Unknown | N/A | |
| Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation | Unknown | N/A | |
| Learning in reverse causal strategic environments with ramifications on two sided markets | Unknown | N/A | |
| Learning to design protein-protein interactions with enhanced generalization | Unknown | N/A | |
| CLAP: Collaborative Adaptation for Patchwork Learning | Unknown | N/A | |
| Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification | Unknown | N/A | |
| Towards Understanding Factual Knowledge of Large Language Models | Unknown | N/A | |
| Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community | Unknown | N/A | |
| Invariance-based Learning of Latent Dynamics | Unknown | N/A | |
| Addressing Signal Delay in Deep Reinforcement Learning | Unknown | N/A | |
| GRANDE: Gradient-Based Decision Tree Ensembles for Tabular Data | Unknown | N/A | |
| ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate | Unknown | N/A | |
| Differentially Private Synthetic Data via Foundation Model APIs 1: Images | Unknown | N/A | |
| On Differentially Private Federated Linear Contextual Bandits | Unknown | N/A | |
| Federated Wasserstein Distance | Unknown | N/A | |
| Win-Win: Training High-Resolution Vision Transformers from Two Windows | Unknown | N/A | |
| Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation | Unknown | N/A | |
| Trajeglish: Traffic Modeling as Next-Token Prediction | Unknown | N/A | |
| Bayesian Low-rank Adaptation for Large Language Models | Unknown | N/A | |
| Detecting, Explaining, and Mitigating Memorization in Diffusion Models | Unknown | N/A | |
| An Intuitive Multi-Frequency Feature Representation for SO(3)-Equivariant Networks | Unknown | N/A | |
| CrossLoco: Human Motion Driven Control of Legged Robots via Guided Unsupervised Reinforcement Learning | Unknown | N/A | |
| Brain decoding: toward real-time reconstruction of visual perception | Unknown | N/A | |
| Cauchy-Schwarz Divergence Information Bottleneck for Regression | Unknown | N/A | |
| Adaptive Window Pruning for Efficient Local Motion Deblurring | Unknown | N/A | |
| Heterogeneous Personalized Federated Learning by Local-Global Updates Mixing via Convergence Rate | Unknown | N/A | |
| ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models | Unknown | N/A | |
| TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting | Unknown | N/A | |
| Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach | Unknown | N/A | |
| Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning | Unknown | N/A | |
| A Primal-Dual Approach to Solving Variational Inequalities with General Constraints | Unknown | N/A | |
| Assessing Uncertainty in Similarity Scoring: Performance & Fairness in Face Recognition | Unknown | N/A | |
| CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception | Unknown | N/A | |
| Robust Classification via Regression for Learning with Noisy Labels | Unknown | N/A | |
| Does CLIP’s generalization performance mainly stem from high train-test similarity? | Unknown | N/A | |
| GNNBoundary: Towards Explaining Graph Neural Networks through the Lens of Decision Boundaries | Unknown | N/A | |
| Flag Aggregator: Scalable Distributed Training under Failures and Augmented Losses using Convex Optimization | Unknown | N/A | |
| Understanding prompt engineering may not require rethinking generalization | Unknown | N/A | |
| Backdoor Federated Learning by Poisoning Backdoor-Critical Layers | Unknown | N/A | |
| Reverse Diffusion Monte Carlo | Unknown | N/A | |
| Spurious Feature Diversification Improves Out-of-distribution Generalization | Unknown | N/A | |
| Interpreting Robustness Proofs of Deep Neural Networks | Unknown | N/A | |
| OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text | Unknown | N/A | |
| Incentive-Aware Federated Learning with Training-Time Model Rewards | Unknown | N/A | |
| SWAP-NAS: Sample-Wise Activation Patterns for Ultra-fast NAS | Unknown | N/A | |
| Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning | Unknown | N/A | |
| Communication-Efficient Federated Non-Linear Bandit Optimization | Unknown | N/A | |
| Provable Robust Watermarking for AI-Generated Text | Unknown | N/A | |
| Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition | Unknown | N/A | |
| Exploring Effective Stimulus Encoding via Vision System Modeling for Visual Prostheses | Unknown | N/A | |
| An LLM can Fool Itself: A Prompt-Based Adversarial Attack | Unknown | N/A | |
| WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Unknown | N/A | |
| Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models | Unknown | N/A | |
| Enhancing Human-AI Collaboration Through Logic-Guided Reasoning | Unknown | N/A | |
| Synaptic Weight Distributions Depend on the Geometry of Plasticity | Unknown | N/A | |
| ArchLock: Locking DNN Transferability at the Architecture Level with a Zero-Cost Binary Predictor | Unknown | N/A | |
| 3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation | Unknown | N/A | |
| Outlier-Robust Subsampling Techniques for Persistent Homology | Unknown | N/A | |
| On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation | Unknown | N/A | |
| ODEFormer: Symbolic Regression of Dynamical Systems with Transformers | Unknown | N/A | |
| VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks | Unknown | N/A | |
| Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition | Unknown | N/A | |
| Neuro-Inspired Information-Theoretic Hierarchical Perception for Multimodal Learning | Unknown | N/A | |
| Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression | Unknown | N/A | |
| Efficient Sharpness-Aware Minimization for Molecular Graph Transformer Models | Unknown | N/A | |
| When can transformers reason with abstract symbols? | Unknown | N/A | |
| Novel Quadratic Constraints for Extending LipSDP beyond Slope-Restricted Activations | Unknown | N/A | |
| DOS: Diverse Outlier Sampling for Out-of-Distribution Detection | Unknown | N/A | |
| Safe RLHF: Safe Reinforcement Learning from Human Feedback | Unknown | N/A | |
| Mayfly: a Neural Data Structure for Graph Stream Summarization | Unknown | N/A | |
| Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds | Unknown | N/A | |
| Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery | Unknown | N/A | |
| On Adversarial Training without Perturbing all Examples | Unknown | N/A | |
| ReLoRA: High-Rank Training Through Low-Rank Updates | Unknown | N/A | |
| VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models | Unknown | N/A | |
| Provable Compositional Generalization for Object-Centric Learning | Unknown | N/A | |
| Neural Common Neighbor with Completion for Link Prediction | Unknown | N/A | |
| Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns | Unknown | N/A | |
| DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models | Unknown | N/A | |
| Masked Completion via Structured Diffusion with White-Box Transformers | Unknown | N/A | |
| PubDef: Defending Against Transfer Attacks From Public Models | Unknown | N/A | |
| Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching | Unknown | N/A | |
| Efficient Streaming Language Models with Attention Sinks | Unknown | N/A | |
| Defining and extracting generalizable interaction primitives from DNNs | Unknown | N/A | |
| LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models | Unknown | N/A | |
| Fine-Tuning Language Models for Factuality | Unknown | N/A | |
| Minimax optimality of convolutional neural networks for infinite dimensional input-output problems and separation from kernel methods | Unknown | N/A | |
| Training Bayesian Neural Networks with Sparse Subspace Variational Inference | Unknown | N/A | |
| Spatially-Aware Transformers for Embodied Agents | Unknown | N/A | |
| DRSM: De-Randomized Smoothing on Malware Classifier Providing Certified Robustness | Unknown | N/A | |
| Seer: Language Instructed Video Prediction with Latent Diffusion Models | Unknown | N/A | |
| Let's do the time-warp-attend: Learning topological invariants of dynamical systems | Unknown | N/A | |
| Querying Easily Flip-flopped Samples for Deep Active Learning | Unknown | N/A | |
| Quasi-Monte Carlo for 3D Sliced Wasserstein | Unknown | N/A | |
| TAB: Temporal Accumulated Batch Normalization in Spiking Neural Networks | Unknown | N/A | |
| Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior | Unknown | N/A | |
| Improved algorithm and bounds for successive projection | Unknown | N/A | |
| Statistical Perspective of Top-K Sparse Softmax Gating Mixture of Experts | Unknown | N/A | |
| Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models | Unknown | N/A | |
| GeoLLM: Extracting Geospatial Knowledge from Large Language Models | Unknown | N/A | |
| LILO: Learning Interpretable Libraries by Compressing and Documenting Code | Unknown | N/A | |
| NOLA: Compressing LoRA using Linear Combination of Random Basis | Unknown | N/A | |
| Learning Hierarchical World Models with Adaptive Temporal Abstractions from Discrete Latent Dynamics | Unknown | N/A | |
| Beyond Vanilla Variational Autoencoders: Detecting Posterior Collapse in Conditional and Hierarchical Variational Autoencoders | Unknown | N/A | |
| Faithful and Efficient Explanations for Neural Networks via Neural Tangent Kernel Surrogate Models | Unknown | N/A | |
| On Characterizing the Trade-off in Invariant Representation Learning | Unknown | N/A | |
| Diffeomorphic Mesh Deformation via Efficient Optimal Transport for Cortical Surface Reconstruction | Unknown | N/A | |
| Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning | Unknown | N/A | |
| Llemma: An Open Language Model for Mathematics | Unknown | N/A | |
| Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles | Unknown | N/A | |
| Distributionally Robust Optimization with Bias and Variance Reduction | Unknown | N/A | |
| Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees | Unknown | N/A | |
| Fast Hyperboloid Decision Tree Algorithms | Unknown | N/A | |
| TiC-CLIP: Continual Training of CLIP Models | Unknown | N/A | |
| Neural Ordinary Differential Equations for Modeling Epidemic Spreading | Unknown | N/A | |
| Tree Cross Attention | Unknown | N/A | |
| Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning | Unknown | N/A | |
| Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation | Unknown | N/A | |
| Multi-resolution HuBERT: Multi-resolution Speech Self-Supervised Learning with Masked Unit Prediction | Unknown | N/A | |
| Neur2RO: Neural Two-Stage Robust Optimization | Unknown | N/A | |
| Variational Bayesian Last Layers | Unknown | N/A | |
| Finetuning Text-to-Image Diffusion Models for Fairness | Unknown | N/A | |
| IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models | Unknown | N/A | |
| Improving Non-Transferable Representation Learning by Harnessing Content and Style | Unknown | N/A | |
| AmortizedPeriod: Attention-based Amortized Inference for Periodicity Identification | Unknown | N/A | |
| Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making | Unknown | N/A | |
| CABINET: Content Relevance-based Noise Reduction for Table Question Answering | Unknown | N/A | |
| Efficient local linearity regularization to overcome catastrophic overfitting | Unknown | N/A | |
| TEMPO: Prompt-based Generative Pre-trained Transformer for Time Series Forecasting | Unknown | N/A | |
| Social-Transmotion: Promptable Human Trajectory Prediction | Unknown | N/A | |
| The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language Models | Unknown | N/A | |
| Controlled Text Generation via Language Model Arithmetic | Unknown | N/A | |
| Neural Fourier Transform: A General Approach to Equivariant Representation Learning | Unknown | N/A | |
| Hybrid Sharing for Multi-Label Image Classification | Unknown | N/A | |
| Learning invariant representations of time-homogeneous stochastic dynamical systems | Unknown | N/A | |
| Label-Agnostic Forgetting: A Supervision-Free Unlearning in Deep Models | Unknown | N/A | |
| On the Fairness ROAD: Robust Optimization for Adversarial Debiasing | Unknown | N/A | |
| Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP | Unknown | N/A | |
| Complex priors and flexible inference in recurrent circuits with dendritic nonlinearities | Unknown | N/A | |
| A Variational Framework for Estimating Continuous Treatment Effects with Measurement Error | Unknown | N/A | |
| SolidGen: An Autoregressive Model for Direct B-rep Synthesis | Unknown | N/A | |
| DEEP NEURAL NETWORK INITIALIZATION WITH SPARSITY INDUCING ACTIVATIONS | Unknown | N/A | |
| Guiding Instruction-based Image Editing via Multimodal Large Language Models | Unknown | N/A | |
| A Foundation Model for Error Correction Codes | Unknown | N/A | |
| Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space | Unknown | N/A | |
| Learning to Make Adherence-aware Advice | Unknown | N/A | |
| Tree Search-Based Policy Optimization under Stochastic Execution Delay | Unknown | N/A | |
| High Fidelity Neural Audio Compression | Unknown | N/A | |
| Sufficient conditions for offline reactivation in recurrent neural networks | Unknown | N/A | |
| A Generalist Agent | Unknown | N/A | |
| Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks | Unknown | N/A | |
| Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis | Unknown | N/A | |
| Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees | Unknown | N/A | |
| InsertNeRF: Instilling Generalizability into NeRF with HyperNet Modules | Unknown | N/A | |
| Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback | Unknown | N/A | |
| Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting | Unknown | N/A | |
| Graph Generation with $K^2$-trees | Unknown | N/A | |
| Memory-Consistent Neural Networks for Imitation Learning | Unknown | N/A | |
| PRIME: Prioritizing Interpretability in Failure Mode Extraction | Unknown | N/A | |
| Bayesian Neural Controlled Differential Equations for Treatment Effect Estimation | Unknown | N/A | |
| OpenTab: Advancing Large Language Models as Open-domain Table Reasoners | Unknown | N/A | |
| Lemur: Integrating Large Language Models in Automated Program Verification | Unknown | N/A | |
| A Simple and Effective Pruning Approach for Large Language Models | Unknown | N/A | |
| Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions | Unknown | N/A | |
| Quadratic models for understanding catapult dynamics of neural networks | Unknown | N/A | |
| Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks | Unknown | N/A | |
| Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion | Unknown | N/A | |
| Learning to Solve Bilevel Programs with Binary Tender | Unknown | N/A | |
| PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training | Unknown | N/A | |
| Fusing Models with Complementary Expertise | Unknown | N/A | |
| Making RL with Preference-based Feedback Efficient via Randomization | Unknown | N/A | |
| Local Search GFlowNets | Unknown | N/A | |
| Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors | Unknown | N/A | |
| Data-independent Module-aware Pruning for Hierarchical Vision Transformers | Unknown | N/A | |
| Matryoshka Diffusion Models | Unknown | N/A | |
| Pre-training with Synthetic Data Helps Offline Reinforcement Learning | Unknown | N/A | |
| SOHES: Self-supervised Open-world Hierarchical Entity Segmentation | Unknown | N/A | |
| Understanding the Effects of RLHF on LLM Generalisation and Diversity | Unknown | N/A | |
| Overcoming the Pitfalls of Vision-Language Model Finetuning for OOD Generalization | Unknown | N/A | |
| Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning | Unknown | N/A | |
| Reinforcement Symbolic Regression Machine | Unknown | N/A | |
| RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation | Unknown | N/A | |
| BRUSLEATTACK: A QUERY-EFFICIENT SCORE- BASED BLACK-BOX SPARSE ADVERSARIAL ATTACK | Unknown | N/A | |
| Learning Energy Decompositions for Partial Inference in GFlowNets | Unknown | N/A | |
| Unlocking the Power of Representations in Long-term Novelty-based Exploration | Unknown | N/A | |
| Augmented Bayesian Policy Search | Unknown | N/A | |
| ImagenHub: Standardizing the evaluation of conditional image generation models | Unknown | N/A | |
| Universal Humanoid Motion Representations for Physics-Based Control | Unknown | N/A | |
| PILOT: An $\mathcal{O}(1/K)$-Convergent Approach for Policy Evaluation with Nonlinear Function Approximation | Unknown | N/A | |
| Estimating Conditional Mutual Information for Dynamic Feature Selection | Unknown | N/A | |
| ALAM: Averaged Low-Precision Activation for Memory-Efficient Training of Transformer Models | Unknown | N/A | |
| Spectrally Transformed Kernel Regression | Unknown | N/A | |
| Adversarial AutoMixup | Unknown | N/A | |
| First-order ANIL provably learns representations despite overparametrisation | Unknown | N/A | |
| Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory optimization | Unknown | N/A | |
| GenSim: Generating Robotic Simulation Tasks via Large Language Models | Unknown | N/A | |
| Multi-View Causal Representation Learning with Partial Observability | Unknown | N/A | |
| Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data | Unknown | N/A | |
| DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models | Unknown | N/A | |
| Leveraging Optimization for Adaptive Attacks on Image Watermarks | Unknown | N/A | |
| CNN Kernels Can Be the Best Shapelets | Unknown | N/A | |
| A Lie Group Approach to Riemannian Batch Normalization | Unknown | N/A | |
| Modelling complex vector drawings with stroke-clouds | Unknown | N/A | |
| WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions | Unknown | N/A | |
| Exposing Text-Image Inconsistency Using Diffusion Models | Unknown | N/A | |
| Task Planning for Visual Room Rearrangement under Partial Observability | Unknown | N/A | |
| FOSI: Hybrid First and Second Order Optimization | Unknown | N/A | |
| InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists | Unknown | N/A | |
| 3D Reconstruction with Generalizable Neural Fields using Scene Priors | Unknown | N/A | |
| Lagrangian Flow Networks for Conservation Laws | Unknown | N/A | |
| fairret: a Framework for Differentiable Fairness Regularization Terms | Unknown | N/A | |
| LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses | Unknown | N/A | |
| Massive Editing for Large Language Models via Meta Learning | Unknown | N/A | |
| Fantastic Generalization Measures are Nowhere to be Found | Unknown | N/A | |
| VeRA: Vector-based Random Matrix Adaptation | Unknown | N/A | |
| Towards Imitation Learning to Branch for MIP: A Hybrid Reinforcement Learning based Sample Augmentation Approach | Unknown | N/A | |
| Implicit Gaussian process representation of vector fields over arbitrary latent manifolds | Unknown | N/A | |
| Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces | Unknown | N/A | |
| V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection | Unknown | N/A | |
| TESTAM: A Time-Enhanced Spatio-Temporal Attention Model with Mixture of Experts | Unknown | N/A | |
| Learning Multi-Faceted Prototypical User Interests | Unknown | N/A | |
| PeFLL: Personalized Federated Learning by Learning to Learn | Unknown | N/A | |
| Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks | Unknown | N/A | |
| Delta-AI: Local objectives for amortized inference in sparse graphical models | Unknown | N/A | |
| DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization | Unknown | N/A | |
| Effective Structural Encodings via Local Curvature Profiles | Unknown | N/A | |
| Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering | Unknown | N/A | |
| SyncDreamer: Generating Multiview-consistent Images from a Single-view Image | Unknown | N/A | |
| InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation | Unknown | N/A | |
| Improving Offline RL by Blending Heuristics | Unknown | N/A | |
| Learning dynamic representations of the functional connectome in neurobiological networks | Unknown | N/A | |
| RAPPER: Reinforced Rationale-Prompted Paradigm for Natural Language Explanation in Visual Question Answering | Unknown | N/A | |
| Learning interpretable control inputs and dynamics underlying animal locomotion | Unknown | N/A | |
| The Effectiveness of Random Forgetting for Robust Generalization | Unknown | N/A | |
| AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents | Unknown | N/A | |
| Unraveling the Key Components of OOD Generalization via Diversification | Unknown | N/A | |
| Bilevel Optimization under Unbounded Smoothness: A New Algorithm and Convergence Analysis | Unknown | N/A | |
| JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention | Unknown | N/A | |
| ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update | Unknown | N/A | |
| Online GNN Evaluation Under Test-time Graph Distribution Shifts | Unknown | N/A | |
| Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models | Unknown | N/A | |
| Cascading Reinforcement Learning | Unknown | N/A | |
| CCIL: Continuity-Based Data Augmentation for Corrective Imitation Learning | Unknown | N/A | |
| A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning | Unknown | N/A | |
| Disentangling Time Series Representations via Contrastive Independence-of-Support on l-Variational Inference | Unknown | N/A | |
| LoftQ: LoRA-Fine-Tuning-aware Quantization for Large Language Models | Unknown | N/A | |
| ReMasker: Imputing Tabular Data with Masked Autoencoding | Unknown | N/A | |
| A Cognitive Model for Learning Abstract Relational Structures from Memory-based Decision-Making Tasks | Unknown | N/A | |
| Diving Segmentation Model into Pixels | Unknown | N/A | |
| Robust Similarity Learning with Difference Alignment Regularization | Unknown | N/A | |
| Exploring Weight Balancing on Long-Tailed Recognition Problem | Unknown | N/A | |
| Towards Enhancing Time Series Contrastive Learning: A Dynamic Bad Pair Mining Approach | Unknown | N/A | |
| GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules | Unknown | N/A | |
| Plug-and-Play: An Efficient Post-training Pruning Method for Large Language Models | Unknown | N/A | |
| Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models | Unknown | N/A | |
| LabelDP-Pro: Learning with Label Differential Privacy via Projections | Unknown | N/A | |
| FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing | Unknown | N/A | |
| Navigating Text-To-Image Customization: From LyCORIS Fine-Tuning to Model Evaluation | Unknown | N/A | |
| Learning to Jointly Understand Visual and Tactile Signals | Unknown | N/A | |
| AirPhyNet: Harnessing Physics-Guided Neural Networks for Air Quality Prediction | Unknown | N/A | |
| A path-norm toolkit for modern networks: consequences, promises and challenges | Unknown | N/A | |
| Deep Confident Steps to New Pockets: Strategies for Docking Generalization | Unknown | N/A | |
| Training-free Multi-objective Diffusion Model for 3D Molecule Generation | Unknown | N/A | |
| Towards Codable Watermarking for Injecting Multi-Bits Information to LLMs | Unknown | N/A | |
| Scaling Laws of RoPE-based Extrapolation | Unknown | N/A | |
| UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science | Unknown | N/A | |
| Rethinking Label Poisoning for GNNs: Pitfalls and Attacks | Unknown | N/A | |
| Efficient Score Matching with Deep Equilibrium Layers | Unknown | N/A | |
| Jointly-Learned Exit and Inference for a Dynamic Neural Network | Unknown | N/A | |
| Private Zeroth-Order Nonsmooth Nonconvex Optimization | Unknown | N/A | |
| Can we get the best of both Binary Neural Networks and Spiking Neural Networks for Efficient Computer Vision? | Unknown | N/A | |
| Deep Neural Networks Tend To Extrapolate Predictably | Unknown | N/A | |
| On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks | Unknown | N/A | |
| Scalable Language Model with Generalized Continual Learning | Unknown | N/A | |
| FedCDA: Federated Learning with Cross-rounds Divergence-aware Aggregation | Unknown | N/A | |
| WebArena: A Realistic Web Environment for Building Autonomous Agents | Unknown | N/A | |
| Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model | Unknown | N/A | |
| Language Model Inversion | Unknown | N/A | |
| A Good Learner can Teach Better: Teacher-Student Collaborative Knowledge Distillation | Unknown | N/A | |
| Consistent Video-to-Video Transfer Using Synthetic Dataset | Unknown | N/A | |
| GraphChef: Decision-Tree Recipes to Explain Graph Neural Networks | Unknown | N/A | |
| Benign Oscillation of Stochastic Gradient Descent with Large Learning Rate | Unknown | N/A | |
| Towards Generative Abstract Reasoning: Completing Raven’s Progressive Matrix via Rule Abstraction and Selection | Unknown | N/A | |
| Adaptive deep spiking neural network with global-local learning via balanced excitatory and inhibitory mechanism | Unknown | N/A | |
| Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation | Unknown | N/A | |
| Learning Hierarchical Image Segmentation For Recognition and By Recognition | Unknown | N/A | |
| Uncertainty-aware Constraint Inference in Inverse Constrained Reinforcement Learning | Unknown | N/A | |
| DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation | Unknown | N/A | |
| On the Role of Discrete Tokenization in Visual Representation Learning | Unknown | N/A | |
| In-Context Learning Dynamics with Random Binary Sequences | Unknown | N/A | |
| Influencer Backdoor Attack on Semantic Segmentation | Unknown | N/A | |
| NeurRev: Train Better Sparse Neural Network Practically via Neuron Revitalization | Unknown | N/A | |
| Low Rank Matrix Completion via Robust Alternating Minimization in Nearly Linear Time | Unknown | N/A | |
| Whittle Index with Multiple Actions and State Constraint for Inventory Management | Unknown | N/A | |
| ControlVideo: Training-free Controllable Text-to-video Generation | Unknown | N/A | |
| Jointly Training Large Autoregressive Multimodal Models | Unknown | N/A | |
| Time-Efficient Reinforcement Learning with Stochastic Stateful Policies | Unknown | N/A | |
| MiniLLM: Knowledge Distillation of Large Language Models | Unknown | N/A | |
| Learning Adaptive Multiresolution Transforms via Meta-Framelet-based Graph Convolutional Network | Unknown | N/A | |
| Multiscale Positive-Unlabeled Detection of AI-Generated Texts | Unknown | N/A | |
| Implicit Neural Representation Inference for Low-Dimensional Bayesian Deep Learning | Unknown | N/A | |
| When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method | Unknown | N/A | |
| The Effective Horizon Explains Deep RL Performance in Stochastic Environments | Unknown | N/A | |
| Interpreting CLIP's Image Representation via Text-Based Decomposition | Unknown | N/A | |
| Demystifying CLIP Data | Unknown | N/A | |
| Optimal Sample Complexity for Average Reward Markov Decision Processes | Unknown | N/A | |
| Space and time continuous physics simulation from partial observations | Unknown | N/A | |
| Learning 3D Particle-based Simulators from RGB-D Videos | Unknown | N/A | |
| Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning | Unknown | N/A | |
| The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry | Unknown | N/A | |
| EMO: EARTH MOVER DISTANCE OPTIMIZATION FOR AUTO-REGRESSIVE LANGUAGE MODELING | Unknown | N/A | |
| Adaptive Instrument Design for Indirect Experiments | Unknown | N/A | |
| Towards Foundation Models for Knowledge Graph Reasoning | Unknown | N/A | |
| Locality-Aware Graph Rewiring in GNNs | Unknown | N/A | |
| Curiosity-driven Red-teaming for Large Language Models | Unknown | N/A | |
| The Devil is in the Object Boundary: Towards Annotation-free Instance Segmentation using Foundation Models | Unknown | N/A | |
| Chameleon: Increasing Label-Only Membership Leakage with Adaptive Poisoning | Unknown | N/A | |
| LCOT: Linear Circular Optimal Transport | Unknown | N/A | |
| OPTIMAL ROBUST MEMORIZATION WITH RELU NEURAL NETWORKS | Unknown | N/A | |
| Stochastic Controlled Averaging for Federated Learning with Communication Compression | Unknown | N/A | |
| DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$ | Unknown | N/A | |
| REBAR: Retrieval-Based Reconstruction for Time-series Contrastive Learning | Unknown | N/A | |
| T-Rep: Representation Learning for Time Series using Time-Embeddings | Unknown | N/A | |
| Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training | Unknown | N/A | |
| Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning | Unknown | N/A | |
| Boosting Vanilla Lightweight Vision Transformers via Re-parameterization | Unknown | N/A | |
| Rethinking the Uniformity Metric in Self-Supervised Learning | Unknown | N/A | |
| Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy | Unknown | N/A | |
| SEGNO: Generalizing Equivariant Graph Neural Networks with Physical Inductive Biases | Unknown | N/A | |
| Identifying Representations for Intervention Extrapolation | Unknown | N/A | |
| Fusion Is Not Enough: Single Modal Attacks on Fusion Models for 3D Object Detection | Unknown | N/A | |
| Generalization error of spectral algorithms | Unknown | N/A | |
| CrIBo: Self-Supervised Learning via Cross-Image Object-Level Bootstrapping | Unknown | N/A | |
| The Reasonableness Behind Unreasonable Translation Capability of Large Language Model | Unknown | N/A | |
| On Representation Complexity of Model-based and Model-free Reinforcement Learning | Unknown | N/A | |
| EX-Graph: A Pioneering Dataset Bridging Ethereum and X | Unknown | N/A | |
| Self-Supervised Heterogeneous Graph Learning: a Homophily and Heterogeneity View | Unknown | N/A | |
| Chain of Thought Empowers Transformers to Solve Inherently Serial Problems | Unknown | N/A | |
| Beating Price of Anarchy and Gradient Descent without Regret in Potential Games | Unknown | N/A | |
| Conformal Risk Control | Unknown | N/A | |
| Fair Classifiers that Abstain without Harm | Unknown | N/A | |
| Repelling Random Walks | Unknown | N/A | |
| Separating common from salient patterns with Contrastive Representation Learning | Unknown | N/A | |
| Instant3D: Fast Text-to-3D with Sparse-view Generation and Large Reconstruction Model | Unknown | N/A | |
| Learning the greatest common divisor: explaining transformer predictions | Unknown | N/A | |
| BroGNet: Momentum-Conserving Graph Neural Stochastic Differential Equation for Learning Brownian Dynamics | Unknown | N/A | |
| Diagnosing Transformers: Illuminating Feature Spaces for Clinical Decision-Making | Unknown | N/A | |
| Vision Transformers Need Registers | Unknown | N/A | |
| Efficient and Scalable Graph Generation through Iterative Local Expansion | Unknown | N/A | |
| PlaSma: Procedural Knowledge Models for Language-based Planning and Re-Planning | Unknown | N/A | |
| Lipsum-FT: Robust Fine-Tuning of Zero-Shot Models Using Random Text Guidance | Unknown | N/A | |
| Neural Spectral Methods: Self-supervised learning in the spectral domain | Unknown | N/A | |
| Zero-Shot Continuous Prompt Transfer: Generalizing Task Semantics Across Language Models | Unknown | N/A | |
| RETSim: Resilient and Efficient Text Similarity | Unknown | N/A | |
| MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction Following | Unknown | N/A | |
| ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection | Unknown | N/A | |
| Convolutional Deep Kernel Machines | Unknown | N/A | |
| Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach | Unknown | N/A | |
| ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation | Unknown | N/A | |
| I-PHYRE: Interactive Physical Reasoning | Unknown | N/A | |
| Defining Expertise: Applications to Treatment Effect Estimation | Unknown | N/A | |
| A Variational Perspective on Solving Inverse Problems with Diffusion Models | Unknown | N/A | |
| Beyond IID weights: sparse and low-rank deep Neural Networks are also Gaussian Processes | Unknown | N/A | |
| Be Careful What You Smooth For: Label Smoothing Can Be a Privacy Shield but Also a Catalyst for Model Inversion Attacks | Unknown | N/A | |
| Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips | Unknown | N/A | |
| Network Memory Footprint Compression Through Jointly Learnable Codebooks and Mappings | Unknown | N/A | |
| MAMBA: an Effective World Model Approach for Meta-Reinforcement Learning | Unknown | N/A | |
| Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality | Unknown | N/A | |
| LipSim: A Provably Robust Perceptual Similarity Metric | Unknown | N/A | |
| SALMONN: Towards Generic Hearing Abilities for Large Language Models | Unknown | N/A | |
| Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF | Unknown | N/A | |
| Project and Probe: Sample-Efficient Adaptation by Interpolating Orthogonal Features | Unknown | N/A | |
| UNR-Explainer: Counterfactual Explanations for Unsupervised Node Representation Learning Models | Unknown | N/A | |
| On Accelerating Diffusion-Based Sampling Processes via Improved Integration Approximation | Unknown | N/A | |
| Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision | Unknown | N/A | |
| From Latent Graph to Latent Topology Inference: Differentiable Cell Complex Module | Unknown | N/A | |
| AutoLoRa: An Automated Robust Fine-Tuning Framework | Unknown | N/A | |
| Dropout Enhanced Bilevel Training | Unknown | N/A | |
| Predicting Emergent Abilities with Infinite Resolution Evaluation | Unknown | N/A | |
| LightHGNN: Distilling Hypergraph Neural Networks into MLPs for 100x Faster Inference | Unknown | N/A | |
| On Diffusion Modeling for Anomaly Detection | Unknown | N/A | |
| Efficient Modulation for Vision Networks | Unknown | N/A | |
| Finite-State Autoregressive Entropy Coding for Efficient Learned Lossless Compression | Unknown | N/A | |
| G$^2$N$^2$ : Weisfeiler and Lehman go grammatical | Unknown | N/A | |
| Time-Varying Propensity Score to Bridge the Gap between the Past and Present | Unknown | N/A | |
| Discovering modular solutions that generalize compositionally | Unknown | N/A | |
| Be More Active! Understanding the Differences Between Mean and Sampled Representations of Variational Autoencoders | Unknown | N/A | |
| Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps | Unknown | N/A | |
| Meta Inverse Constrained Reinforcement Learning: Convergence Guarantee and Generalization Analysis | Unknown | N/A | |
| Masked Structural Growth for 2x Faster Language Model Pre-training | Unknown | N/A | |
| On the Generalization and Approximation Capacities of Neural Controlled Differential Equations | Unknown | N/A | |
| Robust Angular Synchronization via Directed Graph Neural Networks | Unknown | N/A | |
| Maximum Entropy Model Correction in Reinforcement Learning | Unknown | N/A | |
| Improving protein optimization with smoothed fitness landscapes | Unknown | N/A | |
| Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight | Unknown | N/A | |
| Quantifying the Plausibility of Context Reliance in Neural Machine Translation | Unknown | N/A | |
| BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity | Unknown | N/A | |
| Enabling Efficient Equivariant Operations in the Fourier Basis via Gaunt Tensor Products | Unknown | N/A | |
| Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation | Unknown | N/A | |
| Flat Minima in Linear Estimation and an Extended Gauss Markov Theorem | Unknown | N/A | |
| Sample-Efficient Multi-Agent RL: An Optimization Perspective | Unknown | N/A | |
| Multimodal Molecular Pretraining via Modality Blending | Unknown | N/A | |
| Online Information Acquisition: Hiring Multiple Agents | Unknown | N/A | |
| RAIN: Your Language Models Can Align Themselves without Finetuning | Unknown | N/A | |
| Lie Group Decompositions for Equivariant Neural Networks | Unknown | N/A | |
| Masks, Signs, And Learning Rate Rewinding | Unknown | N/A | |
| Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation | Unknown | N/A | |
| Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networks | Unknown | N/A | |
| TabR: Tabular Deep Learning Meets Nearest Neighbors | Unknown | N/A | |
| Conformal Prediction via Regression-as-Classification | Unknown | N/A | |
| The Human-AI Substitution game: active learning from a strategic labeler | Unknown | N/A | |
| Probabilistic Self-supervised Representation Learning via Scoring Rules Minimization | Unknown | N/A | |
| Classification with Conceptual Safeguards | Unknown | N/A | |
| $\mathbb{D}^2$ Pruning: Message Passing for Balancing Diversity & Difficulty in Data Pruning | Unknown | N/A | |
| Scaling physics-informed hard constraints with mixture-of-experts | Unknown | N/A | |
| On Stationary Point Convergence of PPO-Clip | Unknown | N/A | |
| How to Capture Higher-order Correlations? Generalizing Matrix Softmax Attention to Kronecker Computation | Unknown | N/A | |
| General Graph Random Features | Unknown | N/A | |
| Are Models Biased on Text without Gender-related Language? | Unknown | N/A | |
| Uni3D: Exploring Unified 3D Representation at Scale | Unknown | N/A | |
| Privacy-Preserving In-Context Learning for Large Language Models | Unknown | N/A | |
| A Discretization Framework for Robust Contextual Stochastic Optimization | Unknown | N/A | |
| Chain of Log-Concave Markov Chains | Unknown | N/A | |
| A Quadratic Synchronization Rule for Distributed Deep Learning | Unknown | N/A | |
| Perceptual Scales Predicted by Fisher Information Metrics | Unknown | N/A | |
| Protein Discovery with Discrete Walk-Jump Sampling | Unknown | N/A | |
| Fast and unified path gradient estimators for normalizing flows | Unknown | N/A | |
| Out-of-Variable Generalisation for Discriminative Models | Unknown | N/A | |
| A Simple and Scalable Representation for Graph Generation | Unknown | N/A | |
| Listen, Think, and Understand | Unknown | N/A | |
| FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning | Unknown | N/A | |
| TokenFlow: Consistent Diffusion Features for Consistent Video Editing | Unknown | N/A | |
| Language-Informed Visual Concept Learning | Unknown | N/A | |
| Turning large language models into cognitive models | Unknown | N/A | |
| Neural Snowflakes: Universal Latent Graph Inference via Trainable Latent Geometries | Unknown | N/A | |
| Multi-Task Reinforcement Learning with Mixture of Orthogonal Experts | Unknown | N/A | |
| Unveiling Options with Neural Network Decomposition | Unknown | N/A | |
| Towards the Fundamental Limits of Knowledge Transfer over Finite Domains | Unknown | N/A | |
| Active Test-Time Adaptation: Theoretical Analyses and An Algorithm | Unknown | N/A | |
| Quantifying and Enhancing Multi-modal Robustness with Modality Preference | Unknown | N/A | |
| RingAttention with Blockwise Transformers for Near-Infinite Context | Unknown | N/A | |
| A Statistical Analysis of Wasserstein Autoencoders for Intrinsically Low-dimensional Data | Unknown | N/A | |
| Improved Techniques for Training Consistency Models | Unknown | N/A | |
| Modeling Boundedly Rational Agents with Latent Inference Budgets | Unknown | N/A | |
| HYPO: Hyperspherical Out-Of-Distribution Generalization | Unknown | N/A | |
| On the Foundations of Shortcut Learning | Unknown | N/A | |
| Emergent Communication with Conversational Repair | Unknown | N/A | |
| SmartPlay : A Benchmark for LLMs as Intelligent Agents | Unknown | N/A | |
| A General Framework for User-Guided Bayesian Optimization | Unknown | N/A | |
| InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior | Unknown | N/A | |
| HIFA: High-fidelity Text-to-3D Generation with Advanced Diffusion Guidance | Unknown | N/A | |
| Understanding Domain Generalization: A Noise Robustness Perspective | Unknown | N/A | |
| Can Transformers Capture Spatial Relations between Objects? | Unknown | N/A | |
| The LLM Surgeon | Unknown | N/A | |
| Causal-StoNet: Causal Inference for High-Dimensional Complex Data | Unknown | N/A | |
| Equivariant Scalar Fields for Molecular Docking with Fast Fourier Transforms | Unknown | N/A | |
| Belief-Enriched Pessimistic Q-Learning against Adversarial State Perturbations | Unknown | N/A | |
| Diffusion-TS: Interpretable Diffusion for General Time Series Generation | Unknown | N/A | |
| Why is SAM Robust to Label Noise? | Unknown | N/A | |
| An Efficient Tester-Learner for Halfspaces | Unknown | N/A | |
| Batch normalization is sufficient for universal function approximation in CNNs | Unknown | N/A | |
| Neural structure learning with stochastic differential equations | Unknown | N/A | |
| Predictive, scalable and interpretable knowledge tracing on structured domains | Unknown | N/A | |
| Imitation Learning from Observation with Automatic Discount Scheduling | Unknown | N/A | |
| Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition | Unknown | N/A | |
| ImageNet-OOD: Deciphering Modern Out-of-Distribution Detection Algorithms | Unknown | N/A | |
| Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions | Unknown | N/A | |
| GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth Benchmarking | Unknown | N/A | |
| A Benchmark Study on Calibration | Unknown | N/A | |
| BEND: Benchmarking DNA Language Models on Biologically Meaningful Tasks | Unknown | N/A | |
| Guaranteed Approximation Bounds for Mixed-Precision Neural Operators | Unknown | N/A | |
| Lifting Architectural Constraints of Injective Flows | Unknown | N/A | |
| Aux-NAS: Exploiting Auxiliary Labels with Negligibly Extra Inference Cost | Unknown | N/A | |
| Language Model Self-improvement by Reinforcement Learning Contemplation | Unknown | N/A | |
| Fast Updating Truncated SVD for Representation Learning with Sparse Matrices | Unknown | N/A | |
| SOInter: A Novel Deep Energy-Based Interpretation Method for Explaining Structured Output Models | Unknown | N/A | |
| Sparse Spiking Neural Network: Exploiting Heterogeneity in Timescales for Pruning Recurrent SNN | Unknown | N/A | |
| Rethinking Model Ensemble in Transfer-based Adversarial Attacks | Unknown | N/A | |
| Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning | Unknown | N/A | |
| ModernTCN: A Modern Pure Convolution Structure for General Time Series Analysis | Unknown | N/A | |
| Prototypical Information Bottlenecking and Disentangling for Multimodal Cancer Survival Prediction | Unknown | N/A | |
| Attention-based Iterative Decomposition for Tensor Product Representation | Unknown | N/A | |
| On the Role of General Function Approximation in Offline Reinforcement Learning | Unknown | N/A | |
| Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings | Unknown | N/A | |
| The Generative AI Paradox: “What It Can Create, It May Not Understand” | Unknown | N/A | |
| The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning | Unknown | N/A | |
| Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement | Unknown | N/A | |
| Evaluating Large Language Models at Evaluating Instruction Following | Unknown | N/A | |
| Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning | Unknown | N/A | |
| The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction | Unknown | N/A | |
| Learning Grounded Action Abstractions from Language | Unknown | N/A | |
| SpikePoint: An Efficient Point-based Spiking Neural Network for Event Cameras Action Recognition | Unknown | N/A | |
| OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models | Unknown | N/A | |
| Scaling Laws for Sparsely-Connected Foundation Models | Unknown | N/A | |
| From Sparse to Soft Mixtures of Experts | Unknown | N/A | |
| iGraphMix: Input Graph Mixup Method for Node Classification | Unknown | N/A | |
| Retrieval-Enhanced Contrastive Vision-Text Models | Unknown | N/A | |
| Raidar: geneRative AI Detection viA Rewriting | Unknown | N/A | |
| Function Vectors in Large Language Models | Unknown | N/A | |
| Ring-A-Bell! How Reliable are Concept Removal Methods For Diffusion Models? | Unknown | N/A | |
| Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking | Unknown | N/A | |
| Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization | Unknown | N/A | |
| A Policy Gradient Method for Confounded POMDPs | Unknown | N/A | |
| MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data | Unknown | N/A | |
| LEGO-Prover: Neural Theorem Proving with Growing Libraries | Unknown | N/A | |
| THOUGHT PROPAGATION: AN ANALOGICAL APPROACH TO COMPLEX REASONING WITH LARGE LANGUAGE MODELS | Unknown | N/A | |
| GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher | Unknown | N/A | |
| On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs | Unknown | N/A | |
| Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models | Unknown | N/A | |
| INViTE: INterpret and Control Vision-Language Models with Text Explanations | Unknown | N/A | |
| Effective pruning of web-scale datasets based on complexity of concept clusters | Unknown | N/A | |
| Sin3DM: Learning a Diffusion Model from a Single 3D Textured Shape | Unknown | N/A | |
| Remote Sensing Vision-Language Foundation Models without Annotations via Ground Remote Alignment | Unknown | N/A | |
| DiffEnc: Variational Diffusion with a Learned Encoder | Unknown | N/A | |
| GIM: Learning Generalizable Image Matcher From Internet Videos | Unknown | N/A | |
| DyVal: Dynamic Evaluation of Large Language Models for Reasoning Tasks | Unknown | N/A | |
| The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric | Unknown | N/A | |
| FFB: A Fair Fairness Benchmark for In-Processing Group Fairness Methods | Unknown | N/A | |
| SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer | Unknown | N/A | |
| Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages | Unknown | N/A | |
| Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances | Unknown | N/A | |
| Self-supervised Pocket Pretraining via Protein Fragment-Surroundings Alignment | Unknown | N/A | |
| Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution | Unknown | N/A | |
| Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning | Unknown | N/A | |
| Rayleigh Quotient Graph Neural Networks for Graph-level Anomaly Detection | Unknown | N/A | |
| DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning | Unknown | N/A | |
| Large Language Models Are Not Robust Multiple Choice Selectors | Unknown | N/A | |
| MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework | Unknown | N/A | |
| In-context Autoencoder for Context Compression in a Large Language Model | Unknown | N/A | |
| GenCorres: Consistent Shape Matching via Coupled Implicit-Explicit Shape Generative Models | Unknown | N/A | |
| Hard-Constrained Deep Learning for Climate Downscaling | Unknown | N/A | |
| PB-LLM: Partially Binarized Large Language Models | Unknown | N/A | |
| Information Bottleneck Analysis of Deep Neural Networks via Lossy Compression | Unknown | N/A | |
| ZipIt! Merging Models from Different Tasks without Training | Unknown | N/A | |
| Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs | Unknown | N/A | |
| MIntRec2.0: A Large-scale Benchmark Dataset for Multimodal Intent Recognition and Out-of-scope Detection in Conversations | Unknown | N/A | |
| RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment | Unknown | N/A | |
| Localizing and Editing Knowledge In Text-to-Image Generative Models | Unknown | N/A | |
| Efficient Dynamics Modeling in Interactive Environments with Koopman Theory | Unknown | N/A | |
| Generative Learning for Financial Time Series with Irregular and Scale-Invariant Patterns | Unknown | N/A | |
| Linear attention is (maybe) all you need (to understand Transformer optimization) | Unknown | N/A | |
| Scalable Diffusion for Materials Generation | Unknown | N/A | |
| MG-TSD: Multi-Granularity Time Series Diffusion Models with Guided Learning Process | Unknown | N/A | |
| RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches | Unknown | N/A | |
| Differentiable Euler Characteristic Transforms for Shape Classification | Unknown | N/A | |
| Simplicial Representation Learning with Neural $k$-Forms | Unknown | N/A | |
| HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments | Unknown | N/A | |
| Efficient Multi-agent Reinforcement Learning by Planning | Unknown | N/A | |
| DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines | Unknown | N/A | |
| On the Stability of Iterative Retraining of Generative Models on their own Data | Unknown | N/A | |
| A Study of Bayesian Neural Network Surrogates for Bayesian Optimization | Unknown | N/A | |
| Fine-Tuned Language Models Generate Stable Inorganic Materials as Text | Unknown | N/A | |
| BooookScore: A systematic exploration of book-length summarization in the era of LLMs | Unknown | N/A | |
| Prediction Error-based Classification for Class-Incremental Learning | Unknown | N/A | |
| Deep Geodesic Canonical Correlation Analysis for Covariance-Based Neuroimaging Data | Unknown | N/A | |
| Beyond Weisfeiler-Lehman: A Quantitative Framework for GNN Expressiveness | Unknown | N/A | |
| The Need for Speed: Pruning Transformers with One Recipe | Unknown | N/A | |
| Adapting to Distribution Shift by Visual Domain Prompt Generation | Unknown | N/A | |
| ImplicitSLIM and How it Improves Embedding-based Collaborative Filtering | Unknown | N/A |
ICML 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Confidence Sets and Hypothesis Testing in a Likelihood-Free Inference Setting | Unknown | N/A | |
| Learning for Dose Allocation in Adaptive Clinical Trials with Safety Constraints | Unknown | N/A | |
| Restarted Bayesian Online Change-point Detector achieves Optimal Detection Delay | Unknown | N/A | |
| The Usual Suspects? Reassessing Blame for VAE Posterior Collapse | Unknown | N/A | |
| A Markov Decision Process Model for Socio-Economic Systems Impacted by Climate Change | Unknown | N/A | |
| Low-Variance and Zero-Variance Baselines for Extensive-Form Games | Unknown | N/A | |
| Multi-Task Learning with User Preferences: Gradient Descent with Controlled Ascent in Pareto Optimization | Unknown | N/A | |
| Sequence Generation with Mixed Representations | Unknown | N/A | |
| Rate-distortion optimization guided autoencoder for isometric embedding in Euclidean latent space | Unknown | N/A | |
| Student Specialization in Deep Rectified Networks With Finite Width and Input Dimension | Unknown | N/A | |
| Scalable Deep Generative Modeling for Sparse Graphs | Unknown | N/A | |
| Efficient nonparametric statistical inference on population feature importance using Shapley values | Unknown | N/A | |
| Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts? | Unknown | N/A | |
| Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors | Unknown | N/A | |
| A Mean Field Analysis Of Deep ResNet And Beyond: Towards Provably Optimization Via Overparameterization From Depth | Unknown | N/A | |
| PowerNorm: Rethinking Batch Normalization in Transformers | Unknown | N/A | |
| It's Not What Machines Can Learn, It's What We Cannot Teach | Unknown | N/A | |
| Improving generalization by controlling label-noise information in neural network weights | Unknown | N/A | |
| Go Wide, Then Narrow: Efficient Training of Deep Thin Networks | Unknown | N/A | |
| Inverse Active Sensing: Modeling and Understanding Timely Decision-Making | Unknown | N/A | |
| Convergence of a Stochastic Gradient Method with Momentum for Non-Smooth Non-Convex Optimization | Unknown | N/A | |
| Sample Amplification: Increasing Dataset Size even when Learning is Impossible | Unknown | N/A | |
| Hypernetwork approach to generating point clouds | Unknown | N/A | |
| Convex Calibrated Surrogates for the Multi-Label F-Measure | Unknown | N/A | |
| Graph Optimal Transport for Cross-Domain Alignment | Unknown | N/A | |
| History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms | Unknown | N/A | |
| Adversarial Filters of Dataset Biases | Unknown | N/A | |
| Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels | Unknown | N/A | |
| BoXHED: Boosted eXact Hazard Estimator with Dynamic covariates | Unknown | N/A | |
| Constrained Markov Decision Processes via Backward Value Functions | Unknown | N/A | |
| Hierarchical Generation of Molecular Graphs using Structural Motifs | Unknown | N/A | |
| Semiparametric Nonlinear Bipartite Graph Representation Learning with Provable Guarantees | Unknown | N/A | |
| Responsive Safety in Reinforcement Learning by PID Lagrangian Methods | Unknown | N/A | |
| Informative Dropout for Robust Representation Learning: A Shape-bias Perspective | Unknown | N/A | |
| Robust One-Bit Recovery via ReLU Generative Networks: Near-Optimal Statistical Rate and Global Landscape Analysis | Unknown | N/A | |
| R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games | Unknown | N/A | |
| ConQUR: Mitigating Delusional Bias in Deep Q-Learning | Unknown | N/A | |
| Confidence-Aware Learning for Deep Neural Networks | Unknown | N/A | |
| Bandits for BMO Functions | Unknown | N/A | |
| Superpolynomial Lower Bounds for Learning One-Layer Neural Networks using Gradient Descent | Unknown | N/A | |
| Scaling up Hybrid Probabilistic Inference with Logical and Arithmetic Constraints via Message Passing | Unknown | N/A | |
| Accelerating Large-Scale Inference with Anisotropic Vector Quantization | Unknown | N/A | |
| Differentiating through the Fréchet Mean | Unknown | N/A | |
| Few-shot Relation Extraction via Bayesian Meta-learning on Relation Graphs | Unknown | N/A | |
| Smaller, more accurate regression forests using tree alternating optimization | Unknown | N/A | |
| Divide and Conquer: Leveraging Intermediate Feature Representations for Quantized Training of Neural Networks | Unknown | N/A | |
| All in the Exponential Family: Bregman Duality in Thermodynamic Variational Inference | Unknown | N/A | |
| Explainable and Discourse Topic-aware Neural Language Understanding | Unknown | N/A | |
| Stochastic Latent Residual Video Prediction | Unknown | N/A | |
| NetGAN without GAN: From Random Walks to Low-Rank Approximations | Unknown | N/A | |
| Extra-gradient with player sampling for faster convergence in n-player games | Unknown | N/A | |
| Generalization via Derandomization | Unknown | N/A | |
| Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning | Unknown | N/A | |
| Leveraging Frequency Analysis for Deep Fake Image Recognition | Unknown | N/A | |
| Linear bandits with Stochastic Delayed Feedback | Unknown | N/A | |
| Time Series Deconfounder: Estimating Treatment Effects over Time in the Presence of Hidden Confounders | Unknown | N/A | |
| WaveFlow: A Compact Flow-based Model for Raw Audio | Unknown | N/A | |
| Self-supervised Label Augmentation via Input Transformations | Unknown | N/A | |
| Unsupervised Discovery of Interpretable Directions in the GAN Latent Space | Unknown | N/A | |
| Interference and Generalization in Temporal Difference Learning | Unknown | N/A | |
| Invariant Rationalization | Unknown | N/A | |
| Accelerated Stochastic Gradient-free and Projection-free Methods | Unknown | N/A | |
| Bayesian Experimental Design for Implicit Models by Mutual Information Neural Estimation | Unknown | N/A | |
| Tuning-free Plug-and-Play Proximal Algorithm for Inverse Imaging Problems | Unknown | N/A | |
| Robust Learning with the Hilbert-Schmidt Independence Criterion | Unknown | N/A | |
| Can Stochastic Zeroth-Order Frank-Wolfe Method Converge Faster for Non-Convex Problems? | Unknown | N/A | |
| Inexact Tensor Methods with Dynamic Accuracies | Unknown | N/A | |
| Radioactive data: tracing through training | Unknown | N/A | |
| Fast Adaptation to New Environments via Policy-Dynamics Value Functions | Unknown | N/A | |
| Fast and Consistent Learning of Hidden Markov Models by Incorporating Non-Consecutive Correlations | Unknown | N/A | |
| Stochastically Dominant Distributional Reinforcement Learning | Unknown | N/A | |
| Beyond Signal Propagation: Is Feature Diversity Necessary in Deep Neural Network Initialization? | Unknown | N/A | |
| Optimal approximation for unconstrained non-submodular minimization | Unknown | N/A | |
| Nonparametric Score Estimators | Unknown | N/A | |
| Implicit Learning Dynamics in Stackelberg Games: Equilibria Characterization, Convergence Analysis, and Empirical Study | Unknown | N/A | |
| Agent57: Outperforming the Atari Human Benchmark | Unknown | N/A | |
| Monte-Carlo Tree Search as Regularized Policy Optimization | Unknown | N/A | |
| On the (In)tractability of Computing Normalizing Constants for the Product of Determinantal Point Processes | Unknown | N/A | |
| On Coresets for Regularized Regression | Unknown | N/A | |
| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Unknown | N/A | |
| T-GD: Transferable GAN-generated Images Detection Framework | Unknown | N/A | |
| Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing | Unknown | N/A | |
| How Good is the Bayes Posterior in Deep Neural Networks Really? | Unknown | N/A | |
| Fast and Private Submodular and $k$-Submodular Functions Maximization with Matroid Constraints | Unknown | N/A | |
| Learning Flat Latent Manifolds with VAEs | Unknown | N/A | |
| Online Dense Subgraph Discovery via Blurred-Graph Feedback | Unknown | N/A | |
| Linear Lower Bounds and Conditioning of Differentiable Games | Unknown | N/A | |
| Refined bounds for algorithm configuration: The knife-edge of dual class approximability | Unknown | N/A | |
| Optimal Randomized First-Order Methods for Least-Squares Problems | Unknown | N/A | |
| Convex Representation Learning for Generalized Invariance in Semi-Inner-Product Space | Unknown | N/A | |
| Fiedler Regularization: Learning Neural Networks with Graph Sparsity | Unknown | N/A | |
| The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation | Unknown | N/A | |
| Learning What to Defer for Maximum Independent Sets | Unknown | N/A | |
| NGBoost: Natural Gradient Boosting for Probabilistic Prediction | Unknown | N/A | |
| Perceptual Generative Autoencoders | Unknown | N/A | |
| Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning | Unknown | N/A | |
| Private Query Release Assisted by Public Data | Unknown | N/A | |
| On Validation and Planning of An Optimal Decision Rule with Application in Healthcare Studies | Unknown | N/A | |
| Randomized Smoothing of All Shapes and Sizes | Unknown | N/A | |
| Dispersed Exponential Family Mixture VAEs for Interpretable Text Generation | Unknown | N/A | |
| Searching to Exploit Memorization Effect in Learning with Noisy Labels | Unknown | N/A | |
| Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data | Unknown | N/A | |
| Deep Graph Random Process for Relational-Thinking-Based Speech Recognition | Unknown | N/A | |
| Asynchronous Coagent Networks | Unknown | N/A | |
| Generalizing Convolutional Neural Networks for Equivariance to Lie Groups on Arbitrary Continuous Data | Unknown | N/A | |
| Single Point Transductive Prediction | Unknown | N/A | |
| Provable Self-Play Algorithms for Competitive Reinforcement Learning | Unknown | N/A | |
| Lookahead-Bounded Q-learning | Unknown | N/A | |
| Kernelized Stein Discrepancy Tests of Goodness-of-fit for Time-to-Event Data | Unknown | N/A | |
| Provable Representation Learning for Imitation Learning via Bi-level Optimization | Unknown | N/A | |
| What is Local Optimality in Nonconvex-Nonconcave Minimax Optimization? | Unknown | N/A | |
| Bandits with Adversarial Scaling | Unknown | N/A | |
| On the Relation between Quality-Diversity Evaluation and Distribution-Fitting Goal in Text Generation | Unknown | N/A | |
| LEEP: A New Measure to Evaluate Transferability of Learned Representations | Unknown | N/A | |
| Designing Optimal Dynamic Treatment Regimes: A Causal Reinforcement Learning Approach | Unknown | N/A | |
| Expert Learning through Generalized Inverse Multiobjective Optimization: Models, Insights, and Algorithms | Unknown | N/A | |
| Low-Rank Bottleneck in Multi-head Attention Models | Unknown | N/A | |
| Reward-Free Exploration for Reinforcement Learning | Unknown | N/A | |
| Upper bounds for Model-Free Row-Sparse Principal Component Analysis | Unknown | N/A | |
| Accelerating the diffusion-based ensemble sampling by non-reversible dynamics | Unknown | N/A | |
| Learning To Stop While Learning To Predict | Unknown | N/A | |
| p-Norm Flow Diffusion for Local Graph Clustering | Unknown | N/A | |
| Latent Bernoulli Autoencoder | Unknown | N/A | |
| Data Valuation using Reinforcement Learning | Unknown | N/A | |
| Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks | Unknown | N/A | |
| Disentangling Trainability and Generalization in Deep Neural Networks | Unknown | N/A | |
| SCAFFOLD: Stochastic Controlled Averaging for Federated Learning | Unknown | N/A | |
| Which Tasks Should Be Learned Together in Multi-task Learning? | Unknown | N/A | |
| Adversarial Risk via Optimal Transport and Optimal Couplings | Unknown | N/A | |
| Boosting for Control of Dynamical Systems | Unknown | N/A | |
| Lifted Disjoint Paths with Application in Multiple Object Tracking | Unknown | N/A | |
| Evolutionary Reinforcement Learning for Sample-Efficient Multiagent Coordination | Unknown | N/A | |
| Decoupled Greedy Learning of CNNs | Unknown | N/A | |
| Overfitting in adversarially robust deep learning | Unknown | N/A | |
| Second-Order Provable Defenses against Adversarial Attacks | Unknown | N/A | |
| Parameterized Rate-Distortion Stochastic Encoder | Unknown | N/A | |
| MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time | Unknown | N/A | |
| Learning Robot Skills with Temporal Variational Inference | Unknown | N/A | |
| Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models | Unknown | N/A | |
| Revisiting Spatial Invariance with Low-Rank Local Connectivity | Unknown | N/A | |
| Deep Reasoning Networks for Unsupervised Pattern De-mixing with Constraint Reasoning | Unknown | N/A | |
| Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound | Unknown | N/A | |
| Generalized and Scalable Optimal Sparse Decision Trees | Unknown | N/A | |
| SIGUA: Forgetting May Make Learning with Noisy Labels More Robust | Unknown | N/A | |
| Learning Discrete Structured Representations by Adversarially Maximizing Mutual Information | Unknown | N/A | |
| Tensor denoising and completion based on ordinal observations | Unknown | N/A | |
| Video Prediction via Example Guidance | Unknown | N/A | |
| Efficient Continuous Pareto Exploration in Multi-Task Learning | Unknown | N/A | |
| Efficient Policy Learning from Surrogate-Loss Classification Reductions | Unknown | N/A | |
| Minimax Weight and Q-Function Learning for Off-Policy Evaluation | Unknown | N/A | |
| On Efficient Low Distortion Ultrametric Embedding | Unknown | N/A | |
| Data preprocessing to mitigate bias: A maximum entropy based approach | Unknown | N/A | |
| Global Concavity and Optimization in a Class of Dynamic Discrete Choice Models | Unknown | N/A | |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Unknown | N/A | |
| Scalable Gaussian Process Separation for Kernels with a Non-Stationary Phase | Unknown | N/A | |
| Streaming k-Submodular Maximization under Noise subject to Size Constraint | Unknown | N/A | |
| CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information | Unknown | N/A | |
| Small Data, Big Decisions: Model Selection in the Small-Data Regime | Unknown | N/A | |
| Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits | Unknown | N/A | |
| An Accelerated DFO Algorithm for Finite-sum Convex Functions | Unknown | N/A | |
| Finding trainable sparse networks through Neural Tangent Transfer | Unknown | N/A | |
| Learning Quadratic Games on Networks | Unknown | N/A | |
| Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks | Unknown | N/A | |
| PoWER-BERT: Accelerating BERT Inference via Progressive Word-vector Elimination | Unknown | N/A | |
| On the Sample Complexity of Adversarial Multi-Source PAC Learning | Unknown | N/A | |
| Super-efficiency of automatic differentiation for functions defined as a minimum | Unknown | N/A | |
| Scalable Identification of Partially Observed Systems with Certainty-Equivalent EM | Unknown | N/A | |
| Learning to Learn Kernels with Variational Random Features | Unknown | N/A | |
| A distributional view on multi-objective policy optimization | Unknown | N/A | |
| Learning Autoencoders with Relational Regularization | Unknown | N/A | |
| Bayesian Sparsification of Deep C-valued Networks | Unknown | N/A | |
| Neural Contextual Bandits with UCB-based Exploration | Unknown | N/A | |
| Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers | Unknown | N/A | |
| Lorentz Group Equivariant Neural Network for Particle Physics | Unknown | N/A | |
| Interpolation between Residual and Non-Residual Networks | Unknown | N/A | |
| Collaborative Machine Learning with Incentive-Aware Model Rewards | Unknown | N/A | |
| Forecasting Sequential Data Using Consistent Koopman Autoencoders | Unknown | N/A | |
| The Performance Analysis of Generalized Margin Maximizers on Separable Data | Unknown | N/A | |
| Adversarial Attacks on Probabilistic Autoregressive Forecasting Models | Unknown | N/A | |
| From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics | Unknown | N/A | |
| Training Binary Neural Networks using the Bayesian Learning Rule | Unknown | N/A | |
| Learning the piece-wise constant graph structure of a varying Ising model | Unknown | N/A | |
| Efficiently Solving MDPs with Stochastic Mirror Descent | Unknown | N/A | |
| Robust Graph Representation Learning via Neural Sparsification | Unknown | N/A | |
| Handling the Positive-Definite Constraint in the Bayesian Learning Rule | Unknown | N/A | |
| Interpretable, Multidimensional, Multimodal Anomaly Detection with Negative Sampling for Detection of Device Failure | Unknown | N/A | |
| Optimal Bounds between f-Divergences and Integral Probability Metrics | Unknown | N/A | |
| Likelihood-free MCMC with Amortized Approximate Ratio Estimators | Unknown | N/A | |
| Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation | Unknown | N/A | |
| Certified Robustness to Label-Flipping Attacks via Randomized Smoothing | Unknown | N/A | |
| GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values | Unknown | N/A | |
| An Investigation of Why Overparameterization Exacerbates Spurious Correlations | Unknown | N/A | |
| Double-Loop Unadjusted Langevin Algorithm | Unknown | N/A | |
| Provable guarantees for decision tree induction: the agnostic setting | Unknown | N/A | |
| Learning Optimal Tree Models under Beam Search | Unknown | N/A | |
| Attacks Which Do Not Kill Training Make Adversarial Learning Stronger | Unknown | N/A | |
| Towards Adaptive Residual Network Training: A Neural-ODE Perspective | Unknown | N/A | |
| Estimating Model Uncertainty of Neural Networks in Sparse Information Form | Unknown | N/A | |
| Is Local SGD Better than Minibatch SGD? | Unknown | N/A | |
| Estimating the Number and Effect Sizes of Non-null Hypotheses | Unknown | N/A | |
| From ImageNet to Image Classification: Contextualizing Progress on Benchmarks | Unknown | N/A | |
| Scalable and Efficient Comparison-based Search without Features | Unknown | N/A | |
| Identifying Statistical Bias in Dataset Replication | Unknown | N/A | |
| On Unbalanced Optimal Transport: An Analysis of Sinkhorn Algorithm | Unknown | N/A | |
| Learning and Sampling of Atomic Interventions from Observations | Unknown | N/A | |
| Reliable Fidelity and Diversity Metrics for Generative Models | Unknown | N/A | |
| Combining Differentiable PDE Solvers and Graph Neural Networks for Fluid Flow Prediction | Unknown | N/A | |
| Evolutionary Topology Search for Tensor Network Decomposition | Unknown | N/A | |
| Randomization matters How to defend against strong adversarial attacks | Unknown | N/A | |
| Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism | Unknown | N/A | |
| Bidirectional Model-based Policy Optimization | Unknown | N/A | |
| Learning to Rank Learning Curves | Unknown | N/A | |
| Understanding and Mitigating the Tradeoff between Robustness and Accuracy | Unknown | N/A | |
| Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling | Unknown | N/A | |
| Non-autoregressive Machine Translation with Disentangled Context Transformer | Unknown | N/A | |
| Estimating Generalization under Distribution Shifts via Domain-Invariant Representations | Unknown | N/A | |
| Error-Bounded Correction of Noisy Labels | Unknown | N/A | |
| Population-Based Black-Box Optimization for Biological Sequence Design | Unknown | N/A | |
| Improved Optimistic Algorithms for Logistic Bandits | Unknown | N/A | |
| Control Frequency Adaptation via Action Persistence in Batch Reinforcement Learning | Unknown | N/A | |
| Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension | Unknown | N/A | |
| Boosted Histogram Transform for Regression | Unknown | N/A | |
| Sample Complexity Bounds for 1-bit Compressive Sensing and Binary Stable Embeddings with Generative Priors | Unknown | N/A | |
| Knowing The What But Not The Where in Bayesian Optimization | Unknown | N/A | |
| Implicit Euler Skip Connections: Enhancing Adversarial Robustness via Numerical Stability | Unknown | N/A | |
| Black-Box Methods for Restoring Monotonicity | Unknown | N/A | |
| Multi-fidelity Bayesian Optimization with Max-value Entropy Search and its Parallelization | Unknown | N/A | |
| A Flexible Framework for Nonparametric Graphical Modeling that Accommodates Machine Learning | Unknown | N/A | |
| Adversarial Robustness via Runtime Masking and Cleansing | Unknown | N/A | |
| Proving the Lottery Ticket Hypothesis: Pruning is All You Need | Unknown | N/A | |
| Aggregation of Multiple Knockoffs | Unknown | N/A | |
| Multi-objective Bayesian Optimization using Pareto-frontier Entropy | Unknown | N/A | |
| On the Generalization Benefit of Noise in Stochastic Gradient Descent | Unknown | N/A | |
| Optimization Theory for ReLU Neural Networks Trained with Normalization Layers | Unknown | N/A | |
| Does label smoothing mitigate label noise? | Unknown | N/A | |
| Variational Bayesian Quantization | Unknown | N/A | |
| Non-Stationary Delayed Bandits with Intermediate Observations | Unknown | N/A | |
| Evaluating Machine Accuracy on ImageNet | Unknown | N/A | |
| Composable Sketches for Functions of Frequencies: Beyond the Worst Case | Unknown | N/A | |
| The Implicit and Explicit Regularization Effects of Dropout | Unknown | N/A | |
| Decision Trees for Decision-Making under the Predict-then-Optimize Framework | Unknown | N/A | |
| Adaptive Estimator Selection for Off-Policy Evaluation | Unknown | N/A | |
| Two Simple Ways to Learn Individual Fairness Metrics from Data | Unknown | N/A | |
| Kernel Methods for Cooperative Multi-Agent Contextual Bandits | Unknown | N/A | |
| On the Theoretical Properties of the Network Jackknife | Unknown | N/A | |
| Bisection-Based Pricing for Repeated Contextual Auctions against Strategic Buyer | Unknown | N/A | |
| Bayesian Graph Neural Networks with Adaptive Connection Sampling | Unknown | N/A | |
| On Breaking Deep Generative Model-based Defenses and Beyond | Unknown | N/A | |
| Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles | Unknown | N/A | |
| Representation Learning via Adversarially-Contrastive Optimal Transport | Unknown | N/A | |
| Thompson Sampling via Local Uncertainty | Unknown | N/A | |
| Meta Variance Transfer: Learning to Augment from the Others | Unknown | N/A | |
| Abstraction Mechanisms Predict Generalization in Deep Neural Networks | Unknown | N/A | |
| Coresets for Clustering in Graphs of Bounded Treewidth | Unknown | N/A | |
| Learning the Valuations of a $k$-demand Agent | Unknown | N/A | |
| Graph Homomorphism Convolution | Unknown | N/A | |
| Bounding the fairness and accuracy of classifiers from population statistics | Unknown | N/A | |
| Distribution Augmentation for Generative Modeling | Unknown | N/A | |
| Revisiting Fundamentals of Experience Replay | Unknown | N/A | |
| Haar Graph Pooling | Unknown | N/A | |
| Nested Subspace Arrangement for Representation of Relational Data | Unknown | N/A | |
| Deep Molecular Programming: A Natural Implementation of Binary-Weight ReLU Neural Networks | Unknown | N/A | |
| DINO: Distributed Newton-Type Optimization Method | Unknown | N/A | |
| FedBoost: A Communication-Efficient Algorithm for Federated Learning | Unknown | N/A | |
| Healing Products of Gaussian Process Experts | Unknown | N/A | |
| PDO-eConvs: Partial Differential Operator Based Equivariant Convolutions | Unknown | N/A | |
| Online Bayesian Moment Matching based SAT Solver Heuristics | Unknown | N/A | |
| Robust and Stable Black Box Explanations | Unknown | N/A | |
| Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions | Unknown | N/A | |
| Implicit Geometric Regularization for Learning Shapes | Unknown | N/A | |
| On conditional versus marginal bias in multi-armed bandits | Unknown | N/A | |
| Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems | Unknown | N/A | |
| Loss Function Search for Face Recognition | Unknown | N/A | |
| Circuit-Based Intrinsic Methods to Detect Overfitting | Unknown | N/A | |
| Graph-based, Self-Supervised Program Repair from Diagnostic Feedback | Unknown | N/A | |
| Implicit competitive regularization in GANs | Unknown | N/A | |
| Computational and Statistical Tradeoffs in Inferring Combinatorial Structures of Ising Model | Unknown | N/A | |
| Inter-domain Deep Gaussian Processes | Unknown | N/A | |
| Mapping natural-language problems to formal-language solutions using structured neural representations | Unknown | N/A | |
| Estimating Q(s,s') with Deep Deterministic Dynamics Gradients | Unknown | N/A | |
| Source Separation with Deep Generative Priors | Unknown | N/A | |
| Non-Autoregressive Neural Text-to-Speech | Unknown | N/A | |
| DropNet: Reducing Neural Network Complexity via Iterative Pruning | Unknown | N/A | |
| The Neural Tangent Kernel in High Dimensions: Triple Descent and a Multi-Scale Theory of Generalization | Unknown | N/A | |
| Transformation of ReLU-based recurrent neural networks from discrete-time to continuous-time | Unknown | N/A | |
| Black-box Certification and Learning under Adversarial Perturbations | Unknown | N/A | |
| A Chance-Constrained Generative Framework for Sequence Optimization | Unknown | N/A | |
| On the Number of Linear Regions of Convolutional Neural Networks | Unknown | N/A | |
| Detecting Out-of-Distribution Examples with Gram Matrices | Unknown | N/A | |
| When deep denoising meets iterative phase retrieval | Unknown | N/A | |
| Predicting deliberative outcomes | Unknown | N/A | |
| Contrastive Multi-View Representation Learning on Graphs | Unknown | N/A | |
| On Variational Learning of Controllable Representations for Text without Supervision | Unknown | N/A | |
| Optimistic Bounds for Multi-output Learning | Unknown | N/A | |
| Multi-step Greedy Reinforcement Learning Algorithms | Unknown | N/A | |
| Amortised Learning by Wake-Sleep | Unknown | N/A | |
| Near-optimal sample complexity bounds for learning Latent $k-$polytopes and applications to Ad-Mixtures | Unknown | N/A | |
| Online Learning for Active Cache Synchronization | Unknown | N/A | |
| Time-aware Large Kernel Convolutions | Unknown | N/A | |
| Strength from Weakness: Fast Learning Using Weak Supervision | Unknown | N/A | |
| Gradient Temporal-Difference Learning with Regularized Corrections | Unknown | N/A | |
| Deep Streaming Label Learning | Unknown | N/A | |
| Learning to Branch for Multi-Task Learning | Unknown | N/A | |
| On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems | Unknown | N/A | |
| NADS: Neural Architecture Distribution Search for Uncertainty Awareness | Unknown | N/A | |
| Improving Transformer Optimization Through Better Initialization | Unknown | N/A | |
| Learning and Evaluating Contextual Embedding of Source Code | Unknown | N/A | |
| Complexity of Finding Stationary Points of Nonconvex Nonsmooth Functions | Unknown | N/A | |
| Accountable Off-Policy Evaluation With Kernel Bellman Statistics | Unknown | N/A | |
| Learning Mixtures of Graphs from Epidemic Cascades | Unknown | N/A | |
| Do GANs always have Nash equilibria? | Unknown | N/A | |
| The Impact of Neural Network Overparameterization on Gradient Confusion and Stochastic Gradient Descent | Unknown | N/A | |
| Improving the Gating Mechanism of Recurrent Neural Networks | Unknown | N/A | |
| Parameter-free, Dynamic, and Strongly-Adaptive Online Learning | Unknown | N/A | |
| Hierarchical Verification for Adversarial Robustness | Unknown | N/A | |
| From Sets to Multisets: Provable Variational Inference for Probabilistic Integer Submodular Models | Unknown | N/A | |
| BINOCULARS for efficient, nonmyopic sequential experimental design | Unknown | N/A | |
| Stochastic Frank-Wolfe for Constrained Finite-Sum Minimization | Unknown | N/A | |
| Empirical Study of the Benefits of Overparameterization in Learning Latent Variable Models | Unknown | N/A | |
| Near-linear time Gaussian process optimization with adaptive batching and resparsification | Unknown | N/A | |
| Learning with Bounded Instance- and Label-dependent Label Noise | Unknown | N/A | |
| Temporal Phenotyping using Deep Predictive Clustering of Disease Progression | Unknown | N/A | |
| How recurrent networks implement contextual processing in sentiment analysis | Unknown | N/A | |
| Selective Dyna-style Planning Under Limited Model Capacity | Unknown | N/A | |
| Zeno++: Robust Fully Asynchronous SGD | Unknown | N/A | |
| Time-Consistent Self-Supervision for Semi-Supervised Learning | Unknown | N/A | |
| Spectral Graph Matching and Regularized Quadratic Relaxations: Algorithm and Theory | Unknown | N/A | |
| Training Deep Energy-Based Models with f-Divergence Minimization | Unknown | N/A | |
| On the consistency of top-k surrogate losses | Unknown | N/A | |
| Breaking the Curse of Many Agents: Provable Mean Embedding Q-Iteration for Mean-Field Reinforcement Learning | Unknown | N/A | |
| Representations for Stable Off-Policy Reinforcement Learning | Unknown | N/A | |
| Transparency Promotion with Model-Agnostic Linear Competitors | Unknown | N/A | |
| StochasticRank: Global Optimization of Scale-Free Discrete Functions | Unknown | N/A | |
| Provable Smoothness Guarantees for Black-Box Variational Inference | Unknown | N/A | |
| Boosting Deep Neural Network Efficiency with Dual-Module Inference | Unknown | N/A | |
| Adversarial Attacks on Copyright Detection Systems | Unknown | N/A | |
| Countering Language Drift with Seeded Iterated Learning | Unknown | N/A | |
| Compressive sensing with un-trained neural networks: Gradient descent finds a smooth approximation | Unknown | N/A | |
| Latent Variable Modelling with Hyperbolic Normalizing Flows | Unknown | N/A | |
| Mutual Transfer Learning for Massive Data | Unknown | N/A | |
| A general recurrent state space framework for modeling neural dynamics during decision-making | Unknown | N/A | |
| PackIt: A Virtual Environment for Geometric Planning | Unknown | N/A | |
| Representing Unordered Data Using Complex-Weighted Multiset Automata | Unknown | N/A | |
| The Differentiable Cross-Entropy Method | Unknown | N/A | |
| Domain Adaptive Imitation Learning | Unknown | N/A | |
| Generalization to New Actions in Reinforcement Learning | Unknown | N/A | |
| Better depth-width trade-offs for neural networks through the lens of dynamical systems | Unknown | N/A | |
| Stochastic Coordinate Minimization with Progressive Precision for Stochastic Convex Optimization | Unknown | N/A | |
| Convolutional Kernel Networks for Graph-Structured Data | Unknown | N/A | |
| Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling | Unknown | N/A | |
| Bridging the Gap Between f-GANs and Wasserstein GANs | Unknown | N/A | |
| Learning with Good Feature Representations in Bandits and in RL with a Generative Model | Unknown | N/A | |
| Correlation Clustering with Asymmetric Classification Errors | Unknown | N/A | |
| Learning Similarity Metrics for Numerical Simulations | Unknown | N/A | |
| AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation | Unknown | N/A | |
| Sparsified Linear Programming for Zero-Sum Equilibrium Finding | Unknown | N/A | |
| Aligned Cross Entropy for Non-Autoregressive Machine Translation | Unknown | N/A | |
| Supervised Quantile Normalization for Low Rank Matrix Factorization | Unknown | N/A | |
| Adversarial Nonnegative Matrix Factorization | Unknown | N/A | |
| Multigrid Neural Memory | Unknown | N/A | |
| Adaptive Sampling for Estimating Probability Distributions | Unknown | N/A | |
| Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings | Unknown | N/A | |
| Tails of Lipschitz Triangular Flows | Unknown | N/A | |
| Inductive Relation Prediction by Subgraph Reasoning | Unknown | N/A | |
| Thompson Sampling Algorithms for Mean-Variance Bandits | Unknown | N/A | |
| Operation-Aware Soft Channel Pruning using Differentiable Masks | Unknown | N/A | |
| Boosting Frank-Wolfe by Chasing Gradients | Unknown | N/A | |
| Stochastic Regret Minimization in Extensive-Form Games | Unknown | N/A | |
| On hyperparameter tuning in general clustering problemsm | Unknown | N/A | |
| Simultaneous Inference for Massive Data: Distributed Bootstrap | Unknown | N/A | |
| AutoML-Zero: Evolving Machine Learning Algorithms From Scratch | Unknown | N/A | |
| Continuous Time Bayesian Networks with Clocks | Unknown | N/A | |
| T-Basis: a Compact Representation for Neural Networks | Unknown | N/A | |
| Communication-Efficient Distributed Stochastic AUC Maximization with Deep Neural Networks | Unknown | N/A | |
| Evaluating Lossy Compression Rates of Deep Generative Models | Unknown | N/A | |
| How to Train Your Neural ODE: the World of Jacobian and Kinetic Regularization | Unknown | N/A | |
| Extreme Multi-label Classification from Aggregated Labels | Unknown | N/A | |
| Familywise Error Rate Control by Interactive Unmasking | Unknown | N/A | |
| Optimizer Benchmarking Needs to Account for Hyperparameter Tuning | Unknown | N/A | |
| Unraveling Meta-Learning: Understanding Feature Representations for Few-Shot Tasks | Unknown | N/A | |
| CoMic: Complementary Task Learning & Mimicry for Reusable Skills | Unknown | N/A | |
| Implicit differentiation of Lasso-type models for hyperparameter optimization | Unknown | N/A | |
| Revisiting Training Strategies and Generalization Performance in Deep Metric Learning | Unknown | N/A | |
| On Efficient Constructions of Checkpoints | Unknown | N/A | |
| Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization | Unknown | N/A | |
| Self-Modulating Nonparametric Event-Tensor Factorization | Unknown | N/A | |
| Bio-Inspired Hashing for Unsupervised Similarity Search | Unknown | N/A | |
| Full Law Identification in Graphical Models of Missing Data: Completeness Results | Unknown | N/A | |
| Self-Attentive Associative Memory | Unknown | N/A | |
| Hallucinative Topological Memory for Zero-Shot Visual Planning | Unknown | N/A | |
| MetaFun: Meta-Learning with Iterative Functional Updates | Unknown | N/A | |
| Improving Generative Imagination in Object-Centric World Models | Unknown | N/A | |
| Sequential Transfer in Reinforcement Learning with a Generative Model | Unknown | N/A | |
| VideoOneNet: Bidirectional Convolutional Recurrent OneNet with Trainable Data Steps for Video Processing | Unknown | N/A | |
| Fine-Grained Analysis of Stability and Generalization for Stochastic Gradient Descent | Unknown | N/A | |
| Feature Quantization Improves GAN Training | Unknown | N/A | |
| Amortized Finite Element Analysis for Fast PDE-Constrained Optimization | Unknown | N/A | |
| Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates | Unknown | N/A | |
| Temporal Logic Point Processes | Unknown | N/A | |
| Rank Aggregation from Pairwise Comparisons in the Presence of Adversarial Corruptions | Unknown | N/A | |
| Optimizing Data Usage via Differentiable Rewards | Unknown | N/A | |
| Finite-Time Convergence in Continuous-Time Optimization | Unknown | N/A | |
| Estimation of Bounds on Potential Outcomes For Decision Making | Unknown | N/A | |
| Undirected Graphical Models as Approximate Posteriors | Unknown | N/A | |
| Deep Gaussian Markov Random Fields | Unknown | N/A | |
| Adaptive Reward-Poisoning Attacks against Reinforcement Learning | Unknown | N/A | |
| Adversarial Neural Pruning with Latent Vulnerability Suppression | Unknown | N/A | |
| Online Control of the False Coverage Rate and False Sign Rate | Unknown | N/A | |
| Stronger and Faster Wasserstein Adversarial Attacks | Unknown | N/A | |
| Dynamics of Deep Neural Networks and Neural Tangent Hierarchy | Unknown | N/A | |
| Planning to Explore via Self-Supervised World Models | Unknown | N/A | |
| Measuring Non-Expert Comprehension of Machine Learning Fairness Metrics | Unknown | N/A | |
| Multinomial Logit Bandit with Low Switching Cost | Unknown | N/A | |
| Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels | Unknown | N/A | |
| Task-Oriented Active Perception and Planning in Environments with Partially Known Semantics | Unknown | N/A | |
| Defense Through Diverse Directions | Unknown | N/A | |
| Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks | Unknown | N/A | |
| Neural Architecture Search in A Proxy Validation Loss Landscape | Unknown | N/A | |
| AutoGAN-Distiller: Searching to Compress Generative Adversarial Networks | Unknown | N/A | |
| Multilinear Latent Conditioning for Generating Unseen Attribute Combinations | Unknown | N/A | |
| One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control | Unknown | N/A | |
| Sparse Sinkhorn Attention | Unknown | N/A | |
| Feature Noise Induces Loss Discrepancy Across Groups | Unknown | N/A | |
| Oracle Efficient Private Non-Convex Optimization | Unknown | N/A | |
| Rigging the Lottery: Making All Tickets Winners | Unknown | N/A | |
| Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation | Unknown | N/A | |
| Inducing and Exploiting Activation Sparsity for Fast Inference on Deep Neural Networks | Unknown | N/A | |
| Improving Molecular Design by Stochastic Iterative Target Augmentation | Unknown | N/A | |
| FetchSGD: Communication-Efficient Federated Learning with Sketching | Unknown | N/A | |
| Two Routes to Scalable Credit Assignment without Weight Symmetry | Unknown | N/A | |
| A Pairwise Fair and Community-preserving Approach to k-Center Clustering | Unknown | N/A | |
| Identifying the Reward Function by Anchor Actions | Unknown | N/A | |
| Probing Emergent Semantics in Predictive Agents via Question Answering | Unknown | N/A | |
| Conditional gradient methods for stochastically constrained convex minimization | Unknown | N/A | |
| Infinite attention: NNGP and NTK for deep attention networks | Unknown | N/A | |
| LP-SparseMAP: Differentiable Relaxed Optimization for Sparse Structured Prediction | Unknown | N/A | |
| ControlVAE: Controllable Variational Autoencoder | Unknown | N/A | |
| Accelerated Message Passing for Entropy-Regularized MAP Inference | Unknown | N/A | |
| Generative Teaching Networks: Accelerating Neural Architecture Search by Learning to Generate Synthetic Training Data | Unknown | N/A | |
| Leveraging Procedural Generation to Benchmark Reinforcement Learning | Unknown | N/A | |
| Up or Down? Adaptive Rounding for Post-Training Quantization | Unknown | N/A | |
| Meta-learning for Mixed Linear Regression | Unknown | N/A | |
| A Sample Complexity Separation between Non-Convex and Convex Meta-Learning | Unknown | N/A | |
| Variance Reduction and Quasi-Newton for Particle-Based Variational Inference | Unknown | N/A | |
| Individual Fairness for k-Clustering | Unknown | N/A | |
| Predictive Multiplicity in Classification | Unknown | N/A | |
| A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation | Unknown | N/A | |
| Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning" | Unknown | N/A | |
| Supervised learning: no loss no cry | Unknown | N/A | |
| Analytic Marching: An Analytic Meshing Solution from Deep Implicit Surface Networks | Unknown | N/A | |
| From Local SGD to Local Fixed-Point Methods for Federated Learning | Unknown | N/A | |
| A Simple Framework for Contrastive Learning of Visual Representations | Unknown | N/A | |
| Spectral Subsampling MCMC for Stationary Time Series | Unknown | N/A | |
| Efficient Proximal Mapping of the 1-path-norm of Shallow Networks | Unknown | N/A | |
| Learning with Feature and Distribution Evolvable Streams | Unknown | N/A | |
| State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes | Unknown | N/A | |
| Angular Visual Hardness | Unknown | N/A | |
| Black-Box Variational Inference as a Parametric Approximation to Langevin Dynamics | Unknown | N/A | |
| Reducing Sampling Error in Batch Temporal Difference Learning | Unknown | N/A | |
| “Other-Play” for Zero-Shot Coordination | Unknown | N/A | |
| Efficiently Learning Adversarially Robust Halfspaces with Noise | Unknown | N/A | |
| Progressive Identification of True Labels for Partial-Label Learning | Unknown | N/A | |
| Subspace Fitting Meets Regression: The Effects of Supervision and Orthonormality Constraints on Double Descent of Generalization Errors | Unknown | N/A | |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Unknown | N/A | |
| Word-Level Speech Recognition With a Letter to Word Encoder | Unknown | N/A | |
| Spectral Frank-Wolfe Algorithm: Strict Complementarity and Linear Convergence | Unknown | N/A | |
| Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion | Unknown | N/A | |
| Multi-Agent Routing Value Iteration Network | Unknown | N/A | |
| Inferring DQN structure for high-dimensional continuous control | Unknown | N/A | |
| Distributed Online Optimization over a Heterogeneous Network | Unknown | N/A | |
| Fair Learning with Private Demographic Data | Unknown | N/A | |
| Predictive Sampling with Forecasting Autoregressive Models | Unknown | N/A | |
| On Learning Sets of Symmetric Elements | Unknown | N/A | |
| DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths | Unknown | N/A | |
| Differentiable Product Quantization for End-to-End Embedding Compression | Unknown | N/A | |
| Doubly robust off-policy evaluation with shrinkage | Unknown | N/A | |
| Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural Networks with Attention over Modules | Unknown | N/A | |
| Description Based Text Classification with Reinforcement Learning | Unknown | N/A | |
| Meta-Learning with Shared Amortized Variational Inference | Unknown | N/A | |
| Neural Clustering Processes | Unknown | N/A | |
| Growing Action Spaces | Unknown | N/A | |
| On a projective ensemble approach to two sample test for equality of distributions | Unknown | N/A | |
| Estimating the Error of Randomized Newton Methods: A Bootstrap Approach | Unknown | N/A | |
| Adding seemingly uninformative labels helps in low data regimes | Unknown | N/A | |
| Low-loss connection of weight vectors: distribution-based approaches | Unknown | N/A | |
| Optimal Differential Privacy Composition for Exponential Mechanisms | Unknown | N/A | |
| A Flexible Latent Space Model for Multilayer Networks | Unknown | N/A | |
| Gradient-free Online Learning in Continuous Games with Delayed Rewards | Unknown | N/A | |
| Working Memory Graphs | Unknown | N/A | |
| Bayesian Optimisation over Multiple Continuous and Categorical Inputs | Unknown | N/A | |
| A Generative Model for Molecular Distance Geometry | Unknown | N/A | |
| Transfer Learning without Knowing: Reprogramming Black-box Machine Learning Models with Scarce Data and Limited Resources | Unknown | N/A | |
| Interpreting Robust Optimization via Adversarial Influence Functions | Unknown | N/A | |
| Lower Complexity Bounds for Finite-Sum Convex-Concave Minimax Optimization Problems | Unknown | N/A | |
| Towards Understanding the Dynamics of the First-Order Adversaries | Unknown | N/A | |
| An Imitation Learning Approach for Cache Replacement | Unknown | N/A | |
| Option Discovery in the Absence of Rewards with Manifold Analysis | Unknown | N/A | |
| Learning Selection Strategies in Buchberger’s Algorithm | Unknown | N/A | |
| Adversarial Robustness Against the Union of Multiple Perturbation Models | Unknown | N/A | |
| Generative Adversarial Imitation Learning with Neural Network Parameterization: Global Optimality and Convergence Rate | Unknown | N/A | |
| (Locally) Differentially Private Combinatorial Semi-Bandits | Unknown | N/A | |
| A Game Theoretic Framework for Model Based Reinforcement Learning | Unknown | N/A | |
| Streaming Coresets for Symmetric Tensor Factorization | Unknown | N/A | |
| Batch Stationary Distribution Estimation | Unknown | N/A | |
| The FAST Algorithm for Submodular Maximization | Unknown | N/A | |
| Context-aware Dynamics Model for Generalization in Model-Based Reinforcement Learning | Unknown | N/A | |
| Stochastic Optimization for Non-convex Inf-Projection Problems | Unknown | N/A | |
| Retrieval Augmented Language Model Pre-Training | Unknown | N/A | |
| When Does Self-Supervision Help Graph Convolutional Networks? | Unknown | N/A | |
| A Tree-Structured Decoder for Image-to-Markup Generation | Unknown | N/A | |
| Neural Network Control Policy Verification With Persistent Adversarial Perturbation | Unknown | N/A | |
| Normalized Loss Functions for Deep Learning with Noisy Labels | Unknown | N/A | |
| k-means++: few more steps yield constant approximation | Unknown | N/A | |
| Polynomial Tensor Sketch for Element-wise Function of Low-Rank Matrix | Unknown | N/A | |
| Let's Agree to Agree: Neural Networks Share Classification Order on Real Datasets | Unknown | N/A | |
| Stochastic Subspace Cubic Newton Method | Unknown | N/A | |
| Bayesian Learning from Sequential Data using Gaussian Processes with Signature Covariances | Unknown | N/A | |
| UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training | Unknown | N/A | |
| Variational Inference for Sequential Data with Future Likelihood Estimates | Unknown | N/A | |
| On the Noisy Gradient Descent that Generalizes as SGD | Unknown | N/A | |
| Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning | Unknown | N/A | |
| On Contrastive Learning for Likelihood-free Inference | Unknown | N/A | |
| Variational Autoencoders with Riemannian Brownian Motion Priors | Unknown | N/A | |
| Distinguishing Cause from Effect Using Quantiles: Bivariate Quantile Causal Discovery | Unknown | N/A | |
| How to Solve Fair k-Center in Massive Data Models | Unknown | N/A | |
| Batch Reinforcement Learning with Hyperparameter Gradients | Unknown | N/A | |
| A Geometric Approach to Archetypal Analysis via Sparse Projections | Unknown | N/A | |
| Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising | Unknown | N/A | |
| Sets Clustering | Unknown | N/A | |
| Data-Efficient Image Recognition with Contrastive Predictive Coding | Unknown | N/A | |
| Online Convex Optimization in the Random Order Model | Unknown | N/A | |
| Normalizing Flows on Tori and Spheres | Unknown | N/A | |
| Causal Effect Estimation and Optimal Dose Suggestions in Mobile Health | Unknown | N/A | |
| Dual-Path Distillation: A Unified Framework to Improve Black-Box Attacks | Unknown | N/A | |
| Fairwashing explanations with off-manifold detergent | Unknown | N/A | |
| More Data Can Expand The Generalization Gap Between Adversarially Robust and Standard Models | Unknown | N/A | |
| Implicit Generative Modeling for Efficient Exploration | Unknown | N/A | |
| Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation | Unknown | N/A | |
| Relaxing Bijectivity Constraints with Continuously Indexed Normalising Flows | Unknown | N/A | |
| Streaming Submodular Maximization under a k-Set System Constraint | Unknown | N/A | |
| Naive Exploration is Optimal for Online LQR | Unknown | N/A | |
| Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks | Unknown | N/A | |
| The Cost-free Nature of Optimally Tuning Tikhonov Regularizers and Other Ordered Smoothers | Unknown | N/A | |
| Inductive-bias-driven Reinforcement Learning For Efficient Schedules in Heterogeneous Clusters | Unknown | N/A | |
| The Boomerang Sampler | Unknown | N/A | |
| Weakly-Supervised Disentanglement Without Compromises | Unknown | N/A | |
| LazyIter: A Fast Algorithm for Counting Markov Equivalent DAGs and Designing Experiments | Unknown | N/A | |
| Goodness-of-Fit Tests for Inhomogeneous Random Graphs | Unknown | N/A | |
| Towards Understanding the Regularization of Adversarial Robustness on Neural Networks | Unknown | N/A | |
| Neural Datalog Through Time: Informed Temporal Modeling via Logical Specification | Unknown | N/A | |
| Safe Reinforcement Learning in Constrained Markov Decision Processes | Unknown | N/A | |
| Reinforcement Learning for Integer Programming: Learning to Cut | Unknown | N/A | |
| Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning | Unknown | N/A | |
| Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods | Unknown | N/A | |
| Online Multi-Kernel Learning with Graph-Structured Feedback | Unknown | N/A | |
| A Swiss Army Knife for Minimax Optimal Transport | Unknown | N/A | |
| Deep Reinforcement Learning with Smooth Policy | Unknown | N/A | |
| Tightening Exploration in Upper Confidence Reinforcement Learning | Unknown | N/A | |
| Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits | Unknown | N/A | |
| Fast computation of Nash Equilibria in Imperfect Information Games | Unknown | N/A | |
| The k-tied Normal Distribution: A Compact Parameterization of Gaussian Mean Field Posteriors in Bayesian Neural Networks | Unknown | N/A | |
| Learning disconnected manifolds: a no GAN's land | Unknown | N/A | |
| Low Bias Low Variance Gradient Estimates for Hierarchical Boolean Stochastic Networks | Unknown | N/A | |
| Optimizing for the Future in Non-Stationary MDPs | Unknown | N/A | |
| Explainable k-Means and k-Medians Clustering | Unknown | N/A | |
| Goal-Aware Prediction: Learning to Model What Matters | Unknown | N/A | |
| Laplacian Regularized Few-Shot Learning | Unknown | N/A | |
| Learning Compound Tasks without Task-specific Knowledge via Imitation and Self-supervised Learning | Unknown | N/A | |
| What can I do here? A Theory of Affordances in Reinforcement Learning | Unknown | N/A | |
| Automatic Shortcut Removal for Self-Supervised Representation Learning | Unknown | N/A | |
| Simple and sharp analysis of k-means | Unknown | ||
| Sharp Statistical Guaratees for Adversarially Robust Gaussian Classification | Unknown | N/A | |
| Characterizing Distribution Equivalence and Structure Learning for Cyclic and Acyclic Directed Graphs | Unknown | N/A | |
| Discriminative Adversarial Search for Abstractive Summarization | Unknown | N/A | |
| Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis | Unknown | N/A | |
| Curse of Dimensionality on Randomized Smoothing for Certifiable Robustness | Unknown | N/A | |
| On the Power of Compressed Sensing with Generative Models | Unknown | N/A | |
| Optimal Sequential Maximization: One Interview is Enough! | Unknown | N/A | |
| Invariant Causal Prediction for Block MDPs | Unknown | N/A | |
| Efficiently sampling functions from Gaussian process posteriors | Unknown | N/A | |
| On Second-Order Group Influence Functions for Black-Box Predictions | Unknown | N/A | |
| Randomly Projected Additive Gaussian Processes for Regression | Unknown | N/A | |
| Involutive MCMC: a Unifying Framework | Unknown | N/A | |
| Fair k-Centers via Maximum Matching | Unknown | N/A | |
| Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences | Unknown | N/A | |
| Missing Data Imputation using Optimal Transport | Unknown | N/A | |
| Fast Learning of Graph Neural Networks with Guaranteed Generalizability: One-hidden-layer Case | Unknown | N/A | |
| Online Continual Learning from Imbalanced Data | Unknown | N/A | |
| AdaScale SGD: A User-Friendly Algorithm for Distributed Training | Unknown | N/A | |
| Automated Synthetic-to-Real Generalization | Unknown | N/A | |
| Structure Adaptive Algorithms for Stochastic Bandits | Unknown | N/A | |
| Uncertainty Estimation Using a Single Deep Deterministic Neural Network | Unknown | N/A | |
| Partial Trace Regression and Low-Rank Kraus Decomposition | Unknown | N/A | |
| Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation | Unknown | N/A | |
| Extrapolation for Large-batch Training in Deep Learning | Unknown | N/A | |
| Preselection Bandits | Unknown | N/A | |
| Provably Efficient Model-based Policy Adaptation | Unknown | N/A | |
| Causal Inference using Gaussian Processes with Structured Latent Confounders | Unknown | N/A | |
| Robustifying Sequential Neural Processes | Unknown | N/A | |
| Optimizing Dynamic Structures with Bayesian Generative Search | Unknown | N/A | |
| Problems with Shapley-value-based explanations as feature importance measures | Unknown | N/A | |
| Mix-n-Match : Ensemble and Compositional Methods for Uncertainty Calibration in Deep Learning | Unknown | N/A | |
| No-Regret and Incentive-Compatible Online Learning | Unknown | N/A | |
| Gamification of Pure Exploration for Linear Bandits | Unknown | N/A | |
| Sparse Gaussian Processes with Spherical Harmonic Features | Unknown | N/A | |
| Stochastic Optimization for Regularized Wasserstein Estimators | Unknown | N/A | |
| Differentially Private Set Union | Unknown | N/A | |
| Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling | Unknown | N/A | |
| Poisson Learning: Graph Based Semi-Supervised Learning At Very Low Label Rates | Unknown | N/A | |
| Non-separable Non-stationary random fields | Unknown | N/A | |
| Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup | Unknown | N/A | |
| Associative Memory in Iterated Overparameterized Sigmoid Autoencoders | Unknown | N/A | |
| Ordinal Non-negative Matrix Factorization for Recommendation | Unknown | N/A | |
| Generalization Error of Generalized Linear Models in High Dimensions | Unknown | N/A | |
| Confidence-Calibrated Adversarial Training: Generalizing to Unseen Attacks | Unknown | N/A | |
| Logarithmic Regret for Adversarial Online Control | Unknown | N/A | |
| Causal Modeling for Fairness In Dynamical Systems | Unknown | N/A | |
| Quantum Boosting | Unknown | N/A | |
| Skew-Fit: State-Covering Self-Supervised Reinforcement Learning | Unknown | N/A | |
| The Tree Ensemble Layer: Differentiability meets Conditional Computation | Unknown | N/A | |
| ACFlow: Flow Models for Arbitrary Conditional Likelihoods | Unknown | N/A | |
| Data-Dependent Differentially Private Parameter Learning for Directed Graphical Models | Unknown | N/A | |
| Combinatorial Pure Exploration for Dueling Bandit | Unknown | N/A | |
| Error Estimation for Sketched SVD via the Bootstrap | Unknown | N/A | |
| Generalization and Representational Limits of Graph Neural Networks | Unknown | N/A | |
| Enhanced POET: Open-ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions | Unknown | N/A | |
| Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention | Unknown | N/A | |
| Deep Isometric Learning for Visual Recognition | Unknown | N/A | |
| Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift | Unknown | N/A | |
| Dual Mirror Descent for Online Allocation Problems | Unknown | N/A | |
| Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes | Unknown | N/A | |
| Preference Modeling with Context-Dependent Salient Features | Unknown | N/A | |
| Efficient Non-conjugate Gaussian Process Factor Models for Spike Count Data using Polynomial Approximations | Unknown | N/A | |
| Growing Adaptive Multi-hyperplane Machines | Unknown | N/A | |
| Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks | Unknown | N/A | |
| On Semi-parametric Inference for BART | Unknown | N/A | |
| RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr | Unknown | N/A | |
| Optimization from Structured Samples for Coverage Functions | Unknown | N/A | |
| Online Learning with Imperfect Hints | Unknown | N/A | |
| Alleviating Privacy Attacks via Causal Learning | Unknown | N/A | |
| Efficient Identification in Linear Structural Causal Models with Auxiliary Cutsets | Unknown | N/A | |
| The Effect of Natural Distribution Shift on Question Answering Models | Unknown | N/A | |
| Multiresolution Tensor Learning for Efficient and Interpretable Spatial Analysis | Unknown | N/A | |
| Feature Selection using Stochastic Gates | Unknown | N/A | |
| Information-Theoretic Local Minima Characterization and Regularization | Unknown | N/A | |
| Online Learned Continual Compression with Adaptive Quantization Modules | Unknown | N/A | |
| Semi-Supervised Learning with Normalizing Flows | Unknown | N/A | |
| Fractal Gaussian Networks: A sparse random graph model based on Gaussian Multiplicative Chaos | Unknown | N/A | |
| Educating Text Autoencoders: Latent Representation Guidance via Denoising | Unknown | N/A | |
| Stabilizing Differentiable Architecture Search via Perturbation-based Regularization | Unknown | N/A | |
| Constant Curvature Graph Convolutional Networks | Unknown | N/A | |
| Kernel interpolation with continuous volume sampling | Unknown | N/A | |
| IPBoost – Non-Convex Boosting via Integer Programming | Unknown | N/A | |
| Adaptive Gradient Descent without Descent | Unknown | N/A | |
| Concise Explanations of Neural Networks using Adversarial Training | Unknown | N/A | |
| Federated Learning with Only Positive Labels | Unknown | N/A | |
| The continuous categorical: a novel simplex-valued exponential family | Unknown | N/A | |
| PENNI: Pruned Kernel Sharing for Efficient CNN Inference | Unknown | N/A | |
| A Unified Theory of Decentralized SGD with Changing Topology and Local Updates | Unknown | N/A | |
| Efficient Intervention Design for Causal Discovery with Latents | Unknown | N/A | |
| Online Learning with Dependent Stochastic Feedback Graphs | Unknown | N/A | |
| Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions | Unknown | N/A | |
| Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games | Unknown | N/A | |
| Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition | Unknown | N/A | |
| Maximum-and-Concatenation Networks | Unknown | N/A | |
| Optimistic Policy Optimization with Bandit Feedback | Unknown | N/A | |
| Moniqua: Modulo Quantized Communication in Decentralized SGD | Unknown | N/A | |
| Why Are Learned Indexes So Effective? | Unknown | N/A | |
| Learning Human Objectives by Evaluating Hypothetical Behavior | Unknown | N/A | |
| The Shapley Taylor Interaction Index | Unknown | N/A | |
| Test-Time Training with Self-Supervision for Generalization under Distribution Shifts | Unknown | N/A | |
| SGD Learns One-Layer Networks in WGANs | Unknown | N/A | |
| Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning | Unknown | N/A | |
| Faster Graph Embeddings via Coarsening | Unknown | N/A | |
| CURL: Contrastive Unsupervised Representations for Reinforcement Learning | Unknown | N/A | |
| Generative Pretraining From Pixels | Unknown | N/A | |
| Retro: Learning Retrosynthetic Planning with Neural Guided A Search | Unknown | N/A | |
| Amortized Population Gibbs Samplers with Neural Sufficient Statistics | Unknown | N/A | |
| What Can Learned Intrinsic Rewards Capture? | Unknown | N/A | |
| Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards | Unknown | N/A | |
| Set Functions for Time Series | Unknown | N/A | |
| Variational Imitation Learning with Diverse-quality Demonstrations | Unknown | N/A | |
| The Role of Regularization in Classification of High-dimensional Noisy Gaussian Mixture | Unknown | N/A | |
| A quantile-based approach for hyperparameter transfer learning | Unknown | N/A | |
| DeepCoDA: personalized interpretability for compositional health data | Unknown | N/A | |
| Optimal Non-parametric Learning in Repeated Contextual Auctions with Strategic Buyer | Unknown | N/A | |
| No-Regret Exploration in Goal-Oriented Reinforcement Learning | Unknown | N/A | |
| Unsupervised Speech Decomposition via Triple Information Bottleneck | Unknown | N/A | |
| Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation | Unknown | N/A | |
| Fast Differentiable Sorting and Ranking | Unknown | N/A | |
| Implicit Regularization of Random Feature Models | Unknown | N/A | |
| Continuous Graph Neural Networks | Unknown | N/A | |
| Consistent Estimators for Learning to Defer to an Expert | Unknown | N/A | |
| A Graph to Graphs Framework for Retrosynthesis Prediction | Unknown | N/A | |
| Learning Calibratable Policies using Programmatic Style-Consistency | Unknown | N/A | |
| Distance Metric Learning with Joint Representation Diversification | Unknown | N/A | |
| Near-optimal Regret Bounds for Stochastic Shortest Path | Unknown | N/A | |
| ECLIPSE: An Extreme-Scale Linear Program Solver for Web-Applications | Unknown | N/A | |
| Model-Based Reinforcement Learning with Value-Targeted Regression | Unknown | N/A | |
| A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition | Unknown | N/A | |
| On the Global Convergence Rates of Softmax Policy Gradient Methods | Unknown | N/A | |
| Learnable Group Transform For Time-Series | Unknown | N/A | |
| Fair Generative Modeling via Weak Supervision | Unknown | N/A | |
| Obtaining Adjustable Regularization for Free via Iterate Averaging | Unknown | N/A | |
| Invariant Risk Minimization Games | Unknown | N/A | |
| Linear Mode Connectivity and the Lottery Ticket Hypothesis | Unknown | N/A | |
| Input-Sparsity Low Rank Approximation in Schatten Norm | Unknown | N/A | |
| Convolutional dictionary learning based auto-encoders for natural exponential-family distributions | Unknown | N/A | |
| Optimal transport mapping via input convex neural networks | Unknown | N/A | |
| Visual Grounding of Learned Physical Models | Unknown | N/A | |
| GraphOpt: Learning Optimization Models of Graph Formation | Unknown | N/A | |
| An EM Approach to Non-autoregressive Conditional Sequence Generation | Unknown | N/A | |
| Training Neural Networks for and by Interpolation | Unknown | N/A | |
| On the Iteration Complexity of Hypergradient Computation | Unknown | N/A | |
| Balancing Competing Objectives with Noisy Data: Score-Based Classifiers for Welfare-Aware Machine Learning | Unknown | N/A | |
| InstaHide: Instance-hiding Schemes for Private Distributed Learning | Unknown | N/A | |
| Variance Reduction in Stochastic Particle-Optimization Sampling | Unknown | N/A | |
| The Sample Complexity of Best-$k$ Items Selection from Pairwise Comparisons | Unknown | N/A | |
| Efficient Domain Generalization via Common-Specific Low-Rank Decomposition | Unknown | N/A | |
| Approximating Stacked and Bidirectional Recurrent Architectures with the Delayed Recurrent Neural Network | Unknown | N/A | |
| Born-again Tree Ensembles | Unknown | N/A | |
| Learning to Simulate Complex Physics with Graph Networks | Unknown | N/A | |
| Learning to Simulate and Design for Structural Engineering | Unknown | N/A | |
| Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere | Unknown | N/A | |
| Efficient Robustness Certificates for Discrete Data: Sparsity-Aware Randomized Smoothing for Graphs, Images and More | Unknown | N/A | |
| Towards a General Theory of Infinite-Width Limits of Neural Classifiers | Unknown | N/A | |
| PolyGen: An Autoregressive Generative Model of 3D Meshes | Unknown | N/A | |
| Multiclass Neural Network Minimization via Tropical Newton Polytope Approximation | Unknown | N/A | |
| Structured Prediction with Partial Labelling through the Infimum Loss | Unknown | N/A | |
| XtarNet: Learning to Extract Task-Adaptive Representation for Incremental Few-Shot Learning | Unknown | N/A | |
| Dissecting Non-Vacuous Generalization Bounds based on the Mean-Field Approximation | Unknown | N/A | |
| Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics | Unknown | N/A | |
| Constructive Universal High-Dimensional Distribution Generation through Deep ReLU Networks | Unknown | N/A | |
| Regularized Optimal Transport is Ground Cost Adversarial | Unknown | N/A | |
| On Convergence-Diagnostic based Step Sizes for Stochastic Gradient Descent | Unknown | N/A | |
| Hierarchically Decoupled Imitation For Morphological Transfer | Unknown | N/A | |
| Minimally distorted Adversarial Examples with a Fast Adaptive Boundary Attack | Unknown | N/A | |
| Equivariant Flows: Exact Likelihood Generative Learning for Symmetric Densities | Unknown | N/A | |
| The Buckley-Osthus model and the block preferential attachment model: statistical analysis and application | Unknown | N/A | |
| Causal Strategic Linear Regression | Unknown | N/A | |
| Online metric algorithms with untrusted predictions | Unknown | N/A | |
| Piecewise Linear Regression via a Difference of Convex Functions | Unknown | N/A | |
| Robustness to Spurious Correlations via Human Annotations | Unknown | N/A | |
| On Learning Language-Invariant Representations for Universal Machine Translation | Unknown | N/A | |
| A simpler approach to accelerated optimization: iterative averaging meets optimism | Unknown | N/A | |
| Sub-linear Memory Sketches for Near Neighbor Search on Streaming Data | Unknown | N/A | |
| Neural Topic Modeling with Continual Lifelong Learning | Unknown | N/A | |
| On Lp-norm Robustness of Ensemble Decision Stumps and Trees | Unknown | N/A | |
| Recht-Re Noncommutative Arithmetic-Geometric Mean Conjecture is False | Unknown | N/A | |
| High-dimensional Robust Mean Estimation via Gradient Descent | Unknown | N/A | |
| Fundamental Tradeoffs between Invariance and Sensitivity to Adversarial Perturbations | Unknown | N/A | |
| On Implicit Regularization in $\beta$-VAEs | Unknown | N/A | |
| Uncertainty quantification for nonconvex tensor completion: Confidence intervals, heteroscedasticity and optimality | Unknown | N/A | |
| A Nearly-Linear Time Algorithm for Exact Community Recovery in Stochastic Block Model | Unknown | N/A | |
| Concept Bottleneck Models | Unknown | N/A | |
| FACT: A Diagnostic for Group Fairness Trade-offs | Unknown | N/A | |
| Data Amplification: Instance-Optimal Property Estimation | Unknown | N/A | |
| Stochastic Hamiltonian Gradient Methods for Smooth Games | Unknown | N/A | |
| DROCC: Deep Robust One-Class Classification | Unknown | N/A | |
| Predictive Coding for Locally-Linear Control | Unknown | N/A | |
| Understanding Self-Training for Gradual Domain Adaptation | Unknown | N/A | |
| Improved Bounds on Minimax Regret under Logarithmic Loss via Self-Concordance | Unknown | N/A | |
| Do We Need Zero Training Loss After Achieving Zero Training Error? | Unknown | N/A | |
| On Thompson Sampling with Langevin Algorithms | Unknown | N/A | |
| Strategic Classification is Causal Modeling in Disguise | Unknown | N/A | |
| Decentralised Learning with Random Features and Distributed Gradient Descent | Unknown | N/A | |
| Topic Modeling via Full Dependence Mixtures | Unknown | N/A | |
| Transformer Hawkes Process | Unknown | N/A | |
| Adversarial Mutual Information for Text Generation | Unknown | N/A | |
| Optimal Estimator for Unlabeled Linear Regression | Unknown | N/A | |
| Learning Near Optimal Policies with Low Inherent Bellman Error | Unknown | N/A | |
| Margin-aware Adversarial Domain Adaptation with Optimal Transport | Unknown | N/A | |
| Message Passing Least Squares Framework and its Application to Rotation Synchronization | Unknown | N/A | |
| Improving Robustness of Deep-Learning-Based Image Reconstruction | Unknown | N/A | |
| Domain Aggregation Networks for Multi-Source Domain Adaptation | Unknown | N/A | |
| Recurrent Hierarchical Topic-Guided RNN for Language Generation | Unknown | N/A | |
| Closing the convergence gap of SGD without replacement | Unknown | N/A | |
| Learning to Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning | Unknown | N/A | |
| Emergence of Separable Manifolds in Deep Language Representations | Unknown | N/A | |
| Recovery of Sparse Signals from a Mixture of Linear Samples | Unknown | N/A | |
| Don't Waste Your Bits! Squeeze Activations and Gradients for Deep Neural Networks via TinyScript | Unknown | N/A | |
| Flexible and Efficient Long-Range Planning Through Curious Exploration | Unknown | N/A | |
| Private Outsourced Bayesian Optimization | Unknown | N/A | |
| Sparse Convex Optimization via Adaptively Regularized Hard Thresholding | Unknown | N/A | |
| Orthogonalized SGD and Nested Architectures for Anytime Neural Networks | Unknown | N/A | |
| Discriminative Jackknife: Quantifying Uncertainty in Deep Learning via Higher-Order Influence Functions | Unknown | N/A | |
| Improved Communication Cost in Distributed PageRank Computation – A Theoretical Study | Unknown | N/A | |
| Collapsed Amortized Variational Inference for Switching Nonlinear Dynamical Systems | Unknown | N/A | |
| On the Convergence of Nesterov's Accelerated Gradient Method in Stochastic Settings | Unknown | N/A | |
| Causal Structure Discovery from Distributions Arising from Mixtures of DAGs | Unknown | N/A | |
| A Distributional Framework For Data Valuation | Unknown | N/A | |
| Customizing ML Predictions for Online Algorithms | Unknown | N/A | |
| Learning Adversarially Robust Representations via Worst-Case Mutual Information Maximization | Unknown | N/A | |
| Individual Calibration with Randomized Forecasting | Unknown | N/A | |
| Bayesian Differential Privacy for Machine Learning | Unknown | N/A | |
| Logistic Regression for Massive Data with Rare Events | Unknown | N/A | |
| Meta-learning with Stochastic Linear Bandits | Unknown | N/A | |
| Parallel Algorithm for Non-Monotone DR-Submodular Maximization | Unknown | N/A | |
| Deep Divergence Learning | Unknown | N/A | |
| TrajectoryNet: A Dynamic Optimal Transport Network for Modeling Cellular Dynamics | Unknown | N/A | |
| A new regret analysis for Adam-type algorithms | Unknown | N/A | |
| InfoGAN-CR and ModelCentrality: Self-supervised Model Training and Selection for Disentangling GANs | Unknown | N/A | |
| Adversarial Robustness for Code | Unknown | N/A | |
| Curvature-corrected learning dynamics in deep neural networks | Unknown | N/A | |
| Attentive Group Equivariant Convolutional Networks | Unknown | N/A | |
| XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation | Unknown | N/A | |
| Guided Learning of Nonconvex Models through Successive Functional Gradient Optimization | Unknown | N/A | |
| Causal Effect Identifiability under Partial-Observability | Unknown | N/A | |
| Scalable Exact Inference in Multi-Output Gaussian Processes | Unknown | N/A | |
| Topologically Densified Distributions | Unknown | N/A | |
| Graph Filtration Learning | Unknown | N/A | |
| Student-Teacher Curriculum Learning via Reinforcement Learning: Predicting Hospital Inpatient Admission Location | Unknown | N/A | |
| Scalable Differential Privacy with Certified Robustness in Adversarial Learning | Unknown | N/A | |
| A Free-Energy Principle for Representation Learning | Unknown | N/A | |
| Generalisation error in learning with random features and the hidden manifold model | Unknown | N/A | |
| Why bigger is not always better: on finite and infinite neural networks | Unknown | N/A | |
| Budgeted Online Influence Maximization | Unknown | N/A | |
| Being Bayesian about Categorical Probability | Unknown | N/A | |
| Structured Policy Iteration for Linear Quadratic Regulator | Unknown | N/A | |
| Intrinsic Reward Driven Imitation Learning via Generative Model | Unknown | N/A | |
| Manifold Identification for Ultimately Communication-Efficient Distributed Optimization | Unknown | N/A | |
| When Demands Evolve Larger and Noisier: Learning and Earning in a Growing Environment | Unknown | N/A | |
| Communication-Efficient Distributed PCA by Riemannian Optimization | Unknown | N/A | |
| Learning Reasoning Strategies in End-to-End Differentiable Proving | Unknown | N/A | |
| Learning Algebraic Multigrid Using Graph Neural Networks | Unknown | N/A | |
| Normalized Flat Minima: Exploring Scale Invariant Definition of Flat Minima for Neural Networks Using PAC-Bayesian Analysis | Unknown | N/A | |
| Neural Kernels Without Tangents | Unknown | N/A | |
| Q-value Path Decomposition for Deep Multiagent Reinforcement Learning | Unknown | N/A | |
| Quantized Decentralized Stochastic Learning over Directed Graphs | Unknown | N/A | |
| Continuous-time Lower Bounds for Gradient-based Algorithms | Unknown | N/A | |
| Adaptive Droplet Routing in Digital Microfluidic Biochips Using Deep Reinforcement Learning | Unknown | N/A | |
| Coresets for Data-efficient Training of Machine Learning Models | Unknown | N/A | |
| Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning | Unknown | N/A | |
| Learning Task-Agnostic Embedding of Multiple Black-Box Experts for Multi-Task Model Fusion | Unknown | N/A | |
| Safe screening rules for L0-regression from Perspective Relaxations | Unknown | N/A | |
| Semi-Supervised StyleGAN for Disentanglement Learning | Unknown | N/A | |
| Variational Label Enhancement | Unknown | N/A | |
| Optimizing Long-term Social Welfare in Recommender Systems: A Constrained Matching Approach | Unknown | N/A | |
| The Non-IID Data Quagmire of Decentralized Machine Learning | Unknown | N/A | |
| Learning from Irregularly-Sampled Time Series: A Missing Data Perspective | Unknown | N/A | |
| Parametric Gaussian Process Regressors | Unknown | N/A | |
| Evaluating the Performance of Reinforcement Learning Algorithms | Unknown | N/A | |
| Eliminating the Invariance on the Loss Landscape of Linear Autoencoders | Unknown | N/A | |
| FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis | Unknown | N/A | |
| An end-to-end approach for the verification problem: learning the right distance | Unknown | N/A | |
| Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions | Unknown | N/A | |
| Near-Tight Margin-Based Generalization Bounds for Support Vector Machines | Unknown | N/A | |
| Frustratingly Simple Few-Shot Object Detection | Unknown | N/A | |
| DeepMatch: Balancing Deep Covariate Representations for Causal Inference Using Adversarial Training | Unknown | N/A | |
| Rethinking Bias-Variance Trade-off for Generalization of Neural Networks | Unknown | N/A | |
| Universal Equivariant Multilayer Perceptrons | Unknown | N/A | |
| Optimal Robust Learning of Discrete Distributions from Batches | Unknown | N/A | |
| Multi-Precision Policy Enforced Training (MuPPET) : A Precision-Switching Strategy for Quantised Fixed-Point Training of CNNs | Unknown | N/A | |
| Energy-Based Processes for Exchangeable Data | Unknown | N/A | |
| An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm | Unknown | N/A | |
| LowFER: Low-rank Bilinear Pooling for Link Prediction | Unknown | N/A | |
| SimGANs: Simulator-Based Generative Adversarial Networks for ECG Synthesis to Improve Deep ECG Classification | Unknown | N/A | |
| Momentum Improves Normalized SGD | Unknown | N/A | |
| When are Non-Parametric Methods Robust? | Unknown | N/A | |
| Is There a Trade-Off Between Fairness and Accuracy? A Perspective Using Mismatched Hypothesis Testing | Unknown | N/A | |
| Multi-Objective Molecule Generation using Interpretable Substructures | Unknown | N/A | |
| The Implicit Regularization of Stochastic Gradient Flow for Least Squares | Unknown | N/A | |
| Learning Representations that Support Extrapolation | Unknown | N/A | |
| Frequency Bias in Neural Networks for Input of Non-Uniform Density | Unknown | N/A | |
| Incremental Sampling Without Replacement for Sequence Models | Unknown | N/A | |
| Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-layer Networks | Unknown | N/A | |
| Variable Skipping for Autoregressive Range Density Estimation | Unknown | N/A | |
| TaskNorm: Rethinking Batch Normalization for Meta-Learning | Unknown | N/A | |
| Acceleration for Compressed Gradient Descent in Distributed and Federated Optimization | Unknown | N/A | |
| Learning to Score Behaviors for Guided Policy Optimization | Unknown | N/A | |
| Invertible generative models for inverse problems: mitigating representation error and dataset bias | Unknown | N/A | |
| Anderson Acceleration of Proximal Gradient Methods | Unknown | N/A | |
| Harmonic Decompositions of Convolutional Networks | Unknown | N/A | |
| Deep k-NN for Noisy Labels | Unknown | N/A | |
| An Explicitly Relational Neural Network Architecture | Unknown | N/A | |
| Private Counting from Anonymous Messages: Near-Optimal Accuracy with Vanishing Communication Overhead | Unknown | N/A | |
| Learning with Multiple Complementary Labels | Unknown | N/A | |
| Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle | Unknown | N/A | |
| LTF: A Label Transformation Framework for Correcting Label Shift | Unknown | N/A | |
| Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints | Unknown | N/A | |
| Learning Portable Representations for High-Level Planning | Unknown | N/A | |
| Choice Set Optimization Under Discrete Choice Models of Group Decisions | Unknown | N/A | |
| Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking | Unknown | N/A | |
| Unique Properties of Flat Minima in Deep Networks | Unknown | N/A | |
| Stochastic Differential Equations with Variational Wishart Diffusions | Unknown | N/A | |
| Explaining Groups of Points in Low-Dimensional Representations | Unknown | N/A | |
| Understanding and Stabilizing GANs' Training Dynamics Using Control Theory | Unknown | N/A | |
| Calibration, Entropy Rates, and Memory in Language Models | Unknown | N/A | |
| Entropy Minimization In Emergent Languages | Unknown | N/A | |
| Model Fusion with Kullback--Leibler Divergence | Unknown | N/A | |
| Momentum-Based Policy Gradient Methods | Unknown | N/A | |
| Scalable Nearest Neighbor Search for Optimal Transport | Unknown | N/A | |
| Discount Factor as a Regularizer in Reinforcement Learning | Unknown | N/A | |
| Small-GAN: Speeding up GAN Training using Core-Sets | Unknown | N/A | |
| Median Matrix Completion: from Embarrassment to Optimality | Unknown | N/A | |
| Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits | Unknown | N/A | |
| Online mirror descent and dual averaging: keeping pace in the dynamic case | Unknown | N/A | |
| Differentiable Likelihoods for Fast Inversion of 'Likelihood-Free' Dynamical Systems | Unknown | N/A | |
| Encoding Musical Style with Transformer Autoencoders | Unknown | N/A | |
| Interferometric Graph Transform: a Deep Unsupervised Graph Representation | Unknown | N/A | |
| Robust Pricing in Dynamic Mechanism Design | Unknown | N/A | |
| SDE-Net: Equipping Deep Neural Networks with Uncertainty Estimates | Unknown | N/A | |
| Spectral Clustering with Graph Neural Networks for Graph Pooling | Unknown | N/A | |
| Fully Parallel Hyperparameter Search: Reshaped Space-Filling | Unknown | N/A | |
| On Relativistic f-Divergences | Unknown | N/A | |
| Projection-free Distributed Online Convex Optimization with $O(\sqrt{T})$ Communication Complexity | Unknown | N/A | |
| Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection | Unknown | N/A | |
| FR-Train: A Mutual Information-Based Approach to Fair and Robust Training | Unknown | N/A | |
| Robust Outlier Arm Identification | Unknown | N/A | |
| Stochastic bandits with arm-dependent delays | Unknown | N/A | |
| Consistent Structured Prediction with Max-Min Margin Markov Networks | Unknown | N/A | |
| Latent Space Factorisation and Manipulation via Matrix Subspace Projection | Unknown | N/A | |
| Modulating Surrogates for Bayesian Optimization | Unknown | N/A | |
| Fast Deterministic CUR Matrix Decomposition with Accuracy Assurance | Unknown | N/A | |
| Random extrapolation for primal-dual coordinate descent | Unknown | N/A | |
| Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions | Unknown | N/A | |
| Active World Model Learning in Agent-rich Environments with Progress Curiosity | Unknown | N/A | |
| Predicting Choice with Set-Dependent Aggregation | Unknown | N/A | |
| Landscape Connectivity and Dropout Stability of SGD Solutions for Over-parameterized Neural Networks | Unknown | N/A | |
| Projective Preferential Bayesian Optimization | Unknown | N/A | |
| Simple and Deep Graph Convolutional Networks | Unknown | N/A | |
| Stochastic Gauss-Newton Algorithms for Nonconvex Compositional Optimization | Unknown | N/A | |
| Real-Time Optimisation for Online Learning in Auctions | Unknown | N/A | |
| Multidimensional Shape Constraints | Unknown | N/A | |
| Robust Bayesian Classification Using An Optimistic Score Ratio | Unknown | N/A | |
| Convergence Rates of Variational Inference in Sparse Deep Learning | Unknown | N/A | |
| Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently | Unknown | N/A | |
| Deep Coordination Graphs | Unknown | N/A | |
| Reinforcement Learning for Molecular Design Guided by Quantum Mechanics | Unknown | N/A | |
| A Natural Lottery Ticket Winner: Reinforcement Learning with Ordinary Neural Circuits | Unknown | N/A | |
| Unsupervised Transfer Learning for Spatiotemporal Predictive Networks | Unknown | N/A | |
| Strategyproof Mean Estimation from Multiple-Choice Questions | Unknown | N/A | |
| Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime | Unknown | N/A | |
| Imputer: Sequence Modelling via Imputation and Dynamic Programming | Unknown | N/A | |
| Towards non-parametric drift detection via Dynamic Adapting Window Independence Drift Detection (DAWIDD) | Unknown | N/A | |
| Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains | Unknown | N/A | |
| On the Global Optimality of Model-Agnostic Meta-Learning | Unknown | N/A | |
| Non-convex Learning via Replica Exchange Stochastic Gradient MCMC | Unknown | N/A | |
| Continuously Indexed Domain Adaptation | Unknown | N/A | |
| Minimax Rate for Learning From Pairwise Comparisons in the BTL Model | Unknown | N/A | |
| Cost-effectively Identifying Causal Effects When Only Response Variable is Observable | Unknown | N/A | |
| Sequential Cooperative Bayesian Inference | Unknown | N/A | |
| Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games | Unknown | N/A | |
| Adaptive Region-Based Active Learning | Unknown | N/A | |
| Private Reinforcement Learning with PAC and Regret Guarantees | Unknown | N/A | |
| Sparse Subspace Clustering with Entropy-Norm | Unknown | N/A | |
| Generalization Guarantees for Sparse Kernel Approximation with Entropic Optimal Features | Unknown | N/A | |
| Debiased Sinkhorn barycenters | Unknown | N/A | |
| Principled learning method for Wasserstein distributionally robust optimization with local perturbations | Unknown | N/A | |
| Generative Flows with Matrix Exponential | Unknown | N/A | |
| Equivariant Neural Rendering | Unknown | N/A | |
| Optimizing Black-box Metrics with Adaptive Surrogates | Unknown | N/A | |
| Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills | Unknown | N/A | |
| Self-Attentive Hawkes Process | Unknown | N/A | |
| Negative Sampling in Semi-Supervised learning | Unknown | N/A | |
| Sparse Shrunk Additive Models | Unknown | N/A | |
| Spread Divergence | Unknown | N/A | |
| VFlow: More Expressive Generative Flows with Variational Data Augmentation | Unknown | N/A | |
| Adaptive Sketching for Fast and Convergent Canonical Polyadic Decomposition | Unknown | N/A | |
| Label-Noise Robust Domain Adaptation | Unknown | N/A | |
| Learning Opinions in Social Networks | Unknown | N/A | |
| Voice Separation with an Unknown Number of Multiple Speakers | Unknown | N/A | |
| Off-Policy Actor-Critic with Shared Experience Replay | Unknown | N/A | |
| Self-Concordant Analysis of Frank-Wolfe Algorithms | Unknown | N/A | |
| On the Generalization Effects of Linear Transformations in Data Augmentation | Unknown | N/A | |
| Graph Random Neural Features for Distance-Preserving Graph Representations | Unknown | N/A | |
| Provably Efficient Exploration in Policy Optimization | Unknown | N/A | |
| Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling | Unknown | N/A | |
| Doubly Stochastic Variational Inference for Neural Processes with Hierarchical Latent Variables | Unknown | N/A | |
| Quantum Expectation-Maximization for Gaussian mixture models | Unknown | N/A | |
| Proper Network Interpretability Helps Adversarial Robustness in Classification | Unknown | N/A | |
| DeBayes: a Bayesian Method for Debiasing Network Embeddings | Unknown | N/A | |
| Learning Deep Kernels for Non-Parametric Two-Sample Tests | Unknown | N/A | |
| Stabilizing Transformers for Reinforcement Learning | Unknown | N/A | |
| Duality in RKHSs with Infinite Dimensional Outputs: Application to Robust Losses | Unknown | N/A | |
| OPtions as REsponses: Grounding behavioural hierarchies in multi-agent reinforcement learning | Unknown | N/A | |
| Divide, Conquer, and Combine: a New Inference Strategy for Probabilistic Programs with Stochastic Support | Unknown | N/A | |
| Influenza Forecasting Framework based on Gaussian Processes | Unknown | N/A | |
| Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making | Unknown | N/A | |
| Ready Policy One: World Building Through Active Learning | Unknown | N/A | |
| Semismooth Newton Algorithm for Efficient Projections onto $\ell_{1, \infty}$-norm Ball | Unknown | N/A | |
| Graph-based Nearest Neighbor Search: From Practice to Theory | Unknown | N/A | |
| Structural Language Models of Code | Unknown | N/A | |
| Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning | Unknown | N/A | |
| PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization | Unknown | N/A | |
| On Differentially Private Stochastic Convex Optimization with Heavy-tailed Data | Unknown | N/A | |
| Statistically Efficient Off-Policy Policy Gradients | Unknown | N/A | |
| Nearly Linear Row Sampling Algorithm for Quantile Regression | Unknown | N/A | |
| A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton | Unknown | N/A | |
| On Leveraging Pretrained GANs for Generation with Limited Data | Unknown | N/A | |
| Few-shot Domain Adaptation by Causal Mechanism Transfer | Unknown | N/A | |
| Adaptive Adversarial Multi-task Representation Learning | Unknown | N/A | |
| Variance Reduced Coordinate Descent with Acceleration: New Method With a Surprising Application to Finite-Sum Problems | Unknown | N/A | |
| Learning Structured Latent Factors from Dependent Data:A Generative Model Framework from Information-Theoretic Perspective | Unknown | N/A | |
| Generating Programmatic Referring Expressions via Program Synthesis | Unknown | N/A | |
| Explicit Gradient Learning for Black-Box Optimization | Unknown | N/A | |
| Optimization and Analysis of the pAp@k Metric for Recommender Systems | Unknown | N/A | |
| Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training | Unknown | N/A | |
| Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control | Unknown | N/A | |
| Interpretations are Useful: Penalizing Explanations to Align Neural Networks with Prior Knowledge | Unknown | N/A | |
| Training Linear Neural Networks: Non-Local Convergence and Complexity Results | Unknown | N/A | |
| ROMA: Multi-Agent Reinforcement Learning with Emergent Roles | Unknown | N/A | |
| Online Pricing with Offline Data: Phase Transition and Inverse Square Law | Unknown | N/A | |
| Stochastic Gradient and Langevin Processes | Unknown | N/A | |
| Minimax Pareto Fairness: A Multi Objective Perspective | Unknown | N/A | |
| When Explanations Lie: Why Many Modified BP Attributions Fail | Unknown | N/A | |
| Approximation Guarantees of Local Search Algorithms via Localizability of Set Functions | Unknown | N/A | |
| DeltaGrad: Rapid retraining of machine learning models | Unknown | N/A | |
| Teaching with Limited Information on the Learner's Behaviour | Unknown | N/A | |
| Do RNN and LSTM have Long Memory? | Unknown | N/A | |
| On the Unreasonable Effectiveness of the Greedy Algorithm: Greedy Adapts to Sharpness | Unknown | N/A | |
| Taylor Expansion Policy Optimization | Unknown | N/A | |
| Layered Sampling for Robust Optimization Problems | Unknown | N/A | |
| Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE | Unknown | N/A | |
| One Size Fits All: Can We Train One Denoiser for All Noise Levels? | Unknown | N/A | |
| Learning to Encode Position for Transformer with Continuous Dynamical Model | Unknown | N/A | |
| Certified Data Removal from Machine Learning Models | Unknown | N/A | |
| GNN-FiLM: Graph Neural Networks with Feature-wise Linear Modulation | Unknown | N/A | |
| Approximation Capabilities of Neural ODEs and Invertible Residual Networks | Unknown | N/A | |
| Maximum Likelihood with Bias-Corrected Calibration is Hard-To-Beat at Label Shift Adaptation | Unknown | N/A | |
| Random Matrix Theory Proves that Deep Learning Representations of GAN-data Behave as Gaussian Mixtures | Unknown | N/A | |
| Pretrained Generalized Autoregressive Model with Adaptive Probabilistic Label Clusters for Extreme Multi-label Text Classification | Unknown | N/A | |
| Fast OSCAR and OWL Regression via Safe Screening Rules | Unknown | N/A | |
| Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization | Unknown | N/A | |
| Statistically Preconditioned Accelerated Gradient Method for Distributed Optimization | Unknown | N/A | |
| Uncertainty-Aware Lookahead Factor Models for Quantitative Investing | Unknown | N/A | |
| Learning Efficient Multi-agent Communication: An Information Bottleneck Approach | Unknown | N/A | |
| Multi-Agent Determinantal Q-Learning | Unknown | N/A | |
| Task Understanding from Confusing Multi-task Data | Unknown | N/A | |
| Privately detecting changes in unknown distributions | Unknown | N/A | |
| Linear Convergence of Randomized Primal-Dual Coordinate Method for Large-scale Linear Constrained Convex Programming | Unknown | N/A | |
| Symbolic Network: Generalized Neural Policies for Relational MDPs | Unknown | N/A | |
| Stochastic Flows and Geometric Optimization on the Orthogonal Group | Unknown | N/A | |
| Topological Autoencoders | Unknown | N/A | |
| On Layer Normalization in the Transformer Architecture | Unknown | N/A | |
| Fiduciary Bandits | Unknown | N/A | |
| One-shot Distributed Ridge Regression in High Dimensions | Unknown | N/A | |
| Robustness to Programmable String Transformations via Augmented Abstract Training | Unknown | N/A | |
| Towards Accurate Post-training Network Quantization via Bit-Split and Stitching | Unknown | N/A | |
| Channel Equilibrium Networks for Learning Deep Representation | Unknown | N/A | |
| Universal Asymptotic Optimality of Polyak Momentum | Unknown | N/A | |
| Progressive Graph Learning for Open-Set Domain Adaptation | Unknown | N/A | |
| SoftSort: A Continuous Relaxation for the argsort Operator | Unknown | N/A | |
| Cooperative Multi-Agent Bandits with Heavy Tails | Unknown | N/A | |
| Enhancing Simple Models by Exploiting What They Already Know | Unknown | N/A | |
| Cost-Effective Interactive Attention Learning with Neural Attention Processes | Unknown | N/A | |
| Active Learning on Attributed Graphs via Graph Cognizant Logistic Regression and Preemptive Query Generation | Unknown | N/A | |
| Learning De-biased Representations with Biased Representations | Unknown | N/A | |
| Too Relaxed to Be Fair | Unknown | N/A | |
| From Importance Sampling to Doubly Robust Policy Gradient | Unknown | N/A | |
| Context Aware Local Differential Privacy | Unknown | N/A | |
| From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model | Unknown | N/A | |
| Graph Structure of Neural Networks | Unknown | N/A | |
| Fractional Underdamped Langevin Dynamics: Retargeting SGD with Momentum under Heavy-Tailed Gradient Noise | Unknown | N/A | |
| Graph Convolutional Network for Recommendation with Low-pass Collaborative Filters | Unknown | N/A | |
| Automatic Reparameterisation of Probabilistic Programs | Unknown | N/A | |
| Privately Learning Markov Random Fields | Unknown | N/A | |
| Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards | Unknown | N/A | |
| Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies | Unknown | N/A | |
| Learning Factorized Weight Matrix for Joint Filtering | Unknown | N/A | |
| Optimal Continual Learning has Perfect Memory and is NP-hard | Unknown | N/A | |
| More Information Supervised Probabilistic Deep Face Embedding Learning | Unknown | N/A | |
| Acceleration through spectral density estimation | Unknown | N/A | |
| Reverse-engineering deep ReLU networks | Unknown | N/A | |
| Almost Tune-Free Variance Reduction | Unknown | N/A | |
| Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search | Unknown | N/A | |
| Randomized Block-Diagonal Preconditioning for Parallel Learning | Unknown | N/A | |
| Class-Weighted Classification: Trade-offs and Robust Approaches | Unknown | N/A | |
| Hybrid Stochastic-Deterministic Minibatch Proximal Gradient: Less-Than-Single-Pass Optimization with Nearly Optimal Generalization | Unknown | N/A | |
| New Oracle-Efficient Algorithms for Private Synthetic Data Release | Unknown | N/A | |
| Do We Really Need to Access the Source Data? Source Hypothesis Transfer for Unsupervised Domain Adaptation | Unknown | N/A | |
| Scalable Differentiable Physics for Learning and Control | Unknown | N/A | |
| Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting | Unknown | N/A | |
| Training Binary Neural Networks through Learning with Noisy Supervision | Unknown | N/A | |
| My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits | Unknown | N/A | |
| The Complexity of Finding Stationary Points with Stochastic Gradient Descent | Unknown | N/A | |
| Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders | Unknown | N/A | |
| Uniform Convergence of Rank-weighted Learning | Unknown | N/A | |
| DRWR: A Differentiable Renderer without Rendering for Unsupervised 3D Structure Learning from Silhouette Images | Unknown | N/A | |
| The Many Shapley Values for Model Explanation | Unknown | N/A | |
| Feature-map-level Online Adversarial Knowledge Distillation | Unknown | N/A | |
| Soft Threshold Weight Reparameterization for Learnable Sparsity | Unknown | N/A | |
| Performative Prediction | Unknown | N/A | |
| CAUSE: Learning Granger Causality from Event Sequences using Attribution Methods | Unknown | N/A |
ICML 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Lossless Compression of Efficient Private Local Randomizers | Unknown | N/A | |
| On the Predictability of Pruning Across Scales | Unknown | N/A | |
| Emphatic Algorithms for Deep Reinforcement Learning | Unknown | N/A | |
| Regularizing towards Causal Invariance: Linear Models with Proxies | Unknown | N/A | |
| Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations | Unknown | N/A | |
| How Does Loss Function Affect Generalization Performance of Deep Learning? Application to Human Age Estimation | Unknown | N/A | |
| Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping | Unknown | N/A | |
| TempoRL: Learning When to Act | Unknown | N/A | |
| Tighter Bounds on the Log Marginal Likelihood of Gaussian Process Regression Using Conjugate Gradients | Unknown | N/A | |
| Memory Efficient Online Meta Learning | Unknown | N/A | |
| Adversarial Policy Learning in Two-player Competitive Games | Unknown | N/A | |
| Meta-Cal: Well-controlled Post-hoc Calibration by Ranking | Unknown | N/A | |
| Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation | Unknown | N/A | |
| Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization | Unknown | N/A | |
| Equivariant Learning of Stochastic Fields: Gaussian Processes and Steerable Conditional Neural Processes | Unknown | N/A | |
| Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning | Unknown | N/A | |
| Deep Learning for Functional Data Analysis with Adaptive Basis Layers | Unknown | N/A | |
| Three Operator Splitting with a Nonconvex Loss Function | Unknown | N/A | |
| Causality-aware counterfactual confounding adjustment as an alternative to linear residualization in anticausal prediction tasks based on linear learners | Unknown | N/A | |
| DeepWalking Backwards: From Embeddings Back to Graphs | Unknown | N/A | |
| What does LIME really see in images? | Unknown | N/A | |
| Learning to Rehearse in Long Sequence Memorization | Unknown | N/A | |
| ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision | Unknown | N/A | |
| Improving Breadth-Wise Backpropagation in Graph Neural Networks Helps Learning Long-Range Dependencies. | Unknown | N/A | |
| Exact Optimization of Conformal Predictors via Incremental and Decremental Learning | Unknown | N/A | |
| Skew Orthogonal Convolutions | Unknown | N/A | |
| Self-supervised and Supervised Joint Training for Resource-rich Machine Translation | Unknown | N/A | |
| Mixed Cross Entropy Loss for Neural Machine Translation | Unknown | N/A | |
| Dissecting Supervised Constrastive Learning | Unknown | N/A | |
| Hierarchical VAEs Know What They Don’t Know | Unknown | N/A | |
| Optimal Off-Policy Evaluation from Multiple Logging Policies | Unknown | N/A | |
| Regret Minimization in Stochastic Non-Convex Learning via a Proximal-Gradient Approach | Unknown | N/A | |
| Federated Deep AUC Maximization for Hetergeneous Data with a Constant Communication Complexity | Unknown | N/A | |
| Active Learning of Continuous-time Bayesian Networks through Interventions | Unknown | N/A | |
| Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums | Unknown | N/A | |
| Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels | Unknown | N/A | |
| iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients | Unknown | N/A | |
| Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference | Unknown | N/A | |
| Adaptive Sampling for Best Policy Identification in Markov Decision Processes | Unknown | N/A | |
| Environment Inference for Invariant Learning | Unknown | N/A | |
| Unsupervised Embedding Adaptation via Early-Stage Feature Reconstruction for Few-Shot Classification | Unknown | N/A | |
| Approximate Group Fairness for Clustering | Unknown | N/A | |
| Few-Shot Neural Architecture Search | Unknown | N/A | |
| Poolingformer: Long Document Modeling with Pooling Attention | Unknown | N/A | |
| CURI: A Benchmark for Productive Concept Learning Under Uncertainty | Unknown | N/A | |
| Data augmentation for deep learning based accelerated MRI reconstruction with limited data | Unknown | N/A | |
| Context-Aware Online Collective Inference for Templated Graphical Models | Unknown | N/A | |
| Parallelizing Legendre Memory Unit Training | Unknown | N/A | |
| Tilting the playing field: Dynamical loss functions for machine learning | Unknown | N/A | |
| Skill Discovery for Exploration and Planning using Deep Skill Graphs | Unknown | N/A | |
| Learner-Private Convex Optimization | Unknown | N/A | |
| Network Inference and Influence Maximization from Samples | Unknown | N/A | |
| Temporally Correlated Task Scheduling for Sequence Learning | Unknown | N/A | |
| Temporal Predictive Coding For Model-Based Planning In Latent Space | Unknown | N/A | |
| Smooth $p$-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications | Unknown | N/A | |
| A Theory of Label Propagation for Subpopulation Shift | Unknown | N/A | |
| Poisson-Randomised DirBN: Large Mutation is Needed in Dirichlet Belief Networks | Unknown | N/A | |
| Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving | Unknown | N/A | |
| Conservative Objective Models for Effective Offline Model-Based Optimization | Unknown | N/A | |
| Selfish Sparse RNN Training | Unknown | N/A | |
| How to Learn when Data Reacts to Your Model: Performative Gradient Descent | Unknown | N/A | |
| A Collective Learning Framework to Boost GNN Expressiveness for Node Classification | Unknown | N/A | |
| On Limited-Memory Subsampling Strategies for Bandits | Unknown | N/A | |
| Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization | Unknown | N/A | |
| Optimal Thompson Sampling strategies for support-aware CVaR bandits | Unknown | N/A | |
| Neural Architecture Search without Training | Unknown | N/A | |
| Discriminative Complementary-Label Learning with Weighted Loss | Unknown | N/A | |
| Dense for the Price of Sparse: Improved Performance of Sparsely Initialized Networks via a Subspace Offset | Unknown | N/A | |
| On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization | Unknown | N/A | |
| A Functional Perspective on Learning Symmetric Functions with Neural Networks | Unknown | N/A | |
| Oblivious Sketching-based Central Path Method for Linear Programming | Unknown | N/A | |
| Federated Learning under Arbitrary Communication Patterns | Unknown | N/A | |
| Unsupervised Part Representation by Flow Capsules | Unknown | N/A | |
| SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data | Unknown | N/A | |
| Learning Deep Neural Networks under Agnostic Corrupted Supervision | Unknown | N/A | |
| Directional Bias Amplification | Unknown | N/A | |
| Fair Classification with Noisy Protected Attributes: A Framework with Provable Guarantees | Unknown | N/A | |
| Connecting Sphere Manifolds Hierarchically for Regularization | Unknown | N/A | |
| Dual Principal Component Pursuit for Robust Subspace Learning: Theory and Algorithms for a Holistic Approach | Unknown | N/A | |
| Bayesian Optimization over Hybrid Spaces | Unknown | N/A | |
| Detection of Signal in the Spiked Rectangular Models | Unknown | N/A | |
| Streaming Bayesian Deep Tensor Factorization | Unknown | N/A | |
| Deep Reinforcement Learning amidst Continual Structured Non-Stationarity | Unknown | N/A | |
| Compositional Video Synthesis with Action Graphs | Unknown | N/A | |
| Directed Graph Embeddings in Pseudo-Riemannian Manifolds | Unknown | N/A | |
| Evaluating Robustness of Predictive Uncertainty Estimation: Are Dirichlet-based Models Reliable? | Unknown | N/A | |
| Implicit-PDF: Non-Parametric Representation of Probability Distributions on the Rotation Manifold | Unknown | N/A | |
| Representation Matters: Offline Pretraining for Sequential Decision Making | Unknown | N/A | |
| Robust Representation Learning via Perceptual Similarity Metrics | Unknown | N/A | |
| On the difficulty of unbiased alpha divergence minimization | Unknown | N/A | |
| What Are Bayesian Neural Network Posteriors Really Like? | Unknown | N/A | |
| Neural Transformation Learning for Deep Anomaly Detection Beyond Images | Unknown | N/A | |
| On Energy-Based Models with Overparametrized Shallow Neural Networks | Unknown | N/A | |
| Preferential Temporal Difference Learning | Unknown | N/A | |
| Exploiting Shared Representations for Personalized Federated Learning | Unknown | N/A | |
| Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization | Unknown | N/A | |
| Dynamic Game Theoretic Neural Optimizer | Unknown | N/A | |
| Testing Group Fairness via Optimal Transport Projections | Unknown | N/A | |
| Goal-Conditioned Reinforcement Learning with Imagined Subgoals | Unknown | N/A | |
| LogME: Practical Assessment of Pre-trained Models for Transfer Learning | Unknown | N/A | |
| Tensor Programs IIb: Architectural Universality Of Neural Tangent Kernel Training Dynamics | Unknown | N/A | |
| Offline Reinforcement Learning with Fisher Divergence Critic Regularization | Unknown | N/A | |
| Correlation Clustering in Constant Many Parallel Rounds | Unknown | N/A | |
| RATT: Leveraging Unlabeled Data to Guarantee Generalization | Unknown | N/A | |
| Distributed Second Order Methods with Fast Rates and Compressed Communication | Unknown | N/A | |
| Adapting to misspecification in contextual bandits with offline regression oracles | Unknown | N/A | |
| Decoupling Representation Learning from Reinforcement Learning | Unknown | N/A | |
| Task-Optimal Exploration in Linear Dynamical Systems | Unknown | N/A | |
| Finding Relevant Information via a Discrete Fourier Expansion | Unknown | N/A | |
| ADOM: Accelerated Decentralized Optimization Method for Time-Varying Networks | Unknown | N/A | |
| Dash: Semi-Supervised Learning with Dynamic Thresholding | Unknown | N/A | |
| Decision-Making Under Selective Labels: Optimal Finite-Domain Policies and Beyond | Unknown | N/A | |
| Connecting Interpretability and Robustness in Decision Trees through Separation | Unknown | N/A | |
| Self-Damaging Contrastive Learning | Unknown | N/A | |
| On Variational Inference in Biclustering Models | Unknown | N/A | |
| Nonmyopic Multifidelity Acitve Search | Unknown | N/A | |
| Directional Graph Networks | Unknown | N/A | |
| First-Order Methods for Wasserstein Distributionally Robust MDP | Unknown | N/A | |
| REPAINT: Knowledge Transfer in Deep Reinforcement Learning | Unknown | N/A | |
| Addressing Catastrophic Forgetting in Few-Shot Problems | Unknown | N/A | |
| Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio Estimation | Unknown | N/A | |
| Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach | Unknown | N/A | |
| Graph Neural Networks Inspired by Classical Iterative Algorithms | Unknown | N/A | |
| Uniform Convergence, Adversarial Spheres and a Simple Remedy | Unknown | N/A | |
| Generating images with sparse representations | Unknown | N/A | |
| Implicit Regularization in Tensor Factorization | Unknown | N/A | |
| Autoencoder Image Interpolation by Shaping the Latent Space | Unknown | N/A | |
| Which transformer architecture fits my data? A vocabulary bottleneck in self-attention | Unknown | N/A | |
| Single Pass Entrywise-Transformed Low Rank Approximation | Unknown | N/A | |
| SGA: A Robust Algorithm for Partial Recovery of Tree-Structured Graphical Models with Noisy Samples | Unknown | N/A | |
| Accelerating Gossip SGD with Periodic Global Averaging | Unknown | N/A | |
| SpreadsheetCoder: Formula Prediction from Semi-structured Context | Unknown | N/A | |
| Compressed Maximum Likelihood | Unknown | N/A | |
| Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment | Unknown | N/A | |
| Policy Analysis using Synthetic Controls in Continuous-Time | Unknown | N/A | |
| Conditional Distributional Treatment Effect with Kernel Conditional Mean Embeddings and U-Statistic Regression | Unknown | N/A | |
| Representation Subspace Distance for Domain Adaptation Regression | Unknown | N/A | |
| Re-understanding Finite-State Representations of Recurrent Policy Networks | Unknown | N/A | |
| Model Distillation for Revenue Optimization: Interpretable Personalized Pricing | Unknown | N/A | |
| Kernel-Based Reinforcement Learning: A Finite-Time Analysis | Unknown | N/A | |
| Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech | Unknown | N/A | |
| More Powerful and General Selective Inference for Stepwise Feature Selection using Homotopy Method | Unknown | N/A | |
| Low-Rank Sinkhorn Factorization | Unknown | N/A | |
| Monte Carlo Variational Auto-Encoders | Unknown | N/A | |
| Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement Learning | Unknown | N/A | |
| Locally Private k-Means in One Round | Unknown | N/A | |
| Optimization Planning for 3D ConvNets | Unknown | N/A | |
| What Does Rotation Prediction Tell Us about Classifier Accuracy under Varying Testing Environments? | Unknown | N/A | |
| BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining | Unknown | N/A | |
| Light RUMs | Unknown | N/A | |
| PODS: Policy Optimization via Differentiable Simulation | Unknown | N/A | |
| Recomposing the Reinforcement Learning Building Blocks with Hypernetworks | Unknown | N/A | |
| Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks | Unknown | N/A | |
| Characterizing Fairness Over the Set of Good Models Under Selective Labels | Unknown | N/A | |
| A large-scale benchmark for few-shot program induction and synthesis | Unknown | N/A | |
| CLOCS: Contrastive Learning of Cardiac Signals Across Space, Time, and Patients | Unknown | N/A | |
| Ensemble Bootstrapping for Q-Learning | Unknown | N/A | |
| Uncovering the Connections Between Adversarial Transferability and Knowledge Transferability | Unknown | N/A | |
| A Scalable Deterministic Global Optimization Algorithm for Clustering Problems | Unknown | N/A | |
| Regularized Online Allocation Problems: Fairness and Beyond | Unknown | N/A | |
| A Proxy Variable View of Shared Confounding | Unknown | N/A | |
| Provable Lipschitz Certification for Generative Models | Unknown | N/A | |
| HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search | Unknown | N/A | |
| Neural SDEs as Infinite-Dimensional GANs | Unknown | N/A | |
| Pointwise Binary Classification with Pairwise Confidence Comparisons | Unknown | N/A | |
| Global Prosody Style Transfer Without Text Transcriptions | Unknown | N/A | |
| A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning | Unknown | N/A | |
| MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration | Unknown | N/A | |
| Affine Invariant Analysis of Frank-Wolfe on Strongly Convex Sets | Unknown | N/A | |
| SparseBERT: Rethinking the Importance Analysis in Self-attention | Unknown | N/A | |
| A Regret Minimization Approach to Iterative Learning Control | Unknown | N/A | |
| 1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed | Unknown | N/A | |
| Differentiable Sorting Networks for Scalable Sorting and Ranking Supervision | Unknown | N/A | |
| Efficient Statistical Tests: A Neural Tangent Kernel Approach | Unknown | N/A | |
| Knowledge Enhanced Machine Learning Pipeline against Diverse Adversarial Attacks | Unknown | N/A | |
| Expressive 1-Lipschitz Neural Networks for Robust Multiple Graph Learning against Adversarial Attacks | Unknown | N/A | |
| ARMS: Antithetic-REINFORCE-Multi-Sample Gradient for Binary Variables | Unknown | N/A | |
| Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods | Unknown | N/A | |
| Few-shot Language Coordination by Modeling Theory of Mind | Unknown | N/A | |
| On the Optimality of Batch Policy Optimization Algorithms | Unknown | N/A | |
| Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills | Unknown | N/A | |
| Faster Kernel Matrix Algebra via Density Estimation | Unknown | N/A | |
| Meta-Thompson Sampling | Unknown | N/A | |
| Simple and Effective VAE Training with Calibrated Decoders | Unknown | N/A | |
| PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration | Unknown | N/A | |
| DANCE: Enhancing saliency maps using decoys | Unknown | N/A | |
| Learning Generalized Intersection Over Union for Dense Pixelwise Prediction | Unknown | N/A | |
| Bayesian Quadrature on Riemannian Data Manifolds | Unknown | N/A | |
| Towards Understanding Learning in Neural Networks with Linear Teachers | Unknown | N/A | |
| Learning from Nested Data with Ornstein Auto-Encoders | Unknown | N/A | |
| When Does Data Augmentation Help With Membership Inference Attacks? | Unknown | N/A | |
| Debiasing Model Updates for Improving Personalized Federated Training | Unknown | N/A | |
| Fold2Seq: A Joint Sequence(1D)-Fold(3D) Embedding-based Generative Model for Protein Design | Unknown | N/A | |
| Reinforcement Learning Under Moral Uncertainty | Unknown | N/A | |
| Elastic Graph Neural Networks | Unknown | N/A | |
| Integrated Defense for Resilient Graph Matching | Unknown | N/A | |
| Latent Programmer: Discrete Latent Codes for Program Synthesis | Unknown | N/A | |
| Toward Better Generalization Bounds with Locally Elastic Stability | Unknown | N/A | |
| CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints | Unknown | N/A | |
| Learning Task Informed Abstractions | Unknown | N/A | |
| Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection | Unknown | N/A | |
| Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design | Unknown | N/A | |
| A Unified Generative Adversarial Network Training via Self-Labeling and Self-Attention | Unknown | N/A | |
| Learning Queueing Policies for Organ Transplantation Allocation using Interpretable Counterfactual Survival Analysis | Unknown | N/A | |
| Generalization Error Bound for Hyperbolic Ordinal Embedding | Unknown | N/A | |
| Can Subnetwork Structure Be the Key to Out-of-Distribution Generalization? | Unknown | N/A | |
| An Algorithm for Stochastic and Adversarial Bandits with Switching Costs | Unknown | N/A | |
| Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning | Unknown | N/A | |
| Composing Normalizing Flows for Inverse Problems | Unknown | N/A | |
| ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training | Unknown | N/A | |
| LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning | Unknown | N/A | |
| Fairness of Exposure in Stochastic Bandits | Unknown | N/A | |
| Non-Autoregressive Electron Redistribution Modeling for Reaction Prediction | Unknown | N/A | |
| A Scalable Second Order Method for Ill-Conditioned Matrix Completion from Few Samples | Unknown | N/A | |
| Testing DNN-based Autonomous Driving Systems under Critical Environmental Conditions | Unknown | N/A | |
| Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap | Unknown | N/A | |
| Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers | Unknown | N/A | |
| On Recovering from Modeling Errors Using Testing Bayesian Networks | Unknown | N/A | |
| Learning While Playing in Mean-Field Games: Convergence and Optimality | Unknown | N/A | |
| Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps | Unknown | N/A | |
| Fast Projection Onto Convex Smooth Constraints | Unknown | N/A | |
| Markpainting: Adversarial Machine Learning meets Inpainting | Unknown | N/A | |
| Online A-Optimal Design and Active Linear Regression | Unknown | N/A | |
| Learning Routines for Effective Off-Policy Reinforcement Learning | Unknown | N/A | |
| Query Complexity of Adversarial Attacks | Unknown | N/A | |
| On Disentangled Representations Learned from Correlated Data | Unknown | N/A | |
| Oops I Took A Gradient: Scalable Sampling for Discrete Distributions | Unknown | N/A | |
| Online Limited Memory Neural-Linear Bandits with Likelihood Matching | Unknown | N/A | |
| Risk-Sensitive Reinforcement Learning with Function Approximation: A Debiasing Approach | Unknown | N/A | |
| Optimizing Black-box Metrics with Iterative Example Weighting | Unknown | N/A | |
| Mandoline: Model Evaluation under Distribution Shift | Unknown | N/A | |
| Provably Correct Optimization and Exploration with Non-linear Policies | Unknown | N/A | |
| Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation | Unknown | N/A | |
| OmniNet: Omnidirectional Representations from Transformers | Unknown | N/A | |
| Meta Learning for Support Recovery in High-dimensional Precision Matrix Estimation | Unknown | N/A | |
| DG-LMC: A Turn-key and Scalable Synchronous Distributed MCMC Algorithm via Langevin Monte Carlo within Gibbs | Unknown | N/A | |
| Kernel Stein Discrepancy Descent | Unknown | N/A | |
| A Novel Sequential Coreset Method for Gradient Descent Algorithms | Unknown | N/A | |
| Improving Gradient Regularization using Complex-Valued Neural Networks | Unknown | N/A | |
| Data-efficient Hindsight Off-policy Option Learning | Unknown | N/A | |
| Leveraging Good Representations in Linear Contextual Bandits | Unknown | N/A | |
| Adversarial Purification with Score-based Generative Models | Unknown | N/A | |
| Meta-learning Hyperparameter Performance Prediction with Neural Processes | Unknown | N/A | |
| Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts | Unknown | N/A | |
| Towards Better Robust Generalization with Shift Consistency Regularization | Unknown | N/A | |
| Best Model Identification: A Rested Bandit Formulation | Unknown | N/A | |
| Chebyshev Polynomial Codes: Task Entanglement-based Coding for Distributed Matrix Multiplication | Unknown | N/A | |
| Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Fast margin maximization via dual acceleration | Unknown | N/A | |
| From Local to Global Norm Emergence: Dissolving Self-reinforcing Substructures with Incremental Social Instruments | Unknown | N/A | |
| Towards Domain-Agnostic Contrastive Learning | Unknown | N/A | |
| Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization | Unknown | N/A | |
| Interpretable Stein Goodness-of-fit Tests on Riemannian Manifold | Unknown | N/A | |
| Tensor Programs IV: Feature Learning in Infinite-Width Neural Networks | Unknown | N/A | |
| Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech | Unknown | N/A | |
| Online Submodular Resource Allocation with Applications to Rebalancing Shared Mobility Systems | Unknown | N/A | |
| Unitary Branching Programs: Learnability and Lower Bounds | Unknown | N/A | |
| Cyclically Equivariant Neural Decoders for Cyclic Codes | Unknown | N/A | |
| Optimal Non-Convex Exact Recovery in Stochastic Block Model via Projected Power Method | Unknown | N/A | |
| Generative Adversarial Networks for Markovian Temporal Dynamics: Stochastic Continuous Data Generation | Unknown | N/A | |
| Estimating $\alpha$-Rank from A Few Entries with Low Rank Matrix Completion | Unknown | N/A | |
| Deep Coherent Exploration for Continuous Control | Unknown | N/A | |
| A Differentiable Point Process with Its Application to Spiking Neural Networks | Unknown | N/A | |
| Overcoming Catastrophic Forgetting by Bayesian Generative Regularization | Unknown | N/A | |
| Lenient Regret and Good-Action Identification in Gaussian Process Bandits | Unknown | N/A | |
| Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels | Unknown | N/A | |
| Fixed-Parameter and Approximation Algorithms for PCA with Outliers | Unknown | N/A | |
| Rate-Distortion Analysis of Minimum Excess Risk in Bayesian Learning | Unknown | N/A | |
| FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Analysis | Unknown | N/A | |
| Generalised Lipschitz Regularisation Equals Distributional Robustness | Unknown | N/A | |
| Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision | Unknown | N/A | |
| Targeted Data Acquisition for Evolving Negotiation Agents | Unknown | N/A | |
| BASE Layers: Simplifying Training of Large, Sparse Models | Unknown | N/A | |
| Instabilities of Offline RL with Pre-Trained Neural Representation | Unknown | N/A | |
| RNN with Particle Flow for Probabilistic Spatio-temporal Forecasting | Unknown | N/A | |
| CRFL: Certifiably Robust Federated Learning against Backdoor Attacks | Unknown | N/A | |
| Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer | Unknown | N/A | |
| Local Algorithms for Finding Densely Connected Clusters | Unknown | N/A | |
| Near-Optimal Algorithms for Explainable k-Medians and k-Means | Unknown | N/A | |
| Multi-group Agnostic PAC Learnability | Unknown | N/A | |
| Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding | Unknown | N/A | |
| Improved Regret Bound and Experience Replay in Regularized Policy Iteration | Unknown | N/A | |
| Conditional Temporal Neural Processes with Covariance Loss | Unknown | N/A | |
| Structured Convolutional Kernel Networks for Airline Crew Scheduling | Unknown | N/A | |
| Deep Continuous Networks | Unknown | N/A | |
| Efficient Lottery Ticket Finding: Less Data is More | Unknown | N/A | |
| Putting the ``Learning" into Learning-Augmented Algorithms for Frequency Estimation | Unknown | N/A | |
| Classification with Rejection Based on Cost-sensitive Classification | Unknown | N/A | |
| Data Augmentation for Meta-Learning | Unknown | N/A | |
| Signatured Deep Fictitious Play for Mean Field Games with Common Noise | Unknown | N/A | |
| AlphaNet: Improved Training of Supernets with Alpha-Divergence | Unknown | N/A | |
| Sparse within Sparse Gaussian Processes using Neighbor Information | Unknown | N/A | |
| Slot Machines: Discovering Winning Combinations of Random Weights in Neural Networks | Unknown | N/A | |
| Online Unrelated Machine Load Balancing with Predictions Revisited | Unknown | N/A | |
| Connecting Optimal Ex-Ante Collusion in Teams to Extensive-Form Correlation: Faster Algorithms and Positive Complexity Results | Unknown | N/A | |
| An Information-Geometric Distance on the Space of Tasks | Unknown | N/A | |
| Memory-Efficient Pipeline-Parallel DNN Training | Unknown | N/A | |
| Near-Optimal Confidence Sequences for Bounded Random Variables | Unknown | N/A | |
| Personalized Federated Learning using Hypernetworks | Unknown | N/A | |
| GLSearch: Maximum Common Subgraph Detection via Learning to Search | Unknown | N/A | |
| PAC-Learning for Strategic Classification | Unknown | N/A | |
| GP-Tree: A Gaussian Process Classifier for Few-Shot Incremental Learning | Unknown | N/A | |
| GraphDF: A Discrete Flow Model for Molecular Graph Generation | Unknown | N/A | |
| Disentangling Sampling and Labeling Bias for Learning in Large-output Spaces | Unknown | N/A | |
| The Power of Adaptivity for Stochastic Submodular Cover | Unknown | N/A | |
| Decomposable Submodular Function Minimization via Maximum Flow | Unknown | N/A | |
| MC-LSTM: Mass-Conserving LSTM | Unknown | N/A | |
| Sliced Iterative Normalizing Flows | Unknown | N/A | |
| Matrix Sketching for Secure Collaborative Machine Learning | Unknown | N/A | |
| Analysis of stochastic Lanczos quadrature for spectrum approximation | Unknown | N/A | |
| Principal Component Hierarchy for Sparse Quadratic Programs | Unknown | N/A | |
| On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks | Unknown | N/A | |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Unknown | N/A | |
| Multi-Receiver Online Bayesian Persuasion | Unknown | N/A | |
| Equivariant Networks for Pixelized Spheres | Unknown | N/A | |
| On-Off Center-Surround Receptive Fields for Accurate and Robust Image Classification | Unknown | N/A | |
| Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise | Unknown | N/A | |
| Training Recurrent Neural Networks via Forward Propagation Through Time | Unknown | N/A | |
| Fast Sketching of Polynomial Kernels of Polynomial Degree | Unknown | N/A | |
| Differentiable Spatial Planning using Transformers | Unknown | N/A | |
| Dropout: Explicit Forms and Capacity Control | Unknown | N/A | |
| Strategic Classification in the Dark | Unknown | N/A | |
| Stability and Convergence of Stochastic Gradient Clipping: Beyond Lipschitz Continuity and Smoothness | Unknown | N/A | |
| Autoencoding Under Normalization Constraints | Unknown | N/A | |
| Differentially Private Correlation Clustering | Unknown | N/A | |
| Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| On Signal-to-Noise Ratio Issues in Variational Inference for Deep Gaussian Processes | Unknown | N/A | |
| Consistent Nonparametric Methods for Network Assisted Covariate Estimation | Unknown | N/A | |
| How could Neural Networks understand Programs? | Unknown | N/A | |
| Conjugate Energy-Based Models | Unknown | N/A | |
| Neural-Pull: Learning Signed Distance Function from Point clouds by Learning to Pull Space onto Surface | Unknown | N/A | |
| Incentivizing Compliance with Algorithmic Instruments | Unknown | N/A | |
| Pure Exploration and Regret Minimization in Matching Bandits | Unknown | N/A | |
| Offline Reinforcement Learning with Pseudometric Learning | Unknown | N/A | |
| Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks | Unknown | N/A | |
| Contrastive Learning Inverts the Data Generating Process | Unknown | N/A | |
| Scalable Normalizing Flows for Permutation Invariant Densities | Unknown | N/A | |
| A statistical perspective on distillation | Unknown | N/A | |
| NeRF-VAE: A Geometry Aware 3D Scene Generative Model | Unknown | N/A | |
| Alternative Microfoundations for Strategic Classification | Unknown | N/A | |
| On-Policy Deep Reinforcement Learning for the Average-Reward Criterion | Unknown | N/A | |
| Two Heads are Better Than One: Hypergraph-Enhanced Graph Reasoning for Visual Event Ratiocination | Unknown | N/A | |
| Two-way kernel matrix puncturing: towards resource-efficient PCA and spectral clustering | Unknown | N/A | |
| KD3A: Unsupervised Multi-Source Decentralized Domain Adaptation via Knowledge Distillation | Unknown | N/A | |
| Asymmetric Heavy Tails and Implicit Bias in Gaussian Noise Injections | Unknown | N/A | |
| One Pass Late Fusion Multi-view Clustering | Unknown | N/A | |
| The Heavy-Tail Phenomenon in SGD | Unknown | N/A | |
| Guided Exploration with Proximal Policy Optimization using a Single Demonstration | Unknown | N/A | |
| Monotonic Robust Policy Optimization with Model Discrepancy | Unknown | N/A | |
| An Identifiable Double VAE For Disentangled Representations | Unknown | N/A | |
| A Framework for Private Matrix Analysis in Sliding Window Model | Unknown | N/A | |
| Unsupervised Representation Learning via Neural Activation Coding | Unknown | N/A | |
| Modeling Hierarchical Structures with Continuous Recursive Neural Networks | Unknown | N/A | |
| Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices | Unknown | N/A | |
| WGAN with an Infinitely Wide Generator Has No Spurious Stationary Points | Unknown | N/A | |
| A Sharp Analysis of Model-based Reinforcement Learning with Self-Play | Unknown | N/A | |
| Interpreting and Disentangling Feature Components of Various Complexity from DNNs | Unknown | N/A | |
| Neural Tangent Generalization Attacks | Unknown | N/A | |
| Provable Robustness of Adversarial Training for Learning Halfspaces with Noise | Unknown | N/A | |
| Model-Free and Model-Based Policy Evaluation when Causality is Uncertain | Unknown | N/A | |
| Efficient Training of Robust Decision Trees Against Adversarial Examples | Unknown | N/A | |
| Distribution-Free Calibration Guarantees for Histogram Binning without Sample Splitting | Unknown | N/A | |
| Quantum algorithms for reinforcement learning with a generative model | Unknown | N/A | |
| Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training | Unknown | N/A | |
| Sharper Generalization Bounds for Clustering | Unknown | N/A | |
| An exact solver for the Weston-Watkins SVM subproblem | Unknown | N/A | |
| Relative Deviation Margin Bounds | Unknown | N/A | |
| 12-Lead ECG Reconstruction via Koopman Operators | Unknown | N/A | |
| Cooperative Exploration for Multi-Agent Deep Reinforcement Learning | Unknown | N/A | |
| Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies | Unknown | N/A | |
| MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning | Unknown | N/A | |
| EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture | Unknown | N/A | |
| Massively Parallel and Asynchronous Tsetlin Machine Architecture Supporting Almost Constant-Time Scaling | Unknown | N/A | |
| A Hybrid Variance-Reduced Method for Decentralized Stochastic Non-Convex Optimization | Unknown | N/A | |
| Projection techniques to update the truncated SVD of evolving matrices with applications | Unknown | N/A | |
| Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning | Unknown | N/A | |
| A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning | Unknown | N/A | |
| LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs | Unknown | N/A | |
| Interpretable Stability Bounds for Spectral Graph Filters | Unknown | N/A | |
| DORO: Distributional and Outlier Robust Optimization | Unknown | N/A | |
| Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding | Unknown | N/A | |
| Parallel tempering on optimized paths | Unknown | N/A | |
| Elementary superexpressive activations | Unknown | N/A | |
| Safe Reinforcement Learning Using Advantage-Based Intervention | Unknown | N/A | |
| Train simultaneously, generalize better: Stability of gradient-based minimax learners | Unknown | N/A | |
| Fundamental Tradeoffs in Distributionally Adversarial Training | Unknown | N/A | |
| PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning | Unknown | N/A | |
| Large Scale Private Learning via Low-rank Reparametrization | Unknown | N/A | |
| Learning and Planning in Average-Reward Markov Decision Processes | Unknown | N/A | |
| WILDS: A Benchmark of in-the-Wild Distribution Shifts | Unknown | N/A | |
| Optimal Complexity in Decentralized Training | Unknown | N/A | |
| Privacy-Preserving Video Classification with Convolutional Neural Networks | Unknown | N/A | |
| Best Arm Identification in Graphical Bilinear Bandits | Unknown | N/A | |
| Detecting Rewards Deterioration in Episodic Reinforcement Learning | Unknown | N/A | |
| Data-Free Knowledge Distillation for Heterogeneous Federated Learning | Unknown | N/A | |
| HAWQ-V3: Dyadic Neural Network Quantization | Unknown | N/A | |
| Parameterless Transductive Feature Re-representation for Few-Shot Learning | Unknown | N/A | |
| Momentum Residual Neural Networks | Unknown | N/A | |
| Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers | Unknown | N/A | |
| Evolving Attention with Residual Convolutions | Unknown | N/A | |
| Graph Cuts Always Find a Global Optimum for Potts Models (With a Catch) | Unknown | N/A | |
| Exponentially Many Local Minima in Quantum Neural Networks | Unknown | N/A | |
| Instance-Optimal Compressed Sensing via Posterior Sampling | Unknown | N/A | |
| Integer Programming for Causal Structure Learning in the Presence of Latent Variables | Unknown | N/A | |
| On Estimation in Latent Variable Models | Unknown | N/A | |
| Representation Matters: Assessing the Importance of Subgroup Allocations in Training Data | Unknown | N/A | |
| Learning Online Algorithms with Distributional Advice | Unknown | N/A | |
| World Model as a Graph: Learning Latent Landmarks for Planning | Unknown | N/A | |
| Tight Bounds on the Smallest Eigenvalue of the Neural Tangent Kernel for Deep ReLU Networks | Unknown | N/A | |
| EfficientNetV2: Smaller Models and Faster Training | Unknown | N/A | |
| Flow-based Attribution in Graphical Models: A Recursive Shapley Approach | Unknown | N/A | |
| Enhancing Robustness of Neural Networks through Fourier Stabilization | Unknown | N/A | |
| A Distribution-dependent Analysis of Meta Learning | Unknown | N/A | |
| Non-Exponentially Weighted Aggregation: Regret Bounds for Unbounded Loss Functions | Unknown | N/A | |
| Generalization Bounds in the Presence of Outliers: a Median-of-Means Study | Unknown | N/A | |
| Deep Latent Graph Matching | Unknown | N/A | |
| Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution | Unknown | N/A | |
| BASGD: Buffered Asynchronous SGD for Byzantine Learning | Unknown | N/A | |
| Near-Optimal Linear Regression under Distribution Shift | Unknown | N/A | |
| Explaining Time Series Predictions with Dynamic Masks | Unknown | N/A | |
| Exploiting structured data for learning contagious diseases under incomplete testing | Unknown | N/A | |
| Learning de-identified representations of prosody from raw audio | Unknown | N/A | |
| Value-at-Risk Optimization with Gaussian Processes | Unknown | N/A | |
| How rotational invariance of common kernels prevents generalization in high dimensions | Unknown | N/A | |
| Locally Adaptive Label Smoothing Improves Predictive Churn | Unknown | N/A | |
| RNNRepair: Automatic RNN Repair via Model-based Analysis | Unknown | N/A | |
| Multi-layered Network Exploration via Random Walks: From Offline Optimization to Online Learning | Unknown | N/A | |
| Training Data Subset Selection for Regression with Controlled Generalization Error | Unknown | N/A | |
| Beyond $log^2(T)$ regret for decentralized bandits in matching markets | Unknown | N/A | |
| Few-Shot Conformal Prediction with Auxiliary Tasks | Unknown | N/A | |
| Quantization Algorithms for Random Fourier Features | Unknown | N/A | |
| Graph Convolution for Semi-Supervised Classification: Improved Linear Separability and Out-of-Distribution Generalization | Unknown | N/A | |
| PID Accelerated Value Iteration Algorithm | Unknown | N/A | |
| Active Learning for Distributionally Robust Level-Set Estimation | Unknown | N/A | |
| Byzantine-Resilient High-Dimensional SGD with Local Iterations on Heterogeneous Data | Unknown | N/A | |
| Offline Meta-Reinforcement Learning with Advantage Weighting | Unknown | N/A | |
| Whitening and Second Order Optimization Both Make Information in the Dataset Unusable During Training, and Can Reduce or Prevent Generalization | Unknown | N/A | |
| An End-to-End Framework for Molecular Conformation Generation via Bilevel Programming | Unknown | N/A | |
| Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Improved Corruption Robust Algorithms for Episodic Reinforcement Learning | Unknown | N/A | |
| Don’t Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification | Unknown | N/A | |
| Matrix Completion with Model-free Weighting | Unknown | N/A | |
| Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality | Unknown | N/A | |
| Factor-analytic inverse regression for high-dimension, small-sample dimensionality reduction | Unknown | N/A | |
| SAINT-ACC: Safety-Aware Intelligent Adaptive Cruise Control for Autonomous Vehicles Using Deep Reinforcement Learning | Unknown | N/A | |
| Quantifying Availability and Discovery in Recommender Systems via Stochastic Reachability | Unknown | N/A | |
| Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics | Unknown | N/A | |
| Quantifying the Benefit of Using Differentiable Learning over Tangent Kernels | Unknown | N/A | |
| The Lipschitz Constant of Self-Attention | Unknown | N/A | |
| SGLB: Stochastic Gradient Langevin Boosting | Unknown | N/A | |
| Training data-efficient image transformers & distillation through attention | Unknown | N/A | |
| Object Segmentation Without Labels with Large-Scale Generative Models | Unknown | N/A | |
| Regularized Submodular Maximization at Scale | Unknown | N/A | |
| Average-Reward Off-Policy Policy Evaluation with Function Approximation | Unknown | N/A | |
| AutoSampling: Search for Effective Data Sampling Schedules | Unknown | N/A | |
| Quantifying and Reducing Bias in Maximum Likelihood Estimation of Structured Anomalies | Unknown | N/A | |
| Lower-Bounded Proper Losses for Weakly Supervised Classification | Unknown | N/A | |
| Differentially Private Densest Subgraph Detection | Unknown | N/A | |
| Domain Generalization using Causal Matching | Unknown | N/A | |
| Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing | Unknown | N/A | |
| Theory of Spectral Method for Union of Subspaces-Based Random Geometry Graph | Unknown | N/A | |
| KO codes: inventing nonlinear encoding and decoding for reliable wireless communication via deep-learning | Unknown | N/A | |
| Probabilistic Programs with Stochastic Conditioning | Unknown | N/A | |
| Inverse Decision Modeling: Learning Interpretable Representations of Behavior | Unknown | N/A | |
| How Do Adam and Training Strategies Help BNNs Optimization | Unknown | N/A | |
| Coded-InvNet for Resilient Prediction Serving Systems | Unknown | N/A | |
| On the Proof of Global Convergence of Gradient Descent for Deep ReLU Networks with Linear Widths | Unknown | N/A | |
| Asymmetric Loss Functions for Learning with Noisy Labels | Unknown | N/A | |
| Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity | Unknown | N/A | |
| Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning | Unknown | N/A | |
| Breaking the Limits of Message Passing Graph Neural Networks | Unknown | N/A | |
| Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning | Unknown | N/A | |
| Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling | Unknown | N/A | |
| Continuous-time Model-based Reinforcement Learning | Unknown | N/A | |
| UCB Momentum Q-learning: Correcting the bias without forgetting | Unknown | N/A | |
| Differentially-Private Clustering of Easy Instances | Unknown | N/A | |
| Online Graph Dictionary Learning | Unknown | N/A | |
| Off-Belief Learning | Unknown | N/A | |
| Approximation Theory Based Methods for RKHS Bandits | Unknown | N/A | |
| Fast active learning for pure exploration in reinforcement learning | Unknown | N/A | |
| Think Global and Act Local: Bayesian Optimisation over High-Dimensional Categorical and Mixed Search Spaces | Unknown | N/A | |
| K-shot NAS: Learnable Weight-Sharing for NAS with K-shot Supernets | Unknown | N/A | |
| Unsupervised Skill Discovery with Bottleneck Option Learning | Unknown | N/A | |
| One-sided Frank-Wolfe algorithms for saddle problems | Unknown | N/A | |
| Meta-Learning Bidirectional Update Rules | Unknown | N/A | |
| Differentially Private Aggregation in the Shuffle Model: Almost Central Accuracy in Almost a Single Message | Unknown | N/A | |
| Concentric mixtures of Mallows models for top-$k$ rankings: sampling and identifiability | Unknown | N/A | |
| Homomorphic Sensing: Sparsity and Noise | Unknown | N/A | |
| Efficient Generative Modelling of Protein Structure Fragments using a Deep Markov Model | Unknown | N/A | |
| Bias-Robust Bayesian Optimization via Dueling Bandits | Unknown | N/A | |
| Distributed Nystr\"{o}m Kernel Learning with Communications | Unknown | N/A | |
| Equivariant message passing for the prediction of tensorial properties and molecular spectra | Unknown | N/A | |
| PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training | Unknown | N/A | |
| SagaNet: A Small Sample Gated Network for Pediatric Cancer Diagnosis | Unknown | N/A | |
| GBHT: Gradient Boosting Histogram Transform for Density Estimation | Unknown | N/A | |
| Imitation by Predicting Observations | Unknown | N/A | |
| Optimal Streaming Algorithms for Multi-Armed Bandits | Unknown | N/A | |
| On the Random Conjugate Kernel and Neural Tangent Kernel | Unknown | N/A | |
| Approximating a Distribution Using Weight Queries | Unknown | N/A | |
| PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization | Unknown | N/A | |
| Revisiting Peng's Q($\lambda$) for Modern Reinforcement Learning | Unknown | N/A | |
| DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | Unknown | N/A | |
| Learning Optimal Auctions with Correlated Valuations from Samples | Unknown | N/A | |
| Soft then Hard: Rethinking the Quantization in Neural Image Compression | Unknown | N/A | |
| Systematic Analysis of Cluster Similarity Indices: How to Validate Validation Measures | Unknown | N/A | |
| Grid-Functioned Neural Networks | Unknown | N/A | |
| SMG: A Shuffling Gradient-Based Method with Momentum | Unknown | N/A | |
| When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC | Unknown | N/A | |
| PAPRIKA: Private Online False Discovery Rate Control | Unknown | N/A | |
| Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs | Unknown | N/A | |
| Generative Causal Explanations for Graph Neural Networks | Unknown | N/A | |
| DAGs with No Curl: An Efficient DAG Structure Learning Approach | Unknown | N/A | |
| Improved Confidence Bounds for the Linear Logistic Model and Applications to Bandits | Unknown | N/A | |
| Learning Curves for Analysis of Deep Networks | Unknown | N/A | |
| SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning | Unknown | N/A | |
| Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size | Unknown | N/A | |
| Information Obfuscation of Graph Neural Networks | Unknown | N/A | |
| Beyond the Pareto Efficient Frontier: Constraint Active Search for Multiobjective Experimental Design | Unknown | N/A | |
| LieTransformer: Equivariant Self-Attention for Lie Groups | Unknown | N/A | |
| Decentralized Riemannian Gradient Descent on the Stiefel Manifold | Unknown | N/A | |
| AutoAttend: Automated Attention Representation Search | Unknown | N/A | |
| Robust Asymmetric Learning in POMDPs | Unknown | N/A | |
| High-Dimensional Gaussian Process Inference with Derivatives | Unknown | N/A | |
| Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning | Unknown | N/A | |
| Multiplying Matrices Without Multiplying | Unknown | N/A | |
| Asymptotics of Ridge Regression in Convolutional Models | Unknown | N/A | |
| Stochastic Sign Descent Methods: New Algorithms and Better Theory | Unknown | N/A | |
| Learning Binary Decision Trees by Argmin Differentiation | Unknown | N/A | |
| Analyzing the tree-layer structure of Deep Forests | Unknown | N/A | |
| Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously | Unknown | N/A | |
| "Hey, that's not an ODE": Faster ODE Adjoints via Seminorms | Unknown | N/A | |
| Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss | Unknown | N/A | |
| High-Performance Large-Scale Image Recognition Without Normalization | Unknown | N/A | |
| Federated Continual Learning with Weighted Inter-client Transfer | Unknown | N/A | |
| Privacy-Preserving Feature Selection with Secure Multiparty Computation | Unknown | N/A | |
| Learning Bounds for Open-Set Learning | Unknown | N/A | |
| Nonparametric Hamiltonian Monte Carlo | Unknown | N/A | |
| Parameter-free Locally Accelerated Conditional Gradients | Unknown | N/A | |
| SG-PALM: a Fast Physically Interpretable Tensor Graphical Model | Unknown | N/A | |
| Learning Intra-Batch Connections for Deep Metric Learning | Unknown | N/A | |
| Graph Mixture Density Networks | Unknown | N/A | |
| Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards | Unknown | N/A | |
| Unsupervised Learning of Visual 3D Keypoints for Control | Unknown | N/A | |
| A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning | Unknown | N/A | |
| Versatile Verification of Tree Ensembles | Unknown | N/A | |
| Bayesian Optimistic Optimisation with Exponentially Decaying Regret | Unknown | N/A | |
| Guarantees for Tuning the Step Size using a Learning-to-Learn Approach | Unknown | N/A | |
| Robust Learning-Augmented Caching: An Experimental Study | Unknown | N/A | |
| Phase Transitions, Distance Functions, and Implicit Neural Representations | Unknown | N/A | |
| Differentiable Particle Filtering via Entropy-Regularized Optimal Transport | Unknown | N/A | |
| Differentially Private Sliced Wasserstein Distance | Unknown | N/A | |
| Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time | Unknown | N/A | |
| Convex Regularization in Monte-Carlo Tree Search | Unknown | N/A | |
| OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation | Unknown | N/A | |
| Towards the Unification and Robustness of Perturbation and Gradient Based Explanations | Unknown | N/A | |
| SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks | Unknown | N/A | |
| Self-Tuning for Data-Efficient Deep Learning | Unknown | N/A | |
| Supervised Tree-Wasserstein Distance | Unknown | N/A | |
| Robust Learning for Data Poisoning Attacks | Unknown | N/A | |
| A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration | Unknown | N/A | |
| Practical and Private (Deep) Learning Without Sampling or Shuffling | Unknown | N/A | |
| A New Formalism, Method and Open Issues for Zero-Shot Coordination | Unknown | N/A | |
| Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks | Unknown | N/A | |
| Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization | Unknown | N/A | |
| UnICORNN: A recurrent model for learning very long time dependencies | Unknown | N/A | |
| Evaluating the Implicit Midpoint Integrator for Riemannian Hamiltonian Monte Carlo | Unknown | N/A | |
| Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation | Unknown | N/A | |
| Leveraging Language to Learn Program Abstractions and Search Heuristics | Unknown | N/A | |
| Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers | Unknown | N/A | |
| Making transport more robust and interpretable by moving data through a small number of anchor points | Unknown | N/A | |
| Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances | Unknown | N/A | |
| SKIing on Simplices: Kernel Interpolation on the Permutohedral Lattice for Scalable Gaussian Processes | Unknown | N/A | |
| Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics | Unknown | N/A | |
| Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence | Unknown | N/A | |
| Strategic Classification Made Practical | Unknown | N/A | |
| Learning to Weight Imperfect Demonstrations | Unknown | N/A | |
| Narrow Margins: Classification, Margins and Fat Tails | Unknown | N/A | |
| Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach | Unknown | N/A | |
| Grey-box Extraction of Natural Language Models | Unknown | N/A | |
| Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers | Unknown | N/A | |
| Deep Generative Learning via Schrödinger Bridge | Unknown | N/A | |
| Multiplicative Noise and Heavy Tails in Stochastic Optimization | Unknown | N/A | |
| Not All Memories are Created Equal: Learning to Forget by Expiring | Unknown | N/A | |
| Large-Margin Contrastive Learning with Distance Polarization Regularizer | Unknown | N/A | |
| Variational (Gradient) Estimate of the Score Function in Energy-based Latent Variable Models | Unknown | N/A | |
| Reward Identification in Inverse Reinforcement Learning | Unknown | N/A | |
| Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes | Unknown | N/A | |
| PixelTransformer: Sample Conditioned Signal Generation | Unknown | N/A | |
| GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings | Unknown | N/A | |
| Dichotomous Optimistic Search to Quantify Human Perception | Unknown | N/A | |
| Adversarial Option-Aware Hierarchical Imitation Learning | Unknown | N/A | |
| BasisDeVAE: Interpretable Simultaneous Dimensionality Reduction and Feature-Level Clustering with Derivative-Based Variational Autoencoders | Unknown | N/A | |
| Randomized Exploration in Reinforcement Learning with General Value Function Approximation | Unknown | N/A | |
| Adversarial Combinatorial Bandits with General Non-linear Reward Functions | Unknown | N/A | |
| APS: Active Pretraining with Successor Features | Unknown | N/A | |
| Zero-Shot Knowledge Distillation from a Decision-Based Black-Box Model | Unknown | N/A | |
| Functional Space Analysis of Local GAN Convergence | Unknown | N/A | |
| Deciding What to Learn: A Rate-Distortion Approach | Unknown | N/A | |
| Neural Rough Differential Equations for Long Time Series | Unknown | N/A | |
| Cross-domain Imitation from Observations | Unknown | N/A | |
| Adversarial Multi Class Learning under Weak Supervision with Performance Guarantees | Unknown | N/A | |
| Backdoor Scanning for Deep Neural Networks through K-Arm Optimization | Unknown | N/A | |
| Winograd Algorithm for AdderNet | Unknown | N/A | |
| Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs | Unknown | N/A | |
| Provably Strict Generalisation Benefit for Equivariant Models | Unknown | N/A | |
| Continuous Coordination As a Realistic Scenario for Lifelong Learning | Unknown | N/A | |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Unknown | N/A | |
| Trees with Attention for Set Prediction Tasks | Unknown | N/A | |
| Infinite-Dimensional Optimization for Zero-Sum Games via Variational Transport | Unknown | N/A | |
| Quasi-global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data | Unknown | N/A | |
| Barlow Twins: Self-Supervised Learning via Redundancy Reduction | Unknown | N/A | |
| Differentially Private Query Release Through Adaptive Projection | Unknown | N/A | |
| Oblivious Sketching for Logistic Regression | Unknown | N/A | |
| Scalable Variational Gaussian Processes via Harmonic Kernel Decomposition | Unknown | N/A | |
| Learning from Similarity-Confidence Data | Unknown | N/A | |
| Label-Only Membership Inference Attacks | Unknown | N/A | |
| Self-Improved Retrosynthetic Planning | Unknown | N/A | |
| A Second look at Exponential and Cosine Step Sizes: Simplicity, Adaptivity, and Performance | Unknown | N/A | |
| Representational aspects of depth and conditioning in normalizing flows | Unknown | N/A | |
| Maximum Mean Discrepancy Test is Aware of Adversarial Attacks | Unknown | N/A | |
| Probabilistic Generating Circuits | Unknown | N/A | |
| KNAS: Green Neural Architecture Search | Unknown | N/A | |
| Sparse and Imperceptible Adversarial Attack via a Homotopy Algorithm | Unknown | N/A | |
| You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling | Unknown | N/A | |
| An Integer Linear Programming Framework for Mining Constraints from Data | Unknown | N/A | |
| Sparsity-Agnostic Lasso Bandit | Unknown | N/A | |
| Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search | Unknown | N/A | |
| Federated Composite Optimization | Unknown | N/A | |
| Randomized Algorithms for Submodular Function Maximization with a $k$-System Constraint | Unknown | N/A | |
| Linear Transformers Are Secretly Fast Weight Programmers | Unknown | N/A | |
| High-dimensional Experimental Design and Kernel Bandits | Unknown | N/A | |
| LAMDA: Label Matching Deep Domain Adaptation | Unknown | N/A | |
| Diffusion Source Identification on Networks with Statistical Confidence | Unknown | N/A | |
| Break-It-Fix-It: Unsupervised Learning for Program Repair | Unknown | N/A | |
| Variational Data Assimilation with a Learned Inverse Observation Operator | Unknown | N/A | |
| A Value-Function-based Interior-point Method for Non-convex Bi-level Optimization | Unknown | N/A | |
| Improved, Deterministic Smoothing for L_1 Certified Robustness | Unknown | N/A | |
| The Power of Log-Sum-Exp: Sequential Density Ratio Matrix Estimation for Speed-Accuracy Optimization | Unknown | N/A | |
| Joining datasets via data augmentation in the label space for neural networks | Unknown | N/A | |
| Explore Visual Concept Formation for Image Classification | Unknown | N/A | |
| Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting | Unknown | N/A | |
| Scalable Certified Segmentation via Randomized Smoothing | Unknown | N/A | |
| Prioritized Level Replay | Unknown | N/A | |
| Efficient Differentiable Simulation of Articulated Bodies | Unknown | N/A | |
| The Logical Options Framework | Unknown | N/A | |
| Self-Paced Context Evaluation for Contextual Reinforcement Learning | Unknown | N/A | |
| Decomposed Mutual Information Estimation for Contrastive Representation Learning | Unknown | N/A | |
| Crowdsourcing via Annotator Co-occurrence Imputation and Provable Symmetric Nonnegative Matrix Factorization | Unknown | N/A | |
| Objective Bound Conditional Gaussian Process for Bayesian Optimization | Unknown | N/A | |
| The Distributed Discrete Gaussian Mechanism for Federated Learning with Secure Aggregation | Unknown | N/A | |
| Taylor Expansion of Discount Factors | Unknown | N/A | |
| Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization | Unknown | N/A | |
| Zeroth-Order Non-Convex Learning via Hierarchical Dual Averaging | Unknown | N/A | |
| Let's Agree to Degree: Comparing Graph Convolutional Networks in the Message-Passing Framework | Unknown | N/A | |
| Counterfactual Credit Assignment in Model-Free Reinforcement Learning | Unknown | N/A | |
| Temporal Difference Learning as Gradient Splitting | Unknown | N/A | |
| Agnostic Learning of Halfspaces with Gradient Descent via Soft Margins | Unknown | N/A | |
| Spectral vertex sparsifiers and pair-wise spanners over distributed graphs | Unknown | N/A | |
| Multi-Task Reinforcement Learning with Context-based Representations | Unknown | N/A | |
| Muesli: Combining Improvements in Policy Optimization | Unknown | N/A | |
| Feature Clustering for Support Identification in Extreme Regions | Unknown | N/A | |
| A Gradient Based Strategy for Hamiltonian Monte Carlo Hyperparameter Optimization | Unknown | N/A | |
| Newton Method over Networks is Fast up to the Statistical Precision | Unknown | N/A | |
| From Local Structures to Size Generalization in Graph Neural Networks | Unknown | N/A | |
| Estimating Identifiable Causal Effects on Markov Equivalence Class through Double Machine Learning | Unknown | N/A | |
| Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning | Unknown | N/A | |
| HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections | Unknown | N/A | |
| A Novel Method to Solve Neural Knapsack Problems | Unknown | N/A | |
| Self Normalizing Flows | Unknown | N/A | |
| Sparsifying Networks via Subdifferential Inclusion | Unknown | N/A | |
| Label Distribution Learning Machine | Unknown | N/A | |
| Stabilizing Equilibrium Models by Jacobian Regularization | Unknown | N/A | |
| Finding the Stochastic Shortest Path with Low Regret: the Adversarial Cost and Unknown Transition Case | Unknown | N/A | |
| Gaussian Process-Based Real-Time Learning for Safety Critical Applications | Unknown | N/A | |
| Making Paper Reviewing Robust to Bid Manipulation Attacks | Unknown | N/A | |
| Watermarking Deep Neural Networks with Greedy Residuals | Unknown | N/A | |
| Cumulants of Hawkes Processes are Robust to Observation Noise | Unknown | N/A | |
| Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization | Unknown | N/A | |
| Submodular Maximization subject to a Knapsack Constraint: Combinatorial Algorithms with Near-optimal Adaptive Complexity | Unknown | N/A | |
| Ditto: Fair and Robust Federated Learning Through Personalization | Unknown | N/A | |
| Run-Sort-ReRun: Escaping Batch Size Limitations in Sliced Wasserstein Generative Models | Unknown | N/A | |
| Multi-Dimensional Classification via Sparse Label Encoding | Unknown | N/A | |
| Clustered Sampling: Low-Variance and Improved Representativity for Clients Selection in Federated Learning | Unknown | N/A | |
| Annealed Flow Transport Monte Carlo | Unknown | N/A | |
| Active Slices for Sliced Stein Discrepancy | Unknown | N/A | |
| Partially Observed Exchangeable Modeling | Unknown | N/A | |
| Finding k in Latent $k-$ polytope | Unknown | N/A | |
| Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Prediction-Centric Learning of Independent Cascade Dynamics from Partial Observations | Unknown | N/A | |
| UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data | Unknown | N/A | |
| Deeply-Debiased Off-Policy Interval Estimation | Unknown | N/A | |
| On Monotonic Linear Interpolation of Neural Network Parameters | Unknown | N/A | |
| Reinforcement Learning with Prototypical Representations | Unknown | N/A | |
| Rissanen Data Analysis: Examining Dataset Characteristics via Description Length | Unknown | N/A | |
| GeomCA: Geometric Evaluation of Data Representations | Unknown | N/A | |
| HEMET: A Homomorphic-Encryption-Friendly Privacy-Preserving Mobile Neural Network Architecture | Unknown | N/A | |
| Graph Contrastive Learning Automated | Unknown | N/A | |
| CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection | Unknown | N/A | |
| Understanding the Dynamics of Gradient Flow in Overparameterized Linear models | Unknown | N/A | |
| On Robust Mean Estimation under Coordinate-level Corruption | Unknown | N/A | |
| Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity | Unknown | N/A | |
| T-SCI: A Two-Stage Conformal Inference Algorithm with Guaranteed Coverage for Cox-MLP | Unknown | N/A | |
| Perceiver: General Perception with Iterative Attention | Unknown | N/A | |
| Delving into Deep Imbalanced Regression | Unknown | N/A | |
| Scaling Properties of Deep Residual Networks | Unknown | N/A | |
| On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework | Unknown | N/A | |
| Distributionally Robust Optimization with Markovian Data | Unknown | N/A | |
| Understanding Instance-Level Label Noise: Disparate Impacts and Treatments | Unknown | N/A | |
| Intermediate Layer Optimization for Inverse Problems using Deep Generative Models | Unknown | N/A | |
| On Proximal Policy Optimization's Heavy-tailed Gradients | Unknown | N/A | |
| Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline | Unknown | N/A | |
| Model Fusion for Personalized Learning | Unknown | N/A | |
| Density Constrained Reinforcement Learning | Unknown | N/A | |
| Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing | Unknown | N/A | |
| Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting | Unknown | N/A | |
| Near Optimal Reward-Free Reinforcement Learning | Unknown | N/A | |
| Backpropagated Neighborhood Aggregation for Accurate Training of Spiking Neural Networks | Unknown | N/A | |
| Learning Gradient Fields for Molecular Conformation Generation | Unknown | N/A | |
| Off-Policy Confidence Sequences | Unknown | N/A | |
| Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits | Unknown | N/A | |
| Finite mixture models do not reliably learn the number of components | Unknown | N/A | |
| Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies | Unknown | N/A | |
| Learning disentangled representations via product manifold projection | Unknown | N/A | |
| Noise and Fluctuation of Finite Learning Rate Stochastic Gradient Descent | Unknown | N/A | |
| Adapting to Delays and Data in Adversarial Multi-Armed Bandits | Unknown | N/A | |
| ACE: Explaining cluster from an adversarial perspective | Unknown | N/A | |
| Pareto GAN: Extending the Representational Power of GANs to Heavy-Tailed Distributions | Unknown | N/A | |
| Confidence-Budget Matching for Sequential Budgeted Learning | Unknown | N/A | |
| Interactive Learning from Activity Description | Unknown | N/A | |
| Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions | Unknown | N/A | |
| Segmenting Hybrid Trajectories using Latent ODEs | Unknown | N/A | |
| Model-based Reinforcement Learning for Continuous Control with Posterior Sampling | Unknown | N/A | |
| Robust Density Estimation from Batches: The Best Things in Life are (Nearly) Free | Unknown | N/A | |
| Stochastic Iterative Graph Matching | Unknown | N/A | |
| Lipschitz normalization for self-attention layers with application to graph neural networks | Unknown | N/A | |
| Inverse Constrained Reinforcement Learning | Unknown | N/A | |
| Keyframe-Focused Visual Imitation Learning | Unknown | N/A | |
| Learning in Nonzero-Sum Stochastic Games with Potentials | Unknown | N/A | |
| The Hintons in your Neural Network: a Quantum Field Theory View of Deep Learning | Unknown | N/A | |
| Function Contrastive Learning of Transferable Meta-Representations | Unknown | N/A | |
| A New Representation of Successor Features for Transfer across Dissimilar Environments | Unknown | N/A | |
| Generalized Doubly Reparameterized Gradient Estimators | Unknown | N/A | |
| A Nullspace Property for Subspace-Preserving Recovery | Unknown | N/A | |
| Self-supervised Graph-level Representation Learning with Local and Global Structure | Unknown | N/A | |
| Probabilistic Sequential Shrinking: A Best Arm Identification Algorithm for Stochastic Bandits with Corruptions | Unknown | N/A | |
| Efficient Message Passing for 0–1 ILPs with Binary Decision Diagrams | Unknown | N/A | |
| Towards Rigorous Interpretations: a Formalisation of Feature Attribution | Unknown | N/A | |
| Dynamic Planning and Learning under Recovering Rewards | Unknown | N/A | |
| Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data | Unknown | N/A | |
| Geometric convergence of elliptical slice sampling | Unknown | N/A | |
| GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training | Unknown | N/A | |
| Bayesian Algorithm Execution: Estimating Computable Properties of Black-box Functions Using Mutual Information | Unknown | N/A | |
| A Bit More Bayesian: Domain-Invariant Learning with Uncertainty | Unknown | N/A | |
| Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision | Unknown | N/A | |
| Modelling Behavioural Diversity for Learning in Open-Ended Games | Unknown | N/A | |
| Active Deep Probabilistic Subsampling | Unknown | N/A | |
| Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism | Unknown | N/A | |
| Valid Causal Inference with (Some) Invalid Instruments | Unknown | N/A | |
| Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm | Unknown | N/A | |
| Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning | Unknown | N/A | |
| Disambiguation of Weak Supervision leading to Exponential Convergence rates | Unknown | N/A | |
| Towards Practical Mean Bounds for Small Samples | Unknown | N/A | |
| Projection Robust Wasserstein Barycenters | Unknown | N/A | |
| Reinforcement Learning for Cost-Aware Markov Decision Processes | Unknown | N/A | |
| Learning a Universal Template for Few-shot Dataset Generalization | Unknown | N/A | |
| Better Training using Weight-Constrained Stochastic Dynamics | Unknown | N/A | |
| Leveraging Sparse Linear Layers for Debuggable Deep Networks | Unknown | N/A | |
| Logarithmic Regret for Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Dataset Condensation with Differentiable Siamese Augmentation | Unknown | N/A | |
| Consistent regression when oblivious outliers overwhelm | Unknown | N/A | |
| Learning Diverse-Structured Networks for Adversarial Robustness | Unknown | N/A | |
| A Unified Lottery Ticket Hypothesis for Graph Neural Networks | Unknown | N/A | |
| Approximation Theory of Convolutional Architectures for Time Series Modelling | Unknown | N/A | |
| Parallel Droplet Control in MEDA Biochips using Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Bias-Free Scalable Gaussian Processes via Randomized Truncations | Unknown | N/A | |
| Provably Efficient Learning of Transferable Rewards | Unknown | N/A | |
| ConvexVST: A Convex Optimization Approach to Variance-stabilizing Transformation | Unknown | N/A | |
| Differentially Private Quantiles | Unknown | N/A | |
| Training Graph Neural Networks with 1000 Layers | Unknown | N/A | |
| Hyperparameter Selection for Imitation Learning | Unknown | N/A | |
| Active Feature Acquisition with Generative Surrogate Models | Unknown | N/A | |
| Machine Unlearning for Random Forests | Unknown | N/A | |
| Improved OOD Generalization via Adversarial Training and Pretraing | Unknown | N/A | |
| Learning from Biased Data: A Semi-Parametric Approach | Unknown | N/A | |
| Provably End-to-end Label-noise Learning without Anchor Points | Unknown | N/A | |
| Implicit Bias of Linear RNNs | Unknown | N/A | |
| Combinatorial Blocking Bandits with Stochastic Delays | Unknown | N/A | |
| Offline Contextual Bandits with Overparameterized Models | Unknown | N/A | |
| Conformal prediction interval for dynamic time-series | Unknown | N/A | |
| Kernel Continual Learning | Unknown | N/A | |
| Optimal regret algorithm for Pseudo-1d Bandit Convex Optimization | Unknown | N/A | |
| Dueling Convex Optimization | Unknown | N/A | |
| Learning and Planning in Complex Action Spaces | Unknown | N/A | |
| SiameseXML: Siamese Networks meet Extreme Classifiers with 100M Labels | Unknown | N/A | |
| Adversarial Dueling Bandits | Unknown | N/A | |
| On the Generalization Power of Overfitted Two-Layer Neural Tangent Kernel Models | Unknown | N/A | |
| Improving Predictors via Combination Across Diverse Task Categories | Unknown | N/A | |
| Is Pessimism Provably Efficient for Offline RL? | Unknown | N/A | |
| A theory of high dimensional regression with arbitrary correlations between input features and target functions: sample complexity, multiple descent curves and a hierarchy of phase transitions | Unknown | N/A | |
| PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees | Unknown | N/A | |
| Understanding self-supervised learning dynamics without contrastive pairs | Unknown | N/A | |
| On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent | Unknown | N/A | |
| Boosting for Online Convex Optimization | Unknown | N/A | |
| Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning | Unknown | N/A | |
| Model-Targeted Poisoning Attacks with Provable Convergence | Unknown | N/A | |
| Statistical Estimation from Dependent Data | Unknown | N/A | |
| Correcting Exposure Bias for Link Recommendation | Unknown | N/A | |
| A Probabilistic Approach to Neural Network Pruning | Unknown | N/A | |
| Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration | Unknown | N/A | |
| TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL | Unknown | N/A | |
| Discretization Drift in Two-Player Games | Unknown | N/A | |
| Optimizing persistent homology based functions | Unknown | N/A | |
| Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment | Unknown | N/A | |
| Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data | Unknown | N/A | |
| Whittle Networks: A Deep Likelihood Model for Time Series | Unknown | N/A | |
| AGENT: A Benchmark for Core Psychological Reasoning | Unknown | N/A | |
| Moreau-Yosida $f$-divergences | Unknown | N/A | |
| Accurate Post Training Quantization With Small Calibration Sets | Unknown | N/A | |
| Improved Regret Bounds of Bilinear Bandits using Action Space Analysis | Unknown | N/A | |
| Model-Based Reinforcement Learning via Latent-Space Collocation | Unknown | N/A | |
| Learning Node Representations Using Stationary Flow Prediction on Large Payment and Cash Transaction Networks | Unknown | N/A | |
| ChaCha for Online AutoML | Unknown | N/A | |
| Hierarchical Clustering of Data Streams: Scalable Algorithms and Approximation Guarantees | Unknown | N/A | |
| SinIR: Efficient General Image Manipulation with Single Image Reconstruction | Unknown | N/A | |
| Simultaneous Similarity-based Self-Distillation for Deep Metric Learning | Unknown | N/A | |
| Budgeted Heterogeneous Treatment Effect Estimation | Unknown | N/A | |
| Discrete-Valued Latent Preference Matrix Estimation with Graph Side Information | Unknown | N/A | |
| Confidence Scores Make Instance-dependent Label-noise Learning Possible | Unknown | N/A | |
| Whitening for Self-Supervised Representation Learning | Unknown | N/A | |
| Principled Simplicial Neural Networks for Trajectory Prediction | Unknown | N/A | |
| Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation | Unknown | N/A | |
| UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Accelerated Algorithms for Smooth Convex-Concave Minimax Problems with O(1/k^2) Rate on Squared Gradient Norm | Unknown | N/A | |
| Top-k eXtreme Contextual Bandits with Arm Hierarchy | Unknown | N/A | |
| Inference for Network Regression Models with Community Structure | Unknown | N/A | |
| The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks | Unknown | N/A | |
| BORE: Bayesian Optimization by Density-Ratio Estimation | Unknown | N/A | |
| PHEW : Constructing Sparse Networks that Learn Fast and Generalize Well without Training Data | Unknown | N/A | |
| Weisfeiler and Lehman Go Topological: Message Passing Simplicial Networks | Unknown | N/A | |
| A Discriminative Technique for Multiple-Source Adaptation | Unknown | N/A | |
| Online Selection Problems against Constrained Adversary | Unknown | N/A | |
| Fairness for Image Generation with Uncertain Sensitive Attributes | Unknown | N/A | |
| Robust Policy Gradient against Strong Data Corruption | Unknown | N/A | |
| Robust Pure Exploration in Linear Bandits with Limited Budget | Unknown | N/A | |
| Asynchronous Distributed Learning : Adapting to Gradient Delays without Prior Knowledge | Unknown | N/A | |
| RRL: Resnet as representation for Reinforcement Learning | Unknown | N/A | |
| Post-selection inference with HSIC-Lasso | Unknown | N/A | |
| Learning Interaction Kernels for Agent Systems on Riemannian Manifolds | Unknown | N/A | |
| Aggregating From Multiple Target-Shifted Sources | Unknown | N/A | |
| A General Framework For Detecting Anomalous Inputs to DNN Classifiers | Unknown | N/A | |
| A Modular Analysis of Provable Acceleration via Polyak's Momentum: Training a Wide ReLU Network and a Deep Linear Network | Unknown | N/A | |
| Value Alignment Verification | Unknown | N/A | |
| Towards Distraction-Robust Active Visual Tracking | Unknown | N/A | |
| Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates | Unknown | N/A | |
| Fairness and Bias in Online Selection | Unknown | N/A | |
| Model Performance Scaling with Multiple Data Sources | Unknown | N/A | |
| Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models | Unknown | N/A | |
| Online Learning in Unknown Markov Games | Unknown | N/A | |
| Revealing the Structure of Deep Neural Networks via Convex Duality | Unknown | N/A | |
| Uncertainty Principles of Encoding GANs | Unknown | N/A | |
| Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation | Unknown | N/A | |
| Understanding and Mitigating Accuracy Disparity in Regression | Unknown | N/A | |
| Megaverse: Simulating Embodied Agents at One Million Experiences per Second | Unknown | N/A | |
| Stability and Generalization of Stochastic Gradient Methods for Minimax Problems | Unknown | N/A | |
| Cross-model Back-translated Distillation for Unsupervised Machine Translation | Unknown | N/A | |
| Revenue-Incentive Tradeoffs in Dynamic Reserve Pricing | Unknown | N/A | |
| End-to-End Learning of Coherent Probabilistic Forecasts for Hierarchical Time Series | Unknown | N/A | |
| Principal Bit Analysis: Autoencoding with Schur-Concave Loss | Unknown | N/A | |
| Improved Algorithms for Agnostic Pool-based Active Classification | Unknown | N/A | |
| Randomized Dimensionality Reduction for Facility Location and Single-Linkage Clustering | Unknown | N/A | |
| Sample-Optimal PAC Learning of Halfspaces with Malicious Noise | Unknown | N/A | |
| Differentially Private Bayesian Inference for Generalized Linear Models | Unknown | N/A | |
| Online Learning for Load Balancing of Unknown Monotone Resource Allocation Games | Unknown | N/A | |
| Generative Video Transformer: Can Objects be the Words? | Unknown | N/A | |
| A Wasserstein Minimax Framework for Mixed Linear Regression | Unknown | N/A | |
| Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning | Unknown | N/A | |
| On a Combination of Alternating Minimization and Nesterov's Momentum | Unknown | N/A | |
| Latent Space Energy-Based Model of Symbol-Vector Coupling for Text Generation and Classification | Unknown | N/A | |
| Sparse Bayesian Learning via Stepwise Regression | Unknown | N/A | |
| Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators | Unknown | N/A | |
| Reasoning Over Virtual Knowledge Bases With Open Predicate Relations | Unknown | N/A | |
| Learning from Noisy Labels with No Change to the Training Process | Unknown | N/A | |
| Bayesian Structural Adaptation for Continual Learning | Unknown | N/A | |
| Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot | Unknown | N/A | |
| DriftSurf: Stable-State / Reactive-State Learning under Concept Drift | Unknown | N/A | |
| Streaming and Distributed Algorithms for Robust Column Subset Selection | Unknown | N/A | |
| Operationalizing Complex Causes: A Pragmatic View of Mediation | Unknown | N/A | |
| Neural Feature Matching in Implicit 3D Representations | Unknown | N/A | |
| From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization | Unknown | N/A | |
| MSA Transformer | Unknown | N/A | |
| Heterogeneous Risk Minimization | Unknown | N/A | |
| Variational Auto-Regressive Gaussian Processes for Continual Learning | Unknown | N/A | |
| Group Fisher Pruning for Practical Network Compression | Unknown | N/A | |
| How Important is the Train-Validation Split in Meta-Learning? | Unknown | N/A | |
| Outlier-Robust Optimal Transport | Unknown | N/A | |
| Learn-to-Share: A Hardware-friendly Transfer Learning Framework Exploiting Computation and Parameter Sharing | Unknown | N/A | |
| Consensus Control for Decentralized Deep Learning | Unknown | N/A | |
| Mixed Nash Equilibria in the Adversarial Examples Game | Unknown | N/A | |
| Active Covering | Unknown | N/A | |
| Towards Certifying L-infinity Robustness using Neural Networks with L-inf-dist Neurons | Unknown | N/A | |
| Joint Online Learning and Decision-making via Dual Mirror Descent | Unknown | N/A | |
| Attention is not all you need: pure attention loses rank doubly exponentially with depth | Unknown | N/A | |
| Nonparametric Decomposition of Sparse Tensors | Unknown | N/A | |
| Generalization Guarantees for Neural Architecture Search with Train-Validation Split | Unknown | N/A | |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Unknown | N/A | |
| Calibrate Before Use: Improving Few-shot Performance of Language Models | Unknown | N/A | |
| I-BERT: Integer-only BERT Quantization | Unknown | N/A | |
| A Language for Counterfactual Generative Models | Unknown | N/A | |
| Thinking Like Transformers | Unknown | N/A | |
| Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry | Unknown | N/A | |
| Multidimensional Scaling: Approximation and Complexity | Unknown | N/A | |
| Discovering symbolic policies with deep reinforcement learning | Unknown | N/A | |
| Learning from History for Byzantine Robust Optimization | Unknown | N/A | |
| Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling | Unknown | N/A | |
| On-the-fly Rectification for Robust Large-Vocabulary Topic Inference | Unknown | N/A | |
| Additive Error Guarantees for Weighted Low Rank Approximation | Unknown | N/A | |
| Problem Dependent View on Structured Thresholding Bandit Problems | Unknown | N/A | |
| Heterogeneity for the Win: One-Shot Federated Clustering | Unknown | N/A | |
| Characterizing Structural Regularities of Labeled Data in Overparameterized Models | Unknown | N/A | |
| Oneshot Differentially Private Top-k Selection | Unknown | N/A | |
| Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret | Unknown | N/A | |
| Double-Win Quant: Aggressively Winning Robustness of Quantized Deep Neural Networks via Random Precision Training and Inference | Unknown | N/A | |
| Unifying Vision-and-Language Tasks via Text Generation | Unknown | N/A | |
| Explanations for Monotonic Classifiers. | Unknown | N/A | |
| Towards Tight Bounds on the Sample Complexity of Average-reward MDPs | Unknown | N/A | |
| Measuring Robustness in Deep Learning Based Compressive Sensing | Unknown | N/A | |
| Recovering AES Keys with a Deep Cold Boot Attack | Unknown | N/A | |
| ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks | Unknown | N/A | |
| Optimal Counterfactual Explanations in Tree Ensembles | Unknown | N/A | |
| Learning Neural Network Subspaces | Unknown | N/A | |
| Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers | Unknown | N/A | |
| Cross-Gradient Aggregation for Decentralized Learning from Non-IID Data | Unknown | N/A | |
| Bayesian Deep Learning via Subnetwork Inference | Unknown | N/A | |
| Solving high-dimensional parabolic PDEs using the tensor train format | Unknown | N/A | |
| Diffusion Earth Mover's Distance and Distribution Embeddings | Unknown | N/A | |
| Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics | Unknown | N/A | |
| A Tale of Two Efficient and Informative Negative Sampling Distributions | Unknown | N/A | |
| Tractable structured natural-gradient descent using local parameterizations | Unknown | N/A | |
| Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective | Unknown | N/A | |
| A Structured Observation Distribution for Generative Biological Sequence Prediction and Forecasting | Unknown | N/A | |
| Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction | Unknown | N/A | |
| Dynamic Balancing for Model Selection in Bandits and RL | Unknown | N/A | |
| Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition | Unknown | N/A | |
| Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks | Unknown | N/A | |
| A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups | Unknown | N/A | |
| Neural Symbolic Regression that scales | Unknown | N/A | |
| Label Inference Attacks from Log-loss Scores | Unknown | N/A | |
| A Precise Performance Analysis of Support Vector Regression | Unknown | N/A | |
| On Linear Identifiability of Learned Representations | Unknown | N/A | |
| Generalizable Episodic Memory for Deep Reinforcement Learning | Unknown | N/A | |
| Zoo-Tuning: Adaptive Transfer from A Zoo of Models | Unknown | N/A | |
| Characterizing the Gap Between Actor-Critic and Policy Gradient | Unknown | N/A | |
| DeepReDuce: ReLU Reduction for Fast Private Inference | Unknown | N/A | |
| Learning to Generate Noise for Multi-Attack Robustness | Unknown | N/A | |
| Globally-Robust Neural Networks | Unknown | N/A | |
| MorphVAE: Generating Neural Morphologies from 3D-Walks using a Variational Autoencoder with Spherical Latent Space | Unknown | N/A | |
| Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification | Unknown | N/A | |
| State Entropy Maximization with Random Encoders for Efficient Exploration | Unknown | N/A | |
| EL-Attention: Memory Efficient Lossless Attention for Generation | Unknown | N/A | |
| The Earth Mover's Pinball Loss: Quantiles for Histogram-Valued Regression | Unknown | N/A | |
| GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient Deep Model Training | Unknown | N/A | |
| CountSketches, Feature Hashing and the Median of Three | Unknown | N/A | |
| Improved Contrastive Divergence Training of Energy-Based Models | Unknown | N/A | |
| Marginalized Stochastic Natural Gradients for Black-Box Variational Inference | Unknown | N/A | |
| SketchEmbedNet: Learning Novel Concepts by Imitating Drawings | Unknown | N/A | |
| Safe Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Leveraged Weighted Loss for Partial Label Learning | Unknown | N/A | |
| HyperHyperNetwork for the Design of Antenna Arrays | Unknown | N/A | |
| Explainable Automated Graph Representation Learning with Hyperparameter Importance | Unknown | N/A | |
| CARTL: Cooperative Adversarially-Robust Transfer Learning | Unknown | N/A | |
| Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation | Unknown | N/A | |
| MARINA: Faster Non-Convex Distributed Learning with Compression | Unknown | N/A | |
| Parametric Graph for Unimodal Ranking Bandit | Unknown | N/A | |
| No-regret Algorithms for Capturing Events in Poisson Point Processes | Unknown | N/A | |
| Estimation and Quantization of Expected Persistence Diagrams | Unknown | N/A | |
| Follow-the-Regularized-Leader Routes to Chaos in Routing Games | Unknown | N/A | |
| Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games | Unknown | N/A | |
| Towards Understanding and Mitigating Social Biases in Language Models | Unknown | N/A | |
| Unsupervised Co-part Segmentation through Assembly | Unknown | N/A | |
| PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models | Unknown | N/A | |
| Implicit rate-constrained optimization of non-decomposable objectives | Unknown | N/A | |
| To be Robust or to be Fair: Towards Fairness in Adversarial Training | Unknown | N/A | |
| Neural Pharmacodynamic State Space Modeling | Unknown | N/A | |
| Adaptive Newton Sketch: Linear-time Optimization with Quadratic Convergence and Effective Hessian Dimensionality | Unknown | N/A | |
| Accuracy, Interpretability, and Differential Privacy via Explainable Boosting | Unknown | N/A | |
| Reinforcement Learning of Implicit and Explicit Control Flow Instructions | Unknown | N/A | |
| SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II | Unknown | N/A | |
| Policy Gradient Bayesian Robust Optimization for Imitation Learning | Unknown | N/A | |
| Online Learning with Optimism and Delay | Unknown | N/A | |
| Blind Pareto Fairness and Subgroup Robustness | Unknown | N/A | |
| Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games | Unknown | N/A | |
| Optimal Estimation of High Dimensional Smooth Additive Function Based on Noisy Observations | Unknown | N/A | |
| Structured World Belief for Reinforcement Learning in POMDP | Unknown | N/A | |
| Improved Denoising Diffusion Probabilistic Models | Unknown | N/A | |
| Prior Image-Constrained Reconstruction using Style-Based Generative Models | Unknown | N/A | |
| Vector Quantized Models for Planning | Unknown | N/A | |
| On the Convergence of Hamiltonian Monte Carlo with Stochastic Gradients | Unknown | N/A | |
| f-Domain Adversarial Learning: Theory and Algorithms | Unknown | N/A | |
| Bias-Variance Reduced Local SGD for Less Heterogeneous Federated Learning | Unknown | N/A | |
| Automatic variational inference with cascading flows | Unknown | N/A | |
| Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences | Unknown | N/A | |
| Dataset Dynamics via Gradient Flows in Probability Space | Unknown | N/A | |
| Lower Bounds on Cross-Entropy Loss in the Presence of Test-time Adversaries | Unknown | N/A | |
| Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction | Unknown | N/A | |
| Link Prediction with Persistent Homology: An Interactive View | Unknown | N/A | |
| Regret and Cumulative Constraint Violation Analysis for Online Convex Optimization with Long Term Constraints | Unknown | N/A | |
| PopSkipJump: Decision-Based Attack for Probabilistic Classifiers | Unknown | N/A | |
| Black-box density function estimation using recursive partitioning | Unknown | N/A | |
| Transfer-Based Semantic Anomaly Detection | Unknown | N/A | |
| Bilevel Optimization: Convergence Analysis and Enhanced Design | Unknown | N/A | |
| Neuro-algorithmic Policies Enable Fast Combinatorial Generalization | Unknown | N/A | |
| Collaborative Bayesian Optimization with Fair Regret | Unknown | N/A | |
| Leveraging Non-uniformity in First-order Non-convex Optimization | Unknown | N/A | |
| Solving Inverse Problems with a Flow-based Noise Model | Unknown | N/A | |
| Trajectory Diversity for Zero-Shot Coordination | Unknown | N/A | |
| Zero-Shot Text-to-Image Generation | Unknown | N/A | |
| Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not? | Unknown | N/A | |
| Understanding Failures in Out-of-Distribution Detection with Deep Generative Models | Unknown | N/A | |
| One for One, or All for All: Equilibria and Optimality of Collaboration in Federated Learning | Unknown | N/A | |
| On the Inherent Regularization Effects of Noise Injection During Training | Unknown | N/A | |
| Learn2Hop: Learned Optimization on Rough Landscapes | Unknown | N/A | |
| A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization | Unknown | N/A | |
| Efficient Iterative Amortized Inference for Learning Symmetric and Disentangled Multi-Object Representations | Unknown | N/A | |
| State Relevance for Off-Policy Evaluation | Unknown | N/A | |
| Leveraging Public Data for Practical Private Query Release | Unknown | N/A | |
| Sample Complexity of Robust Linear Classification on Separated Data | Unknown | N/A | |
| Continual Learning in the Teacher-Student Setup: Impact of Task Similarity | Unknown | N/A | |
| Spectral Smoothing Unveils Phase Transitions in Hierarchical Variational Autoencoders | Unknown | N/A | |
| Robust Testing and Estimation under Manipulation Attacks | Unknown | N/A | |
| Policy Caches with Successor Features | Unknown | N/A | |
| Neighborhood Contrastive Learning Applied to Online Patient Monitoring | Unknown | N/A | |
| Value Iteration in Continuous Actions, States and Time | Unknown | N/A | |
| Provable Meta-Learning of Linear Representations | Unknown | N/A | |
| Provably Efficient Algorithms for Multi-Objective Competitive RL | Unknown | N/A | |
| Decoupling Value and Policy for Generalization in Reinforcement Learning | Unknown | N/A | |
| High Confidence Generalization for Reinforcement Learning | Unknown | N/A | |
| Fair Selective Classification Via Sufficiency | Unknown | N/A | |
| SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies | Unknown | N/A | |
| Dimensionality Reduction for the Sum-of-Distances Metric | Unknown | N/A | |
| Synthesizer: Rethinking Self-Attention for Transformer Models | Unknown | N/A | |
| Weight-covariance alignment for adversarially robust neural networks | Unknown | N/A | |
| Robust Unsupervised Learning via L-statistic Minimization | Unknown | N/A | |
| Active Testing: Sample-Efficient Model Evaluation | Unknown | N/A | |
| Reserve Price Optimization for First Price Auctions in Display Advertising | Unknown | N/A | |
| E(n) Equivariant Graph Neural Networks | Unknown | N/A | |
| Demystifying Inductive Biases for (Beta-)VAE Based Architectures | Unknown | N/A | |
| Disentangling syntax and semantics in the brain with deep networks | Unknown | N/A | |
| FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Training Quantized Neural Networks to Global Optimality via Semidefinite Programming | Unknown | N/A | |
| Generative Particle Variational Inference via Estimation of Functional Gradients | Unknown | N/A | |
| Size-Invariant Graph Representations for Graph Classification Extrapolations | Unknown | N/A | |
| Progressive-Scale Boundary Blackbox Attack via Projective Gradient Estimation | Unknown | N/A | |
| LARNet: Lie Algebra Residual Network for Face Recognition | Unknown | N/A | |
| Sinkhorn Label Allocation: Semi-Supervised Classification via Annealed Self-Training | Unknown | N/A | |
| Inferring Latent Dynamics Underlying Neural Population Activity via Neural Differential Equations | Unknown | N/A | |
| Private Adaptive Gradient Methods for Convex Optimization | Unknown | N/A | |
| Hierarchical Agglomerative Graph Clustering in Nearly-Linear Time | Unknown | N/A | |
| On Learnability via Gradient Method for Two-Layer ReLU Neural Networks in Teacher-Student Setting | Unknown | N/A | |
| LTL2Action: Generalizing LTL Instructions for Multi-Task RL | Unknown | N/A | |
| Learning Transferable Visual Models From Natural Language Supervision | Unknown | N/A | |
| Benchmarks, Algorithms, and Metrics for Hierarchical Disentanglement | Unknown | N/A | |
| Out-of-Distribution Generalization via Risk Extrapolation (REx) | Unknown | N/A | |
| TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models | Unknown | N/A | |
| Acceleration via Fractal Learning Rate Schedules | Unknown | N/A | |
| Local Correlation Clustering with Asymmetric Classification Errors | Unknown | N/A | |
| How and Why to Use Experimental Data to Evaluate Methods for Observational Causal Inference | Unknown | N/A | |
| Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity | Unknown | N/A | |
| The Impact of Record Linkage on Learning from Feature Partitioned Data | Unknown | N/A | |
| The Symmetry between Arms and Knapsacks: A Primal-Dual Approach for Bandits with Knapsacks | Unknown | N/A | |
| Robust Inference for High-Dimensional Linear Models via Residual Randomization | Unknown | N/A | |
| XOR-CD: Linearly Convergent Constrained Structure Generation | Unknown | N/A | |
| Just Train Twice: Improving Group Robustness without Training Group Information | Unknown | N/A | |
| Permutation Weighting | Unknown | N/A | |
| Deep kernel processes | Unknown | N/A | |
| Examining and Combating Spurious Features under Distribution Shift | Unknown | N/A | |
| Learning to Price Against a Moving Target | Unknown | N/A | |
| Bilinear Classes: A Structural Framework for Provable Generalization in RL | Unknown | N/A | |
| Emergent Social Learning via Multi-agent Reinforcement Learning | Unknown | N/A | |
| Improving Generalization in Meta-learning via Task Augmentation | Unknown | N/A | |
| Message Passing Adaptive Resonance Theory for Online Active Semi-supervised Learning | Unknown | N/A | |
| Batch Value-function Approximation with Only Realizability | Unknown | N/A | |
| Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework | Unknown | N/A | |
| Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research | Unknown | N/A | |
| Instance Specific Approximations for Submodular Maximization | Unknown | N/A | |
| Learning Representations by Humans, for Humans | Unknown | N/A | |
| Accumulated Decoupled Learning with Gradient Staleness Mitigation for Convolutional Neural Networks | Unknown | N/A | |
| Voice2Series: Reprogramming Acoustic Models for Time Series Classification | Unknown | N/A | |
| Path Planning using Neural A* Search | Unknown | N/A | |
| CATE: Computation-aware Neural Architecture Encoding with Transformers | Unknown | N/A | |
| Unified Robust Semi-Supervised Variational Autoencoder | Unknown | N/A | |
| Outside the Echo Chamber: Optimizing the Performative Risk | Unknown | N/A | |
| Catformer: Designing Stable Transformers via Sensitivity Analysis | Unknown | N/A | |
| MOTS: Minimax Optimal Thompson Sampling | Unknown | N/A | |
| On the Power of Localized Perceptron for Label-Optimal Learning of Halfspaces with Adversarial Noise | Unknown | N/A | |
| Riemannian Convex Potential Maps | Unknown | N/A | |
| Debiasing a First-order Heuristic for Approximate Bi-level Optimization | Unknown | N/A | |
| Isometric Gaussian Process Latent Variable Model for Dissimilarity Data | Unknown | N/A | |
| Relative Positional Encoding for Transformers with Linear Complexity | Unknown | N/A | |
| GANMEX: One-vs-One Attributions using GAN-based Model Explainability | Unknown | N/A | |
| Defense against backdoor attacks via robust covariance estimation | Unknown | N/A | |
| Bayesian Attention Belief Networks | Unknown | N/A | |
| Nondeterminism and Instability in Neural Network Optimization | Unknown | N/A | |
| Large-Scale Multi-Agent Deep FBSDEs | Unknown | N/A | |
| Generative Adversarial Transformers | Unknown | N/A | |
| CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee | Unknown | N/A | |
| SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation | Unknown | N/A | |
| Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation | Unknown | N/A | |
| Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation | Unknown | N/A | |
| GMAC: A Distributional Perspective on Actor-Critic Framework | Unknown | N/A | |
| Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation | Unknown | N/A | |
| Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction | Unknown | N/A | |
| Learning Stochastic Behaviour from Aggregate Data | Unknown | N/A | |
| Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks | Unknown | N/A | |
| Is Space-Time Attention All You Need for Video Understanding? | Unknown | N/A | |
| Interaction-Grounded Learning | Unknown | N/A | |
| The Emergence of Individuality | Unknown | N/A | |
| Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings | Unknown | N/A | |
| Large-Scale Meta-Learning with Continual Trajectory Shifting | Unknown | N/A | |
| Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation | Unknown | N/A | |
| Sharf: Shape-conditioned Radiance Fields from a Single View | Unknown | N/A | |
| Commutative Lie Group VAE for Disentanglement Learning | Unknown | N/A | |
| A Receptor Skeleton for Capsule Neural Networks | Unknown | N/A | |
| On the price of explainability for some clustering problems | Unknown | N/A | |
| GRAND: Graph Neural Diffusion | Unknown | N/A | |
| Variance Reduced Training with Stratified Sampling for Forecasting Models | Unknown | N/A | |
| Quantitative Understanding of VAE as a Non-linearly Scaled Isometric Embedding | Unknown | N/A | |
| ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations | Unknown | N/A | |
| Selecting Data Augmentation for Simulating Interventions | Unknown | N/A | |
| On Explainability of Graph Neural Networks via Subgraph Explorations | Unknown | N/A | |
| Event Outlier Detection in Continuous Time | Unknown | N/A | |
| Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning | Unknown | N/A | |
| What Makes for End-to-End Object Detection? | Unknown | N/A | |
| Understanding Noise Injection in GANs | Unknown | N/A | |
| Learning by Turning: Neural Architecture Aware Optimisation | Unknown | N/A | |
| FILTRA: Rethinking Steerable CNN by Filter Transform | Unknown | N/A | |
| Gradient Disaggregation: Breaking Privacy in Federated Learning by Reconstructing the User Participant Matrix | Unknown | N/A | |
| Crystallization Learning with the Delaunay Triangulation | Unknown | N/A | |
| Principled Exploration via Optimistic Bootstrapping and Backward Induction | Unknown | N/A | |
| Near-Optimal Entrywise Anomaly Detection for Low-Rank Matrices with Sub-Exponential Noise | Unknown | N/A | |
| TFix: Learning to Fix Coding Errors with a Text-to-Text Transformer | Unknown | N/A | |
| Mind the Box: $l_1$-APGD for Sparse Adversarial Attacks on Image Classifiers | Unknown | N/A | |
| On the Problem of Underranking in Group-Fair Ranking | Unknown | N/A | |
| On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game | Unknown | N/A | |
| Improving Ultrametrics Embeddings Through Coresets | Unknown | N/A | |
| Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More | Unknown | N/A | |
| Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions | Unknown | N/A | |
| Efficient Online Learning for Dynamic k-Clustering | Unknown | N/A | |
| Federated Learning of User Verification Models Without Sharing Embeddings | Unknown | N/A | |
| Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives | Unknown | N/A | |
| Incentivized Bandit Learning with Self-Reinforcing User Preferences | Unknown | N/A | |
| Quantile Bandits for Best Arms Identification | Unknown | N/A | |
| Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees | Unknown | N/A | |
| STRODE: Stochastic Boundary Ordinary Differential Equation | Unknown | N/A | |
| Align, then memorise: the dynamics of learning with feedback alignment | Unknown | N/A | |
| Communication-Efficient Distributed SVD via Local Power Iterations | Unknown | N/A | |
| Breaking the Deadly Triad with a Target Network | Unknown | N/A | |
| Adversarial Robustness Guarantees for Random Deep Neural Networks | Unknown | N/A | |
| In-Database Regression in Input Sparsity Time | Unknown | N/A | |
| Inferring serial correlation with dynamic backgrounds | Unknown | N/A | |
| The Limits of Min-Max Optimization Algorithms: Convergence to Spurious Non-Critical Sets | Unknown | N/A | |
| Towards Defending against Adversarial Examples via Attack-Invariant Features | Unknown | N/A | |
| Asymptotic Normality and Confidence Intervals for Prediction Risk of the Min-Norm Least Squares Estimator | Unknown | N/A | |
| Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth | Unknown | N/A | |
| Communication-Efficient Distributed Optimization with Quantized Preconditioners | Unknown | N/A | |
| AdaXpert: Adapting Neural Architecture for Growing Data | Unknown | N/A | |
| On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP | Unknown | N/A | |
| SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform | Unknown | N/A | |
| On Characterizing GAN Convergence Through Proximal Duality Gap | Unknown | N/A | |
| Unbalanced minibatch Optimal Transport; applications to Domain Adaptation | Unknown | N/A | |
| What's in the Box? Exploring the Inner Life of Neural Networks with Robust Rules | Unknown | N/A | |
| Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed | Unknown | N/A | |
| ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases | Unknown | N/A | |
| Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient | Unknown | N/A | |
| Bootstrapping Fitted Q-Evaluation for Off-Policy Inference | Unknown | N/A | |
| Fast Algorithms for Stackelberg Prediction Game with Least Squares Loss | Unknown | N/A | |
| Phasic Policy Gradient | Unknown | N/A | |
| Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization | Unknown | N/A | |
| A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance | Unknown | N/A | |
| Near-Optimal Representation Learning for Linear Bandits and Linear RL | Unknown | N/A | |
| How Framelets Enhance Graph Neural Networks | Unknown | N/A | |
| A Sampling-Based Method for Tensor Ring Decomposition | Unknown | N/A | |
| Necessary and sufficient conditions for causal feature selection in time series with latent common causes | Unknown | N/A | |
| Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL | Unknown | N/A | |
| Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network | Unknown | N/A |
ICML 2022
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Communicating via Markov Decision Processes | Unknown | N/A | |
| Inferring Cause and Effect in the Presence of Heteroscedastic Noise | Unknown | N/A | |
| Simplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games | Unknown | N/A | |
| Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis | Unknown | N/A | |
| Anticorrelated Noise Injection for Improved Generalization | Unknown | N/A | |
| Scalable First-Order Bayesian Optimization via Structured Automatic Differentiation | Unknown | N/A | |
| When AUC meets DRO: Optimizing Partial AUC for Deep Learning with Non-Convex Convergence Guarantee | Unknown | N/A | |
| Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning | Unknown | N/A | |
| The Primacy Bias in Deep Reinforcement Learning | Unknown | N/A | |
| Understanding Gradient Descent on the Edge of Stability in Deep Learning | Unknown | N/A | |
| Connect, Not Collapse: Explaining Contrastive Learning for Unsupervised Domain Adaptation | Unknown | N/A | |
| On Distribution Shift in Learning-based Bug Detectors | Unknown | N/A | |
| SPECTRE: Spectral Conditioning Helps to Overcome the Expressivity Limits of One-shot Graph Generators | Unknown | N/A | |
| Spatial-Channel Token Distillation for Vision MLPs | Unknown | N/A | |
| Understanding The Robustness in Vision Transformers | Unknown | N/A | |
| NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields | Unknown | N/A | |
| Measuring Representational Robustness of Neural Networks Through Shared Invariances | Unknown | N/A | |
| Efficient Approximate Inference for Stationary Kernel on Frequency Domain | Unknown | N/A | |
| Goal Misgeneralization in Deep Reinforcement Learning | Unknown | N/A | |
| Topology-aware Generalization of Decentralized SGD | Unknown | N/A | |
| Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation | Unknown | N/A | |
| Decision-Focused Learning: Through the Lens of Learning to Rank | Unknown | N/A | |
| On Implicit Bias in Overparameterized Bilevel Optimization | Unknown | N/A | |
| Welfare Maximization in Competitive Equilibrium: Reinforcement Learning for Markov Exchange Economy | Unknown | N/A | |
| EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction | Unknown | N/A | |
| Understanding Gradual Domain Adaptation: Improved Analysis, Optimal Path and Beyond | Unknown | N/A | |
| Linear-Time Gromov Wasserstein Distances using Low Rank Couplings and Costs | Unknown | N/A | |
| Input-agnostic Certified Group Fairness via Gaussian Parameter Smoothing | Unknown | N/A | |
| Channel Importance Matters in Few-Shot Image Classification | Unknown | N/A | |
| RECAPP: Crafting a More Efficient Catalyst for Convex Optimization | Unknown | N/A | |
| From Noisy Prediction to True Label: Noisy Prediction Calibration via Generative Model | Unknown | N/A | |
| Supervised Off-Policy Ranking | Unknown | N/A | |
| Steerable 3D Spherical Neurons | Unknown | N/A | |
| Neural Inverse Kinematic | Unknown | N/A | |
| SE(3) Equivariant Graph Neural Networks with Complete Local Frames | Unknown | N/A | |
| DNNR: Differential Nearest Neighbors Regression | Unknown | N/A | |
| Injecting Logical Constraints into Neural Networks via Straight-Through Estimators | Unknown | N/A | |
| Active Sampling for Min-Max Fairness | Unknown | N/A | |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Unknown | N/A | |
| Versatile Dueling Bandits: Best-of-both World Analyses for Learning from Relative Preferences | Unknown | N/A | |
| More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize | Unknown | N/A | |
| The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks | Unknown | N/A | |
| Towards Understanding Sharpness-Aware Minimization | Unknown | N/A | |
| AGNAS: Attention-Guided Micro- and Macro-Architecture Search | Unknown | N/A | |
| Improving Policy Optimization with Generalist-Specialist Learning | Unknown | N/A | |
| Short-Term Plasticity Neurons Learning to Learn and Forget | Unknown | N/A | |
| Adaptive Model Design for Markov Decision Process | Unknown | N/A | |
| Unsupervised Ground Metric Learning Using Wasserstein Singular Vectors | Unknown | N/A | |
| Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning | Unknown | N/A | |
| Orchestra: Unsupervised Federated Learning via Globally Consistent Clustering | Unknown | N/A | |
| Matching Learned Causal Effects of Neural Networks with Domain Priors | Unknown | N/A | |
| Bayesian Deep Embedding Topic Meta-Learner | Unknown | N/A | |
| REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer | Unknown | N/A | |
| Exploring the Gap between Collapsed & Whitened Features in Self-Supervised Learning | Unknown | N/A | |
| Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation | Unknown | N/A | |
| Accurate Quantization of Measures via Interacting Particle-based Optimization | Unknown | N/A | |
| Planning with Diffusion for Flexible Behavior Synthesis | Unknown | N/A | |
| An Exact Symbolic Reduction of Linear Smart Predict+Optimize to Mixed Integer Linear Programming | Unknown | N/A | |
| Equivariant Priors for compressed sensing with unknown orientation | Unknown | N/A | |
| Neural Language Models are not Born Equal to Fit Brain Data, but Training Helps | Unknown | N/A | |
| On the Role of Discount Factor in Offline Reinforcement Learning | Unknown | N/A | |
| Adapting the Linearised Laplace Model Evidence for Modern Deep Learning | Unknown | N/A | |
| Communication-Efficient Adaptive Federated Learning | Unknown | N/A | |
| Content Addressable Memory Without Catastrophic Forgetting by Heteroassociation with a Fixed Scaffold | Unknown | N/A | |
| Continual Learning via Sequential Function-Space Variational Inference | Unknown | N/A | |
| Bitwidth Heterogeneous Federated Learning with Progressive Weight Dequantization | Unknown | N/A | |
| Do More Negative Samples Necessarily Hurt In Contrastive Learning? | Unknown | N/A | |
| Fast Population-Based Reinforcement Learning on a Single Machine | Unknown | N/A | |
| NeuralEF: Deconstructing Kernels by Deep Neural Networks | Unknown | N/A | |
| Private Streaming SCO in $\ell_p$ geometry with Applications in High Dimensional Online Decision Making | Unknown | N/A | |
| Robin Hood and Matthew Effects: Differential Privacy Has Disparate Impact on Synthetic Data | Unknown | N/A | |
| Robust Training of Neural Networks Using Scale Invariant Architectures | Unknown | N/A | |
| Generalization and Robustness Implications in Object-Centric Learning | Unknown | N/A | |
| Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations | Unknown | N/A | |
| Multicoated Supermasks Enhance Hidden Networks | Unknown | N/A | |
| Time Is MattEr: Temporal Self-supervision for Video Transformers | Unknown | N/A | |
| Privacy for Free: How does Dataset Condensation Help Privacy? | Unknown | N/A | |
| Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning | Unknown | N/A | |
| Stable Conformal Prediction Sets | Unknown | N/A | |
| Correct-N-Contrast: a Contrastive Approach for Improving Robustness to Spurious Correlations | Unknown | N/A | |
| Selective Regression under Fairness Criteria | Unknown | N/A | |
| Towards Scaling Difference Target Propagation by Learning Backprop Targets | Unknown | N/A | |
| Revisiting Label Smoothing and Knowledge Distillation Compatibility: What was Missing? | Unknown | N/A | |
| Personalized Federated Learning through Local Memorization | Unknown | N/A | |
| SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks | Unknown | N/A | |
| Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control | Unknown | N/A | |
| Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding | Unknown | N/A | |
| Deep Safe Incomplete Multi-view Clustering: Theorem and Algorithm | Unknown | N/A | |
| On the Adversarial Robustness of Causal Algorithmic Recourse | Unknown | N/A | |
| Virtual Homogeneity Learning: Defending against Data Heterogeneity in Federated Learning | Unknown | N/A | |
| Coin Flipping Neural Networks | Unknown | N/A | |
| Neurotoxin: Durable Backdoors in Federated Learning | Unknown | N/A | |
| Dataset Condensation with Contrastive Signals | Unknown | N/A | |
| How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation | Unknown | N/A | |
| Deep Network Approximation in Terms of Intrinsic Parameters | Unknown | N/A | |
| A Convergent and Dimension-Independent Min-Max Optimization Algorithm | Unknown | N/A | |
| Adversarially Robust Models may not Transfer Better: Sufficient Conditions for Domain Transferability from the View of Regularization | Unknown | N/A | |
| Implicit Regularization with Polynomial Growth in Deep Tensor Factorization | Unknown | N/A | |
| Demystifying the Adversarial Robustness of Random Transformation Defenses | Unknown | N/A | |
| Marginal Tail-Adaptive Normalizing Flows | Unknown | N/A | |
| Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging | Unknown | N/A | |
| NP-Match: When Neural Processes meet Semi-Supervised Learning | Unknown | N/A | |
| Representation Topology Divergence: A Method for Comparing Neural Network Representations. | Unknown | N/A | |
| Understanding Contrastive Learning Requires Incorporating Inductive Biases | Unknown | N/A | |
| Neural Implicit Dictionary Learning via Mixture-of-Expert Training | Unknown | N/A | |
| POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging | Unknown | N/A | |
| Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations | Unknown | N/A | |
| Retroformer: Pushing the Limits of End-to-end Retrosynthesis Transformer | Unknown | N/A | |
| SpaceMAP: Visualizing High-Dimensional Data by Space Expansion | Unknown | N/A | |
| Meta-Learning Hypothesis Spaces for Sequential Decision-making | Unknown | N/A | |
| Lazy Estimation of Variable Importance for Large Neural Networks | Unknown | N/A | |
| A Single-Loop Gradient Descent and Perturbed Ascent Algorithm for Nonconvex Functional Constrained Optimization | Unknown | N/A | |
| Partial disentanglement for domain adaptation | Unknown | N/A | |
| Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path | Unknown | N/A | |
| Convergence Rates of Non-Convex Stochastic Gradient Descent Under a Generic Lojasiewicz Condition and Local Smoothness | Unknown | N/A | |
| Langevin Monte Carlo for Contextual Bandits | Unknown | N/A | |
| Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems | Unknown | N/A | |
| Tractable Uncertainty for Structure Learning | Unknown | N/A | |
| Multi-slots Online Matching with High Entropy | Unknown | N/A | |
| Data Determines Distributional Robustness in Contrastive Language Image Pre-training (CLIP) | Unknown | N/A | |
| Self-Supervised Models of Audio Effectively Explain Human Cortical Responses to Speech | Unknown | N/A | |
| GNNRank: Learning Global Rankings from Pairwise Comparisons via Directed Graph Neural Networks | Unknown | N/A | |
| Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Wide Bayesian neural networks have a simple weight posterior: theory and accelerated sampling | Unknown | N/A | |
| On the Generalization Analysis of Adversarial Learning | Unknown | N/A | |
| Extended Unconstrained Features Model for Exploring Deep Neural Collapse | Unknown | N/A | |
| Proximal Denoiser for Convergent Plug-and-Play Optimization with Nonconvex Regularization | Unknown | N/A | |
| Zero-Shot Reward Specification via Grounded Natural Language | Unknown | N/A | |
| VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix | Unknown | N/A | |
| Benefits of Overparameterized Convolutional Residual Networks: Function Approximation under Smoothness Constraint | Unknown | N/A | |
| Generative Cooperative Networks for Natural Language Generation | Unknown | N/A | |
| Multi-scale Feature Learning Dynamics: Insights for Double Descent | Unknown | N/A | |
| On Convergence of Gradient Descent Ascent: A Tight Local Analysis | Unknown | N/A | |
| Reachability Constrained Reinforcement Learning | Unknown | N/A | |
| A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning | Unknown | N/A | |
| TAM: Topology-Aware Margin Loss for Class-Imbalanced Node Classification | Unknown | N/A | |
| 3DLinker: An E(3) Equivariant Variational Autoencoder for Molecular Linker Design | Unknown | N/A | |
| Deep Variational Graph Convolutional Recurrent Network for Multivariate Time Series Anomaly Detection | Unknown | N/A | |
| Differentially Private Coordinate Descent for Composite Empirical Risk Minimization | Unknown | N/A | |
| Calibrated Learning to Defer with One-vs-All Classifiers | Unknown | N/A | |
| Multiclass learning with margin: exponential rates with no bias-variance trade-off | Unknown | N/A | |
| Prompting Decision Transformer for Few-Shot Policy Generalization | Unknown | N/A | |
| TSPipe: Learn from Teacher Faster with Pipelines | Unknown | N/A | |
| Self-conditioning Pre-Trained Language Models | Unknown | N/A | |
| Fast Aquatic Swimmer Optimization with Differentiable Projective Dynamics and Neural Network Hydrodynamic Models | Unknown | N/A | |
| Subspace Learning for Effective Meta-Learning | Unknown | N/A | |
| The power of first-order smooth optimization for black-box non-smooth problems | Unknown | N/A | |
| Proving Theorems using Incremental Learning and Hindsight Experience Replay | Unknown | N/A | |
| Rethinking Attention-Model Explainability through Faithfulness Violation Test | Unknown | N/A | |
| Inductive Biases and Variable Creation in Self-Attention Mechanisms | Unknown | N/A | |
| A data-driven approach for learning to control computers | Unknown | N/A | |
| Causal Imitation Learning under Temporally Correlated Noise | Unknown | N/A | |
| Delayed Reinforcement Learning by Imitation | Unknown | N/A | |
| Minimizing Control for Credit Assignment with Strong Feedback | Unknown | N/A | |
| Markov Chain Monte Carlo for Continuous-Time Switching Dynamical Systems | Unknown | N/A | |
| ProxSkip: Yes! Local Gradient Steps Provably Lead to Communication Acceleration! Finally! | Unknown | N/A | |
| Generalizing to Evolving Domains with Latent Structure-Aware Sequential Autoencoder | Unknown | N/A | |
| A Unified Weight Initialization Paradigm for Tensorial Convolutional Neural Networks | Unknown | N/A | |
| CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer | Unknown | N/A | |
| On the Robustness of CountSketch to Adaptive Inputs | Unknown | N/A | |
| An Initial Alignment between Neural Network and Target is Needed for Gradient Descent to Learn | Unknown | N/A | |
| Utilizing Expert Features for Contrastive Learning of Time-Series Representations | Unknown | N/A | |
| Region-Based Semantic Factorization in GANs | Unknown | N/A | |
| Evaluating the Adversarial Robustness of Adaptive Test-time Defenses | Unknown | N/A | |
| The Teaching Dimension of Regularized Kernel Learners | Unknown | N/A | |
| Selling Data To a Machine Learner: Pricing via Costly Signaling | Unknown | N/A | |
| Learning to Incorporate Texture Saliency Adaptive Attention to Image Cartoonization | Unknown | N/A | |
| Hindering Adversarial Attacks with Implicit Neural Representations | Unknown | N/A | |
| Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning | Unknown | N/A | |
| Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology | Unknown | N/A | |
| Context-Aware Drift Detection | Unknown | N/A | |
| Neural Fisher Discriminant Analysis: Optimal Neural Network Embeddings in Polynomial Time | Unknown | N/A | |
| Efficient Learning for AlphaZero via Path Consistency | Unknown | N/A | |
| Zero-shot AutoML with Pretrained Models | Unknown | N/A | |
| Offline RL Policies Should Be Trained to be Adaptive | Unknown | N/A | |
| Improving Out-of-Distribution Robustness via Selective Augmentation | Unknown | N/A | |
| Feature Learning and Signal Propagation in Deep Neural Networks | Unknown | N/A | |
| Learning to Predict Graphs with Fused Gromov-Wasserstein Barycenters | Unknown | N/A | |
| LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation | Unknown | N/A | |
| Continuous-Time Analysis of Accelerated Gradient Methods via Conservation Laws in Dilated Coordinate Systems | Unknown | N/A | |
| Stabilizing Off-Policy Deep Reinforcement Learning from Pixels | Unknown | N/A | |
| DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations | Unknown | N/A | |
| Bayesian Model Selection, the Marginal Likelihood, and Generalization | Unknown | N/A | |
| Structure-Aware Transformer for Graph Representation Learning | Unknown | N/A | |
| Generalized Leverage Scores: Geometric Interpretation and Applications | Unknown | N/A | |
| Flow-Guided Sparse Transformer for Video Deblurring | Unknown | N/A | |
| FEDformer: Frequency Enhanced Decomposed Transformer for Long-term Series Forecasting | Unknown | N/A | |
| StreamingQA: A Benchmark for Adaptation to New Knowledge over Time in Question Answering Models | Unknown | N/A | |
| Learning from a Learning User for Optimal Recommendations | Unknown | N/A | |
| Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods | Unknown | N/A | |
| Modular Conformal Calibration | Unknown | N/A | |
| PACE: A Parallelizable Computation Encoder for Directed Acyclic Graphs | Unknown | N/A | |
| Private optimization in the interpolation regime: faster rates and hardness results | Unknown | N/A | |
| Neuron Dependency Graphs: A Causal Abstraction of Neural Networks | Unknown | N/A | |
| Searching for BurgerFormer with Micro-Meso-Macro Space Design | Unknown | N/A | |
| Rethinking Graph Neural Networks for Anomaly Detection | Unknown | N/A | |
| The Geometry of Robust Value Functions | Unknown | N/A | |
| A Tighter Analysis of Spectral Clustering, and Beyond | Unknown | N/A | |
| Directed Acyclic Transformer for Non-Autoregressive Machine Translation | Unknown | N/A | |
| Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) | Unknown | N/A | |
| Model-Value Inconsistency as a Signal for Epistemic Uncertainty | Unknown | N/A | |
| Exact Learning of Preference Structure: Single-peaked Preferences and Beyond | Unknown | N/A | |
| Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four | Unknown | N/A | |
| Boosting Graph Structure Learning with Dummy Nodes | Unknown | N/A | |
| Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents | Unknown | N/A | |
| Learning General Halfspaces with Adversarial Label Noise via Online Gradient Descent | Unknown | N/A | |
| Robust Fine-Tuning of Deep Neural Networks with Hessian-based Generalization Guarantees | Unknown | N/A | |
| Sequential- and Parallel- Constrained Max-value Entropy Search via Information Lower Bound | Unknown | N/A | |
| Approximate Frank-Wolfe Algorithms over Graph-structured Support Sets | Unknown | N/A | |
| A Statistical Manifold Framework for Point Cloud Data | Unknown | N/A | |
| Stochastic smoothing of the top-K calibrated hinge loss for deep imbalanced classification | Unknown | N/A | |
| Iterative Double Sketching for Faster Least-Squares Optimization | Unknown | N/A | |
| Test-Time Training Can Close the Natural Distribution Shift Performance Gap in Deep Learning Based Compressed Sensing | Unknown | N/A | |
| The Poisson Binomial Mechanism for Unbiased Federated Learning with Secure Aggregation | Unknown | N/A | |
| Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian Optimization | Unknown | N/A | |
| Optimizing Tensor Network Contraction Using Reinforcement Learning | Unknown | N/A | |
| Distribution Regression with Sliced Wasserstein Kernels | Unknown | N/A | |
| Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding | Unknown | N/A | |
| Understanding Doubly Stochastic Clustering | Unknown | N/A | |
| Functional Generalized Empirical Likelihood Estimation for Conditional Moment Restrictions | Unknown | N/A | |
| On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs | Unknown | N/A | |
| Latent Outlier Exposure for Anomaly Detection with Contaminated Data | Unknown | N/A | |
| Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback | Unknown | N/A | |
| Set Based Stochastic Subsampling | Unknown | N/A | |
| Neural Tangent Kernel Analysis of Deep Narrow Neural Networks | Unknown | N/A | |
| Fully-Connected Network on Noncompact Symmetric Space and Ridgelet Transform based on Helgason-Fourier Analysis | Unknown | N/A | |
| Tackling Data Heterogeneity: A New Unified Framework for Decentralized SGD with Sample-induced Topology | Unknown | N/A | |
| Co-training Improves Prompt-based Learning for Large Language Models | Unknown | N/A | |
| Variational Feature Pyramid Networks | Unknown | N/A | |
| Probabilistic Bilevel Coreset Selection | Unknown | N/A | |
| Efficient Distributionally Robust Bayesian Optimization with Worst-case Sensitivity | Unknown | N/A | |
| Perfectly Balanced: Improving Transfer and Robustness of Supervised Contrastive Learning | Unknown | N/A | |
| XAI for Transformers: Better Explanations through Conservative Propagation | Unknown | N/A | |
| General-purpose, long-context autoregressive modeling with Perceiver AR | Unknown | N/A | |
| Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent | Unknown | N/A | |
| TURF: Two-Factor, Universal, Robust, Fast Distribution Learning Algorithm | Unknown | N/A | |
| Flashlight: Enabling Innovation in Tools for Machine Learning | Unknown | N/A | |
| Model Agnostic Sample Reweighting for Out-of-Distribution Learning | Unknown | N/A | |
| Nonparametric Involutive Markov Chain Monte Carlo | Unknown | N/A | |
| Identity-Disentangled Adversarial Augmentation for Self-supervised Learning | Unknown | N/A | |
| Fast and Provable Nonconvex Tensor RPCA | Unknown | N/A | |
| Certified Robustness Against Natural Language Attacks by Causal Intervention | Unknown | N/A | |
| A Simple Unified Framework for High Dimensional Bandit Problems | Unknown | N/A | |
| Interventional Contrastive Learning with Meta Semantic Regularizer | Unknown | N/A | |
| Parametric Visual Program Induction with Function Modularization | Unknown | N/A | |
| Generalized Beliefs for Cooperative AI | Unknown | N/A | |
| Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets | Unknown | N/A | |
| Unaligned Supervision for Automatic Music Transcription in The Wild | Unknown | N/A | |
| Fisher SAM: Information Geometry and Sharpness Aware Minimisation | Unknown | N/A | |
| The Algebraic Path Problem for Graph Metrics | Unknown | N/A | |
| Overcoming Oscillations in Quantization-Aware Training | Unknown | N/A | |
| Individual Reward Assisted Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Causal Inference Through the Structural Causal Marginal Problem | Unknown | N/A | |
| Improving Ensemble Distillation With Weight Averaging and Diversifying Perturbation | Unknown | N/A | |
| Sparse Mixed Linear Regression with Guarantees: Taming an Intractable Problem with Invex Relaxation | Unknown | N/A | |
| SCHA-VAE: Hierarchical Context Aggregation for Few-Shot Generation | Unknown | N/A | |
| Permutation Search of Tensor Network Structures via Local Sampling | Unknown | N/A | |
| Exploiting Redundancy: Separable Group Convolutional Networks on Lie Groups | Unknown | N/A | |
| AnyMorph: Learning Transferable Polices By Inferring Agent Morphology | Unknown | N/A | |
| Adaptive Conformal Predictions for Time Series | Unknown | N/A | |
| Unified Scaling Laws for Routed Language Models | Unknown | N/A | |
| Bayesian Optimization for Distributionally Robust Chance-constrained Problem | Unknown | N/A | |
| Provable Domain Generalization via Invariant-Feature Subspace Recovery | Unknown | N/A | |
| Revisiting Consistency Regularization for Deep Partial Label Learning | Unknown | N/A | |
| PoF: Post-Training of Feature Extractor for Improving Generalization | Unknown | N/A | |
| A Closer Look at Smoothness in Domain Adversarial Training | Unknown | N/A | |
| SpeqNets: Sparsity-aware permutation-equivariant graph networks | Unknown | N/A | |
| Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory | Unknown | N/A | |
| Fast Lossless Neural Compression with Integer-Only Discrete Flows | Unknown | N/A | |
| How Powerful are Spectral Graph Neural Networks | Unknown | N/A | |
| A Deep Learning Approach for the Segmentation of Electroencephalography Data in Eye Tracking Applications | Unknown | N/A | |
| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Unknown | N/A | |
| Proximal Exploration for Model-guided Protein Sequence Design | Unknown | N/A | |
| Active Learning on a Budget: Opposite Strategies Suit High and Low Budgets | Unknown | N/A | |
| Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations | Unknown | N/A | |
| Data Augmentation as Feature Manipulation | Unknown | N/A | |
| Efficient Low Rank Convex Bounds for Pairwise Discrete Graphical Models | Unknown | N/A | |
| UniRank: Unimodal Bandit Algorithms for Online Ranking | Unknown | N/A | |
| VLUE: A Multi-Task Multi-Dimension Benchmark for Evaluating Vision-Language Pre-training | Unknown | N/A | |
| Label-Descriptive Patterns and Their Application to Characterizing Classification Errors | Unknown | N/A | |
| Online and Consistent Correlation Clustering | Unknown | N/A | |
| FedNest: Federated Bilevel, Minimax, and Compositional Optimization | Unknown | N/A | |
| Near-optimal rate of consistency for linear models with missing values | Unknown | N/A | |
| Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits | Unknown | N/A | |
| Bayesian Optimization under Stochastic Delayed Feedback | Unknown | N/A | |
| Online Nonsubmodular Minimization with Delayed Costs: From Full Information to Bandit Feedback | Unknown | N/A | |
| Learning of Cluster-based Feature Importance for Electronic Health Record Time-series | Unknown | N/A | |
| Global Optimization of K-Center Clustering | Unknown | N/A | |
| EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Learning | Unknown | N/A | |
| Variational nearest neighbor Gaussian process | Unknown | N/A | |
| A Completely Tuning-Free and Robust Approach to Sparse Precision Matrix Estimation | Unknown | N/A | |
| Generalized Data Distribution Iteration | Unknown | N/A | |
| Anytime Information Cascade Popularity Prediction via Self-Exciting Processes | Unknown | N/A | |
| Secure Quantized Training for Deep Learning | Unknown | N/A | |
| Iterative Hard Thresholding with Adaptive Regularization: Sparser Solutions Without Sacrificing Runtime | Unknown | N/A | |
| Improving Adversarial Robustness via Mutual Information Estimation | Unknown | N/A | |
| Black-Box Tuning for Language-Model-as-a-Service | Unknown | N/A | |
| A Context-Integrated Transformer-Based Neural Network for Auction Design | Unknown | N/A | |
| Statistical inference with implicit SGD: proximal Robbins-Monro vs. Polyak-Ruppert | Unknown | N/A | |
| Neurocoder: General-Purpose Computation Using Stored Neural Programs | Unknown | N/A | |
| Learning Infinite-horizon Average-reward Markov Decision Process with Constraints | Unknown | N/A | |
| Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models | Unknown | N/A | |
| Sparse Invariant Risk Minimization | Unknown | N/A | |
| Winning the Lottery Ahead of Time: Efficient Early Network Pruning | Unknown | N/A | |
| Implicit Bias of the Step Size in Linear Diagonal Neural Networks | Unknown | N/A | |
| Intriguing Properties of Input-Dependent Randomized Smoothing | Unknown | N/A | |
| Latent Diffusion Energy-Based Model for Interpretable Text Modelling | Unknown | N/A | |
| Scalable MCMC Sampling for Nonsymmetric Determinantal Point Processes | Unknown | N/A | |
| Bregman Neural Networks | Unknown | N/A | |
| ModLaNets: Learning Generalisable Dynamics via Modularity and Physical Inductive Bias | Unknown | N/A | |
| Graph Neural Architecture Search Under Distribution Shifts | Unknown | N/A | |
| Quantifying and Learning Linear Symmetry-Based Disentanglement | Unknown | N/A | |
| Certifying Out-of-Domain Generalization for Blackbox Functions | Unknown | N/A | |
| Constants Matter: The Performance Gains of Active Learning | Unknown | N/A | |
| AutoSNN: Towards Energy-Efficient Spiking Neural Networks | Unknown | N/A | |
| A Modern Self-Referential Weight Matrix That Learns to Modify Itself | Unknown | N/A | |
| The Complexity of k-Means Clustering when Little is Known | Unknown | N/A | |
| Gradient Based Clustering | Unknown | N/A | |
| Modeling Adversarial Noise for Adversarial Training | Unknown | N/A | |
| Balancing Discriminability and Transferability for Source-Free Domain Adaptation | Unknown | N/A | |
| Closed-Form Diffeomorphic Transformations for Time Series Alignment | Unknown | N/A | |
| Dynamic Regret of Online Markov Decision Processes | Unknown | N/A | |
| A Multi-objective / Multi-task Learning Framework Induced by Pareto Stationarity | Unknown | N/A | |
| Spectral Representation of Robustness Measures for Optimization Under Input Uncertainty | Unknown | N/A | |
| Individual Preference Stability for Clustering | Unknown | N/A | |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Unknown | N/A | |
| Importance Weighted Kernel Bayes' Rule | Unknown | N/A | |
| Instrumental Variable Regression with Confounder Balancing | Unknown | N/A | |
| Optimal Clustering with Noisy Queries via Multi-Armed Bandit | Unknown | N/A | |
| Optimization-Derived Learning with Essential Convergence Analysis of Training and Hyper-training | Unknown | N/A | |
| It’s Raw! Audio Generation with State-Space Models | Unknown | N/A | |
| Nyström Kernel Mean Embeddings | Unknown | N/A | |
| Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets | Unknown | N/A | |
| An Asymptotic Test for Conditional Independence using Analytic Kernel Embeddings | Unknown | N/A | |
| Causal Transformer for Estimating Counterfactual Outcomes | Unknown | N/A | |
| A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization | Unknown | N/A | |
| Convolutional and Residual Networks Provably Contain Lottery Tickets | Unknown | N/A | |
| Interpretable Off-Policy Learning via Hyperbox Search | Unknown | N/A | |
| Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation | Unknown | N/A | |
| UNIREX: A Unified Learning Framework for Language Model Rationale Extraction | Unknown | N/A | |
| Self-Organized Polynomial-Time Coordination Graphs | Unknown | N/A | |
| Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search | Unknown | N/A | |
| HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning | Unknown | N/A | |
| Predicting Out-of-Distribution Error with the Projection Norm | Unknown | N/A | |
| Solving Stackelberg Prediction Game with Least Squares Loss via Spherically Constrained Least Squares Reformulation | Unknown | N/A | |
| Generative Modeling for Multi-task Visual Learning | Unknown | N/A | |
| An Intriguing Property of Geophysics Inversion | Unknown | N/A | |
| Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| A Marriage between Adversarial Team Games and 2-player Games: Enabling Abstractions, No-regret Learning, and Subgame Solving | Unknown | N/A | |
| Let Invariant Rationale Discovery Inspire Graph Contrastive Learning | Unknown | N/A | |
| Fast rates for noisy interpolation require rethinking the effect of inductive bias | Unknown | N/A | |
| Provably Adversarially Robust Nearest Prototype Classifiers | Unknown | N/A | |
| Continual Learning with Guarantees via Weight Interval Constraints | Unknown | N/A | |
| You Only Cut Once: Boosting Data Augmentation with a Single Cut | Unknown | N/A | |
| A Dynamical System Perspective for Lipschitz Neural Networks | Unknown | N/A | |
| Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably) | Unknown | N/A | |
| Strategic Representation | Unknown | N/A | |
| A Convergence Theory for SVGD in the Population Limit under Talagrand's Inequality T1 | Unknown | N/A | |
| On the Learning of Non-Autoregressive Transformers | Unknown | N/A | |
| Actor-Critic based Improper Reinforcement Learning | Unknown | N/A | |
| Byzantine Machine Learning Made Easy By Resilient Averaging of Momentums | Unknown | N/A | |
| Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency | Unknown | N/A | |
| Understanding and Improving Knowledge Graph Embedding for Entity Alignment | Unknown | N/A | |
| Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification | Unknown | N/A | |
| A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Unknown | N/A | |
| Dialog Inpainting: Turning Documents into Dialogs | Unknown | N/A | |
| How to Fill the Optimum Set? Population Gradient Descent with Harmless Diversity | Unknown | N/A | |
| Measuring dissimilarity with diffeomorphism invariance | Unknown | N/A | |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Unknown | N/A | |
| Fast Composite Optimization and Statistical Recovery in Federated Learning | Unknown | N/A | |
| Efficiently Learning the Topology and Behavior of a Networked Dynamical System Via Active Queries | Unknown | N/A | |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Unknown | N/A | |
| Fair and Fast k-Center Clustering for Data Summarization | Unknown | N/A | |
| Deep Reference Priors: What is the best way to pretrain a model? | Unknown | N/A | |
| Improving Task-free Continual Learning by Distributionally Robust Memory Evolution | Unknown | N/A | |
| ProGCL: Rethinking Hard Negative Mining in Graph Contrastive Learning | Unknown | N/A | |
| Continual Repeated Annealed Flow Transport Monte Carlo | Unknown | N/A | |
| HyperImpute: Generalized Iterative Imputation with Automatic Model Selection | Unknown | N/A | |
| Revisiting Online Submodular Minimization: Gap-Dependent Regret Bounds, Best of Both Worlds and Adversarial Robustness | Unknown | N/A | |
| Data Scaling Laws in NMT: The Effect of Noise and Architecture | Unknown | N/A | |
| Improving Mini-batch Optimal Transport via Partial Transportation | Unknown | N/A | |
| DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks | Unknown | N/A | |
| Learning Iterative Reasoning through Energy Minimization | Unknown | N/A | |
| Efficient Computation of Higher-Order Subgraph Attribution via Message Passing | Unknown | N/A | |
| Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness | Unknown | N/A | |
| Robust Imitation Learning against Variations in Environment Dynamics | Unknown | N/A | |
| Accelerated Federated Learning with Decoupled Adaptive Optimization | Unknown | N/A | |
| A Consistent and Efficient Evaluation Strategy for Attribution Methods | Unknown | N/A | |
| Private Adaptive Optimization with Side information | Unknown | N/A | |
| A Difference Standardization Method for Mutual Transfer Learning | Unknown | N/A | |
| Memory-Based Model Editing at Scale | Unknown | N/A | |
| Revisiting End-to-End Speech-to-Text Translation From Scratch | Unknown | N/A | |
| Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence Modeling | Unknown | N/A | |
| The Fundamental Price of Secure Aggregation in Differentially Private Federated Learning | Unknown | N/A | |
| Causal structure-based root cause analysis of outliers | Unknown | N/A | |
| Learning to Estimate and Refine Fluid Motion with Physical Dynamics | Unknown | N/A | |
| Comprehensive Analysis of Negative Sampling in Knowledge Graph Representation Learning | Unknown | N/A | |
| Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters | Unknown | N/A | |
| Generalised Policy Improvement with Geometric Policy Composition | Unknown | N/A | |
| Popular decision tree algorithms are provably noise tolerant | Unknown | N/A | |
| Only tails matter: Average-Case Universality and Robustness in the Convex Regime | Unknown | N/A | |
| Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Scalable Deep Reinforcement Learning Algorithms for Mean Field Games | Unknown | N/A | |
| Bisimulation Makes Analogies in Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| DAVINZ: Data Valuation using Deep Neural Networks at Initialization | Unknown | N/A | |
| What Dense Graph Do You Need for Self-Attention? | Unknown | N/A | |
| Finite-Sum Coupled Compositional Stochastic Optimization: Theory and Applications | Unknown | N/A | |
| Mitigating Modality Collapse in Multimodal VAEs via Impartial Optimization | Unknown | N/A | |
| Inductive Matrix Completion: No Bad Local Minima and a Fast Algorithm | Unknown | N/A | |
| A Unified View on PAC-Bayes Bounds for Meta-Learning | Unknown | N/A | |
| Online Continual Learning through Mutual Information Maximization | Unknown | N/A | |
| Dimension-free Complexity Bounds for High-order Nonconvex Finite-sum Optimization | Unknown | N/A | |
| Learning Dynamics and Generalization in Deep Reinforcement Learning | Unknown | N/A | |
| Certified Adversarial Robustness Under the Bounded Support Set | Unknown | N/A | |
| ASAP.SGD: Instance-based Adaptiveness to Staleness in Asynchronous SGD | Unknown | N/A | |
| Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core Quantities | Unknown | N/A | |
| Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks | Unknown | N/A | |
| Near-Optimal Learning of Extensive-Form Games with Imperfect Information | Unknown | N/A | |
| Contrastive Learning with Boosted Memorization | Unknown | N/A | |
| Improving Language Models by Retrieving from Trillions of Tokens | Unknown | N/A | |
| UAST: Uncertainty-Aware Siamese Tracking | Unknown | N/A | |
| Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness | Unknown | N/A | |
| Online Learning for Min Sum Set Cover and Pandora’s Box | Unknown | N/A | |
| Faster Privacy Accounting via Evolving Discretization | Unknown | N/A | |
| Gradient-Free Method for Heavily Constrained Nonconvex Optimization | Unknown | N/A | |
| An Equivalence Between Data Poisoning and Byzantine Gradient Attacks | Unknown | N/A | |
| Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning | Unknown | N/A | |
| A Temporal-Difference Approach to Policy Gradient Estimation | Unknown | N/A | |
| Framework for Evaluating Faithfulness of Local Explanations | Unknown | N/A | |
| Learning Multiscale Transformer Models for Sequence Generation | Unknown | N/A | |
| Collaboration of Experts: Achieving 80% Top-1 Accuracy on ImageNet with 100M FLOPs | Unknown | N/A | |
| Mirror Learning: A Unifying Framework of Policy Optimisation | Unknown | N/A | |
| Denoised MDPs: Learning World Models Better Than the World Itself | Unknown | N/A | |
| Identifiability Conditions for Domain Adaptation | Unknown | N/A | |
| Nested Bandits | Unknown | N/A | |
| Federated Minimax Optimization: Improved Convergence Analyses and Algorithms | Unknown | N/A | |
| A Simple yet Universal Strategy for Online Convex Optimization | Unknown | N/A | |
| SPDY: Accurate Pruning with Speedup Guarantees | Unknown | N/A | |
| Interpretable and Generalizable Graph Learning via Stochastic Attention Mechanism | Unknown | N/A | |
| Functional Output Regression with Infimal Convolution: Exploring the Huber and $\epsilon$-insensitive Losses | Unknown | N/A | |
| Tractable Dendritic RNNs for Reconstructing Nonlinear Dynamical Systems | Unknown | N/A | |
| Smoothed Adversarial Linear Contextual Bandits with Knapsacks | Unknown | N/A | |
| A Hierarchical Bayesian Approach to Inverse Reinforcement Learning with Symbolic Reward Machines | Unknown | N/A | |
| On Collective Robustness of Bagging Against Data Poisoning | Unknown | N/A | |
| CITRIS: Causal Identifiability from Temporal Intervened Sequences | Unknown | N/A | |
| Mitigating Gender Bias in Face Recognition using the von Mises-Fisher Mixture Model | Unknown | N/A | |
| Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers | Unknown | N/A | |
| Reinforcement Learning with Action-Free Pre-Training from Videos | Unknown | N/A | |
| A new similarity measure for covariate shift with applications to nonparametric regression | Unknown | N/A | |
| State Transition of Dendritic Spines Improves Learning of Sparse Spiking Neural Networks | Unknown | N/A | |
| Self-supervised learning with random-projection quantizer for speech recognition | Unknown | N/A | |
| Learning Symmetric Embeddings for Equivariant World Models | Unknown | N/A | |
| How Tempering Fixes Data Augmentation in Bayesian Neural Networks | Unknown | N/A | |
| Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes | Unknown | N/A | |
| Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming | Unknown | N/A | |
| HousE: Knowledge Graph Embedding with Householder Parameterization | Unknown | N/A | |
| Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation | Unknown | N/A | |
| Investigating Why Contrastive Learning Benefits Robustness against Label Noise | Unknown | N/A | |
| Informed Learning by Wide Neural Networks: Convergence, Generalization and Sampling Complexity | Unknown | N/A | |
| Frustratingly Easy Transferability Estimation | Unknown | N/A | |
| Minimum Cost Intervention Design for Causal Effect Identification | Unknown | N/A | |
| Action-Sufficient State Representation Learning for Control with Structural Constraints | Unknown | N/A | |
| Value Function based Difference-of-Convex Algorithm for Bilevel Hyperparameter Selection Problems | Unknown | N/A | |
| GSmooth: Certified Robustness against Semantic Transformations via Generalized Randomized Smoothing | Unknown | N/A | |
| Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning | Unknown | N/A | |
| Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill Diversity | Unknown | N/A | |
| Tackling covariate shift with node-based Bayesian neural networks | Unknown | N/A | |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Unknown | N/A | |
| Transfer and Marginalize: Explaining Away Label Noise with Privileged Information | Unknown | N/A | |
| Staged Training for Transformer Language Models | Unknown | N/A | |
| Deletion Robust Submodular Maximization over Matroids | Unknown | N/A | |
| GLaM: Efficient Scaling of Language Models with Mixture-of-Experts | Unknown | N/A | |
| Fictitious Play and Best-Response Dynamics in Identical Interest and Zero-Sum Stochastic Games | Unknown | N/A | |
| Deciphering Lasso-based Classification Through a Large Dimensional Analysis of the Iterative Soft-Thresholding Algorithm | Unknown | N/A | |
| Estimating and Penalizing Induced Preference Shifts in Recommender Systems | Unknown | N/A | |
| The Unsurprising Effectiveness of Pre-Trained Vision Models for Control | Unknown | N/A | |
| Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning | Unknown | N/A | |
| Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts | Unknown | N/A | |
| Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation | Unknown | N/A | |
| Efficient Learning of CNNs using Patch Based Features | Unknown | N/A | |
| Biological Sequence Design with GFlowNets | Unknown | N/A | |
| One-Pass Algorithms for MAP Inference of Nonsymmetric Determinantal Point Processes | Unknown | N/A | |
| Private frequency estimation via projective geometry | Unknown | N/A | |
| Counterfactual Prediction for Outcome-Oriented Treatments | Unknown | N/A | |
| 3D Infomax improves GNNs for Molecular Property Prediction | Unknown | N/A | |
| Generating Distributional Adversarial Examples to Evade Statistical Detectors | Unknown | N/A | |
| How to Leverage Unlabeled Data in Offline Reinforcement Learning | Unknown | N/A | |
| Linear Adversarial Concept Erasure | Unknown | N/A | |
| Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer | Unknown | N/A | |
| MetAug: Contrastive Learning via Meta Feature Augmentation | Unknown | N/A | |
| Lie Point Symmetry Data Augmentation for Neural PDE Solvers | Unknown | N/A | |
| Universal Hopfield Networks: A General Framework for Single-Shot Associative Memory Models | Unknown | N/A | |
| Stochastic Continuous Submodular Maximization: Boosting via Non-oblivious Function | Unknown | N/A | |
| Conformal Prediction Sets with Limited False Positives | Unknown | N/A | |
| Personalized Federated Learning via Variational Bayesian Inference | Unknown | N/A | |
| Path-Gradient Estimators for Continuous Normalizing Flows | Unknown | N/A | |
| Consistent Polyhedral Surrogates for Top-k Classification and Variants | Unknown | N/A | |
| Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Interactive Inverse Reinforcement Learning for Cooperative Games | Unknown | N/A | |
| Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms | Unknown | N/A | |
| Be Like Water: Adaptive Floating Point for Machine Learning | Unknown | N/A | |
| Diffusion bridges vector quantized variational autoencoders | Unknown | N/A | |
| Deduplicating Training Data Mitigates Privacy Risks in Language Models | Unknown | N/A | |
| Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent | Unknown | N/A | |
| Learning Efficient and Robust Ordinary Differential Equations via Invertible Neural Networks | Unknown | N/A | |
| Revisiting and Advancing Fast Adversarial Training Through The Lens of Bi-Level Optimization | Unknown | N/A | |
| Scalable Deep Gaussian Markov Random Fields for General Graphs | Unknown | N/A | |
| Transformer Quality in Linear Time | Unknown | N/A | |
| Visual Attention Emerges from Recurrent Sparse Reconstruction | Unknown | N/A | |
| Confidence Score for Source-Free Unsupervised Domain Adaptation | Unknown | N/A | |
| Unsupervised Image Representation Learning with Deep Latent Particles | Unknown | N/A | |
| Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization | Unknown | N/A | |
| A Rigorous Study of Integrated Gradients Method and Extensions to Internal Neuron Attributions | Unknown | N/A | |
| IDYNO: Learning Nonparametric DAGs from Interventional Dynamic Data | Unknown | N/A | |
| On the Surrogate Gap between Contrastive and Supervised Losses | Unknown | N/A | |
| Understanding Instance-Level Impact of Fairness Constraints | Unknown | N/A | |
| Robust Group Synchronization via Quadratic Programming | Unknown | N/A | |
| Deep Causal Metric Learning | Unknown | N/A | |
| Flowformer: Linearizing Transformers with Conservation Flows | Unknown | N/A | |
| Graph-Coupled Oscillator Networks | Unknown | N/A | |
| Variational Inference for Infinitely Deep Neural Networks | Unknown | N/A | |
| Reverse Engineering the Neural Tangent Kernel | Unknown | N/A | |
| Low-Precision Stochastic Gradient Langevin Dynamics | Unknown | N/A | |
| Matching Normalizing Flows and Probability Paths on Manifolds | Unknown | N/A | |
| Asymptotically-Optimal Gaussian Bandits with Side Observations | Unknown | N/A | |
| Sparse Double Descent: Where Network Pruning Aggravates Overfitting | Unknown | N/A | |
| Contrastive Mixture of Posteriors for Counterfactual Inference, Data Integration and Fairness | Unknown | N/A | |
| Distributionally-Aware Kernelized Bandit Problems for Risk Aversion | Unknown | N/A | |
| The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention | Unknown | N/A | |
| AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems | Unknown | N/A | |
| Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games | Unknown | N/A | |
| Principled Knowledge Extrapolation with GANs | Unknown | N/A | |
| Not All Poisons are Created Equal: Robust Training against Data Poisoning | Unknown | N/A | |
| DisPFL: Towards Communication-Efficient Personalized Federated Learning via Decentralized Sparse Training | Unknown | N/A | |
| DynaMixer: A Vision MLP Architecture with Dynamic Mixing | Unknown | N/A | |
| Fenrir: Physics-Enhanced Regression for Initial Value Problems | Unknown | N/A | |
| Unified Fourier-based Kernel and Nonlinearity Design for Equivariant Networks on Homogeneous Spaces | Unknown | N/A | |
| LCANets: Lateral Competition Improves Robustness Against Corruption and Attack | Unknown | N/A | |
| PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation | Unknown | N/A | |
| Prototype-Anchored Learning for Learning with Imperfect Annotations | Unknown | N/A | |
| Improved StyleGAN-v2 based Inversion for Out-of-Distribution Images | Unknown | N/A | |
| Unraveling Attention via Convex Duality: Analysis and Interpretations of Vision Transformers | Unknown | N/A | |
| Adaptive Gaussian Process Change Point Detection | Unknown | N/A | |
| Hardness and Algorithms for Robust and Sparse Optimization | Unknown | N/A | |
| Multi Resolution Analysis (MRA) for Approximate Self-Attention | Unknown | N/A | |
| A Functional Information Perspective on Model Interpretation | Unknown | N/A | |
| PAC-Net: A Model Pruning Approach to Inductive Transfer Learning | Unknown | N/A | |
| Robust Policy Learning over Multiple Uncertainty Sets | Unknown | N/A | |
| Fishr: Invariant Gradient Variances for Out-of-Distribution Generalization | Unknown | N/A | |
| Fast Finite Width Neural Tangent Kernel | Unknown | N/A | |
| Robustness in Multi-Objective Submodular Optimization: a Quantile Approach | Unknown | N/A | |
| Optimal Estimation of Policy Gradient via Double Fitted Iteration | Unknown | N/A | |
| Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation | Unknown | N/A | |
| Rethinking Fano’s Inequality in Ensemble Learning | Unknown | N/A | |
| Convergence of Uncertainty Sampling for Active Learning | Unknown | N/A | |
| The Role of Deconfounding in Meta-learning | Unknown | N/A | |
| Active fairness auditing | Unknown | N/A | |
| FITNESS: (Fine Tune on New and Similar Samples) to detect anomalies in streams with drift and outliers | Unknown | N/A | |
| Deep Hierarchy in Bandits | Unknown | N/A | |
| Adversarially Trained Actor Critic for Offline Reinforcement Learning | Unknown | N/A | |
| Pairwise Conditional Gradients without Swap Steps and Sparser Kernel Herding | Unknown | N/A | |
| Reconstructing Nonlinear Dynamical Systems from Multi-Modal Time Series | Unknown | N/A | |
| GACT: Activation Compressed Training for Generic Network Architectures | Unknown | N/A | |
| Disentangling Disease-related Representation from Obscure for Disease Prediction | Unknown | N/A | |
| Provable Reinforcement Learning with a Short-Term Memory | Unknown | N/A | |
| Constrained Optimization with Dynamic Bound-scaling for Effective NLP Backdoor Defense | Unknown | N/A | |
| Secure Distributed Training at Scale | Unknown | N/A | |
| Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance | Unknown | N/A | |
| Neuro-Symbolic Hierarchical Rule Induction | Unknown | N/A | |
| pathGCN: Learning General Graph Spatial Operators from Paths | Unknown | N/A | |
| Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization | Unknown | N/A | |
| Conditional GANs with Auxiliary Discriminative Classifier | Unknown | N/A | |
| Retrieval-Augmented Reinforcement Learning | Unknown | N/A | |
| Generative Flow Networks for Discrete Probabilistic Modeling | Unknown | N/A | |
| Molecular Representation Learning via Heterogeneous Motif Graph Neural Networks | Unknown | N/A | |
| Adversarially trained neural representations are already as robust as biological neural representations | Unknown | N/A | |
| A Parametric Class of Approximate Gradient Updates for Policy Optimization | Unknown | N/A | |
| Interactively Learning Preference Constraints in Linear Bandits | Unknown | N/A | |
| Robust Training under Label Noise by Over-parameterization | Unknown | N/A | |
| Causal Conceptions of Fairness and their Consequences | Unknown | N/A | |
| Flow-based Recurrent Belief State Learning for POMDPs | Unknown | N/A | |
| Approximate Bayesian Computation with Domain Expert in the Loop | Unknown | N/A | |
| Multi-Task Learning as a Bargaining Game | Unknown | N/A | |
| Investigating Generalization by Controlling Normalized Margin | Unknown | N/A | |
| Continuous-Time Modeling of Counterfactual Outcomes Using Neural Controlled Differential Equations | Unknown | N/A | |
| Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits | Unknown | N/A | |
| MAML and ANIL Provably Learn Representations | Unknown | N/A | |
| Model-Free Opponent Shaping | Unknown | N/A | |
| Optimizing Sequential Experimental Design with Deep Reinforcement Learning | Unknown | N/A | |
| Quant-BnB: A Scalable Branch-and-Bound Method for Optimal Decision Trees with Continuous Features | Unknown | N/A | |
| The Multivariate Community Hawkes Model for Dependent Relational Events in Continuous-time Networks | Unknown | N/A | |
| Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning | Unknown | N/A | |
| Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression | Unknown | N/A | |
| Variational Inference with Locally Enhanced Bounds for Hierarchical Models | Unknown | N/A | |
| Branching Reinforcement Learning | Unknown | N/A | |
| On the Convergence of the Shapley Value in Parametric Bayesian Learning Games | Unknown | N/A | |
| Metric-Fair Classifier Derandomization | Unknown | N/A | |
| Partial Counterfactual Identification from Observational and Experimental Data | Unknown | N/A | |
| Correlated Quantization for Distributed Mean Estimation and Optimization | Unknown | N/A | |
| Transformers are Meta-Reinforcement Learners | Unknown | N/A | |
| Temporal Difference Learning for Model Predictive Control | Unknown | N/A | |
| PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance | Unknown | N/A | |
| Fair Generalized Linear Models with a Convex Penalty | Unknown | N/A | |
| A Simple Guard for Learned Optimizers | Unknown | N/A | |
| Federated Learning with Positive and Unlabeled Data | Unknown | N/A | |
| A Langevin-like Sampler for Discrete Distributions | Unknown | N/A | |
| Robust Meta-learning with Sampling Noise and Label Noise via Eigen-Reptile | Unknown | N/A | |
| Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation | Unknown | N/A | |
| Data-SUITE: Data-centric identification of in-distribution incongruous examples | Unknown | N/A | |
| LyaNet: A Lyapunov Framework for Training Neural ODEs | Unknown | N/A | |
| The Neural Race Reduction: Dynamics of Abstraction in Gated Networks | Unknown | N/A | |
| Bit Prioritization in Variational Autoencoders via Progressive Coding | Unknown | N/A | |
| Wide Neural Networks Forget Less Catastrophically | Unknown | N/A | |
| Model Selection in Batch Policy Optimization | Unknown | N/A | |
| On the Difficulty of Defending Self-Supervised Learning against Model Extraction | Unknown | N/A | |
| To Smooth or Not? When Label Smoothing Meets Noisy Labels | Unknown | N/A | |
| Improving Screening Processes via Calibrated Subset Selection | Unknown | N/A | |
| Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum | Unknown | N/A | |
| A Regret Minimization Approach to Multi-Agent Control | Unknown | N/A | |
| Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using Language | Unknown | N/A | |
| Streaming Inference for Infinite Feature Models | Unknown | N/A | |
| Examining Scaling and Transfer of Language Model Architectures for Machine Translation | Unknown | N/A | |
| On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games | Unknown | N/A | |
| FedNL: Making Newton-Type Methods Applicable to Federated Learning | Unknown | N/A | |
| Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification | Unknown | N/A | |
| Exact Optimal Accelerated Complexity for Fixed-Point Iterations | Unknown | N/A | |
| Fairness with Adaptive Weights | Unknown | N/A | |
| Efficient Representation Learning via Adaptive Context Pooling | Unknown | N/A | |
| Anarchic Federated Learning | Unknown | N/A | |
| A Branch and Bound Framework for Stronger Adversarial Attacks of ReLU Networks | Unknown | N/A | |
| On Measuring Causal Contributions via do-interventions | Unknown | N/A | |
| Making Linear MDPs Practical via Contrastive Representation Learning | Unknown | N/A | |
| Building Robust Ensembles via Margin Boosting | Unknown | N/A | |
| On Last-Iterate Convergence Beyond Zero-Sum Games | Unknown | N/A | |
| Learning fair representation with a parametric integral probability metric | Unknown | N/A | |
| Extracting Latent State Representations with Linear Dynamics from Rich Observations | Unknown | N/A | |
| Forward Operator Estimation in Generative Models with Kernel Transfer Operators | Unknown | N/A | |
| Decentralized Online Convex Optimization in Networked Systems | Unknown | N/A | |
| Burst-Dependent Plasticity and Dendritic Amplification Support Target-Based Learning and Hierarchical Imitation Learning | Unknown | N/A | |
| Contextual Bandits with Large Action Spaces: Made Practical | Unknown | N/A | |
| Learning-based Optimisation of Particle Accelerators Under Partial Observability Without Real-World Training | Unknown | N/A | |
| On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features | Unknown | N/A | |
| Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time | Unknown | N/A | |
| Feature Space Particle Inference for Neural Network Ensembles | Unknown | N/A | |
| Linear Complexity Randomized Self-attention Mechanism | Unknown | N/A | |
| Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP | Unknown | N/A | |
| Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems | Unknown | N/A | |
| Robust Counterfactual Explanations for Tree-Based Ensembles | Unknown | N/A | |
| A New Perspective on the Effects of Spectrum in Graph Neural Networks | Unknown | N/A | |
| The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces | Unknown | N/A | |
| Contextual Bandits with Smooth Regret: Efficient Learning in Continuous Action Spaces | Unknown | N/A | |
| Datamodels: Understanding Predictions with Data and Data with Predictions | Unknown | N/A | |
| Approximately Equivariant Networks for Imperfectly Symmetric Dynamics | Unknown | N/A | |
| ActiveHedge: Hedge meets Active Learning | Unknown | N/A | |
| Principal Component Flows | Unknown | N/A | |
| Maslow's Hammer in Catastrophic Forgetting: Node Re-Use vs. Node Activation | Unknown | N/A | |
| Learning to Separate Voices by Spatial Regions | Unknown | N/A | |
| On Numerical Integration in Neural Ordinary Differential Equations | Unknown | N/A | |
| Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments | Unknown | N/A | |
| On the Convergence of Local Stochastic Compositional Gradient Descent with Momentum | Unknown | N/A | |
| Domain Adaptation for Time Series Forecasting via Attention Sharing | Unknown | N/A | |
| DNA: Domain Generalization with Diversified Neural Averaging | Unknown | N/A | |
| Architecture Agnostic Federated Learning for Neural Networks | Unknown | N/A | |
| Leverage Score Sampling for Tensor Product Matrices in Input Sparsity Time | Unknown | N/A | |
| Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning | Unknown | N/A | |
| Understanding Robust Generalization in Learning Regular Languages | Unknown | N/A | |
| Minimax M-estimation under Adversarial Contamination | Unknown | N/A | |
| Disentangled Federated Learning for Tackling Attributes Skew via Invariant Aggregation and Diversity Transferring | Unknown | N/A | |
| A Stochastic Multi-Rate Control Framework For Modeling Distributed Optimization Algorithms | Unknown | N/A | |
| Easy Variational Inference for Categorical Models via an Independent Binary Approximation | Unknown | N/A | |
| Training Your Sparse Neural Network Better with Any Mask | Unknown | N/A | |
| How to Steer Your Adversary: Targeted and Efficient Model Stealing Defenses with Gradient Redirection | Unknown | N/A | |
| Learning to Cut by Looking Ahead: Cutting Plane Selection via Imitation Learning | Unknown | N/A | |
| Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms | Unknown | N/A | |
| On Learning Mixture of Linear Regressions in the Non-Realizable Setting | Unknown | N/A | |
| Sparsity in Partially Controllable Linear Systems | Unknown | N/A | |
| On the Equivalence Between Temporal and Static Equivariant Graph Representations | Unknown | N/A | |
| ButterflyFlow: Building Invertible Layers with Butterfly Matrices | Unknown | N/A | |
| Structured Stochastic Gradient MCMC | Unknown | N/A | |
| Scaling Out-of-Distribution Detection for Real-World Settings | Unknown | N/A | |
| Towards Uniformly Superhuman Autonomy via Subdominance Minimization | Unknown | N/A | |
| RUMs from Head-to-Head Contests | Unknown | N/A | |
| Set Norm and Equivariant Skip Connections: Putting the Deep in Deep Sets | Unknown | N/A | |
| Learning inverse folding from millions of predicted structures | Unknown | N/A | |
| When Are Linear Stochastic Bandits Attackable? | Unknown | N/A | |
| VarScene: A Deep Generative Model for Realistic Scene Graph Synthesis | Unknown | N/A | |
| Streaming Algorithms for High-Dimensional Robust Statistics | Unknown | N/A | |
| Practical Almost-Linear-Time Approximation Algorithms for Hybrid and Overlapping Graph Clustering | Unknown | N/A | |
| Distributionally Robust $Q$-Learning | Unknown | N/A | |
| Optimally Controllable Perceptual Lossy Compression | Unknown | N/A | |
| On Finite-Sample Identifiability of Contrastive Learning-Based Nonlinear Independent Component Analysis | Unknown | N/A | |
| Volatility Based Kernels and Moving Average Means for Accurate Forecasting with Gaussian Processes | Unknown | N/A | |
| Learning Stochastic Shortest Path with Linear Function Approximation | Unknown | N/A | |
| Gaussian Mixture Variational Autoencoder with Contrastive Learning for Multi-Label Classification | Unknown | N/A | |
| Exploiting Independent Instruments: Identification and Distribution Generalization | Unknown | N/A | |
| Thompson Sampling for Robust Transfer in Multi-Task Bandits | Unknown | N/A | |
| Finding Global Homophily in Graph Neural Networks When Meeting Heterophily | Unknown | N/A | |
| A Model-Agnostic Randomized Learning Framework based on Random Hypothesis Subspace Sampling | Unknown | N/A | |
| Cycle Representation Learning for Inductive Relation Prediction | Unknown | N/A | |
| A Simple Reward-free Approach to Constrained Reinforcement Learning | Unknown | N/A | |
| Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition | Unknown | N/A | |
| Refined Convergence Rates for Maximum Likelihood Estimation under Finite Mixture Models | Unknown | N/A | |
| Coordinated Double Machine Learning | Unknown | N/A | |
| Causal Dynamics Learning for Task-Independent State Abstraction | Unknown | N/A | |
| Identification of Linear Non-Gaussian Latent Hierarchical Structure | Unknown | N/A | |
| Robust Multi-Objective Bayesian Optimization Under Input Noise | Unknown | N/A | |
| FedNew: A Communication-Efficient and Privacy-Preserving Newton-Type Method for Federated Learning | Unknown | N/A | |
| EqR: Equivariant Representations for Data-Efficient Reinforcement Learning | Unknown | N/A | |
| Robustness Verification for Contrastive Learning | Unknown | N/A | |
| Does the Data Induce Capacity Control in Deep Learning? | Unknown | N/A | |
| Fast and Reliable Evaluation of Adversarial Robustness with Minimum-Margin Attack | Unknown | N/A | |
| Training Discrete Deep Generative Models via Gapped Straight-Through Estimator | Unknown | N/A | |
| A Framework for Learning to Request Rich and Contextually Useful Information from Humans | Unknown | N/A | |
| Regret Minimization with Performative Feedback | Unknown | N/A | |
| Hierarchical Shrinkage: Improving the accuracy and interpretability of tree-based models. | Unknown | N/A | |
| Variational Wasserstein gradient flow | Unknown | N/A | |
| Hermite Polynomial Features for Private Data Generation | Unknown | N/A | |
| N-Penetrate: Active Learning of Neural Collision Handler for Complex 3D Mesh Deformations | Unknown | N/A | |
| Locally Sparse Neural Networks for Tabular Biomedical Data | Unknown | N/A | |
| Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments | Unknown | N/A | |
| On Non-local Convergence Analysis of Deep Linear Networks | Unknown | N/A | |
| No-Regret Learning in Partially-Informed Auctions | Unknown | N/A | |
| Universal and data-adaptive algorithms for model selection in linear contextual bandits | Unknown | N/A | |
| Understanding Clipping for Federated Learning: Convergence and Client-Level Differential Privacy | Unknown | N/A | |
| Improve Single-Point Zeroth-Order Optimization Using High-Pass and Low-Pass Filters | Unknown | N/A | |
| De novo mass spectrometry peptide sequencing with a transformer model | Unknown | N/A | |
| Learning Augmented Binary Search Trees | Unknown | N/A | |
| Fishing for User Data in Large-Batch Federated Learning via Gradient Magnification | Unknown | N/A | |
| A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games | Unknown | N/A | |
| Probabilistically Robust Learning: Balancing Average- and Worst-case Performance | Unknown | N/A | |
| Differentially Private Community Detection for Stochastic Block Models | Unknown | N/A | |
| A Study on the Ramanujan Graph Property of Winning Lottery Tickets | Unknown | N/A | |
| Local Augmentation for Graph Neural Networks | Unknown | N/A | |
| Learning from Counterfactual Links for Link Prediction | Unknown | N/A | |
| H-Consistency Bounds for Surrogate Loss Minimizers | Unknown | N/A | |
| VariGrow: Variational Architecture Growing for Task-Agnostic Continual Learning based on Bayesian Novelty | Unknown | N/A | |
| Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning | Unknown | N/A | |
| A deep convolutional neural network that is invariant to time rescaling | Unknown | N/A | |
| Improved Regret for Differentially Private Exploration in Linear MDP | Unknown | N/A | |
| Supervised Learning with General Risk Functionals | Unknown | N/A | |
| Combining Diverse Feature Priors | Unknown | N/A | |
| Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum | Unknown | N/A | |
| Generalization Guarantee of Training Graph Convolutional Networks with Graph Topology Sampling | Unknown | N/A | |
| Do Differentiable Simulators Give Better Policy Gradients? | Unknown | N/A | |
| Entropic Causal Inference: Graph Identifiability | Unknown | N/A | |
| Counterfactual Transportability: A Formal Approach | Unknown | N/A | |
| BAMDT: Bayesian Additive Semi-Multivariate Decision Trees for Nonparametric Regression | Unknown | N/A | |
| Feature selection using e-values | Unknown | N/A | |
| Augment with Care: Contrastive Learning for Combinatorial Problems | Unknown | N/A | |
| LIMO: Latent Inceptionism for Targeted Molecule Generation | Unknown | N/A | |
| Object Permanence Emerges in a Random Walk along Memory | Unknown | N/A | |
| Imitation Learning by Estimating Expertise of Demonstrators | Unknown | N/A | |
| Neural Laplace: Learning diverse classes of differential equations in the Laplace domain | Unknown | N/A | |
| Stabilizing Q-learning with Linear Architectures for Provable Efficient Learning | Unknown | N/A | |
| Random Gegenbauer Features for Scalable Kernel Methods | Unknown | N/A | |
| Proximal and Federated Random Reshuffling | Unknown | N/A | |
| Neural Tangent Kernel Empowered Federated Learning | Unknown | N/A | |
| Sharpened Quasi-Newton Methods: Faster Superlinear Rate and Larger Local Convergence Neighborhood | Unknown | N/A | |
| Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost | Unknown | N/A | |
| An Analytical Update Rule for General Policy Optimization | Unknown | N/A | |
| Constrained Gradient Descent: A Powerful and Principled Evasion Attack Against Neural Networks | Unknown | N/A | |
| Meaningfully debugging model mistakes using conceptual counterfactual explanations | Unknown | N/A | |
| The State of Sparse Training in Deep Reinforcement Learning | Unknown | N/A | |
| Diffusion Models for Adversarial Purification | Unknown | N/A | |
| Multirate Training of Neural Networks | Unknown | N/A | |
| A Natural Actor-Critic Framework for Zero-Sum Markov Games | Unknown | N/A | |
| Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm | Unknown | N/A | |
| Characterizing and Overcoming the Greedy Nature of Learning in Multi-modal Deep Neural Networks | Unknown | N/A | |
| Resilient and Communication Efficient Learning for Heterogeneous Federated Systems | Unknown | N/A | |
| DRAGONN: Distributed Randomized Approximate Gradients of Neural Networks | Unknown | N/A | |
| Bregman Proximal Langevin Monte Carlo via Bregman--Moreau Envelopes | Unknown | N/A | |
| Efficient Online ML API Selection for Multi-Label Classification Tasks | Unknown | N/A | |
| Quantum-Inspired Algorithms from Randomized Numerical Linear Algebra | Unknown | N/A | |
| Variational Mixtures of ODEs for Inferring Cellular Gene Expression Dynamics | Unknown | N/A | |
| Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning | Unknown | N/A | |
| Discrete Tree Flows via Tree-Structured Permutations | Unknown | N/A | |
| On Transportation of Mini-batches: A Hierarchical Approach | Unknown | N/A | |
| Understanding the unstable convergence of gradient descent | Unknown | N/A | |
| Modeling Strong and Human-Like Gameplay with KL-Regularized Search | Unknown | N/A | |
| Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling | Unknown | N/A | |
| Going Deeper into Permutation-Sensitive Graph Neural Networks | Unknown | N/A | |
| Online Decision Transformer | Unknown | N/A | |
| Streaming Algorithm for Monotone k-Submodular Maximization with Cardinality Constraints | Unknown | N/A | |
| LSB: Local Self-Balancing MCMC in Discrete Spaces | Unknown | N/A | |
| Sharp-MAML: Sharpness-Aware Model-Agnostic Meta Learning | Unknown | N/A | |
| Sketching Algorithms and Lower Bounds for Ridge Regression | Unknown | N/A | |
| data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language | Unknown | N/A | |
| Differentially Private Maximal Information Coefficients | Unknown | N/A | |
| Accelerating Shapley Explanation via Contributive Cooperator Selection | Unknown | N/A | |
| More Efficient Sampling for Tensor Decomposition With Worst-Case Guarantees | Unknown | N/A | |
| Antibody-Antigen Docking and Design via Hierarchical Structure Refinement | Unknown | N/A | |
| When and How Mixup Improves Calibration | Unknown | N/A | |
| NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework | Unknown | N/A | |
| Hessian-Free High-Resolution Nesterov Acceleration For Sampling | Unknown | N/A | |
| DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale | Unknown | N/A | |
| Neural Inverse Transform Sampler | Unknown | N/A | |
| Reverse Engineering $\ell_p$ attacks: A block-sparse optimization approach with recovery guarantees | Unknown | N/A | |
| Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach | Unknown | N/A | |
| On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Symmetric Machine Theory of Mind | Unknown | N/A | |
| COAT: Measuring Object Compositionality in Emergent Representations | Unknown | N/A | |
| Discrete Probabilistic Inverse Optimal Transport | Unknown | N/A | |
| Stability Based Generalization Bounds for Exponential Family Langevin Dynamics | Unknown | N/A | |
| Role-based Multiplex Network Embedding | Unknown | N/A | |
| Bayesian Imitation Learning for End-to-End Mobile Manipulation | Unknown | N/A | |
| Constrained Variational Policy Optimization for Safe Reinforcement Learning | Unknown | N/A | |
| POEM: Out-of-Distribution Detection with Posterior Sampling | Unknown | N/A | |
| Understanding Dataset Difficulty with $\mathcal{V}$-Usable Information | Unknown | N/A | |
| Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits | Unknown | N/A | |
| Federated Learning with Partial Model Personalization | Unknown | N/A | |
| Entropic Gromov-Wasserstein between Gaussian Distributions | Unknown | N/A | |
| Debiaser Beware: Pitfalls of Centering Regularized Transport Maps | Unknown | N/A | |
| A Tree-based Model Averaging Approach for Personalized Treatment Effect Estimation from Heterogeneous Data Sources | Unknown | N/A | |
| G-Mixup: Graph Data Augmentation for Graph Classification | Unknown | N/A | |
| Selective Network Linearization for Efficient Private Inference | Unknown | N/A | |
| Dual Perspective of Label-Specific Feature Learning for Multi-Label Classification | Unknown | N/A | |
| Measure Estimation in the Barycentric Coding Model | Unknown | N/A | |
| Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks | Unknown | N/A | |
| Distinguishing rule- and exemplar-based generalization in learning systems | Unknown | N/A | |
| Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error | Unknown | N/A | |
| Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching | Unknown | N/A | |
| Rich Feature Construction for the Optimization-Generalization Dilemma | Unknown | N/A | |
| Task-aware Privacy Preservation for Multi-dimensional Data | Unknown | N/A | |
| TACTiS: Transformer-Attentional Copulas for Time Series | Unknown | N/A | |
| Risk-Averse No-Regret Learning in Online Convex Games | Unknown | N/A | |
| Fourier Learning with Cyclical Data | Unknown | N/A | |
| Adaptive Second Order Coresets for Data-efficient Machine Learning | Unknown | N/A | |
| Describing Differences between Text Distributions with Natural Language | Unknown | N/A | |
| Uncertainty Modeling in Generative Compressed Sensing | Unknown | N/A | |
| Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance | Unknown | N/A | |
| Scaling-up Diverse Orthogonal Convolutional Networks by a Paraunitary Framework | Unknown | N/A | |
| Parsimonious Learning-Augmented Caching | Unknown | N/A | |
| On the Statistical Benefits of Curriculum Learning | Unknown | N/A | |
| GraphFM: Improving Large-Scale GNN Training via Feature Momentum | Unknown | N/A | |
| NysADMM: faster composite convex optimization via low-rank approximation | Unknown | N/A | |
| Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity | Unknown | N/A | |
| Constrained Offline Policy Optimization | Unknown | N/A | |
| End-to-End Balancing for Causal Continuous Treatment-Effect Estimation | Unknown | N/A | |
| Constrained Discrete Black-Box Optimization using Mixed-Integer Programming | Unknown | N/A | |
| Learning to Solve PDE-constrained Inverse Problems with Graph Networks | Unknown | N/A | |
| Feature and Parameter Selection in Stochastic Linear Bandits | Unknown | N/A | |
| Faster Algorithms for Learning Convex Functions | Unknown | N/A | |
| Optimal Algorithms for Mean Estimation under Local Differential Privacy | Unknown | N/A | |
| Removing Batch Normalization Boosts Adversarial Training | Unknown | N/A | |
| Fast Convex Optimization for Two-Layer ReLU Networks: Equivalent Model Classes and Cone Decompositions | Unknown | N/A | |
| GALAXY: Graph-based Active Learning at the Extreme | Unknown | N/A | |
| Preconditioning for Scalable Gaussian Process Hyperparameter Optimization | Unknown | N/A | |
| Three-stage Evolution and Fast Equilibrium for SGD with Non-degerate Critical Points | Unknown | N/A | |
| Implicit Bias of Linear Equivariant Networks | Unknown | N/A | |
| Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times | Unknown | N/A | |
| ViT-NeT: Interpretable Vision Transformers with Neural Tree Decoder | Unknown | N/A | |
| The CLRS Algorithmic Reasoning Benchmark | Unknown | N/A | |
| Adaptive Accelerated (Extra-)Gradient Methods with Variance Reduction | Unknown | N/A | |
| Training OOD Detectors in their Natural Habitats | Unknown | N/A | |
| Power-Law Escape Rate of SGD | Unknown | N/A | |
| Why the Rich Get Richer? On the Balancedness of Random Partition Models | Unknown | N/A | |
| Validating Causal Inference Methods | Unknown | N/A | |
| Adversarial Vulnerability of Randomized Ensembles | Unknown | N/A | |
| Provable Acceleration of Heavy Ball beyond Quadratics for a Class of Polyak-Lojasiewicz Functions when the Non-Convexity is Averaged-Out | Unknown | N/A | |
| Achieving Minimax Rates in Pool-Based Batch Active Learning | Unknown | N/A | |
| On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces | Unknown | N/A | |
| Detached Error Feedback for Distributed SGD with Random Sparsification | Unknown | N/A | |
| RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression | Unknown | N/A | |
| Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning | Unknown | N/A | |
| Breaking the $\sqrt{T}$ Barrier: Instance-Independent Logarithmic Regret in Stochastic Contextual Linear Bandits | Unknown | N/A | |
| Weisfeiler-Lehman Meets Gromov-Wasserstein | Unknown | N/A | |
| HyperPrompt: Prompt-based Task-Conditioning of Transformers | Unknown | N/A | |
| Out-of-Distribution Detection with Deep Nearest Neighbors | Unknown | N/A | |
| Bregman Power k-Means for Clustering Exponential Family Data | Unknown | N/A | |
| Bounding the Width of Neural Networks via Coupled Initialization - A Worst Case Analysis | Unknown | N/A | |
| Neural Tangent Kernel Beyond the Infinite-Width Limit: Effects of Depth and Initialization | Unknown | N/A | |
| Information Discrepancy in Strategic Learning | Unknown | N/A | |
| Robust alignment of cross-session recordings of neural population activity by behaviour via unsupervised domain adaptation | Unknown | N/A | |
| Simultaneous Graph Signal Clustering and Graph Learning | Unknown | N/A | |
| Fair Representation Learning through Implicit Path Alignment | Unknown | N/A | |
| A General Recipe for Likelihood-free Bayesian Optimization | Unknown | N/A | |
| ROCK: Causal Inference Principles for Reasoning about Commonsense Causality | Unknown | N/A | |
| Interactive Correlation Clustering with Existential Cluster Constraints | Unknown | N/A | |
| Adaptive Random Walk Gradient Descent for Decentralized Optimization | Unknown | N/A | |
| Differentiable Top-k Classification Learning | Unknown | N/A | |
| Linear Bandit Algorithms with Sublinear Time Complexity | Unknown | N/A | |
| A query-optimal algorithm for finding counterfactuals | Unknown | N/A | |
| Label Ranking through Nonparametric Regression | Unknown | N/A | |
| Agnostic Learnability of Halfspaces via Logistic Loss | Unknown | N/A | |
| First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach | Unknown | N/A | |
| Deploying Convolutional Networks on Untrusted Platforms Using 2D Holographic Reduced Representations | Unknown | N/A | |
| Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes | Unknown | N/A | |
| Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training | Unknown | N/A | |
| DRIBO: Robust Deep Reinforcement Learning via Multi-View Information Bottleneck | Unknown | N/A | |
| Generalizing Gaussian Smoothing for Random Search | Unknown | N/A | |
| A Study of Face Obfuscation in ImageNet | Unknown | N/A | |
| Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs | Unknown | N/A | |
| Sanity Simulations for Saliency Methods | Unknown | N/A | |
| Universal Joint Approximation of Manifolds and Densities by Simple Injective Flows | Unknown | N/A | |
| Variational Sparse Coding with Learned Thresholding | Unknown | N/A | |
| Local Linear Convergence of Douglas-Rachford for Linear Programming: a Probabilistic Analysis | Unknown | N/A | |
| Equivariant Diffusion for Molecule Generation in 3D | Unknown | N/A | |
| Loss Function Learning for Domain Generalization by Implicit Gradient | Unknown | N/A | |
| Pure Noise to the Rescue of Insufficient Data: Improving Imbalanced Classification by Training on Random Noise Images | Unknown | N/A | |
| Contextual Information-Directed Sampling | Unknown | N/A | |
| COLA: Consistent Learning with Opponent-Learning Awareness | Unknown | N/A | |
| Adapting to Mixing Time in Stochastic Optimization with Markovian Data | Unknown | N/A | |
| From data to functa: Your data point is a function and you can treat it like one | Unknown | N/A | |
| Scaling Structured Inference with Randomization | Unknown | N/A | |
| Log-Euclidean Signatures for Intrinsic Distances Between Unaligned Datasets | Unknown | N/A | |
| How to Train Your Wide Neural Network Without Backprop: An Input-Weight Alignment Perspective | Unknown | N/A | |
| Learning to Infer Structures of Network Games | Unknown | N/A | |
| Translatotron 2: High-quality direct speech-to-speech translation with voice preservation | Unknown | N/A | |
| Faster Fundamental Graph Algorithms via Learned Predictions | Unknown | N/A | |
| Fat–Tailed Variational Inference with Anisotropic Tail Adaptive Flows | Unknown | N/A | |
| In defense of dual-encoders for neural ranking | Unknown | N/A | |
| MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection | Unknown | N/A | |
| Compressed-VFL: Communication-Efficient Learning with Vertically Partitioned Data | Unknown | N/A | |
| Nesterov Accelerated Shuffling Gradient Method for Convex Optimization | Unknown | N/A | |
| What Can Linear Interpolation of Neural Network Loss Landscapes Tell Us? | Unknown | N/A | |
| Algorithms for the Communication of Samples | Unknown | N/A | |
| Strategic Instrumental Variable Regression: Recovering Causal Relationships From Strategic Responses | Unknown | N/A | |
| Nearly Optimal Policy Optimization with Stable at Any Time Guarantee | Unknown | N/A | |
| Analysis of Stochastic Processes through Replay Buffers | Unknown | N/A | |
| Generalized Strategic Classification and the Case of Aligned Incentives | Unknown | N/A | |
| Policy Gradient Method For Robust Reinforcement Learning | Unknown | N/A | |
| Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations | Unknown | N/A | |
| Structure-preserving GANs | Unknown | N/A | |
| Improved Certified Defenses against Data Poisoning with (Deterministic) Finite Aggregation | Unknown | N/A | |
| Tell me why! Explanations support learning relational and causal structure | Unknown | N/A | |
| A Random Matrix Analysis of Data Stream Clustering: Coping With Limited Memory Resources | Unknown | N/A | |
| A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes | Unknown | N/A | |
| Balancing Sample Efficiency and Suboptimality in Inverse Reinforcement Learning | Unknown | N/A | |
| Towards Coherent and Consistent Use of Entities in Narrative Generation | Unknown | N/A | |
| Certified Neural Network Watermarks with Randomized Smoothing | Unknown | N/A | |
| On the Practicality of Deterministic Epistemic Uncertainty | Unknown | N/A | |
| Robust SDE-Based Variational Formulations for Solving Linear PDEs via Deep Learning | Unknown | N/A | |
| On the Finite-Time Complexity and Practical Computation of Approximate Stationarity Concepts of Lipschitz Functions | Unknown | N/A | |
| Correlation Clustering via Strong Triadic Closure Labeling: Fast Approximation Algorithms and Practical Lower Bounds | Unknown | N/A | |
| Modeling Structure with Undirected Neural Networks | Unknown | N/A | |
| Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime | Unknown | N/A | |
| Inducing Causal Structure for Interpretable Neural Networks | Unknown | N/A | |
| Controlling Conditional Language Models without Catastrophic Forgetting | Unknown | N/A | |
| ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks | Unknown | N/A | |
| A Differential Entropy Estimator for Training Neural Networks | Unknown | N/A | |
| Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval | Unknown | N/A | |
| Towards Noise-adaptive, Problem-adaptive (Accelerated) Stochastic Gradient Descent | Unknown | N/A | |
| Scalable Computation of Causal Bounds | Unknown | N/A | |
| Detecting Adversarial Examples Is (Nearly) As Hard As Classifying Them | Unknown | N/A | |
| Tight and Robust Private Mean Estimation with Few Users | Unknown | N/A | |
| Strategies for Safe Multi-Armed Bandits with Logarithmic Regret and Risk | Unknown | N/A | |
| Modeling Irregular Time Series with Continuous Recurrent Units | Unknown | N/A | |
| No-Regret Learning in Time-Varying Zero-Sum Games | Unknown | N/A | |
| Curriculum Reinforcement Learning via Constrained Optimal Transport | Unknown | N/A | |
| Score Matching Enables Causal Discovery of Nonlinear Additive Noise Models | Unknown | N/A | |
| PAC-Bayesian Bounds on Rate-Efficient Classifiers | Unknown | N/A | |
| Generalized Results for the Existence and Consistency of the MLE in the Bradley-Terry-Luce Model | Unknown | N/A | |
| FedScale: Benchmarking Model and System Performance of Federated Learning at Scale | Unknown | N/A | |
| Probabilistic ODE Solutions in Millions of Dimensions | Unknown | N/A | |
| Sequential Covariate Shift Detection Using Classifier Two-Sample Tests | Unknown | N/A | |
| Accelerated, Optimal and Parallel: Some results on model-based stochastic optimization | Unknown | N/A | |
| On the Convergence of Inexact Predictor-Corrector Methods for Linear Programming | Unknown | N/A | |
| FriendlyCore: Practical Differentially Private Aggregation | Unknown | N/A | |
| GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | Unknown | N/A | |
| RieszNet and ForestRiesz: Automatic Debiased Machine Learning with Neural Nets and Random Forests | Unknown | N/A | |
| Robust Kernel Density Estimation with Median-of-Means principle | Unknown | N/A | |
| Evolving Curricula with Regret-Based Environment Design | Unknown | N/A | |
| The Importance of Non-Markovianity in Maximum State Entropy Exploration | Unknown | N/A | |
| Adversarial Masking for Self-Supervised Learning | Unknown | N/A | |
| Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-Dimension | Unknown | N/A | |
| Inverse Contextual Bandits: Learning How Behavior Evolves over Time | Unknown | N/A | |
| Adversarial Attacks on Gaussian Process Bandits | Unknown | N/A | |
| Bayesian Nonparametrics for Offline Skill Discovery | Unknown | N/A | |
| PDE-Based Optimal Strategy for Unconstrained Online Learning | Unknown | N/A | |
| Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization | Unknown | N/A | |
| Improved Rates for Differentially Private Stochastic Convex Optimization with Heavy-Tailed Data | Unknown | N/A | |
| Shuffle Private Linear Contextual Bandits | Unknown | N/A | |
| Universality of Winning Tickets: A Renormalization Group Perspective | Unknown | N/A | |
| Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks | Unknown | N/A | |
| SDQ: Stochastic Differentiable Quantization with Mixed Precision | Unknown | N/A | |
| Direct Behavior Specification via Constrained Reinforcement Learning | Unknown | N/A | |
| Translating Robot Skills: Learning Unsupervised Skill Correspondences Across Robots | Unknown | N/A | |
| Error-driven Input Modulation: Solving the Credit Assignment Problem without a Backward Pass | Unknown | N/A | |
| Invariant Ancestry Search | Unknown | N/A | |
| Least Squares Estimation using Sketched Data with Heteroskedastic Errors | Unknown | N/A | |
| Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning | Unknown | N/A | |
| Convergence and Recovery Guarantees of the K-Subspaces Method for Subspace Clustering | Unknown | N/A | |
| A Hierarchical Transitive-Aligned Graph Kernel for Un-attributed Graphs | Unknown | N/A | |
| A Psychological Theory of Explainability | Unknown | N/A | |
| Composing Partial Differential Equations with Physics-Aware Neural Networks | Unknown | N/A | |
| Stochastic Rising Bandits | Unknown | N/A | |
| How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models | Unknown | N/A | |
| Failure and success of the spectral bias prediction for Laplace Kernel Ridge Regression: the case of low-dimensional data | Unknown | N/A | |
| Geometric Multimodal Contrastive Representation Learning | Unknown | N/A | |
| NOMU: Neural Optimization-based Model Uncertainty | Unknown | N/A | |
| Neural-Symbolic Models for Logical Queries on Knowledge Graphs | Unknown | N/A | |
| The Infinite Contextual Graph Markov Model | Unknown | N/A | |
| Federated Learning with Label Distribution Skew via Logits Calibration | Unknown | N/A | |
| RetrievalGuard: Provably Robust 1-Nearest Neighbor Image Retrieval | Unknown | N/A | |
| Exploring and Exploiting Hubness Priors for High-Quality GAN Latent Sampling | Unknown | N/A | |
| Deep symbolic regression for recurrence prediction | Unknown | N/A | |
| Kernel Methods for Radial Transformed Compositional Data with Many Zeros | Unknown | N/A | |
| Equivariance versus Augmentation for Spherical Images | Unknown | N/A | |
| Adversarial Robustness against Multiple and Single $l_p$-Threat Models via Quick Fine-Tuning of Robust Classifiers | Unknown | N/A | |
| Decomposing Temporal High-Order Interactions via Latent ODEs | Unknown | N/A | |
| Nearly Optimal Catoni’s M-estimator for Infinite Variance | Unknown | N/A | |
| Label-Free Explainability for Unsupervised Models | Unknown | N/A | |
| AutoIP: A United Framework to Integrate Physics into Gaussian Processes | Unknown | N/A | |
| PINs: Progressive Implicit Networks for Multi-Scale Neural Representations | Unknown | N/A | |
| Generalization Bounds using Lower Tail Exponents in Stochastic Optimizers | Unknown | N/A | |
| Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models | Unknown | N/A | |
| Nonlinear Feature Diffusion on Hypergraphs | Unknown | N/A | |
| Interpretable Neural Networks with Frank-Wolfe: Sparse Relevance Maps and Relevance Orderings | Unknown | N/A | |
| Towards Theoretical Analysis of Transformation Complexity of ReLU DNNs | Unknown | N/A | |
| Self-supervised Models are Good Teaching Assistants for Vision Transformers | Unknown | N/A | |
| ME-GAN: Learning Panoptic Electrocardio Representations for Multi-view ECG Synthesis Conditioned on Heart Diseases | Unknown | N/A | |
| Delay-Adaptive Step-sizes for Asynchronous Learning | Unknown | N/A | |
| Transfer Learning In Differential Privacy's Hybrid-Model | Unknown | N/A | |
| On the Finite-Time Performance of the Knowledge Gradient Algorithm | Unknown | N/A | |
| Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control | Unknown | N/A | |
| Dynamic Topic Models for Temporal Document Networks | Unknown | N/A | |
| Choosing Answers in Epsilon-Best-Answer Identification for Linear Bandits | Unknown | N/A | |
| Variational On-the-Fly Personalization | Unknown | N/A | |
| Re-evaluating Word Mover's Distance | Unknown | N/A | |
| Revisiting the Effects of Stochasticity for Hamiltonian Samplers | Unknown | N/A | |
| Unsupervised Flow-Aligned Sequence-to-Sequence Learning for Video Restoration | Unknown | N/A | |
| Difference Advantage Estimation for Multi-Agent Policy Gradients | Unknown | N/A | |
| A Joint Exponential Mechanism For Differentially Private Top-$k$ | Unknown | N/A | |
| Being Properly Improper | Unknown | N/A | |
| Global Optimization Networks | Unknown | N/A | |
| On the Effects of Artificial Data Modification | Unknown | N/A | |
| Near-Exact Recovery for Tomographic Inverse Problems via Deep Learning | Unknown | N/A | |
| Robustness Implies Generalization via Data-Dependent Generalization Bounds | Unknown | N/A | |
| Order Constraints in Optimal Transport | Unknown | N/A | |
| Estimation in Rotationally Invariant Generalized Linear Models via Approximate Message Passing | Unknown | N/A | |
| Safe Exploration for Efficient Policy Evaluation and Comparison | Unknown | N/A | |
| Omni-Granular Ego-Semantic Propagation for Self-Supervised Graph Representation Learning | Unknown | N/A | |
| Residual-Based Sampling for Online Outlier-Robust PCA | Unknown | N/A | |
| Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective | Unknown | N/A | |
| Efficient PAC Learning from the Crowd with Pairwise Comparisons | Unknown | N/A | |
| Generic Coreset for Scalable Learning of Monotonic Kernels: Logistic Regression, Sigmoid and more | Unknown | N/A | |
| Generalized Federated Learning via Sharpness Aware Minimization | Unknown | N/A | |
| NISPA: Neuro-Inspired Stability-Plasticity Adaptation for Continual Learning in Sparse Networks | Unknown | N/A | |
| Off-Policy Reinforcement Learning with Delayed Rewards | Unknown | N/A | |
| EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning | Unknown | N/A | |
| Towards understanding how momentum improves generalization in deep learning | Unknown | N/A | |
| One-Pass Diversified Sampling with Application to Terabyte-Scale Genomic Sequence Streams | Unknown | N/A | |
| Stochastic Reweighted Gradient Descent | Unknown | N/A | |
| Cooperative Online Learning in Stochastic and Adversarial MDPs | Unknown | N/A | |
| Analyzing and Mitigating Interference in Neural Architecture Search | Unknown | N/A | |
| Convergence of Invariant Graph Networks | Unknown | N/A | |
| Minimax Classification under Concept Drift with Multidimensional Adaptation and Performance Guarantees | Unknown | N/A | |
| Partial Label Learning via Label Influence Function | Unknown | N/A | |
| Instance Dependent Regret Analysis of Kernelized Bandits | Unknown | N/A | |
| A Theoretical Comparison of Graph Neural Network Extensions | Unknown | N/A | |
| Ripple Attention for Visual Perception with Sub-quadratic Complexity | Unknown | N/A | |
| Offline Meta-Reinforcement Learning with Online Self-Supervision | Unknown | N/A | |
| Prototype Based Classification from Hierarchy to Fairness | Unknown | N/A | |
| Understanding Robust Overfitting of Adversarial Training and Beyond | Unknown | N/A | |
| Fast Provably Robust Decision Trees and Boosting | Unknown | N/A | |
| MemSR: Training Memory-efficient Lightweight Model for Image Super-Resolution | Unknown | N/A | |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Unknown | N/A | |
| YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone | Unknown | N/A | |
| Deep Squared Euclidean Approximation to the Levenshtein Distance for DNA Storage | Unknown | N/A | |
| Simple and near-optimal algorithms for hidden stratification and multi-group learning | Unknown | N/A | |
| Function-space Inference with Sparse Implicit Processes | Unknown | N/A | |
| C-algebra Net: A New Approach Generalizing Neural Network Parameters to C-algebra | Unknown | N/A | |
| Equivariant Quantum Graph Circuits | Unknown | N/A | |
| Attentional Meta-learners for Few-shot Polythetic Classification | Unknown | N/A | |
| Neural Network Pruning Denoises the Features and Makes Local Connectivity Emerge in Visual Tasks | Unknown | N/A | |
| Nonparametric Embeddings of Sparse High-Order Interaction Events | Unknown | N/A | |
| Robust Models Are More Interpretable Because Attributions Look Normal | Unknown | N/A | |
| Divergence-Regularized Multi-Agent Actor-Critic | Unknown | N/A | |
| Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding | Unknown | N/A | |
| DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning | Unknown | N/A | |
| Generative Trees: Adversarial and Copycat | Unknown | N/A | |
| Nonparametric Sparse Tensor Factorization with Hierarchical Gamma Processes | Unknown | N/A | |
| Learning Domain Adaptive Object Detection with Probabilistic Teacher | Unknown | N/A | |
| Matching Structure for Dual Learning | Unknown | N/A | |
| Large Batch Experience Replay | Unknown | N/A | |
| Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets | Unknown | N/A | |
| Input Dependent Sparse Gaussian Processes | Unknown | N/A | |
| Sublinear-Time Clustering Oracle for Signed Graphs | Unknown | N/A | |
| Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications | Unknown | N/A | |
| Continuous Control with Action Quantization from Demonstrations | Unknown | N/A | |
| An iterative clustering algorithm for the Contextual Stochastic Block Model with optimality guarantees | Unknown | N/A | |
| Estimating Instance-dependent Bayes-label Transition Matrix using a Deep Neural Network | Unknown | N/A | |
| A Neural Tangent Kernel Perspective of GANs | Unknown | N/A | |
| On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation | Unknown | N/A | |
| Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks | Unknown | N/A | |
| Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach | Unknown | N/A | |
| BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation | Unknown | N/A | |
| Differentially Private Approximate Quantiles | Unknown | N/A | |
| Nonparametric Factor Trajectory Learning for Dynamic Tensor Decomposition | Unknown | N/A | |
| Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization | Unknown | N/A | |
| Centroid Approximation for Bootstrap: Improving Particle Quality at Inference | Unknown | N/A | |
| Learning Mixtures of Linear Dynamical Systems | Unknown | N/A | |
| Data-Efficient Double-Win Lottery Tickets from Robust Pre-training | Unknown | N/A | |
| Adaptive Data Analysis with Correlated Observations | Unknown | N/A | |
| $p$-Laplacian Based Graph Neural Networks | Unknown | N/A | |
| Adapting k-means Algorithms for Outliers | Unknown | N/A | |
| Smoothed Adaptive Weighting for Imbalanced Semi-Supervised Learning: Improve Reliability Against Unknown Distribution Data | Unknown | N/A | |
| Surrogate Likelihoods for Variational Annealed Importance Sampling | Unknown | N/A | |
| Learning to Hash Robustly, Guaranteed | Unknown | N/A | |
| Bounding Training Data Reconstruction in Private (Deep) Learning | Unknown | N/A | |
| Massively Parallel $k$-Means Clustering for Perturbation Resilient Instances | Unknown | N/A | |
| Bayesian Continuous-Time Tucker Decomposition | Unknown | N/A | |
| Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures | Unknown | N/A | |
| Fairness Interventions as (Dis)Incentives for Strategic Manipulation | Unknown | N/A | |
| Unsupervised Time-Series Representation Learning with Iterative Bilinear Temporal-Spectral Fusion | Unknown | N/A | |
| DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting | Unknown | N/A | |
| Learning Stable Classifiers by Transferring Unstable Features | Unknown | N/A | |
| Greedy based Value Representation for Optimal Coordination in Multi-agent Reinforcement Learning | Unknown | N/A | |
| Fast-Rate PAC-Bayesian Generalization Bounds for Meta-Learning | Unknown | N/A | |
| Congested Bandits: Optimal Routing via Short-term Resets | Unknown | N/A | |
| For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria | Unknown | N/A | |
| A Resilient Distributed Boosting Algorithm | Unknown | N/A | |
| Structure Preserving Neural Networks: A Case Study in the Entropy Closure of the Boltzmann Equation | Unknown | N/A | |
| QSFL: A Two-Level Uplink Communication Optimization Framework for Federated Learning | Unknown | N/A | |
| Skin Deep Unlearning: Artefact and Instrument Debiasing in the Context of Melanoma Classification | Unknown | N/A | |
| Online Learning and Pricing with Reusable Resources: Linear Bandits with Sub-Exponential Rewards | Unknown | N/A | |
| SoQal: Selective Oracle Questioning for Consistency Based Active Learning of Cardiac Signals | Unknown | N/A | |
| The dynamics of representation learning in shallow, non-linear autoencoders | Unknown | N/A | |
| Bayesian Nonparametric Learning for Point Processes with Spatial Homogeneity: A Spatial Analysis of NBA Shot Locations | Unknown | N/A | |
| Structural Entropy Guided Graph Hierarchical Pooling | Unknown | N/A | |
| Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning | Unknown | N/A | |
| Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence | Unknown | N/A | |
| Deep Probability Estimation | Unknown | N/A | |
| SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization | Unknown | N/A | |
| Beyond Images: Label Noise Transition Matrix Estimation for Tasks with Lower-Quality Features | Unknown | N/A | |
| Detecting Corrupted Labels Without Training a Model to Predict | Unknown | N/A | |
| Accelerated Gradient Methods for Geodesically Convex Optimization: Tractable Algorithms and Convergence Analysis | Unknown | N/A | |
| Personalization Improves Privacy-Accuracy Tradeoffs in Federated Learning | Unknown | N/A | |
| Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence | Unknown | N/A | |
| Sample Efficient Learning of Predictors that Complement Humans | Unknown | N/A | |
| Bayesian Learning with Information Gain Provably Bounds Risk for a Robust Adversarial Defense | Unknown | N/A | |
| From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers | Unknown | N/A | |
| Generating 3D Molecules for Target Protein Binding | Unknown | N/A | |
| Deep and Flexible Graph Neural Architecture Search | Unknown | N/A | |
| Streaming Algorithms for Support-Aware Histograms | Unknown | N/A | |
| Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL | Unknown | N/A | |
| Score-Guided Intermediate Level Optimization: Fast Langevin Mixing for Inverse Problems | Unknown | N/A | |
| Optimal Algorithms for Stochastic Multi-Level Compositional Optimization | Unknown | N/A | |
| Off-Policy Evaluation for Large Action Spaces via Embeddings | Unknown | N/A | |
| ProgFed: Effective, Communication, and Computation Efficient Federated Learning by Progressive Training | Unknown | N/A | |
| Learning Bellman Complete Representations for Offline Policy Evaluation | Unknown | N/A | |
| Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning | Unknown | N/A | |
| GenLabel: Mixup Relabeling using Generative Models | Unknown | N/A | |
| Diversified Adversarial Attacks based on Conjugate Gradient Method | Unknown | N/A | |
| NAFS: A Simple yet Tough-to-beat Baseline for Graph Representation Learning | Unknown | N/A | |
| Estimating the Optimal Covariance with Imperfect Mean in Diffusion Probabilistic Models | Unknown | N/A | |
| Maximum Likelihood Training for Score-based Diffusion ODEs by High Order Denoising Score Matching | Unknown | N/A | |
| Efficient Test-Time Model Adaptation without Forgetting | Unknown | N/A | |
| Restarted Nonconvex Accelerated Gradient Descent: No More Polylogarithmic Factor in the $O(\epsilon^{-7/4})$ Complexity | Unknown | N/A | |
| CerDEQ: Certifiable Deep Equilibrium Model | Unknown | N/A | |
| PDO-s3DCNNs: Partial Differential Operator Based Steerable 3D CNNs | Unknown | N/A | |
| Optimization-Induced Graph Implicit Nonlinear Diffusion | Unknown | N/A | |
| G$^2$CN: Graph Gaussian Convolution Networks with Concentrated Graph Filters | Unknown | N/A | |
| Auxiliary Learning with Joint Task and Data Scheduling | Unknown | N/A | |
| Adversarial Attack and Defense for Non-Parametric Two-Sample Tests | Unknown | N/A | |
| Self-Supervised Representation Learning via Latent Graph Prediction | Unknown | N/A | |
| Mitigating Neural Network Overconfidence with Logit Normalization | Unknown | N/A | |
| Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets | Unknown | N/A | |
| Robustness and Accuracy Could Be Reconcilable by (Proper) Definition | Unknown | N/A | |
| Online Learning with Knapsacks: the Best of Both Worlds | Unknown | N/A | |
| Safe Learning in Tree-Form Sequential Decision Making: Handling Hard and Soft Constraints | Unknown | N/A | |
| UnderGrad: A Universal Black-Box Optimization Method with Almost Dimension-Free Convergence Rate Guarantees | Unknown | N/A | |
| AdaGrad Avoids Saddle Points | Unknown | N/A | |
| PLATINUM: Semi-Supervised Model Agnostic Meta-Learning using Submodular Mutual Information | Unknown | N/A | |
| BabelTower: Learning to Auto-parallelized Program Translation | Unknown | N/A | |
| PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration | Unknown | N/A | |
| Fast Relative Entropy Coding with A* coding | Unknown | N/A | |
| LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood | Unknown | N/A | |
| Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval | Unknown | N/A | |
| Deep equilibrium networks are sensitive to initialization statistics | Unknown | N/A | |
| Online Algorithms with Multiple Predictions | Unknown | N/A | |
| Batch Greenkhorn Algorithm for Entropic-Regularized Multimarginal Optimal Transport: Linear Rate of Convergence and Iteration Complexity | Unknown | N/A | |
| Dataset Condensation via Efficient Synthetic-Data Parameterization | Unknown | N/A | |
| Deep Neural Network Fusion via Graph Matching with Applications to Model Ensemble and Federated Learning | Unknown | N/A | |
| Batched Dueling Bandits | Unknown | N/A | |
| Regret Bounds for Stochastic Shortest Path Problems with Linear Function Approximation | Unknown | N/A | |
| High Probability Guarantees for Nonconvex Stochastic Gradient Descent with Heavy Tails | Unknown | N/A | |
| From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses | Unknown | N/A | |
| Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning | Unknown | N/A | |
| Gradient Descent on Neurons and its Link to Approximate Second-order Optimization | Unknown | N/A | |
| Toward Compositional Generalization in Object-Oriented World Modeling | Unknown | N/A | |
| C-MinHash: Improving Minwise Hashing with Circulant Permutation | Unknown | N/A | |
| FOCUS: Familiar Objects in Common and Uncommon Settings | Unknown | N/A | |
| Deep Networks on Toroids: Removing Symmetries Reveals the Structure of Flat Regions in the Landscape Geometry | Unknown | N/A | |
| IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages | Unknown | N/A | |
| (Non-)Convergence Results for Predictive Coding Networks | Unknown | N/A | |
| Monarch: Expressive Structured Matrices for Efficient and Accurate Training | Unknown | N/A | |
| Forget-free Continual Learning with Winning Subnetworks | Unknown | N/A | |
| Constraint-based graph network simulator | Unknown | N/A | |
| OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework | Unknown | N/A | |
| Online Active Regression | Unknown | N/A | |
| Achieving Fairness at No Utility Cost via Data Reweighing with Influence | Unknown | N/A | |
| Additive Gaussian Processes Revisited | Unknown | N/A | |
| Neural Network Poisson Models for Behavioural and Neural Spike Train Data | Unknown | N/A | |
| Rotting Infinitely Many-Armed Bandits | Unknown | N/A | |
| Low-Complexity Deep Convolutional Neural Networks on Fully Homomorphic Encryption Using Multiplexed Parallel Convolutions | Unknown | N/A | |
| Scalable Spike-and-Slab | Unknown | N/A | |
| Active Nearest Neighbor Regression Through Delaunay Refinement | Unknown | N/A | |
| Active Multi-Task Representation Learning | Unknown | N/A | |
| Utility Theory for Sequential Decision Making | Unknown | N/A | |
| Generative Coarse-Graining of Molecular Conformations | Unknown | N/A | |
| Path-Aware and Structure-Preserving Generation of Synthetically Accessible Molecules | Unknown | N/A | |
| Improving Transformers with Probabilistic Attention Keys | Unknown | N/A | |
| Cross-Space Active Learning on Graph Convolutional Networks | Unknown | N/A | |
| Public Data-Assisted Mirror Descent for Private Model Training | Unknown | N/A | |
| Thompson Sampling for (Combinatorial) Pure Exploration | Unknown | N/A | |
| Greedy when Sure and Conservative when Uncertain about the Opponents | Unknown | N/A | |
| Dual Decomposition of Convex Optimization Layers for Consistent Attention in Medical Images | Unknown | N/A | |
| Online Balanced Experimental Design | Unknown | N/A | |
| Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt | Unknown | N/A | |
| What Language Model Architecture and Pretraining Objective Works Best for Zero-Shot Generalization? | Unknown | N/A | |
| 3PC: Three Point Compressors for Communication-Efficient Distributed Training and a Better Theory for Lazy Aggregation | Unknown | N/A | |
| TPC: Transformation-Specific Smoothing for Point Cloud Models | Unknown | N/A | |
| Large-Scale Graph Neural Architecture Search | Unknown | N/A | |
| Random Forest Density Estimation | Unknown | N/A | |
| Particle Transformer for Jet Tagging | Unknown | N/A | |
| A Reduction from Linear Contextual Bandits Lower Bounds to Estimations Lower Bounds | Unknown | N/A | |
| Knowledge Base Question Answering by Case-based Reasoning over Subgraphs | Unknown | N/A | |
| Communication-efficient Distributed Learning for Large Batch Optimization | Unknown | N/A | |
| Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework | Unknown | N/A | |
| Metric-Fair Active Learning | Unknown | N/A | |
| Thresholded Lasso Bandit | Unknown | N/A | |
| Kill a Bird with Two Stones: Closing the Convergence Gaps in Non-Strongly Convex Optimization by Directly Accelerated SVRG with Double Compensation and Snapshots | Unknown | N/A | |
| Benchmarking and Analyzing Point Cloud Classification under Corruptions | Unknown | N/A | |
| Position Prediction as an Effective Pretraining Strategy | Unknown | N/A | |
| Generalizing to New Physical Systems via Context-Informed Dynamics Model | Unknown | N/A | |
| Multi-Level Branched Regularization for Federated Learning | Unknown | N/A | |
| Efficient Variance Reduction for Meta-learning | Unknown | N/A | |
| ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers | Unknown | N/A | |
| Double Sampling Randomized Smoothing | Unknown | N/A | |
| Accelerating Bayesian Optimization for Biological Sequence Design with Denoising Autoencoders | Unknown | N/A | |
| History Compression via Language Models in Reinforcement Learning | Unknown | N/A | |
| Non-Vacuous Generalisation Bounds for Shallow Neural Networks | Unknown | N/A |
ICML 2023
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Sample Complexity Bounds for Learning High-dimensional Simplices in Noisy Regimes | Unknown | N/A | |
| Grounding Language Models to Images for Multimodal Inputs and Outputs | Unknown | N/A | |
| Training-Free Neural Active Learning with Initialization-Robustness Guarantees | Unknown | N/A | |
| Scaling Vision Transformers to 22 Billion Parameters | Unknown | N/A | |
| ED-Batch: Efficient Automatic Batching of Dynamic Neural Networks via Learned Finite State Machines | Unknown | N/A | |
| Chameleon: Adapting to Peer Images for Planting Durable Backdoors in Federated Learning | Unknown | N/A | |
| Partial Optimality in Cubic Correlation Clustering | Unknown | N/A | |
| Poisoning Language Models During Instruction Tuning | Unknown | N/A | |
| Learning Distributions over Quantum Measurement Outcomes | Unknown | N/A | |
| Recasting Self-Attention with Holographic Reduced Representations | Unknown | N/A | |
| Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks | Unknown | N/A | |
| Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features | Unknown | N/A | |
| Fast Federated Machine Unlearning with Nonlinear Functional Theory | Unknown | N/A | |
| End-to-End Full-Atom Antibody Design | Unknown | N/A | |
| Dimension-independent Certified Neural Network Watermarks via Mollifier Smoothing | Unknown | N/A | |
| Efficient Personalized Federated Learning via Sparse Model-Adaptation | Unknown | N/A | |
| Byzantine-Robust Learning on Heterogeneous Data via Gradient Splitting | Unknown | N/A | |
| Equivariant Polynomials for Graph Neural Networks | Unknown | N/A | |
| Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames | Unknown | N/A | |
| Fed-CBS: A Heterogeneity-Aware Client Sampling Mechanism for Federated Learning via Class-Imbalance Reduction | Unknown | N/A | |
| QASA: Advanced Question Answering on Scientific Articles | Unknown | N/A | |
| Set-membership Belief State-based Reinforcement Learning for POMDPs | Unknown | N/A | |
| Towards Unbiased Training in Federated Open-world Semi-supervised Learning | Unknown | N/A | |
| NP-SemiSeg: When Neural Processes meet Semi-Supervised Semantic Segmentation | Unknown | N/A | |
| Temporally Consistent Transformers for Video Generation | Unknown | N/A | |
| Decentralized SGD and Average-direction SAM are Asymptotically Equivalent | Unknown | N/A | |
| Tilted Sparse Additive Models | Unknown | N/A | |
| Test-Time Style Shifting: Handling Arbitrary Styles in Domain Generalization | Unknown | N/A | |
| Which is Better for Learning with Noisy Labels: The Semi-supervised Method or Modeling Label Noise? | Unknown | N/A | |
| A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models | Unknown | N/A | |
| A Coupled Flow Approach to Imitation Learning | Unknown | N/A | |
| On Strengthening and Defending Graph Reconstruction Attack with Markov Chain Approximation | Unknown | N/A | |
| Adaptive Annealed Importance Sampling with Constant Rate Progress | Unknown | N/A | |
| DoCoFL: Downlink Compression for Cross-Device Federated Learning | Unknown | N/A | |
| Topological Point Cloud Clustering | Unknown | N/A | |
| Constant Matters: Fine-grained Error Bound on Differentially Private Continual Observation | Unknown | N/A | |
| PixelAsParam: A Gradient View on Diffusion Sampling with Guidance | Unknown | N/A | |
| Adaptive Whitening in Neural Populations with Gain-modulating Interneurons | Unknown | N/A | |
| The Hessian perspective into the Nature of Convolutional Neural Networks | Unknown | N/A | |
| Reprogramming Pretrained Language Models for Antibody Sequence Infilling | Unknown | N/A | |
| Unifying Nesterov's Accelerated Gradient Methods for Convex and Strongly Convex Objective Functions | Unknown | N/A | |
| Shiftable Context: Addressing Training-Inference Context Mismatch in Simultaneous Speech Translation | Unknown | N/A | |
| Complementary Attention for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| MultiRobustBench: Benchmarking Robustness Against Multiple Attacks | Unknown | N/A | |
| Scaling Laws for Reward Model Overoptimization | Unknown | N/A | |
| Reflected Diffusion Models | Unknown | N/A | |
| LEVER: Learning to Verify Language-to-Code Generation with Execution | Unknown | N/A | |
| Learning to Bid in Repeated First-Price Auctions with Budgets | Unknown | N/A | |
| Semi-Dual Unbalanced Quadratic Optimal Transport: fast statistical rates and convergent algorithm. | Unknown | N/A | |
| Multi-task Representation Learning for Pure Exploration in Linear Bandits | Unknown | N/A | |
| Online Restless Bandits with Unobserved States | Unknown | N/A | |
| Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning | Unknown | N/A | |
| Out-of-Domain Robustness via Targeted Augmentations | Unknown | N/A | |
| Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning | Unknown | N/A | |
| Test-time Adaptation with Slot-Centric Models | Unknown | N/A | |
| Meta-SAGE: Scale Meta-Learning Scheduled Adaptation with Guided Exploration for Mitigating Scale Shift on Combinatorial Optimization | Unknown | N/A | |
| The Blessing of Heterogeneity in Federated Q-Learning: Linear Speedup and Beyond | Unknown | N/A | |
| Pre-computed memory or on-the-fly encoding? A hybrid approach to retrieval augmentation makes the most of your compute | Unknown | N/A | |
| Conditions and Assumptions for Constraint-based Causal Structure Learning | Unknown | N/A | |
| Fast Online Node Labeling for Very Large Graphs | Unknown | N/A | |
| Selective Machine Learning of the Average Treatment Effect with an Invalid Instrumental Variable | Unknown | N/A | |
| What Makes Entities Similar? A Similarity Flooding Perspective for Multi-sourced Knowledge Graph Embeddings | Unknown | N/A | |
| Drug Discovery under Covariate Shift with Domain-Informed Prior Distributions over Functions | Unknown | N/A | |
| Raising the Cost of Malicious AI-Powered Image Editing | Unknown | N/A | |
| Competitive Gradient Optimization | Unknown | N/A | |
| Graph Inductive Biases in Transformers without Message Passing | Unknown | N/A | |
| Fast Sampling of Diffusion Models via Operator Learning | Unknown | N/A | |
| Model Ratatouille: Recycling Diverse Models for Out-of-Distribution Generalization | Unknown | N/A | |
| Offline Reinforcement Learning with Closed-Form Policy Improvement Operators | Unknown | N/A | |
| Efficient and Degree-Guided Graph Generation via Discrete Diffusion Modeling | Unknown | N/A | |
| On the convergence of the MLE as an estimator of the learning rate in the Exp3 algorithm | Unknown | N/A | |
| Revisiting Discriminative vs. Generative Classifiers: Theory and Implications | Unknown | N/A | |
| Optimizing Hyperparameters with Conformal Quantile Regression | Unknown | N/A | |
| Optimizing DDPM Sampling with Shortcut Fine-Tuning | Unknown | N/A | |
| Federated Conformal Predictors for Distributed Uncertainty Quantification | Unknown | N/A | |
| Future-conditioned Unsupervised Pretraining for Decision Transformer | Unknown | N/A | |
| Learning Subpocket Prototypes for Generalizable Structure-based Drug Design | Unknown | N/A | |
| Synthetic Data, Real Errors: How (Not) to Publish and Use Synthetic Data | Unknown | N/A | |
| Paging with Succinct Predictions | Unknown | N/A | |
| Optimal Shrinkage for Distributed Second-Order Optimization | Unknown | N/A | |
| A new near-linear time algorithm for k-nearest neighbor search using a compressed cover tree | Unknown | N/A | |
| Analyzing Convergence in Quantum Neural Networks: Deviations from Neural Tangent Kernels | Unknown | N/A | |
| On Distribution Dependent Sub-Logarithmic Query Time of Learned Indexing | Unknown | N/A | |
| Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons | Unknown | N/A | |
| Beyond Reward: Offline Preference-guided Policy Optimization | Unknown | N/A | |
| Cold Analysis of Rao-Blackwellized Straight-Through Gumbel-Softmax Gradient Estimator | Unknown | N/A | |
| Correcting discount-factor mismatch in on-policy policy gradient methods | Unknown | N/A | |
| The multimarginal optimal transport formulation of adversarial multiclass classification | Unknown | N/A | |
| CLIPood: Generalizing CLIP to Out-of-Distributions | Unknown | N/A | |
| A Law of Robustness beyond Isoperimetry | Unknown | N/A | |
| Differential Privacy has Bounded Impact on Fairness in Classification | Unknown | N/A | |
| Neural Status Registers | Unknown | N/A | |
| Dimensionality Reduction for General KDE Mode Finding | Unknown | N/A | |
| Which Invariance Should We Transfer? A Causal Minimax Learning Approach | Unknown | N/A | |
| Does a Neural Network Really Encode Symbolic Concepts? | Unknown | N/A | |
| AdaBoost is not an Optimal Weak to Strong Learner | Unknown | N/A | |
| The Fast Johnson-Lindenstrauss Transform Is Even Faster | Unknown | N/A | |
| Nonlinear Advantage: Trained Networks Might Not Be As Complex as You Think | Unknown | N/A | |
| Learning the Dynamics of Sparsely Observed Interacting Systems | Unknown | N/A | |
| Constrained Decision Transformer for Offline Safe Reinforcement Learning | Unknown | N/A | |
| Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling | Unknown | N/A | |
| LinSATNet: The Positive Linear Satisfiability Neural Networks | Unknown | N/A | |
| GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models | Unknown | N/A | |
| Consistency Models | Unknown | N/A | |
| Comparison of meta-learners for estimating multi-valued treatment heterogeneous effects | Unknown | N/A | |
| Self-supervised learning of Split Invariant Equivariant representations | Unknown | N/A | |
| Privacy-Aware Compression for Federated Learning Through Numerical Mechanism Design | Unknown | N/A | |
| Gaussian processes at the Helm(holtz): A more fluid model for ocean currents | Unknown | N/A | |
| A Two-Stage Active Learning Algorithm for k-Nearest Neighbors | Unknown | N/A | |
| Data-Copying in Generative Models: A Formal Framework | Unknown | N/A | |
| Generalized Reductions: Making any Hierarchical Clustering Fair and Balanced with Low Cost | Unknown | N/A | |
| Accelerated Primal-Dual Methods for Convex-Strongly-Concave Saddle Point Problems | Unknown | N/A | |
| From Noisy Fixed-Point Iterations to Private ADMM for Centralized and Federated Learning | Unknown | N/A | |
| Multi-Symmetry Ensembles: Improving Diversity and Generalization via Opposing Symmetries | Unknown | N/A | |
| Towards Theoretical Understanding of Inverse Reinforcement Learning | Unknown | N/A | |
| Accelerated Cyclic Coordinate Dual Averaging with Extrapolation for Composite Convex Optimization | Unknown | N/A | |
| Learning Preconditioners for Conjugate Gradient PDE Solvers | Unknown | N/A | |
| Anchor Sampling for Federated Learning with Partial Client Participation | Unknown | N/A | |
| MG-GNN: Multigrid Graph Neural Networks for Learning Multilevel Domain Decomposition Methods | Unknown | N/A | |
| Human-Timescale Adaptation in an Open-Ended Task Space | Unknown | N/A | |
| Improving the Model Consistency of Decentralized Federated Learning | Unknown | N/A | |
| Men Also Do Laundry: Multi-Attribute Bias Amplification | Unknown | N/A | |
| Auxiliary Modality Learning with Generalized Curriculum Distillation | Unknown | N/A | |
| ELSA: Efficient Label Shift Adaptation through the Lens of Semiparametric Models | Unknown | N/A | |
| SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process | Unknown | N/A | |
| Prometheus: Taming Sample and Communication Complexities in Constrained Decentralized Stochastic Bilevel Learning | Unknown | N/A | |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Unknown | N/A | |
| A General Representation Learning Framework with Generalization Performance Guarantees | Unknown | N/A | |
| SAAL: Sharpness-Aware Active Learning | Unknown | N/A | |
| Brainformers: Trading Simplicity for Efficiency | Unknown | N/A | |
| Multiple Thinking Achieving Meta-Ability Decoupling for Object Navigation | Unknown | N/A | |
| Normalizing Flows for Interventional Density Estimation | Unknown | N/A | |
| Is Overfitting Necessary for Implicit Video Representation? | Unknown | N/A | |
| Fast Combinatorial Algorithms for Min Max Correlation Clustering | Unknown | N/A | |
| Scaling Laws for Generative Mixed-Modal Language Models | Unknown | N/A | |
| DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm | Unknown | N/A | |
| A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition | Unknown | N/A | |
| Memory-Based Meta-Learning on Non-Stationary Distributions | Unknown | N/A | |
| LipsNet: A Smooth and Robust Neural Network with Adaptive Lipschitz Constant for High Accuracy Optimal Control | Unknown | N/A | |
| Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Scalable Set Encoding with Universal Mini-Batch Consistency and Unbiased Full Set Gradient Approximation | Unknown | N/A | |
| Margin-based Neural Network Watermarking | Unknown | N/A | |
| Vector Quantized Wasserstein Auto-Encoder | Unknown | N/A | |
| Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability | Unknown | N/A | |
| On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning | Unknown | N/A | |
| Geometric Autoencoders - What You See is What You Decode | Unknown | N/A | |
| Estimating Heterogeneous Treatment Effects: Mutual Information Bounds and Learning Algorithms | Unknown | N/A | |
| FAIRER: Fairness as Decision Rationale Alignment | Unknown | N/A | |
| SE(3) diffusion model with application to protein backbone generation | Unknown | N/A | |
| Performative Recommendation: Diversifying Content via Strategic Incentives | Unknown | N/A | |
| How Does Information Bottleneck Help Deep Learning? | Unknown | N/A | |
| Momentum Ensures Convergence of SIGNSGD under Weaker Assumptions | Unknown | N/A | |
| Bidirectional Adaptation for Robust Semi-Supervised Learning with Inconsistent Data Distributions | Unknown | N/A | |
| OpenFE: Automated Feature Generation with Expert-level Performance | Unknown | N/A | |
| Model-based Reinforcement Learning with Scalable Composite Policy Gradient Estimators | Unknown | N/A | |
| On the Expressive Power of Geometric Graph Neural Networks | Unknown | N/A | |
| When and How Does Known Class Help Discover Unknown Ones? Provable Understanding Through Spectral Analysis | Unknown | N/A | |
| Eliminating Adversarial Noise via Information Discard and Robust Representation Restoration | Unknown | N/A | |
| Optimal No-Regret Learning for One-Sided Lipschitz Functions | Unknown | N/A | |
| Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes | Unknown | N/A | |
| A Nearly-Optimal Bound for Fast Regression with $\ell_\infty$ Guarantee | Unknown | N/A | |
| Simple and Fast Group Robustness by Automatic Feature Reweighting | Unknown | N/A | |
| XTab: Cross-table Pretraining for Tabular Transformers | Unknown | N/A | |
| Controlled Text Generation with Natural Language Instructions | Unknown | N/A | |
| Adversarial robustness of amortized Bayesian inference | Unknown | N/A | |
| Explaining Reinforcement Learning with Shapley Values | Unknown | N/A | |
| Explaining the effects of non-convergent MCMC in the training of Energy-Based Models | Unknown | N/A | |
| Exploring Chemical Space with Score-based Out-of-distribution Generation | Unknown | N/A | |
| Second-Order Optimization with Lazy Hessians | Unknown | N/A | |
| Great Models Think Alike: Improving Model Reliability via Inter-Model Latent Agreement | Unknown | N/A | |
| Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines? | Unknown | N/A | |
| Hindsight Learning for MDPs with Exogenous Inputs | Unknown | N/A | |
| Brauer's Group Equivariant Neural Networks | Unknown | N/A | |
| Constrained Efficient Global Optimization of Expensive Black-box Functions | Unknown | N/A | |
| Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning | Unknown | N/A | |
| A Deep Conjugate Direction Method for Iteratively Solving Linear Systems | Unknown | N/A | |
| How Powerful are Shallow Neural Networks with Bandlimited Random Weights? | Unknown | N/A | |
| Nonparametric Generative Modeling with Conditional Sliced-Wasserstein Flows | Unknown | N/A | |
| The Wisdom of Hindsight Makes Language Models Better Instruction Followers | Unknown | N/A | |
| Exploring Model Dynamics for Accumulative Poisoning Discovery | Unknown | N/A | |
| High-dimensional Clustering onto Hamiltonian Cycle | Unknown | N/A | |
| Bag of Tricks for Training Data Extraction from Language Models | Unknown | N/A | |
| Unearthing InSights into Mars: Unsupervised Source Separation with Limited Data | Unknown | N/A | |
| Faith-Shap: The Faithful Shapley Interaction Index | Unknown | N/A | |
| Curriculum Co-disentangled Representation Learning across Multiple Environments for Social Recommendation | Unknown | N/A | |
| Understand and Modularize Generator Optimization in ELECTRA-style Pretraining | Unknown | N/A | |
| Vertical Federated Graph Neural Network for Recommender System | Unknown | N/A | |
| Practical and Matching Gradient Variance Bounds for Black-Box Variational Bayesian Inference | Unknown | N/A | |
| Concept-based Explanations for Out-of-Distribution Detectors | Unknown | N/A | |
| Understanding Plasticity in Neural Networks | Unknown | N/A | |
| GeCoNeRF: Few-shot Neural Radiance Fields via Geometric Consistency | Unknown | N/A | |
| Improved Learning-Augmented Algorithms for the Multi-Option Ski Rental Problem via Best-Possible Competitive Analysis | Unknown | N/A | |
| The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation | Unknown | N/A | |
| Gradient-Free Structured Pruning with Unlabeled Data | Unknown | N/A | |
| Interpretable Neural-Symbolic Concept Reasoning | Unknown | N/A | |
| XAI Beyond Classification: Interpretable Neural Clustering | Unknown | N/A | |
| Blossom: an Anytime Algorithm for Computing Optimal Decision Trees | Unknown | N/A | |
| Less is More: Task-aware Layer-wise Distillation for Language Model Compression | Unknown | N/A | |
| Probabilistic Concept Bottleneck Models | Unknown | N/A | |
| Optimizing Mode Connectivity for Class Incremental Learning | Unknown | N/A | |
| Smart Initial Basis Selection for Linear Programs | Unknown | N/A | |
| Minimum Width of Leaky-ReLU Neural Networks for Uniform Universal Approximation | Unknown | N/A | |
| Rethink DARTS Search Space and Renovate a New Benchmark | Unknown | N/A | |
| Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity | Unknown | N/A | |
| Understanding Gradient Regularization in Deep Learning: Efficient Finite-Difference Computation and Implicit Bias | Unknown | N/A | |
| Policy Contrastive Imitation Learning | Unknown | N/A | |
| Statistical Inference and A/B Testing for First-Price Pacing Equilibria | Unknown | N/A | |
| Stratified Adversarial Robustness with Rejection | Unknown | N/A | |
| RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents | Unknown | N/A | |
| Offline Meta Reinforcement Learning with In-Distribution Online Adaptation | Unknown | N/A | |
| Answering Complex Logical Queries on Knowledge Graphs via Query Computation Tree Optimization | Unknown | N/A | |
| Uncertainty Estimation for Molecules: Desiderata and Methods | Unknown | N/A | |
| Transformers Meet Directed Graphs | Unknown | N/A | |
| Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion | Unknown | N/A | |
| Crafting Training Degradation Distribution for the Accuracy-Generalization Trade-off in Real-World Super-Resolution | Unknown | N/A | |
| SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning | Unknown | N/A | |
| Learning Neural Constitutive Laws from Motion Observations for Generalizable PDE Dynamics | Unknown | N/A | |
| Evolving Semantic Prototype Improves Generative Zero-Shot Learning | Unknown | N/A | |
| Enhancing Activity Prediction Models in Drug Discovery with the Ability to Understand Human Language | Unknown | N/A | |
| Contextual Combinatorial Bandits with Probabilistically Triggered Arms | Unknown | N/A | |
| Dataset Distillation with Convexified Implicit Gradients | Unknown | N/A | |
| Structure-informed Language Models Are Protein Designers | Unknown | N/A | |
| Cross-Modal Fine-Tuning: Align then Refine | Unknown | N/A | |
| Prompting Large Language Model for Machine Translation: A Case Study | Unknown | N/A | |
| Patch-level Contrastive Learning via Positional Query for Visual Pre-training | Unknown | N/A | |
| Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models | Unknown | N/A | |
| Linear optimal partial transport embedding | Unknown | N/A | |
| Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases | Unknown | N/A | |
| Neural Network Approximations of PDEs Beyond Linearity: A Representational Perspective | Unknown | N/A | |
| Counterfactual Identifiability of Bijective Causal Models | Unknown | N/A | |
| Subequivariant Graph Reinforcement Learning in 3D Environments | Unknown | N/A | |
| Generalized Polyak Step Size for First Order Optimization with Momentum | Unknown | N/A | |
| On Investigating the Conservative Property of Score-Based Generative Models | Unknown | N/A | |
| Oscillation-free Quantization for Low-bit Vision Transformers | Unknown | N/A | |
| Leveraging Demonstrations to Improve Online Learning: Quality Matters | Unknown | N/A | |
| Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation | Unknown | N/A | |
| Causal Isotonic Calibration for Heterogeneous Treatment Effects | Unknown | N/A | |
| What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? | Unknown | N/A | |
| Towards Robust and Safe Reinforcement Learning with Benign Off-policy Data | Unknown | N/A | |
| Dynamical Linear Bandits | Unknown | N/A | |
| Beyond the Universal Law of Robustness: Sharper Laws for Random Features and Neural Tangent Kernels | Unknown | N/A | |
| Function-Space Regularization in Neural Networks: A Probabilistic Perspective | Unknown | N/A | |
| Feature Directions Matter: Long-Tailed Learning via Rotated Balanced Representation | Unknown | N/A | |
| Interpolation for Robust Learning: Data Augmentation on Wasserstein Geodesics | Unknown | N/A | |
| Streaming Active Learning with Deep Neural Networks | Unknown | N/A | |
| Towards Omni-generalizable Neural Methods for Vehicle Routing Problems | Unknown | N/A | |
| Adversarial Example Does Good: Preventing Painting Imitation from Diffusion Models via Adversarial Examples | Unknown | N/A | |
| Escaping saddle points in zeroth-order optimization: the power of two-point estimators | Unknown | N/A | |
| UMD: Unsupervised Model Detection for X2X Backdoor Attacks | Unknown | N/A | |
| Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning | Unknown | N/A | |
| Kernel QuantTree | Unknown | N/A | |
| Accuracy on the Curve: On the Nonlinear Correlation of ML Performance Between Data Subpopulations | Unknown | N/A | |
| Long-Term Rhythmic Video Soundtracker | Unknown | N/A | |
| Rethinking Warm-Starts with Predictions: Learning Predictions Close to Sets of Optimal Solutions for Faster $\text{L}$-/$\text{L}^\natural$-Convex Function Minimization | Unknown | N/A | |
| Memory-Based Dual Gaussian Processes for Sequential Learning | Unknown | N/A | |
| Semi-Autoregressive Energy Flows: Exploring Likelihood-Free Training of Normalizing Flows | Unknown | N/A | |
| Multi-Task Differential Privacy Under Distribution Skew | Unknown | N/A | |
| Exploiting locality in high-dimensional Factorial hidden Markov models | Unknown | N/A | |
| Inflow, Outflow, and Reciprocity in Machine Learning | Unknown | N/A | |
| End-to-End Learning for Stochastic Optimization: A Bayesian Perspective | Unknown | N/A | |
| Behavior Contrastive Learning for Unsupervised Skill Discovery | Unknown | N/A | |
| Beyond Homophily: Reconstructing Structure for Graph-agnostic Clustering | Unknown | N/A | |
| Project and Forget: Solving Large-Scale Metric Constrained Problems | Unknown | N/A | |
| Target-based Surrogates for Stochastic Optimization | Unknown | N/A | |
| Transformers as Algorithms: Generalization and Stability in In-context Learning | Unknown | N/A | |
| Sequence Modeling with Multiresolution Convolutional Memory | Unknown | N/A | |
| Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence | Unknown | N/A | |
| Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime | Unknown | N/A | |
| Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs | Unknown | N/A | |
| ILLUME: Rationalizing Vision-Language Models through Human Interactions | Unknown | N/A | |
| On the Convergence Rates of Policy Gradient Methods | Unknown | N/A | |
| Quantifying Human Priors over Social and Navigation Networks | Unknown | N/A | |
| Cluster-Specific Predictions with Multi-Task Gaussian Processes | Unknown | N/A | |
| Disentangled Multiplex Graph Representation Learning | Unknown | N/A | |
| Beyond Lipschitz Smoothness: A Tighter Analysis for Nonconvex Optimization | Unknown | N/A | |
| Graph Generative Model for Benchmarking Graph Neural Networks | Unknown | N/A | |
| A New PHO-rmula for Improved Performance of Semi-Structured Networks | Unknown | N/A | |
| Federated Online and Bandit Convex Optimization | Unknown | N/A | |
| Towards Trustworthy Explanation: On Causal Rationalization | Unknown | N/A | |
| Emergent Agentic Transformer from Chain of Hindsight Experience | Unknown | N/A | |
| Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds | Unknown | N/A | |
| Rigid Body Flows for Sampling Molecular Crystal Structures | Unknown | N/A | |
| Continual Learning in Linear Classification on Separable Data | Unknown | N/A | |
| Hypervolume Knowledge Gradient: A Lookahead Approach for Multi-Objective Bayesian Optimization with Partial Information | Unknown | N/A | |
| Combinatorial Neural Bandits | Unknown | N/A | |
| Distributed Stochastic Gradient Descent: Nonconvexity, Nonsmoothness, and Convergence to Local Minima | Unknown | N/A | |
| Semi-Parametric Contextual Pricing Algorithm using Cox Proportional Hazards Model | Unknown | N/A | |
| When Sparsity Meets Contrastive Models: Less Graph Data Can Bring Better Class-Balanced Representations | Unknown | N/A | |
| Improved Analysis of Score-based Generative Modeling: User-Friendly Bounds under Minimal Smoothness Assumptions | Unknown | N/A | |
| Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching | Unknown | N/A | |
| Competing for Shareable Arms in Multi-Player Multi-Armed Bandits | Unknown | N/A | |
| Regularization-free Diffeomorphic Temporal Alignment Nets | Unknown | N/A | |
| CircuitNet: A Generic Neural Network to Realize Universal Circuit Motif Modeling | Unknown | N/A | |
| Learning Physical Models that Can Respect Conservation Laws | Unknown | N/A | |
| Sampling random graph homomorphisms and applications to network data analysis | Unknown | N/A | |
| Cooperative Open-ended Learning Framework for Zero-Shot Coordination | Unknown | N/A | |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning | Unknown | N/A | |
| Multi-Modal Classifiers for Open-Vocabulary Object Detection | Unknown | N/A | |
| Optimizing the Collaboration Structure in Cross-Silo Federated Learning | Unknown | N/A | |
| Effective Neural Topic Modeling with Embedding Clustering Regularization | Unknown | N/A | |
| Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits | Unknown | N/A | |
| Shedding a PAC-Bayesian Light on Adaptive Sliced-Wasserstein Distances | Unknown | N/A | |
| LeadFL: Client Self-Defense against Model Poisoning in Federated Learning | Unknown | N/A | |
| Generative Pretraining for Black-Box Optimization | Unknown | N/A | |
| Diffusion Models for Black-Box Optimization | Unknown | N/A | |
| Robust Budget Pacing with a Single Sample | Unknown | N/A | |
| Mu$^2$SLAM: Multitask, Multilingual Speech and Language Models | Unknown | N/A | |
| Thompson Sampling with Diffusion Generative Prior | Unknown | N/A | |
| Revisiting Pseudo-Label for Single-Positive Multi-Label Learning | Unknown | N/A | |
| Progressive Purification for Instance-Dependent Partial Label Learning | Unknown | N/A | |
| Global Convergence of Sub-gradient Method for Robust Matrix Recovery: Small Initialization, Noisy Measurements, and Over-parameterization | Unknown | N/A | |
| Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks | Unknown | N/A | |
| A Complete Expressiveness Hierarchy for Subgraph GNNs via Subgraph Weisfeiler-Lehman Tests | Unknown | N/A | |
| DDGR: Continual Learning with Deep Diffusion-based Generative Replay | Unknown | N/A | |
| Generalized-Smooth Nonconvex Optimization is As Efficient As Smooth Nonconvex Optimization | Unknown | N/A | |
| A General Theory for Federated Optimization with Asynchronous and Heterogeneous Clients Updates | Unknown | N/A | |
| Free-Form Variational Inference for Gaussian Process State-Space Models | Unknown | N/A | |
| Transformed Distribution Matching for Missing Value Imputation | Unknown | N/A | |
| Identifying Useful Learnwares for Heterogeneous Label Spaces | Unknown | N/A | |
| Graph Neural Networks can Recover the Hidden Features Solely from the Graph Structure | Unknown | N/A | |
| Disentangled Generative Models for Robust Prediction of System Dynamics | Unknown | N/A | |
| Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning | Unknown | N/A | |
| Linearly Constrained Bilevel Optimization: A Smoothed Implicit Gradient Approach | Unknown | N/A | |
| Dirichlet Diffusion Score Model for Biological Sequence Generation | Unknown | N/A | |
| Lifelong Language Pretraining with Distribution-Specialized Experts | Unknown | N/A | |
| Self-Interpretable Time Series Prediction with Counterfactual Explanations | Unknown | N/A | |
| Robust Perception through Equivariance | Unknown | N/A | |
| Is Learning Summary Statistics Necessary for Likelihood-free Inference? | Unknown | N/A | |
| Simplex Random Features | Unknown | N/A | |
| Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum | Unknown | N/A | |
| Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration | Unknown | N/A | |
| Data Poisoning Attacks Against Multimodal Encoders | Unknown | N/A | |
| Properties of the Mallows Model Depending on the Number of Alternatives: A Warning for an Experimentalist | Unknown | N/A | |
| Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value | Unknown | N/A | |
| Diagnosis, Feedback, Adaptation: A Human-in-the-Loop Framework for Test-Time Policy Adaptation | Unknown | N/A | |
| Specializing Smaller Language Models towards Multi-Step Reasoning | Unknown | N/A | |
| SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models | Unknown | N/A | |
| Continual Task Allocation in Meta-Policy Network via Sparse Prompting | Unknown | N/A | |
| Structured Cooperative Learning with Graphical Model Priors | Unknown | N/A | |
| Does Continual Learning Equally Forget All Parameters? | Unknown | N/A | |
| InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models | Unknown | N/A | |
| Learning-Rate-Free Learning by D-Adaptation | Unknown | N/A | |
| EM-Network: Oracle Guided Self-distillation for Sequence Learning | Unknown | N/A | |
| Coordinated Dynamic Bidding in Repeated Second-Price Auctions with Budgets | Unknown | N/A | |
| Implicit Graph Neural Networks: A Monotone Operator Viewpoint | Unknown | N/A | |
| SurProGenes: Survival Risk-Ordered Representation of Cancer Patients and Genes for the Identification of Prognostic Genes | Unknown | N/A | |
| Rockmate: an Efficient, Fast, Automatic and Generic Tool for Re-materialization in PyTorch | Unknown | N/A | |
| Towards Understanding Ensemble Distillation in Federated Learning | Unknown | N/A | |
| On the Convergence of Federated Averaging with Cyclic Client Participation | Unknown | N/A | |
| One-sided Matrix Completion from Two Observations Per Row | Unknown | N/A | |
| Autoregressive Diffusion Model for Graph Generation | Unknown | N/A | |
| Fast Private Kernel Density Estimation via Locality Sensitive Quantization | Unknown | N/A | |
| Statistical Inference on Multi-armed Bandits with Delayed Feedback | Unknown | N/A | |
| Robust Explanation for Free or At the Cost of Faithfulness | Unknown | N/A | |
| Tight Data Access Bounds for Private Top-$k$ Selection | Unknown | N/A | |
| dugMatting: Decomposed-Uncertainty-Guided Matting | Unknown | N/A | |
| R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents | Unknown | N/A | |
| Learning Controllable Degradation for Real-World Super-Resolution via Constrained Flows | Unknown | N/A | |
| Bandit Multi-linear DR-Submodular Maximization and Its Applications on Adversarial Submodular Bandits | Unknown | N/A | |
| Are Random Decompositions all we need in High Dimensional Bayesian Optimisation? | Unknown | N/A | |
| Computational Doob h-transforms for Online Filtering of Discretely Observed Diffusions | Unknown | N/A | |
| Resurrecting Recurrent Neural Networks for Long Sequences | Unknown | N/A | |
| The Value of Out-of-Distribution Data | Unknown | N/A | |
| Meta Optimal Transport | Unknown | N/A | |
| A Picture of the Space of Typical Learnable Tasks | Unknown | N/A | |
| Discover and Cure: Concept-aware Mitigation of Spurious Correlation | Unknown | N/A | |
| Estimating Joint Treatment Effects by Combining Multiple Experiments | Unknown | N/A | |
| Efficient Graph Field Integrators Meet Point Clouds | Unknown | N/A | |
| Towards Reliable Neural Specifications | Unknown | N/A | |
| On Computing Optimal Tree Ensembles | Unknown | N/A | |
| Learning Mixtures of Gaussians with Censored Data | Unknown | N/A | |
| Nonlinear Causal Discovery with Latent Confounders | Unknown | N/A | |
| Functional Neural Networks: Shift invariant models for functional data with applications to EEG classification | Unknown | N/A | |
| When is Realizability Sufficient for Off-Policy Reinforcement Learning? | Unknown | N/A | |
| Provable Benefit of Mixup for Finding Optimal Decision Boundaries | Unknown | N/A | |
| Near-Optimal Cryptographic Hardness of Agnostically Learning Halfspaces and ReLU Regression under Gaussian Marginals | Unknown | N/A | |
| Performative Reinforcement Learning | Unknown | N/A | |
| Neuro-Symbolic Continual Learning: Knowledge, Reasoning Shortcuts and Concept Rehearsal | Unknown | N/A | |
| Learning Control-Oriented Dynamical Structure from Data | Unknown | N/A | |
| Mitigating Propagation Failures in Physics-informed Neural Networks using Retain-Resample-Release (R3) Sampling | Unknown | N/A | |
| Uncertain Evidence in Probabilistic Models and Stochastic Simulators | Unknown | N/A | |
| Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization | Unknown | N/A | |
| POUF: Prompt-Oriented Unsupervised Fine-tuning for Large Pre-trained Models | Unknown | N/A | |
| Shape-Guided Dual-Memory Learning for 3D Anomaly Detection | Unknown | N/A | |
| Temporal Label Smoothing for Early Event Prediction | Unknown | N/A | |
| Identifiability and Generalizability in Constrained Inverse Reinforcement Learning | Unknown | N/A | |
| Deep Clustering with Incomplete Noisy Pairwise Annotations: A Geometric Regularization Approach | Unknown | N/A | |
| Learning Rate Schedules in the Presence of Distribution Shift | Unknown | N/A | |
| GOAT: A Global Transformer on Large-scale Graphs | Unknown | N/A | |
| Local Vertex Colouring Graph Neural Networks | Unknown | N/A | |
| A Hybrid Quantum-Classical Approach based on the Hadamard Transform for the Convolutional Layer | Unknown | N/A | |
| PAC-Bayesian Offline Contextual Bandits With Guarantees | Unknown | N/A | |
| On Balancing Bias and Variance in Unsupervised Multi-Source-Free Domain Adaptation | Unknown | N/A | |
| Multi-class Graph Clustering via Approximated Effective $p$-Resistance | Unknown | N/A | |
| A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback | Unknown | N/A | |
| Accelerated Stochastic Optimization Methods under Quasar-convexity | Unknown | N/A | |
| Personalized Subgraph Federated Learning | Unknown | N/A | |
| A Kernel Stein Test of Goodness of Fit for Sequential Models | Unknown | N/A | |
| MultiAdam: Parameter-wise Scale-invariant Optimizer for Multiscale Training of Physics-informed Neural Networks | Unknown | N/A | |
| VIMA: Robot Manipulation with Multimodal Prompts | Unknown | N/A | |
| Gradient Descent Finds the Global Optima of Two-Layer Physics-Informed Neural Networks | Unknown | N/A | |
| On User-Level Private Convex Optimization | Unknown | N/A | |
| Tensor Gaussian Process with Contraction for Multi-Channel Imaging Analysis | Unknown | N/A | |
| Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling | Unknown | N/A | |
| Are labels informative in semi-supervised learning? Estimating and leveraging the missing-data mechanism. | Unknown | N/A | |
| A Neural PDE Solver with Temporal Stencil Modeling | Unknown | N/A | |
| Understanding Oversquashing in GNNs through the Lens of Effective Resistance | Unknown | N/A | |
| The Numerical Stability of Hyperbolic Representation Learning | Unknown | N/A | |
| Interval Bound Interpolation for Few-shot Learning with Few Tasks | Unknown | N/A | |
| A Model-free Closeness-of-influence Test for Features in Supervised Learning | Unknown | N/A | |
| Generalized Disparate Impact for Configurable Fairness Solutions in ML | Unknown | N/A | |
| Truncating Trajectories in Monte Carlo Reinforcement Learning | Unknown | N/A | |
| Trapdoor Normalization with Irreversible Ownership Verification | Unknown | N/A | |
| For Pre-Trained Vision Models in Motor Control, Not All Policy Learning Methods are Created Equal | Unknown | N/A | |
| HyperTuning: Toward Adapting Large Language Models without Back-propagation | Unknown | N/A | |
| Underspecification Presents Challenges for Credibility in Modern Machine Learning | Unknown | N/A | |
| Doubly Optimal No-Regret Learning in Monotone Games | Unknown | N/A | |
| Kernel Sufficient Dimension Reduction and Variable Selection for Compositional Data via Amalgamation | Unknown | N/A | |
| Active Learning based Structural Inference | Unknown | N/A | |
| Curious Replay for Model-based Adaptation | Unknown | N/A | |
| From Temporal to Contemporaneous Iterative Causal Discovery in the Presence of Latent Confounders | Unknown | N/A | |
| The Power of Uniform Sampling for k-Median | Unknown | N/A | |
| Towards Understanding and Reducing Graph Structural Noise for GNNs | Unknown | N/A | |
| BEATs: Audio Pre-Training with Acoustic Tokenizers | Unknown | N/A | |
| Optimistic Planning by Regularized Dynamic Programming | Unknown | N/A | |
| FedAvg Converges to Zero Training Loss Linearly for Overparameterized Multi-Layer Neural Networks | Unknown | N/A | |
| A Closer Look at Self-Supervised Lightweight Vision Transformers | Unknown | N/A | |
| Feed Two Birds with One Scone: Exploiting Wild Data for Both Out-of-Distribution Generalization and Detection | Unknown | N/A | |
| Defects of Convolutional Decoder Networks in Frequency Representation | Unknown | N/A | |
| An Instrumental Variable Approach to Confounded Off-Policy Evaluation | Unknown | N/A | |
| Multi-agent Online Scheduling: MMS Allocations for Indivisible Items | Unknown | N/A | |
| Cooperation in the Latent Space: The Benefits of Adding Mixture Components in Variational Autoencoders | Unknown | N/A | |
| Learning Belief Representations for Partially Observable Deep RL | Unknown | N/A | |
| Discrete Continuous Optimization Framework for Simultaneous Clustering and Training in Mixture Models | Unknown | N/A | |
| Entropy-driven Unsupervised Keypoint Representation Learning in Videos | Unknown | N/A | |
| Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling | Unknown | N/A | |
| Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents | Unknown | N/A | |
| A Reinforcement Learning Framework for Dynamic Mediation Analysis | Unknown | N/A | |
| Subset-Based Instance Optimality in Private Estimation | Unknown | N/A | |
| Returning The Favour: When Regression Benefits From Probabilistic Causal Knowledge | Unknown | N/A | |
| Automatic Data Augmentation via Invariance-Constrained Learning | Unknown | N/A | |
| Exponential Smoothing for Off-Policy Learning | Unknown | N/A | |
| Not all Strongly Rayleigh Distributions Have Small Probabilistic Generating Circuits | Unknown | N/A | |
| Adversarial Policies Beat Superhuman Go AIs | Unknown | N/A | |
| On the Impact of Knowledge Distillation for Model Interpretability | Unknown | N/A | |
| Adaptive Barrier Smoothing for First-Order Policy Gradient with Contact Dynamics | Unknown | N/A | |
| On the Correctness of Automatic Differentiation for Neural Networks with Machine-Representable Parameters | Unknown | N/A | |
| Revisiting Simple Regret: Fast Rates for Returning a Good Arm | Unknown | N/A | |
| simple diffusion: End-to-end diffusion for high resolution images | Unknown | N/A | |
| The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation | Unknown | N/A | |
| A Fully First-Order Method for Stochastic Bilevel Optimization | Unknown | N/A | |
| Stochastic Gradient Succeeds for Bandits | Unknown | N/A | |
| Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice | Unknown | N/A | |
| Towards Learning Geometric Eigen-Lengths Crucial for Fitting Tasks | Unknown | N/A | |
| Parallel Online Clustering of Bandits via Hedonic Game | Unknown | N/A | |
| Long-Tailed Recognition by Mutual Information Maximization between Latent Features and Ground-Truth Labels | Unknown | N/A | |
| Model-Free Robust Average-Reward Reinforcement Learning | Unknown | N/A | |
| Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat | Unknown | N/A | |
| Special Properties of Gradient Descent with Large Learning Rates | Unknown | N/A | |
| Inverse Reinforcement Learning without Reinforcement Learning | Unknown | N/A | |
| Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling | Unknown | N/A | |
| Regularizing Towards Soft Equivariance Under Mixed Symmetries | Unknown | N/A | |
| Probabilistic Imputation for Time-series Classification with Missing Data | Unknown | N/A | |
| The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms | Unknown | N/A | |
| Multi-User Reinforcement Learning with Low Rank Rewards | Unknown | N/A | |
| Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering | Unknown | N/A | |
| Short-lived High-volume Bandits | Unknown | N/A | |
| Bidirectional Learning for Offline Model-based Biological Sequence Design | Unknown | N/A | |
| DevFormer: A Symmetric Transformer for Context-Aware Device Placement | Unknown | N/A | |
| Generalization on the Unseen, Logic Reasoning and Degree Curriculum | Unknown | N/A | |
| Accounting For Informative Sampling When Learning to Forecast Treatment Outcomes Over Time | Unknown | N/A | |
| Learning Perturbations to Explain Time Series Predictions | Unknown | N/A | |
| Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits | Unknown | N/A | |
| GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks | Unknown | N/A | |
| Robust Satisficing MDPs | Unknown | N/A | |
| Robust One-Class Classification with Signed Distance Function using 1-Lipschitz Neural Networks | Unknown | N/A | |
| Weakly Supervised Disentangled Generative Causal Representation Learning | Unknown | N/A | |
| Buying Information for Stochastic Optimization | Unknown | N/A | |
| Neural FIM for learning Fisher information metrics from point cloud data | Unknown | N/A | |
| Lowering the Pre-training Tax for Gradient-based Subset Training: A Lightweight Distributed Pre-Training Toolkit | Unknown | N/A | |
| Learning Dense Correspondences between Photos and Sketches | Unknown | N/A | |
| Synthetic data for model selection | Unknown | N/A | |
| Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models | Unknown | N/A | |
| Learning to Suggest Breaks: Sustainable Optimization of Long-Term User Engagement | Unknown | N/A | |
| Feature Expansion for Graph Neural Networks | Unknown | N/A | |
| D2Match: Leveraging Deep Learning and Degeneracy for Subgraph Matching | Unknown | N/A | |
| How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding | Unknown | N/A | |
| Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian Regret Bounds | Unknown | N/A | |
| Scaling of Class-wise Training Losses for Post-hoc Calibration | Unknown | N/A | |
| Q-Flow: Generative Modeling for Differential Equations of Open Quantum Dynamics with Normalizing Flows | Unknown | N/A | |
| TIPS: Topologically Important Path Sampling for Anytime Neural Networks | Unknown | N/A | |
| On the Effectiveness of Offline RL for Dialogue Response Generation | Unknown | N/A | |
| Learning Antidote Data to Individual Unfairness | Unknown | N/A | |
| On the Stepwise Nature of Self-Supervised Learning | Unknown | N/A | |
| Identification of the Adversary from a Single Adversarial Example | Unknown | N/A | |
| Discover-Then-Rank Unlabeled Support Vectors in the Dual Space for Multi-Class Active Learning | Unknown | N/A | |
| Effectively Using Public Data in Privacy Preserving Machine Learning | Unknown | N/A | |
| Multiplier Bootstrap-based Exploration | Unknown | N/A | |
| Gradient Descent in Neural Networks as Sequential Learning in Reproducing Kernel Banach Space | Unknown | N/A | |
| Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models | Unknown | N/A | |
| Uncovering Adversarial Risks of Test-Time Adaptation | Unknown | N/A | |
| Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations | Unknown | N/A | |
| On Coresets for Clustering in Small Dimensional Euclidean spaces | Unknown | N/A | |
| CLUTR: Curriculum Learning via Unsupervised Task Representation Learning | Unknown | N/A | |
| Feature learning in deep classifiers through Intermediate Neural Collapse | Unknown | N/A | |
| Internet Explorer: Targeted Representation Learning on the Open Web | Unknown | N/A | |
| MetricGAN-OKD: Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement | Unknown | N/A | |
| A Category-theoretical Meta-analysis of Definitions of Disentanglement | Unknown | N/A | |
| Random Classification Noise does not defeat All Convex Potential Boosters Irrespective of Model Choice | Unknown | N/A | |
| Understanding the Role of Feedback in Online Learning with Switching Costs | Unknown | N/A | |
| Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards | Unknown | N/A | |
| DIVISION: Memory Efficient Training via Dual Activation Precision | Unknown | N/A | |
| Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression | Unknown | N/A | |
| Hiding Data Helps: On the Benefits of Masking for Sparse Coding | Unknown | N/A | |
| Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models | Unknown | N/A | |
| Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup | Unknown | N/A | |
| Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples | Unknown | N/A | |
| The Price of Differential Privacy under Continual Observation | Unknown | N/A | |
| Delayed Feedback in Kernel Bandits | Unknown | N/A | |
| Continuous Spatiotemporal Transformer | Unknown | N/A | |
| LSDS++ : Dual Sampling for Accelerated k-means++ | Unknown | N/A | |
| InfoOT: Information Maximizing Optimal Transport | Unknown | N/A | |
| Neural signature kernels as infinite-width-depth-limits of controlled ResNets | Unknown | N/A | |
| Regression with Label Permutation in Generalized Linear Model | Unknown | N/A | |
| Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| On the Within-Group Fairness of Screening Classifiers | Unknown | N/A | |
| Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression: Fast Convergence and Partial Participation | Unknown | N/A | |
| Achieving High Accuracy with PINNs via Energy Natural Gradient Descent | Unknown | N/A | |
| TRAK: Attributing Model Behavior at Scale | Unknown | N/A | |
| Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model | Unknown | N/A | |
| Two-Scale Gradient Descent Ascent Dynamics Finds Mixed Nash Equilibria of Continuous Games: A Mean-Field Perspective | Unknown | N/A | |
| Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP | Unknown | N/A | |
| Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments | Unknown | N/A | |
| Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models | Unknown | N/A | |
| DoG is SGD's Best Friend: A Parameter-Free Dynamic Step Size Schedule | Unknown | N/A | |
| Rethinking Backdoor Attacks | Unknown | N/A | |
| Linear CNNs Discover the Statistical Structure of the Dataset Using Only the Most Dominant Frequencies | Unknown | N/A | |
| Finding the Missing-half: Graph Complementary Learning for Homophily-prone and Heterophily-prone Graphs | Unknown | N/A | |
| $\pi$-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation | Unknown | N/A | |
| Compositional Score Modeling for Simulation-Based Inference | Unknown | N/A | |
| Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression | Unknown | N/A | |
| A Watermark for Large Language Models | Unknown | N/A | |
| Online Prototype Alignment for Few-shot Policy Transfer | Unknown | N/A | |
| MetaModulation: Learning Variational Feature Hierarchies for Few-Shot Learning with Fewer Tasks | Unknown | N/A | |
| Flexible Model Aggregation for Quantile Regression | Unknown | N/A | |
| Looped Transformers as Programmable Computers | Unknown | N/A | |
| Deep Perturbation Learning: Enhancing the Network Performance via Image Perturbations | Unknown | N/A | |
| Improving Adversarial Robustness Through the Contrastive-Guided Diffusion Process | Unknown | N/A | |
| A Gromov--Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening | Unknown | N/A | |
| Spherical Fourier Neural Operators: Learning Stable Dynamics on the Sphere | Unknown | N/A | |
| Double-Weighting for Covariate Shift Adaptation | Unknown | N/A | |
| NTK-approximating MLP Fusion for Efficient Language Model Fine-tuning | Unknown | N/A | |
| How Bad is Top-$K$ Recommendation under Competing Content Creators? | Unknown | N/A | |
| Continuation Path Learning for Homotopy Optimization | Unknown | N/A | |
| Random Matrix Analysis to Balance between Supervised and Unsupervised Learning under the Low Density Separation Assumption | Unknown | N/A | |
| Neural Wasserstein Gradient Flows for Discrepancies with Riesz Kernels | Unknown | N/A | |
| A Critical View of Vision-Based Long-Term Dynamics Prediction Under Environment Misalignment | Unknown | N/A | |
| Aligning Language Models with Preferences through $f$-divergence Minimization | Unknown | N/A | |
| SpotEM: Efficient Video Search for Episodic Memory | Unknown | N/A | |
| Disentangled Multi-Fidelity Deep Bayesian Active Learning | Unknown | N/A | |
| Reinforcement Learning with History Dependent Dynamic Contexts | Unknown | N/A | |
| ModelDiff: A Framework for Comparing Learning Algorithms | Unknown | N/A | |
| MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL | Unknown | N/A | |
| Randomized Schur Complement Views for Graph Contrastive Learning | Unknown | N/A | |
| AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Unknown | N/A | |
| A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems | Unknown | N/A | |
| On the Convergence Rate of Gaussianization with Random Rotations | Unknown | N/A | |
| SOM-CPC: Unsupervised Contrastive Learning with Self-Organizing Maps for Structured Representations of High-Rate Time Series | Unknown | N/A | |
| CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets | Unknown | N/A | |
| Revisiting Sampling for Combinatorial Optimization | Unknown | N/A | |
| Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes | Unknown | N/A | |
| Consistency of Multiple Kernel Clustering | Unknown | N/A | |
| Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path | Unknown | N/A | |
| Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits | Unknown | N/A | |
| Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials | Unknown | N/A | |
| Long Horizon Temperature Scaling | Unknown | N/A | |
| Improving l1-Certified Robustness via Randomized Smoothing by Leveraging Box Constraints | Unknown | N/A | |
| Using Perturbation to Improve Goodness-of-Fit Tests based on Kernelized Stein Discrepancy | Unknown | N/A | |
| Understanding Self-Predictive Learning for Reinforcement Learning | Unknown | N/A | |
| Online Learning in Stackelberg Games with an Omniscient Follower | Unknown | N/A | |
| Low Complexity Homeomorphic Projection to Ensure Neural-Network Solution Feasibility for Optimization over (Non-)Convex Set | Unknown | N/A | |
| High-dimensional Location Estimation via Norm Concentration for Subgamma Vectors | Unknown | N/A | |
| Fast Inference from Transformers via Speculative Decoding | Unknown | N/A | |
| Semi-Offline Reinforcement Learning for Optimized Text Generation | Unknown | N/A | |
| DRCFS: Doubly Robust Causal Feature Selection | Unknown | N/A | |
| Improving Medical Predictions by Irregular Multimodal Electronic Health Records Modeling | Unknown | N/A | |
| On the Role of Attention in Prompt-tuning | Unknown | N/A | |
| Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes | Unknown | N/A | |
| Learning to Learn from APIs: Black-Box Data-Free Meta-Learning | Unknown | N/A | |
| Multi-Task Off-Policy Learning from Bandit Feedback | Unknown | N/A | |
| A Statistical Perspective on Retrieval-Based Models | Unknown | N/A | |
| Near-optimal Conservative Exploration in Reinforcement Learning under Episode-wise Constraints | Unknown | N/A | |
| Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization | Unknown | N/A | |
| LegendreTron: Uprising Proper Multiclass Loss Learning | Unknown | N/A | |
| Modality-Agnostic Variational Compression of Implicit Neural Representations | Unknown | N/A | |
| The Persistent Laplacian for Data Science: Evaluating Higher-Order Persistent Spectral Representations of Data | Unknown | N/A | |
| Never mind the metrics---what about the uncertainty? Visualising binary confusion matrix metric distributions to put performance in perspective | Unknown | N/A | |
| SparseGPT: Massive Language Models Can be Accurately Pruned in One-Shot | Unknown | N/A | |
| Off-Policy Average Reward Actor-Critic with Deterministic Policy Search | Unknown | N/A | |
| Quantum Speedups for Zero-Sum Games via Improved Dynamic Gibbs Sampling | Unknown | N/A | |
| Improving Adversarial Robustness of Deep Equilibrium Models with Explicit Regulations Along the Neural Dynamics | Unknown | N/A | |
| Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback | Unknown | N/A | |
| Near-Optimal $\Phi$-Regret Learning in Extensive-Form Games | Unknown | N/A | |
| Team Belief DAG: Generalizing the Sequence Form to Team Games for Fast Computation of Correlated Team Max-Min Equilibria via Regret Minimization | Unknown | N/A | |
| Dink-Net: Neural Clustering on Large Graphs | Unknown | N/A | |
| What Can Be Learnt With Wide Convolutional Neural Networks? | Unknown | N/A | |
| Invariance in Policy Optimisation and Partial Identifiability in Reward Learning | Unknown | N/A | |
| SRATTA: Sample Re-ATTribution Attack of Secure Aggregation in Federated Learning. | Unknown | N/A | |
| Theory on Forgetting and Generalization of Continual Learning | Unknown | N/A | |
| Internally Rewarded Reinforcement Learning | Unknown | N/A | |
| Generalization Bounds using Data-Dependent Fractal Dimensions | Unknown | N/A | |
| Flash: Concept Drift Adaptation in Federated Learning | Unknown | N/A | |
| DIFF2: Differential Private Optimization via Gradient Differences for Nonconvex Distributed Learning | Unknown | N/A | |
| Tight and fast generalization error bound of graph embedding in metric space | Unknown | N/A | |
| Primal and Dual Analysis of Entropic Fictitious Play for Finite-sum Problems | Unknown | N/A | |
| Modeling Dynamic Environments with Scene Graph Memory | Unknown | N/A | |
| A Scalable Frank-Wolfe-Based Algorithm for the Max-Cut SDP | Unknown | N/A | |
| High Fidelity Image Counterfactuals with Probabilistic Causal Models | Unknown | N/A | |
| On the Functional Similarity of Robust and Non-Robust Neural Representations | Unknown | N/A | |
| Provably Learning Object-Centric Representations | Unknown | N/A | |
| Open-Vocabulary Universal Image Segmentation with MaskCLIP | Unknown | N/A | |
| Active Policy Improvement from Multiple Black-box Oracles | Unknown | N/A | |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Unknown | N/A | |
| Compositional Exemplars for In-context Learning | Unknown | N/A | |
| In Search for a Generalizable Method for Source Free Domain Adaptation | Unknown | N/A | |
| From Relational Pooling to Subgraph GNNs: A Universal Framework for More Expressive Graph Neural Networks | Unknown | N/A | |
| ODS: Test-Time Adaptation in the Presence of Open-World Data Shift | Unknown | N/A | |
| User-level Private Stochastic Convex Optimization with Optimal Rates | Unknown | N/A | |
| NeuralSlice: Neural 3D Triangle Mesh Reconstruction via Slicing 4D Tetrahedral Meshes | Unknown | N/A | |
| Prototype-oriented unsupervised anomaly detection for multivariate time series | Unknown | N/A | |
| Best of Both Worlds Policy Optimization | Unknown | N/A | |
| Bayesian Progressive Deep Topic Model with Knowledge Informed Textual Data Coarsening Process | Unknown | N/A | |
| ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation | Unknown | N/A | |
| High Probability Convergence of Stochastic Gradient Methods | Unknown | N/A | |
| abess: A Fast Best-Subset Selection Library in Python and R | Unknown | N/A | |
| Covariate balancing using the integral probability metric for causal inference | Unknown | N/A | |
| Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation | Unknown | N/A | |
| Taxonomy-Structured Domain Adaptation | Unknown | N/A | |
| NeuralStagger: Accelerating Physics-constrained Neural PDE Solver with Spatial-temporal Decomposition | Unknown | N/A | |
| Robust Weak Supervision with Variational Auto-Encoders | Unknown | N/A | |
| Delay-agnostic Asynchronous Coordinate Update Algorithm | Unknown | N/A | |
| Task-specific experimental design for treatment effect estimation | Unknown | N/A | |
| Boosting Graph Contrastive Learning via Graph Contrastive Saliency | Unknown | N/A | |
| Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization | Unknown | N/A | |
| Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron | Unknown | N/A | |
| Quantized Distributed Training of Large Models with Convergence Guarantees | Unknown | N/A | |
| Cramming: Training a Language Model on a single GPU in one day. | Unknown | N/A | |
| Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models? | Unknown | N/A | |
| Towards Stable and Efficient Adversarial Training against $l_1$ Bounded Adversarial Attacks | Unknown | N/A | |
| Learning Functional Distributions with Private Labels | Unknown | N/A | |
| What do CNNs Learn in the First Layer and Why? A Linear Systems Perspective | Unknown | N/A | |
| Are Equivariant Equilibrium Approximators Beneficial? | Unknown | N/A | |
| Knowledge Hypergraph Embedding Meets Relational Algebra | Unknown | N/A | |
| When do Minimax-fair Learning and Empirical Risk Minimization Coincide? | Unknown | N/A | |
| Random Grid Neural Processes for Parametric Partial Differential Equations | Unknown | N/A | |
| SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning | Unknown | N/A | |
| Weighted Sampling without Replacement for Deep Top-$k$ Classification | Unknown | N/A | |
| Efficient Learning of Mesh-Based Physical Simulation with Bi-Stride Multi-Scale Graph Neural Network | Unknown | N/A | |
| One-vs-the-Rest Loss to Focus on Important Samples in Adversarial Training | Unknown | N/A | |
| Fundamental Tradeoffs in Learning with Prior Information | Unknown | N/A | |
| Model-agnostic Measure of Generalization Difficulty | Unknown | N/A | |
| Scalable Safe Policy Improvement via Monte Carlo Tree Search | Unknown | N/A | |
| Quantum Policy Gradient Algorithm with Optimized Action Decoding | Unknown | N/A | |
| Boosting Offline Reinforcement Learning with Action Preference Query | Unknown | N/A | |
| Tight Certification of Adversarially Trained Neural Networks via Nonconvex Low-Rank Semidefinite Relaxations | Unknown | N/A | |
| Projected Tensor Power Method for Hypergraph Community Recovery | Unknown | N/A | |
| Retrieval-Augmented Multimodal Language Modeling | Unknown | N/A | |
| Neural Network Accelerated Implicit Filtering: Integrating Neural Network Surrogates With Provably Convergent Derivative Free Optimization Methods | Unknown | N/A | |
| BiBench: Benchmarking and Analyzing Network Binarization | Unknown | N/A | |
| On Data Manifolds Entailed by Structural Causal Models | Unknown | N/A | |
| The Acquisition of Physical Knowledge in Generative Neural Networks | Unknown | N/A | |
| Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings | Unknown | N/A | |
| Federated Heavy Hitter Recovery under Linear Sketching | Unknown | N/A | |
| Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models | Unknown | N/A | |
| Equivariance with Learned Canonicalization Functions | Unknown | N/A | |
| FedDisco: Federated Learning with Discrepancy-Aware Collaboration | Unknown | N/A | |
| Gradient-based Wang--Landau Algorithm: A Novel Sampler for Output Distribution of Neural Networks over the Input Space | Unknown | N/A | |
| Federated Adversarial Learning: A Framework with Convergence Analysis | Unknown | N/A | |
| Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning | Unknown | N/A | |
| Towards Learning to Imitate from a Single Video Demonstration | Unknown | N/A | |
| Speeding Up Bellman Ford via Minimum Violation Permutations | Unknown | N/A | |
| Fully Dynamic Submodular Maximization over Matroids | Unknown | N/A | |
| On the Forward Invariance of Neural ODEs | Unknown | N/A | |
| Hybrid Energy Based Model in the Feature Space for Out-of-Distribution Detection | Unknown | N/A | |
| Graph Neural Networks with Learnable and Optimal Polynomial Bases | Unknown | N/A | |
| Masked Bayesian Neural Networks : Theoretical Guarantee and its Posterior Inference | Unknown | N/A | |
| MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation | Unknown | N/A | |
| Multi-View Masked World Models for Visual Robotic Manipulation | Unknown | N/A | |
| Multisample Flow Matching: Straightening Flows with Minibatch Couplings | Unknown | N/A | |
| Policy Mirror Ascent for Efficient and Independent Learning in Mean Field Games | Unknown | N/A | |
| Do Not Train It: A Linear Neural Architecture Search of Graph Neural Networks | Unknown | N/A | |
| Towards Controlled Data Augmentations for Active Learning | Unknown | N/A | |
| Conformal Prediction for Federated Uncertainty Quantification Under Label Shift | Unknown | N/A | |
| Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces | Unknown | N/A | |
| Differentially Private Stochastic Convex Optimization under a Quantile Loss Function | Unknown | N/A | |
| SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation | Unknown | N/A | |
| In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation | Unknown | N/A | |
| Over-parametrization via Lifting for Low-rank Matrix Sensing: Conversion of Spurious Solutions to Strict Saddle Points | Unknown | N/A | |
| Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation | Unknown | N/A | |
| Retrosynthetic Planning with Dual Value Networks | Unknown | N/A | |
| A Critical Revisit of Adversarial Robustness in 3D Point Cloud Recognition with Diffusion-Driven Purification | Unknown | N/A | |
| Cones: Concept Neurons in Diffusion Models for Customized Generation | Unknown | N/A | |
| Finding Generalization Measures by Contrasting Signal and Noise | Unknown | N/A | |
| On Uni-Modal Feature Learning in Supervised Multi-Modal Learning | Unknown | N/A | |
| Personalized Federated Learning under Mixture of Distributions | Unknown | N/A | |
| Evaluating Unsupervised Denoising Requires Unsupervised Metrics | Unknown | N/A | |
| On the Robustness of Randomized Ensembles to Adversarial Perturbations | Unknown | N/A | |
| A Closer Look at the Intervention Procedure of Concept Bottleneck Models | Unknown | N/A | |
| Offline Learning in Markov Games with General Function Approximation | Unknown | N/A | |
| Decoding Layer Saliency in Language Transformers | Unknown | N/A | |
| Nested Elimination: A Simple Algorithm for Best-Item Identification From Choice-Based Feedback | Unknown | N/A | |
| Provable Dynamic Fusion for Low-Quality Multimodal Data | Unknown | N/A | |
| Block Subsampled Randomized Hadamard Transform for Nyström Approximation on Distributed Architectures | Unknown | N/A | |
| One-shot Imitation in a Non-Stationary Environment via Multi-Modal Skill | Unknown | N/A | |
| Conditional Graph Information Bottleneck for Molecular Relational Learning | Unknown | N/A | |
| Domain Adaptation for Time Series Under Feature and Label Shifts | Unknown | N/A | |
| Strategic Classification with Unknown User Manipulations | Unknown | N/A | |
| Simplified Temporal Consistency Reinforcement Learning | Unknown | N/A | |
| Image Restoration with Mean-Reverting Stochastic Differential Equations | Unknown | N/A | |
| Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving | Unknown | N/A | |
| Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies | Unknown | N/A | |
| Implicit Jacobian regularization weighted with impurity of probability output | Unknown | N/A | |
| Adversarial Cheap Talk | Unknown | N/A | |
| Slot-VAE: Object-Centric Scene Generation with Slot Attention | Unknown | N/A | |
| Pre-training for Speech Translation: CTC Meets Optimal Transport | Unknown | N/A | |
| H-Likelihood Approach to Deep Neural Networks with Temporal-Spatial Random Effects for High-Cardinality Categorical Features | Unknown | N/A | |
| Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization | Unknown | N/A | |
| Robust and private stochastic linear bandits | Unknown | N/A | |
| Algorithms for bounding contribution for histogram estimation under user-level privacy | Unknown | N/A | |
| Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers | Unknown | N/A | |
| A Likelihood Approach to Nonparametric Estimation of a Singular Distribution Using Deep Generative Models | Unknown | N/A | |
| IRNeXt: Rethinking Convolutional Network Design for Image Restoration | Unknown | N/A | |
| TabLeak: Tabular Data Leakage in Federated Learning | Unknown | N/A | |
| Efficient Algorithms for Exact Graph Matching on Correlated Stochastic Block Models with Constant Correlation | Unknown | N/A | |
| Bootstrap in High Dimension with Low Computation | Unknown | N/A | |
| Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting | Unknown | N/A | |
| Simple Disentanglement of Style and Content in Visual Representations | Unknown | N/A | |
| Reparameterized Policy Learning for Multimodal Trajectory Optimization | Unknown | N/A | |
| Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills | Unknown | N/A | |
| Universal Physics-Informed Neural Networks: Symbolic Differential Operator Discovery with Sparse Data | Unknown | N/A | |
| Deep Graph Representation Learning and Optimization for Influence Maximization | Unknown | N/A | |
| A Large-Scale Study of Probabilistic Calibration in Neural Network Regression | Unknown | N/A | |
| Global optimality of Elman-type RNNs in the mean-field regime | Unknown | N/A | |
| Proper Losses for Discrete Generative Models | Unknown | N/A | |
| Adversarial Collaborative Learning on Non-IID Features | Unknown | N/A | |
| Multi-Agent Learning from Learners | Unknown | N/A | |
| On Sampling with Approximate Transport Maps | Unknown | N/A | |
| State and parameter learning with PARIS particle Gibbs | Unknown | N/A | |
| Inferring Relational Potentials in Interacting Systems | Unknown | N/A | |
| BiRT: Bio-inspired Replay in Vision Transformers for Continual Learning | Unknown | N/A | |
| The Computational Complexity of Concise Hypersphere Classification | Unknown | N/A | |
| The Test of Tests: A Framework for Differentially Private Hypothesis Testing | Unknown | N/A | |
| CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design | Unknown | N/A | |
| A Generalization of ViT/MLP-Mixer to Graphs | Unknown | N/A | |
| A Study on Transformer Configuration and Training Objective | Unknown | N/A | |
| Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-type Samplers | Unknown | N/A | |
| Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search | Unknown | N/A | |
| Hyperparameters in Reinforcement Learning and How To Tune Them | Unknown | N/A | |
| Learning Hidden Markov Models When the Locations of Missing Observations are Unknown | Unknown | N/A | |
| Optimizing NOTEARS Objectives via Topological Swaps | Unknown | N/A | |
| Infinite Action Contextual Bandits with Reusable Data Exhaust | Unknown | N/A | |
| Efficient Quantum Algorithms for Quantum Optimal Control | Unknown | N/A | |
| SLAMB: Accelerated Large Batch Training with Sparse Communication | Unknown | N/A | |
| Random Shuffle Transformer for Image Restoration | Unknown | N/A | |
| Conditional Tree Matching for Inference-Time Adaptation of Tree Prediction Models | Unknown | N/A | |
| Robust Situational Reinforcement Learning in Face of Context Disturbances | Unknown | N/A | |
| Can Neural Network Memorization Be Localized? | Unknown | N/A | |
| Exphormer: Sparse Transformers for Graphs | Unknown | N/A | |
| The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning | Unknown | N/A | |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Unknown | N/A | |
| Topologically Faithful Image Segmentation via Induced Matching of Persistence Barcodes | Unknown | N/A | |
| Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels | Unknown | N/A | |
| Fair Neighbor Embedding | Unknown | N/A | |
| Robust Camera Pose Refinement for Multi-Resolution Hash Encoding | Unknown | N/A | |
| Effective and Efficient Structural Inference with Reservoir Computing | Unknown | N/A | |
| Ewald-based Long-Range Message Passing for Molecular Graphs | Unknown | N/A | |
| General Sequential Episodic Memory Model | Unknown | N/A | |
| End-to-end Training of Deep Boltzmann Machines by Unbiased Contrastive Divergence with Local Mode Initialization | Unknown | N/A | |
| Constrained Monotonic Neural Networks | Unknown | N/A | |
| SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks at the Edge | Unknown | N/A | |
| Random Teachers are Good Teachers | Unknown | N/A | |
| Provably Convergent Schrödinger Bridge with Applications to Probabilistic Time Series Imputation | Unknown | N/A | |
| Improving Graph Neural Networks with Learnable Propagation Operators | Unknown | N/A | |
| Nearly-Optimal Hierarchical Clustering for Well-Clustered Graphs | Unknown | N/A | |
| Nonparametric Density Estimation under Distribution Drift | Unknown | N/A | |
| Tensor Decompositions Meet Control Theory: Learning General Mixtures of Linear Dynamical Systems | Unknown | N/A | |
| Exploring the Benefits of Training Expert Language Models over Instruction Tuning | Unknown | N/A | |
| Low-Variance Gradient Estimation in Unrolled Computation Graphs with ES-Single | Unknown | N/A | |
| Difference-in-Differences Meets Tree-based Methods: Heterogeneous Treatment Effects Estimation with Unmeasured Confounding | Unknown | N/A | |
| Scaling Spherical CNNs | Unknown | N/A | |
| AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation | Unknown | N/A | |
| Iterative Approximate Cross-Validation | Unknown | N/A | |
| Conformal Prediction Sets for Graph Neural Networks | Unknown | N/A | |
| Locally Regularized Neural Differential Equations: Some Black Boxes were meant to remain closed! | Unknown | N/A | |
| TabDDPM: Modelling Tabular Data with Diffusion Models | Unknown | N/A | |
| Out-of-Distribution Generalization of Federated Learning via Implicit Invariant Relationships | Unknown | N/A | |
| Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost | Unknown | N/A | |
| Approximate Causal Effect Identification under Weak Confounding | Unknown | N/A | |
| Self-Attention Amortized Distributional Projection Optimization for Sliced Wasserstein Point-Cloud Reconstruction | Unknown | N/A | |
| Probabilistic Categorical Adversarial Attack and Adversarial Training | Unknown | N/A | |
| Alternately Optimized Graph Neural Networks | Unknown | N/A | |
| Complexity of Block Coordinate Descent with Proximal Regularization and Applications to Wasserstein CP-dictionary Learning | Unknown | N/A | |
| Node Embedding from Neural Hamiltonian Orbits in Graph Neural Networks | Unknown | N/A | |
| Robust Speech Recognition via Large-Scale Weak Supervision | Unknown | N/A | |
| A/B Testing in Network Data with Covariate-Adaptive Randomization | Unknown | N/A | |
| Coarse-to-Fine: a Hierarchical Diffusion Model for Molecule Generation in 3D | Unknown | N/A | |
| MEWL: Few-shot multimodal word learning with referential uncertainty | Unknown | N/A | |
| Fair and Robust Estimation of Heterogeneous Treatment Effects for Policy Learning | Unknown | N/A | |
| Forget Unlearning: Towards True Data-Deletion in Machine Learning | Unknown | N/A | |
| Extending Conformal Prediction to Hidden Markov Models with Exact Validity via de Finetti's Theorem for Markov Chains | Unknown | N/A | |
| Gradient Descent Converges Linearly for Logistic Regression on Separable Data | Unknown | N/A | |
| Regions of Reliability in the Evaluation of Multivariate Probabilistic Forecasts | Unknown | N/A | |
| Learning-augmented private algorithms for multiple quantile release | Unknown | N/A | |
| Learning Deep Time-index Models for Time Series Forecasting | Unknown | N/A | |
| Spatial Implicit Neural Representations for Global-Scale Species Mapping | Unknown | N/A | |
| On Preemption and Learning in Stochastic Scheduling | Unknown | N/A | |
| Controlled Differential Equations on Long Sequences via Non-standard Wavelets | Unknown | N/A | |
| LookupFFN: Making Transformers Compute-lite for CPU inference | Unknown | N/A | |
| Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization | Unknown | N/A | |
| Compressing Tabular Data via Latent Variable Estimation | Unknown | N/A | |
| A Connection between One-Step RL and Critic Regularization in Reinforcement Learning | Unknown | N/A | |
| Denoising MCMC for Accelerating Diffusion-Based Generative Models | Unknown | N/A | |
| Linear Time GPs for Inferring Latent Trajectories from Neural Spike Trains | Unknown | N/A | |
| Few-Sample Feature Selection via Feature Manifold Learning | Unknown | N/A | |
| Representation-Driven Reinforcement Learning | Unknown | N/A | |
| How to Trust Your Diffusion Model: A Convex Optimization Approach to Conformal Risk Control | Unknown | N/A | |
| Taming graph kernels with random features | Unknown | N/A | |
| Causal Discovery with Latent Confounders Based on Higher-Order Cumulants | Unknown | N/A | |
| Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Neural Stochastic Differential Games for Time-series Analysis | Unknown | N/A | |
| Learning Regions of Interest for Bayesian Optimization with Adaptive Level-Set Estimation | Unknown | N/A | |
| Revisiting the Linear-Programming Framework for Offline RL with General Function Approximation | Unknown | N/A | |
| Delayed Bandits: When Do Intermediate Observations Help? | Unknown | N/A | |
| On Over-Squashing in Message Passing Neural Networks: The Impact of Width, Depth, and Topology | Unknown | N/A | |
| Reducing SO(3) Convolutions to SO(2) for Efficient Equivariant GNNs | Unknown | N/A | |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Unknown | N/A | |
| PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search | Unknown | N/A | |
| Adversarially Robust PAC Learnability of Real-Valued Functions | Unknown | N/A | |
| Weighted Flow Diffusion for Local Graph Clustering with Node Attributes: an Algorithm and Statistical Guarantees | Unknown | N/A | |
| Mixing Predictions for Online Metric Algorithms | Unknown | N/A | |
| Reinforcement Learning in Low-rank MDPs with Density Features | Unknown | N/A | |
| The Power of Preconditioning in Overparameterized Low-Rank Matrix Sensing | Unknown | N/A | |
| Sequential Monte Carlo Learning for Time Series Structure Discovery | Unknown | N/A | |
| Multicalibration as Boosting for Regression | Unknown | N/A | |
| Reasons for the Superiority of Stochastic Estimators over Deterministic Ones: Robustness, Consistency and Perceptual Quality | Unknown | N/A | |
| Cut your Losses with Squentropy | Unknown | N/A | |
| Sparse Learning of Dynamical Systems in RKHS: An Operator-Theoretic Approach | Unknown | N/A | |
| K-SHAP: Policy Clustering Algorithm for Anonymous Multi-Agent State-Action Pairs | Unknown | N/A | |
| Towards Understanding Generalization of Macro-AUC in Multi-label Learning | Unknown | N/A | |
| Controllable Neural Symbolic Regression | Unknown | N/A | |
| Stable Estimation of Heterogeneous Treatment Effects | Unknown | N/A | |
| Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond | Unknown | N/A | |
| Constraint Reasoning Embedded Structured Prediction | Unknown | N/A | |
| Optimal Convergence Rates for Agnostic Nyström Kernel Learning | Unknown | N/A | |
| Active Ranking of Experts Based on their Performances in Many Tasks | Unknown | N/A | |
| Efficient Parametric Approximations of Neural Network Function Space Distance | Unknown | N/A | |
| Improving Expert Predictions with Conformal Prediction | Unknown | N/A | |
| Sketched Ridgeless Linear Regression: The Role of Downsampling | Unknown | N/A | |
| mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video | Unknown | N/A | |
| Shortest Edit Path Crossover: A Theory-driven Solution to the Permutation Problem in Evolutionary Neural Architecture Search | Unknown | N/A | |
| Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models | Unknown | N/A | |
| Fisher Information Embedding for Node and Graph Learning | Unknown | N/A | |
| Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization | Unknown | N/A | |
| Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming | Unknown | N/A | |
| Naive imputation implicitly regularizes high-dimensional linear models | Unknown | N/A | |
| Single Point-Based Distributed Zeroth-Order Optimization with a Non-Convex Stochastic Objective Function | Unknown | N/A | |
| Likelihood Adjusted Semidefinite Programs for Clustering Heterogeneous Data | Unknown | N/A | |
| Dual Focal Loss for Calibration | Unknown | N/A | |
| Coin Sampling: Gradient-Based Bayesian Inference without Learning Rates | Unknown | N/A | |
| Distribution Free Domain Generalization | Unknown | N/A | |
| Why Target Networks Stabilise Temporal Difference Methods | Unknown | N/A | |
| Are Gaussian Data All You Need? The Extents and Limits of Universality in High-Dimensional Generalized Linear Estimation | Unknown | N/A | |
| Machine Learning Force Fields with Data Cost Aware Training | Unknown | N/A | |
| Principled Offline RL in the Presence of Rich Exogenous Information | Unknown | N/A | |
| Bayesian online change point detection with Hilbert space approximate Student-t process | Unknown | N/A | |
| BNN-DP: Robustness Certification of Bayesian Neural Networks via Dynamic Programming | Unknown | N/A | |
| Why do Nearest Neighbor Language Models Work? | Unknown | N/A | |
| QAS-Bench: Rethinking Quantum Architecture Search and A Benchmark | Unknown | N/A | |
| Bilevel Optimization with Coupled Decision-Dependent Distributions | Unknown | N/A | |
| Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments | Unknown | N/A | |
| Variational Mixture of HyperGenerators for Learning Distributions over Functions | Unknown | N/A | |
| Nearly Optimal Competitive Ratio for Online Allocation Problems with Two-sided Resource Constraints and Finite Requests | Unknown | N/A | |
| Statistical Indistinguishability of Learning Algorithms | Unknown | N/A | |
| Graph Mixup with Soft Alignments | Unknown | N/A | |
| Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources | Unknown | N/A | |
| Parallel Neurosymbolic Integration with Concordia | Unknown | N/A | |
| Posterior Sampling for Deep Reinforcement Learning | Unknown | N/A | |
| Reinforcement Learning Can Be More Efficient with Multiple Rewards | Unknown | N/A | |
| 2D-Shapley: A Framework for Fragmented Data Valuation | Unknown | N/A | |
| One-Shot Federated Conformal Prediction | Unknown | N/A | |
| E$(n)$ Equivariant Message Passing Simplicial Networks | Unknown | N/A | |
| Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits | Unknown | N/A | |
| DeSRA: Detect and Delete the Artifacts of GAN-based Real-World Super-Resolution Models | Unknown | N/A | |
| Recovery Bounds on Class-Based Optimal Transport: A Sum-of-Norms Regularization Framework | Unknown | N/A | |
| From Adaptive Query Release to Machine Unlearning | Unknown | N/A | |
| Recovering Top-Two Answers and Confusion Probability in Multi-Choice Crowdsourcing | Unknown | N/A | |
| Optimal LP Rounding and Linear-Time Approximation Algorithms for Clustering Edge-Colored Hypergraphs | Unknown | N/A | |
| Surrogate Module Learning: Reduce the Gradient Error Accumulation in Training Spiking Neural Networks | Unknown | N/A | |
| Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization | Unknown | N/A | |
| Automatically Auditing Large Language Models via Discrete Optimization | Unknown | N/A | |
| Efficient Transformed Gaussian Processes for Non-Stationary Dependent Multi-class Classification | Unknown | N/A | |
| Learning to Initiate and Reason in Event-Driven Cascading Processes | Unknown | N/A | |
| Robust and Scalable Bayesian Online Changepoint Detection | Unknown | N/A | |
| In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation | Unknown | N/A | |
| Graph Positional Encoding via Random Feature Propagation | Unknown | N/A | |
| Estimating Possible Causal Effects with Latent Variables via Adjustment | Unknown | N/A | |
| Benign Overfitting in Deep Neural Networks under Lazy Training | Unknown | N/A | |
| A Near-Optimal Algorithm for Safe Reinforcement Learning Under Instantaneous Hard Constraints | Unknown | N/A | |
| Explainability as statistical inference | Unknown | N/A | |
| Learning Prescriptive ReLU Networks | Unknown | N/A | |
| Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation | Unknown | N/A | |
| On the Robustness of Text Vectorizers | Unknown | N/A | |
| Kernel Logistic Regression Approximation of an Understandable ReLU Neural Network | Unknown | N/A | |
| Learning Compiler Pass Orders using Coreset and Normalized Value Prediction | Unknown | N/A | |
| Robustness in Multimodal Learning under Train-Test Modality Mismatch | Unknown | N/A | |
| Achieving Linear Speedup in Non-IID Federated Bilevel Learning | Unknown | N/A | |
| Estimation Beyond Data Reweighting: Kernel Method of Moments | Unknown | N/A | |
| General Covariance Data Augmentation for Neural PDE Solvers | Unknown | N/A | |
| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Unknown | N/A | |
| PASTA: Pessimistic Assortment Optimization | Unknown | N/A | |
| Hypothesis Transfer Learning with Surrogate Classification Losses: Generalization Bounds through Algorithmic Stability | Unknown | N/A | |
| Thompson Sampling with Less Exploration is Fast and Optimal | Unknown | N/A | |
| FARE: Provably Fair Representation Learning with Practical Certificates | Unknown | N/A | |
| Stabilizing Transformer Training by Preventing Attention Entropy Collapse | Unknown | N/A | |
| Optimally-weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference | Unknown | N/A | |
| Tuning Computer Vision Models With Task Rewards | Unknown | N/A | |
| Quantifying the Variability Collapse of Neural Networks | Unknown | N/A | |
| Polarity Is All You Need to Learn and Transfer Faster | Unknown | N/A | |
| A Unified Optimization Framework of ANN-SNN Conversion: Towards Optimal Mapping from Activation Values to Firing Rates | Unknown | N/A | |
| Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization | Unknown | N/A | |
| Learning to Design Analog Circuits to Meet Threshold Specifications | Unknown | N/A | |
| Constrained Causal Bayesian Optimization | Unknown | N/A | |
| Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning | Unknown | N/A | |
| A Modern Look at the Relationship between Sharpness and Generalization | Unknown | N/A | |
| Learning Unnormalized Statistical Models via Compositional Optimization | Unknown | N/A | |
| Half-Hop: A graph upsampling approach for slowing down message passing | Unknown | N/A | |
| Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguous Inputs | Unknown | N/A | |
| Attributing Image Generative Models using Latent Fingerprints | Unknown | N/A | |
| Last Switch Dependent Bandits with Monotone Payoff Functions | Unknown | N/A | |
| Distribution Free Prediction Sets for Node Classification | Unknown | N/A | |
| StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes | Unknown | N/A | |
| Deterministic equivalent and error universality of deep random features learning | Unknown | N/A | |
| Demystifying Disagreement-on-the-Line in High Dimensions | Unknown | N/A | |
| MixFlows: principled variational inference via mixed flows | Unknown | N/A | |
| GRAFENNE: Learning on Graphs with Heterogeneous and Dynamic Feature Sets | Unknown | N/A | |
| Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization | Unknown | N/A | |
| A Kernelized Stein Discrepancy for Biological Sequences | Unknown | N/A | |
| Minimax estimation of discontinuous optimal transport maps: The semi-discrete case | Unknown | N/A | |
| Estimating Causal Effects using a Multi-task Deep Ensemble | Unknown | N/A | |
| Unveiling the Latent Space Geometry of Push-Forward Generative Models | Unknown | N/A | |
| Model-Aware Contrastive Learning: Towards Escaping the Dilemmas | Unknown | N/A | |
| Theoretical Bounds on the Network Community Profile from Low-rank Semi-definite Programming | Unknown | N/A | |
| Conditionally Strongly Log-Concave Generative Models | Unknown | N/A | |
| Nugget: Neural Agglomerative Embeddings of Text | Unknown | N/A | |
| Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning | Unknown | N/A | |
| BPipe: Memory-Balanced Pipeline Parallelism for Training Large Language Models | Unknown | N/A | |
| The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning | Unknown | N/A | |
| PAL: Program-aided Language Models | Unknown | N/A | |
| Causal Proxy Models for Concept-based Model Explanations | Unknown | N/A | |
| Multi-Environment Pretraining Enables Transfer to Action Limited Datasets | Unknown | N/A | |
| Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories | Unknown | N/A | |
| An Information-Theoretic Analysis of Nonstationary Bandit Learning | Unknown | N/A | |
| GuardHFL: Privacy Guardian for Heterogeneous Federated Learning | Unknown | N/A | |
| Federated Linear Contextual Bandits with User-level Differential Privacy | Unknown | N/A | |
| NerfDiff: Single-image View Synthesis with NeRF-guided Distillation from 3D-aware Diffusion | Unknown | N/A | |
| Context Consistency Regularization for Label Sparsity in Time Series | Unknown | N/A | |
| A Conditional Normalizing Flow for Accelerated Multi-Coil MR Imaging | Unknown | N/A | |
| Evaluating Self-Supervised Learning via Risk Decomposition | Unknown | N/A | |
| Homomorphism AutoEncoder --- Learning Group Structured Representations from Observed Transitions | Unknown | N/A | |
| Global optimality for Euclidean CCCP under Riemannian convexity | Unknown | N/A | |
| Approximation and Estimation Ability of Transformers for Sequence-to-Sequence Functions with Infinite Dimensional Input | Unknown | N/A | |
| Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation | Unknown | N/A | |
| PaLM-E: An Embodied Multimodal Language Model | Unknown | N/A | |
| Universal Morphology Control via Contextual Modulation | Unknown | N/A | |
| Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning | Unknown | N/A | |
| Learning Control by Iterative Inversion | Unknown | N/A | |
| Uncertainty Estimation by Fisher Information-based Evidential Deep Learning | Unknown | N/A | |
| Do Perceptually Aligned Gradients Imply Robustness? | Unknown | N/A | |
| Analyzing Diffusion as Serial Reproduction | Unknown | N/A | |
| Certified Robust Neural Networks: Generalization and Corruption Resistance | Unknown | N/A | |
| FP-Diffusion: Improving Score-based Diffusion Models by Enforcing the Underlying Score Fokker-Planck Equation | Unknown | N/A | |
| Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models | Unknown | N/A | |
| Emergent Asymmetry of Precision and Recall for Measuring Fidelity and Diversity of Generative Models in High Dimensions | Unknown | N/A | |
| MANSA: Learning Fast and Slow in Multi-Agent Systems | Unknown | N/A | |
| Spherical Inducing Features for Orthogonally-Decoupled Gaussian Processes | Unknown | N/A | |
| Generative Decoding of Visual Stimuli | Unknown | N/A | |
| Beyond the Edge of Stability via Two-step Gradient Updates | Unknown | N/A | |
| Few-bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction | Unknown | N/A | |
| Contextual Conservative Interleaving Bandits | Unknown | N/A | |
| Efficient List-Decodable Regression using Batches | Unknown | N/A | |
| Do Machine Learning Models Learn Statistical Rules Inferred from Data? | Unknown | N/A | |
| Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning | Unknown | N/A | |
| Deep Temporal Sets with Evidential Reinforced Attentions for Unique Behavioral Pattern Discovery | Unknown | N/A | |
| Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments | Unknown | N/A | |
| Tighter Bounds on the Expressivity of Transformer Encoders | Unknown | N/A | |
| Parameter-Level Soft-Masking for Continual Learning | Unknown | N/A | |
| Learnability and Algorithm for Continual Learning | Unknown | N/A | |
| TGRL: An Algorithm for Teacher Guided Reinforcement Learning | Unknown | N/A | |
| Importance Weighted Expectation-Maximization for Protein Sequence Design | Unknown | N/A | |
| Graph Switching Dynamical Systems | Unknown | N/A | |
| Protecting Language Generation Models via Invisible Watermarking | Unknown | N/A | |
| ReDi: Efficient Learning-Free Diffusion Inference via Trajectory Retrieval | Unknown | N/A | |
| Perturbation Analysis of Neural Collapse | Unknown | N/A | |
| Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining | Unknown | N/A | |
| UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers | Unknown | N/A | |
| Von Mises Mixture Distributions for Molecular Conformation Generation | Unknown | N/A | |
| The Power of Learned Locally Linear Models for Nonlinear Policy Optimization | Unknown | N/A | |
| Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks | Unknown | N/A | |
| System Identification of Neural Systems: If We Got It Right, Would We Know? | Unknown | N/A | |
| Multi-Agent Best Arm Identification with Private Communications | Unknown | N/A | |
| Muse: Text-To-Image Generation via Masked Generative Transformers | Unknown | N/A | |
| Learn to Accumulate Evidence from All Training Samples: Theory and Practice | Unknown | N/A | |
| Pairwise Ranking Losses of Click-Through Rates Prediction for Welfare Maximization in Ad Auctions | Unknown | N/A | |
| Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm | Unknown | N/A | |
| Robust Counterfactual Explanations for Neural Networks With Probabilistic Guarantees | Unknown | N/A | |
| Communication-Constrained Bandits under Additive Gaussian Noise | Unknown | N/A | |
| Everyone's Preference Changes Differently: A Weighted Multi-Interest Model For Retrieval | Unknown | N/A | |
| Poisoning Generative Replay in Continual Learning to Promote Forgetting | Unknown | N/A | |
| Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models | Unknown | N/A | |
| The Regret of Exploration and the Control of Bad Episodes in Reinforcement Learning | Unknown | N/A | |
| STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition | Unknown | N/A | |
| Supervised Metric Learning to Rank for Retrieval via Contextual Similarity Optimization | Unknown | N/A | |
| Constrained Phi-Equilibria | Unknown | N/A | |
| Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion | Unknown | N/A | |
| Flexible Phase Dynamics for Bio-Plausible Contrastive Learning | Unknown | N/A | |
| How much does Initialization Affect Generalization? | Unknown | N/A | |
| Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks | Unknown | N/A | |
| ClusterFuG: Clustering Fully connected Graphs by Multicut | Unknown | N/A | |
| Motion Question Answering via Modular Motion Programs | Unknown | N/A | |
| Statistical Foundations of Prior-Data Fitted Networks | Unknown | N/A | |
| RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation | Unknown | N/A | |
| Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing | Unknown | N/A | |
| Linear Causal Disentanglement via Interventions | Unknown | N/A | |
| Neural Algorithmic Reasoning with Causal Regularisation | Unknown | N/A | |
| A theory of representation learning gives a deep generalisation of kernel methods | Unknown | N/A | |
| PromptBoosting: Black-Box Text Classification with Ten Forward Passes | Unknown | N/A | |
| Hierarchies of Reward Machines | Unknown | N/A | |
| Nearly-tight Bounds for Deep Kernel Learning | Unknown | N/A | |
| Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning | Unknown | N/A | |
| Fractional Denoising for 3D Molecular Pre-training | Unknown | N/A | |
| GNN&GBDT-Guided Fast Optimizing Framework for Large-scale Integer Programming | Unknown | N/A | |
| Optimization for Amortized Inverse Problems | Unknown | N/A | |
| Causal Bounds in Quasi-Markovian Graphs | Unknown | N/A | |
| spred: Solving L1 Penalty with SGD | Unknown | N/A | |
| Evidential Interactive Learning for Medical Image Captioning | Unknown | N/A | |
| SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation | Unknown | N/A | |
| RGE: A Repulsive Graph Rectification for Node Classification via Influence | Unknown | N/A | |
| Investigating the Role of Model-Based Learning in Exploration and Transfer | Unknown | N/A | |
| Leveraging Label Non-Uniformity for Node Classification in Graph Neural Networks | Unknown | N/A | |
| Efficient RL via Disentangled Environment and Agent Representations | Unknown | N/A | |
| Divide and Conquer Dynamic Programming: An Almost Linear Time Change Point Detection Methodology in High Dimensions | Unknown | N/A | |
| Online Mechanism Design for Information Acquisition | Unknown | N/A | |
| Optimal Stochastic Non-smooth Non-convex Optimization through Online-to-Non-convex Conversion | Unknown | N/A | |
| Directed Chain Generative Adversarial Networks | Unknown | N/A | |
| Layered State Discovery for Incremental Autonomous Exploration | Unknown | N/A | |
| Bayes-optimal Learning of Deep Random Networks of Extensive-width | Unknown | N/A | |
| CataBEEM: Integrating Latent Interaction Categories in Node-wise Community Detection Models for Network Data | Unknown | N/A | |
| Maximum Optimality Margin: A Unified Approach for Contextual Linear Programming and Inverse Linear Programming | Unknown | N/A | |
| Under-Counted Tensor Completion with Neural Incorporation of Attributes | Unknown | N/A | |
| Polyhedral Complex Extraction from ReLU Networks using Edge Subdivision | Unknown | N/A | |
| Nearly-Linear Time and Streaming Algorithms for Outlier-Robust PCA | Unknown | N/A | |
| A Closer Look at Few-shot Classification Again | Unknown | N/A | |
| QuantumDARTS: Differentiable Quantum Architecture Search for Variational Quantum Algorithms | Unknown | N/A | |
| Sequential Multi-Dimensional Self-Supervised Learning for Clinical Time Series | Unknown | N/A | |
| X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion | Unknown | N/A | |
| GibbsDDRM: A Partially Collapsed Gibbs Sampler for Solving Blind Inverse Problems with Denoising Diffusion Restoration | Unknown | N/A | |
| Quantum 3D Graph Learning with Applications to Molecule Embedding | Unknown | N/A | |
| Submodular Order Functions and Assortment Optimization | Unknown | N/A | |
| Go Beyond Imagination: Maximizing Episodic Reachability with World Models | Unknown | N/A | |
| Fascinating Supervisory Signals and Where to Find Them: Deep Anomaly Detection with Scale Learning | Unknown | N/A | |
| Bit Allocation using Optimization | Unknown | N/A | |
| Interventional Causal Representation Learning | Unknown | N/A | |
| Convergence of Proximal Point and Extragradient-Based Methods Beyond Monotonicity: the Case of Negative Comonotonicity | Unknown | N/A | |
| Tied-Augment: Controlling Representation Similarity Improves Data Augmentation | Unknown | N/A | |
| Mimetic Initialization of Self-Attention Layers | Unknown | N/A | |
| Text-To-4D Dynamic Scene Generation | Unknown | N/A | |
| Towards Quantum Machine Learning for Constrained Combinatorial Optimization: a Quantum QAP Solver | Unknown | N/A | |
| Minimalistic Predictions to Schedule Jobs with Online Precedence Constraints | Unknown | N/A | |
| Convex Geometry of ReLU-layers, Injectivity on the Ball and Local Reconstruction | Unknown | N/A | |
| Robust Consensus in Ranking Data Analysis: Definitions, Properties and Computational Issues | Unknown | N/A | |
| Subset Selection Based On Multiple Rankings in the Presence of Bias: Effectiveness of Fairness Constraints for Multiwinner Voting Score Functions | Unknown | N/A | |
| Phase-aware Adversarial Defense for Improving Adversarial Robustness | Unknown | N/A | |
| Bayesian Design Principles for Frequentist Sequential Learning | Unknown | N/A | |
| Weighted Tallying Bandits: Overcoming Intractability via Repeated Exposure Optimality | Unknown | N/A | |
| Language Instructed Reinforcement Learning for Human-AI Coordination | Unknown | N/A | |
| Generated Graph Detection | Unknown | N/A | |
| The Catalog Problem: Clustering and Ordering Variable-Sized Sets | Unknown | N/A | |
| Adaptive Smoothing Gradient Learning for Spiking Neural Networks | Unknown | N/A | |
| A Theoretical Analysis of the Learning Dynamics under Class Imbalance | Unknown | N/A | |
| PCA-based Multi-Task Learning: a Random Matrix Approach | Unknown | N/A | |
| CoCo: A Coupled Contrastive Framework for Unsupervised Domain Adaptive Graph Classification | Unknown | N/A | |
| Efficient preconditioned stochastic gradient descent for estimation in latent variable models | Unknown | N/A | |
| Multi-Layer Neural Networks as Trainable Ladders of Hilbert Spaces | Unknown | N/A | |
| Lower Bounds for Learning in Revealing POMDPs | Unknown | N/A | |
| On the Relationship Between Explanation and Prediction: A Causal View | Unknown | N/A | |
| Revisiting Domain Randomization via Relaxed State-Adversarial Policy Optimization | Unknown | N/A | |
| SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification | Unknown | N/A | |
| Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models | Unknown | N/A | |
| Convergence of First-Order Methods for Constrained Nonconvex Optimization with Dependent Data | Unknown | N/A | |
| Expected Gradients of Maxout Networks and Consequences to Parameter Initialization | Unknown | N/A | |
| Task-Specific Skill Localization in Fine-tuned Language Models | Unknown | N/A | |
| Facial Expression Recognition with Adaptive Frame Rate based on Multiple Testing Correction | Unknown | N/A | |
| Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction | Unknown | N/A | |
| Integrating Prior Knowledge in Contrastive Learning with Kernel | Unknown | N/A | |
| Polynomial Preconditioning for Gradient Methods | Unknown | N/A | |
| From Robustness to Privacy and Back | Unknown | N/A | |
| Fast as CHITA: Neural Network Pruning with Combinatorial Optimization | Unknown | N/A | |
| Private Federated Learning with Autotuned Compression | Unknown | N/A | |
| Proper Scoring Rules for Survival Analysis | Unknown | N/A | |
| CRISP: Curriculum based Sequential neural decoders for Polar code family | Unknown | N/A | |
| FLEX: an Adaptive Exploration Algorithm for Nonlinear Systems | Unknown | N/A | |
| Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation | Unknown | N/A | |
| Graphically Structured Diffusion Models | Unknown | N/A | |
| On Many-Actions Policy Gradient | Unknown | N/A | |
| Monge, Bregman and Occam: Interpretable Optimal Transport in High-Dimensions with Feature-Sparse Maps | Unknown | N/A | |
| The Statistical Scope of Multicalibration | Unknown | N/A | |
| Hardware-Aware Compression with Random Operation Access Specific Tile (ROAST) Hashing | Unknown | N/A | |
| Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition | Unknown | N/A | |
| Stabilizing GANs' Training with Brownian Motion Controller | Unknown | N/A | |
| Featured Graph Coarsening with Similarity Guarantees | Unknown | N/A | |
| Biases in Evaluation of Molecular Optimization Methods and Bias Reduction Strategies | Unknown | N/A | |
| Fighting Fire with Fire: Contrastive Debiasing without Bias-free Data via Generative Bias-transformation | Unknown | N/A | |
| Improved Online Learning Algorithms for CTR Prediction in Ad Auctions | Unknown | N/A | |
| Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation | Unknown | N/A | |
| Width and Depth Limits Commute in Residual Networks | Unknown | N/A | |
| Continual Learners are Incremental Model Generalizers | Unknown | N/A | |
| Online Platt Scaling with Calibeating | Unknown | N/A | |
| SpENCNN: Orchestrating Encoding and Sparsity for Fast Homomorphically Encrypted Neural Network Inference | Unknown | N/A | |
| Data Feedback Loops: Model-driven Amplification of Dataset Biases | Unknown | N/A | |
| Variational Autoencoding Neural Operators | Unknown | N/A | |
| Meta-Learning the Inductive Bias of Simple Neural Circuits | Unknown | N/A | |
| Cyclic Block Coordinate Descent With Variance Reduction for Composite Nonconvex Optimization | Unknown | N/A | |
| Predictive Flows for Faster Ford-Fulkerson | Unknown | N/A | |
| Neural Latent Aligner: Cross-trial Alignment for Learning Representations of Complex, Naturalistic Neural Data | Unknown | N/A | |
| Can Large Language Models Reason about Program Invariants? | Unknown | N/A | |
| Provable Reset-free Reinforcement Learning by No-Regret Reduction | Unknown | N/A | |
| On the Impact of Algorithmic Recourse on Social Segregation | Unknown | N/A | |
| Understanding and Generalizing Contrastive Learning from the Inverse Optimal Transport Perspective | Unknown | N/A | |
| Dropout Reduces Underfitting | Unknown | N/A | |
| Towards a Persistence Diagram that is Robust to Noise and Varied Densities | Unknown | N/A | |
| Conformal Prediction with Missing Values | Unknown | N/A | |
| Diffusion Models are Minimax Optimal Distribution Estimators | Unknown | N/A | |
| DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning | Unknown | N/A | |
| MODeL: Memory Optimizations for Deep Learning | Unknown | N/A | |
| Optimal Sets and Solution Paths of ReLU Networks | Unknown | N/A | |
| Traversing Between Modes in Function Space for Fast Ensembling | Unknown | N/A | |
| Fast Rates in Time-Varying Strongly Monotone Games | Unknown | N/A | |
| Tighter Information-Theoretic Generalization Bounds from Supersamples | Unknown | N/A | |
| Learning in POMDPs is Sample-Efficient with Hindsight Observability | Unknown | N/A | |
| Reduce, Reuse, Recycle: Compositional Generation with Energy-Based Diffusion Models and MCMC | Unknown | N/A | |
| Atari-5: Distilling the Arcade Learning Environment down to Five Games | Unknown | N/A | |
| Data-Driven Subgroup Identification for Linear Regression | Unknown | N/A | |
| Supported Trust Region Optimization for Offline Reinforcement Learning | Unknown | N/A | |
| Multi-Objective Population Based Training | Unknown | N/A | |
| Differentially Private Sharpness-Aware Training | Unknown | N/A | |
| Nonparametric Extensions of Randomized Response for Private Confidence Sets | Unknown | N/A | |
| Statistical Learning under Heterogenous Distribution Shift | Unknown | N/A | |
| Spatial-Temporal Graph Learning with Adversarial Contrastive Adaptation | Unknown | N/A | |
| Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers | Unknown | N/A | |
| Adaptive Coordination in Social Embodied Rearrangement | Unknown | N/A | |
| Mean-field Analysis of Piecewise Linear Solutions for Wide ReLU Networks | Unknown | N/A | |
| MonoFlow: Rethinking Divergence GANs via the Perspective of Wasserstein Gradient Flows | Unknown | N/A | |
| Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods | Unknown | N/A | |
| Bigger, Better, Faster: Human-level Atari with human-level efficiency | Unknown | N/A | |
| PLay: Parametrically Conditioned Layout Generation using Latent Diffusion | Unknown | N/A | |
| Repository-Level Prompt Generation for Large Language Models of Code | Unknown | N/A | |
| On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline | Unknown | N/A | |
| Bandits with Knapsacks: Advice on Time-Varying Demands | Unknown | N/A | |
| Does Sparsity Help in Learning Misspecified Linear Bandits? | Unknown | N/A | |
| The SSL Interplay: Augmentations, Inductive Bias, and Generalization | Unknown | N/A | |
| Pix2Struct: Screenshot Parsing as Pretraining for Visual Language Understanding | Unknown | N/A | |
| Improved Algorithms for White-Box Adversarial Streams | Unknown | N/A | |
| GREAD: Graph Neural Reaction-Diffusion Networks | Unknown | N/A | |
| Structural Re-weighting Improves Graph Domain Adaptation | Unknown | N/A | |
| One-Step Estimator for Permuted Sparse Recovery | Unknown | N/A | |
| Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems | Unknown | N/A | |
| PAC Generalization via Invariant Representations | Unknown | N/A | |
| Bayesian Neural Networks Avoid Encoding Complex and Perturbation-Sensitive Concepts | Unknown | N/A | |
| Adaptive IMLE for Few-shot Pretraining-free Generative Modelling | Unknown | N/A | |
| Polynomial Time and Private Learning of Unbounded Gaussian Mixture Models | Unknown | N/A | |
| On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits | Unknown | N/A | |
| On Excess Mass Behavior in Gaussian Mixture Models with Orlicz-Wasserstein Distances | Unknown | N/A | |
| Toward Large Kernel Models | Unknown | N/A | |
| Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs | Unknown | N/A | |
| Mirror Sinkhorn: Fast Online Optimization on Transport Polytopes | Unknown | N/A | |
| Efficient Online Reinforcement Learning with Offline Data | Unknown | N/A | |
| Self-Repellent Random Walks on General Graphs - Achieving Minimal Sampling Variance via Nonlinear Markov Chains | Unknown | N/A | |
| Moccasin: Efficient Tensor Rematerialization for Neural Networks | Unknown | N/A | |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Unknown | N/A | |
| "Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts | Unknown | N/A | |
| Wrapped Cauchy Distributed Angular Softmax for Long-Tailed Visual Recognition | Unknown | N/A | |
| Change is Hard: A Closer Look at Subpopulation Shift | Unknown | N/A | |
| Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints | Unknown | N/A | |
| Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano | Unknown | N/A | |
| Cocktail Party Attack: Breaking Aggregation-Based Privacy in Federated Learning Using Independent Component Analysis | Unknown | N/A | |
| On Penalty-based Bilevel Gradient Descent Method | Unknown | N/A | |
| Fairness in Matching under Uncertainty | Unknown | N/A | |
| Meta Learning of Interface Conditions for Multi-Domain Physics-Informed Neural Networks | Unknown | N/A | |
| Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Improving Hyperparameter Learning under Approximate Inference in Gaussian Process Models | Unknown | N/A | |
| On Regularization and Inference with Label Constraints | Unknown | N/A | |
| Why does Throwing Away Data Improve Worst-Group Error? | Unknown | N/A | |
| Unsupervised Skill Discovery for Learning Shared Structures across Changing Environments | Unknown | N/A | |
| Leveraging Proxy of Training Data for Test-Time Adaptation | Unknown | N/A | |
| Generating Language Corrections for Teaching Physical Control Tasks | Unknown | N/A | |
| Predictable MDP Abstraction for Unsupervised Model-Based RL | Unknown | N/A | |
| Variance Control for Distributional Reinforcement Learning | Unknown | N/A | |
| Anti-Exploration by Random Network Distillation | Unknown | N/A | |
| Revisiting Bellman Errors for Offline Model Selection | Unknown | N/A | |
| Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights? | Unknown | N/A | |
| Interactive Object Placement with Reinforcement Learning | Unknown | N/A | |
| Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data | Unknown | N/A | |
| GC-Flow: A Graph-Based Flow Network for Effective Clustering | Unknown | N/A | |
| Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap | Unknown | N/A | |
| Coordinate Descent Methods for Fractional Minimization | Unknown | N/A | |
| Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series | Unknown | N/A | |
| Network Effects in Performative Prediction Games | Unknown | N/A | |
| On Enhancing Expressive Power via Compositions of Single Fixed-Size ReLU Network | Unknown | N/A | |
| Best Arm Identification in Multi-Agent Multi-Armed Bandits | Unknown | N/A | |
| AudioLDM: Text-to-Audio Generation with Latent Diffusion Models | Unknown | N/A | |
| Approximation Algorithms for Fair Range Clustering | Unknown | N/A | |
| Probabilistic Attention-to-Influence Neural Models for Event Sequences | Unknown | N/A | |
| RankMe: Assessing the Downstream Performance of Pretrained Self-Supervised Representations by Their Rank | Unknown | N/A | |
| How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy | Unknown | N/A | |
| Improving Fair Training under Correlation Shifts | Unknown | N/A | |
| ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging | Unknown | N/A | |
| Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL | Unknown | N/A | |
| End-to-End Multi-Object Detection with a Regularized Mixture Model | Unknown | N/A | |
| Confidence and Dispersity Speak: Characterizing Prediction Matrix for Unsupervised Accuracy Estimation | Unknown | N/A | |
| Towards Explaining Distribution Shifts | Unknown | N/A | |
| RSC: Accelerate Graph Neural Networks Training via Randomized Sparse Computations | Unknown | N/A | |
| Delving into Noisy Label Detection with Clean Data | Unknown | N/A | |
| Smooth Non-stationary Bandits | Unknown | N/A | |
| On Kinetic Optimal Probability Paths for Generative Models | Unknown | N/A | |
| Multi-Fidelity Covariance Estimation in the Log-Euclidean Geometry | Unknown | N/A | |
| Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification | Unknown | N/A | |
| Restoration based Generative Models | Unknown | N/A | |
| MAGANet: Achieving Combinatorial Generalization by Modeling a Group Action | Unknown | N/A | |
| Feature Programming for Multivariate Time Series Prediction | Unknown | N/A | |
| Reliable Measures of Spread in High Dimensional Latent Spaces | Unknown | N/A | |
| Bayesian Estimation of Differential Privacy | Unknown | N/A | |
| Learning useful representations for shifting tasks and distributions | Unknown | N/A | |
| Toward Efficient Gradient-Based Value Estimation | Unknown | N/A | |
| All in a Row: Compressed Convolution Networks for Graphs | Unknown | N/A | |
| Dynamics-inspired Neuromorphic Visual Representation Learning | Unknown | N/A | |
| Stable and Consistent Prediction of 3D Characteristic Orientation via Invariant Residual Learning | Unknown | N/A | |
| Neural networks trained with SGD learn distributions of increasing complexity | Unknown | N/A | |
| Rethinking Weak Supervision in Helping Contrastive Learning | Unknown | N/A | |
| Abstracting Imperfect Information Away from Two-Player Zero-Sum Games | Unknown | N/A | |
| Learning Intuitive Policies Using Action Features | Unknown | N/A | |
| DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design | Unknown | N/A | |
| Sharper Bounds for $\ell_p$ Sensitivity Sampling | Unknown | N/A | |
| Gaussian Process Priors for Systems of Linear Partial Differential Equations with Constant Coefficients | Unknown | N/A | |
| Formalizing Preferences Over Runtime Distributions | Unknown | N/A | |
| Effective Minkowski Dimension of Deep Nonparametric Regression: Function Approximation and Statistical Theories | Unknown | N/A | |
| Pricing Experimental Design: Causal Effect, Expected Revenue and Tail Risk | Unknown | N/A | |
| Differentiable and Transportable Structure Learning | Unknown | N/A | |
| Proximal Causal Learning of Conditional Average Treatment Effects | Unknown | N/A | |
| Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data | Unknown | N/A | |
| Weakly Supervised Regression with Interval Targets | Unknown | N/A | |
| Deep Latent State Space Models for Time-Series Generation | Unknown | N/A | |
| Implicit Neural Spatial Representations for Time-dependent PDEs | Unknown | N/A | |
| Improving Bi-level Optimization Based Methods with Inspiration from Humans' Classroom Study Techniques | Unknown | N/A | |
| DiscoBAX - Discovery of optimal intervention sets in genomic experiment design | Unknown | N/A | |
| Sample Complexity of Probability Divergences under Group Symmetry | Unknown | N/A | |
| Learning Instance-Specific Augmentations by Capturing Local Invariances | Unknown | N/A | |
| HarsanyiNet: Computing Accurate Shapley Values in a Single Forward Propagation | Unknown | N/A | |
| Learning Temporally AbstractWorld Models without Online Experimentation | Unknown | N/A | |
| Input uncertainty propagation through trained neural networks | Unknown | N/A | |
| Sequential Counterfactual Risk Minimization | Unknown | N/A | |
| Applied Online Algorithms with Heterogeneous Predictors | Unknown | N/A | |
| Omnipredictors for Constrained Optimization | Unknown | N/A | |
| Semi Bandit dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees. | Unknown | N/A | |
| What can online reinforcement learning with function approximation benefit from general coverage conditions? | Unknown | N/A | |
| An Effective Meaningful Way to Evaluate Survival Models | Unknown | N/A | |
| The Dormant Neuron Phenomenon in Deep Reinforcement Learning | Unknown | N/A | |
| On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization | Unknown | N/A | |
| Counterfactual Analysis in Dynamic Latent State Models | Unknown | N/A | |
| On Bridging the Gap between Mean Field and Finite Width Deep Random Multilayer Perceptron with Batch Normalization | Unknown | N/A | |
| Fully-Adaptive Composition in Differential Privacy | Unknown | N/A | |
| Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning | Unknown | N/A | |
| Settling the Reward Hypothesis | Unknown | N/A | |
| Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation | Unknown | N/A | |
| Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition | Unknown | N/A | |
| TAN Without a Burn: Scaling Laws of DP-SGD | Unknown | N/A | |
| Quantile Credit Assignment | Unknown | N/A | |
| The Benefits of Model-Based Generalization in Reinforcement Learning | Unknown | N/A | |
| SpeedDETR: Speed-aware Transformers for End-to-end Object Detection | Unknown | N/A | |
| Provably and Practically Efficient Neural Contextual Bandits | Unknown | N/A | |
| Quantum Ridgelet Transform: Winning Lottery Ticket of Neural Networks with Quantum Computation | Unknown | N/A | |
| On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness | Unknown | N/A | |
| Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing | Unknown | N/A | |
| Distributed Linear Bandits under Communication Constraints | Unknown | N/A | |
| A Unifying Framework to the Analysis of Interaction Methods using Synergy Functions | Unknown | N/A | |
| Sequential Kernelized Independence Testing | Unknown | N/A | |
| Sequential Strategic Screening | Unknown | N/A | |
| Automated Search for Conjectures on Mathematical Constants using Analysis of Integer Sequences | Unknown | N/A | |
| Provable Multi-instance Deep AUC Maximization with Stochastic Pooling | Unknown | N/A | |
| TIDE: Time Derivative Diffusion for Deep Learning on Graphs | Unknown | N/A | |
| Geometric Latent Diffusion Models for 3D Molecule Generation | Unknown | N/A | |
| On the Statistical Benefits of Temporal Difference Learning | Unknown | N/A | |
| Information-Theoretic State Space Model for Multi-View Reinforcement Learning | Unknown | N/A | |
| Continual Vision-Language Representation Learning with Off-Diagonal Information | Unknown | N/A | |
| Private Statistical Estimation of Many Quantiles | Unknown | N/A | |
| AbODE: Ab initio antibody design using conjoined ODEs | Unknown | N/A | |
| Trustworthy Policy Learning under the Counterfactual No-Harm Criterion | Unknown | N/A | |
| Propensity Matters: Measuring and Enhancing Balancing for Recommendation | Unknown | N/A | |
| Improving Graph Generation by Restricting Graph Bandwidth | Unknown | N/A | |
| Solving Linear Programs with Fast Online Learning Algorithms | Unknown | N/A | |
| LESS-VFL: Communication-Efficient Feature Selection for Vertical Federated Learning | Unknown | N/A | |
| Robust Collaborative Learning with Linear Gradient Overhead | Unknown | N/A | |
| Towards Understanding and Improving GFlowNet Training | Unknown | N/A | |
| MALTS: Matching After Learning to Stretch | Unknown | N/A | |
| PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation | Unknown | N/A | |
| Efficient Training of Language Models using Few-Shot Learning | Unknown | N/A | |
| A Universal Unbiased Method for Classification from Aggregate Observations | Unknown | N/A | |
| On the Convergence of SARSA with Linear Function Approximation | Unknown | N/A | |
| Mitigating Memorization of Noisy Labels by Clipping the Model Prediction | Unknown | N/A | |
| PAC-Bayesian Generalization Bounds for Adversarial Generative Models | Unknown | N/A | |
| Fairness in Streaming Submodular Maximization over a Matroid Constraint | Unknown | N/A | |
| Optimal randomized multilevel Monte Carlo for repeatedly nested expectations | Unknown | N/A | |
| PAC Prediction Sets for Large Language Models of Code | Unknown | N/A | |
| Scalable Adaptive Computation for Iterative Generation | Unknown | N/A | |
| ConCerNet: A Contrastive Learning Based Framework for Automated Conservation Law Discovery and Trustworthy Dynamical System Prediction | Unknown | N/A | |
| Sequential Changepoint Detection via Backward Confidence Sequences | Unknown | N/A | |
| Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN | Unknown | N/A | |
| Distribution-dependent McDiarmid-type Inequalities for Functions of Unbounded Interaction | Unknown | N/A | |
| On the Privacy-Robustness-Utility Trilemma in Distributed Learning | Unknown | N/A | |
| Identifiability of Label Noise Transition Matrix | Unknown | N/A | |
| Model Transferability with Responsive Decision Subjects | Unknown | N/A | |
| Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs | Unknown | N/A | |
| Concurrent Shuffle Differential Privacy Under Continual Observation | Unknown | N/A | |
| Weak Proxies are Sufficient and Preferable for Fairness with Missing Sensitive Attributes | Unknown | N/A | |
| Efficient displacement convex optimization with particle gradient descent | Unknown | N/A | |
| PPG Reloaded: An Empirical Study on What Matters in Phasic Policy Gradient | Unknown | N/A | |
| Collaborative Causal Inference with Fair Incentives | Unknown | N/A | |
| Fair yet Asymptotically Equal Collaborative Learning | Unknown | N/A | |
| Global Context Vision Transformers | Unknown | N/A | |
| Distortion and Uncertainty Aware Loss for Panoramic Depth Completion | Unknown | N/A | |
| A Kernel-Based View of Language Model Fine-Tuning | Unknown | N/A | |
| From Perception to Programs: Regularize, Overparameterize, and Amortize | Unknown | N/A | |
| SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models | Unknown | N/A | |
| Geometric Clifford Algebra Networks | Unknown | N/A | |
| Efficiently predicting high resolution mass spectra with graph neural networks | Unknown | N/A | |
| Learning Mixtures of Markov Chains and MDPs | Unknown | N/A | |
| Generalized Implicit Follow-The-Regularized-Leader | Unknown | N/A | |
| Spurious Valleys and Clustering Behavior of Neural Networks | Unknown | N/A | |
| Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape | Unknown | N/A | |
| A Robust Test for the Stationarity Assumption in Sequential Decision Making | Unknown | N/A | |
| Towards a better understanding of representation dynamics under TD-learning | Unknown | N/A | |
| Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation | Unknown | N/A | |
| Rethinking Explaining Graph Neural Networks via Non-parametric Subgraph Matching | Unknown | N/A | |
| Label Distributionally Robust Losses for Multi-class Classification: Consistency, Robustness and Adaptivity | Unknown | N/A | |
| Explainable Data-Driven Optimization: From Context to Decision and Back Again | Unknown | N/A | |
| Why Random Pruning Is All We Need to Start Sparse | Unknown | N/A | |
| Direct Parameterization of Lipschitz-Bounded Deep Networks | Unknown | N/A | |
| FREDIS: A Fusion Framework of Refinement and Disambiguation for Unreliable Partial Label Learning | Unknown | N/A | |
| Generalization Analysis for Contrastive Representation Learning | Unknown | N/A | |
| An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning | Unknown | N/A | |
| Predicting Rare Events by Shrinking Towards Proportional Odds | Unknown | N/A | |
| Online Nonstochastic Control with Adversarial and Static Constraints | Unknown | N/A | |
| Multi-task Hierarchical Adversarial Inverse Reinforcement Learning | Unknown | N/A | |
| Surface Snapping Optimization Layer for Single Image Object Shape Reconstruction | Unknown | N/A | |
| Relevant Walk Search for Explaining Graph Neural Networks | Unknown | N/A | |
| VectorMapNet: End-to-end Vectorized HD Map Learning | Unknown | N/A | |
| Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts | Unknown | N/A | |
| Representer Point Selection for Explaining Regularized High-dimensional Models | Unknown | N/A | |
| Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection | Unknown | N/A | |
| Communication-Efficient Federated Hypergradient Computation via Aggregated Iterative Differentiation | Unknown | N/A | |
| Cell-Free Latent Go-Explore | Unknown | N/A | |
| Towards Understanding Generalization of Graph Neural Networks | Unknown | N/A | |
| The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent | Unknown | N/A | |
| Fair and Optimal Classification via Post-Processing | Unknown | N/A | |
| Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels | Unknown | N/A | |
| Graph Neural Tangent Kernel: Convergence on Large Graphs | Unknown | N/A | |
| Tractable Control for Autoregressive Language Generation | Unknown | N/A | |
| Speed-Oblivious Online Scheduling: Knowing (Precise) Speeds is not Necessary | Unknown | N/A | |
| Graph Contrastive Backdoor Attacks | Unknown | N/A | |
| Jump-Start Reinforcement Learning | Unknown | N/A | |
| COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models | Unknown | N/A | |
| The Impact of Exploration on Convergence and Performance of Multi-Agent Q-Learning Dynamics | Unknown | N/A | |
| Vector-Valued Control Variates | Unknown | N/A | |
| Algorithmic Collective Action in Machine Learning | Unknown | N/A | |
| Causal Structure Learning for Latent Intervened Non-stationary Data | Unknown | N/A | |
| Neural Inverse Operators for Solving PDE Inverse Problems | Unknown | N/A | |
| A Distribution Optimization Framework for Confidence Bounds of Risk Measures | Unknown | N/A | |
| Exact Inference in High-order Structured Prediction | Unknown | N/A | |
| On the Complexity of Bayesian Generalization | Unknown | N/A | |
| Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise | Unknown | N/A | |
| SGD with AdaGrad Stepsizes: Full Adaptivity with High Probability to Unknown Parameters, Unbounded Gradients and Affine Variance | Unknown | N/A | |
| On the Convergence of Gradient Flow on Multi-layer Linear Models | Unknown | N/A | |
| Unscented Autoencoder | Unknown | N/A | |
| Individually Fair Learning with One-Sided Feedback | Unknown | N/A | |
| Hierarchical Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction | Unknown | N/A | |
| Training Deep Surrogate Models with Large Scale Online Learning | Unknown | N/A | |
| Quantum Lower Bounds for Finding Stationary Points of Nonconvex Functions | Unknown | N/A | |
| Near-Optimal Quantum Coreset Construction Algorithms for Clustering | Unknown | N/A | |
| CSP: Self-Supervised Contrastive Spatial Pre-Training for Geospatial-Visual Representations | Unknown | N/A | |
| Differentiable Tree Operations Promote Compositional Generalization | Unknown | N/A | |
| CrossSplit: Mitigating Label Noise Memorization through Data Splitting | Unknown | N/A | |
| Generalizing Neural Wave Functions | Unknown | N/A | |
| Deep Laplacian-based Options for Temporally-Extended Exploration | Unknown | N/A | |
| Fourmer: An Efficient Global Modeling Paradigm for Image Restoration | Unknown | N/A | |
| Shapley Based Residual Decomposition for Instance Analysis | Unknown | N/A | |
| A Group Symmetric Stochastic Differential Equation Model for Molecule Multi-modal Pretraining | Unknown | N/A | |
| Reachability-Aware Laplacian Representation in Reinforcement Learning | Unknown | N/A | |
| Provably Invariant Learning without Domain Information | Unknown | N/A | |
| Improved Online Conformal Prediction via Strongly Adaptive Online Learning | Unknown | N/A | |
| Total Variation Graph Neural Networks | Unknown | N/A | |
| ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs | Unknown | N/A | |
| Hidden Symmetries of ReLU Networks | Unknown | N/A | |
| Topological Singularity Detection at Multiple Scales | Unknown | N/A | |
| Better Training of GFlowNets with Local Credit and Incomplete Trajectories | Unknown | N/A | |
| Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection | Unknown | N/A | |
| Symmetry-Aware Robot Design with Structured Subgroups | Unknown | N/A | |
| Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning | Unknown | N/A | |
| Provable Data Subset Selection For Efficient Neural Networks Training | Unknown | N/A | |
| GFlowOut: Dropout with Generative Flow Networks | Unknown | N/A | |
| FedCR: Personalized Federated Learning Based on Across-Client Common Representation with Conditional Mutual Information Regularization | Unknown | N/A | |
| Controllability-Aware Unsupervised Skill Discovery | Unknown | N/A | |
| ChiPFormer: Transferable Chip Placement via Offline Decision Transformer | Unknown | N/A | |
| Towards credible visual model interpretation with path attribution | Unknown | N/A | |
| Learning Signed Distance Functions from Noisy 3D Point Clouds via Noise to Noise Mapping | Unknown | N/A | |
| Towards Practical Preferential Bayesian Optimization with Skew Gaussian Processes | Unknown | N/A | |
| The Edge of Orthogonality: A Simple View of What Makes BYOL Tick | Unknown | N/A | |
| Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space | Unknown | N/A | |
| Hyperbolic Image-text Representations | Unknown | N/A | |
| LongCoder: A Long-Range Pre-trained Language Model for Code Completion | Unknown | N/A | |
| WL meet VC | Unknown | N/A | |
| Regret-Minimizing Double Oracle for Extensive-Form Games | Unknown | N/A | |
| Adaptive Identification of Populations with Treatment Benefit in Clinical Trials: Machine Learning Challenges and Solutions | Unknown | N/A | |
| Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning | Unknown | N/A | |
| Personalized Federated Learning with Inferred Collaboration Graphs | Unknown | N/A | |
| Run-off Election: Improved Provable Defense against Data Poisoning Attacks | Unknown | N/A | |
| Regret Minimization and Convergence to Equilibria in General-sum Markov Games | Unknown | N/A | |
| A Fast, Well-Founded Approximation to the Empirical Neural Tangent Kernel | Unknown | N/A | |
| Learning Expressive Priors for Generalization and Uncertainty Estimation in Neural Networks | Unknown | N/A | |
| Hyperbolic Representation Learning: Revisiting and Advancing | Unknown | N/A | |
| RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution | Unknown | N/A | |
| No One Idles: Efficient Heterogeneous Federated Learning with Parallel Edge and Server Computation | Unknown | N/A | |
| Magneto: A Foundation Transformer | Unknown | N/A | |
| Minimizing Trajectory Curvature of ODE-based Generative Models | Unknown | N/A | |
| How Jellyfish Characterise Alternating Group Equivariant Neural Networks | Unknown | N/A | |
| Hyperbolic Diffusion Embedding and Distance for Hierarchical Representation Learning | Unknown | N/A | |
| Safe Offline Reinforcement Learning with Real-Time Budget Constraints | Unknown | N/A | |
| Improved Active Multi-Task Representation Learning via Lasso | Unknown | N/A | |
| Rethinking Visual Reconstruction: Experience-Based Content Completion Guided by Visual Cues | Unknown | N/A | |
| SlotGAT: Slot-based Message Passing for Heterogeneous Graphs | Unknown | N/A | |
| Hierarchical Diffusion for Offline Decision Making | Unknown | N/A | |
| Stochastic Gradient Descent under Markovian Sampling Schemes | Unknown | N/A | |
| Mitigating the Effects of Non-Identifiability on Inference for Bayesian Neural Networks with Latent Variables | Unknown | N/A | |
| Difference of submodular minimization via DC programming | Unknown | N/A | |
| Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization | Unknown | N/A | |
| Nonparametric Iterative Machine Teaching | Unknown | N/A | |
| A Fast Optimistic Method for Monotone Variational Inequalities | Unknown | N/A | |
| Deep Regression Unlearning | Unknown | N/A | |
| A Robust Optimisation Perspective on Counterexample-Guided Repair of Neural Networks | Unknown | N/A | |
| StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis | Unknown | N/A | |
| Decentralized Stochastic Bilevel Optimization with Improved per-Iteration Complexity | Unknown | N/A | |
| Existence and Estimation of Critical Batch Size for Training Generative Adversarial Networks with Two Time-Scale Update Rule | Unknown | N/A | |
| Neural Prediction Errors enable Analogical Visual Reasoning in Human Standard Intelligence Tests | Unknown | N/A | |
| Minimal Width for Universal Property of Deep RNN | Unknown | N/A | |
| Mixture Proportion Estimation Beyond Irreducibility | Unknown | N/A | |
| Sliced-Wasserstein on Symmetric Positive Definite Matrices for M/EEG Signals | Unknown | N/A | |
| Reinforcement Learning from Passive Data via Latent Intentions | Unknown | N/A | |
| Refined Regret for Adversarial MDPs with Linear Function Approximation | Unknown | N/A | |
| Label differential privacy and private training data release | Unknown | N/A | |
| Metagenomic Binning using Connectivity-constrained Variational Autoencoders | Unknown | N/A | |
| Automatically marginalized MCMC in probabilistic programming | Unknown | N/A | |
| Calibrating Multimodal Learning | Unknown | N/A | |
| On the Optimality of Misspecified Kernel Ridge Regression | Unknown | N/A | |
| Actor-Critic Alignment for Offline-to-Online Reinforcement Learning | Unknown | N/A | |
| Harmonic Neural Networks | Unknown | N/A | |
| Approximately Optimal Core Shapes for Tensor Decompositions | Unknown | N/A | |
| COLA: Orchestrating Error Coding and Learning for Robust Neural Network Inference Against Hardware Defects | Unknown | N/A | |
| A Flexible Diffusion Model | Unknown | N/A | |
| LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation | Unknown | N/A | |
| Generative Graph Dictionary Learning | Unknown | N/A | |
| Solving High-Dimensional PDEs with Latent Spectral Models | Unknown | N/A | |
| Distance Weighted Supervised Learning for Offline Interaction Data | Unknown | N/A | |
| Optimal Arms Identification with Knapsacks | Unknown | N/A | |
| Effective Structured Prompting by Meta-Learning and Representative Verbalizer | Unknown | N/A | |
| Markovian Gaussian Process Variational Autoencoders | Unknown | N/A | |
| Active causal structure learning with advice | Unknown | N/A | |
| New metrics and search algorithms for weighted causal DAGs | Unknown | N/A | |
| End-to-end Differentiable Clustering with Associative Memories | Unknown | N/A | |
| Trainability, Expressivity and Interpretability in Gated Neural ODEs | Unknown | N/A | |
| Differentially Private Hierarchical Clustering with Provable Approximation Guarantees | Unknown | N/A | |
| Adapting to game trees in zero-sum imperfect information games | Unknown | N/A | |
| Fast Excess Risk Rates via Offset Rademacher Complexity | Unknown | N/A | |
| Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations | Unknown | N/A | |
| Reward-Mixing MDPs with Few Latent Contexts are Learnable | Unknown | N/A | |
| Maximal Initial Learning Rates in Deep ReLU Networks | Unknown | N/A | |
| Improving Visual Prompt Tuning for Self-supervised Vision Transformers | Unknown | N/A | |
| One-Shot Compression of Large Edge-Exchangeable Graphs using Bits-Back Coding | Unknown | N/A | |
| Instrumental Variable Estimation of Average Partial Causal Effects | Unknown | N/A | |
| Path Neural Networks: Expressive and Accurate Graph Neural Networks | Unknown | N/A | |
| Action Matching: Learning Stochastic Dynamics from Samples | Unknown | N/A | |
| How to address monotonicity for model risk management? | Unknown | N/A | |
| IncDSI: Incrementally Updatable Document Retrieval | Unknown | N/A | |
| Computational Asymmetries in Robust Classification | Unknown | N/A | |
| Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models | Unknown | N/A | |
| GNOT: A General Neural Operator Transformer for Operator Learning | Unknown | N/A | |
| NUNO: A General Framework for Learning Parametric PDEs with Non-Uniform Data | Unknown | N/A | |
| Robust Subtask Learning for Compositional Generalization | Unknown | N/A | |
| One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale | Unknown | N/A | |
| DRew: Dynamically Rewired Message Passing with Delay | Unknown | N/A | |
| Graph Ladling: Shockingly Simple Parallel GNN Training without Intermediate Communication | Unknown | N/A | |
| Are Large Kernels Better Teachers than Transformers for ConvNets? | Unknown | N/A | |
| GLOBE-CE: A Translation Based Approach for Global Counterfactual Explanations | Unknown | N/A | |
| The case for 4-bit precision: k-bit Inference Scaling Laws | Unknown | N/A | |
| SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient | Unknown | N/A | |
| Generating Private Synthetic Data with Genetic Algorithms | Unknown | N/A | |
| The Flan Collection: Designing Data and Methods for Effective Instruction Tuning | Unknown | N/A | |
| Causal Strategic Classification: A Tale of Two Shifts | Unknown | N/A | |
| Revisiting Over-smoothing and Over-squashing Using Ollivier-Ricci Curvature | Unknown | N/A | |
| DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm | Unknown | N/A | |
| Bootstrapped Representations in Reinforcement Learning | Unknown | N/A | |
| CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms | Unknown | N/A | |
| VA-learning as a more efficient alternative to Q-learning | Unknown | N/A | |
| Robust Non-Linear Feedback Coding via Power-Constrained Deep Learning | Unknown | N/A | |
| Compute-Efficient Deep Learning: Algorithmic Trends and Opportunities | Unknown | N/A | |
| An SDE for Modeling SAM: Theory and Insights | Unknown | N/A | |
| SeedGNN: Graph Neural Network for Supervised Seeded Graph Matching | Unknown | N/A | |
| Adaptively Weighted Data Augmentation Consistency Regularization for Robust Optimization under Concept Shift | Unknown | N/A | |
| Learning Unforeseen Robustness from Out-of-distribution Data Using Equivariant Domain Translator | Unknown | N/A | |
| Learning the Right Layers a Data-Driven Layer-Aggregation Strategy for Semi-Supervised Learning on Multilayer Graphs | Unknown | N/A | |
| Causal Modeling of Policy Interventions From Treatment–Outcome Sequences | Unknown | N/A | |
| Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning | Unknown | N/A | |
| Towards Sustainable Learning: Coresets for Data-efficient Deep Learning | Unknown | N/A | |
| Controlling Posterior Collapse by an Inverse Lipschitz Constraint on the Decoder Network | Unknown | N/A | |
| The Monge Gap: A Regularizer to Learn All Transport Maps | Unknown | N/A | |
| Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond | Unknown | N/A | |
| Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models | Unknown | N/A | |
| FlexRound: Learnable Rounding based on Element-wise Division for Post-Training Quantization | Unknown | N/A | |
| Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs | Unknown | N/A | |
| Marginalization is not Marginal: No Bad VAE Local Minima when Learning Optimal Sparse Representations | Unknown | N/A | |
| Extrapolative Controlled Sequence Generation via Iterative Refinement | Unknown | N/A | |
| Sampling-based Nyström Approximation and Kernel Quadrature | Unknown | N/A | |
| Global Optimization with Parametric Function Approximation | Unknown | N/A | |
| Towards Constituting Mathematical Structures for Learning to Optimize | Unknown | N/A | |
| On the Initialization of Graph Neural Networks | Unknown | N/A | |
| From Hypergraph Energy Functions to Hypergraph Neural Networks | Unknown | N/A | |
| Data Efficient Neural Scaling Law via Model Reusing | Unknown | N/A | |
| Learning to Optimize Differentiable Games | Unknown | N/A | |
| Cluster Explanation via Polyhedral Descriptions | Unknown | N/A | |
| Certifying Ensembles: A General Certification Theory with S-Lipschitzness | Unknown | N/A | |
| SurCo: Learning Linear SURrogates for COmbinatorial Nonlinear Optimization Problems | Unknown | N/A | |
| Rotation and Translation Invariant Representation Learning with Implicit Neural Representations | Unknown | N/A | |
| Searching Large Neighborhoods for Integer Linear Programs with Contrastive Learning | Unknown | N/A | |
| Non-stationary Reinforcement Learning under General Function Approximation | Unknown | N/A | |
| Mechanistic Mode Connectivity | Unknown | N/A | |
| Understanding and Defending Patched-based Adversarial Attacks for Vision Transformer | Unknown | N/A | |
| PFGM++: Unlocking the Potential of Physics-Inspired Generative Models | Unknown | N/A | |
| FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning | Unknown | N/A | |
| CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models | Unknown | N/A | |
| Differentially Private Distributed Bayesian Linear Regression with MCMC | Unknown | N/A | |
| Whose Opinions Do Language Models Reflect? | Unknown | N/A | |
| Emergence of Sparse Representations from Noise | Unknown | N/A | |
| Pareto Regret Analyses in Multi-objective Multi-armed Bandit | Unknown | N/A | |
| Transformers Learn In-Context by Gradient Descent | Unknown | N/A | |
| Which Tricks are Important for Learning to Rank? | Unknown | N/A | |
| Chemically Transferable Generative Backmapping of Coarse-Grained Proteins | Unknown | N/A | |
| Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions | Unknown | N/A | |
| Incentivizing Exploration with Linear Contexts and Combinatorial Actions | Unknown | N/A | |
| JAWS-X: Addressing Efficiency Bottlenecks of Conformal Prediction Under Standard and Feedback Covariate Shift | Unknown | N/A | |
| Multi-Epoch Matrix Factorization Mechanisms for Private Machine Learning | Unknown | N/A | |
| HOPE: High-order Graph ODE For Modeling Interacting Dynamics | Unknown | N/A | |
| Moderately Distributional Exploration for Domain Generalization | Unknown | N/A | |
| Social learning spontaneously emerges by searching optimal heuristics with deep reinforcement learning | Unknown | N/A | |
| Unlocking Slot Attention by Changing Optimal Transport Costs | Unknown | N/A | |
| Efficient Sequence Transduction by Jointly Predicting Tokens and Durations | Unknown | N/A | |
| Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective | Unknown | N/A | |
| On Heterogeneous Treatment Effects in Heterogeneous Causal Graphs | Unknown | N/A | |
| Faster Rates of Convergence to Stationary Points in Differentially Private Optimization | Unknown | N/A | |
| Adaptive Compositional Continual Meta-Learning | Unknown | N/A | |
| Learning Affinity with Hyperbolic Representation for Spatial Propagation | Unknown | N/A | |
| Dual Propagation: Accelerating Contrastive Hebbian Learning with Dyadic Neurons | Unknown | N/A | |
| Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables | Unknown | N/A | |
| On the Occupancy Measure of Non-Markovian Policies in Continuous MDPs | Unknown | N/A | |
| Loss Balancing for Fair Supervised Learning | Unknown | N/A | |
| Target-Aware Generative Augmentations for Single-Shot Adaptation | Unknown | N/A | |
| Who Needs to Know? Minimal Knowledge for Optimal Coordination | Unknown | N/A | |
| Online Learning with Feedback Graphs: The True Shape of Regret | Unknown | N/A | |
| AutoCoreset: An Automatic Practical Coreset Construction Framework | Unknown | N/A | |
| Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression | Unknown | N/A | |
| An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Eventual Discounting Temporal Logic Counterfactual Experience Replay | Unknown | N/A | |
| Extrapolated Random Tree for Regression | Unknown | N/A | |
| On the Identifiability and Estimation of Causal Location-Scale Noise Models | Unknown | N/A | |
| Orthogonality-Enforced Latent Space in Autoencoders: An Approach to Learning Disentangled Representations | Unknown | N/A | |
| Understanding the Impact of Adversarial Robustness on Accuracy Disparity | Unknown | N/A | |
| MABe22: A Multi-Species Multi-Task Benchmark for Learned Representations of Behavior | Unknown | N/A | |
| Pruning via Sparsity-indexed ODE: a Continuous Sparsity Viewpoint | Unknown | N/A | |
| LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework | Unknown | N/A | |
| Adaptive Estimation of Graphical Models under Total Positivity | Unknown | N/A | |
| Algorithmic Stability of Heavy-Tailed SGD with General Loss Functions | Unknown | N/A | |
| Additive Causal Bandits with Unknown Graph | Unknown | N/A | |
| Multi-Task Structural Learning using Local Task Similarity induced Neuron Creation and Removal | Unknown | N/A | |
| FedVS: Straggler-Resilient and Privacy-Preserving Vertical Federated Learning for Split Models | Unknown | N/A | |
| Quantitative Universal Approximation Bounds for Deep Belief Networks | Unknown | N/A | |
| Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations | Unknown | N/A | |
| Expectation-Complete Graph Representations with Homomorphisms | Unknown | N/A | |
| Group Equivariant Fourier Neural Operators for Partial Differential Equations | Unknown | N/A | |
| SGD with Large Step Sizes Learns Sparse Features | Unknown | N/A | |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Unknown | N/A | |
| STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning | Unknown | N/A | |
| Auxiliary Learning as an Asymmetric Bargaining Game | Unknown | N/A | |
| Equivariant Architectures for Learning in Deep Weight Spaces | Unknown | N/A | |
| InGram: Inductive Knowledge Graph Embedding via Relation Graphs | Unknown | N/A | |
| CoDi: Co-evolving Contrastive Diffusion Models for Mixed-type Tabular Synthesis | Unknown | N/A | |
| Reconstructive Neuron Pruning for Backdoor Defense | Unknown | N/A | |
| Learning Deductive Reasoning from Synthetic Corpus based on Formal Logic | Unknown | N/A | |
| Unifying Molecular and Textual Representations via Multi-task Language Modelling | Unknown | N/A | |
| A Toy Model of Universality: Reverse Engineering how Networks Learn Group Operations | Unknown | N/A | |
| Revisiting Data-Free Knowledge Distillation with Poisoned Teachers | Unknown | N/A | |
| Adaptive Computation with Elastic Input Sequence | Unknown | N/A | |
| The Ideal Continual Learner: An Agent That Never Forgets | Unknown | N/A | |
| Collaborative Multi-Agent Heterogeneous Multi-Armed Bandits | Unknown | N/A | |
| Deep Anomaly Detection under Labeling Budget Constraints | Unknown | N/A | |
| Differentiable Simulations for Enhanced Sampling of Rare Events | Unknown | N/A | |
| Coupled Variational Autoencoder | Unknown | N/A | |
| On Second-Order Scoring Rules for Epistemic Uncertainty Quantification | Unknown | N/A | |
| Dynamic Constrained Submodular Optimization with Polylogarithmic Update Time | Unknown | N/A | |
| $H$-Consistency Bounds for Pairwise Misranking Loss Surrogates | Unknown | N/A | |
| Fast Algorithms for Distributed k-Clustering with Outliers | Unknown | N/A | |
| Revisiting Weighted Aggregation in Federated Learning with Neural Networks | Unknown | N/A | |
| FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction | Unknown | N/A | |
| When does Privileged information Explain Away Label Noise? | Unknown | N/A | |
| On Pitfalls of Test-Time Adaptation | Unknown | N/A | |
| Distributional Offline Policy Evaluation with Predictive Error Guarantees | Unknown | N/A | |
| Distilling Internet-Scale Vision-Language Models into Embodied Agents | Unknown | N/A | |
| Forward-Backward Gaussian Variational Inference via JKO in the Bures-Wasserstein Space | Unknown | N/A | |
| Text Generation with Diffusion Language Models: A Pre-training Approach with Continuous Paragraph Denoise | Unknown | N/A | |
| On the Training Instability of Shuffling SGD with Batch Normalization | Unknown | N/A | |
| Doubly Adversarial Federated Bandits | Unknown | N/A | |
| Measuring the Impact of Programming Language Distribution | Unknown | N/A | |
| Expertise Trees Resolve Knowledge Limitations in Collective Decision-Making | Unknown | N/A | |
| DADAO: Decoupled Accelerated Decentralized Asynchronous Optimization | Unknown | N/A | |
| Why Is Public Pretraining Necessary for Private Model Training? | Unknown | N/A | |
| Prototype-Sample Relation Distillation: Towards Replay-Free Continual Learning | Unknown | N/A | |
| Critical Points and Convergence Analysis of Generative Deep Linear Networks Trained with Bures-Wasserstein Loss | Unknown | N/A | |
| Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score | Unknown | N/A | |
| Detecting Out-of-distribution Data through In-distribution Class Prior | Unknown | N/A | |
| Large Language Models Can Be Easily Distracted by Irrelevant Context | Unknown | N/A | |
| PFNs4BO: In-Context Learning for Bayesian Optimization | Unknown | N/A | |
| Meta-learning Parameterized Skills | Unknown | N/A | |
| Learning Globally Smooth Functions on Manifolds | Unknown | N/A | |
| MyoDex: A Generalizable Prior for Dexterous Manipulation | Unknown | N/A | |
| Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior Inference | Unknown | N/A | |
| FAENet: Frame Averaging Equivariant GNN for Materials Modeling | Unknown | N/A | |
| Beyond In-Domain Scenarios: Robust Density-Aware Calibration | Unknown | N/A | |
| Learning to Decouple Complex Systems | Unknown | N/A | |
| Linkless Link Prediction via Relational Distillation | Unknown | N/A | |
| Cross-Entropy Loss Functions: Theoretical Analysis and Applications | Unknown | N/A | |
| Can Forward Gradient Match Backpropagation? | Unknown | N/A | |
| Identifying Interpretable Subspaces in Image Representations | Unknown | N/A | |
| Global Selection of Contrastive Batches via Optimization on Sample Permutations | Unknown | N/A | |
| Differentiable Multi-Target Causal Bayesian Experimental Design | Unknown | N/A | |
| Quantifying the Knowledge in GNNs for Reliable Distillation into MLPs | Unknown | N/A | |
| Bandit Online Linear Optimization with Hints and Queries | Unknown | N/A | |
| OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models | Unknown | N/A | |
| LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation | Unknown | N/A | |
| ClimaX: A foundation model for weather and climate | Unknown | N/A | |
| TR0N: Translator Networks for 0-Shot Plug-and-Play Conditional Generation | Unknown | N/A | |
| Generative Adversarial Symmetry Discovery | Unknown | N/A | |
| The Benefits of Mixup for Feature Learning | Unknown | N/A | |
| Towards Robust Graph Incremental Learning on Evolving Graphs | Unknown | N/A | |
| FedHPO-Bench: A Benchmark Suite for Federated Hyperparameter Optimization | Unknown | N/A | |
| FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU | Unknown | N/A | |
| Learning GFlowNets From Partial Episodes For Improved Convergence And Stability | Unknown | N/A | |
| A theory of continuous generative flow networks | Unknown | N/A | |
| N$\text{A}^\text{2}$Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning | Unknown | N/A | |
| The Saddle-Point Method in Differential Privacy | Unknown | N/A | |
| Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time | Unknown | N/A | |
| Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning | Unknown | N/A | |
| Unsupervised Out-of-Distribution Detection with Diffusion Inpainting | Unknown | N/A | |
| SinFusion: Training Diffusion Models on a Single Image or Video | Unknown | N/A | |
| Simple Hardware-Efficient Long Convolutions for Sequence Modeling | Unknown | N/A | |
| LIV: Language-Image Representations and Rewards for Robotic Control | Unknown | N/A | |
| Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning | Unknown | N/A | |
| Sampling-Based Accuracy Testing of Posterior Estimators for General Inference | Unknown | N/A | |
| Leveraging Offline Data in Online Reinforcement Learning | Unknown | N/A | |
| Learning Noisy OR Bayesian Networks with Max-Product Belief Propagation | Unknown | N/A | |
| RLSbench: Domain Adaptation Under Relaxed Label Shift | Unknown | N/A | |
| Learning to Boost Training by Periodic Nowcasting Near Future Weights | Unknown | N/A | |
| Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models | Unknown | N/A | |
| Differentially Private Optimization on Large Model at Small Cost | Unknown | N/A | |
| Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark | Unknown | N/A | |
| Predicting Ordinary Differential Equations with Transformers | Unknown | N/A | |
| DP-Fast MH: Private, Fast, and Accurate Metropolis-Hastings for Large-Scale Bayesian Inference | Unknown | N/A | |
| Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits | Unknown | N/A | |
| I$^2$SB: Image-to-Image Schrödinger Bridge | Unknown | N/A | |
| GFlowNet-EM for Learning Compositional Latent Variable Models | Unknown | N/A | |
| FeDXL: Provable Federated Learning for Deep X-Risk Optimization | Unknown | N/A | |
| Blockwise Stochastic Variance-Reduced Methods with Parallel Speedup for Multi-Block Bilevel Optimization | Unknown | N/A | |
| Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization | Unknown | N/A | |
| Text-To-Concept (and Back) via Cross-Model Alignment | Unknown | N/A | |
| Conformal Inference is (almost) Free for Neural Networks Trained with Early Stopping | Unknown | N/A | |
| Continuously Parameterized Mixture Models | Unknown | N/A | |
| FaDIn: Fast Discretized Inference for Hawkes Processes with General Parametric Kernels | Unknown | N/A | |
| Regression with Sensor Data Containing Incomplete Observations | Unknown | N/A | |
| Superhuman Fairness | Unknown | N/A | |
| Extending Kernel PCA through Dualization: Sparsity, Robustness and Fast Algorithms | Unknown | N/A | |
| PWSHAP: A Path-Wise Explanation Model for Targeted Variables | Unknown | N/A | |
| Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation | Unknown | N/A | |
| Discovering Object-Centric Generalized Value Functions From Pixels | Unknown | N/A | |
| Multi-channel Autobidding with Budget and ROI Constraints | Unknown | N/A | |
| Principled Acceleration of Iterative Numerical Methods Using Machine Learning | Unknown | N/A | |
| Beam Tree Recursive Cells | Unknown | N/A | |
| Monotonic Location Attention for Length Generalization | Unknown | N/A | |
| GraphCleaner: Detecting Mislabelled Samples in Popular Graph Learning Benchmarks | Unknown | N/A | |
| UPSCALE: Unconstrained Channel Pruning | Unknown | N/A | |
| Trompt: Towards a Better Deep Neural Network for Tabular Data | Unknown | N/A | |
| Gibbsian Polar Slice Sampling | Unknown | N/A | |
| Graph Reinforcement Learning for Network Control via Bi-Level Optimization | Unknown | N/A | |
| DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation | Unknown | N/A | |
| On the Estimation of Gaussian Mixture Copula Models | Unknown | N/A | |
| Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes | Unknown | N/A | |
| A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs | Unknown | N/A | |
| Fully Bayesian Autoencoders with Latent Sparse Gaussian Processes | Unknown | N/A | |
| B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding | Unknown | N/A | |
| Efficient and Equivariant Graph Networks for Predicting Quantum Hamiltonian | Unknown | N/A | |
| On the Connection Between MPNN and Graph Transformer | Unknown | N/A | |
| Policy Gradient in Robust MDPs with Global Convergence Guarantee | Unknown | N/A | |
| Unit Scaling: Out-of-the-Box Low-Precision Training | Unknown | N/A | |
| Masked Trajectory Models for Prediction, Representation, and Control | Unknown | N/A | |
| A Three-regime Model of Network Pruning | Unknown | N/A | |
| DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature | Unknown | N/A | |
| Data Structures for Density Estimation | Unknown | N/A | |
| Training Normalizing Flows from Dependent Data | Unknown | N/A | |
| Scaling Up Dataset Distillation to ImageNet-1K with Constant Memory | Unknown | N/A | |
| Overcoming Simplicity Bias in Deep Networks using a Feature Sieve | Unknown | N/A | |
| Multi-Objective GFlowNets | Unknown | N/A | |
| Discrete Key-Value Bottleneck | Unknown | N/A | |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Unknown | N/A | |
| EF21-P and Friends: Improved Theoretical Communication Complexity for Distributed Optimization with Bidirectional Compression | Unknown | N/A | |
| Tighter Analysis for ProxSkip | Unknown | N/A | |
| High-Probability Bounds for Stochastic Optimization and Variational Inequalities: the Case of Unbounded Variance | Unknown | N/A | |
| KDEformer: Accelerating Transformers via Kernel Density Estimation | Unknown | N/A | |
| Streaming Submodular Maximization with Differential Privacy | Unknown | N/A | |
| Learning Neural PDE Solvers with Parameter-Guided Channel Attention | Unknown | N/A | |
| Margin-based sampling in high dimensions: When being active is less efficient than staying passive | Unknown | N/A | |
| MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Poses | Unknown | N/A | |
| Stochastic Gradient Descent-Induced Drift of Representation in a Two-Layer Neural Network | Unknown | N/A | |
| Large Language Models Struggle to Learn Long-Tail Knowledge | Unknown | N/A | |
| Sequential Predictive Conformal Inference for Time Series | Unknown | N/A | |
| BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | Unknown | N/A | |
| Hierarchical Neural Coding for Controllable CAD Model Generation | Unknown | N/A | |
| Opponent-Limited Online Search for Imperfect Information Games | Unknown | N/A | |
| Fair Densities via Boosting the Sufficient Statistics of Exponential Families | Unknown | N/A | |
| User-defined Event Sampling and Uncertainty Quantification in Diffusion Models for Physical Dynamical Systems | Unknown | N/A | |
| Matrix Estimation for Individual Fairness | Unknown | N/A | |
| Better Diffusion Models Further Improve Adversarial Training | Unknown | N/A | |
| Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic | Unknown | N/A | |
| Variational Open-Domain Question Answering | Unknown | N/A | |
| Learning for Edge-Weighted Online Bipartite Matching with Robustness Guarantees | Unknown | N/A | |
| Synergies between Disentanglement and Sparsity: Generalization and Identifiability in Multi-Task Learning | Unknown | N/A | |
| Generalized Teacher Forcing for Learning Chaotic Dynamics | Unknown | N/A | |
| Structure Learning of Latent Factors via Clique Search on Correlation Thresholded Graphs | Unknown | N/A | |
| Understanding Self-Distillation in the Presence of Label Noise | Unknown | N/A | |
| Beyond Uniform Lipschitz Condition in Differentially Private Optimization | Unknown | N/A | |
| Sequential Underspecified Instrument Selection for Cause-Effect Estimation | Unknown | N/A | |
| Diffusion Based Representation Learning | Unknown | N/A | |
| Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression | Unknown | N/A | |
| Coder Reviewer Reranking for Code Generation | Unknown | N/A | |
| Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least | Unknown | N/A | |
| Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning | Unknown | N/A | |
| Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning | Unknown | N/A | |
| Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data | Unknown | N/A | |
| Conformalization of Sparse Generalized Linear Models | Unknown | N/A | |
| Fast Rates for Maximum Entropy Exploration | Unknown | N/A | |
| The Unreasonable Effectiveness of Few-shot Learning for Machine Translation | Unknown | N/A | |
| Scaling Laws for Multilingual Neural Machine Translation | Unknown | N/A | |
| Secure Federated Correlation Test and Entropy Estimation | Unknown | N/A | |
| Image generation with shortest path diffusion | Unknown | N/A | |
| Multiply Robust Off-policy Evaluation and Learning under Truncation by Death | Unknown | N/A | |
| On Provable Copyright Protection for Generative Models | Unknown | N/A | |
| Hardness of Independent Learning and Sparse Equilibrium Computation in Markov Games | Unknown | N/A | |
| Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language | Unknown | N/A | |
| Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting | Unknown | N/A | |
| Input Perturbation Reduces Exposure Bias in Diffusion Models | Unknown | N/A | |
| Demystifying Uneven Vulnerability of Link Stealing Attacks against Graph Neural Networks | Unknown | N/A | |
| Tight Regret Bounds for Single-pass Streaming Multi-armed Bandits | Unknown | N/A | |
| Understanding Backdoor Attacks through the Adaptability Hypothesis | Unknown | N/A | |
| Learning to Maximize Mutual Information for Dynamic Feature Selection | Unknown | N/A | |
| Sketching Meets Differential Privacy: Fast Algorithm for Dynamic Kronecker Projection Maintenance | Unknown | N/A | |
| Sketching for First Order Method: Efficient Algorithm for Low-Bandwidth Channel and Vulnerability | Unknown | N/A | |
| Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling | Unknown | N/A | |
| Accelerated Infeasibility Detection of Constrained Optimization and Fixed-Point Iterations | Unknown | N/A | |
| Pretraining Language Models with Human Preferences | Unknown | N/A | |
| Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting | Unknown | N/A | |
| NeRFool: Uncovering the Vulnerability of Generalizable Neural Radiance Fields against Adversarial Perturbations | Unknown | N/A | |
| Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning | Unknown | N/A | |
| Fast Online Value-Maximizing Prediction Sets with Conformal Cost Control | Unknown | N/A | |
| Unconstrained Online Learning with Unbounded Losses | Unknown | N/A | |
| Sample and Predict Your Latent: Modality-free Sequential Disentanglement via Contrastive Estimation | Unknown | N/A | |
| Improved Algorithms for Multi-period Multi-class Packing Problems with Bandit Feedback | Unknown | N/A | |
| Towards Deep Attention in Graph Neural Networks: Problems and Remedies | Unknown | N/A | |
| Adversarial Parameter Attack on Deep Neural Networks | Unknown | N/A | |
| Phase Transitions in the Detection of Correlated Databases | Unknown | N/A | |
| Understanding the Complexity Gains of Single-Task RL with a Curriculum | Unknown | N/A | |
| When Personalization Harms Performance: Reconsidering the Use of Group Attributes in Prediction | Unknown | N/A | |
| OCD: Learning to Overfit with Conditional Diffusion Models | Unknown | N/A | |
| Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten | Unknown | N/A | |
| Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally Coupled Oscillatory Recurrent Neural Networks | Unknown | N/A | |
| Latent Traversals in Generative Models as Potential Flows | Unknown | N/A | |
| DUET: 2D Structured and Approximately Equivariant Representations | Unknown | N/A | |
| ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts | Unknown | N/A | |
| Contextual Reliability: When Different Features Matter in Different Contexts | Unknown | N/A | |
| HETAL: Efficient Privacy-preserving Transfer Learning with Homomorphic Encryption | Unknown | N/A | |
| Hierarchical Imitation Learning with Vector Quantized Models | Unknown | N/A | |
| Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies | Unknown | N/A | |
| Deep linear networks can benignly overfit when shallow ones do | Unknown | N/A | |
| Policy Evaluation and Temporal-Difference Learning in Continuous Time and Space: A Martingale Approach | Unknown | N/A | |
| Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism | Unknown | N/A | |
| Data-Derived Weak Universal Consistency | Unknown | N/A | |
| Existence, Stability and Scalability of Orthogonal Convolutional Neural Networks | Unknown | N/A | |
| Adversarial Classification: Necessary Conditions and Geometric Flows | Unknown | N/A | |
| On Generalizations of Some Distance Based Classifiers for HDLSS Data | Unknown | N/A | |
| Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning | Unknown | N/A | |
| CLUSTSEG: Clustering for Universal Segmentation | Unknown | N/A | |
| Bi-directional Masks for Efficient N:M Sparse Training | Unknown | N/A | |
| Composer: Creative and Controllable Image Synthesis with Composable Conditions | Unknown | N/A | |
| Learning to acquire novel cognitive tasks with evolution, plasticity and meta-meta-learning | Unknown | N/A | |
| Semiparametrically Efficient Off-Policy Evaluation in Linear Markov Decision Processes | Unknown | N/A | |
| Enabling First-Order Gradient-Based Learning for Equilibrium Computation in Markets | Unknown | N/A | |
| Non-autoregressive Conditional Diffusion Models for Time Series Prediction | Unknown | N/A | |
| On the Power of Foundation Models | Unknown | N/A | |
| Neural Diffusion Processes | Unknown | N/A | |
| Building Neural Networks on Matrix Manifolds: A Gyrovector Space Approach | Unknown | N/A | |
| Contrastive Learning Meets Homophily: Two Birds with One Stone | Unknown | N/A | |
| Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments | Unknown | N/A | |
| On the Generalization of Multi-modal Contrastive Learning | Unknown | N/A | |
| Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR | Unknown | N/A | |
| ContraBAR: Contrastive Bayes-Adaptive Deep RL | Unknown | N/A | |
| Are Diffusion Models Vulnerable to Membership Inference Attacks? | Unknown | N/A | |
| Data Representations' Study of Latent Image Manifolds | Unknown | N/A | |
| Is Consensus Acceleration Possible in Decentralized Optimization over Slowly Time-Varying Networks? | Unknown | N/A | |
| Benign Overfitting in Two-layer ReLU Convolutional Neural Networks | Unknown | N/A | |
| Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability | Unknown | N/A | |
| Second-order regression models exhibit progressive sharpening to the edge of stability | Unknown | N/A | |
| SAM operates far from home: eigenvalue regularization as a dynamical phenomenon | Unknown | N/A | |
| Subsample Ridge Ensembles: Equivalences and Generalized Cross-Validation | Unknown | N/A | |
| Policy Regularization with Dataset Constraint for Offline Reinforcement Learning | Unknown | N/A | |
| Guiding Pretraining in Reinforcement Learning with Large Language Models | Unknown | N/A | |
| A Mathematical Model for Curriculum Learning for Parities | Unknown | N/A | |
| Revisiting Gradient Clipping: Stochastic bias and tight convergence guarantees | Unknown | N/A | |
| Two Losses Are Better Than One: Faster Optimization Using a Cheaper Proxy | Unknown | N/A | |
| MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation | Unknown | N/A | |
| NNSplitter: An Active Defense Solution for DNN Model via Automated Weight Obfuscation | Unknown | N/A | |
| Learning Representations without Compositional Assumptions | Unknown | N/A | |
| Fair and Accurate Decision Making through Group-Aware Learning | Unknown | N/A | |
| CocktailSGD: Fine-tuning Foundation Models over 500Mbps Networks | Unknown | N/A | |
| Adversarial Learning of Distributional Reinforcement Learning | Unknown | N/A | |
| Online Local Differential Private Quantile Inference via Self-normalization | Unknown | N/A | |
| Horizon-free Learning for Markov Decision Processes and Games: Stochastically Bounded Rewards and Improved Bounds | Unknown | N/A | |
| Approximate Stein Classes for Truncated Density Estimation | Unknown | N/A | |
| SinDDM: A Single Image Denoising Diffusion Model | Unknown | N/A | |
| Robustly Learning a Single Neuron via Sharpness | Unknown | N/A | |
| Neural Markov Jump Processes | Unknown | N/A | |
| A Model-Based Method for Minimizing CVaR and Beyond | Unknown | N/A | |
| Revisiting Structured Variational Autoencoders | Unknown | N/A | |
| Oracles & Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Lottery Tickets in Evolutionary Optimization: On Sparse Backpropagation-Free Trainability | Unknown | N/A |
ICML 2024
| Title | Author | Code URL | |
|---|---|---|---|
| Position: Towards Unified Alignment Between Agents, Humans, and Environment | Unknown | N/A | |
| Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching | Unknown | N/A | |
| Fair Off-Policy Learning from Observational Data | Unknown | N/A | |
| Consistent Submodular Maximization | Unknown | N/A | |
| Relaxing the Accurate Imputation Assumption in Doubly Robust Learning for Debiased Collaborative Filtering | Unknown | N/A | |
| Automated Statistical Model Discovery with Language Models | Unknown | N/A | |
| Model-based Reinforcement Learning for Confounded POMDPs | Unknown | N/A | |
| Position: A Call for Embodied AI | Unknown | N/A | |
| Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference | Unknown | N/A | |
| WARM: On the Benefits of Weight Averaged Reward Models | Unknown | N/A | |
| MusicRL: Aligning Music Generation to Human Preferences | Unknown | N/A | |
| Nash Learning from Human Feedback | Unknown | N/A | |
| LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery | Unknown | N/A | |
| Kernel-Based Evaluation of Conditional Biological Sequence Models | Unknown | N/A | |
| TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge | Unknown | N/A | |
| Learning Associative Memories with Gradient Descent | Unknown | N/A | |
| InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization | Unknown | N/A | |
| Calibration Bottleneck: Over-compressed Representations are Less Calibratable | Unknown | N/A | |
| Discovering Environments with XRM | Unknown | N/A | |
| Batch and match: black-box variational inference with a score-based divergence | Unknown | N/A | |
| Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality | Unknown | N/A | |
| Combinatorial Approximations for Cluster Deletion: Simpler, Faster, and Better | Unknown | N/A | |
| MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts | Unknown | N/A | |
| Hybrid$^2$ Neural ODE Causal Modeling and an Application to Glycemic Response | Unknown | N/A | |
| Stereographic Spherical Sliced Wasserstein Distances | Unknown | N/A | |
| Distribution Alignment Optimization through Neural Collapse for Long-tailed Classification | Unknown | N/A | |
| How Learning by Reconstruction Produces Uninformative Features For Perception | Unknown | N/A | |
| Effects of Exponential Gaussian Distribution on (Double Sampling) Randomized Smoothing | Unknown | N/A | |
| Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference | Unknown | N/A | |
| Fool Your (Vision and) Language Model with Embarrassingly Simple Permutations | Unknown | N/A | |
| DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design | Unknown | N/A | |
| Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners | Unknown | N/A | |
| Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance | Unknown | N/A | |
| The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective | Unknown | N/A | |
| Active Preference Learning for Large Language Models | Unknown | N/A | |
| CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay | Unknown | N/A | |
| Variational Partial Group Convolutions for Input-Aware Partial Equivariance of Rotations and Color-Shifts | Unknown | N/A | |
| Incorporating Information into Shapley Values: Reweighting via a Maximum Entropy Approach | Unknown | N/A | |
| Position: Why We Must Rethink Empirical Research in Machine Learning | Unknown | N/A | |
| HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning | Unknown | N/A | |
| Q-value Regularized Transformer for Offline Reinforcement Learning | Unknown | N/A | |
| Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization | Unknown | N/A | |
| Boundary Exploration for Bayesian Optimization With Unknown Physical Constraints | Unknown | N/A | |
| Understanding Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation | Unknown | N/A | |
| Peeking with PEAK: Sequential, Nonparametric Composite Hypothesis Tests for Means of Multiple Data Streams | Unknown | N/A | |
| From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation | Unknown | N/A | |
| Multi-group Learning for Hierarchical Groups | Unknown | N/A | |
| Fast Decision Boundary based Out-of-Distribution Detector | Unknown | N/A | |
| Ensemble Pruning for Out-of-distribution Generalization | Unknown | N/A | |
| Diffuse, Sample, Project: Plug-And-Play Controllable Graph Generation | Unknown | N/A | |
| On the Asymptotic Distribution of the Minimum Empirical Risk | Unknown | N/A | |
| RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback | Unknown | N/A | |
| Exploiting Code Symmetries for Learning Program Semantics | Unknown | N/A | |
| Identification and Estimation for Nonignorable Missing Data: A Data Fusion Approach | Unknown | N/A | |
| Encodings for Prediction-based Neural Architecture Search | Unknown | N/A | |
| Chain of Code: Reasoning with a Language Model-Augmented Code Emulator | Unknown | N/A | |
| Towards Compositionality in Concept Learning | Unknown | N/A | |
| A Minimaximalist Approach to Reinforcement Learning from Human Feedback | Unknown | N/A | |
| Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models | Unknown | N/A | |
| Image Hijacks: Adversarial Images can Control Generative Models at Runtime | Unknown | N/A | |
| TENG: Time-Evolving Natural Gradient for Solving PDEs With Deep Neural Nets Toward Machine Precision | Unknown | N/A | |
| Decomposed Linear Dynamical Systems (dLDS) for learning the latent components of neural dynamics | Unknown | N/A | |
| Behavior Generation with Latent Actions | Unknown | N/A | |
| Measures of diversity and space-filling designs for categorical data | Unknown | N/A | |
| Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains | Unknown | N/A | |
| QuRating: Selecting High-Quality Data for Training Language Models | Unknown | N/A | |
| On the Maximal Local Disparity of Fairness-Aware Classifiers | Unknown | N/A | |
| Conditional Common Entropy for Instrumental Variable Testing and Partial Identification | Unknown | N/A | |
| On PI Controllers for Updating Lagrange Multipliers in Constrained Optimization | Unknown | N/A | |
| Benchmarking Deletion Metrics with the Principled Explanations | Unknown | N/A | |
| Online Matrix Completion: A Collaborative Approach with Hott Items | Unknown | N/A | |
| Scaling Laws for Fine-Grained Mixture of Experts | Unknown | N/A | |
| Benign Overfitting in Adversarial Training of Neural Networks | Unknown | N/A | |
| Survival Kernets: Scalable and Interpretable Deep Kernel Survival Analysis with an Accuracy Guarantee | Unknown | N/A | |
| Sample as you Infer: Predictive Coding with Langevin Dynamics | Unknown | N/A | |
| NeWRF: A Deep Learning Framework for Wireless Radiation Field Reconstruction and Channel Prediction | Unknown | N/A | |
| Efficient PAC Learnability of Dynamical Systems Over Multilayer Networks | Unknown | N/A | |
| How Language Model Hallucinations Can Snowball | Unknown | N/A | |
| APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference | Unknown | N/A | |
| Extracting Training Data From Document-Based VQA Models | Unknown | N/A | |
| MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data | Unknown | N/A | |
| Prediction Accuracy of Learning in Games : Follow-the-Regularized-Leader meets Heisenberg | Unknown | N/A | |
| PARCv2: Physics-aware Recurrent Convolutional Neural Networks for Spatiotemporal Dynamics Modeling | Unknown | N/A | |
| Unsupervised Concept Discovery Mitigates Spurious Correlations | Unknown | N/A | |
| Scalable AI Safety via Doubly-Efficient Debate | Unknown | N/A | |
| GeoMFormer: A General Architecture for Geometric Molecular Representation Learning | Unknown | N/A | |
| Model Assessment and Selection under Temporal Distribution Shift | Unknown | N/A | |
| The Fundamental Limits of Least-Privilege Learning | Unknown | N/A | |
| Sequential Disentanglement by Extracting Static Information From A Single Sequence Element | Unknown | N/A | |
| Minimizing $f$-Divergences by Interpolating Velocity Fields | Unknown | N/A | |
| Sequential Neural Score Estimation: Likelihood-Free Inference with Conditional Score Based Diffusion Models | Unknown | N/A | |
| Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses | Unknown | N/A | |
| Subgoal-based Demonstration Learning for Formal Theorem Proving | Unknown | N/A | |
| Vector Quantization Pretraining for EEG Time Series with Random Projection and Phase Alignment | Unknown | N/A | |
| Emergence of In-Context Reinforcement Learning from Noise Distillation | Unknown | N/A | |
| Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs | Unknown | N/A | |
| Robust Universal Adversarial Perturbations | Unknown | N/A | |
| Low-Cost High-Power Membership Inference Attacks | Unknown | N/A | |
| Towards Modular LLMs by Building and Reusing a Library of LoRAs | Unknown | N/A | |
| Early Time Classification with Accumulated Accuracy Gap Control | Unknown | N/A | |
| Evaluating Instrument Validity using the Principle of Independent Mechanisms | Unknown | N/A | |
| Unified Generation, Reconstruction, and Representation: Generalized Diffusion with Adaptive Latent Encoding-Decoding | Unknown | N/A | |
| CogBench: a large language model walks into a psychology lab | Unknown | N/A | |
| Human-like Category Learning by Injecting Ecological Priors from Large Language Models into Neural Networks | Unknown | N/A | |
| ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy | Unknown | N/A | |
| Consistent Diffusion Meets Tweedie: Training Exact Ambient Diffusion Models with Noisy Data | Unknown | N/A | |
| Optimally Improving Cooperative Learning in a Social Setting | Unknown | N/A | |
| $\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts | Unknown | N/A | |
| Simple linear attention language models balance the recall-throughput tradeoff | Unknown | N/A | |
| Principled Preferential Bayesian Optimization | Unknown | N/A | |
| SiT: Symmetry-invariant Transformers for Generalisation in Reinforcement Learning | Unknown | N/A | |
| On Positivity Condition for Causal Inference | Unknown | N/A | |
| Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference | Unknown | N/A | |
| Linguistic Calibration of Long-Form Generations | Unknown | N/A | |
| A Unified Framework for Learning with Nonlinear Model Classes from Arbitrary Linear Samples | Unknown | N/A | |
| Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices | Unknown | N/A | |
| The Expressive Power of Path-Based Graph Neural Networks | Unknown | N/A | |
| Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers | Unknown | N/A | |
| Environment Design for Inverse Reinforcement Learning | Unknown | N/A | |
| Stable Differentiable Causal Discovery | Unknown | N/A | |
| Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error | Unknown | N/A | |
| Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective | Unknown | N/A | |
| Noise-Aware Algorithm for Heterogeneous Differentially Private Federated Learning | Unknown | N/A | |
| A Tale of Tails: Model Collapse as a Change of Scaling Laws | Unknown | N/A | |
| Adversarial Attacks on Combinatorial Multi-Armed Bandits | Unknown | N/A | |
| Practical Hamiltonian Monte Carlo on Riemannian Manifolds via Relativity Theory | Unknown | N/A | |
| A Dynamic Algorithm for Weighted Submodular Cover Problem | Unknown | N/A | |
| On The Fairness Impacts of Hardware Selection in Machine Learning | Unknown | N/A | |
| Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning | Unknown | N/A | |
| Graph Attention Retrospective | Unknown | N/A | |
| A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity | Unknown | N/A | |
| From Coarse to Fine: Enable Comprehensive Graph Self-supervised Learning with Multi-granular Semantic Ensemble | Unknown | N/A | |
| Parameterized Physics-informed Neural Networks for Parameterized PDEs | Unknown | N/A | |
| Sparser, Better, Deeper, Stronger: Improving Static Sparse Training with Exact Orthogonal Initialization | Unknown | N/A | |
| Safe Exploration in Dose Finding Clinical Trials with Heterogeneous Participants | Unknown | N/A | |
| TIC-TAC: A Framework For Improved Covariance Estimation In Deep Heteroscedastic Regression | Unknown | N/A | |
| Challenges in Training PINNs: A Loss Landscape Perspective | Unknown | N/A | |
| High-Probability Convergence for Composite and Distributed Stochastic Minimization and Variational Inequalities with Heavy-Tailed Noise | Unknown | N/A | |
| Position: The Causal Revolution Needs Scientific Pragmatism | Unknown | N/A | |
| A Unified Recipe for Deriving (Time-Uniform) PAC-Bayes Bounds | Unknown | N/A | |
| RankSEG: A Consistent Ranking-based Framework for Segmentation | Unknown | N/A | |
| PruNeRF: Segment-Centric Dataset Pruning via 3D Spatial Consistency | Unknown | N/A | |
| Bridging discrete and continuous state spaces: Exploring the Ehrenfest process in time-continuous diffusion models | Unknown | N/A | |
| Agnostic Interactive Imitation Learning: New Theory and Practical Algorithms | Unknown | N/A | |
| Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation | Unknown | N/A | |
| Conformal Validity Guarantees Exist for Any Data Distribution (and How to Find Them) | Unknown | N/A | |
| Learning to Continually Learn with the Bayesian Principle | Unknown | N/A | |
| Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data | Unknown | N/A | |
| Repoformer: Selective Retrieval for Repository-Level Code Completion | Unknown | N/A | |
| Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach | Unknown | N/A | |
| On the Generalization of Stochastic Gradient Descent with Momentum | Unknown | N/A | |
| Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent | Unknown | N/A | |
| A Geometric Explanation of the Likelihood OOD Detection Paradox | Unknown | N/A | |
| Leveraging Attractor Dynamics in Spatial Navigation for Better Language Parsing | Unknown | N/A | |
| Disparate Impact on Group Accuracy of Linearization for Private Inference | Unknown | N/A | |
| SqueezeLLM: Dense-and-Sparse Quantization | Unknown | N/A | |
| Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization | Unknown | N/A | |
| An LLM Compiler for Parallel Function Calling | Unknown | N/A | |
| Unbiased Multi-Label Learning from Crowdsourced Annotations | Unknown | N/A | |
| SleepFM: Multi-modal Representation Learning for Sleep Across Brain Activity, ECG and Respiratory Signals | Unknown | N/A | |
| Position: Evolving AI Collectives Enhance Human Diversity and Enable Self-Regulation | Unknown | N/A | |
| Revisiting Context Aggregation for Image Matting | Unknown | N/A | |
| Learning Scale-Aware Spatio-temporal Implicit Representation for Event-based Motion Deblurring | Unknown | N/A | |
| Mind the Boundary: Coreset Selection via Reconstructing the Decision Boundary | Unknown | N/A | |
| Position: Standardization of Behavioral Use Clauses is Necessary for the Adoption of Responsible Licensing of AI | Unknown | N/A | |
| Infinite-Horizon Distributionally Robust Regret-Optimal Control | Unknown | N/A | |
| Accelerating Heterogeneous Federated Learning with Closed-form Classifiers | Unknown | N/A | |
| Irregular Multivariate Time Series Forecasting: A Transformable Patching Graph Neural Networks Approach | Unknown | N/A | |
| Simplicity Bias via Global Convergence of Sharpness Minimization | Unknown | N/A | |
| Why Do You Grok? A Theoretical Analysis on Grokking Modular Addition | Unknown | N/A | |
| An Intrinsic Vector Heat Network | Unknown | N/A | |
| Mean Field Langevin Actor-Critic: Faster Convergence and Global Optimality beyond Lazy Learning | Unknown | N/A | |
| Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model | Unknown | N/A | |
| Neural-Kernel Conditional Mean Embeddings | Unknown | N/A | |
| Stochastic Quantum Sampling for Non-Logconcave Distributions and Estimating Partition Functions | Unknown | N/A | |
| Enabling Uncertainty Estimation in Iterative Neural Networks | Unknown | N/A | |
| MultiMax: Sparse and Multi-Modal Attention Learning | Unknown | N/A | |
| Improving Neural Additive Models with Bayesian Principles | Unknown | N/A | |
| Averaging $n$-step Returns Reduces Variance in Reinforcement Learning | Unknown | N/A | |
| Implicit meta-learning may lead language models to trust more reliable sources | Unknown | N/A | |
| Multi-Track Message Passing: Tackling Oversmoothing and Oversquashing in Graph Learning via Preventing Heterophily Mixing | Unknown | N/A | |
| Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning | Unknown | N/A | |
| Residual-Conditioned Optimal Transport: Towards Structure-Preserving Unpaired and Paired Image Restoration | Unknown | N/A | |
| Prometheus: Out-of-distribution Fluid Dynamics Modeling with Disentangled Graph ODE | Unknown | N/A | |
| Do Models Explain Themselves? Counterfactual Simulatability of Natural Language Explanations | Unknown | N/A | |
| Total Variation Distance Meets Probabilistic Inference | Unknown | N/A | |
| Improving Adversarial Energy-Based Model via Diffusion Process | Unknown | N/A | |
| A3S: A General Active Clustering Method with Pairwise Constraints | Unknown | N/A | |
| An Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization | Unknown | N/A | |
| Learning a Diffusion Model Policy from Rewards via Q-Score Matching | Unknown | N/A | |
| Online Cascade Learning for Efficient Inference over Streams | Unknown | N/A | |
| Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning | Unknown | N/A | |
| Learning from Streaming Data when Users Choose | Unknown | N/A | |
| Scaling Speech Technology to 1,000+ Languages | Unknown | N/A | |
| Robustness of Nonlinear Representation Learning | Unknown | N/A | |
| Conformal Prediction with Learned Features | Unknown | N/A | |
| Symmetry Induces Structure and Constraint of Learning | Unknown | N/A | |
| Achieving Margin Maximization Exponentially Fast via Progressive Norm Rescaling | Unknown | N/A | |
| T-Cal: An Optimal Test for the Calibration of Predictive Models | Unknown | N/A | |
| Watermark Stealing in Large Language Models | Unknown | N/A | |
| Prior Specification for Bayesian Matrix Factorization via Prior Predictive Matching | Unknown | N/A | |
| Fair Data Representation for Machine Learning at the Pareto Frontier | Unknown | N/A | |
| A Dynamical Model of Neural Scaling Laws | Unknown | N/A | |
| Adaptive Learning of Density Ratios in RKHS | Unknown | N/A | |
| Adaptively Perturbed Mirror Descent for Learning in Games | Unknown | N/A | |
| Bayesian Uncertainty for Gradient Aggregation in Multi-Task Learning | Unknown | N/A | |
| AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers | Unknown | N/A | |
| FrameQuant: Flexible Low-Bit Quantization for Transformers | Unknown | N/A | |
| BeigeMaps: Behavioral Eigenmaps for Reinforcement Learning from Images | Unknown | N/A | |
| Optimal Coresets for Low-Dimensional Geometric Median | Unknown | N/A | |
| Learning to Play Atari in a World of Tokens | Unknown | N/A | |
| Probabilistic Generating Circuits - Demystified | Unknown | N/A | |
| The Non-linear $F$-Design and Applications to Interactive Learning | Unknown | N/A | |
| LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned Proportions | Unknown | N/A | |
| Distinguishing the Knowable from the Unknowable with Language Models | Unknown | N/A | |
| How to Escape Sharp Minima with Random Perturbations | Unknown | N/A | |
| Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise | Unknown | N/A | |
| Not all distributional shifts are equal: Fine-grained robust conformal inference | Unknown | N/A | |
| Triple Changes Estimator for Targeted Policies | Unknown | N/A | |
| Learning Mixtures of Gaussian Processes through Random Projection | Unknown | N/A | |
| Nonlinear Filtering with Brenier Optimal Transport Maps | Unknown | N/A | |
| Revisiting Inexact Fixed-Point Iterations for Min-Max Problems: Stochasticity and Structured Nonconvexity | Unknown | N/A | |
| Gaussian Processes on Cellular Complexes | Unknown | N/A | |
| No Dimensional Sampling Coresets for Classification | Unknown | N/A | |
| Physics of Language Models: Part 3.1, Knowledge Storage and Extraction | Unknown | N/A | |
| Byzantine-Robust Federated Learning: Impact of Client Subsampling and Local Updates | Unknown | N/A | |
| The Privacy Power of Correlated Noise in Decentralized Learning | Unknown | N/A | |
| Robust and Conjugate Gaussian Process Regression | Unknown | N/A | |
| Position: Stop Making Unscientific AGI Performance Claims | Unknown | N/A | |
| Hyperbolic Optimizer as a Dynamical System | Unknown | N/A | |
| Stationarity without mean reversion in improper Gaussian processes | Unknown | N/A | |
| Robust Graph Matching when Nodes are Corrupt | Unknown | N/A | |
| Fast Algorithms for Hypergraph PageRank with Applications to Semi-Supervised Learning | Unknown | N/A | |
| Scalable Online Exploration via Coverability | Unknown | N/A | |
| Adaptive Hierarchical Certification for Segmentation using Randomized Smoothing | Unknown | N/A | |
| Online conformal prediction with decaying step sizes | Unknown | N/A | |
| A Rate-Distortion View of Uncertainty Quantification | Unknown | N/A | |
| Practical Performance Guarantees for Pipelined DNN Inference | Unknown | N/A | |
| Causal Action Influence Aware Counterfactual Data Augmentation | Unknown | N/A | |
| An amortized approach to non-linear mixed-effects modeling based on neural posterior estimation | Unknown | N/A | |
| Learning the Target Network in Function Space | Unknown | N/A | |
| Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages | Unknown | N/A | |
| Delaunay Graph: Addressing Over-Squashing and Over-Smoothing Using Delaunay Triangulation | Unknown | N/A | |
| How Free is Parameter-Free Stochastic Optimization? | Unknown | N/A | |
| Random features models: a way to study the success of naive imputation | Unknown | N/A | |
| Bipartite Matching in Massive Graphs: A Tight Analysis of EDCS | Unknown | N/A | |
| Simulation of Graph Algorithms with Looped Transformers | Unknown | N/A | |
| Diffusion Models Demand Contrastive Guidance for Adversarial Purification to Advance | Unknown | N/A | |
| On the Complexity of Finite-Sum Smooth Optimization under the Polyak–Łojasiewicz Condition | Unknown | N/A | |
| Constrained Ensemble Exploration for Unsupervised Skill Discovery | Unknown | N/A | |
| Memory Consolidation Enables Long-Context Video Understanding | Unknown | N/A | |
| On the Identifiability of Switching Dynamical Systems | Unknown | N/A | |
| Analyzing $D^\alpha$ seeding for $k$-means | Unknown | N/A | |
| Relational DNN Verification With Cross Executional Bound Refinement | Unknown | N/A | |
| VNN: Verification-Friendly Neural Networks with Hard Robustness Guarantees | Unknown | N/A | |
| Scale-Free Image Keypoints Using Differentiable Persistent Homology | Unknown | N/A | |
| Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies | Unknown | N/A | |
| Neural Diffusion Models | Unknown | N/A | |
| Generalization in Kernel Regression Under Realistic Assumptions | Unknown | N/A | |
| On Mechanistic Knowledge Localization in Text-to-Image Generative Models | Unknown | N/A | |
| Monotone Individual Fairness | Unknown | N/A | |
| Vocabulary for Universal Approximation: A Linguistic Perspective of Mapping Compositions | Unknown | N/A | |
| Standardized Interpretable Fairness Measures for Continuous Risk Scores | Unknown | N/A | |
| Neural Networks Learn Statistics of Increasing Complexity | Unknown | N/A | |
| The Role of Learning Algorithms in Collective Action | Unknown | N/A | |
| CoLoRA: Continuous low-rank adaptation for reduced implicit neural modeling of parameterized partial differential equations | Unknown | N/A | |
| By Tying Embeddings You Are Assuming the Distributional Hypothesis | Unknown | N/A | |
| Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Refining Minimax Regret for Unsupervised Environment Design | Unknown | N/A | |
| Guiding LLMs The Right Way: Fast, Non-Invasive Constrained Generation | Unknown | N/A | |
| Sarah Frank-Wolfe: Methods for Constrained Optimization with Best Rates and Practical Features | Unknown | N/A | |
| Position: Scaling Simulation is Neither Necessary Nor Sufficient for In-the-Wild Robot Manipulation | Unknown | N/A | |
| Why do Variational Autoencoders Really Promote Disentanglement? | Unknown | N/A | |
| Best of Both Worlds Guarantees for Smoothed Online Quadratic Optimization | Unknown | N/A | |
| Multi-Patch Prediction: Adapting Language Models for Time Series Representation Learning | Unknown | N/A | |
| Naive Bayes Classifiers over Missing Data: Decision and Poisoning | Unknown | N/A | |
| Improving fine-grained understanding in image-text pre-training | Unknown | N/A | |
| Position: Explain to Question not to Justify | Unknown | N/A | |
| Biharmonic Distance of Graphs and its Higher-Order Variants: Theoretical Properties with Applications to Centrality and Clustering | Unknown | N/A | |
| Stability Evaluation through Distributional Perturbation Analysis | Unknown | N/A | |
| Dynamic Survival Analysis with Controlled Latent States | Unknown | N/A | |
| Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling | Unknown | N/A | |
| Shifted Interpolation for Differential Privacy | Unknown | N/A | |
| How Spurious Features are Memorized: Precise Analysis for Random and NTK Features | Unknown | N/A | |
| Towards Understanding the Word Sensitivity of Attention Layers: A Study via Random Features | Unknown | N/A | |
| Position: Machine Learning-powered Assessments of the EU Digital Services Act Aid Quantify Policy Impacts on Online Harms | Unknown | N/A | |
| Random matrix theory improved Fréchet mean of symmetric positive definite matrices | Unknown | N/A | |
| On dimensionality of feature vectors in MPNNs | Unknown | N/A | |
| Fully-Dynamic Approximate Decision Trees With Worst-Case Update Time Guarantees | Unknown | N/A | |
| Applying language models to algebraic topology: generating simplicial cycles using multi-labeling in Wu's formula | Unknown | N/A | |
| Private Gradient Descent for Linear Regression: Tighter Error Bounds and Instance-Specific Uncertainty Estimation | Unknown | N/A | |
| Provably Neural Active Learning Succeeds via Prioritizing Perplexing Samples | Unknown | N/A | |
| Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More | Unknown | N/A | |
| Differentially Private Bias-Term Fine-tuning of Foundation Models | Unknown | N/A | |
| Langevin Policy for Safe Reinforcement Learning | Unknown | N/A | |
| Semantically-correlated memories in a dense associative model | Unknown | N/A | |
| How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers | Unknown | N/A | |
| Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads | Unknown | N/A | |
| Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality Generation | Unknown | N/A | |
| Accelerated Algorithms for Constrained Nonconvex-Nonconcave Min-Max Optimization and Comonotone Inclusion | Unknown | N/A | |
| Sample-specific Masks for Visual Reprogramming-based Prompting | Unknown | N/A | |
| Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design | Unknown | N/A | |
| Successor Features for Efficient Multi-Subject Controlled Text Generation | Unknown | N/A | |
| Limited Preference Aided Imitation Learning from Imperfect Demonstrations | Unknown | N/A | |
| Predictive Dynamic Fusion | Unknown | N/A | |
| Can a Few Decide for Many? The Metric Distortion of Sortition | Unknown | N/A | |
| AI Alignment with Changing and Influenceable Reward Functions | Unknown | N/A | |
| Online Learning under Budget and ROI Constraints via Weak Adaptivity | Unknown | N/A | |
| On the Implicit Bias of Adam | Unknown | N/A | |
| Feasibility Consistent Representation Learning for Safe Reinforcement Learning | Unknown | N/A | |
| Simple Ingredients for Offline Reinforcement Learning | Unknown | N/A | |
| Auditing Private Prediction | Unknown | N/A | |
| Scribble-Supervised Semantic Segmentation with Prototype-based Feature Augmentation | Unknown | N/A | |
| Feature Importance Disparities for Data Bias Investigations | Unknown | N/A | |
| Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting | Unknown | N/A | |
| MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion | Unknown | N/A | |
| How Interpretable Are Interpretable Graph Neural Networks? | Unknown | N/A | |
| Doubly Robust Causal Effect Estimation under Networked Interference via Targeted Learning | Unknown | N/A | |
| Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation | Unknown | N/A | |
| InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models | Unknown | N/A | |
| MaSS: Multi-attribute Selective Suppression for Utility-preserving Data Transformation from an Information-theoretic Perspective | Unknown | N/A | |
| Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models | Unknown | N/A | |
| Robust Classification via a Single Diffusion Model | Unknown | N/A | |
| Relational Learning in Pre-Trained Models: A Theory from Hypergraph Recovery Perspective | Unknown | N/A | |
| Towards AutoAI: Optimizing a Machine Learning System with Black-box and Differentiable Components | Unknown | N/A | |
| RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content | Unknown | N/A | |
| Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes | Unknown | N/A | |
| CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding | Unknown | N/A | |
| Bagged Deep Image Prior for Recovering Images in the Presence of Speckle Noise | Unknown | N/A | |
| Accelerated Policy Gradient for s-rectangular Robust MDPs with Large State Spaces | Unknown | N/A | |
| Improved Communication-Privacy Trade-offs in $L_2$ Mean Estimation under Streaming Differential Privacy | Unknown | N/A | |
| Offline Transition Modeling via Contrastive Energy Learning | Unknown | N/A | |
| Efficient Pareto Manifold Learning with Low-Rank Structure | Unknown | N/A | |
| Identifiability Matters: Revealing the Hidden Recoverable Condition in Unbiased Learning to Rank | Unknown | N/A | |
| High-Dimensional Kernel Methods under Covariate Shift: Data-Dependent Implicit Regularization | Unknown | N/A | |
| DiJiang: Efficient Large Language Models through Compact Kernelization | Unknown | N/A | |
| EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language Models with 3D Parallelism | Unknown | N/A | |
| MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models | Unknown | N/A | |
| CaRiNG: Learning Temporal Causal Representation under Non-Invertible Generation Process | Unknown | N/A | |
| GRATH: Gradual Self-Truthifying for Large Language Models | Unknown | N/A | |
| Performative Prediction with Bandit Feedback: Learning through Reparameterization | Unknown | N/A | |
| Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes | Unknown | N/A | |
| Recovering Labels from Local Updates in Federated Learning | Unknown | N/A | |
| Locally Differentially Private Decentralized Stochastic Bilevel Optimization with Guaranteed Convergence Accuracy | Unknown | N/A | |
| Subequivariant Reinforcement Learning in 3D Multi-Entity Physical Environments | Unknown | N/A | |
| A General Framework for Learning from Weak Supervision | Unknown | N/A | |
| Diffusion Model-Augmented Behavioral Cloning | Unknown | N/A | |
| Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning | Unknown | N/A | |
| In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation | Unknown | N/A | |
| Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting | Unknown | N/A | |
| DRCT: Diffusion Reconstruction Contrastive Training towards Universal Detection of Diffusion Generated Images | Unknown | N/A | |
| FedMBridge: Bridgeable Multimodal Federated Learning | Unknown | N/A | |
| Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness | Unknown | N/A | |
| Diffusive Gibbs Sampling | Unknown | N/A | |
| Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation | Unknown | N/A | |
| LLaGA: Large Language and Graph Assistant | Unknown | N/A | |
| Compact Optimality Verification for Optimization Proxies | Unknown | N/A | |
| Enhancing Implicit Shape Generators Using Topological Regularizations | Unknown | N/A | |
| Stacking Deep Set Networks and Pooling by Quantiles | Unknown | N/A | |
| What Can Transformer Learn with Varying Depth? Case Studies on Sequence Learning Tasks | Unknown | N/A | |
| BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression Tasks | Unknown | N/A | |
| GaussianPro: 3D Gaussian Splatting with Progressive Propagation | Unknown | N/A | |
| RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation | Unknown | N/A | |
| RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences | Unknown | N/A | |
| Kernel Semi-Implicit Variational Inference | Unknown | N/A | |
| Creative Text-to-Audio Generation via Synthesizer Programming | Unknown | N/A | |
| Leveraging (Biased) Information: Multi-armed Bandits with Offline Data | Unknown | N/A | |
| Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning | Unknown | N/A | |
| MS-TIP: Imputation Aware Pedestrian Trajectory Prediction | Unknown | N/A | |
| Enhancing Trajectory Prediction through Self-Supervised Waypoint Distortion Prediction | Unknown | N/A | |
| How Flawed Is ECE? An Analysis via Logit Smoothing | Unknown | N/A | |
| Kernel Debiased Plug-in Estimation: Simultaneous, Automated Debiasing without Influence Functions for Many Target Parameters | Unknown | N/A | |
| Hard Tasks First: Multi-Task Reinforcement Learning Through Task Scheduling | Unknown | N/A | |
| KV-Runahead: Scalable Causal LLM Inference by Parallel Key-Value Cache Generation | Unknown | N/A | |
| Neurodegenerative Brain Network Classification via Adaptive Diffusion with Temporal Regularization | Unknown | N/A | |
| Scalable Wasserstein Gradient Flow for Generative Modeling through Unbalanced Optimal Transport | Unknown | N/A | |
| Listwise Reward Estimation for Offline Preference-based Reinforcement Learning | Unknown | N/A | |
| PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning | Unknown | N/A | |
| Online bipartite matching with imperfect advice | Unknown | N/A | |
| A connection between Tempering and Entropic Mirror Descent | Unknown | N/A | |
| A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts | Unknown | N/A | |
| How Private are DP-SGD Implementations? | Unknown | N/A | |
| Prompt-tuning Latent Diffusion Models for Inverse Problems | Unknown | N/A | |
| Studying K-FAC Heuristics by Viewing Adam through a Second-Order Lens | Unknown | N/A | |
| $\mathtt{VITS}$ : Variational Inference Thompson Sampling for contextual bandits | Unknown | N/A | |
| Improving Token-Based World Models with Parallel Observation Prediction | Unknown | N/A | |
| Multi-View Stochastic Block Models | Unknown | N/A | |
| Weighted distance nearest neighbor condensing | Unknown | N/A | |
| A Near-Linear Time Approximation Algorithm for Beyond-Worst-Case Graph Clustering | Unknown | N/A | |
| Dynamic Correlation Clustering in Sublinear Update Time | Unknown | N/A | |
| A2Q+: Improving Accumulator-Aware Weight Quantization | Unknown | N/A | |
| Statistical Inference Under Constrained Selection Bias | Unknown | N/A | |
| Conformal Prediction Sets Improve Human Decision Making | Unknown | N/A | |
| Generalization Bounds for Causal Regression: Insights, Guarantees and Sensitivity Analysis | Unknown | N/A | |
| Harmonizing Generalization and Personalization in Federated Prompt Learning | Unknown | N/A | |
| ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback | Unknown | N/A | |
| Position: Beyond Personhood: Agency, Accountability, and the Limits of Anthropomorphic Ethical Analysis | Unknown | N/A | |
| Multi-View Clustering by Inter-cluster Connectivity Guided Reward | Unknown | N/A | |
| High-Order Contrastive Learning with Fine-grained Comparative Levels for Sparse Ordinal Tensor Completion | Unknown | N/A | |
| Safe Reinforcement Learning using Finite-Horizon Gradient-based Estimation | Unknown | N/A | |
| Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Features Model | Unknown | N/A | |
| Boosting Offline Optimizers with Surrogate Sensitivity | Unknown | N/A | |
| Test-Time Degradation Adaptation for Open-Set Image Restoration | Unknown | N/A | |
| A decoder-only foundation model for time-series forecasting | Unknown | N/A | |
| New Bounds on the Cohesion of Complete-link and Other Linkage Methods for Agglomerative Clustering | Unknown | N/A | |
| Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstraction | Unknown | N/A | |
| Global Reinforcement Learning : Beyond Linear and Convex Rewards via Submodular Semi-gradient Methods | Unknown | N/A | |
| Provably Better Explanations with Optimized Aggregation of Feature Attributions | Unknown | N/A | |
| Learning Cognitive Maps from Transformer Representations for Efficient Planning in Partially Observed Environments | Unknown | N/A | |
| Asymptotically Optimal and Computationally Efficient Average Treatment Effect Estimation in A/B testing | Unknown | N/A | |
| Predicting Lagrangian Multipliers for Mixed Integer Linear Programs | Unknown | N/A | |
| Prediction-powered Generalization of Causal Inferences | Unknown | N/A | |
| An Unsupervised Approach for Periodic Source Detection in Time Series | Unknown | N/A | |
| Collaborative Learning with Different Labeling Functions | Unknown | N/A | |
| Exploring the Low-Pass Filtering Behavior in Image Super-Resolution | Unknown | N/A | |
| Network Tight Community Detection | Unknown | N/A | |
| Going beyond Compositions, DDPMs Can Produce Zero-Shot Interpolations | Unknown | N/A | |
| Multicalibration for Confidence Scoring in LLMs | Unknown | N/A | |
| Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation Learning | Unknown | N/A | |
| Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies | Unknown | N/A | |
| Trust Regions for Explanations via Black-Box Probabilistic Certification | Unknown | N/A | |
| Double Stochasticity Gazes Faster: Snap-Shot Decentralized Stochastic Gradient Tracking Methods | Unknown | N/A | |
| Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient | Unknown | N/A | |
| Robust Sparse Estimation for Gaussians with Optimal Error under Huber Contamination | Unknown | N/A | |
| Fast Co-Training under Weak Dependence via Stream-Based Active Learning | Unknown | N/A | |
| Convex and Bilevel Optimization for Neural-Symbolic Inference and Learning | Unknown | N/A | |
| Efficient Algorithms for Sum-Of-Minimum Optimization | Unknown | N/A | |
| Robust Stable Spiking Neural Networks | Unknown | N/A | |
| Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization | Unknown | N/A | |
| LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens | Unknown | N/A | |
| Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds | Unknown | N/A | |
| Consistent Adversarially Robust Linear Classification: Non-Parametric Setting | Unknown | N/A | |
| Precise Accuracy / Robustness Tradeoffs in Regression: Case of General Norms | Unknown | N/A | |
| Spectral Preconditioning for Gradient Methods on Graded Non-convex Functions | Unknown | N/A | |
| Impact of Decentralized Learning on Player Utilities in Stackelberg Games | Unknown | N/A | |
| Towards Generalization beyond Pointwise Learning: A Unified Information-theoretic Perspective | Unknown | N/A | |
| Pruner-Zero: Evolving Symbolic Pruning Metric From Scratch for Large Language Models | Unknown | N/A | |
| Position: Building Guardrails for Large Language Models Requires Systematic Design | Unknown | N/A | |
| Accelerating PDE Data Generation via Differential Operator Action in Solution Space | Unknown | N/A | |
| TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling | Unknown | N/A | |
| Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference | Unknown | N/A | |
| Privacy-Preserving Data Release Leveraging Optimal Transport and Particle Gradient Descent | Unknown | N/A | |
| Spike Distance Function as a Learning Objective for Spike Prediction | Unknown | N/A | |
| On the Universality of Volume-Preserving and Coupling-Based Normalizing Flows | Unknown | N/A | |
| WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks? | Unknown | N/A | |
| Principled Gradient-Based MCMC for Conditional Sampling of Text | Unknown | N/A | |
| Position: Compositional Generative Modeling: A Single Model is Not All You Need | Unknown | N/A | |
| Improving Factuality and Reasoning in Language Models through Multiagent Debate | Unknown | N/A | |
| Learning Iterative Reasoning through Energy Diffusion | Unknown | N/A | |
| When and How Does In-Distribution Label Help Out-of-Distribution Detection? | Unknown | N/A | |
| AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls | Unknown | N/A | |
| MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving | Unknown | N/A | |
| MF-CLR: Multi-Frequency Contrastive Learning Representation for Time Series | Unknown | N/A | |
| DE-COP: Detecting Copyrighted Content in Language Models Training Data | Unknown | N/A | |
| Unveiling the Potential of AI for Nanomaterial Morphology Prediction | Unknown | N/A | |
| Sharpness-Aware Data Generation for Zero-shot Quantization | Unknown | N/A | |
| Making Old Things New: A Unified Algorithm for Differentially Private Clustering | Unknown | N/A | |
| Generalization Bounds for Heavy-Tailed SDEs through the Fractional Fokker-Planck Equation | Unknown | N/A | |
| Outlier-robust Kalman Filtering through Generalised Bayes | Unknown | N/A | |
| Barrier Algorithms for Constrained Non-Convex Optimization | Unknown | N/A | |
| Equivariant Frames and the Impossibility of Continuous Canonicalization | Unknown | N/A | |
| Position: Insights from Survey Methodology can Improve Training Data | Unknown | N/A | |
| Efficient Error Certification for Physics-Informed Neural Networks | Unknown | N/A | |
| Scalable Pre-training of Large Autoregressive Image Models | Unknown | N/A | |
| TSLANet: Rethinking Transformers for Time Series Representation Learning | Unknown | N/A | |
| Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning | Unknown | N/A | |
| Approximate Nearest Neighbor Search with Window Filters | Unknown | N/A | |
| DsDm: Model-Aware Dataset Selection with Datamodels | Unknown | N/A | |
| Compositional Curvature Bounds for Deep Neural Networks | Unknown | N/A | |
| PAC-Bayesian Error Bound, via Rényi Divergence, for a Class of Linear Time-Invariant State-Space Models | Unknown | N/A | |
| Model Alignment as Prospect Theoretic Optimization | Unknown | N/A | |
| Out of the Ordinary: Spectrally Adapting Regression for Covariate Shift | Unknown | N/A | |
| Revisit the Essence of Distilling Knowledge through Calibration | Unknown | N/A | |
| DOGE: Domain Reweighting with Generalization Estimation | Unknown | N/A | |
| Bayesian Knowledge Distillation: A Bayesian Perspective of Distillation with Uncertainty Quantification | Unknown | N/A | |
| Exploring Correlations of Self-Supervised Tasks for Graphs | Unknown | N/A | |
| INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer | Unknown | N/A | |
| Stop Regressing: Training Value Functions via Classification for Scalable Deep RL | Unknown | N/A | |
| From Geometry to Causality- Ricci Curvature and the Reliability of Causal Inference on Networks | Unknown | N/A | |
| Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition | Unknown | N/A | |
| Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree | Unknown | N/A | |
| Fast White-Box Adversarial Streaming Without a Random Oracle | Unknown | N/A | |
| DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection | Unknown | N/A | |
| Keypoint-based Progressive Chain-of-Thought Distillation for LLMs | Unknown | N/A | |
| UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning | Unknown | N/A | |
| Sliced-Wasserstein Estimation with Spherical Harmonics as Control Variates | Unknown | N/A | |
| Privacy Backdoors: Stealing Data with Corrupted Pretrained Models | Unknown | N/A | |
| Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games | Unknown | N/A | |
| Reservoir Computing for Short High-Dimensional Time Series: an Application to SARS-CoV-2 Hospitalization Forecast | Unknown | N/A | |
| Position: Relational Deep Learning - Graph Representation Learning on Relational Databases | Unknown | N/A | |
| Critical feature learning in deep neural networks | Unknown | N/A | |
| Neuroexplicit Diffusion Models for Inpainting of Optical Flow Fields | Unknown | N/A | |
| Inverse-Variance Weighting for Estimation of Heterogeneous Treatment Effects | Unknown | N/A | |
| Explaining Probabilistic Models with Distributional Values | Unknown | N/A | |
| Hyperbolic Active Learning for Semantic Segmentation under Domain Shift | Unknown | N/A | |
| Weisfeiler-Leman at the margin: When more expressivity matters | Unknown | N/A | |
| Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings | Unknown | N/A | |
| Trust the Model Where It Trusts Itself - Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption | Unknown | N/A | |
| Trustworthy Actionable Perturbations | Unknown | N/A | |
| Interpretability Illusions in the Generalization of Simplified Models | Unknown | N/A | |
| Hyperbolic Geometric Latent Diffusion Model for Graph Generation | Unknown | N/A | |
| Language-guided Skill Learning with Temporal Variational Inference | Unknown | N/A | |
| PinNet: Pinpoint Instructive Information for Retrieval Augmented Code-to-Text Generation | Unknown | N/A | |
| Towards Theoretical Understandings of Self-Consuming Generative Models | Unknown | N/A | |
| Positive Concave Deep Equilibrium Models | Unknown | N/A | |
| Let Go of Your Labels with Unsupervised Transfer | Unknown | N/A | |
| Erasing the Bias: Fine-Tuning Foundation Models for Semi-Supervised Learning | Unknown | N/A | |
| Reflective Policy Optimization | Unknown | N/A | |
| Testing the Feasibility of Linear Programs with Bandit Feedback | Unknown | N/A | |
| A Doubly Recursive Stochastic Compositional Gradient Descent Method for Federated Multi-Level Compositional Optimization | Unknown | N/A | |
| Stochastic Weakly Convex Optimization beyond Lipschitz Continuity | Unknown | N/A | |
| A Graph is Worth $K$ Words: Euclideanizing Graph using Pure Transformer | Unknown | N/A | |
| Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy | Unknown | N/A | |
| DMTG: One-Shot Differentiable Multi-Task Grouping | Unknown | N/A | |
| Rethinking Specificity in SBDD: Leveraging Delta Score and Energy-Guided Diffusion | Unknown | N/A | |
| Non-convex Stochastic Composite Optimization with Polyak Momentum | Unknown | N/A | |
| Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations | Unknown | N/A | |
| Decoupling Learning and Decision-Making: Breaking the $\mathcal{O}(\sqrt{T})$ Barrier in Online Resource Allocation with First-Order Methods | Unknown | N/A | |
| Parameter-Efficient Fine-Tuning with Discrete Fourier Transform | Unknown | N/A | |
| Fast-Slow Test-Time Adaptation for Online Vision-and-Language Navigation | Unknown | N/A | |
| Causal Customer Churn Analysis with Low-rank Tensor Block Hazard Model | Unknown | N/A | |
| Projection-Free Online Convex Optimization with Time-Varying Constraints | Unknown | N/A | |
| LLark: A Multimodal Instruction-Following Language Model for Music | Unknown | N/A | |
| Position: Categorical Deep Learning is an Algebraic Theory of All Architectures | Unknown | N/A | |
| Safe and Robust Subgame Exploitation in Imperfect Information Games | Unknown | N/A | |
| Don't trust your eyes: on the (un)reliability of feature visualizations | Unknown | N/A | |
| Learning with 3D rotations, a hitchhiker's guide to SO(3) | Unknown | N/A | |
| Graph-Triggered Rising Bandits | Unknown | N/A | |
| Reinforcement Learning within Tree Search for Fast Macro Placement | Unknown | N/A | |
| Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models | Unknown | N/A | |
| Individualized Privacy Accounting via Subsampling with Applications in Combinatorial Optimization | Unknown | N/A | |
| State-Constrained Zero-Sum Differential Games with One-Sided Information | Unknown | N/A | |
| Agnostic Learning of Mixed Linear Regressions with EM and AM Algorithms | Unknown | N/A | |
| Optimal Eye Surgeon: Finding image priors through sparse generators at initialization | Unknown | N/A | |
| Self-Correcting Self-Consuming Loops for Generative Model Training | Unknown | N/A | |
| Position: The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning | Unknown | N/A | |
| CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling | Unknown | N/A | |
| Does Label Smoothing Help Deep Partial Label Learning? | Unknown | N/A | |
| AST-T5: Structure-Aware Pretraining for Code Generation and Understanding | Unknown | N/A | |
| Evolution-Inspired Loss Functions for Protein Representation Learning | Unknown | N/A | |
| Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks | Unknown | N/A | |
| E$^2$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation | Unknown | N/A | |
| Long Range Propagation on Continuous-Time Dynamic Graphs | Unknown | N/A | |
| Nonsmooth Implicit Differentiation: Deterministic and Stochastic Convergence Rates | Unknown | N/A | |
| Fine-grained Classes and How to Find Them | Unknown | N/A | |
| AI Control: Improving Safety Despite Intentional Subversion | Unknown | N/A | |
| Scaling Down Deep Learning with MNIST-1D | Unknown | N/A | |
| A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models | Unknown | N/A | |
| EDISON: Enhanced Dictionary-Induced Tensorized Incomplete Multi-View Clustering with Gaussian Error Rank Minimization | Unknown | N/A | |
| CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution | Unknown | N/A | |
| Predictive Performance Comparison of Decision Policies Under Confounding | Unknown | N/A | |
| On the Diminishing Returns of Width for Continual Learning | Unknown | N/A | |
| Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation | Unknown | N/A | |
| DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning | Unknown | N/A | |
| Collaborative Heterogeneous Causal Inference Beyond Meta-analysis | Unknown | N/A | |
| ACM-MILP: Adaptive Constraint Modification via Grouping and Selection for Hardness-Preserving MILP Instance Generation | Unknown | N/A | |
| FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clustering | Unknown | N/A | |
| Compressing Large Language Models by Joint Sparsification and Quantization | Unknown | N/A | |
| Automated Loss function Search for Class-imbalanced Node Classification | Unknown | N/A | |
| Temporal Logic Specification-Conditioned Decision Transformer for Offline Safe Reinforcement Learning | Unknown | N/A | |
| GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks | Unknown | N/A | |
| Vectorized Conditional Neural Fields: A Framework for Solving Time-dependent Parametric Partial Differential Equations | Unknown | N/A | |
| Isometric Representation Learning for Disentangled Latent Space of Diffusion Models | Unknown | N/A | |
| Pursuing Overall Welfare in Federated Learning through Sequential Decision Making | Unknown | N/A | |
| Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming | Unknown | N/A | |
| Riemannian coordinate descent algorithms on matrix manifolds | Unknown | N/A | |
| Prototypical Transformer As Unified Motion Learners | Unknown | N/A | |
| SIN: Selective and Interpretable Normalization for Long-Term Time Series Forecasting | Unknown | N/A | |
| Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning | Unknown | N/A | |
| Binary Decomposition: A Problem Transformation Perspective for Open-Set Semi-Supervised Learning | Unknown | N/A | |
| Data-efficient Large Vision Models through Sequential Autoregression | Unknown | N/A | |
| MGit: A Model Versioning and Management System | Unknown | N/A | |
| DPOT: Auto-Regressive Denoising Operator Transformer for Large-Scale PDE Pre-Training | Unknown | N/A | |
| Convergence Guarantees for the DeepWalk Embedding on Block Models | Unknown | N/A | |
| Estimating the Permanent by Nesting Importance Sampling | Unknown | N/A | |
| Position: $C^*$-Algebraic Machine Learning $-$ Moving in a New Direction | Unknown | N/A | |
| Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformer | Unknown | N/A | |
| GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements | Unknown | N/A | |
| MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-go Approximation | Unknown | N/A | |
| LoRA+: Efficient Low Rank Adaptation of Large Models | Unknown | N/A | |
| Deep Neural Room Acoustics Primitive | Unknown | N/A | |
| Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation | Unknown | N/A | |
| ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation | Unknown | N/A | |
| Domain-wise Data Acquisition to Improve Performance under Distribution Shift | Unknown | N/A | |
| Quantum Algorithm for Online Exp-concave Optimization | Unknown | N/A | |
| Riemannian Accelerated Zeroth-order Algorithm: Improved Robustness and Lower Query Complexity | Unknown | N/A | |
| Ambiguity-Aware Abductive Learning | Unknown | N/A | |
| Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised Learning | Unknown | N/A | |
| Robust Multi-Task Learning with Excess Risks | Unknown | N/A | |
| DynSyn: Dynamical Synergistic Representation for Efficient Learning and Control in Overactuated Embodied Systems | Unknown | N/A | |
| Learning Useful Representations of Recurrent Neural Network Weight Matrices | Unknown | N/A | |
| Randomized Confidence Bounds for Stochastic Partial Monitoring | Unknown | N/A | |
| Understanding Diffusion Models by Feynman's Path Integral | Unknown | N/A | |
| Learning Surrogates for Offline Black-Box Optimization via Gradient Matching | Unknown | N/A | |
| Estimating Unknown Population Sizes Using the Hypergeometric Distribution | Unknown | N/A | |
| Two Tales of Single-Phase Contrastive Hebbian Learning | Unknown | N/A | |
| Verifying message-passing neural networks via topology-based bounds tightening | Unknown | N/A | |
| Criterion Collapse and Loss Distribution Control | Unknown | N/A | |
| Removing Spurious Concepts from Neural Network Representations via Joint Subspace Estimation | Unknown | N/A | |
| Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression | Unknown | N/A | |
| Enhancing Sufficient Dimension Reduction via Hellinger Correlation | Unknown | N/A | |
| A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs | Unknown | N/A | |
| Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates | Unknown | N/A | |
| Graph Neural PDE Solvers with Conservation and Similarity-Equivariance | Unknown | N/A | |
| Maestro: Uncovering Low-Rank Structures via Trainable Decomposition | Unknown | N/A | |
| Equilibrium of Data Markets with Externality | Unknown | N/A | |
| Multi-Sender Persuasion: A Computational Perspective | Unknown | N/A | |
| IBD-PSC: Input-level Backdoor Detection via Parameter-oriented Scaling Consistency | Unknown | N/A | |
| PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs | Unknown | N/A | |
| Loss Shaping Constraints for Long-Term Time Series Forecasting | Unknown | N/A | |
| Careful with that Scalpel: Improving Gradient Surgery with an EMA | Unknown | N/A | |
| Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning | Unknown | N/A | |
| Task-aware Orthogonal Sparse Network for Exploring Shared Knowledge in Continual Learning | Unknown | N/A | |
| An Information Theoretic Approach to Interaction-Grounded Learning | Unknown | N/A | |
| SceneCraft: An LLM Agent for Synthesizing 3D Scenes as Blender Code | Unknown | N/A | |
| Pseudo-Calibration: Improving Predictive Uncertainty Estimation in Unsupervised Domain Adaptation | Unknown | N/A | |
| Improving Interpretation Faithfulness for Vision Transformers | Unknown | N/A | |
| Multigroup Robustness | Unknown | N/A | |
| Provable Privacy with Non-Private Pre-Processing | Unknown | N/A | |
| Case-Based or Rule-Based: How Do Transformers Do the Math? | Unknown | N/A | |
| Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications | Unknown | N/A | |
| High-Performance Temporal Reversible Spiking Neural Networks with $\mathcal{O}(L)$ Training Memory and $\mathcal{O}(1)$ Inference Cost | Unknown | N/A | |
| Accelerating Transformer Pre-training with 2:4 Sparsity | Unknown | N/A | |
| InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks | Unknown | N/A | |
| ReconBoost: Boosting Can Achieve Modality Reconcilement | Unknown | N/A | |
| Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games | Unknown | N/A | |
| In-context Convergence of Transformers | Unknown | N/A | |
| Near-Linear Time Approximation Algorithms for k-means with Outliers | Unknown | N/A | |
| Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion | Unknown | N/A | |
| Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL | Unknown | N/A | |
| InstructSpeech: Following Speech Editing Instructions via Large Language Models | Unknown | N/A | |
| Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models | Unknown | N/A | |
| CLIF: Complementary Leaky Integrate-and-Fire Neuron for Spiking Neural Networks | Unknown | N/A | |
| Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning | Unknown | N/A | |
| BiLLM: Pushing the Limit of Post-Training Quantization for LLMs | Unknown | N/A | |
| An Empirical Examination of Balancing Strategy for Counterfactual Estimation on Time Series | Unknown | N/A | |
| MFTN: A Multi-scale Feature Transfer Network Based on IMatchFormer for Hyperspectral Image Super-Resolution | Unknown | N/A | |
| On Which Nodes Does GCN Fail? Enhancing GCN From the Node Perspective | Unknown | N/A | |
| Quasi-Monte Carlo Features for Kernel Approximation | Unknown | N/A | |
| MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation | Unknown | N/A | |
| How Universal Polynomial Bases Enhance Spectral Graph Neural Networks: Heterophily, Over-smoothing, and Over-squashing | Unknown | N/A | |
| Interaction-based Retrieval-augmented Diffusion Models for Protein-specific 3D Molecule Generation | Unknown | N/A | |
| Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and Convergence | Unknown | N/A | |
| An Embodied Generalist Agent in 3D World | Unknown | N/A | |
| Faster Adaptive Decentralized Learning Algorithms | Unknown | N/A | |
| Faster Sampling via Stochastic Gradient Proximal Sampler | Unknown | N/A | |
| Position: The Platonic Representation Hypothesis | Unknown | N/A | |
| Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online Learning | Unknown | N/A | |
| Make-A-Shape: a Ten-Million-scale 3D Shape Model | Unknown | N/A | |
| Residual Quantization with Implicit Neural Codebooks | Unknown | N/A | |
| Theoretical Guarantees for Variational Inference with Fixed-Variance Mixture of Gaussians | Unknown | N/A | |
| Vanilla Bayesian Optimization Performs Great in High Dimensions | Unknown | N/A | |
| Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning | Unknown | N/A | |
| Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control | Unknown | N/A | |
| Smooth Min-Max Monotonic Networks | Unknown | N/A | |
| SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention | Unknown | N/A | |
| Understanding the Learning Dynamics of Alignment with Human Feedback | Unknown | N/A | |
| Zero-Shot Reinforcement Learning via Function Encoders | Unknown | N/A | |
| PASOA- PArticle baSed Bayesian Optimal Adaptive design | Unknown | N/A | |
| Attribution-based Explanations that Provide Recourse Cannot be Robust | Unknown | N/A | |
| Learning to Reach Goals via Diffusion | Unknown | N/A | |
| Online Non-stochastic Control with Partial Feedback | Unknown | N/A | |
| Rethinking DP-SGD in Discrete Domain: Exploring Logistic Distribution in the Realm of signSGD | Unknown | N/A | |
| An Independence-promoting Loss for Music Generation with Language Models | Unknown | N/A | |
| Gradual Divergence for Seamless Adaptation: A Novel Domain Incremental Learning Method | Unknown | N/A | |
| Repeat After Me: Transformers are Better than State Space Models at Copying | Unknown | N/A | |
| Finite Volume Features, Global Geometry Representations, and Residual Training for Deep Learning-based CFD Simulation | Unknown | N/A | |
| ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages | Unknown | N/A | |
| Advancing Dynamic Sparse Training by Exploring Optimization Opportunities | Unknown | N/A | |
| ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization | Unknown | N/A | |
| Towards Efficient Exact Optimization of Language Model Alignment | Unknown | N/A | |
| Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic | Unknown | N/A | |
| Discrete Latent Perspective Learning for Segmentation and Detection | Unknown | N/A | |
| Simulation-Based Inference with Quantile Regression | Unknown | N/A | |
| GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer | Unknown | N/A | |
| Chain-of-Thought Predictive Control | Unknown | N/A | |
| NDOT: Neuronal Dynamics-based Online Training for Spiking Neural Networks | Unknown | N/A | |
| On the Origins of Linear Representations in Large Language Models | Unknown | N/A | |
| Federated Optimization with Doubly Regularized Drift Correction | Unknown | N/A | |
| Projection-Free Variance Reduction Methods for Stochastic Constrained Multi-Level Compositional Optimization | Unknown | N/A | |
| Generalized Neural Collapse for a Large Number of Classes | Unknown | N/A | |
| SuDA: Support-based Domain Adaptation for Sim2Real Hinge Joint Tracking with Flexible Sensors | Unknown | N/A | |
| Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning | Unknown | N/A | |
| Homomorphism Counts for Graph Neural Networks: All About That Basis | Unknown | N/A | |
| What Will My Model Forget? Forecasting Forgotten Examples in Language Model Refinement | Unknown | N/A | |
| An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning | Unknown | N/A | |
| Language Models as Semantic Indexers | Unknown | N/A | |
| Position: What Can Large Language Models Tell Us about Time Series Analysis | Unknown | N/A | |
| FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data | Unknown | N/A | |
| Graph Generation with Diffusion Mixture | Unknown | N/A | |
| Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs | Unknown | N/A | |
| IW-GAE: Importance weighted group accuracy estimation for improved calibration and model selection in unsupervised domain adaptation | Unknown | N/A | |
| Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks | Unknown | N/A | |
| Position: Benchmarking is Limited in Reinforcement Learning Research | Unknown | N/A | |
| Is Epistemic Uncertainty Faithfully Represented by Evidential Deep Learning Methods? | Unknown | N/A | |
| Unsupervised Episode Generation for Graph Meta-learning | Unknown | N/A | |
| Beyond the Calibration Point: Mechanism Comparison in Differential Privacy | Unknown | N/A | |
| Replicable Learning of Large-Margin Halfspaces | Unknown | N/A | |
| Tell, Don't Show: Language Guidance Eases Transfer Across Domains in Images and Videos | Unknown | N/A | |
| C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models | Unknown | N/A | |
| Think Before You Act: Decision Transformers with Working Memory | Unknown | N/A | |
| Certifiably Byzantine-Robust Federated Conformal Prediction | Unknown | N/A | |
| Neural Tangent Kernels for Axis-Aligned Tree Ensembles | Unknown | N/A | |
| On the Generalization of Equivariant Graph Neural Networks | Unknown | N/A | |
| Challenges and Considerations in the Evaluation of Bayesian Causal Discovery | Unknown | N/A | |
| Progressive Inference: Explaining Decoder-Only Sequence Classification Models Using Intermediate Predictions | Unknown | N/A | |
| Active Adaptive Experimental Design for Treatment Effect Estimation with Covariate Choice | Unknown | N/A | |
| Pluvial Flood Emulation with Hydraulics-informed Message Passing | Unknown | N/A | |
| Accelerating Convergence in Bayesian Few-Shot Classification | Unknown | N/A | |
| An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks | Unknown | N/A | |
| DUPLEX: Dual GAT for Complex Embedding of Directed Graphs | Unknown | N/A | |
| A Universal Transfer Theorem for Convex Optimization Algorithms Using Inexact First-order Oracles | Unknown | N/A | |
| Fair Classification with Partial Feedback: An Exploration-Based Data Collection Approach | Unknown | N/A | |
| Neural Tangent Kernels Motivate Cross-Covariance Graphs in Neural Networks | Unknown | N/A | |
| Breaking through the learning plateaus of in-context learning in Transformer | Unknown | N/A | |
| Tuning-Free Stochastic Optimization | Unknown | N/A | |
| Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under Smoothness | Unknown | N/A | |
| Can Machines Learn the True Probabilities? | Unknown | N/A | |
| Gaussian Plane-Wave Neural Operator for Electron Density Estimation | Unknown | N/A | |
| LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging | Unknown | N/A | |
| CARTE: Pretraining and Transfer for Tabular Learning | Unknown | N/A | |
| Achieving Lossless Gradient Sparsification via Mapping to Alternative Space in Federated Learning | Unknown | N/A | |
| Active Label Correction for Semantic Segmentation with Foundation Models | Unknown | N/A | |
| ODIM: Outlier Detection via Likelihood of Under-Fitted Generative Models | Unknown | N/A | |
| Synergistic Integration of Coordinate Network and Tensorial Feature for Improving Neural Radiance Fields from Sparse Inputs | Unknown | N/A | |
| Learning to Explore for Stochastic Gradient MCMC | Unknown | N/A | |
| Improving Robustness to Multiple Spurious Correlations by Multi-Objective Optimization | Unknown | N/A | |
| Clustered Federated Learning via Gradient-based Partitioning | Unknown | N/A | |
| Demystifying SGD with Doubly Stochastic Gradients | Unknown | N/A | |
| Attribute Based Interpretable Evaluation Metrics for Generative Models | Unknown | N/A | |
| EquiAV: Leveraging Equivariance for Audio-Visual Contrastive Learning | Unknown | N/A | |
| Risk-Sensitive Policy Optimization via Predictive CVaR Policy Gradient | Unknown | N/A | |
| Translating Subgraphs to Nodes Makes Simple GNNs Strong and Efficient for Subgraph Representation Learning | Unknown | N/A | |
| Privacy-Preserving Embedding via Look-up Table Evaluation with Fully Homomorphic Encryption | Unknown | N/A | |
| Convex Relaxations of ReLU Neural Networks Approximate Global Optima in Polynomial Time | Unknown | N/A | |
| Polynomial-based Self-Attention for Table Representation Learning | Unknown | N/A | |
| Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention Landscape | Unknown | N/A | |
| Discovering Features with Synergistic Interactions in Multiple Views | Unknown | N/A | |
| An Infinite-Width Analysis on the Jacobian-Regularised Training of a Neural Network | Unknown | N/A | |
| One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning | Unknown | N/A | |
| A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback | Unknown | N/A | |
| Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design | Unknown | N/A | |
| Universal Consistency of Wide and Deep ReLU Neural Networks and Minimax Optimal Convergence Rates for Kolmogorov-Donoho Optimal Function Classes | Unknown | N/A | |
| DistiLLM: Towards Streamlined Distillation for Large Language Models | Unknown | N/A | |
| Provably Scalable Black-Box Variational Inference with Structured Variational Families | Unknown | N/A | |
| Stochastic Conditional Diffusion Models for Robust Semantic Image Synthesis | Unknown | N/A | |
| Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth | Unknown | N/A | |
| Estimating Barycenters of Distributions with Neural Optimal Transport | Unknown | N/A | |
| AdsorbDiff: Adsorbate Placement via Conditional Denoising Diffusion | Unknown | N/A | |
| On Convergence of Incremental Gradient for Non-convex Smooth Functions | Unknown | N/A | |
| Generalist Equivariant Transformer Towards 3D Molecular Interaction Learning | Unknown | N/A | |
| The Computational Complexity of Finding Second-Order Stationary Points | Unknown | N/A | |
| convSeq: Fast and Scalable Method for Detecting Patterns in Spike Data | Unknown | N/A | |
| A General Online Algorithm for Optimizing Complex Performance Metrics | Unknown | N/A | |
| CLLMs: Consistency Large Language Models | Unknown | N/A | |
| KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations | Unknown | N/A | |
| PcLast: Discovering Plannable Continuous Latent States | Unknown | N/A | |
| Sobolev Space Regularised Pre Density Models | Unknown | N/A | |
| Geometry-Aware Instrumental Variable Regression | Unknown | N/A | |
| Understanding the Effects of Iterative Prompting on Truthfulness | Unknown | N/A | |
| Mean Estimation in the Add-Remove Model of Differential Privacy | Unknown | N/A | |
| No Free Prune: Information-Theoretic Barriers to Pruning at Initialization | Unknown | N/A | |
| Collective Certified Robustness against Graph Injection Attacks | Unknown | N/A | |
| Privately Learning Smooth Distributions on the Hypercube by Projections | Unknown | N/A | |
| Single-Model Attribution of Generative Models Through Final-Layer Inversion | Unknown | N/A | |
| Towards Understanding Inductive Bias in Transformers: A View From Infinity | Unknown | N/A | |
| Modeling Caption Diversity in Contrastive Vision-Language Pretraining | Unknown | N/A | |
| Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms | Unknown | N/A | |
| Generalized Sobolev Transport for Probability Measures on a Graph | Unknown | N/A | |
| Robust Inverse Graphics via Probabilistic Inference | Unknown | N/A | |
| Knowledge Graphs Can be Learned with Just Intersection Features | Unknown | N/A | |
| Run-Time Task Composition with Safety Semantics | Unknown | N/A | |
| Chasing Convex Functions with Long-term Constraints | Unknown | N/A | |
| Stationary Latent Weight Inference for Unreliable Observations from Online Test-Time Adaptation | Unknown | N/A | |
| Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks | Unknown | N/A | |
| Fundamental Benefit of Alternating Updates in Minimax Optimization | Unknown | N/A | |
| DataFreeShield: Defending Adversarial Attacks without Training Data | Unknown | N/A | |
| SelMatch: Effectively Scaling Up Dataset Distillation via Selection-Based Initialization and Partial Updates by Trajectory Matching | Unknown | N/A | |
| Pausing Policy Learning in Non-stationary Reinforcement Learning | Unknown | N/A | |
| Feature Distribution on Graph Topology Mediates the Effect of Graph Convolution: Homophily Perspective | Unknown | N/A | |
| Drug Discovery with Dynamic Goal-aware Fragments | Unknown | N/A | |
| Supervised Matrix Factorization: Local Landscape Analysis and Applications | Unknown | N/A | |
| Defining Neural Network Architecture through Polytope Structures of Datasets | Unknown | N/A | |
| 3D Geometric Shape Assembly via Efficient Point Cloud Matching | Unknown | N/A | |
| StrWAEs to Invariant Representations | Unknown | N/A | |
| Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular Domains | Unknown | N/A | |
| Training Greedy Policy for Proposal Batch Selection in Expensive Multi-Objective Combinatorial Optimization | Unknown | N/A | |
| Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent Space | Unknown | N/A | |
| Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation | Unknown | N/A | |
| Improving Gradient-Guided Nested Sampling for Posterior Inference | Unknown | N/A | |
| Winner-takes-all learners are geometry-aware conditional density estimators | Unknown | N/A | |
| Eluder-based Regret for Stochastic Contextual MDPs | Unknown | N/A | |
| Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization Algorithms | Unknown | N/A | |
| DetKDS: Knowledge Distillation Search for Object Detectors | Unknown | N/A | |
| Purifying Quantization-conditioned Backdoors via Layer-wise Activation Correction with Distribution Approximation | Unknown | N/A | |
| Critical windows: non-asymptotic theory for feature emergence in diffusion models | Unknown | N/A | |
| Learning Causal Domain-Invariant Temporal Dynamics for Few-Shot Action Recognition | Unknown | N/A | |
| Completing Visual Objects via Bridging Generation and Segmentation | Unknown | N/A | |
| Evolving Subnetwork Training for Large Language Models | Unknown | N/A | |
| Data Poisoning Attacks against Conformal Prediction | Unknown | N/A | |
| ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking | Unknown | N/A | |
| Two-sided Competing Matching Recommendation Markets With Quota and Complementary Preferences Constraints | Unknown | N/A | |
| Full-Atom Peptide Design based on Multi-modal Flow Matching | Unknown | N/A | |
| Positive and Unlabeled Learning with Controlled Probability Boundary Fence | Unknown | N/A | |
| FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Debiased Distribution Compression | Unknown | N/A | |
| RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning | Unknown | N/A | |
| Vague Prototype-Oriented Diffusion Model for Multi-Class Anomaly Detection | Unknown | N/A | |
| Graph Structure Extrapolation for Out-of-Distribution Generalization | Unknown | N/A | |
| Value-Evolutionary-Based Reinforcement Learning | Unknown | N/A | |
| Image Clustering with External Guidance | Unknown | N/A | |
| VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context | Unknown | N/A | |
| Accelerating Convergence of Score-Based Diffusion Models, Provably | Unknown | N/A | |
| Q-Probe: A Lightweight Approach to Reward Maximization for Language Models | Unknown | N/A | |
| Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines | Unknown | N/A | |
| Neural Collapse in Multi-label Learning with Pick-all-label Loss | Unknown | N/A | |
| A Differentiable Partially Observable Generalized Linear Model with Forward-Backward Message Passing | Unknown | N/A | |
| Multi-Region Markovian Gaussian Process: An Efficient Method to Discover Directional Communications Across Multiple Brain Regions | Unknown | N/A | |
| A Generative Approach for Treatment Effect Estimation under Collider Bias: From an Out-of-Distribution Perspective | Unknown | N/A | |
| Learning Shadow Variable Representation for Treatment Effect Estimation under Collider Bias | Unknown | N/A | |
| Configurable Mirror Descent: Towards a Unification of Decision Making | Unknown | N/A | |
| Enhancing Class-Imbalanced Learning with Pre-Trained Guidance through Class-Conditional Knowledge Distillation | Unknown | N/A | |
| A Neural-Guided Dynamic Symbolic Network for Exploring Mathematical Expressions from Data | Unknown | N/A | |
| Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation | Unknown | N/A | |
| Preventing Model Collapse in Gaussian Process Latent Variable Models | Unknown | N/A | |
| Measuring Stochastic Data Complexity with Boltzmann Influence Functions | Unknown | N/A | |
| Concentration Inequalities for General Functions of Heavy-Tailed Random Variables | Unknown | N/A | |
| Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once | Unknown | N/A | |
| Is Temperature Sample Efficient for Softmax Gaussian Mixture of Experts? | Unknown | N/A | |
| Learning Adaptive and View-Invariant Vision Transformer for Real-Time UAV Tracking | Unknown | N/A | |
| PID: Prompt-Independent Data Protection Against Latent Diffusion Models | Unknown | N/A | |
| A Contextual Combinatorial Bandit Approach to Negotiation | Unknown | N/A | |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Unknown | N/A | |
| Privacy Preserving Adaptive Experiment Design | Unknown | N/A | |
| Combining Experimental and Historical Data for Policy Evaluation | Unknown | N/A | |
| LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models | Unknown | N/A | |
| DiffFPR: Diffusion Prior for Oversampled Fourier Phase Retrieval | Unknown | N/A | |
| Harnessing Neural Unit Dynamics for Effective and Scalable Class-Incremental Learning | Unknown | N/A | |
| Compress Clean Signal from Noisy Raw Image: A Self-Supervised Approach | Unknown | N/A | |
| VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling | Unknown | N/A | |
| Emergent Representations of Program Semantics in Language Models Trained on Programs | Unknown | N/A | |
| OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift | Unknown | N/A | |
| Improved Bounds for Pure Private Agnostic Learning: Item-Level and User-Level Privacy | Unknown | N/A | |
| $\bf{\Phi}_\textrm{Flow}$: Differentiable Simulations for PyTorch, TensorFlow and Jax | Unknown | N/A | |
| The Good, The Bad, and Why: Unveiling Emotions in Generative AI | Unknown | N/A | |
| Two-Stage Shadow Inclusion Estimation: An IV Approach for Causal Inference under Latent Confounding and Collider Bias | Unknown | N/A | |
| FlashST: A Simple and Universal Prompt-Tuning Framework for Traffic Prediction | Unknown | N/A | |
| Towards efficient deep spiking neural networks construction with spiking activity based pruning | Unknown | N/A | |
| FedBAT: Communication-Efficient Federated Learning via Learnable Binarization | Unknown | N/A | |
| Beyond Point Prediction: Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process | Unknown | N/A | |
| Statistical Properties of Robust Satisficing | Unknown | N/A | |
| ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models | Unknown | N/A | |
| IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation | Unknown | N/A | |
| KernelWarehouse: Rethinking the Design of Dynamic Convolution | Unknown | N/A | |
| GeoReasoner: Geo-localization with Reasoning in Street Views using a Large Vision-Language Model | Unknown | N/A | |
| Learning the Uncertainty Sets of Linear Control Systems via Set Membership: A Non-asymptotic Analysis | Unknown | N/A | |
| Seesaw: Compensating for Nonlinear Reduction with Linear Computations for Private Inference | Unknown | N/A | |
| Predicting and Interpreting Energy Barriers of Metallic Glasses with Graph Neural Networks | Unknown | N/A | |
| From Fourier to Neural ODEs: Flow Matching for Modeling Complex Systems | Unknown | N/A | |
| EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search | Unknown | N/A | |
| Algorithmic Stability Unleashed: Generalization Bounds with Unbounded Losses | Unknown | N/A | |
| Receptive Fields As Experts in Convolutional Neural Architectures | Unknown | N/A | |
| Diving into Underwater: Segment Anything Model Guided Underwater Salient Instance Segmentation and A Large-scale Dataset | Unknown | N/A | |
| Graph External Attention Enhanced Transformer | Unknown | N/A | |
| Single-Trajectory Distributionally Robust Reinforcement Learning | Unknown | N/A | |
| Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization | Unknown | N/A | |
| On the Error-Propagation of Inexact Hotelling's Deflation for Principal Component Analysis | Unknown | N/A | |
| Bootstrapping Fisher Market Equilibrium and First-Price Pacing Equilibrium | Unknown | N/A | |
| Graph Geometry-Preserving Autoencoders | Unknown | N/A | |
| Momentum Particle Maximum Likelihood | Unknown | N/A | |
| An Effective Dynamic Gradient Calibration Method for Continual Learning | Unknown | N/A | |
| Revisiting the Role of Language Priors in Vision-Language Models | Unknown | N/A | |
| Non-confusing Generation of Customized Concepts in Diffusion Models | Unknown | N/A | |
| Structured Inverse-Free Natural Gradient Descent: Memory-Efficient & Numerically-Stable KFAC | Unknown | N/A | |
| Robustness of Deep Learning for Accelerated MRI: Benefits of Diverse Training Data | Unknown | N/A | |
| Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency | Unknown | N/A | |
| Graph-enhanced Large Language Models in Asynchronous Plan Reasoning | Unknown | N/A | |
| HGAP: Boosting Permutation Invariant and Permutation Equivariant in Multi-Agent Reinforcement Learning via Graph Attention Network | Unknown | N/A | |
| Parsimonious Learning-Augmented Approximations for Dense Instances of $\mathcal{NP}$-hard Problems | Unknown | N/A | |
| SparseTSF: Modeling Long-term Time Series Forecasting with 1k Parameters | Unknown | N/A | |
| On Hypothesis Transfer Learning of Functional Linear Models | Unknown | N/A | |
| GeoAB: Towards Realistic Antibody Design and Reliable Affinity Maturation | Unknown | N/A | |
| A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes | Unknown | N/A | |
| A General Theory for Softmax Gating Multinomial Logistic Mixture of Experts | Unknown | N/A | |
| Layer-Aware Analysis of Catastrophic Overfitting: Revealing the Pseudo-Robust Shortcut Dependency | Unknown | N/A | |
| Autonomous Sparse Mean-CVaR Portfolio Optimization | Unknown | N/A | |
| More Flexible PAC-Bayesian Meta-Learning by Learning Learning Algorithms | Unknown | N/A | |
| Graph Neural Stochastic Diffusion for Estimating Uncertainty in Node Classification | Unknown | N/A | |
| PPFLOW: Target-Aware Peptide Design with Torsional Flow Matching | Unknown | N/A | |
| Lie Neurons: Adjoint-Equivariant Neural Networks for Semisimple Lie Algebras | Unknown | N/A | |
| Position: A Call to Action for a Human-Centered AutoML Paradigm | Unknown | N/A | |
| Beyond Regular Grids: Fourier-Based Neural Operators on Arbitrary Domains | Unknown | N/A | |
| Scaling Tractable Probabilistic Circuits: A Systems Perspective | Unknown | N/A | |
| Graph Distillation with Eigenbasis Matching | Unknown | N/A | |
| Adaptive Text Watermark for Large Language Models | Unknown | N/A | |
| Graph Adversarial Diffusion Convolution | Unknown | N/A | |
| Zeroth-Order Methods for Constrained Nonconvex Nonsmooth Stochastic Optimization | Unknown | N/A | |
| ESNet: Evolution and Succession Network for High-Resolution Salient Object Detection | Unknown | N/A | |
| Unifying Image Processing as Visual Prompting Question Answering | Unknown | N/A | |
| The Pitfalls and Promise of Conformal Inference Under Adversarial Attacks | Unknown | N/A | |
| Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models | Unknown | N/A | |
| Decoding-time Realignment of Language Models | Unknown | N/A | |
| DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation | Unknown | N/A | |
| Bidirectional Reciprocative Information Communication for Few-Shot Semantic Segmentation | Unknown | N/A | |
| Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation | Unknown | N/A | |
| PAPM: A Physics-aware Proxy Model for Process Systems | Unknown | N/A | |
| ELTA: An Enhancer against Long-Tail for Aesthetics-oriented Models | Unknown | N/A | |
| On the Feasibility of Single-Pass Full-Capacity Learning in Linear Threshold Neurons with Binary Input Vectors | Unknown | N/A | |
| Online Speculative Decoding | Unknown | N/A | |
| Tuning-free Estimation and Inference of Cumulative Distribution Function under Local Differential Privacy | Unknown | N/A | |
| Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion | Unknown | N/A | |
| Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents | Unknown | N/A | |
| Stereo Risk: A Continuous Modeling Approach to Stereo Matching | Unknown | N/A | |
| Multi-Source Conformal Inference Under Distribution Shift | Unknown | N/A | |
| From Generalization Analysis to Optimization Designs for State Space Models | Unknown | N/A | |
| Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences | Unknown | N/A | |
| Convergence of Online Learning Algorithm for a Mixture of Multiple Linear Regressions | Unknown | N/A | |
| Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning | Unknown | N/A | |
| Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-Loop and Hessian-Free Solution Strategy | Unknown | N/A | |
| Position: Foundation Agents as the Paradigm Shift for Decision Making | Unknown | N/A | |
| Amortized Equation Discovery in Hybrid Dynamical Systems | Unknown | N/A | |
| Floating Anchor Diffusion Model for Multi-motif Scaffolding | Unknown | N/A | |
| Class-Imbalanced Graph Learning without Class Rebalancing | Unknown | N/A | |
| Generative Marginalization Models | Unknown | N/A | |
| Federated Representation Learning in the Under-Parameterized Regime | Unknown | N/A | |
| Causal Discovery via Conditional Independence Testing with Proxy Variables | Unknown | N/A | |
| Perfect Alignment May be Poisonous to Graph Contrastive Learning | Unknown | N/A | |
| Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement | Unknown | N/A | |
| Symmetric Matrix Completion with ReLU Sampling | Unknown | N/A | |
| DNA-SE: Towards Deep Neural-Nets Assisted Semiparametric Estimation | Unknown | N/A | |
| High-Probability Bound for Non-Smooth Non-Convex Stochastic Optimization with Heavy Tails | Unknown | N/A | |
| Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition | Unknown | N/A | |
| Geometry-Calibrated DRO: Combating Over-Pessimism with Free Energy Implications | Unknown | N/A | |
| Correlation-Induced Label Prior for Semi-Supervised Multi-Label Learning | Unknown | N/A | |
| Causality Based Front-door Defense Against Backdoor Attack on Language Models | Unknown | N/A | |
| Partial Multi-View Multi-Label Classification via Semantic Invariance Learning and Prototype Modeling | Unknown | N/A | |
| Building Socially-Equitable Public Models | Unknown | N/A | |
| KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache | Unknown | N/A | |
| Timer: Generative Pre-trained Transformers Are Large Time Series Models | Unknown | N/A | |
| On the Last-Iterate Convergence of Shuffling Gradient Methods | Unknown | N/A | |
| Pairwise Alignment Improves Graph Domain Adaptation | Unknown | N/A | |
| Neural Operators with Localized Integral and Differential Kernels | Unknown | N/A | |
| A Tensor Decomposition Perspective on Second-order RNNs | Unknown | N/A | |
| Restoring balance: principled under/oversampling of data for optimal classification | Unknown | N/A | |
| Reparameterized Importance Sampling for Robust Variational Bayesian Neural Networks | Unknown | N/A | |
| Attention Meets Post-hoc Interpretability: A Mathematical Perspective | Unknown | N/A | |
| Non-Vacuous Generalization Bounds for Large Language Models | Unknown | N/A | |
| Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution | Unknown | N/A | |
| Optimal Differentially Private Model Training with Public Data | Unknown | N/A | |
| How to Make the Gradients Small Privately: Improved Rates for Differentially Private Non-Convex Optimization | Unknown | N/A | |
| Beyond Sole Strength: Customized Ensembles for Generalized Vision-Language Models | Unknown | N/A | |
| HumanTOMATO: Text-aligned Whole-body Motion Generation | Unknown | N/A | |
| Characterizing Overfitting in Kernel Ridgeless Regression Through the Eigenspectrum | Unknown | N/A | |
| Position: Exploring the Robustness of Pipeline-Parallelism-Based Decentralized Training | Unknown | N/A | |
| CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables | Unknown | N/A | |
| WebLINX: Real-World Website Navigation with Multi-Turn Dialogue | Unknown | N/A | |
| EiG-Search: Generating Edge-Induced Subgraphs for GNN Explanation in Linear Time | Unknown | N/A | |
| CauDiTS: Causal Disentangled Domain Adaptation of Multivariate Time Series | Unknown | N/A | |
| FiT: Flexible Vision Transformer for Diffusion Model | Unknown | N/A | |
| Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search | Unknown | N/A | |
| OxyGenerator: Reconstructing Global Ocean Deoxygenation Over a Century with Deep Learning | Unknown | N/A | |
| SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models | Unknown | N/A | |
| Unveiling the Cycloid Trajectory of EM Iterations in Mixed Linear Regression | Unknown | N/A | |
| OMPO: A Unified Framework for RL under Policy and Dynamics Shifts | Unknown | N/A | |
| OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning | Unknown | N/A | |
| Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL | Unknown | N/A | |
| Potential Based Diffusion Motion Planning | Unknown | N/A | |
| Cluster-Aware Similarity Diffusion for Instance Retrieval | Unknown | N/A | |
| RoboMP$^2$: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language Models | Unknown | N/A | |
| Contamination-Resilient Anomaly Detection via Adversarial Learning on Partially-Observed Normal and Anomalous Data | Unknown | N/A | |
| Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models | Unknown | N/A | |
| Efficient and Effective Time-Series Forecasting with Spiking Neural Networks | Unknown | N/A | |
| Cross-Domain Policy Adaptation by Capturing Representation Mismatch | Unknown | N/A | |
| Sampling is as easy as keeping the consistency: convergence guarantee for Consistency Models | Unknown | N/A | |
| Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens Rotation | Unknown | N/A | |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Unknown | N/A | |
| Better Locally Private Sparse Estimation Given Multiple Samples Per User | Unknown | N/A | |
| Outlier-aware Slicing for Post-Training Quantization in Vision Transformer | Unknown | N/A | |
| X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation | Unknown | N/A | |
| Neighboring Perturbations of Knowledge Editing on Large Language Models | Unknown | N/A | |
| HarmonyDream: Task Harmonization Inside World Models | Unknown | N/A | |
| High-dimensional Linear Bandits with Knapsacks | Unknown | N/A | |
| SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment | Unknown | N/A | |
| Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder | Unknown | N/A | |
| A Provable Decision Rule for Out-of-Distribution Detection | Unknown | N/A | |
| Faithfulness Measurable Masked Language Models | Unknown | N/A | |
| On the Hardness of Probabilistic Neurosymbolic Learning | Unknown | N/A | |
| Split-and-Denoise: Protect large language model inference with local differential privacy | Unknown | N/A | |
| tinyBenchmarks: evaluating LLMs with fewer examples | Unknown | N/A | |
| SCoRe: Submodular Combinatorial Representation Learning | Unknown | N/A | |
| LASER: Linear Compression in Wireless Distributed Optimization | Unknown | N/A | |
| Entropy-Reinforced Planning with Large Language Models for Drug Discovery | Unknown | N/A | |
| Auto-Regressive Next-Token Predictors are Universal Learners | Unknown | N/A | |
| Self-Composing Policies for Scalable Continual Reinforcement Learning | Unknown | N/A | |
| Tilting the Odds at the Lottery: the Interplay of Overparameterisation and Curricula in Neural Networks | Unknown | N/A | |
| Submodular framework for structured-sparse optimal transport | Unknown | N/A | |
| Large Language Models are Geographically Biased | Unknown | N/A | |
| Position: Graph Foundation Models Are Already Here | Unknown | N/A | |
| Towards General Neural Surrogate Solvers with Specialized Neural Accelerators | Unknown | N/A | |
| $H$-Consistency Guarantees for Regression | Unknown | N/A | |
| Regression with Multi-Expert Deferral | Unknown | N/A | |
| No-Regret Reinforcement Learning in Smooth MDPs | Unknown | N/A | |
| Keep the Momentum: Conservation Laws beyond Euclidean Gradient Flows | Unknown | N/A | |
| Graph-based Forecasting with Missing Data through Spatiotemporal Downsampling | Unknown | N/A | |
| Convergence and Trade-Offs in Riemannian Gradient Descent and Riemannian Proximal Point | Unknown | N/A | |
| Using AI Uncertainty Quantification to Improve Human Decision-Making | Unknown | N/A | |
| On the Tractability of SHAP Explanations under Markovian Distributions | Unknown | N/A | |
| On the Consistency of Kernel Methods with Dependent Observations | Unknown | N/A | |
| Delving into Differentially Private Transformer | Unknown | N/A | |
| Deep Fusion: Efficient Network Training via Pre-trained Initializations | Unknown | N/A | |
| Roping in Uncertainty: Robustness and Regularization in Markov Games | Unknown | N/A | |
| IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Unknown | N/A | |
| O$n$ Learning Deep O($n$)-Equivariant Hyperspheres | Unknown | N/A | |
| Position: Tensor Networks are a Valuable Asset for Green AI | Unknown | N/A | |
| OSSCAR: One-Shot Structured Pruning in Vision and Language Models with Combinatorial Optimization | Unknown | N/A | |
| Physics-Informed Neural Network Policy Iteration: Algorithms, Convergence, and Verification | Unknown | N/A | |
| The Illusion of State in State-Space Models | Unknown | N/A | |
| Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation | Unknown | N/A | |
| Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics | Unknown | N/A | |
| Rethinking Independent Cross-Entropy Loss For Graph-Structured Data | Unknown | N/A | |
| Rethinking Momentum Knowledge Distillation in Online Continual Learning | Unknown | N/A | |
| Efficient World Models with Context-Aware Tokenization | Unknown | N/A | |
| CLIPZyme: Reaction-Conditioned Virtual Screening of Enzymes | Unknown | N/A | |
| Can Implicit Bias Imply Adversarial Robustness? | Unknown | N/A | |
| Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models | Unknown | N/A | |
| RODEO: Robust Outlier Detection via Exposing Adaptive Out-of-Distribution Samples | Unknown | N/A | |
| Prodigy: An Expeditiously Adaptive Parameter-Free Learner | Unknown | N/A | |
| From Inverse Optimization to Feasibility to ERM | Unknown | N/A | |
| Provable Interactive Learning with Hindsight Instruction Feedback | Unknown | N/A | |
| TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors | Unknown | N/A | |
| Straight-Through Meets Sparse Recovery: the Support Exploration Algorithm | Unknown | N/A | |
| OAK: Enriching Document Representations using Auxiliary Knowledge for Extreme Classification | Unknown | N/A | |
| Language Models with Conformal Factuality Guarantees | Unknown | N/A | |
| Finding NEM-U: Explaining unsupervised representation learning through neural network generated explanation masks | Unknown | N/A | |
| Slot Abstractors: Toward Scalable Abstract Visual Reasoning | Unknown | N/A | |
| A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks | Unknown | N/A | |
| Learning Optimal Deterministic Policies with Stochastic Policy Gradients | Unknown | N/A | |
| Causal Representation Learning Made Identifiable by Grouping of Observational Variables | Unknown | N/A | |
| Position: Levels of AGI for Operationalizing Progress on the Path to AGI | Unknown | N/A | |
| Using Uncertainty Quantification to Characterize and Improve Out-of-Domain Learning for PDEs | Unknown | N/A | |
| SiBBlInGS: Similarity-driven Building-Block Inference using Graphs across States | Unknown | N/A | |
| Truly No-Regret Learning in Constrained MDPs | Unknown | N/A | |
| Optimal bounds for $\ell_p$ sensitivity sampling via $\ell_2$ augmentation | Unknown | N/A | |
| Turnstile $\ell_p$ leverage score sampling with applications | Unknown | N/A | |
| BAGEL: Bootstrapping Agents by Guiding Exploration with Language | Unknown | N/A | |
| Factored-Reward Bandits with Intermediate Observations | Unknown | N/A | |
| Best Arm Identification for Stochastic Rising Bandits | Unknown | N/A | |
| Test-Time Regret Minimization in Meta Reinforcement Learning | Unknown | N/A | |
| Learning in Deep Factor Graphs with Gaussian Belief Propagation | Unknown | N/A | |
| PairNet: Training with Observed Pairs to Estimate Individual Treatment Effect | Unknown | N/A | |
| Density Ratio Estimation with Doubly Strong Robustness | Unknown | N/A | |
| Equivariant Deep Weight Space Alignment | Unknown | N/A | |
| Quality-Weighted Vendi Scores And Their Application To Diverse Experimental Design | Unknown | N/A | |
| On Least Square Estimation in Softmax Gating Mixture of Experts | Unknown | N/A | |
| PIDformer: Transformer Meets Control Theory | Unknown | N/A | |
| Differentially private exact recovery for stochastic block models | Unknown | N/A | |
| Novel Spectral Algorithms for the Partial Credit Model | Unknown | N/A | |
| Sliced Wasserstein with Random-Path Projecting Directions | Unknown | N/A | |
| Risk-Sensitive Reward-Free Reinforcement Learning with CVaR | Unknown | N/A | |
| How Transformers Learn Causal Structure with Gradient Descent | Unknown | N/A | |
| Understanding the Impact of Introducing Constraints at Inference Time on Generalization Error | Unknown | N/A | |
| Test-Time Model Adaptation with Only Forward Passes | Unknown | N/A | |
| Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming | Unknown | N/A | |
| RNAFlow: RNA Structure & Sequence Design via Inverse Folding-Based Flow Matching | Unknown | N/A | |
| $f$-Divergence Based Classification: Beyond the Use of Cross-Entropy | Unknown | N/A | |
| In value-based deep reinforcement learning, a pruned network is a good network | Unknown | N/A | |
| Mixtures of Experts Unlock Parameter Scaling for Deep RL | Unknown | N/A | |
| The Perception-Robustness Tradeoff in Deterministic Image Restoration | Unknown | N/A | |
| Linear Explanations for Individual Neurons | Unknown | N/A | |
| Adaptive Proximal Gradient Methods Are Universal Without Approximation | Unknown | N/A | |
| Fair Resource Allocation in Multi-Task Learning | Unknown | N/A | |
| Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners? | Unknown | N/A | |
| Deep Stochastic Mechanics | Unknown | N/A | |
| Variational Linearized Laplace Approximation for Bayesian Deep Learning | Unknown | N/A | |
| Differentiable Mapper for Topological Optimization of Data Representation | Unknown | N/A | |
| Structured Chemistry Reasoning with Large Language Models | Unknown | N/A | |
| MADA: Meta-Adaptive Optimizers Through Hyper-Gradient Descent | Unknown | N/A | |
| Implicit Representations via Operator Learning | Unknown | N/A | |
| Bayesian Program Learning by Decompiling Amortized Knowledge | Unknown | N/A | |
| $S^2$IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting | Unknown | N/A | |
| Feedback Loops With Language Models Drive In-Context Reward Hacking | Unknown | N/A | |
| Stability and Generalization for Stochastic Recursive Momentum-based Algorithms for (Strongly-)Convex One to $K$-Level Stochastic Optimizations | Unknown | N/A | |
| RMIB: Representation Matching Information Bottleneck for Matching Text Representations | Unknown | N/A | |
| Auto-Encoding Morph-Tokens for Multimodal LLM | Unknown | N/A | |
| A New Linear Scaling Rule for Private Adaptive Hyperparameter Optimization | Unknown | N/A | |
| Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation | Unknown | N/A | |
| Trainable Transformer in Transformer | Unknown | N/A | |
| Position: Topological Deep Learning is the New Frontier for Relational Learning | Unknown | N/A | |
| Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI | Unknown | N/A | |
| The Max-Min Formulation of Multi-Objective Reinforcement Learning: From Theory to a Model-Free Algorithm | Unknown | N/A | |
| The Linear Representation Hypothesis and the Geometry of Large Language Models | Unknown | N/A | |
| Mean-field Chaos Diffusion Models | Unknown | N/A | |
| Foundation Policies with Hilbert Representations | Unknown | N/A | |
| SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding | Unknown | N/A | |
| BOtied: Multi-objective Bayesian optimization with tied multivariate ranks | Unknown | N/A | |
| State-Free Inference of State-Space Models: The Transfer Function Approach | Unknown | N/A | |
| Variational Inference with Coverage Guarantees in Simulation-Based Inference | Unknown | N/A | |
| Optimal Ridge Regularization for Out-of-Distribution Prediction | Unknown | N/A | |
| LPGD: A General Framework for Backpropagation through Embedded Optimization Layers | Unknown | N/A | |
| Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces | Unknown | N/A | |
| Graph Automorphism Group Equivariant Neural Networks | Unknown | N/A | |
| BetterV: Controlled Verilog Generation with Discriminative Guidance | Unknown | N/A | |
| Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks | Unknown | N/A | |
| Knowledge Distillation with Auxiliary Variable | Unknown | N/A | |
| UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers | Unknown | N/A | |
| Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input | Unknown | N/A | |
| UPOCR: Towards Unified Pixel-Level OCR Interface | Unknown | N/A | |
| FedCal: Achieving Local and Global Calibration in Federated Learning via Aggregated Parameterized Scaler | Unknown | N/A | |
| Improving Diffusion Models for Inverse Problems Using Optimal Posterior Covariance | Unknown | N/A | |
| A Subquadratic Time Algorithm for Robust Sparse Mean Estimation | Unknown | N/A | |
| Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach | Unknown | N/A | |
| The Relative Value of Prediction in Algorithmic Decision Making | Unknown | N/A | |
| Interpreting and Improving Diffusion Models from an Optimization Perspective | Unknown | N/A | |
| Mechanistic Neural Networks for Scientific Machine Learning | Unknown | N/A | |
| Bayesian Regret Minimization in Offline Bandits | Unknown | N/A | |
| Prompting a Pretrained Transformer Can Be a Universal Approximator | Unknown | N/A | |
| Transport of Algebraic Structure to Latent Embeddings | Unknown | N/A | |
| Cross-view Masked Diffusion Transformers for Person Image Synthesis | Unknown | N/A | |
| Detecting Influence Structures in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Contrasting Multiple Representations with the Multi-Marginal Matching Gap | Unknown | N/A | |
| Adaptive Conformal Inference by Betting | Unknown | N/A | |
| Mechanistic Design and Scaling of Hybrid Architectures | Unknown | N/A | |
| Robust Data-driven Prescriptiveness Optimization | Unknown | N/A | |
| Learning Multiple Secrets in Mastermind | Unknown | N/A | |
| The Entropy Enigma: Success and Failure of Entropy Minimization | Unknown | N/A | |
| Efficient Exploration in Average-Reward Constrained Reinforcement Learning: Achieving Near-Optimal Regret With Posterior Sampling | Unknown | N/A | |
| Learning-Efficient Yet Generalizable Collaborative Filtering for Item Recommendation | Unknown | N/A | |
| Unsupervised Domain Adaptation for Anatomical Structure Detection in Ultrasound Images | Unknown | N/A | |
| Learning to Remove Cuts in Integer Linear Programming | Unknown | N/A | |
| Conformalized Survival Distributions: A Generic Post-Process to Increase Calibration | Unknown | N/A | |
| ByMI: Byzantine Machine Identification with False Discovery Rate Control | Unknown | N/A | |
| Efficient Non-stationary Online Learning by Wavelets with Applications to Online Distribution Shift Adaptation | Unknown | N/A | |
| Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints | Unknown | N/A | |
| ULAREF: A Unified Label Refinement Framework for Learning with Inaccurate Supervision | Unknown | N/A | |
| Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes | Unknown | N/A | |
| Accurate LoRA-Finetuning Quantization of LLMs via Information Retention | Unknown | N/A | |
| Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention | Unknown | N/A | |
| Feasible Reachable Policy Iteration | Unknown | N/A | |
| Learning High-Order Relationships of Brain Regions | Unknown | N/A | |
| To Cool or not to Cool? Temperature Network Meets Large Foundation Models via DRO | Unknown | N/A | |
| Transferring Knowledge From Large Foundation Models to Small Downstream Models | Unknown | N/A | |
| Compute Better Spent: Replacing Dense Layers with Structured Matrices | Unknown | N/A | |
| MolCRAFT: Structure-Based Drug Design in Continuous Parameter Space | Unknown | N/A | |
| Connect Later: Improving Fine-tuning for Robustness with Targeted Augmentations | Unknown | N/A | |
| Learning Constraints from Offline Demonstrations via Superior Distribution Correction Estimation | Unknown | N/A | |
| Multiply-Robust Causal Change Attribution | Unknown | N/A | |
| Decomposable Submodular Maximization in Federated Setting | Unknown | N/A | |
| Subsampling is not Magic: Why Large Batch Sizes Work for Differentially Private Stochastic Optimisation | Unknown | N/A | |
| STEER: Assessing the Economic Rationality of Large Language Models | Unknown | N/A | |
| Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks | Unknown | N/A | |
| Position: The Reasonable Person Standard for AI | Unknown | N/A | |
| Unveiling Privacy, Memorization, and Input Curvature Links | Unknown | N/A | |
| Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion | Unknown | N/A | |
| Fair Federated Learning via the Proportional Veto Core | Unknown | N/A | |
| Optimal Batched Linear Bandits | Unknown | N/A | |
| TabLog: Test-Time Adaptation for Tabular Data Using Logic Rules | Unknown | N/A | |
| Rejuvenating image-GPT as Strong Visual Representation Learners | Unknown | N/A | |
| CarbonNovo: Joint Design of Protein Structure and Sequence Using a Unified Energy-based Model | Unknown | N/A | |
| Plug-and-Play image restoration with Stochastic deNOising REgularization | Unknown | N/A | |
| Implicit Regularization in Feedback Alignment Learning Mechanisms for Neural Networks | Unknown | N/A | |
| Universal Gradient Methods for Stochastic Convex Optimization | Unknown | N/A | |
| Position: Key Claims in LLM Research Have a Long Tail of Footnotes | Unknown | N/A | |
| Position: Mission Critical – Satellite Data is a Distinct Modality in Machine Learning | Unknown | N/A | |
| Invariant Risk Minimization Is A Total Variation Model | Unknown | N/A | |
| Position: Application-Driven Innovation in Machine Learning | Unknown | N/A | |
| One-Shot Strategic Classification Under Unknown Costs | Unknown | N/A | |
| Modelling Microbial Communities with Graph Neural Networks | Unknown | N/A | |
| Position: Amazing Things Come From Having Many Good Models | Unknown | N/A | |
| Generalizing Orthogonalization for Models with Non-Linearities | Unknown | N/A | |
| Rolling Diffusion Models | Unknown | N/A | |
| Second-Order Uncertainty Quantification: A Distance-Based Approach | Unknown | N/A | |
| Random Exploration in Bayesian Optimization: Order-Optimal Regret and Computational Efficiency | Unknown | N/A | |
| Predictive Coding beyond Correlations | Unknown | N/A | |
| Proactive Detection of Voice Cloning with Localized Watermarking | Unknown | N/A | |
| A Diffusion Model Framework for Unsupervised Neural Combinatorial Optimization | Unknown | N/A | |
| Sparse and Structured Hopfield Networks | Unknown | N/A | |
| A sampling theory perspective on activations for implicit neural representations | Unknown | N/A | |
| A fast algorithm to simulate nonlinear resistive networks | Unknown | N/A | |
| Parallel Affine Transformation Tuning of Markov Chain Monte Carlo | Unknown | N/A | |
| Incentivized Learning in Principal-Agent Bandit Games | Unknown | N/A | |
| Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models | Unknown | N/A | |
| Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference | Unknown | N/A | |
| Online Learning with Bounded Recall | Unknown | N/A | |
| Asymptotics of Learning with Deep Structured (Random) Features | Unknown | N/A | |
| Simultaneous identification of models and parameters of scientific simulators | Unknown | N/A | |
| Bayesian Adaptation of Network Depth and Width for Continual Learning | Unknown | N/A | |
| Towards Scalable and Versatile Weight Space Learning | Unknown | N/A | |
| Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes | Unknown | N/A | |
| Lessons from Generalization Error Analysis of Federated Learning: You May Communicate Less Often! | Unknown | N/A | |
| Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models | Unknown | N/A | |
| Prompting is a Double-Edged Sword: Improving Worst-Group Robustness of Foundation Models | Unknown | N/A | |
| A Multimodal Automated Interpretability Agent | Unknown | N/A | |
| The Balanced-Pairwise-Affinities Feature Transform | Unknown | N/A | |
| Improved Generalization of Weight Space Networks via Augmentations | Unknown | N/A | |
| On Multi-Armed Bandit with Impatient Arms | Unknown | N/A | |
| Language Generation with Strictly Proper Scoring Rules | Unknown | N/A | |
| Learning Decision Policies with Instrumental Variables through Double Machine Learning | Unknown | N/A | |
| How Far Can Fairness Constraints Help Recover From Biased Data? | Unknown | N/A | |
| Reducing sequential change detection to sequential estimation | Unknown | N/A | |
| Exploring the Complexity of Deep Neural Networks through Functional Equivalence | Unknown | N/A | |
| Position: Do pretrained Transformers Learn In-Context by Gradient Descent? | Unknown | N/A | |
| Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning | Unknown | N/A | |
| ReLUs Are Sufficient for Learning Implicit Neural Representations | Unknown | N/A | |
| Double Momentum Method for Lower-Level Constrained Bilevel Optimization | Unknown | N/A | |
| LCA-on-the-Line: Benchmarking Out of Distribution Generalization with Class Taxonomies | Unknown | N/A | |
| Sample-Efficient Robust Multi-Agent Reinforcement Learning in the Face of Environmental Uncertainty | Unknown | N/A | |
| CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers | Unknown | N/A | |
| Why Larger Language Models Do In-context Learning Differently? | Unknown | N/A | |
| Long-Tail Learning with Foundation Model: Heavy Fine-Tuning Hurts | Unknown | N/A | |
| Statistical Test for Attention Maps in Vision Transformers | Unknown | N/A | |
| Weakly Convex Regularisers for Inverse Problems: Convergence of Critical Points and Primal-Dual Optimisation | Unknown | N/A | |
| IOI: Invisible One-Iteration Adversarial Attack on No-Reference Image- and Video-Quality Metrics | Unknown | N/A | |
| InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation | Unknown | N/A | |
| Embarrassingly Parallel GFlowNets | Unknown | N/A | |
| Deletion-Anticipative Data Selection with a Limited Budget | Unknown | N/A | |
| Latent variable model for high-dimensional point process with structured missingness | Unknown | N/A | |
| Domain Generalisation via Imprecise Learning | Unknown | N/A | |
| Byzantine Resilient and Fast Federated Few-Shot Learning | Unknown | N/A | |
| Parallelized Spatiotemporal Slot Binding for Videos | Unknown | N/A | |
| In-Context Reinforcement Learning for Variable Action Spaces | Unknown | N/A | |
| Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing | Unknown | N/A | |
| Inexact Newton-type Methods for Optimisation with Nonnegativity Constraints | Unknown | N/A | |
| Probabilistic Modeling of Interpersonal Coordination Processes | Unknown | N/A | |
| Connecting the Dots: Is Mode-Connectedness the Key to Feasible Sample-Based Inference in Bayesian Neural Networks? | Unknown | N/A | |
| Harnessing the Power of Neural Operators with Automatically Encoded Conservation Laws | Unknown | N/A | |
| Hybrid Reinforcement Learning from Offline Observation Alone | Unknown | N/A | |
| SurfPro: Functional Protein Design Based on Continuous Surface | Unknown | N/A | |
| OSN: Infinite Representations of Dynamic 3D Scenes from Monocular Videos | Unknown | N/A | |
| Sparse is Enough in Fine-tuning Pre-trained Large Language Models | Unknown | N/A | |
| Position: Leverage Foundational Models for Black-Box Optimization | Unknown | N/A | |
| Latent Logic Tree Extraction for Event Sequence Explanation from LLMs | Unknown | N/A | |
| Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates | Unknown | N/A | |
| Position: A Roadmap to Pluralistic Alignment | Unknown | N/A | |
| CHEMREASONER: Heuristic Search over a Large Language Model’s Knowledge Space using Quantum-Chemical Feedback | Unknown | N/A | |
| Harmonic Self-Conditioned Flow Matching for joint Multi-Ligand Docking and Binding Site Design | Unknown | N/A | |
| Learning to Intervene on Concept Bottlenecks | Unknown | N/A | |
| QORA: Zero-Shot Transfer via Interpretable Object-Relational Model Learning | Unknown | N/A | |
| Private Truly-Everlasting Robust-Prediction | Unknown | N/A | |
| ReGAL: Refactoring Programs to Discover Generalizable Abstractions | Unknown | N/A | |
| RLVF: Learning from Verbal Feedback without Overgeneralization | Unknown | N/A | |
| Online Learning in CMDPs: Handling Stochastic and Adversarial Constraints | Unknown | N/A | |
| Designing Decision Support Systems using Counterfactual Prediction Sets | Unknown | N/A | |
| Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models | Unknown | N/A | |
| Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration via Shift Reduction Lemmas | Unknown | N/A | |
| Networked Inequality: Preferential Attachment Bias in Graph Neural Network Link Prediction | Unknown | N/A | |
| ED-Copilot: Reduce Emergency Department Wait Time with Language Model Diagnostic Assistance | Unknown | N/A | |
| Constrained Reinforcement Learning Under Model Mismatch | Unknown | N/A | |
| DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton | Unknown | N/A | |
| LSEnet: Lorentz Structural Entropy Neural Network for Deep Graph Clustering | Unknown | N/A | |
| Online Adaptive Anomaly Thresholding with Confidence Sequences | Unknown | N/A | |
| Learning Graph Representation via Graph Entropy Maximization | Unknown | N/A | |
| FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models | Unknown | N/A | |
| Regression Learning with Limited Observations of Multivariate Outcomes and Features | Unknown | N/A | |
| Graph Neural Networks with a Distribution of Parametrized Graphs | Unknown | N/A | |
| video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models | Unknown | N/A | |
| BBox-Adapter: Lightweight Adapting for Black-Box Large Language Models | Unknown | N/A | |
| On a Combinatorial Problem Arising in Machine Teaching | Unknown | N/A | |
| Interpretable Deep Clustering for Tabular Data | Unknown | N/A | |
| Reinforcement Learning from Reachability Specifications: PAC Guarantees with Expected Conditional Distance | Unknown | N/A | |
| A Universal Class of Sharpness-Aware Minimization Algorithms | Unknown | N/A | |
| Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds | Unknown | N/A | |
| Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective | Unknown | N/A | |
| Community-Invariant Graph Contrastive Learning | Unknown | N/A | |
| Memorization Through the Lens of Curvature of Loss Function Around Samples | Unknown | N/A | |
| Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | Unknown | N/A | |
| Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem | Unknown | N/A | |
| OTMatch: Improving Semi-Supervised Learning with Optimal Transport | Unknown | N/A | |
| Post-hoc Part-Prototype Networks | Unknown | N/A | |
| Rethinking Optimization and Architecture for Tiny Language Models | Unknown | N/A | |
| Merging Multi-Task Models via Weight-Ensembling Mixture of Experts | Unknown | N/A | |
| StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis | Unknown | N/A | |
| SSL4Q: Semi-Supervised Learning of Quantum Data with Application to Quantum State Classification | Unknown | N/A | |
| Finite Smoothing Algorithm for High-Dimensional Support Vector Machines and Quantile Regression | Unknown | N/A | |
| MathScale: Scaling Instruction Tuning for Mathematical Reasoning | Unknown | N/A | |
| QUEST: Query-Aware Sparsity for Efficient Long-Context LLM Inference | Unknown | N/A | |
| Position: What makes an image realistic? | Unknown | N/A | |
| A New Branch-and-Bound Pruning Framework for $\ell_0$-Regularized Problems | Unknown | N/A | |
| Beyond Individual Input for Deep Anomaly Detection on Tabular Data | Unknown | N/A | |
| Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval | Unknown | N/A | |
| MOKD: Cross-domain Finetuning for Few-shot Classification via Maximizing Optimized Kernel Dependence | Unknown | N/A | |
| Ranking-based Client Imitation Selection for Efficient Federated Learning | Unknown | N/A | |
| OT-CLIP: Understanding and Generalizing CLIP via Optimal Transport | Unknown | N/A | |
| Copula-Nested Spectral Kernel Network | Unknown | N/A | |
| FRAPPÉ: A Group Fairness Framework for Post-Processing Everything | Unknown | N/A | |
| Faster Maximum Inner Product Search in High Dimensions | Unknown | N/A | |
| Position: Enforced Amnesia as a Way to Mitigate the Potential Risk of Silent Suffering in the Conscious AI | Unknown | N/A | |
| How Deep Networks Learn Sparse and Hierarchical Data: the Sparse Random Hierarchy Model | Unknown | N/A | |
| Position: Do Not Explain Vision Models Without Context | Unknown | N/A | |
| Neural SPH: Improved Neural Modeling of Lagrangian Fluid Dynamics | Unknown | N/A | |
| Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments | Unknown | N/A | |
| Simplicity Bias of Two-Layer Networks beyond Linearly Separable Data | Unknown | N/A | |
| Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring | Unknown | N/A | |
| Coactive Learning for Large Language Models using Implicit User Feedback | Unknown | N/A | |
| An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems | Unknown | N/A | |
| Matroid Semi-Bandits in Sublinear Time | Unknown | N/A | |
| Improving Antibody Humanness Prediction using Patent Data | Unknown | N/A | |
| Feedback Efficient Online Fine-Tuning of Diffusion Models | Unknown | N/A | |
| Do Large Language Models Perform the Way People Expect? Measuring the Human Generalization Function | Unknown | N/A | |
| Reward-Free Kernel-Based Reinforcement Learning | Unknown | N/A | |
| Federated Self-Explaining GNNs with Anti-shortcut Augmentations | Unknown | N/A | |
| How to Leverage Diverse Demonstrations in Offline Imitation Learning | Unknown | N/A | |
| Position: Why Tabular Foundation Models Should Be a Research Priority | Unknown | N/A | |
| Piecewise Constant and Linear Regression Trees: An Optimal Dynamic Programming Approach | Unknown | N/A | |
| Algorithm and Hardness for Dynamic Attention Maintenance in Large Language Models | Unknown | N/A | |
| Proactive DP: A Multiple Target Optimization Framework for DP-SGD | Unknown | N/A | |
| When Representations Align: Universality in Representation Learning Dynamics | Unknown | N/A | |
| Generalized Smooth Variational Inequalities: Methods with Adaptive Stepsizes | Unknown | N/A | |
| Discovering Mixtures of Structural Causal Models from Time Series Data | Unknown | N/A | |
| Statistically Optimal Generative Modeling with Maximum Deviation from the Empirical Distribution | Unknown | N/A | |
| Stochastic Gradient Flow Dynamics of Test Risk and its Exact Solution for Weak Features | Unknown | N/A | |
| Code as Reward: Empowering Reinforcement Learning with VLMs | Unknown | N/A | |
| Topological Neural Networks go Persistent, Equivariant, and Continuous | Unknown | N/A | |
| To the Max: Reinventing Reward in Reinforcement Learning | Unknown | N/A | |
| Imitation Learning in Discounted Linear MDPs without exploration assumptions | Unknown | N/A | |
| Parameter Estimation in DAGs from Incomplete Data via Optimal Transport | Unknown | N/A | |
| Optimal Transport for Structure Learning Under Missing Data | Unknown | N/A | |
| Convergence of Some Convex Message Passing Algorithms to a Fixed Point | Unknown | N/A | |
| Unsupervised Evaluation of Code LLMs with Round-Trip Correctness | Unknown | N/A | |
| Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning | Unknown | N/A | |
| Trustless Audits without Revealing Data or Models | Unknown | N/A | |
| Implicit Compressibility of Overparametrized Neural Networks Trained with Heavy-Tailed SGD | Unknown | N/A | |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Unknown | N/A | |
| VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception | Unknown | N/A | |
| Superpoint Gaussian Splatting for Real-Time High-Fidelity Dynamic Scene Reconstruction | Unknown | N/A | |
| S3GCL: Spectral, Swift, Spatial Graph Contrastive Learning | Unknown | N/A | |
| Non-stationary Online Convex Optimization with Arbitrary Delays | Unknown | N/A | |
| Towards Unified Multi-granularity Text Detection with Interactive Attention | Unknown | N/A | |
| One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts | Unknown | N/A | |
| On Universally Optimal Algorithms for A/B Testing | Unknown | N/A | |
| Adversarially Robust Hypothesis Transfer Learning | Unknown | N/A | |
| Towards Theoretical Understanding of Learning Large-scale Dependent Data via Random Features | Unknown | N/A | |
| A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design | Unknown | N/A | |
| Revisiting the Power of Prompt for Visual Tuning | Unknown | N/A | |
| TVE: Learning Meta-attribution for Transferable Vision Explainer | Unknown | N/A | |
| Adaptively Learning to Select-Rank in Online Platforms | Unknown | N/A | |
| Imitation Learning from Purified Demonstrations | Unknown | N/A | |
| Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge | Unknown | N/A | |
| Diagnosing the Compositional Knowledge of Vision Language Models from a Game-Theoretic View | Unknown | N/A | |
| An Efficient Maximal Ancestral Graph Listing Algorithm | Unknown | N/A | |
| Monotone, Bi-Lipschitz, and Polyak-Łojasiewicz Networks | Unknown | N/A | |
| Swallowing the Bitter Pill: Simplified Scalable Conformer Generation | Unknown | N/A | |
| Optimal Kernel Quantile Learning with Random Features | Unknown | N/A | |
| MEMORYLLM: Towards Self-Updatable Large Language Models | Unknown | N/A | |
| Momentum for the Win: Collaborative Federated Reinforcement Learning across Heterogeneous Environments | Unknown | N/A | |
| Mollification Effects of Policy Gradient Methods | Unknown | N/A | |
| Optimal Kernel Choice for Score Function-based Causal Discovery | Unknown | N/A | |
| Rapid Learning without Catastrophic Forgetting in the Morris Water Maze | Unknown | N/A | |
| Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical | Unknown | N/A | |
| Total Variation Floodgate for Variable Importance Inference in Classification | Unknown | N/A | |
| In-context Learning on Function Classes Unveiled for Transformers | Unknown | N/A | |
| StableSSM: Alleviating the Curse of Memory in State-space Models through Stable Reparameterization | Unknown | N/A | |
| Bootstrap AutoEncoders With Contrastive Paradigm for Self-supervised Gaze Estimation | Unknown | N/A | |
| Highway Value Iteration Networks | Unknown | N/A | |
| Improving Generalization in Offline Reinforcement Learning via Adversarial Data Splitting | Unknown | N/A | |
| Visual Transformer with Differentiable Channel Selection: An Information Bottleneck Inspired Approach | Unknown | N/A | |
| An Iterative Min-Min Optimization Method for Sparse Bayesian Learning | Unknown | N/A | |
| Open Ad Hoc Teamwork with Cooperative Game Theory | Unknown | N/A | |
| Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models | Unknown | N/A | |
| Bridging Data Gaps in Diffusion Models with Adversarial Noise-Based Transfer Learning | Unknown | N/A | |
| EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data | Unknown | N/A | |
| A Dual-module Framework for Counterfactual Estimation over Time | Unknown | N/A | |
| Transforming and Combining Rewards for Aligning Large Language Models | Unknown | N/A | |
| TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks | Unknown | N/A | |
| Pi-DUAL: Using privileged information to distinguish clean from noisy labels | Unknown | N/A | |
| Sample Average Approximation for Conditional Stochastic Optimization with Dependent Data | Unknown | N/A | |
| InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining | Unknown | N/A | |
| A Fine-grained Analysis of Fitted Q-evaluation: Beyond Parametric Models | Unknown | N/A | |
| Efficient Online Set-valued Classification with Bandit Feedback | Unknown | N/A | |
| LLM-Empowered State Representation for Reinforcement Learning | Unknown | N/A | |
| Proteus: Exploring Protein Structure Generation for Enhanced Designability and Efficiency | Unknown | N/A | |
| How to Trace Latent Generative Model Generated Images without Artificial Watermark? | Unknown | N/A | |
| Distributed High-Dimensional Quantile Regression: Estimation Efficiency and Support Recovery | Unknown | N/A | |
| Generalization Analysis of Stochastic Weight Averaging with General Sampling | Unknown | N/A | |
| CW Complex Hypothesis for Image Data | Unknown | N/A | |
| Optimal Exact Recovery in Semi-Supervised Learning: A Study of Spectral Methods and Graph Convolutional Networks | Unknown | N/A | |
| Open-Vocabulary Calibration for Fine-tuned CLIP | Unknown | N/A | |
| A Hierarchical Adaptive Multi-Task Reinforcement Learning Framework for Multiplier Circuit Design | Unknown | N/A | |
| Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot | Unknown | N/A | |
| Defense against Model Extraction Attack by Bayesian Active Watermarking | Unknown | N/A | |
| Autaptic Synaptic Circuit Enhances Spatio-temporal Predictive Learning of Spiking Neural Networks | Unknown | N/A | |
| Learning with Adaptive Resource Allocation | Unknown | N/A | |
| Exploring Intrinsic Dimension for Vision-Language Model Pruning | Unknown | N/A | |
| Boximator: Generating Rich and Controllable Motions for Video Synthesis | Unknown | N/A | |
| Bridging Model Heterogeneity in Federated Learning via Uncertainty-based Asymmetrical Reciprocity Learning | Unknown | N/A | |
| Neural Collapse meets Differential Privacy: Curious behaviors of NoisyGD with Near-Perfect Representation Learning | Unknown | N/A | |
| Batch Singular Value Polarization and Weighted Semantic Augmentation for Universal Domain Adaptation | Unknown | N/A | |
| Mapping the Multiverse of Latent Representations | Unknown | N/A | |
| Exact Soft Analytical Side-Channel Attacks using Tractable Circuits | Unknown | N/A | |
| Learning Pseudo-Contractive Denoisers for Inverse Problems | Unknown | N/A | |
| Rethinking Generative Large Language Model Evaluation for Semantic Comprehension | Unknown | N/A | |
| Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models | Unknown | N/A | |
| Magicoder: Empowering Code Generation with OSS-Instruct | Unknown | N/A | |
| Extending Test-Time Augmentation with Metamorphic Relations for Combinatorial Problems | Unknown | N/A | |
| Position: AI/ML Influencers Have a Place in the Academic Process | Unknown | N/A | |
| Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution | Unknown | N/A | |
| Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning | Unknown | N/A | |
| Diffusion-based Missing-view Generation With the Application on Incomplete Multi-view Clustering | Unknown | N/A | |
| Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning | Unknown | N/A | |
| Provable Contrastive Continual Learning | Unknown | N/A | |
| Stability-Informed Initialization of Neural Ordinary Differential Equations | Unknown | N/A | |
| Multiply Robust Estimation for Local Distribution Shifts with Multiple Domains | Unknown | N/A | |
| Unified Training of Universal Time Series Forecasting Transformers | Unknown | N/A | |
| Adaptive Accompaniment with ReaLchords | Unknown | N/A | |
| Ditto: Quantization-aware Secure Inference of Transformers upon MPC | Unknown | N/A | |
| NExT-GPT: Any-to-Any Multimodal LLM | Unknown | N/A | |
| Understanding Stochastic Natural Gradient Variational Inference | Unknown | N/A | |
| FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames | Unknown | N/A | |
| A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models | Unknown | N/A | |
| PointMC: Multi-instance Point Cloud Registration based on Maximal Cliques | Unknown | N/A | |
| Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models | Unknown | N/A | |
| Borda Regret Minimization for Generalized Linear Dueling Bandits | Unknown | N/A | |
| DISCRET: Synthesizing Faithful Explanations For Treatment Effect Estimation | Unknown | N/A | |
| Surface-VQMAE: Vector-quantized Masked Auto-encoders on Molecular Surfaces | Unknown | N/A | |
| Learning Causal Relations from Subsampled Time Series with Two Time-Slices | Unknown | N/A | |
| AND: Audio Network Dissection for Interpreting Deep Acoustic Models | Unknown | N/A | |
| Transolver: A Fast Transformer Solver for PDEs on General Geometries | Unknown | N/A | |
| Confidence-aware Contrastive Learning for Selective Classification | Unknown | N/A | |
| Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value | Unknown | N/A | |
| VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model | Unknown | N/A | |
| Profile Reconstruction from Private Sketches | Unknown | N/A | |
| Policy Learning for Balancing Short-Term and Long-Term Rewards | Unknown | N/A | |
| A Theory of Fault-Tolerant Learning | Unknown | N/A | |
| How to Explore with Belief: State Entropy Maximization in POMDPs | Unknown | N/A | |
| Mitigating Catastrophic Forgetting in Online Continual Learning by Modeling Previous Task Interrelations via Pareto Optimization | Unknown | N/A | |
| Detecting Any instruction-to-answer interaction relationship:Universal Instruction-to-Answer Navigator for Med-VQA | Unknown | N/A | |
| Unraveling the Impact of Heterophilic Structures on Graph Positive-Unlabeled Learning | Unknown | N/A | |
| Mitigating Label Noise on Graphs via Topological Sample Selection | Unknown | N/A | |
| Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays | Unknown | N/A | |
| HGCN2SP: Hierarchical Graph Convolutional Network for Two-Stage Stochastic Programming | Unknown | N/A | |
| Q-Align: Teaching LMMs for Visual Scoring via Discrete Text-Defined Levels | Unknown | N/A | |
| Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints | Unknown | N/A | |
| LESS: Selecting Influential Data for Targeted Instruction Tuning | Unknown | N/A | |
| Contrastive Learning for Clinical Outcome Prediction with Partial Data Sources | Unknown | N/A | |
| Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems | Unknown | N/A | |
| Delving into the Convergence of Generalized Smooth Minimax Optimization | Unknown | N/A | |
| Category-Aware Active Domain Adaptation | Unknown | N/A | |
| Improved Operator Learning by Orthogonal Attention | Unknown | N/A | |
| Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoning | Unknown | N/A | |
| Efficient Contrastive Learning for Fast and Accurate Inference on Graphs | Unknown | N/A | |
| CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models | Unknown | N/A | |
| Intersecting-Boundary-Sensitive Fingerprinting for Tampering Detection of DNN Models | Unknown | N/A | |
| Automating the Selection of Proxy Variables of Unmeasured Confounders | Unknown | N/A | |
| FedREDefense: Defending against Model Poisoning Attacks for Federated Learning using Model Update Reconstruction Error | Unknown | N/A | |
| Improving SAM Requires Rethinking its Optimization Formulation | Unknown | N/A | |
| Implicit Bias of AdamW: $\ell_\infty$-Norm Constrained Optimization | Unknown | N/A | |
| Local Causal Structure Learning in the Presence of Latent Variables | Unknown | N/A | |
| Reflected Flow Matching | Unknown | N/A | |
| Federated Neuro-Symbolic Learning | Unknown | N/A | |
| HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid Prediction | Unknown | N/A | |
| See More Details: Efficient Image Super-Resolution by Experts Mining | Unknown | N/A | |
| Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | Unknown | N/A | |
| Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control | Unknown | N/A | |
| Stochastic Bandits with ReLU Neural Networks | Unknown | N/A | |
| Intersectional Unfairness Discovery | Unknown | N/A | |
| Semantic-Aware Human Object Interaction Image Generation | Unknown | N/A | |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Unknown | N/A | |
| Equivariant Graph Neural Operator for Modeling 3D Dynamics | Unknown | N/A | |
| Aligned Objective for Soft-Pseudo-Label Generation in Supervised Learning | Unknown | N/A | |
| Non-clairvoyant Scheduling with Partial Predictions | Unknown | N/A | |
| BiSHop: Bi-Directional Cellular Learning for Tabular Data with Generalized Sparse Modern Hopfield Model | Unknown | N/A | |
| Enhancing Vision Transformer: Amplifying Non-Linearity in Feedforward Network Module | Unknown | N/A | |
| Meta-Reinforcement Learning Robust to Distributional Shift Via Performing Lifelong In-Context Learning | Unknown | N/A | |
| Prompt-guided Precise Audio Editing with Diffusion Models | Unknown | N/A | |
| Robust Inverse Constrained Reinforcement Learning under Model Misspecification | Unknown | N/A | |
| Soft Prompt Recovers Compressed LLMs, Transferably | Unknown | N/A | |
| Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation | Unknown | N/A | |
| Adaptive Group Personalization for Federated Mutual Transfer Learning | Unknown | N/A | |
| Learning Exceptional Subgroups by End-to-End Maximizing KL-Divergence | Unknown | N/A | |
| Pricing with Contextual Elasticity and Heteroscedastic Valuation | Unknown | N/A | |
| Learning 1-Bit Tiny Object Detector with Discriminative Feature Refinement | Unknown | N/A | |
| SLOG: An Inductive Spectral Graph Neural Network Beyond Polynomial Filter | Unknown | N/A | |
| Libra: Building Decoupled Vision System on Large Language Models | Unknown | N/A | |
| Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble | Unknown | N/A | |
| Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning | Unknown | N/A | |
| Exponential Spectral Pursuit: An Effective Initialization Method for Sparse Phase Retrieval | Unknown | N/A | |
| Iterative Regularized Policy Optimization with Imperfect Demonstrations | Unknown | N/A | |
| Few-shot Adaptation to Distribution Shifts By Mixing Source and Target Embeddings | Unknown | N/A | |
| Offline Multi-Objective Optimization | Unknown | N/A | |
| FairProof : Confidential and Certifiable Fairness for Neural Networks | Unknown | N/A | |
| Balancing Similarity and Complementarity for Federated Learning | Unknown | N/A | |
| Probabilistic Time Series Modeling with Decomposable Denoising Diffusion Model | Unknown | N/A | |
| Exploring the LLM Journey from Cognition to Expression with Linear Representations | Unknown | N/A | |
| A Space Group Symmetry Informed Network for O(3) Equivariant Crystal Tensor Prediction | Unknown | N/A | |
| Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching | Unknown | N/A | |
| Handling Heterogeneous Curvatures in Bandit LQR Control | Unknown | N/A | |
| Foundations of Testing for Finite-Sample Causal Discovery | Unknown | N/A | |
| Retrieval Across Any Domains via Large-scale Pre-trained Model | Unknown | N/A | |
| Reducing Balancing Error for Causal Inference via Optimal Transport | Unknown | N/A | |
| Sample-Efficient Multiagent Reinforcement Learning with Reset Replay | Unknown | N/A | |
| Guidance with Spherical Gaussian Constraint for Conditional Diffusion | Unknown | N/A | |
| Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks | Unknown | N/A | |
| SAM as the Guide: Mastering Pseudo-Label Refinement in Semi-Supervised Referring Expression Segmentation | Unknown | N/A | |
| Small-loss Adaptive Regret for Online Convex Optimization | Unknown | N/A | |
| Position: Towards Implicit Prompt For Text-To-Image Models | Unknown | N/A | |
| Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment | Unknown | N/A | |
| Understanding Server-Assisted Federated Learning in the Presence of Incomplete Client Participation | Unknown | N/A | |
| Representation Surgery for Multi-Task Model Merging | Unknown | N/A | |
| Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation | Unknown | N/A | |
| Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training | Unknown | N/A | |
| UniAudio: Towards Universal Audio Generation with Large Language Models | Unknown | N/A | |
| Explain Temporal Black-Box Models via Functional Decomposition | Unknown | N/A | |
| Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms | Unknown | N/A | |
| Neuro-Symbolic Temporal Point Processes | Unknown | N/A | |
| Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs | Unknown | N/A | |
| Empowering Graph Invariance Learning with Deep Spurious Infomax | Unknown | N/A | |
| Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict? | Unknown | N/A | |
| Mobile Attention: Mobile-Friendly Linear-Attention for Vision Transformers | Unknown | N/A | |
| Socialized Learning: Making Each Other Better Through Multi-Agent Collaboration | Unknown | N/A | |
| Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption | Unknown | N/A | |
| StableMask: Refining Causal Masking in Decoder-only Transformer | Unknown | N/A | |
| Junk DNA Hypothesis: Pruning Small Pre-Trained Weights $\textit{Irreversibly}$ and $\textit{Monotonically}$ Impairs ``Difficult" Downstream Tasks in LLMs | Unknown | N/A | |
| Uncertainty Estimation by Density Aware Evidential Deep Learning | Unknown | N/A | |
| FRAG: Frequency Adapting Group for Diffusion Video Editing | Unknown | N/A | |
| When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models | Unknown | N/A | |
| EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian Splatting | Unknown | N/A | |
| SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN | Unknown | N/A | |
| Activation-Descent Regularization for Input Optimization of ReLU Networks | Unknown | N/A | |
| Privacy-Preserving Instructions for Aligning Large Language Models | Unknown | N/A | |
| Learning Latent Structures in Network Games via Data-Dependent Gated-Prior Graph Variational Autoencoders | Unknown | N/A | |
| Enabling Few-Shot Learning with PID Control: A Layer Adaptive Optimizer | Unknown | N/A | |
| Generalization Bound and New Algorithm for Clean-Label Backdoor Attack | Unknown | N/A | |
| Learning Causal Dynamics Models in Object-Oriented Environments | Unknown | N/A | |
| ViP: A Differentially Private Foundation Model for Computer Vision | Unknown | N/A | |
| Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders | Unknown | N/A | |
| MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities | Unknown | N/A | |
| Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch | Unknown | N/A | |
| Improving Sharpness-Aware Minimization by Lookahead | Unknown | N/A | |
| SHINE: Shielding Backdoors in Deep Reinforcement Learning | Unknown | N/A | |
| Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators | Unknown | N/A | |
| DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation | Unknown | N/A | |
| Robustly Learning Single-Index Models via Alignment Sharpness | Unknown | N/A | |
| In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | Unknown | N/A | |
| tnGPS: Discovering Unknown Tensor Network Structure Search Algorithms via Large Language Models (LLMs) | Unknown | N/A | |
| Token-level Direct Preference Optimization | Unknown | N/A | |
| Learning Reward for Robot Skills Using Large Language Models via Self-Alignment | Unknown | N/A | |
| Graph Mixup on Approximate Gromov–Wasserstein Geodesics | Unknown | N/A | |
| IM-Unpack: Training and Inference with Arbitrarily Low Precision Integers | Unknown | N/A | |
| Differentiable Annealed Importance Sampling Minimizes The Jensen-Shannon Divergence Between Initial and Target Distribution | Unknown | N/A | |
| Robust Learning-Augmented Dictionaries | Unknown | N/A | |
| Tight Partial Identification of Causal Effects with Marginal Distribution of Unmeasured Confounders | Unknown | N/A | |
| DAG-Based Column Generation for Adversarial Team Games | Unknown | N/A | |
| Efficient Stochastic Approximation of Minimax Excess Risk Optimization | Unknown | N/A | |
| Discounted Adaptive Online Learning: Towards Better Regularization | Unknown | N/A | |
| Provably Efficient Partially Observable Risk-sensitive Reinforcement Learning with Hindsight Observation | Unknown | N/A | |
| Self-Supervised Coarsening of Unstructured Grid with Automatic Differentiation | Unknown | N/A | |
| LQER: Low-Rank Quantization Error Reconstruction for LLMs | Unknown | N/A | |
| Random Scaling and Momentum for Non-smooth Non-convex Optimization | Unknown | N/A | |
| Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining | Unknown | N/A | |
| CaM: Cache Merging for Memory-efficient LLMs Inference | Unknown | N/A | |
| Watermarks in the Sand: Impossibility of Strong Watermarking for Language Models | Unknown | N/A | |
| MILP-FBGen: LP/MILP Instance Generation with Feasibility/Boundedness | Unknown | N/A | |
| ILILT: Implicit Learning of Inverse Lithography Technologies | Unknown | N/A | |
| SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning | Unknown | N/A | |
| More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning | Unknown | N/A | |
| Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment | Unknown | N/A | |
| Parameter-Efficient Fine-Tuning with Controls | Unknown | N/A | |
| Deep Regression Representation Learning with Topology | Unknown | N/A | |
| Understanding Unimodal Bias in Multimodal Deep Linear Networks | Unknown | N/A | |
| Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark | Unknown | N/A | |
| S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video | Unknown | N/A | |
| Understanding and Diagnosing Deep Reinforcement Learning | Unknown | N/A | |
| Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation | Unknown | N/A | |
| Multi-Factor Adaptive Vision Selection for Egocentric Video Question Answering | Unknown | N/A | |
| UP2ME: Univariate Pre-training to Multivariate Fine-tuning as a General-purpose Framework for Multivariate Time Series Analysis | Unknown | N/A | |
| Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training | Unknown | N/A | |
| MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions | Unknown | N/A | |
| Wukong: Towards a Scaling Law for Large-Scale Recommendation | Unknown | N/A | |
| Nonparametric Teaching of Implicit Neural Representations | Unknown | N/A | |
| Sparse-to-dense Multimodal Image Registration via Multi-Task Learning | Unknown | N/A | |
| In-Context Principle Learning from Mistakes | Unknown | N/A | |
| A Federated Stochastic Multi-level Compositional Minimax Algorithm for Deep AUC Maximization | Unknown | N/A | |
| Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models | Unknown | N/A | |
| Enhancing Storage and Computational Efficiency in Federated Multimodal Learning for Large-Scale Models | Unknown | N/A | |
| Online Resource Allocation with Non-Stationary Customers | Unknown | N/A | |
| Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics | Unknown | N/A | |
| Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks | Unknown | N/A | |
| Online Matching with Stochastic Rewards: Provable Better Bound via Adversarial Reinforcement Learning | Unknown | N/A | |
| Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning | Unknown | N/A | |
| Switchable Decision: Dynamic Neural Generation Networks | Unknown | N/A | |
| Interpreting and Improving Large Language Models in Arithmetic Calculation | Unknown | N/A | |
| GroupCover: A Secure, Efficient and Scalable Inference Framework for On-device Model Protection based on TEEs | Unknown | N/A | |
| Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data | Unknown | N/A | |
| Exploring the Benefit of Activation Sparsity in Pre-training | Unknown | N/A | |
| Causal Representation Learning from Multiple Distributions: A General Setting | Unknown | N/A | |
| FESSNC: Fast Exponentially Stable and Safe Neural Controller | Unknown | N/A | |
| Rethinking Guidance Information to Utilize Unlabeled Samples: A Label Encoding Perspective | Unknown | N/A | |
| Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions | Unknown | N/A | |
| Two Heads Are Better Than One: Boosting Graph Sparse Training via Semantic and Topological Awareness | Unknown | N/A | |
| Beyond the ROC Curve: Classification Trees Using Cost-Optimal Curves, with Application to Imbalanced Datasets | Unknown | N/A | |
| Distributionally Robust Data Valuation | Unknown | N/A | |
| MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization | Unknown | N/A | |
| Efficient Contextual Bandits with Uninformed Feedback Graphs | Unknown | N/A | |
| Efficient Denoising Diffusion via Probabilistic Masking | Unknown | N/A | |
| Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases | Unknown | N/A | |
| Uncertainty-Aware Reward-Free Exploration with General Function Approximation | Unknown | N/A | |
| Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize | Unknown | N/A | |
| On the Expressive Power of Spectral Invariant Graph Neural Networks | Unknown | N/A | |
| Neural Jump-Diffusion Temporal Point Processes | Unknown | N/A | |
| Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization | Unknown | N/A | |
| Accelerating Iterative Retrieval-augmented Language Model Serving with Speculation | Unknown | N/A | |
| Position: Measure Dataset Diversity, Don't Just Claim It | Unknown | N/A | |
| Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning | Unknown | N/A | |
| Absolute Policy Optimization: Enhancing Lower Probability Bound of Performance with High Confidence | Unknown | N/A | |
| Spider: A Unified Framework for Context-dependent Concept Segmentation | Unknown | N/A | |
| Rethinking Adversarial Robustness in the Context of the Right to be Forgotten | Unknown | N/A | |
| Quantum Implicit Neural Representations | Unknown | N/A | |
| Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective | Unknown | N/A | |
| A Statistical Theory of Regularization-Based Continual Learning | Unknown | N/A | |
| Unsupervised Representation Learning of Brain Activity via Bridging Voxel Activity and Functional Connectivity | Unknown | N/A | |
| Double-Step Alternating Extragradient with Increasing Timescale Separation for Finding Local Minimax Points: Provable Improvements | Unknown | N/A | |
| CompeteAI: Understanding the Competition Dynamics of Large Language Model-based Agents | Unknown | N/A | |
| LangCell: Language-Cell Pre-training for Cell Identity Understanding | Unknown | N/A | |
| Graph-based Time Series Clustering for End-to-End Hierarchical Forecasting | Unknown | N/A | |
| Exploiting Negative Samples: A Catalyst for Cohort Discovery in Healthcare Analytics | Unknown | N/A | |
| Characteristic Guidance: Non-linear Correction for Diffusion Model at Large Guidance Scale | Unknown | N/A | |
| Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss | Unknown | N/A | |
| Conformal Predictions under Markovian Data | Unknown | N/A | |
| Learning Latent Space Hierarchical EBM Diffusion Models | Unknown | N/A | |
| DPN: Decoupling Partition and Navigation for Neural Solvers of Min-max Vehicle Routing Problems | Unknown | N/A | |
| On Prompt-Driven Safeguarding for Large Language Models | Unknown | N/A | |
| Self-Infilling Code Generation | Unknown | N/A | |
| ERQ: Error Reduction for Post-Training Quantization of Vision Transformers | Unknown | N/A | |
| Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret | Unknown | N/A | |
| GNNs Also Deserve Editing, and They Need It More Than Once | Unknown | N/A | |
| Causal-IQA: Towards the Generalization of Image Quality Assessment Based on Causal Inference | Unknown | N/A | |
| On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm | Unknown | N/A | |
| Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning | Unknown | N/A | |
| Pedestrian Attribute Recognition as Label-balanced Multi-label Learning | Unknown | N/A | |
| Conformalized Adaptive Forecasting of Heterogeneous Trajectories | Unknown | N/A | |
| Sequential Kernel Goodness-of-fit Testing | Unknown | N/A | |
| RAUCA: A Novel Physical Adversarial Attack on Vehicle Detectors via Robust and Accurate Camouflage Generation | Unknown | N/A | |
| CurBench: Curriculum Learning Benchmark | Unknown | N/A | |
| GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting | Unknown | N/A | |
| Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process | Unknown | N/A | |
| DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection | Unknown | N/A | |
| ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL | Unknown | N/A | |
| Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm | Unknown | N/A | |
| Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation | Unknown | N/A | |
| Exploring Training on Heterogeneous Data with Mixture of Low-rank Adapters | Unknown | N/A | |
| Iterative Search Attribution for Deep Neural Networks | Unknown | N/A | |
| Generative Active Learning for Long-tailed Instance Segmentation | Unknown | N/A | |
| Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF | Unknown | N/A | |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Unknown | N/A | |
| Switched Flow Matching: Eliminating Singularities via Switching ODEs | Unknown | N/A | |
| Toward Availability Attacks in 3D Point Clouds | Unknown | N/A | |
| Antibody Design Using a Score-based Diffusion Model Guided by Evolutionary, Physical and Geometric Constraints | Unknown | N/A | |
| Online Learning in Betting Markets: Profit versus Prediction | Unknown | N/A | |
| Dynamic Evaluation of Large Language Models by Meta Probing Agents | Unknown | N/A | |
| CRoFT: Robust Fine-Tuning with Concurrent Optimization for OOD Generalization and Open-Set OOD Detection | Unknown | N/A | |
| Translation Equivariant Transformer Neural Processes | Unknown | N/A | |
| Language Models Represent Beliefs of Self and Others | Unknown | N/A | |
| Stealthy Imitation: Reward-guided Environment-free Policy Stealing | Unknown | N/A | |
| Reinformer: Max-Return Sequence Modeling for Offline RL | Unknown | N/A | |
| Towards Efficient Spiking Transformer: a Token Sparsification Framework for Training and Inference Acceleration | Unknown | N/A | |
| Converting Transformers to Polynomial Form for Secure Inference Over Homomorphic Encryption | Unknown | N/A | |
| Viewing Transformers Through the Lens of Long Convolutions Layers | Unknown | N/A | |
| Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models | Unknown | N/A | |
| Compositional Few-Shot Class-Incremental Learning | Unknown | N/A | |
| BiE: Bi-Exponent Block Floating-Point for Large Language Models Quantization | Unknown | N/A | |
| Improving Equivariant Graph Neural Networks on Large Geometric Graphs via Virtual Nodes Learning | Unknown | N/A | |
| REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates | Unknown | N/A | |
| Amend to Alignment: Decoupled Prompt Tuning for Mitigating Spurious Correlation in Vision-Language Models | Unknown | N/A | |
| Visual Representation Learning with Stochastic Frame Prediction | Unknown | N/A | |
| Exploration and Anti-Exploration with Distributional Random Network Distillation | Unknown | N/A | |
| Position: Is machine learning good or bad for the natural sciences? | Unknown | N/A | |
| Multi-class Probabilistic Bounds for Majority Vote Classifiers with Partially Labeled Data | Unknown | N/A | |
| Sample Complexity Bounds for Estimating Probability Divergences under Invariances | Unknown | N/A | |
| AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA | Unknown | N/A | |
| On the Embedding Collapse when Scaling up Recommendation Models | Unknown | N/A | |
| Revisiting Character-level Adversarial Attacks for Language Models | Unknown | N/A | |
| Position: Quo Vadis, Unsupervised Time Series Anomaly Detection? | Unknown | N/A | |
| Scaling Beyond the GPU Memory Limit for Large Mixture-of-Experts Model Training | Unknown | N/A | |
| Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game | Unknown | N/A | |
| COPAL: Continual Pruning in Large Language Generative Models | Unknown | N/A | |
| Position: Understanding LLMs Requires More Than Statistical Generalization | Unknown | N/A | |
| Understanding Inter-Concept Relationships in Concept-Based Models | Unknown | N/A | |
| Online Isolation Forest | Unknown | N/A | |
| Multimodal Prototyping for cancer survival prediction | Unknown | N/A | |
| Differentially Private Sum-Product Networks | Unknown | N/A | |
| Log Neural Controlled Differential Equations: The Lie Brackets Make A Difference | Unknown | N/A | |
| A Dense Reward View on Aligning Text-to-Image Diffusion with Preference | Unknown | N/A | |
| Improved Differentially Private and Lazy Online Convex Optimization: Lower Regret without Smoothness Requirements | Unknown | N/A | |
| MorphGrower: A Synchronized Layer-by-layer Growing Approach for Plausible Neuronal Morphology Generation | Unknown | N/A | |
| Expand-and-Cluster: Parameter Recovery of Neural Networks | Unknown | N/A | |
| REMEDI: Corrective Transformations for Improved Neural Entropy Estimation | Unknown | N/A | |
| Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF | Unknown | N/A | |
| Momentor: Advancing Video Large Language Model with Fine-Grained Temporal Reasoning | Unknown | N/A | |
| Counterfactual Metarules for Local and Global Recourse | Unknown | N/A | |
| UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEs | Unknown | N/A | |
| RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback | Unknown | N/A | |
| Optimal Recurrent Network Topologies for Dynamical Systems Reconstruction | Unknown | N/A | |
| Unmasking Vulnerabilities: Cardinality Sketches under Adaptive Inputs | Unknown | N/A | |
| StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought Decompilation | Unknown | N/A | |
| Disentangled Continual Graph Neural Architecture Search with Invariant Modular Supernet | Unknown | N/A | |
| Disentangled Graph Self-supervised Learning for Out-of-Distribution Generalization | Unknown | N/A | |
| Knowledge-aware Reinforced Language Models for Protein Directed Evolution | Unknown | N/A | |
| Adapt and Diffuse: Sample-adaptive Reconstruction via Latent Diffusion Models | Unknown | N/A | |
| On the Effectiveness of Supervision in Asymmetric Non-Contrastive Learning | Unknown | N/A | |
| DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency | Unknown | N/A | |
| DNCs Require More Planning Steps | Unknown | N/A | |
| I/O Complexity of Attention, or How Optimal is FlashAttention? | Unknown | N/A | |
| Position: Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them? | Unknown | N/A | |
| Lookbehind-SAM: k steps back, 1 step forward | Unknown | N/A | |
| A Distributional Analogue to the Successor Representation | Unknown | N/A | |
| From Neurons to Neutrons: A Case Study in Interpretability | Unknown | N/A | |
| A Theoretical Analysis of Backdoor Poisoning Attacks in Convolutional Neural Networks | Unknown | N/A | |
| Do Transformer World Models Give Better Policy Gradients? | Unknown | N/A | |
| Conformal prediction for multi-dimensional time series by ellipsoidal sets | Unknown | N/A | |
| Conditionally-Conjugate Gaussian Process Factor Analysis for Spike Count Data via Data Augmentation | Unknown | N/A | |
| Understanding the Training Speedup from Sampling with Approximate Losses | Unknown | N/A | |
| Adaptive Horizon Actor-Critic for Policy Learning in Contact-Rich Differentiable Simulation | Unknown | N/A | |
| Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills | Unknown | N/A | |
| Mathematical Framework for Online Social Media Auditing | Unknown | N/A | |
| Low-Rank Similarity Mining for Multimodal Dataset Distillation | Unknown | N/A | |
| MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with Autocorrelations | Unknown | N/A | |
| Enforcing Constraints in RNA Secondary Structure Predictions: A Post-Processing Framework Based on the Assignment Problem | Unknown | N/A | |
| Reshape and Adapt for Output Quantization (RAOQ): Quantization-aware Training for In-memory Computing Systems | Unknown | N/A | |
| What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding | Unknown | N/A | |
| How Do Nonlinear Transformers Learn and Generalize in In-Context Learning? | Unknown | N/A | |
| Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning | Unknown | N/A | |
| ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data | Unknown | N/A | |
| Controllable Prompt Tuning For Balancing Group Distributional Robustness | Unknown | N/A | |
| Position: Scarce Resource Allocations That Rely On Machine Learning Should Be Randomized | Unknown | N/A | |
| Compositional Text-to-Image Generation with Dense Blob Representations | Unknown | N/A | |
| Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding | Unknown | N/A | |
| Deep Functional Factor Models: Forecasting High-Dimensional Functional Time Series via Bayesian Nonparametric Factorization | Unknown | N/A | |
| A Neural-Preconditioned Poisson Solver for Mixed Dirichlet and Neumann Boundary Conditions | Unknown | N/A | |
| Diffusion Posterior Sampling is Computationally Intractable | Unknown | N/A | |
| PerceptAnon: Exploring the Human Perception of Image Anonymization Beyond Pseudonymization for GDPR | Unknown | N/A | |
| Do Topological Characteristics Help in Knowledge Distillation? | Unknown | N/A | |
| Stochastic Optimization with Arbitrary Recurrent Data Sampling | Unknown | N/A | |
| Partially Stochastic Infinitely Deep Bayesian Neural Networks | Unknown | N/A | |
| Neuro-Visualizer: A Novel Auto-Encoder-Based Loss Landscape Visualization Method With an Application in Knowledge-Guided Machine Learning | Unknown | N/A | |
| Discovering Bias in Latent Space: An Unsupervised Debiasing Approach | Unknown | N/A | |
| Centralized Selection with Preferences in the Presence of Biases | Unknown | N/A | |
| Generalizing Knowledge Graph Embedding with Universal Orthogonal Parameterization | Unknown | N/A | |
| DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems | Unknown | N/A | |
| Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration | Unknown | N/A | |
| BAT: Learning to Reason about Spatial Sounds with Large Language Models | Unknown | N/A | |
| Rethinking Transformers in Solving POMDPs | Unknown | N/A | |
| Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization | Unknown | N/A | |
| From Biased Selective Labels to Pseudo-Labels: An Expectation-Maximization Framework for Learning from Biased Decisions | Unknown | N/A | |
| Embodied CoT Distillation From LLM To Off-the-shelf Agents | Unknown | N/A | |
| A General Framework for Sequential Decision-Making under Adaptivity Constraints | Unknown | N/A | |
| How Does Goal Relabeling Improve Sample Efficiency? | Unknown | N/A | |
| Theory of Consistency Diffusion Models: Distribution Estimation Meets Fast Sampling | Unknown | N/A | |
| RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis | Unknown | N/A | |
| Enhancing Adversarial Robustness in SNNs with Sparse Gradients | Unknown | N/A | |
| Layerwise Change of Knowledge in Neural Networks | Unknown | N/A | |
| Analysis for Abductive Learning and Neural-Symbolic Reasoning Shortcuts | Unknown | N/A | |
| On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis | Unknown | N/A | |
| Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers | Unknown | N/A | |
| EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence | Unknown | N/A | |
| Smoothness Adaptive Hypothesis Transfer Learning | Unknown | N/A | |
| Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection | Unknown | N/A | |
| Tilt your Head: Activating the Hidden Spatial-Invariance of Classifiers | Unknown | N/A | |
| WISER: Weak Supervision and Supervised Representation Learning to Improve Drug Response Prediction in Cancer | Unknown | N/A | |
| Interacting Diffusion Processes for Event Sequence Forecasting | Unknown | N/A | |
| Recurrent Early Exits for Federated Learning with Heterogeneous Clients | Unknown | N/A | |
| On Interpolating Experts and Multi-Armed Bandits | Unknown | N/A | |
| Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning | Unknown | N/A | |
| Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations Modeling | Unknown | N/A | |
| NeuralIndicator: Implicit Surface Reconstruction from Neural Indicator Priors | Unknown | N/A | |
| On the Calibration of Human Pose Estimation | Unknown | N/A | |
| MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI | Unknown | N/A | |
| Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection | Unknown | N/A | |
| The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling | Unknown | N/A | |
| RVI-SAC: Average Reward Off-Policy Deep Reinforcement Learning | Unknown | N/A | |
| Smooth Tchebycheff Scalarization for Multi-Objective Optimization | Unknown | N/A | |
| DFD: Distilling the Feature Disparity Differently for Detectors | Unknown | N/A | |
| Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model | Unknown | N/A | |
| In-Context Unlearning: Language Models as Few-Shot Unlearners | Unknown | N/A | |
| Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences | Unknown | N/A | |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Unknown | N/A | |
| KernelSHAP-IQ: Weighted Least Square Optimization for Shapley Interactions | Unknown | N/A | |
| Reference Neural Operators: Learning the Smooth Dependence of Solutions of PDEs on Geometric Deformations | Unknown | N/A | |
| Auto-Linear Phenomenon in Subsurface Imaging | Unknown | N/A | |
| Accelerating Parallel Sampling of Diffusion Models | Unknown | N/A | |
| From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems | Unknown | N/A | |
| Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts | Unknown | N/A | |
| DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models (Exemplified as A Video Agent) | Unknown | N/A | |
| Distributed Bilevel Optimization with Communication Compression | Unknown | N/A | |
| Inherent Trade-Offs between Diversity and Stability in Multi-Task Benchmarks | Unknown | N/A | |
| On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions | Unknown | N/A | |
| On the Weight Dynamics of Deep Normalized Networks | Unknown | N/A | |
| Jacobian Regularizer-based Neural Granger Causality | Unknown | N/A | |
| Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations | Unknown | N/A | |
| Projecting Molecules into Synthesizable Chemical Spaces | Unknown | N/A | |
| A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts | Unknown | N/A | |
| Energy-based Backdoor Defense without Task-Specific Samples and Model Retraining | Unknown | N/A | |
| Break the Sequential Dependency of LLM Inference Using Lookahead Decoding | Unknown | N/A | |
| Don’t Label Twice: Quantity Beats Quality when Comparing Binary Classifiers on a Budget | Unknown | N/A | |
| Causal Inference from Competing Treatments | Unknown | N/A | |
| Denoising Autoregressive Representation Learning | Unknown | N/A | |
| On a Neural Implementation of Brenier's Polar Factorization | Unknown | N/A | |
| Causally Motivated Personalized Federated Invariant Learning with Shortcut-Averse Information-Theoretic Regularization | Unknown | N/A | |
| Privacy Attacks in Decentralized Learning | Unknown | N/A | |
| Membership Inference Attacks on Diffusion Models via Quantile Regression | Unknown | N/A | |
| Hybrid Neural Representations for Spherical Data | Unknown | N/A | |
| Premise Order Matters in Reasoning with Large Language Models | Unknown | N/A | |
| Graph As Point Set | Unknown | N/A | |
| Mastering Zero-Shot Interactions in Cooperative and Competitive Simultaneous Games | Unknown | N/A | |
| SparQ Attention: Bandwidth-Efficient LLM Inference | Unknown | N/A | |
| An Analysis of Linear Time Series Forecasting Models | Unknown | N/A | |
| Adaptive Stabilization Based on Machine Learning for Column Generation | Unknown | N/A | |
| PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning | Unknown | N/A | |
| Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective | Unknown | N/A | |
| Accelerating Legacy Numerical Solvers by Non-intrusive Gradient-based Meta-solving | Unknown | N/A | |
| Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning | Unknown | N/A | |
| Optimizing Watermarks for Large Language Models | Unknown | N/A | |
| A Probabilistic Approach to Learning the Degree of Equivariance in Steerable CNNs | Unknown | N/A | |
| Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup | Unknown | N/A | |
| Hieros: Hierarchical Imagination on Structured State Space Sequence World Models | Unknown | N/A | |
| Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm | Unknown | N/A | |
| Equivariant Diffusion for Crystal Structure Prediction | Unknown | N/A | |
| Online Variational Sequential Monte Carlo | Unknown | N/A | |
| Generalized Preference Optimization: A Unified Approach to Offline Alignment | Unknown | N/A | |
| Leveraging VLM-Based Pipelines to Annotate 3D Objects | Unknown | N/A | |
| Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach | Unknown | N/A | |
| MS$^3$D: A RG Flow-Based Regularization for GAN Training with Limited Data | Unknown | N/A | |
| DiffDA: a Diffusion model for weather-scale Data Assimilation | Unknown | N/A | |
| Understanding Heterophily for Graph Neural Networks | Unknown | N/A | |
| The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright BreachesWithout Adjusting Finetuning Pipeline | Unknown | N/A | |
| Dynamic Spectral Clustering with Provable Approximation Guarantee | Unknown | N/A | |
| Adversarially Robust Deep Multi-View Clustering: A Novel Attack and Defense Framework | Unknown | N/A | |
| PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation | Unknown | N/A | |
| On Stronger Computational Separations Between Multimodal and Unimodal Machine Learning | Unknown | N/A | |
| Rich-Observation Reinforcement Learning with Continuous Latent Dynamics | Unknown | N/A | |
| Ai-sampler: Adversarial Learning of Markov kernels with involutive maps | Unknown | N/A | |
| Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations | Unknown | N/A | |
| Sparse Dimensionality Reduction Revisited | Unknown | N/A | |
| A New Theoretical Perspective on Data Heterogeneity in Federated Optimization | Unknown | N/A | |
| Subhomogeneous Deep Equilibrium Models | Unknown | N/A | |
| Speech Self-Supervised Learning Using Diffusion Model Synthetic Data | Unknown | N/A | |
| RoboDreamer: Learning Compositional World Models for Robot Imagination | Unknown | N/A | |
| ContPhy: Continuum Physical Concept Learning and Reasoning from Videos | Unknown | N/A | |
| 3D-VLA: A 3D Vision-Language-Action Generative World Model | Unknown | N/A | |
| RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation | Unknown | N/A | |
| Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models | Unknown | N/A | |
| Compositional Image Decomposition with Diffusion Models | Unknown | N/A | |
| Diffusion Rejection Sampling | Unknown | N/A | |
| Information-Directed Pessimism for Offline Reinforcement Learning | Unknown | N/A | |
| Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian Manifold | Unknown | N/A | |
| Partial Optimality in the Linear Ordering Problem | Unknown | N/A | |
| Improved Modelling of Federated Datasets using Mixtures-of-Dirichlet-Multinomials | Unknown | N/A | |
| Differentially Private Domain Adaptation with Theoretical Guarantees | Unknown | N/A | |
| Differentially Private Worst-group Risk Minimization | Unknown | N/A | |
| Time Series Diffusion in the Frequency Domain | Unknown | N/A | |
| FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion | Unknown | N/A | |
| Position: Optimization in SciML Should Employ the Function Space Geometry | Unknown | N/A | |
| Memory Efficient Neural Processes via Constant Memory Attention Block | Unknown | N/A | |
| Box Facets and Cut Facets of Lifted Multicut Polytopes | Unknown | N/A | |
| Improving Computational Complexity in Statistical Models with Local Curvature Information | Unknown | N/A | |
| Online Algorithms with Uncertainty-Quantified Predictions | Unknown | N/A | |
| A Statistical Framework for Data-dependent Retrieval-Augmented Models | Unknown | N/A | |
| A Fresh Take on Stale Embeddings: Improving Dense Retriever Training with Corrector Networks | Unknown | N/A | |
| Trained Random Forests Completely Reveal your Dataset | Unknown | N/A | |
| Generalization Analysis of Deep Non-linear Matrix Completion | Unknown | N/A | |
| Triplet Interaction Improves Graph Transformers: Accurate Molecular Graph Learning with Triplet Graph Transformers | Unknown | N/A | |
| R2E: Turning any Github Repository into a Programming Agent Environment | Unknown | N/A | |
| Enhancing Value Function Estimation through First-Order State-Action Dynamics in Offline Reinforcement Learning | Unknown | N/A | |
| Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning | Unknown | N/A | |
| Verification of Machine Unlearning is Fragile | Unknown | N/A | |
| Towards Certified Unlearning for Deep Neural Networks | Unknown | N/A | |
| DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents | Unknown | N/A | |
| Dirichlet Flow Matching with Applications to DNA Sequence Design | Unknown | N/A | |
| Reward Shaping for Reinforcement Learning with An Assistant Reward Agent | Unknown | N/A | |
| Human Alignment of Large Language Models through Online Preference Optimisation | Unknown | N/A | |
| Flora: Low-Rank Adapters Are Secretly Gradient Compressors | Unknown | N/A | |
| Expressivity and Generalization: Fragment-Biases for Molecular GNNs | Unknown | N/A | |
| SelfIE: Self-Interpretation of Large Language Model Embeddings | Unknown | N/A | |
| Listenable Maps for Audio Classifiers | Unknown | N/A | |
| QBMK: Quantum-based Matching Kernels for Un-attributed Graphs | Unknown | N/A | |
| Unsupervised Parameter-free Simplicial Representation Learning with Scattering Transforms | Unknown | N/A | |
| Regularized Q-learning through Robust Averaging | Unknown | N/A | |
| Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities | Unknown | N/A | |
| Efficient Precision and Recall Metrics for Assessing Generative Models using Hubness-aware Sampling | Unknown | N/A | |
| Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling | Unknown | N/A | |
| Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation | Unknown | N/A | |
| Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining | Unknown | N/A | |
| Differentially Private Post-Processing for Fair Regression | Unknown | N/A | |
| An Empirical Study Into What Matters for Calibrating Vision-Language Models | Unknown | N/A | |
| Position: Automatic Environment Shaping is the Next Frontier in RL | Unknown | N/A | |
| LLM Maybe LongLM: SelfExtend LLM Context Window Without Tuning | Unknown | N/A | |
| Evaluating Model Bias Requires Characterizing its Mistakes | Unknown | N/A | |
| Probabilistic Subgoal Representations for Hierarchical Reinforcement Learning | Unknown | N/A | |
| LAGMA: LAtent Goal-guided Multi-Agent Reinforcement Learning | Unknown | N/A | |
| How do Large Language Models Navigate Conflicts between Honesty and Helpfulness? | Unknown | N/A | |
| Image Fusion via Vision-Language Model | Unknown | N/A | |
| Sharp Rates in Dependent Learning Theory: Avoiding Sample Size Deflation for the Square Loss | Unknown | N/A | |
| Knowledge Transfer from Vision Foundation Models for Efficient Training of Small Task-specific Models | Unknown | N/A | |
| A Computational Framework for Solving Wasserstein Lagrangian Flows | Unknown | N/A | |
| Contextual Feature Selection with Conditional Stochastic Gates | Unknown | N/A | |
| Weisfeiler Leman for Euclidean Equivariant Machine Learning | Unknown | N/A | |
| Learning to Scale Logits for Temperature-Conditional GFlowNets | Unknown | N/A | |
| Learning in Feature Spaces via Coupled Covariances: Asymmetric Kernel SVD and Nyström method | Unknown | N/A | |
| Reinforcement Learning and Regret Bounds for Admission Control | Unknown | N/A | |
| Large Scale Dataset Distillation with Domain Shift | Unknown | N/A | |
| OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models | Unknown | N/A | |
| Autoformalizing Euclidean Geometry | Unknown | N/A | |
| Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design | Unknown | N/A | |
| QuIP$#$: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks | Unknown | N/A | |
| Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis | Unknown | N/A | |
| Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo | Unknown | N/A | |
| Revealing Vision-Language Integration in the Brain with Multimodal Networks | Unknown | N/A | |
| COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability | Unknown | N/A | |
| Learning Optimal Projection for Forecast Reconciliation of Hierarchical Time Series | Unknown | N/A | |
| Saliency strikes back: How filtering out high frequencies improves white-box explanations | Unknown | N/A | |
| Switching the Loss Reduces the Cost in Batch Reinforcement Learning | Unknown | N/A | |
| Sampling-based Multi-dimensional Recalibration | Unknown | N/A | |
| NExT: Teaching Large Language Models to Reason about Code Execution | Unknown | N/A | |
| Time Weaver: A Conditional Time Series Generation Model | Unknown | N/A | |
| Hierarchical Novelty Detection via Fine-Grained Evidence Allocation | Unknown | N/A | |
| Executable Code Actions Elicit Better LLM Agents | Unknown | N/A | |
| Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision | Unknown | N/A | |
| PANDA: Expanded Width-Aware Message Passing Beyond Rewiring | Unknown | N/A | |
| Towards Efficient Training and Evaluation of Robust Models against $l_0$ Bounded Adversarial Perturbations | Unknown | N/A | |
| Position: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research | Unknown | N/A | |
| Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition | Unknown | N/A | |
| MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance | Unknown | N/A | |
| Uniformly Stable Algorithms for Adversarial Training and Beyond | Unknown | N/A | |
| Exploration-Driven Policy Optimization in RLHF: Theoretical Insights on Efficient Data Utilization | Unknown | N/A | |
| Position: Intent-aligned AI Systems Must Optimize for Agency Preservation | Unknown | N/A | |
| Model-Based Minimum Bayes Risk Decoding for Text Generation | Unknown | N/A | |
| On the Convergence of Projected Bures-Wasserstein Gradient Descent under Euclidean Strong Convexity | Unknown | N/A | |
| Minimum-Norm Interpolation Under Covariate Shift | Unknown | N/A | |
| Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models | Unknown | N/A | |
| Collage: Light-Weight Low-Precision Strategy for LLM Training | Unknown | N/A | |
| SelfVC: Voice Conversion With Iterative Refinement using Self Transformations | Unknown | N/A | |
| Evaluating Quantized Large Language Models | Unknown | N/A | |
| PhAST: Physics-Aware, Scalable, and Task-Specific GNNs for Accelerated Catalyst Design | Unknown | N/A | |
| First-Order Manifold Data Augmentation for Regression Learning | Unknown | N/A | |
| Recurrent Distance Filtering for Graph Representation Learning | Unknown | N/A | |
| Zero-Shot Unsupervised and Text-Based Audio Editing Using DDPM Inversion | Unknown | N/A | |
| Feel-Good Thompson Sampling for Contextual Dueling Bandits | Unknown | N/A | |
| Causal Discovery with Fewer Conditional Independence Tests | Unknown | N/A | |
| What is the Long-Run Distribution of Stochastic Gradient Descent? A Large Deviations Analysis | Unknown | N/A | |
| Exploiting Human-AI Dependence for Learning to Defer | Unknown | N/A | |
| Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term Frequency | Unknown | N/A | |
| Towards General Algorithm Discovery for Combinatorial Optimization: Learning Symbolic Branching Policy from Bipartite Graph | Unknown | N/A | |
| Gradient Compressed Sensing: A Query-Efficient Gradient Estimator for High-Dimensional Zeroth-Order Optimization | Unknown | N/A | |
| Fundamental Limitations of Alignment in Large Language Models | Unknown | N/A | |
| Flexible Residual Binarization for Image Super-Resolution | Unknown | N/A | |
| SFC: Achieve Accurate Fast Convolution under Low-precision Arithmetic | Unknown | N/A | |
| Predicting Dose-Response Curves with Deep Neural Networks | Unknown | N/A | |
| Provably Efficient Long-Horizon Exploration in Monte Carlo Tree Search through State Occupancy Regularization | Unknown | N/A | |
| Causal Effect Identification in LiNGAM Models with Latent Confounders | Unknown | N/A | |
| Transformers, parallel computation, and logarithmic depth | Unknown | N/A | |
| Extending Adversarial Attacks to Produce Adversarial Class Probability Distributions | Unknown | N/A | |
| Tilt and Average : Geometric Adjustment of the Last Layer for Recalibration | Unknown | N/A | |
| Weakly-Supervised Residual Evidential Learning for Multi-Instance Uncertainty Estimation | Unknown | N/A | |
| Sliding Down the Stairs: How Correlated Latent Variables Accelerate Learning with Neural Networks | Unknown | N/A | |
| Helpful or Harmful Data? Fine-tuning-free Shapley Attribution for Explaining Language Model Predictions | Unknown | N/A | |
| Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning | Unknown | N/A | |
| Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks | Unknown | N/A | |
| Implicit Representations for Constrained Image Segmentation | Unknown | N/A | |
| Integrating Multimodal Data for Joint Generative Modeling of Complex Dynamics | Unknown | N/A | |
| When is Transfer Learning Possible? | Unknown | N/A | |
| Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning | Unknown | N/A | |
| Self-Consistency Training for Density-Functional-Theory Hamiltonian Prediction | Unknown | N/A | |
| InferCept: Efficient Intercept Support for Augmented Large Language Model Inference | Unknown | N/A | |
| Scalable Safe Policy Improvement for Factored Multi-Agent MDPs | Unknown | N/A | |
| Local Feature Selection without Label or Feature Leakage for Interpretable Machine Learning Predictions | Unknown | N/A | |
| Generalization Analysis for Multi-Label Learning | Unknown | N/A | |
| On the sample complexity of conditional independence testing with Von Mises estimator with application to causal discovery | Unknown | N/A | |
| Coarse-To-Fine Tensor Trains for Compact Visual Representations | Unknown | N/A | |
| How Smooth Is Attention? | Unknown | N/A | |
| Efficient Low-Rank Matrix Estimation, Experimental Design, and Arm-Set-Dependent Low-Rank Bandits | Unknown | N/A | |
| Lightweight Image Super-Resolution via Flexible Meta Pruning | Unknown | N/A | |
| On the Nonlinearity of Layer Normalization | Unknown | N/A | |
| KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning | Unknown | N/A | |
| Fundamental Limits of Distributed Covariance Matrix Estimation Under Communication Constraints | Unknown | N/A | |
| Sign Rank Limitations for Inner Product Graph Decoders | Unknown | N/A | |
| LoCoCo: Dropping In Convolutions for Long Context Compression | Unknown | N/A | |
| MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases | Unknown | N/A | |
| GenCO: Generating Diverse Designs with Combinatorial Constraints | Unknown | N/A | |
| TravelPlanner: A Benchmark for Real-World Planning with Language Agents | Unknown | N/A | |
| GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection | Unknown | N/A | |
| Dense Reward for Free in Reinforcement Learning from Human Feedback | Unknown | N/A | |
| Training-Free Long-Context Scaling of Large Language Models | Unknown | N/A | |
| MD tree: a model-diagnostic tree grown on loss landscape | Unknown | N/A | |
| ReLU Network with Width $d+\mathcal{O}(1)$ Can Achieve Optimal Approximation Rate | Unknown | N/A | |
| Characterizing ResNet's Universal Approximation Capability | Unknown | N/A | |
| Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback | Unknown | N/A | |
| PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition | Unknown | N/A | |
| Minimum Norm Interpolation Meets The Local Theory of Banach Spaces | Unknown | N/A | |
| Individual Fairness in Graph Decomposition | Unknown | N/A | |
| SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models | Unknown | N/A | |
| Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning | Unknown | N/A | |
| Bivariate Causal Discovery using Bayesian Model Selection | Unknown | N/A | |
| BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback | Unknown | N/A | |
| Adaptive Online Experimental Design for Causal Discovery | Unknown | N/A | |
| Joint Composite Latent Space Bayesian Optimization | Unknown | N/A | |
| Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents | Unknown | N/A | |
| Spectral Phase Transition and Optimal PCA in Block-Structured Spiked Models | Unknown | N/A | |
| A Bayesian Approach to Online Planning | Unknown | N/A | |
| Theoretical insights for diffusion guidance: A case study for Gaussian mixture models | Unknown | N/A | |
| Uniform Memory Retrieval with Larger Capacity for Modern Hopfield Models | Unknown | N/A | |
| Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning | Unknown | N/A | |
| Position: Video as the New Language for Real-World Decision Making | Unknown | N/A | |
| Agent Instructs Large Language Models to be General Zero-Shot Reasoners | Unknown | N/A | |
| Aligning Transformers with Weisfeiler-Leman | Unknown | N/A | |
| Position: Will we run out of data? Limits of LLM scaling based on human-generated data | Unknown | N/A | |
| Learning Linear Block Error Correction Codes | Unknown | N/A | |
| ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints | Unknown | N/A | |
| Local vs. Global Interpretability: A Computational Complexity Perspective | Unknown | N/A | |
| A Study of First-Order Methods with a Deterministic Relative-Error Gradient Oracle | Unknown | N/A | |
| Fault Tolerant ML: Efficient Meta-Aggregation and Synchronous Training | Unknown | N/A | |
| Private and Federated Stochastic Convex Optimization: Efficient Strategies for Centralized Systems | Unknown | N/A | |
| Efficient Value Iteration for s-rectangular Robust Markov Decision Processes | Unknown | N/A | |
| Dynamic Byzantine-Robust Learning: Adapting to Switching Byzantine Workers | Unknown | N/A | |
| Retrieval-Augmented Score Distillation for Text-to-3D Generation | Unknown | N/A | |
| Parameter-Dependent Competitive Analysis for Online Capacitated Coverage Maximization through Boostings and Attenuations | Unknown | N/A | |
| Promoting External and Internal Equities Under Ex-Ante/Ex-Post Metrics in Online Resource Allocation | Unknown | N/A | |
| Eureka-Moments in Transformers: Multi-Step Tasks Reveal Softmax Induced Optimization Problems | Unknown | N/A | |
| Stochastic Q-learning for Large Discrete Action Spaces | Unknown | N/A | |
| Federated Combinatorial Multi-Agent Multi-Armed Bandits | Unknown | N/A | |
| In-Context Freeze-Thaw Bayesian Optimization for Hyperparameter Optimization | Unknown | N/A | |
| Bayesian Optimization of Function Networks with Partial Evaluations | Unknown | N/A | |
| Autoencoding Conditional Neural Processes for Representation Learning | Unknown | N/A | |
| BLO-SAM: Bi-level Optimization Based Finetuning of the Segment Anything Model for Overfitting-Preventing Semantic Segmentation | Unknown | N/A | |
| A Touch, Vision, and Language Dataset for Multimodal Alignment | Unknown | N/A | |
| Prospective Side Information for Latent MDPs | Unknown | N/A | |
| On The Complexity of First-Order Methods in Stochastic Bilevel Optimization | Unknown | N/A | |
| Neural NeRF Compression | Unknown | N/A | |
| SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning | Unknown | N/A | |
| Graph Neural Network Explanations are Fragile | Unknown | N/A | |
| AlphaFold Meets Flow Matching for Generating Protein Ensembles | Unknown | N/A | |
| Generalization Error of Graph Neural Networks in the Mean-field Regime | Unknown | N/A | |
| ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models | Unknown | N/A | |
| Explorations of Self-Repair in Language Models | Unknown | N/A | |
| Regularizing with Pseudo-Negatives for Continual Self-Supervised Learning | Unknown | N/A | |
| Block Acceleration Without Momentum: On Optimal Stepsizes of Block Gradient Descent for Least-Squares | Unknown | N/A | |
| Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference | Unknown | N/A | |
| Effective Federated Graph Matching | Unknown | N/A | |
| Self-cognitive Denoising in the Presence of Multiple Noisy Label Sources | Unknown | N/A | |
| Explaining Graph Neural Networks via Structure-aware Interaction Index | Unknown | N/A | |
| GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding | Unknown | N/A | |
| Generative Conditional Distributions by Neural (Entropic) Optimal Transport | Unknown | N/A | |
| Federated Continual Learning via Prompt-based Dual Knowledge Transfer | Unknown | N/A | |
| An Interpretable Evaluation of Entropy-based Novelty of Generative Models | Unknown | N/A | |
| xT: Nested Tokenization for Larger Context in Large Images | Unknown | N/A | |
| Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes | Unknown | N/A | |
| Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond | Unknown | N/A | |
| SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation | Unknown | N/A | |
| Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization | Unknown | N/A | |
| Easing Concept Bleeding in Diffusion via Entity Localization and Anchoring | Unknown | N/A | |
| BayOTIDE: Bayesian Online Multivariate Time Series Imputation with Functional Decomposition | Unknown | N/A | |
| Toward Adaptive Reasoning in Large Language Models with Thought Rollback | Unknown | N/A | |
| When Will Gradient Regularization Be Harmful? | Unknown | N/A | |
| Sign is Not a Remedy: Multiset-to-Multiset Message Passing for Learning on Heterophilic Graphs | Unknown | N/A | |
| Iterated Denoising Energy Matching for Sampling from Boltzmann Densities | Unknown | N/A | |
| Privacy Profiles for Private Selection | Unknown | N/A | |
| From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning | Unknown | N/A | |
| Improving Neural Logic Machines via Failure Reflection | Unknown | N/A | |
| Less is More: on the Over-Globalizing Problem in Graph Transformers | Unknown | N/A | |
| Quantum Algorithms and Lower Bounds for Finite-Sum Optimization | Unknown | N/A | |
| Stochastic Localization via Iterative Posterior Sampling | Unknown | N/A | |
| Learning Modality Knowledge Alignment for Cross-Modality Transfer | Unknown | N/A | |
| A Nearly Optimal Single Loop Algorithm for Stochastic Bilevel Optimization under Unbounded Smoothness | Unknown | N/A | |
| Differentiable Model Scaling using Differentiable Topk | Unknown | N/A | |
| Energy-Efficient Gaussian Processes Using Low-Precision Arithmetic | Unknown | N/A | |
| LoRA Training in the NTK Regime has No Spurious Local Minima | Unknown | N/A | |
| Optimal Acceleration for Minimax and Fixed-Point Problems is Not Unique | Unknown | N/A | |
| Mol-AE: Auto-Encoder Based Molecular Representation Learning With 3D Cloze Test Objective | Unknown | N/A | |
| ESM All-Atom: Multi-Scale Protein Language Model for Unified Molecular Modeling | Unknown | N/A | |
| Pruned Pivot: Correlation Clustering Algorithm for Dynamic, Parallel, and Local Computation Models | Unknown | N/A | |
| MH-pFLID: Model Heterogeneous personalized Federated Learning via Injection and Distillation for Medical Data Analysis | Unknown | N/A | |
| Differentially Private Decentralized Learning with Random Walks | Unknown | N/A | |
| Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning | Unknown | N/A | |
| Towards Resource-friendly, Extensible and Stable Incomplete Multi-view Clustering | Unknown | N/A | |
| Adaptive Robust Learning using Latent Bernoulli Variables | Unknown | N/A | |
| Confidence Aware Inverse Constrained Reinforcement Learning | Unknown | N/A | |
| Scalable Multiple Kernel Clustering: Learning Clustering Structure from Expectation | Unknown | N/A | |
| Decouple then Classify: A Dynamic Multi-view Labeling Strategy with Shared and Specific Information | Unknown | N/A | |
| Solving Poisson Equations using Neural Walk-on-Spheres | Unknown | N/A | |
| Mean-field Underdamped Langevin Dynamics and its Spacetime Discretization | Unknown | N/A | |
| DoRA: Weight-Decomposed Low-Rank Adaptation | Unknown | N/A | |
| Flextron: Many-in-One Flexible Large Language Model | Unknown | N/A | |
| Bayesian Exploration Networks | Unknown | N/A | |
| Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers | Unknown | N/A | |
| MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark | Unknown | N/A | |
| Prior Mismatch and Adaptation in PnP-ADMM with a Nonconvex Convergence Analysis | Unknown | N/A | |
| DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning | Unknown | N/A | |
| Learning to Compile Programs to Neural Networks | Unknown | N/A | |
| FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees | Unknown | N/A | |
| ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis | Unknown | N/A | |
| Towards the Theory of Unsupervised Federated Learning: Non-asymptotic Analysis of Federated EM Algorithms | Unknown | N/A | |
| SAPG: Split and Aggregate Policy Gradients | Unknown | N/A | |
| Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data | Unknown | N/A | |
| Bridging Environments and Language with Rendering Functions and Vision-Language Models | Unknown | N/A | |
| MoMo: Momentum Models for Adaptive Learning Rates | Unknown | N/A | |
| Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling | Unknown | N/A | |
| Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts | Unknown | N/A | |
| Fine-grained Local Sensitivity Analysis of Standard Dot-Product Self-Attention | Unknown | N/A | |
| Gambling-Based Confidence Sequences for Bounded Random Vectors | Unknown | N/A | |
| Operator SVD with Neural Networks via Nested Low-Rank Approximation | Unknown | N/A | |
| Offline Actor-Critic Reinforcement Learning Scales to Large Models | Unknown | N/A | |
| Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments | Unknown | N/A | |
| Mitigating Oversmoothing Through Reverse Process of GNNs for Heterophilic Graphs | Unknown | N/A | |
| Hypergraph-enhanced Dual Semi-supervised Graph Classification | Unknown | N/A | |
| From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers | Unknown | N/A | |
| Online Linear Regression in Dynamic Environments via Discounting | Unknown | N/A | |
| PGODE: Towards High-quality System Dynamics Modeling | Unknown | N/A | |
| Uncertainty for Active Learning on Graphs | Unknown | N/A | |
| Decomposing and Editing Predictions by Modeling Model Computation | Unknown | N/A | |
| The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents | Unknown | N/A | |
| Online Learning and Information Exponents: The Importance of Batch size & Time/Complexity Tradeoffs | Unknown | N/A | |
| Asymptotics of feature learning in two-layer networks after one gradient-step | Unknown | N/A | |
| Initial Guessing Bias: How Untrained Networks Favor Some Classes | Unknown | N/A | |
| Polygonal Unadjusted Langevin Algorithms: Creating stable and efficient adaptive algorithms for neural networks | Unknown | N/A | |
| Taylor Videos for Action Recognition | Unknown | N/A | |
| On Statistical Learning Theory for Distributional Inputs | Unknown | N/A | |
| Structure-based drug design by denoising voxel grids | Unknown | N/A | |
| Towards Realistic Model Selection for Semi-supervised Learning | Unknown | N/A | |
| On the Second-Order Convergence of Biased Policy Gradient Algorithms | Unknown | N/A | |
| Finite Time Logarithmic Regret Bounds for Self-Tuning Regulation | Unknown | N/A | |
| Differentially Private Synthetic Data via Foundation Model APIs 2: Text | Unknown | N/A | |
| A Unified View of FANOVA: A Comprehensive Bayesian Framework for Component Selection and Estimation | Unknown | N/A | |
| Non-parametric Online Change Point Detection on Riemannian Manifolds | Unknown | N/A | |
| On the Unexpected Effectiveness of Reinforcement Learning for Sequential Recommendation | Unknown | N/A | |
| DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation | Unknown | N/A | |
| On Online Experimentation without Device Identifiers | Unknown | N/A | |
| Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization | Unknown | N/A | |
| SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample Complexity | Unknown | N/A | |
| Transferable Facial Privacy Protection against Blind Face Restoration via Domain-Consistent Adversarial Obfuscation | Unknown | N/A | |
| HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal | Unknown | N/A | |
| Emergent Equivariance in Deep Ensembles | Unknown | N/A | |
| Do Efficient Transformers Really Save Computation? | Unknown | N/A | |
| Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning | Unknown | N/A | |
| GPTSwarm: Language Agents as Optimizable Graphs | Unknown | N/A | |
| Provably Robust DPO: Aligning Language Models with Noisy Feedback | Unknown | N/A | |
| Balanced Resonate-and-Fire Neurons | Unknown | N/A | |
| Path-Guided Particle-based Sampling | Unknown | N/A | |
| GFlowNet Training by Policy Gradients | Unknown | N/A | |
| Logistic Variational Bayes Revisited | Unknown | N/A | |
| Cross-domain Open-world Discovery | Unknown | N/A | |
| Decentralized Convex Finite-Sum Optimization with Better Dependence on Condition Numbers | Unknown | N/A | |
| Fewer Truncations Improve Language Modeling | Unknown | N/A | |
| Nesting Particle Filters for Experimental Design in Dynamical Systems | Unknown | N/A | |
| An Information-Theoretic Analysis of In-Context Learning | Unknown | N/A | |
| Balancing Feature Similarity and Label Variability for Optimal Size-Aware One-shot Subset Selection | Unknown | N/A | |
| Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning | Unknown | N/A | |
| Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference | Unknown | N/A | |
| A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models | Unknown | N/A | |
| Beyond the Norms: Detecting Prediction Errors in Regression Models | Unknown | N/A | |
| Improving Open-Ended Text Generation via Adaptive Decoding | Unknown | N/A | |
| A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions | Unknown | N/A | |
| Fast Timing-Conditioned Latent Audio Diffusion | Unknown | N/A | |
| Overcoming the Optimizer's Curse: Obtaining Realistic Prescriptions from Neural Networks | Unknown | N/A | |
| Density-Softmax: Efficient Test-time Model for Uncertainty Estimation and Robustness under Distribution Shifts | Unknown | N/A | |
| Causal Inference out of Control: Estimating Performativity without Treatment Randomization | Unknown | N/A | |
| Differentiability and Optimization of Multiparameter Persistent Homology | Unknown | N/A | |
| Hybrid Inverse Reinforcement Learning | Unknown | N/A | |
| Cooperative Graph Neural Networks | Unknown | N/A | |
| Generalization to New Sequential Decision Making Tasks with In-Context Learning | Unknown | N/A | |
| Allocation Requires Prediction Only if Inequality Is Low | Unknown | N/A | |
| Latent Space Symmetry Discovery | Unknown | N/A | |
| Can Gaussian Sketching Converge Faster on a Preconditioned Landscape? | Unknown | N/A | |
| Leverage Class-Specific Accuracy to Guide Data Generation for Improving Image Classification | Unknown | N/A | |
| Data-free Neural Representation Compression with Riemannian Neural Dynamics | Unknown | N/A | |
| FlowMM: Generating Materials with Riemannian Flow Matching | Unknown | N/A | |
| Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models | Unknown | N/A | |
| Variational Schrödinger Diffusion Models | Unknown | N/A | |
| Improving Transformers with Dynamically Composable Multi-Head Attention | Unknown | N/A | |
| Constrained Exploration via Reflected Replica Exchange Stochastic Gradient Langevin Dynamics | Unknown | N/A | |
| Learning Universal Predictors | Unknown | N/A | |
| Subgraphormer: Unifying Subgraph GNNs and Graph Transformers via Graph Products | Unknown | N/A | |
| Efficient Exploration for LLMs | Unknown | N/A | |
| Overcoming Data and Model heterogeneities in Decentralized Federated Learning via Synthetic Anchors | Unknown | N/A | |
| Amortizing Pragmatic Program Synthesis with Rankings | Unknown | N/A | |
| Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture | Unknown | N/A | |
| Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks | Unknown | N/A | |
| Surprisingly Strong Performance Prediction with Neural Graph Features | Unknown | N/A | |
| Stability and Multigroup Fairness in Ranking with Uncertain Predictions | Unknown | N/A | |
| Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations | Unknown | N/A | |
| GiLOT: Interpreting Generative Language Models via Optimal Transport | Unknown | N/A | |
| Transitional Uncertainty with Layered Intermediate Predictions | Unknown | N/A | |
| Prompt Sketching for Large Language Models | Unknown | N/A | |
| Probabilistic Conceptual Explainers: Trustworthy Conceptual Explanations for Vision Foundation Models | Unknown | N/A | |
| Mitigating Privacy Risk in Membership Inference by Convex-Concave Loss | Unknown | N/A | |
| Diversified Batch Selection for Training Acceleration | Unknown | N/A | |
| MALIBO: Meta-learning for Likelihood-free Bayesian Optimization | Unknown | N/A | |
| A Geometric Decomposition of Finite Games: Convergence vs. Recurrence under Exponential Weights | Unknown | N/A | |
| Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits | Unknown | N/A | |
| Scaling Laws for the Value of Individual Data Points in Machine Learning | Unknown | N/A | |
| Learning and Forgetting Unsafe Examples in Large Language Models | Unknown | N/A | |
| In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering | Unknown | N/A | |
| Prospector Heads: Generalized Feature Attribution for Large Models & Data | Unknown | N/A | |
| How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis | Unknown | N/A | |
| Selecting Large Language Model to Fine-tune via Rectified Scaling Law | Unknown | N/A | |
| PolySketchFormer: Fast Transformers via Sketching Polynomial Kernels | Unknown | N/A | |
| Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation | Unknown | N/A | |
| When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions | Unknown | N/A | |
| Assessing Large Language Models on Climate Information | Unknown | N/A | |
| The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning | Unknown | N/A | |
| Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities | Unknown | N/A | |
| An Explicit Frame Construction for Normalizing 3D Point Clouds | Unknown | N/A | |
| Align Your Steps: Optimizing Sampling Schedules in Diffusion Models | Unknown | N/A | |
| Distributional Bellman Operators over Mean Embeddings | Unknown | N/A | |
| Comparing Graph Transformers via Positional Encodings | Unknown | N/A | |
| ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections | Unknown | N/A | |
| DéjàVu: KV-cache Streaming for Fast, Fault-tolerant Generative LLM Serving | Unknown | N/A | |
| Private Heterogeneous Federated Learning Without a Trusted Server Revisited: Error-Optimal and Communication-Efficient Algorithms for Convex Losses | Unknown | N/A | |
| Incremental Topological Ordering and Cycle Detection with Predictions | Unknown | N/A | |
| Reweighted Solutions for Weighted Low Rank Approximation | Unknown | N/A | |
| Coresets for Multiple $\ell_p$ Regression | Unknown | N/A | |
| Fast, Scalable, Warm-Start Semidefinite Programming with Spectral Bundling and Sketching | Unknown | N/A | |
| Learning from Integral Losses in Physics Informed Neural Networks | Unknown | N/A | |
| Model-Free Robust $\phi$-Divergence Reinforcement Learning Using Both Offline and Online Data | Unknown | N/A | |
| A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) | Unknown | N/A | |
| WAVES: Benchmarking the Robustness of Image Watermarks | Unknown | N/A | |
| How Deep Do We Need: Accelerating Training and Inference of Neural ODEs via Control Perspective | Unknown | N/A | |
| GPT-4V(ision) is a Generalist Web Agent, if Grounded | Unknown | N/A | |
| LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models | Unknown | N/A | |
| Scalable and Flexible Causal Discovery with an Efficient Test for Adjacency | Unknown | N/A | |
| Continuous Treatment Effects with Surrogate Outcomes | Unknown | N/A | |
| Editing Partially Observable Networks via Graph Diffusion Models | Unknown | N/A | |
| Meta Evidential Transformer for Few-Shot Open-Set Recognition | Unknown | N/A | |
| Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity | Unknown | N/A | |
| NExT-Chat: An LMM for Chat, Detection and Segmentation | Unknown | N/A | |
| CKGConv: General Graph Convolution with Continuous Kernels | Unknown | N/A | |
| Few-Shot Character Understanding in Movies as an Assessment to Meta-Learning of Theory-of-Mind | Unknown | N/A | |
| Position: Social Environment Design Should be Further Developed for AI-based Policy-Making | Unknown | N/A | |
| CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources | Unknown | N/A | |
| ${\rm E}(3)$-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Grokking Group Multiplication with Cosets | Unknown | N/A | |
| Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews | Unknown | N/A | |
| Cell2Sentence: Teaching Large Language Models the Language of Biology | Unknown | N/A | |
| The Effect of Weight Precision on the Neuron Count in Deep ReLU Networks | Unknown | N/A | |
| Augmenting Decision with Hypothesis in Reinforcement Learning | Unknown | N/A | |
| Outlier-Efficient Hopfield Layers for Large Transformer-Based Models | Unknown | N/A | |
| High-Dimensional Geometric Streaming for Nearly Low Rank Data | Unknown | N/A | |
| PriorBoost: An Adaptive Algorithm for Learning from Aggregate Responses | Unknown | N/A | |
| Perturb-and-Project: Differentially Private Similarities and Marginals | Unknown | N/A | |
| Data-Efficient Learning via Clustering-Based Sensitivity Sampling: Foundation Models and Beyond | Unknown | N/A | |
| A Field Guide for Pacing Budget and ROS Constraints | Unknown | N/A | |
| PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control | Unknown | N/A | |
| Deep Networks Always Grok and Here is Why | Unknown | N/A | |
| Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint | Unknown | N/A | |
| Contrastive Predict-and-Search for Mixed Integer Linear Programs | Unknown | N/A | |
| Learning Decision Trees and Forests with Algorithmic Recourse | Unknown | N/A | |
| Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian Mixtures | Unknown | N/A | |
| AMPA: Adaptive Mixed Precision Allocation for Low-Bit Integer Training | Unknown | N/A | |
| MusicFlow: Cascaded Flow Matching for Text Guided Music Generation | Unknown | N/A | |
| Beyond the Federation: Topology-aware Federated Learning for Generalization to Unseen Clients | Unknown | N/A | |
| Bayesian Design Principles for Offline-to-Online Reinforcement Learning | Unknown | N/A | |
| Differentiable Distributionally Robust Optimization Layers | Unknown | N/A | |
| Generating In-Distribution Proxy Graphs for Explaining Graph Neural Networks | Unknown | N/A | |
| TimeX++: Learning Time-Series Explanations with Information Bottleneck | Unknown | N/A | |
| Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs | Unknown | N/A | |
| NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models | Unknown | N/A | |
| Position: Near to Mid-term Risks and Opportunities of Open-Source Generative AI | Unknown | N/A | |
| Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study | Unknown | N/A | |
| Open-Domain Text Evaluation via Contrastive Distribution Methods | Unknown | N/A | |
| DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language Models | Unknown | N/A | |
| Listening to the noise: Blind Denoising with Gibbs Diffusion | Unknown | N/A | |
| GATE: How to Keep Out Intrusive Neighbors | Unknown | N/A | |
| End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations | Unknown | N/A | |
| Larimar: Large Language Models with Episodic Memory Control | Unknown | N/A | |
| What Would Gauss Say About Representations? Probing Pretrained Image Models using Synthetic Gaussian Benchmarks | Unknown | N/A | |
| Position: Data-driven Discovery with Large Generative Models | Unknown | N/A | |
| Matrix Information Theory for Self-Supervised Learning | Unknown | N/A | |
| Information Flow in Self-Supervised Learning | Unknown | N/A | |
| Better & Faster Large Language Models via Multi-token Prediction | Unknown | N/A | |
| ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data | Unknown | N/A | |
| CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasks | Unknown | N/A | |
| Promptbreeder: Self-Referential Self-Improvement via Prompt Evolution | Unknown | N/A | |
| Position: Open-Endedness is Essential for Artificial Superhuman Intelligence | Unknown | N/A | |
| Debating with More Persuasive LLMs Leads to More Truthful Answers | Unknown | N/A | |
| Genie: Generative Interactive Environments | Unknown | N/A | |
| HAMLET: Graph Transformer Neural Operator for Partial Differential Equations | Unknown | N/A | |
| ATraDiff: Accelerating Online Reinforcement Learning with Imaginary Trajectories | Unknown | N/A | |
| Data-Efficient Molecular Generation with Hierarchical Textual Inversion | Unknown | N/A | |
| All-in-one simulation-based inference | Unknown | N/A | |
| Learning from Memory: Non-Parametric Memory Augmented Self-Supervised Learning of Visual Features | Unknown | N/A | |
| Position: A Safe Harbor for AI Evaluation and Red Teaming | Unknown | N/A | |
| Evaluation of Trajectory Distribution Predictions with Energy Score | Unknown | N/A | |
| Position: Future Directions in the Theory of Graph Machine Learning | Unknown | N/A | |
| Hierarchical Integral Probability Metrics: A distance on random probability measures with low sample complexity | Unknown | N/A | |
| HexGen: Generative Inference of Large Language Model over Heterogeneous Environment | Unknown | N/A | |
| Position: An Inner Interpretability Framework for AI Inspired by Lessons from Cognitive Neuroscience | Unknown | N/A | |
| Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States | Unknown | N/A | |
| Stochastic positional embeddings improve masked image modeling | Unknown | N/A | |
| Differentially Private Representation Learning via Image Captioning | Unknown | N/A | |
| Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts | Unknown | N/A | |
| Universality of Linear Recurrences Followed by Non-linear Projections: Finite-Width Guarantees and Benefits of Complex Eigenvalues | Unknown | N/A | |
| PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer | Unknown | N/A | |
| On Gradient-like Explanation under a Black-box Setting: When Black-box Explanations Become as Good as White-box | Unknown | N/A | |
| Tabular Insights, Visual Impacts: Transferring Expertise from Tables to Images | Unknown | N/A | |
| Neural operators meet conjugate gradients: The FCG-NO method for efficient PDE solving | Unknown | N/A | |
| No Double Descent in Principal Component Regression: A High-Dimensional Analysis | Unknown | N/A | |
| Dynamic Facility Location in High Dimensional Euclidean Spaces | Unknown | N/A | |
| Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normalization | Unknown | N/A | |
| Rethinking the Flat Minima Searching in Federated Learning | Unknown | N/A | |
| Bounded and Uniform Energy-based Out-of-distribution Detection for Graphs | Unknown | N/A | |
| Deconstructing the Goldilocks Zone of Neural Network Initialization | Unknown | N/A | |
| AutoOS: Make Your OS More Powerful by Exploiting Large Language Models | Unknown | N/A | |
| Prompt-based Visual Alignment for Zero-shot Policy Transfer | Unknown | N/A | |
| Gradient-based Visual Explanation for Transformer-based CLIP | Unknown | N/A | |
| Performance Bounds for Active Binary Testing with Information Maximization | Unknown | N/A | |
| Latent Noise Segmentation: How Neural Noise Leads to the Emergence of Segmentation and Grouping | Unknown | N/A | |
| Provable Benefits of Local Steps in Heterogeneous Federated Learning for Neural Networks: A Feature Learning Perspective | Unknown | N/A | |
| Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise | Unknown | N/A | |
| A Fixed-Point Approach for Causal Generative Modeling | Unknown | N/A | |
| Towards Causal Foundation Model: on Duality between Optimal Balancing and Attention | Unknown | N/A | |
| Conditional Language Learning with Context | Unknown | N/A | |
| Getting the most out of your tokenizer for pre-training and domain adaptation | Unknown | N/A | |
| Optimization without Retraction on the Random Generalized Stiefel Manifold | Unknown | N/A | |
| Representing Molecules as Random Walks Over Interpretable Grammars | Unknown | N/A | |
| Towards a Better Theoretical Understanding of Independent Subnetwork Training | Unknown | N/A | |
| AegisFL: Efficient and Flexible Privacy-Preserving Byzantine-Robust Cross-silo Federated Learning | Unknown | N/A | |
| One for All: A Universal Generator for Concept Unlearnability via Multi-Modal Alignment | Unknown | N/A | |
| Forget Sharpness: Perturbed Forgetting of Model Biases Within SAM Dynamics | Unknown | N/A | |
| Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations | Unknown | N/A | |
| Risk Aware Benchmarking of Large Language Models | Unknown | N/A | |
| Deeper or Wider: A Perspective from Optimal Generalization Error with Sobolev Loss | Unknown | N/A | |
| Towards a Self-contained Data-driven Global Weather Forecasting Framework | Unknown | N/A | |
| Two-timescale Derivative Free Optimization for Performative Prediction with Markovian Data | Unknown | N/A | |
| To Each (Textual Sequence) Its Own: Improving Memorized-Data Unlearning in Large Language Models | Unknown | N/A | |
| S$\Omega$I: Score-based O-INFORMATION Estimation | Unknown | N/A | |
| SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks | Unknown | N/A | |
| DeepPolar: Inventing Nonlinear Large-Kernel Polar Codes via Deep Learning | Unknown | N/A | |
| Two Heads are Actually Better than One: Towards Better Adversarial Robustness via Transduction and Rejection | Unknown | N/A | |
| On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control | Unknown | N/A | |
| Position: On the Possibilities of AI-Generated Text Detection | Unknown | N/A | |
| PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling | Unknown | N/A | |
| MaxMin-RLHF: Alignment with Diverse Human Preferences | Unknown | N/A | |
| Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles | Unknown | N/A | |
| Don't be so Negative! Score-based Generative Modeling with Oracle-assisted Guidance | Unknown | N/A | |
| Et Tu Certifications: Robustness Certificates Yield Better Adversarial Examples | Unknown | N/A | |
| Quantum Theory and Application of Contextual Optimal Transport | Unknown | N/A | |
| Few-Shot Unsupervised Implicit Neural Shape Representation Learning with Spatial Adversaries | Unknown | N/A | |
| Learning with Partial-Label and Unlabeled Data: A Uniform Treatment for Supervision Redundancy and Insufficiency | Unknown | N/A | |
| HyperFields: Towards Zero-Shot Generation of NeRFs from Text | Unknown | N/A | |
| Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples | Unknown | N/A | |
| AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training | Unknown | N/A | |
| Active Ranking and Matchmaking, with Perfect Matchings | Unknown | N/A | |
| Causal Bandits: The Pareto Optimal Frontier of Adaptivity, a Reduction to Linear Bandits, and Limitations around Unknown Marginals | Unknown | N/A | |
| Towards Neural Architecture Search through Hierarchical Generative Modeling | Unknown | N/A | |
| Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs | Unknown | N/A | |
| Amortized Variational Deep Kernel Learning | Unknown | N/A | |
| Disentanglement Learning via Topology | Unknown | N/A | |
| SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP | Unknown | N/A | |
| Codebook Features: Sparse and Discrete Interpretability for Neural Networks | Unknown | N/A | |
| Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics | Unknown | N/A | |
| Position: Embracing Negative Results in Machine Learning | Unknown | N/A | |
| Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination | Unknown | N/A | |
| Learning Divergence Fields for Shift-Robust Graph Representations | Unknown | N/A | |
| How Graph Neural Networks Learn: Lessons from Training Dynamics | Unknown | N/A | |
| Graph Out-of-Distribution Detection Goes Neighborhood Shaping | Unknown | N/A | |
| Stay on Topic with Classifier-Free Guidance | Unknown | N/A | |
| Position: On the Societal Impact of Open Foundation Models | Unknown | N/A | |
| Learning Label Shift Correction for Test-Agnostic Long-Tailed Recognition | Unknown | N/A | |
| On the Recoverability of Causal Relations from Temporally Aggregated I.I.D. Data | Unknown | N/A | |
| Evaluation of Test-Time Adaptation Under Computational Time Constraints | Unknown | N/A | |
| Towards Interpretable Deep Local Learning with Successive Gradient Reconciliation | Unknown | N/A | |
| An Empirical Study of Realized GNN Expressiveness | Unknown | N/A | |
| Self-Driven Entropy Aggregation for Byzantine-Robust Heterogeneous Federated Learning | Unknown | N/A | |
| Understanding MLP-Mixer as a wide and sparse MLP | Unknown | N/A | |
| Self-attention Networks Localize When QK-eigenspectrum Concentrates | Unknown | N/A | |
| Faster Streaming and Scalable Algorithms for Finding Directed Dense Subgraphs in Large Graphs | Unknown | N/A | |
| Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency | Unknown | N/A | |
| Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks | Unknown | N/A | |
| Effect-Invariant Mechanisms for Policy Generalization | Unknown | N/A | |
| eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data | Unknown | N/A | |
| Differentiable Combinatorial Scheduling at Scale | Unknown | N/A | |
| Bottleneck-Minimal Indexing for Generative Document Retrieval | Unknown | N/A | |
| Model-based Reinforcement Learning for Parameterized Action Spaces | Unknown | N/A | |
| Efficient Mixture Learning in Black-Box Variational Inference | Unknown | N/A | |
| Indirectly Parameterized Concrete Autoencoders | Unknown | N/A | |
| EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens | Unknown | N/A | |
| STELLA: Continual Audio-Video Pre-training with SpatioTemporal Localized Alignment | Unknown | N/A | |
| BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation | Unknown | N/A | |
| Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation | Unknown | N/A | |
| What is Dataset Distillation Learning? | Unknown | N/A | |
| DPZero: Private Fine-Tuning of Language Models without Backpropagation | Unknown | N/A | |
| MLI Formula: A Nearly Scale-Invariant Solution with Noise Perturbation | Unknown | N/A | |
| Sampling in Unit Time with Kernel Fisher-Rao Flow | Unknown | N/A | |
| Structure-Aware E(3)-Invariant Molecular Conformer Aggregation Networks | Unknown | N/A | |
| Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity | Unknown | N/A | |
| Quantum Positional Encodings for Graph Neural Networks | Unknown | N/A | |
| Hidden Traveling Waves bind Working Memory Variables in Recurrent Neural Networks | Unknown | N/A | |
| Disentangled 3D Scene Generation with Layout Learning | Unknown | N/A | |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Unknown | N/A | |
| Improved Dimensionality Dependence for Zeroth-Order Optimisation over Cross-Polytopes | Unknown | N/A | |
| A New Computationally Efficient Algorithm to solve Feature Selection for Functional Data Classification in High-dimensional Spaces | Unknown | N/A | |
| A Sparsity Principle for Partially Observable Causal Representation Learning | Unknown | N/A | |
| Conditional Normalizing Flows for Active Learning of Coarse-Grained Molecular Representations | Unknown | N/A | |
| Probability Distribution of Hypervolume Improvement in Bi-objective Bayesian Optimization | Unknown | N/A | |
| Position: Opportunities Exist for Machine Learning in Magnetic Fusion Energy | Unknown | N/A | |
| Bounding the Excess Risk for Linear Models Trained on Marginal-Preserving, Differentially-Private, Synthetic Data | Unknown | N/A | |
| Major-Minor Mean Field Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Understanding Forgetting in Continual Learning with Linear Regression | Unknown | N/A | |
| Quality-Diversity with Limited Resources | Unknown | N/A | |
| Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback | Unknown | N/A | |
| On the Independence Assumption in Neurosymbolic Learning | Unknown | N/A | |
| diff History for Neural Language Agents | Unknown | N/A | |
| Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling | Unknown | N/A | |
| Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning? | Unknown | N/A | |
| CuTS: Customizable Tabular Synthetic Data Generation | Unknown | N/A | |
| Instruction Tuning for Secure Code Generation | Unknown | N/A | |
| Mimicking Better by Matching the Approximate Action Distribution | Unknown | N/A | |
| Asymmetry in Low-Rank Adapters of Foundation Models | Unknown | N/A | |
| Slicing Mutual Information Generalization Bounds for Neural Networks | Unknown | N/A | |
| Position: Fundamental Limitations of LLM Censorship Necessitate New Approaches | Unknown | N/A | |
| Information Complexity of Stochastic Convex Optimization: Applications to Generalization, Memorization, and Tracing | Unknown | N/A | |
| Bifurcated Attention for Single-Context Large-Batch Sampling | Unknown | N/A | |
| Breadth-First Exploration on Adaptive Grid for Reinforcement Learning | Unknown | N/A | |
| Smoothing Proximal Gradient Methods for Nonsmooth Sparsity Constrained Optimization: Optimality Conditions and Global Convergence | Unknown | N/A | |
| Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training | Unknown | N/A | |
| Learning to Model the World With Language | Unknown | N/A | |
| The Merit of River Network Topology for Neural Flood Forecasting | Unknown | N/A | |
| Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation | Unknown | N/A | |
| Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making | Unknown | N/A | |
| Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making | Unknown | N/A | |
| Disguised Copyright Infringement of Latent Diffusion Models | Unknown | N/A | |
| Predictive Linear Online Tracking for Unknown Targets | Unknown | N/A | |
| Learning Latent Dynamic Robust Representations for World Models | Unknown | N/A | |
| Stealing part of a production language model | Unknown | N/A | |
| Clifford-Steerable Convolutional Neural Networks | Unknown | N/A | |
| Sub-token ViT Embedding via Stochastic Resonance Transformers | Unknown | N/A | |
| Dynamic Metric Embedding into lp Space | Unknown | N/A | |
| Graph2Tac: Online Representation Learning of Formal Math Concepts | Unknown | N/A | |
| Diffusion Language Models Are Versatile Protein Learners | Unknown | N/A | |
| BWS: Best Window Selection Based on Sample Scores for Data Pruning across Broad Ranges | Unknown | N/A | |
| Localizing Task Information for Improved Model Merging and Compression | Unknown | N/A | |
| Detecting and Identifying Selection Structure in Sequential Data | Unknown | N/A | |
| Data Engineering for Scaling Language Models to 128K Context | Unknown | N/A | |
| OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models | Unknown | N/A | |
| The Emergence of Reproducibility and Consistency in Diffusion Models | Unknown | N/A | |
| Accelerated Speculative Sampling Based on Tree Monte Carlo | Unknown | N/A | |
| Random Latent Exploration for Deep Reinforcement Learning | Unknown | N/A | |
| Enhancing Size Generalization in Graph Neural Networks through Disentangled Representation Learning | Unknown | N/A | |
| EvoluNet: Advancing Dynamic Non-IID Transfer Learning on Graphs | Unknown | N/A | |
| Active Statistical Inference | Unknown | N/A | |
| Protein Conformation Generation via Force-Guided SE(3) Diffusion Models | Unknown | N/A | |
| CHAI: Clustered Head Attention for Efficient LLM Inference | Unknown | N/A | |
| Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Unknown | N/A | |
| Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration | Unknown | N/A | |
| Policy-conditioned Environment Models are More Generalizable | Unknown | N/A | |
| SILVER: Single-loop variance reduction and application to federated learning | Unknown | N/A | |
| Hierarchical Neural Operator Transformer with Learnable Frequency-aware Loss Prior for Arbitrary-scale Super-resolution | Unknown | N/A | |
| InterLUDE: Interactions between Labeled and Unlabeled Data to Enhance Semi-Supervised Learning | Unknown | N/A | |
| Using Left and Right Brains Together: Towards Vision and Language Planning | Unknown | N/A | |
| SMaRt: Improving GANs with Score Matching Regularity | Unknown | N/A | |
| ODIN: Disentangled Reward Mitigates Hacking in RLHF | Unknown | N/A | |
| Modular Learning of Deep Causal Generative Models for High-dimensional Causal Inference | Unknown | N/A | |
| Interplay of ROC and Precision-Recall AUCs: Theoretical Limits and Practical Implications in Binary Classification | Unknown | N/A | |
| FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning | Unknown | N/A | |
| On Discrete Prompt Optimization for Diffusion Models | Unknown | N/A | |
| Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models | Unknown | N/A | |
| Directly Denoising Diffusion Models | Unknown | N/A | |
| Nearest Neighbour Score Estimators for Diffusion Generative Models | Unknown | N/A | |
| What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation | Unknown | N/A | |
| Thermometer: Towards Universal Calibration for Large Language Models | Unknown | N/A | |
| Out-of-Domain Generalization in Dynamical Systems Reconstruction | Unknown | N/A | |
| Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel | Unknown | N/A | |
| VideoPoet: A Large Language Model for Zero-Shot Video Generation | Unknown | N/A | |
| Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications | Unknown | N/A | |
| TimeMIL: Advancing Multivariate Time Series Classification via a Time-aware Multiple Instance Learning | Unknown | N/A | |
| Position: LLMs Can’t Plan, But Can Help Planning in LLM-Modulo Frameworks | Unknown | N/A | |
| New Sample Complexity Bounds for Sample Average Approximation in Heavy-Tailed Stochastic Programming | Unknown | N/A | |
| EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | Unknown | N/A | |
| Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling | Unknown | N/A | |
| Feature Reuse and Scaling: Understanding Transfer Learning with Protein Language Models | Unknown | N/A | |
| Learning Coverage Paths in Unknown Environments with Deep Reinforcement Learning | Unknown | N/A | |
| No Wrong Turns: The Simple Geometry Of Neural Networks Optimization Paths | Unknown | N/A | |
| MOMENT: A Family of Open Time-series Foundation Models | Unknown | N/A | |
| SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | Unknown | N/A | |
| Counterfactual Image Editing | Unknown | N/A | |
| Learning to Route Among Specialized Experts for Zero-Shot Generalization | Unknown | N/A | |
| Fast Adversarial Attacks on Language Models In One GPU Minute | Unknown | N/A | |
| Orthogonal Bootstrap: Efficient Simulation of Input Uncertainty | Unknown | N/A | |
| EvIL: Evolution Strategies for Generalisable Imitation Learning | Unknown | N/A | |
| Improving Prototypical Visual Explanations with Reward Reweighing, Reselection, and Retraining | Unknown | N/A | |
| Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data | Unknown | N/A | |
| On The Statistical Complexity of Offline Decision-Making | Unknown | N/A | |
| Probabilistic Constrained Reinforcement Learning with Formal Interpretability | Unknown | N/A | |
| Bringing Motion Taxonomies to Continuous Domains via GPLVM on Hyperbolic manifolds | Unknown | N/A | |
| Variational Learning is Effective for Large Deep Networks | Unknown | N/A | |
| Light and Optimal Schrödinger Bridge Matching | Unknown | N/A | |
| Controlled Decoding from Language Models | Unknown | N/A | |
| On the Duality Between Sharpness-Aware Minimization and Adversarial Training | Unknown | N/A | |
| Liouville Flow Importance Sampler | Unknown | N/A | |
| Offline Training of Language Model Agents with Functions as Learnable Weights | Unknown | N/A | |
| Scaling Exponents Across Parameterizations and Optimizers | Unknown | N/A | |
| Discovering Symmetry Breaking in Physical Systems with Relaxed Group Convolution | Unknown | N/A | |
| SPADE: Sparsity-Guided Debugging for Deep Neural Networks | Unknown | N/A | |
| Error Feedback Can Accurately Compress Preconditioners | Unknown | N/A | |
| Extreme Compression of Large Language Models via Additive Quantization | Unknown | N/A | |
| AttNS: Attention-Inspired Numerical Solving For Limited Data Scenarios | Unknown | N/A | |
| LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views | Unknown | N/A | |
| Characterizing Large Language Model Geometry Helps Solve Toxicity Detection and Generation | Unknown | N/A | |
| Ameliorate Spurious Correlations in Dataset Condensation | Unknown | N/A | |
| VideoPrism: A Foundational Visual Encoder for Video Understanding | Unknown | N/A | |
| Particle Denoising Diffusion Sampler | Unknown | N/A | |
| LaMAGIC: Language-Model-based Topology Generation for Analog Integrated Circuits | Unknown | N/A | |
| Optimistic Multi-Agent Policy Gradient | Unknown | N/A | |
| How do Transformers Perform In-Context Autoregressive Learning ? | Unknown | N/A | |
| Vision Transformers as Probabilistic Expansion from Learngene | Unknown | N/A | |
| Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation | Unknown | N/A | |
| Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast | Unknown | N/A | |
| Graph Positional and Structural Encoder | Unknown | N/A | |
| Modeling Language Tokens as Functionals of Semantic Fields | Unknown | N/A | |
| Inferring Change Points in High-Dimensional Linear Regression via Approximate Message Passing | Unknown | N/A | |
| Score-Based Causal Discovery of Latent Variable Causal Models | Unknown | N/A | |
| DITTO: Diffusion Inference-Time T-Optimization for Music Generation | Unknown | N/A | |
| Language Models as Science Tutors | Unknown | N/A | |
| Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition | Unknown | N/A | |
| Position: Technical Research and Talent is Needed for Effective AI Governance | Unknown | N/A | |
| Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models | Unknown | N/A | |
| SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms | Unknown | N/A | |
| Integrated Hardware Architecture and Device Placement Search | Unknown | N/A | |
| Noise-Adaptive Confidence Sets for Linear Bandits and Application to Bayesian Optimization | Unknown | N/A | |
| Fast Sampling-Based Sketches for Tensors | Unknown | N/A | |
| Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT | Unknown | N/A | |
| Diffusion Models Encode the Intrinsic Dimension of Data Manifolds | Unknown | N/A | |
| Graph Neural Networks Use Graphs When They Shouldn't | Unknown | N/A | |
| Two Fists, One Heart: Multi-Objective Optimization Based Strategy Fusion for Long-tailed Learning | Unknown | N/A | |
| AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors | Unknown | N/A | |
| Data-free Distillation of Diffusion Models with Bootstrapping | Unknown | N/A | |
| What’s the score? Automated Denoising Score Matching for Nonlinear Diffusions | Unknown | N/A | |
| Stochastic Interpolants with Data-Dependent Couplings | Unknown | N/A | |
| Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction | Unknown | N/A | |
| Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior | Unknown | N/A | |
| Self-Rewarding Language Models | Unknown | N/A | |
| COALA: A Practical and Vision-Centric Federated Learning Platform | Unknown | N/A | |
| Arrows of Time for Large Language Models | Unknown | N/A | |
| Overcoming Saturation in Density Ratio Estimation by Iterated Regularization | Unknown | N/A | |
| In-Context Learning Agents Are Asymmetric Belief Updaters | Unknown | N/A | |
| Efficient Algorithms for Empirical Group Distributionally Robust Optimization and Beyond | Unknown | N/A | |
| Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains | Unknown | N/A | |
| FADAS: Towards Federated Adaptive Asynchronous Optimization | Unknown | N/A | |
| Fast and Sample Efficient Multi-Task Representation Learning in Stochastic Contextual Bandits | Unknown | N/A | |
| A Closer Look at the Limitations of Instruction Tuning | Unknown | N/A | |
| Sign Gradient Descent-based Neuronal Dynamics: ANN-to-SNN Conversion Beyond ReLU Network | Unknown | N/A | |
| Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability | Unknown | N/A | |
| Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback | Unknown | N/A | |
| Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning | Unknown | N/A | |
| EquiPocket: an E(3)-Equivariant Geometric Graph Neural Network for Ligand Binding Site Prediction | Unknown | N/A | |
| Is Kernel Prediction More Powerful than Gating in Convolutional Neural Networks? | Unknown | N/A | |
| Dynamic Anisotropic Smoothing for Noisy Derivative-Free Optimization | Unknown | N/A | |
| Incorporating probabilistic domain knowledge into deep multiple instance learning | Unknown | N/A | |
| Mean-field Analysis on Two-layer Neural Networks from a Kernel Perspective | Unknown | N/A | |
| In-Context Language Learning: Architectures and Algorithms | Unknown | N/A | |
| Gated Linear Attention Transformers with Hardware-Efficient Training | Unknown | N/A | |
| Agnostic Sample Compression Schemes for Regression | Unknown | N/A | |
| ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations | Unknown | N/A | |
| Dealing With Unbounded Gradients in Stochastic Saddle-point Optimization | Unknown | N/A | |
| Reducing Item Discrepancy via Differentially Private Robust Embedding Alignment for Privacy-Preserving Cross Domain Recommendation | Unknown | N/A | |
| Masked Face Recognition with Generative-to-Discriminative Representations | Unknown | N/A | |
| Recovering the Pre-Fine-Tuning Weights of Generative Models | Unknown | N/A | |
| Plug-in Performative Optimization | Unknown | N/A | |
| Understanding Finetuning for Factual Knowledge Extraction | Unknown | N/A | |
| Re-Dock: Towards Flexible and Realistic Molecular Docking with Diffusion Bridge | Unknown | N/A | |
| Estimating Canopy Height at Scale | Unknown | N/A | |
| Learning to Predict Mutational Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning | Unknown | N/A | |
| Position: Cracking the Code of Cascading Disparity Towards Marginalized Communities | Unknown | N/A | |
| A Global Geometric Analysis of Maximal Coding Rate Reduction | Unknown | N/A | |
| The Pitfalls of Next-Token Prediction | Unknown | N/A | |
| Failures Are Fated, But Can Be Faded: Characterizing and Mitigating Unwanted Behaviors in Large-Scale Vision and Language Models | Unknown | N/A | |
| Navigating Scaling Laws: Compute Optimality in Adaptive Model Training | Unknown | N/A | |
| A Language Model’s Guide Through Latent Space | Unknown | N/A | |
| PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming | Unknown | N/A | |
| One Meta-tuned Transformer is What You Need for Few-shot Learning | Unknown | N/A | |
| Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought | Unknown | N/A | |
| Conformal Prediction for Deep Classifier via Label Ranking | Unknown | N/A | |
| Position: TrustLLM: Trustworthiness in Large Language Models | Unknown | N/A | |
| Longitudinal Targeted Minimum Loss-based Estimation with Temporal-Difference Heterogeneous Transformer | Unknown | N/A | |
| Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems | Unknown | N/A | |
| Representation Surgery: Theory and Practice of Affine Steering | Unknown | N/A | |
| SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models | Unknown | N/A | |
| Accelerating Federated Learning with Quick Distributed Mean Estimation | Unknown | N/A | |
| From Classification Accuracy to Proper Scoring Rules: Elicitability of Probabilistic Top List Predictions | Unknown | N/A | |
| Layerwise Proximal Replay: A Proximal Point Method for Online Continual Learning | Unknown | N/A | |
| A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules? | Unknown | N/A | |
| Rate-Optimal Policy Optimization for Linear Markov Decision Processes | Unknown | N/A | |
| Fast Peer Adaptation with Context-aware Exploration | Unknown | N/A | |
| Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text | Unknown | N/A | |
| Learning to Infer Generative Template Programs for Visual Concepts | Unknown | N/A | |
| Gibbs Sampling of Continuous Potentials on a Quantum Computer | Unknown | N/A | |
| Dual Operating Modes of In-Context Learning | Unknown | N/A | |
| D-Flow: Differentiating through Flows for Controlled Generation | Unknown | N/A | |
| Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In Context | Unknown | N/A | |
| Unveiling the Dynamics of Information Interplay in Supervised Learning | Unknown | N/A | |
| Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling | Unknown | N/A | |
| Integrating Global Context Contrast and Local Sensitivity for Blind Image Quality Assessment | Unknown | N/A | |
| Adaptive Feature Selection for No-Reference Image Quality Assessment by Mitigating Semantic Noise Sensitivity | Unknown | N/A | |
| Can AI Assistants Know What They Don't Know? | Unknown | N/A | |
| Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery | Unknown | N/A | |
| Classification Under Strategic Self-Selection | Unknown | N/A | |
| Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration | Unknown | N/A | |
| Estimating Distributional Treatment Effects in Randomized Experiments: Machine Learning for Variance Reduction | Unknown | N/A | |
| USTAD: Unified Single-model Training Achieving Diverse Scores for Information Retrieval | Unknown | N/A | |
| StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization | Unknown | N/A | |
| Multi-layer Rehearsal Feature Augmentation for Class-Incremental Learning | Unknown | N/A | |
| On the Role of Edge Dependency in Graph Generative Models | Unknown | N/A | |
| Consistent Long-Term Forecasting of Ergodic Dynamical Systems | Unknown | N/A | |
| Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models | Unknown | N/A | |
| Learning to Explore in POMDPs with Informational Rewards | Unknown | N/A | |
| PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Unknown | N/A | |
| Can Mamba Learn How To Learn? A Comparative Study on In-Context Learning Tasks | Unknown | N/A | |
| Structure Your Data: Towards Semantic Graph Counterfactuals | Unknown | N/A | |
| Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning | Unknown | N/A | |
| Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws | Unknown | N/A | |
| Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you Need | Unknown | N/A | |
| A Persuasive Approach to Combating Misinformation | Unknown | N/A | |
| Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem | Unknown | N/A | |
| Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Unknown | N/A | |
| HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding | Unknown | N/A | |
| Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization | Unknown | N/A | |
| Self-Supervised Interpretable End-to-End Learning via Latent Functional Modularity | Unknown | N/A | |
| High-Dimensional Bayesian Optimization via Semi-Supervised Learning with Optimized Unlabeled Data Sampling | Unknown | N/A | |
| A Unified Adaptive Testing System Enabled by Hierarchical Structure Search | Unknown | N/A | |
| Observable Propagation: Uncovering Feature Vectors in Transformers | Unknown | N/A | |
| Complexity Matters: Feature Learning in the Presence of Spurious Correlations | Unknown | N/A | |
| EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty | Unknown | N/A | |
| Copyright Traps for Large Language Models | Unknown | N/A | |
| PAGER: Accurate Failure Characterization in Deep Regression Models | Unknown | N/A | |
| Policy Evaluation for Variance in Average Reward Reinforcement Learning | Unknown | N/A | |
| Interpreting Equivariant Representations | Unknown | N/A | |
| Risk Estimation in a Markov Cost Process: Lower and Upper Bounds | Unknown | N/A | |
| Physics and Lie symmetry informed Gaussian processes | Unknown | N/A | |
| Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension | Unknown | N/A | |
| Robust Yet Efficient Conformal Prediction Sets | Unknown | N/A | |
| On the Trajectory Regularity of ODE-based Diffusion Sampling | Unknown | N/A | |
| CF-OPT: Counterfactual Explanations for Structured Prediction | Unknown | N/A | |
| Differentiable Weightless Neural Networks | Unknown | N/A | |
| Adaptive Observation Cost Control for Variational Quantum Eigensolvers | Unknown | N/A | |
| Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower Bounds | Unknown | N/A | |
| RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation | Unknown | N/A | |
| Kepler codebook | Unknown | N/A | |
| Tandem Transformers for Inference Efficient LLMs | Unknown | N/A |
NIPS 2020
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning | Unknown | N/A | |
| Reinforced Molecular Optimization with Neighborhood-Controlled Grammars | Unknown | N/A | |
| Locally Differentially Private (Contextual) Bandits Learning | Unknown | N/A | |
| Online Structured Meta-learning | Unknown | N/A | |
| Inverse Rational Control with Partially Observable Continuous Nonlinear Dynamics | Unknown | N/A | |
| Differentiable Neural Architecture Search in Equivalent Space with Exploration Enhancement | Unknown | N/A | |
| All Word Embeddings from One Embedding | Unknown | N/A | |
| Multi-label classification: do Hamming loss and subset accuracy really conflict with each other? | Unknown | N/A | |
| Few-Cost Salient Object Detection with Adversarial-Paced Learning | Unknown | N/A | |
| Counterfactual Predictions under Runtime Confounding | Unknown | N/A | |
| BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits | Unknown | N/A | |
| Multipole Graph Neural Operator for Parametric Partial Differential Equations | Unknown | N/A | |
| Fast Unbalanced Optimal Transport on a Tree | Unknown | N/A | |
| Baxter Permutation Process | Unknown | N/A | |
| Generalized Boosting | Unknown | N/A | |
| Probabilistic Active Meta-Learning | Unknown | N/A | |
| Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula | Unknown | N/A | |
| Learning Retrospective Knowledge with Reverse Reinforcement Learning | Unknown | N/A | |
| A Decentralized Parallel Algorithm for Training Generative Adversarial Nets | Unknown | N/A | |
| Bayesian Optimization of Risk Measures | Unknown | N/A | |
| Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation | Unknown | N/A | |
| Batched Coarse Ranking in Multi-Armed Bandits | Unknown | N/A | |
| Wavelet Flow: Fast Training of High Resolution Normalizing Flows | Unknown | N/A | |
| Minimax Value Interval for Off-Policy Evaluation and Policy Optimization | Unknown | N/A | |
| ShapeFlow: Learnable Deformation Flows Among 3D Shapes | Unknown | N/A | |
| High-Dimensional Sparse Linear Bandits | Unknown | N/A | |
| Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition | Unknown | N/A | |
| Certified Monotonic Neural Networks | Unknown | N/A | |
| Combining Deep Reinforcement Learning and Search for Imperfect-Information Games | Unknown | N/A | |
| Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation | Unknown | N/A | |
| Towards Interpretable Natural Language Understanding with Explanations as Latent Variables | Unknown | N/A | |
| Denoised Smoothing: A Provable Defense for Pretrained Classifiers | Unknown | N/A | |
| BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images | Unknown | N/A | |
| Faster DBSCAN via subsampled similarity queries | Unknown | N/A | |
| First-Order Methods for Large-Scale Market Equilibrium Computation | Unknown | N/A | |
| Latent Dynamic Factor Analysis of High-Dimensional Neural Recordings | Unknown | N/A | |
| Multiscale Deep Equilibrium Models | Unknown | N/A | |
| Latent Dynamic Factor Analysis of High-Dimensional Neural Recordings | Unknown | N/A | |
| Representation Learning for Integrating Multi-domain Outcomes to Optimize Individualized Treatment | Unknown | N/A | |
| On the Similarity between the Laplace and Neural Tangent Kernels | Unknown | N/A | |
| Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs | Unknown | N/A | |
| Matérn Gaussian Processes on Riemannian Manifolds | Unknown | N/A | |
| Adversarially Robust Streaming Algorithms via Differential Privacy | Unknown | N/A | |
| A kernel test for quasi-independence | Unknown | N/A | |
| Hybrid Variance-Reduced SGD Algorithms For Minimax Problems with Nonconvex-Linear Function | Unknown | N/A | |
| Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning | Unknown | N/A | |
| Bayesian Pseudocoresets | Unknown | N/A | |
| MPNet: Masked and Permuted Pre-training for Language Understanding | Unknown | N/A | |
| Improving robustness against common corruptions by covariate shift adaptation | Unknown | N/A | |
| Hard Shape-Constrained Kernel Machines | Unknown | N/A | |
| Duality-Induced Regularizer for Tensor Factorization Based Knowledge Graph Completion | Unknown | N/A | |
| A Closer Look at the Training Strategy for Modern Meta-Learning | Unknown | N/A | |
| Set2Graph: Learning Graphs From Sets | Unknown | N/A | |
| Reconstructing Perceptive Images from Brain Activity by Shape-Semantic GAN | Unknown | N/A | |
| Adapting Neural Architectures Between Domains | Unknown | N/A | |
| A mean-field analysis of two-player zero-sum games | Unknown | N/A | |
| Self-Supervised Graph Transformer on Large-Scale Molecular Data | Unknown | N/A | |
| Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning | Unknown | N/A | |
| Learning to search efficiently for causally near-optimal treatments | Unknown | N/A | |
| Supervised Contrastive Learning | Unknown | N/A | |
| Introducing Routing Uncertainty in Capsule Networks | Unknown | N/A | |
| Implicit Regularization in Deep Learning May Not Be Explainable by Norms | Unknown | N/A | |
| No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained Classification Problems | Unknown | N/A | |
| Boosting First-Order Methods by Shifting Objective: New Schemes with Faster Worst-Case Rates | Unknown | N/A | |
| Deep Energy-based Modeling of Discrete-Time Physics | Unknown | N/A | |
| Learning outside the Black-Box: The pursuit of interpretable models | Unknown | N/A | |
| Faster Wasserstein Distance Estimation with the Sinkhorn Divergence | Unknown | N/A | |
| Scalable Graph Neural Networks via Bidirectional Propagation | Unknown | N/A | |
| Gradient Regularized V-Learning for Dynamic Treatment Regimes | Unknown | N/A | |
| FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training | Unknown | N/A | |
| KFC: A Scalable Approximation Algorithm for $k$−center Fair Clustering | Unknown | N/A | |
| Deep Multimodal Fusion by Channel Exchanging | Unknown | N/A | |
| Minimax Classification with 0-1 Loss and Performance Guarantees | Unknown | N/A | |
| Self-Distillation Amplifies Regularization in Hilbert Space | Unknown | N/A | |
| Fighting Copycat Agents in Behavioral Cloning from Observation Histories | Unknown | N/A | |
| GreedyFool: Distortion-Aware Sparse Adversarial Attack | Unknown | N/A | |
| An Efficient Adversarial Attack for Tree Ensembles | Unknown | N/A | |
| Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning | Unknown | N/A | |
| Domain Adaptation as a Problem of Inference on Graphical Models | Unknown | N/A | |
| Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction | Unknown | N/A | |
| Online Matrix Completion with Side Information | Unknown | N/A | |
| Cascaded Text Generation with Markov Transformers | Unknown | N/A | |
| Rankmax: An Adaptive Projection Alternative to the Softmax Function | Unknown | N/A | |
| Stochastic Deep Gaussian Processes over Graphs | Unknown | N/A | |
| Understanding Approximate Fisher Information for Fast Convergence of Natural Gradient Descent in Wide Neural Networks | Unknown | N/A | |
| Can the Brain Do Backpropagation? --- Exact Implementation of Backpropagation in Predictive Coding Networks | Unknown | N/A | |
| Temporal Spike Sequence Learning via Backpropagation for Deep Spiking Neural Networks | Unknown | N/A | |
| Parametric Instance Classification for Unsupervised Visual Feature learning | Unknown | N/A | |
| Robustness of Bayesian Neural Networks to Gradient-Based Attacks | Unknown | N/A | |
| Regularized linear autoencoders recover the principal components, eventually | Unknown | N/A | |
| A Closer Look at Accuracy vs. Robustness | Unknown | N/A | |
| Theoretical Insights Into Multiclass Classification: A High-dimensional Asymptotic View | Unknown | N/A | |
| Neural Manifold Ordinary Differential Equations | Unknown | N/A | |
| Robust, Accurate Stochastic Optimization for Variational Inference | Unknown | N/A | |
| On the Optimal Weighted $\ell_2$ Regularization in Overparameterized Linear Regression | Unknown | N/A | |
| HiPPO: Recurrent Memory with Optimal Polynomial Projections | Unknown | N/A | |
| Ultrahyperbolic Representation Learning | Unknown | N/A | |
| Fair Multiple Decision Making Through Soft Interventions | Unknown | N/A | |
| Finding the Homology of Decision Boundaries with Active Learning | Unknown | N/A | |
| Neural Sparse Representation for Image Restoration | Unknown | N/A | |
| Reinforcement Learning in Factored MDPs: Oracle-Efficient Algorithms and Tighter Regret Bounds for the Non-Episodic Setting | Unknown | N/A | |
| Unsupervised Representation Learning by Invariance Propagation | Unknown | N/A | |
| How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization | Unknown | N/A | |
| Learning to Adapt to Evolving Domains | Unknown | N/A | |
| Solver-in-the-Loop: Learning from Differentiable Physics to Interact with Iterative PDE-Solvers | Unknown | N/A | |
| SIRI: Spatial Relation Induced Network For Spatial Description Resolution | Unknown | N/A | |
| Kernel Alignment Risk Estimator: Risk Prediction from Training Data | Unknown | N/A | |
| Model Rubik’s Cube: Twisting Resolution, Depth and Width for TinyNets | Unknown | N/A | |
| Attribute Prototype Network for Zero-Shot Learning | Unknown | N/A | |
| Decisions, Counterfactual Explanations and Strategic Behavior | Unknown | N/A | |
| Synthesize, Execute and Debug: Learning to Repair for Neural Program Synthesis | Unknown | N/A | |
| Learning to Prove Theorems by Learning to Generate Theorems | Unknown | N/A | |
| MeshSDF: Differentiable Iso-Surface Extraction | Unknown | N/A | |
| Error Bounds of Imitating Policies and Environments | Unknown | N/A | |
| Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning | Unknown | N/A | |
| BERT Loses Patience: Fast and Robust Inference with Early Exit | Unknown | N/A | |
| How does This Interaction Affect Me? Interpretable Attribution for Feature Interactions | Unknown | N/A | |
| Rethinking Learnable Tree Filter for Generic Feature Transform | Unknown | N/A | |
| SOLOv2: Dynamic and Fast Instance Segmentation | Unknown | N/A | |
| Latent Template Induction with Gumbel-CRFs | Unknown | N/A | |
| Comprehensive Attention Self-Distillation for Weakly-Supervised Object Detection | Unknown | N/A | |
| A random matrix analysis of random Fourier features: beyond the Gaussian kernel, a precise phase transition, and the corresponding double descent | Unknown | N/A | |
| Neural Methods for Point-wise Dependency Estimation | Unknown | N/A | |
| Bayesian Deep Ensembles via the Neural Tangent Kernel | Unknown | N/A | |
| Robust Optimization for Fairness with Noisy Protected Groups | Unknown | N/A | |
| Inference Stage Optimization for Cross-scenario 3D Human Pose Estimation | Unknown | N/A | |
| Towards Playing Full MOBA Games with Deep Reinforcement Learning | Unknown | N/A | |
| Robust compressed sensing using generative models | Unknown | N/A | |
| On the Stability and Convergence of Robust Adversarial Reinforcement Learning: A Case Study on Linear Quadratic Systems | Unknown | N/A | |
| Statistical control for spatio-temporal MEG/EEG source imaging with desparsified mutli-task Lasso | Unknown | N/A | |
| Flows for simultaneous manifold learning and density estimation | Unknown | N/A | |
| Noise2Same: Optimizing A Self-Supervised Bound for Image Denoising | Unknown | N/A | |
| A graph similarity for deep learning | Unknown | N/A | |
| How does Weight Correlation Affect Generalisation Ability of Deep Neural Networks? | Unknown | N/A | |
| Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features | Unknown | N/A | |
| Minimax Bounds for Generalized Linear Models | Unknown | N/A | |
| Near-Optimal SQ Lower Bounds for Agnostically Learning Halfspaces and ReLUs under Gaussian Marginals | Unknown | N/A | |
| Probabilistic Linear Solvers for Machine Learning | Unknown | N/A | |
| Feature Importance Ranking for Deep Learning | Unknown | N/A | |
| MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures | Unknown | N/A | |
| HyNet: Learning Local Descriptor with Hybrid Similarity Measure and Triplet Loss | Unknown | N/A | |
| Modeling Shared responses in Neuroimaging Studies through MultiView ICA | Unknown | N/A | |
| Beyond accuracy: quantifying trial-by-trial behaviour of CNNs and humans by measuring error consistency | Unknown | N/A | |
| Semialgebraic Optimization for Lipschitz Constants of ReLU Networks | Unknown | N/A | |
| System Identification with Biophysical Constraints: A Circuit Model of the Inner Retina | Unknown | N/A | |
| Auditing Differentially Private Machine Learning: How Private is Private SGD? | Unknown | N/A | |
| Robust Meta-learning for Mixed Linear Regression with Small Batches | Unknown | N/A | |
| Deep active inference agents using Monte-Carlo methods | Unknown | N/A | |
| Direct Feedback Alignment Scales to Modern Deep Learning Tasks and Architectures | Unknown | N/A | |
| ICAM: Interpretable Classification via Disentangled Representations and Feature Attribution Mapping | Unknown | N/A | |
| Coresets via Bilevel Optimization for Continual Learning and Streaming | Unknown | N/A | |
| Explore Aggressively, Update Conservatively: Stochastic Extragradient Methods with Variable Stepsize Scaling | Unknown | N/A | |
| Self-Adaptive Training: beyond Empirical Risk Minimization | Unknown | N/A | |
| On the distance between two neural networks and the stability of learning | Unknown | N/A | |
| GPS-Net: Graph-based Photometric Stereo Network | Unknown | N/A | |
| Adversarial Self-Supervised Contrastive Learning | Unknown | N/A | |
| Domain Generalization for Medical Imaging Classification with Linear-Dependency Regularization | Unknown | N/A | |
| Passport-aware Normalization for Deep Model Protection | Unknown | N/A | |
| Neural Architecture Generator Optimization | Unknown | N/A | |
| The Power of Predictions in Online Control | Unknown | N/A | |
| Multi-label Contrastive Predictive Coding | Unknown | N/A | |
| One-sample Guided Object Representation Disassembling | Unknown | N/A | |
| Learning Dynamic Belief Graphs to Generalize on Text-Based Games | Unknown | N/A | |
| Hybrid Models for Learning to Branch | Unknown | N/A | |
| Provable Overlapping Community Detection in Weighted Graphs | Unknown | N/A | |
| Calibrating CNNs for Lifelong Learning | Unknown | N/A | |
| Learning Deformable Tetrahedral Meshes for 3D Reconstruction | Unknown | N/A | |
| Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection | Unknown | N/A | |
| Adaptive Reduced Rank Regression | Unknown | N/A | |
| Permute-and-Flip: A new mechanism for differentially private selection | Unknown | N/A | |
| Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control | Unknown | N/A | |
| Latent Dynamic Factor Analysis of High-Dimensional Neural Recordings | Unknown | N/A | |
| SE(3)-Transformers: 3D Roto-Translation Equivariant Attention Networks | Unknown | N/A | |
| Avoiding Side Effects in Complex Environments | Unknown | N/A | |
| Adversarial Weight Perturbation Helps Robust Generalization | Unknown | N/A | |
| Graduated Assignment for Joint Multi-Graph Matching and Clustering with Application to Unsupervised Graph Matching Network Learning | Unknown | N/A | |
| A new convergent variant of Q-learning with linear function approximation | Unknown | N/A | |
| A Boolean Task Algebra for Reinforcement Learning | Unknown | N/A | |
| Continuous Regularized Wasserstein Barycenters | Unknown | N/A | |
| Sharpened Generalization Bounds based on Conditional Mutual Information and an Application to Noisy, Iterative Algorithms | Unknown | N/A | |
| Coherent Hierarchical Multi-Label Classification Networks | Unknown | N/A | |
| Learning Disentangled Representations and Group Structure of Dynamical Environments | Unknown | N/A | |
| Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits | Unknown | N/A | |
| Stochastic Normalization | Unknown | N/A | |
| In search of robust measures of generalization | Unknown | N/A | |
| RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces | Unknown | N/A | |
| Incorporating BERT into Parallel Sequence Decoding with Adapters | Unknown | N/A | |
| Sharper Generalization Bounds for Pairwise Learning | Unknown | N/A | |
| Hierarchical Quantized Autoencoders | Unknown | N/A | |
| Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping | Unknown | N/A | |
| Provably Consistent Partial-Label Learning | Unknown | N/A | |
| Transferable Calibration with Lower Bias and Variance in Domain Adaptation | Unknown | N/A | |
| ICNet: Intra-saliency Correlation Network for Co-Saliency Detection | Unknown | N/A | |
| Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits | Unknown | N/A | |
| A Matrix Chernoff Bound for Markov Chains and Its Application to Co-occurrence Matrices | Unknown | N/A | |
| SGD with shuffling: optimal rates without component convexity and large epoch requirements | Unknown | N/A | |
| A Dictionary Approach to Domain-Invariant Learning in Deep Networks | Unknown | N/A | |
| A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees | Unknown | N/A | |
| TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation | Unknown | N/A | |
| Adversarial Bandits with Corruptions | Unknown | N/A | |
| Active Invariant Causal Prediction: Experiment Selection through Stability | Unknown | N/A | |
| Adaptive Online Estimation of Piecewise Polynomial Trends | Unknown | N/A | |
| Part-dependent Label Noise: Towards Instance-dependent Label Noise | Unknown | N/A | |
| Neural Unsigned Distance Fields for Implicit Function Learning | Unknown | N/A | |
| Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition | Unknown | N/A | |
| Near-Optimal Comparison Based Clustering | Unknown | N/A | |
| Robust large-margin learning in hyperbolic space | Unknown | N/A | |
| LoopReg: Self-supervised Learning of Implicit Surface Correspondences, Pose and Shape for 3D Human Mesh Registration | Unknown | N/A | |
| No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix | Unknown | N/A | |
| Cross-Scale Internal Graph Neural Network for Image Super-Resolution | Unknown | N/A | |
| Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Prediction | Unknown | N/A | |
| Blind Video Temporal Consistency via Deep Video Prior | Unknown | N/A | |
| A mathematical model for automatic differentiation in machine learning | Unknown | N/A | |
| Auxiliary Task Reweighting for Minimum-data Learning | Unknown | N/A | |
| Dual T: Reducing Estimation Error for Transition Matrix in Label-noise Learning | Unknown | N/A | |
| OOD-MAML: Meta-Learning for Few-Shot Out-of-Distribution Detection and Classification | Unknown | N/A | |
| Theory-Inspired Path-Regularized Differential Network Architecture Search | Unknown | N/A | |
| Knowledge Augmented Deep Neural Networks for Joint Facial Expression and Action Unit Recognition | Unknown | N/A | |
| Agnostic $Q$-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity | Unknown | N/A | |
| Compositional Generalization by Learning Analytical Expressions | Unknown | N/A | |
| Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards | Unknown | N/A | |
| Approximation Based Variance Reduction for Reparameterization Gradients | Unknown | N/A | |
| Swapping Autoencoder for Deep Image Manipulation | Unknown | N/A | |
| ISTA-NAS: Efficient and Consistent Neural Architecture Search by Sparse Coding | Unknown | N/A | |
| Online Fast Adaptation and Knowledge Accumulation (OSAKA): a New Approach to Continual Learning | Unknown | N/A | |
| A Group-Theoretic Framework for Data Augmentation | Unknown | N/A | |
| Fixed-Support Wasserstein Barycenters: Computational Hardness and Fast Algorithm | Unknown | N/A | |
| Learning Individually Inferred Communication for Multi-Agent Cooperation | Unknown | N/A | |
| Exponential ergodicity of mirror-Langevin diffusions | Unknown | N/A | |
| Lifelong Policy Gradient Learning of Factored Policies for Faster Training Without Forgetting | Unknown | N/A | |
| Deep Reinforcement and InfoMax Learning | Unknown | N/A | |
| DISK: Learning local features with policy gradient | Unknown | N/A | |
| Content Provider Dynamics and Coordination in Recommendation Ecosystems | Unknown | N/A | |
| Projection Robust Wasserstein Distance and Riemannian Optimization | Unknown | N/A | |
| Fast Adversarial Robustness Certification of Nearest Prototype Classifiers for Arbitrary Seminorms | Unknown | N/A | |
| Proximity Operator of the Matrix Perspective Function and its Applications | Unknown | N/A | |
| Robust Quantization: One Model to Rule Them All | Unknown | N/A | |
| Black-Box Certification with Randomized Smoothing: A Functional Optimization Based Framework | Unknown | N/A | |
| Backpropagating Linearly Improves Transferability of Adversarial Examples | Unknown | N/A | |
| Dual-Free Stochastic Decentralized Optimization with Variance Reduction | Unknown | N/A | |
| CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching | Unknown | N/A | |
| Compositional Zero-Shot Learning via Fine-Grained Dense Feature Composition | Unknown | N/A | |
| Gradient Surgery for Multi-Task Learning | Unknown | N/A | |
| On the Trade-off between Adversarial and Backdoor Robustness | Unknown | N/A | |
| Graph Cross Networks with Vertex Infomax Pooling | Unknown | N/A | |
| MetaSDF: Meta-Learning Signed Distance Functions | Unknown | N/A | |
| Adaptive Gradient Quantization for Data-Parallel SGD | Unknown | N/A | |
| Coupling-based Invertible Neural Networks Are Universal Diffeomorphism Approximators | Unknown | N/A | |
| Deep reconstruction of strange attractors from time series | Unknown | N/A | |
| SDF-SRN: Learning Signed Distance 3D Object Reconstruction from Static Images | Unknown | N/A | |
| Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free | Unknown | N/A | |
| Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation | Unknown | N/A | |
| Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? | Unknown | N/A | |
| Accelerating Training of Transformer-Based Language Models with Progressive Layer Dropping | Unknown | N/A | |
| Stable and expressive recurrent vision models | Unknown | N/A | |
| Deep Wiener Deconvolution: Wiener Meets Deep Learning for Image Deblurring | Unknown | N/A | |
| Texture Interpolation for Probing Visual Perception | Unknown | N/A | |
| Off-Policy Imitation Learning from Observations | Unknown | N/A | |
| AdaTune: Adaptive Tensor Program Compilation Made Efficient | Unknown | N/A | |
| Neural Non-Rigid Tracking | Unknown | N/A | |
| Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms | Unknown | N/A | |
| The Diversified Ensemble Neural Network | Unknown | N/A | |
| Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model | Unknown | N/A | |
| Residual Force Control for Agile Human Behavior Imitation and Extended Motion Synthesis | Unknown | N/A | |
| Lamina-specific neuronal properties promote robust, stable signal propagation in feedforward networks | Unknown | N/A | |
| The Generalized Lasso with Nonlinear Observations and Generative Priors | Unknown | N/A | |
| Neural Message Passing for Multi-Relational Ordered and Recursive Hypergraphs | Unknown | N/A | |
| Learning Loss for Test-Time Augmentation | Unknown | N/A | |
| Learning to Learn Variational Semantic Memory | Unknown | N/A | |
| Projection Efficient Subgradient Method and Optimal Nonsmooth Frank-Wolfe Method | Unknown | N/A | |
| The Pitfalls of Simplicity Bias in Neural Networks | Unknown | N/A | |
| LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond | Unknown | N/A | |
| Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor Networks | Unknown | N/A | |
| Sparse Learning with CART | Unknown | N/A | |
| Learning About Objects by Learning to Interact with Them | Unknown | N/A | |
| Fast and Flexible Temporal Point Processes with Triangular Maps | Unknown | N/A | |
| UWSOD: Toward Fully-Supervised-Level Capacity Weakly Supervised Object Detection | Unknown | N/A | |
| Deep Diffusion-Invariant Wasserstein Distributional Classification | Unknown | N/A | |
| Kernel Based Progressive Distillation for Adder Neural Networks | Unknown | N/A | |
| Finite-Sample Analysis of Contractive Stochastic Approximation Using Smooth Convex Envelopes | Unknown | N/A | |
| Beta R-CNN: Looking into Pedestrian Detection from Another Perspective | Unknown | N/A | |
| HOI Analysis: Integrating and Decomposing Human-Object Interaction | Unknown | N/A | |
| Softmax Deep Double Deterministic Policy Gradients | Unknown | N/A | |
| RANet: Region Attention Network for Semantic Segmentation | Unknown | N/A | |
| Practical Quasi-Newton Methods for Training Deep Neural Networks | Unknown | N/A | |
| Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation | Unknown | N/A | |
| PIE-NET: Parametric Inference of Point Cloud Edges | Unknown | N/A | |
| A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection | Unknown | N/A | |
| HM-ANN: Efficient Billion-Point Nearest Neighbor Search on Heterogeneous Memory | Unknown | N/A | |
| Continual Learning with Node-Importance based Adaptive Group Sparse Regularization | Unknown | N/A | |
| SCOP: Scientific Control for Reliable Neural Network Pruning | Unknown | N/A | |
| Learning to Orient Surfaces by Self-supervised Spherical CNNs | Unknown | N/A | |
| Assessing SATNet's Ability to Solve the Symbol Grounding Problem | Unknown | N/A | |
| Unfolding the Alternating Optimization for Blind Super Resolution | Unknown | N/A | |
| StratLearner: Learning a Strategy for Misinformation Prevention in Social Networks | Unknown | N/A | |
| Adversarial Style Mining for One-Shot Unsupervised Domain Adaptation | Unknown | N/A | |
| Group Contextual Encoding for 3D Point Clouds | Unknown | N/A | |
| Pruning Filter in Filter | Unknown | N/A | |
| Towards Crowdsourced Training of Large Neural Networks using Decentralized Mixture-of-Experts | Unknown | N/A | |
| Adversarially Robust Few-Shot Learning: A Meta-Learning Approach | Unknown | N/A | |
| Human Parsing Based Texture Transfer from Single Image to 3D Human via Cross-View Consistency | Unknown | N/A | |
| RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference | Unknown | N/A | |
| Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses | Unknown | N/A | |
| Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning | Unknown | N/A | |
| Kalman Filtering Attention for User Behavior Modeling in CTR Prediction | Unknown | N/A | |
| Learning from Positive and Unlabeled Data with Arbitrary Positive Shift | Unknown | N/A | |
| Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID | Unknown | N/A | |
| Sample Complexity of Uniform Convergence for Multicalibration | Unknown | N/A | |
| Diversity-Guided Multi-Objective Bayesian Optimization With Batch Evaluations | Unknown | N/A | |
| A Class of Algorithms for General Instrumental Variable Models | Unknown | N/A | |
| EcoLight: Intersection Control in Developing Regions Under Extreme Budget and Network Constraints | Unknown | N/A | |
| Transfer Learning via $\ell_1$ Regularization | Unknown | N/A | |
| What if Neural Networks had SVDs? | Unknown | N/A | |
| Searching for Low-Bit Weights in Quantized Neural Networks | Unknown | N/A | |
| Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback | Unknown | N/A | |
| Uncertainty-Aware Learning for Zero-Shot Semantic Segmentation | Unknown | N/A | |
| AvE: Assistance via Empowerment | Unknown | N/A | |
| Minibatch vs Local SGD for Heterogeneous Distributed Learning | Unknown | N/A | |
| Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic Systems | Unknown | N/A | |
| DVERGE: Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles | Unknown | N/A | |
| Accelerating Reinforcement Learning through GPU Atari Emulation | Unknown | N/A | |
| Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control | Unknown | N/A | |
| Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning | Unknown | N/A | |
| Practical Low-Rank Communication Compression in Decentralized Deep Learning | Unknown | N/A | |
| Wisdom of the Ensemble: Improving Consistency of Deep Learning Models | Unknown | N/A | |
| Every View Counts: Cross-View Consistency in 3D Object Detection with Hybrid-Cylindrical-Spherical Voxelization | Unknown | N/A | |
| A Measure-Theoretic Approach to Kernel Conditional Mean Embeddings | Unknown | N/A | |
| An Unbiased Risk Estimator for Learning with Augmented Classes | Unknown | N/A | |
| Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder | Unknown | N/A | |
| A Tight Lower Bound and Efficient Reduction for Swap Regret | Unknown | N/A | |
| Improved Schemes for Episodic Memory-based Lifelong Learning | Unknown | N/A | |
| Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher | Unknown | N/A | |
| The Adaptive Complexity of Maximizing a Gross Substitutes Valuation | Unknown | N/A | |
| UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging | Unknown | N/A | |
| PLANS: Neuro-Symbolic Program Learning from Videos | Unknown | N/A | |
| Decision-Making with Auto-Encoding Variational Bayes | Unknown | N/A | |
| Weakly Supervised Deep Functional Maps for Shape Matching | Unknown | N/A | |
| Optimal Algorithms for Stochastic Multi-Armed Bandits with Heavy Tailed Rewards | Unknown | N/A | |
| Bayesian Probabilistic Numerical Integration with Tree-Based Models | Unknown | N/A | |
| Fairness constraints can help exact inference in structured prediction | Unknown | N/A | |
| Multiparameter Persistence Image for Topological Machine Learning | Unknown | N/A | |
| Heuristic Domain Adaptation | Unknown | N/A | |
| Probabilistic Time Series Forecasting with Shape and Temporal Diversity | Unknown | N/A | |
| Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D | Unknown | N/A | |
| Labelling unlabelled videos from scratch with multi-modal self-supervision | Unknown | N/A | |
| On the Convergence of Smooth Regularized Approximate Value Iteration Schemes | Unknown | N/A | |
| Reliable Graph Neural Networks via Robust Aggregation | Unknown | N/A | |
| Random Walk Graph Neural Networks | Unknown | N/A | |
| AOT: Appearance Optimal Transport Based Identity Swapping for Forgery Detection | Unknown | N/A | |
| ConvBERT: Improving BERT with Span-based Dynamic Convolution | Unknown | N/A | |
| On the training dynamics of deep networks with $L_2$ regularization | Unknown | N/A | |
| Iterative Deep Graph Learning for Graph Neural Networks: Better and Robust Node Embeddings | Unknown | N/A | |
| Prophet Attention: Predicting Attention with Future Attention | Unknown | N/A | |
| Unsupervised Learning of Visual Features by Contrasting Cluster Assignments | Unknown | N/A | |
| On the Tightness of Semidefinite Relaxations for Certifying Robustness to Adversarial Examples | Unknown | N/A | |
| 3D Self-Supervised Methods for Medical Imaging | Unknown | N/A | |
| On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law | Unknown | N/A | |
| Modeling Continuous Stochastic Processes with Dynamic Normalizing Flows | Unknown | N/A | |
| SMYRF - Efficient Attention using Asymmetric Clustering | Unknown | N/A | |
| Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot | Unknown | N/A | |
| Kernelized information bottleneck leads to biologically plausible 3-factor Hebbian learning in deep networks | Unknown | N/A | |
| Self-supervised Co-Training for Video Representation Learning | Unknown | N/A | |
| Further Analysis of Outlier Detection with Deep Generative Models | Unknown | N/A | |
| Hamiltonian Monte Carlo using an adjoint-differentiated Laplace approximation: Bayesian inference for latent Gaussian models and beyond | Unknown | N/A | |
| Bandit Linear Control | Unknown | N/A | |
| Is normalization indispensable for training deep neural network? | Unknown | N/A | |
| Unsupervised Sound Separation Using Mixture Invariant Training | Unknown | N/A | |
| VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data | Unknown | N/A | |
| Space-Time Correspondence as a Contrastive Random Walk | Unknown | N/A | |
| On Warm-Starting Neural Network Training | Unknown | N/A | |
| Generative View Synthesis: From Single-view Semantics to Novel-view Images | Unknown | N/A | |
| Critic Regularized Regression | Unknown | N/A | |
| Tree! I am no Tree! I am a low dimensional Hyperbolic Embedding | Unknown | N/A | |
| Learnability with Indirect Supervision Signals | Unknown | N/A | |
| Adaptive Probing Policies for Shortest Path Routing | Unknown | N/A | |
| CoinPress: Practical Private Mean and Covariance Estimation | Unknown | N/A | |
| Sharp Representation Theorems for ReLU Networks with Precise Dependence on Depth | Unknown | N/A | |
| Private Identity Testing for High-Dimensional Distributions | Unknown | N/A | |
| Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness | Unknown | N/A | |
| Meta-Gradient Reinforcement Learning with an Objective Discovered Online | Unknown | N/A | |
| Neural Networks with Small Weights and Depth-Separation Barriers | Unknown | N/A | |
| Riemannian Continuous Normalizing Flows | Unknown | N/A | |
| Path Integral Based Convolution and Pooling for Graph Neural Networks | Unknown | N/A | |
| Modeling Task Effects on Meaning Representation in the Brain via Zero-Shot MEG Prediction | Unknown | N/A | |
| Certifying Confidence via Randomized Smoothing | Unknown | N/A | |
| A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network | Unknown | N/A | |
| A General Method for Robust Learning from Batches | Unknown | N/A | |
| Few-shot Image Generation with Elastic Weight Consolidation | Unknown | N/A | |
| Dual Instrumental Variable Regression | Unknown | N/A | |
| Learning Kernel Tests Without Data Splitting | Unknown | N/A | |
| Towards More Practical Adversarial Attacks on Graph Neural Networks | Unknown | N/A | |
| Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning | Unknown | N/A | |
| f-Divergence Variational Inference | Unknown | N/A | |
| Implicit Graph Neural Networks | Unknown | N/A | |
| Robust Recursive Partitioning for Heterogeneous Treatment Effects with Uncertainty Quantification | Unknown | N/A | |
| Quantitative Propagation of Chaos for SGD in Wide Neural Networks | Unknown | N/A | |
| A Generalized Neural Tangent Kernel Analysis for Two-layer Neural Networks | Unknown | N/A | |
| Learning Some Popular Gaussian Graphical Models without Condition Number Bounds | Unknown | N/A | |
| Neural Networks Fail to Learn Periodic Functions and How to Fix It | Unknown | N/A | |
| Strongly Incremental Constituency Parsing with Graph Neural Networks | Unknown | N/A | |
| Improved Variational Bayesian Phylogenetic Inference with Normalizing Flows | Unknown | N/A | |
| Improving model calibration with accuracy versus uncertainty optimization | Unknown | N/A | |
| Continuous Surface Embeddings | Unknown | N/A | |
| A General Large Neighborhood Search Framework for Solving Integer Linear Programs | Unknown | N/A | |
| High-contrast “gaudy” images improve the training of deep neural network models of visual cortex | Unknown | N/A | |
| Simple and Fast Algorithm for Binary Integer and Online Linear Programming | Unknown | N/A | |
| OTLDA: A Geometry-aware Optimal Transport Approach for Topic Modeling | Unknown | N/A | |
| CO-Optimal Transport | Unknown | N/A | |
| Explicit Regularisation in Gaussian Noise Injections | Unknown | N/A | |
| Assisted Learning: A Framework for Multi-Organization Learning | Unknown | N/A | |
| Deep Smoothing of the Implied Volatility Surface | Unknown | N/A | |
| Limits to Depth Efficiencies of Self-Attention | Unknown | N/A | |
| A Unifying View of Optimism in Episodic Reinforcement Learning | Unknown | N/A | |
| Adversarial Distributional Training for Robust Deep Learning | Unknown | N/A | |
| GPU-Accelerated Primal Learning for Extremely Fast Large-Scale Classification | Unknown | N/A | |
| BoTorch: A Framework for Efficient Monte-Carlo Bayesian Optimization | Unknown | N/A | |
| Multilabel Classification by Hierarchical Partitioning and Data-dependent Grouping | Unknown | N/A | |
| A Study on Encodings for Neural Architecture Search | Unknown | N/A | |
| Task-Robust Model-Agnostic Meta-Learning | Unknown | N/A | |
| On the equivalence of molecular graph convolution and molecular wave function with poor basis set | Unknown | N/A | |
| One-bit Supervision for Image Classification | Unknown | N/A | |
| CompRess: Self-Supervised Learning by Compressing Representations | Unknown | N/A | |
| Learning Global Transparent Models consistent with Local Contrastive Explanations | Unknown | N/A | |
| Extrapolation Towards Imaginary 0-Nearest Neighbour and Its Improved Convergence Rate | Unknown | N/A | |
| Learning discrete distributions: user vs item-level privacy | Unknown | N/A | |
| Self-Learning Transformations for Improving Gaze and Head Redirection | Unknown | N/A | |
| Posterior Network: Uncertainty Estimation without OOD Samples via Density-Based Pseudo-Counts | Unknown | N/A | |
| Robust Density Estimation under Besov IPM Losses | Unknown | N/A | |
| Stochasticity of Deterministic Gradient Descent: Large Learning Rate for Multiscale Objective Function | Unknown | N/A | |
| Object Goal Navigation using Goal-Oriented Semantic Exploration | Unknown | N/A | |
| Hausdorff Dimension, Heavy Tails, and Generalization in Neural Networks | Unknown | N/A | |
| PAC-Bayes Analysis Beyond the Usual Bounds | Unknown | N/A | |
| Dynamical mean-field theory for stochastic gradient descent in Gaussian mixture classification | Unknown | N/A | |
| Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement | Unknown | N/A | |
| Video Frame Interpolation without Temporal Priors | Unknown | N/A | |
| Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond | Unknown | N/A | |
| Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach | Unknown | N/A | |
| Bad Global Minima Exist and SGD Can Reach Them | Unknown | N/A | |
| Efficient Exact Verification of Binarized Neural Networks | Unknown | N/A | |
| Myersonian Regression | Unknown | N/A | |
| Learning under Model Misspecification: Applications to Variational and Ensemble methods | Unknown | N/A | |
| Generating Correct Answers for Progressive Matrices Intelligence Tests | Unknown | N/A | |
| Universally Quantized Neural Compression | Unknown | N/A | |
| The Strong Screening Rule for SLOPE | Unknown | N/A | |
| Displacement-Invariant Matching Cost Learning for Accurate Optical Flow Estimation | Unknown | N/A | |
| Latent Dynamic Factor Analysis of High-Dimensional Neural Recordings | Unknown | N/A | |
| Evolving Normalization-Activation Layers | Unknown | N/A | |
| Curriculum By Smoothing | Unknown | N/A | |
| On Completeness-aware Concept-Based Explanations in Deep Neural Networks | Unknown | N/A | |
| Directional Pruning of Deep Neural Networks | Unknown | N/A | |
| Task-Oriented Feature Distillation | Unknown | N/A | |
| Analytic Characterization of the Hessian in Shallow ReLU Models: A Tale of Symmetry | Unknown | N/A | |
| Minibatch Stochastic Approximate Proximal Point Methods | Unknown | N/A | |
| LoCo: Local Contrastive Representation Learning | Unknown | N/A | |
| Finding Second-Order Stationary Points Efficiently in Smooth Nonconvex Linearly Constrained Optimization Problems | Unknown | N/A | |
| Multi-task Batch Reinforcement Learning with Metric Learning | Unknown | N/A | |
| Evaluating Attribution for Graph Neural Networks | Unknown | N/A | |
| Learning Strategic Network Emergence Games | Unknown | N/A | |
| Detecting Hands and Recognizing Physical Contact in the Wild | Unknown | N/A | |
| Focus of Attention Improves Information Transfer in Visual Features | Unknown | N/A | |
| Adversarial Attacks on Linear Contextual Bandits | Unknown | N/A | |
| Neural Path Features and Neural Path Kernel : Understanding the role of gates in deep learning | Unknown | N/A | |
| Deep Transformation-Invariant Clustering | Unknown | N/A | |
| HYDRA: Pruning Adversarially Robust Neural Networks | Unknown | N/A | |
| Higher-Order Certification For Randomized Smoothing | Unknown | N/A | |
| Exactly Computing the Local Lipschitz Constant of ReLU Networks | Unknown | N/A | |
| A Discrete Variational Recurrent Topic Model without the Reparametrization Trick | Unknown | N/A | |
| Learning to Learn with Feedback and Local Plasticity | Unknown | N/A | |
| Model Interpretability through the Lens of Computational Complexity | Unknown | N/A | |
| GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis | Unknown | N/A | |
| A Unified View of Label Shift Estimation | Unknown | N/A | |
| Distributional Robustness with IPMs and links to Regularization and GANs | Unknown | N/A | |
| On the universality of deep learning | Unknown | N/A | |
| CogLTX: Applying BERT to Long Texts | Unknown | N/A | |
| What shapes feature representations? Exploring datasets, architectures, and training | Unknown | N/A | |
| Better Full-Matrix Regret via Parameter-Free Online Learning | Unknown | N/A | |
| Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement | Unknown | N/A | |
| On Correctness of Automatic Differentiation for Non-Differentiable Functions | Unknown | N/A | |
| Consistent Estimation of Identifiable Nonparametric Mixture Models from Grouped Observations | Unknown | N/A | |
| GAN Memory with No Forgetting | Unknown | N/A | |
| Approximate Heavily-Constrained Learning with Lagrange Multiplier Models | Unknown | N/A | |
| Denoising Diffusion Probabilistic Models | Unknown | N/A | |
| Variational Bayesian Monte Carlo with Noisy Likelihoods | Unknown | N/A | |
| SVGD as a kernelized Wasserstein gradient flow of the chi-squared divergence | Unknown | N/A | |
| Exchangeable Neural ODE for Set Modeling | Unknown | N/A | |
| Bootstrapping neural processes | Unknown | N/A | |
| Robust and Heavy-Tailed Mean Estimation Made Simple, via Regret Minimization | Unknown | N/A | |
| Is Long Horizon RL More Difficult Than Short Horizon RL? | Unknown | N/A | |
| Graph Information Bottleneck | Unknown | N/A | |
| Inverse Reinforcement Learning from a Gradient-based Learner | Unknown | N/A | |
| Regret Bounds without Lipschitz Continuity: Online Learning with Relative-Lipschitz Losses | Unknown | N/A | |
| Statistical and Topological Properties of Sliced Probability Divergences | Unknown | N/A | |
| Benchmarking Deep Inverse Models over time, and the Neural-Adjoint method | Unknown | N/A | |
| Effective Dimension Adaptive Sketching Methods for Faster Regularized Least-Squares Optimization | Unknown | N/A | |
| SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection | Unknown | N/A | |
| WOR and $p$'s: Sketches for $\ell_p$-Sampling Without Replacement | Unknown | N/A | |
| Robust Persistence Diagrams using Reproducing Kernels | Unknown | N/A | |
| Biologically Inspired Mechanisms for Adversarial Robustness | Unknown | N/A | |
| Variance reduction for Random Coordinate Descent-Langevin Monte Carlo | Unknown | N/A | |
| Predictive inference is free with the jackknife+-after-bootstrap | Unknown | N/A | |
| Online learning with dynamics: A minimax perspective | Unknown | N/A | |
| Spectra of the Conjugate Kernel and Neural Tangent Kernel for linear-width neural networks | Unknown | N/A | |
| Improving Online Rent-or-Buy Algorithms with Sequential Decision Making and ML Predictions | Unknown | N/A | |
| Submodular Maximization Through Barrier Functions | Unknown | N/A | |
| Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks | Unknown | N/A | |
| How many samples is a good initial point worth in Low-rank Matrix Recovery? | Unknown | N/A | |
| Finite Versus Infinite Neural Networks: an Empirical Study | Unknown | N/A | |
| Online Planning with Lookahead Policies | Unknown | N/A | |
| Non-Convex SGD Learns Halfspaces with Adversarial Label Noise | Unknown | N/A | |
| Improving Sparse Vector Technique with Renyi Differential Privacy | Unknown | N/A | |
| Self-Supervised Learning by Cross-Modal Audio-Video Clustering | Unknown | N/A | |
| Watch out! Motion is Blurring the Vision of Your Deep Neural Networks | Unknown | N/A | |
| Diverse Image Captioning with Context-Object Split Latent Spaces | Unknown | N/A | |
| Sample-Efficient Reinforcement Learning of Undercomplete POMDPs | Unknown | N/A | |
| A Novel Automated Curriculum Strategy to Solve Hard Sokoban Planning Instances | Unknown | N/A | |
| The Discrete Gaussian for Differential Privacy | Unknown | N/A | |
| Relative gradient optimization of the Jacobian term in unsupervised deep learning | Unknown | N/A | |
| Implicit Neural Representations with Periodic Activation Functions | Unknown | N/A | |
| Autoregressive Score Matching | Unknown | N/A | |
| Preference learning along multiple criteria: A game-theoretic perspective | Unknown | N/A | |
| TaylorGAN: Neighbor-Augmented Policy Update Towards Sample-Efficient Natural Language Generation | Unknown | N/A | |
| Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks | Unknown | N/A | |
| Towards Learning Convolutions from Scratch | Unknown | N/A | |
| Learning Differential Equations that are Easy to Solve | Unknown | N/A | |
| Online Algorithm for Unsupervised Sequential Selection with Contextual Information | Unknown | N/A | |
| A Simple and Efficient Smoothing Method for Faster Optimization and Local Exploration | Unknown | N/A | |
| Compositional Generalization via Neural-Symbolic Stack Machines | Unknown | N/A | |
| Post-training Iterative Hierarchical Data Augmentation for Deep Networks | Unknown | N/A | |
| AutoPrivacy: Automated Layer-wise Parameter Selection for Secure Neural Network Inference | Unknown | N/A | |
| BOSS: Bayesian Optimization over String Spaces | Unknown | N/A | |
| Adversarial Training is a Form of Data-dependent Operator Norm Regularization | Unknown | N/A | |
| Trust the Model When It Is Confident: Masked Model-based Actor-Critic | Unknown | N/A | |
| Avoiding Side Effects By Considering Future Tasks | Unknown | N/A | |
| Implicit Rank-Minimizing Autoencoder | Unknown | N/A | |
| RetroXpert: Decompose Retrosynthesis Prediction Like A Chemist | Unknown | N/A | |
| Decentralized Langevin Dynamics for Bayesian Learning | Unknown | N/A | |
| Monotone operator equilibrium networks | Unknown | N/A | |
| Sequence to Multi-Sequence Learning via Conditional Chain Mapping for Mixture Signals | Unknown | N/A | |
| Neurosymbolic Transformers for Multi-Agent Communication | Unknown | N/A | |
| Federated Principal Component Analysis | Unknown | N/A | |
| When Do Neural Networks Outperform Kernel Methods? | Unknown | N/A | |
| Universal Domain Adaptation through Self Supervision | Unknown | N/A | |
| Uncertainty-aware Self-training for Few-shot Text Classification | Unknown | N/A | |
| Differentially-Private Federated Linear Bandits | Unknown | N/A | |
| What went wrong and when? Instance-wise feature importance for time-series black-box models | Unknown | N/A | |
| Achieving Equalized Odds by Resampling Sensitive Attributes | Unknown | N/A | |
| Model Agnostic Multilevel Explanations | Unknown | N/A | |
| Online MAP Inference of Determinantal Point Processes | Unknown | N/A | |
| High-Dimensional Bayesian Optimization via Nested Riemannian Manifolds | Unknown | N/A | |
| Multi-agent active perception with prediction rewards | Unknown | N/A | |
| Synbols: Probing Learning Algorithms with Synthetic Datasets | Unknown | N/A | |
| Learning Augmented Energy Minimization via Speed Scaling | Unknown | N/A | |
| Efficient Projection-free Algorithms for Saddle Point Problems | Unknown | N/A | |
| Improved Guarantees for k-means++ and k-means++ Parallel | Unknown | N/A | |
| Dissecting Neural ODEs | Unknown | N/A | |
| Higher-Order Spectral Clustering of Directed Graphs | Unknown | N/A | |
| Ensembling geophysical models with Bayesian Neural Networks | Unknown | N/A | |
| Optimal Private Median Estimation under Minimal Distributional Assumptions | Unknown | N/A | |
| Rethinking Importance Weighting for Deep Learning under Distribution Shift | Unknown | N/A | |
| Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies | Unknown | N/A | |
| Recovery of sparse linear classifiers from mixture of responses | Unknown | N/A | |
| Neural Execution Engines: Learning to Execute Subroutines | Unknown | N/A | |
| The Autoencoding Variational Autoencoder | Unknown | N/A | |
| Generative 3D Part Assembly via Dynamic Graph Learning | Unknown | N/A | |
| Unsupervised Text Generation by Learning from Search | Unknown | N/A | |
| Characterizing Optimal Mixed Policies: Where to Intervene and What to Observe | Unknown | N/A | |
| Logarithmic Pruning is All You Need | Unknown | N/A | |
| Online Adaptation for Consistent Mesh Reconstruction in the Wild | Unknown | N/A | |
| CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations | Unknown | N/A | |
| Meta-Neighborhoods | Unknown | N/A | |
| Fair Performance Metric Elicitation | Unknown | N/A | |
| AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity | Unknown | N/A | |
| Curriculum learning for multilevel budgeted combinatorial problems | Unknown | N/A | |
| Ratio Trace Formulation of Wasserstein Discriminant Analysis | Unknown | N/A | |
| Self-Supervised Relationship Probing | Unknown | N/A | |
| Hard Negative Mixing for Contrastive Learning | Unknown | N/A | |
| Self-Supervised Generative Adversarial Compression | Unknown | N/A | |
| Election Coding for Distributed Learning: Protecting SignSGD against Byzantine Attacks | Unknown | N/A | |
| Domain Generalization via Entropy Regularization | Unknown | N/A | |
| Multi-Task Temporal Shift Attention Networks for On-Device Contactless Vitals Measurement | Unknown | N/A | |
| Equivariant Networks for Hierarchical Structures | Unknown | N/A | |
| Model Fusion via Optimal Transport | Unknown | N/A | |
| H-Mem: Harnessing synaptic plasticity with Hebbian Memory Networks | Unknown | N/A | |
| Robustness of Community Detection to Random Geometric Perturbations | Unknown | N/A | |
| Co-Tuning for Transfer Learning | Unknown | N/A | |
| A new inference approach for training shallow and deep generalized linear models of noisy interacting neurons | Unknown | N/A | |
| Optimal Adaptive Electrode Selection to Maximize Simultaneously Recorded Neuron Yield | Unknown | N/A | |
| Tight Nonparametric Convergence Rates for Stochastic Gradient Descent under the Noiseless Linear Model | Unknown | N/A | |
| Towards Problem-dependent Optimal Learning Rates | Unknown | N/A | |
| Towards Convergence Rate Analysis of Random Forests for Classification | Unknown | N/A | |
| Neural Controlled Differential Equations for Irregular Time Series | Unknown | N/A | |
| Variance Reduction via Accelerated Dual Averaging for Finite-Sum Optimization | Unknown | N/A | |
| Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning | Unknown | N/A | |
| High-recall causal discovery for autocorrelated time series with latent confounders | Unknown | N/A | |
| The Smoothed Possibility of Social Choice | Unknown | N/A | |
| R-learning in actor-critic model offers a biologically relevant mechanism for sequential decision-making | Unknown | N/A | |
| Hard Example Generation by Texture Synthesis for Cross-domain Shape Similarity Learning | Unknown | N/A | |
| Uncovering the Topology of Time-Varying fMRI Data using Cubical Persistence | Unknown | N/A | |
| Parameterized Explainer for Graph Neural Network | Unknown | N/A | |
| Finding All $\epsilon$-Good Arms in Stochastic Bandits | Unknown | N/A | |
| Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search | Unknown | N/A | |
| Efficient Online Learning of Optimal Rankings: Dimensionality Reduction via Gradient Descent | Unknown | N/A | |
| AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Unknown | N/A | |
| Deep Archimedean Copulas | Unknown | N/A | |
| Deep Transformers with Latent Depth | Unknown | N/A | |
| Learning the Linear Quadratic Regulator from Nonlinear Observations | Unknown | N/A | |
| Factor Graph Neural Networks | Unknown | N/A | |
| Teaching a GAN What Not to Learn | Unknown | N/A | |
| RelationNet++: Bridging Visual Representations for Object Detection via Transformer Decoder | Unknown | N/A | |
| Learning Causal Effects via Weighted Empirical Risk Minimization | Unknown | N/A | |
| Audeo: Audio Generation for a Silent Performance Video | Unknown | N/A | |
| Stochastic Normalizing Flows | Unknown | N/A | |
| Learning Bounds for Risk-sensitive Learning | Unknown | N/A | |
| Instance-wise Feature Grouping | Unknown | N/A | |
| On the Power of Louvain in the Stochastic Block Model | Unknown | N/A | |
| Variational Bayesian Unlearning | Unknown | N/A | |
| Consequences of Misaligned AI | Unknown | N/A | |
| Canonical 3D Deformer Maps: Unifying parametric and non-parametric methods for dense weakly-supervised category reconstruction | Unknown | N/A | |
| Fast Adaptive Non-Monotone Submodular Maximization Subject to a Knapsack Constraint | Unknown | N/A | |
| A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning | Unknown | N/A | |
| A Flexible Framework for Designing Trainable Priors with Adaptive Smoothing and Game Encoding | Unknown | N/A | |
| RD$^2$: Reward Decomposition with Representation Decomposition | Unknown | N/A | |
| Modeling Noisy Annotations for Crowd Counting | Unknown | N/A | |
| Robust Correction of Sampling Bias using Cumulative Distribution Functions | Unknown | N/A | |
| Graph Geometry Interaction Learning | Unknown | N/A | |
| Lipschitz Bounds and Provably Robust Training by Laplacian Smoothing | Unknown | N/A | |
| Improving Local Identifiability in Probabilistic Box Embeddings | Unknown | N/A | |
| Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates | Unknown | N/A | |
| Model Class Reliance for Random Forests | Unknown | N/A | |
| Distribution-free binary classification: prediction sets, confidence intervals and calibration | Unknown | N/A | |
| Agnostic Learning with Multiple Objectives | Unknown | N/A | |
| Improving Natural Language Processing Tasks with Human Gaze-Guided Neural Attention | Unknown | N/A | |
| Numerically Solving Parametric Families of High-Dimensional Kolmogorov Partial Differential Equations via Deep Learning | Unknown | N/A | |
| Pre-training via Paraphrasing | Unknown | N/A | |
| Improving Inference for Neural Image Compression | Unknown | N/A | |
| Learning abstract structure for drawing by efficient motor program induction | Unknown | N/A | |
| Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing | Unknown | N/A | |
| Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss | Unknown | N/A | |
| Certified Defense to Image Transformations via Randomized Smoothing | Unknown | N/A | |
| Partial Optimal Transport with applications on Positive-Unlabeled Learning | Unknown | N/A | |
| How Robust are the Estimated Effects of Nonpharmaceutical Interventions against COVID-19? | Unknown | N/A | |
| Interpretable Sequence Learning for Covid-19 Forecasting | Unknown | N/A | |
| On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces | Unknown | N/A | |
| High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization | Unknown | N/A | |
| On Power Laws in Deep Ensembles | Unknown | N/A | |
| Conditioning and Processing: Techniques to Improve Information-Theoretic Generalization Bounds | Unknown | N/A | |
| The Potts-Ising model for discrete multivariate data | Unknown | N/A | |
| Gibbs Sampling with People | Unknown | N/A | |
| PRANK: motion Prediction based on RANKing | Unknown | N/A | |
| Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences | Unknown | N/A | |
| On the Error Resistance of Hinge-Loss Minimization | Unknown | N/A | |
| Efficient Planning in Large MDPs with Weak Linear Function Approximation | Unknown | N/A | |
| Coresets for Near-Convex Functions | Unknown | N/A | |
| The Primal-Dual method for Learning Augmented Algorithms | Unknown | N/A | |
| Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration | Unknown | N/A | |
| Network Diffusions via Neural Mean-Field Dynamics | Unknown | N/A | |
| Bayes Consistency vs. H-Consistency: The Interplay between Surrogate Loss Functions and the Scoring Function Class | Unknown | N/A | |
| Incorporating Interpretable Output Constraints in Bayesian Neural Networks | Unknown | N/A | |
| Language-Conditioned Imitation Learning for Robot Manipulation Tasks | Unknown | N/A | |
| Estimating Rank-One Spikes from Heavy-Tailed Noise via Self-Avoiding Walks | Unknown | N/A | |
| Belief Propagation Neural Networks | Unknown | N/A | |
| Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point | Unknown | N/A | |
| Gradient Boosted Normalizing Flows | Unknown | N/A | |
| Testing Determinantal Point Processes | Unknown | N/A | |
| Rethinking pooling in graph neural networks | Unknown | N/A | |
| Efficient estimation of neural tuning during naturalistic behavior | Unknown | N/A | |
| Sparse Weight Activation Training | Unknown | N/A | |
| Adapting to Misspecification in Contextual Bandits | Unknown | N/A | |
| Conformal Symplectic and Relativistic Optimization | Unknown | N/A | |
| Transferable Graph Optimizers for ML Compilers | Unknown | N/A | |
| Throughput-Optimal Topology Design for Cross-Silo Federated Learning | Unknown | N/A | |
| General Transportability of Soft Interventions: Completeness Results | Unknown | N/A | |
| CoSE: Compositional Stroke Embeddings | Unknown | N/A | |
| Factor Graph Grammars | Unknown | N/A | |
| Beyond Perturbations: Learning Guarantees with Arbitrary Adversarial Test Examples | Unknown | N/A | |
| Online Neural Connectivity Estimation with Noisy Group Testing | Unknown | N/A | |
| Autoencoders that don't overfit towards the Identity | Unknown | N/A | |
| An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits | Unknown | N/A | |
| Understanding Global Feature Contributions With Additive Importance Measures | Unknown | N/A | |
| Non-reversible Gaussian processes for identifying latent dynamical structure in neural data | Unknown | N/A | |
| Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming | Unknown | N/A | |
| Adversarial Attacks on Deep Graph Matching | Unknown | N/A | |
| Contrastive learning of global and local features for medical image segmentation with limited annotations | Unknown | N/A | |
| Provably adaptive reinforcement learning in metric spaces | Unknown | N/A | |
| Towards Safe Policy Improvement for Non-Stationary MDPs | Unknown | N/A | |
| Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model | Unknown | N/A | |
| Generative Neurosymbolic Machines | Unknown | N/A | |
| Non-Euclidean Universal Approximation | Unknown | N/A | |
| Off-Policy Interval Estimation with Lipschitz Value Iteration | Unknown | N/A | |
| Optimal Prediction of the Number of Unseen Species with Multiplicity | Unknown | N/A | |
| A Single Recipe for Online Submodular Maximization with Adversarial or Stochastic Constraints | Unknown | N/A | |
| Adaptation Properties Allow Identification of Optimized Neural Codes | Unknown | N/A | |
| Dense Correspondences between Human Bodies via Learning Transformation Synchronization on Graphs | Unknown | N/A | |
| Skeleton-bridged Point Completion: From Global Inference to Local Adjustment | Unknown | N/A | |
| CryptoNAS: Private Inference on a ReLU Budget | Unknown | N/A | |
| Experimental design for MRI by greedy policy search | Unknown | N/A | |
| CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models | Unknown | N/A | |
| Training Generative Adversarial Networks by Solving Ordinary Differential Equations | Unknown | N/A | |
| Policy Improvement via Imitation of Multiple Oracles | Unknown | N/A | |
| Outlier Robust Mean Estimation with Subgaussian Rates via Stability | Unknown | N/A | |
| An Analysis of SVD for Deep Rotation Estimation | Unknown | N/A | |
| Learning Physical Constraints with Neural Projections | Unknown | N/A | |
| Characterizing emergent representations in a space of candidate learning rules for deep networks | Unknown | N/A | |
| Information Theoretic Regret Bounds for Online Nonlinear Control | Unknown | N/A | |
| Learning sparse codes from compressed representations with biologically plausible local wiring constraints | Unknown | N/A | |
| Reparameterizing Mirror Descent as Gradient Descent | Unknown | N/A | |
| Few-shot Visual Reasoning with Meta-Analogical Contrastive Learning | Unknown | N/A | |
| Learning efficient task-dependent representations with synaptic plasticity | Unknown | N/A | |
| Primal-Dual Mesh Convolutional Neural Networks | Unknown | N/A | |
| Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms | Unknown | N/A | |
| Stein Self-Repulsive Dynamics: Benefits From Past Samples | Unknown | N/A | |
| Optimal Approximation - Smoothness Tradeoffs for Soft-Max Functions | Unknown | N/A | |
| Natural Policy Gradient Primal-Dual Method for Constrained Markov Decision Processes | Unknown | N/A | |
| Learning Search Space Partition for Black-box Optimization using Monte Carlo Tree Search | Unknown | N/A | |
| Understanding Double Descent Requires A Fine-Grained Bias-Variance Decomposition | Unknown | N/A | |
| DisARM: An Antithetic Gradient Estimator for Binary Latent Variables | Unknown | N/A | |
| All-or-nothing statistical and computational phase transitions in sparse spiked matrix estimation | Unknown | N/A | |
| Fourier Spectrum Discrepancies in Deep Network Generated Images | Unknown | N/A | |
| Adaptive Shrinkage Estimation for Streaming Graphs | Unknown | N/A | |
| Provably Good Batch Reinforcement Learning Without Great Exploration | Unknown | N/A | |
| A/B Testing in Dense Large-Scale Networks: Design and Inference | Unknown | N/A | |
| Stateful Posted Pricing with Vanishing Regret via Dynamic Deterministic Markov Decision Processes | Unknown | N/A | |
| Replica-Exchange Nos\'e-Hoover Dynamics for Bayesian Learning on Large Datasets | Unknown | N/A | |
| Learning compositional functions via multiplicative weight updates | Unknown | N/A | |
| Optimal Best-arm Identification in Linear Bandits | Unknown | N/A | |
| Towards Understanding Hierarchical Learning: Benefits of Neural Representations | Unknown | N/A | |
| A Unified Switching System Perspective and Convergence Analysis of Q-Learning Algorithms | Unknown | N/A | |
| Adaptive Discretization for Model-Based Reinforcement Learning | Unknown | N/A | |
| De-Anonymizing Text by Fingerprinting Language Generation | Unknown | N/A | |
| Low Distortion Block-Resampling with Spatially Stochastic Networks | Unknown | N/A | |
| Greedy inference with structure-exploiting lazy maps | Unknown | N/A | |
| Faster Differentially Private Samplers via Rényi Divergence Analysis of Discretized Langevin MCMC | Unknown | N/A | |
| Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces | Unknown | N/A | |
| Learning Continuous System Dynamics from Irregularly-Sampled Partial Observations | Unknown | N/A | |
| Semantic Visual Navigation by Watching YouTube Videos | Unknown | N/A | |
| End-to-End Learning and Intervention in Games | Unknown | N/A | |
| Demystifying Contrastive Self-Supervised Learning: Invariances, Augmentations and Dataset Biases | Unknown | N/A | |
| Compositional Explanations of Neurons | Unknown | N/A | |
| Big Self-Supervised Models are Strong Semi-Supervised Learners | Unknown | N/A | |
| Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests | Unknown | N/A | |
| Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation | Unknown | N/A | |
| Personalized Federated Learning with Theoretical Guarantees: A Model-Agnostic Meta-Learning Approach | Unknown | N/A | |
| Factorized Neural Processes for Neural Processes: K-Shot Prediction of Neural Responses | Unknown | N/A | |
| The Wasserstein Proximal Gradient Algorithm | Unknown | N/A | |
| Axioms for Learning from Pairwise Comparisons | Unknown | N/A | |
| Generative causal explanations of black-box classifiers | Unknown | N/A | |
| What is being transferred in transfer learning? | Unknown | N/A | |
| Recurrent Quantum Neural Networks | Unknown | N/A | |
| Latent Bandits Revisited | Unknown | N/A | |
| Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning | Unknown | N/A | |
| Fourier-transform-based attribution priors improve the interpretability and stability of deep learning models for genomics | Unknown | N/A | |
| Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations | Unknown | N/A | |
| Provable Online CP/PARAFAC Decomposition of a Structured Tensor via Dictionary Learning | Unknown | N/A | |
| A convex optimization formulation for multivariate regression | Unknown | N/A | |
| Confidence sequences for sampling without replacement | Unknown | N/A | |
| Adaptive Learning of Rank-One Models for Efficient Pairwise Sequence Alignment | Unknown | N/A | |
| User-Dependent Neural Sequence Models for Continuous-Time Event Data | Unknown | N/A | |
| Big Bird: Transformers for Longer Sequences | Unknown | N/A | |
| PLLay: Efficient Topological Layer based on Persistent Landscapes | Unknown | N/A | |
| wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations | Unknown | N/A | |
| A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems | Unknown | N/A | |
| Mitigating Manipulation in Peer Review via Randomized Reviewer Assignments | Unknown | N/A | |
| JAX MD: A Framework for Differentiable Physics | Unknown | N/A | |
| Label-Aware Neural Tangent Kernel: Toward Better Generalization and Local Elasticity | Unknown | N/A | |
| NeuMiss networks: differentiable programming for supervised learning with missing values. | Unknown | N/A | |
| Task-Agnostic Amortized Inference of Gaussian Process Hyperparameters | Unknown | N/A | |
| Self-supervised learning through the eyes of a child | Unknown | N/A | |
| Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks | Unknown | N/A | |
| Point process models for sequence detection in high-dimensional neural spike trains | Unknown | N/A | |
| Learning to Approximate a Bregman Divergence | Unknown | N/A | |
| Top-k Training of GANs: Improving GAN Performance by Throwing Away Bad Samples | Unknown | N/A | |
| Probably Approximately Correct Constrained Learning | Unknown | N/A | |
| Beyond Homophily in Graph Neural Networks: Current Limitations and Effective Designs | Unknown | N/A | |
| SEVIR : A Storm Event Imagery Dataset for Deep Learning Applications in Radar and Satellite Meteorology | Unknown | N/A | |
| Deep Subspace Clustering with Data Augmentation | Unknown | N/A | |
| Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses | Unknown | N/A | |
| Meta-Learning with Adaptive Hyperparameters | Unknown | N/A | |
| Estimating weighted areas under the ROC curve | Unknown | N/A | |
| Asymptotic Guarantees for Generative Modeling Based on the Smooth Wasserstein Distance | Unknown | N/A | |
| Identifying Causal-Effect Inference Failure with Uncertainty-Aware Models | Unknown | N/A | |
| Certifiably Adversarially Robust Detection of Out-of-Distribution Data | Unknown | N/A | |
| Interpolation Technique to Speed Up Gradients Propagation in Neural ODEs | Unknown | N/A | |
| TinyTL: Reduce Memory, Not Parameters for Efficient On-Device Learning | Unknown | N/A | |
| Projected Stein Variational Gradient Descent | Unknown | N/A | |
| Learning to summarize with human feedback | Unknown | N/A | |
| PEP: Parameter Ensembling by Perturbation | Unknown | N/A | |
| The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks | Unknown | N/A | |
| Understanding spiking networks through convex optimization | Unknown | N/A | |
| A Game-Theoretic Analysis of the Empirical Revenue Maximization Algorithm with Endogenous Sampling | Unknown | N/A | |
| Neural FFTs for Universal Texture Image Synthesis | Unknown | N/A | |
| Limits on Testing Structural Changes in Ising Models | Unknown | N/A | |
| A Simple Language Model for Task-Oriented Dialogue | Unknown | N/A | |
| Bayesian Bits: Unifying Quantization and Pruning | Unknown | N/A | |
| Acceleration with a Ball Optimization Oracle | Unknown | N/A | |
| Almost Surely Stable Deep Dynamics | Unknown | N/A | |
| A causal view of compositional zero-shot recognition | Unknown | N/A | |
| Sample complexity and effective dimension for regression on manifolds | Unknown | N/A | |
| An Empirical Process Approach to the Union Bound: Practical Algorithms for Combinatorial and Linear Bandits | Unknown | N/A | |
| Probabilistic Inference with Algebraic Constraints: Theoretical Limits and Practical Approximations | Unknown | N/A | |
| Instance Selection for GANs | Unknown | N/A | |
| Temporal Variability in Implicit Online Learning | Unknown | N/A | |
| Fast geometric learning with symbolic matrices | Unknown | N/A | |
| Neural Topographic Factor Analysis for fMRI Data | Unknown | N/A | |
| Inferring learning rules from animal decision-making | Unknown | N/A | |
| Simultaneous Preference and Metric Learning from Paired Comparisons | Unknown | N/A | |
| Towards practical differentially private causal graph discovery | Unknown | N/A | |
| Continuous Object Representation Networks: Novel View Synthesis without Target View Supervision | Unknown | N/A | |
| Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization | Unknown | N/A | |
| The Mean-Squared Error of Double Q-Learning | Unknown | N/A | |
| Convolutional Tensor-Train LSTM for Spatio-Temporal Learning | Unknown | N/A | |
| Improving GAN Training with Probability Ratio Clipping and Sample Reweighting | Unknown | N/A | |
| Top-KAST: Top-K Always Sparse Training | Unknown | N/A | |
| Disentangling Human Error from Ground Truth in Segmentation of Medical Images | Unknown | N/A | |
| Model Selection in Contextual Stochastic Bandit Problems | Unknown | N/A | |
| Modular Meta-Learning with Shrinkage | Unknown | N/A | |
| Calibrating Deep Neural Networks using Focal Loss | Unknown | N/A | |
| Discovering conflicting groups in signed networks | Unknown | N/A | |
| Delta-STN: Efficient Bilevel Optimization for Neural Networks using Structured Response Jacobians | Unknown | N/A | |
| Distribution Matching for Crowd Counting | Unknown | N/A | |
| ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training | Unknown | N/A | |
| Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains | Unknown | N/A | |
| Consistent Plug-in Classifiers for Complex Objectives and Constraints | Unknown | N/A | |
| Collapsing Bandits and Their Application to Public Health Intervention | Unknown | N/A | |
| An Optimal Elimination Algorithm for Learning a Best Arm | Unknown | N/A | |
| Adaptive Learned Bloom Filter (Ada-BF): Efficient Utilization of the Classifier with Application to Real-Time Information Filtering on the Web | Unknown | N/A | |
| Information-theoretic Task Selection for Meta-Reinforcement Learning | Unknown | N/A | |
| Learning Multi-Agent Communication through Structured Attentive Reasoning | Unknown | N/A | |
| Causal Shapley Values: Exploiting Causal Knowledge to Explain Individual Predictions of Complex Models | Unknown | N/A | |
| Steady State Analysis of Episodic Reinforcement Learning | Unknown | N/A | |
| Active Structure Learning of Causal DAGs via Directed Clique Trees | Unknown | N/A | |
| Normalizing Kalman Filters for Multivariate Time Series Analysis | Unknown | N/A | |
| Early-Learning Regularization Prevents Memorization of Noisy Labels | Unknown | N/A | |
| Learning Feature Sparse Principal Subspace | Unknown | N/A | |
| Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality | Unknown | N/A | |
| POMDPs in Continuous Time and Discrete Spaces | Unknown | N/A | |
| Goal-directed Generation of Discrete Structures with Conditional Generative Models | Unknown | N/A | |
| Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence | Unknown | N/A | |
| Multi-agent Trajectory Prediction with Fuzzy Query Attention | Unknown | N/A | |
| Linear Dynamical Systems as a Core Computational Primitive | Unknown | N/A | |
| GramGAN: Deep 3D Texture Synthesis From 2D Exemplars | Unknown | N/A | |
| Finer Metagenomic Reconstruction via Biodiversity Optimization | Unknown | N/A | |
| Measuring Systematic Generalization in Neural Proof Generation with Transformers | Unknown | N/A | |
| Directional convergence and alignment in deep learning | Unknown | N/A | |
| COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning | Unknown | N/A | |
| Efficient semidefinite-programming-based inference for binary and multi-class MRFs | Unknown | N/A | |
| On the linearity of large non-linear models: when and why the tangent kernel is constant | Unknown | N/A | |
| Quantized Variational Inference | Unknown | N/A | |
| Optimal and Practical Algorithms for Smooth and Strongly Convex Decentralized Optimization | Unknown | N/A | |
| The Value Equivalence Principle for Model-Based Reinforcement Learning | Unknown | N/A | |
| Organizing recurrent network dynamics by task-computation to enable continual learning | Unknown | N/A | |
| The Convex Relaxation Barrier, Revisited: Tightened Single-Neuron Relaxations for Neural Network Verification | Unknown | N/A | |
| Deep Graph Pose: a semi-supervised deep graphical model for improved animal pose tracking | Unknown | N/A | |
| Escaping the Gravitational Pull of Softmax | Unknown | N/A | |
| Forethought and Hindsight in Credit Assignment | Unknown | N/A | |
| When Counterpoint Meets Chinese Folk Melodies | Unknown | N/A | |
| The Statistical Complexity of Early-Stopped Mirror Descent | Unknown | N/A | |
| A Local Temporal Difference Code for Distributional Reinforcement Learning | Unknown | N/A | |
| Rescuing neural spike train models from bad MLE | Unknown | N/A | |
| Learning to Play No-Press Diplomacy with Best Response Policy Iteration | Unknown | N/A | |
| MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning | Unknown | N/A | |
| The Complexity of Adversarially Robust Proper Learning of Halfspaces with Agnostic Noise | Unknown | N/A | |
| Penalized Langevin dynamics with vanishing penalty for smooth and log-concave targets | Unknown | N/A | |
| Hypersolvers: Toward Fast Continuous-Depth Models | Unknown | N/A | |
| Comparator-Adaptive Convex Bandits | Unknown | N/A | |
| Path Sample-Analytic Gradient Estimators for Stochastic Binary Networks | Unknown | N/A | |
| STLnet: Signal Temporal Logic Enforced Multivariate Recurrent Neural Networks | Unknown | N/A | |
| Variational Amodal Object Completion | Unknown | N/A | |
| Novelty Search in Representational Space for Sample Efficient Exploration | Unknown | N/A | |
| RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning | Unknown | N/A | |
| Deeply Learned Spectral Total Variation Decomposition | Unknown | N/A | |
| Manifold structure in graph embeddings | Unknown | N/A | |
| Exploiting the Surrogate Gap in Online Multiclass Classification | Unknown | N/A | |
| ImpatientCapsAndRuns: Approximately Optimal Algorithm Configuration from an Infinite Pool | Unknown | N/A | |
| Secretary and Online Matching Problems with Machine Learned Advice | Unknown | N/A | |
| Estimating Fluctuations in Neural Representations of Uncertain Environments | Unknown | N/A | |
| Sliding Window Algorithms for k-Clustering Problems | Unknown | N/A | |
| Your Classifier can Secretly Suffice Multi-Source Domain Adaptation | Unknown | N/A | |
| Coresets for Robust Training of Deep Neural Networks against Noisy Labels | Unknown | N/A | |
| Unsupervised Learning of Object Landmarks via Self-Training Correspondence | Unknown | N/A | |
| Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data | Unknown | N/A | |
| Beyond Lazy Training for Over-parameterized Tensor Decomposition | Unknown | N/A | |
| Black-Box Optimization with Local Generative Surrogates | Unknown | N/A | |
| Entropic Causal Inference: Identifiability and Finite Sample Results | Unknown | N/A | |
| UDH: Universal Deep Hiding for Steganography, Watermarking, and Light Field Messaging | Unknown | N/A | |
| Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games | Unknown | N/A | |
| Counterfactual Data Augmentation using Locally Factored Dynamics | Unknown | N/A | |
| Convolutional Generation of Textured 3D Meshes | Unknown | N/A | |
| Batch Normalization Biases Residual Blocks Towards the Identity Function in Deep Networks | Unknown | N/A | |
| Joint Policy Search for Multi-agent Collaboration with Imperfect Information | Unknown | N/A | |
| Invertible Gaussian Reparameterization: Revisiting the Gumbel-Softmax | Unknown | N/A | |
| Primal Dual Interpretation of the Proximal Stochastic Gradient Langevin Algorithm | Unknown | N/A | |
| Linear Time Sinkhorn Divergences using Positive Features | Unknown | N/A | |
| Applications of Common Entropy for Causal Inference | Unknown | N/A | |
| SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows | Unknown | N/A | |
| A Stochastic Path Integral Differential EstimatoR Expectation Maximization Algorithm | Unknown | N/A | |
| Network-to-Network Translation with Conditional Invertible Neural Networks | Unknown | N/A | |
| ColdGANs: Taming Language GANs with Cautious Sampling Strategies | Unknown | N/A | |
| Minimax Estimation of Conditional Moment Models | Unknown | N/A | |
| Second Order PAC-Bayesian Bounds for the Weighted Majority Vote | Unknown | N/A | |
| Kernel Methods Through the Roof: Handling Billions of Points Efficiently | Unknown | N/A | |
| Learning of Discrete Graphical Models with Neural Networks | Unknown | N/A | |
| Spin-Weighted Spherical CNNs | Unknown | N/A | |
| SnapBoost: A Heterogeneous Boosting Machine | Unknown | N/A | |
| Robust Multi-Object Matching via Iterative Reweighting of the Graph Connection Laplacian | Unknown | N/A | |
| Ensemble Distillation for Robust Model Fusion in Federated Learning | Unknown | N/A | |
| PAC-Bayes Learning Bounds for Sample-Dependent Priors | Unknown | N/A | |
| Fair regression via plug-in estimator and recalibration with statistical guarantees | Unknown | N/A | |
| Sampling from a k-DPP without looking at all items | Unknown | N/A | |
| Geometric Dataset Distances via Optimal Transport | Unknown | N/A | |
| Fair regression with Wasserstein barycenters | Unknown | N/A | |
| Generalized Independent Noise Condition for Estimating Latent Variable Causal Graphs | Unknown | N/A | |
| A polynomial-time algorithm for learning nonparametric causal graphs | Unknown | N/A | |
| Convex optimization based on global lower second-order models | Unknown | N/A | |
| Incorporating Pragmatic Reasoning Communication into Emergent Language | Unknown | N/A | |
| Deep Evidential Regression | Unknown | N/A | |
| Learning discrete distributions with infinite support | Unknown | N/A | |
| Curvature Regularization to Prevent Distortion in Graph Embedding | Unknown | N/A | |
| What Do Neural Networks Learn When Trained With Random Labels? | Unknown | N/A | |
| A novel variational form of the Schatten-$p$ quasi-norm | Unknown | N/A | |
| Robust Disentanglement of a Few Factors at a Time using rPU-VAE | Unknown | N/A | |
| Improved Analysis of Clipping Algorithms for Non-convex Optimization | Unknown | N/A | |
| Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications | Unknown | N/A | |
| A Feasible Level Proximal Point Method for Nonconvex Sparse Constrained Optimization | Unknown | N/A | |
| On Infinite-Width Hypernetworks | Unknown | N/A | |
| Discovering Symbolic Models from Deep Learning with Inductive Biases | Unknown | N/A | |
| Online Meta-Critic Learning for Off-Policy Actor-Critic Methods | Unknown | N/A | |
| Learning to solve TV regularised problems with unrolled algorithms | Unknown | N/A | |
| Reasoning about Uncertainties in Discrete-Time Dynamical Systems using Polynomial Forms. | Unknown | N/A | |
| Forget About the LiDAR: Self-Supervised Depth Estimators with MED Probability Volumes | Unknown | N/A | |
| Hardness of Learning Neural Networks with Natural Weights | Unknown | N/A | |
| Identifying signal and noise structure in neural population activity with Gaussian process factor models | Unknown | N/A | |
| A Bandit Learning Algorithm and Applications to Auction Design | Unknown | N/A | |
| Joints in Random Forests | Unknown | N/A | |
| Spectral Temporal Graph Neural Network for Multivariate Time-series Forecasting | Unknown | N/A | |
| Deep Metric Learning with Spherical Embedding | Unknown | N/A | |
| Triple descent and the two kinds of overfitting: where & why do they appear? | Unknown | N/A | |
| Neuronal Gaussian Process Regression | Unknown | N/A | |
| Continual Deep Learning by Functional Regularisation of Memorable Past | Unknown | N/A | |
| Weak Form Generalized Hamiltonian Learning | Unknown | N/A | |
| DiffGCN: Graph Convolutional Networks via Differential Operators and Algebraic Multigrid Pooling | Unknown | N/A | |
| HRN: A Holistic Approach to One Class Learning | Unknown | N/A | |
| Train-by-Reconnect: Decoupling Locations of Weights from Their Values | Unknown | N/A | |
| All your loss are belong to Bayes | Unknown | N/A | |
| Optimal Variance Control of the Score-Function Gradient Estimator for Importance-Weighted Bounds | Unknown | N/A | |
| Towards Scale-Invariant Graph-related Problem Solving by Iterative Homogeneous GNNs | Unknown | N/A | |
| AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control | Unknown | N/A | |
| Graph Policy Network for Transferable Active Learning on Graphs | Unknown | N/A | |
| Private Learning of Halfspaces: Simplifying the Construction and Reducing the Sample Complexity | Unknown | N/A | |
| Correspondence learning via linearly-invariant embedding | Unknown | N/A | |
| Optimal visual search based on a model of target detectability in natural images | Unknown | N/A | |
| Deep Rao-Blackwellised Particle Filters for Time Series Forecasting | Unknown | N/A | |
| Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation | Unknown | N/A | |
| Explaining Naive Bayes and Other Linear Classifiers with Polynomial Time and Delay | Unknown | N/A | |
| Coresets for Regressions with Panel Data | Unknown | N/A | |
| Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration | Unknown | N/A | |
| An efficient nonconvex reformulation of stagewise convex optimization problems | Unknown | N/A | |
| Conservative Q-Learning for Offline Reinforcement Learning | Unknown | N/A | |
| Probabilistic Orientation Estimation with Matrix Fisher Distributions | Unknown | N/A | |
| Learning from Failure: De-biasing Classifier from Biased Classifier | Unknown | N/A | |
| Safe Reinforcement Learning via Curriculum Induction | Unknown | N/A | |
| Telescoping Density-Ratio Estimation | Unknown | N/A | |
| GS-WGAN: A Gradient-Sanitized Approach for Learning Differentially Private Generators | Unknown | N/A | |
| Stochastic Optimization with Laggard Data Pipelines | Unknown | N/A | |
| Learning to Detect Objects with a 1 Megapixel Event Camera | Unknown | N/A | |
| Online Learning in Contextual Bandits using Gated Linear Networks | Unknown | N/A | |
| Online Sinkhorn: Optimal Transport distances from sample streams | Unknown | N/A | |
| Interstellar: Searching Recurrent Architecture for Knowledge Graph Embedding | Unknown | N/A | |
| Self-Adaptively Learning to Demoiré from Focused and Defocused Image Pairs | Unknown | N/A | |
| Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring | Unknown | N/A | |
| Non-parametric Models for Non-negative Functions | Unknown | N/A | |
| On Convergence of Nearest Neighbor Classifiers over Feature Transformations | Unknown | N/A | |
| Trading Personalization for Accuracy: Data Debugging in Collaborative Filtering | Unknown | N/A | |
| Meta-learning from Tasks with Heterogeneous Attribute Spaces | Unknown | N/A | |
| Fast and Accurate $k$-means++ via Rejection Sampling | Unknown | N/A | |
| Regularizing Towards Permutation Invariance In Recurrent Models | Unknown | N/A | |
| Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning | Unknown | N/A | |
| Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality | Unknown | N/A | |
| Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret | Unknown | N/A | |
| Bayesian Robust Optimization for Imitation Learning | Unknown | N/A | |
| Auto Learning Attention | Unknown | N/A | |
| Benchmarking Deep Learning Interpretability in Time Series Predictions | Unknown | N/A | |
| Timeseries Anomaly Detection using Temporal Hierarchical One-Class Network | Unknown | N/A | |
| Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method | Unknown | N/A | |
| Contextual Games: Multi-Agent Learning with Side Information | Unknown | N/A | |
| Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks | Unknown | N/A | |
| Semi-Supervised Neural Architecture Search | Unknown | N/A | |
| Continual Learning in Low-rank Orthogonal Subspaces | Unknown | N/A | |
| Learning to Play Sequential Games versus Unknown Opponents | Unknown | N/A | |
| A Universal Approximation Theorem of Deep Neural Networks for Expressing Probability Distributions | Unknown | N/A | |
| Collegial Ensembles | Unknown | N/A | |
| Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems | Unknown | N/A | |
| Recursive Inference for Variational Autoencoders | Unknown | N/A | |
| MinMax Methods for Optimal Transport and Beyond: Regularization, Approximation and Numerics | Unknown | N/A | |
| MMA Regularization: Decorrelating Weights of Neural Networks by Maximizing the Minimal Angles | Unknown | N/A | |
| Curriculum Learning by Dynamic Instance Hardness | Unknown | N/A | |
| Inverting Gradients - How easy is it to break privacy in federated learning? | Unknown | N/A | |
| Efficient Clustering for Stretched Mixtures: Landscape and Optimality | Unknown | N/A | |
| Learning Restricted Boltzmann Machines with Sparse Latent Variables | Unknown | N/A | |
| Erdos Goes Neural: an Unsupervised Learning Framework for Combinatorial Optimization on Graphs | Unknown | N/A | |
| Understanding and Improving Fast Adversarial Training | Unknown | N/A | |
| PyGlove: Symbolic Programming for Automated Machine Learning | Unknown | N/A | |
| How do fair decisions fare in long-term qualification? | Unknown | N/A | |
| DynaBERT: Dynamic BERT with Adaptive Width and Depth | Unknown | N/A | |
| On the Expressiveness of Approximate Inference in Bayesian Neural Networks | Unknown | N/A | |
| Reinforcement Learning for Control with Multiple Frequencies | Unknown | N/A | |
| Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised Learning | Unknown | N/A | |
| A Scalable Approach for Privacy-Preserving Collaborative Machine Learning | Unknown | N/A | |
| CrossTransformers: spatially-aware few-shot transfer | Unknown | N/A | |
| Multimodal Graph Networks for Compositional Generalization in Visual Question Answering | Unknown | N/A | |
| Margins are Insufficient for Explaining Gradient Boosting | Unknown | N/A | |
| Interferobot: aligning an optical interferometer by a reinforcement learning agent | Unknown | N/A | |
| Bootstrap Your Own Latent - A New Approach to Self-Supervised Learning | Unknown | N/A | |
| Meta-Learning Stationary Stochastic Process Prediction with Convolutional Neural Processes | Unknown | N/A | |
| Robust Reinforcement Learning via Adversarial training with Langevin Dynamics | Unknown | N/A | |
| Object-Centric Learning with Slot Attention | Unknown | N/A | |
| 3D Multi-bodies: Fitting Sets of Plausible 3D Human Models to Ambiguous Image Data | Unknown | N/A | |
| An Improved Analysis of Stochastic Gradient Descent with Momentum | Unknown | N/A | |
| Dark Experience for General Continual Learning: a Strong, Simple Baseline | Unknown | N/A | |
| Bidirectional Convolutional Poisson Gamma Dynamical Systems | Unknown | N/A | |
| Language Through a Prism: A Spectral Approach for Multiscale Language Representations | Unknown | N/A | |
| Language Models are Few-Shot Learners | Unknown | N/A | |
| Counterfactual Vision-and-Language Navigation: Unravelling the Unseen | Unknown | N/A | |
| Instance Based Approximations to Profile Maximum Likelihood | Unknown | N/A | |
| CoMIR: Contrastive Multimodal Image Representation for Registration | Unknown | N/A | |
| Latent World Models For Intrinsically Motivated Exploration | Unknown | N/A | |
| Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games | Unknown | N/A | |
| Evaluating and Rewarding Teamwork Using Cooperative Game Abstractions | Unknown | N/A | |
| Deep Variational Instance Segmentation | Unknown | N/A | |
| Bandit Samplers for Training Graph Neural Networks | Unknown | N/A | |
| Cycle-Contrast for Self-Supervised Video Representation Learning | Unknown | N/A | |
| Reconciling Modern Deep Learning with Traditional Optimization Analyses: The Intrinsic Learning Rate | Unknown | N/A | |
| Planning in Markov Decision Processes with Gap-Dependent Sample Complexity | Unknown | N/A | |
| Hierarchical Poset Decoding for Compositional Generalization in Language | Unknown | N/A | |
| What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation | Unknown | N/A | |
| Field-wise Learning for Multi-field Categorical Data | Unknown | N/A | |
| What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes | Unknown | N/A | |
| (De)Randomized Smoothing for Certifiable Defense against Patch Attacks | Unknown | N/A | |
| ContraGAN: Contrastive Learning for Conditional Image Generation | Unknown | N/A | |
| On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems | Unknown | N/A | |
| Understanding the Role of Training Regimes in Continual Learning | Unknown | N/A | |
| Optimal Iterative Sketching Methods with the Subsampled Randomized Hadamard Transform | Unknown | N/A | |
| Residual Distillation: Towards Portable Deep Neural Networks without Shortcuts | Unknown | N/A | |
| Exploiting MMD and Sinkhorn Divergences for Fair and Transferable Representation Learning | Unknown | N/A | |
| Neural Anisotropy Directions | Unknown | N/A | |
| Neural Power Units | Unknown | N/A | |
| Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective | Unknown | N/A | |
| Fairness without Demographics through Adversarially Reweighted Learning | Unknown | N/A | |
| Scalable Belief Propagation via Relaxed Scheduling | Unknown | N/A | |
| Computing Valid p-value for Optimal Changepoint by Selective Inference using Dynamic Programming | Unknown | N/A | |
| Fast Epigraphical Projection-based Incremental Algorithms for Wasserstein Distributionally Robust Support Vector Machine | Unknown | N/A | |
| PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals | Unknown | N/A | |
| Manifold GPLVMs for discovering non-Euclidean latent structure in neural data | Unknown | N/A | |
| Off-Policy Evaluation and Learning for External Validity under a Covariate Shift | Unknown | N/A | |
| Improving Generalization in Reinforcement Learning with Mixture Regularization | Unknown | N/A | |
| Global Convergence and Variance Reduction for a Class of Nonconvex-Nonconcave Minimax Problems | Unknown | N/A | |
| A shooting formulation of deep learning | Unknown | N/A | |
| Breaking the Communication-Privacy-Accuracy Trilemma | Unknown | N/A | |
| Federated Accelerated Stochastic Gradient Descent | Unknown | N/A | |
| Advances in Black-Box VI: Normalizing Flows, Importance Weighting, and Optimization | Unknown | N/A | |
| Learning Manifold Implicitly via Explicit Heat-Kernel Learning | Unknown | N/A | |
| Do Adversarially Robust ImageNet Models Transfer Better? | Unknown | N/A | |
| Neural Sparse Voxel Fields | Unknown | N/A | |
| A Convolutional Auto-Encoder for Haplotype Assembly and Viral Quasispecies Reconstruction | Unknown | N/A | |
| Sparse Spectrum Warped Input Measures for Nonstationary Kernel Learning | Unknown | N/A | |
| Hierarchical nucleation in deep neural networks | Unknown | N/A | |
| Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity | Unknown | N/A | |
| Model Selection for Production System via Automated Online Experiments | Unknown | N/A | |
| Batch normalization provably avoids ranks collapse for randomly initialised deep networks | Unknown | N/A | |
| Generalization bound of globally optimal non-convex neural network training: Transportation map estimation by infinite dimensional Langevin dynamics | Unknown | N/A | |
| SURF: A Simple, Universal, Robust, Fast Distribution Learning Algorithm | Unknown | N/A | |
| Unifying Activation- and Timing-based Learning Rules for Spiking Neural Networks | Unknown | N/A | |
| Adversarial Counterfactual Learning and Evaluation for Recommender System | Unknown | N/A | |
| Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors | Unknown | N/A | |
| Learning with Operator-valued Kernels in Reproducing Kernel Krein Spaces | Unknown | N/A | |
| Differentiable Meta-Learning of Bandit Policies | Unknown | N/A | |
| Independent Policy Gradient Methods for Competitive Reinforcement Learning | Unknown | N/A | |
| Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout | Unknown | N/A | |
| Co-exposure Maximization in Online Social Networks | Unknown | N/A | |
| Predictive Information Accelerates Learning in RL | Unknown | N/A | |
| Group-Fair Online Allocation in Continuous Time | Unknown | N/A | |
| Profile Entropy: A Fundamental Measure for the Learnability and Compressibility of Distributions | Unknown | N/A | |
| Tight last-iterate convergence rates for no-regret learning in multi-player games | Unknown | N/A | |
| Inductive Quantum Embedding | Unknown | N/A | |
| Chaos, Extremism and Optimism: Volume Analysis of Learning in Games | Unknown | N/A | |
| A Statistical Framework for Low-bitwidth Training of Deep Neural Networks | Unknown | N/A | |
| PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning | Unknown | N/A | |
| Learning Guidance Rewards with Trajectory-space Smoothing | Unknown | N/A | |
| Learning Representations from Audio-Visual Spatial Alignment | Unknown | N/A | |
| Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization | Unknown | N/A | |
| Matrix Completion with Hierarchical Graph Side Information | Unknown | N/A | |
| On Regret with Multiple Best Arms | Unknown | N/A | |
| Delay and Cooperation in Nonstochastic Linear Bandits | Unknown | N/A | |
| Self-supervised Auxiliary Learning with Meta-paths for Heterogeneous Graphs | Unknown | N/A | |
| A Limitation of the PAC-Bayes Framework | Unknown | N/A | |
| Implicit Distributional Reinforcement Learning | Unknown | N/A | |
| DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation | Unknown | N/A | |
| Greedy Optimization Provably Wins the Lottery: Logarithmic Number of Winning Tickets is Enough | Unknown | N/A | |
| Attribution Preservation in Network Compression for Reliable Network Interpretation | Unknown | N/A | |
| Neural Complexity Measures | Unknown | N/A | |
| Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge | Unknown | N/A | |
| Nonconvex Sparse Graph Learning under Laplacian Constrained Graphical Model | Unknown | N/A | |
| Mutual exclusivity as a challenge for deep neural networks | Unknown | N/A | |
| MATE: Plugging in Model Awareness to Task Embedding for Meta Learning | Unknown | N/A | |
| Cross-lingual Retrieval for Iterative Self-Supervised Training | Unknown | N/A | |
| Instance-optimality in differential privacy via approximate inverse sensitivity mechanisms | Unknown | N/A | |
| Continuous Meta-Learning without Tasks | Unknown | N/A | |
| FleXOR: Trainable Fractional Quantization | Unknown | N/A | |
| Adversarially-learned Inference via an Ensemble of Discrete Undirected Graphical Models | Unknown | N/A | |
| O(n) Connections are Expressive Enough: Universal Approximability of Sparse Transformers | Unknown | N/A | |
| Adam with Bandit Sampling for Deep Learning | Unknown | N/A | |
| Using noise to probe recurrent neural network structure and prune synapses | Unknown | N/A | |
| Understanding Deep Architecture with Reasoning Layer | Unknown | N/A | |
| Worst-Case Analysis for Randomly Collected Data | Unknown | N/A | |
| Cooperative Heterogeneous Deep Reinforcement Learning | Unknown | N/A | |
| Random Reshuffling: Simple Analysis with Vast Improvements | Unknown | N/A | |
| A Novel Approach for Constrained Optimization in Graphical Models | Unknown | N/A | |
| Winning the Lottery with Continuous Sparsification | Unknown | N/A | |
| Counterfactual Contrastive Learning for Weakly-Supervised Vision-Language Grounding | Unknown | N/A | |
| Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation | Unknown | N/A | |
| Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding | Unknown | N/A | |
| Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning | Unknown | N/A | |
| Proximal Mapping for Deep Regularization | Unknown | N/A | |
| ShiftAddNet: A Hardware-Inspired Deep Network | Unknown | N/A | |
| A Catalyst Framework for Minimax Optimization | Unknown | N/A | |
| A Computational Separation between Private Learning and Online Learning | Unknown | N/A | |
| Explainable Voting | Unknown | N/A | |
| Neural encoding with visual attention | Unknown | N/A | |
| Linear-Sample Learning of Low-Rank Distributions | Unknown | N/A | |
| Certified Robustness of Graph Convolution Networks for Graph Classification under Topological Attacks | Unknown | N/A | |
| Constrained episodic reinforcement learning in concave-convex and knapsack settings | Unknown | N/A | |
| Attack of the Tails: Yes, You Really Can Backdoor Federated Learning | Unknown | N/A | |
| The Complete Lasso Tradeoff Diagram | Unknown | N/A | |
| Variational Inference for Graph Convolutional Networks in the Absence of Graph Data and Adversarial Settings | Unknown | N/A | |
| Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks | Unknown | N/A | |
| Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction | Unknown | N/A | |
| Graph Contrastive Learning with Augmentations | Unknown | N/A | |
| Revisiting the Sample Complexity of Sparse Spectrum Approximation of Gaussian Processes | Unknown | N/A | |
| ARMA Nets: Expanding Receptive Field for Dense Prediction | Unknown | N/A | |
| HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks | Unknown | N/A | |
| GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs | Unknown | N/A | |
| Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward | Unknown | N/A | |
| RandAugment: Practical Automated Data Augmentation with a Reduced Search Space | Unknown | N/A | |
| Debiasing Distributed Second Order Optimization with Surrogate Sketching and Scaled Regularization | Unknown | N/A | |
| First Order Constrained Optimization in Policy Space | Unknown | N/A | |
| Sinkhorn Barycenter via Functional Gradient Descent | Unknown | N/A | |
| Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step Trees | Unknown | N/A | |
| Statistical Optimal Transport posed as Learning Kernel Embedding | Unknown | N/A | |
| Matrix Inference and Estimation in Multi-Layer Models | Unknown | N/A | |
| Pruning neural networks without any data by iteratively conserving synaptic flow | Unknown | N/A | |
| Offline Imitation Learning with a Misspecified Simulator | Unknown | N/A | |
| Training Stronger Baselines for Learning to Optimize | Unknown | N/A | |
| Sinkhorn Natural Gradient for Generative Models | Unknown | N/A | |
| Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems | Unknown | N/A | |
| FedSplit: an algorithmic framework for fast federated optimization | Unknown | N/A | |
| Precise expressions for random projections: Low-rank approximation and randomized Newton | Unknown | N/A | |
| Modern Hopfield Networks and Attention for Immune Repertoire Classification | Unknown | N/A | |
| Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes | Unknown | N/A | |
| The Lottery Ticket Hypothesis for Pre-trained BERT Networks | Unknown | N/A | |
| Stochastic Segmentation Networks: Modelling Spatially Correlated Aleatoric Uncertainty | Unknown | N/A | |
| SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory | Unknown | N/A | |
| Simplifying Hamiltonian and Lagrangian Neural Networks via Explicit Constraints | Unknown | N/A | |
| Making Non-Stochastic Control (Almost) as Easy as Stochastic | Unknown | N/A | |
| Improved guarantees and a multiple-descent curve for Column Subset Selection and the Nystrom method | Unknown | N/A | |
| An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch | Unknown | N/A | |
| Robust Multi-Agent Reinforcement Learning with Model Uncertainty | Unknown | N/A | |
| Differentially Private Clustering: Tight Approximation Ratios | Unknown | N/A | |
| FrugalML: How to use ML Prediction APIs more accurately and cheaply | Unknown | N/A | |
| Rethinking the Value of Labels for Improving Class-Imbalanced Learning | Unknown | N/A | |
| On Convergence and Generalization of Dropout Training | Unknown | N/A | |
| Counterfactual Prediction for Bundle Treatment | Unknown | N/A | |
| Fully Convolutional Mesh Autoencoder using Efficient Spatially Varying Kernels | Unknown | N/A | |
| Measuring Robustness to Natural Distribution Shifts in Image Classification | Unknown | N/A | |
| Optimal Lottery Tickets via Subset Sum: Logarithmic Over-Parameterization is Sufficient | Unknown | N/A | |
| Online Linear Optimization with Many Hints | Unknown | N/A | |
| FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence | Unknown | N/A | |
| Hitting the High Notes: Subset Selection for Maximizing Expected Order Statistics | Unknown | N/A | |
| Demystifying Orthogonal Monte Carlo and Beyond | Unknown | N/A | |
| Tackling the Objective Inconsistency Problem in Heterogeneous Federated Optimization | Unknown | N/A | |
| A Dynamical Central Limit Theorem for Shallow Neural Networks | Unknown | N/A | |
| Dual-Resolution Correspondence Networks | Unknown | N/A | |
| Near-Optimal Reinforcement Learning with Self-Play | Unknown | N/A | |
| An Unsupervised Information-Theoretic Perceptual Quality Metric | Unknown | N/A | |
| Exact expressions for double descent and implicit regularization via surrogate random design | Unknown | N/A | |
| Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference | Unknown | N/A | |
| Bayesian Attention Modules | Unknown | N/A | |
| Uncertainty Quantification for Inferring Hawkes Networks | Unknown | N/A | |
| Position-based Scaled Gradient for Model Quantization and Pruning | Unknown | N/A | |
| A Bayesian Perspective on Training Speed and Model Selection | Unknown | N/A | |
| Meta-Learning Requires Meta-Augmentation | Unknown | N/A | |
| Tensor Completion Made Practical | Unknown | N/A | |
| Group Knowledge Transfer: Federated Learning of Large CNNs at the Edge | Unknown | N/A | |
| Learning Black-Box Attackers with Transferable Priors and Query Feedback | Unknown | N/A | |
| Improved Algorithms for Online Submodular Maximization via First-order Regret Bounds | Unknown | N/A | |
| Estimating decision tree learnability with polylogarithmic sample complexity | Unknown | N/A | |
| Generalized Leverage Score Sampling for Neural Networks | Unknown | N/A | |
| On Reward-Free Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Causal Estimation with Functional Confounders | Unknown | N/A | |
| Re-Examining Linear Embeddings for High-Dimensional Bayesian Optimization | Unknown | N/A | |
| Nonasymptotic Guarantees for Spiked Matrix Recovery with Generative Priors | Unknown | N/A | |
| Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing | Unknown | N/A | |
| ICE-BeeM: Identifiable Conditional Energy-Based Deep Models Based on Nonlinear ICA | Unknown | N/A | |
| Scattering GCN: Overcoming Oversmoothness in Graph Convolutional Networks | Unknown | N/A | |
| On Adaptive Attacks to Adversarial Example Defenses | Unknown | N/A | |
| Universal guarantees for decision tree induction via a higher-order splitting criterion | Unknown | N/A | |
| Glance and Focus: a Dynamic Approach to Reducing Spatial Redundancy in Image Classification | Unknown | N/A | |
| Learning Structured Distributions From Untrusted Batches: Faster and Simpler | Unknown | N/A | |
| Distributed Newton Can Communicate Less and Resist Byzantine Workers | Unknown | N/A | |
| Classification Under Misspecification: Halfspaces, Generalized Linear Models, and Evolvability | Unknown | N/A | |
| Identifying Learning Rules From Neural Network Observables | Unknown | N/A | |
| Attention-Gated Brain Propagation: How the brain can implement reward-based error backpropagation | Unknown | N/A | |
| Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes | Unknown | N/A | |
| Gradient-EM Bayesian Meta-Learning | Unknown | N/A | |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Unknown | N/A | |
| Understanding and Exploring the Network with Stochastic Architectures | Unknown | N/A | |
| Perturbing Across the Feature Hierarchy to Improve Standard and Strict Blackbox Attack Transferability | Unknown | N/A | |
| Causal Discovery in Physical Systems from Videos | Unknown | N/A | |
| Learning Compositional Rules via Neural Program Synthesis | Unknown | N/A | |
| Implicit Regularization and Convergence for Weight Normalization | Unknown | N/A | |
| Stage-wise Conservative Linear Bandits | Unknown | N/A | |
| Learning to Decode: Reinforcement Learning for Decoding of Sparse Graph-Based Channel Codes | Unknown | N/A | |
| Consistency Regularization for Certified Robustness of Smoothed Classifiers | Unknown | N/A | |
| Beta Embeddings for Multi-Hop Logical Reasoning in Knowledge Graphs | Unknown | N/A | |
| Open Graph Benchmark: Datasets for Machine Learning on Graphs | Unknown | N/A | |
| Analytical Probability Distributions and Exact Expectation-Maximization for Deep Generative Networks | Unknown | N/A | |
| Preference-based Reinforcement Learning with Finite-Time Guarantees | Unknown | N/A | |
| Predicting Training Time Without Training | Unknown | N/A | |
| Practical No-box Adversarial Attacks against DNNs | Unknown | N/A | |
| Neural Networks with Recurrent Generative Feedback | Unknown | N/A | |
| AViD Dataset: Anonymized Videos from Diverse Countries | Unknown | N/A | |
| How to Characterize The Landscape of Overparameterized Convolutional Neural Networks | Unknown | N/A | |
| Deep Direct Likelihood Knockoffs | Unknown | N/A | |
| Calibrated Reliable Regression using Maximum Mean Discrepancy | Unknown | N/A | |
| Instance-based Generalization in Reinforcement Learning | Unknown | N/A | |
| Tight First- and Second-Order Regret Bounds for Adversarial Linear Bandits | Unknown | N/A | |
| Community detection using fast low-cardinality semidefinite programming | Unknown | N/A | |
| Randomized tests for high-dimensional regression: A more efficient and powerful solution | Unknown | N/A | |
| Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning? | Unknown | N/A | |
| Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition | Unknown | N/A | |
| Robust Pre-Training by Adversarial Contrastive Learning | Unknown | N/A | |
| Asymptotically Optimal Exact Minibatch Metropolis-Hastings | Unknown | N/A | |
| Interpretable and Personalized Apprenticeship Scheduling: Learning Interpretable Scheduling Policies from Heterogeneous User Demonstrations | Unknown | N/A | |
| Towards Better Generalization of Adaptive Gradient Methods | Unknown | N/A | |
| Robust Recovery via Implicit Bias of Discrepant Learning Rates for Double Over-parameterization | Unknown | N/A | |
| Online Convex Optimization Over Erdos-Renyi Random Networks | Unknown | N/A | |
| Efficient Distance Approximation for Structured High-Dimensional Distributions via Learning | Unknown | N/A | |
| Learning to Execute Programs with Instruction Pointer Attention Graph Neural Networks | Unknown | N/A | |
| Sampling-Decomposable Generative Adversarial Recommender | Unknown | N/A | |
| Boundary thickness and robustness in learning models | Unknown | N/A | |
| Consistent Structural Relation Learning for Zero-Shot Segmentation | Unknown | N/A | |
| Statistical-Query Lower Bounds via Functional Gradients | Unknown | N/A | |
| Reward-rational (implicit) choice: A unifying formalism for reward learning | Unknown | N/A | |
| Partially View-aligned Clustering | Unknown | N/A | |
| How Can I Explain This to You? An Empirical Study of Deep Neural Network Explanation Methods | Unknown | N/A | |
| Succinct and Robust Multi-Agent Communication With Temporal Message Control | Unknown | N/A | |
| Learning to Dispatch for Job Shop Scheduling via Deep Reinforcement Learning | Unknown | N/A | |
| Learning Utilities and Equilibria in Non-Truthful Auctions | Unknown | N/A | |
| Weston-Watkins Hinge Loss and Ordered Partitions | Unknown | N/A | |
| DAGs with No Fears: A Closer Look at Continuous Optimization for Learning Bayesian Networks | Unknown | N/A | |
| Personalized Federated Learning with Moreau Envelopes | Unknown | N/A | |
| Reciprocal Adversarial Learning via Characteristic Functions | Unknown | N/A | |
| AutoSync: Learning to Synchronize for Data-Parallel Distributed Deep Learning | Unknown | N/A | |
| Input-Aware Dynamic Backdoor Attack | Unknown | N/A | |
| Uncertainty Aware Semi-Supervised Learning on Graph Data | Unknown | N/A | |
| Generalised Bayesian Filtering via Sequential Monte Carlo | Unknown | N/A | |
| Optimizing Mode Connectivity via Neuron Alignment | Unknown | N/A | |
| Non-Stochastic Control with Bandit Feedback | Unknown | N/A | |
| A Biologically Plausible Neural Network for Slow Feature Analysis | Unknown | N/A | |
| AdvFlow: Inconspicuous Black-box Adversarial Attacks using Normalizing Flows | Unknown | N/A | |
| Adaptive Graph Convolutional Recurrent Network for Traffic Forecasting | Unknown | N/A | |
| A Benchmark for Systematic Generalization in Grounded Language Understanding | Unknown | N/A | |
| A Game Theoretic Analysis of Additive Adversarial Attacks and Defenses | Unknown | N/A | |
| Patch2Self: Denoising Diffusion MRI with Self-Supervised Learning | Unknown | N/A | |
| Marginal Utility for Planning in Continuous or Large Discrete Action Spaces | Unknown | N/A | |
| Learning Linear Programs from Optimal Decisions | Unknown | N/A | |
| Unsupervised Learning of Dense Visual Representations | Unknown | N/A | |
| Implicit Bias in Deep Linear Classification: Initialization Scale vs Training Accuracy | Unknown | N/A | |
| Sparse Symplectically Integrated Neural Networks | Unknown | N/A | |
| Continual Learning of Control Primitives : Skill Discovery via Reset-Games | Unknown | N/A | |
| Distributionally Robust Federated Averaging | Unknown | N/A | |
| Learning by Minimizing the Sum of Ranked Range | Unknown | N/A | |
| From Boltzmann Machines to Neural Networks and Back Again | Unknown | N/A | |
| Efficient Learning of Discrete Graphical Models | Unknown | N/A | |
| MetaPoison: Practical General-purpose Clean-label Data Poisoning | Unknown | N/A | |
| BayReL: Bayesian Relational Learning for Multi-omics Data Integration | Unknown | N/A | |
| Why are Adaptive Methods Good for Attention Models? | Unknown | N/A | |
| Distributed Training with Heterogeneous Data: Bridging Median- and Mean-Based Algorithms | Unknown | N/A | |
| Learning Optimal Representations with the Decodable Information Bottleneck | Unknown | N/A | |
| Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks | Unknown | N/A | |
| Fair Hierarchical Clustering | Unknown | N/A | |
| MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation | Unknown | N/A | |
| Program Synthesis with Pragmatic Communication | Unknown | N/A | |
| Sparse and Continuous Attention Mechanisms | Unknown | N/A | |
| TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search | Unknown | N/A | |
| Subgroup-based Rank-1 Lattice Quasi-Monte Carlo | Unknown | N/A | |
| Neural Mesh Flow: 3D Manifold Mesh Generation via Diffeomorphic Flows | Unknown | N/A | |
| Grasp Proposal Networks: An End-to-End Solution for Visual Learning of Robotic Grasps | Unknown | N/A | |
| Finite-Time Analysis for Double Q-learning | Unknown | N/A | |
| Adversarial robustness via robust low rank representations | Unknown | N/A | |
| Understanding Gradient Clipping in Private SGD: A Geometric Perspective | Unknown | N/A | |
| The Flajolet-Martin Sketch Itself Preserves Differential Privacy: Private Counting with Minimal Space | Unknown | N/A | |
| General Control Functions for Causal Effect Estimation from IVs | Unknown | N/A | |
| Large-Scale Methods for Distributionally Robust Optimization | Unknown | N/A | |
| Towards a Better Global Loss Landscape of GANs | Unknown | N/A | |
| Flexible mean field variational inference using mixtures of non-overlapping exponential families | Unknown | N/A | |
| A Fair Classifier Using Kernel Density Estimation | Unknown | N/A | |
| Sequential Bayesian Experimental Design with Variable Cost Structure | Unknown | N/A | |
| Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering | Unknown | N/A | |
| Barking up the right tree: an approach to search over molecule synthesis DAGs | Unknown | N/A | |
| Exemplar Guided Active Learning | Unknown | N/A | |
| Subgraph Neural Networks | Unknown | N/A | |
| Multi-Plane Program Induction with 3D Box Priors | Unknown | N/A | |
| X-CAL: Explicit Calibration for Survival Analysis | Unknown | N/A | |
| Smoothed Geometry for Robust Attribution | Unknown | N/A | |
| Interior Point Solving for LP-based prediction+optimisation | Unknown | N/A | |
| Privacy Amplification via Random Check-Ins | Unknown | N/A | |
| FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs | Unknown | N/A | |
| Dynamic Regret of Policy Optimization in Non-Stationary Environments | Unknown | N/A | |
| Learning Invariances in Neural Networks from Training Data | Unknown | N/A | |
| Geometric All-way Boolean Tensor Decomposition | Unknown | N/A | |
| Generalized Hindsight for Reinforcement Learning | Unknown | N/A | |
| Automatic Curriculum Learning through Value Disagreement | Unknown | N/A | |
| DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction | Unknown | N/A | |
| Ensuring Fairness Beyond the Training Data | Unknown | N/A | |
| Reward Propagation Using Graph Convolutional Networks | Unknown | N/A | |
| Neurosymbolic Reinforcement Learning with Formally Verified Exploration | Unknown | N/A | |
| Learning to Select Best Forecast Tasks for Clinical Outcome Prediction | Unknown | N/A | |
| Fairness with Overlapping Groups; a Probabilistic Perspective | Unknown | N/A | |
| Generalization Bound of Gradient Descent for Non-Convex Metric Learning | Unknown | N/A | |
| Reconsidering Generative Objectives For Counterfactual Reasoning | Unknown | N/A | |
| Optimizing Neural Networks via Koopman Operator Theory | Unknown | N/A | |
| Predictive coding in balanced neural networks with noise, chaos and delays | Unknown | N/A | |
| Memory-Efficient Learning of Stable Linear Dynamical Systems for Prediction and Control | Unknown | N/A | |
| Adversarial Blocking Bandits | Unknown | N/A | |
| Inference for Batched Bandits | Unknown | N/A | |
| Minimax Optimal Nonparametric Estimation of Heterogeneous Treatment Effects | Unknown | N/A | |
| Markovian Score Climbing: Variational Inference with KL(p | q) | Unknown | |
| IDEAL: Inexact DEcentralized Accelerated Augmented Lagrangian Method | Unknown | N/A | |
| Rethinking Pre-training and Self-training | Unknown | N/A | |
| Minimax Dynamics of Optimally Balanced Spiking Networks of Excitatory and Inhibitory Neurons | Unknown | N/A | |
| The Generalization-Stability Tradeoff In Neural Network Pruning | Unknown | N/A | |
| Hierarchical Gaussian Process Priors for Bayesian Neural Network Weights | Unknown | N/A | |
| The Cone of Silence: Speech Separation by Localization | Unknown | N/A | |
| Optimization and Generalization of Shallow Neural Networks with Quadratic Activation Functions | Unknown | N/A | |
| Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms | Unknown | N/A | |
| Multi-task Additive Models for Robust Estimation and Automatic Structure Discovery | Unknown | N/A | |
| Learning Physical Graph Representations from Visual Scenes | Unknown | N/A | |
| Complex Dynamics in Simple Neural Networks: Understanding Gradient Flow in Phase Retrieval | Unknown | N/A | |
| Why Normalizing Flows Fail to Detect Out-of-Distribution Data | Unknown | N/A | |
| Demixed shared component analysis of neural population data from multiple brain areas | Unknown | N/A | |
| Does Unsupervised Architecture Representation Learning Help Neural Architecture Search? | Unknown | N/A | |
| Constant-Expansion Suffices for Compressed Sensing with Generative Priors | Unknown | N/A | |
| Variance-Reduced Off-Policy TDC Learning: Non-Asymptotic Convergence Analysis | Unknown | N/A | |
| Pointer Graph Networks | Unknown | N/A | |
| Thunder: a Fast Coordinate Selection Solver for Sparse Learning | Unknown | N/A | |
| MOPO: Model-based Offline Policy Optimization | Unknown | N/A | |
| Reinforcement Learning with Combinatorial Actions: An Application to Vehicle Routing | Unknown | N/A | |
| From Predictions to Decisions: Using Lookahead Regularization | Unknown | N/A | |
| Robust Federated Learning: The Case of Affine Distribution Shifts | Unknown | N/A | |
| Learning the Geometry of Wave-Based Imaging | Unknown | N/A | |
| Random Reshuffling is Not Always Better | Unknown | N/A | |
| Learning Composable Energy Surrogates for PDE Order Reduction | Unknown | N/A | |
| COBE: Contextualized Object Embeddings from Narrated Instructional Video | Unknown | N/A | |
| Optimal Learning from Verified Training Data | Unknown | N/A | |
| Counterexample-Guided Learning of Monotonic Neural Networks | Unknown | N/A | |
| Toward the Fundamental Limits of Imitation Learning | Unknown | N/A | |
| Targeted Adversarial Perturbations for Monocular Depth Prediction | Unknown | N/A | |
| Supermasks in Superposition | Unknown | N/A | |
| Online Agnostic Boosting via Regret Minimization | Unknown | N/A | |
| Differentiable Causal Discovery from Interventional Data | Unknown | N/A | |
| Truncated Linear Regression in High Dimensions | Unknown | N/A | |
| Design Space for Graph Neural Networks | Unknown | N/A | |
| Adversarial Example Games | Unknown | N/A | |
| WoodFisher: Efficient Second-Order Approximation for Neural Network Compression | Unknown | N/A | |
| Differentiable Augmentation for Data-Efficient GAN Training | Unknown | N/A | |
| Taming Discrete Integration via the Boon of Dimensionality | Unknown | N/A | |
| On the Theory of Transfer Learning: The Importance of Task Diversity | Unknown | N/A | |
| Distance Encoding: Design Provably More Powerful Neural Networks for Graph Representation Learning | Unknown | N/A | |
| Learning identifiable and interpretable latent models of high-dimensional neural activity using pi-VAE | Unknown | N/A | |
| Agnostic Learning of a Single Neuron with Gradient Descent | Unknown | N/A | |
| Coded Sequential Matrix Multiplication For Straggler Mitigation | Unknown | N/A | |
| Depth Uncertainty in Neural Networks | Unknown | N/A | |
| Fast Matrix Square Roots with Applications to Gaussian Processes and Bayesian Optimization | Unknown | N/A | |
| Interpretable multi-timescale models for predicting fMRI responses to continuous natural speech | Unknown | N/A | |
| Simple and Scalable Sparse k-means Clustering via Feature Ranking | Unknown | N/A | |
| Strictly Batch Imitation Learning by Energy-based Distribution Matching | Unknown | N/A | |
| Off-Policy Evaluation via the Regularized Lagrangian | Unknown | N/A | |
| Online Non-Convex Optimization with Imperfect Feedback | Unknown | N/A | |
| Identifying Mislabeled Data using the Area Under the Margin Ranking | Unknown | N/A | |
| See, Hear, Explore: Curiosity via Audio-Visual Association | Unknown | N/A | |
| Probabilistic Fair Clustering | Unknown | N/A | |
| Byzantine Resilient Distributed Multi-Task Learning | Unknown | N/A | |
| Geometric Exploration for Online Control | Unknown | N/A | |
| Autofocused oracles for model-based design | Unknown | N/A | |
| EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning | Unknown | N/A | |
| No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium | Unknown | N/A | |
| Stability of Stochastic Gradient Descent on Nonsmooth Convex Losses | Unknown | N/A | |
| Improved Sample Complexity for Incremental Autonomous Exploration in MDPs | Unknown | N/A | |
| Differentiable Top-k with Optimal Transport | Unknown | N/A | |
| Reinforcement Learning with Augmented Data | Unknown | N/A | |
| Factorizable Graph Convolutional Networks | Unknown | N/A | |
| Handling Missing Data with Graph Representation Learning | Unknown | N/A | |
| Online Bayesian Persuasion | Unknown | N/A | |
| Graphon Neural Networks and the Transferability of Graph Neural Networks | Unknown | N/A | |
| Linearly Converging Error Compensated SGD | Unknown | N/A | |
| On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them | Unknown | N/A | |
| Efficient Contextual Bandits with Continuous Actions | Unknown | N/A | |
| From Trees to Continuous Embeddings and Back: Hyperbolic Hierarchical Clustering | Unknown | N/A | |
| Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping | Unknown | N/A | |
| Smoothed Analysis of Online and Differentially Private Learning | Unknown | N/A | |
| Learning Rich Rankings | Unknown | N/A | |
| Refactoring Policy for Compositional Generalizability using Self-Supervised Object Proposals | Unknown | N/A | |
| The Advantage of Conditional Meta-Learning for Biased Regularization and Fine Tuning | Unknown | N/A | |
| Investigating Gender Bias in Language Models Using Causal Mediation Analysis | Unknown | N/A | |
| Calibration of Shared Equilibria in General Sum Partially Observable Markov Games | Unknown | N/A | |
| Statistical Guarantees of Distributed Nearest Neighbor Classification | Unknown | N/A | |
| Learning Differentiable Programs with Admissible Neural Heuristics | Unknown | N/A | |
| Learning Certified Individually Fair Representations | Unknown | N/A | |
| Breaking Reversibility Accelerates Langevin Dynamics for Non-Convex Optimization | Unknown | N/A | |
| Debiasing Averaged Stochastic Gradient Descent to handle missing values | Unknown | N/A | |
| Large-Scale Adversarial Training for Vision-and-Language Representation Learning | Unknown | N/A | |
| Learning from Mixtures of Private and Public Populations | Unknown | N/A | |
| Recurrent Switching Dynamical Systems Models for Multiple Interacting Neural Populations | Unknown | N/A | |
| Information theoretic limits of learning a sparse rule | Unknown | N/A | |
| Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee | Unknown | N/A | |
| Reverse-engineering recurrent neural network solutions to a hierarchical inference task for mice | Unknown | N/A | |
| Principal Neighbourhood Aggregation for Graph Nets | Unknown | N/A | |
| Adaptive Importance Sampling for Finite-Sum Optimization and Sampling with Decreasing Step-Sizes | Unknown | N/A | |
| Look-ahead Meta Learning for Continual Learning | Unknown | N/A | |
| Movement Pruning: Adaptive Sparsity by Fine-Tuning | Unknown | N/A | |
| Fourier Sparse Leverage Scores and Approximate Kernel Learning | Unknown | N/A | |
| Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design | Unknown | N/A | |
| Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction | Unknown | N/A | |
| Learning to Incentivize Other Learning Agents | Unknown | N/A | |
| Improved Techniques for Training Score-Based Generative Models | Unknown | N/A | |
| Learning Affordance Landscapes for Interaction Exploration in 3D Environments | Unknown | N/A | |
| Debugging Tests for Model Explanations | Unknown | N/A | |
| Convergence of Meta-Learning with Task-Specific Adaptation over Partial Parameters | Unknown | N/A | |
| UCLID-Net: Single View Reconstruction in Object Space | Unknown | N/A | |
| Exploiting weakly supervised visual patterns to learn from partial annotations | Unknown | N/A | |
| Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory | Unknown | N/A | |
| GradAug: A New Regularization Method for Deep Neural Networks | Unknown | N/A | |
| Reducing Adversarially Robust Learning to Non-Robust PAC Learning | Unknown | N/A | |
| Online Bayesian Goal Inference for Boundedly Rational Planning Agents | Unknown | N/A | |
| A Robust Functional EM Algorithm for Incomplete Panel Count Data | Unknown | N/A | |
| The Origins and Prevalence of Texture Bias in Convolutional Neural Networks | Unknown | N/A | |
| Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow | Unknown | N/A | |
| RSKDD-Net: Random Sample-based Keypoint Detector and Descriptor | Unknown | N/A | |
| Walking in the Shadow: A New Perspective on Descent Directions for Constrained Minimization | Unknown | N/A | |
| Online Algorithms for Multi-shop Ski Rental with Machine Learned Advice | Unknown | N/A | |
| Sharp uniform convergence bounds through empirical centralization | Unknown | N/A | |
| Efficient Generation of Structured Objects with Constrained Adversarial Networks | Unknown | N/A | |
| Disentangling by Subspace Diffusion | Unknown | N/A | |
| Value-driven Hindsight Modelling | Unknown | N/A | |
| Adversarial Crowdsourcing Through Robust Rank-One Matrix Completion | Unknown | N/A | |
| Regression with reject option and application to kNN | Unknown | N/A | |
| A Finite-Time Analysis of Two Time-Scale Actor-Critic Methods | Unknown | N/A | |
| Deep Statistical Solvers | Unknown | N/A | |
| POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis | Unknown | N/A | |
| Dynamic allocation of limited memory resources in reinforcement learning | Unknown | N/A | |
| Variational Policy Gradient Method for Reinforcement Learning with General Utilities | Unknown | N/A | |
| A Topological Filter for Learning with Label Noise | Unknown | N/A | |
| Node Embeddings and Exact Low-Rank Representations of Complex Networks | Unknown | N/A | |
| Hyperparameter Ensembles for Robustness and Uncertainty Quantification | Unknown | N/A | |
| Log-Likelihood Ratio Minimizing Flows: Towards Robust and Quantifiable Neural Distribution Alignment | Unknown | N/A | |
| Faithful Embeddings for Knowledge Base Queries | Unknown | N/A | |
| The interplay between randomness and structure during learning in RNNs | Unknown | N/A | |
| Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization | Unknown | N/A | |
| On Efficiency in Hierarchical Reinforcement Learning | Unknown | N/A | |
| Cross-validation Confidence Intervals for Test Error | Unknown | N/A | |
| Learning Graph Structure With A Finite-State Automaton Layer | Unknown | N/A | |
| Adaptive Experimental Design with Temporal Interference: A Maximum Likelihood Approach | Unknown | N/A | |
| Continuous Submodular Maximization: Beyond DR-Submodularity | Unknown | N/A | |
| Convergence and Stability of Graph Convolutional Networks on Large Random Graphs | Unknown | N/A | |
| Joint Contrastive Learning with Infinite Possibilities | Unknown | N/A | |
| Gradient Estimation with Stochastic Softmax Tricks | Unknown | N/A | |
| Leveraging Predictions in Smoothed Online Convex Optimization via Gradient-based Algorithms | Unknown | N/A | |
| The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes | Unknown | N/A | |
| Compressing Images by Encoding Their Latent Representations with Relative Entropy Coding | Unknown | N/A | |
| An operator view of policy gradient methods | Unknown | N/A | |
| Contextual Reserve Price Optimization in Auctions via Mixed Integer Programming | Unknown | N/A | |
| Multifaceted Uncertainty Estimation for Label-Efficient Deep Learning | Unknown | N/A | |
| UCSG-NET- Unsupervised Discovering of Constructive Solid Geometry Tree | Unknown | N/A | |
| Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian | Unknown | N/A | |
| CoinDICE: Off-Policy Confidence Interval Estimation | Unknown | N/A | |
| Learning Latent Space Energy-Based Prior Model | Unknown | N/A | |
| Temporal Positive-unlabeled Learning for Biomedical Hypothesis Generation via Risk Estimation | Unknown | N/A | |
| On the Modularity of Hypernetworks | Unknown | N/A | |
| On Uniform Convergence and Low-Norm Interpolation Learning | Unknown | N/A | |
| Sufficient dimension reduction for classification using principal optimal transport direction | Unknown | N/A | |
| Faster Randomized Infeasible Interior Point Methods for Tall/Wide Linear Programs | Unknown | N/A | |
| Asymmetric Shapley values: incorporating causal knowledge into model-agnostic explainability | Unknown | N/A | |
| List-Decodable Mean Estimation via Iterative Multi-Filtering | Unknown | N/A | |
| Distributionally Robust Local Non-parametric Conditional Estimation | Unknown | N/A | |
| Optimistic Dual Extrapolation for Coherent Non-monotone Variational Inequalities | Unknown | N/A | |
| Exploiting Higher Order Smoothness in Derivative-free Optimization and Continuous Bandits | Unknown | N/A | |
| Better Set Representations For Relational Reasoning | Unknown | N/A | |
| Decision trees as partitioning machines to characterize their generalization properties | Unknown | N/A | |
| Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Color Visual Illusions: A Statistics-based Computational Model | Unknown | N/A | |
| Online Learning with Primary and Secondary Losses | Unknown | N/A | |
| Learning with Differentiable Pertubed Optimizers | Unknown | N/A | |
| MRI Banding Removal via Adversarial Training | Unknown | N/A | |
| Distributed Distillation for On-Device Learning | Unknown | N/A | |
| Stochastic Gradient Descent in Correlated Settings: A Study on Gaussian Processes | Unknown | N/A | |
| Training Normalizing Flows with the Information Bottleneck for Competitive Generative Classification | Unknown | N/A | |
| Self-Supervised Few-Shot Learning on Point Clouds | Unknown | N/A | |
| Approximate Cross-Validation for Structured Models | Unknown | N/A | |
| An Efficient Framework for Clustered Federated Learning | Unknown | N/A | |
| Gaussian Gated Linear Networks | Unknown | N/A | |
| 3D Shape Reconstruction from Vision and Touch | Unknown | N/A | |
| Soft Contrastive Learning for Visual Localization | Unknown | N/A | |
| Online Robust Regression via SGD on the l1 loss | Unknown | N/A | |
| Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift | Unknown | N/A | |
| A Causal View on Robustness of Neural Networks | Unknown | N/A | |
| Listening to Sounds of Silence for Speech Denoising | Unknown | N/A | |
| The NetHack Learning Environment | Unknown | N/A | |
| Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective | Unknown | N/A | |
| Multi-Task Reinforcement Learning with Soft Modularization | Unknown | N/A | |
| Causal Imitation Learning With Unobserved Confounders | Unknown | N/A | |
| CSER: Communication-efficient SGD with Error Reset | Unknown | N/A | |
| The All-or-Nothing Phenomenon in Sparse Tensor PCA | Unknown | N/A | |
| Cooperative Multi-player Bandit Optimization | Unknown | N/A | |
| Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs | Unknown | N/A | |
| The phase diagram of approximation rates for deep neural networks | Unknown | N/A | |
| Deep Automodulators | Unknown | N/A | |
| Storage Efficient and Dynamic Flexible Runtime Channel Pruning via Deep Reinforcement Learning | Unknown | N/A | |
| The Statistical Cost of Robust Kernel Hyperparameter Turning | Unknown | N/A | |
| Choice Bandits | Unknown | N/A | |
| Reinforcement Learning with Feedback Graphs | Unknown | N/A | |
| Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel | Unknown | N/A | |
| Dynamic Fusion of Eye Movement Data and Verbal Narrations in Knowledge-rich Domains | Unknown | N/A | |
| On Second Order Behaviour in Augmented Neural ODEs | Unknown | N/A | |
| Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views | Unknown | N/A | |
| Regret in Online Recommendation Systems | Unknown | N/A | |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Unknown | N/A | |
| The Devil is in the Detail: A Framework for Macroscopic Prediction via Microscopic Models | Unknown | N/A | |
| Can Graph Neural Networks Count Substructures? | Unknown | N/A | |
| Ode to an ODE | Unknown | N/A | |
| Approximate Cross-Validation with Low-Rank Data in High Dimensions | Unknown | N/A | |
| Online Multitask Learning with Long-Term Memory | Unknown | N/A | |
| Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning | Unknown | N/A | |
| CHIP: A Hawkes Process Model for Continuous-time Networks with Scalable and Consistent Estimation | Unknown | N/A | |
| Unsupervised Translation of Programming Languages | Unknown | N/A | |
| Conic Descent and its Application to Memory-efficient Optimization over Positive Semidefinite Matrices | Unknown | N/A | |
| STEER : Simple Temporal Regularization For Neural ODE | Unknown | N/A | |
| Certifying Strategyproof Auction Networks | Unknown | N/A | |
| A Spectral Energy Distance for Parallel Speech Synthesis | Unknown | N/A | |
| Distributionally Robust Parametric Maximum Likelihood Estimation | Unknown | N/A | |
| Mix and Match: An Optimistic Tree-Search Approach for Learning Models from Mixture Distributions | Unknown | N/A | |
| Building powerful and equivariant graph neural networks with structural message-passing | Unknown | N/A | |
| Meta-trained agents implement Bayes-optimal agents | Unknown | N/A | |
| Compositional Visual Generation with Energy Based Models | Unknown | N/A | |
| Task-agnostic Exploration in Reinforcement Learning | Unknown | N/A | |
| Empirical Likelihood for Contextual Bandits | Unknown | N/A | |
| Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning | Unknown | N/A | |
| AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning | Unknown | N/A | |
| Metric-Free Individual Fairness in Online Learning | Unknown | N/A | |
| Decentralized TD Tracking with Linear Function Approximation and its Finite-Time Analysis | Unknown | N/A | |
| Deep Inverse Q-learning with Constraints | Unknown | N/A | |
| Compact task representations as a normative model for higher-order brain activity | Unknown | N/A | |
| Fast Transformers with Clustered Attention | Unknown | N/A | |
| Overfitting Can Be Harmless for Basis Pursuit, But Only to a Degree | Unknown | N/A | |
| Fully Dynamic Algorithm for Constrained Submodular Optimization | Unknown | N/A | |
| Stochastic Stein Discrepancies | Unknown | N/A | |
| Unfolding recurrence by Green’s functions for optimized reservoir computing | Unknown | N/A | |
| Guiding Deep Molecular Optimization with Genetic Exploration | Unknown | N/A | |
| Steering Distortions to Preserve Classes and Neighbors in Supervised Dimensionality Reduction | Unknown | N/A | |
| Debiased Contrastive Learning | Unknown | N/A | |
| An analytic theory of shallow networks dynamics for hinge loss classification | Unknown | N/A | |
| Classification with Valid and Adaptive Coverage | Unknown | N/A | |
| High-Throughput Synchronous Deep RL | Unknown | N/A | |
| Consistent feature selection for analytic deep neural networks | Unknown | N/A | |
| OrganITE: Optimal transplant donor organ offering using an individual treatment effect | Unknown | N/A | |
| On 1/n neural representation and robustness | Unknown | N/A | |
| A Statistical Mechanics Framework for Task-Agnostic Sample Design in Machine Learning | Unknown | N/A | |
| Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity | Unknown | N/A | |
| A simple normative network approximates local non-Hebbian learning in the cortex | Unknown | N/A | |
| Boosting Adversarial Training with Hypersphere Embedding | Unknown | N/A | |
| Robust Sequence Submodular Maximization | Unknown | N/A | |
| Improving Policy-Constrained Kidney Exchange via Pre-Screening | Unknown | N/A | |
| Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks | Unknown | N/A | |
| Estimating Training Data Influence by Tracing Gradient Descent | Unknown | N/A | |
| MCUNet: Tiny Deep Learning on IoT Devices | Unknown | N/A | |
| Bi-level Score Matching for Learning Energy-based Latent Variable Models | Unknown | N/A | |
| Training Linear Finite-State Machines | Unknown | N/A | |
| CSI: Novelty Detection via Contrastive Learning on Distributionally Shifted Instances | Unknown | N/A | |
| Robustness Analysis of Non-Convex Stochastic Gradient Descent using Biased Expectations | Unknown | N/A | |
| PAC-Bayesian Bound for the Conditional Value at Risk | Unknown | N/A | |
| Neuron Shapley: Discovering the Responsible Neurons | Unknown | N/A | |
| Sample-Efficient Optimization in the Latent Space of Deep Generative Models via Weighted Retraining | Unknown | N/A | |
| Adversarial Robustness of Supervised Sparse Coding | Unknown | N/A | |
| Efficient Learning of Generative Models via Finite-Difference Score Matching | Unknown | N/A | |
| CLEARER: Multi-Scale Neural Architecture Search for Image Restoration | Unknown | N/A | |
| Graph Meta Learning via Local Subgraphs | Unknown | N/A | |
| Semi-Supervised Partial Label Learning via Confidence-Rated Margin Maximization | Unknown | N/A | |
| Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS | Unknown | N/A | |
| The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning | Unknown | N/A | |
| Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks | Unknown | N/A | |
| Digraph Inception Convolutional Networks | Unknown | N/A | |
| Learning Disentangled Representations of Videos with Missing Data | Unknown | N/A | |
| Graph Random Neural Networks for Semi-Supervised Learning on Graphs | Unknown | N/A | |
| Smoothly Bounding User Contributions in Differential Privacy | Unknown | N/A | |
| Robust-Adaptive Control of Linear Systems: beyond Quadratic Costs | Unknown | N/A | |
| Zero-Resource Knowledge-Grounded Dialogue Generation | Unknown | N/A | |
| Contrastive Learning with Adversarial Examples | Unknown | N/A | |
| Sub-sampling for Efficient Non-Parametric Bandit Exploration | Unknown | N/A | |
| Language and Visual Entity Relationship Graph for Agent Navigation | Unknown | N/A | |
| A Combinatorial Perspective on Transfer Learning | Unknown | N/A | |
| Learning Robust Decision Policies from Observational Data | Unknown | N/A | |
| Neuron Merging: Compensating for Pruned Neurons | Unknown | N/A | |
| Most ReLU Networks Suffer from $\ell^2$ Adversarial Perturbations | Unknown | N/A | |
| Constraining Variational Inference with Geometric Jensen-Shannon Divergence | Unknown | N/A | |
| Adversarial Learning for Robust Deep Clustering | Unknown | N/A | |
| Biological credit assignment through dynamic inversion of feedforward networks | Unknown | N/A | |
| Simple and Principled Uncertainty Estimation with Deterministic Deep Learning via Distance Awareness | Unknown | N/A | |
| Neural Networks Learning and Memorization with (almost) no Over-Parameterization | Unknown | N/A | |
| Phase retrieval in high dimensions: Statistical and computational phase transitions | Unknown | N/A | |
| From Finite to Countable-Armed Bandits | Unknown | N/A | |
| A Variational Approach for Learning from Positive and Unlabeled Data | Unknown | N/A | |
| Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders | Unknown | N/A | |
| GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network | Unknown | N/A | |
| Latent Dynamic Factor Analysis of High-Dimensional Neural Recordings | Unknown | N/A | |
| Balanced Meta-Softmax for Long-Tailed Visual Recognition | Unknown | N/A | |
| Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Synthesizing Tasks for Block-based Programming | Unknown | N/A | |
| Neutralizing Self-Selection Bias in Sampling for Sortition | Unknown | N/A | |
| Learning Semantic-aware Normalization for Generative Adversarial Networks | Unknown | N/A | |
| Causal analysis of Covid-19 Spread in Germany | Unknown | N/A | |
| Posterior Re-calibration for Imbalanced Datasets | Unknown | N/A | |
| Optimally Deceiving a Learning Leader in Stackelberg Games | Unknown | N/A | |
| Regularizing Black-box Models for Improved Interpretability | Unknown | N/A | |
| Model-based Policy Optimization with Unsupervised Model Adaptation | Unknown | N/A | |
| GANSpace: Discovering Interpretable GAN Controls | Unknown | N/A | |
| Effective Diversity in Population Based Reinforcement Learning | Unknown | N/A | |
| Learning Invariants through Soft Unification | Unknown | N/A | |
| Multi-Stage Influence Function | Unknown | N/A | |
| On ranking via sorting by estimated expected utility | Unknown | N/A | |
| Algorithmic recourse under imperfect causal knowledge: a probabilistic approach | Unknown | N/A | |
| Locally-Adaptive Nonparametric Online Learning | Unknown | N/A | |
| Small Nash Equilibrium Certificates in Very Large Games | Unknown | N/A | |
| Influence-Augmented Online Planning for Complex Environments | Unknown | N/A | |
| Pontryagin Differentiable Programming: An End-to-End Learning and Control Framework | Unknown | N/A | |
| Community detection in sparse time-evolving graphs with a dynamical Bethe-Hessian | Unknown | N/A | |
| Dynamic Submodular Maximization | Unknown | N/A | |
| Discovering Reinforcement Learning Algorithms | Unknown | N/A | |
| Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization | Unknown | N/A | |
| Liberty or Depth: Deep Bayesian Neural Nets Do Not Need Complex Weight Posterior Approximations | Unknown | N/A | |
| VarGrad: A Low-Variance Gradient Estimator for Variational Inference | Unknown | N/A | |
| Online Decision Based Visual Tracking via Reinforcement Learning | Unknown | N/A | |
| Stationary Activations for Uncertainty Calibration in Deep Learning | Unknown | N/A | |
| NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity | Unknown | N/A | |
| How hard is to distinguish graphs with graph neural networks? | Unknown | N/A | |
| On Testing of Samplers | Unknown | N/A | |
| Towards Scalable Bayesian Learning of Causal DAGs | Unknown | N/A | |
| Variational Interaction Information Maximization for Cross-domain Disentanglement | Unknown | N/A | |
| Synthetic Data Generators -- Sequential and Private | Unknown | N/A | |
| The Convolution Exponential and Generalized Sylvester Flows | Unknown | N/A | |
| When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and Policy Assessment using Compartmental Gaussian Processes | Unknown | N/A | |
| Rational neural networks | Unknown | N/A | |
| Non-Crossing Quantile Regression for Distributional Reinforcement Learning | Unknown | N/A | |
| Learning Implicit Functions for Topology-Varying Dense 3D Shape Correspondence | Unknown | N/A | |
| Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Real World Games Look Like Spinning Tops | Unknown | N/A | |
| Fast Fourier Convolution | Unknown | N/A | |
| Agree to Disagree: Adaptive Ensemble Knowledge Distillation in Gradient Space | Unknown | N/A | |
| Stochastic Optimization for Performative Prediction | Unknown | N/A | |
| A Self-Tuning Actor-Critic Algorithm | Unknown | N/A | |
| A Continuous-Time Mirror Descent Approach to Sparse Phase Retrieval | Unknown | N/A | |
| A Non-Asymptotic Analysis for Stein Variational Gradient Descent | Unknown | N/A | |
| Restoring Negative Information in Few-Shot Object Detection | Unknown | N/A | |
| Graph Stochastic Neural Networks for Semi-supervised Learning | Unknown | N/A | |
| BoxE: A Box Embedding Model for Knowledge Base Completion | Unknown | N/A | |
| Decentralized Accelerated Proximal Gradient Descent | Unknown | N/A | |
| Second Order Optimality in Decentralized Non-Convex Optimization via Perturbed Gradient Tracking | Unknown | N/A | |
| Munchausen Reinforcement Learning | Unknown | N/A | |
| Hierarchical Neural Architecture Search for Deep Stereo Matching | Unknown | N/A | |
| Time-Reversal Symmetric ODE Network | Unknown | N/A | |
| Can Implicit Bias Explain Generalization? Stochastic Convex Optimization as a Case Study | Unknown | N/A | |
| CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection | Unknown | N/A | |
| Learning Parities with Neural Networks | Unknown | N/A | |
| Quantile Propagation for Wasserstein-Approximate Gaussian Processes | Unknown | N/A | |
| The Implications of Local Correlation on Learning Some Deep Functions | Unknown | N/A | |
| Deep Shells: Unsupervised Shape Correspondence with Optimal Transport | Unknown | N/A | |
| A Theoretical Framework for Target Propagation | Unknown | N/A | |
| Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels | Unknown | N/A | |
| Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits | Unknown | N/A | |
| Towards Maximizing the Representation Gap between In-Domain & Out-of-Distribution Examples | Unknown | N/A | |
| Discover, Hallucinate, and Adapt: Open Compound Domain Adaptation for Semantic Segmentation | Unknown | N/A | |
| Efficient Clustering Based On A Unified View Of K-means And Ratio-cut | Unknown | N/A | |
| Reservoir Computing meets Recurrent Kernels and Structured Transforms | Unknown | N/A | |
| SuperLoss: A Generic Loss for Robust Curriculum Learning | Unknown | N/A | |
| Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation | Unknown | N/A | |
| Lower Bounds and Optimal Algorithms for Personalized Federated Learning | Unknown | N/A | |
| Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance | Unknown | N/A | |
| Deep Relational Topic Modeling via Graph Poisson Gamma Belief Network | Unknown | N/A | |
| Fast Convergence of Langevin Dynamics on Manifold: Geodesics meet Log-Sobolev | Unknown | N/A | |
| Spike and slab variational Bayes for high dimensional logistic regression | Unknown | N/A | |
| Linear Disentangled Representations and Unsupervised Action Estimation | Unknown | N/A | |
| Interventional Few-Shot Learning | Unknown | N/A | |
| HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis | Unknown | N/A | |
| A Randomized Algorithm to Reduce the Support of Discrete Measures | Unknown | N/A | |
| BRP-NAS: Prediction-based NAS using GCNs | Unknown | N/A | |
| Simplify and Robustify Negative Sampling for Implicit Collaborative Filtering | Unknown | N/A | |
| Meta-Consolidation for Continual Learning | Unknown | N/A | |
| Learning from Aggregate Observations | Unknown | N/A | |
| Smooth And Consistent Probabilistic Regression Trees | Unknown | N/A | |
| Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems | Unknown | N/A | |
| Bayesian filtering unifies adaptive and non-adaptive neural network optimization methods | Unknown | N/A | |
| CASTLE: Regularization via Auxiliary Causal Graph Discovery | Unknown | N/A | |
| Prediction with Corrupted Expert Advice | Unknown | N/A | |
| Quantifying the Empirical Wasserstein Distance to a Set of Measures: Beating the Curse of Dimensionality | Unknown | N/A | |
| Model Inversion Networks for Model-Based Optimization | Unknown | N/A | |
| Universal Function Approximation on Graphs | Unknown | N/A | |
| Efficient active learning of sparse halfspaces with arbitrary bounded noise | Unknown | N/A | |
| On the Ergodicity, Bias and Asymptotic Normality of Randomized Midpoint Sampling Method | Unknown | N/A | |
| MuSCLE: Multi Sweep Compression of LiDAR using Deep Entropy Models | Unknown | N/A | |
| Provably Efficient Neural GTD for Off-Policy Learning | Unknown | N/A | |
| Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search | Unknown | N/A | |
| Probabilistic Circuits for Variational Inference in Discrete Graphical Models | Unknown | N/A | |
| Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization | Unknown | N/A | |
| Falcon: Fast Spectral Inference on Encrypted Data | Unknown | N/A | |
| Exemplar VAE: Linking Generative Models, Nearest Neighbor Retrieval, and Data Augmentation | Unknown | N/A | |
| Self-training Avoids Using Spurious Features Under Domain Shift | Unknown | N/A | |
| Model-based Adversarial Meta-Reinforcement Learning | Unknown | N/A | |
| An implicit function learning approach for parametric modal regression | Unknown | N/A | |
| Online Optimization with Memory and Competitive Control | Unknown | N/A | |
| Automatically Learning Compact Quality-aware Surrogates for Optimization Problems | Unknown | N/A | |
| Hierarchical Granularity Transfer Learning | Unknown | N/A | |
| Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data | Unknown | N/A | |
| Finite-Time Analysis of Round-Robin Kullback-Leibler Upper Confidence Bounds for Optimal Adaptive Allocation with Multiple Plays and Markovian Rewards | Unknown | N/A | |
| Bayesian Multi-type Mean Field Multi-agent Imitation Learning | Unknown | N/A | |
| MomentumRNN: Integrating Momentum into Recurrent Neural Networks | Unknown | N/A | |
| Optimal Query Complexity of Secure Stochastic Convex Optimization | Unknown | N/A | |
| Self-Distillation as Instance-Specific Label Smoothing | Unknown | N/A | |
| Diversity can be Transferred: Output Diversification for White- and Black-box Attacks | Unknown | N/A | |
| Belief-Dependent Macro-Action Discovery in POMDPs using the Value of Information | Unknown | N/A | |
| On Adaptive Distance Estimation | Unknown | N/A | |
| The Power of Comparisons for Actively Learning Linear Classifiers | Unknown | N/A | |
| Multi-Fidelity Bayesian Optimization via Deep Neural Networks | Unknown | N/A | |
| The route to chaos in routing games: When is price of anarchy too optimistic? | Unknown | N/A | |
| Intra-Processing Methods for Debiasing Neural Networks | Unknown | N/A | |
| Entrywise convergence of iterative methods for eigenproblems | Unknown | N/A | |
| SoftFlow: Probabilistic Framework for Normalizing Flow on Manifolds | Unknown | N/A | |
| Learning Strategy-Aware Linear Classifiers | Unknown | N/A | |
| Functional Regularization for Representation Learning: A Unified Theoretical Perspective | Unknown | N/A | |
| Learning Sparse Prototypes for Text Generation | Unknown | N/A | |
| Towards a Combinatorial Characterization of Bounded-Memory Learning | Unknown | N/A | |
| RepPoints v2: Verification Meets Regression for Object Detection | Unknown | N/A | |
| Submodular Meta-Learning | Unknown | N/A | |
| Fewer is More: A Deep Graph Metric Learning Perspective Using Fewer Proxies | Unknown | N/A | |
| The Dilemma of TriHard Loss and an Element-Weighted TriHard Loss for Person Re-Identification | Unknown | N/A | |
| Detecting Interactions from Neural Networks via Topological Analysis | Unknown | N/A | |
| Deterministic Approximation for Submodular Maximization over a Matroid in Nearly Linear Time | Unknown | N/A | |
| Learning to Mutate with Hypergradient Guided Population | Unknown | N/A | |
| Network size and size of the weights in memorization with two-layers neural networks | Unknown | N/A | |
| COPT: Coordinated Optimal Transport on Graphs | Unknown | N/A | |
| Deep Imitation Learning for Bimanual Robotic Manipulation | Unknown | N/A | |
| Improving Auto-Augment via Augmentation-Wise Weight Sharing | Unknown | N/A | |
| GNNGuard: Defending Graph Neural Networks against Adversarial Attacks | Unknown | N/A | |
| Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation | Unknown | N/A | |
| Rotated Binary Neural Network | Unknown | N/A | |
| Sense and Sensitivity Analysis: Simple Post-Hoc Analysis of Bias Due to Unobserved Confounding | Unknown | N/A | |
| Wasserstein Distances for Stereo Disparity Estimation | Unknown | N/A | |
| Rotation-Invariant Local-to-Global Representation Learning for 3D Point Cloud | Unknown | N/A | |
| One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL | Unknown | N/A | |
| Security Analysis of Safe and Seldonian Reinforcement Learning Algorithms | Unknown | N/A | |
| Towards Deeper Graph Neural Networks with Differentiable Group Normalization | Unknown | N/A | |
| Provably Robust Metric Learning | Unknown | N/A | |
| Estimation of Skill Distribution from a Tournament | Unknown | N/A | |
| Ultra-Low Precision 4-bit Training of Deep Neural Networks | Unknown | N/A | |
| Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension | Unknown | N/A | |
| Shared Space Transfer Learning for analyzing multi-site fMRI data | Unknown | N/A | |
| Planning with General Objective Functions: Going Beyond Total Rewards | Unknown | N/A | |
| Structured Prediction for Conditional Meta-Learning | Unknown | N/A | |
| Correlation Robust Influence Maximization | Unknown | N/A | |
| Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time | Unknown | N/A | |
| MOReL: Model-Based Offline Reinforcement Learning | Unknown | N/A | |
| GCN meets GPU: Decoupling “When to Sample” from “How to Sample” | Unknown | N/A | |
| NVAE: A Deep Hierarchical Variational Autoencoder | Unknown | N/A | |
| VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain | Unknown | N/A | |
| Zap Q-Learning With Nonlinear Function Approximation | Unknown | N/A | |
| Learning Deep Attribution Priors Based On Prior Knowledge | Unknown | N/A | |
| Your GAN is Secretly an Energy-based Model and You Should Use Discriminator Driven Latent Sampling | Unknown | N/A | |
| Polynomial-Time Computation of Optimal Correlated Equilibria in Two-Player Extensive-Form Games with Public Chance Moves and Beyond | Unknown | N/A | |
| Neural Dynamic Policies for End-to-End Sensorimotor Learning | Unknown | N/A | |
| Sparse Graphical Memory for Robust Planning | Unknown | N/A | |
| PGM-Explainer: Probabilistic Graphical Model Explanations for Graph Neural Networks | Unknown | N/A | |
| An Improved Analysis of (Variance-Reduced) Policy Gradient and Natural Policy Gradient Methods | Unknown | N/A | |
| Weakly-Supervised Reinforcement Learning for Controllable Behavior | Unknown | N/A | |
| Modeling and Optimization Trade-off in Meta-learning | Unknown | N/A | |
| Structured Convolutions for Efficient Neural Network Design | Unknown | N/A | |
| Biased Stochastic First-Order Methods for Conditional Stochastic Optimization and Applications in Meta Learning | Unknown | N/A | |
| Walsh-Hadamard Variational Inference for Bayesian Deep Learning | Unknown | N/A | |
| Efficient Algorithms for Device Placement of DNN Graph Operators | Unknown | N/A | |
| Weisfeiler and Leman go sparse: Towards scalable higher-order graph embeddings | Unknown | N/A | |
| Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration | Unknown | N/A | |
| The MAGICAL Benchmark for Robust Imitation | Unknown | N/A | |
| Unsupervised Joint k-node Graph Representations with Compositional Energy-Based Models | Unknown | N/A | |
| No-regret Learning in Price Competitions under Consumer Reference Effects | Unknown | N/A | |
| Self-Imitation Learning via Generalized Lower Bound Q-learning | Unknown | N/A | |
| Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding | Unknown | N/A | |
| Learning Agent Representations for Ice Hockey | Unknown | N/A | |
| Bayesian Causal Structural Learning with Zero-Inflated Poisson Bayesian Networks | Unknown | N/A | |
| On Numerosity of Deep Neural Networks | Unknown | N/A | |
| A mathematical theory of cooperative communication | Unknown | N/A | |
| Detection as Regression: Certified Object Detection with Median Smoothing | Unknown | N/A | |
| Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games | Unknown | N/A | |
| Instead of Rewriting Foreign Code for Machine Learning, Automatically Synthesize Fast Gradients | Unknown | N/A | |
| Improving Neural Network Training in Low Dimensional Random Bases | Unknown | N/A | |
| Glyph: Fast and Accurately Training Deep Neural Networks on Encrypted Data | Unknown | N/A | |
| COT-GAN: Generating Sequential Data via Causal Optimal Transport | Unknown | N/A | |
| Learning from Label Proportions: A Mutual Contamination Framework | Unknown | N/A | |
| Truthful Data Acquisition via Peer Prediction | Unknown | N/A | |
| A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions | Unknown | N/A | |
| On the Equivalence between Online and Private Learnability beyond Binary Classification | Unknown | N/A | |
| Unsupervised Data Augmentation for Consistency Training | Unknown | N/A | |
| Hedging in games: Faster convergence of external and swap regrets | Unknown | N/A | |
| Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation | Unknown | N/A | |
| What Makes for Good Views for Contrastive Learning? | Unknown | N/A | |
| Fairness in Streaming Submodular Maximization: Algorithms and Hardness | Unknown | N/A | |
| On Learning Ising Models under Huber's Contamination Model | Unknown | N/A | |
| Make One-Shot Video Object Segmentation Efficient Again | Unknown | N/A | |
| Black-Box Ripper: Copying black-box models using generative evolutionary algorithms | Unknown | N/A | |
| Fine-Grained Dynamic Head for Object Detection | Unknown | N/A | |
| Estimation and Imputation in Probabilistic Principal Component Analysis with Missing Not At Random Data | Unknown | N/A | |
| Meta-Learning through Hebbian Plasticity in Random Networks | Unknown | N/A | |
| Hold me tight! Influence of discriminative features on deep network boundaries | Unknown | N/A | |
| Heavy-tailed Representations, Text Polarity Classification & Data Augmentation | Unknown | N/A | |
| Global Convergence of Deep Networks with One Wide Layer Followed by Pyramidal Topology | Unknown | N/A | |
| A Maximum-Entropy Approach to Off-Policy Evaluation in Average-Reward MDPs | Unknown | N/A | |
| Efficient Low Rank Gaussian Variational Inference for Neural Networks | Unknown | N/A | |
| Learning Mutational Semantics | Unknown | N/A | |
| RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning | Unknown | N/A | |
| Bayesian Optimization for Iterative Learning | Unknown | N/A | |
| Asymptotic normality and confidence intervals for derivatives of 2-layers neural network in the random features model | Unknown | N/A | |
| f-GAIL: Learning f-Divergence for Generative Adversarial Imitation Learning | Unknown | N/A | |
| On the Role of Sparsity and DAG Constraints for Learning Linear DAGs | Unknown | N/A | |
| One Ring to Rule Them All: Certifiably Robust Geometric Perception with Outliers | Unknown | N/A | |
| Revisiting Frank-Wolfe for Polytopes: Strict Complementarity and Sparsity | Unknown | N/A | |
| Towards Neural Programming Interfaces | Unknown | N/A | |
| Transductive Information Maximization for Few-Shot Learning | Unknown | N/A | |
| A Bayesian Nonparametrics View into Deep Representations | Unknown | N/A | |
| Dynamic Regret of Convex and Smooth Functions | Unknown | N/A | |
| Unbalanced Sobolev Descent | Unknown | N/A | |
| Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching | Unknown | N/A | |
| Untangling tradeoffs between recurrence and self-attention in artificial neural networks | Unknown | N/A | |
| Exact Recovery of Mangled Clusters with Same-Cluster Queries | Unknown | N/A | |
| Woodbury Transformations for Deep Generative Flows | Unknown | N/A | |
| Hierarchical Patch VAE-GAN: Generating Diverse Videos from a Single Sample | Unknown | N/A | |
| Self-Supervised MultiModal Versatile Networks | Unknown | N/A | |
| Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms | Unknown | N/A | |
| Neuron-level Structured Pruning using Polarization Regularizer | Unknown | N/A | |
| Bayesian Deep Learning and a Probabilistic Perspective of Generalization | Unknown | N/A | |
| Unsupervised object-centric video generation and decomposition in 3D | Unknown | N/A | |
| DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs | Unknown | N/A | |
| Natural Graph Networks | Unknown | N/A | |
| Causal Intervention for Weakly-Supervised Semantic Segmentation | Unknown | N/A | |
| Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies | Unknown | N/A | |
| Adversarial Sparse Transformer for Time Series Forecasting | Unknown | N/A | |
| Deep Structural Causal Models for Tractable Counterfactual Inference | Unknown | N/A | |
| Improved Algorithms for Convex-Concave Minimax Optimization | Unknown | N/A | |
| A Loss Function for Generative Neural Networks Based on Watson’s Perceptual Model | Unknown | N/A | |
| Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning | Unknown | N/A | |
| Neural Star Domain as Primitive Representation | Unknown | N/A | |
| Dirichlet Graph Variational Autoencoder | Unknown | N/A | |
| Parabolic Approximation Line Search for DNNs | Unknown | N/A | |
| Locally private non-asymptotic testing of discrete distributions is faster using interactive mechanisms | Unknown | N/A | |
| GAIT-prop: A biologically plausible learning rule derived from backpropagation of error | Unknown | N/A | |
| Lipschitz-Certifiable Training with a Tight Outer Bound | Unknown | N/A | |
| Adaptive Sampling for Stochastic Risk-Averse Learning | Unknown | N/A | |
| Learning with Optimized Random Features: Exponential Speedup by Quantum Machine Learning without Sparsity and Low-Rank Assumptions | Unknown | N/A | |
| Federated Bayesian Optimization via Thompson Sampling | Unknown | N/A | |
| Entropic Optimal Transport between Unbalanced Gaussian Measures has a Closed Form | Unknown | N/A | |
| Revisiting Parameter Sharing for Automatic Neural Channel Number Search | Unknown | N/A | |
| Self-Paced Deep Reinforcement Learning | Unknown | N/A | |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Unknown | N/A | |
| ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks | Unknown | N/A | |
| High-Fidelity Generative Image Compression | Unknown | N/A | |
| Online Influence Maximization under Linear Threshold Model | Unknown | N/A | |
| Impossibility Results for Grammar-Compressed Linear Algebra | Unknown | N/A | |
| Inverse Learning of Symmetries | Unknown | N/A | |
| Multi-task Causal Learning with Gaussian Processes | Unknown | N/A | |
| Finite Continuum-Armed Bandits | Unknown | N/A | |
| Beyond the Mean-Field: Structured Deep Gaussian Processes Improve the Predictive Uncertainties | Unknown | N/A | |
| Mixed Hamiltonian Monte Carlo for Mixed Discrete and Continuous Variables | Unknown | N/A | |
| MESA: Boost Ensemble Imbalanced Learning with MEta-SAmpler | Unknown | N/A | |
| Training Generative Adversarial Networks with Limited Data | Unknown | N/A | |
| Simulating a Primary Visual Cortex at the Front of CNNs Improves Robustness to Image Perturbations | Unknown | N/A | |
| Escaping Saddle-Point Faster under Interpolation-like Conditions | Unknown | N/A | |
| Self-Supervised Relational Reasoning for Representation Learning | Unknown | N/A | |
| Noise-Contrastive Estimation for Multivariate Point Processes | Unknown | N/A | |
| AutoBSS: An Efficient Algorithm for Block Stacking Style Search | Unknown | N/A | |
| Data Diversification: A Simple Strategy For Neural Machine Translation | Unknown | N/A | |
| Differentiable Expected Hypervolume Improvement for Parallel Multi-Objective Bayesian Optimization | Unknown | N/A | |
| POMO: Policy Optimization with Multiple Optima for Reinforcement Learning | Unknown | N/A | |
| Sub-linear Regret Bounds for Bayesian Optimisation in Unknown Search Spaces | Unknown | N/A | |
| Energy-based Out-of-distribution Detection | Unknown | N/A | |
| Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect | Unknown | N/A | |
| Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction | Unknown | N/A | |
| MiniLM: Deep Self-Attention Distillation for Task-Agnostic Compression of Pre-Trained Transformers | Unknown | N/A | |
| Self-Supervised Visual Representation Learning from Hierarchical Grouping | Unknown | N/A | |
| CircleGAN: Generative Adversarial Learning across Spherical Circles | Unknown | N/A |
NIPS 2021
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| Learning Gaussian Mixtures with Generalized Linear Models: Precise Asymptotics in High-dimensions | Unknown | N/A | |
| Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage | Unknown | N/A | |
| Flexible Option Learning | Unknown | N/A | |
| Landscape analysis of an improved power method for tensor decomposition | Unknown | N/A | |
| Explicit loss asymptotics in the gradient descent training of neural networks | Unknown | N/A | |
| COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining | Unknown | N/A | |
| Compositional Modeling of Nonlinear Dynamical Systems with ODE-based Random Features | Unknown | N/A | |
| $\alpha$-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression | Unknown | N/A | |
| A Minimalist Approach to Offline Reinforcement Learning | Unknown | N/A | |
| Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods | Unknown | N/A | |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Unknown | N/A | |
| Decrypting Cryptic Crosswords: Semantically Complex Wordplay Puzzles as a Target for NLP | Unknown | N/A | |
| Spherical Motion Dynamics: Learning Dynamics of Normalized Neural Network using SGD and Weight Decay | Unknown | N/A | |
| A Gaussian Process-Bayesian Bernoulli Mixture Model for Multi-Label Active Learning | Unknown | N/A | |
| Exploring Social Posterior Collapse in Variational Autoencoder for Interaction Modeling | Unknown | N/A | |
| Sparse Training via Boosting Pruning Plasticity with Neuroregeneration | Unknown | N/A | |
| Large-Scale Unsupervised Object Discovery | Unknown | N/A | |
| Across-animal odor decoding by probabilistic manifold alignment | Unknown | N/A | |
| Score-based Generative Neural Networks for Large-Scale Optimal Transport | Unknown | N/A | |
| On Plasticity, Invariance, and Mutually Frozen Weights in Sequential Task Learning | Unknown | N/A | |
| Statistical Regeneration Guarantees of the Wasserstein Autoencoder with Latent Space Consistency | Unknown | N/A | |
| EIGNN: Efficient Infinite-Depth Graph Neural Networks | Unknown | N/A | |
| Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport | Unknown | N/A | |
| Credal Self-Supervised Learning | Unknown | N/A | |
| Distributed Deep Learning In Open Collaborations | Unknown | N/A | |
| Skipping the Frame-Level: Event-Based Piano Transcription With Neural Semi-CRFs | Unknown | N/A | |
| Profiling Pareto Front With Multi-Objective Stein Variational Gradient Descent | Unknown | N/A | |
| Sequential Algorithms for Testing Closeness of Distributions | Unknown | N/A | |
| INDIGO: GNN-Based Inductive Knowledge Graph Completion Using Pair-Wise Encoding | Unknown | N/A | |
| Detecting Moments and Highlights in Videos via Natural Language Queries | Unknown | N/A | |
| Joint Inference for Neural Network Depth and Dropout Regularization | Unknown | N/A | |
| Lifelong Domain Adaptation via Consolidated Internal Distribution | Unknown | N/A | |
| Learning latent causal graphs via mixture oracles | Unknown | N/A | |
| Container: Context Aggregation Networks | Unknown | N/A | |
| Semialgebraic Representation of Monotone Deep Equilibrium Models and Applications to Certification | Unknown | N/A | |
| The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective | Unknown | N/A | |
| Temporally Abstract Partial Models | Unknown | N/A | |
| Near-Optimal Offline Reinforcement Learning via Double Variance Reduction | Unknown | N/A | |
| FL-WBC: Enhancing Robustness against Model Poisoning Attacks in Federated Learning from a Client Perspective | Unknown | N/A | |
| One Explanation is Not Enough: Structured Attention Graphs for Image Classification | Unknown | N/A | |
| Overinterpretation reveals image classification model pathologies | Unknown | N/A | |
| Good Classification Measures and How to Find Them | Unknown | N/A | |
| BNS: Building Network Structures Dynamically for Continual Learning | Unknown | N/A | |
| Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks | Unknown | N/A | |
| TRS: Transferability Reduced Ensemble via Promoting Gradient Diversity and Model Smoothness | Unknown | N/A | |
| Automorphic Equivalence-aware Graph Neural Network | Unknown | N/A | |
| Direct Multi-view Multi-person 3D Pose Estimation | Unknown | N/A | |
| Learnability of Linear Thresholds from Label Proportions | Unknown | N/A | |
| Regret Bounds for Gaussian-Process Optimization in Large Domains | Unknown | N/A | |
| On Episodes, Prototypical Networks, and Few-Shot Learning | Unknown | N/A | |
| Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions | Unknown | N/A | |
| CATs: Cost Aggregation Transformers for Visual Correspondence | Unknown | N/A | |
| Efficient Training of Retrieval Models using Negative Cache | Unknown | N/A | |
| Differentiable Multiple Shooting Layers | Unknown | N/A | |
| Deep Explicit Duration Switching Models for Time Series | Unknown | N/A | |
| Offline RL Without Off-Policy Evaluation | Unknown | N/A | |
| A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning | Unknown | N/A | |
| Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach | Unknown | N/A | |
| Validation Free and Replication Robust Volume-based Data Valuation | Unknown | N/A | |
| Graph Neural Networks with Adaptive Residual | Unknown | N/A | |
| Efficient Combination of Rematerialization and Offloading for Training DNNs | Unknown | N/A | |
| Conservative Offline Distributional Reinforcement Learning | Unknown | N/A | |
| On Model Calibration for Long-Tailed Object Detection and Instance Segmentation | Unknown | N/A | |
| Generative Occupancy Fields for 3D Surface-Aware Image Synthesis | Unknown | N/A | |
| TNASP: A Transformer-based NAS Predictor with a Self-evolution Framework | Unknown | N/A | |
| Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction | Unknown | N/A | |
| Learning Compact Representations of Neural Networks using DiscriminAtive Masking (DAM) | Unknown | N/A | |
| Influence Patterns for Explaining Information Flow in BERT | Unknown | N/A | |
| Towards mental time travel: a hierarchical memory for reinforcement learning agents | Unknown | N/A | |
| Explaining heterogeneity in medial entorhinal cortex with task-driven neural networks | Unknown | N/A | |
| Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss | Unknown | N/A | |
| Online Knapsack with Frequency Predictions | Unknown | N/A | |
| Neural Regression, Representational Similarity, Model Zoology & Neural Taskonomy at Scale in Rodent Visual Cortex | Unknown | N/A | |
| The future is log-Gaussian: ResNets and their infinite-depth-and-width limit at initialization | Unknown | N/A | |
| A Mathematical Framework for Quantifying Transferability in Multi-source Transfer Learning | Unknown | N/A | |
| Panoptic 3D Scene Reconstruction From a Single RGB Image | Unknown | N/A | |
| PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning | Unknown | N/A | |
| Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning | Unknown | N/A | |
| 3DP3: 3D Scene Perception via Probabilistic Programming | Unknown | N/A | |
| Learning with Holographic Reduced Representations | Unknown | N/A | |
| Convex Polytope Trees | Unknown | N/A | |
| You Are the Best Reviewer of Your Own Papers: An Owner-Assisted Scoring Mechanism | Unknown | N/A | |
| Online Control of Unknown Time-Varying Dynamical Systems | Unknown | N/A | |
| Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language | Unknown | N/A | |
| Counterbalancing Learning and Strategic Incentives in Allocation Markets | Unknown | N/A | |
| Low-dimensional Structure in the Space of Language Representations is Reflected in Brain Responses | Unknown | N/A | |
| Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations | Unknown | N/A | |
| Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings | Unknown | N/A | |
| Reinforcement Learning with Latent Flow | Unknown | N/A | |
| Behavior From the Void: Unsupervised Active Pre-Training | Unknown | N/A | |
| Uniform Convergence of Interpolators: Gaussian Width, Norm Bounds and Benign Overfitting | Unknown | N/A | |
| Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training | Unknown | N/A | |
| Towards Gradient-based Bilevel Optimization with Non-convex Followers and Beyond | Unknown | N/A | |
| Information is Power: Intrinsic Control via Information Capture | Unknown | N/A | |
| CHIP: CHannel Independence-based Pruning for Compact Neural Networks | Unknown | N/A | |
| Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture | Unknown | N/A | |
| Instance-optimal Mean Estimation Under Differential Privacy | Unknown | N/A | |
| Weak-shot Fine-grained Classification via Similarity Transfer | Unknown | N/A | |
| A Continuous Mapping For Augmentation Design | Unknown | N/A | |
| Towards robust vision by multi-task learning on monkey visual cortex | Unknown | N/A | |
| SE(3)-equivariant prediction of molecular wavefunctions and electronic densities | Unknown | N/A | |
| Analysis of one-hidden-layer neural networks via the resolvent method | Unknown | N/A | |
| A Probabilistic State Space Model for Joint Inference from Differential Equations and Data | Unknown | N/A | |
| Differentiable Simulation of Soft Multi-body Systems | Unknown | N/A | |
| Hierarchical Skills for Efficient Exploration | Unknown | N/A | |
| Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games | Unknown | N/A | |
| Improved Transformer for High-Resolution GANs | Unknown | N/A | |
| Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation | Unknown | N/A | |
| Tractable Regularization of Probabilistic Circuits | Unknown | N/A | |
| Batch Multi-Fidelity Bayesian Optimization with Deep Auto-Regressive Networks | Unknown | N/A | |
| Graph Posterior Network: Bayesian Predictive Uncertainty for Node Classification | Unknown | N/A | |
| Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and Generalization | Unknown | N/A | |
| Sim and Real: Better Together | Unknown | N/A | |
| Matrix factorisation and the interpretation of geodesic distance | Unknown | N/A | |
| Marginalised Gaussian Processes with Nested Sampling | Unknown | N/A | |
| Grounding Spatio-Temporal Language with Transformers | Unknown | N/A | |
| K-Net: Towards Unified Image Segmentation | Unknown | N/A | |
| Neural Algorithmic Reasoners are Implicit Planners | Unknown | N/A | |
| Active 3D Shape Reconstruction from Vision and Touch | Unknown | N/A | |
| Overcoming the curse of dimensionality with Laplacian regularization in semi-supervised learning | Unknown | N/A | |
| Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations | Unknown | N/A | |
| A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs | Unknown | N/A | |
| Fast Approximate Dynamic Programming for Infinite-Horizon Markov Decision Processes | Unknown | N/A | |
| Speedy Performance Estimation for Neural Architecture Search | Unknown | N/A | |
| Scalable Thompson Sampling using Sparse Gaussian Process Models | Unknown | N/A | |
| Reliable Decisions with Threshold Calibration | Unknown | N/A | |
| MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images | Unknown | N/A | |
| Learning Large Neighborhood Search Policy for Integer Programming | Unknown | N/A | |
| Corruption Robust Active Learning | Unknown | N/A | |
| A Critical Look at the Consistency of Causal Estimation with Deep Latent Variable Models | Unknown | N/A | |
| Non-local Latent Relation Distillation for Self-Adaptive 3D Human Pose Estimation | Unknown | N/A | |
| Piper: Multidimensional Planner for DNN Parallelization | Unknown | N/A | |
| Post-Contextual-Bandit Inference | Unknown | N/A | |
| CrypTen: Secure Multi-Party Computation Meets Machine Learning | Unknown | N/A | |
| Continuous Mean-Covariance Bandits | Unknown | N/A | |
| Controlling Neural Networks with Rule Representations | Unknown | N/A | |
| Compressing Neural Networks: Towards Determining the Optimal Layer-wise Decomposition | Unknown | N/A | |
| Hierarchical Clustering: $O(1)$-Approximation for Well-Clustered Graphs | Unknown | N/A | |
| Efficient constrained sampling via the mirror-Langevin algorithm | Unknown | N/A | |
| Towards Robust Bisimulation Metric Learning | Unknown | N/A | |
| Amortized Variational Inference for Simple Hierarchical Models | Unknown | N/A | |
| Repulsive Deep Ensembles are Bayesian | Unknown | N/A | |
| Algorithmic stability and generalization of an unsupervised feature selection algorithm | Unknown | N/A | |
| LSH-SMILE: Locality Sensitive Hashing Accelerated Simulation and Learning | Unknown | N/A | |
| Towards Better Understanding of Training Certifiably Robust Models against Adversarial Examples | Unknown | N/A | |
| Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning | Unknown | N/A | |
| RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem | Unknown | N/A | |
| AC-GC: Lossy Activation Compression with Guaranteed Convergence | Unknown | N/A | |
| Near-Optimal No-Regret Learning in General Games | Unknown | N/A | |
| It Has Potential: Gradient-Driven Denoisers for Convergent Solutions to Inverse Problems | Unknown | N/A | |
| Shift Invariance Can Reduce Adversarial Robustness | Unknown | N/A | |
| DNN-based Topology Optimisation: Spatial Invariance and Neural Tangent Kernel | Unknown | N/A | |
| Neural Production Systems | Unknown | N/A | |
| Neural Active Learning with Performance Guarantees | Unknown | N/A | |
| Equivariant Manifold Flows | Unknown | N/A | |
| Reinforcement Learning in Newcomblike Environments | Unknown | N/A | |
| Disrupting Deep Uncertainty Estimation Without Harming Accuracy | Unknown | N/A | |
| Fairness in Ranking under Uncertainty | Unknown | N/A | |
| Identifiable Generative models for Missing Not at Random Data Imputation | Unknown | N/A | |
| Multi-view Contrastive Graph Clustering | Unknown | N/A | |
| Unifying lower bounds on prediction dimension of convex surrogates | Unknown | N/A | |
| Learning Stochastic Majority Votes by Minimizing a PAC-Bayes Generalization Bound | Unknown | N/A | |
| Misspecified Gaussian Process Bandit Optimization | Unknown | N/A | |
| Reliable Estimation of KL Divergence using a Discriminator in Reproducing Kernel Hilbert Space | Unknown | N/A | |
| DeepGEM: Generalized Expectation-Maximization for Blind Inversion | Unknown | N/A | |
| Parameter-free HE-friendly Logistic Regression | Unknown | N/A | |
| Imitation with Neural Density Models | Unknown | N/A | |
| SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning | Unknown | N/A | |
| Fair Exploration via Axiomatic Bargaining | Unknown | N/A | |
| No Regrets for Learning the Prior in Bandits | Unknown | N/A | |
| Privately Publishable Per-instance Privacy | Unknown | N/A | |
| Learning the optimal Tikhonov regularizer for inverse problems | Unknown | N/A | |
| Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation | Unknown | N/A | |
| Fast Algorithms for $L_\infty$-constrained S-rectangular Robust MDPs | Unknown | N/A | |
| A Trainable Spectral-Spatial Sparse Coding Model for Hyperspectral Image Restoration | Unknown | N/A | |
| Weighted model estimation for offline model-based reinforcement learning | Unknown | N/A | |
| Bellman-consistent Pessimism for Offline Reinforcement Learning | Unknown | N/A | |
| Settling the Variance of Multi-Agent Policy Gradients | Unknown | N/A | |
| Cortico-cerebellar networks as decoupling neural interfaces | Unknown | N/A | |
| Recursive Causal Structure Learning in the Presence of Latent Variables and Selection Bias | Unknown | N/A | |
| A flow-based latent state generative model of neural population responses to natural images | Unknown | N/A | |
| Monte Carlo Tree Search With Iteratively Refining State Abstractions | Unknown | N/A | |
| Editing a classifier by rewriting its prediction rules | Unknown | N/A | |
| Training Neural Networks with Fixed Sparse Masks | Unknown | N/A | |
| Generalization Bounds For Meta-Learning: An Information-Theoretic Analysis | Unknown | N/A | |
| ByPE-VAE: Bayesian Pseudocoresets Exemplar VAE | Unknown | N/A | |
| Particle Cloud Generation with Message Passing Generative Adversarial Networks | Unknown | N/A | |
| Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics | Unknown | N/A | |
| PSD Representations for Effective Probability Models | Unknown | N/A | |
| Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs | Unknown | N/A | |
| Memory-efficient Patch-based Inference for Tiny Deep Learning | Unknown | N/A | |
| Boost Neural Networks by Checkpoints | Unknown | N/A | |
| Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning | Unknown | N/A | |
| SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation | Unknown | N/A | |
| Learning 3D Dense Correspondence via Canonical Point Autoencoder | Unknown | N/A | |
| On the interplay between data structure and loss function in classification problems | Unknown | N/A | |
| Robust Compressed Sensing MRI with Deep Generative Priors | Unknown | N/A | |
| Scalable Intervention Target Estimation in Linear Models | Unknown | N/A | |
| Hierarchical Reinforcement Learning with Timed Subgoals | Unknown | N/A | |
| Selective Sampling for Online Best-arm Identification | Unknown | N/A | |
| Excess Capacity and Backdoor Poisoning | Unknown | N/A | |
| Multimodal and Multilingual Embeddings for Large-Scale Speech Mining | Unknown | N/A | |
| Learning Frequency Domain Approximation for Binary Neural Networks | Unknown | N/A | |
| Clustering Effect of Adversarial Robust Models | Unknown | N/A | |
| Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation | Unknown | N/A | |
| Beyond Pinball Loss: Quantile Methods for Calibrated Uncertainty Quantification | Unknown | N/A | |
| Functional Neural Networks for Parametric Image Restoration Problems | Unknown | N/A | |
| Delayed Gradient Averaging: Tolerate the Communication Latency for Federated Learning | Unknown | N/A | |
| Understanding the Generalization Benefit of Model Invariance from a Data Perspective | Unknown | N/A | |
| Predicting Deep Neural Network Generalization with Perturbation Response Curves | Unknown | N/A | |
| Escape saddle points by a simple gradient-descent based algorithm | Unknown | N/A | |
| Your head is there to move you around: Goal-driven models of the primate dorsal pathway | Unknown | N/A | |
| SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients | Unknown | N/A | |
| SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs | Unknown | N/A | |
| Decentralized Q-learning in Zero-sum Markov Games | Unknown | N/A | |
| Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation | Unknown | N/A | |
| Channel Permutations for N:M Sparsity | Unknown | N/A | |
| Dynamic influence maximization | Unknown | N/A | |
| Closing the loop in medical decision support by understanding clinical decision-making: A case study on organ transplantation | Unknown | N/A | |
| Human-Adversarial Visual Question Answering | Unknown | N/A | |
| Non-approximate Inference for Collective Graphical Models on Path Graphs via Discrete Difference of Convex Algorithm | Unknown | N/A | |
| What Matters for Adversarial Imitation Learning? | Unknown | N/A | |
| Accumulative Poisoning Attacks on Real-time Data | Unknown | N/A | |
| IQ-Learn: Inverse soft-Q Learning for Imitation | Unknown | N/A | |
| ParK: Sound and Efficient Kernel Ridge Regression by Feature Space Partitions | Unknown | N/A | |
| MarioNette: Self-Supervised Sprite Learning | Unknown | N/A | |
| Regulating algorithmic filtering on social media | Unknown | N/A | |
| Visualizing the Emergence of Intermediate Visual Patterns in DNNs | Unknown | N/A | |
| CBP: backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method | Unknown | N/A | |
| Bridging the Gap Between Practice and PAC-Bayes Theory in Few-Shot Meta-Learning | Unknown | N/A | |
| On The Structure of Parametric Tournaments with Application to Ranking from Pairwise Comparisons | Unknown | N/A | |
| A Geometric Perspective towards Neural Calibration via Sensitivity Decomposition | Unknown | N/A | |
| Improving Transferability of Representations via Augmentation-Aware Self-Supervision | Unknown | N/A | |
| Policy Learning Using Weak Supervision | Unknown | N/A | |
| Learning on Random Balls is Sufficient for Estimating (Some) Graph Parameters | Unknown | N/A | |
| Passive attention in artificial neural networks predicts human visual selectivity | Unknown | N/A | |
| Do Vision Transformers See Like Convolutional Neural Networks? | Unknown | N/A | |
| Information-constrained optimization: can adaptive processing of gradients help? | Unknown | N/A | |
| Fast Tucker Rank Reduction for Non-Negative Tensors Using Mean-Field Approximation | Unknown | N/A | |
| Learning Hard Optimization Problems: A Data Generation Perspective | Unknown | N/A | |
| Similarity and Matching of Neural Network Representations | Unknown | N/A | |
| NEO: Non Equilibrium Sampling on the Orbits of a Deterministic Transform | Unknown | N/A | |
| Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition | Unknown | N/A | |
| Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification | Unknown | N/A | |
| Noisy Recurrent Neural Networks | Unknown | N/A | |
| Multi-modal Dependency Tree for Video Captioning | Unknown | N/A | |
| Deep Reinforcement Learning at the Edge of the Statistical Precipice | Unknown | N/A | |
| Shaping embodied agent behavior with activity-context priors from egocentric video | Unknown | N/A | |
| Representation Learning Beyond Linear Prediction Functions | Unknown | N/A | |
| How Modular should Neural Module Networks Be for Systematic Generalization? | Unknown | N/A | |
| PolarStream: Streaming Object Detection and Segmentation with Polar Pillars | Unknown | N/A | |
| SSMF: Shifting Seasonal Matrix Factorization | Unknown | N/A | |
| Average-Reward Learning and Planning with Options | Unknown | N/A | |
| Nonsmooth Implicit Differentiation for Machine-Learning and Optimization | Unknown | N/A | |
| Numerical influence of ReLU’(0) on backpropagation | Unknown | N/A | |
| Optimal Gradient-based Algorithms for Non-concave Bandit Optimization | Unknown | N/A | |
| Towards Sample-Optimal Compressive Phase Retrieval with Sparse and Generative Priors | Unknown | N/A | |
| PettingZoo: Gym for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| HNPE: Leveraging Global Parameters for Neural Posterior Estimation | Unknown | N/A | |
| Partition-Based Formulations for Mixed-Integer Optimization of Trained ReLU Neural Networks | Unknown | N/A | |
| Gradient-based Hyperparameter Optimization Over Long Horizons | Unknown | N/A | |
| On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations | Unknown | N/A | |
| Multimodal Virtual Point 3D Detection | Unknown | N/A | |
| VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer | Unknown | N/A | |
| Convergence and Alignment of Gradient Descent with Random Backpropagation Weights | Unknown | N/A | |
| Implicit Task-Driven Probability Discrepancy Measure for Unsupervised Domain Adaptation | Unknown | N/A | |
| Referring Transformer: A One-step Approach to Multi-task Visual Grounding | Unknown | N/A | |
| Learning Diverse Policies in MOBA Games via Macro-Goals | Unknown | N/A | |
| Deformable Butterfly: A Highly Structured and Sparse Linear Transform | Unknown | N/A | |
| MAP Propagation Algorithm: Faster Learning with a Team of Reinforcement Learning Agents | Unknown | N/A | |
| An Analysis of Constant Step Size SGD in the Non-convex Regime: Asymptotic Normality and Bias | Unknown | N/A | |
| Understanding the Effect of Stochasticity in Policy Optimization | Unknown | N/A | |
| Deep Learning on a Data Diet: Finding Important Examples Early in Training | Unknown | N/A | |
| Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation | Unknown | N/A | |
| Bayesian Optimization of Function Networks | Unknown | N/A | |
| Prior-independent Dynamic Auctions for a Value-maximizing Buyer | Unknown | N/A | |
| Constrained Robust Submodular Partitioning | Unknown | N/A | |
| Iterative Connecting Probability Estimation for Networks | Unknown | N/A | |
| Forster Decomposition and Learning Halfspaces with Noise | Unknown | N/A | |
| On Joint Learning for Solving Placement and Routing in Chip Design | Unknown | N/A | |
| End-to-End Weak Supervision | Unknown | N/A | |
| A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning | Unknown | N/A | |
| Characterizing the risk of fairwashing | Unknown | N/A | |
| Searching the Search Space of Vision Transformer | Unknown | N/A | |
| On Learning Domain-Invariant Representations for Transfer Learning with Multiple Sources | Unknown | N/A | |
| Perceptual Score: What Data Modalities Does Your Model Perceive? | Unknown | N/A | |
| On UMAP's True Loss Function | Unknown | N/A | |
| Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding | Unknown | N/A | |
| Adder Attention for Vision Transformer | Unknown | N/A | |
| Provably Efficient Black-Box Action Poisoning Attacks Against Reinforcement Learning | Unknown | N/A | |
| COMBO: Conservative Offline Model-Based Policy Optimization | Unknown | N/A | |
| Learning to Schedule Heuristics in Branch and Bound | Unknown | N/A | |
| TAAC: Temporally Abstract Actor-Critic for Continuous Control | Unknown | N/A | |
| Spectrum-to-Kernel Translation for Accurate Blind Image Super-Resolution | Unknown | N/A | |
| Unintended Selection: Persistent Qualification Rate Disparities and Interventions | Unknown | N/A | |
| Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks | Unknown | N/A | |
| Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation | Unknown | N/A | |
| Open-set Label Noise Can Improve Robustness Against Inherent Label Noise | Unknown | N/A | |
| Self-Supervised Learning of Event-Based Optical Flow with Spiking Neural Networks | Unknown | N/A | |
| Deeply Shared Filter Bases for Parameter-Efficient Convolutional Neural Networks | Unknown | N/A | |
| Doubly Robust Thompson Sampling with Linear Payoffs | Unknown | N/A | |
| Only Train Once: A One-Shot Neural Network Training And Pruning Framework | Unknown | N/A | |
| Fairness via Representation Neutralization | Unknown | N/A | |
| A No-go Theorem for Robust Acceleration in the Hyperbolic Plane | Unknown | N/A | |
| Probabilistic Transformer For Time Series Analysis | Unknown | N/A | |
| Learning Robust Hierarchical Patterns of Human Brain across Many fMRI Studies | Unknown | N/A | |
| Self-Adaptable Point Processes with Nonparametric Time Decays | Unknown | N/A | |
| RED : Looking for Redundancies for Data-FreeStructured Compression of Deep Neural Networks | Unknown | N/A | |
| Adversarially Robust Change Point Detection | Unknown | N/A | |
| The Limits of Optimal Pricing in the Dark | Unknown | N/A | |
| Making the most of your day: online learning for optimal allocation of time | Unknown | N/A | |
| Visual Search Asymmetry: Deep Nets and Humans Share Similar Inherent Biases | Unknown | N/A | |
| Invertible DenseNets with Concatenated LipSwish | Unknown | N/A | |
| Pareto-Optimal Learning-Augmented Algorithms for Online Conversion Problems | Unknown | N/A | |
| Non-asymptotic Error Bounds for Bidirectional GANs | Unknown | N/A | |
| Iterative Teacher-Aware Learning | Unknown | N/A | |
| Stochastic $L^\natural$-convex Function Minimization | Unknown | N/A | |
| BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer | Unknown | N/A | |
| Post-Training Sparsity-Aware Quantization | Unknown | N/A | |
| Accurately Solving Rod Dynamics with Graph Learning | Unknown | N/A | |
| Online and Offline Reinforcement Learning by Planning with a Learned Model | Unknown | N/A | |
| Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal Memory | Unknown | N/A | |
| Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting | Unknown | N/A | |
| Greedy and Random Quasi-Newton Methods with Faster Explicit Superlinear Convergence | Unknown | N/A | |
| Heavy Ball Momentum for Conditional Gradient | Unknown | N/A | |
| On the Out-of-distribution Generalization of Probabilistic Image Modelling | Unknown | N/A | |
| Neural Architecture Dilation for Adversarial Robustness | Unknown | N/A | |
| Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation | Unknown | N/A | |
| Revisiting Smoothed Online Learning | Unknown | N/A | |
| Learning interaction rules from multi-animal trajectories via augmented behavioral models | Unknown | N/A | |
| A Constant Approximation Algorithm for Sequential Random-Order No-Substitution k-Median Clustering | Unknown | N/A | |
| Proxy Convexity: A Unified Framework for the Analysis of Neural Networks Trained by Gradient Descent | Unknown | N/A | |
| Dynamic Sasvi: Strong Safe Screening for Norm-Regularized Least Squares | Unknown | N/A | |
| Scalable Inference of Sparsely-changing Gaussian Markov Random Fields | Unknown | N/A | |
| Capturing implicit hierarchical structure in 3D biomedical images with self-supervised hyperbolic representations | Unknown | N/A | |
| Adaptive Risk Minimization: Learning to Adapt to Domain Shift | Unknown | N/A | |
| Analyzing the Confidentiality of Undistillable Teachers in Knowledge Distillation | Unknown | N/A | |
| Scalable Rule-Based Representation Learning for Interpretable Classification | Unknown | N/A | |
| Parallel and Efficient Hierarchical k-Median Clustering | Unknown | N/A | |
| Training Feedback Spiking Neural Networks by Implicit Differentiation on the Equilibrium State | Unknown | N/A | |
| Conic Blackwell Algorithm: Parameter-Free Convex-Concave Saddle-Point Solving | Unknown | N/A | |
| Robust Allocations with Diversity Constraints | Unknown | N/A | |
| Optimality and Stability in Federated Learning: A Game-theoretic Approach | Unknown | N/A | |
| Dynamic Causal Bayesian Optimization | Unknown | N/A | |
| An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General Constraints | Unknown | N/A | |
| Robustifying Algorithms of Learning Latent Trees with Vector Variables | Unknown | N/A | |
| Generalization Guarantee of SGD for Pairwise Learning | Unknown | N/A | |
| Universal Off-Policy Evaluation | Unknown | N/A | |
| Calibration and Consistency of Adversarial Surrogate Losses | Unknown | N/A | |
| On the Convergence of Step Decay Step-Size for Stochastic Optimization | Unknown | N/A | |
| Unsupervised Speech Recognition | Unknown | N/A | |
| TransMIL: Transformer based Correlated Multiple Instance Learning for Whole Slide Image Classification | Unknown | N/A | |
| DOBF: A Deobfuscation Pre-Training Objective for Programming Languages | Unknown | N/A | |
| On the Expected Complexity of Maxout Networks | Unknown | N/A | |
| Tactical Optimism and Pessimism for Deep Reinforcement Learning | Unknown | N/A | |
| An online passive-aggressive algorithm for difference-of-squares classification | Unknown | N/A | |
| Learning State Representations from Random Deep Action-conditional Predictions | Unknown | N/A | |
| Exact Privacy Guarantees for Markov Chain Implementations of the Exponential Mechanism with Artificial Atoms | Unknown | N/A | |
| Relative stability toward diffeomorphisms indicates performance in deep nets | Unknown | N/A | |
| Sparse Uncertainty Representation in Deep Learning with Inducing Weights | Unknown | N/A | |
| Online Active Learning with Surrogate Loss Functions | Unknown | N/A | |
| Reverse engineering learned optimizers reveals known and novel mechanisms | Unknown | N/A | |
| A Near-Optimal Algorithm for Debiasing Trained Machine Learning Models | Unknown | N/A | |
| Efficient and Accurate Gradients for Neural SDEs | Unknown | N/A | |
| Dynamics of Stochastic Momentum Methods on Large-scale, Quadratic Models | Unknown | N/A | |
| Sparse Spiking Gradient Descent | Unknown | N/A | |
| Convergence Rates of Stochastic Gradient Descent under Infinite Noise Variance | Unknown | N/A | |
| Few-Shot Data-Driven Algorithms for Low Rank Approximation | Unknown | N/A | |
| Privately Learning Mixtures of Axis-Aligned Gaussians | Unknown | N/A | |
| Hash Layers For Large Sparse Models | Unknown | N/A | |
| List-Decodable Mean Estimation in Nearly-PCA Time | Unknown | N/A | |
| Automatic Unsupervised Outlier Model Selection | Unknown | N/A | |
| Why Lottery Ticket Wins? A Theoretical Perspective of Sample Complexity on Sparse Neural Networks | Unknown | N/A | |
| Ising Model Selection Using $\ell_{1}$-Regularized Linear Regression: A Statistical Mechanics Analysis | Unknown | N/A | |
| Dynamic Bottleneck for Robust Self-Supervised Exploration | Unknown | N/A | |
| USCO-Solver: Solving Undetermined Stochastic Combinatorial Optimization Problems | Unknown | N/A | |
| Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking | Unknown | N/A | |
| Understanding Interlocking Dynamics of Cooperative Rationalization | Unknown | N/A | |
| Differentiable Equilibrium Computation with Decision Diagrams for Stackelberg Models of Combinatorial Congestion Games | Unknown | N/A | |
| Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems | Unknown | N/A | |
| Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning | Unknown | N/A | |
| Online Market Equilibrium with Application to Fair Division | Unknown | N/A | |
| Process for Adapting Language Models to Society (PALMS) with Values-Targeted Datasets | Unknown | N/A | |
| Bias and variance of the Bayesian-mean decoder | Unknown | N/A | |
| MLP-Mixer: An all-MLP Architecture for Vision | Unknown | N/A | |
| Learning Knowledge Graph-based World Models of Textual Environments | Unknown | N/A | |
| Bridging Non Co-occurrence with Unlabeled In-the-wild Data for Incremental Object Detection | Unknown | N/A | |
| Refined Learning Bounds for Kernel and Approximate $k$-Means | Unknown | N/A | |
| Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning | Unknown | N/A | |
| Faster Directional Convergence of Linear Neural Networks under Spherically Symmetric Data | Unknown | N/A | |
| An Exponential Lower Bound for Linearly Realizable MDP with Constant Suboptimality Gap | Unknown | N/A | |
| Coresets for Classification – Simplified and Strengthened | Unknown | N/A | |
| Collaborative Learning in the Jungle (Decentralized, Byzantine, Heterogeneous, Asynchronous and Nonconvex Learning) | Unknown | N/A | |
| Submodular + Concave | Unknown | N/A | |
| Understanding Partial Multi-Label Learning via Mutual Information | Unknown | N/A | |
| Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style | Unknown | N/A | |
| Improved Coresets and Sublinear Algorithms for Power Means in Euclidean Spaces | Unknown | N/A | |
| Multi-Armed Bandits with Bounded Arm-Memory: Near-Optimal Guarantees for Best-Arm Identification and Regret Minimization | Unknown | N/A | |
| Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations | Unknown | N/A | |
| Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices | Unknown | N/A | |
| Self-Instantiated Recurrent Units with Dynamic Soft Recursion | Unknown | N/A | |
| Near-Optimal Lower Bounds For Convex Optimization For All Orders of Smoothness | Unknown | N/A | |
| Improved Learning Rates of a Functional Lasso-type SVM with Sparse Multi-Kernel Representation | Unknown | N/A | |
| Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks | Unknown | N/A | |
| A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems | Unknown | N/A | |
| Morié Attack (MA): A New Potential Risk of Screen Photos | Unknown | N/A | |
| Towards Understanding Why Lookahead Generalizes Better Than SGD and Beyond | Unknown | N/A | |
| FjORD: Fair and Accurate Federated Learning under heterogeneous targets with Ordered Dropout | Unknown | N/A | |
| Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification | Unknown | N/A | |
| An Empirical Study of Adder Neural Networks for Object Detection | Unknown | N/A | |
| Medical Dead-ends and Learning to Identify High-Risk States and Treatments | Unknown | N/A | |
| Low-Rank Extragradient Method for Nonsmooth and Low-Rank Matrix Optimization Problems | Unknown | N/A | |
| Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark | Unknown | N/A | |
| Local plasticity rules can learn deep representations using self-supervised contrastive predictions | Unknown | N/A | |
| Approximating the Permanent with Deep Rejection Sampling | Unknown | N/A | |
| Exploiting Data Sparsity in Secure Cross-Platform Social Recommendation | Unknown | N/A | |
| How Data Augmentation affects Optimization for Linear Regression | Unknown | N/A | |
| Spot the Difference: Detection of Topological Changes via Geometric Alignment | Unknown | N/A | |
| Deep Extended Hazard Models for Survival Analysis | Unknown | N/A | |
| Scaling Gaussian Processes with Derivative Information Using Variational Inference | Unknown | N/A | |
| On the Expressivity of Markov Reward | Unknown | N/A | |
| Credit Assignment in Neural Networks through Deep Feedback Control | Unknown | N/A | |
| KS-GNN: Keywords Search over Incomplete Graphs via Graphs Neural Network | Unknown | N/A | |
| A novel notion of barycenter for probability distributions based on optimal weak mass transport | Unknown | N/A | |
| Confident Anchor-Induced Multi-Source Free Domain Adaptation | Unknown | N/A | |
| Learning from Inside: Self-driven Siamese Sampling and Reasoning for Video Question Answering | Unknown | N/A | |
| Iterative Amortized Policy Optimization | Unknown | N/A | |
| The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement Learning | Unknown | N/A | |
| Encoding Robustness to Image Style via Adversarial Feature Perturbations | Unknown | N/A | |
| Structure-Aware Random Fourier Kernel for Graphs | Unknown | N/A | |
| Risk Monotonicity in Statistical Learning | Unknown | N/A | |
| Recognizing Vector Graphics without Rasterization | Unknown | N/A | |
| Large-Scale Learning with Fourier Features and Tensor Decompositions | Unknown | N/A | |
| Implicit Semantic Response Alignment for Partial Domain Adaptation | Unknown | N/A | |
| Exponential Separation between Two Learning Models and Adversarial Robustness | Unknown | N/A | |
| SBO-RNN: Reformulating Recurrent Neural Networks via Stochastic Bilevel Optimization | Unknown | N/A | |
| Variational Continual Bayesian Meta-Learning | Unknown | N/A | |
| Learning One Representation to Optimize All Rewards | Unknown | N/A | |
| Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket Perspective | Unknown | N/A | |
| Linear Convergence of Gradient Methods for Estimating Structured Transition Matrices in High-dimensional Vector Autoregressive Models | Unknown | N/A | |
| Raw Nav-merge Seismic Data to Subsurface Properties with MLP based Multi-Modal Information Unscrambler | Unknown | N/A | |
| On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method | Unknown | N/A | |
| Robust Optimization for Multilingual Translation with Imbalanced Data | Unknown | N/A | |
| SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search | Unknown | N/A | |
| Class-Incremental Learning via Dual Augmentation | Unknown | N/A | |
| Mitigating Forgetting in Online Continual Learning with Neuron Calibration | Unknown | N/A | |
| Robust Auction Design in the Auto-bidding World | Unknown | N/A | |
| CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age | Unknown | N/A | |
| Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs | Unknown | N/A | |
| The Pareto Frontier of model selection for general Contextual Bandits | Unknown | N/A | |
| Asymptotically Exact Error Characterization of Offline Policy Evaluation with Misspecified Linear Models | Unknown | N/A | |
| Causal Bandits with Unknown Graph Structure | Unknown | N/A | |
| Emergent Discrete Communication in Semantic Spaces | Unknown | N/A | |
| On the Stochastic Stability of Deep Markov Models | Unknown | N/A | |
| Statistical Inference with M-Estimators on Adaptively Collected Data | Unknown | N/A | |
| Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Locality Sensitive Teaching | Unknown | N/A | |
| RL for Latent MDPs: Regret Guarantees and a Lower Bound | Unknown | N/A | |
| Distribution-free inference for regression: discrete, continuous, and in between | Unknown | N/A | |
| Lattice partition recovery with dyadic CART | Unknown | N/A | |
| The Flip Side of the Reweighted Coin: Duality of Adaptive Dropout and Regularization | Unknown | N/A | |
| Beyond the Signs: Nonparametric Tensor Completion via Sign Series | Unknown | N/A | |
| Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature | Unknown | N/A | |
| Tensor decompositions of higher-order correlations by nonlinear Hebbian plasticity | Unknown | N/A | |
| Curriculum Learning for Vision-and-Language Navigation | Unknown | N/A | |
| Information Directed Sampling for Sparse Linear Bandits | Unknown | N/A | |
| A generative nonparametric Bayesian model for whole genomes | Unknown | N/A | |
| Derivative-Free Policy Optimization for Linear Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity | Unknown | N/A | |
| A Unified View of cGANs with and without Classifiers | Unknown | N/A | |
| Sub-Linear Memory: How to Make Performers SLiM | Unknown | N/A | |
| Challenges and Opportunities in High Dimensional Variational Inference | Unknown | N/A | |
| On the Existence of The Adversarial Bayes Classifier | Unknown | N/A | |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Unknown | N/A | |
| To Beam Or Not To Beam: That is a Question of Cooperation for Language GANs | Unknown | N/A | |
| True Few-Shot Learning with Language Models | Unknown | N/A | |
| Label Disentanglement in Partition-based Extreme Multilabel Classification | Unknown | N/A | |
| Towards understanding retrosynthesis by energy-based models | Unknown | N/A | |
| Rectangular Flows for Manifold Learning | Unknown | N/A | |
| Two Sides of Meta-Learning Evaluation: In vs. Out of Distribution | Unknown | N/A | |
| Stability & Generalisation of Gradient Descent for Shallow Neural Networks without the Neural Tangent Kernel | Unknown | N/A | |
| DiBS: Differentiable Bayesian Structure Learning | Unknown | N/A | |
| BARTScore: Evaluating Generated Text as Text Generation | Unknown | N/A | |
| Fast and accurate randomized algorithms for low-rank tensor decompositions | Unknown | N/A | |
| Nearly Horizon-Free Offline Reinforcement Learning | Unknown | N/A | |
| CogView: Mastering Text-to-Image Generation via Transformers | Unknown | N/A | |
| Private and Non-private Uniformity Testing for Ranking Data | Unknown | N/A | |
| Universal Graph Convolutional Networks | Unknown | N/A | |
| Causal Inference for Event Pairs in Multivariate Point Processes | Unknown | N/A | |
| Labeling Trick: A Theory of Using Graph Neural Networks for Multi-Node Representation Learning | Unknown | N/A | |
| On the Power of Differentiable Learning versus PAC and SQ Learning | Unknown | N/A | |
| SOLQ: Segmenting Objects by Learning Queries | Unknown | N/A | |
| Bandits with Knapsacks beyond the Worst Case | Unknown | N/A | |
| Counterfactual Invariance to Spurious Correlations in Text Classification | Unknown | N/A | |
| Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model | Unknown | N/A | |
| Identity testing for Mallows model | Unknown | N/A | |
| Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement | Unknown | N/A | |
| Localization with Sampling-Argmax | Unknown | N/A | |
| Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs | Unknown | N/A | |
| Automated Discovery of Adaptive Attacks on Adversarial Defenses | Unknown | N/A | |
| Learning with Labeling Induced Abstentions | Unknown | N/A | |
| Fair Sparse Regression with Clustering: An Invex Relaxation for a Combinatorial Problem | Unknown | N/A | |
| Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness | Unknown | N/A | |
| Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization | Unknown | N/A | |
| A Regression Approach to Learning-Augmented Online Algorithms | Unknown | N/A | |
| Revenue maximization via machine learning with noisy data | Unknown | N/A | |
| Planning from Pixels in Environments with Combinatorially Hard Search Spaces | Unknown | N/A | |
| Privately Learning Subspaces | Unknown | N/A | |
| Which Mutual-Information Representation Learning Objectives are Sufficient for Control? | Unknown | N/A | |
| Symbolic Regression via Deep Reinforcement Learning Enhanced Genetic Programming Seeding | Unknown | N/A | |
| Risk Bounds for Over-parameterized Maximum Margin Classification on Sub-Gaussian Mixtures | Unknown | N/A | |
| Spatial Ensemble: a Novel Model Smoothing Mechanism for Student-Teacher Framework | Unknown | N/A | |
| Online Multi-Armed Bandits with Adaptive Inference | Unknown | N/A | |
| Equilibrium and non-Equilibrium regimes in the learning of Restricted Boltzmann Machines | Unknown | N/A | |
| Contextual Recommendations and Low-Regret Cutting-Plane Algorithms | Unknown | N/A | |
| UniDoc: Unified Pretraining Framework for Document Understanding | Unknown | N/A | |
| The effectiveness of feature attribution methods and its correlation with automatic evaluation scores | Unknown | N/A | |
| Subquadratic Overparameterization for Shallow Neural Networks | Unknown | N/A | |
| Learning Semantic Representations to Verify Hardware Designs | Unknown | N/A | |
| Constrained Optimization to Train Neural Networks on Critical and Under-Represented Classes | Unknown | N/A | |
| Surrogate Regret Bounds for Polyhedral Losses | Unknown | N/A | |
| A Variational Perspective on Diffusion-Based Generative Models and Score Matching | Unknown | N/A | |
| Unifying Width-Reduced Methods for Quasi-Self-Concordant Optimization | Unknown | N/A | |
| Shapeshifter: a Parameter-efficient Transformer using Factorized Reshaped Matrices | Unknown | N/A | |
| A Bayesian-Symbolic Approach to Reasoning and Learning in Intuitive Physics | Unknown | N/A | |
| Cardinality constrained submodular maximization for random streams | Unknown | N/A | |
| On Calibration and Out-of-Domain Generalization | Unknown | N/A | |
| Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data | Unknown | N/A | |
| For high-dimensional hierarchical models, consider exchangeability of effects across covariates instead of across datasets | Unknown | N/A | |
| Intriguing Properties of Contrastive Losses | Unknown | N/A | |
| Answering Complex Causal Queries With the Maximum Causal Set Effect | Unknown | N/A | |
| Generalizable Multi-linear Attention Network | Unknown | N/A | |
| Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning | Unknown | N/A | |
| Test-time Collective Prediction | Unknown | N/A | |
| Statistical Undecidability in Linear, Non-Gaussian Causal Models in the Presence of Latent Confounders | Unknown | N/A | |
| Are Transformers more robust than CNNs? | Unknown | N/A | |
| Approximate optimization of convex functions with outlier noise | Unknown | N/A | |
| Reliable and Trustworthy Machine Learning for Health Using Dataset Shift Detection | Unknown | N/A | |
| SimiGrad: Fine-Grained Adaptive Batching for Large Scale Training using Gradient Similarity Measurement | Unknown | N/A | |
| Bootstrap Your Object Detector via Mixed Training | Unknown | N/A | |
| Can fMRI reveal the representation of syntactic structure in the brain? | Unknown | N/A | |
| On the Algorithmic Stability of Adversarial Training | Unknown | N/A | |
| Damped Anderson Mixing for Deep Reinforcement Learning: Acceleration, Convergence, and Stabilization | Unknown | N/A | |
| Can multi-label classification networks know what they don’t know? | Unknown | N/A | |
| AFEC: Active Forgetting of Negative Transfer in Continual Learning | Unknown | N/A | |
| Near-Optimal Multi-Perturbation Experimental Design for Causal Structure Learning | Unknown | N/A | |
| Exploring Forensic Dental Identification with Deep Learning | Unknown | N/A | |
| Dissecting the Diffusion Process in Linear Graph Convolutional Networks | Unknown | N/A | |
| Solving Graph-based Public Goods Games with Tree Search and Imitation Learning | Unknown | N/A | |
| NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL | Unknown | N/A | |
| On Riemannian Optimization over Positive Definite Matrices with the Bures-Wasserstein Geometry | Unknown | N/A | |
| Safe Pontryagin Differentiable Programming | Unknown | N/A | |
| Generic Neural Architecture Search via Regression | Unknown | N/A | |
| Graph Differentiable Architecture Search with Structure Learning | Unknown | N/A | |
| Reinforcement Learning in Reward-Mixing MDPs | Unknown | N/A | |
| A Highly-Efficient Group Elastic Net Algorithm with an Application to Function-On-Scalar Regression | Unknown | N/A | |
| Not All Low-Pass Filters are Robust in Graph Convolutional Networks | Unknown | N/A | |
| Implicit Regularization in Matrix Sensing via Mirror Descent | Unknown | N/A | |
| Generalized DataWeighting via Class-Level Gradient Manipulation | Unknown | N/A | |
| Online Robust Reinforcement Learning with Model Uncertainty | Unknown | N/A | |
| Single Layer Predictive Normalized Maximum Likelihood for Out-of-Distribution Detection | Unknown | N/A | |
| Non-Asymptotic Analysis for Two Time-scale TDC with General Smooth Function Approximation | Unknown | N/A | |
| Continuized Accelerations of Deterministic and Stochastic Gradient Descents, and of Gossip Algorithms | Unknown | N/A | |
| Oracle Complexity in Nonsmooth Nonconvex Optimization | Unknown | N/A | |
| Unlabeled Principal Component Analysis | Unknown | N/A | |
| Residual2Vec: Debiasing graph embedding with random graphs | Unknown | N/A | |
| Towards Context-Agnostic Learning Using Synthetic Data | Unknown | N/A | |
| Modality-Agnostic Topology Aware Localization | Unknown | N/A | |
| A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms | Unknown | N/A | |
| Asynchronous Stochastic Optimization Robust to Arbitrary Delays | Unknown | N/A | |
| Graph Neural Networks with Local Graph Parameters | Unknown | N/A | |
| Towards Sharper Generalization Bounds for Structured Prediction | Unknown | N/A | |
| L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization | Unknown | N/A | |
| Coresets for Time Series Clustering | Unknown | N/A | |
| MCMC Variational Inference via Uncorrected Hamiltonian Annealing | Unknown | N/A | |
| Adversarial Examples for k-Nearest Neighbor Classifiers Based on Higher-Order Voronoi Diagrams | Unknown | N/A | |
| Implicit Sparse Regularization: The Impact of Depth and Early Stopping | Unknown | N/A | |
| Fixes That Fail: Self-Defeating Improvements in Machine-Learning Systems | Unknown | N/A | |
| Risk-Aware Transfer in Reinforcement Learning using Successor Features | Unknown | N/A | |
| A Biased Graph Neural Network Sampler with Near-Optimal Regret | Unknown | N/A | |
| Coresets for Decision Trees of Signals | Unknown | N/A | |
| Quantifying and Improving Transferability in Domain Generalization | Unknown | N/A | |
| Online Selective Classification with Limited Feedback | Unknown | N/A | |
| Concentration inequalities under sub-Gaussian and sub-exponential conditions | Unknown | N/A | |
| Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning | Unknown | N/A | |
| Combining Latent Space and Structured Kernels for Bayesian Optimization over Combinatorial Spaces | Unknown | N/A | |
| MAU: A Motion-Aware Unit for Video Prediction and Beyond | Unknown | N/A | |
| Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark | Unknown | N/A | |
| FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition | Unknown | N/A | |
| STEP: Out-of-Distribution Detection in the Presence of Limited In-Distribution Labeled Data | Unknown | N/A | |
| The Complexity of Bayesian Network Learning: Revisiting the Superstructure | Unknown | N/A | |
| Tighter Expected Generalization Error Bounds via Wasserstein Distance | Unknown | N/A | |
| Differentiable Learning Under Triage | Unknown | N/A | |
| Analytical Study of Momentum-Based Acceleration Methods in Paradigmatic High-Dimensional Non-Convex Problems | Unknown | N/A | |
| GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph | Unknown | N/A | |
| Hyperbolic Busemann Learning with Ideal Prototypes | Unknown | N/A | |
| Meta-Learning for Relative Density-Ratio Estimation | Unknown | N/A | |
| TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning | Unknown | N/A | |
| ROI Maximization in Stochastic Online Decision-Making | Unknown | N/A | |
| Asymptotics of representation learning in finite Bayesian neural networks | Unknown | N/A | |
| Reverse engineering recurrent neural networks with Jacobian switching linear dynamical systems | Unknown | N/A | |
| Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning | Unknown | N/A | |
| Revisiting ResNets: Improved Training and Scaling Strategies | Unknown | N/A | |
| Communication-efficient SGD: From Local SGD to One-Shot Averaging | Unknown | N/A | |
| DSelect-k: Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning | Unknown | N/A | |
| Well-tuned Simple Nets Excel on Tabular Datasets | Unknown | N/A | |
| A Central Limit Theorem for Differentially Private Query Answering | Unknown | N/A | |
| A Little Robustness Goes a Long Way: Leveraging Robust Features for Targeted Transfer Attacks | Unknown | N/A | |
| Learning Dynamic Graph Representation of Brain Connectome with Spatio-Temporal Attention | Unknown | N/A | |
| Diffusion Models Beat GANs on Image Synthesis | Unknown | N/A | |
| Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models | Unknown | N/A | |
| A Contrastive Learning Approach for Training Variational Autoencoder Priors | Unknown | N/A | |
| Are My Deep Learning Systems Fair? An Empirical Study of Fixed-Seed Training | Unknown | N/A | |
| Coresets for Clustering with Missing Values | Unknown | N/A | |
| Representation Learning on Spatial Networks | Unknown | N/A | |
| Chasing Sparsity in Vision Transformers: An End-to-End Exploration | Unknown | N/A | |
| Cycle Self-Training for Domain Adaptation | Unknown | N/A | |
| Self-Supervised Multi-Object Tracking with Cross-input Consistency | Unknown | N/A | |
| Generalizable Imitation Learning from Observation via Inferring Goal Proximity | Unknown | N/A | |
| Information-theoretic generalization bounds for black-box learning algorithms | Unknown | N/A | |
| Disentangled Contrastive Learning on Graphs | Unknown | N/A | |
| Deep Proxy Causal Learning and its Application to Confounded Bandit Policy Evaluation | Unknown | N/A | |
| CANITA: Faster Rates for Distributed Convex Optimization with Communication Compression | Unknown | N/A | |
| Adversarial Regression with Doubly Non-negative Weighting Matrices | Unknown | N/A | |
| ConE: Cone Embeddings for Multi-Hop Reasoning over Knowledge Graphs | Unknown | N/A | |
| Combinatorial Pure Exploration with Bottleneck Reward Function | Unknown | N/A | |
| Few-Shot Object Detection via Association and DIscrimination | Unknown | N/A | |
| Coarse-to-fine Animal Pose and Shape Estimation | Unknown | N/A | |
| Memory-Efficient Approximation Algorithms for Max-k-Cut and Correlation Clustering | Unknown | N/A | |
| An Online Method for A Class of Distributionally Robust Optimization with Non-convex Objectives | Unknown | N/A | |
| Regime Switching Bandits | Unknown | N/A | |
| Conformal Bayesian Computation | Unknown | N/A | |
| Two steps to risk sensitivity | Unknown | N/A | |
| Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training | Unknown | N/A | |
| Causal Influence Detection for Improving Efficiency in Reinforcement Learning | Unknown | N/A | |
| NeuroLKH: Combining Deep Learning Model with Lin-Kernighan-Helsgaun Heuristic for Solving the Traveling Salesman Problem | Unknown | N/A | |
| Learning to Time-Decode in Spiking Neural Networks Through the Information Bottleneck | Unknown | N/A | |
| Uncertainty-Driven Loss for Single Image Super-Resolution | Unknown | N/A | |
| Continual World: A Robotic Benchmark For Continual Reinforcement Learning | Unknown | N/A | |
| Spectral embedding for dynamic networks with stability guarantees | Unknown | N/A | |
| Decentralized Learning in Online Queuing Systems | Unknown | N/A | |
| Framing RNN as a kernel method: A neural ODE approach | Unknown | N/A | |
| Algorithmic Instabilities of Accelerated Gradient Descent | Unknown | N/A | |
| Dual Progressive Prototype Network for Generalized Zero-Shot Learning | Unknown | N/A | |
| Distributed Principal Component Analysis with Limited Communication | Unknown | N/A | |
| Efficient Active Learning for Gaussian Process Classification by Error Reduction | Unknown | N/A | |
| E(n) Equivariant Normalizing Flows | Unknown | N/A | |
| Scalable Bayesian GPFA with automatic relevance determination and discrete noise models | Unknown | N/A | |
| Sharp Impossibility Results for Hyper-graph Testing | Unknown | N/A | |
| Private learning implies quantum stability | Unknown | N/A | |
| IRM—when it works and when it doesn't: A test case of natural language inference | Unknown | N/A | |
| GemNet: Universal Directional Graph Neural Networks for Molecules | Unknown | N/A | |
| Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize | Unknown | N/A | |
| Learning to Combine Per-Example Solutions for Neural Program Synthesis | Unknown | N/A | |
| Causal Navigation by Continuous-time Neural Networks | Unknown | N/A | |
| Provable Representation Learning for Imitation with Contrastive Fourier Features | Unknown | N/A | |
| Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles | Unknown | N/A | |
| Parametrized Quantum Policies for Reinforcement Learning | Unknown | N/A | |
| Parameter Prediction for Unseen Deep Architectures | Unknown | N/A | |
| A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning | Unknown | N/A | |
| LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes | Unknown | N/A | |
| How can classical multidimensional scaling go wrong? | Unknown | N/A | |
| Deep Extrapolation for Attribute-Enhanced Generation | Unknown | N/A | |
| Separation Results between Fixed-Kernel and Feature-Learning Probability Metrics | Unknown | N/A | |
| Removing Inter-Experimental Variability from Functional Data in Systems Neuroscience | Unknown | N/A | |
| Neural optimal feedback control with local learning rules | Unknown | N/A | |
| HyperSPNs: Compact and Expressive Probabilistic Circuits | Unknown | N/A | |
| Robust Generalization despite Distribution Shift via Minimum Discriminating Information | Unknown | N/A | |
| Tracking Without Re-recognition in Humans and Machines | Unknown | N/A | |
| Luna: Linear Unified Nested Attention | Unknown | N/A | |
| Modified Frank Wolfe in Probability Space | Unknown | N/A | |
| EDGE: Explaining Deep Reinforcement Learning Policies | Unknown | N/A | |
| An Empirical Investigation of Domain Generalization with Empirical Risk Minimizers | Unknown | N/A | |
| Differentially Private n-gram Extraction | Unknown | N/A | |
| Heuristic-Guided Reinforcement Learning | Unknown | N/A | |
| A Note on Sparse Generalized Eigenvalue Problem | Unknown | N/A | |
| On Empirical Risk Minimization with Dependent and Heavy-Tailed Data | Unknown | N/A | |
| Celebrating Diversity in Shared Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Mirror Langevin Monte Carlo: the Case Under Isoperimetry | Unknown | N/A | |
| HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning | Unknown | N/A | |
| Exploration-Exploitation in Multi-Agent Competition: Convergence with Bounded Rationality | Unknown | N/A | |
| Finding Regions of Heterogeneity in Decision-Making via Expected Conditional Covariance | Unknown | N/A | |
| Neural Additive Models: Interpretable Machine Learning with Neural Nets | Unknown | N/A | |
| You Never Cluster Alone | Unknown | N/A | |
| Rethinking the Pruning Criteria for Convolutional Neural Network | Unknown | N/A | |
| Bridging Explicit and Implicit Deep Generative Models via Neural Stein Estimators | Unknown | N/A | |
| Distilling Object Detectors with Feature Richness | Unknown | N/A | |
| Support Recovery of Sparse Signals from a Mixture of Linear Measurements | Unknown | N/A | |
| Residual Relaxation for Multi-view Representation Learning | Unknown | N/A | |
| Latent Execution for Neural Program Synthesis Beyond Domain-Specific Languages | Unknown | N/A | |
| Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning | Unknown | N/A | |
| Optimal Rates for Random Order Online Optimization | Unknown | N/A | |
| Autonomous Reinforcement Learning via Subgoal Curricula | Unknown | N/A | |
| Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer | Unknown | N/A | |
| $\texttt{LeadCache}$: Regret-Optimal Caching in Networks | Unknown | N/A | |
| Noise2Score: Tweedie’s Approach to Self-Supervised Image Denoising without Clean Images | Unknown | N/A | |
| Adversarial Robustness with Non-uniform Perturbations | Unknown | N/A | |
| Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning | Unknown | N/A | |
| Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings | Unknown | N/A | |
| Online Matching in Sparse Random Graphs: Non-Asymptotic Performances of Greedy Algorithm | Unknown | N/A | |
| All Tokens Matter: Token Labeling for Training Better Vision Transformers | Unknown | N/A | |
| The decomposition of the higher-order homology embedding constructed from the $k$-Laplacian | Unknown | N/A | |
| Catch-A-Waveform: Learning to Generate Audio from a Single Short Example | Unknown | N/A | |
| Curriculum Disentangled Recommendation with Noisy Multi-feedback | Unknown | N/A | |
| Unsupervised Motion Representation Learning with Capsule Autoencoders | Unknown | N/A | |
| On Margin-Based Cluster Recovery with Oracle Queries | Unknown | N/A | |
| Locally Most Powerful Bayesian Test for Out-of-Distribution Detection using Deep Generative Models | Unknown | N/A | |
| Mixture weights optimisation for Alpha-Divergence Variational Inference | Unknown | N/A | |
| Fast and Memory Efficient Differentially Private-SGD via JL Projections | Unknown | N/A | |
| Conformal Time-series Forecasting | Unknown | N/A | |
| A Max-Min Entropy Framework for Reinforcement Learning | Unknown | N/A | |
| Instance-Dependent Partial Label Learning | Unknown | N/A | |
| Leveraging Distribution Alignment via Stein Path for Cross-Domain Cold-Start Recommendation | Unknown | N/A | |
| Modeling Heterogeneous Hierarchies with Relation-specific Hyperbolic Cones | Unknown | N/A | |
| Adaptive Diffusion in Graph Neural Networks | Unknown | N/A | |
| Explaining Latent Representations with a Corpus of Examples | Unknown | N/A | |
| Sparse Quadratic Optimisation over the Stiefel Manifold with Application to Permutation Synchronisation | Unknown | N/A | |
| Knowledge-inspired 3D Scene Graph Prediction in Point Cloud | Unknown | N/A | |
| Regularization in ResNet with Stochastic Depth | Unknown | N/A | |
| Photonic Differential Privacy with Direct Feedback Alignment | Unknown | N/A | |
| Few-Round Learning for Federated Learning | Unknown | N/A | |
| Multiclass Boosting and the Cost of Weak Learning | Unknown | N/A | |
| On Optimal Robustness to Adversarial Corruption in Online Decision Problems | Unknown | N/A | |
| Self-Diagnosing GAN: Diagnosing Underrepresented Samples in Generative Adversarial Networks | Unknown | N/A | |
| Addressing Algorithmic Disparity and Performance Inconsistency in Federated Learning | Unknown | N/A | |
| ABC: Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning | Unknown | N/A | |
| Rethinking Calibration of Deep Neural Networks: Do Not Be Afraid of Overconfidence | Unknown | N/A | |
| There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning | Unknown | N/A | |
| Learning Interpretable Decision Rule Sets: A Submodular Optimization Approach | Unknown | N/A | |
| Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee | Unknown | N/A | |
| Boosted CVaR Classification | Unknown | N/A | |
| MICo: Improved representations via sampling-based state similarity for Markov decision processes | Unknown | N/A | |
| Federated Hyperparameter Tuning: Challenges, Baselines, and Connections to Weight-Sharing | Unknown | N/A | |
| Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints | Unknown | N/A | |
| Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification | Unknown | N/A | |
| BAST: Bayesian Additive Regression Spanning Trees for Complex Constrained Domain | Unknown | N/A | |
| On Memorization in Probabilistic Deep Generative Models | Unknown | N/A | |
| Assessing Fairness in the Presence of Missing Data | Unknown | N/A | |
| Entropy-based adaptive Hamiltonian Monte Carlo | Unknown | N/A | |
| DeepReduce: A Sparse-tensor Communication Framework for Federated Deep Learning | Unknown | N/A | |
| An Improved Analysis and Rates for Variance Reduction under Without-replacement Sampling Orders | Unknown | N/A | |
| Taxonomizing local versus global structure in neural network loss landscapes | Unknown | N/A | |
| Making a (Counterfactual) Difference One Rationale at a Time | Unknown | N/A | |
| RIM: Reliable Influence-based Active Learning on Graphs | Unknown | N/A | |
| SOFT: Softmax-free Transformer with Linear Complexity | Unknown | N/A | |
| Node Dependent Local Smoothing for Scalable Graph Learning | Unknown | N/A | |
| Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning | Unknown | N/A | |
| A Geometric Analysis of Neural Collapse with Unconstrained Features | Unknown | N/A | |
| Noisy Adaptation Generates Lévy Flights in Attractor Neural Networks | Unknown | N/A | |
| Reverse-Complement Equivariant Networks for DNA Sequences | Unknown | N/A | |
| Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization | Unknown | N/A | |
| Statistical Query Lower Bounds for List-Decodable Linear Regression | Unknown | N/A | |
| The Unbalanced Gromov Wasserstein Distance: Conic Formulation and Relaxation | Unknown | N/A | |
| Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks | Unknown | N/A | |
| Adversarial Teacher-Student Representation Learning for Domain Generalization | Unknown | N/A | |
| Neural Bootstrapper | Unknown | N/A | |
| Learning to Draw: Emergent Communication through Sketching | Unknown | N/A | |
| Counterfactual Maximum Likelihood Estimation for Training Deep Networks | Unknown | N/A | |
| Fitting summary statistics of neural data with a differentiable spiking network simulator | Unknown | N/A | |
| Littlestone Classes are Privately Online Learnable | Unknown | N/A | |
| Can contrastive learning avoid shortcut solutions? | Unknown | N/A | |
| Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP | Unknown | N/A | |
| Contrastive Learning for Neural Topic Model | Unknown | N/A | |
| Scallop: From Probabilistic Deductive Databases to Scalable Differentiable Reasoning | Unknown | N/A | |
| Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Last iterate convergence of SGD for Least-Squares in the Interpolation regime. | Unknown | N/A | |
| Provable Benefits of Actor-Critic Methods for Offline Reinforcement Learning | Unknown | N/A | |
| Necessary and sufficient graphical conditions for optimal adjustment sets in causal graphical models with hidden variables | Unknown | N/A | |
| Differentiable Unsupervised Feature Selection based on a Gated Laplacian | Unknown | N/A | |
| Uniform Concentration Bounds toward a Unified Framework for Robust Clustering | Unknown | N/A | |
| Risk-Averse Bayes-Adaptive Reinforcement Learning | Unknown | N/A | |
| Approximate Decomposable Submodular Function Minimization for Cardinality-Based Components | Unknown | N/A | |
| Lower and Upper Bounds on the Pseudo-Dimension of Tensor Network Models | Unknown | N/A | |
| Permutation-Invariant Variational Autoencoder for Graph-Level Representation Learning | Unknown | N/A | |
| Federated Reconstruction: Partially Local Federated Learning | Unknown | N/A | |
| Provably Efficient Reinforcement Learning with Linear Function Approximation under Adaptivity Constraints | Unknown | N/A | |
| K-level Reasoning for Zero-Shot Coordination in Hanabi | Unknown | N/A | |
| A Theory of the Distortion-Perception Tradeoff in Wasserstein Space | Unknown | N/A | |
| Learning a Single Neuron with Bias Using Gradient Descent | Unknown | N/A | |
| Offline Meta Reinforcement Learning -- Identifiability Challenges and Effective Data Collection Strategies | Unknown | N/A | |
| The Many Faces of Adversarial Risk | Unknown | N/A | |
| Re-ranking for image retrieval and transductive few-shot classification | Unknown | N/A | |
| Impression learning: Online representation learning with synaptic plasticity | Unknown | N/A | |
| Adaptive Conformal Inference Under Distribution Shift | Unknown | N/A | |
| Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers | Unknown | N/A | |
| Neural Distance Embeddings for Biological Sequences | Unknown | N/A | |
| REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision | Unknown | N/A | |
| Adaptive wavelet distillation from neural networks through interpretations | Unknown | N/A | |
| Credit Assignment Through Broadcasting a Global Error Vector | Unknown | N/A | |
| Robust Online Correlation Clustering | Unknown | N/A | |
| DOCTOR: A Simple Method for Detecting Misclassification Errors | Unknown | N/A | |
| Out-of-Distribution Generalization in Kernel Regression | Unknown | N/A | |
| Functionally Regionalized Knowledge Transfer for Low-resource Drug Discovery | Unknown | N/A | |
| Efficiently Learning One Hidden Layer ReLU Networks From Queries | Unknown | N/A | |
| Truncated Marginal Neural Ratio Estimation | Unknown | N/A | |
| Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations | Unknown | N/A | |
| Hyperparameter Optimization Is Deceiving Us, and How to Stop It | Unknown | N/A | |
| Scalable and Stable Surrogates for Flexible Classifiers with Fairness Constraints | Unknown | N/A | |
| Pointwise Bounds for Distribution Estimation under Communication Constraints | Unknown | N/A | |
| Backward-Compatible Prediction Updates: A Probabilistic Approach | Unknown | N/A | |
| Universal Rate-Distortion-Perception Representations for Lossy Compression | Unknown | N/A | |
| Autobahn: Automorphism-based Graph Neural Nets | Unknown | N/A | |
| Fractal Structure and Generalization Properties of Stochastic Optimization Algorithms | Unknown | N/A | |
| Differentiable Annealed Importance Sampling and the Perils of Gradient Noise | Unknown | N/A | |
| Learning to Ground Multi-Agent Communication with Autoencoders | Unknown | N/A | |
| BCD Nets: Scalable Variational Approaches for Bayesian Causal Discovery | Unknown | N/A | |
| Turing Completeness of Bounded-Precision Recurrent Neural Networks | Unknown | N/A | |
| Interpretable agent communication from scratch (with a generic visual processor emerging on the side) | Unknown | N/A | |
| Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination | Unknown | N/A | |
| A Provably Efficient Sample Collection Strategy for Reinforcement Learning | Unknown | N/A | |
| Searching for Efficient Transformers for Language Modeling | Unknown | N/A | |
| Combining Human Predictions with Model Probabilities via Confusion Matrices and Calibration | Unknown | N/A | |
| On Large-Cohort Training for Federated Learning | Unknown | N/A | |
| A/B Testing for Recommender Systems in a Two-sided Marketplace | Unknown | N/A | |
| Detecting and Adapting to Irregular Distribution Shifts in Bayesian Online Learning | Unknown | N/A | |
| Can we globally optimize cross-validation loss? Quasiconvexity in ridge regression | Unknown | N/A | |
| A Prototype-Oriented Framework for Unsupervised Domain Adaptation | Unknown | N/A | |
| Probabilistic Attention for Interactive Segmentation | Unknown | N/A | |
| Safe Policy Optimization with Local Generalized Linear Function Approximations | Unknown | N/A | |
| Locally Valid and Discriminative Prediction Intervals for Deep Learning Models | Unknown | N/A | |
| Extracting Deformation-Aware Local Features by Learning to Deform | Unknown | N/A | |
| NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM | Unknown | N/A | |
| Lip to Speech Synthesis with Visual Context Attentional GAN | Unknown | N/A | |
| Sparse Deep Learning: A New Framework Immune to Local Traps and Miscalibration | Unknown | N/A | |
| RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents | Unknown | N/A | |
| Finite-Sample Analysis of Off-Policy TD-Learning via Generalized Bellman Operators | Unknown | N/A | |
| Fine-Grained Zero-Shot Learning with DNA as Side Information | Unknown | N/A | |
| Debiased Visual Question Answering from Feature and Sample Perspectives | Unknown | N/A | |
| Towards a Theoretical Framework of Out-of-Distribution Generalization | Unknown | N/A | |
| Handling Long-tailed Feature Distribution in AdderNets | Unknown | N/A | |
| Gradient-Free Adversarial Training Against Image Corruption for Learning-based Steering | Unknown | N/A | |
| Increasing Liquid State Machine Performance with Edge-of-Chaos Dynamics Organized by Astrocyte-modulated Plasticity | Unknown | N/A | |
| Capacity and Bias of Learned Geometric Embeddings for Directed Graphs | Unknown | N/A | |
| Word2Fun: Modelling Words as Functions for Diachronic Word Representation | Unknown | N/A | |
| Provably Faster Algorithms for Bilevel Optimization | Unknown | N/A | |
| MixSeq: Connecting Macroscopic Time Series Forecasting with Microscopic Time Series Data | Unknown | N/A | |
| Practical Near Neighbor Search via Group Testing | Unknown | N/A | |
| Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization | Unknown | N/A | |
| Fast Abductive Learning by Similarity-based Consistency Optimization | Unknown | N/A | |
| Posterior Collapse and Latent Variable Non-identifiability | Unknown | N/A | |
| See More for Scene: Pairwise Consistency Learning for Scene Classification | Unknown | N/A | |
| Adversarial Attack Generation Empowered by Min-Max Optimization | Unknown | N/A | |
| PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition | Unknown | N/A | |
| Iterative Methods for Private Synthetic Data: Unifying Framework and New Methods | Unknown | N/A | |
| When Is Unsupervised Disentanglement Possible? | Unknown | N/A | |
| Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference | Unknown | N/A | |
| Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning | Unknown | N/A | |
| You are caught stealing my winning lottery ticket! Making a lottery ticket claim its ownership | Unknown | N/A | |
| Can Less be More? When Increasing-to-Balancing Label Noise Rates Considered Beneficial | Unknown | N/A | |
| Discerning Decision-Making Process of Deep Neural Networks with Hierarchical Voting Transformation | Unknown | N/A | |
| Rethinking and Reweighting the Univariate Losses for Multi-Label Ranking: Consistency and Generalization | Unknown | N/A | |
| Learning to Adapt via Latent Domains for Adaptive Semantic Segmentation | Unknown | N/A | |
| Near Optimal Policy Optimization via REPS | Unknown | N/A | |
| Per-Pixel Classification is Not All You Need for Semantic Segmentation | Unknown | N/A | |
| Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems | Unknown | N/A | |
| Optimal Algorithms for Stochastic Contextual Preference Bandits | Unknown | N/A | |
| Batch Normalization Orthogonalizes Representations in Deep Random Networks | Unknown | N/A | |
| Exploiting Chain Rule and Bayes' Theorem to Compare Probability Distributions | Unknown | N/A | |
| Multi-Objective Meta Learning | Unknown | N/A | |
| Efficiently Identifying Task Groupings for Multi-Task Learning | Unknown | N/A | |
| Continuous Doubly Constrained Batch Reinforcement Learning | Unknown | N/A | |
| ELLA: Exploration through Learned Language Abstraction | Unknown | N/A | |
| PortaSpeech: Portable and High-Quality Generative Text-to-Speech | Unknown | N/A | |
| A mechanistic multi-area recurrent network model of decision-making | Unknown | N/A | |
| Localization, Convexity, and Star Aggregation | Unknown | N/A | |
| Learning to delegate for large-scale vehicle routing | Unknown | N/A | |
| Maximum Likelihood Training of Score-Based Diffusion Models | Unknown | N/A | |
| Graphical Models in Heavy-Tailed Markets | Unknown | N/A | |
| Reliable Post hoc Explanations: Modeling Uncertainty in Explainability | Unknown | N/A | |
| Relaxing Local Robustness | Unknown | N/A | |
| Improving Calibration through the Relationship with Adversarial Robustness | Unknown | N/A | |
| Consistent Non-Parametric Methods for Maximizing Robustness | Unknown | N/A | |
| Representation Learning for Event-based Visuomotor Policies | Unknown | N/A | |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | Unknown | N/A | |
| Off-Policy Risk Assessment in Contextual Bandits | Unknown | N/A | |
| A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs | Unknown | N/A | |
| Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection | Unknown | N/A | |
| The Inductive Bias of Quantum Kernels | Unknown | N/A | |
| Hindsight Task Relabelling: Experience Replay for Sparse Reward Meta-RL | Unknown | N/A | |
| Coordinated Proximal Policy Optimization | Unknown | N/A | |
| Estimating High Order Gradients of the Data Distribution by Denoising | Unknown | N/A | |
| Stabilizing Dynamical Systems via Policy Gradient Methods | Unknown | N/A | |
| What Makes Multi-Modal Learning Better than Single (Provably) | Unknown | N/A | |
| Cardinality-Regularized Hawkes-Granger Model | Unknown | N/A | |
| Deep Contextual Video Compression | Unknown | N/A | |
| Designing Counterfactual Generators using Deep Model Inversion | Unknown | N/A | |
| Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data | Unknown | N/A | |
| Offline Reinforcement Learning as One Big Sequence Modeling Problem | Unknown | N/A | |
| G-PATE: Scalable Differentially Private Data Generator via Private Aggregation of Teacher Discriminators | Unknown | N/A | |
| Emergent Communication of Generalizations | Unknown | N/A | |
| Glance-and-Gaze Vision Transformer | Unknown | N/A | |
| On the Sample Complexity of Privately Learning Axis-Aligned Rectangles | Unknown | N/A | |
| Teachable Reinforcement Learning via Advice Distillation | Unknown | N/A | |
| Sampling with Trusthworthy Constraints: A Variational Gradient Framework | Unknown | N/A | |
| Anti-Backdoor Learning: Training Clean Models on Poisoned Data | Unknown | N/A | |
| Control Variates for Slate Off-Policy Evaluation | Unknown | N/A | |
| TriBERT: Human-centric Audio-visual Representation Learning | Unknown | N/A | |
| How Powerful are Performance Predictors in Neural Architecture Search? | Unknown | N/A | |
| RoMA: Robust Model Adaptation for Offline Model-based Optimization | Unknown | N/A | |
| Sample Complexity Bounds for Active Ranking from Multi-wise Comparisons | Unknown | N/A | |
| Understanding and Improving Early Stopping for Learning with Noisy Labels | Unknown | N/A | |
| NeRS: Neural Reflectance Surfaces for Sparse-view 3D Reconstruction in the Wild | Unknown | N/A | |
| Sageflow: Robust Federated Learning against Both Stragglers and Adversaries | Unknown | N/A | |
| A Universal Law of Robustness via Isoperimetry | Unknown | N/A | |
| Understanding the Under-Coverage Bias in Uncertainty Estimation | Unknown | N/A | |
| Improving Anytime Prediction with Parallel Cascaded Networks and a Temporal-Difference Loss | Unknown | N/A | |
| Differentially Private Model Personalization | Unknown | N/A | |
| Multi-Agent Reinforcement Learning in Stochastic Networked Systems | Unknown | N/A | |
| BulletTrain: Accelerating Robust Neural Network Training via Boundary Example Mining | Unknown | N/A | |
| Robust Predictable Control | Unknown | N/A | |
| Revisiting Model Stitching to Compare Neural Representations | Unknown | N/A | |
| Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation | Unknown | N/A | |
| Iterative Causal Discovery in the Possible Presence of Latent Confounders and Selection Bias | Unknown | N/A | |
| Fast Extra Gradient Methods for Smooth Structured Nonconvex-Nonconcave Minimax Problems | Unknown | N/A | |
| VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text | Unknown | N/A | |
| Detecting Anomalous Event Sequences with Temporal Point Processes | Unknown | N/A | |
| ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias | Unknown | N/A | |
| End-to-end Multi-modal Video Temporal Grounding | Unknown | N/A | |
| Subgroup Generalization and Fairness of Graph Neural Networks | Unknown | N/A | |
| Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers | Unknown | N/A | |
| Online Convex Optimization with Continuous Switching Constraint | Unknown | N/A | |
| Never Go Full Batch (in Stochastic Convex Optimization) | Unknown | N/A | |
| PCA Initialization for Approximate Message Passing in Rotationally Invariant Models | Unknown | N/A | |
| Evaluating State-of-the-Art Classification Models Against Bayes Optimality | Unknown | N/A | |
| Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses | Unknown | N/A | |
| LADA: Look-Ahead Data Acquisition via Augmentation for Deep Active Learning | Unknown | N/A | |
| Searching Parameterized AP Loss for Object Detection | Unknown | N/A | |
| Matrix encoding networks for neural combinatorial optimization | Unknown | N/A | |
| Probabilistic Margins for Instance Reweighting in Adversarial Training | Unknown | N/A | |
| TOHAN: A One-step Approach towards Few-shot Hypothesis Adaptation | Unknown | N/A | |
| Learning Riemannian metric for disease progression modeling | Unknown | N/A | |
| Random Noise Defense Against Query-Based Black-Box Attacks | Unknown | N/A | |
| Exploiting Domain-Specific Features to Enhance Domain Generalization | Unknown | N/A | |
| Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning | Unknown | N/A | |
| What’s a good imputation to predict with missing values? | Unknown | N/A | |
| Local policy search with Bayesian optimization | Unknown | N/A | |
| Twice regularized MDPs and the equivalence between robustness and regularization | Unknown | N/A | |
| Supervising the Transfer of Reasoning Patterns in VQA | Unknown | N/A | |
| On Robust Optimal Transport: Computational Complexity and Barycenter Computation | Unknown | N/A | |
| Deconvolutional Networks on Graph Data | Unknown | N/A | |
| Set Prediction in the Latent Space | Unknown | N/A | |
| Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence | Unknown | N/A | |
| Ensembling Graph Predictions for AMR Parsing | Unknown | N/A | |
| PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization | Unknown | N/A | |
| Predicting Event Memorability from Contextual Visual Semantics | Unknown | N/A | |
| Bounds all around: training energy-based models with bidirectional bounds | Unknown | N/A | |
| Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits | Unknown | N/A | |
| Artistic Style Transfer with Internal-external Learning and Contrastive Learning | Unknown | N/A | |
| Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning | Unknown | N/A | |
| Consistency Regularization for Variational Auto-Encoders | Unknown | N/A | |
| The Implicit Bias of Minima Stability: A View from Function Space | Unknown | N/A | |
| What can linearized neural networks actually say about generalization? | Unknown | N/A | |
| Neighborhood Reconstructing Autoencoders | Unknown | N/A | |
| SLOE: A Faster Method for Statistical Inference in High-Dimensional Logistic Regression | Unknown | N/A | |
| On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness | Unknown | N/A | |
| Variational Multi-Task Learning with Gumbel-Softmax Priors | Unknown | N/A | |
| Noether’s Learning Dynamics: Role of Symmetry Breaking in Neural Networks | Unknown | N/A | |
| Posterior Meta-Replay for Continual Learning | Unknown | N/A | |
| SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning | Unknown | N/A | |
| Variational Diffusion Models | Unknown | N/A | |
| Collaborative Uncertainty in Multi-Agent Trajectory Forecasting | Unknown | N/A | |
| ResT: An Efficient Transformer for Visual Recognition | Unknown | N/A | |
| Unsupervised Object-Level Representation Learning from Scene Images | Unknown | N/A | |
| Locality defeats the curse of dimensionality in convolutional teacher-student scenarios | Unknown | N/A | |
| Learning Theory Can (Sometimes) Explain Generalisation in Graph Neural Networks | Unknown | N/A | |
| Fast rates for prediction with limited expert advice | Unknown | N/A | |
| Unsupervised Representation Transfer for Small Networks: I Believe I Can Distill On-the-Fly | Unknown | N/A | |
| Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration | Unknown | N/A | |
| Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network | Unknown | N/A | |
| Rectifying the Shortcut Learning of Background for Few-Shot Learning | Unknown | N/A | |
| To The Point: Correspondence-driven monocular 3D category reconstruction | Unknown | N/A | |
| Robustness via Uncertainty-aware Cycle Consistency | Unknown | N/A | |
| Counterexample Guided RL Policy Refinement Using Bayesian Optimization | Unknown | N/A | |
| Foundations of Symbolic Languages for Model Interpretability | Unknown | N/A | |
| Wisdom of the Crowd Voting: Truthful Aggregation of Voter Information and Preferences | Unknown | N/A | |
| Rate-Optimal Subspace Estimation on Random Graphs | Unknown | N/A | |
| Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System | Unknown | N/A | |
| Convergence of adaptive algorithms for constrained weakly convex optimization | Unknown | N/A | |
| Gradient Driven Rewards to Guarantee Fairness in Collaborative Machine Learning | Unknown | N/A | |
| Model-Based Reinforcement Learning via Imagination with Derived Memory | Unknown | N/A | |
| Pareto Domain Adaptation | Unknown | N/A | |
| Optimal Rates for Nonparametric Density Estimation under Communication Constraints | Unknown | N/A | |
| Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck | Unknown | N/A | |
| Neo-GNNs: Neighborhood Overlap-aware Graph Neural Networks for Link Prediction | Unknown | N/A | |
| Federated Split Task-Agnostic Vision Transformer for COVID-19 CXR Diagnosis | Unknown | N/A | |
| DRIVE: One-bit Distributed Mean Estimation | Unknown | N/A | |
| MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms | Unknown | N/A | |
| Do Transformers Really Perform Badly for Graph Representation? | Unknown | N/A | |
| Diversity Enhanced Active Learning with Strictly Proper Scoring Rules | Unknown | N/A | |
| Fast Federated Learning in the Presence of Arbitrary Device Unavailability | Unknown | N/A | |
| Clockwork Variational Autoencoders | Unknown | N/A | |
| Learning Domain Invariant Representations in Goal-conditioned Block MDPs | Unknown | N/A | |
| Rethinking conditional GAN training: An approach using geometrically structured latent manifolds | Unknown | N/A | |
| Learning Space Partitions for Path Planning | Unknown | N/A | |
| Independent Prototype Propagation for Zero-Shot Compositionality | Unknown | N/A | |
| A Normative and Biologically Plausible Algorithm for Independent Component Analysis | Unknown | N/A | |
| Representing Hyperbolic Space Accurately using Multi-Component Floats | Unknown | N/A | |
| Compacter: Efficient Low-Rank Hypercomplex Adapter Layers | Unknown | N/A | |
| Proportional Participatory Budgeting with Additive Utilities | Unknown | N/A | |
| Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision | Unknown | N/A | |
| Streaming Linear System Identification with Reverse Experience Replay | Unknown | N/A | |
| Estimating Multi-cause Treatment Effects via Single-cause Perturbation | Unknown | N/A | |
| DualNet: Continual Learning, Fast and Slow | Unknown | N/A | |
| End-to-end reconstruction meets data-driven regularization for inverse problems | Unknown | N/A | |
| Garment4D: Garment Reconstruction from Point Cloud Sequences | Unknown | N/A | |
| Identification of the Generalized Condorcet Winner in Multi-dueling Bandits | Unknown | N/A | |
| Learning Collaborative Policies to Solve NP-hard Routing Problems | Unknown | N/A | |
| Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN | Unknown | N/A | |
| On the Second-order Convergence Properties of Random Search Methods | Unknown | N/A | |
| Combating Noise: Semi-supervised Learning by Region Uncertainty Quantification | Unknown | N/A | |
| Neural Bellman-Ford Networks: A General Graph Neural Network Framework for Link Prediction | Unknown | N/A | |
| Natural continual learning: success is a journey, not (just) a destination | Unknown | N/A | |
| Parameterized Knowledge Transfer for Personalized Federated Learning | Unknown | N/A | |
| ToAlign: Task-Oriented Alignment for Unsupervised Domain Adaptation | Unknown | N/A | |
| Integrated Latent Heterogeneity and Invariance Learning in Kernel Space | Unknown | N/A | |
| Variational Inference for Continuous-Time Switching Dynamical Systems | Unknown | N/A | |
| Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction | Unknown | N/A | |
| 3D Pose Transfer with Correspondence Learning and Mesh Refinement | Unknown | N/A | |
| Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections | Unknown | N/A | |
| Efficient Learning of Discrete-Continuous Computation Graphs | Unknown | N/A | |
| From global to local MDI variable importances for random forests and when they are Shapley values | Unknown | N/A | |
| Limiting fluctuation and trajectorial stability of multilayer neural networks with mean field training | Unknown | N/A | |
| Nearly-Tight and Oblivious Algorithms for Explainable Clustering | Unknown | N/A | |
| Lower Bounds on Metropolized Sampling Methods for Well-Conditioned Distributions | Unknown | N/A | |
| Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training | Unknown | N/A | |
| Safe Reinforcement Learning by Imagining the Near Future | Unknown | N/A | |
| BernNet: Learning Arbitrary Graph Spectral Filters via Bernstein Approximation | Unknown | N/A | |
| Structured Denoising Diffusion Models in Discrete State-Spaces | Unknown | N/A | |
| An Information-theoretic Approach to Distribution Shifts | Unknown | N/A | |
| Offline Reinforcement Learning with Reverse Model-based Imagination | Unknown | N/A | |
| On learning sparse vectors from mixture of responses | Unknown | N/A | |
| SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers | Unknown | N/A | |
| The Role of Global Labels in Few-Shot Classification and How to Infer Them | Unknown | N/A | |
| Predify: Augmenting deep neural networks with brain-inspired predictive coding dynamics | Unknown | N/A | |
| Meta-Learning Sparse Implicit Neural Representations | Unknown | N/A | |
| Pruning Randomly Initialized Neural Networks with Iterative Randomization | Unknown | N/A | |
| Periodic Activation Functions Induce Stationarity | Unknown | N/A | |
| Stateful Strategic Regression | Unknown | N/A | |
| On the Estimation Bias in Double Q-Learning | Unknown | N/A | |
| Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA | Unknown | N/A | |
| A Faster Maximum Cardinality Matching Algorithm with Applications in Machine Learning | Unknown | N/A | |
| Few-Shot Segmentation via Cycle-Consistent Transformer | Unknown | N/A | |
| Augmented Shortcuts for Vision Transformers | Unknown | N/A | |
| Adversarial Reweighting for Partial Domain Adaptation | Unknown | N/A | |
| Instance-dependent Label-noise Learning under a Structural Causal Model | Unknown | N/A | |
| Auto-Encoding Knowledge Graph for Unsupervised Medical Report Generation | Unknown | N/A | |
| Optimizing Reusable Knowledge for Continual Learning via Metalearning | Unknown | N/A | |
| Breaking the centralized barrier for cross-device federated learning | Unknown | N/A | |
| Towards Enabling Meta-Learning from Target Models | Unknown | N/A | |
| Universal Semi-Supervised Learning | Unknown | N/A | |
| The Emergence of Objectness: Learning Zero-shot Segmentation from Videos | Unknown | N/A | |
| CoFrNets: Interpretable Neural Architecture Inspired by Continued Fractions | Unknown | N/A | |
| Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality | Unknown | N/A | |
| Unsupervised Foreground Extraction via Deep Region Competition | Unknown | N/A | |
| DeepSITH: Efficient Learning via Decomposition of What and When Across Time Scales | Unknown | N/A | |
| Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs | Unknown | N/A | |
| From Canonical Correlation Analysis to Self-supervised Graph Neural Networks | Unknown | N/A | |
| Powerpropagation: A sparsity inducing weight reparameterisation | Unknown | N/A | |
| Towards a Unified Game-Theoretic View of Adversarial Perturbations and Robustness | Unknown | N/A | |
| Deep Residual Learning in Spiking Neural Networks | Unknown | N/A | |
| Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning | Unknown | N/A | |
| Learning curves of generic features maps for realistic datasets with a teacher-student model | Unknown | N/A | |
| Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret | Unknown | N/A | |
| SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes | Unknown | N/A | |
| Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation | Unknown | N/A | |
| Recurrent Bayesian Classifier Chains for Exact Multi-Label Classification | Unknown | N/A | |
| Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks | Unknown | N/A | |
| When Are Solutions Connected in Deep Networks? | Unknown | N/A | |
| SWAD: Domain Generalization by Seeking Flat Minima | Unknown | N/A | |
| Efficient Neural Network Training via Forward and Backward Propagation Sparsification | Unknown | N/A | |
| Least Square Calibration for Peer Reviews | Unknown | N/A | |
| Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks | Unknown | N/A | |
| Choose a Transformer: Fourier or Galerkin | Unknown | N/A | |
| MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps | Unknown | N/A | |
| Gauge Equivariant Transformer | Unknown | N/A | |
| An Axiomatic Theory of Provably-Fair Welfare-Centric Machine Learning | Unknown | N/A | |
| Mastering Atari Games with Limited Data | Unknown | N/A | |
| Contextual Similarity Aggregation with Self-attention for Visual Re-ranking | Unknown | N/A | |
| Distributed Saddle-Point Problems Under Data Similarity | Unknown | N/A | |
| Online Variational Filtering and Parameter Learning | Unknown | N/A | |
| Support vector machines and linear regression coincide with very high-dimensional features | Unknown | N/A | |
| Information Directed Reward Learning for Reinforcement Learning | Unknown | N/A | |
| Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery | Unknown | N/A | |
| Associating Objects with Transformers for Video Object Segmentation | Unknown | N/A | |
| Learning in Non-Cooperative Configurable Markov Decision Processes | Unknown | N/A | |
| Partial success in closing the gap between human and machine vision | Unknown | N/A | |
| No-regret Online Learning over Riemannian Manifolds | Unknown | N/A | |
| On Effective Scheduling of Model-based Reinforcement Learning | Unknown | N/A | |
| Dual Parameterization of Sparse Variational Gaussian Processes | Unknown | N/A | |
| Online Facility Location with Multiple Advice | Unknown | N/A | |
| Agent Modelling under Partial Observability for Deep Reinforcement Learning | Unknown | N/A | |
| Self-Supervised Learning Disentangled Group Representation as Feature | Unknown | N/A | |
| MOMA: Multi-Object Multi-Actor Activity Parsing | Unknown | N/A | |
| Optimizing Information-theoretical Generalization Bound via Anisotropic Noise of SGLD | Unknown | N/A | |
| Batched Thompson Sampling | Unknown | N/A | |
| Shape your Space: A Gaussian Mixture Regularization Approach to Deterministic Autoencoders | Unknown | N/A | |
| On the Bias-Variance-Cost Tradeoff of Stochastic Optimization | Unknown | N/A | |
| R-Drop: Regularized Dropout for Neural Networks | Unknown | N/A | |
| Hard-Attention for Scalable Image Classification | Unknown | N/A | |
| A Faster Decentralized Algorithm for Nonconvex Minimax Problems | Unknown | N/A | |
| Co-evolution Transformer for Protein Contact Prediction | Unknown | N/A | |
| Dynamic COVID risk assessment accounting for community virus exposure from a spatial-temporal transmission model | Unknown | N/A | |
| The balancing principle for parameter choice in distance-regularized domain adaptation | Unknown | N/A | |
| Large-Scale Wasserstein Gradient Flows | Unknown | N/A | |
| Non-Gaussian Gaussian Processes for Few-Shot Regression | Unknown | N/A | |
| Robustness between the worst and average case | Unknown | N/A | |
| Alignment Attention by Matching Key and Query Distributions | Unknown | N/A | |
| Learning Conjoint Attentions for Graph Neural Nets | Unknown | N/A | |
| FedDR – Randomized Douglas-Rachford Splitting Algorithms for Nonconvex Federated Composite Optimization | Unknown | N/A | |
| Neural Symplectic Form: Learning Hamiltonian Equations on General Coordinate Systems | Unknown | N/A | |
| Open Rule Induction | Unknown | N/A | |
| Biological learning in key-value memory networks | Unknown | N/A | |
| Mixed Supervised Object Detection by Transferring Mask Prior and Semantic Similarity | Unknown | N/A | |
| An Improved Analysis of Gradient Tracking for Decentralized Machine Learning | Unknown | N/A | |
| Task-Adaptive Neural Network Search with Meta-Contrastive Learning | Unknown | N/A | |
| Minimax Optimal Quantile and Semi-Adversarial Regret via Root-Logarithmic Regularizers | Unknown | N/A | |
| On Inductive Biases for Heterogeneous Treatment Effect Estimation | Unknown | N/A | |
| Deconditional Downscaling with Gaussian Processes | Unknown | N/A | |
| Understanding Instance-based Interpretability of Variational Auto-Encoders | Unknown | N/A | |
| Self-Supervised GANs with Label Augmentation | Unknown | N/A | |
| Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data | Unknown | N/A | |
| Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning | Unknown | N/A | |
| Fair Scheduling for Time-dependent Resources | Unknown | N/A | |
| Distilling Image Classifiers in Object Detectors | Unknown | N/A | |
| Discovery of Options via Meta-Learned Subgoals | Unknown | N/A | |
| Balanced Chamfer Distance as a Comprehensive Metric for Point Cloud Completion | Unknown | N/A | |
| CO-PILOT: COllaborative Planning and reInforcement Learning On sub-Task curriculum | Unknown | N/A | |
| Topographic VAEs learn Equivariant Capsules | Unknown | N/A | |
| Self-Consistent Models and Values | Unknown | N/A | |
| Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions | Unknown | N/A | |
| On Linear Stability of SGD and Input-Smoothness of Neural Networks | Unknown | N/A | |
| Adversarial Training Helps Transfer Learning via Better Representations | Unknown | N/A | |
| Going Beyond Linear Transformers with Recurrent Fast Weight Programmers | Unknown | N/A | |
| Regret Minimization Experience Replay in Off-Policy Reinforcement Learning | Unknown | N/A | |
| PreferenceNet: Encoding Human Preferences in Auction Design with Deep Learning | Unknown | N/A | |
| Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic Prediction | Unknown | N/A | |
| Learning to Learn Graph Topologies | Unknown | N/A | |
| AutoGEL: An Automated Graph Neural Network with Explicit Link Information | Unknown | N/A | |
| Interpreting Representation Quality of DNNs for 3D Point Cloud Processing | Unknown | N/A | |
| Low-Rank Constraints for Fast Inference in Structured Models | Unknown | N/A | |
| On the Equivalence between Neural Network and Support Vector Machine | Unknown | N/A | |
| Active Assessment of Prediction Services as Accuracy Surface Over Attribute Combinations | Unknown | N/A | |
| Learning Equivariant Energy Based Models with Equivariant Stein Variational Gradient Descent | Unknown | N/A | |
| FlexMatch: Boosting Semi-Supervised Learning with Curriculum Pseudo Labeling | Unknown | N/A | |
| Adversarially robust learning for security-constrained optimal power flow | Unknown | N/A | |
| Stochastic Optimization of Areas Under Precision-Recall Curves with Provable Convergence | Unknown | N/A | |
| MST: Masked Self-Supervised Transformer for Visual Representation | Unknown | N/A | |
| Generalized Shape Metrics on Neural Representations | Unknown | N/A | |
| Faster Neural Network Training with Approximate Tensor Operations | Unknown | N/A | |
| Personalized Federated Learning With Gaussian Processes | Unknown | N/A | |
| ReSSL: Relational Self-Supervised Learning with Weak Augmentation | Unknown | N/A | |
| Aligned Structured Sparsity Learning for Efficient Image Super-Resolution | Unknown | N/A | |
| Differentiable Quality Diversity | Unknown | N/A | |
| Recurrence along Depth: Deep Convolutional Neural Networks with Recurrent Layer Aggregation | Unknown | N/A | |
| Efficient Equivariant Network | Unknown | N/A | |
| The functional specialization of visual cortex emerges from training parallel pathways with self-supervised predictive learning | Unknown | N/A | |
| Alias-Free Generative Adversarial Networks | Unknown | N/A | |
| Statistically and Computationally Efficient Linear Meta-representation Learning | Unknown | N/A | |
| Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games | Unknown | N/A | |
| Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning | Unknown | N/A | |
| Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data | Unknown | N/A | |
| Safe Reinforcement Learning with Natural Language Constraints | Unknown | N/A | |
| Improving Self-supervised Learning with Automated Unsupervised Outlier Arbitration | Unknown | N/A | |
| Dynamical Wasserstein Barycenters for Time-series Modeling | Unknown | N/A | |
| Global-aware Beam Search for Neural Abstractive Summarization | Unknown | N/A | |
| Optimal Order Simple Regret for Gaussian Process Bandits | Unknown | N/A | |
| Invariant Causal Imitation Learning for Generalizable Policies | Unknown | N/A | |
| Directed Probabilistic Watershed | Unknown | N/A | |
| Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects | Unknown | N/A | |
| STORM+: Fully Adaptive SGD with Recursive Momentum for Nonconvex Optimization | Unknown | N/A | |
| Counterfactual Explanations in Sequential Decision Making Under Uncertainty | Unknown | N/A | |
| Diversity Matters When Learning From Ensembles | Unknown | N/A | |
| Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples | Unknown | N/A | |
| Make Sure You're Unsure: A Framework for Verifying Probabilistic Specifications | Unknown | N/A | |
| Beyond Bandit Feedback in Online Multiclass Classification | Unknown | N/A | |
| Learning Fast-Inference Bayesian Networks | Unknown | N/A | |
| Physics-Aware Downsampling with Deep Learning for Scalable Flood Modeling | Unknown | N/A | |
| Directed Graph Contrastive Learning | Unknown | N/A | |
| Neural Auto-Curricula in Two-Player Zero-Sum Games | Unknown | N/A | |
| Determinantal point processes based on orthogonal polynomials for sampling minibatches in SGD | Unknown | N/A | |
| Asynchronous Decentralized Online Learning | Unknown | N/A | |
| Diffusion Normalizing Flow | Unknown | N/A | |
| A sampling-based circuit for optimal decision making | Unknown | N/A | |
| Demystifying and Generalizing BinaryConnect | Unknown | N/A | |
| Learning Transferable Features for Point Cloud Detection via 3D Contrastive Co-training | Unknown | N/A | |
| Bayesian Optimization with High-Dimensional Outputs | Unknown | N/A | |
| Using Random Effects to Account for High-Cardinality Categorical Features and Repeated Measures in Deep Neural Networks | Unknown | N/A | |
| HRFormer: High-Resolution Vision Transformer for Dense Predict | Unknown | N/A | |
| Graph Adversarial Self-Supervised Learning | Unknown | N/A | |
| The Image Local Autoregressive Transformer | Unknown | N/A | |
| Fine-grained Generalization Analysis of Inductive Matrix Completion | Unknown | N/A | |
| Canonical Capsules: Self-Supervised Capsules in Canonical Pose | Unknown | N/A | |
| On the Power of Edge Independent Graph Models | Unknown | N/A | |
| On the Theory of Reinforcement Learning with Once-per-Episode Feedback | Unknown | N/A | |
| Conflict-Averse Gradient Descent for Multi-task learning | Unknown | N/A | |
| Near-optimal Offline and Streaming Algorithms for Learning Non-Linear Dynamical Systems | Unknown | N/A | |
| Predicting What You Already Know Helps: Provable Self-Supervised Learning | Unknown | N/A | |
| Fair Sortition Made Transparent | Unknown | N/A | |
| Denoising Normalizing Flow | Unknown | N/A | |
| TopicNet: Semantic Graph-Guided Topic Discovery | Unknown | N/A | |
| Effective Meta-Regularization by Kernelized Proximal Regularization | Unknown | N/A | |
| No RL, No Simulation: Learning to Navigate without Navigating | Unknown | N/A | |
| Knowledge-Adaptation Priors | Unknown | N/A | |
| Universal Approximation Using Well-Conditioned Normalizing Flows | Unknown | N/A | |
| Domain Invariant Representation Learning with Domain Density Transformations | Unknown | N/A | |
| OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression | Unknown | N/A | |
| Is Automated Topic Model Evaluation Broken? The Incoherence of Coherence | Unknown | N/A | |
| VAST: Value Function Factorization with Variable Agent Sub-Teams | Unknown | N/A | |
| Relaxed Marginal Consistency for Differentially Private Query Answering | Unknown | N/A | |
| Neural Flows: Efficient Alternative to Neural ODEs | Unknown | N/A | |
| Square Root Principal Component Pursuit: Tuning-Free Noisy Robust Matrix Recovery | Unknown | N/A | |
| Fast Training Method for Stochastic Compositional Optimization Problems | Unknown | N/A | |
| Fast Projection onto the Capped Simplex with Applications to Sparse Regression in Bioinformatics | Unknown | N/A | |
| MobTCast: Leveraging Auxiliary Trajectory Forecasting for Human Mobility Prediction | Unknown | N/A | |
| Reliable Causal Discovery with Improved Exact Search and Weaker Assumptions | Unknown | N/A | |
| Adaptive Sampling for Minimax Fair Classification | Unknown | N/A | |
| Relative Flatness and Generalization | Unknown | N/A | |
| Mini-Batch Consistent Slot Set Encoder for Scalable Set Encoding | Unknown | N/A | |
| Stochastic Solutions for Linear Inverse Problems using the Prior Implicit in a Denoiser | Unknown | N/A | |
| RelaySum for Decentralized Deep Learning on Heterogeneous Data | Unknown | N/A | |
| FMMformer: Efficient and Flexible Transformer via Decomposed Near-field and Far-field Attention | Unknown | N/A | |
| Gaussian Kernel Mixture Network for Single Image Defocus Deblurring | Unknown | N/A | |
| Global Filter Networks for Image Classification | Unknown | N/A | |
| No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data | Unknown | N/A | |
| Linear-Time Probabilistic Solution of Boundary Value Problems | Unknown | N/A | |
| Adaptive Online Packing-guided Search for POMDPs | Unknown | N/A | |
| Topological Relational Learning on Graphs | Unknown | N/A | |
| MobILE: Model-Based Imitation Learning From Observation Alone | Unknown | N/A | |
| Multi-Label Learning with Pairwise Relevance Ordering | Unknown | N/A | |
| Volume Rendering of Neural Implicit Surfaces | Unknown | N/A | |
| Loss function based second-order Jensen inequality and its application to particle variational inference | Unknown | N/A | |
| Towards Robust and Reliable Algorithmic Recourse | Unknown | N/A | |
| DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras | Unknown | N/A | |
| Does Preprocessing Help Training Over-parameterized Neural Networks? | Unknown | N/A | |
| Adversarial Robustness with Semi-Infinite Constrained Learning | Unknown | N/A | |
| Don’t Generate Me: Training Differentially Private Generative Models with Sinkhorn Divergence | Unknown | N/A | |
| Optimizing Conditional Value-At-Risk of Black-Box Functions | Unknown | N/A | |
| Learning to dehaze with polarization | Unknown | N/A | |
| TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks | Unknown | N/A | |
| Federated Linear Contextual Bandits | Unknown | N/A | |
| Fast Doubly-Adaptive MCMC to Estimate the Gibbs Partition Function with Weak Mixing Time Bounds | Unknown | N/A | |
| An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning | Unknown | N/A | |
| Calibrating Predictions to Decisions: A Novel Approach to Multi-Class Calibration | Unknown | N/A | |
| Implicit Transformer Network for Screen Content Image Continuous Super-Resolution | Unknown | N/A | |
| Do Input Gradients Highlight Discriminative Features? | Unknown | N/A | |
| Three Operator Splitting with Subgradients, Stochastic Gradients, and Adaptive Learning Rates | Unknown | N/A | |
| Linear Convergence in Federated Learning: Tackling Client Heterogeneity and Sparse Gradients | Unknown | N/A | |
| 3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds | Unknown | N/A | |
| Training for the Future: A Simple Gradient Interpolation Loss to Generalize Along Time | Unknown | N/A | |
| Dense Unsupervised Learning for Video Segmentation | Unknown | N/A | |
| Scalable Quasi-Bayesian Inference for Instrumental Variable Regression | Unknown | N/A | |
| Ultrahyperbolic Neural Networks | Unknown | N/A | |
| Drop, Swap, and Generate: A Self-Supervised Approach for Generating Neural Activity | Unknown | N/A | |
| Evaluating Gradient Inversion Attacks and Defenses in Federated Learning | Unknown | N/A | |
| Neural Hybrid Automata: Learning Dynamics With Multiple Modes and Stochastic Transitions | Unknown | N/A | |
| Rot-Pro: Modeling Transitivity by Projection in Knowledge Graph Embedding | Unknown | N/A | |
| Gradient Inversion with Generative Image Prior | Unknown | N/A | |
| Action-guided 3D Human Motion Prediction | Unknown | N/A | |
| SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness | Unknown | N/A | |
| Higher Order Kernel Mean Embeddings to Capture Filtrations of Stochastic Processes | Unknown | N/A | |
| Learning to Learn Dense Gaussian Processes for Few-Shot Learning | Unknown | N/A | |
| Achieving Rotational Invariance with Bessel-Convolutional Neural Networks | Unknown | N/A | |
| Online Learning and Control of Complex Dynamical Systems from Sensory Input | Unknown | N/A | |
| Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning | Unknown | N/A | |
| End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering | Unknown | N/A | |
| Reinforcement learning for optimization of variational quantum circuit architectures | Unknown | N/A | |
| Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination | Unknown | N/A | |
| Active clustering for labeling training data | Unknown | N/A | |
| Dynamic Normalization and Relay for Video Action Recognition | Unknown | N/A | |
| Local Differential Privacy for Regret Minimization in Reinforcement Learning | Unknown | N/A | |
| Predicting Molecular Conformation via Dynamic Graph Score Matching | Unknown | N/A | |
| Identification and Estimation of Joint Probabilities of Potential Outcomes in Observational Studies with Covariate Information | Unknown | N/A | |
| Residual Pathway Priors for Soft Equivariance Constraints | Unknown | N/A | |
| Robust Deep Reinforcement Learning through Adversarial Loss | Unknown | N/A | |
| Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives | Unknown | N/A | |
| Interesting Object, Curious Agent: Learning Task-Agnostic Exploration | Unknown | N/A | |
| ASSANet: An Anisotropic Separable Set Abstraction for Efficient Point Cloud Representation Learning | Unknown | N/A | |
| Smooth Normalizing Flows | Unknown | N/A | |
| Directional Message Passing on Molecular Graphs via Synthetic Coordinates | Unknown | N/A | |
| Lower Bounds and Optimal Algorithms for Smooth and Strongly Convex Decentralized Optimization Over Time-Varying Networks | Unknown | N/A | |
| On Contrastive Representations of Stochastic Processes | Unknown | N/A | |
| Joint inference and input optimization in equilibrium networks | Unknown | N/A | |
| Black Box Probabilistic Numerics | Unknown | N/A | |
| STEM: A Stochastic Two-Sided Momentum Algorithm Achieving Near-Optimal Sample and Communication Complexities for Federated Learning | Unknown | N/A | |
| NTopo: Mesh-free Topology Optimization using Implicit Neural Representations | Unknown | N/A | |
| A 3D Generative Model for Structure-Based Drug Design | Unknown | N/A | |
| Circa: Stochastic ReLUs for Private Deep Learning | Unknown | N/A | |
| Explaining Hyperparameter Optimization via Partial Dependence Plots | Unknown | N/A | |
| Learning Causal Semantic Representation for Out-of-Distribution Prediction | Unknown | N/A | |
| Charting and Navigating the Space of Solutions for Recurrent Neural Networks | Unknown | N/A | |
| Disentangling the Roles of Curation, Data-Augmentation and the Prior in the Cold Posterior Effect | Unknown | N/A | |
| Manipulating SGD with Data Ordering Attacks | Unknown | N/A | |
| Practical Large-Scale Linear Programming using Primal-Dual Hybrid Gradient | Unknown | N/A | |
| Recovery Analysis for Plug-and-Play Priors using the Restricted Eigenvalue Condition | Unknown | N/A | |
| Do Different Tracking Tasks Require Different Appearance Models? | Unknown | N/A | |
| Online Learning in Periodic Zero-Sum Games | Unknown | N/A | |
| CentripetalText: An Efficient Text Instance Representation for Scene Text Detection | Unknown | N/A | |
| Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data | Unknown | N/A | |
| Image Generation using Continuous Filter Atoms | Unknown | N/A | |
| Beltrami Flow and Neural Diffusion on Graphs | Unknown | N/A | |
| Multimodal Few-Shot Learning with Frozen Language Models | Unknown | N/A | |
| Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent | Unknown | N/A | |
| Fast Bayesian Inference for Gaussian Cox Processes via Path Integral Formulation | Unknown | N/A | |
| NORESQA: A Framework for Speech Quality Assessment using Non-Matching References | Unknown | N/A | |
| Duplex Sequence-to-Sequence Learning for Reversible Machine Translation | Unknown | N/A | |
| Coupled Gradient Estimators for Discrete Latent Variables | Unknown | N/A | |
| Perturbation-based Regret Analysis of Predictive Control in Linear Time Varying Systems | Unknown | N/A | |
| Metropolis-Hastings Data Augmentation for Graph Neural Networks | Unknown | N/A | |
| Private Non-smooth ERM and SCO in Subquadratic Steps | Unknown | N/A | |
| Automatic Symmetry Discovery with Lie Algebra Convolutional Network | Unknown | N/A | |
| Finding Bipartite Components in Hypergraphs | Unknown | N/A | |
| Gone Fishing: Neural Active Learning with Fisher Embeddings | Unknown | N/A | |
| SketchGen: Generating Constrained CAD Sketches | Unknown | N/A | |
| Dueling Bandits with Team Comparisons | Unknown | N/A | |
| The Effect of the Intrinsic Dimension on the Generalization of Quadratic Classifiers | Unknown | N/A | |
| Exponential Graph is Provably Efficient for Decentralized Deep Training | Unknown | N/A | |
| A Convergence Analysis of Gradient Descent on Graph Neural Networks | Unknown | N/A | |
| Design of Experiments for Stochastic Contextual Linear Bandits | Unknown | N/A | |
| Think Big, Teach Small: Do Language Models Distil Occam’s Razor? | Unknown | N/A | |
| Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective | Unknown | N/A | |
| On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning | Unknown | N/A | |
| Solving Min-Max Optimization with Hidden Structure via Gradient Descent Ascent | Unknown | N/A | |
| Cooperative Stochastic Bandits with Asynchronous Agents and Constrained Feedback | Unknown | N/A | |
| Latent Equilibrium: A unified learning theory for arbitrarily fast computation with arbitrarily slow neurons | Unknown | N/A | |
| On Locality of Local Explanation Models | Unknown | N/A | |
| Learning Signal-Agnostic Manifolds of Neural Fields | Unknown | N/A | |
| Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training | Unknown | N/A | |
| Unsupervised Learning of Compositional Energy Concepts | Unknown | N/A | |
| Neural Circuit Synthesis from Specification Patterns | Unknown | N/A | |
| Spatiotemporal Joint Filter Decomposition in 3D Convolutional Neural Networks | Unknown | N/A | |
| Grounding Representation Similarity Through Statistical Testing | Unknown | N/A | |
| Uncertainty Calibration for Ensemble-Based Debiasing Methods | Unknown | N/A | |
| Activation Sharing with Asymmetric Paths Solves Weight Transport Problem without Bidirectional Connection | Unknown | N/A | |
| Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception | Unknown | N/A | |
| Structure learning in polynomial time: Greedy algorithms, Bregman information, and exponential families | Unknown | N/A | |
| Who Leads and Who Follows in Strategic Classification? | Unknown | N/A | |
| Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication | Unknown | N/A | |
| Bandits with many optimal arms | Unknown | N/A | |
| Exploiting a Zoo of Checkpoints for Unseen Tasks | Unknown | N/A | |
| Offline Model-based Adaptable Policy Learning | Unknown | N/A | |
| Formalizing the Generalization-Forgetting Trade-off in Continual Learning | Unknown | N/A | |
| (Almost) Free Incentivized Exploration from Decentralized Learning Agents | Unknown | N/A | |
| Emergent Communication under Varying Sizes and Connectivities | Unknown | N/A | |
| Meta Learning Backpropagation And Improving It | Unknown | N/A | |
| Adaptable Agent Populations via a Generative Model of Policies | Unknown | N/A | |
| Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error | Unknown | N/A | |
| Preserved central model for faster bidirectional compression in distributed settings | Unknown | N/A | |
| InfoGCL: Information-Aware Graph Contrastive Learning | Unknown | N/A | |
| Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization | Unknown | N/A | |
| Boosting with Multiple Sources | Unknown | N/A | |
| Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation | Unknown | N/A | |
| Towards Biologically Plausible Convolutional Networks | Unknown | N/A | |
| Minibatch and Momentum Model-based Methods for Stochastic Weakly Convex Optimization | Unknown | N/A | |
| Actively Identifying Causal Effects with Latent Variables Given Only Response Variable Observable | Unknown | N/A | |
| Gradual Domain Adaptation without Indexed Intermediate Domains | Unknown | N/A | |
| Understanding How Encoder-Decoder Architectures Attend | Unknown | N/A | |
| Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices | Unknown | N/A | |
| PLUGIn: A simple algorithm for inverting generative models with recovery guarantees | Unknown | N/A | |
| Relative Uncertainty Learning for Facial Expression Recognition | Unknown | N/A | |
| Vector-valued Gaussian Processes on Riemannian Manifolds via Gauge Independent Projected Kernels | Unknown | N/A | |
| Manifold Topology Divergence: a Framework for Comparing Data Manifolds. | Unknown | N/A | |
| Bayesian Bellman Operators | Unknown | N/A | |
| Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees | Unknown | N/A | |
| Compositional Reinforcement Learning from Logical Specifications | Unknown | N/A | |
| One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval | Unknown | N/A | |
| Imitating Deep Learning Dynamics via Locally Elastic Stochastic Differential Equations | Unknown | N/A | |
| Dynamic population-based meta-learning for multi-agent communication with natural language | Unknown | N/A | |
| SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data | Unknown | N/A | |
| Model, sample, and epoch-wise descents: exact solution of gradient flow in the random feature model | Unknown | N/A | |
| Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning | Unknown | N/A | |
| Instance-Conditional Knowledge Distillation for Object Detection | Unknown | N/A | |
| Entropic Desired Dynamics for Intrinsic Control | Unknown | N/A | |
| A unified framework for bandit multiple testing | Unknown | N/A | |
| Memory Efficient Meta-Learning with Large Images | Unknown | N/A | |
| A single gradient step finds adversarial examples on random two-layers neural networks | Unknown | N/A | |
| The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations | Unknown | N/A | |
| A Unified Approach to Fair Online Learning via Blackwell Approachability | Unknown | N/A | |
| On Component Interactions in Two-Stage Recommender Systems | Unknown | N/A | |
| Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic | Unknown | N/A | |
| CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator | Unknown | N/A | |
| ReAct: Out-of-distribution Detection With Rectified Activations | Unknown | N/A | |
| Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models | Unknown | N/A | |
| Stability and Deviation Optimal Risk Bounds with Convergence Rate $O(1/n)$ | Unknown | N/A | |
| Efficient Training of Visual Transformers with Small Datasets | Unknown | N/A | |
| Combiner: Full Attention Transformer with Sparse Computation Cost | Unknown | N/A | |
| On the Frequency Bias of Generative Models | Unknown | N/A | |
| SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection | Unknown | N/A | |
| High-probability Bounds for Non-Convex Stochastic Optimization with Heavy Tails | Unknown | N/A | |
| Discrete-Valued Neural Communication | Unknown | N/A | |
| Robust Contrastive Learning Using Negative Samples with Diminished Semantics | Unknown | N/A | |
| Chebyshev-Cantelli PAC-Bayes-Bennett Inequality for the Weighted Majority Vote | Unknown | N/A | |
| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Unknown | N/A | |
| XDO: A Double Oracle Algorithm for Extensive-Form Games | Unknown | N/A | |
| From Optimality to Robustness: Adaptive Re-Sampling Strategies in Stochastic Bandits | Unknown | N/A | |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Unknown | N/A | |
| Reformulating Zero-shot Action Recognition for Multi-label Actions | Unknown | N/A | |
| The Utility of Explainable AI in Ad Hoc Human-Machine Teaming | Unknown | N/A | |
| Understanding Deflation Process in Over-parametrized Tensor Decomposition | Unknown | N/A | |
| Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling | Unknown | N/A | |
| Covariance-Aware Private Mean Estimation Without Private Covariance Estimation | Unknown | N/A | |
| MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge | Unknown | N/A | |
| Generalized Linear Bandits with Local Differential Privacy | Unknown | N/A | |
| Scalable Diverse Model Selection for Accessible Transfer Learning | Unknown | N/A | |
| Unbiased Classification through Bias-Contrastive and Bias-Balanced Learning | Unknown | N/A | |
| Snowflake: Scaling GNNs to high-dimensional continuous control via parameter freezing | Unknown | N/A | |
| Distributional Reinforcement Learning for Multi-Dimensional Reward Functions | Unknown | N/A | |
| Learning Nonparametric Volterra Kernels with Gaussian Processes | Unknown | N/A | |
| Estimating the Unique Information of Continuous Variables | Unknown | N/A | |
| Towards Optimal Strategies for Training Self-Driving Perception Models in Simulation | Unknown | N/A | |
| Rethinking Graph Transformers with Spectral Attention | Unknown | N/A | |
| Continual Learning via Local Module Composition | Unknown | N/A | |
| Local Explanation of Dialogue Response Generation | Unknown | N/A | |
| Robust Visual Reasoning via Language Guided Neural Module Networks | Unknown | N/A | |
| Robust and differentially private mean estimation | Unknown | N/A | |
| Differentially Private Stochastic Optimization: New Results in Convex and Non-Convex Settings | Unknown | N/A | |
| Accurate Point Cloud Registration with Robust Optimal Transport | Unknown | N/A | |
| Efficient and Local Parallel Random Walks | Unknown | N/A | |
| RMM: Reinforced Memory Management for Class-Incremental Learning | Unknown | N/A | |
| Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability | Unknown | N/A | |
| Comprehensive Knowledge Distillation with Causal Intervention | Unknown | N/A | |
| How Does it Sound? | Unknown | N/A | |
| Contrastive Laplacian Eigenmaps | Unknown | N/A | |
| Deep learning is adaptive to intrinsic dimensionality of model smoothness in anisotropic Besov space | Unknown | N/A | |
| Online Meta-Learning via Learning with Layer-Distributed Memory | Unknown | N/A | |
| Neural Program Generation Modulo Static Analysis | Unknown | N/A | |
| $(\textrm{Implicit})^2$: Implicit Layers for Implicit Representations | Unknown | N/A | |
| Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains | Unknown | N/A | |
| Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning | Unknown | N/A | |
| Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction | Unknown | N/A | |
| Exact marginal prior distributions of finite Bayesian neural networks | Unknown | N/A | |
| Functional Regularization for Reinforcement Learning via Learned Fourier Features | Unknown | N/A | |
| Discovering Dynamic Salient Regions for Spatio-Temporal Graph Neural Networks | Unknown | N/A | |
| Training Over-parameterized Models with Non-decomposable Objectives | Unknown | N/A | |
| Stochastic Multi-Armed Bandits with Control Variates | Unknown | N/A | |
| Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods | Unknown | N/A | |
| Reinforcement Learning Enhanced Explainer for Graph Neural Networks | Unknown | N/A | |
| RETRIEVE: Coreset Selection for Efficient and Robust Semi-Supervised Learning | Unknown | N/A | |
| Directed Spectrum Measures Improve Latent Network Models Of Neural Populations | Unknown | N/A | |
| Antipodes of Label Differential Privacy: PATE and ALIBI | Unknown | N/A | |
| POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples | Unknown | N/A | |
| Proper Value Equivalence | Unknown | N/A | |
| Data driven semi-supervised learning | Unknown | N/A | |
| Trash or Treasure? An Interactive Dual-Stream Strategy for Single Image Reflection Separation | Unknown | N/A | |
| CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks | Unknown | N/A | |
| An Online Riemannian PCA for Stochastic Canonical Correlation Analysis | Unknown | N/A | |
| Deep Learning with Label Differential Privacy | Unknown | N/A | |
| Label Noise SGD Provably Prefers Flat Global Minimizers | Unknown | N/A | |
| Sparse Flows: Pruning Continuous-depth Models | Unknown | N/A | |
| Adversarial Examples in Multi-Layer Random ReLU Networks | Unknown | N/A | |
| Fast Certified Robust Training with Short Warmup | Unknown | N/A | |
| Realistic evaluation of transductive few-shot learning | Unknown | N/A | |
| Active Offline Policy Selection | Unknown | N/A | |
| Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution | Unknown | N/A | |
| Systematic Generalization with Edge Transformers | Unknown | N/A | |
| Contrastive Active Inference | Unknown | N/A | |
| Relational Self-Attention: What's Missing in Attention for Video Understanding | Unknown | N/A | |
| Blending Anti-Aliasing into Vision Transformer | Unknown | N/A | |
| Data-Efficient Instance Generation from Instance Discrimination | Unknown | N/A | |
| Scalable Neural Data Server: A Data Recommender for Transfer Learning | Unknown | N/A | |
| High Probability Complexity Bounds for Line Search Based on Stochastic Oracles | Unknown | N/A | |
| Terra: Imperative-Symbolic Co-Execution of Imperative Deep Learning Programs | Unknown | N/A | |
| Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis | Unknown | N/A | |
| Multi-View Representation Learning via Total Correlation Objective | Unknown | N/A | |
| Bandit Phase Retrieval | Unknown | N/A | |
| Object DGCNN: 3D Object Detection using Dynamic Graphs | Unknown | N/A | |
| IA-RED$^2$: Interpretability-Aware Redundancy Reduction for Vision Transformers | Unknown | N/A | |
| XCiT: Cross-Covariance Image Transformers | Unknown | N/A | |
| Optimal Policies Tend To Seek Power | Unknown | N/A | |
| On the Periodic Behavior of Neural Network Training with Batch Normalization and Weight Decay | Unknown | N/A | |
| SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning | Unknown | N/A | |
| Multiclass versus Binary Differentially Private PAC Learning | Unknown | N/A | |
| Partition and Code: learning how to compress graphs | Unknown | N/A | |
| An Image is Worth More Than a Thousand Words: Towards Disentanglement in The Wild | Unknown | N/A | |
| Error Compensated Distributed SGD Can Be Accelerated | Unknown | N/A | |
| Revisiting Deep Learning Models for Tabular Data | Unknown | N/A | |
| Conditioning Sparse Variational Gaussian Processes for Online Decision-making | Unknown | N/A | |
| Spatial-Temporal Super-Resolution of Satellite Imagery via Conditional Pixel Synthesis | Unknown | N/A | |
| Implicit SVD for Graph Representation Learning | Unknown | N/A | |
| Explanation-based Data Augmentation for Image Classification | Unknown | N/A | |
| On the Universality of Graph Neural Networks on Large Random Graphs | Unknown | N/A | |
| Navigating to the Best Policy in Markov Decision Processes | Unknown | N/A | |
| Identifying and Benchmarking Natural Out-of-Context Prediction Problems | Unknown | N/A | |
| Asynchronous Decentralized SGD with Quantized and Local Updates | Unknown | N/A | |
| Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model | Unknown | N/A | |
| Grammar-Based Grounded Lexicon Learning | Unknown | N/A | |
| SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition | Unknown | N/A | |
| Retiring Adult: New Datasets for Fair Machine Learning | Unknown | N/A | |
| Space-time Mixing Attention for Video Transformer | Unknown | N/A | |
| CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation | Unknown | N/A | |
| Weisfeiler and Lehman Go Cellular: CW Networks | Unknown | N/A | |
| Iterative Teaching by Label Synthesis | Unknown | N/A | |
| Lossy Compression for Lossless Prediction | Unknown | N/A | |
| T-LoHo: A Bayesian Regularization Model for Structured Sparsity and Smoothness on Graphs | Unknown | N/A | |
| Contrastive Reinforcement Learning of Symbolic Reasoning Domains | Unknown | N/A | |
| How Tight Can PAC-Bayes be in the Small Data Regime? | Unknown | N/A | |
| VQ-GNN: A Universal Framework to Scale up Graph Neural Networks using Vector Quantization | Unknown | N/A | |
| Deep Markov Factor Analysis: Towards Concurrent Temporal and Spatial Analysis of fMRI Data | Unknown | N/A | |
| Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies | Unknown | N/A | |
| AC/DC: Alternating Compressed/DeCompressed Training of Deep Neural Networks | Unknown | N/A | |
| Synthetic Design: An Optimization Approach to Experimental Design with Synthetic Controls | Unknown | N/A | |
| Going Beyond Linear RL: Sample Efficient Neural Function Approximation | Unknown | N/A | |
| One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective | Unknown | N/A | |
| De-randomizing MCMC dynamics with the diffusion Stein operator | Unknown | N/A | |
| On the Provable Generalization of Recurrent Neural Networks | Unknown | N/A | |
| A first-order primal-dual method with adaptivity to local smoothness | Unknown | N/A | |
| Encoding Spatial Distribution of Convolutional Features for Texture Representation | Unknown | N/A | |
| Curriculum Offline Imitating Learning | Unknown | N/A | |
| Sparse is Enough in Scaling Transformers | Unknown | N/A | |
| Federated Graph Classification over Non-IID Graphs | Unknown | N/A | |
| Adaptive Denoising via GainTuning | Unknown | N/A | |
| Rates of Estimation of Optimal Transport Maps using Plug-in Estimators via Barycentric Projections | Unknown | N/A | |
| Attention Approximates Sparse Distributed Memory | Unknown | N/A | |
| ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction | Unknown | N/A | |
| Better Safe Than Sorry: Preventing Delusive Adversaries with Adversarial Training | Unknown | N/A | |
| Amortized Synthesis of Constrained Configurations Using a Differentiable Surrogate | Unknown | N/A | |
| NeuroMLR: Robust & Reliable Route Recommendation on Road Networks | Unknown | N/A | |
| ATISS: Autoregressive Transformers for Indoor Scene Synthesis | Unknown | N/A | |
| Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models | Unknown | N/A | |
| Pure Exploration in Kernel and Neural Bandits | Unknown | N/A | |
| On the Cryptographic Hardness of Learning Single Periodic Neurons | Unknown | N/A | |
| Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning | Unknown | N/A | |
| Meta-learning with an Adaptive Task Scheduler | Unknown | N/A | |
| Multi-Facet Clustering Variational Autoencoders | Unknown | N/A | |
| Soft Calibration Objectives for Neural Networks | Unknown | N/A | |
| Complexity Lower Bounds for Nonconvex-Strongly-Concave Min-Max Optimization | Unknown | N/A | |
| Generalization of Model-Agnostic Meta-Learning Algorithms: Recurring and Unseen Tasks | Unknown | N/A | |
| Learning rule influences recurrent network representations but not attractor structure in decision-making tasks | Unknown | N/A | |
| Techniques for Symbol Grounding with SATNet | Unknown | N/A | |
| Improved Guarantees for Offline Stochastic Matching via new Ordered Contention Resolution Schemes | Unknown | N/A | |
| A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance | Unknown | N/A | |
| Continuous Latent Process Flows | Unknown | N/A | |
| How does a Neural Network's Architecture Impact its Robustness to Noisy Labels? | Unknown | N/A | |
| When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning? | Unknown | N/A | |
| On the Importance of Gradients for Detecting Distributional Shifts in the Wild | Unknown | N/A | |
| Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization | Unknown | N/A | |
| Evaluating Efficient Performance Estimators of Neural Architectures | Unknown | N/A | |
| Multiwavelet-based Operator Learning for Differential Equations | Unknown | N/A | |
| Bubblewrap: Online tiling and real-time flow prediction on neural manifolds | Unknown | N/A | |
| Dirichlet Energy Constrained Learning for Deep Graph Neural Networks | Unknown | N/A | |
| S$^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks | Unknown | N/A | |
| The staircase property: How hierarchical structure can guide deep learning | Unknown | N/A | |
| Topological Attention for Time Series Forecasting | Unknown | N/A | |
| Compressive Visual Representations | Unknown | N/A | |
| When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking | Unknown | N/A | |
| Best-case lower bounds in online learning | Unknown | N/A | |
| Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs | Unknown | N/A | |
| Computer-Aided Design as Language | Unknown | N/A | |
| No-Press Diplomacy from Scratch | Unknown | N/A | |
| Efficient Mirror Descent Ascent Methods for Nonsmooth Minimax Problems | Unknown | N/A | |
| Data Sharing and Compression for Cooperative Networked Control | Unknown | N/A | |
| DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable Renderer | Unknown | N/A | |
| Adaptive First-Order Methods Revisited: Convex Minimization without Lipschitz Requirements | Unknown | N/A | |
| Backdoor Attack with Imperceptible Input and Latent Modification | Unknown | N/A | |
| Teaching an Active Learner with Contrastive Examples | Unknown | N/A | |
| On sensitivity of meta-learning to support data | Unknown | N/A | |
| Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing | Unknown | N/A | |
| Can we have it all? On the Trade-off between Spatial and Adversarial Robustness of Neural Networks | Unknown | N/A | |
| Inverse Problems Leveraging Pre-trained Contrastive Representations | Unknown | N/A | |
| H-NeRF: Neural Radiance Fields for Rendering and Temporal Reconstruction of Humans in Motion | Unknown | N/A | |
| Slice Sampling Reparameterization Gradients | Unknown | N/A | |
| Why Do Better Loss Functions Lead to Less Transferable Features? | Unknown | N/A | |
| Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data | Unknown | N/A | |
| Revisit Multimodal Meta-Learning through the Lens of Multi-Task Learning | Unknown | N/A | |
| Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization | Unknown | N/A | |
| Fast Axiomatic Attribution for Neural Networks | Unknown | N/A | |
| Targeted Neural Dynamical Modeling | Unknown | N/A | |
| On the Role of Optimization in Double Descent: A Least Squares Study | Unknown | N/A | |
| Attention Bottlenecks for Multimodal Fusion | Unknown | N/A | |
| Stochastic Bias-Reduced Gradient Methods | Unknown | N/A | |
| Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training data | Unknown | N/A | |
| Interactive Label Cleaning with Example-based Explanations | Unknown | N/A | |
| Parameter Inference with Bifurcation Diagrams | Unknown | N/A | |
| Logarithmic Regret from Sublinear Hints | Unknown | N/A | |
| Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis | Unknown | N/A | |
| Learning to Predict Trustworthiness with Steep Slope Loss | Unknown | N/A | |
| Breaking the Linear Iteration Cost Barrier for Some Well-known Conditional Gradient Methods Using MaxIP Data-structures | Unknown | N/A | |
| Kernel Identification Through Transformers | Unknown | N/A | |
| Convex-Concave Min-Max Stackelberg Games | Unknown | N/A | |
| Three-dimensional spike localization and improved motion correction for Neuropixels recordings | Unknown | N/A | |
| Outcome-Driven Reinforcement Learning via Variational Inference | Unknown | N/A | |
| Transformers Generalize DeepSets and Can be Extended to Graphs & Hypergraphs | Unknown | N/A | |
| Efficient Generalization with Distributionally Robust Learning | Unknown | N/A | |
| How to transfer algorithmic reasoning knowledge to learn new algorithms? | Unknown | N/A | |
| Fast Routing under Uncertainty: Adaptive Learning in Congestion Games via Exponential Weights | Unknown | N/A | |
| Absolute Neighbour Difference based Correlation Test for Detecting Heteroscedastic Relationships | Unknown | N/A | |
| Self-Paced Contrastive Learning for Semi-supervised Medical Image Segmentation with Meta-labels | Unknown | N/A | |
| On Optimal Interpolation in Linear Regression | Unknown | N/A | |
| Towards Sample-efficient Overparameterized Meta-learning | Unknown | N/A | |
| Self-Supervised Learning with Kernel Dependence Maximization | Unknown | N/A | |
| Instance-Conditioned GAN | Unknown | N/A | |
| Optimal prediction of Markov chains with and without spectral gap | Unknown | N/A | |
| Overlapping Spaces for Compact Graph Representations | Unknown | N/A | |
| Long Short-Term Transformer for Online Action Detection | Unknown | N/A | |
| Supercharging Imbalanced Data Learning With Energy-based Contrastive Representation Transfer | Unknown | N/A | |
| Neural Pseudo-Label Optimism for the Bank Loan Problem | Unknown | N/A | |
| Differentially Private Learning with Adaptive Clipping | Unknown | N/A | |
| Nested Counterfactual Identification from Arbitrary Surrogate Experiments | Unknown | N/A | |
| On Provable Benefits of Depth in Training Graph Convolutional Networks | Unknown | N/A | |
| Robust Counterfactual Explanations on Graph Neural Networks | Unknown | N/A | |
| Perturb-and-max-product: Sampling and learning in discrete energy-based models | Unknown | N/A | |
| Class-Disentanglement and Applications in Adversarial Detection and Defense | Unknown | N/A | |
| Hypergraph Propagation and Community Selection for Objects Retrieval | Unknown | N/A | |
| Aligning Silhouette Topology for Self-Adaptive 3D Human Pose Recovery | Unknown | N/A | |
| Robust Implicit Networks via Non-Euclidean Contractions | Unknown | N/A | |
| Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time | Unknown | N/A | |
| Generative vs. Discriminative: Rethinking The Meta-Continual Learning | Unknown | N/A | |
| Controllable and Compositional Generation with Latent-Space Energy-Based Models | Unknown | N/A | |
| CoFiNet: Reliable Coarse-to-fine Correspondences for Robust PointCloud Registration | Unknown | N/A | |
| Automatic and Harmless Regularization with Constrained and Lexicographic Optimization: A Dynamic Barrier Approach | Unknown | N/A | |
| Joint Modeling of Visual Objects and Relations for Scene Graph Generation | Unknown | N/A | |
| Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL | Unknown | N/A | |
| Renyi Differential Privacy of The Subsampled Shuffle Model In Distributed Learning | Unknown | N/A | |
| Visual Adversarial Imitation Learning using Variational Models | Unknown | N/A | |
| Online false discovery rate control for anomaly detection in time series | Unknown | N/A | |
| Double Machine Learning Density Estimation for Local Treatment Effects with Instruments | Unknown | N/A | |
| An analysis of Ermakov-Zolotukhin quadrature using kernels | Unknown | N/A | |
| An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks | Unknown | N/A | |
| NAS-Bench-x11 and the Power of Learning Curves | Unknown | N/A | |
| Reinforcement Learning based Disease Progression Model for Alzheimer’s Disease | Unknown | N/A | |
| Scalable Online Planning via Reinforcement Learning Fine-Tuning | Unknown | N/A | |
| Differentiable Optimization of Generalized Nondecomposable Functions using Linear Programs | Unknown | N/A | |
| Parallel Bayesian Optimization of Multiple Noisy Objectives with Expected Hypervolume Improvement | Unknown | N/A | |
| Explicable Reward Design for Reinforcement Learning Agents | Unknown | N/A | |
| Robust and Fully-Dynamic Coreset for Continuous-and-Bounded Learning (With Outliers) Problems | Unknown | N/A | |
| Remember What You Want to Forget: Algorithms for Machine Unlearning | Unknown | N/A | |
| Faster Matchings via Learned Duals | Unknown | N/A | |
| A Separation Result Between Data-oblivious and Data-aware Poisoning Attacks | Unknown | N/A | |
| Learning to Select Exogenous Events for Marked Temporal Point Process | Unknown | N/A | |
| Score-based Generative Modeling in Latent Space | Unknown | N/A | |
| Reducing Collision Checking for Sampling-Based Motion Planning Using Graph Neural Networks | Unknown | N/A | |
| Center Smoothing: Certified Robustness for Networks with Structured Outputs | Unknown | N/A | |
| Numerical Composition of Differential Privacy | Unknown | N/A | |
| The Semi-Random Satisfaction of Voting Axioms | Unknown | N/A | |
| Better Algorithms for Individually Fair $k$-Clustering | Unknown | N/A | |
| A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum | Unknown | N/A | |
| One More Step Towards Reality: Cooperative Bandits with Imperfect Communication | Unknown | N/A | |
| Discovering and Achieving Goals via World Models | Unknown | N/A | |
| Learning-to-learn non-convex piecewise-Lipschitz functions | Unknown | N/A | |
| Tracking People with 3D Representations | Unknown | N/A | |
| Efficient Truncated Linear Regression with Unknown Noise Variance | Unknown | N/A | |
| Moser Flow: Divergence-based Generative Modeling on Manifolds | Unknown | N/A | |
| Stateful ODE-Nets using Basis Function Expansions | Unknown | N/A | |
| Adversarial Graph Augmentation to Improve Graph Contrastive Learning | Unknown | N/A | |
| Latent Matters: Learning Deep State-Space Models | Unknown | N/A | |
| Permuton-induced Chinese Restaurant Process | Unknown | N/A | |
| Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy to Game | Unknown | N/A | |
| A Gang of Adversarial Bandits | Unknown | N/A | |
| Bayesian Adaptation for Covariate Shift | Unknown | N/A | |
| Differentiable Synthesis of Program Architectures | Unknown | N/A | |
| Fair Classification with Adversarial Perturbations | Unknown | N/A | |
| Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks | Unknown | N/A | |
| Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge | Unknown | N/A | |
| Strategic Behavior is Bliss: Iterative Voting Improves Social Welfare | Unknown | N/A | |
| Sifting through the noise: Universal first-order methods for stochastic variational inequalities | Unknown | N/A | |
| The Complexity of Sparse Tensor PCA | Unknown | N/A | |
| Extending Lagrangian and Hamiltonian Neural Networks with Differentiable Contact Models | Unknown | N/A | |
| CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings | Unknown | N/A | |
| Double/Debiased Machine Learning for Dynamic Treatment Effects | Unknown | N/A | |
| Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information | Unknown | N/A | |
| Do Wider Neural Networks Really Help Adversarial Robustness? | Unknown | N/A | |
| Hyperparameter Tuning is All You Need for LISTA | Unknown | N/A | |
| Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems | Unknown | N/A | |
| Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems | Unknown | N/A | |
| FLEX: Unifying Evaluation for Few-Shot NLP | Unknown | N/A | |
| TokenLearner: Adaptive Space-Time Tokenization for Videos | Unknown | N/A | |
| Adjusting for Autocorrelated Errors in Neural Networks for Time Series | Unknown | N/A | |
| The Benefits of Implicit Regularization from SGD in Least Squares Problems | Unknown | N/A | |
| Teaching via Best-Case Counterexamples in the Learning-with-Equivalence-Queries Paradigm | Unknown | N/A | |
| SIMILAR: Submodular Information Measures Based Active Learning In Realistic Scenarios | Unknown | N/A | |
| Shapley Residuals: Quantifying the limits of the Shapley value for explanations | Unknown | N/A | |
| Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with Error Certificates | Unknown | N/A | |
| Robust Regression Revisited: Acceleration and Improved Estimation Rates | Unknown | N/A | |
| Reinforcement Learning with State Observation Costs in Action-Contingent Noiselessly Observable Markov Decision Processes | Unknown | N/A | |
| On the Sample Complexity of Learning under Geometric Stability | Unknown | N/A | |
| Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets | Unknown | N/A | |
| Subgoal Search For Complex Reasoning Tasks | Unknown | N/A | |
| Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers | Unknown | N/A | |
| Curriculum Design for Teaching via Demonstrations: Theory and Applications | Unknown | N/A | |
| Uncertain Decisions Facilitate Better Preference Learning | Unknown | N/A | |
| Asymptotically Best Causal Effect Identification with Multi-Armed Bandits | Unknown | N/A | |
| Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes | Unknown | N/A | |
| DropGNN: Random Dropouts Increase the Expressiveness of Graph Neural Networks | Unknown | N/A | |
| Improving Compositionality of Neural Networks by Decoding Representations to Inputs | Unknown | N/A | |
| Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization | Unknown | N/A | |
| Learning to See by Looking at Noise | Unknown | N/A | |
| Parametric Complexity Bounds for Approximating PDEs with Neural Networks | Unknown | N/A | |
| General Nonlinearities in SO(2)-Equivariant CNNs | Unknown | N/A | |
| Representing Long-Range Context for Graph Neural Networks with Global Attention | Unknown | N/A | |
| Subgame solving without common knowledge | Unknown | N/A | |
| Towards a Unified Information-Theoretic Framework for Generalization | Unknown | N/A | |
| Learning to Synthesize Programs as Interpretable and Generalizable Policies | Unknown | N/A | |
| Distributed Estimation with Multiple Samples per User: Sharp Rates and Phase Transition | Unknown | N/A | |
| Structured Dropout Variational Inference for Bayesian Neural Networks | Unknown | N/A | |
| Local Signal Adaptivity: Provable Feature Learning in Neural Networks Beyond Kernels | Unknown | N/A | |
| Newton-LESS: Sparsification without Trade-offs for the Sketched Newton Update | Unknown | N/A | |
| Taming Communication and Sample Complexities in Decentralized Policy Evaluation for Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Scatterbrain: Unifying Sparse and Low-rank Attention | Unknown | N/A | |
| Look at the Variance! Efficient Black-box Explanations with Sobol-based Sensitivity Analysis | Unknown | N/A | |
| Minimizing Polarization and Disagreement in Social Networks via Link Recommendation | Unknown | N/A | |
| Adversarial Examples Make Strong Poisons | Unknown | N/A | |
| Laplace Redux - Effortless Bayesian Deep Learning | Unknown | N/A | |
| Enabling Fast Differentially Private SGD via Just-in-Time Compilation and Vectorization | Unknown | N/A | |
| A Multi-Implicit Neural Representation for Fonts | Unknown | N/A | |
| Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning | Unknown | N/A | |
| Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot? | Unknown | N/A | |
| Learning Distilled Collaboration Graph for Multi-Agent Perception | Unknown | N/A | |
| Generalization Bounds for (Wasserstein) Robust Optimization | Unknown | N/A | |
| Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning | Unknown | N/A | |
| Compositional Transformers for Scene Generation | Unknown | N/A | |
| Structural Credit Assignment in Neural Networks using Reinforcement Learning | Unknown | N/A | |
| Reducing the Covariate Shift by Mirror Samples in Cross Domain Alignment | Unknown | N/A | |
| Characterizing possible failure modes in physics-informed neural networks | Unknown | N/A | |
| Fast Training of Neural Lumigraph Representations using Meta Learning | Unknown | N/A | |
| Correlated Stochastic Block Models: Exact Graph Matching with Applications to Recovering Communities | Unknown | N/A | |
| Can Information Flows Suggest Targets for Interventions in Neural Circuits? | Unknown | N/A | |
| Kernel Functional Optimisation | Unknown | N/A | |
| Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation | Unknown | N/A | |
| Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds | Unknown | N/A | |
| ReLU Regression with Massart Noise | Unknown | N/A | |
| Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning | Unknown | N/A | |
| Fair Clustering Under a Bounded Cost | Unknown | N/A | |
| Pragmatic Image Compression for Human-in-the-Loop Decision-Making | Unknown | N/A | |
| Second-Order Neural ODE Optimizer | Unknown | N/A | |
| Early Convolutions Help Transformers See Better | Unknown | N/A | |
| PatchGame: Learning to Signal Mid-level Patches in Referential Games | Unknown | N/A | |
| Structured Reordering for Modeling Latent Alignments in Sequence Transduction | Unknown | N/A | |
| ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees | Unknown | N/A | |
| MERLOT: Multimodal Neural Script Knowledge Models | Unknown | N/A | |
| Novel Upper Bounds for the Constrained Most Probable Explanation Task | Unknown | N/A | |
| Low-Fidelity Video Encoder Optimization for Temporal Action Localization | Unknown | N/A | |
| Replay-Guided Adversarial Environment Design | Unknown | N/A | |
| Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image | Unknown | N/A | |
| Stochastic Gradient Descent-Ascent and Consensus Optimization for Smooth Games: Convergence Analysis under Expected Co-coercivity | Unknown | N/A | |
| Beyond Smoothness: Incorporating Low-Rank Analysis into Nonparametric Density Estimation | Unknown | N/A | |
| A Geometric Structure of Acceleration and Its Role in Making Gradients Small Fast | Unknown | N/A | |
| Differentiable Spline Approximations | Unknown | N/A | |
| Measuring Generalization with Optimal Transport | Unknown | N/A | |
| Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval | Unknown | N/A | |
| Optimal Sketching for Trace Estimation | Unknown | N/A | |
| Robustness of Graph Neural Networks at Scale | Unknown | N/A | |
| Dynamic Inference with Neural Interpreters | Unknown | N/A | |
| Stochastic bandits with groups of similar arms. | Unknown | N/A | |
| Identification of Partially Observed Linear Causal Models: Graphical Conditions for the Non-Gaussian and Heterogeneous Cases | Unknown | N/A | |
| Continual Auxiliary Task Learning | Unknown | N/A | |
| Generalization Bounds for Graph Embedding Using Negative Sampling: Linear vs Hyperbolic | Unknown | N/A | |
| On the Rate of Convergence of Regularized Learning in Games: From Bandits and Uncertainty to Optimism and Beyond | Unknown | N/A | |
| CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation | Unknown | N/A | |
| Causal Abstractions of Neural Networks | Unknown | N/A | |
| Optimal Underdamped Langevin MCMC Method | Unknown | N/A | |
| Greedy Approximation Algorithms for Active Sequential Hypothesis Testing | Unknown | N/A | |
| Towards Deeper Deep Reinforcement Learning with Spectral Normalization | Unknown | N/A | |
| NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction | Unknown | N/A | |
| Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals | Unknown | N/A | |
| When in Doubt: Neural Non-Parametric Uncertainty Quantification for Epidemic Forecasting | Unknown | N/A | |
| KALE Flow: A Relaxed KL Gradient Flow for Probabilities with Disjoint Support | Unknown | N/A | |
| Hyperbolic Procrustes Analysis Using Riemannian Geometry | Unknown | N/A | |
| MADE: Exploration via Maximizing Deviation from Explored Regions | Unknown | N/A | |
| Federated Multi-Task Learning under a Mixture of Distributions | Unknown | N/A | |
| Overcoming Catastrophic Forgetting in Incremental Few-Shot Learning by Finding Flat Minima | Unknown | N/A | |
| Regularized Frank-Wolfe for Dense CRFs: Generalizing Mean Field and Beyond | Unknown | N/A | |
| Collapsed Variational Bounds for Bayesian Neural Networks | Unknown | N/A | |
| Characterizing Generalization under Out-Of-Distribution Shifts in Deep Metric Learning | Unknown | N/A | |
| Improving Deep Learning Interpretability by Saliency Guided Training | Unknown | N/A | |
| Label consistency in overfitted generalized $k$-means | Unknown | N/A | |
| Meta Internal Learning | Unknown | N/A | |
| Analytic Insights into Structure and Rank of Neural Network Hessian Maps | Unknown | N/A | |
| LEADS: Learning Dynamical Systems that Generalize Across Environments | Unknown | N/A | |
| Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks | Unknown | N/A | |
| Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks | Unknown | N/A | |
| Uniform Sampling over Episode Difficulty | Unknown | N/A | |
| GeoMol: Torsional Geometric Generation of Molecular 3D Conformer Ensembles | Unknown | N/A | |
| Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations | Unknown | N/A | |
| A-NeRF: Articulated Neural Radiance Fields for Learning Human Shape, Appearance, and Pose | Unknown | N/A | |
| AugMax: Adversarial Composition of Random Augmentations for Robust Training | Unknown | N/A | |
| Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks | Unknown | N/A | |
| Overparameterization Improves Robustness to Covariate Shift in High Dimensions | Unknown | N/A | |
| MagNet: A Neural Network for Directed Graphs | Unknown | N/A | |
| Dr Jekyll & Mr Hyde: the strange case of off-policy policy updates | Unknown | N/A | |
| Stronger NAS with Weaker Predictors | Unknown | N/A | |
| Multi-Step Budgeted Bayesian Optimization with Unknown Evaluation Costs | Unknown | N/A | |
| Adaptive Machine Unlearning | Unknown | N/A | |
| Time-independent Generalization Bounds for SGLD in Non-convex Settings | Unknown | N/A | |
| NeRV: Neural Representations for Videos | Unknown | N/A | |
| Causal Effect Inference for Structured Treatments | Unknown | N/A | |
| Learning-Augmented Dynamic Power Management with Multiple States via New Ski Rental Bounds | Unknown | N/A | |
| Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path | Unknown | N/A | |
| Finite Sample Analysis of Average-Reward TD Learning and $Q$-Learning | Unknown | N/A | |
| Stochastic optimization under time drift: iterate averaging, step-decay schedules, and high probability guarantees | Unknown | N/A | |
| Focal Attention for Long-Range Interactions in Vision Transformers | Unknown | N/A | |
| Iteratively Reweighted Least Squares for Basis Pursuit with Global Linear Convergence Rate | Unknown | N/A | |
| COHESIV: Contrastive Object and Hand Embedding Segmentation In Video | Unknown | N/A | |
| Scalars are universal: Equivariant machine learning, structured like classical physics | Unknown | N/A | |
| Rethinking gradient sparsification as total error minimization | Unknown | N/A | |
| Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation | Unknown | N/A | |
| Habitat 2.0: Training Home Assistants to Rearrange their Habitat | Unknown | N/A | |
| Language models enable zero-shot prediction of the effects of mutations on protein function | Unknown | N/A | |
| Label-Imbalanced and Group-Sensitive Classification under Overparameterization | Unknown | N/A | |
| Deep inference of latent dynamics with spatio-temporal super-resolution using selective backpropagation through time | Unknown | N/A | |
| TTT++: When Does Self-Supervised Test-Time Training Fail or Thrive? | Unknown | N/A | |
| Two-sided fairness in rankings via Lorenz dominance | Unknown | N/A | |
| Decoupling the Depth and Scope of Graph Neural Networks | Unknown | N/A | |
| Learning in two-player zero-sum partially observable Markov games with perfect recall | Unknown | N/A | |
| Mixture Proportion Estimation and PU Learning:A Modern Approach | Unknown | N/A | |
| A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Refining Language Models with Compositional Explanations | Unknown | N/A | |
| Noether Networks: meta-learning useful conserved quantities | Unknown | N/A | |
| Efficient hierarchical Bayesian inference for spatio-temporal regression models in neuroimaging | Unknown | N/A | |
| Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation | Unknown | N/A | |
| Stochastic Anderson Mixing for Nonconvex Stochastic Optimization | Unknown | N/A | |
| NovelD: A Simple yet Effective Exploration Criterion | Unknown | N/A | |
| Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label Text Classification | Unknown | N/A | |
| Row-clustering of a Point Process-valued Matrix | Unknown | N/A | |
| Optimal Best-Arm Identification Methods for Tail-Risk Measures | Unknown | N/A | |
| Deep Networks Provably Classify Data on Curves | Unknown | N/A | |
| Global Convergence to Local Minmax Equilibrium in Classes of Nonconvex Zero-Sum Games | Unknown | N/A | |
| A Winning Hand: Compressing Deep Networks Can Improve Out-of-Distribution Robustness | Unknown | N/A | |
| EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback | Unknown | N/A | |
| On the Generative Utility of Cyclic Conditionals | Unknown | N/A | |
| CAFE: Catastrophic Data Leakage in Vertical Federated Learning | Unknown | N/A | |
| Topological Detection of Trojaned Neural Networks | Unknown | N/A | |
| Proxy-Normalizing Activations to Match Batch Normalization while Removing Batch Dependence | Unknown | N/A | |
| Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration | Unknown | N/A | |
| The Causal-Neural Connection: Expressiveness, Learnability, and Inference | Unknown | N/A | |
| SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency | Unknown | N/A | |
| Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems | Unknown | N/A | |
| Interpolation can hurt robust generalization even when there is no noise | Unknown | N/A | |
| OctField: Hierarchical Implicit Functions for 3D Modeling | Unknown | N/A | |
| Test-Time Personalization with a Transformer for Human Pose Estimation | Unknown | N/A | |
| Dense Keypoints via Multiview Supervision | Unknown | N/A | |
| Functional Variational Inference based on Stochastic Process Generators | Unknown | N/A | |
| Overcoming the Convex Barrier for Simplex Inputs | Unknown | N/A | |
| Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos | Unknown | N/A | |
| CLIP-It! Language-Guided Video Summarization | Unknown | N/A | |
| The Lazy Online Subgradient Algorithm is Universal on Strongly Convex Domains | Unknown | N/A | |
| Adversarial Robustness without Adversarial Training: A Teacher-Guided Curriculum Learning Approach | Unknown | N/A | |
| An Exact Characterization of the Generalization Error for the Gibbs Algorithm | Unknown | N/A | |
| Evaluating model performance under worst-case subpopulations | Unknown | N/A | |
| DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples | Unknown | N/A | |
| Risk-averse Heteroscedastic Bayesian Optimization | Unknown | N/A | |
| Mining the Benefits of Two-stage and One-stage HOI Detection | Unknown | N/A | |
| Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models | Unknown | N/A | |
| Learning Equilibria in Matching Markets from Bandit Feedback | Unknown | N/A | |
| Improving black-box optimization in VAE latent space using decoder uncertainty | Unknown | N/A | |
| On the Convergence of Prior-Guided Zeroth-Order Optimization Algorithms | Unknown | N/A | |
| Validating the Lottery Ticket Hypothesis with Inertial Manifold Theory | Unknown | N/A | |
| TöRF: Time-of-Flight Radiance Fields for Dynamic Scene View Synthesis | Unknown | N/A | |
| Sample Complexity of Tree Search Configuration: Cutting Planes and Beyond | Unknown | N/A | |
| Evidential Softmax for Sparse Multimodal Distributions in Deep Generative Models | Unknown | N/A | |
| Adapting to function difficulty and growth conditions in private optimization | Unknown | N/A | |
| Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers | Unknown | N/A | |
| Leveraging SE(3) Equivariance for Self-supervised Category-Level Object Pose Estimation from Point Clouds | Unknown | N/A | |
| AutoBalance: Optimized Loss Functions for Imbalanced Data | Unknown | N/A | |
| Gradient Descent on Two-layer Nets: Margin Maximization and Simplicity Bias | Unknown | N/A | |
| TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification | Unknown | N/A | |
| You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection | Unknown | N/A | |
| Towards Efficient and Effective Adversarial Training | Unknown | N/A | |
| Neural Dubber: Dubbing for Videos According to Scripts | Unknown | N/A | |
| Revealing and Protecting Labels in Distributed Training | Unknown | N/A | |
| Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers | Unknown | N/A | |
| Preconditioned Gradient Descent for Over-Parameterized Nonconvex Matrix Factorization | Unknown | N/A | |
| Scaling Up Exact Neural Network Compression by ReLU Stability | Unknown | N/A | |
| Revisiting 3D Object Detection From an Egocentric Perspective | Unknown | N/A | |
| Learning Debiased Representation via Disentangled Feature Augmentation | Unknown | N/A | |
| ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis | Unknown | N/A | |
| SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning | Unknown | N/A | |
| Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism | Unknown | N/A | |
| Object-Centric Representation Learning with Generative Spatial-Temporal Factorization | Unknown | N/A | |
| Post-processing for Individual Fairness | Unknown | N/A | |
| Linear and Kernel Classification in the Streaming Model: Improved Bounds for Heavy Hitters | Unknown | N/A | |
| Bridging the Imitation Gap by Adaptive Insubordination | Unknown | N/A | |
| Particle Dual Averaging: Optimization of Mean Field Neural Network with Global Convergence Rate Analysis | Unknown | N/A | |
| Gradient-based Editing of Memory Examples for Online Task-free Continual Learning | Unknown | N/A | |
| Last-iterate Convergence in Extensive-Form Games | Unknown | N/A | |
| GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement | Unknown | N/A | |
| On Blame Attribution for Accountable Multi-Agent Sequential Decision Making | Unknown | N/A | |
| Locally private online change point detection | Unknown | N/A | |
| A Causal Lens for Controllable Text Generation | Unknown | N/A | |
| Unsupervised Part Discovery from Contrastive Reconstruction | Unknown | N/A | |
| PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning | Unknown | N/A | |
| Beyond Tikhonov: faster learning with self-concordant losses, via iterative regularization | Unknown | N/A | |
| Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach | Unknown | N/A | |
| Topology-Imbalance Learning for Semi-Supervised Node Classification | Unknown | N/A | |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Unknown | N/A | |
| Exploring Cross-Video and Cross-Modality Signals for Weakly-Supervised Audio-Visual Video Parsing | Unknown | N/A | |
| Benign Overfitting in Multiclass Classification: All Roads Lead to Interpolation | Unknown | N/A | |
| Continuous vs. Discrete Optimization of Deep Neural Networks | Unknown | N/A | |
| Post-Training Quantization for Vision Transformer | Unknown | N/A | |
| Edge Representation Learning with Hypergraphs | Unknown | N/A | |
| SILG: The Multi-domain Symbolic Interactive Language Grounding Benchmark | Unknown | N/A | |
| Conditional Generation Using Polynomial Expansions | Unknown | N/A | |
| Model-Based Episodic Memory Induces Dynamic Hybrid Controls | Unknown | N/A | |
| Property-Aware Relation Networks for Few-Shot Molecular Property Prediction | Unknown | N/A | |
| Deep Learning Through the Lens of Example Difficulty | Unknown | N/A | |
| Understanding Bandits with Graph Feedback | Unknown | N/A | |
| Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings | Unknown | N/A | |
| An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence | Unknown | N/A | |
| Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D Space | Unknown | N/A | |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Online Adaptation to Label Distribution Shift | Unknown | N/A | |
| Sample Selection for Fair and Robust Training | Unknown | N/A | |
| Integrating Tree Path in Transformer for Code Representation | Unknown | N/A | |
| Beta-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Robustness Verification | Unknown | N/A | |
| VigDet: Knowledge Informed Neural Temporal Point Process for Coordination Detection on Social Media | Unknown | N/A | |
| Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation | Unknown | N/A | |
| Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression | Unknown | N/A | |
| A Non-commutative Extension of Lee-Seung's Algorithm for Positive Semidefinite Factorizations | Unknown | N/A | |
| The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle | Unknown | N/A | |
| Differentially Private Federated Bayesian Optimization with Distributed Exploration | Unknown | N/A | |
| SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization | Unknown | N/A | |
| Deep Conditional Gaussian Mixture Model for Constrained Clustering | Unknown | N/A | |
| EditGAN: High-Precision Semantic Image Editing | Unknown | N/A | |
| Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations | Unknown | N/A | |
| VoiceMixer: Adversarial Voice Style Mixup | Unknown | N/A | |
| BCORLE($\lambda$): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market | Unknown | N/A | |
| Unsupervised Object-Based Transition Models For 3D Partially Observable Environments | Unknown | N/A | |
| Learning Graph Models for Retrosynthesis Prediction | Unknown | N/A | |
| On Success and Simplicity: A Second Look at Transferable Targeted Attacks | Unknown | N/A | |
| Variational Model Inversion Attacks | Unknown | N/A | |
| A Computationally Efficient Method for Learning Exponential Family Distributions | Unknown | N/A | |
| Streaming Belief Propagation for Community Detection | Unknown | N/A | |
| Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection | Unknown | N/A | |
| Learning Generalized Gumbel-max Causal Mechanisms | Unknown | N/A | |
| A PAC-Bayes Analysis of Adversarial Robustness | Unknown | N/A | |
| QuPeD: Quantized Personalization via Distillation with Applications to Federated Learning | Unknown | N/A | |
| Recursive Bayesian Networks: Generalising and Unifying Probabilistic Context-Free Grammars and Dynamic Bayesian Networks | Unknown | N/A | |
| Improved Regularization and Robustness for Fine-tuning in Neural Networks | Unknown | N/A | |
| Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling | Unknown | N/A | |
| Dataset Distillation with Infinitely Wide Convolutional Networks | Unknown | N/A | |
| Multi-Person 3D Motion Prediction with Multi-Range Transformers | Unknown | N/A | |
| Efficient Bayesian network structure learning via local Markov boundary search | Unknown | N/A | |
| SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL | Unknown | N/A | |
| On the Value of Infinite Gradients in Variational Autoencoder Models | Unknown | N/A | |
| Sequential Causal Imitation Learning with Unobserved Confounders | Unknown | N/A | |
| Faster Non-asymptotic Convergence for Double Q-learning | Unknown | N/A | |
| CorticalFlow: A Diffeomorphic Mesh Transformer Network for Cortical Surface Reconstruction | Unknown | N/A | |
| Provably Strict Generalisation Benefit for Invariance in Kernel Methods | Unknown | N/A | |
| Accelerating Quadratic Optimization with Reinforcement Learning | Unknown | N/A | |
| Multi-armed Bandit Requiring Monotone Arm Sequences | Unknown | N/A | |
| Non-asymptotic convergence bounds for Wasserstein approximation using point clouds | Unknown | N/A | |
| Early-stopped neural networks are consistent | Unknown | N/A | |
| Class-agnostic Reconstruction of Dynamic Objects from Videos | Unknown | N/A | |
| Meta-Adaptive Nonlinear Control: Theory and Algorithms | Unknown | N/A | |
| A nonparametric method for gradual change problems with statistical guarantees | Unknown | N/A | |
| Analyzing the Generalization Capability of SGLD Using Properties of Gaussian Channels | Unknown | N/A | |
| An Uncertainty Principle is a Price of Privacy-Preserving Microdata | Unknown | N/A | |
| Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch | Unknown | N/A | |
| Neural Routing by Memory | Unknown | N/A | |
| Improved Regret Bounds for Tracking Experts with Memory | Unknown | N/A | |
| Adversarial Attacks on Graph Classifiers via Bayesian Optimisation | Unknown | N/A | |
| Learning where to learn: Gradient sparsity in meta and continual learning | Unknown | N/A | |
| Fuzzy Clustering with Similarity Queries | Unknown | N/A | |
| User-Level Differentially Private Learning via Correlated Sampling | Unknown | N/A | |
| Learning to Generate Visual Questions with Noisy Supervision | Unknown | N/A | |
| Scaling Vision with Sparse Mixture of Experts | Unknown | N/A | |
| What training reveals about neural network complexity | Unknown | N/A | |
| Dimensionality Reduction for Wasserstein Barycenter | Unknown | N/A | |
| Gradient Starvation: A Learning Proclivity in Neural Networks | Unknown | N/A | |
| Reusing Combinatorial Structure: Faster Iterative Projections over Submodular Base Polytopes | Unknown | N/A | |
| Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection | Unknown | N/A | |
| Dynamic Resolution Network | Unknown | N/A | |
| Probabilistic Forecasting: A Level-Set Approach | Unknown | N/A | |
| Learnable Fourier Features for Multi-dimensional Spatial Positional Encoding | Unknown | N/A | |
| Pipeline Combinators for Gradual AutoML | Unknown | N/A | |
| Play to Grade: Testing Coding Games as Classifying Markov Decision Process | Unknown | N/A | |
| Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces | Unknown | N/A | |
| Collaborating with Humans without Human Data | Unknown | N/A | |
| Constrained Two-step Look-Ahead Bayesian Optimization | Unknown | N/A | |
| A Stochastic Newton Algorithm for Distributed Convex Optimization | Unknown | N/A | |
| Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi | Unknown | N/A | |
| Adversarial Feature Desensitization | Unknown | N/A | |
| Shared Independent Component Analysis for Multi-Subject Neuroimaging | Unknown | N/A | |
| Nested Variational Inference | Unknown | N/A | |
| Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks | Unknown | N/A | |
| Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning | Unknown | N/A | |
| Pooling by Sliced-Wasserstein Embedding | Unknown | N/A | |
| Exploiting Local Convergence of Quasi-Newton Methods Globally: Adaptive Sample Size Approach | Unknown | N/A | |
| Meta-learning to Improve Pre-training | Unknown | N/A | |
| Stylized Dialogue Generation with Multi-Pass Dual Learning | Unknown | N/A | |
| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Unknown | N/A | |
| Optimization-Based Algebraic Multigrid Coarsening Using Reinforcement Learning | Unknown | N/A | |
| Object-aware Contrastive Learning for Debiased Scene Representation | Unknown | N/A | |
| Dynamic Grained Encoder for Vision Transformers | Unknown | N/A | |
| Contrastively Disentangled Sequential Variational Autoencoder | Unknown | N/A | |
| A Surrogate Objective Framework for Prediction+Programming with Soft Constraints | Unknown | N/A | |
| The Adaptive Doubly Robust Estimator and a Paradox Concerning Logging Policy | Unknown | N/A | |
| Robust and Decomposable Average Precision for Image Retrieval | Unknown | N/A | |
| Learning Transferable Adversarial Perturbations | Unknown | N/A | |
| Efficient Online Estimation of Causal Effects by Deciding What to Observe | Unknown | N/A | |
| Pay Attention to MLPs | Unknown | N/A | |
| Robust Learning of Optimal Auctions | Unknown | N/A | |
| Asymptotics of the Bootstrap via Stability with Applications to Inference with Model Selection | Unknown | N/A | |
| Locally differentially private estimation of functionals of discrete distributions | Unknown | N/A | |
| Neural View Synthesis and Matching for Semi-Supervised Few-Shot Learning of 3D Pose | Unknown | N/A | |
| Breaking the Dilemma of Medical Image-to-image Translation | Unknown | N/A | |
| PDE-GCN: Novel Architectures for Graph Neural Networks Motivated by Partial Differential Equations | Unknown | N/A | |
| Machine Learning for Variance Reduction in Online Experiments | Unknown | N/A | |
| Learning with Noisy Correspondence for Cross-modal Matching | Unknown | N/A | |
| Indexed Minimum Empirical Divergence for Unimodal Bandits | Unknown | N/A | |
| Learning Graph Cellular Automata | Unknown | N/A | |
| The Skellam Mechanism for Differentially Private Federated Learning | Unknown | N/A | |
| Logarithmic Regret in Feature-based Dynamic Pricing | Unknown | N/A | |
| SNIPS: Solving Noisy Inverse Problems Stochastically | Unknown | N/A | |
| A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis | Unknown | N/A | |
| Learning to Compose Visual Relations | Unknown | N/A | |
| Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method | Unknown | N/A | |
| Machine versus Human Attention in Deep Reinforcement Learning Tasks | Unknown | N/A | |
| Analytic Study of Families of Spurious Minima in Two-Layer ReLU Neural Networks: A Tale of Symmetry II | Unknown | N/A | |
| Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization | Unknown | N/A | |
| Coupled Segmentation and Edge Learning via Dynamic Graph Propagation | Unknown | N/A | |
| Online learning in MDPs with linear function approximation and bandit feedback. | Unknown | N/A | |
| Dual Adaptivity: A Universal Algorithm for Minimizing the Adaptive Regret of Convex Functions | Unknown | N/A | |
| Temporal-attentive Covariance Pooling Networks for Video Recognition | Unknown | N/A | |
| Improving Conditional Coverage via Orthogonal Quantile Regression | Unknown | N/A | |
| Speech-T: Transducer for Text to Speech and Beyond | Unknown | N/A | |
| Machine learning structure preserving brackets for forecasting irreversible processes | Unknown | N/A | |
| TransformerFusion: Monocular RGB Scene Reconstruction using Transformers | Unknown | N/A | |
| Group Equivariant Subsampling | Unknown | N/A | |
| GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction | Unknown | N/A | |
| Tree in Tree: from Decision Trees to Decision Graphs | Unknown | N/A | |
| Generalized Proximal Policy Optimization with Sample Reuse | Unknown | N/A | |
| DECAF: Generating Fair Synthetic Data Using Causally-Aware Generative Networks | Unknown | N/A | |
| Diverse Message Passing for Attribute with Heterophily | Unknown | N/A | |
| Matching a Desired Causal State via Shift Interventions | Unknown | N/A | |
| Learning to Assimilate in Chaotic Dynamical Systems | Unknown | N/A | |
| Independent mechanism analysis, a new concept? | Unknown | N/A | |
| Representation Costs of Linear Neural Networks: Analysis and Design | Unknown | N/A | |
| Active Learning of Convex Halfspaces on Graphs | Unknown | N/A | |
| Environment Generation for Zero-Shot Compositional Reinforcement Learning | Unknown | N/A | |
| ScaleCert: Scalable Certified Defense against Adversarial Patches with Sparse Superficial Layers | Unknown | N/A | |
| Grounding inductive biases in natural images: invariance stems from variations in data | Unknown | N/A | |
| Efficient Statistical Assessment of Neural Network Corruption Robustness | Unknown | N/A | |
| Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning | Unknown | N/A | |
| argmax centroid | Unknown | N/A | |
| Rethinking the Variational Interpretation of Accelerated Optimization Methods | Unknown | N/A | |
| Adaptive Proximal Gradient Methods for Structured Neural Networks | Unknown | N/A | |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Unknown | N/A | |
| Scaling Neural Tangent Kernels via Sketching and Random Features | Unknown | N/A | |
| Fast Pure Exploration via Frank-Wolfe | Unknown | N/A | |
| Learning with Algorithmic Supervision via Continuous Relaxations | Unknown | N/A | |
| SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks | Unknown | N/A | |
| On the Variance of the Fisher Information for Deep Learning | Unknown | N/A | |
| On the Validity of Modeling SGD with Stochastic Differential Equations (SDEs) | Unknown | N/A | |
| How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness? | Unknown | N/A | |
| Twins: Revisiting the Design of Spatial Attention in Vision Transformers | Unknown | N/A | |
| Auditing Black-Box Prediction Models for Data Minimization Compliance | Unknown | N/A | |
| Regularized Softmax Deep Multi-Agent Q-Learning | Unknown | N/A | |
| BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation | Unknown | N/A | |
| Task-Agnostic Undesirable Feature Deactivation Using Out-of-Distribution Data | Unknown | N/A | |
| When Expressivity Meets Trainability: Fewer than $n$ Neurons Can Work | Unknown | N/A | |
| Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess | Unknown | N/A | |
| Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering | Unknown | N/A | |
| Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels | Unknown | N/A | |
| Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots | Unknown | N/A | |
| Invertible Tabular GANs: Killing Two Birds with One Stone for Tabular Data Synthesis | Unknown | N/A | |
| Not All Images are Worth 16x16 Words: Dynamic Transformers for Efficient Image Recognition | Unknown | N/A | |
| Learning Debiased and Disentangled Representations for Semantic Segmentation | Unknown | N/A | |
| Stability and Generalization of Bilevel Programming in Hyperparameter Optimization | Unknown | N/A | |
| A Compositional Atlas of Tractable Circuit Operations for Probabilistic Inference | Unknown | N/A | |
| Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows | Unknown | N/A | |
| Adversarial Robustness of Streaming Algorithms through Importance Sampling | Unknown | N/A | |
| Video Instance Segmentation using Inter-Frame Communication Transformers | Unknown | N/A | |
| Towards Tight Communication Lower Bounds for Distributed Optimisation | Unknown | N/A | |
| Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networks | Unknown | N/A | |
| Uncertainty Quantification and Deep Ensembles | Unknown | N/A | |
| BooVI: Provably Efficient Bootstrapped Value Iteration | Unknown | N/A | |
| A Framework to Learn with Interpretation | Unknown | N/A | |
| Learning Tree Interpretation from Object Representation for Deep Reinforcement Learning | Unknown | N/A | |
| Align before Fuse: Vision and Language Representation Learning with Momentum Distillation | Unknown | N/A | |
| Improving Robustness using Generated Data | Unknown | N/A | |
| Model Selection for Bayesian Autoencoders | Unknown | N/A | |
| Generalization Error Rates in Kernel Regression: The Crossover from the Noiseless to Noisy Regime | Unknown | N/A | |
| On Path Integration of Grid Cells: Group Representation and Isotropic Scaling | Unknown | N/A | |
| Unfolding Taylor's Approximations for Image Restoration | Unknown | N/A | |
| Towards Lower Bounds on the Depth of ReLU Neural Networks | Unknown | N/A | |
| Deep Self-Dissimilarities as Powerful Visual Fingerprints | Unknown | N/A | |
| UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis | Unknown | N/A | |
| Conformal Prediction using Conditional Histograms | Unknown | N/A | |
| D2C: Diffusion-Decoding Models for Few-Shot Conditional Generation | Unknown | N/A | |
| Distributed Machine Learning with Sparse Heterogeneous Data | Unknown | N/A | |
| Associative Memories via Predictive Coding | Unknown | N/A | |
| Adaptive Data Augmentation on Temporal Graphs | Unknown | N/A | |
| Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation | Unknown | N/A | |
| Finding Optimal Tangent Points for Reducing Distortions of Hard-label Attacks | Unknown | N/A | |
| SyMetric: Measuring the Quality of Learnt Hamiltonian Dynamics Inferred from Vision | Unknown | N/A | |
| Learning to Iteratively Solve Routing Problems with Dual-Aspect Collaborative Transformer | Unknown | N/A | |
| Meta-Learning Reliable Priors in the Function Space | Unknown | N/A | |
| Neural Ensemble Search for Uncertainty Estimation and Dataset Shift | Unknown | N/A | |
| Federated-EM with heterogeneity mitigation and variance reduction | Unknown | N/A | |
| Recovering Latent Causal Factor for Generalization to Distributional Shifts | Unknown | N/A | |
| Dual-stream Network for Visual Recognition | Unknown | N/A | |
| Shape As Points: A Differentiable Poisson Solver | Unknown | N/A | |
| Spatio-Temporal Variational Gaussian Processes | Unknown | N/A | |
| Smoothness Matrices Beat Smoothness Constants: Better Communication Compression Techniques for Distributed Optimization | Unknown | N/A | |
| Long-Short Transformer: Efficient Transformers for Language and Vision | Unknown | N/A | |
| Momentum Centering and Asynchronous Update for Adaptive Gradient Methods | Unknown | N/A | |
| Unadversarial Examples: Designing Objects for Robust Vision | Unknown | N/A | |
| Reward is enough for convex MDPs | Unknown | N/A | |
| Dangers of Bayesian Model Averaging under Covariate Shift | Unknown | N/A | |
| Differentially Private Sampling from Distributions | Unknown | N/A | |
| Provably efficient, succinct, and precise explanations | Unknown | N/A | |
| Storchastic: A Framework for General Stochastic Automatic Differentiation | Unknown | N/A | |
| Differentially Private Multi-Armed Bandits in the Shuffle Model | Unknown | N/A | |
| Program Synthesis Guided Reinforcement Learning for Partially Observed Environments | Unknown | N/A | |
| Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration | Unknown | N/A | |
| Hessian Eigenspectra of More Realistic Nonlinear Models | Unknown | N/A | |
| Motif-based Graph Self-Supervised Learning for Molecular Property Prediction | Unknown | N/A | |
| Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data | Unknown | N/A | |
| Subgraph Federated Learning with Missing Neighbor Generation | Unknown | N/A | |
| Learning Policies with Zero or Bounded Constraint Violation for Constrained MDPs | Unknown | N/A | |
| Provably Efficient Causal Reinforcement Learning with Confounded Observational Data | Unknown | N/A | |
| Simple steps are all you need: Frank-Wolfe and generalized self-concordant functions | Unknown | N/A | |
| A Kernel-based Test of Independence for Cluster-correlated Data | Unknown | N/A | |
| Unique sparse decomposition of low rank matrices | Unknown | N/A | |
| Data Augmentation Can Improve Robustness | Unknown | N/A | |
| Fair Sequential Selection Using Supervised Learning Models | Unknown | N/A | |
| Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL | Unknown | N/A | |
| Escaping Saddle Points with Compressed SGD | Unknown | N/A | |
| The Difficulty of Passive Learning in Deep Reinforcement Learning | Unknown | N/A | |
| Multilingual Pre-training with Universal Dependency Learning | Unknown | N/A | |
| Self-Supervised Bug Detection and Repair | Unknown | N/A | |
| Neural Trees for Learning on Graphs | Unknown | N/A | |
| When Is Generalizable Reinforcement Learning Tractable? | Unknown | N/A | |
| On the Representation of Solutions to Elliptic PDEs in Barron Spaces | Unknown | N/A | |
| Towards optimally abstaining from prediction with OOD test examples | Unknown | N/A | |
| Precise characterization of the prior predictive distribution of deep ReLU networks | Unknown | N/A | |
| Random Shuffling Beats SGD Only After Many Epochs on Ill-Conditioned Problems | Unknown | N/A | |
| Looking Beyond Single Images for Contrastive Semantic Segmentation Learning | Unknown | N/A | |
| Rethinking Neural Operations for Diverse Tasks | Unknown | N/A | |
| Training Neural Networks is ER-complete | Unknown | N/A | |
| ErrorCompensatedX: error compensation for variance reduced algorithms | Unknown | N/A | |
| Densely connected normalizing flows | Unknown | N/A | |
| Collaborative Causal Discovery with Atomic Interventions | Unknown | N/A | |
| An Even More Optimal Stochastic Optimization Algorithm: Minibatching and Interpolation Learning | Unknown | N/A | |
| SOPE: Spectrum of Off-Policy Estimators | Unknown | N/A | |
| Learning with User-Level Privacy | Unknown | N/A | |
| Neural Tangent Kernel Maximum Mean Discrepancy | Unknown | N/A | |
| Estimating the Long-Term Effects of Novel Treatments | Unknown | N/A | |
| Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting | Unknown | N/A | |
| Bandit Quickest Changepoint Detection | Unknown | N/A | |
| OpenMatch: Open-Set Semi-supervised Learning with Open-set Consistency Regularization | Unknown | N/A | |
| How Well do Feature Visualizations Support Causal Understanding of CNN Activations? | Unknown | N/A | |
| Margin-Independent Online Multiclass Learning via Convex Geometry | Unknown | N/A | |
| Does enforcing fairness mitigate biases caused by subpopulation shift? | Unknown | N/A | |
| Batch Active Learning at Scale | Unknown | N/A | |
| Variational Bayesian Optimistic Sampling | Unknown | N/A | |
| Mind the Gap: Assessing Temporal Generalization in Neural Language Models | Unknown | N/A | |
| Automated Dynamic Mechanism Design | Unknown | N/A | |
| On the Suboptimality of Thompson Sampling in High Dimensions | Unknown | N/A | |
| Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models | Unknown | N/A | |
| Deep Neural Networks as Point Estimates for Deep Gaussian Processes | Unknown | N/A | |
| Learning Treatment Effects in Panels with General Intervention Patterns | Unknown | N/A | |
| PiRank: Scalable Learning To Rank via Differentiable Sorting | Unknown | N/A | |
| Ranking Policy Decisions | Unknown | N/A | |
| Local Disentanglement in Variational Auto-Encoders Using Jacobian $L_1$ Regularization | Unknown | N/A | |
| CoAtNet: Marrying Convolution and Attention for All Data Sizes | Unknown | N/A | |
| Multiple Descent: Design Your Own Generalization Curve | Unknown | N/A | |
| Generating High-Quality Explanations for Navigation in Partially-Revealed Environments | Unknown | N/A | |
| Solving Soft Clustering Ensemble via $k$-Sparse Discrete Wasserstein Barycenter | Unknown | N/A | |
| Learning Models for Actionable Recourse | Unknown | N/A | |
| A variational approximate posterior for the deep Wishart process | Unknown | N/A | |
| Bayesian decision-making under misspecified priors with applications to meta-learning | Unknown | N/A | |
| Infinite Time Horizon Safety of Bayesian Neural Networks | Unknown | N/A | |
| Network-to-Network Regularization: Enforcing Occam's Razor to Improve Generalization | Unknown | N/A | |
| Pretraining Representations for Data-Efficient Reinforcement Learning | Unknown | N/A | |
| Domain Adaptation with Invariant Representation Learning: What Transformations to Learn? | Unknown | N/A | |
| BayesIMP: Uncertainty Quantification for Causal Data Fusion | Unknown | N/A | |
| Self-Interpretable Model with Transformation Equivariant Interpretation | Unknown | N/A | |
| Generalization Bounds for Meta-Learning via PAC-Bayes and Uniform Stability | Unknown | N/A | |
| Roto-translated Local Coordinate Frames For Interacting Dynamical Systems | Unknown | N/A | |
| Distributed Zero-Order Optimization under Adversarial Noise | Unknown | N/A | |
| Scalable Inference in SDEs by Direct Matching of the Fokker–Planck–Kolmogorov Equation | Unknown | N/A | |
| Parallelizing Thompson Sampling | Unknown | N/A | |
| Differential Privacy Over Riemannian Manifolds | Unknown | N/A | |
| GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training | Unknown | N/A | |
| Sliced Mutual Information: A Scalable Measure of Statistical Dependence | Unknown | N/A | |
| Smooth Bilevel Programming for Sparse Regularization | Unknown | N/A | |
| Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling | Unknown | N/A | |
| Variance-Aware Off-Policy Evaluation with Linear Function Approximation | Unknown | N/A | |
| On the Representation Power of Set Pooling Networks | Unknown | N/A | |
| Dimension-free empirical entropy estimation | Unknown | N/A | |
| Geometry Processing with Neural Fields | Unknown | N/A | |
| Provably efficient multi-task reinforcement learning with model transfer | Unknown | N/A | |
| DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks | Unknown | N/A | |
| Deep Synoptic Monte-Carlo Planning in Reconnaissance Blind Chess | Unknown | N/A | |
| Attention over Learned Object Embeddings Enables Complex Visual Reasoning | Unknown | N/A | |
| Unbalanced Optimal Transport through Non-negative Penalized Linear Regression | Unknown | N/A | |
| Closing the Gap: Tighter Analysis of Alternating Stochastic Gradient Methods for Bilevel Problems | Unknown | N/A | |
| A Topological Perspective on Causal Inference | Unknown | N/A | |
| Shifted Chunk Transformer for Spatio-Temporal Representational Learning | Unknown | N/A | |
| Distilling Meta Knowledge on Heterogeneous Graph for Illicit Drug Trafficker Detection on Social Media | Unknown | N/A | |
| Continuous-time edge modelling using non-parametric point processes | Unknown | N/A | |
| Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions | Unknown | N/A | |
| Intriguing Properties of Vision Transformers | Unknown | N/A | |
| Arbitrary Conditional Distributions with Energy | Unknown | N/A | |
| Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning | Unknown | N/A | |
| UCB-based Algorithms for Multinomial Logistic Regression Bandits | Unknown | N/A | |
| BooVAE: Boosting Approach for Continual Learning of VAE | Unknown | N/A | |
| Conditionally Parameterized, Discretization-Aware Neural Networks for Mesh-Based Modeling of Physical Systems | Unknown | N/A | |
| Why Spectral Normalization Stabilizes GANs: Analysis and Improvements | Unknown | N/A | |
| Rebounding Bandits for Modeling Satiation Effects | Unknown | N/A | |
| Efficient methods for Gaussian Markov random fields under sparse linear constraints | Unknown | N/A | |
| Physics-Integrated Variational Autoencoders for Robust and Interpretable Generative Modeling | Unknown | N/A | |
| Adversarially Robust 3D Point Cloud Recognition Using Self-Supervisions | Unknown | N/A | |
| Revisiting the Calibration of Modern Neural Networks | Unknown | N/A | |
| Understanding the Limits of Unsupervised Domain Adaptation via Data Poisoning | Unknown | N/A | |
| Formalizing Generalization and Adversarial Robustness of Neural Networks to Weight Perturbations | Unknown | N/A | |
| Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels | Unknown | N/A | |
| Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning | Unknown | N/A | |
| NN-Baker: A Neural-network Infused Algorithmic Framework for Optimization Problems on Geometric Intersection Graphs | Unknown | N/A | |
| Bootstrapping the Error of Oja's Algorithm | Unknown | N/A | |
| Towards Stable and Robust AdderNets | Unknown | N/A | |
| Probability Paths and the Structure of Predictions over Time | Unknown | N/A | |
| Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning | Unknown | N/A | |
| Global Convergence of Online Optimization for Nonlinear Model Predictive Control | Unknown | N/A | |
| ProTo: Program-Guided Transformer for Program-Guided Tasks | Unknown | N/A | |
| Oracle-Efficient Regret Minimization in Factored MDPs with Unknown Structure | Unknown | N/A | |
| Observation-Free Attacks on Stochastic Bandits | Unknown | N/A | |
| Contrastive Learning of Global and Local Video Representations | Unknown | N/A | |
| A Theoretical Analysis of Fine-tuning with Linear Teachers | Unknown | N/A | |
| On Training Implicit Models | Unknown | N/A | |
| Implicit Bias of SGD for Diagonal Linear Networks: a Provable Benefit of Stochasticity | Unknown | N/A | |
| Reconstruction for Powerful Graph Representations | Unknown | N/A | |
| Deep Molecular Representation Learning via Fusing Physical and Chemical Information | Unknown | N/A | |
| Implicit Generative Copulas | Unknown | N/A | |
| Automatic Data Augmentation for Generalization in Reinforcement Learning | Unknown | N/A | |
| Local Hyper-Flow Diffusion | Unknown | N/A | |
| Analysis of Sensing Spectral for Signal Recovery under a Generalized Linear Model | Unknown | N/A | |
| Differentially Private Empirical Risk Minimization under the Fairness Lens | Unknown | N/A | |
| Adversarial Neuron Pruning Purifies Backdoored Deep Models | Unknown | N/A | |
| Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods | Unknown | N/A | |
| PLUR: A Unifying, Graph-Based View of Program Learning, Understanding, and Repair | Unknown | N/A | |
| Adversarial Intrinsic Motivation for Reinforcement Learning | Unknown | N/A | |
| Embedding Principle of Loss Landscape of Deep Neural Networks | Unknown | N/A | |
| Progressive Feature Interaction Search for Deep Sparse Network | Unknown | N/A | |
| Towards Multi-Grained Explainability for Graph Neural Networks | Unknown | N/A | |
| Multi-task Learning of Order-Consistent Causal Graphs | Unknown | N/A | |
| Sequence-to-Sequence Learning with Latent Neural Grammars | Unknown | N/A | |
| Causal Identification with Matrix Equations | Unknown | N/A | |
| Compressed Video Contrastive Learning | Unknown | N/A | |
| Low-Rank Subspaces in GANs | Unknown | N/A | |
| Accelerated Sparse Neural Training: A Provable and Efficient Method to Find N:M Transposable Masks | Unknown | N/A | |
| Differentiable rendering with perturbed optimizers | Unknown | N/A | |
| iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder | Unknown | N/A | |
| Controlled Text Generation as Continuous Optimization with Multiple Constraints | Unknown | N/A | |
| Dynamic Analysis of Higher-Order Coordination in Neuronal Assemblies via De-Sparsified Orthogonal Matching Pursuit | Unknown | N/A | |
| Best of Both Worlds: Practical and Theoretically Optimal Submodular Maximization in Parallel | Unknown | N/A | |
| Individual Privacy Accounting via a Rényi Filter | Unknown | N/A | |
| Improving Contrastive Learning on Imbalanced Data via Open-World Sampling | Unknown | N/A | |
| A Comprehensively Tight Analysis of Gradient Descent for PCA | Unknown | N/A | |
| CCVS: Context-aware Controllable Video Synthesis | Unknown | N/A | |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Unknown | N/A | |
| Multi-Scale Representation Learning on Proteins | Unknown | N/A | |
| Exploring the Limits of Out-of-Distribution Detection | Unknown | N/A | |
| The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition | Unknown | N/A | |
| The Value of Information When Deciding What to Learn | Unknown | N/A | |
| Minimax Regret for Stochastic Shortest Path | Unknown | N/A | |
| Tensor Normal Training for Deep Learning Models | Unknown | N/A | |
| Fair Algorithms for Multi-Agent Multi-Armed Bandits | Unknown | N/A | |
| Nested Graph Neural Networks | Unknown | N/A | |
| General Low-rank Matrix Optimization: Geometric Analysis and Sharper Bounds | Unknown | N/A | |
| Variational Bayesian Reinforcement Learning with Regret Bounds | Unknown | N/A | |
| A Gradient Method for Multilevel Optimization | Unknown | N/A | |
| A universal probabilistic spike count model reveals ongoing modulation of neural variability | Unknown | N/A | |
| Shape Registration in the Time of Transformers | Unknown | N/A | |
| Towards Instance-Optimal Offline Reinforcement Learning with Pessimism | Unknown | N/A | |
| Optimality of variational inference for stochasticblock model with missing links | Unknown | N/A | |
| Dynamic Trace Estimation | Unknown | N/A | |
| Zero Time Waste: Recycling Predictions in Early Exit Neural Networks | Unknown | N/A | |
| Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme | Unknown | N/A | |
| Learning Student-Friendly Teacher Networks for Knowledge Distillation | Unknown | N/A | |
| Towards Best-of-All-Worlds Online Learning with Feedback Graphs | Unknown | N/A | |
| A$^2$-Net: Learning Attribute-Aware Hash Codes for Large-Scale Fine-Grained Image Retrieval | Unknown | N/A | |
| Progressive Coordinate Transforms for Monocular 3D Object Detection | Unknown | N/A | |
| Neural Human Performer: Learning Generalizable Radiance Fields for Human Performance Rendering | Unknown | N/A | |
| Learning and Generalization in RNNs | Unknown | N/A | |
| Counterfactual Explanations Can Be Manipulated | Unknown | N/A | |
| Scheduling jobs with stochastic holding costs | Unknown | N/A | |
| On the Value of Interaction and Function Approximation in Imitation Learning | Unknown | N/A | |
| Nonparametric estimation of continuous DPPs with kernel methods | Unknown | N/A | |
| Learning Disentangled Behavior Embeddings | Unknown | N/A | |
| Topic Modeling Revisited: A Document Graph-based Neural Network Perspective | Unknown | N/A | |
| Dueling Bandits with Adversarial Sleeping | Unknown | N/A | |
| Inverse-Weighted Survival Games | Unknown | N/A | |
| Identifiability in inverse reinforcement learning | Unknown | N/A | |
| Modular Gaussian Processes for Transfer Learning | Unknown | N/A | |
| Faster proximal algorithms for matrix optimization using Jacobi-based eigenvalue methods | Unknown | N/A | |
| Neural Relightable Participating Media Rendering | Unknown | N/A | |
| Time-series Generation by Contrastive Imitation | Unknown | N/A | |
| Exploiting Opponents Under Utility Constraints in Sequential Games | Unknown | N/A | |
| Model-Based Domain Generalization | Unknown | N/A | |
| The Elastic Lottery Ticket Hypothesis | Unknown | N/A | |
| Hybrid Regret Bounds for Combinatorial Semi-Bandits and Adversarial Linear Bandits | Unknown | N/A | |
| Learning Optimal Predictive Checklists | Unknown | N/A | |
| Learning Markov State Abstractions for Deep Reinforcement Learning | Unknown | N/A | |
| Learning to Elect | Unknown | N/A | |
| Projected GANs Converge Faster | Unknown | N/A | |
| Certifying Robustness to Programmable Data Bias in Decision Trees | Unknown | N/A | |
| M-FAC: Efficient Matrix-Free Approximations of Second-Order Information | Unknown | N/A | |
| Bellman Eluder Dimension: New Rich Classes of RL Problems, and Sample-Efficient Algorithms | Unknown | N/A | |
| Transformer in Transformer | Unknown | N/A | |
| Neural Scene Flow Prior | Unknown | N/A | |
| MAUVE: Measuring the Gap Between Neural Text and Human Text using Divergence Frontiers | Unknown | N/A | |
| Neural Rule-Execution Tracking Machine For Transformer-Based Text Generation | Unknown | N/A | |
| Dynamics-regulated kinematic policy for egocentric pose estimation | Unknown | N/A | |
| TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up | Unknown | N/A | |
| A/B/n Testing with Control in the Presence of Subpopulations | Unknown | N/A | |
| EvoGrad: Efficient Gradient-Based Meta-Learning and Hyperparameter Optimization | Unknown | N/A | |
| Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others | Unknown | N/A | |
| Introspective Distillation for Robust Question Answering | Unknown | N/A | |
| Bandit Learning with Delayed Impact of Actions | Unknown | N/A | |
| DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification | Unknown | N/A | |
| Heavy Ball Neural Ordinary Differential Equations | Unknown | N/A | |
| Recurrent Submodular Welfare and Matroid Blocking Semi-Bandits | Unknown | N/A | |
| Online Learning Of Neural Computations From Sparse Temporal Feedback | Unknown | N/A | |
| PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators | Unknown | N/A | |
| Testing Probabilistic Circuits | Unknown | N/A | |
| Aligning Pretraining for Detection via Object-Level Contrastive Learning | Unknown | N/A | |
| Perturbation Theory for the Information Bottleneck | Unknown | N/A | |
| Equilibrium Refinement for the Age of Machines: The One-Sided Quasi-Perfect Equilibrium | Unknown | N/A | |
| DRONE: Data-aware Low-rank Compression for Large NLP Models | Unknown | N/A | |
| Pseudo-Spherical Contrastive Divergence | Unknown | N/A | |
| How Fine-Tuning Allows for Effective Meta-Learning | Unknown | N/A | |
| Learning in Multi-Stage Decentralized Matching Markets | Unknown | N/A | |
| Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training | Unknown | N/A | |
| Cross-view Geo-localization with Layer-to-Layer Transformer | Unknown | N/A | |
| Differential Privacy Dynamics of Langevin Diffusion and Noisy Gradient Descent | Unknown | N/A | |
| Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning | Unknown | N/A | |
| FINE Samples for Learning with Noisy Labels | Unknown | N/A | |
| Distributionally Robust Imitation Learning | Unknown | N/A | |
| Probabilistic Tensor Decomposition of Neural Population Spiking Activity | Unknown | N/A | |
| Change Point Detection via Multivariate Singular Spectrum Analysis | Unknown | N/A | |
| Mixability made efficient: Fast online multiclass logistic regression | Unknown | N/A | |
| Does Knowledge Distillation Really Work? | Unknown | N/A | |
| Risk Bounds and Calibration for a Smart Predict-then-Optimize Method | Unknown | N/A |
NIPS 2022
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| SurDis: A Surface Discontinuity Dataset for Wearable Technology to Assist Blind Navigation in Urban Environments | Unknown | N/A | |
| MOMA-LRG: Language-Refined Graphs for Multi-Object Multi-Actor Activity Parsing | Unknown | N/A | |
| VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation | Unknown | N/A | |
| AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies | Unknown | N/A | |
| Pythae: Unifying Generative Autoencoders in Python - A Benchmarking Use Case | Unknown | N/A | |
| Ambiguous Images With Human Judgments for Robust Visual Event Classification | Unknown | N/A | |
| Towards Better Evaluation for Dynamic Link Prediction | Unknown | N/A | |
| pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models | Unknown | N/A | |
| EgoTaskQA: Understanding Human Tasks in Egocentric Videos | Unknown | N/A | |
| Finding Naturally Occurring Physical Backdoors in Image Datasets | Unknown | N/A | |
| Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models | Unknown | N/A | |
| GLOBEM Dataset: Multi-Year Datasets for Longitudinal Human Behavior Modeling Generalization | Unknown | N/A | |
| K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather Conditions | Unknown | N/A | |
| PulseImpute: A Novel Benchmark Task for Pulsative Physiological Signal Imputation | Unknown | N/A | |
| Touch and Go: Learning from Human-Collected Vision and Touch | Unknown | N/A | |
| How Transferable are Video Representations Based on Synthetic Data? | Unknown | N/A | |
| Nocturne: a scalable driving benchmark for bringing multi-agent learning one step closer to the real world | Unknown | N/A | |
| How Would The Viewer Feel? Estimating Wellbeing From Video Scenarios | Unknown | N/A | |
| The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset | Unknown | N/A | |
| A Comprehensive Study on Large-Scale Graph Training: Benchmarking and Rethinking | Unknown | N/A | |
| SMPL: Simulated Industrial Manufacturing and Process Control Learning Environments | Unknown | N/A | |
| Hard ImageNet: Segmentations for Objects with Strong Spurious Cues | Unknown | N/A | |
| SCAMPS: Synthetics for Camera Measurement of Physiological Signals | Unknown | N/A | |
| OpenOOD: Benchmarking Generalized Out-of-Distribution Detection | Unknown | N/A | |
| A Large Scale Search Dataset for Unbiased Learning to Rank | Unknown | N/A | |
| FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning | Unknown | N/A | |
| Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems | Unknown | N/A | |
| MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge | Unknown | N/A | |
| Robustness Analysis of Video-Language Models Against Visual and Language Perturbations | Unknown | N/A | |
| ComMU: Dataset for Combinatorial Music Generation | Unknown | N/A | |
| BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs | Unknown | N/A | |
| TAP-Vid: A Benchmark for Tracking Any Point in a Video | Unknown | N/A | |
| DABS 2.0: Improved Datasets and Algorithms for Universal Self-Supervision | Unknown | N/A | |
| Communicating Natural Programs to Humans and Machines | Unknown | N/A | |
| Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability | Unknown | N/A | |
| SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis | Unknown | N/A | |
| PROSPECT: Labeled Tandem Mass Spectrometry Dataset for Machine Learning in Proteomics | Unknown | N/A | |
| FACT: Learning Governing Abstractions Behind Integer Sequences | Unknown | N/A | |
| Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML Evaluation | Unknown | N/A | |
| PeRFception: Perception using Radiance Fields | Unknown | N/A | |
| A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis | Unknown | N/A | |
| Addressing Resource Scarcity across Sign Languages with Multilingual Pretraining and Unified-Vocabulary Datasets | Unknown | N/A | |
| TGEA 2.0: A Large-Scale Diagnostically Annotated Dataset with Benchmark Tasks for Text Generation of Pretrained Language Models | Unknown | N/A | |
| BackdoorBench: A Comprehensive Benchmark of Backdoor Learning | Unknown | N/A | |
| 3DOS: Towards 3D Open Set Learning - Benchmarking and Understanding Semantic Novelty Detection on Point Clouds | Unknown | N/A | |
| Flare7K: A Phenomenological Nighttime Flare Removal Dataset | Unknown | N/A | |
| DC-BENCH: Dataset Condensation Benchmark | Unknown | N/A | |
| Change Event Dataset for Discovery from Spatio-temporal Remote Sensing Imagery | Unknown | N/A | |
| CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks | Unknown | N/A | |
| Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment | Unknown | N/A | |
| IKEA-Manual: Seeing Shape Assembly Step by Step | Unknown | N/A | |
| FLamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings | Unknown | N/A | |
| Ontologue: Declarative Benchmark Construction for Ontological Multi-Label Classification | Unknown | N/A | |
| MBW: Multi-view Bootstrapping in the Wild | Unknown | N/A | |
| Enabling Detailed Action Recognition Evaluation Through Video Dataset Augmentation | Unknown | N/A | |
| Sample Efficiency Matters: A Benchmark for Practical Molecular Optimization | Unknown | N/A | |
| NAS-Bench-Suite-Zero: Accelerating Research on Zero Cost Proxies | Unknown | N/A | |
| PEER: A Comprehensive and Multi-Task Benchmark for Protein Sequence Understanding | Unknown | N/A | |
| FlyView: a bio-informed optical flow truth dataset for visual navigation using panoramic stereo vision | Unknown | N/A | |
| Chartalist: Labeled Graph Datasets for UTXO and Account-based Blockchains | Unknown | N/A | |
| A Multi-Task Benchmark for Korean Legal Language Understanding and Judgement Prediction | Unknown | N/A | |
| APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking | Unknown | N/A | |
| OccGen: Selection of Real-world Multilingual Parallel Data Balanced in Gender within Occupations | Unknown | N/A | |
| GriddlyJS: A Web IDE for Reinforcement Learning | Unknown | N/A | |
| Model Zoos: A Dataset of Diverse Populations of Neural Network Models | Unknown | N/A | |
| Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification | Unknown | N/A | |
| Beyond Real-world Benchmark Datasets: An Empirical Study of Node Classification with GNNs | Unknown | N/A | |
| The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games | Unknown | N/A | |
| Breaking Bad: A Dataset for Geometric Fracture and Reassembly | Unknown | N/A | |
| USB: A Unified Semi-supervised Learning Benchmark for Classification | Unknown | N/A | |
| NAS-Bench-Graph: Benchmarking Graph Neural Architecture Search | Unknown | N/A | |
| WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models | Unknown | N/A | |
| BigBio: A Framework for Data-Centric Biomedical Natural Language Processing | Unknown | N/A | |
| MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control | Unknown | N/A | |
| CLEVRER-Humans: Describing Physical and Causal Events the Human Way | Unknown | N/A | |
| HandMeThat: Human-Robot Communication in Physical and Social Environments | Unknown | N/A | |
| OpenSRH: optimizing brain tumor surgery using intraoperative stimulated Raman histology | Unknown | N/A | |
| Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset | Unknown | N/A | |
| BLOX: Macro Neural Architecture Search Benchmark and Algorithms | Unknown | N/A | |
| CEDe: A collection of expert-curated datasets with atom-level entity annotations for Optical Chemical Structure Recognition | Unknown | N/A | |
| M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus | Unknown | N/A | |
| LAION-5B: An open large-scale dataset for training next generation image-text models | Unknown | N/A | |
| MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification | Unknown | N/A | |
| Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms | Unknown | N/A | |
| ViSioNS: Visual Search in Natural Scenes Benchmark | Unknown | N/A | |
| mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors | Unknown | N/A | |
| ActionSense: A Multimodal Dataset and Recording Framework for Human Activities Using Wearable Sensors in a Kitchen Environment | Unknown | N/A | |
| The Dollar Street Dataset: Images Representing the Geographic and Socioeconomic Diversity of the World | Unknown | N/A | |
| FETA: Towards Specializing Foundational Models for Expert Task Applications | Unknown | N/A | |
| AnoShift: A Distribution Shift Benchmark for Unsupervised Anomaly Detection | Unknown | N/A | |
| Evaluating Out-of-Distribution Performance on Document Image Classifiers | Unknown | N/A | |
| Open High-Resolution Satellite Imagery: The WorldStrat Dataset – With Application to Super-Resolution | Unknown | N/A | |
| OLIVES Dataset: Ophthalmic Labels for Investigating Visual Eye Semantics | Unknown | N/A | |
| A Benchmark for Compositional Visual Reasoning | Unknown | N/A | |
| xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery | Unknown | N/A | |
| Learning Long-Term Crop Management Strategies with CyclesGym | Unknown | N/A | |
| ETAB: A Benchmark Suite for Visual Representation Learning in Echocardiography | Unknown | N/A | |
| EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records | Unknown | N/A | |
| GOOD: A Graph Out-of-Distribution Benchmark | Unknown | N/A | |
| Is one annotation enough? - A data-centric image classification benchmark for noisy and ambiguous label estimation | Unknown | N/A | |
| MTNeuro: A Benchmark for Evaluating Representations of Brain Structure Across Multiple Levels of Abstraction | Unknown | N/A | |
| CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets | Unknown | N/A | |
| JAHS-Bench-201: A Foundation For Research On Joint Architecture And Hyperparameter Search | Unknown | N/A | |
| A Dataset for Efforts Towards Achieving the Sustainable Development Goal of Safe Working Environments | Unknown | N/A | |
| Forecasting Future World Events With Neural Networks | Unknown | N/A | |
| TwiBot-22: Towards Graph-Based Twitter Bot Detection | Unknown | N/A | |
| Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds | Unknown | N/A | |
| Long Range Graph Benchmark | Unknown | N/A | |
| Geoclidean: Few-Shot Generalization in Euclidean Geometry | Unknown | N/A | |
| CARLANE: A Lane Detection Benchmark for Unsupervised Domain Adaptation from Simulation to multiple Real-World Domains | Unknown | N/A | |
| EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine | Unknown | N/A | |
| How Well Do Unsupervised Learning Algorithms Model Human Real-time and Life-long Learning? | Unknown | N/A | |
| OpenFilter: A Framework to Democratize Research Access to Social Media AR Filters | Unknown | N/A | |
| Why do tree-based models still outperform deep learning on typical tabular data? | Unknown | N/A | |
| Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities | Unknown | N/A | |
| Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark | Unknown | N/A | |
| Robustness Disparities in Face Detection | Unknown | N/A | |
| AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation | Unknown | N/A | |
| TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training | Unknown | N/A | |
| Wild-Time: A Benchmark of in-the-Wild Distribution Shift over Time | Unknown | N/A | |
| PDEBench: An Extensive Benchmark for Scientific Machine Learning | Unknown | N/A | |
| LIPS - Learning Industrial Physical Simulation benchmark suite | Unknown | N/A | |
| Towards Video Text Visual Question Answering: Benchmark and Baseline | Unknown | N/A | |
| SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning | Unknown | N/A | |
| NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning | Unknown | N/A | |
| NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks | Unknown | N/A | |
| OpenFWI: Large-scale Multi-structural Benchmark Datasets for Full Waveform Inversion | Unknown | N/A | |
| METS-CoV: A Dataset of Medical Entity and Targeted Sentiment on COVID-19 Related Tweets | Unknown | N/A | |
| DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection | Unknown | N/A | |
| ConfLab: A Data Collection Concept, Dataset, and Benchmark for Machine Analysis of Free-Standing Social Interactions in the Wild | Unknown | N/A | |
| HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions | Unknown | N/A | |
| TempEL: Linking Dynamically Evolving and Newly Emerging Entities | Unknown | N/A | |
| ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models | Unknown | N/A | |
| A Survey and Datasheet Repository of Publicly Available US Criminal Justice Datasets | Unknown | N/A | |
| Myriad: a real-world testbed to bridge trajectory optimization and deep learning | Unknown | N/A | |
| TweetNERD - End to End Entity Linking Benchmark for Tweets | Unknown | N/A | |
| AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels | Unknown | N/A | |
| SafeBench: A Benchmarking Platform for Safety Evaluation of Autonomous Vehicles | Unknown | N/A | |
| This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish | Unknown | N/A | |
| A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks | Unknown | N/A | |
| Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport? | Unknown | N/A | |
| DART: Articulated Hand Model with Diverse Accessories and Rich Textures | Unknown | N/A | |
| Active-Passive SimStereo - Benchmarking the Cross-Generalization Capabilities of Deep Learning-based Stereo Methods | Unknown | N/A | |
| CGLB: Benchmark Tasks for Continual Graph Learning | Unknown | N/A | |
| ADBench: Anomaly Detection Benchmark | Unknown | N/A | |
| A new dataset for multilingual keyphrase generation | Unknown | N/A | |
| Unravelling the Performance of Physics-informed Graph Neural Networks for Dynamical Systems | Unknown | N/A | |
| DDXPlus: A New Dataset For Automatic Medical Diagnosis | Unknown | N/A | |
| Video compression dataset and benchmark of learning-based video-quality metrics | Unknown | N/A | |
| Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning | Unknown | N/A | |
| MVP-N: A Dataset and Benchmark for Real-World Multi-View Object Classification | Unknown | N/A | |
| pFL-Bench: A Comprehensive Benchmark for Personalized Federated Learning | Unknown | N/A | |
| Dungeons and Data: A Large-Scale NetHack Dataset | Unknown | N/A | |
| OpenXAI: Towards a Transparent Evaluation of Model Explanations | Unknown | N/A | |
| Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning | Unknown | N/A | |
| ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts | Unknown | N/A | |
| AirfRANS: High Fidelity Computational Fluid Dynamics Dataset for Approximating Reynolds-Averaged Navier–Stokes Solutions | Unknown | N/A | |
| EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Unknown | N/A | |
| Multilingual Abusive Comment Detection at Scale for Indic Languages | Unknown | N/A | |
| MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control | Unknown | N/A | |
| FLAIR: Federated Learning Annotated Image Repository | Unknown | N/A | |
| StrokeRehab: A Benchmark Dataset for Sub-second Action Identification | Unknown | N/A | |
| Training Uncertainty-Aware Classifiers with Conformalized Deep Learning | Unknown | N/A | |
| Optimizing Relevance Maps of Vision Transformers Improves Robustness | Unknown | N/A | |
| Quantum Speedups of Optimizing Approximately Convex Functions with Applications to Logarithmic Regret Stochastic Convex Bandits | Unknown | N/A | |
| Low-rank lottery tickets: finding efficient low-rank neural networks via matrix differential equations | Unknown | N/A | |
| Towards Improving Faithfulness in Abstractive Summarization | Unknown | N/A | |
| SIREN: Shaping Representations for Detecting Out-of-Distribution Objects | Unknown | N/A | |
| Implicit Neural Representations with Levels-of-Experts | Unknown | N/A | |
| Uplifting Bandits | Unknown | N/A | |
| Infinite-Fidelity Coregionalization for Physical Simulation | Unknown | N/A | |
| RSA: Reducing Semantic Shift from Aggressive Augmentations for Self-supervised Learning | Unknown | N/A | |
| On the Effective Number of Linear Regions in Shallow Univariate ReLU Networks: Convergence Guarantees and Implicit Bias | Unknown | N/A | |
| On Infinite Separations Between Simple and Optimal Mechanisms | Unknown | N/A | |
| TANKBind: Trigonometry-Aware Neural NetworKs for Drug-Protein Binding Structure Prediction | Unknown | N/A | |
| Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation | Unknown | N/A | |
| Automatic differentiation of nonsmooth iterative algorithms | Unknown | N/A | |
| Efficient coding, channel capacity, and the emergence of retinal mosaics | Unknown | N/A | |
| Decentralized Local Stochastic Extra-Gradient for Variational Inequalities | Unknown | N/A | |
| Toward Equation of Motion for Deep Neural Networks: Continuous-time Gradient Descent and Discretization Error Analysis | Unknown | N/A | |
| Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning | Unknown | N/A | |
| Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering | Unknown | N/A | |
| Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret | Unknown | N/A | |
| Counterfactual Neural Temporal Point Process for Estimating Causal Influence of Misinformation on Social Media | Unknown | N/A | |
| Globally Gated Deep Linear Networks | Unknown | N/A | |
| Graph Scattering beyond Wavelet Shackles | Unknown | N/A | |
| Aligning individual brains with fused unbalanced Gromov Wasserstein | Unknown | N/A | |
| SoftPatch: Unsupervised Anomaly Detection with Noisy Data | Unknown | N/A | |
| Kernel Interpolation with Sparse Grids | Unknown | N/A | |
| Ask4Help: Learning to Leverage an Expert for Embodied Tasks | Unknown | N/A | |
| TUSK: Task-Agnostic Unsupervised Keypoints | Unknown | N/A | |
| Concept Activation Regions: A Generalized Framework For Concept-Based Explanations | Unknown | N/A | |
| Matrix Multiplicative Weights Updates in Quantum Zero-Sum Games: Conservation Laws & Recurrence | Unknown | N/A | |
| Posted Pricing and Dynamic Prior-independent Mechanisms with Value Maximizers | Unknown | N/A | |
| Training stochastic stabilized supralinear networks by dynamics-neutral growth | Unknown | N/A | |
| Chefs' Random Tables: Non-Trigonometric Random Features | Unknown | N/A | |
| NeuForm: Adaptive Overfitting for Neural Shape Editing | Unknown | N/A | |
| STaR: Bootstrapping Reasoning With Reasoning | Unknown | N/A | |
| A Causal Analysis of Harm | Unknown | N/A | |
| Network change point localisation under local differential privacy | Unknown | N/A | |
| DISCO: Adversarial Defense with Local Implicit Functions | Unknown | N/A | |
| Does GNN Pretraining Help Molecular Representation? | Unknown | N/A | |
| FedAvg with Fine Tuning: Local Updates Lead to Representation Learning | Unknown | N/A | |
| GET3D: A Generative Model of High Quality 3D Textured Shapes Learned from Images | Unknown | N/A | |
| Re-Analyze Gauss: Bounds for Private Matrix Approximation via Dyson Brownian Motion | Unknown | N/A | |
| Locating and Editing Factual Associations in GPT | Unknown | N/A | |
| Faster Linear Algebra for Distance Matrices | Unknown | N/A | |
| Causal Inference with Non-IID Data using Linear Graphical Models | Unknown | N/A | |
| Extra-Newton: A First Approach to Noise-Adaptive Accelerated Second-Order Methods | Unknown | N/A | |
| ALMA: Hierarchical Learning for Composite Multi-Agent Tasks | Unknown | N/A | |
| Diversified Recommendations for Agents with Adaptive Preferences | Unknown | N/A | |
| Optimizing Data Collection for Machine Learning | Unknown | N/A | |
| VeriDark: A Large-Scale Benchmark for Authorship Verification on the Dark Web | Unknown | N/A | |
| CoNT: Contrastive Neural Text Generation | Unknown | N/A | |
| Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline | Unknown | N/A | |
| Zeroth-Order Negative Curvature Finding: Escaping Saddle Points without Gradients | Unknown | N/A | |
| Towards Practical Control of Singular Values of Convolutional Layers | Unknown | N/A | |
| Riemannian Neural SDE: Learning Stochastic Representations on Manifolds | Unknown | N/A | |
| Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments | Unknown | N/A | |
| A Contrastive Framework for Neural Text Generation | Unknown | N/A | |
| AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos | Unknown | N/A | |
| Two-Stream Network for Sign Language Recognition and Translation | Unknown | N/A | |
| Multivariate Time-Series Forecasting with Temporal Polynomial Graph Neural Networks | Unknown | N/A | |
| Towards Out-of-Distribution Sequential Event Prediction: A Causal Treatment | Unknown | N/A | |
| Roadblocks for Temporarily Disabling Shortcuts and Learning New Knowledge | Unknown | N/A | |
| Learning Bipartite Graphs: Heavy Tails and Multiple Components | Unknown | N/A | |
| Vision Transformers provably learn spatial structure | Unknown | N/A | |
| Shape, Light, and Material Decomposition from Images using Monte Carlo Rendering and Denoising | Unknown | N/A | |
| Teach Less, Learn More: On the Undistillable Classes in Knowledge Distillation | Unknown | N/A | |
| Hand-Object Interaction Image Generation | Unknown | N/A | |
| Feature Learning in $L_2$-regularized DNNs: Attraction/Repulsion and Sparsity | Unknown | N/A | |
| Bridging the Gap: Unifying the Training and Evaluation of Neural Network Binary Classifiers | Unknown | N/A | |
| On the Discrimination Risk of Mean Aggregation Feature Imputation in Graphs | Unknown | N/A | |
| Efficient and Modular Implicit Differentiation | Unknown | N/A | |
| NeuroSchedule: A Novel Effective GNN-based Scheduling Method for High-level Synthesis | Unknown | N/A | |
| Recursive Reinforcement Learning | Unknown | N/A | |
| Making Sense of Dependence: Efficient Black-box Explanations Using Dependence Measure | Unknown | N/A | |
| Distribution-Informed Neural Networks for Domain Adaptation Regression | Unknown | N/A | |
| On the Interpretability of Regularisation for Neural Networks Through Model Gradient Similarity | Unknown | N/A | |
| Exploiting Semantic Relations for Glass Surface Detection | Unknown | N/A | |
| Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions | Unknown | N/A | |
| Function Classes for Identifiable Nonlinear Independent Component Analysis | Unknown | N/A | |
| GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models | Unknown | N/A | |
| Recovering Private Text in Federated Learning of Language Models | Unknown | N/A | |
| Contrastive Language-Image Pre-Training with Knowledge Graphs | Unknown | N/A | |
| Fast Mixing of Stochastic Gradient Descent with Normalization and Weight Decay | Unknown | N/A | |
| Disentangling the Predictive Variance of Deep Ensembles through the Neural Tangent Kernel | Unknown | N/A | |
| Quantile Constrained Reinforcement Learning: A Reinforcement Learning Framework Constraining Outage Probability | Unknown | N/A | |
| Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members | Unknown | N/A | |
| Diversity vs. Recognizability: Human-like generalization in one-shot generative models | Unknown | N/A | |
| SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems | Unknown | N/A | |
| DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes | Unknown | N/A | |
| Emergence of Hierarchical Layers in a Single Sheet of Self-Organizing Spiking Neurons | Unknown | N/A | |
| Adapting to Online Label Shift with Provable Guarantees | Unknown | N/A | |
| Offline Multi-Agent Reinforcement Learning with Knowledge Distillation | Unknown | N/A | |
| Visual correspondence-based explanations improve AI robustness and human-AI team accuracy | Unknown | N/A | |
| Large-Scale Differentiable Causal Discovery of Factor Graphs | Unknown | N/A | |
| Near Instance-Optimal PAC Reinforcement Learning for Deterministic MDPs | Unknown | N/A | |
| A Conditional Randomization Test for Sparse Logistic Regression in High-Dimension | Unknown | N/A | |
| Learning Distributed and Fair Policies for Network Load Balancing as Markov Potential Game | Unknown | N/A | |
| On the SDEs and Scaling Rules for Adaptive Gradient Algorithms | Unknown | N/A | |
| Data Augmentation MCMC for Bayesian Inference from Privatized Data | Unknown | N/A | |
| Dynamic Tensor Product Regression | Unknown | N/A | |
| Introspective Learning : A Two-Stage approach for Inference in Neural Networks | Unknown | N/A | |
| Score-Based Diffusion meets Annealed Importance Sampling | Unknown | N/A | |
| Local Identifiability of Deep ReLU Neural Networks: the Theory | Unknown | N/A | |
| Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning | Unknown | N/A | |
| A Continuous Time Framework for Discrete Denoising Models | Unknown | N/A | |
| Are Two Heads the Same as One? Identifying Disparate Treatment in Fair Neural Networks | Unknown | N/A | |
| Infinite Recommendation Networks: A Data-Centric Approach | Unknown | N/A | |
| Neural Sheaf Diffusion: A Topological Perspective on Heterophily and Oversmoothing in GNNs | Unknown | N/A | |
| Learning Distributions Generated by Single-Layer ReLU Networks in the Presence of Arbitrary Outliers | Unknown | N/A | |
| RainNet: A Large-Scale Imagery Dataset and Benchmark for Spatial Precipitation Downscaling | Unknown | N/A | |
| VICRegL: Self-Supervised Learning of Local Visual Features | Unknown | N/A | |
| Learning to Find Proofs and Theorems by Learning to Refine Search Strategies: The Case of Loop Invariant Synthesis | Unknown | N/A | |
| Generalization for multiclass classification with overparameterized linear models | Unknown | N/A | |
| Okapi: Generalising Better by Making Statistical Matches Match | Unknown | N/A | |
| Deterministic Langevin Monte Carlo with Normalizing Flows for Bayesian Inference | Unknown | N/A | |
| Probabilistic Transformer: Modelling Ambiguities and Distributions for RNA Folding and Molecule Design | Unknown | N/A | |
| Adversarial Reprogramming Revisited | Unknown | N/A | |
| A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs | Unknown | N/A | |
| Uncertainty Estimation Using Riemannian Model Dynamics for Offline Reinforcement Learning | Unknown | N/A | |
| Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning | Unknown | N/A | |
| The Pitfalls of Regularization in Off-Policy TD Learning | Unknown | N/A | |
| OmniVL: One Foundation Model for Image-Language and Video-Language Tasks | Unknown | N/A | |
| CCCP is Frank-Wolfe in disguise | Unknown | N/A | |
| Learning-based Motion Planning in Dynamic Environments Using GNNs and Temporal Encoding | Unknown | N/A | |
| Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning | Unknown | N/A | |
| Adam Can Converge Without Any Modification On Update Rules | Unknown | N/A | |
| A Consistent and Differentiable Lp Canonical Calibration Error Estimator | Unknown | N/A | |
| Exploration-Guided Reward Shaping for Reinforcement Learning under Sparse Rewards | Unknown | N/A | |
| Detection and Localization of Changes in Conditional Distributions | Unknown | N/A | |
| TransTab: Learning Transferable Tabular Transformers Across Tables | Unknown | N/A | |
| Spatial Mixture-of-Experts | Unknown | N/A | |
| TransBoost: Improving the Best ImageNet Performance using Deep Transduction | Unknown | N/A | |
| A Multilabel Classification Framework for Approximate Nearest Neighbor Search | Unknown | N/A | |
| On Efficient Online Imitation Learning via Classification | Unknown | N/A | |
| Inherently Explainable Reinforcement Learning in Natural Language | Unknown | N/A | |
| Inverse Game Theory for Stackelberg Games: the Blessing of Bounded Rationality | Unknown | N/A | |
| Incrementality Bidding via Reinforcement Learning under Mixed and Delayed Rewards | Unknown | N/A | |
| $k$-Sliced Mutual Information: A Quantitative Study of Scalability with Dimension | Unknown | N/A | |
| A Direct Approximation of AIXI Using Logical State Abstractions | Unknown | N/A | |
| Transition to Linearity of General Neural Networks with Directed Acyclic Graph Architecture | Unknown | N/A | |
| Towards Efficient Post-training Quantization of Pre-trained Language Models | Unknown | N/A | |
| A Unified Analysis of Federated Learning with Arbitrary Client Participation | Unknown | N/A | |
| Self-supervised surround-view depth estimation with volumetric feature fusion | Unknown | N/A | |
| Robust Bayesian Regression via Hard Thresholding | Unknown | N/A | |
| On the Efficient Implementation of High Accuracy Optimality of Profile Maximum Likelihood | Unknown | N/A | |
| The price of unfairness in linear bandits with biased feedback | Unknown | N/A | |
| Cooperative Distribution Alignment via JSD Upper Bound | Unknown | N/A | |
| Censored Quantile Regression Neural Networks for Distribution-Free Survival Analysis | Unknown | N/A | |
| Dataset Inference for Self-Supervised Models | Unknown | N/A | |
| Active Learning Through a Covering Lens | Unknown | N/A | |
| Adversarially Robust Learning: A Generic Minimax Optimal Learner and Characterization | Unknown | N/A | |
| Wavelet Score-Based Generative Modeling | Unknown | N/A | |
| Efficiently Factorizing Boolean Matrices using Proximal Gradient Descent | Unknown | N/A | |
| The Curse of Unrolling: Rate of Differentiating Through Optimization | Unknown | N/A | |
| ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers | Unknown | N/A | |
| Dual-discriminative Graph Neural Network for Imbalanced Graph-level Anomaly Detection | Unknown | N/A | |
| Improved Convergence Rate of Stochastic Gradient Langevin Dynamics with Variance Reduction and its Application to Optimization | Unknown | N/A | |
| What You See is What You Classify: Black Box Attributions | Unknown | N/A | |
| A Closer Look at Prototype Classifier for Few-shot Image Classification | Unknown | N/A | |
| Graph Reordering for Cache-Efficient Near Neighbor Search | Unknown | N/A | |
| Trade-off between Payoff and Model Rewards in Shapley-Fair Collaborative Machine Learning | Unknown | N/A | |
| Muffliato: Peer-to-Peer Privacy Amplification for Decentralized Optimization and Averaging | Unknown | N/A | |
| Adaptively Exploiting d-Separators with Causal Bandits | Unknown | N/A | |
| Generative Neural Articulated Radiance Fields | Unknown | N/A | |
| Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos | Unknown | N/A | |
| Identification, Amplification and Measurement: A bridge to Gaussian Differential Privacy | Unknown | N/A | |
| BagFlip: A Certified Defense Against Data Poisoning | Unknown | N/A | |
| On the Convergence Theory for Hessian-Free Bilevel Algorithms | Unknown | N/A | |
| On the Sample Complexity of Stabilizing LTI Systems on a Single Trajectory | Unknown | N/A | |
| Gradient Descent Is Optimal Under Lower Restricted Secant Inequality And Upper Error Bound | Unknown | N/A | |
| Maximum Common Subgraph Guided Graph Retrieval: Late and Early Interaction Networks | Unknown | N/A | |
| Generalized Variational Inference in Function Spaces: Gaussian Measures meet Bayesian Deep Learning | Unknown | N/A | |
| Refining Low-Resource Unsupervised Translation by Language Disentanglement of Multilingual Translation Model | Unknown | N/A | |
| Analyzing Data-Centric Properties for Graph Contrastive Learning | Unknown | N/A | |
| RényiCL: Contrastive Representation Learning with Skew Rényi Divergence | Unknown | N/A | |
| Scalable Neural Video Representations with Learnable Positional Features | Unknown | N/A | |
| Towards Improving Calibration in Object Detection Under Domain Shift | Unknown | N/A | |
| GenSDF: Two-Stage Learning of Generalizable Signed Distance Functions | Unknown | N/A | |
| Approaching Quartic Convergence Rates for Quasi-Stochastic Approximation with Application to Gradient-Free Optimization | Unknown | N/A | |
| Neural Circuit Architectural Priors for Embodied Control | Unknown | N/A | |
| Quality Not Quantity: On the Interaction between Dataset Design and Robustness of CLIP | Unknown | N/A | |
| Understanding Deep Neural Function Approximation in Reinforcement Learning via $\epsilon$-Greedy Exploration | Unknown | N/A | |
| LIFT: Language-Interfaced Fine-Tuning for Non-language Machine Learning Tasks | Unknown | N/A | |
| Does Self-supervised Learning Really Improve Reinforcement Learning from Pixels? | Unknown | N/A | |
| Stochastic Multiple Target Sampling Gradient Descent | Unknown | N/A | |
| If Influence Functions are the Answer, Then What is the Question? | Unknown | N/A | |
| [Re] Replication Study of DECAF: Generating Fair Synthetic Data Using Causally-Aware Generative Networks | Unknown | N/A | |
| A Projection-free Algorithm for Constrained Stochastic Multi-level Composition Optimization | Unknown | N/A | |
| A composable machine-learning approach for steady-state simulations on high-resolution grids | Unknown | N/A | |
| Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging | Unknown | N/A | |
| [Re] Background-Aware Pooling and Noise-Aware Loss for Weakly-Supervised Semantic Segmentation | Unknown | N/A | |
| Amortized Projection Optimization for Sliced Wasserstein Generative Models | Unknown | N/A | |
| Trading Off Resource Budgets For Improved Regret Bounds | Unknown | N/A | |
| Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning | Unknown | N/A | |
| Matching in Multi-arm Bandit with Collision | Unknown | N/A | |
| Combinatorial Bandits with Linear Constraints: Beyond Knapsacks and Fairness | Unknown | N/A | |
| Evaluating Graph Generative Models with Contrastively Learned Features | Unknown | N/A | |
| Single-pass Streaming Lower Bounds for Multi-armed Bandits Exploration with Instance-sensitive Sample Complexity | Unknown | N/A | |
| The Minority Matters: A Diversity-Promoting Collaborative Metric Learning Algorithm | Unknown | N/A | |
| A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks | Unknown | N/A | |
| Rate-Distortion Theoretic Bounds on Generalization Error for Distributed Learning | Unknown | N/A | |
| One-shot Neural Backdoor Erasing via Adversarial Weight Masking | Unknown | N/A | |
| Learning Generalizable Models for Vehicle Routing Problems via Knowledge Distillation | Unknown | N/A | |
| Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts | Unknown | N/A | |
| Movement Penalized Bayesian Optimization with Application to Wind Energy Systems | Unknown | N/A | |
| Two-layer neural network on infinite dimensional data: global optimization guarantee in the mean-field regime | Unknown | N/A | |
| Efficient Aggregated Kernel Tests using Incomplete $U$-statistics | Unknown | N/A | |
| Recurrent Memory Transformer | Unknown | N/A | |
| Unsupervised Learning From Incomplete Measurements for Inverse Problems | Unknown | N/A | |
| An empirical analysis of compute-optimal large language model training | Unknown | N/A | |
| DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems | Unknown | N/A | |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | Unknown | N/A | |
| House of Cans: Covert Transmission of Internal Datasets via Capacity-Aware Neuron Steganography | Unknown | N/A | |
| A Unifying Framework for Online Optimization with Long-Term Constraints | Unknown | N/A | |
| Better Best of Both Worlds Bounds for Bandits with Switching Costs | Unknown | N/A | |
| Renyi Differential Privacy of Propose-Test-Release and Applications to Private and Robust Machine Learning | Unknown | N/A | |
| Earthformer: Exploring Space-Time Transformers for Earth System Forecasting | Unknown | N/A | |
| Lower Bounds and Nearly Optimal Algorithms in Distributed Learning with Communication Compression | Unknown | N/A | |
| Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding | Unknown | N/A | |
| Variational inference via Wasserstein gradient flows | Unknown | N/A | |
| Efficient Risk-Averse Reinforcement Learning | Unknown | N/A | |
| Operator Splitting Value Iteration | Unknown | N/A | |
| Composite Feature Selection Using Deep Ensembles | Unknown | N/A | |
| From Gradient Flow on Population Loss to Learning with Stochastic Gradient Descent | Unknown | N/A | |
| Contrastive Adapters for Foundation Model Group Robustness | Unknown | N/A | |
| Domain Generalization by Learning and Removing Domain-specific Features | Unknown | N/A | |
| On Uncertainty, Tempering, and Data Augmentation in Bayesian Classification | Unknown | N/A | |
| Physics-Embedded Neural Networks: Graph Neural PDE Solvers with Mixed Boundary Conditions | Unknown | N/A | |
| SAVi++: Towards End-to-End Object-Centric Learning from Real-World Videos | Unknown | N/A | |
| Debiased Self-Training for Semi-Supervised Learning | Unknown | N/A | |
| Learning Recourse on Instance Environment to Enhance Prediction Accuracy | Unknown | N/A | |
| Differentially Private Learning with Margin Guarantees | Unknown | N/A | |
| Provable General Function Class Representation Learning in Multitask Bandits and MDP | Unknown | N/A | |
| Characterization of Excess Risk for Locally Strongly Convex Population Risk | Unknown | N/A | |
| Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation | Unknown | N/A | |
| SNAKE: Shape-aware Neural 3D Keypoint Field | Unknown | N/A | |
| SIXO: Smoothing Inference with Twisted Objectives | Unknown | N/A | |
| Learning Articulated Rigid Body Dynamics with Lagrangian Graph Neural Network | Unknown | N/A | |
| Gradient Descent: The Ultimate Optimizer | Unknown | N/A | |
| Self-Consistent Dynamical Field Theory of Kernel Evolution in Wide Neural Networks | Unknown | N/A | |
| Batch size-invariance for policy optimization | Unknown | N/A | |
| Distributionally robust weighted k-nearest neighbors | Unknown | N/A | |
| On the Importance of Gradient Norm in PAC-Bayesian Bounds | Unknown | N/A | |
| Fair Bayes-Optimal Classifiers Under Predictive Parity | Unknown | N/A | |
| Analyzing Lottery Ticket Hypothesis from PAC-Bayesian Theory Perspective | Unknown | N/A | |
| Counterfactual Fairness with Partially Known Causal Graph | Unknown | N/A | |
| When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits | Unknown | N/A | |
| Efficient identification of informative features in simulation-based inference | Unknown | N/A | |
| Transform Once: Efficient Operator Learning in Frequency Domain | Unknown | N/A | |
| Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation | Unknown | N/A | |
| Deep Active Learning by Leveraging Training Dynamics | Unknown | N/A | |
| Rate-Optimal Online Convex Optimization in Adaptive Linear Control | Unknown | N/A | |
| SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training | Unknown | N/A | |
| Understanding Programmatic Weak Supervision via Source-aware Influence Function | Unknown | N/A | |
| Mind Reader: Reconstructing complex images from brain activities | Unknown | N/A | |
| A Neural Corpus Indexer for Document Retrieval | Unknown | N/A | |
| CUP: Critic-Guided Policy Reuse | Unknown | N/A | |
| Low-Rank Modular Reinforcement Learning via Muscle Synergy | Unknown | N/A | |
| RORL: Robust Offline Reinforcement Learning via Conservative Smoothing | Unknown | N/A | |
| Safe Opponent-Exploitation Subgame Refinement | Unknown | N/A | |
| LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning | Unknown | N/A | |
| A Primer for Neural Arithmetic Logic Modules | Unknown | N/A | |
| Improving Task-Specific Generalization in Few-Shot Learning via Adaptive Vicinal Risk Minimization | Unknown | N/A | |
| Chroma-VAE: Mitigating Shortcut Learning with Generative Classifiers | Unknown | N/A | |
| Look More but Care Less in Video Recognition | Unknown | N/A | |
| Adversarial Task Up-sampling for Meta-learning | Unknown | N/A | |
| Let Images Give You More: Point Cloud Cross-Modal Training for Shape Analysis | Unknown | N/A | |
| Peer Prediction for Learning Agents | Unknown | N/A | |
| Cache-Augmented Inbatch Importance Resampling for Training Recommender Retriever | Unknown | N/A | |
| Interaction Modeling with Multiplex Attention | Unknown | N/A | |
| Learning to Configure Computer Networks with Neural Algorithmic Reasoning | Unknown | N/A | |
| Can Adversarial Training Be Manipulated By Non-Robust Features? | Unknown | N/A | |
| Uncertainty-Aware Hierarchical Refinement for Incremental Implicitly-Refined Classification | Unknown | N/A | |
| MGNNI: Multiscale Graph Neural Networks with Implicit Layers | Unknown | N/A | |
| Discrete Compositional Representations as an Abstraction for Goal Conditioned Reinforcement Learning | Unknown | N/A | |
| MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation | Unknown | N/A | |
| Learning Mixed Multinomial Logits with Provable Guarantees | Unknown | N/A | |
| Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL | Unknown | N/A | |
| Hilbert Distillation for Cross-Dimensionality Networks | Unknown | N/A | |
| Recurrent Video Restoration Transformer with Guided Deformable Attention | Unknown | N/A | |
| Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone | Unknown | N/A | |
| Unified Optimal Transport Framework for Universal Domain Adaptation | Unknown | N/A | |
| Learning Deep Input-Output Stable Dynamics | Unknown | N/A | |
| Batch Bayesian Optimization on Permutations using the Acquisition Weighted Kernel | Unknown | N/A | |
| Neural Topological Ordering for Computation Graphs | Unknown | N/A | |
| Memory Efficient Continual Learning with Transformers | Unknown | N/A | |
| Efficient Knowledge Distillation from Model Checkpoints | Unknown | N/A | |
| EvenNet: Ignoring Odd-Hop Neighbors Improves Robustness of Graph Neural Networks | Unknown | N/A | |
| SelecMix: Debiased Learning by Contradicting-pair Sampling | Unknown | N/A | |
| Coordinate Linear Variance Reduction for Generalized Linear Programming | Unknown | N/A | |
| Local Latent Space Bayesian Optimization over Structured Inputs | Unknown | N/A | |
| Memorization and Optimization in Deep Neural Networks with Minimum Over-parameterization | Unknown | N/A | |
| Debiasing Graph Neural Networks via Learning Disentangled Causal Substructure | Unknown | N/A | |
| Learning Robust Dynamics through Variational Sparse Gating | Unknown | N/A | |
| VER: Scaling On-Policy RL Leads to the Emergence of Navigation in Embodied Rearrangement | Unknown | N/A | |
| A Unified Framework for Deep Symbolic Regression | Unknown | N/A | |
| [Re] A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space | Unknown | N/A | |
| Is Sortition Both Representative and Fair? | Unknown | N/A | |
| All Politics is Local: Redistricting via Local Fairness | Unknown | N/A | |
| Learning Interface Conditions in Domain Decomposition Solvers | Unknown | N/A | |
| Off-Policy Evaluation for Action-Dependent Non-stationary Environments | Unknown | N/A | |
| Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits | Unknown | N/A | |
| Causal Discovery in Linear Latent Variable Models Subject to Measurement Error | Unknown | N/A | |
| Human-AI Collaborative Bayesian Optimisation | Unknown | N/A | |
| SNN-RAT: Robustness-enhanced Spiking Neural Network through Regularized Adversarial Training | Unknown | N/A | |
| OOD Link Prediction Generalization Capabilities of Message-Passing GNNs in Larger Test Graphs | Unknown | N/A | |
| Generalized Delayed Feedback Model with Post-Click Information in Recommender Systems | Unknown | N/A | |
| GAR: Generalized Autoregression for Multi-Fidelity Fusion | Unknown | N/A | |
| Learning Representations via a Robust Behavioral Metric for Deep Reinforcement Learning | Unknown | N/A | |
| Environment Diversification with Multi-head Neural Network for Invariant Learning | Unknown | N/A | |
| MetaTeacher: Coordinating Multi-Model Domain Adaptation for Medical Image Classification | Unknown | N/A | |
| Collaborative Learning by Detecting Collaboration Partners | Unknown | N/A | |
| DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection | Unknown | N/A | |
| Set-based Meta-Interpolation for Few-Task Meta-Learning | Unknown | N/A | |
| Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold | Unknown | N/A | |
| Error Analysis of Tensor-Train Cross Approximation | Unknown | N/A | |
| Trading off Utility, Informativeness, and Complexity in Emergent Communication | Unknown | N/A | |
| Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights | Unknown | N/A | |
| Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation | Unknown | N/A | |
| A Damped Newton Method Achieves Global $\mathcal O \left(\frac{1}{k^2}\right)$ and Local Quadratic Convergence Rate | Unknown | N/A | |
| Finding Second-Order Stationary Points in Nonconvex-Strongly-Concave Minimax Optimization | Unknown | N/A | |
| Private Set Generation with Discriminative Information | Unknown | N/A | |
| Robust Semi-Supervised Learning when Not All Classes have Labels | Unknown | N/A | |
| Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization | Unknown | N/A | |
| "Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach | Unknown | N/A | |
| GLIF: A Unified Gated Leaky Integrate-and-Fire Neuron for Spiking Neural Networks | Unknown | N/A | |
| Finding and Listing Front-door Adjustment Sets | Unknown | N/A | |
| Bridging the Gap from Asymmetry Tricks to Decorrelation Principles in Non-contrastive Self-supervised Learning | Unknown | N/A | |
| Logical Credal Networks | Unknown | N/A | |
| Sharp Analysis of Stochastic Optimization under Global Kurdyka-Lojasiewicz Inequality | Unknown | N/A | |
| SInGE: Sparsity via Integrated Gradients Estimation of Neuron Relevance | Unknown | N/A | |
| A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits | Unknown | N/A | |
| Rethinking the compositionality of point clouds through regularization in the hyperbolic space | Unknown | N/A | |
| Identifiability of deep generative models without auxiliary information | Unknown | N/A | |
| Sub-exponential time Sum-of-Squares lower bounds for Principal Components Analysis | Unknown | N/A | |
| Robust Anytime Learning of Markov Decision Processes | Unknown | N/A | |
| COLD Decoding: Energy-based Constrained Text Generation with Langevin Dynamics | Unknown | N/A | |
| Simultaneous Missing Value Imputation and Structure Learning with Groups | Unknown | N/A | |
| Provably Efficient Model-Free Constrained RL with Linear Function Approximation | Unknown | N/A | |
| Private Estimation with Public Data | Unknown | N/A | |
| Friendly Noise against Adversarial Noise: A Powerful Defense against Data Poisoning Attack | Unknown | N/A | |
| Multi-Fidelity Best-Arm Identification | Unknown | N/A | |
| Off-Policy Evaluation with Deficient Support Using Side Information | Unknown | N/A | |
| Challenging Common Assumptions in Convex Reinforcement Learning | Unknown | N/A | |
| Decision Trees with Short Explainable Rules | Unknown | N/A | |
| List-Decodable Sparse Mean Estimation | Unknown | N/A | |
| Stochastic Adaptive Activation Function | Unknown | N/A | |
| Rethinking Knowledge Graph Evaluation Under the Open-World Assumption | Unknown | N/A | |
| A Theoretical Framework for Inference Learning | Unknown | N/A | |
| OPEN: Orthogonal Propagation with Ego-Network Modeling | Unknown | N/A | |
| On the Frequency-bias of Coordinate-MLPs | Unknown | N/A | |
| Generalization Properties of NAS under Activation and Skip Connection Search | Unknown | N/A | |
| Robustness in deep learning: The good (width), the bad (depth), and the ugly (initialization) | Unknown | N/A | |
| Extrapolation and Spectral Bias of Neural Nets with Hadamard Product: a Polynomial Net Study | Unknown | N/A | |
| A Rotated Hyperbolic Wrapped Normal Distribution for Hierarchical Representation Learning | Unknown | N/A | |
| Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields | Unknown | N/A | |
| A general approximation lower bound in $L^p$ norm, with applications to feed-forward neural networks | Unknown | N/A | |
| CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP | Unknown | N/A | |
| Communication Efficient Distributed Learning for Kernelized Contextual Bandits | Unknown | N/A | |
| Communication Efficient Federated Learning for Generalized Linear Bandits | Unknown | N/A | |
| Versatile Multi-stage Graph Neural Network for Circuit Representation | Unknown | N/A | |
| Cost-Sensitive Self-Training for Optimizing Non-Decomposable Metrics | Unknown | N/A | |
| Understanding Square Loss in Training Overparametrized Neural Network Classifiers | Unknown | N/A | |
| The Gyro-Structure of Some Matrix Manifolds | Unknown | N/A | |
| Multi-view Subspace Clustering on Topological Manifold | Unknown | N/A | |
| HyperDomainNet: Universal Domain Adaptation for Generative Adversarial Networks | Unknown | N/A | |
| S$^3$-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint | Unknown | N/A | |
| PaCo: Parameter-Compositional Multi-task Reinforcement Learning | Unknown | N/A | |
| IALE: Imitating Active Learner Ensembles | Unknown | N/A | |
| Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance | Unknown | N/A | |
| Estimating Noise Transition Matrix with Label Correlations for Noisy Multi-Label Learning | Unknown | N/A | |
| Momentum Aggregation for Private Non-convex ERM | Unknown | N/A | |
| SCL-WC: Cross-Slide Contrastive Learning for Weakly-Supervised Whole-Slide Image Classification | Unknown | N/A | |
| Differentially Private Online-to-batch for Smooth Losses | Unknown | N/A | |
| Distributionally Robust Optimization with Data Geometry | Unknown | N/A | |
| Decentralized Training of Foundation Models in Heterogeneous Environments | Unknown | N/A | |
| On the convergence of policy gradient methods to Nash equilibria in general stochastic games | Unknown | N/A | |
| Sample Complexity of Learning Heuristic Functions for Greedy-Best-First and A* Search | Unknown | N/A | |
| Rank Diminishing in Deep Neural Networks | Unknown | N/A | |
| Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation | Unknown | N/A | |
| Lethal Dose Conjecture on Data Poisoning | Unknown | N/A | |
| Learning Substructure Invariance for Out-of-Distribution Molecular Representations | Unknown | N/A | |
| NeuPhysics: Editable Neural Geometry and Physics from Monocular Videos | Unknown | N/A | |
| Understanding the Evolution of Linear Regions in Deep Reinforcement Learning | Unknown | N/A | |
| RecursiveMix: Mixed Learning with History | Unknown | N/A | |
| DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs | Unknown | N/A | |
| Fairness Reprogramming | Unknown | N/A | |
| S-Prompts Learning with Pre-trained Transformers: An Occam’s Razor for Domain Incremental Learning | Unknown | N/A | |
| Coded Residual Transform for Generalizable Deep Metric Learning | Unknown | N/A | |
| Embodied Scene-aware Human Pose Estimation | Unknown | N/A | |
| Generative Status Estimation and Information Decoupling for Image Rain Removal | Unknown | N/A | |
| Subsidiary Prototype Alignment for Universal Domain Adaptation | Unknown | N/A | |
| Align then Fusion: Generalized Large-scale Multi-view Clustering with Anchor Matching Correspondences | Unknown | N/A | |
| DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning | Unknown | N/A | |
| EcoFormer: Energy-Saving Attention with Linear Complexity | Unknown | N/A | |
| Machine Learning on Graphs: A Model and Comprehensive Taxonomy | Unknown | N/A | |
| DAGMA: Learning DAGs via M-matrices and a Log-Determinant Acyclicity Characterization | Unknown | N/A | |
| Self-Supervised Visual Representation Learning with Semantic Grouping | Unknown | N/A | |
| Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning | Unknown | N/A | |
| Practical Adversarial Attacks on Spatiotemporal Traffic Forecasting Models | Unknown | N/A | |
| Active Labeling: Streaming Stochastic Gradients | Unknown | N/A | |
| SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation | Unknown | N/A | |
| Polynomial Neural Fields for Subband Decomposition and Manipulation | Unknown | N/A | |
| Visual Concepts Tokenization | Unknown | N/A | |
| Phase Transition from Clean Training to Adversarial Training | Unknown | N/A | |
| HSurf-Net: Normal Estimation for 3D Point Clouds by Learning Hyper Surfaces | Unknown | N/A | |
| Adversarial Style Augmentation for Domain Generalized Urban-Scene Segmentation | Unknown | N/A | |
| Natural Color Fool: Towards Boosting Black-box Unrestricted Attacks | Unknown | N/A | |
| Cross-Image Context for Single Image Inpainting | Unknown | N/A | |
| TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation | Unknown | N/A | |
| Is Out-of-Distribution Detection Learnable? | Unknown | N/A | |
| Masked Autoencoders As Spatiotemporal Learners | Unknown | N/A | |
| Trap and Replace: Defending Backdoor Attacks by Trapping Them into an Easy-to-Replace Subnetwork | Unknown | N/A | |
| PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds | Unknown | N/A | |
| Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection | Unknown | N/A | |
| Optimistic Tree Searches for Combinatorial Black-Box Optimization | Unknown | N/A | |
| Tensor Wheel Decomposition and Its Tensor Completion Application | Unknown | N/A | |
| PALBERT: Teaching ALBERT to Ponder | Unknown | N/A | |
| Towards Efficient 3D Object Detection with Knowledge Distillation | Unknown | N/A | |
| Towards Lightweight Black-Box Attack Against Deep Neural Networks | Unknown | N/A | |
| HumanLiker: A Human-like Object Detector to Model the Manual Labeling Process | Unknown | N/A | |
| Learn what matters: cross-domain imitation learning with task-relevant embeddings | Unknown | N/A | |
| Whitening Convergence Rate of Coupling-based Normalizing Flows | Unknown | N/A | |
| Hierarchical Normalization for Robust Monocular Depth Estimation | Unknown | N/A | |
| Unsupervised Multi-Object Segmentation by Predicting Probable Motion Patterns | Unknown | N/A | |
| On the Strong Correlation Between Model Invariance and Generalization | Unknown | N/A | |
| Q-ViT: Accurate and Fully Quantized Low-bit Vision Transformer | Unknown | N/A | |
| Fully Sparse 3D Object Detection | Unknown | N/A | |
| Learning Multi-resolution Functional Maps with Spectral Attention for Robust Shape Matching | Unknown | N/A | |
| A Coupled Design of Exploiting Record Similarity for Practical Vertical Federated Learning | Unknown | N/A | |
| Towards Robust Blind Face Restoration with Codebook Lookup Transformer | Unknown | N/A | |
| Improved Fine-Tuning by Better Leveraging Pre-Training Data | Unknown | N/A | |
| TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies | Unknown | N/A | |
| Cross Aggregation Transformer for Image Restoration | Unknown | N/A | |
| Behavior Transformers: Cloning $k$ modes with one stone | Unknown | N/A | |
| What Makes a "Good" Data Augmentation in Knowledge Distillation - A Statistical Perspective | Unknown | N/A | |
| Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection | Unknown | N/A | |
| Discrete-Convex-Analysis-Based Framework for Warm-Starting Algorithms with Predictions | Unknown | N/A | |
| Divert More Attention to Vision-Language Tracking | Unknown | N/A | |
| Trajectory Inference via Mean-field Langevin in Path Space | Unknown | N/A | |
| ElasticMVS: Learning elastic part representation for self-supervised multi-view stereopsis | Unknown | N/A | |
| A2: Efficient Automated Attacker for Boosting Adversarial Training | Unknown | N/A | |
| PerfectDou: Dominating DouDizhu with Perfect Information Distillation | Unknown | N/A | |
| MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds | Unknown | N/A | |
| Towards Versatile Embodied Navigation | Unknown | N/A | |
| Product Ranking for Revenue Maximization with Multiple Purchases | Unknown | N/A | |
| Remember the Past: Distilling Datasets into Addressable Memories for Neural Networks | Unknown | N/A | |
| ResT V2: Simpler, Faster and Stronger | Unknown | N/A | |
| In the Eye of the Beholder: Robust Prediction with Causal User Modeling | Unknown | N/A | |
| Bi-directional Weakly Supervised Knowledge Distillation for Whole Slide Image Classification | Unknown | N/A | |
| Multi-modal Grouping Network for Weakly-Supervised Audio-Visual Video Parsing | Unknown | N/A | |
| Mining Unseen Classes via Regional Objectness: A Simple Baseline for Incremental Segmentation | Unknown | N/A | |
| Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network | Unknown | N/A | |
| Pay attention to your loss : understanding misconceptions about Lipschitz neural networks | Unknown | N/A | |
| End-to-end Symbolic Regression with Transformers | Unknown | N/A | |
| SPoVT: Semantic-Prototype Variational Transformer for Dense Point Cloud Semantic Completion | Unknown | N/A | |
| Unsupervised Representation Learning from Pre-trained Diffusion Probabilistic Models | Unknown | N/A | |
| What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods | Unknown | N/A | |
| Stochastic Window Transformer for Image Restoration | Unknown | N/A | |
| A Closer Look at Weakly-Supervised Audio-Visual Source Localization | Unknown | N/A | |
| Semi-Discrete Normalizing Flows through Differentiable Tessellation | Unknown | N/A | |
| Blackbox Attacks via Surrogate Ensemble Search | Unknown | N/A | |
| Saliency-Aware Neural Architecture Search | Unknown | N/A | |
| ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation | Unknown | N/A | |
| Learning Best Combination for Efficient N:M Sparsity | Unknown | N/A | |
| Predicting Label Distribution from Multi-label Ranking | Unknown | N/A | |
| Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees | Unknown | N/A | |
| Semantic Diffusion Network for Semantic Segmentation | Unknown | N/A | |
| Regret Bounds for Information-Directed Reinforcement Learning | Unknown | N/A | |
| A Spectral Approach to Item Response Theory | Unknown | N/A | |
| UDC: Unified DNAS for Compressible TinyML Models for Neural Processing Units | Unknown | N/A | |
| AutoLink: Self-supervised Learning of Human Skeletons and Object Outlines by Linking Keypoints | Unknown | N/A | |
| Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games | Unknown | N/A | |
| Parameter-Efficient Masking Networks | Unknown | N/A | |
| Learning Distinct and Representative Modes for Image Captioning | Unknown | N/A | |
| Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images | Unknown | N/A | |
| HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes | Unknown | N/A | |
| VCT: A Video Compression Transformer | Unknown | N/A | |
| Non-stationary Transformers: Exploring the Stationarity in Time Series Forecasting | Unknown | N/A | |
| VITA: Video Instance Segmentation via Object Token Association | Unknown | N/A | |
| A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective | Unknown | N/A | |
| Geometry-aware Two-scale PIFu Representation for Human Reconstruction | Unknown | N/A | |
| Causally motivated multi-shortcut identification and removal | Unknown | N/A | |
| SegViT: Semantic Segmentation with Plain Vision Transformers | Unknown | N/A | |
| Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network? | Unknown | N/A | |
| Masked Autoencoders that Listen | Unknown | N/A | |
| Semi-Supervised Semantic Segmentation via Gentle Teaching Assistant | Unknown | N/A | |
| Video-based Human-Object Interaction Detection from Tubelet Tokens | Unknown | N/A | |
| Learning Equivariant Segmentation with Instance-Unique Querying | Unknown | N/A | |
| Enhanced Latent Space Blind Model for Real Image Denoising via Alternative Optimization | Unknown | N/A | |
| High-dimensional Additive Gaussian Processes under Monotonicity Constraints | Unknown | N/A | |
| Learning Generalizable Part-based Feature Representation for 3D Point Clouds | Unknown | N/A | |
| Constants of motion network | Unknown | N/A | |
| Asymptotically Unbiased Instance-wise Regularized Partial AUC Optimization: Theory and Algorithm | Unknown | N/A | |
| Rethinking Alignment in Video Super-Resolution Transformers | Unknown | N/A | |
| Robust Testing in High-Dimensional Sparse Models | Unknown | N/A | |
| INRAS: Implicit Neural Representation for Audio Scenes | Unknown | N/A | |
| BMU-MoCo: Bidirectional Momentum Update for Continual Video-Language Modeling | Unknown | N/A | |
| DropCov: A Simple yet Effective Method for Improving Deep Architectures | Unknown | N/A | |
| Antigen-Specific Antibody Design and Optimization with Diffusion-Based Generative Models for Protein Structures | Unknown | N/A | |
| Monocular Dynamic View Synthesis: A Reality Check | Unknown | N/A | |
| A Mixture Of Surprises for Unsupervised Reinforcement Learning | Unknown | N/A | |
| QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Query | Unknown | N/A | |
| Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning | Unknown | N/A | |
| Misspecified Phase Retrieval with Generative Priors | Unknown | N/A | |
| Watermarking for Out-of-distribution Detection | Unknown | N/A | |
| Error Correction Code Transformer | Unknown | N/A | |
| Maximum Class Separation as Inductive Bias in One Matrix | Unknown | N/A | |
| Sequencer: Deep LSTM for Image Classification | Unknown | N/A | |
| Self-Supervised Learning via Maximum Entropy Coding | Unknown | N/A | |
| Giga-scale Kernel Matrix-Vector Multiplication on GPU | Unknown | N/A | |
| Scalable Infomin Learning | Unknown | N/A | |
| Multi-dataset Training of Transformers for Robust Action Recognition | Unknown | N/A | |
| ZARTS: On Zero-order Optimization for Neural Architecture Search | Unknown | N/A | |
| Online Training Through Time for Spiking Neural Networks | Unknown | N/A | |
| Multi-Instance Causal Representation Learning for Instance Label Prediction and Out-of-Distribution Generalization | Unknown | N/A | |
| P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting | Unknown | N/A | |
| Towards Theoretically Inspired Neural Initialization Optimization | Unknown | N/A | |
| Vision GNN: An Image is Worth Graph of Nodes | Unknown | N/A | |
| Rotation-Equivariant Conditional Spherical Neural Fields for Learning a Natural Illumination Prior | Unknown | N/A | |
| Supported Policy Optimization for Offline Reinforcement Learning | Unknown | N/A | |
| AutoMS: Automatic Model Selection for Novelty Detection with Error Rate Control | Unknown | N/A | |
| Increasing Confidence in Adversarial Robustness Evaluations | Unknown | N/A | |
| Generalization Bounds for Estimating Causal Effects of Continuous Treatments | Unknown | N/A | |
| Non-Markovian Reward Modelling from Trajectory Labels via Interpretable Multiple Instance Learning | Unknown | N/A | |
| Learning Consistency-Aware Unsigned Distance Functions Progressively from Raw Point Clouds | Unknown | N/A | |
| Why Do Artificially Generated Data Help Adversarial Robustness | Unknown | N/A | |
| Learning Infinite-Horizon Average-Reward Restless Multi-Action Bandits via Index Awareness | Unknown | N/A | |
| Theory and Approximate Solvers for Branched Optimal Transport with Multiple Sources | Unknown | N/A | |
| New Lower Bounds for Private Estimation and a Generalized Fingerprinting Lemma | Unknown | N/A | |
| PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points | Unknown | N/A | |
| On the Generalizability and Predictability of Recommender Systems | Unknown | N/A | |
| Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision Tasks | Unknown | N/A | |
| Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models | Unknown | N/A | |
| Stability and Generalization Analysis of Gradient Methods for Shallow Neural Networks | Unknown | N/A | |
| Physically-Based Face Rendering for NIR-VIS Face Recognition | Unknown | N/A | |
| Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks | Unknown | N/A | |
| Few-Shot Continual Active Learning by a Robot | Unknown | N/A | |
| MultiScan: Scalable RGBD scanning for 3D environments with articulated objects | Unknown | N/A | |
| Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing | Unknown | N/A | |
| Structural Kernel Search via Bayesian Optimization and Symbolical Optimal Transport | Unknown | N/A | |
| Biologically Inspired Dynamic Thresholds for Spiking Neural Networks | Unknown | N/A | |
| Don't Roll the Dice, Ask Twice: The Two-Query Distortion of Matching Problems and Beyond | Unknown | N/A | |
| A Unified Model for Multi-class Anomaly Detection | Unknown | N/A | |
| A framework for bilevel optimization that enables stochastic and global variance reduction algorithms | Unknown | N/A | |
| SAViT: Structure-Aware Vision Transformer Pruning via Collaborative Optimization | Unknown | N/A | |
| Masked Generative Adversarial Networks are Data-Efficient Generation Learners | Unknown | N/A | |
| Training Spiking Neural Networks with Event-driven Backpropagation | Unknown | N/A | |
| MCMAE: Masked Convolution Meets Masked Autoencoders | Unknown | N/A | |
| Learning Physical Dynamics with Subequivariant Graph Neural Networks | Unknown | N/A | |
| Online PAC-Bayes Learning | Unknown | N/A | |
| Implicit Warping for Animation with Image Sets | Unknown | N/A | |
| Rethinking Resolution in the Context of Efficient Video Recognition | Unknown | N/A | |
| RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning | Unknown | N/A | |
| CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior | Unknown | N/A | |
| Natural gradient enables fast sampling in spiking neural networks | Unknown | N/A | |
| MultiGuard: Provably Robust Multi-label Classification against Adversarial Examples | Unknown | N/A | |
| Efficient and Effective Multi-task Grouping via Meta Learning on Task Combinations | Unknown | N/A | |
| Robust Calibration with Multi-domain Temperature Scaling | Unknown | N/A | |
| Exploration via Planning for Information about the Optimal Trajectory | Unknown | N/A | |
| Mean Estimation in High-Dimensional Binary Markov Gaussian Mixture Models | Unknown | N/A | |
| BiT: Robustly Binarized Multi-distilled Transformer | Unknown | N/A | |
| PopArt: Efficient Sparse Regression and Experimental Design for Optimal Sparse Linear Bandits | Unknown | N/A | |
| On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement Learning | Unknown | N/A | |
| On-Device Training Under 256KB Memory | Unknown | N/A | |
| Geo-SIC: Learning Deformable Geometric Shapes in Deep Image Classifiers | Unknown | N/A | |
| An Embarrassingly Simple Approach to Semi-Supervised Few-Shot Learning | Unknown | N/A | |
| Multi-Granularity Cross-modal Alignment for Generalized Medical Visual Representation Learning | Unknown | N/A | |
| Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks | Unknown | N/A | |
| Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera | Unknown | N/A | |
| Mutual Information Divergence: A Unified Metric for Multimodal Generative Models | Unknown | N/A | |
| Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding | Unknown | N/A | |
| GLIPv2: Unifying Localization and Vision-Language Understanding | Unknown | N/A | |
| A Unified Diversity Measure for Multiagent Reinforcement Learning | Unknown | N/A | |
| Plan To Predict: Learning an Uncertainty-Foreseeing Model For Model-Based Reinforcement Learning | Unknown | N/A | |
| Learning to Accelerate Partial Differential Equations via Latent Global Evolution | Unknown | N/A | |
| Active Learning for Multiple Target Models | Unknown | N/A | |
| Alignment-guided Temporal Attention for Video Action Recognition | Unknown | N/A | |
| Open-Ended Reinforcement Learning with Neural Reward Functions | Unknown | N/A | |
| On Margins and Generalisation for Voting Classifiers | Unknown | N/A | |
| Contrastive Neural Ratio Estimation | Unknown | N/A | |
| Mildly Conservative Q-Learning for Offline Reinforcement Learning | Unknown | N/A | |
| Self-Supervised Image Restoration with Blurry and Noisy Pairs | Unknown | N/A | |
| Recommender Forest for Efficient Retrieval | Unknown | N/A | |
| Retrieval-Augmented Diffusion Models | Unknown | N/A | |
| PatchComplete: Learning Multi-Resolution Patch Priors for 3D Shape Completion on Unseen Categories | Unknown | N/A | |
| Generalized Laplacian Eigenmaps | Unknown | N/A | |
| SAPA: Similarity-Aware Point Affiliation for Feature Upsampling | Unknown | N/A | |
| Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuning | Unknown | N/A | |
| Random Sharpness-Aware Minimization | Unknown | N/A | |
| Generalized One-shot Domain Adaptation of Generative Adversarial Networks | Unknown | N/A | |
| SCONE: Surface Coverage Optimization in Unknown Environments by Volumetric Integration | Unknown | N/A | |
| A Quantitative Geometric Approach to Neural-Network Smoothness | Unknown | N/A | |
| Is this the Right Neighborhood? Accurate and Query Efficient Model Agnostic Explanations | Unknown | N/A | |
| Parametrically Retargetable Decision-Makers Tend To Seek Power | Unknown | N/A | |
| Learning Individualized Treatment Rules with Many Treatments: A Supervised Clustering Approach Using Adaptive Fusion | Unknown | N/A | |
| Differentially Private Model Compression | Unknown | N/A | |
| Is a Modular Architecture Enough? | Unknown | N/A | |
| Learning General World Models in a Handful of Reward-Free Deployments | Unknown | N/A | |
| Revisiting Heterophily For Graph Neural Networks | Unknown | N/A | |
| Recipe for a General, Powerful, Scalable Graph Transformer | Unknown | N/A | |
| CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations | Unknown | N/A | |
| GhostNetV2: Enhance Cheap Operation with Long-Range Attention | Unknown | N/A | |
| Elucidating the Design Space of Diffusion-Based Generative Models | Unknown | N/A | |
| Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation | Unknown | N/A | |
| Robust Models are less Over-Confident | Unknown | N/A | |
| OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training | Unknown | N/A | |
| KSD Aggregated Goodness-of-fit Test | Unknown | N/A | |
| Distributional Reward Estimation for Effective Multi-agent Deep Reinforcement Learning | Unknown | N/A | |
| A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP | Unknown | N/A | |
| ZIN: When and How to Learn Invariance Without Environment Partition? | Unknown | N/A | |
| Enhance the Visual Representation via Discrete Adversarial Training | Unknown | N/A | |
| Frank-Wolfe-based Algorithms for Approximating Tyler's M-estimator | Unknown | N/A | |
| Spending Thinking Time Wisely: Accelerating MCTS with Virtual Expansions | Unknown | N/A | |
| Efficient and Effective Optimal Transport-Based Biclustering | Unknown | N/A | |
| SageMix: Saliency-Guided Mixup for Point Clouds | Unknown | N/A | |
| Heatmap Distribution Matching for Human Pose Estimation | Unknown | N/A | |
| Autoregressive Search Engines: Generating Substrings as Document Identifiers | Unknown | N/A | |
| Mirror Descent with Relative Smoothness in Measure Spaces, with application to Sinkhorn and EM | Unknown | N/A | |
| Deconfounded Representation Similarity for Comparison of Neural Networks | Unknown | N/A | |
| Rethinking Lipschitz Neural Networks and Certified Robustness: A Boolean Function Perspective | Unknown | N/A | |
| Fine-Grained Analysis of Stability and Generalization for Modern Meta Learning Algorithms | Unknown | N/A | |
| Gold-standard solutions to the Schrödinger equation using deep learning: How much physics do we need? | Unknown | N/A | |
| Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition | Unknown | N/A | |
| Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE | Unknown | N/A | |
| Relational Proxies: Emergent Relationships as Fine-Grained Discriminators | Unknown | N/A | |
| Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization | Unknown | N/A | |
| Unsupervised Cross-Task Generalization via Retrieval Augmentation | Unknown | N/A | |
| coVariance Neural Networks | Unknown | N/A | |
| On the inability of Gaussian process regression to optimally learn compositional functions | Unknown | N/A | |
| Distributed Methods with Compressed Communication for Solving Variational Inequalities, with Theoretical Guarantees | Unknown | N/A | |
| When to Update Your Model: Constrained Model-based Reinforcement Learning | Unknown | N/A | |
| Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization | Unknown | N/A | |
| Constrained Langevin Algorithms with L-mixing External Random Variables | Unknown | N/A | |
| Practical Adversarial Multivalid Conformal Prediction | Unknown | N/A | |
| Biologically-Plausible Determinant Maximization Neural Networks for Blind Separation of Correlated Sources | Unknown | N/A | |
| Deep Generalized Schrödinger Bridge | Unknown | N/A | |
| Deep Generative Model for Periodic Graphs | Unknown | N/A | |
| Optimal Comparator Adaptive Online Learning with Switching Cost | Unknown | N/A | |
| Enhanced Bilevel Optimization via Bregman Distance | Unknown | N/A | |
| Learning State-Aware Visual Representations from Audible Interactions | Unknown | N/A | |
| Near-Optimal Multi-Agent Learning for Safe Coverage Control | Unknown | N/A | |
| Probabilistic Missing Value Imputation for Mixed Categorical and Ordered Data | Unknown | N/A | |
| Exploration via Elliptical Episodic Bonuses | Unknown | N/A | |
| GAUDI: A Neural Architect for Immersive 3D Scene Generation | Unknown | N/A | |
| Periodic Graph Transformers for Crystal Material Property Prediction | Unknown | N/A | |
| Parallel Tempering With a Variational Reference | Unknown | N/A | |
| On the consistent estimation of optimal Receiver Operating Characteristic (ROC) curve | Unknown | N/A | |
| NS3: Neuro-symbolic Semantic Code Search | Unknown | N/A | |
| A Deep Learning Dataloader with Shared Data Preparation | Unknown | N/A | |
| Deep Multi-Modal Structural Equations For Causal Effect Estimation With Unstructured Proxies | Unknown | N/A | |
| Improving Variational Autoencoders with Density Gap-based Regularization | Unknown | N/A | |
| Fused Orthogonal Alternating Least Squares for Tensor Clustering | Unknown | N/A | |
| Representing Spatial Trajectories as Distributions | Unknown | N/A | |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Unknown | N/A | |
| CLEAR: Generative Counterfactual Explanations on Graphs | Unknown | N/A | |
| Wasserstein $K$-means for clustering probability distributions | Unknown | N/A | |
| Biologically-plausible backpropagation through arbitrary timespans via local neuromodulators | Unknown | N/A | |
| Cost-efficient Gaussian tensor network embeddings for tensor-structured inputs | Unknown | N/A | |
| Hub-Pathway: Transfer Learning from A Hub of Pre-trained Models | Unknown | N/A | |
| Green Hierarchical Vision Transformer for Masked Image Modeling | Unknown | N/A | |
| Beyond the Best: Distribution Functional Estimation in Infinite-Armed Bandits | Unknown | N/A | |
| An Investigation into Whitening Loss for Self-supervised Learning | Unknown | N/A | |
| Fixed-Distance Hamiltonian Monte Carlo | Unknown | N/A | |
| SecureFedYJ: a safe feature Gaussianization protocol for Federated Learning | Unknown | N/A | |
| Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset | Unknown | N/A | |
| Deep Attentive Belief Propagation: Integrating Reasoning and Learning for Solving Constraint Optimization Problems | Unknown | N/A | |
| Amortized Mixing Coupling Processes for Clustering | Unknown | N/A | |
| HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions | Unknown | N/A | |
| Weakly supervised causal representation learning | Unknown | N/A | |
| Less-forgetting Multi-lingual Fine-tuning | Unknown | N/A | |
| Online Convex Optimization with Hard Constraints: Towards the Best of Two Worlds and Beyond | Unknown | N/A | |
| Rethinking Variational Inference for Probabilistic Programs with Stochastic Support | Unknown | N/A | |
| Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions | Unknown | N/A | |
| Cross-modal Learning for Image-Guided Point Cloud Shape Completion | Unknown | N/A | |
| TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels | Unknown | N/A | |
| FNeVR: Neural Volume Rendering for Face Animation | Unknown | N/A | |
| Bessel Equivariant Networks for Inversion of Transmission Effects in Multi-Mode Optical Fibres | Unknown | N/A | |
| Bidirectional Learning for Offline Infinite-width Model-based Optimization | Unknown | N/A | |
| TREC: Transient Redundancy Elimination-based Convolution | Unknown | N/A | |
| DivBO: Diversity-aware CASH for Ensemble Learning | Unknown | N/A | |
| Forecasting Human Trajectory from Scene History | Unknown | N/A | |
| Wasserstein Logistic Regression with Mixed Features | Unknown | N/A | |
| Contextual Bandits with Knapsacks for a Conversion Model | Unknown | N/A | |
| Diagnosing failures of fairness transfer across distribution shift in real-world medical settings | Unknown | N/A | |
| Adaptation Accelerating Sampling-based Bayesian Inference in Attractor Neural Networks | Unknown | N/A | |
| ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler | Unknown | N/A | |
| Oscillatory Tracking of Continuous Attractor Neural Networks Account for Phase Precession and Procession of Hippocampal Place Cells | Unknown | N/A | |
| UQGAN: A Unified Model for Uncertainty Quantification of Deep Classifiers trained via Conditional GANs | Unknown | N/A | |
| Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach | Unknown | N/A | |
| Contrastive Learning as Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space | Unknown | N/A | |
| When are Local Queries Useful for Robust Learning? | Unknown | N/A | |
| Shield Decentralization for Safe Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Extracting computational mechanisms from neural data using low-rank RNNs | Unknown | N/A | |
| Data Distributional Properties Drive Emergent In-Context Learning in Transformers | Unknown | N/A | |
| A Quadrature Rule combining Control Variates and Adaptive Importance Sampling | Unknown | N/A | |
| Dynamic Fair Division with Partial Information | Unknown | N/A | |
| Improved Imaging by Invex Regularizers with Global Optima Guarantees | Unknown | N/A | |
| Markov Chain Score Ascent: A Unifying Framework of Variational Inference with Markovian Gradients | Unknown | N/A | |
| Change-point Detection for Sparse and Dense Functional Data in General Dimensions | Unknown | N/A | |
| Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games | Unknown | N/A | |
| Local Spatiotemporal Representation Learning for Longitudinally-consistent Neuroimage Analysis | Unknown | N/A | |
| Parameter tuning and model selection in Optimal Transport with semi-dual Brenier formulation | Unknown | N/A | |
| Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems | Unknown | N/A | |
| Generating multivariate time series with COmmon Source CoordInated GAN (COSCI-GAN) | Unknown | N/A | |
| Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation | Unknown | N/A | |
| Online Deep Equilibrium Learning for Regularization by Denoising | Unknown | N/A | |
| When does return-conditioned supervised learning work for offline reinforcement learning? | Unknown | N/A | |
| Inductive Logical Query Answering in Knowledge Graphs | Unknown | N/A | |
| The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning | Unknown | N/A | |
| Biological Learning of Irreducible Representations of Commuting Transformations | Unknown | N/A | |
| The price of ignorance: how much does it cost to forget noise structure in low-rank matrix estimation? | Unknown | N/A | |
| MCVD - Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation | Unknown | N/A | |
| Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers | Unknown | N/A | |
| Non-convex online learning via algorithmic equivalence | Unknown | N/A | |
| Decomposing NeRF for Editing via Feature Field Distillation | Unknown | N/A | |
| Approximate Value Equivalence | Unknown | N/A | |
| Neur2SP: Neural Two-Stage Stochastic Programming | Unknown | N/A | |
| Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models | Unknown | N/A | |
| SparCL: Sparse Continual Learning on the Edge | Unknown | N/A | |
| Visual Prompting via Image Inpainting | Unknown | N/A | |
| Test-Time Training with Masked Autoencoders | Unknown | N/A | |
| Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning | Unknown | N/A | |
| Gradient-Free Methods for Deterministic and Stochastic Nonsmooth Nonconvex Optimization | Unknown | N/A | |
| BILCO: An Efficient Algorithm for Joint Alignment of Time Series | Unknown | N/A | |
| Hardness of Noise-Free Learning for Two-Hidden-Layer Neural Networks | Unknown | N/A | |
| Falsification before Extrapolation in Causal Effect Estimation | Unknown | N/A | |
| LION: Latent Point Diffusion Models for 3D Shape Generation | Unknown | N/A | |
| FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction | Unknown | N/A | |
| Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions | Unknown | N/A | |
| Sharpness-Aware Training for Free | Unknown | N/A | |
| CHIMLE: Conditional Hierarchical IMLE for Multimodal Conditional Image Synthesis | Unknown | N/A | |
| 3DILG: Irregular Latent Grids for 3D Generative Modeling | Unknown | N/A | |
| Translation-equivariant Representation in Recurrent Networks with a Continuous Manifold of Attractors | Unknown | N/A | |
| Optimal Transport-based Identity Matching for Identity-invariant Facial Expression Recognition | Unknown | N/A | |
| Towards Learning Universal Hyperparameter Optimizers with Transformers | Unknown | N/A | |
| OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression | Unknown | N/A | |
| ComGAN: Unsupervised Disentanglement and Segmentation via Image Composition | Unknown | N/A | |
| Non-Linear Coordination Graphs | Unknown | N/A | |
| Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning | Unknown | N/A | |
| Fast Distance Oracles for Any Symmetric Norm | Unknown | N/A | |
| Low-rank Optimal Transport: Approximation, Statistics and Debiasing | Unknown | N/A | |
| Iterative Scene Graph Generation | Unknown | N/A | |
| Eliciting Thinking Hierarchy without a Prior | Unknown | N/A | |
| Learning Robust Rule Representations for Abstract Reasoning via Internal Inferences | Unknown | N/A | |
| Multi-layer State Evolution Under Random Convolutional Design | Unknown | N/A | |
| Latency-aware Spatial-wise Dynamic Networks | Unknown | N/A | |
| Margin-Based Few-Shot Class-Incremental Learning with Class-Level Overfitting Mitigation | Unknown | N/A | |
| Relation-Constrained Decoding for Text Generation | Unknown | N/A | |
| Searching for Better Spatio-temporal Alignment in Few-Shot Action Recognition | Unknown | N/A | |
| Could Giant Pre-trained Image Models Extract Universal Representations? | Unknown | N/A | |
| IM-Loss: Information Maximization Loss for Spiking Neural Networks | Unknown | N/A | |
| TokenMixup: Efficient Attention-guided Token-level Data Augmentation for Transformers | Unknown | N/A | |
| Hyperbolic Feature Augmentation via Distribution Estimation and Infinite Sampling on Manifolds | Unknown | N/A | |
| Verification and search algorithms for causal DAGs | Unknown | N/A | |
| AD-DROP: Attribution-Driven Dropout for Robust Language Model Fine-Tuning | Unknown | N/A | |
| Where to Pay Attention in Sparse Training for Feature Selection? | Unknown | N/A | |
| TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training | Unknown | N/A | |
| Understanding the Failure of Batch Normalization for Transformers in NLP | Unknown | N/A | |
| Transformers meet Stochastic Block Models: Attention with Data-Adaptive Sparsity and Cost | Unknown | N/A | |
| Theoretically Provable Spiking Neural Networks | Unknown | N/A | |
| Deep Combinatorial Aggregation | Unknown | N/A | |
| Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling | Unknown | N/A | |
| Self-Supervised Learning with an Information Maximization Criterion | Unknown | N/A | |
| Improved Utility Analysis of Private CountSketch | Unknown | N/A | |
| A Classification of $G$-invariant Shallow Neural Networks | Unknown | N/A | |
| Module-Aware Optimization for Auxiliary Learning | Unknown | N/A | |
| Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering | Unknown | N/A | |
| Riemannian Score-Based Generative Modelling | Unknown | N/A | |
| Out-of-Distribution Detection via Conditional Kernel Independence Model | Unknown | N/A | |
| Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization | Unknown | N/A | |
| Policy Gradient With Serial Markov Chain Reasoning | Unknown | N/A | |
| Estimating graphical models for count data with applications to single-cell gene network | Unknown | N/A | |
| Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator | Unknown | N/A | |
| Egocentric Video-Language Pretraining | Unknown | N/A | |
| Efficient Submodular Optimization under Noise: Local Search is Robust | Unknown | N/A | |
| Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning | Unknown | N/A | |
| DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations | Unknown | N/A | |
| Does Momentum Change the Implicit Regularization on Separable Data? | Unknown | N/A | |
| VRL3: A Data-Driven Framework for Visual Deep Reinforcement Learning | Unknown | N/A | |
| Exact Solutions of a Deep Linear Network | Unknown | N/A | |
| ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning | Unknown | N/A | |
| Masked Prediction: A Parameter Identifiability View | Unknown | N/A | |
| Direct Advantage Estimation | Unknown | N/A | |
| Depth is More Powerful than Width with Prediction Concatenation in Deep Forest | Unknown | N/A | |
| AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars | Unknown | N/A | |
| Instance-based Learning for Knowledge Base Completion | Unknown | N/A | |
| Efficient and Effective Augmentation Strategy for Adversarial Training | Unknown | N/A | |
| u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality | Unknown | N/A | |
| First-Order Algorithms for Min-Max Optimization in Geodesic Metric Spaces | Unknown | N/A | |
| GENIE: Higher-Order Denoising Diffusion Solvers | Unknown | N/A | |
| Scalable and Efficient Training of Large Convolutional Neural Networks with Differential Privacy | Unknown | N/A | |
| Structured Recognition for Generative Models with Explaining Away | Unknown | N/A | |
| UniCLIP: Unified Framework for Contrastive Language-Image Pre-training | Unknown | N/A | |
| InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model | Unknown | N/A | |
| Local-Global MCMC kernels: the best of both worlds | Unknown | N/A | |
| Manifold Interpolating Optimal-Transport Flows for Trajectory Inference | Unknown | N/A | |
| Doubly Robust Counterfactual Classification | Unknown | N/A | |
| Uncertainty-Aware Reinforcement Learning for Risk-Sensitive Player Evaluation in Sports Game | Unknown | N/A | |
| Smooth Fictitious Play in Stochastic Games with Perturbed Payoffs and Unknown Transitions | Unknown | N/A | |
| SKFlow: Learning Optical Flow with Super Kernels | Unknown | N/A | |
| Non-stationary Bandits with Knapsacks | Unknown | N/A | |
| Weighted Mutual Learning with Diversity-Driven Model Compression | Unknown | N/A | |
| Learning to Attack Federated Learning: A Model-based Reinforcement Learning Attack Framework | Unknown | N/A | |
| Improving Zero-Shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions | Unknown | N/A | |
| Procedural Image Programs for Representation Learning | Unknown | N/A | |
| Bivariate Causal Discovery for Categorical Data via Classification with Optimal Label Permutation | Unknown | N/A | |
| High-Order Pooling for Graph Neural Networks with Tensor Decomposition | Unknown | N/A | |
| TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning | Unknown | N/A | |
| SALSA: Attacking Lattice Cryptography with Transformers | Unknown | N/A | |
| Class-Aware Adversarial Transformers for Medical Image Segmentation | Unknown | N/A | |
| A Single-timescale Analysis for Stochastic Approximation with Multiple Coupled Sequences | Unknown | N/A | |
| You Only Live Once: Single-Life Reinforcement Learning | Unknown | N/A | |
| Semi-Supervised Learning with Decision Trees: Graph Laplacian Tree Alternating Optimization | Unknown | N/A | |
| When does dough become a bagel? Analyzing the remaining mistakes on ImageNet | Unknown | N/A | |
| Learning from Stochastically Revealed Preference | Unknown | N/A | |
| A Best-of-Both-Worlds Algorithm for Bandits with Delayed Feedback | Unknown | N/A | |
| Online Minimax Multiobjective Optimization: Multicalibeating and Other Applications | Unknown | N/A | |
| Algorithms that Approximate Data Removal: New Results and Limitations | Unknown | N/A | |
| Annihilation of Spurious Minima in Two-Layer ReLU Networks | Unknown | N/A | |
| Unsupervised Image-to-Image Translation with Density Changing Regularization | Unknown | N/A | |
| Reproducibility in Optimization: Theoretical Framework and Limits | Unknown | N/A | |
| Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering | Unknown | N/A | |
| Systematic improvement of neural network quantum states using Lanczos | Unknown | N/A | |
| Privacy of Noisy Stochastic Gradient Descent: More Iterations without More Privacy Loss | Unknown | N/A | |
| Diagonal State Spaces are as Effective as Structured State Spaces | Unknown | N/A | |
| Why neural networks find simple solutions: The many regularizers of geometric complexity | Unknown | N/A | |
| Zero-Shot 3D Drug Design by Sketching and Generating | Unknown | N/A | |
| Adaptive Oracle-Efficient Online Learning | Unknown | N/A | |
| Brownian Noise Reduction: Maximizing Privacy Subject to Accuracy Constraints | Unknown | N/A | |
| Efficient Active Learning with Abstention | Unknown | N/A | |
| Unsupervised Learning of Shape Programs with Repeatable Implicit Parts | Unknown | N/A | |
| Moderate-fitting as a Natural Backdoor Defender for Pre-trained Language Models | Unknown | N/A | |
| Controllable Text Generation with Neurally-Decomposed Oracle | Unknown | N/A | |
| A Fast Post-Training Pruning Framework for Transformers | Unknown | N/A | |
| ConfounderGAN: Protecting Image Data Privacy with Causal Confounder | Unknown | N/A | |
| Improved Feature Distillation via Projector Ensemble | Unknown | N/A | |
| Neuron with Steady Response Leads to Better Generalization | Unknown | N/A | |
| Mirror Descent Maximizes Generalized Margin and Can Be Implemented Efficiently | Unknown | N/A | |
| Self-Organized Group for Cooperative Multi-agent Reinforcement Learning | Unknown | N/A | |
| APG: Adaptive Parameter Generation Network for Click-Through Rate Prediction | Unknown | N/A | |
| Learning Manifold Dimensions with Conditional Variational Autoencoders | Unknown | N/A | |
| Discovering Design Concepts for CAD Sketches | Unknown | N/A | |
| Reconstruction on Trees and Low-Degree Polynomials | Unknown | N/A | |
| Test Time Adaptation via Conjugate Pseudo-labels | Unknown | N/A | |
| Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning | Unknown | N/A | |
| GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech | Unknown | N/A | |
| Momentum Adversarial Distillation: Handling Large Distribution Shifts in Data-Free Knowledge Distillation | Unknown | N/A | |
| FreGAN: Exploiting Frequency Components for Training GANs under Limited Data | Unknown | N/A | |
| FasterRisk: Fast and Accurate Interpretable Risk Scores | Unknown | N/A | |
| When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning | Unknown | N/A | |
| Generalization Bounds for Stochastic Gradient Descent via Localized $\varepsilon$-Covers | Unknown | N/A | |
| Symbolic Distillation for Learned TCP Congestion Control | Unknown | N/A | |
| Proximal Learning With Opponent-Learning Awareness | Unknown | N/A | |
| Accelerated Linearized Laplace Approximation for Bayesian Deep Learning | Unknown | N/A | |
| GAGA: Deciphering Age-path of Generalized Self-paced Regularizer | Unknown | N/A | |
| Provable Benefit of Multitask Representation Learning in Reinforcement Learning | Unknown | N/A | |
| Follow-the-Perturbed-Leader for Adversarial Markov Decision Processes with Bandit Feedback | Unknown | N/A | |
| Why do We Need Large Batchsizes in Contrastive Learning? A Gradient-Bias Perspective | Unknown | N/A | |
| Globally Convergent Policy Search for Output Estimation | Unknown | N/A | |
| To update or not to update? Neurons at equilibrium in deep models | Unknown | N/A | |
| Grow and Merge: A Unified Framework for Continuous Categories Discovery | Unknown | N/A | |
| OGC: Unsupervised 3D Object Segmentation from Rigid Dynamics of Point Clouds | Unknown | N/A | |
| Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning | Unknown | N/A | |
| Factorized-FL: Personalized Federated Learning with Parameter Factorization & Similarity Matching | Unknown | N/A | |
| Autoinverse: Uncertainty Aware Inversion of Neural Networks | Unknown | N/A | |
| Bootstrapped Transformer for Offline Reinforcement Learning | Unknown | N/A | |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Unknown | N/A | |
| Fair Wrapping for Black-box Predictions | Unknown | N/A | |
| GT-GAN: General Purpose Time Series Synthesis with Generative Adversarial Networks | Unknown | N/A | |
| Generic bounds on the approximation error for physics-informed (and) operator learning | Unknown | N/A | |
| Debiased, Longitudinal and Coordinated Drug Recommendation through Multi-Visit Clinic Records | Unknown | N/A | |
| Most Activation Functions Can Win the Lottery Without Excessive Depth | Unknown | N/A | |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Unknown | N/A | |
| Truncated Matrix Power Iteration for Differentiable DAG Learning | Unknown | N/A | |
| Robust Rent Division | Unknown | N/A | |
| Temporally Disentangled Representation Learning | Unknown | N/A | |
| Improving Transformer with an Admixture of Attention Heads | Unknown | N/A | |
| Para-CFlows: $C^k$-universal diffeomorphism approximators as superior neural surrogates | Unknown | N/A | |
| TA-GATES: An Encoding Scheme for Neural Network Architectures | Unknown | N/A | |
| Gradient Methods Provably Converge to Non-Robust Networks | Unknown | N/A | |
| Convolutional Neural Networks on Graphs with Chebyshev Approximation, Revisited | Unknown | N/A | |
| Mask-based Latent Reconstruction for Reinforcement Learning | Unknown | N/A | |
| MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning | Unknown | N/A | |
| SwinTrack: A Simple and Strong Baseline for Transformer Tracking | Unknown | N/A | |
| Self-supervised Amodal Video Object Segmentation | Unknown | N/A | |
| Improving Generative Adversarial Networks via Adversarial Learning in Latent Space | Unknown | N/A | |
| EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization | Unknown | N/A | |
| First is Better Than Last for Language Data Influence | Unknown | N/A | |
| Molecule Generation by Principal Subgraph Mining and Assembling | Unknown | N/A | |
| Conditional Independence Testing with Heteroskedastic Data and Applications to Causal Discovery | Unknown | N/A | |
| Equivariant Graph Hierarchy-Based Neural Networks | Unknown | N/A | |
| Semi-infinitely Constrained Markov Decision Processes | Unknown | N/A | |
| One Positive Label is Sufficient: Single-Positive Multi-Label Learning with Label Enhancement | Unknown | N/A | |
| Bridge the Gap Between Architecture Spaces via A Cross-Domain Predictor | Unknown | N/A | |
| Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Top Two Algorithms Revisited | Unknown | N/A | |
| Revisiting Graph Contrastive Learning from the Perspective of Graph Spectrum | Unknown | N/A | |
| A Probabilistic Graph Coupling View of Dimension Reduction | Unknown | N/A | |
| Knowledge Distillation Improves Graph Structure Augmentation for Graph Neural Networks | Unknown | N/A | |
| LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Stimulative Training of Residual Networks: A Social Psychology Perspective of Loafing | Unknown | N/A | |
| MExMI: Pool-based Active Model Extraction Crossover Membership Inference | Unknown | N/A | |
| S3GC: Scalable Self-Supervised Graph Clustering | Unknown | N/A | |
| Parameter-free Dynamic Graph Embedding for Link Prediction | Unknown | N/A | |
| Federated Submodel Optimization for Hot and Cold Data Features | Unknown | N/A | |
| Picking on the Same Person: Does Algorithmic Monoculture lead to Outcome Homogenization? | Unknown | N/A | |
| Causality Preserving Chaotic Transformation and Classification using Neurochaos Learning | Unknown | N/A | |
| On Margin Maximization in Linear and ReLU Networks | Unknown | N/A | |
| Optimal Binary Classification Beyond Accuracy | Unknown | N/A | |
| Active Learning of Classifiers with Label and Seed Queries | Unknown | N/A | |
| AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition | Unknown | N/A | |
| FedPop: A Bayesian Approach for Personalised Federated Learning | Unknown | N/A | |
| Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs | Unknown | N/A | |
| Escaping Saddle Points with Bias-Variance Reduced Local Perturbed SGD for Communication Efficient Nonconvex Distributed Learning | Unknown | N/A | |
| Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving | Unknown | N/A | |
| Large Language Models are Zero-Shot Reasoners | Unknown | N/A | |
| Descent Steps of a Relation-Aware Energy Produce Heterogeneous Graph Neural Networks | Unknown | N/A | |
| Amplifying Membership Exposure via Data Poisoning | Unknown | N/A | |
| Robust Graph Structure Learning via Multiple Statistical Tests | Unknown | N/A | |
| Geometric Knowledge Distillation: Topology Compression for Graph Neural Networks | Unknown | N/A | |
| Learning to Constrain Policy Optimization with Virtual Trust Region | Unknown | N/A | |
| Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation | Unknown | N/A | |
| NodeFormer: A Scalable Graph Structure Learning Transformer for Node Classification | Unknown | N/A | |
| Efficient Architecture Search for Diverse Tasks | Unknown | N/A | |
| GRASP: Navigating Retrosynthetic Planning with Goal-driven Policy | Unknown | N/A | |
| Multi-Agent Reinforcement Learning is a Sequence Modeling Problem | Unknown | N/A | |
| Distributed Learning of Conditional Quantiles in the Reproducing Kernel Hilbert Space | Unknown | N/A | |
| The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design | Unknown | N/A | |
| Pruning Neural Networks via Coresets and Convex Geometry: Towards No Assumptions | Unknown | N/A | |
| Revisiting Injective Attacks on Recommender Systems | Unknown | N/A | |
| On the Convergence of Stochastic Multi-Objective Gradient Manipulation and Beyond | Unknown | N/A | |
| Learning to Generate Inversion-Resistant Model Explanations | Unknown | N/A | |
| Semi-Supervised Generative Models for Multiagent Trajectories | Unknown | N/A | |
| Unknown-Aware Domain Adversarial Learning for Open-Set Domain Adaptation | Unknown | N/A | |
| Distributionally Robust Optimization via Ball Oracle Acceleration | Unknown | N/A | |
| DeepMed: Semiparametric Causal Mediation Analysis with Debiased Deep Learning | Unknown | N/A | |
| Domain Adaptation under Open Set Label Shift | Unknown | N/A | |
| Generalization Bounds with Minimal Dependency on Hypothesis Class via Distributionally Robust Optimization | Unknown | N/A | |
| NeMF: Neural Motion Fields for Kinematic Animation | Unknown | N/A | |
| On Robust Multiclass Learnability | Unknown | N/A | |
| Moment Distributionally Robust Tree Structured Prediction | Unknown | N/A | |
| Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient | Unknown | N/A | |
| Grounding Aleatoric Uncertainty for Unsupervised Environment Design | Unknown | N/A | |
| Conditional Meta-Learning of Linear Representations | Unknown | N/A | |
| AZ-whiteness test: a test for signal uncorrelation on spatio-temporal graphs | Unknown | N/A | |
| Sample-Then-Optimize Batch Neural Thompson Sampling | Unknown | N/A | |
| Efficient Adversarial Training without Attacking: Worst-Case-Aware Robust Reinforcement Learning | Unknown | N/A | |
| Fast Bayesian Inference with Batch Bayesian Quadrature via Kernel Recombination | Unknown | N/A | |
| EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations | Unknown | N/A | |
| Blessing of Depth in Linear Regression: Deeper Models Have Flatter Landscape Around the True Solution | Unknown | N/A | |
| PKD: General Distillation Framework for Object Detectors via Pearson Correlation Coefficient | Unknown | N/A | |
| Star Temporal Classification: Sequence Modeling with Partially Labeled Data | Unknown | N/A | |
| Neural Stochastic Control | Unknown | N/A | |
| Smoothed Online Convex Optimization Based on Discounted-Normal-Predictor | Unknown | N/A | |
| Adaptive Sampling for Discovery | Unknown | N/A | |
| Diverse Weight Averaging for Out-of-Distribution Generalization | Unknown | N/A | |
| Counterfactual Temporal Point Processes | Unknown | N/A | |
| Sparse Winning Tickets are Data-Efficient Image Recognizers | Unknown | N/A | |
| Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs | Unknown | N/A | |
| Reduction Algorithms for Persistence Diagrams of Networks: CoralTDA and PrunIT | Unknown | N/A | |
| Approximation with CNNs in Sobolev Space: with Applications to Classification | Unknown | N/A | |
| Tracking Functional Changes in Nonstationary Signals with Evolutionary Ensemble Bayesian Model for Robust Neural Decoding | Unknown | N/A | |
| A Unified Convergence Theorem for Stochastic Optimization Methods | Unknown | N/A | |
| On Embeddings for Numerical Features in Tabular Deep Learning | Unknown | N/A | |
| Near-Optimal Collaborative Learning in Bandits | Unknown | N/A | |
| Increasing the Scope as You Learn: Adaptive Bayesian Optimization in Nested Subspaces | Unknown | N/A | |
| Iron: Private Inference on Transformers | Unknown | N/A | |
| Towards Disentangling Information Paths with Coded ResNeXt | Unknown | N/A | |
| Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space Energy-based Model | Unknown | N/A | |
| Flamingo: a Visual Language Model for Few-Shot Learning | Unknown | N/A | |
| Learning to Re-weight Examples with Optimal Transport for Imbalanced Classification | Unknown | N/A | |
| ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization | Unknown | N/A | |
| Torsional Diffusion for Molecular Conformer Generation | Unknown | N/A | |
| Beyond Time-Average Convergence: Near-Optimal Uncoupled Online Learning via Clairvoyant Multiplicative Weights Update | Unknown | N/A | |
| Approximate Euclidean lengths and distances beyond Johnson-Lindenstrauss | Unknown | N/A | |
| A consistently adaptive trust-region method | Unknown | N/A | |
| Order-Invariant Cardinality Estimators Are Differentially Private | Unknown | N/A | |
| Spectral Bias in Practice: The Role of Function Frequency in Generalization | Unknown | N/A | |
| Task-level Differentially Private Meta Learning | Unknown | N/A | |
| Distributed Inverse Constrained Reinforcement Learning for Multi-agent Systems | Unknown | N/A | |
| WaveBound: Dynamic Error Bounds for Stable Time Series Forecasting | Unknown | N/A | |
| Self-Supervised Learning of Brain Dynamics from Broad Neuroimaging Data | Unknown | N/A | |
| Gradient flow dynamics of shallow ReLU networks for square loss and orthogonal inputs | Unknown | N/A | |
| Ensemble of Averages: Improving Model Selection and Boosting Performance in Domain Generalization | Unknown | N/A | |
| Log-Polar Space Convolution Layers | Unknown | N/A | |
| Efficient Training of Low-Curvature Neural Networks | Unknown | N/A | |
| Self-Supervised Fair Representation Learning without Demographics | Unknown | N/A | |
| Nonlinear MCMC for Bayesian Machine Learning | Unknown | N/A | |
| Scale-invariant Learning by Physics Inversion | Unknown | N/A | |
| On Non-Linear operators for Geometric Deep Learning | Unknown | N/A | |
| A Geometric Perspective on Variational Autoencoders | Unknown | N/A | |
| Contrastive Graph Structure Learning via Information Bottleneck for Recommendation | Unknown | N/A | |
| Iterative Structural Inference of Directed Graphs | Unknown | N/A | |
| PDSketch: Integrated Domain Programming, Learning, and Planning | Unknown | N/A | |
| Off-Policy Evaluation with Policy-Dependent Optimization Response | Unknown | N/A | |
| Interpolation and Regularization for Causal Learning | Unknown | N/A | |
| Confidence-based Reliable Learning under Dual Noises | Unknown | N/A | |
| Dynamic Inverse Reinforcement Learning for Characterizing Animal Behavior | Unknown | N/A | |
| DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing | Unknown | N/A | |
| Black-Box Generalization: Stability of Zeroth-Order Learning | Unknown | N/A | |
| Label-Aware Global Consistency for Multi-Label Learning with Single Positive Labels | Unknown | N/A | |
| Emergent Communication: Generalization and Overfitting in Lewis Games | Unknown | N/A | |
| Latent Planning via Expansive Tree Search | Unknown | N/A | |
| Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback | Unknown | N/A | |
| RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer | Unknown | N/A | |
| The Phenomenon of Policy Churn | Unknown | N/A | |
| Optimal-er Auctions through Attention | Unknown | N/A | |
| Sampling with Riemannian Hamiltonian Monte Carlo in a Constrained Space | Unknown | N/A | |
| Defending Against Adversarial Attacks via Neural Dynamic System | Unknown | N/A | |
| Association Graph Learning for Multi-Task Classification with Category Shifts | Unknown | N/A | |
| Weakly Supervised Representation Learning with Sparse Perturbations | Unknown | N/A | |
| Chain-of-Thought Prompting Elicits Reasoning in Large Language Models | Unknown | N/A | |
| Learning Superpoint Graph Cut for 3D Instance Segmentation | Unknown | N/A | |
| First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization | Unknown | N/A | |
| CyCLIP: Cyclic Contrastive Language-Image Pretraining | Unknown | N/A | |
| Amortized Inference for Heterogeneous Reconstruction in Cryo-EM | Unknown | N/A | |
| Neural Stochastic PDEs: Resolution-Invariant Learning of Continuous Spatiotemporal Dynamics | Unknown | N/A | |
| Sobolev Acceleration and Statistical Optimality for Learning Elliptic Equations via Gradient Descent | Unknown | N/A | |
| Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF | Unknown | N/A | |
| An $\alpha$-No-Regret Algorithm For Graphical Bilinear Bandits | Unknown | N/A | |
| Perfect Sampling from Pairwise Comparisons | Unknown | N/A | |
| Value Function Decomposition for Iterative Design of Reinforcement Learning Agents | Unknown | N/A | |
| Constraining Gaussian Processes to Systems of Linear Ordinary Differential Equations | Unknown | N/A | |
| VAEL: Bridging Variational Autoencoders and Probabilistic Logic Programming | Unknown | N/A | |
| Conformal Off-Policy Prediction in Contextual Bandits | Unknown | N/A | |
| Constrained Update Projection Approach to Safe Policy Optimization | Unknown | N/A | |
| Global Linear and Local Superlinear Convergence of IRLS for Non-Smooth Robust Regression | Unknown | N/A | |
| A Fourier Approach to Mixture Learning | Unknown | N/A | |
| LECO: Learnable Episodic Count for Task-Specific Intrinsic Reward | Unknown | N/A | |
| Domain Generalization without Excess Empirical Risk | Unknown | N/A | |
| Navigating Memory Construction by Global Pseudo-Task Simulation for Continual Learning | Unknown | N/A | |
| Optimal Transport of Classifiers to Fairness | Unknown | N/A | |
| FedSR: A Simple and Effective Domain Generalization Method for Federated Learning | Unknown | N/A | |
| Using Partial Monotonicity in Submodular Maximization | Unknown | N/A | |
| When Do Flat Minima Optimizers Work? | Unknown | N/A | |
| Revisiting Non-Parametric Matching Cost Volumes for Robust and Generalizable Stereo Matching | Unknown | N/A | |
| Large-scale Optimization of Partial AUC in a Range of False Positive Rates | Unknown | N/A | |
| Learning in Congestion Games with Bandit Feedback | Unknown | N/A | |
| TreeMoCo: Contrastive Neuron Morphology Representation Learning | Unknown | N/A | |
| Near-Optimal Sample Complexity Bounds for Constrained MDPs | Unknown | N/A | |
| Fairness Transferability Subject to Bounded Distribution Shift | Unknown | N/A | |
| The Burer-Monteiro SDP method can fail even above the Barvinok-Pataki bound | Unknown | N/A | |
| WeightedSHAP: analyzing and improving Shapley based feature attributions | Unknown | N/A | |
| How to talk so AI will learn: Instructions, descriptions, and autonomy | Unknown | N/A | |
| Improved Algorithms for Neural Active Learning | Unknown | N/A | |
| Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential | Unknown | N/A | |
| Nonlinear Sufficient Dimension Reduction with a Stochastic Neural Network | Unknown | N/A | |
| Bayesian inference via sparse Hamiltonian flows | Unknown | N/A | |
| On Batch Teaching with Sample Complexity Bounded by VCD | Unknown | N/A | |
| AVLEN: Audio-Visual-Language Embodied Navigation in 3D Environments | Unknown | N/A | |
| Model-based Lifelong Reinforcement Learning with Bayesian Exploration | Unknown | N/A | |
| projUNN: efficient method for training deep networks with unitary matrices | Unknown | N/A | |
| Staggered Rollout Designs Enable Causal Inference Under Interference Without Network Knowledge | Unknown | N/A | |
| KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation | Unknown | N/A | |
| An Information-Theoretic Framework for Deep Learning | Unknown | N/A | |
| ORIENT: Submodular Mutual Information Measures for Data Subset Selection under Distribution Shift | Unknown | N/A | |
| Insights into Pre-training via Simpler Synthetic Tasks | Unknown | N/A | |
| Attracting and Dispersing: A Simple Approach for Source-free Domain Adaptation | Unknown | N/A | |
| Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions | Unknown | N/A | |
| Density-driven Regularization for Out-of-distribution Detection | Unknown | N/A | |
| Rapid Model Architecture Adaption for Meta-Learning | Unknown | N/A | |
| Finding Correlated Equilibrium of Constrained Markov Game: A Primal-Dual Approach | Unknown | N/A | |
| Hyperbolic Embedding Inference for Structured Multi-Label Prediction | Unknown | N/A | |
| AutoMTL: A Programming Framework for Automating Efficient Multi-Task Learning | Unknown | N/A | |
| Wavelet Feature Maps Compression for Image-to-Image CNNs | Unknown | N/A | |
| CoPur: Certifiably Robust Collaborative Inference via Feature Purification | Unknown | N/A | |
| Interventions, Where and How? Experimental Design for Causal Models at Scale | Unknown | N/A | |
| Efficient Non-Parametric Optimizer Search for Diverse Tasks | Unknown | N/A | |
| Seeing the forest and the tree: Building representations of both individual and collective dynamics with transformers | Unknown | N/A | |
| Deep Architecture Connectivity Matters for Its Convergence: A Fine-Grained Analysis | Unknown | N/A | |
| Scaling Multimodal Pre-Training via Cross-Modality Gradient Harmonization | Unknown | N/A | |
| A Character-Level Length-Control Algorithm for Non-Autoregressive Sentence Summarization | Unknown | N/A | |
| The Privacy Onion Effect: Memorization is Relative | Unknown | N/A | |
| Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method | Unknown | N/A | |
| Deep Ensembles Work, But Are They Necessary? | Unknown | N/A | |
| Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes | Unknown | N/A | |
| Variational Model Perturbation for Source-Free Domain Adaptation | Unknown | N/A | |
| Generative multitask learning mitigates target-causing confounding | Unknown | N/A | |
| Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer | Unknown | N/A | |
| Acceleration in Distributed Sparse Regression | Unknown | N/A | |
| Learning Two-Player Markov Games: Neural Function Approximation and Correlated Equilibrium | Unknown | N/A | |
| Repairing Neural Networks by Leaving the Right Past Behind | Unknown | N/A | |
| [Re] Privacy-preserving collaborative learning with automatic transformation search | Unknown | N/A | |
| Sequence Model Imitation Learning with Unobserved Contexts | Unknown | N/A | |
| GULP: a prediction-based metric between representations | Unknown | N/A | |
| Efficient Frameworks for Generalized Low-Rank Matrix Bandit Problems | Unknown | N/A | |
| Composition Theorems for Interactive Differential Privacy | Unknown | N/A | |
| On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games | Unknown | N/A | |
| Robust Generalized Method of Moments: A Finite Sample Viewpoint | Unknown | N/A | |
| Boosting Barely Robust Learners: A New Perspective on Adversarial Robustness | Unknown | N/A | |
| Regret Bounds for Risk-Sensitive Reinforcement Learning | Unknown | N/A | |
| Semi-supervised Active Linear Regression | Unknown | N/A | |
| Near-Isometric Properties of Kronecker-Structured Random Tensor Embeddings | Unknown | N/A | |
| Riemannian Diffusion Models | Unknown | N/A | |
| Towards Safe Reinforcement Learning with a Safety Editor Policy | Unknown | N/A | |
| On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach | Unknown | N/A | |
| The Implicit Delta Method | Unknown | N/A | |
| Bellman Residual Orthogonalization for Offline Reinforcement Learning | Unknown | N/A | |
| Meta-Learning Dynamics Forecasting Using Task Inference | Unknown | N/A | |
| On Scrambling Phenomena for Randomly Initialized Recurrent Networks | Unknown | N/A | |
| Data-Efficient Augmentation for Training Neural Networks | Unknown | N/A | |
| Beyond black box densities: Parameter learning for the deviated components | Unknown | N/A | |
| Robust Learning against Relational Adversaries | Unknown | N/A | |
| Policy Optimization for Markov Games: Unified Framework and Faster Convergence | Unknown | N/A | |
| Continuously Tempered PDMP samplers | Unknown | N/A | |
| Uncalibrated Models Can Improve Human-AI Collaboration | Unknown | N/A | |
| Few-Shot Non-Parametric Learning with Deep Latent Variable Model | Unknown | N/A | |
| Emergent Graphical Conventions in a Visual Communication Game | Unknown | N/A | |
| Chain of Thought Imitation with Procedure Cloning | Unknown | N/A | |
| Conformalized Fairness via Quantile Regression | Unknown | N/A | |
| Improving Self-Supervised Learning by Characterizing Idealized Representations | Unknown | N/A | |
| Learning Options via Compression | Unknown | N/A | |
| Rapidly Mixing Multiple-try Metropolis Algorithms for Model Selection Problems | Unknown | N/A | |
| Understanding Hyperdimensional Computing for Parallel Single-Pass Learning | Unknown | N/A | |
| Functional Indirection Neural Estimator for Better Out-of-distribution Generalization | Unknown | N/A | |
| Few-shot Learning for Feature Selection with Hilbert-Schmidt Independence Criterion | Unknown | N/A | |
| Bayesian Spline Learning for Equation Discovery of Nonlinear Dynamics with Quantified Uncertainty | Unknown | N/A | |
| Sketch-GNN: Scalable Graph Neural Networks with Sublinear Training Complexity | Unknown | N/A | |
| Active Learning Polynomial Threshold Functions | Unknown | N/A | |
| Efficient and Near-Optimal Smoothed Online Learning for Generalized Linear Functions | Unknown | N/A | |
| Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse | Unknown | N/A | |
| An Analysis of Ensemble Sampling | Unknown | N/A | |
| Towards Understanding the Condensation of Neural Networks at Initial Training | Unknown | N/A | |
| Fair Infinitesimal Jackknife: Mitigating the Influence of Biased Training Data Points Without Refitting | Unknown | N/A | |
| Optimal Scaling for Locally Balanced Proposals in Discrete Spaces | Unknown | N/A | |
| End-to-end Algorithm Synthesis with Recurrent Networks: Extrapolation without Overthinking | Unknown | N/A | |
| Domain Adaptation meets Individual Fairness. And they get along. | Unknown | N/A | |
| Free Probability for predicting the performance of feed-forward fully connected neural networks | Unknown | N/A | |
| Conformal Prediction with Temporal Quantile Adjustments | Unknown | N/A | |
| Using natural language and program abstractions to instill human inductive biases in machines | Unknown | N/A | |
| Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning | Unknown | N/A | |
| Neurosymbolic Deep Generative Models for Sequence Data with Relational Constraints | Unknown | N/A | |
| Polynomial time guarantees for the Burer-Monteiro method | Unknown | N/A | |
| Scalable design of Error-Correcting Output Codes using Discrete Optimization with Graph Coloring | Unknown | N/A | |
| On Deep Generative Models for Approximation and Estimation of Distributions on Manifolds | Unknown | N/A | |
| Nest Your Adaptive Algorithm for Parameter-Agnostic Nonconvex Minimax Optimization | Unknown | N/A | |
| Simple Unsupervised Object-Centric Learning for Complex and Naturalistic Videos | Unknown | N/A | |
| Revisiting Optimal Convergence Rate for Smooth and Non-convex Stochastic Decentralized Optimization | Unknown | N/A | |
| Physics-Informed Implicit Representations of Equilibrium Network Flows | Unknown | N/A | |
| Learning Generalized Policy Automata for Relational Stochastic Shortest Path Problems | Unknown | N/A | |
| Simplified Graph Convolution with Heterophily | Unknown | N/A | |
| DMAP: a Distributed Morphological Attention Policy for learning to locomote with a changing body | Unknown | N/A | |
| Byzantine-tolerant federated Gaussian process regression for streaming data | Unknown | N/A | |
| Distributionally Adaptive Meta Reinforcement Learning | Unknown | N/A | |
| Submodular Maximization in Clean Linear Time | Unknown | N/A | |
| Amortized Proximal Optimization | Unknown | N/A | |
| On Learning Fairness and Accuracy on Multiple Subgroups | Unknown | N/A | |
| HUMUS-Net: Hybrid Unrolled Multi-scale Network Architecture for Accelerated MRI Reconstruction | Unknown | N/A | |
| On the Symmetries of Deep Learning Models and their Internal Representations | Unknown | N/A | |
| Calibrated Data-Dependent Constraints with Exact Satisfaction Guarantees | Unknown | N/A | |
| Second Thoughts are Best: Learning to Re-Align With Human Values from Text Edits | Unknown | N/A | |
| Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal | Unknown | N/A | |
| FourierFormer: Transformer Meets Generalized Fourier Integral Theorem | Unknown | N/A | |
| In What Ways Are Deep Neural Networks Invariant and How Should We Measure This? | Unknown | N/A | |
| Faster and Scalable Algorithms for Densest Subgraph and Decomposition | Unknown | N/A | |
| Co-Modality Graph Contrastive Learning for Imbalanced Node Classification | Unknown | N/A | |
| ACIL: Analytic Class-Incremental Learning with Absolute Memorization and Privacy Protection | Unknown | N/A | |
| Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity | Unknown | N/A | |
| Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings | Unknown | N/A | |
| A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models | Unknown | N/A | |
| A General Framework for Auditing Differentially Private Machine Learning | Unknown | N/A | |
| Minimax Optimal Algorithms for Fixed-Budget Best Arm Identification | Unknown | N/A | |
| Pruning’s Effect on Generalization Through the Lens of Training and Regularization | Unknown | N/A | |
| Kernel similarity matching with Hebbian networks | Unknown | N/A | |
| QC-StyleGAN - Quality Controllable Image Generation and Manipulation | Unknown | N/A | |
| Human-AI Shared Control via Policy Dissection | Unknown | N/A | |
| Label-invariant Augmentation for Semi-Supervised Graph Classification | Unknown | N/A | |
| Preservation of the Global Knowledge by Not-True Distillation in Federated Learning | Unknown | N/A | |
| Using Embeddings for Causal Estimation of Peer Influence in Social Networks | Unknown | N/A | |
| A Unifying Framework of Off-Policy General Value Function Evaluation | Unknown | N/A | |
| TaSIL: Taylor Series Imitation Learning | Unknown | N/A | |
| Asymptotic Behaviors of Projected Stochastic Approximation: A Jump Diffusion Perspective | Unknown | N/A | |
| VF-PS: How to Select Important Participants in Vertical Federated Learning, Efficiently and Securely? | Unknown | N/A | |
| LISA: Learning Interpretable Skill Abstractions from Language | Unknown | N/A | |
| NSNet: A General Neural Probabilistic Framework for Satisfiability Problems | Unknown | N/A | |
| Model Preserving Compression for Neural Networks | Unknown | N/A | |
| Effects of Data Geometry in Early Deep Learning | Unknown | N/A | |
| Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model | Unknown | N/A | |
| Tight Mutual Information Estimation With Contrastive Fenchel-Legendre Optimization | Unknown | N/A | |
| Pre-Trained Model Reusability Evaluation for Small-Data Transfer Learning | Unknown | N/A | |
| Graph Few-shot Learning with Task-specific Structures | Unknown | N/A | |
| Offline Goal-Conditioned Reinforcement Learning via $f$-Advantage Regression | Unknown | N/A | |
| Old can be Gold: Better Gradient Flow can Make Vanilla-GCNs Great Again | Unknown | N/A | |
| Faster Deep Reinforcement Learning with Slower Online Network | Unknown | N/A | |
| An Asymptotically Optimal Batched Algorithm for the Dueling Bandit Problem | Unknown | N/A | |
| Beyond Not-Forgetting: Continual Learning with Backward Knowledge Transfer | Unknown | N/A | |
| Distributed Distributionally Robust Optimization with Non-Convex Objectives | Unknown | N/A | |
| Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis | Unknown | N/A | |
| Merging Models with Fisher-Weighted Averaging | Unknown | N/A | |
| Path Independent Equilibrium Models Can Better Exploit Test-Time Computation | Unknown | N/A | |
| Private Graph All-Pairwise-Shortest-Path Distance Release with Improved Error Rate | Unknown | N/A | |
| A Theory of PAC Learnability under Transformation Invariances | Unknown | N/A | |
| Global Convergence of Federated Learning for Mixed Regression | Unknown | N/A | |
| Segmenting Moving Objects via an Object-Centric Layered Representation | Unknown | N/A | |
| Invariance Learning based on Label Hierarchy | Unknown | N/A | |
| Online Algorithms for the Santa Claus Problem | Unknown | N/A | |
| Federated Learning from Pre-Trained Models: A Contrastive Learning Approach | Unknown | N/A | |
| When are Offline Two-Player Zero-Sum Markov Games Solvable? | Unknown | N/A | |
| Tree Mover's Distance: Bridging Graph Metrics and Stability of Graph Neural Networks | Unknown | N/A | |
| Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus | Unknown | N/A | |
| Implicit Bias of Gradient Descent on Reparametrized Models: On Equivalence to Mirror Descent | Unknown | N/A | |
| Few-Shot Audio-Visual Learning of Environment Acoustics | Unknown | N/A | |
| Redundancy-Free Message Passing for Graph Neural Networks | Unknown | N/A | |
| SemMAE: Semantic-Guided Masking for Learning Masked Autoencoders | Unknown | N/A | |
| Weighted Distillation with Unlabeled Examples | Unknown | N/A | |
| Mixture-of-Experts with Expert Choice Routing | Unknown | N/A | |
| The Stability-Efficiency Dilemma: Investigating Sequence Length Warmup for Training GPT Models | Unknown | N/A | |
| Diffusion-LM Improves Controllable Text Generation | Unknown | N/A | |
| Self-Supervised Pretraining for Large-Scale Point Clouds | Unknown | N/A | |
| Invariant and Transportable Representations for Anti-Causal Domain Shifts | Unknown | N/A | |
| Sparsity in Continuous-Depth Neural Networks | Unknown | N/A | |
| A Variational Edge Partition Model for Supervised Graph Representation Learning | Unknown | N/A | |
| A Simple Approach to Automated Spectral Clustering | Unknown | N/A | |
| Point Transformer V2: Grouped Vector Attention and Partition-based Pooling | Unknown | N/A | |
| Accelerated Training of Physics-Informed Neural Networks (PINNs) using Meshless Discretizations | Unknown | N/A | |
| Fault-Aware Neural Code Rankers | Unknown | N/A | |
| PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization | Unknown | N/A | |
| A simple but strong baseline for online continual learning: Repeated Augmented Rehearsal | Unknown | N/A | |
| Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models | Unknown | N/A | |
| Learning Symmetric Rules with SATNet | Unknown | N/A | |
| Consistent Interpolating Ensembles via the Manifold-Hilbert Kernel | Unknown | N/A | |
| Get More at Once: Alternating Sparse Training with Gradient Correction | Unknown | N/A | |
| Learning Fractional White Noises in Neural Stochastic Differential Equations | Unknown | N/A | |
| “Why Not Other Classes?”: Towards Class-Contrastive Back-Propagation Explanations | Unknown | N/A | |
| Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power | Unknown | N/A | |
| Training with More Confidence: Mitigating Injected and Natural Backdoors During Training | Unknown | N/A | |
| Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments | Unknown | N/A | |
| Batch Multi-Fidelity Active Learning with Budget Constraints | Unknown | N/A | |
| Between Stochastic and Adversarial Online Convex Optimization: Improved Regret Bounds via Smoothness | Unknown | N/A | |
| Time-Conditioned Dances with Simplicial Complexes: Zigzag Filtration Curve based Supra-Hodge Convolution Networks for Time-series Forecasting | Unknown | N/A | |
| Constrained Stochastic Nonconvex Optimization with State-dependent Markov Data | Unknown | N/A | |
| Integral Probability Metrics PAC-Bayes Bounds | Unknown | N/A | |
| Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning | Unknown | N/A | |
| M2N: Mesh Movement Networks for PDE Solvers | Unknown | N/A | |
| Exploiting the Relationship Between Kendall's Rank Correlation and Cosine Similarity for Attribution Protection | Unknown | N/A | |
| Understanding and Improving Robustness of Vision Transformers through Patch-based Negative Augmentation | Unknown | N/A | |
| Gaussian Copula Embeddings | Unknown | N/A | |
| Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching | Unknown | N/A | |
| CoNSoLe: Convex Neural Symbolic Learning | Unknown | N/A | |
| Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees | Unknown | N/A | |
| Meta-Auto-Decoder for Solving Parametric Partial Differential Equations | Unknown | N/A | |
| Non-Stationary Bandits under Recharging Payoffs: Improved Planning with Sublinear Regret | Unknown | N/A | |
| Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning | Unknown | N/A | |
| PlasticityNet: Learning to Simulate Metal, Sand, and Snow for Optimization Time Integration | Unknown | N/A | |
| [Re] Value Alignment Verification | Unknown | N/A | |
| Nearly-Tight Bounds for Testing Histogram Distributions | Unknown | N/A | |
| Rethinking and Scaling Up Graph Contrastive Learning: An Extremely Efficient Approach with Group Discrimination | Unknown | N/A | |
| Uncertainty Estimation for Multi-view Data: The Power of Seeing the Whole Picture | Unknown | N/A | |
| Reinforcement Learning with Automated Auxiliary Loss Search | Unknown | N/A | |
| Tractable Function-Space Variational Inference in Bayesian Neural Networks | Unknown | N/A | |
| Are all Frames Equal? Active Sparse Labeling for Video Action Detection | Unknown | N/A | |
| Unsupervised Learning under Latent Label Shift | Unknown | N/A | |
| You Can’t Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments | Unknown | N/A | |
| Provable Subspace Identification Under Post-Nonlinear Mixtures | Unknown | N/A | |
| Truly Deterministic Policy Optimization | Unknown | N/A | |
| Active Learning Helps Pretrained Models Learn the Intended Task | Unknown | N/A | |
| A Consolidated Cross-Validation Algorithm for Support Vector Machines via Data Reduction | Unknown | N/A | |
| Giving Feedback on Interactive Student Programs with Meta-Exploration | Unknown | N/A | |
| On Leave-One-Out Conditional Mutual Information For Generalization | Unknown | N/A | |
| High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation | Unknown | N/A | |
| Global Optimal K-Medoids Clustering of One Million Samples | Unknown | N/A | |
| A Scalable Deterministic Global Optimization Algorithm for Training Optimal Decision Tree | Unknown | N/A | |
| What Can Transformers Learn In-Context? A Case Study of Simple Function Classes | Unknown | N/A | |
| GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis | Unknown | N/A | |
| Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning | Unknown | N/A | |
| Data-Driven Offline Decision-Making via Invariant Representation Learning | Unknown | N/A | |
| When Does Group Invariant Learning Survive Spurious Correlations? | Unknown | N/A | |
| On Elimination Strategies for Bandit Fixed-Confidence Identification | Unknown | N/A | |
| So3krates: Equivariant attention for interactions on arbitrary length-scales in molecular systems | Unknown | N/A | |
| Making Look-Ahead Active Learning Strategies Feasible with Neural Tangent Kernels | Unknown | N/A | |
| DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning | Unknown | N/A | |
| Improving Diffusion Models for Inverse Problems using Manifold Constraints | Unknown | N/A | |
| DARE: Disentanglement-Augmented Rationale Extraction | Unknown | N/A | |
| Symmetry-induced Disentanglement on Graphs | Unknown | N/A | |
| Learning in Observable POMDPs, without Computationally Intractable Oracles | Unknown | N/A | |
| When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning | Unknown | N/A | |
| Spherization Layer: Representation Using Only Angles | Unknown | N/A | |
| Grounded Reinforcement Learning: Learning to Win the Game under Human Commands | Unknown | N/A | |
| How Powerful are K-hop Message Passing Graph Neural Networks | Unknown | N/A | |
| MEMO: Test Time Robustness via Adaptation and Augmentation | Unknown | N/A | |
| Redundant representations help generalization in wide neural networks | Unknown | N/A | |
| Dynamic Learning in Large Matching Markets | Unknown | N/A | |
| Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments | Unknown | N/A | |
| Towards Understanding the Mixture-of-Experts Layer in Deep Learning | Unknown | N/A | |
| A time-resolved theory of information encoding in recurrent neural networks | Unknown | N/A | |
| Coresets for Relational Data and The Applications | Unknown | N/A | |
| Coresets for Wasserstein Distributionally Robust Optimization Problems | Unknown | N/A | |
| Lazy and Fast Greedy MAP Inference for Determinantal Point Process | Unknown | N/A | |
| FlowHMM: Flow-based continuous hidden Markov models | Unknown | N/A | |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Unknown | N/A | |
| An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context | Unknown | N/A | |
| Experimental Design for Linear Functionals in Reproducing Kernel Hilbert Spaces | Unknown | N/A | |
| Imbalance Trouble: Revisiting Neural-Collapse Geometry | Unknown | N/A | |
| Injecting Domain Knowledge from Empirical Interatomic Potentials to Neural Networks for Predicting Material Properties | Unknown | N/A | |
| How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders | Unknown | N/A | |
| WT-MVSNet: Window-based Transformers for Multi-view Stereo | Unknown | N/A | |
| Models Out of Line: A Fourier Lens on Distribution Shift Robustness | Unknown | N/A | |
| SCINet: Time Series Modeling and Forecasting with Sample Convolution and Interaction | Unknown | N/A | |
| Chromatic Correlation Clustering, Revisited | Unknown | N/A | |
| A Reduction to Binary Approach for Debiasing Multiclass Datasets | Unknown | N/A | |
| MetricFormer: A Unified Perspective of Correlation Exploring in Similarity Learning | Unknown | N/A | |
| Asynchronous SGD Beats Minibatch SGD Under Arbitrary Delays | Unknown | N/A | |
| Revisiting Neural Scaling Laws in Language and Vision | Unknown | N/A | |
| Towards Consistency in Adversarial Classification | Unknown | N/A | |
| Last-Iterate Convergence of Optimistic Gradient Method for Monotone Variational Inequalities | Unknown | N/A | |
| Graph Convolution Network based Recommender Systems: Learning Guarantee and Item Mixture Powered Strategy | Unknown | N/A | |
| Revisit last-iterate convergence of mSGD under milder requirement on step size | Unknown | N/A | |
| Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation | Unknown | N/A | |
| Joint Learning of 2D-3D Weakly Supervised Semantic Segmentation | Unknown | N/A | |
| Graph Coloring via Neural Networks for Haplotype Assembly and Viral Quasispecies Reconstruction | Unknown | N/A | |
| Optimal Positive Generation via Latent Transformation for Contrastive Learning | Unknown | N/A | |
| Neural-Symbolic Entangled Framework for Complex Query Answering | Unknown | N/A | |
| Multiagent Q-learning with Sub-Team Coordination | Unknown | N/A | |
| Sound and Complete Verification of Polynomial Networks | Unknown | N/A | |
| Laplacian Autoencoders for Learning Stochastic Representations | Unknown | N/A | |
| Oracle Inequalities for Model Selection in Offline Reinforcement Learning | Unknown | N/A | |
| Revisiting Active Sets for Gaussian Process Decoders | Unknown | N/A | |
| Bounding and Approximating Intersectional Fairness through Marginal Fairness | Unknown | N/A | |
| MAtt: A Manifold Attention Network for EEG Decoding | Unknown | N/A | |
| BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis | Unknown | N/A | |
| Collaborative Decision Making Using Action Suggestions | Unknown | N/A | |
| Dynamics of SGD with Stochastic Polyak Stepsizes: Truly Adaptive Variants and Convergence to Exact Solution | Unknown | N/A | |
| A gradient estimator via L1-randomization for online zero-order optimization with two point feedback | Unknown | N/A | |
| Bring Your Own Algorithm for Optimal Differentially Private Stochastic Minimax Optimization | Unknown | N/A | |
| Selective compression learning of latent representations for variable-rate image compression | Unknown | N/A | |
| Neural Network Architecture Beyond Width and Depth | Unknown | N/A | |
| On the relationship between variational inference and auto-associative memory | Unknown | N/A | |
| Sparse Probabilistic Circuits via Pruning and Growing | Unknown | N/A | |
| When to Intervene: Learning Optimal Intervention Policies for Critical Events | Unknown | N/A | |
| Smoothed Embeddings for Certified Few-Shot Learning | Unknown | N/A | |
| An Analytical Theory of Curriculum Learning in Teacher-Student Networks | Unknown | N/A | |
| Black-box coreset variational inference | Unknown | N/A | |
| Distilling Representations from GAN Generator via Squeeze and Span | Unknown | N/A | |
| Generalization Analysis of Message Passing Neural Networks on Large Random Graphs | Unknown | N/A | |
| Meta-Learning with Self-Improving Momentum Target | Unknown | N/A | |
| Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space | Unknown | N/A | |
| Sequence-to-Set Generative Models | Unknown | N/A | |
| What Makes Graph Neural Networks Miscalibrated? | Unknown | N/A | |
| A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models | Unknown | N/A | |
| Consistency of Constrained Spectral Clustering under Graph Induced Fair Planted Partitions | Unknown | N/A | |
| A Regret-Variance Trade-Off in Online Learning | Unknown | N/A | |
| Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning | Unknown | N/A | |
| Not too little, not too much: a theoretical analysis of graph (over)smoothing | Unknown | N/A | |
| On A Mallows-type Model For (Ranked) Choices | Unknown | N/A | |
| Inverse Design for Fluid-Structure Interactions using Graph Network Simulators | Unknown | N/A | |
| Towards a Standardised Performance Evaluation Protocol for Cooperative MARL | Unknown | N/A | |
| On the Learning Mechanisms in Physical Reasoning | Unknown | N/A | |
| Joint Entropy Search For Maximally-Informed Bayesian Optimization | Unknown | N/A | |
| Benign Overfitting in Two-layer Convolutional Neural Networks | Unknown | N/A | |
| Proximal Point Imitation Learning | Unknown | N/A | |
| On the Robustness of Graph Neural Diffusion to Topology Perturbations | Unknown | N/A | |
| Power and limitations of single-qubit native quantum neural networks | Unknown | N/A | |
| A Characterization of Semi-Supervised Adversarially Robust PAC Learnability | Unknown | N/A | |
| Accelerating SGD for Highly Ill-Conditioned Huge-Scale Online Matrix Completion | Unknown | N/A | |
| Accelerated Primal-Dual Gradient Method for Smooth and Convex-Concave Saddle-Point Problems with Bilinear Coupling | Unknown | N/A | |
| Distilled Gradient Aggregation: Purify Features for Input Attribution in the Deep Neural Network | Unknown | N/A | |
| Sequential Information Design: Learning to Persuade in the Dark | Unknown | N/A | |
| Optimal Weak to Strong Learning | Unknown | N/A | |
| Unsupervised Learning of Group Invariant and Equivariant Representations | Unknown | N/A | |
| On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC) | Unknown | N/A | |
| Estimating the Arc Length of the Optimal ROC Curve and Lower Bounding the Maximal AUC | Unknown | N/A | |
| Disentangling Causal Effects from Sets of Interventions in the Presence of Unobserved Confounders | Unknown | N/A | |
| A Reparametrization-Invariant Sharpness Measure Based on Information Geometry | Unknown | N/A | |
| Bayesian Active Learning with Fully Bayesian Gaussian Processes | Unknown | N/A | |
| Log-Concave and Multivariate Canonical Noise Distributions for Differential Privacy | Unknown | N/A | |
| On Measuring Excess Capacity in Neural Networks | Unknown | N/A | |
| General Cutting Planes for Bound-Propagation-Based Neural Network Verification | Unknown | N/A | |
| Unsupervised Adaptation from Repeated Traversals for Autonomous Driving | Unknown | N/A | |
| Fine-Grained Semantically Aligned Vision-Language Pre-Training | Unknown | N/A | |
| On Sample Optimality in Personalized Collaborative and Federated Learning | Unknown | N/A | |
| Using Mixup as a Regularizer Can Surprisingly Improve Accuracy & Out-of-Distribution Robustness | Unknown | N/A | |
| Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning | Unknown | N/A | |
| A Variant of Anderson Mixing with Minimal Memory Size | Unknown | N/A | |
| Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems | Unknown | N/A | |
| Improved techniques for deterministic l2 robustness | Unknown | N/A | |
| Real-Valued Backpropagation is Unsuitable for Complex-Valued Neural Networks | Unknown | N/A | |
| Anonymized Histograms in Intermediate Privacy Models | Unknown | N/A | |
| Relaxing Equivariance Constraints with Non-stationary Continuous Filters | Unknown | N/A | |
| MCL-GAN: Generative Adversarial Networks with Multiple Specialized Discriminators | Unknown | N/A | |
| Sparse Gaussian Process Hyperparameters: Optimize or Integrate? | Unknown | N/A | |
| Sleeper Agent: Scalable Hidden Trigger Backdoors for Neural Networks Trained from Scratch | Unknown | N/A | |
| Learning to Branch with Tree MDPs | Unknown | N/A | |
| Fine-tuning Language Models over Slow Networks using Activation Quantization with Guarantees | Unknown | N/A | |
| Probing Classifiers are Unreliable for Concept Removal and Detection | Unknown | N/A | |
| Graph Learning Assisted Multi-Objective Integer Programming | Unknown | N/A | |
| Randomized Sketches for Clustering: Fast and Optimal Kernel $k$-Means | Unknown | N/A | |
| Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data | Unknown | N/A | |
| Certifying Robust Graph Classification under Orthogonal Gromov-Wasserstein Threats | Unknown | N/A | |
| On the Representation Collapse of Sparse Mixture of Experts | Unknown | N/A | |
| Information bottleneck theory of high-dimensional regression: relevancy, efficiency and optimality | Unknown | N/A | |
| Statistically Meaningful Approximation: a Case Study on Approximating Turing Machines with Transformers | Unknown | N/A | |
| A Data-Augmentation Is Worth A Thousand Samples: Analytical Moments And Sampling-Free Training | Unknown | N/A | |
| Partial Identification of Treatment Effects with Implicit Generative Models | Unknown | N/A | |
| Learning Neural Acoustic Fields | Unknown | N/A | |
| Variance Reduced ProxSkip: Algorithm, Theory and Application to Federated Learning | Unknown | N/A | |
| Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions | Unknown | N/A | |
| A contrastive rule for meta-learning | Unknown | N/A | |
| Meta Reinforcement Learning with Finite Training Tasks - a Density Estimation Approach | Unknown | N/A | |
| A gradient sampling method with complexity guarantees for Lipschitz functions in high and low dimensions | Unknown | N/A | |
| Regularized Molecular Conformation Fields | Unknown | N/A | |
| You Never Stop Dancing: Non-freezing Dance Generation via Bank-constrained Manifold Projection | Unknown | N/A | |
| Risk-Driven Design of Perception Systems | Unknown | N/A | |
| Langevin Autoencoders for Learning Deep Latent Variable Models | Unknown | N/A | |
| Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection | Unknown | N/A | |
| Learning on Arbitrary Graph Topologies via Predictive Coding | Unknown | N/A | |
| Multi-Lingual Acquisition on Multimodal Pre-training for Cross-modal Retrieval | Unknown | N/A | |
| Semantic Exploration from Language Abstractions and Pretrained Representations | Unknown | N/A | |
| A Unified Sequence Interface for Vision Tasks | Unknown | N/A | |
| Is Integer Arithmetic Enough for Deep Learning Training? | Unknown | N/A | |
| Confident Adaptive Language Modeling | Unknown | N/A | |
| Learning Dynamical Systems via Koopman Operator Regression in Reproducing Kernel Hilbert Spaces | Unknown | N/A | |
| 3D Concept Grounding on Neural Fields | Unknown | N/A | |
| A Solver-free Framework for Scalable Learning in Neural ILP Architectures | Unknown | N/A | |
| Unsupervised Object Representation Learning using Translation and Rotation Group Equivariant VAE | Unknown | N/A | |
| Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis | Unknown | N/A | |
| Luckiness in Multiscale Online Learning | Unknown | N/A | |
| Effective Dimension in Bandit Problems under Censorship | Unknown | N/A | |
| In Defense of the Unitary Scalarization for Deep Multi-Task Learning | Unknown | N/A | |
| Beyond IID: data-driven decision-making in heterogeneous environments | Unknown | N/A | |
| Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs | Unknown | N/A | |
| Private Multiparty Perception for Navigation | Unknown | N/A | |
| Group Meritocratic Fairness in Linear Contextual Bandits | Unknown | N/A | |
| Deep Equilibrium Approaches to Diffusion Models | Unknown | N/A | |
| Addressing Leakage in Concept Bottleneck Models | Unknown | N/A | |
| Evolution of Neural Tangent Kernels under Benign and Adversarial Training | Unknown | N/A | |
| The least-control principle for local learning at equilibrium | Unknown | N/A | |
| Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps | Unknown | N/A | |
| Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers | Unknown | N/A | |
| PhysGNN: A Physics--Driven Graph Neural Network Based Model for Predicting Soft Tissue Deformation in Image--Guided Neurosurgery | Unknown | N/A | |
| Archimedes Meets Privacy: On Privately Estimating Quantiles in High Dimensions Under Minimal Assumptions | Unknown | N/A | |
| Better SGD using Second-order Momentum | Unknown | N/A | |
| Learning from Few Samples: Transformation-Invariant SVMs with Composition and Locality at Multiple Scales | Unknown | N/A | |
| DevFly: Bio-Inspired Development of Binary Connections for Locality Preserving Sparse Codes | Unknown | N/A | |
| Multi-agent Dynamic Algorithm Configuration | Unknown | N/A | |
| Predictive Coding beyond Gaussian Distributions | Unknown | N/A | |
| Jump Self-attention: Capturing High-order Statistics in Transformers | Unknown | N/A | |
| Invariance Learning in Deep Neural Networks with Differentiable Laplace Approximations | Unknown | N/A | |
| RISE: Robust Individualized Decision Learning with Sensitive Variables | Unknown | N/A | |
| Efficient and Stable Fully Dynamic Facility Location | Unknown | N/A | |
| Envy-free Policy Teaching to Multiple Agents | Unknown | N/A | |
| VaiPhy: a Variational Inference Based Algorithm for Phylogeny | Unknown | N/A | |
| Active Learning with Safety Constraints | Unknown | N/A | |
| Trustworthy Monte Carlo | Unknown | N/A | |
| Learning-Augmented Algorithms for Online Linear and Semidefinite Programming | Unknown | N/A | |
| Near-Optimal Correlation Clustering with Privacy | Unknown | N/A | |
| Neural Attentive Circuits | Unknown | N/A | |
| Intra-agent speech permits zero-shot task acquisition | Unknown | N/A | |
| MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching | Unknown | N/A | |
| Robustness to Label Noise Depends on the Shape of the Noise Distribution | Unknown | N/A | |
| A Theoretical Study on Solving Continual Learning | Unknown | N/A | |
| Anytime-Valid Inference For Multinomial Count Data | Unknown | N/A | |
| Scalable and Efficient Non-adaptive Deterministic Group Testing | Unknown | N/A | |
| Hierarchical Agglomerative Graph Clustering in Poly-Logarithmic Depth | Unknown | N/A | |
| Variable-rate hierarchical CPC leads to acoustic unit discovery in speech | Unknown | N/A | |
| SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression | Unknown | N/A | |
| Contextual Dynamic Pricing with Unknown Noise: Explore-then-UCB Strategy and Improved Regrets | Unknown | N/A | |
| Distributed Online Convex Optimization with Compressed Communication | Unknown | N/A | |
| GlanceNets: Interpretable, Leak-proof Concept-based Models | Unknown | N/A | |
| BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression | Unknown | N/A | |
| On the Effectiveness of Persistent Homology | Unknown | N/A | |
| The Effects of Regularization and Data Augmentation are Class Dependent | Unknown | N/A | |
| On the Stability and Scalability of Node Perturbation Learning | Unknown | N/A | |
| Trimmed Maximum Likelihood Estimation for Robust Generalized Linear Model | Unknown | N/A | |
| Benefits of Additive Noise in Composing Classes with Bounded Capacity | Unknown | N/A | |
| EZNAS: Evolving Zero-Cost Proxies For Neural Architecture Scoring | Unknown | N/A | |
| Proppo: a Message Passing Framework for Customizable and Composable Learning Algorithms | Unknown | N/A | |
| Towards a Unified Framework for Uncertainty-aware Nonlinear Variable Selection with Theoretical Guarantees | Unknown | N/A | |
| Tempo: Accelerating Transformer-Based Model Training through Memory Footprint Reduction | Unknown | N/A | |
| CS-Shapley: Class-wise Shapley Values for Data Valuation in Classification | Unknown | N/A | |
| A New Family of Generalization Bounds Using Samplewise Evaluated CMI | Unknown | N/A | |
| Learning to Reconstruct Missing Data from Spatiotemporal Graphs with Sparse Observations | Unknown | N/A | |
| On the Adversarial Robustness of Mixture of Experts | Unknown | N/A | |
| Graph Neural Networks are Dynamic Programmers | Unknown | N/A | |
| K-LITE: Learning Transferable Visual Models with External Knowledge | Unknown | N/A | |
| Mesoscopic modeling of hidden spiking neurons | Unknown | N/A | |
| Self-Supervised Learning Through Efference Copies | Unknown | N/A | |
| Self-Explaining Deviations for Coordination | Unknown | N/A | |
| Multi-Objective Deep Learning with Adaptive Reference Vectors | Unknown | N/A | |
| Overparameterization from Computational Constraints | Unknown | N/A | |
| AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning | Unknown | N/A | |
| Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning | Unknown | N/A | |
| On the Generalization Power of the Overfitted Three-Layer Neural Tangent Kernel Model | Unknown | N/A | |
| Provably Adversarially Robust Detection of Out-of-Distribution Data (Almost) for Free | Unknown | N/A | |
| Modular Flows: Differential Molecular Generation | Unknown | N/A | |
| Bridging Central and Local Differential Privacy in Data Acquisition Mechanisms | Unknown | N/A | |
| PAC Prediction Sets for Meta-Learning | Unknown | N/A | |
| Diffusion Models as Plug-and-Play Priors | Unknown | N/A | |
| MorphTE: Injecting Morphology in Tensorized Embeddings | Unknown | N/A | |
| Trajectory balance: Improved credit assignment in GFlowNets | Unknown | N/A | |
| On Convergence of FedProx: Local Dissimilarity Invariant Bounds, Non-smoothness and Beyond | Unknown | N/A | |
| Task-Free Continual Learning via Online Discrepancy Distance Learning | Unknown | N/A | |
| Improved Differential Privacy for SGD via Optimal Private Linear Operators on Adaptive Streams | Unknown | N/A | |
| Evaluation beyond Task Performance: Analyzing Concepts in AlphaZero in Hex | Unknown | N/A | |
| Benchopt: Reproducible, efficient and collaborative optimization benchmarks | Unknown | N/A | |
| RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent Neural Networks | Unknown | N/A | |
| Nonparametric Uncertainty Quantification for Single Deterministic Neural Network | Unknown | N/A | |
| Controlled Sparsity via Constrained Optimization or: How I Learned to Stop Tuning Penalties and Love Constraints | Unknown | N/A | |
| Discovering and Overcoming Limitations of Noise-engineered Data-free Knowledge Distillation | Unknown | N/A | |
| Object Representations as Fixed Points: Training Iterative Refinement Algorithms with Implicit Differentiation | Unknown | N/A | |
| SQ Lower Bounds for Learning Single Neurons with Massart Noise | Unknown | N/A | |
| Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning | Unknown | N/A | |
| Average Sensitivity of Euclidean k-Clustering | Unknown | N/A | |
| A theory of weight distribution-constrained learning | Unknown | N/A | |
| Data augmentation for efficient learning from parametric experts | Unknown | N/A | |
| Active Bayesian Causal Inference | Unknown | N/A | |
| Template based Graph Neural Network with Optimal Transport Distances | Unknown | N/A | |
| Outlier-Robust Sparse Estimation via Non-Convex Optimization | Unknown | N/A | |
| Toward Understanding Privileged Features Distillation in Learning-to-Rank | Unknown | N/A | |
| The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization | Unknown | N/A | |
| FP8 Quantization: The Power of the Exponent | Unknown | N/A | |
| Maximizing Revenue under Market Shrinkage and Market Uncertainty | Unknown | N/A | |
| UnfoldML: Cost-Aware and Uncertainty-Based Dynamic 2D Prediction for Multi-Stage Classification | Unknown | N/A | |
| Structural Analysis of Branch-and-Cut and the Learnability of Gomory Mixed Integer Cuts | Unknown | N/A | |
| DOPE: Doubly Optimistic and Pessimistic Exploration for Safe Reinforcement Learning | Unknown | N/A | |
| Structure-Aware Image Segmentation with Homotopy Warping | Unknown | N/A | |
| Deep Learning Methods for Proximal Inference via Maximum Moment Restriction | Unknown | N/A | |
| On global convergence of ResNets: From finite to infinite width using linear parameterization | Unknown | N/A | |
| Residual Multiplicative Filter Networks for Multiscale Reconstruction | Unknown | N/A | |
| Reinforcement Learning with Non-Exponential Discounting | Unknown | N/A | |
| Towards Trustworthy Automatic Diagnosis Systems by Emulating Doctors' Reasoning with Deep Reinforcement Learning | Unknown | N/A | |
| On the symmetries of the synchronization problem in Cryo-EM: Multi-Frequency Vector Diffusion Maps on the Projective Plane | Unknown | N/A | |
| A Theoretical View on Sparsely Activated Networks | Unknown | N/A | |
| Pre-Train Your Loss: Easy Bayesian Transfer Learning with Informative Priors | Unknown | N/A | |
| Implications of Model Indeterminacy for Explanations of Automated Decisions | Unknown | N/A | |
| NOMAD: Nonlinear Manifold Decoders for Operator Learning | Unknown | N/A | |
| Characterizing the Ventral Visual Stream with Response-Optimized Neural Encoding Models | Unknown | N/A | |
| How Sampling Impacts the Robustness of Stochastic Neural Networks | Unknown | N/A | |
| Forward-Backward Latent State Inference for Hidden Continuous-Time semi-Markov Chains | Unknown | N/A | |
| Shape And Structure Preserving Differential Privacy | Unknown | N/A | |
| On the Effectiveness of Lipschitz-Driven Rehearsal in Continual Learning | Unknown | N/A | |
| Dynamic Pricing with Monotonicity Constraint under Unknown Parametric Demand Model | Unknown | N/A | |
| Cross-Linked Unified Embedding for cross-modality representation learning | Unknown | N/A | |
| Active Ranking without Strong Stochastic Transitivity | Unknown | N/A | |
| ProtoVAE: A Trustworthy Self-Explainable Prototypical Variational Model | Unknown | N/A | |
| The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning | Unknown | N/A | |
| Task Discovery: Finding the Tasks that Neural Networks Generalize on | Unknown | N/A | |
| Chaotic Regularization and Heavy-Tailed Limits for Deterministic Gradient Descent | Unknown | N/A | |
| LOT: Layer-wise Orthogonal Training on Improving l2 Certified Robustness | Unknown | N/A | |
| Few-Shot Fast-Adaptive Anomaly Detection | Unknown | N/A | |
| Learning dynamics of deep linear networks with multiple pathways | Unknown | N/A | |
| Turbocharging Solution Concepts: Solving NEs, CEs and CCEs with Neural Equilibrium Solvers | Unknown | N/A | |
| Multi-fidelity Monte Carlo: a pseudo-marginal approach | Unknown | N/A | |
| Learning sparse features can lead to overfitting in neural networks | Unknown | N/A | |
| Pushing the limits of fairness impossibility: Who's the fairest of them all? | Unknown | N/A | |
| Neural Set Function Extensions: Learning with Discrete Functions in High Dimensions | Unknown | N/A | |
| Zonotope Domains for Lagrangian Neural Network Verification | Unknown | N/A | |
| Safety Guarantees for Neural Network Dynamic Systems via Stochastic Barrier Functions | Unknown | N/A | |
| Online Bipartite Matching with Advice: Tight Robustness-Consistency Tradeoffs for the Two-Stage Model | Unknown | N/A | |
| Improving Multi-Task Generalization via Regularizing Spurious Correlation | Unknown | N/A | |
| Operative dimensions in unconstrained connectivity of recurrent neural networks | Unknown | N/A | |
| Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules | Unknown | N/A | |
| Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds | Unknown | N/A | |
| Generating Training Data with Language Models: Towards Zero-Shot Language Understanding | Unknown | N/A | |
| Differentially Private Graph Learning via Sensitivity-Bounded Personalized PageRank | Unknown | N/A | |
| Towards Practical Few-shot Query Sets: Transductive Minimum Description Length Inference | Unknown | N/A | |
| Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets | Unknown | N/A | |
| MAgNet: Mesh Agnostic Neural PDE Solver | Unknown | N/A | |
| Online Learning and Pricing for Network Revenue Management with Reusable Resources | Unknown | N/A | |
| Learning Modular Simulations for Homogeneous Systems | Unknown | N/A | |
| Instability and Local Minima in GAN Training with Kernel Discriminators | Unknown | N/A | |
| On Computing Probabilistic Explanations for Decision Trees | Unknown | N/A | |
| Distributed Optimization for Overparameterized Problems: Achieving Optimal Dimension Independent Communication Complexity | Unknown | N/A | |
| Lost in Latent Space: Examining failures of disentangled models at combinatorial generalisation | Unknown | N/A | |
| When Combinatorial Thompson Sampling meets Approximation Regret | Unknown | N/A | |
| Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models | Unknown | N/A | |
| Detecting Abrupt Changes in Sequential Pairwise Comparison Data | Unknown | N/A | |
| Sparse Fourier Backpropagation in Cryo-EM Reconstruction | Unknown | N/A | |
| When Does Differentially Private Learning Not Suffer in High Dimensions? | Unknown | N/A | |
| A Fast Scale-Invariant Algorithm for Non-negative Least Squares with Non-negative Data | Unknown | N/A | |
| (Optimal) Online Bipartite Matching with Degree Information | Unknown | N/A | |
| Learning from a Sample in Online Algorithms | Unknown | N/A | |
| Data-Driven Conditional Robust Optimization | Unknown | N/A | |
| Linear Label Ranking with Bounded Noise | Unknown | N/A | |
| Estimation of Entropy in Constant Space with Improved Sample Complexity | Unknown | N/A | |
| Escaping from the Barren Plateau via Gaussian Initializations in Deep Variational Quantum Circuits | Unknown | N/A | |
| Expected Frequency Matrices of Elections: Computation, Geometry, and Preference Learning | Unknown | N/A | |
| Robust Neural Posterior Estimation and Statistical Model Criticism | Unknown | N/A | |
| CryptoGCN: Fast and Scalable Homomorphically Encrypted Graph Convolutional Network Inference | Unknown | N/A | |
| The Missing Invariance Principle found -- the Reciprocal Twin of Invariant Risk Minimization | Unknown | N/A | |
| MABSplit: Faster Forest Training Using Multi-Armed Bandits | Unknown | N/A | |
| Marksman Backdoor: Backdoor Attacks with Arbitrary Target Class | Unknown | N/A | |
| Collaborative Learning of Discrete Distributions under Heterogeneity and Communication Constraints | Unknown | N/A | |
| Sample-Efficient Reinforcement Learning of Partially Observable Markov Games | Unknown | N/A | |
| Phase transitions in when feedback is useful | Unknown | N/A | |
| The Role of Baselines in Policy Gradient Optimization | Unknown | N/A | |
| Autoformalization with Large Language Models | Unknown | N/A | |
| Differentially Private Generalized Linear Models Revisited | Unknown | N/A | |
| Learning to Follow Instructions in Text-Based Games | Unknown | N/A | |
| On Learning and Refutation in Noninteractive Local Differential Privacy | Unknown | N/A | |
| Cryptographic Hardness of Learning Halfspaces with Massart Noise | Unknown | N/A | |
| Instance-optimal PAC Algorithms for Contextual Bandits | Unknown | N/A | |
| Do Current Multi-Task Optimization Methods in Deep Learning Even Help? | Unknown | N/A | |
| Prompt Certified Machine Unlearning with Randomized Gradient Smoothing and Quantization | Unknown | N/A | |
| Unsupervised Reinforcement Learning with Contrastive Intrinsic Control | Unknown | N/A | |
| Exact learning dynamics of deep linear networks with prior knowledge | Unknown | N/A | |
| Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency Consistency | Unknown | N/A | |
| Regret Bounds for Multilabel Classification in Sparse Label Regimes | Unknown | N/A | |
| Characterizing Datapoints via Second-Split Forgetting | Unknown | N/A | |
| Training language models to follow instructions with human feedback | Unknown | N/A | |
| S4ND: Modeling Images and Videos as Multidimensional Signals with State Spaces | Unknown | N/A | |
| Defining and Characterizing Reward Gaming | Unknown | N/A | |
| Adversarial training for high-stakes reliability | Unknown | N/A | |
| Semantic Probabilistic Layers for Neuro-Symbolic Learning | Unknown | N/A | |
| WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents | Unknown | N/A | |
| Maximizing and Satisficing in Multi-armed Bandits with Graph Information | Unknown | N/A | |
| Learning Probabilistic Models from Generator Latent Spaces with Hat EBM | Unknown | N/A | |
| Spherical Channels for Modeling Atomic Interactions | Unknown | N/A | |
| HyperTree Proof Search for Neural Theorem Proving | Unknown | N/A | |
| Exploring the Latent Space of Autoencoders with Interventional Assays | Unknown | N/A | |
| Root Cause Analysis of Failures in Microservices through Causal Discovery | Unknown | N/A | |
| Graphein - a Python Library for Geometric Deep Learning and Network Analysis on Biomolecular Structures and Interaction Networks | Unknown | N/A | |
| Finite-Sample Maximum Likelihood Estimation of Location | Unknown | N/A | |
| BayesPCN: A Continually Learnable Predictive Coding Associative Memory | Unknown | N/A | |
| On the detrimental effect of invariances in the likelihood for variational inference | Unknown | N/A | |
| Learning to Compare Nodes in Branch and Bound with Graph Neural Networks | Unknown | N/A | |
| Parameter-free Regret in High Probability with Heavy Tails | Unknown | N/A | |
| Multi-Game Decision Transformers | Unknown | N/A | |
| Structural Pruning via Latency-Saliency Knapsack | Unknown | N/A | |
| The Query Complexity of Cake Cutting | Unknown | N/A | |
| Best of Both Worlds Model Selection | Unknown | N/A | |
| Pitfalls of Epistemic Uncertainty Quantification through Loss Minimisation | Unknown | N/A | |
| New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound | Unknown | N/A | |
| Memory safe computations with XLA compiler | Unknown | N/A | |
| Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning | Unknown | N/A | |
| Fairness in Federated Learning via Core-Stability | Unknown | N/A | |
| Accelerating Certified Robustness Training via Knowledge Transfer | Unknown | N/A | |
| Certifying Some Distributional Fairness with Subpopulation Decomposition | Unknown | N/A | |
| A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation | Unknown | N/A | |
| Sublinear Algorithms for Hierarchical Clustering | Unknown | N/A | |
| A Deep Reinforcement Learning Framework for Column Generation | Unknown | N/A | |
| Logical Activation Functions: Logit-space equivalents of Probabilistic Boolean Operators | Unknown | N/A | |
| EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL | Unknown | N/A | |
| End-to-end Stochastic Optimization with Energy-based Model | Unknown | N/A | |
| ReCo: Retrieve and Co-segment for Zero-shot Transfer | Unknown | N/A | |
| Human-Robotic Prosthesis as Collaborating Agents for Symmetrical Walking | Unknown | N/A | |
| Adaptive Interest for Emphatic Reinforcement Learning | Unknown | N/A | |
| Chaotic Dynamics are Intrinsic to Neural Network Training with SGD | Unknown | N/A | |
| Local Bayesian optimization via maximizing probability of descent | Unknown | N/A | |
| Learning the Structure of Large Networked Systems Obeying Conservation Laws | Unknown | N/A | |
| Near-Optimal No-Regret Learning Dynamics for General Convex Games | Unknown | N/A | |
| The Impact of Task Underspecification in Evaluating Deep Reinforcement Learning | Unknown | N/A | |
| A Practical, Progressively-Expressive GNN | Unknown | N/A | |
| ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward | Unknown | N/A | |
| Provably tuning the ElasticNet across instances | Unknown | N/A | |
| Fast Neural Kernel Embeddings for General Activations | Unknown | N/A | |
| Evaluating Latent Space Robustness and Uncertainty of EEG-ML Models under Realistic Distribution Shifts | Unknown | N/A | |
| Simple and Optimal Greedy Online Contention Resolution Schemes | Unknown | N/A | |
| Modeling Transitivity and Cyclicity in Directed Graphs via Binary Code Box Embeddings | Unknown | N/A | |
| Planning to the Information Horizon of BAMDPs via Epistemic State Abstraction | Unknown | N/A | |
| Decoupled Context Processing for Context Augmented Language Modeling | Unknown | N/A | |
| Efficiency Ordering of Stochastic Gradient Descent | Unknown | N/A | |
| Robust Streaming PCA | Unknown | N/A | |
| Learning Partial Equivariances From Data | Unknown | N/A | |
| [Re] Lifting 2D StyleGAN for 3D-Aware Face Generation | Unknown | N/A | |
| FiLM: Frequency improved Legendre Memory Model for Long-term Time Series Forecasting | Unknown | N/A | |
| Unsupervised Causal Generative Understanding of Images | Unknown | N/A | |
| Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation | Unknown | N/A | |
| Fair Ranking with Noisy Protected Attributes | Unknown | N/A | |
| Independence Testing-Based Approach to Causal Discovery under Measurement Error and Linear Non-Gaussian Models | Unknown | N/A | |
| Zero-Sum Stochastic Stackelberg Games | Unknown | N/A | |
| Can Hybrid Geometric Scattering Networks Help Solve the Maximum Clique Problem? | Unknown | N/A | |
| NeurOLight: A Physics-Agnostic Neural Operator Enabling Parametric Photonic Device Simulation | Unknown | N/A | |
| Mining Multi-Label Samples from Single Positive Labels | Unknown | N/A | |
| Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters | Unknown | N/A | |
| Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent | Unknown | N/A | |
| Exponential Family Model-Based Reinforcement Learning via Score Matching | Unknown | N/A | |
| Object Scene Representation Transformer | Unknown | N/A | |
| Geometric Order Learning for Rank Estimation | Unknown | N/A | |
| Learning with convolution and pooling operations in kernel methods | Unknown | N/A | |
| Dataset Distillation using Neural Feature Regression | Unknown | N/A | |
| Influencing Long-Term Behavior in Multiagent Reinforcement Learning | Unknown | N/A | |
| Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm | Unknown | N/A | |
| Unifying and Boosting Gradient-Based Training-Free Neural Architecture Search | Unknown | N/A | |
| Learning Contrastive Embedding in Low-Dimensional Space | Unknown | N/A | |
| Exploring Example Influence in Continual Learning | Unknown | N/A | |
| JAWS: Auditing Predictive Uncertainty Under Covariate Shift | Unknown | N/A | |
| One for All: Simultaneous Metric and Preference Learning over Multiple Users | Unknown | N/A | |
| Paraphrasing Is All You Need for Novel Object Captioning | Unknown | N/A | |
| Augmentations in Hypergraph Contrastive Learning: Fabricated and Generative | Unknown | N/A | |
| Multiview Human Body Reconstruction from Uncalibrated Cameras | Unknown | N/A | |
| FairVFL: A Fair Vertical Federated Learning Framework with Contrastive Adversarial Learning | Unknown | N/A | |
| Empirical Gateaux Derivatives for Causal Inference | Unknown | N/A | |
| AgraSSt: Approximate Graph Stein Statistics for Interpretable Assessment of Implicit Graph Generators | Unknown | N/A | |
| Benefits of Permutation-Equivariance in Auction Mechanisms | Unknown | N/A | |
| Learning Active Camera for Multi-Object Navigation | Unknown | N/A | |
| Toward Efficient Robust Training against Union of $\ell_p$ Threat Models | Unknown | N/A | |
| Mask Matching Transformer for Few-Shot Segmentation | Unknown | N/A | |
| A Unified Hard-Constraint Framework for Solving Geometrically Complex PDEs | Unknown | N/A | |
| Symplectic Spectrum Gaussian Processes: Learning Hamiltonians from Noisy and Sparse Data | Unknown | N/A | |
| GREED: A Neural Framework for Learning Graph Distance Functions | Unknown | N/A | |
| Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty | Unknown | N/A | |
| Consistent Sufficient Explanations and Minimal Local Rules for explaining the decision of any classifier or regressor | Unknown | N/A | |
| DaDA: Distortion-aware Domain Adaptation for Unsupervised Semantic Segmentation | Unknown | N/A | |
| Learning Optical Flow from Continuous Spike Streams | Unknown | N/A | |
| Retrospective Adversarial Replay for Continual Learning | Unknown | N/A | |
| Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach | Unknown | N/A | |
| On Feature Learning in the Presence of Spurious Correlations | Unknown | N/A | |
| Explaining Preferences with Shapley Values | Unknown | N/A | |
| Privacy Induces Robustness: Information-Computation Gaps and Sparse Mean Estimation | Unknown | N/A | |
| ReFactor GNNs: Revisiting Factorisation-based Models from a Message-Passing Perspective | Unknown | N/A | |
| Block-Recurrent Transformers | Unknown | N/A | |
| Hamiltonian Latent Operators for content and motion disentanglement in image sequences | Unknown | N/A | |
| Learning (Very) Simple Generative Models Is Hard | Unknown | N/A | |
| Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction | Unknown | N/A | |
| A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning | Unknown | N/A | |
| Rethinking Generalization in Few-Shot Classification | Unknown | N/A | |
| VectorAdam for Rotation Equivariant Geometry Optimization | Unknown | N/A | |
| Keypoint-Guided Optimal Transport with Applications in Heterogeneous Domain Adaptation | Unknown | N/A | |
| Supervising the Multi-Fidelity Race of Hyperparameter Configurations | Unknown | N/A | |
| Trajectory of Mini-Batch Momentum: Batch Size Saturation and Convergence in High Dimensions | Unknown | N/A | |
| Single Model Uncertainty Estimation via Stochastic Data Centering | Unknown | N/A | |
| CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers | Unknown | N/A | |
| Graph Neural Networks with Adaptive Readouts | Unknown | N/A | |
| Adaptive Distribution Calibration for Few-Shot Learning with Hierarchical Optimal Transport | Unknown | N/A | |
| Towards Reliable Simulation-Based Inference with Balanced Neural Ratio Estimation | Unknown | N/A | |
| Beyond Mahalanobis Distance for Textual OOD Detection | Unknown | N/A | |
| Tensor Program Optimization with Probabilistic Programs | Unknown | N/A | |
| VICE: Variational Interpretable Concept Embeddings | Unknown | N/A | |
| Learning single-index models with shallow neural networks | Unknown | N/A | |
| Near-Optimal Randomized Exploration for Tabular Markov Decision Processes | Unknown | N/A | |
| Understanding Non-linearity in Graph Neural Networks from the Bayesian-Inference Perspective | Unknown | N/A | |
| LOG: Active Model Adaptation for Label-Efficient OOD Generalization | Unknown | N/A | |
| Structural Knowledge Distillation for Object Detection | Unknown | N/A | |
| Semantic uncertainty intervals for disentangled latent spaces | Unknown | N/A | |
| Uni[MASK]: Unified Inference in Sequential Decision Problems | Unknown | N/A | |
| Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning | Unknown | N/A | |
| Invertible Monotone Operators for Normalizing Flows | Unknown | N/A | |
| A Transformer-Based Object Detector with Coarse-Fine Crossing Representations | Unknown | N/A | |
| Distinguishing Learning Rules with Brain Machine Interfaces | Unknown | N/A | |
| Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Expected Improvement for Contextual Bandits | Unknown | N/A | |
| BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework | Unknown | N/A | |
| Graph Neural Network Bandits | Unknown | N/A | |
| Mean Estimation with User-level Privacy under Data Heterogeneity | Unknown | N/A | |
| Precise Regret Bounds for Log-loss via a Truncated Bayesian Algorithm | Unknown | N/A | |
| ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints | Unknown | N/A | |
| LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation | Unknown | N/A | |
| 360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning | Unknown | N/A | |
| Reduced Representation of Deformation Fields for Effective Non-rigid Shape Matching | Unknown | N/A | |
| Explain My Surprise: Learning Efficient Long-Term Memory by predicting uncertain outcomes | Unknown | N/A | |
| A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk | Unknown | N/A | |
| Policy Optimization with Linear Temporal Logic Constraints | Unknown | N/A | |
| Scaling & Shifting Your Features: A New Baseline for Efficient Model Tuning | Unknown | N/A | |
| Decomposed Knowledge Distillation for Class-Incremental Semantic Segmentation | Unknown | N/A | |
| On the Theoretical Properties of Noise Correlation in Stochastic Optimization | Unknown | N/A | |
| NCP: Neural Correspondence Prior for Effective Unsupervised Shape Matching | Unknown | N/A | |
| ComENet: Towards Complete and Efficient Message Passing for 3D Molecular Graphs | Unknown | N/A | |
| On Divergence Measures for Bayesian Pseudocoresets | Unknown | N/A | |
| Alleviating the Sample Selection Bias in Few-shot Learning by Removing Projection to the Centroid | Unknown | N/A | |
| Can Push-forward Generative Models Fit Multimodal Distributions? | Unknown | N/A | |
| Posterior and Computational Uncertainty in Gaussian Processes | Unknown | N/A | |
| MORA: Improving Ensemble Robustness Evaluation with Model Reweighing Attack | Unknown | N/A | |
| Learning to Sample and Aggregate: Few-shot Reasoning over Temporal Knowledge Graphs | Unknown | N/A | |
| Advancing Model Pruning via Bi-level Optimization | Unknown | N/A | |
| An Algorithm for Learning Switched Linear Dynamics from Data | Unknown | N/A | |
| Batch Bayesian optimisation via density-ratio estimation with guarantees | Unknown | N/A | |
| An Adaptive Kernel Approach to Federated Learning of Heterogeneous Causal Effects | Unknown | N/A | |
| Structuring Uncertainty for Fine-Grained Sampling in Stochastic Segmentation Networks | Unknown | N/A | |
| Multi-objective Deep Data Generation with Correlated Property Control | Unknown | N/A | |
| Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations | Unknown | N/A | |
| Learning Debiased Classifier with Biased Committee | Unknown | N/A | |
| Surprising Instabilities in Training Deep Networks and a Theoretical Analysis | Unknown | N/A | |
| Capturing Failures of Large Language Models via Human Cognitive Biases | Unknown | N/A | |
| Nonnegative Tensor Completion via Integer Optimization | Unknown | N/A | |
| Equivariant Networks for Crystal Structures | Unknown | N/A | |
| LieGG: Studying Learned Lie Group Generators | Unknown | N/A | |
| On-Demand Sampling: Learning Optimally from Multiple Distributions | Unknown | N/A | |
| A Communication-efficient Algorithm with Linear Convergence for Federated Minimax Learning | Unknown | N/A | |
| Robust Model Selection and Nearly-Proper Learning for GMMs | Unknown | N/A | |
| Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks | Unknown | N/A | |
| A Unified Framework for Alternating Offline Model Training and Policy Learning | Unknown | N/A | |
| Automatic Differentiation of Programs with Discrete Randomness | Unknown | N/A | |
| Thinned random measures for sparse graphs with overlapping communities | Unknown | N/A | |
| Hybrid Neural Autoencoders for Stimulus Encoding in Visual and Other Sensory Neuroprostheses | Unknown | N/A | |
| Tight Analysis of Extra-gradient and Optimistic Gradient Methods For Nonconvex Minimax Problems | Unknown | N/A | |
| Amortized Inference for Causal Structure Learning | Unknown | N/A | |
| Staircase Attention for Recurrent Processing of Sequences | Unknown | N/A | |
| A Multi-Resolution Framework for U-Nets with Applications to Hierarchical VAEs | Unknown | N/A | |
| Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off | Unknown | N/A | |
| Biologically plausible solutions for spiking networks with efficient coding | Unknown | N/A | |
| Algorithms with Prediction Portfolios | Unknown | N/A | |
| SAGDA: Achieving $\mathcal{O}(\epsilon^{-2})$ Communication Complexity in Federated Min-Max Learning | Unknown | N/A | |
| Deep Compression of Pre-trained Transformer Models | Unknown | N/A | |
| Beyond neural scaling laws: beating power law scaling via data pruning | Unknown | N/A | |
| Subgroup Robustness Grows On Trees: An Empirical Baseline Investigation | Unknown | N/A | |
| Learning Optimal Flows for Non-Equilibrium Importance Sampling | Unknown | N/A | |
| Incentivizing Combinatorial Bandit Exploration | Unknown | N/A | |
| A Simple Decentralized Cross-Entropy Method | Unknown | N/A | |
| Neural Abstractions | Unknown | N/A | |
| Learning Dense Object Descriptors from Multiple Views for Low-shot Category Generalization | Unknown | N/A | |
| Flowification: Everything is a normalizing flow | Unknown | N/A | |
| Non-monotonic Resource Utilization in the Bandits with Knapsacks Problem | Unknown | N/A | |
| Evaluating Robustness to Dataset Shift via Parametric Robustness Sets | Unknown | N/A | |
| Generative Time Series Forecasting with Diffusion, Denoise, and Disentanglement | Unknown | N/A | |
| Private and Communication-Efficient Algorithms for Entropy Estimation | Unknown | N/A | |
| Kernel Multimodal Continuous Attention | Unknown | N/A | |
| Stars: Tera-Scale Graph Building for Clustering and Learning | Unknown | N/A | |
| Anonymous Bandits for Multi-User Systems | Unknown | N/A | |
| Understanding Deep Contrastive Learning via Coordinate-wise Optimization | Unknown | N/A | |
| Stochastic Halpern Iteration with Variance Reduction for Stochastic Monotone Inclusions | Unknown | N/A | |
| PALMER: Perception - Action Loop with Memory for Long-Horizon Planning | Unknown | N/A | |
| Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation | Unknown | N/A | |
| Finite Sample Analysis Of Dynamic Regression Parameter Learning | Unknown | N/A | |
| Adaptive Stochastic Variance Reduction for Non-convex Finite-Sum Minimization | Unknown | N/A | |
| Improved Coresets for Euclidean $k$-Means | Unknown | N/A | |
| Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data | Unknown | N/A | |
| Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems | Unknown | N/A | |
| Neural Temporal Walks: Motif-Aware Representation Learning on Continuous-Time Dynamic Graphs | Unknown | N/A | |
| Grounded Video Situation Recognition | Unknown | N/A | |
| Learning to Scaffold: Optimizing Model Explanations for Teaching | Unknown | N/A | |
| Public Wisdom Matters! Discourse-Aware Hyperbolic Fourier Co-Attention for Social Text Classification | Unknown | N/A | |
| Efficient Methods for Non-stationary Online Learning | Unknown | N/A | |
| Sustainable Online Reinforcement Learning for Auto-bidding | Unknown | N/A | |
| Effectiveness of Vision Transformer for Fast and Accurate Single-Stage Pedestrian Detection | Unknown | N/A | |
| On Analyzing Generative and Denoising Capabilities of Diffusion-based Deep Generative Models | Unknown | N/A | |
| Isometric 3D Adversarial Examples in the Physical World | Unknown | N/A | |
| The Hessian Screening Rule | Unknown | N/A | |
| Measuring Data Reconstruction Defenses in Collaborative Inference Systems | Unknown | N/A | |
| A Stochastic Linearized Augmented Lagrangian Method for Decentralized Bilevel Optimization | Unknown | N/A | |
| Kernel Memory Networks: A Unifying Framework for Memory Modeling | Unknown | N/A | |
| A Neural Pre-Conditioning Active Learning Algorithm to Reduce Label Complexity | Unknown | N/A | |
| Flexible Diffusion Modeling of Long Videos | Unknown | N/A | |
| Learning Structure from the Ground up---Hierarchical Representation Learning by Chunking | Unknown | N/A | |
| Meta-Complementing the Semantics of Short Texts in Neural Topic Models | Unknown | N/A | |
| Robust Feature-Level Adversaries are Interpretability Tools | Unknown | N/A | |
| Knowledge-Aware Bayesian Deep Topic Model | Unknown | N/A | |
| GStarX: Explaining Graph Neural Networks with Structure-Aware Cooperative Games | Unknown | N/A | |
| Quantum Algorithms for Sampling Log-Concave Distributions and Estimating Normalizing Constants | Unknown | N/A | |
| Nearly Optimal Best-of-Both-Worlds Algorithms for Online Learning with Feedback Graphs | Unknown | N/A | |
| FourierNets enable the design of highly non-local optical encoders for computational imaging | Unknown | N/A | |
| TVLT: Textless Vision-Language Transformer | Unknown | N/A | |
| No Free Lunch from Deep Learning in Neuroscience: A Case Study through Models of the Entorhinal-Hippocampal Circuit | Unknown | N/A | |
| Retaining Knowledge for Learning with Dynamic Definition | Unknown | N/A | |
| XTC: Extreme Compression for Pre-trained Transformers Made Simple and Efficient | Unknown | N/A | |
| PAC: Assisted Value Factorization with Counterfactual Predictions in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts | Unknown | N/A | |
| Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language | Unknown | N/A | |
| Fairness without Demographics through Knowledge Distillation | Unknown | N/A | |
| Deep Bidirectional Language-Knowledge Graph Pretraining | Unknown | N/A | |
| Rethinking Value Function Learning for Generalization in Reinforcement Learning | Unknown | N/A | |
| Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design | Unknown | N/A | |
| Exposing and Exploiting Fine-Grained Block Structures for Fast and Accurate Sparse Training | Unknown | N/A | |
| Parameters or Privacy: A Provable Tradeoff Between Overparameterization and Membership Inference | Unknown | N/A | |
| Efficient Dataset Distillation using Random Feature Approximation | Unknown | N/A | |
| Locally Hierarchical Auto-Regressive Modeling for Image Generation | Unknown | N/A | |
| Interaction-Grounded Learning with Action-Inclusive Feedback | Unknown | N/A | |
| AdaFocal: Calibration-aware Adaptive Focal Loss | Unknown | N/A | |
| Convergence for score-based generative modeling with polynomial complexity | Unknown | N/A | |
| Toward Robust Spiking Neural Network Against Adversarial Perturbation | Unknown | N/A | |
| Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training | Unknown | N/A | |
| Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation | Unknown | N/A | |
| $\alpha$-ReQ : Assessing Representation Quality in Self-Supervised Learning by measuring eigenspectrum decay | Unknown | N/A | |
| Bounded-Regret MPC via Perturbation Analysis: Prediction Error, Constraints, and Nonlinearity | Unknown | N/A | |
| NaturalProver: Grounded Mathematical Proof Generation with Language Models | Unknown | N/A | |
| Predictive Querying for Autoregressive Neural Sequence Models | Unknown | N/A | |
| Differentially Private Linear Sketches: Efficient Implementations and Applications | Unknown | N/A | |
| Probable Domain Generalization via Quantile Risk Minimization | Unknown | N/A | |
| Embed and Emulate: Learning to estimate parameters of dynamical systems with uncertainty quantification | Unknown | N/A | |
| Minimax Optimal Online Imitation Learning via Replay Estimation | Unknown | N/A | |
| Subspace Recovery from Heterogeneous Data with Non-isotropic Noise | Unknown | N/A | |
| Transferring Fairness under Distribution Shifts via Fair Consistency Regularization | Unknown | N/A | |
| Exploring the Whole Rashomon Set of Sparse Decision Trees | Unknown | N/A | |
| On Image Segmentation With Noisy Labels: Characterization and Volume Properties of the Optimal Solutions to Accuracy and Dice | Unknown | N/A | |
| AutoML Two-Sample Test | Unknown | N/A | |
| Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning | Unknown | N/A | |
| Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post Hoc Explanations | Unknown | N/A | |
| Sampling from Log-Concave Distributions with Infinity-Distance Guarantees | Unknown | N/A | |
| Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks | Unknown | N/A | |
| Distributional Reinforcement Learning for Risk-Sensitive Policies | Unknown | N/A | |
| Causal Discovery in Heterogeneous Environments Under the Sparse Mechanism Shift Hypothesis | Unknown | N/A | |
| Promising or Elusive? Unsupervised Object Segmentation from Real-world Single Images | Unknown | N/A | |
| Unsupervised Domain Adaptation for Semantic Segmentation using Depth Distribution | Unknown | N/A | |
| Data-Efficient Structured Pruning via Submodular Optimization | Unknown | N/A | |
| Structured Energy Network As a Loss | Unknown | N/A | |
| Iso-Dream: Isolating and Leveraging Noncontrollable Visual Dynamics in World Models | Unknown | N/A | |
| Improving Barely Supervised Learning by Discriminating Unlabeled Samples with Super-Class | Unknown | N/A | |
| Rethinking and Improving Robustness of Convolutional Neural Networks: a Shapley Value-based Approach in Frequency Domain | Unknown | N/A | |
| Exploring evolution-aware & -free protein language models as protein function predictors | Unknown | N/A | |
| Boosting the Performance of Generic Deep Neural Network Frameworks with Log-supermodular CRFs | Unknown | N/A | |
| On the Tradeoff Between Robustness and Fairness | Unknown | N/A | |
| Learning to Reason with Neural Networks: Generalization, Unseen Data and Boolean Measures | Unknown | N/A | |
| Causality-driven Hierarchical Structure Discovery for Reinforcement Learning | Unknown | N/A | |
| Are AlphaZero-like Agents Robust to Adversarial Perturbations? | Unknown | N/A | |
| Thinking Outside the Ball: Optimal Learning with Gradient Descent for Generalized Linear Stochastic Convex Optimization | Unknown | N/A | |
| Pluralistic Image Completion with Gaussian Mixture Models | Unknown | N/A | |
| Generalization Analysis on Learning with a Concurrent Verifier | Unknown | N/A | |
| Receding Horizon Inverse Reinforcement Learning | Unknown | N/A | |
| Learning to Share in Networked Multi-Agent Reinforcement Learning | Unknown | N/A | |
| FIRE: Semantic Field of Words Represented as Non-Linear Functions | Unknown | N/A | |
| Perceptual Attacks of No-Reference Image Quality Models with Human-in-the-Loop | Unknown | N/A | |
| ProtoX: Explaining a Reinforcement Learning Agent via Prototyping | Unknown | N/A | |
| Pyramid Attention For Source Code Summarization | Unknown | N/A | |
| Taming Fat-Tailed (“Heavier-Tailed” with Potentially Infinite Variance) Noise in Federated Learning | Unknown | N/A | |
| Maximum a posteriori natural scene reconstruction from retinal ganglion cells with deep denoiser priors | Unknown | N/A | |
| DigGAN: Discriminator gradIent Gap Regularization for GAN Training with Limited Data | Unknown | N/A | |
| DNA: Proximal Policy Optimization with a Dual Network Architecture | Unknown | N/A | |
| Will Bilevel Optimizers Benefit from Loops | Unknown | N/A | |
| Micro and Macro Level Graph Modeling for Graph Variational Auto-Encoders | Unknown | N/A | |
| Redeeming intrinsic rewards via constrained optimization | Unknown | N/A | |
| Target alignment in truncated kernel ridge regression | Unknown | N/A | |
| Queue Up Your Regrets: Achieving the Dynamic Capacity Region of Multiplayer Bandits | Unknown | N/A | |
| Mismatched No More: Joint Model-Policy Optimization for Model-Based RL | Unknown | N/A | |
| Dynamic Sparse Network for Time Series Classification: Learning What to “See” | Unknown | N/A | |
| Stochastic Second-Order Methods Improve Best-Known Sample Complexity of SGD for Gradient-Dominated Functions | Unknown | N/A | |
| Constrained Predictive Coding as a Biologically Plausible Model of the Cortical Hierarchy | Unknown | N/A | |
| Perturbation Learning Based Anomaly Detection | Unknown | N/A | |
| Hierarchical Graph Transformer with Adaptive Node Sampling | Unknown | N/A | |
| LogiGAN: Learning Logical Reasoning via Adversarial Pre-training | Unknown | N/A | |
| Learning Causally Invariant Representations for Out-of-Distribution Generalization on Graphs | Unknown | N/A | |
| Structure-Preserving 3D Garment Modeling with Neural Sewing Machines | Unknown | N/A | |
| Improved Bounds on Neural Complexity for Representing Piecewise Linear Functions | Unknown | N/A | |
| On the Limitations of Stochastic Pre-processing Defenses | Unknown | N/A | |
| ResQ: A Residual Q Function-based Approach for Multi-Agent Reinforcement Learning Value Factorization | Unknown | N/A | |
| Global Convergence and Stability of Stochastic Gradient Descent | Unknown | N/A | |
| Delving into Out-of-Distribution Detection with Vision-Language Representations | Unknown | N/A | |
| Recruitment Strategies That Take a Chance | Unknown | N/A | |
| Inference and Sampling for Archimax Copulas | Unknown | N/A | |
| Text Classification with Born's Rule | Unknown | N/A | |
| Cluster and Aggregate: Face Recognition with Large Probe Set | Unknown | N/A | |
| VTC-LFC: Vision Transformer Compression with Low-Frequency Components | Unknown | N/A | |
| Lipschitz Bandits with Batched Feedback | Unknown | N/A | |
| Formulating Robustness Against Unforeseen Attacks | Unknown | N/A | |
| Randomized Message-Interception Smoothing: Gray-box Certificates for Graph Neural Networks | Unknown | N/A | |
| Subspace clustering in high-dimensions: Phase transitions & Statistical-to-Computational gap | Unknown | N/A | |
| CageNeRF: Cage-based Neural Radiance Field for Generalized 3D Deformation and Animation | Unknown | N/A | |
| Sparse2Dense: Learning to Densify 3D Features for 3D Object Detection | Unknown | N/A | |
| Non-Gaussian Tensor Programs | Unknown | N/A | |
| Understanding the Eluder Dimension | Unknown | N/A | |
| Semi-supervised Semantic Segmentation with Prototype-based Consistency Regularization | Unknown | N/A | |
| Local Linear Convergence of Gradient Methods for Subspace Optimization via Strict Complementarity | Unknown | N/A | |
| Simulation-guided Beam Search for Neural Combinatorial Optimization | Unknown | N/A | |
| Quo Vadis: Is Trajectory Forecasting the Key Towards Long-Term Multi-Object Tracking? | Unknown | N/A | |
| Meta-Reinforcement Learning with Self-Modifying Networks | Unknown | N/A | |
| Respecting Transfer Gap in Knowledge Distillation | Unknown | N/A | |
| What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs | Unknown | N/A | |
| TarGF: Learning Target Gradient Field to Rearrange Objects without Explicit Goal Specification | Unknown | N/A | |
| One-Inlier is First: Towards Efficient Position Encoding for Point Cloud Registration | Unknown | N/A | |
| I2DFormer: Learning Image to Document Attention for Zero-Shot Image Classification | Unknown | N/A | |
| Sharing Knowledge for Meta-learning with Feature Descriptions | Unknown | N/A | |
| Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes | Unknown | N/A | |
| Continual Learning with Evolving Class Ontologies | Unknown | N/A | |
| Quasi-Newton Methods for Saddle Point Problems | Unknown | N/A | |
| TANGO: Text-driven Photorealistic and Robust 3D Stylization via Lighting Decomposition | Unknown | N/A | |
| Asymptotic Properties for Bayesian Neural Network in Besov Space | Unknown | N/A | |
| Planning for Sample Efficient Imitation Learning | Unknown | N/A | |
| Peripheral Vision Transformer | Unknown | N/A | |
| Multi-block-Single-probe Variance Reduced Estimator for Coupled Compositional Optimization | Unknown | N/A | |
| HSDF: Hybrid Sign and Distance Field for Modeling Surfaces with Arbitrary Topologies | Unknown | N/A | |
| Approximate Secular Equations for the Cubic Regularization Subproblem | Unknown | N/A | |
| Faster Stochastic Algorithms for Minimax Optimization under Polyak-{\L}ojasiewicz Condition | Unknown | N/A | |
| Unsupervised Learning of Equivariant Structure from Sequences | Unknown | N/A | |
| Inception Transformer | Unknown | N/A | |
| Signal Recovery with Non-Expansive Generative Network Priors | Unknown | N/A | |
| Counterfactual harm | Unknown | N/A | |
| Posterior Collapse of a Linear Latent Variable Model | Unknown | N/A | |
| Harmonizing the object recognition strategies of deep neural networks with humans | Unknown | N/A | |
| When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment | Unknown | N/A | |
| Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping | Unknown | N/A | |
| Model-Based Imitation Learning for Urban Driving | Unknown | N/A | |
| OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models | Unknown | N/A | |
| ELIAS: End-to-End Learning to Index and Search in Large Output Spaces | Unknown | N/A | |
| QUARK: Controllable Text Generation with Reinforced Unlearning | Unknown | N/A | |
| Self-Supervised Aggregation of Diverse Experts for Test-Agnostic Long-Tailed Recognition | Unknown | N/A | |
| Finding Differences Between Transformers and ConvNets Using Counterfactual Simulation Testing | Unknown | N/A | |
| Anticipating Performativity by Predicting from Predictions | Unknown | N/A | |
| Fast Vision Transformers with HiLo Attention | Unknown | N/A | |
| OpenAUC: Towards AUC-Oriented Open-Set Recognition | Unknown | N/A | |
| Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability | Unknown | N/A | |
| Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models | Unknown | N/A | |
| Differentiable Analog Quantum Computing for Optimization and Control | Unknown | N/A | |
| Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing | Unknown | N/A | |
| Monte Carlo Tree Descent for Black-Box Optimization | Unknown | N/A | |
| On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting | Unknown | N/A | |
| Robust Imitation of a Few Demonstrations with a Backwards Model | Unknown | N/A | |
| AMP: Automatically Finding Model Parallel Strategies with Heterogeneity Awareness | Unknown | N/A | |
| Communication Acceleration of Local Gradient Methods via an Accelerated Primal-Dual Algorithm with an Inexact Prox | Unknown | N/A | |
| Performative Power | Unknown | N/A | |
| SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery | Unknown | N/A | |
| Benign, Tempered, or Catastrophic: Toward a Refined Taxonomy of Overfitting | Unknown | N/A | |
| The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift | Unknown | N/A | |
| Confident Approximate Policy Iteration for Efficient Local Planning in $q^\pi$-realizable MDPs | Unknown | N/A | |
| Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity | Unknown | N/A | |
| Society of Agents: Regret Bounds of Concurrent Thompson Sampling | Unknown | N/A | |
| Exploring Length Generalization in Large Language Models | Unknown | N/A | |
| Unsupervised Learning for Combinatorial Optimization with Principled Objective Relaxation | Unknown | N/A | |
| GPT3.int8(): 8-bit Matrix Multiplication for Transformers at Scale | Unknown | N/A | |
| Lottery Tickets on a Data Diet: Finding Initializations with Sparse Trainable Networks | Unknown | N/A | |
| Revisiting Sparse Convolutional Model for Visual Recognition | Unknown | N/A | |
| Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning | Unknown | N/A | |
| MoCoDA: Model-based Counterfactual Data Augmentation | Unknown | N/A | |
| Beyond Adult and COMPAS: Fair Multi-Class Prediction via Information Projection | Unknown | N/A | |
| On the generalization of learning algorithms that do not converge | Unknown | N/A | |
| Capturing Graphs with Hypo-Elliptic Diffusions | Unknown | N/A | |
| Hypothesis Testing for Differentially Private Linear Regression | Unknown | N/A | |
| Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms | Unknown | N/A | |
| AutoST: Towards the Universal Modeling of Spatio-temporal Sequences | Unknown | N/A | |
| SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning | Unknown | N/A | |
| ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision Medicine | Unknown | N/A | |
| Explicit Tradeoffs between Adversarial and Natural Distributional Robustness | Unknown | N/A | |
| Generalization Bounds for Gradient Methods via Discrete and Continuous Prior | Unknown | N/A | |
| CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification | Unknown | N/A | |
| BYOL-Explore: Exploration by Bootstrapped Prediction | Unknown | N/A | |
| Ordered Subgraph Aggregation Networks | Unknown | N/A | |
| Where do Models go Wrong? Parameter-Space Saliency Maps for Explainability | Unknown | N/A | |
| Label Noise in Adversarial Training: A Novel Perspective to Study Robust Overfitting | Unknown | N/A | |
| Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime | Unknown | N/A | |
| Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval | Unknown | N/A | |
| An In-depth Study of Stochastic Backpropagation | Unknown | N/A | |
| Tractable Optimality in Episodic Latent MABs | Unknown | N/A | |
| Meta-Query-Net: Resolving Purity-Informativeness Dilemma in Open-set Active Learning | Unknown | N/A | |
| Improving Certified Robustness via Statistical Learning with Logical Reasoning | Unknown | N/A | |
| Online Decision Mediation | Unknown | N/A | |
| Deep Differentiable Logic Gate Networks | Unknown | N/A | |
| Double Bubble, Toil and Trouble: Enhancing Certified Robustness through Transitivity | Unknown | N/A | |
| Associating Objects and Their Effects in Video through Coordination Games | Unknown | N/A | |
| Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits | Unknown | N/A | |
| Precise Learning Curves and Higher-Order Scalings for Dot-product Kernel Regression | Unknown | N/A | |
| Quantifying Statistical Significance of Neural Network-based Image Segmentation by Selective Inference | Unknown | N/A | |
| Multi-block Min-max Bilevel Optimization with Applications in Multi-task Deep AUC Maximization | Unknown | N/A | |
| Agreement-on-the-line: Predicting the Performance of Neural Networks under Distribution Shift | Unknown | N/A | |
| Neural Conservation Laws: A Divergence-Free Perspective | Unknown | N/A | |
| Sparse Hypergraph Community Detection Thresholds in Stochastic Block Model | Unknown | N/A | |
| Understanding and Extending Subgraph GNNs by Rethinking Their Symmetries | Unknown | N/A | |
| Latent Hierarchical Causal Structure Discovery with Rank Constraints | Unknown | N/A | |
| Task-Agnostic Graph Explanations | Unknown | N/A | |
| ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings | Unknown | N/A | |
| Towards Optimal Communication Complexity in Distributed Non-Convex Optimization | Unknown | N/A | |
| Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement | Unknown | N/A | |
| Optimal Rates for Regularized Conditional Mean Embedding Learning | Unknown | N/A | |
| Are All Losses Created Equal: A Neural Collapse Perspective | Unknown | N/A | |
| Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees | Unknown | N/A | |
| What You See is What You Get: Principled Deep Learning via Distributional Generalization | Unknown | N/A | |
| Knowledge Distillation: Bad Models Can Be Good Role Models | Unknown | N/A | |
| Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively | Unknown | N/A | |
| Lifelong Neural Predictive Coding: Learning Cumulatively Online without Forgetting | Unknown | N/A | |
| Rare Gems: Finding Lottery Tickets at Initialization | Unknown | N/A | |
| Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit | Unknown | N/A | |
| Neural Approximation of Graph Topological Features | Unknown | N/A | |
| Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning | Unknown | N/A | |
| Surprise Minimizing Multi-Agent Learning with Energy-based Models | Unknown | N/A | |
| Sparse Structure Search for Delta Tuning | Unknown | N/A | |
| Stability and Generalization for Markov Chain Stochastic Gradient Methods | Unknown | N/A | |
| Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare | Unknown | N/A | |
| Discovery of Single Independent Latent Variable | Unknown | N/A | |
| MoGDE: Boosting Mobile Monocular 3D Object Detection with Ground Depth Estimation | Unknown | N/A | |
| Compressible-composable NeRF via Rank-residual Decomposition | Unknown | N/A | |
| Asymmetric Temperature Scaling Makes Larger Networks Teach Well Again | Unknown | N/A | |
| DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps | Unknown | N/A | |
| Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs | Unknown | N/A | |
| Neural Shape Deformation Priors | Unknown | N/A | |
| Hierarchical Channel-spatial Encoding for Communication-efficient Collaborative Learning | Unknown | N/A | |
| Debugging and Explaining Metric Learning Approaches: An Influence Function Based Perspective | Unknown | N/A | |
| Factuality Enhanced Language Models for Open-Ended Text Generation | Unknown | N/A | |
| Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets | Unknown | N/A | |
| A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits | Unknown | N/A | |
| MaskTune: Mitigating Spurious Correlations by Forcing to Explore | Unknown | N/A | |
| Scalable Sensitivity and Uncertainty Analyses for Causal-Effect Estimates of Continuous-Valued Interventions | Unknown | N/A | |
| Additive MIL: Intrinsically Interpretable Multiple Instance Learning for Pathology | Unknown | N/A | |
| Reconstructing Training Data From Trained Neural Networks | Unknown | N/A | |
| Use-Case-Grounded Simulations for Explanation Evaluation | Unknown | N/A | |
| Differentiable hierarchical and surrogate gradient search for spiking neural networks | Unknown | N/A | |
| CalFAT: Calibrated Federated Adversarial Training with Label Skewness | Unknown | N/A | |
| Cluster Randomized Designs for One-Sided Bipartite Experiments | Unknown | N/A | |
| Multi-Sample Training for Neural Image Compression | Unknown | N/A | |
| On the Parameterization and Initialization of Diagonal State Space Models | Unknown | N/A | |
| Solving Quantitative Reasoning Problems with Language Models | Unknown | N/A | |
| Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional Networks | Unknown | N/A | |
| D^2NeRF: Self-Supervised Decoupling of Dynamic and Static Objects from a Monocular Video | Unknown | N/A | |
| Semi-Supervised Video Salient Object Detection Based on Uncertainty-Guided Pseudo Labels | Unknown | N/A | |
| C2FAR: Coarse-to-Fine Autoregressive Networks for Precise Probabilistic Forecasting | Unknown | N/A | |
| SizeShiftReg: a Regularization Method for Improving Size-Generalization in Graph Neural Networks | Unknown | N/A | |
| Squeezeformer: An Efficient Transformer for Automatic Speech Recognition | Unknown | N/A | |
| Adversarial Attack on Attackers: Post-Process to Mitigate Black-Box Score-Based Query Attacks | Unknown | N/A | |
| Generalizing Bayesian Optimization with Decision-theoretic Entropies | Unknown | N/A | |
| Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech | Unknown | N/A | |
| The Unreasonable Effectiveness of Fully-Connected Layers for Low-Data Regimes | Unknown | N/A | |
| Unsupervised Object Detection Pretraining with Joint Object Priors Generation and Detector Learning | Unknown | N/A | |
| Learning Chaotic Dynamics in Dissipative Systems | Unknown | N/A | |
| MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction | Unknown | N/A | |
| SeqPATE: Differentially Private Text Generation via Knowledge Distillation | Unknown | N/A | |
| DENSE: Data-Free One-Shot Federated Learning | Unknown | N/A | |
| Sym-NCO: Leveraging Symmetricity for Neural Combinatorial Optimization | Unknown | N/A | |
| Is $L^2$ Physics Informed Loss Always Suitable for Training Physics Informed Neural Network? | Unknown | N/A | |
| Hiding Images in Deep Probabilistic Models | Unknown | N/A | |
| Factored Adaptation for Non-Stationary Reinforcement Learning | Unknown | N/A | |
| Optimal Algorithms for Decentralized Stochastic Variational Inequalities | Unknown | N/A | |
| Semi-supervised Vision Transformers at Scale | Unknown | N/A | |
| Deep Model Reassembly | Unknown | N/A | |
| Your Transformer May Not be as Powerful as You Expect | Unknown | N/A | |
| InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation | Unknown | N/A | |
| Linear tree shap | Unknown | N/A | |
| Delving into Sequential Patches for Deepfake Detection | Unknown | N/A | |
| Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protection | Unknown | N/A | |
| ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences | Unknown | N/A | |
| Learning Latent Seasonal-Trend Representations for Time Series Forecasting | Unknown | N/A | |
| Posterior Refinement Improves Sample Efficiency in Bayesian Neural Networks | Unknown | N/A | |
| Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropagation | Unknown | N/A | |
| DreamShard: Generalizable Embedding Table Placement for Recommender Systems | Unknown | N/A | |
| Dataset Distillation via Factorization | Unknown | N/A | |
| Video Diffusion Models | Unknown | N/A | |
| Theseus: A Library for Differentiable Nonlinear Optimization | Unknown | N/A | |
| Decoupling Features in Hierarchical Propagation for Video Object Segmentation | Unknown | N/A | |
| RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection | Unknown | N/A | |
| Explainable Reinforcement Learning via Model Transforms | Unknown | N/A | |
| Matryoshka Representation Learning | Unknown | N/A | |
| VoxGRAF: Fast 3D-Aware Image Synthesis with Sparse Voxel Grids | Unknown | N/A | |
| Decoupling Classifier for Boosting Few-shot Object Detection and Instance Segmentation | Unknown | N/A | |
| MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning | Unknown | N/A | |
| LGDN: Language-Guided Denoising Network for Video-Language Modeling | Unknown | N/A | |
| PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining | Unknown | N/A | |
| Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning | Unknown | N/A | |
| Adapting Self-Supervised Vision Transformers by Probing Attention-Conditioned Masking Consistency | Unknown | N/A | |
| Flexible Neural Image Compression via Code Editing | Unknown | N/A | |
| Learning Physics Constrained Dynamics Using Autoencoders | Unknown | N/A | |
| Active Learning with Neural Networks: Insights from Nonparametric Statistics | Unknown | N/A | |
| Understanding Robust Learning through the Lens of Representation Similarities | Unknown | N/A | |
| Beyond Separability: Analyzing the Linear Transferability of Contrastive Representations to Related Subpopulations | Unknown | N/A | |
| Few-shot Task-agnostic Neural Architecture Search for Distilling Large Language Models | Unknown | N/A | |
| Zero-Shot Video Question Answering via Frozen Bidirectional Language Models | Unknown | N/A | |
| Outsourcing Training without Uploading Data via Efficient Collaborative Open-Source Sampling | Unknown | N/A | |
| Measuring and Reducing Model Update Regression in Structured Prediction for NLP | Unknown | N/A | |
| Coordinates Are NOT Lonely - Codebook Prior Helps Implicit Neural 3D representations | Unknown | N/A | |
| Multitasking Models are Robust to Structural Failure: A Neural Model for Bilingual Cognitive Reserve | Unknown | N/A | |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Unknown | N/A | |
| Asymptotics of smoothed Wasserstein distances in the small noise regime | Unknown | N/A | |
| Finite-Time Last-Iterate Convergence for Learning in Multi-Player Games | Unknown | N/A | |
| CARD: Classification and Regression Diffusion Models | Unknown | N/A | |
| GraphDE: A Generative Framework for Debiased Learning and Out-of-Distribution Detection on Graphs | Unknown | N/A | |
| Unlabelled Sample Compression Schemes for Intersection-Closed Classes and Extremal Classes | Unknown | N/A | |
| Concentration of Data Encoding in Parameterized Quantum Circuits | Unknown | N/A | |
| Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation | Unknown | N/A | |
| M$^4$I: Multi-modal Models Membership Inference | Unknown | N/A | |
| Beyond accuracy: generalization properties of bio-plausible temporal credit assignment rules | Unknown | N/A | |
| VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts | Unknown | N/A | |
| Pre-Trained Language Models for Interactive Decision-Making | Unknown | N/A | |
| Learning from Label Proportions by Learning with Label Noise | Unknown | N/A | |
| A Closer Look at Offline RL Agents | Unknown | N/A | |
| Beyond spectral gap: the role of the topology in decentralized learning | Unknown | N/A | |
| A permutation-free kernel two-sample test | Unknown | N/A | |
| C-Mixup: Improving Generalization in Regression | Unknown | N/A | |
| Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses | Unknown | N/A | |
| Efficient Multi-agent Communication via Self-supervised Information Aggregation | Unknown | N/A | |
| EfficientFormer: Vision Transformers at MobileNet Speed | Unknown | N/A | |
| Pseudo-Riemannian Graph Convolutional Networks | Unknown | N/A | |
| Fast Algorithms for Packing Proportional Fairness and its Dual | Unknown | N/A | |
| Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees | Unknown | N/A | |
| Training Scale-Invariant Neural Networks on the Sphere Can Happen in Three Regimes | Unknown | N/A | |
| Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress | Unknown | N/A | |
| Active Exploration for Inverse Reinforcement Learning | Unknown | N/A | |
| UniGAN: Reducing Mode Collapse in GANs using a Uniform Generator | Unknown | N/A | |
| Diffusion Curvature for Estimating Local Curvature in High Dimensional Data | Unknown | N/A | |
| Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms | Unknown | N/A | |
| On Enforcing Better Conditioned Meta-Learning for Rapid Few-Shot Adaptation | Unknown | N/A | |
| Efficient learning of nonlinear prediction models with time-series privileged information | Unknown | N/A | |
| Training and Inference on Any-Order Autoregressive Models the Right Way | Unknown | N/A | |
| SPD: Synergy Pattern Diversifying Oriented Unsupervised Multi-agent Reinforcement Learning | Unknown | N/A | |
| GAPX: Generalized Autoregressive Paraphrase-Identification X | Unknown | N/A | |
| CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks | Unknown | N/A | |
| Reinforcement Learning with a Terminator | Unknown | N/A | |
| Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens | Unknown | N/A | |
| Class-Dependent Label-Noise Learning with Cycle-Consistency Regularization | Unknown | N/A | |
| CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds | Unknown | N/A | |
| Sparse Interaction Additive Networks via Feature Interaction Detection and Sparse Selection | Unknown | N/A | |
| Object-Category Aware Reinforcement Learning | Unknown | N/A | |
| Decision-Focused Learning without Decision-Making: Learning Locally Optimized Decision Losses | Unknown | N/A | |
| Universally Expressive Communication in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Are GANs overkill for NLP? | Unknown | N/A | |
| Simple Mechanisms for Welfare Maximization in Rich Advertising Auctions | Unknown | N/A | |
| Scalable Interpretability via Polynomials | Unknown | N/A | |
| NOTE: Robust Continual Test-time Adaptation Against Temporal Correlation | Unknown | N/A | |
| Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation | Unknown | N/A | |
| Symmetry Teleportation for Accelerated Optimization | Unknown | N/A | |
| The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning | Unknown | N/A | |
| Truncated proposals for scalable and hassle-free simulation-based inference | Unknown | N/A | |
| Large-Scale Retrieval for Reinforcement Learning | Unknown | N/A | |
| Decoupled Self-supervised Learning for Graphs | Unknown | N/A | |
| In Differential Privacy, There is Truth: on Vote-Histogram Leakage in Ensemble Private Learning | Unknown | N/A | |
| Handcrafted Backdoors in Deep Neural Networks | Unknown | N/A | |
| Structuring Representations Using Group Invariants | Unknown | N/A | |
| A sharp NMF result with applications in network modeling | Unknown | N/A | |
| Improving Policy Learning via Language Dynamics Distillation | Unknown | N/A | |
| Pure Transformers are Powerful Graph Learners | Unknown | N/A | |
| Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification | Unknown | N/A | |
| Few-shot Image Generation via Adaptation-Aware Kernel Modulation | Unknown | N/A | |
| Towards Understanding Grokking: An Effective Theory of Representation Learning | Unknown | N/A | |
| Online Agnostic Multiclass Boosting | Unknown | N/A | |
| Adversarial Unlearning: Reducing Confidence Along Adversarial Directions | Unknown | N/A | |
| Robust Imitation via Mirror Descent Inverse Reinforcement Learning | Unknown | N/A | |
| HyperMiner: Topic Taxonomy Mining with Hyperbolic Embedding | Unknown | N/A | |
| Oracle-Efficient Online Learning for Smoothed Adversaries | Unknown | N/A | |
| Multiclass Learnability Beyond the PAC Framework: Universal Rates and Partial Concept Classes | Unknown | N/A | |
| Lower Bounds on Randomly Preconditioned Lasso via Robust Sparse Designs | Unknown | N/A | |
| Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context | Unknown | N/A | |
| Accelerating Sparse Convolution with Column Vector-Wise Sparsity | Unknown | N/A | |
| Fast Instrument Learning with Faster Rates | Unknown | N/A | |
| LTMD: Learning Improvement of Spiking Neural Networks with Learnable Thresholding Neurons and Moderate Dropout | Unknown | N/A | |
| Improving Neural Ordinary Differential Equations with Nesterov's Accelerated Gradient Method | Unknown | N/A | |
| Learning Neural Set Functions Under the Optimal Subset Oracle | Unknown | N/A | |
| Guaranteed Conservation of Momentum for Learning Particle-based Fluid Dynamics | Unknown | N/A | |
| Universality of Group Convolutional Neural Networks Based on Ridgelet Analysis on Groups | Unknown | N/A | |
| On the Spectral Bias of Convolutional Neural Tangent and Gaussian Process Kernels | Unknown | N/A | |
| Beyond L1: Faster and Better Sparse Models with skglm | Unknown | N/A | |
| Improving GANs with A Dynamic Discriminator | Unknown | N/A | |
| Streaming Radiance Fields for 3D Video Synthesis | Unknown | N/A | |
| On the non-universality of deep learning: quantifying the cost of symmetry | Unknown | N/A | |
| GraB: Finding Provably Better Data Permutations than Random Reshuffling | Unknown | N/A | |
| Enhancing Safe Exploration Using Safety State Augmentation | Unknown | N/A | |
| Robust Binary Models by Pruning Randomly-initialized Networks | Unknown | N/A | |
| Optimal and Adaptive Monteiro-Svaiter Acceleration | Unknown | N/A | |
| Reinforcement Learning with Logarithmic Regret and Policy Switches | Unknown | N/A | |
| HYPRO: A Hybridly Normalized Probabilistic Model for Long-Horizon Prediction of Event Sequences | Unknown | N/A | |
| Temporally-Consistent Survival Analysis | Unknown | N/A | |
| Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data | Unknown | N/A | |
| Learning and Covering Sums of Independent Random Variables with Unbounded Support | Unknown | N/A | |
| Learning to Discover and Detect Objects | Unknown | N/A | |
| UViM: A Unified Modeling Approach for Vision with Learned Guiding Codes | Unknown | N/A | |
| BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach | Unknown | N/A | |
| DeepFoids: Adaptive Bio-Inspired Fish Simulation with Deep Reinforcement Learning | Unknown | N/A | |
| Improving Intrinsic Exploration with Language Abstractions | Unknown | N/A | |
| MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Force Fields | Unknown | N/A | |
| Monte Carlo Augmented Actor-Critic for Sparse Reward Deep Reinforcement Learning from Suboptimal Demonstrations | Unknown | N/A | |
| ShapeCrafter: A Recursive Text-Conditioned 3D Shape Generation Model | Unknown | N/A | |
| Non-identifiability and the Blessings of Misspecification in Models of Molecular Fitness | Unknown | N/A | |
| VoiceBlock: Privacy through Real-Time Adversarial Attacks with Audio-to-Audio Models | Unknown | N/A | |
| Recall Distortion in Neural Network Pruning and the Undecayed Pruning Algorithm | Unknown | N/A | |
| Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions | Unknown | N/A | |
| Invariance-Aware Randomized Smoothing Certificates | Unknown | N/A | |
| Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions | Unknown | N/A | |
| On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL | Unknown | N/A | |
| Energy-Based Contrastive Learning of Visual Representations | Unknown | N/A | |
| Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials | Unknown | N/A | |
| Deep Surrogate Assisted Generation of Environments | Unknown | N/A | |
| Hierarchical Lattice Layer for Partially Monotone Neural Networks | Unknown | N/A | |
| SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate Training | Unknown | N/A | |
| Self-Similarity Priors: Neural Collages as Differentiable Fractal Representations | Unknown | N/A | |
| Sample Constrained Treatment Effect Estimation | Unknown | N/A | |
| What's the Harm? Sharp Bounds on the Fraction Negatively Affected by Treatment | Unknown | N/A | |
| Empirical Phase Diagram for Three-layer Neural Networks with Infinite Width | Unknown | N/A | |
| FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation | Unknown | N/A | |
| Maximum Likelihood Training of Implicit Nonlinear Diffusion Model | Unknown | N/A | |
| Single Loop Gaussian Homotopy Method for Non-convex Optimization | Unknown | N/A | |
| GAL: Gradient Assisted Learning for Decentralized Multi-Organization Collaborations | Unknown | N/A | |
| CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning | Unknown | N/A | |
| Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence | Unknown | N/A | |
| Reinforcement Learning with Neural Radiance Fields | Unknown | N/A | |
| Multi-agent Performative Prediction with Greedy Deployment and Consensus Seeking Agents | Unknown | N/A | |
| A Differentially Private Linear-Time fPTAS for the Minimum Enclosing Ball Problem | Unknown | N/A | |
| Debiased Causal Tree: Heterogeneous Treatment Effects Estimation with Unmeasured Confounding | Unknown | N/A | |
| Assistive Teaching of Motor Control Tasks to Humans | Unknown | N/A | |
| Learning interacting dynamical systems with latent Gaussian process ODEs | Unknown | N/A | |
| Provably expressive temporal graph networks | Unknown | N/A | |
| A Universal Error Measure for Input Predictions Applied to Online Graph Problems | Unknown | N/A | |
| On the difficulty of learning chaotic dynamics with RNNs | Unknown | N/A | |
| Learning on the Edge: Online Learning with Stochastic Feedback Graphs | Unknown | N/A | |
| Meta-ticket: Finding optimal subnetworks for few-shot learning within randomly initialized neural networks | Unknown | N/A | |
| Adjoint-aided inference of Gaussian process driven differential equations | Unknown | N/A | |
| Subquadratic Kronecker Regression with Applications to Tensor Decomposition | Unknown | N/A | |
| Post-hoc estimators for learning to defer to an expert | Unknown | N/A | |
| Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games | Unknown | N/A | |
| Contrastive and Non-Contrastive Self-Supervised Learning Recover Global and Local Spectral Embedding Methods | Unknown | N/A | |
| Discovered Policy Optimisation | Unknown | N/A | |
| Decomposable Non-Smooth Convex Optimization with Nearly-Linear Gradient Oracle Complexity | Unknown | N/A | |
| SPD domain-specific batch normalization to crack interpretable unsupervised domain adaptation in EEG | Unknown | N/A | |
| Convexity Certificates from Hessians | Unknown | N/A | |
| Holomorphic Equilibrium Propagation Computes Exact Gradients Through Finite Size Oscillations | Unknown | N/A | |
| Log-Linear-Time Gaussian Processes Using Binary Tree Kernels | Unknown | N/A | |
| Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples | Unknown | N/A | |
| Continual Learning In Environments With Polynomial Mixing Times | Unknown | N/A | |
| VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives | Unknown | N/A | |
| Algorithms and Hardness for Learning Linear Thresholds from Label Proportions | Unknown | N/A | |
| Enhanced Meta Reinforcement Learning via Demonstrations in Sparse Reward Environments | Unknown | N/A | |
| Make Some Noise: Reliable and Efficient Single-Step Adversarial Training | Unknown | N/A | |
| Transformer Memory as a Differentiable Search Index | Unknown | N/A | |
| (De-)Randomized Smoothing for Decision Stump Ensembles | Unknown | N/A | |
| Global Normalization for Streaming Speech Recognition in a Modular Framework | Unknown | N/A | |
| Theoretically Better and Numerically Faster Distributed Optimization with Smoothness-Aware Quantization Techniques | Unknown | N/A | |
| Learning Tractable Probabilistic Models from Inconsistent Local Estimates | Unknown | N/A | |
| List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering | Unknown | N/A | |
| Normalizing Flows for Knockoff-free Controlled Feature Selection | Unknown | N/A | |
| Debiased Machine Learning without Sample-Splitting for Stable Estimators | Unknown | N/A | |
| Explicable Policy Search | Unknown | N/A | |
| Robustness to Unbounded Smoothness of Generalized SignSGD | Unknown | N/A | |
| Subgame Solving in Adversarial Team Games | Unknown | N/A | |
| Autoregressive Perturbations for Data Poisoning | Unknown | N/A | |
| Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions | Unknown | N/A | |
| Statistical Learning and Inverse Problems: A Stochastic Gradient Approach | Unknown | N/A | |
| TPU-KNN: K Nearest Neighbor Search at Peak FLOP/s | Unknown | N/A | |
| Self-Aware Personalized Federated Learning | Unknown | N/A | |
| Unsupervised Visual Representation Learning via Mutual Information Regularized Assignment | Unknown | N/A | |
| LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models | Unknown | N/A | |
| Nonstationary Dual Averaging and Online Fair Allocation | Unknown | N/A | |
| Leveraging Inter-Layer Dependency for Post -Training Quantization | Unknown | N/A | |
| FOF: Learning Fourier Occupancy Field for Monocular Real-time Human Reconstruction | Unknown | N/A | |
| Learning Expressive Meta-Representations with Mixture of Expert Neural Processes | Unknown | N/A | |
| REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering | Unknown | N/A | |
| Online Neural Sequence Detection with Hierarchical Dirichlet Point Process | Unknown | N/A | |
| Exploring Figure-Ground Assignment Mechanism in Perceptual Organization | Unknown | N/A | |
| DTG-SSOD: Dense Teacher Guidance for Semi-Supervised Object Detection | Unknown | N/A | |
| Deliberated Domain Bridging for Domain Adaptive Semantic Segmentation | Unknown | N/A | |
| Dual-Curriculum Contrastive Multi-Instance Learning for Cancer Prognosis Analysis with Whole Slide Images | Unknown | N/A | |
| BadPrompt: Backdoor Attacks on Continuous Prompts | Unknown | N/A | |
| Geodesic Self-Attention for 3D Point Clouds | Unknown | N/A | |
| Learning Enhanced Representation for Tabular Data via Neighborhood Propagation | Unknown | N/A | |
| Spectrum Random Masking for Generalization in Image-based Reinforcement Learning | Unknown | N/A | |
| 3DB: A Framework for Debugging Computer Vision Models | Unknown | N/A | |
| High-dimensional limit theorems for SGD: Effective dynamics and critical scaling | Unknown | N/A | |
| Provable Generalization of Overparameterized Meta-learning Trained with SGD | Unknown | N/A | |
| MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training | Unknown | N/A | |
| Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation | Unknown | N/A | |
| Reinforced Genetic Algorithm for Structure-based Drug Design | Unknown | N/A | |
| Motion Transformer with Global Intention Localization and Local Movement Refinement | Unknown | N/A | |
| Deep Fourier Up-Sampling | Unknown | N/A | |
| FR: Folded Rationalization with a Unified Encoder | Unknown | N/A | |
| Measures of Information Reflect Memorization Patterns | Unknown | N/A | |
| Trading off Image Quality for Robustness is not Necessary with Regularized Deterministic Autoencoders | Unknown | N/A | |
| CASA: Category-agnostic Skeletal Animal Reconstruction | Unknown | N/A | |
| Learning Energy Networks with Generalized Fenchel-Young Losses | Unknown | N/A | |
| Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games | Unknown | N/A | |
| Rethinking Image Restoration for Object Detection | Unknown | N/A | |
| GBA: A Tuning-free Approach to Switch between Synchronous and Asynchronous Training for Recommendation Models | Unknown | N/A | |
| Modeling Human Exploration Through Resource-Rational Reinforcement Learning | Unknown | N/A | |
| SignRFF: Sign Random Fourier Features | Unknown | N/A | |
| Gradient Estimation with Discrete Stein Operators | Unknown | N/A | |
| Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems | Unknown | N/A | |
| Single-phase deep learning in cortico-cortical networks | Unknown | N/A | |
| GraphQNTK: Quantum Neural Tangent Kernel for Graph Data | Unknown | N/A | |
| BiMLP: Compact Binary Architectures for Vision Multi-Layer Perceptrons | Unknown | N/A | |
| Sampling in Constrained Domains with Orthogonal-Space Variational Gradient Descent | Unknown | N/A | |
| Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models | Unknown | N/A | |
| LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model | Unknown | N/A | |
| An $\alpha$-regret analysis of Adversarial Bilateral Trade | Unknown | N/A | |
| Intrinsic dimensionality estimation using Normalizing Flows | Unknown | N/A | |
| Supervised Training of Conditional Monge Maps | Unknown | N/A | |
| Drawing out of Distribution with Neuro-Symbolic Generative Models | Unknown | N/A | |
| Sketching based Representations for Robust Image Classification with Provable Guarantees | Unknown | N/A | |
| Learning low-dimensional generalizable natural features from retina using a U-net | Unknown | N/A | |
| Data Augmentation for Compositional Data: Advancing Predictive Models of the Microbiome | Unknown | N/A | |
| VisCo Grids: Surface Reconstruction with Viscosity and Coarea Grids | Unknown | N/A | |
| Synergy-of-Experts: Collaborate to Improve Adversarial Robustness | Unknown | N/A | |
| Neural Matching Fields: Implicit Representation of Matching Fields for Visual Correspondence | Unknown | N/A | |
| Fast Bayesian Estimation of Point Process Intensity as Function of Covariates | Unknown | N/A | |
| MOVE: Unsupervised Movable Object Segmentation and Detection | Unknown | N/A | |
| Not All Bits have Equal Value: Heterogeneous Precisions via Trainable Noise | Unknown | N/A | |
| Differentially Private Learning Needs Hidden State (Or Much Faster Convergence) | Unknown | N/A | |
| Training Spiking Neural Networks with Local Tandem Learning | Unknown | N/A | |
| Unsupervised Skill Discovery via Recurrent Skill Training | Unknown | N/A | |
| Interpreting Operation Selection in Differentiable Architecture Search: A Perspective from Influence-Directed Explanations | Unknown | N/A | |
| Fair Rank Aggregation | Unknown | N/A | |
| Optimal Gradient Sliding and its Application to Optimal Distributed Optimization Under Similarity | Unknown | N/A | |
| Contact-aware Human Motion Forecasting | Unknown | N/A | |
| Non-rigid Point Cloud Registration with Neural Deformation Pyramid | Unknown | N/A | |
| Make an Omelette with Breaking Eggs: Zero-Shot Learning for Novel Attribute Synthesis | Unknown | N/A | |
| The First Optimal Algorithm for Smooth and Strongly-Convex-Strongly-Concave Minimax Optimization | Unknown | N/A | |
| Towards Reasonable Budget Allocation in Untargeted Graph Structure Attacks via Gradient Debias | Unknown | N/A | |
| Stability and Generalization of Kernel Clustering: from Single Kernel to Multiple Kernel | Unknown | N/A | |
| Few-shot Relational Reasoning via Connection Subgraph Pretraining | Unknown | N/A | |
| Alleviating Adversarial Attacks on Variational Autoencoders with MCMC | Unknown | N/A | |
| Coreset for Line-Sets Clustering | Unknown | N/A | |
| Fast Stochastic Composite Minimization and an Accelerated Frank-Wolfe Algorithm under Parallelization | Unknown | N/A | |
| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | Unknown | N/A | |
| HF-NeuS: Improved Surface Reconstruction Using High-Frequency Details | Unknown | N/A | |
| On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning | Unknown | N/A | |
| Spatial Pruned Sparse Convolution for Efficient 3D Object Detection | Unknown | N/A | |
| Byzantine Spectral Ranking | Unknown | N/A | |
| What Can the Neural Tangent Kernel Tell Us About Adversarial Robustness? | Unknown | N/A | |
| On Translation and Reconstruction Guarantees of the Cycle-Consistent Generative Adversarial Networks | Unknown | N/A | |
| Evaluated CMI Bounds for Meta Learning: Tightness and Expressiveness | Unknown | N/A | |
| SnAKe: Bayesian Optimization with Pathwise Exploration | Unknown | N/A | |
| Random Rank: The One and Only Strategyproof and Proportionally Fair Randomized Facility Location Mechanism | Unknown | N/A | |
| Resolving the data ambiguity for periodic crystals | Unknown | N/A | |
| CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View Completion | Unknown | N/A | |
| Coresets for Vertical Federated Learning: Regularized Linear Regression and $K$-Means Clustering | Unknown | N/A | |
| Learning Predictions for Algorithms with Predictions | Unknown | N/A | |
| Hyperparameter Sensitivity in Deep Outlier Detection: Analysis and a Scalable Hyper-Ensemble Solution | Unknown | N/A | |
| DASCO: Dual-Generator Adversarial Support Constrained Offline Reinforcement Learning | Unknown | N/A | |
| Exploring through Random Curiosity with General Value Functions | Unknown | N/A | |
| Equivariant Networks for Zero-Shot Coordination | Unknown | N/A | |
| A PAC-Bayesian Generalization Bound for Equivariant Networks | Unknown | N/A | |
| Split-kl and PAC-Bayes-split-kl Inequalities for Ternary Random Variables | Unknown | N/A | |
| Pareto Set Learning for Expensive Multi-Objective Optimization | Unknown | N/A | |
| Formalizing Consistency and Coherence of Representation Learning | Unknown | N/A | |
| Compositional generalization through abstract representations in human and artificial neural networks | Unknown | N/A | |
| The Sample Complexity of One-Hidden-Layer Neural Networks | Unknown | N/A | |
| Diffusion Visual Counterfactual Explanations | Unknown | N/A | |
| Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget | Unknown | N/A | |
| Pessimism for Offline Linear Contextual Bandits using $\ell_p$ Confidence Sets | Unknown | N/A | |
| Assaying Out-Of-Distribution Generalization in Transfer Learning | Unknown | N/A | |
| What are the best Systems? New Perspectives on NLP Benchmarking | Unknown | N/A | |
| Clipped Stochastic Methods for Variational Inequalities with Heavy-Tailed Noise | Unknown | N/A | |
| Hardness in Markov Decision Processes: Theory and Practice | Unknown | N/A | |
| Generalization Error Bounds on Deep Learning with Markov Datasets | Unknown | N/A | |
| Information-Theoretic Safe Exploration with Gaussian Processes | Unknown | N/A | |
| M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design | Unknown | N/A | |
| HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis | Unknown | N/A | |
| [Re] Replication Study of "Fairness and Bias in Online Selection" | Unknown | N/A | |
| Triangulation candidates for Bayesian optimization | Unknown | N/A | |
| Non-asymptotic and Accurate Learning of Nonlinear Dynamical Systems | Unknown | N/A | |
| Washing The Unwashable : On The (Im)possibility of Fairwashing Detection | Unknown | N/A | |
| No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation | Unknown | N/A | |
| Adaptive Data Debiasing through Bounded Exploration | Unknown | N/A | |
| Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization | Unknown | N/A | |
| Toward a realistic model of speech processing in the brain with self-supervised learning | Unknown | N/A | |
| TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets | Unknown | N/A | |
| CLIPDraw: Exploring Text-to-Drawing Synthesis through Language-Image Encoders | Unknown | N/A | |
| Attention-based Neural Cellular Automata | Unknown | N/A | |
| Sparse Additive Gaussian Process Regression | Unknown | N/A | |
| Attraction-Repulsion Spectrum in Neighbor Embeddings | Unknown | N/A | |
| Online Nonnegative CP-dictionary Learning for Markovian Data | Unknown | N/A | |
| Decimated Framelet System on Graphs and Fast G-Framelet Transforms | Unknown | N/A | |
| Multi-Agent Multi-Armed Bandits with Limited Communication | Unknown | N/A | |
| Accelerated Zeroth-Order and First-Order Momentum Methods from Mini to Minimax Optimization | Unknown | N/A | |
| Optimality and Stability in Non-Convex Smooth Games | Unknown | N/A | |
| Deep Limits and a Cut-Off Phenomenon for Neural Networks | Unknown | N/A | |
| Robust and scalable manifold learning via landmark diffusion for long-term medical signal processing | Unknown | N/A | |
| [Re] Differentiable Spatial Planning using Transformers | Unknown | N/A | |
| All You Need is a Good Functional Prior for Bayesian Deep Learning | Unknown | N/A | |
| Recovery and Generalization in Over-Realized Dictionary Learning | Unknown | N/A | |
| Truncated Emphatic Temporal Difference Methods for Prediction and Control | Unknown | N/A | |
| [Re] AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients | Unknown | N/A | |
| When is the Convergence Time of Langevin Algorithms Dimension Independent? A Composite Optimization Viewpoint | Unknown | N/A | |
| [Re] Replication study of 'Data-Driven Methods for Balancing Fairness and Efficiency in Ride-Pooling' | Unknown | N/A | |
| Learning Operators with Coupled Attention | Unknown | N/A | |
| [Re] Solving Phase Retrieval With a Learned Reference | Unknown | N/A | |
| [Re] Explaining in Style: Training a GAN to explain a classifier in StyleSpace | Unknown | N/A | |
| DeepInteraction: 3D Object Detection via Modality Interaction | Unknown | N/A | |
| Mix and Reason: Reasoning over Semantic Topology with Data Mixing for Domain Generalization | Unknown | N/A | |
| SGAM: Building a Virtual 3D World through Simultaneous Generation and Mapping | Unknown | N/A | |
| RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection | Unknown | N/A | |
| Dense Interspecies Face Embedding | Unknown | N/A | |
| Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization | Unknown | N/A | |
| UMIX: Improving Importance Weighting for Subpopulation Shift via Uncertainty-Aware Mixup | Unknown | N/A | |
| Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Resource-Adaptive Federated Learning with All-In-One Neural Composition | Unknown | N/A | |
| One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations | Unknown | N/A | |
| Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation | Unknown | N/A | |
| On the Robustness of Deep Clustering Models: Adversarial Attacks and Defenses | Unknown | N/A | |
| Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games | Unknown | N/A | |
| Weak-shot Semantic Segmentation via Dual Similarity Transfer | Unknown | N/A | |
| Effective Backdoor Defense by Exploiting Sensitivity of Poisoned Samples | Unknown | N/A | |
| Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks | Unknown | N/A | |
| OTKGE: Multi-modal Knowledge Graph Embeddings via Optimal Transport | Unknown | N/A | |
| Positively Weighted Kernel Quadrature via Subsampling | Unknown | N/A | |
| LASSIE: Learning Articulated Shapes from Sparse Image Ensemble via 3D Part Discovery | Unknown | N/A | |
| A Kernelised Stein Statistic for Assessing Implicit Generative Models | Unknown | N/A | |
| E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance | Unknown | N/A | |
| EpiGRAF: Rethinking training of 3D GANs | Unknown | N/A | |
| Bridging the Gap Between Vision Transformers and Convolutional Neural Networks on Small Datasets | Unknown | N/A | |
| Optimal Efficiency-Envy Trade-Off via Optimal Transport | Unknown | N/A | |
| Generating Long Videos of Dynamic Scenes | Unknown | N/A | |
| Private Synthetic Data for Multitask Learning and Marginal Queries | Unknown | N/A | |
| Graph Self-supervised Learning with Accurate Discrepancy Learning | Unknown | N/A | |
| Independence Testing for Bounded Degree Bayesian Networks | Unknown | N/A | |
| Tikhonov Regularization is Optimal Transport Robust under Martingale Constraints | Unknown | N/A | |
| ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time | Unknown | N/A | |
| SAMURAI: Shape And Material from Unconstrained Real-world Arbitrary Image collections | Unknown | N/A | |
| Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models | Unknown | N/A | |
| Bayesian Persuasion for Algorithmic Recourse | Unknown | N/A | |
| Deep Hierarchical Planning from Pixels | Unknown | N/A | |
| Noise Attention Learning: Enhancing Noise Robustness by Gradient Scaling | Unknown | N/A | |
| Neural Basis Models for Interpretability | Unknown | N/A | |
| Hierarchical classification at multiple operating points | Unknown | N/A | |
| Information-Theoretic GAN Compression with Variational Energy-based Model | Unknown | N/A | |
| Redistribution of Weights and Activations for AdderNet Quantization | Unknown | N/A | |
| Deep invariant networks with differentiable augmentation layers | Unknown | N/A | |
| Convergence beyond the over-parameterized regime using Rayleigh quotients | Unknown | N/A | |
| Robust $\phi$-Divergence MDPs | Unknown | N/A | |
| ToDD: Topological Compound Fingerprinting in Computer-Aided Drug Discovery | Unknown | N/A | |
| On Privacy and Personalization in Cross-Silo Federated Learning | Unknown | N/A | |
| Differentially Private Covariance Revisited | Unknown | N/A | |
| Learning Graph-embedded Key-event Back-tracing for Object Tracking in Event Clouds | Unknown | N/A | |
| Distributional Convergence of the Sliced Wasserstein Process | Unknown | N/A | |
| Homomorphic Matrix Completion | Unknown | N/A | |
| Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation | Unknown | N/A | |
| On the Identifiability of Nonlinear ICA: Sparsity and Beyond | Unknown | N/A | |
| Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation | Unknown | N/A | |
| Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks | Unknown | N/A | |
| Dance of SNN and ANN: Solving binding problem by combining spike timing and reconstructive attention | Unknown | N/A | |
| Efficient Sampling on Riemannian Manifolds via Langevin MCMC | Unknown | N/A | |
| ATD: Augmenting CP Tensor Decomposition by Self Supervision | Unknown | N/A | |
| Imitating Past Successes can be Very Suboptimal | Unknown | N/A | |
| RKHS-SHAP: Shapley Values for Kernel Methods | Unknown | N/A | |
| SAPD+: An Accelerated Stochastic Method for Nonconvex-Concave Minimax Problems | Unknown | N/A | |
| On Scalable Testing of Samplers | Unknown | N/A | |
| Markovian Interference in Experiments | Unknown | N/A | |
| DP-PCA: Statistically Optimal and Differentially Private PCA | Unknown | N/A | |
| Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning | Unknown | N/A | |
| Continual learning: a feature extraction formalization, an efficient algorithm, and fundamental obstructions | Unknown | N/A | |
| Functional Ensemble Distillation | Unknown | N/A | |
| Self-explaining deep models with logic rule reasoning | Unknown | N/A | |
| Benign Underfitting of Stochastic Gradient Descent | Unknown | N/A | |
| Modeling the Machine Learning Multiverse | Unknown | N/A | |
| Stability Analysis and Generalization Bounds of Adversarial Training | Unknown | N/A | |
| Exact Shape Correspondence via 2D graph convolution | Unknown | N/A | |
| A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning | Unknown | N/A | |
| How and Why to Manipulate Your Own Agent: On the Incentives of Users of Learning Agents | Unknown | N/A | |
| MissDAG: Causal Discovery in the Presence of Missing Data with Continuous Additive Noise Models | Unknown | N/A | |
| Phase diagram of Stochastic Gradient Descent in high-dimensional two-layer neural networks | Unknown | N/A | |
| Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality | Unknown | N/A | |
| Spectral Bias Outside the Training Set for Deep Networks in the Kernel Regime | Unknown | N/A | |
| First Hitting Diffusion Models for Generating Manifold, Graph and Categorical Data | Unknown | N/A | |
| Universal Rates for Interactive Learning | Unknown | N/A | |
| DGD^2: A Linearly Convergent Distributed Algorithm For High-dimensional Statistical Recovery | Unknown | N/A | |
| Single-Stage Visual Relationship Learning using Conditional Queries | Unknown | N/A | |
| Pruning has a disparate impact on model accuracy | Unknown | N/A | |
| Teacher Forcing Recovers Reward Functions for Text Generation | Unknown | N/A | |
| Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity | Unknown | N/A | |
| Optimal Dynamic Regret in LQR Control | Unknown | N/A | |
| Generalization Gap in Amortized Inference | Unknown | N/A | |
| Near-Optimal Private and Scalable $k$-Clustering | Unknown | N/A | |
| Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners | Unknown | N/A | |
| Hedging as Reward Augmentation in Probabilistic Graphical Models | Unknown | N/A | |
| Training Subset Selection for Weak Supervision | Unknown | N/A | |
| Online Reinforcement Learning for Mixed Policy Scopes | Unknown | N/A | |
| Branch & Learn for Recursively and Iteratively Solvable Problems in Predict+Optimize | Unknown | N/A | |
| Your Out-of-Distribution Detection Method is Not Robust! | Unknown | N/A | |
| An efficient graph generative model for navigating ultra-large combinatorial synthesis libraries | Unknown | N/A | |
| Communication-Efficient Topologies for Decentralized Learning with $O(1)$ Consensus Rate | Unknown | N/A | |
| Rethinking the Reverse-engineering of Trojan Triggers | Unknown | N/A | |
| Decentralized, Communication- and Coordination-free Learning in Structured Matching Markets | Unknown | N/A | |
| On the Epistemic Limits of Personalized Prediction | Unknown | N/A | |
| Learning to Mitigate AI Collusion on Economic Platforms | Unknown | N/A | |
| STNDT: Modeling Neural Population Activity with Spatiotemporal Transformers | Unknown | N/A | |
| Masked Autoencoding for Scalable and Generalizable Decision Making | Unknown | N/A | |
| Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs | Unknown | N/A | |
| DiSC: Differential Spectral Clustering of Features | Unknown | N/A | |
| Personalized Online Federated Learning with Multiple Kernels | Unknown | N/A | |
| Patching open-vocabulary models by interpolating weights | Unknown | N/A | |
| Concrete Score Matching: Generalized Score Matching for Discrete Data | Unknown | N/A | |
| LBD: Decouple Relevance and Observation for Individual-Level Unbiased Learning to Rank | Unknown | N/A | |
| Palm up: Playing in the Latent Manifold for Unsupervised Pretraining | Unknown | N/A | |
| Focal Modulation Networks | Unknown | N/A | |
| S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning | Unknown | N/A | |
| Exploitability Minimization in Games and Beyond | Unknown | N/A | |
| FeLMi : Few shot Learning with hard Mixup | Unknown | N/A | |
| The First Optimal Acceleration of High-Order Methods in Smooth Convex Optimization | Unknown | N/A | |
| On Optimal Learning Under Targeted Data Poisoning | Unknown | N/A | |
| The computational and learning benefits of Daleian neural networks | Unknown | N/A | |
| Support Recovery in Sparse PCA with Incomplete Data | Unknown | N/A | |
| Missing Data Imputation and Acquisition with Deep Hierarchical Models and Hamiltonian Monte Carlo | Unknown | N/A | |
| Private Isotonic Regression | Unknown | N/A | |
| Do Residual Neural Networks discretize Neural Ordinary Differential Equations? | Unknown | N/A | |
| Continuous MDP Homomorphisms and Homomorphic Policy Gradient | Unknown | N/A | |
| Exponentially Improving the Complexity of Simulating the Weisfeiler-Lehman Test with Graph Neural Networks | Unknown | N/A | |
| Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning | Unknown | N/A | |
| Constrained GPI for Zero-Shot Transfer in Reinforcement Learning | Unknown | N/A | |
| Predicting Cellular Responses to Novel Drug Perturbations at a Single-Cell Resolution | Unknown | N/A | |
| A Boosting Approach to Reinforcement Learning | Unknown | N/A | |
| DataMUX: Data Multiplexing for Neural Networks | Unknown | N/A | |
| Are Defenses for Graph Neural Networks Robust? | Unknown | N/A | |
| Adversarial Robustness is at Odds with Lazy Training | Unknown | N/A | |
| Robust Reinforcement Learning using Offline Data | Unknown | N/A | |
| Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits | Unknown | N/A | |
| Fine-tuning language models to find agreement among humans with diverse preferences | Unknown | N/A | |
| Tsetlin Machine for Solving Contextual Bandit Problems | Unknown | N/A | |
| Multi-Class $H$-Consistency Bounds | Unknown | N/A | |
| Statistical, Robustness, and Computational Guarantees for Sliced Wasserstein Distances | Unknown | N/A | |
| Lifting Weak Supervision To Structured Prediction | Unknown | N/A | |
| Learning Concept Credible Models for Mitigating Shortcuts | Unknown | N/A | |
| LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning | Unknown | N/A | |
| Disentangling Transfer in Continual Reinforcement Learning | Unknown | N/A | |
| Distinguishing discrete and continuous behavioral variability using warped autoregressive HMMs | Unknown | N/A | |
| Off-Team Learning | Unknown | N/A | |
| LAMP: Extracting Text from Gradients with Language Model Priors | Unknown | N/A | |
| 4D Unsupervised Object Discovery | Unknown | N/A | |
| Bayesian subset selection and variable importance for interpretable prediction and classification | Unknown | N/A | |
| Unifying Voxel-based Representation with Transformer for 3D Object Detection | Unknown | N/A | |
| Multi-Scale Adaptive Network for Single Image Denoising | Unknown | N/A | |
| Improving Out-of-Distribution Generalization by Adversarial Training with Structured Priors | Unknown | N/A | |
| Efficient Graph Similarity Computation with Alignment Regularization | Unknown | N/A | |
| Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering | Unknown | N/A | |
| Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search | Unknown | N/A | |
| Natural image synthesis for the retina with variational information bottleneck representation | Unknown | N/A | |
| A Lower Bound of Hash Codes' Performance | Unknown | N/A | |
| I2Q: A Fully Decentralized Q-Learning Algorithm | Unknown | N/A | |
| Shadow Knowledge Distillation: Bridging Offline and Online Knowledge Transfer | Unknown | N/A | |
| A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval | Unknown | N/A | |
| Heterogeneous Skill Learning for Multi-agent Tasks | Unknown | N/A | |
| Are You Stealing My Model? Sample Correlation for Fingerprinting Deep Neural Networks | Unknown | N/A | |
| Model-Based Opponent Modeling | Unknown | N/A | |
| When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture | Unknown | N/A | |
| Learning Invariant Graph Representations for Out-of-Distribution Generalization | Unknown | N/A | |
| Learning from Future: A Novel Self-Training Framework for Semantic Segmentation | Unknown | N/A | |
| Geo-Neus: Geometry-Consistent Neural Implicit Surfaces Learning for Multi-view Reconstruction | Unknown | N/A | |
| Wasserstein Iterative Networks for Barycenter Estimation | Unknown | N/A | |
| Personalized Federated Learning towards Communication Efficiency, Robustness and Fairness | Unknown | N/A | |
| Analyzing Sharpness along GD Trajectory: Progressive Sharpening and Edge of Stability | Unknown | N/A | |
| Rashomon Capacity: A Metric for Predictive Multiplicity in Classification | Unknown | N/A | |
| Pre-trained Adversarial Perturbations | Unknown | N/A | |
| Conformal Frequency Estimation with Sketched Data | Unknown | N/A | |
| Convergent Representations of Computer Programs in Human and Artificial Neural Networks | Unknown | N/A | |
| tntorch: Tensor Network Learning with PyTorch | Unknown | N/A | |
| Monte Carlo Tree Search based Variable Selection for High Dimensional Bayesian Optimization | Unknown | N/A | |
| Joint Entropy Search for Multi-Objective Bayesian Optimization | Unknown | N/A | |
| Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding | Unknown | N/A | |
| A Closer Look at the Adversarial Robustness of Deep Equilibrium Models | Unknown | N/A | |
| Language Conditioned Spatial Relation Reasoning for 3D Object Grounding | Unknown | N/A | |
| [Re] Projection-based Algorithm for Updating the TruncatedSVD of Evolving Matrices | Unknown | N/A | |
| Unsupervised Point Cloud Completion and Segmentation by Generative Adversarial Autoencoding Network | Unknown | N/A | |
| Audio-Driven Co-Speech Gesture Video Generation | Unknown | N/A | |
| Dynamic Graph Neural Networks Under Spatio-Temporal Distribution Shift | Unknown | N/A | |
| On the role of overparameterization in off-policy Temporal Difference learning with linear function approximation | Unknown | N/A | |
| Quantized Training of Gradient Boosting Decision Trees | Unknown | N/A | |
| InterpretDL: Explaining Deep Models in PaddlePaddle | Unknown | N/A | |
| Efficient Change-Point Detection for Tackling Piecewise-Stationary Bandits | Unknown | N/A | |
| A Unified Statistical Learning Model for Rankings and Scores with Application to Grant Panel Review | Unknown | N/A | |
| LSAR: Efficient Leverage Score Sampling Algorithm for the Analysis of Big Time Series Data | Unknown | N/A | |
| D-GCCA: Decomposition-based Generalized Canonical Correlation Analysis for Multi-view High-dimensional Data | Unknown | N/A | |
| Supervised Dimensionality Reduction and Visualization using Centroid-Encoder | Unknown | N/A | |
| Foolish Crowds Support Benign Overfitting | Unknown | N/A | |
| Rethinking Nonlinear Instrumental Variable Models through Prediction Validity | Unknown | N/A | |
| [Re] Reproduction and Extension of "Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation" | Unknown | N/A | |
| [Re] An Implementation of Fair Robust Learning | Unknown | N/A | |
| GAMA: Generative Adversarial Multi-Object Scene Attacks | Unknown | N/A | |
| Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training | Unknown | N/A | |
| An Empirical Study on Disentanglement of Negative-free Contrastive Learning | Unknown | N/A | |
| Social-Inverse: Inverse Decision-making of Social Contagion Management with Task Migrations | Unknown | N/A | |
| Bayesian Risk Markov Decision Processes | Unknown | N/A | |
| SHINE: SubHypergraph Inductive Neural nEtwork | Unknown | N/A | |
| Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning | Unknown | N/A | |
| Non-deep Networks | Unknown | N/A | |
| Feature-Proxy Transformer for Few-Shot Segmentation | Unknown | N/A | |
| Estimating and Explaining Model Performance When Both Covariates and Labels Shift | Unknown | N/A | |
| The alignment property of SGD noise and how it helps select flat minima: A stability analysis | Unknown | N/A | |
| Thompson Sampling Efficiently Learns to Control Diffusion Processes | Unknown | N/A | |
| Fair and Efficient Allocations Without Obvious Manipulations | Unknown | N/A | |
| Fuzzy Learning Machine | Unknown | N/A | |
| ASPiRe: Adaptive Skill Priors for Reinforcement Learning | Unknown | N/A | |
| Sound and Complete Causal Identification with Latent Variables Given Local Background Knowledge | Unknown | N/A | |
| Explainability Via Causal Self-Talk | Unknown | N/A | |
| ALIFE: Adaptive Logit Regularizer and Feature Replay for Incremental Semantic Segmentation | Unknown | N/A | |
| Causal Identification under Markov equivalence: Calculus, Algorithm, and Completeness | Unknown | N/A | |
| Random Normalization Aggregation for Adversarial Defense | Unknown | N/A | |
| Size and depth of monotone neural networks: interpolation and approximation | Unknown | N/A | |
| Spartan: Differentiable Sparsity via Regularized Transportation | Unknown | N/A | |
| On Gap-dependent Bounds for Offline Reinforcement Learning | Unknown | N/A | |
| Revisiting Sliced Wasserstein on Images: From Vectorization to Convolution | Unknown | N/A | |
| Accelerated Projected Gradient Algorithms for Sparsity Constrained Optimization Problems | Unknown | N/A | |
| Context-Based Dynamic Pricing with Partially Linear Demand Model | Unknown | N/A | |
| S-PIFu: Integrating Parametric Human Models with PIFu for Single-view Clothed Human Reconstruction | Unknown | N/A | |
| Action-modulated midbrain dopamine activity arises from distributed control policies | Unknown | N/A | |
| NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis | Unknown | N/A | |
| Bezier Gaussian Processes for Tall and Wide Data | Unknown | N/A | |
| Minimax Regret for Cascading Bandits | Unknown | N/A | |
| Pre-activation Distributions Expose Backdoor Neurons | Unknown | N/A | |
| Posterior Matching for Arbitrary Conditioning | Unknown | N/A | |
| Alternating Mirror Descent for Constrained Min-Max Games | Unknown | N/A | |
| Early Stage Convergence and Global Convergence of Training Mildly Parameterized Neural Networks | Unknown | N/A | |
| Foundation Posteriors for Approximate Probabilistic Inference | Unknown | N/A | |
| Entropy-Driven Mixed-Precision Quantization for Deep Network Design | Unknown | N/A | |
| Denoising Diffusion Restoration Models | Unknown | N/A | |
| Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms | Unknown | N/A | |
| A Combinatorial Perspective on the Optimization of Shallow ReLU Networks | Unknown | N/A | |
| A Lagrangian Duality Approach to Active Learning | Unknown | N/A | |
| A Statistical Online Inference Approach in Averaged Stochastic Approximation | Unknown | N/A | |
| Theoretical analysis of deep neural networks for temporally dependent observations | Unknown | N/A | |
| On the Complexity of Adversarial Decision Making | Unknown | N/A | |
| Scalable Distributional Robustness in a Class of Non-Convex Optimization with Guarantees | Unknown | N/A | |
| Uncovering the Structural Fairness in Graph Contrastive Learning | Unknown | N/A | |
| ULNeF: Untangled Layered Neural Fields for Mix-and-Match Virtual Try-On | Unknown | N/A | |
| Tree ensemble kernels for Bayesian optimization with known constraints over mixed-feature spaces | Unknown | N/A | |
| Better Uncertainty Calibration via Proper Scores for Classification and Beyond | Unknown | N/A | |
| Augmenting Online Algorithms with $\varepsilon$-Accurate Predictions | Unknown | N/A | |
| Asymptotics of $\ell_2$ Regularized Network Embeddings | Unknown | N/A | |
| Deep Counterfactual Estimation with Categorical Background Variables | Unknown | N/A | |
| What is a Good Metric to Study Generalization of Minimax Learners? | Unknown | N/A | |
| Non-Convex Bilevel Games with Critical Point Selection Maps | Unknown | N/A | |
| The Franz-Parisi Criterion and Computational Trade-offs in High Dimensional Statistics | Unknown | N/A | |
| Provable Defense against Backdoor Policies in Reinforcement Learning | Unknown | N/A | |
| IMED-RL: Regret optimal learning of ergodic Markov decision processes | Unknown | N/A | |
| Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions | Unknown | N/A | |
| A Closer Look at Learned Optimization: Stability, Robustness, and Inductive Biases | Unknown | N/A | |
| Distributed Learning of Finite Gaussian Mixtures | Unknown | N/A | |
| [Re] Reproducibility Report: Contrastive Learning of Socially-aware Motion Representations | Unknown | N/A | |
| [Re] GANSpace: Discovering Interpretable GAN Controls | Unknown | N/A | |
| [Re] Reproducibility Study of “Counterfactual Generative Networks” | Unknown | N/A | |
| [Re] Does Self-Supervision Always Improve Few-Shot Learning? | Unknown | N/A | |
| On Kernelized Multi-Armed Bandits with Constraints | Unknown | N/A | |
| A Nonconvex Framework for Structured Dynamic Covariance Recovery | Unknown | N/A | |
| A Mean-Field Game Approach to Cloud Resource Management with Function Approximation | Unknown | N/A | |
| CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation | Unknown | N/A | |
| Fast and Robust Rank Aggregation against Model Misspecification | Unknown | N/A | |
| Poisson Flow Generative Models | Unknown | N/A | |
| Boosting Out-of-distribution Detection with Typical Features | Unknown | N/A | |
| Joint Estimation and Inference for Data Integration Problems based on Multiple Multi-layered Gaussian Graphical Models | Unknown | N/A | |
| Conditional Diffusion Process for Inverse Halftoning | Unknown | N/A | |
| Knowledge Distillation from A Stronger Teacher | Unknown | N/A | |
| Dynamic pricing and assortment under a contextual MNL demand | Unknown | N/A | |
| Temporal Effective Batch Normalization in Spiking Neural Networks | Unknown | N/A | |
| [Re] Replication Study of "Fairness and Bias in Online Selection" | Unknown | N/A | |
| The trade-offs of model size in large recommendation models : 100GB to 10MB Criteo-tb DLRM model | Unknown | N/A | |
| Neural Transmitted Radiance Fields | Unknown | N/A | |
| 🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation | Unknown | N/A | |
| [Re] Exacerbating Algorithmic Bias through Fairness Attacks | Unknown | N/A | |
| Transfer Learning in Information Criteria-based Feature Selection | Unknown | N/A | |
| Learning with little mixing | Unknown | N/A | |
| Fairness-Aware PAC Learning from Corrupted Data | Unknown | N/A | |
| Online Frank-Wolfe with Arbitrary Delays | Unknown | N/A | |
| Online Allocation and Learning in the Presence of Strategic Agents | Unknown | N/A | |
| Weisfeiler and Leman Go Walking: Random Walk Kernels Revisited | Unknown | N/A | |
| Sufficient reductions in regression with mixed predictors | Unknown | N/A | |
| Optimal Query Complexities for Dynamic Trace Estimation | Unknown | N/A | |
| Understanding Benign Overfitting in Gradient-Based Meta Learning | Unknown | N/A | |
| Generalised Mutual Information for Discriminative Clustering | Unknown | N/A | |
| Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks | Unknown | N/A | |
| ShuffleMixer: An Efficient ConvNet for Image Super-Resolution | Unknown | N/A | |
| Minimax Optimal Fixed-Budget Best Arm Identification in Linear Bandits | Unknown | N/A | |
| [Re] Strategic classification made practical: reproduction | Unknown | N/A | |
| Communication-efficient distributed eigenspace estimation with arbitrary node failures | Unknown | N/A | |
| Generalised Implicit Neural Representations | Unknown | N/A | |
| AttCAT: Explaining Transformers via Attentive Class Activation Tokens | Unknown | N/A | |
| [Re] Explaining in Style: Training a GAN to explain a classifier in StyleSpace | Unknown | N/A | |
| Score-Based Generative Models Detect Manifolds | Unknown | N/A | |
| Geodesic Graph Neural Network for Efficient Graph Representation Learning | Unknown | N/A | |
| [Re] Nondeterminism and Instability in Neural Network Optimization | Unknown | N/A | |
| Diffusion-based Molecule Generation with Informative Prior Bridges | Unknown | N/A | |
| [Re] Learning Unknown from Correlations: Graph Neural Network for Inter-novel-protein Interaction Prediction | Unknown | N/A | |
| The Importance of Being Correlated: Implications of Dependence in Joint Spectral Inference across Multiple Networks | Unknown | N/A | |
| Zero-shot Transfer Learning within a Heterogeneous Graph via Knowledge Transfer Networks | Unknown | N/A | |
| Provably sample-efficient RL with side information about latent dynamics | Unknown | N/A | |
| Signal Processing for Implicit Neural Representations | Unknown | N/A | |
| Embrace the Gap: VAEs Perform Independent Mechanism Analysis | Unknown | N/A | |
| Exponential Separations in Symmetric Neural Networks | Unknown | N/A | |
| [Re] Understanding Self-Supervised Learning Dynamics without Contrastive Pairs | Unknown | N/A | |
| [Re] Exacerbating Algorithmic Bias through Fairness Attacks | Unknown | N/A | |
| Transformers from an Optimization Perspective | Unknown | N/A | |
| (f,Gamma)-Divergences: Interpolating between f-Divergences and Integral Probability Metrics | Unknown | N/A | |
| PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies | Unknown | N/A | |
| [Re] Transparent Object Tracking Benchmark | Unknown | N/A | |
| Neural Lyapunov Control of Unknown Nonlinear Systems with Stability Guarantees | Unknown | N/A | |
| [Re] Graph Edit Networks | Unknown | N/A | |
| [Re] Reproduction Study of Variational Fair Clustering | Unknown | N/A | |
| BR-SNIS: Bias Reduced Self-Normalized Importance Sampling | Unknown | N/A | |
| Fair and Optimal Decision Trees: A Dynamic Programming Approach | Unknown | N/A | |
| Brain Network Transformer | Unknown | N/A | |
| Learning to Navigate Wikipedia by Taking Random Walks | Unknown | N/A | |
| The Neural Testbed: Evaluating Joint Predictions | Unknown | N/A | |
| A Bregman Learning Framework for Sparse Neural Networks | Unknown | N/A | |
| [Re] Learning to count everything | Unknown | N/A | |
| On the Double Descent of Random Features Models Trained with SGD | Unknown | N/A |
NIPS 2023
| Title | Author | PDF_Link | Code_URL |
|---|---|---|---|
| SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models | Unknown | N/A | |
| Gacs-Korner Common Information Variational Autoencoder | Unknown | N/A | |
| Experimental Designs for Heteroskedastic Variance | Unknown | N/A | |
| Estimating Causal Effects Identifiable from a Combination of Observations and Experiments | Unknown | N/A | |
| Equal Opportunity of Coverage in Fair Regression | Unknown | N/A | |
| Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs | Unknown | N/A | |
| Differentiable sorting for censored time-to-event data. | Unknown | N/A | |
| On the Role of Randomization in Adversarially Robust Classification | Unknown | N/A | |
| Polyhedron Attention Module: Learning Adaptive-order Interactions | Unknown | N/A | |
| On the Robustness of Removal-Based Feature Attributions | Unknown | N/A | |
| Equivariant Single View Pose Prediction Via Induced and Restriction Representations | Unknown | N/A | |
| Is RLHF More Difficult than Standard RL? A Theoretical Perspective | Unknown | N/A | |
| Multimodal Deep Learning Model Unveils Behavioral Dynamics of V1 Activity in Freely Moving Mice | Unknown | N/A | |
| Spuriosity Didn’t Kill the Classifier: Using Invariant Predictions to Harness Spurious Features | Unknown | N/A | |
| A Regularized Conditional GAN for Posterior Sampling in Image Recovery Problems | Unknown | N/A | |
| Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis | Unknown | N/A | |
| P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting | Unknown | N/A | |
| Label Poisoning is All You Need | Unknown | N/A | |
| Instructing Goal-Conditioned Reinforcement Learning Agents with Temporal Logic Objectives | Unknown | N/A | |
| Equivariant Adaptation of Large Pretrained Models | Unknown | N/A | |
| Exploring Geometry of Blind Spots in Vision models | Unknown | N/A | |
| Are aligned neural networks adversarially aligned? | Unknown | N/A | |
| SaVeNet: A Scalable Vector Network for Enhanced Molecular Representation Learning | Unknown | N/A | |
| End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes | Unknown | N/A | |
| Fast Attention Over Long Sequences With Dynamic Sparse Flash Attention | Unknown | N/A | |
| Improvements on Uncertainty Quantification for Node Classification via Distance Based Regularization | Unknown | N/A | |
| Relax, it doesn’t matter how you get there: A new self-supervised approach for multi-timescale behavior analysis | Unknown | N/A | |
| Spatial-frequency channels, shape bias, and adversarial robustness | Unknown | N/A | |
| $\varepsilon$-fractional core stability in Hedonic Games. | Unknown | N/A | |
| Beyond Invariance: Test-Time Label-Shift Adaptation for Addressing "Spurious" Correlations | Unknown | N/A | |
| Language Model Alignment with Elastic Reset | Unknown | N/A | |
| Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence | Unknown | N/A | |
| Understanding Deep Gradient Leakage via Inversion Influence Functions | Unknown | N/A | |
| Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment | Unknown | N/A | |
| Joint Prompt Optimization of Stacked LLMs using Variational Inference | Unknown | N/A | |
| PROTES: Probabilistic Optimization with Tensor Sampling | Unknown | N/A | |
| A unified framework for information-theoretic generalization bounds | Unknown | N/A | |
| The Simplicity Bias in Multi-Task RNNs: Shared Attractors, Reuse of Dynamics, and Geometric Representation | Unknown | N/A | |
| Variational Gaussian processes for linear inverse problems | Unknown | N/A | |
| Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt Tuning | Unknown | N/A | |
| Adaptive SGD with Polyak stepsize and Line-search: Robust Convergence and Variance Reduction | Unknown | N/A | |
| First Order Stochastic Optimization with Oblivious Noise | Unknown | N/A | |
| Direct Preference Optimization: Your Language Model is Secretly a Reward Model | Unknown | N/A | |
| Adaptive recurrent vision performs zero-shot computation scaling to unseen difficulty levels | Unknown | N/A | |
| Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts | Unknown | N/A | |
| Sharp Recovery Thresholds of Tensor PCA Spectral Algorithms | Unknown | N/A | |
| Robustifying Generalizable Implicit Shape Networks with a Tunable Non-Parametric Model | Unknown | N/A | |
| Convex and Non-convex Optimization Under Generalized Smoothness | Unknown | N/A | |
| Reflexion: language agents with verbal reinforcement learning | Unknown | N/A | |
| Ordering-based Conditions for Global Convergence of Policy Gradient Methods | Unknown | N/A | |
| Topology-Aware Uncertainty for Image Segmentation | Unknown | N/A | |
| Fast and Simple Spectral Clustering in Theory and Practice | Unknown | N/A | |
| Information Geometry of the Retinal Representation Manifold | Unknown | N/A | |
| Smooth, exact rotational symmetrization for deep learning on point clouds | Unknown | N/A | |
| Convergence of Adam Under Relaxed Assumptions | Unknown | N/A | |
| Latent SDEs on Homogeneous Spaces | Unknown | N/A | |
| Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization | Unknown | N/A | |
| Bicriteria Approximation Algorithms for the Submodular Cover Problem | Unknown | N/A | |
| Neural Sampling in Hierarchical Exponential-family Energy-based Models | Unknown | N/A | |
| On Separate Normalization in Self-supervised Transformers | Unknown | N/A | |
| Tester-Learners for Halfspaces: Universal Algorithms | Unknown | N/A | |
| Universal Online Learning with Gradient Variations: A Multi-layer Online Ensemble Approach | Unknown | N/A | |
| Online List Labeling with Predictions | Unknown | N/A | |
| Energy-based learning algorithms for analog computing: a comparative study | Unknown | N/A | |
| Outlier-Robust Wasserstein DRO | Unknown | N/A | |
| MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks | Unknown | N/A | |
| Conformal Prediction Sets for Ordinal Classification | Unknown | N/A | |
| Graph of Circuits with GNN for Exploring the Optimal Design Space | Unknown | N/A | |
| Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems | Unknown | N/A | |
| Online learning of long-range dependencies | Unknown | N/A | |
| ExPT: Synthetic Pretraining for Few-Shot Experimental Design | Unknown | N/A | |
| Incentivized Communication for Federated Bandits | Unknown | N/A | |
| Learning Provably Robust Estimators for Inverse Problems via Jittering | Unknown | N/A | |
| On the Exploitability of Instruction Tuning | Unknown | N/A | |
| Ensemble-based Deep Reinforcement Learning for Vehicle Routing Problems under Distribution Shift | Unknown | N/A | |
| Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows | Unknown | N/A | |
| Human spatiotemporal pattern learning as probabilistic program synthesis | Unknown | N/A | |
| Neural Multi-Objective Combinatorial Optimization with Diversity Enhancement | Unknown | N/A | |
| Quantification of Uncertainty with Adversarial Models | Unknown | N/A | |
| Are Vision Transformers More Data Hungry Than Newborn Visual Systems? | Unknown | N/A | |
| $\textbf{A}^2\textbf{CiD}^2$: Accelerating Asynchronous Communication in Decentralized Deep Learning | Unknown | N/A | |
| Correlation Aware Sparsified Mean Estimation Using Random Projection | Unknown | N/A | |
| Provably Efficient Offline Reinforcement Learning in Regular Decision Processes | Unknown | N/A | |
| SHAP-IQ: Unified Approximation of any-order Shapley Interactions | Unknown | N/A | |
| Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning | Unknown | N/A | |
| Time Series Kernels based on Nonlinear Vector AutoRegressive Delay Embeddings | Unknown | N/A | |
| Topological Obstructions and How to Avoid Them | Unknown | N/A | |
| On Transfer of Adversarial Robustness from Pretraining to Downstream Tasks | Unknown | N/A | |
| Generalized test utilities for long-tail performance in extreme multi-label classification | Unknown | N/A | |
| Perceptual adjustment queries and an inverted measurement paradigm for low-rank metric learning | Unknown | N/A | |
| Uniform-in-Time Wasserstein Stability Bounds for (Noisy) Stochastic Gradient Descent | Unknown | N/A | |
| What is the Inductive Bias of Flatness Regularization? A Study of Deep Matrix Factorization Models | Unknown | N/A | |
| Tree-Rings Watermarks: Invisible Fingerprints for Diffusion Images | Unknown | N/A | |
| Optimality of Message-Passing Architectures for Sparse Graphs | Unknown | N/A | |
| Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming | Unknown | N/A | |
| Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception | Unknown | N/A | |
| Accelerating Exploration with Unlabeled Prior Data | Unknown | N/A | |
| Accessing Higher Dimensions for Unsupervised Word Translation | Unknown | N/A | |
| D-CIPHER: Discovery of Closed-form Partial Differential Equations | Unknown | N/A | |
| Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery | Unknown | N/A | |
| DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification | Unknown | N/A | |
| Reversible and irreversible bracket-based dynamics for deep graph neural networks | Unknown | N/A | |
| UP-NeRF: Unconstrained Pose Prior-Free Neural Radiance Field | Unknown | N/A | |
| Stable Vectorization of Multiparameter Persistent Homology using Signed Barcodes as Measures | Unknown | N/A | |
| Training on Foveated Images Improves Robustness to Adversarial Attacks | Unknown | N/A | |
| Towards Combinatorial Generalization for Catalysts: A Kohn-Sham Charge-Density Approach | Unknown | N/A | |
| Wasserstein Quantum Monte Carlo: A Novel Approach for Solving the Quantum Many-Body Schrödinger Equation | Unknown | N/A | |
| A Theory of Link Prediction via Relational Weisfeiler-Leman on Knowledge Graphs | Unknown | N/A | |
| Creating a Public Repository for Joining Private Data | Unknown | N/A | |
| Can semi-supervised learning use all the data effectively? A lower bound perspective | Unknown | N/A | |
| Learning Reliable Logical Rules with SATNet | Unknown | N/A | |
| A Unified Approach for Maximizing Continuous DR-submodular Functions | Unknown | N/A | |
| Hardware Resilience Properties of Text-Guided Image Classifiers | Unknown | N/A | |
| Polynomial-Time Linear-Swap Regret Minimization in Imperfect-Information Sequential Games | Unknown | N/A | |
| Direct Preference-based Policy Optimization without Reward Modeling | Unknown | N/A | |
| Faster Query Times for Fully Dynamic $k$-Center Clustering with Outliers | Unknown | N/A | |
| Physics-Informed Bayesian Optimization of Variational Quantum Circuits | Unknown | N/A | |
| Why Does Sharpness-Aware Minimization Generalize Better Than SGD? | Unknown | N/A | |
| Attention as Implicit Structural Inference | Unknown | N/A | |
| Provable Advantage of Curriculum Learning on Parity Targets with Mixed Inputs | Unknown | N/A | |
| Transportability for Bandits with Data from Different Environments | Unknown | N/A | |
| FIRAL: An Active Learning Algorithm for Multinomial Logistic Regression | Unknown | N/A | |
| Conformal Meta-learners for Predictive Inference of Individual Treatment Effects | Unknown | N/A | |
| A Unified Model and Dimension for Interactive Estimation | Unknown | N/A | |
| Towards In-context Scene Understanding | Unknown | N/A | |
| Should Under-parameterized Student Networks Copy or Average Teacher Weights? | Unknown | N/A | |
| Memory Efficient Optimizers with 4-bit States | Unknown | N/A | |
| A Robust Exact Algorithm for the Euclidean Bipartite Matching Problem | Unknown | N/A | |
| Quantum speedups for stochastic optimization | Unknown | N/A | |
| Certified Robustness via Dynamic Margin Maximization and Improved Lipschitz Regularization | Unknown | N/A | |
| Convergence Analysis of Sequential Federated Learning on Heterogeneous Data | Unknown | N/A | |
| Convergence of Alternating Gradient Descent for Matrix Factorization | Unknown | N/A | |
| Optimal Rates for Bandit Nonstochastic Control | Unknown | N/A | |
| Causal Effect Identification in Uncertain Causal Networks | Unknown | N/A | |
| Robust Bayesian Satisficing | Unknown | N/A | |
| SPA: A Graph Spectral Alignment Perspective for Domain Adaptation | Unknown | N/A | |
| Incentives in Federated Learning: Equilibria, Dynamics, and Mechanisms for Welfare Maximization | Unknown | N/A | |
| Finding Order in Chaos: A Novel Data Augmentation Method for Time Series in Contrastive Learning | Unknown | N/A | |
| CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference | Unknown | N/A | |
| GLIME: General, Stable and Local LIME Explanation | Unknown | N/A | |
| SatLM: Satisfiability-Aided Language Models Using Declarative Prompting | Unknown | N/A | |
| Robust and Actively Secure Serverless Collaborative Learning | Unknown | N/A | |
| Learning to Influence Human Behavior with Offline Reinforcement Learning | Unknown | N/A | |
| MKOR: Momentum-Enabled Kronecker-Factor-Based Optimizer Using Rank-1 Updates | Unknown | N/A | |
| Detection Based Part-level Articulated Object Reconstruction from Single RGBD Image | Unknown | N/A | |
| Flow-Attention-based Spatio-Temporal Aggregation Network for 3D Mask Detection | Unknown | N/A | |
| Federated Linear Bandits with Finite Adversarial Actions | Unknown | N/A | |
| Recommender Systems with Generative Retrieval | Unknown | N/A | |
| Interaction Measures, Partition Lattices and Kernel Tests for High-Order Interactions | Unknown | N/A | |
| Learning to Modulate pre-trained Models in RL | Unknown | N/A | |
| Learning Efficient Coding of Natural Images with Maximum Manifold Capacity Representations | Unknown | N/A | |
| The Rashomon Importance Distribution: Getting RID of Unstable, Single Model-based Variable Importance | Unknown | N/A | |
| Efficient Bayesian Learning Curve Extrapolation using Prior-Data Fitted Networks | Unknown | N/A | |
| Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games | Unknown | N/A | |
| Double Randomized Underdamped Langevin with Dimension-Independent Convergence Guarantee | Unknown | N/A | |
| PDP: Parameter-free Differentiable Pruning is All You Need | Unknown | N/A | |
| StEik: Stabilizing the Optimization of Neural Signed Distance Functions and Finer Shape Representation | Unknown | N/A | |
| Efficient Beam Tree Recursion | Unknown | N/A | |
| Projection-Free Online Convex Optimization via Efficient Newton Iterations | Unknown | N/A | |
| Addressing the speed-accuracy simulation trade-off for adaptive spiking neurons | Unknown | N/A | |
| On the Size and Approximation Error of Distilled Datasets | Unknown | N/A | |
| Cascading Bandits: Optimizing Recommendation Frequency in Delayed Feedback Environments | Unknown | N/A | |
| Structured Neural Networks for Density Estimation and Causal Inference | Unknown | N/A | |
| Reusable Slotwise Mechanisms | Unknown | N/A | |
| Posterior Sampling for Competitive RL: Function Approximation and Partial Observation | Unknown | N/A | |
| A generative model of the hippocampal formation trained with theta driven local learning rules | Unknown | N/A | |
| The Bayesian Stability Zoo | Unknown | N/A | |
| Ignorance is Bliss: Robust Control via Information Gating | Unknown | N/A | |
| Double Gumbel Q-Learning | Unknown | N/A | |
| Optimistic Meta-Gradients | Unknown | N/A | |
| Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability | Unknown | N/A | |
| Learning Mixtures of Gaussians Using the DDPM Objective | Unknown | N/A | |
| Scalable Transformer for PDE Surrogate Modeling | Unknown | N/A | |
| Spatio-Angular Convolutions for Super-resolution in Diffusion MRI | Unknown | N/A | |
| A Privacy-Friendly Approach to Data Valuation | Unknown | N/A | |
| Marginal Density Ratio for Off-Policy Evaluation in Contextual Bandits | Unknown | N/A | |
| Pruning vs Quantization: Which is Better? | Unknown | N/A | |
| Common Ground in Cooperative Communication | Unknown | N/A | |
| Scalable Fair Influence Maximization | Unknown | N/A | |
| Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms | Unknown | N/A | |
| Multiclass Boosting: Simple and Intuitive Weak Learning Criteria | Unknown | N/A | |
| Online Label Shift: Optimal Dynamic Regret meets Practical Algorithms | Unknown | N/A | |
| MeGraph: Capturing Long-Range Interactions by Alternating Local and Hierarchical Aggregation on Multi-Scaled Graph Hierarchy | Unknown | N/A | |
| Reverse Engineering Self-Supervised Learning | Unknown | N/A | |
| Model-Free Reinforcement Learning with the Decision-Estimation Coefficient | Unknown | N/A | |
| Accelerating Molecular Graph Neural Networks via Knowledge Distillation | Unknown | N/A | |
| Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder | Unknown | N/A | |
| Locality Sensitive Hashing in Fourier Frequency Domain For Soft Set Containment Search | Unknown | N/A | |
| Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models | Unknown | N/A | |
| Partial Matrix Completion | Unknown | N/A | |
| Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees | Unknown | N/A | |
| Group Fairness in Peer Review | Unknown | N/A | |
| Improved Frequency Estimation Algorithms with and without Predictions | Unknown | N/A | |
| Strategic Distribution Shift of Interacting Agents via Coupled Gradient Flows | Unknown | N/A | |
| Frequency Domain-Based Dataset Distillation | Unknown | N/A | |
| Similarity-based cooperative equilibrium | Unknown | N/A | |
| Simplifying Neural Network Training Under Class Imbalance | Unknown | N/A | |
| Domain Agnostic Fourier Neural Operators | Unknown | N/A | |
| On the Role of Entanglement and Statistics in Learning | Unknown | N/A | |
| A Partially-Supervised Reinforcement Learning Framework for Visual Active Search | Unknown | N/A | |
| Robust Data Valuation with Weighted Banzhaf Values | Unknown | N/A | |
| Nonparametric Boundary Geometry in Physics Informed Deep Learning | Unknown | N/A | |
| Exact Verification of ReLU Neural Control Barrier Functions | Unknown | N/A | |
| Enhancing Robot Program Synthesis Through Environmental Context | Unknown | N/A | |
| Efficient Data Subset Selection to Generalize Training Across Models: Transductive and Inductive Networks | Unknown | N/A | |
| BQ-NCO: Bisimulation Quotienting for Efficient Neural Combinatorial Optimization | Unknown | N/A | |
| Learning Shared Safety Constraints from Multi-task Demonstrations | Unknown | N/A | |
| Dis-inhibitory neuronal circuits can control the sign of synaptic plasticity | Unknown | N/A | |
| Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization | Unknown | N/A | |
| NCDL: A Framework for Deep Learning on non-Cartesian Lattices | Unknown | N/A | |
| A Path to Simpler Models Starts With Noise | Unknown | N/A | |
| SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations | Unknown | N/A | |
| Batchnorm Allows Unsupervised Radial Attacks | Unknown | N/A | |
| A Framework for Fast and Stable Representations of Multiparameter Persistent Homology Decompositions | Unknown | N/A | |
| Cognitive Model Discovery via Disentangled RNNs | Unknown | N/A | |
| Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning | Unknown | N/A | |
| Non-Convex Bilevel Optimization with Time-Varying Objective Functions | Unknown | N/A | |
| Revisiting Visual Model Robustness: A Frequency Long-Tailed Distribution View | Unknown | N/A | |
| Distributional Policy Evaluation: a Maximum Entropy approach to Representation Learning | Unknown | N/A | |
| Constrained Policy Optimization with Explicit Behavior Density For Offline Reinforcement Learning | Unknown | N/A | |
| Implicit variance regularization in non-contrastive SSL | Unknown | N/A | |
| Pseudo-Likelihood Inference | Unknown | N/A | |
| Star-Shaped Denoising Diffusion Probabilistic Models | Unknown | N/A | |
| Hierarchical clustering with dot products recovers hidden tree structure | Unknown | N/A | |
| The Goldilocks of Pragmatic Understanding: Fine-Tuning Strategy Matters for Implicature Resolution by LLMs | Unknown | N/A | |
| PromptIR: Prompting for All-in-One Image Restoration | Unknown | N/A | |
| Kernel-Based Tests for Likelihood-Free Hypothesis Testing | Unknown | N/A | |
| Learning DAGs from Data with Few Root Causes | Unknown | N/A | |
| Randomized and Deterministic Maximin-share Approximations for Fractionally Subadditive Valuations | Unknown | N/A | |
| Initialization Matters: Privacy-Utility Analysis of Overparameterized Neural Networks | Unknown | N/A | |
| A Fast and Accurate Estimator for Large Scale Linear Model via Data Averaging | Unknown | N/A | |
| Implicit Regularization in Over-Parameterized Support Vector Machine | Unknown | N/A | |
| Auxiliary Losses for Learning Generalizable Concept-based Models | Unknown | N/A | |
| GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection | Unknown | N/A | |
| Grassmann Manifold Flows for Stable Shape Generation | Unknown | N/A | |
| Bayesian Optimisation of Functions on Graphs | Unknown | N/A | |
| Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance | Unknown | N/A | |
| Structured State Space Models for In-Context Reinforcement Learning | Unknown | N/A | |
| Normalizing flow neural networks by JKO scheme | Unknown | N/A | |
| On-the-Fly Adapting Code Summarization on Trainable Cost-Effective Language Models | Unknown | N/A | |
| Sample based Explanations via Generalized Representers | Unknown | N/A | |
| Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks | Unknown | N/A | |
| Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples | Unknown | N/A | |
| Empowering Convolutional Neural Nets with MetaSin Activation | Unknown | N/A | |
| Bounded rationality in structured density estimation | Unknown | N/A | |
| Riemannian Laplace approximations for Bayesian neural networks | Unknown | N/A | |
| Token-Scaled Logit Distillation for Ternary Weight Generative Language Models | Unknown | N/A | |
| A Bayesian Approach To Analysing Training Data Attribution In Deep Learning | Unknown | N/A | |
| Human-Guided Complexity-Controlled Abstractions | Unknown | N/A | |
| Generalized Bayesian Inference for Scientific Simulators via Amortized Cost Estimation | Unknown | N/A | |
| Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Method | Unknown | N/A | |
| Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability | Unknown | N/A | |
| Variational Annealing on Graphs for Combinatorial Optimization | Unknown | N/A | |
| EICIL: Joint Excitatory Inhibitory Cycle Iteration Learning for Deep Spiking Neural Networks | Unknown | N/A | |
| Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone | Unknown | N/A | |
| An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient | Unknown | N/A | |
| Kronecker-Factored Approximate Curvature for Modern Neural Network Architectures | Unknown | N/A | |
| Neural Harmonics: Bridging Spectral Embedding and Matrix Completion in Self-Supervised Learning | Unknown | N/A | |
| Exploiting hidden structures in non-convex games for convergence to Nash equilibrium | Unknown | N/A | |
| UniTSFace: Unified Threshold Integrated Sample-to-Sample Loss for Face Recognition | Unknown | N/A | |
| Diffusion Model for Graph Inverse Problems: Towards Effective Source Localization on Complex Networks | Unknown | N/A | |
| $p$-value Adjustment for Monotonous, Unbiased, and Fast Clustering Comparison | Unknown | N/A | |
| Inferring the Future by Imagining the Past | Unknown | N/A | |
| The Graph Pencil Method: Mapping Subgraph Densities to Stochastic Block Models | Unknown | N/A | |
| Improving Self-supervised Molecular Representation Learning using Persistent Homology | Unknown | N/A | |
| Utilitarian Algorithm Configuration | Unknown | N/A | |
| Beta Diffusion | Unknown | N/A | |
| Learning Transformer Programs | Unknown | N/A | |
| Simple and Controllable Music Generation | Unknown | N/A | |
| Mitigating the Effect of Incidental Correlations on Part-based Learning | Unknown | N/A | |
| PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning | Unknown | N/A | |
| The Equivalence of Dynamic and Strategic Stability under Regularized Learning in Games | Unknown | N/A | |
| Learning to Discover Skills through Guidance | Unknown | N/A | |
| xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data | Unknown | N/A | |
| Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models | Unknown | N/A | |
| Risk-Averse Active Sensing for Timely Outcome Prediction under Cost Pressure | Unknown | N/A | |
| Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage | Unknown | N/A | |
| Sample Complexity Bounds for Score-Matching: Causal Discovery and Generative Modeling | Unknown | N/A | |
| Posthoc privacy guarantees for collaborative inference with modified Propose-Test-Release | Unknown | N/A | |
| Future-Dependent Value-Based Off-Policy Evaluation in POMDPs | Unknown | N/A | |
| Accurate Interpolation for Scattered Data through Hierarchical Residual Refinement | Unknown | N/A | |
| Hybrid Policy Optimization from Imperfect Demonstrations | Unknown | N/A | |
| Optimal Preconditioning and Fisher Adaptive Langevin Sampling | Unknown | N/A | |
| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Unknown | N/A | |
| Analyzing the Sample Complexity of Self-Supervised Image Reconstruction Methods | Unknown | N/A | |
| WITRAN: Water-wave Information Transmission and Recurrent Acceleration Network for Long-range Time Series Forecasting | Unknown | N/A | |
| Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization | Unknown | N/A | |
| AMDP: An Adaptive Detection Procedure for False Discovery Rate Control in High-Dimensional Mediation Analysis | Unknown | N/A | |
| Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards | Unknown | N/A | |
| Human-Aligned Calibration for AI-Assisted Decision Making | Unknown | N/A | |
| Cross-Domain Policy Adaptation via Value-Guided Data Filtering | Unknown | N/A | |
| Fast Projected Newton-like Method for Precision Matrix Estimation under Total Positivity | Unknown | N/A | |
| Stable Nonconvex-Nonconcave Training via Linear Interpolation | Unknown | N/A | |
| FAST: a Fused and Accurate Shrinkage Tree for Heterogeneous Treatment Effects Estimation | Unknown | N/A | |
| Near-optimal learning with average Hölder smoothness | Unknown | N/A | |
| Advancing Bayesian Optimization via Learning Correlated Latent Space | Unknown | N/A | |
| Interpreting Unsupervised Anomaly Detection in Security via Rule Extraction | Unknown | N/A | |
| LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph Matching | Unknown | N/A | |
| Decentralized Matrix Sensing: Statistical Guarantees and Fast Convergence | Unknown | N/A | |
| Hyperbolic Space with Hierarchical Margin Boosts Fine-Grained Learning from Coarse Labels | Unknown | N/A | |
| Generalized Information-theoretic Multi-view Clustering | Unknown | N/A | |
| A Definition of Continual Reinforcement Learning | Unknown | N/A | |
| LinkerNet: Fragment Poses and Linker Co-Design with 3D Equivariant Diffusion | Unknown | N/A | |
| DreamSparse: Escaping from Plato’s Cave with 2D Diffusion Model Given Sparse Views | Unknown | N/A | |
| Few-shot Generation via Recalling Brain-Inspired Episodic-Semantic Memory | Unknown | N/A | |
| Latent Space Translation via Semantic Alignment | Unknown | N/A | |
| NAR-Former V2: Rethinking Transformer for Universal Neural Network Representation Learning | Unknown | N/A | |
| Clifford Group Equivariant Neural Networks | Unknown | N/A | |
| NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA | Unknown | N/A | |
| Object-centric Learning with Cyclic Walks between Parts and Whole | Unknown | N/A | |
| Circuit as Set of Points | Unknown | N/A | |
| Energy Guided Diffusion for Generating Neurally Exciting Images | Unknown | N/A | |
| Fast Bellman Updates for Wasserstein Distributionally Robust MDPs | Unknown | N/A | |
| IBA: Towards Irreversible Backdoor Attacks in Federated Learning | Unknown | N/A | |
| SAME: Uncovering GNN Black Box with Structure-aware Shapley-based Multipiece Explanations | Unknown | N/A | |
| Can You Rely on Your Model Evaluation? Improving Model Evaluation with Synthetic Test Data | Unknown | N/A | |
| Meta-learning families of plasticity rules in recurrent spiking networks using simulation-based inference | Unknown | N/A | |
| Certification of Distributional Individual Fairness | Unknown | N/A | |
| Attacks on Online Learners: a Teacher-Student Analysis | Unknown | N/A | |
| Injecting Multimodal Information into Rigid Protein Docking via Bi-level Optimization | Unknown | N/A | |
| An Improved Relaxation for Oracle-Efficient Adversarial Contextual Bandits | Unknown | N/A | |
| Use perturbations when learning from explanations | Unknown | N/A | |
| Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models | Unknown | N/A | |
| Private estimation algorithms for stochastic block models and mixture models | Unknown | N/A | |
| Learning Cuts via Enumeration Oracles | Unknown | N/A | |
| Fair Canonical Correlation Analysis | Unknown | N/A | |
| Towards Test-Time Refusals via Concept Negation | Unknown | N/A | |
| On the Convergence of No-Regret Learning Dynamics in Time-Varying Games | Unknown | N/A | |
| Dynamic Regret of Adversarial Linear Mixture MDPs | Unknown | N/A | |
| Conservative State Value Estimation for Offline Reinforcement Learning | Unknown | N/A | |
| FiGURe: Simple and Efficient Unsupervised Node Representations with Filter Augmentations | Unknown | N/A | |
| On the Interplay between Social Welfare and Tractability of Equilibria | Unknown | N/A | |
| ContinuAR: Continuous Autoregression For Infinite-Fidelity Fusion | Unknown | N/A | |
| Beyond Pretrained Features: Noisy Image Modeling Provides Adversarial Defense | Unknown | N/A | |
| Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima | Unknown | N/A | |
| Streaming PCA for Markovian Data | Unknown | N/A | |
| Lookaround Optimizer: $k$ steps around, 1 step average | Unknown | N/A | |
| Bringing regularized optimal transport to lightspeed: a splitting method adapted for GPUs | Unknown | N/A | |
| Fully Dynamic $k$-Clustering in $\tilde O(k)$ Update Time | Unknown | N/A | |
| Connecting Certified and Adversarial Training | Unknown | N/A | |
| Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing | Unknown | N/A | |
| Toward Understanding Generative Data Augmentation | Unknown | N/A | |
| Adapting Neural Link Predictors for Data-Efficient Complex Query Answering | Unknown | N/A | |
| Provable Training for Graph Contrastive Learning | Unknown | N/A | |
| Learning Layer-wise Equivariances Automatically using Gradients | Unknown | N/A | |
| Decision Tree for Locally Private Estimation with Public Data | Unknown | N/A | |
| Equivariant flow matching | Unknown | N/A | |
| Implicit Manifold Gaussian Process Regression | Unknown | N/A | |
| ReHLine: Regularized Composite ReLU-ReHU Loss Minimization with Linear Computation and Linear Convergence | Unknown | N/A | |
| Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction | Unknown | N/A | |
| The Contextual Lasso: Sparse Linear Models via Deep Neural Networks | Unknown | N/A | |
| Optimal Block-wise Asymmetric Graph Construction for Graph-based Semi-supervised Learning | Unknown | N/A | |
| Deep Insights into Noisy Pseudo Labeling on Graph Data | Unknown | N/A | |
| Causal Interpretation of Self-Attention in Pre-Trained Transformers | Unknown | N/A | |
| Deep Recurrent Optimal Stopping | Unknown | N/A | |
| Stochastic Approximation Algorithms for Systems of Interacting Particles | Unknown | N/A | |
| Efficient Batched Algorithm for Contextual Linear Bandits with Large Action Space via Soft Elimination | Unknown | N/A | |
| Structured Voronoi Sampling | Unknown | N/A | |
| LD2: Scalable Heterophilous Graph Neural Network with Decoupled Embeddings | Unknown | N/A | |
| Feature learning via mean-field Langevin dynamics: classifying sparse parities and beyond | Unknown | N/A | |
| MMD-Fuse: Learning and Combining Kernels for Two-Sample Testing Without Data Splitting | Unknown | N/A | |
| Convergence of mean-field Langevin dynamics: time-space discretization, stochastic gradient, and variance reduction | Unknown | N/A | |
| On the Last-iterate Convergence in Time-varying Zero-sum Games: Extra Gradient Succeeds where Optimism Fails | Unknown | N/A | |
| DYffusion: A Dynamics-informed Diffusion Model for Spatiotemporal Forecasting | Unknown | N/A | |
| Unified Segment-to-Segment Framework for Simultaneous Sequence Generation | Unknown | N/A | |
| Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models | Unknown | N/A | |
| Hybrid Search for Efficient Planning with Completeness Guarantees | Unknown | N/A | |
| SLaM: Student-Label Mixing for Distillation with Unlabeled Examples | Unknown | N/A | |
| Towards Robust and Expressive Whole-body Human Pose and Shape Estimation | Unknown | N/A | |
| Correlative Information Maximization: A Biologically Plausible Approach to Supervised Deep Neural Networks without Weight Symmetry | Unknown | N/A | |
| Fast Approximation of Similarity Graphs with Kernel Density Estimation | Unknown | N/A | |
| AI for Interpretable Chemistry: Predicting Radical Mechanistic Pathways via Contrastive Learning | Unknown | N/A | |
| A Unified Framework for U-Net Design and Analysis | Unknown | N/A | |
| Policy Gradient for Rectangular Robust Markov Decision Processes | Unknown | N/A | |
| Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning | Unknown | N/A | |
| D4Explainer: In-distribution Explanations of Graph Neural Network via Discrete Denoising Diffusion | Unknown | N/A | |
| An Adaptive Algorithm for Learning with Unknown Distribution Drift | Unknown | N/A | |
| Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach | Unknown | N/A | |
| Provable benefits of annealing for estimating normalizing constants: Importance Sampling, Noise-Contrastive Estimation, and beyond | Unknown | N/A | |
| TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery | Unknown | N/A | |
| Knowledge Diffusion for Distillation | Unknown | N/A | |
| Towards a Unified Analysis of Kernel-based Methods Under Covariate Shift | Unknown | N/A | |
| Regression with Cost-based Rejection | Unknown | N/A | |
| Predicting Global Label Relationship Matrix for Graph Neural Networks under Heterophily | Unknown | N/A | |
| Automatic Integration for Spatiotemporal Neural Point Processes | Unknown | N/A | |
| Debiasing Scores and Prompts of 2D Diffusion for View-consistent Text-to-3D Generation | Unknown | N/A | |
| Hierarchical Randomized Smoothing | Unknown | N/A | |
| Direct Training of SNN using Local Zeroth Order Method | Unknown | N/A | |
| Disentangling Voice and Content with Self-Supervision for Speaker Recognition | Unknown | N/A | |
| Understanding and Improving Ensemble Adversarial Defense | Unknown | N/A | |
| Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics | Unknown | N/A | |
| Theoretical and Practical Perspectives on what Influence Functions Do | Unknown | N/A | |
| Estimating the Rate-Distortion Function by Wasserstein Gradient Descent | Unknown | N/A | |
| Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context | Unknown | N/A | |
| TRIAGE: Characterizing and auditing training data for improved regression | Unknown | N/A | |
| Generalization in the Face of Adaptivity: A Bayesian Perspective | Unknown | N/A | |
| ViCA-NeRF: View-Consistency-Aware 3D Editing of Neural Radiance Fields | Unknown | N/A | |
| Efficiently incorporating quintuple interactions into geometric deep learning force fields | Unknown | N/A | |
| Causal de Finetti: On the Identification of Invariant Causal Structure in Exchangeable Data | Unknown | N/A | |
| Adversarial Self-Training Improves Robustness and Generalization for Gradual Domain Adaptation | Unknown | N/A | |
| Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory | Unknown | N/A | |
| MomentDiff: Generative Video Moment Retrieval from Random to Real | Unknown | N/A | |
| Temporal Dynamic Quantization for Diffusion Models | Unknown | N/A | |
| Self-Predictive Universal AI | Unknown | N/A | |
| Causal Component Analysis | Unknown | N/A | |
| Nonparametric Identifiability of Causal Representations from Unknown Interventions | Unknown | N/A | |
| CLadder: Assessing Causal Reasoning in Language Models | Unknown | N/A | |
| Improving neural network representations using human similarity judgments | Unknown | N/A | |
| AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation | Unknown | N/A | |
| NeuroGraph: Benchmarks for Graph Machine Learning in Brain Connectomics | Unknown | N/A | |
| Effectively Learning Initiation Sets in Hierarchical Reinforcement Learning | Unknown | N/A | |
| Reinforcement Learning with Simple Sequence Priors | Unknown | N/A | |
| Aleatoric and Epistemic Discrimination: Fundamental Limits of Fairness Interventions | Unknown | N/A | |
| StoryBench: A Multifaceted Benchmark for Continuous Story Visualization | Unknown | N/A | |
| Robust Mean Estimation Without Moments for Symmetric Distributions | Unknown | N/A | |
| Federated Compositional Deep AUC Maximization | Unknown | N/A | |
| Window-Based Distribution Shift Detection for Deep Neural Networks | Unknown | N/A | |
| FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models | Unknown | N/A | |
| V-InFoR: A Robust Graph Neural Networks Explainer for Structurally Corrupted Graphs | Unknown | N/A | |
| A Comprehensive Study on Text-attributed Graphs: Benchmarking and Rethinking | Unknown | N/A | |
| Bayesian Active Causal Discovery with Multi-Fidelity Experiments | Unknown | N/A | |
| CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss | Unknown | N/A | |
| Online PCA in Converging Self-consistent Field Equations | Unknown | N/A | |
| Don’t blame Dataset Shift! Shortcut Learning due to Gradients and Cross Entropy | Unknown | N/A | |
| On Slicing Optimality for Mutual Information | Unknown | N/A | |
| k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy | Unknown | N/A | |
| Information Maximization Perspective of Orthogonal Matching Pursuit with Applications to Explainable AI | Unknown | N/A | |
| Conditional Matrix Flows for Gaussian Graphical Models | Unknown | N/A | |
| Two-Stage Learning to Defer with Multiple Experts | Unknown | N/A | |
| Multiply Robust Federated Estimation of Targeted Average Treatment Effects | Unknown | N/A | |
| On the Variance, Admissibility, and Stability of Empirical Risk Minimization | Unknown | N/A | |
| To Stay or Not to Stay in the Pre-train Basin: Insights on Ensembling in Transfer Learning | Unknown | N/A | |
| Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task | Unknown | N/A | |
| Phase diagram of early training dynamics in deep neural networks: effect of the learning rate, depth, and width | Unknown | N/A | |
| Explaining V1 Properties with a Biologically Constrained Deep Learning Architecture | Unknown | N/A | |
| Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness | Unknown | N/A | |
| Improving day-ahead Solar Irradiance Time Series Forecasting by Leveraging Spatio-Temporal Context | Unknown | N/A | |
| Red Teaming Deep Neural Networks with Feature Synthesis Tools | Unknown | N/A | |
| From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces | Unknown | N/A | |
| Human-in-the-Loop Optimization for Deep Stimulus Encoding in Visual Prostheses | Unknown | N/A | |
| Agnostically Learning Single-Index Models using Omnipredictors | Unknown | N/A | |
| Combining Behaviors with the Successor Features Keyboard | Unknown | N/A | |
| Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation | Unknown | N/A | |
| Data Market Design through Deep Learning | Unknown | N/A | |
| Text Alignment Is An Efficient Unified Model for Massive NLP Tasks | Unknown | N/A | |
| Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting | Unknown | N/A | |
| f-Policy Gradients: A General Framework for Goal-Conditioned RL using f-Divergences | Unknown | N/A | |
| Fine-Grained Human Feedback Gives Better Rewards for Language Model Training | Unknown | N/A | |
| Disentangled Wasserstein Autoencoder for T-Cell Receptor Engineering | Unknown | N/A | |
| A Unifying Perspective on Multi-Calibration: Game Dynamics for Multi-Objective Learning | Unknown | N/A | |
| Data-driven Optimal Filtering for Linear Systems with Unknown Noise Covariances | Unknown | N/A | |
| Hierarchical VAEs provide a normative account of motion processing in the primate brain | Unknown | N/A | |
| Optimal testing using combined test statistics across independent studies | Unknown | N/A | |
| Scale Alone Does not Improve Mechanistic Interpretability in Vision Models | Unknown | N/A | |
| Tracking Most Significant Shifts in Nonparametric Contextual Bandits | Unknown | N/A | |
| SQ Lower Bounds for Non-Gaussian Component Analysis with Weaker Assumptions | Unknown | N/A | |
| Precise asymptotic generalization for multiclass classification with overparameterized linear models | Unknown | N/A | |
| Fair Adaptive Experiments | Unknown | N/A | |
| Diverse Shape Completion via Style Modulated Generative Adversarial Networks | Unknown | N/A | |
| UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures | Unknown | N/A | |
| Understanding the detrimental class-level effects of data augmentation | Unknown | N/A | |
| Versatile Energy-Based Probabilistic Models for High Energy Physics | Unknown | N/A | |
| Compositional Generalization from First Principles | Unknown | N/A | |
| SpecTr: Fast Speculative Decoding via Optimal Transport | Unknown | N/A | |
| Fair, Polylog-Approximate Low-Cost Hierarchical Clustering | Unknown | N/A | |
| Minimax-Optimal Location Estimation | Unknown | N/A | |
| Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation | Unknown | N/A | |
| Projection-Free Methods for Solving Nonconvex-Concave Saddle Point Problems | Unknown | N/A | |
| A polar prediction model for learning to represent visual transformations | Unknown | N/A | |
| Kullback-Leibler Maillard Sampling for Multi-armed Bandits with Bounded Rewards | Unknown | N/A | |
| No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models | Unknown | N/A | |
| Modelling Cellular Perturbations with the Sparse Additive Mechanism Shift Variational Autoencoder | Unknown | N/A | |
| HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation | Unknown | N/A | |
| No-Regret Online Prediction with Strategic Experts | Unknown | N/A | |
| Uncovering motifs of concurrent signaling across multiple neuronal populations | Unknown | N/A | |
| ELDEN: Exploration via Local Dependencies | Unknown | N/A | |
| Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures | Unknown | N/A | |
| How to Scale Your EMA | Unknown | N/A | |
| Single-Pass Pivot Algorithm for Correlation Clustering. Keep it simple! | Unknown | N/A | |
| Model-free Posterior Sampling via Learning Rate Randomization | Unknown | N/A | |
| A Unified, Scalable Framework for Neural Population Decoding | Unknown | N/A | |
| A Trichotomy for Transductive Online Learning | Unknown | N/A | |
| Towards Automated Circuit Discovery for Mechanistic Interpretability | Unknown | N/A | |
| Generating Behaviorally Diverse Policies with Latent Diffusion Models | Unknown | N/A | |
| Distributed Personalized Empirical Risk Minimization | Unknown | N/A | |
| Structured Prediction with Stronger Consistency Guarantees | Unknown | N/A | |
| Feature Learning for Interpretable, Performant Decision Trees | Unknown | N/A | |
| Advice Querying under Budget Constraint for Online Algorithms | Unknown | N/A | |
| Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression | Unknown | N/A | |
| MIMEx: Intrinsic Rewards from Masked Input Modeling | Unknown | N/A | |
| Prioritizing Samples in Reinforcement Learning with Reducible Loss | Unknown | N/A | |
| Spatially Resolved Gene Expression Prediction from Histology Images via Bi-modal Contrastive Learning | Unknown | N/A | |
| Not All Neuro-Symbolic Concepts Are Created Equal: Analysis and Mitigation of Reasoning Shortcuts | Unknown | N/A | |
| Group Robust Classification Without Any Group Information | Unknown | N/A | |
| Egocentric Planning for Scalable Embodied Task Achievement | Unknown | N/A | |
| Lie Point Symmetry and Physics-Informed Networks | Unknown | N/A | |
| PAC-Bayes Generalization Certificates for Learned Inductive Conformal Prediction | Unknown | N/A | |
| Derandomized novelty detection with FDR control via conformal e-values | Unknown | N/A | |
| Adversarial Learning for Feature Shift Detection and Correction | Unknown | N/A | |
| Grammar Prompting for Domain-Specific Language Generation with Large Language Models | Unknown | N/A | |
| Rethinking Gauss-Newton for learning over-parameterized models | Unknown | N/A | |
| SOL: Sampling-based Optimal Linear bounding of arbitrary scalar functions | Unknown | N/A | |
| Compositional Sculpting of Iterative Generative Processes | Unknown | N/A | |
| High-Fidelity Audio Compression with Improved RVQGAN | Unknown | N/A | |
| A State Representation for Diminishing Rewards | Unknown | N/A | |
| Discriminative Calibration: Check Bayesian Computation from Simulations and Flexible Classifier | Unknown | N/A | |
| Online POMDP Planning with Anytime Deterministic Guarantees | Unknown | N/A | |
| Residual Q-Learning: Offline and Online Policy Customization without Value | Unknown | N/A | |
| Near Optimal Reconstruction of Spherical Harmonic Expansions | Unknown | N/A | |
| Automated Classification of Model Errors on ImageNet | Unknown | N/A | |
| Towards robust and generalizable representations of extracellular data using contrastive learning | Unknown | N/A | |
| The Gain from Ordering in Online Learning | Unknown | N/A | |
| Intensity Profile Projection: A Framework for Continuous-Time Representation Learning for Dynamic Networks | Unknown | N/A | |
| Optimistic Natural Policy Gradient: a Simple Efficient Policy Optimization Framework for Online RL | Unknown | N/A | |
| MarioGPT: Open-Ended Text2Level Generation through Large Language Models | Unknown | N/A | |
| Distribution-Free Statistical Dispersion Control for Societal Applications | Unknown | N/A | |
| Small batch deep reinforcement learning | Unknown | N/A | |
| Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference | Unknown | N/A | |
| Differentiable Neuro-Symbolic Reasoning on Large-Scale Knowledge Graphs | Unknown | N/A | |
| Detecting hidden confounding in observational data using multiple environments | Unknown | N/A | |
| Learning and processing the ordinal information of temporal sequences in recurrent neural circuits | Unknown | N/A | |
| Multi Time Scale World Models | Unknown | N/A | |
| Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization | Unknown | N/A | |
| Model-Free Active Exploration in Reinforcement Learning | Unknown | N/A | |
| Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes | Unknown | N/A | |
| Self-Supervised Reinforcement Learning that Transfers using Random Features | Unknown | N/A | |
| KAKURENBO: Adaptively Hiding Samples in Deep Neural Network Training | Unknown | N/A | |
| List and Certificate Complexities in Replicable Learning | Unknown | N/A | |
| Towards Unbounded Machine Unlearning | Unknown | N/A | |
| $p$-Poisson surface reconstruction in curl-free flow from point clouds | Unknown | N/A | |
| DrugCLIP: Contrastive Protein-Molecule Representation Learning for Virtual Screening | Unknown | N/A | |
| On the Convergence of CART under Sufficient Impurity Decrease Condition | Unknown | N/A | |
| FlowPG: Action-constrained Policy Gradient with Normalizing Flows | Unknown | N/A | |
| Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization | Unknown | N/A | |
| Variational Monte Carlo on a Budget — Fine-tuning pre-trained Neural Wavefunctions | Unknown | N/A | |
| Nearly Optimal Bounds for Cyclic Forgetting | Unknown | N/A | |
| SGFormer: Simplifying and Empowering Transformers for Large-Graph Representations | Unknown | N/A | |
| The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit | Unknown | N/A | |
| Doubly Robust Augmented Transfer for Meta-Reinforcement Learning | Unknown | N/A | |
| Convergence analysis of ODE models for accelerated first-order methods via positive semidefinite kernels | Unknown | N/A | |
| Anytime Model Selection in Linear Bandits | Unknown | N/A | |
| Provable Adversarial Robustness for Group Equivariant Tasks: Graphs, Point Clouds, Molecules, and More | Unknown | N/A | |
| Learning Linear Causal Representations from Interventions under General Nonlinear Mixing | Unknown | N/A | |
| Towards Optimal Effective Resistance Estimation | Unknown | N/A | |
| Geometric Transformer with Interatomic Positional Encoding | Unknown | N/A | |
| Rotating Features for Object Discovery | Unknown | N/A | |
| Scaling MLPs: A Tale of Inductive Bias | Unknown | N/A | |
| Emergent and Predictable Memorization in Large Language Models | Unknown | N/A | |
| Strategic Behavior in Two-sided Matching Markets with Prediction-enhanced Preference-formation | Unknown | N/A | |
| Bayes beats Cross Validation: Efficient and Accurate Ridge Regression via Expectation Maximization | Unknown | N/A | |
| Counterfactual Evaluation of Peer-Review Assignment Policies | Unknown | N/A | |
| Emergent Communication for Rules Reasoning | Unknown | N/A | |
| What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization | Unknown | N/A | |
| On the Connection between Pre-training Data Diversity and Fine-tuning Robustness | Unknown | N/A | |
| Hyperbolic Graph Neural Networks at Scale: A Meta Learning Approach | Unknown | N/A | |
| The emergence of clusters in self-attention dynamics | Unknown | N/A | |
| An Inductive Bias for Tabular Deep Learning | Unknown | N/A | |
| Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint | Unknown | N/A | |
| CEIL: Generalized Contextual Imitation Learning | Unknown | N/A | |
| Training Transformers with 4-bit Integers | Unknown | N/A | |
| Estimating Propensity for Causality-based Recommendation without Exposure Data | Unknown | N/A | |
| Blockwise Parallel Transformers for Large Context Models | Unknown | N/A | |
| BiSLS/SPS: Auto-tune Step Sizes for Stable Bi-level Optimization | Unknown | N/A | |
| Matrix Compression via Randomized Low Rank and Low Precision Factorization | Unknown | N/A | |
| Adapting to Continuous Covariate Shift via Online Density Ratio Estimation | Unknown | N/A | |
| From Tempered to Benign Overfitting in ReLU Neural Networks | Unknown | N/A | |
| Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering | Unknown | N/A | |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Unknown | N/A | |
| Facilitating Graph Neural Networks with Random Walk on Simplicial Complexes | Unknown | N/A | |
| Towards Self-Interpretable Graph-Level Anomaly Detection | Unknown | N/A | |
| Optimal Transport for Treatment Effect Estimation | Unknown | N/A | |
| Language Quantized AutoEncoders: Towards Unsupervised Text-Image Alignment | Unknown | N/A | |
| VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning | Unknown | N/A | |
| Blocked Collaborative Bandits: Online Collaborative Filtering with Per-Item Budget Constraints | Unknown | N/A | |
| Function Space Bayesian Pseudocoreset for Bayesian Neural Networks | Unknown | N/A | |
| CaMP: Causal Multi-policy Planning for Interactive Navigation in Multi-room Scenes | Unknown | N/A | |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Unknown | N/A | |
| Globally solving the Gromov-Wasserstein problem for point clouds in low dimensional Euclidean spaces | Unknown | N/A | |
| Unbiased learning of deep generative models with structured discrete representations | Unknown | N/A | |
| Geometry-Informed Neural Operator for Large-Scale 3D PDEs | Unknown | N/A | |
| Grounding Neural Inference with Satisfiability Modulo Theories | Unknown | N/A | |
| Uncertainty Estimation for Safety-critical Scene Segmentation via Fine-grained Reward Maximization | Unknown | N/A | |
| DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics | Unknown | N/A | |
| Evolving Connectivity for Recurrent Spiking Neural Networks | Unknown | N/A | |
| Counterfactually Fair Representation | Unknown | N/A | |
| Emergent Communication in Interactive Sketch Question Answering | Unknown | N/A | |
| Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation | Unknown | N/A | |
| Identifiable Contrastive Learning with Automatic Feature Importance Discovery | Unknown | N/A | |
| Likelihood-Based Diffusion Language Models | Unknown | N/A | |
| PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning | Unknown | N/A | |
| Brant: Foundation Model for Intracranial Neural Signal | Unknown | N/A | |
| Learning Topology-Agnostic EEG Representations with Geometry-Aware Modeling | Unknown | N/A | |
| Neural-Logic Human-Object Interaction Detection | Unknown | N/A | |
| Beyond NTK with Vanilla Gradient Descent: A Mean-Field Analysis of Neural Networks with Polynomial Width, Samples, and Time | Unknown | N/A | |
| HIQL: Offline Goal-Conditioned RL with Latent States as Actions | Unknown | N/A | |
| Improved Convergence in High Probability of Clipped Gradient Methods with Heavy Tailed Noise | Unknown | N/A | |
| Residual Alignment: Uncovering the Mechanisms of Residual Networks | Unknown | N/A | |
| Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL | Unknown | N/A | |
| Consistent Diffusion Models: Mitigating Sampling Drift by Learning to be Consistent | Unknown | N/A | |
| Binary Radiance Fields | Unknown | N/A | |
| Ambient Diffusion: Learning Clean Distributions from Corrupted Data | Unknown | N/A | |
| An Information-Theoretic Evaluation of Generative Models in Learning Multi-modal Distributions | Unknown | N/A | |
| Statistical and Computational Trade-off in Multi-Agent Multi-Armed Bandits | Unknown | N/A | |
| Compressed Video Prompt Tuning | Unknown | N/A | |
| On Convergence of Polynomial Approximations to the Gaussian Mixture Entropy | Unknown | N/A | |
| Diversify Your Vision Datasets with Automatic Diffusion-based Augmentation | Unknown | N/A | |
| DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement | Unknown | N/A | |
| No Change, No Gain: Empowering Graph Neural Networks with Expected Model Change Maximization for Active Learning | Unknown | N/A | |
| Better with Less: A Data-Active Perspective on Pre-Training Graph Neural Networks | Unknown | N/A | |
| Doubly-Robust Self-Training | Unknown | N/A | |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | Unknown | N/A | |
| Zero-shot causal learning | Unknown | N/A | |
| DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models | Unknown | N/A | |
| Uniform Convergence with Square-Root Lipschitz Loss | Unknown | N/A | |
| Object-Centric Slot Diffusion | Unknown | N/A | |
| Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement | Unknown | N/A | |
| Curriculum Learning With Infant Egocentric Videos | Unknown | N/A | |
| Towards Optimal Caching and Model Selection for Large Model Inference | Unknown | N/A | |
| Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer | Unknown | N/A | |
| RETVec: Resilient and Efficient Text Vectorizer | Unknown | N/A | |
| On the Planning Abilities of Large Language Models - A Critical Investigation | Unknown | N/A | |
| Long-Term Fairness with Unknown Dynamics | Unknown | N/A | |
| A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games | Unknown | N/A | |
| Scaling Riemannian Diffusion Models | Unknown | N/A | |
| Global Convergence Analysis of Local SGD for Two-layer Neural Network without Overparameterization | Unknown | N/A | |
| Plug-and-Play Stability for Intracortical Brain-Computer Interfaces: A One-Year Demonstration of Seamless Brain-to-Text Communication | Unknown | N/A | |
| Diffusion Self-Guidance for Controllable Image Generation | Unknown | N/A | |
| Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data | Unknown | N/A | |
| Nearly Optimal VC-Dimension and Pseudo-Dimension Bounds for Deep Neural Network Derivatives | Unknown | N/A | |
| Stability Guarantees for Feature Attributions with Multiplicative Smoothing | Unknown | N/A | |
| Arbitrarily Scalable Environment Generators via Neural Cellular Automata | Unknown | N/A | |
| TopoSRL: Topology preserving self-supervised Simplicial Representation Learning | Unknown | N/A | |
| Chasing Fairness Under Distribution Shift: A Model Weight Perturbation Approach | Unknown | N/A | |
| Characterizing the Impacts of Semi-supervised Learning for Weak Supervision | Unknown | N/A | |
| Maximum Average Randomly Sampled: A Scale Free and Non-parametric Algorithm for Stochastic Bandits | Unknown | N/A | |
| Physics-Driven ML-Based Modelling for Correcting Inverse Estimation | Unknown | N/A | |
| Simplicity Bias in 1-Hidden Layer Neural Networks | Unknown | N/A | |
| $S^3$: Increasing GPU Utilization during Generative Inference for Higher Throughput | Unknown | N/A | |
| Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Unknown | N/A | |
| Universal Gradient Descent Ascent Method for Nonconvex-Nonconcave Minimax Optimization | Unknown | N/A | |
| Debias Coarsely, Sample Conditionally: Statistical Downscaling through Optimal Transport and Probabilistic Diffusion Models | Unknown | N/A | |
| Class-Conditional Conformal Prediction with Many Classes | Unknown | N/A | |
| Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity | Unknown | N/A | |
| Optimal Unbiased Randomizers for Regression with Label Differential Privacy | Unknown | N/A | |
| Certified Minimax Unlearning with Generalization Rates and Deletion Capacity | Unknown | N/A | |
| Training shallow ReLU networks on noisy data using hinge loss: when do we overfit and is it benign? | Unknown | N/A | |
| UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene | Unknown | N/A | |
| On the Generalization Error of Stochastic Mirror Descent for Quadratically-Bounded Losses: an Improved Analysis | Unknown | N/A | |
| Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning | Unknown | N/A | |
| Practical Differentially Private Hyperparameter Tuning with Subsampling | Unknown | N/A | |
| Disambiguated Attention Embedding for Multi-Instance Partial-Label Learning | Unknown | N/A | |
| High dimensional, tabular deep learning with an auxiliary knowledge graph | Unknown | N/A | |
| Not All Out-of-Distribution Data Are Harmful to Open-Set Active Learning | Unknown | N/A | |
| Error Bounds for Learning with Vector-Valued Random Features | Unknown | N/A | |
| Contextual Bandits and Imitation Learning with Preference-Based Active Queries | Unknown | N/A | |
| What Distributions are Robust to Indiscriminate Poisoning Attacks for Linear Learners? | Unknown | N/A | |
| PAC Learning Linear Thresholds from Label Proportions | Unknown | N/A | |
| Payoff-based Learning with Matrix Multiplicative Weights in Quantum Games | Unknown | N/A | |
| POMDP Planning for Object Search in Partially Unknown Environment | Unknown | N/A | |
| MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers | Unknown | N/A | |
| BasisFormer: Attention-based Time Series Forecasting with Learnable and Interpretable Basis | Unknown | N/A | |
| A case for reframing automated medical image classification as segmentation | Unknown | N/A | |
| Inner Product-based Neural Network Similarity | Unknown | N/A | |
| Rethinking Incentives in Recommender Systems: Are Monotone Rewards Always Beneficial? | Unknown | N/A | |
| Can Language Models Solve Graph Problems in Natural Language? | Unknown | N/A | |
| CLIP-OGD: An Experimental Design for Adaptive Neyman Allocation in Sequential Experiments | Unknown | N/A | |
| Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective | Unknown | N/A | |
| Supervised Pretraining Can Learn In-Context Reinforcement Learning | Unknown | N/A | |
| Robust Second-Order Nonconvex Optimization and Its Application to Low Rank Matrix Sensing | Unknown | N/A | |
| The Impact of Positional Encoding on Length Generalization in Transformers | Unknown | N/A | |
| Demystifying the Optimal Performance of Multi-Class Classification | Unknown | N/A | |
| Self-Chained Image-Language Model for Video Localization and Question Answering | Unknown | N/A | |
| IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI | Unknown | N/A | |
| Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation | Unknown | N/A | |
| ReDS: Offline RL With Heteroskedastic Datasets via Support Constraints | Unknown | N/A | |
| Finite Population Regression Adjustment and Non-asymptotic Guarantees for Treatment Effect Estimation | Unknown | N/A | |
| Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions | Unknown | N/A | |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling and Beyond | Unknown | N/A | |
| Non-autoregressive Machine Translation with Probabilistic Context-free Grammar | Unknown | N/A | |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Unknown | N/A | |
| A Data-Free Approach to Mitigate Catastrophic Forgetting in Federated Class Incremental Learning for Vision Tasks | Unknown | N/A | |
| New Bounds for Hyperparameter Tuning of Regression Problems Across Instances | Unknown | N/A | |
| A Long $N$-step Surrogate Stage Reward for Deep Reinforcement Learning | Unknown | N/A | |
| Language Models are Weak Learners | Unknown | N/A | |
| Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Federated Object Detection | Unknown | N/A | |
| PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model | Unknown | N/A | |
| Demystifying Oversmoothing in Attention-Based Graph Neural Networks | Unknown | N/A | |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | Unknown | N/A | |
| User-Level Differential Privacy With Few Examples Per User | Unknown | N/A | |
| Participatory Personalization in Classification | Unknown | N/A | |
| Guarantees for Self-Play in Multiplayer Games via Polymatrix Decomposability | Unknown | N/A | |
| ChatGPT-Powered Hierarchical Comparisons for Image Classification | Unknown | N/A | |
| Slimmed Asymmetrical Contrastive Learning and Cross Distillation for Lightweight Model Training | Unknown | N/A | |
| Enhancing Knowledge Transfer for Task Incremental Learning with Data-free Subnetwork | Unknown | N/A | |
| Evolutionary Neural Architecture Search for Transformer in Knowledge Tracing | Unknown | N/A | |
| Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption | Unknown | N/A | |
| MIM4DD: Mutual Information Maximization for Dataset Distillation | Unknown | N/A | |
| Fine-Grained Theoretical Analysis of Federated Zeroth-Order Optimization | Unknown | N/A | |
| Searching for Optimal Per-Coordinate Step-sizes with Multidimensional Backtracking | Unknown | N/A | |
| Maximization of Average Precision for Deep Learning with Adversarial Ranking Robustness | Unknown | N/A | |
| A new perspective on building efficient and expressive 3D equivariant graph neural networks | Unknown | N/A | |
| CosNet: A Generalized Spectral Kernel Network | Unknown | N/A | |
| Concept Algebra for (Score-Based) Text-Controlled Generative Models | Unknown | N/A | |
| State-Action Similarity-Based Representations for Off-Policy Evaluation | Unknown | N/A | |
| Adaptive Linear Estimating Equations | Unknown | N/A | |
| Why think step by step? Reasoning emerges from the locality of experience | Unknown | N/A | |
| ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer | Unknown | N/A | |
| Bandit Task Assignment with Unknown Processing Time | Unknown | N/A | |
| DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets | Unknown | N/A | |
| Game Solving with Online Fine-Tuning | Unknown | N/A | |
| Recurrent Temporal Revision Graph Networks | Unknown | N/A | |
| Decompose a Task into Generalizable Subtasks in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Explain Any Concept: Segment Anything Meets Concept-Based Explanation | Unknown | N/A | |
| Depth-discriminative Metric Learning for Monocular 3D Object Detection | Unknown | N/A | |
| Connecting Pre-trained Language Model and Downstream Task via Properties of Representation | Unknown | N/A | |
| An Exploration-by-Optimization Approach to Best of Both Worlds in Linear Bandits | Unknown | N/A | |
| Score-based Source Separation with Applications to Digital Communication Signals | Unknown | N/A | |
| Generalized Belief Transport | Unknown | N/A | |
| Weakly Coupled Deep Q-Networks | Unknown | N/A | |
| Provable benefits of score matching | Unknown | N/A | |
| Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation | Unknown | N/A | |
| Weitzman's Rule for Pandora's Box with Correlations | Unknown | N/A | |
| Learning Mask-aware CLIP Representations for Zero-Shot Segmentation | Unknown | N/A | |
| CoDrug: Conformal Drug Property Prediction with Density Estimation under Covariate Shift | Unknown | N/A | |
| Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion | Unknown | N/A | |
| Learning Rule-Induced Subgraph Representations for Inductive Relation Prediction | Unknown | N/A | |
| HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text | Unknown | N/A | |
| Discriminative Feature Attributions: Bridging Post Hoc Explainability and Inherent Interpretability | Unknown | N/A | |
| EMMA-X: An EM-like Multilingual Pre-training Algorithm for Cross-lingual Representation Learning | Unknown | N/A | |
| Large Language Models Are Semi-Parametric Reinforcement Learning Agents | Unknown | N/A | |
| HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution | Unknown | N/A | |
| Unifying Predictions of Deterministic and Stochastic Physics in Mesh-reduced Space with Sequential Flow Generative Model | Unknown | N/A | |
| Molecule Joint Auto-Encoding: Trajectory Pretraining with 2D and 3D Diffusion | Unknown | N/A | |
| Flow Matching for Scalable Simulation-Based Inference | Unknown | N/A | |
| Optimal and Fair Encouragement Policy Evaluation and Learning | Unknown | N/A | |
| Machine learning detects terminal singularities | Unknown | N/A | |
| Universality laws for Gaussian mixtures in generalized linear models | Unknown | N/A | |
| CoLLAT: On Adding Fine-grained Audio Understanding to Language Models using Token-Level Locked-Language Tuning | Unknown | N/A | |
| Large Language Models Are Zero-Shot Time Series Forecasters | Unknown | N/A | |
| Multi-task Representation Learning for Pure Exploration in Bilinear Bandits | Unknown | N/A | |
| Bottleneck Structure in Learned Features: Low-Dimension vs Regularity Tradeoff | Unknown | N/A | |
| Causal Effect Regularization: Automated Detection and Removal of Spurious Correlations | Unknown | N/A | |
| A Sublinear-Time Spectral Clustering Oracle with Improved Preprocessing Time | Unknown | N/A | |
| Max-Sliced Mutual Information | Unknown | N/A | |
| Probabilistic Inference in Reinforcement Learning Done Right | Unknown | N/A | |
| Training biologically plausible recurrent neural networks on cognitive tasks with long-term dependencies | Unknown | N/A | |
| Active Negative Loss Functions for Learning with Noisy Labels | Unknown | N/A | |
| Transformer-based Planning for Symbolic Regression | Unknown | N/A | |
| Reference-Based POMDPs | Unknown | N/A | |
| Neuro-symbolic Learning Yielding Logical Constraints | Unknown | N/A | |
| Efficient Learning of Linear Graph Neural Networks via Node Subsampling | Unknown | N/A | |
| Transformers learn to implement preconditioned gradient descent for in-context learning | Unknown | N/A | |
| Responsible AI (RAI) Games and Ensembles | Unknown | N/A | |
| GAN You See Me? Enhanced Data Reconstruction Attacks against Split Inference | Unknown | N/A | |
| Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds | Unknown | N/A | |
| Learning to Search Feasible and Infeasible Regions of Routing Problems with Flexible Neural k-Opt | Unknown | N/A | |
| Persuading Farsighted Receivers in MDPs: the Power of Honesty | Unknown | N/A | |
| An information-theoretic quantification of the content of communication between brain regions | Unknown | N/A | |
| Modulated Neural ODEs | Unknown | N/A | |
| CSLP-AE: A Contrastive Split-Latent Permutation Autoencoder Framework for Zero-Shot Electroencephalography Signal Conversion | Unknown | N/A | |
| Focused Transformer: Contrastive Training for Context Scaling | Unknown | N/A | |
| Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields | Unknown | N/A | |
| EvoFed: Leveraging Evolutionary Strategies for Communication-Efficient Federated Learning | Unknown | N/A | |
| Bicriteria Multidimensional Mechanism Design with Side Information | Unknown | N/A | |
| PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis | Unknown | N/A | |
| TriRE: A Multi-Mechanism Learning Paradigm for Continual Knowledge Retention and Promotion | Unknown | N/A | |
| Scalarization for Multi-Task and Multi-Domain Learning at Scale | Unknown | N/A | |
| Paxion: Patching Action Knowledge in Video-Language Foundation Models | Unknown | N/A | |
| Parallel Submodular Function Minimization | Unknown | N/A | |
| Joint Learning of Label and Environment Causal Independence for Graph Out-of-Distribution Generalization | Unknown | N/A | |
| ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding | Unknown | N/A | |
| Minimax Optimal Rate for Parameter Estimation in Multivariate Deviated Models | Unknown | N/A | |
| Mass-Producing Failures of Multimodal Systems with Language Models | Unknown | N/A | |
| Towards Evaluating Transfer-based Attacks Systematically, Practically, and Fairly | Unknown | N/A | |
| Brain-like Flexible Visual Inference by Harnessing Feedback Feedforward Alignment | Unknown | N/A | |
| Policy Space Diversity for Non-Transitive Games | Unknown | N/A | |
| Brain Diffusion for Visual Exploration: Cortical Discovery using Large Scale Generative Models | Unknown | N/A | |
| A Randomized Approach to Tight Privacy Accounting | Unknown | N/A | |
| The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning | Unknown | N/A | |
| Improving Adversarial Transferability via Intermediate-level Perturbation Decay | Unknown | N/A | |
| Sequential Predictive Two-Sample and Independence Testing | Unknown | N/A | |
| Retaining Beneficial Information from Detrimental Data for Neural Network Repair | Unknown | N/A | |
| Self-Supervised Learning of Representations for Space Generates Multi-Modular Grid Cells | Unknown | N/A | |
| Are Emergent Abilities of Large Language Models a Mirage? | Unknown | N/A | |
| SQ Lower Bounds for Learning Mixtures of Linear Classifiers | Unknown | N/A | |
| Sparse Modular Activation for Efficient Sequence Modeling | Unknown | N/A | |
| Rank-1 Matrix Completion with Gradient Descent and Small Random Initialization | Unknown | N/A | |
| Cross-Scale MAE: A Tale of Multiscale Exploitation in Remote Sensing | Unknown | N/A | |
| Approximate Allocation Matching for Structural Causal Bandits with Unobserved Confounders | Unknown | N/A | |
| Module-wise Adaptive Distillation for Multimodality Foundation Models | Unknown | N/A | |
| Make the U in UDA Matter: Invariant Consistency Learning for Unsupervised Domain Adaptation | Unknown | N/A | |
| STREAMER: Streaming Representation Learning and Event Segmentation in a Hierarchical Manner | Unknown | N/A | |
| Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games | Unknown | N/A | |
| Equivariant Neural Simulators for Stochastic Spatiotemporal Dynamics | Unknown | N/A | |
| Personalized Dictionary Learning for Heterogeneous Datasets | Unknown | N/A | |
| Expert load matters: operating networks at high accuracy and low manual effort | Unknown | N/A | |
| Implicit Convolutional Kernels for Steerable CNNs | Unknown | N/A | |
| Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore | Unknown | N/A | |
| Multi-Agent Learning with Heterogeneous Linear Contextual Bandits | Unknown | N/A | |
| Probabilistic Exponential Integrators | Unknown | N/A | |
| Back-Modality: Leveraging Modal Transformation for Data Augmentation | Unknown | N/A | |
| Multi-Swap k-Means++ | Unknown | N/A | |
| Improved Best-of-Both-Worlds Guarantees for Multi-Armed Bandits: FTRL with General Regularizers and Multiple Optimal Arms | Unknown | N/A | |
| DFRD: Data-Free Robustness Distillation for Heterogeneous Federated Learning | Unknown | N/A | |
| Neural Modulation for Flash Memory: An Unsupervised Learning Framework for Improved Reliability | Unknown | N/A | |
| ViSt3D: Video Stylization with 3D CNN | Unknown | N/A | |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Unknown | N/A | |
| Evolving Standardization for Continual Domain Generalization over Temporal Drift | Unknown | N/A | |
| Efficient Neural Music Generation | Unknown | N/A | |
| Bounding training data reconstruction in DP-SGD | Unknown | N/A | |
| Tree of Thoughts: Deliberate Problem Solving with Large Language Models | Unknown | N/A | |
| DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization | Unknown | N/A | |
| Sequential Memory with Temporal Predictive Coding | Unknown | N/A | |
| PDE-Refiner: Achieving Accurate Long Rollouts with Neural PDE Solvers | Unknown | N/A | |
| Counting Distinct Elements in the Turnstile Model with Differential Privacy under Continual Observation | Unknown | N/A | |
| Color Equivariant Convolutional Networks | Unknown | N/A | |
| Learning Efficient Surrogate Dynamic Models with Graph Spline Networks | Unknown | N/A | |
| Optimization and Bayes: A Trade-off for Overparameterized Neural Networks | Unknown | N/A | |
| A Unified Solution for Privacy and Communication Efficiency in Vertical Federated Learning | Unknown | N/A | |
| Imitation Learning from Vague Feedback | Unknown | N/A | |
| Towards Efficient Image Compression Without Autoregressive Models | Unknown | N/A | |
| Mixture Weight Estimation and Model Prediction in Multi-source Multi-target Domain Adaptation | Unknown | N/A | |
| Sample-Conditioned Hypothesis Stability Sharpens Information-Theoretic Generalization Bounds | Unknown | N/A | |
| Robust Matrix Sensing in the Semi-Random Model | Unknown | N/A | |
| FourierGNN: Rethinking Multivariate Time Series Forecasting from a Pure Graph Perspective | Unknown | N/A | |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Unknown | N/A | |
| Differentiable Random Partition Models | Unknown | N/A | |
| Three Iterations of (d − 1)-WL Test Distinguish Non Isometric Clouds of d-dimensional Points | Unknown | N/A | |
| SAMoSSA: Multivariate Singular Spectrum Analysis with Stochastic Autoregressive Noise | Unknown | N/A | |
| Birth of a Transformer: A Memory Viewpoint | Unknown | N/A | |
| Stochastic Optimal Control for Collective Variable Free Sampling of Molecular Transition Paths | Unknown | N/A | |
| Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering | Unknown | N/A | |
| On kernel-based statistical learning theory in the mean field limit | Unknown | N/A | |
| Adjustable Robust Reinforcement Learning for Online 3D Bin Packing | Unknown | N/A | |
| Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity Model | Unknown | N/A | |
| Fair Graph Distillation | Unknown | N/A | |
| Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? | Unknown | N/A | |
| Moral Responsibility for AI Systems | Unknown | N/A | |
| Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation | Unknown | N/A | |
| Sorting with Predictions | Unknown | N/A | |
| Direct Diffusion Bridge using Data Consistency for Inverse Problems | Unknown | N/A | |
| Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization | Unknown | N/A | |
| No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions | Unknown | N/A | |
| Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Unknown | N/A | |
| Adversarial Robustness in Graph Neural Networks: A Hamiltonian Approach | Unknown | N/A | |
| ContiFormer: Continuous-Time Transformer for Irregular Time Series Modeling | Unknown | N/A | |
| Estimating and Controlling for Equalized Odds via Sensitive Attribute Predictors | Unknown | N/A | |
| Sparse Parameterization for Epitomic Dataset Distillation | Unknown | N/A | |
| Deep Neural Collapse Is Provably Optimal for the Deep Unconstrained Features Model | Unknown | N/A | |
| Bayesian Extensive-Rank Matrix Factorization with Rotational Invariant Priors | Unknown | N/A | |
| RECESS Vaccine for Federated Learning: Proactive Defense Against Model Poisoning Attacks | Unknown | N/A | |
| SALSA VERDE: a machine learning attack on LWE with sparse small secrets | Unknown | N/A | |
| Federated Learning with Bilateral Curation for Partially Class-Disjoint Data | Unknown | N/A | |
| On the Identifiability and Interpretability of Gaussian Process Models | Unknown | N/A | |
| Towards Efficient and Accurate Winograd Convolution via Full Quantization | Unknown | N/A | |
| Semantic segmentation of sparse irregular point clouds for leaf/wood discrimination | Unknown | N/A | |
| AmadeusGPT: a natural language interface for interactive animal behavioral analysis | Unknown | N/A | |
| Sampling from Structured Log-Concave Distributions via a Soft-Threshold Dikin Walk | Unknown | N/A | |
| Efficient Exploration in Continuous-time Model-based Reinforcement Learning | Unknown | N/A | |
| On the Power of SVD in the Stochastic Block Model | Unknown | N/A | |
| A Pseudo-Semantic Loss for Autoregressive Models with Logical Constraints | Unknown | N/A | |
| A Fractional Graph Laplacian Approach to Oversmoothing | Unknown | N/A | |
| Learning Regularized Monotone Graphon Mean-Field Games | Unknown | N/A | |
| Learning From Biased Soft Labels | Unknown | N/A | |
| Large language models implicitly learn to straighten neural sentence trajectories to construct a predictive representation of natural language. | Unknown | N/A | |
| DreamWaltz: Make a Scene with Complex 3D Animatable Avatars | Unknown | N/A | |
| Noether Embedding: Efficient Learning of Temporal Regularities | Unknown | N/A | |
| An Optimization-based Approach To Node Role Discovery in Networks: Approximating Equitable Partitions | Unknown | N/A | |
| Understanding and Improving Feature Learning for Out-of-Distribution Generalization | Unknown | N/A | |
| The Tunnel Effect: Building Data Representations in Deep Neural Networks | Unknown | N/A | |
| Structure Learning with Adaptive Random Neighborhood Informed MCMC | Unknown | N/A | |
| FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models | Unknown | N/A | |
| Deep Fractional Fourier Transform | Unknown | N/A | |
| Higher-Order Uncoupled Dynamics Do Not Lead to Nash Equilibrium - Except When They Do | Unknown | N/A | |
| Fast Conditional Mixing of MCMC Algorithms for Non-log-concave Distributions | Unknown | N/A | |
| Learning Dictionary for Visual Attention | Unknown | N/A | |
| Optimistic Active Exploration of Dynamical Systems | Unknown | N/A | |
| Label Correction of Crowdsourced Noisy Annotations with an Instance-Dependent Noise Transition Model | Unknown | N/A | |
| Cookie Consent Has Disparate Impact on Estimation Accuracy | Unknown | N/A | |
| Learning Large-scale Neural Fields via Context Pruned Meta-Learning | Unknown | N/A | |
| Asymptotics of Bayesian Uncertainty Estimation in Random Features Regression | Unknown | N/A | |
| Wasserstein distributional robustness of neural networks | Unknown | N/A | |
| Recurrent Hypernetworks are Surprisingly Strong in Meta-RL | Unknown | N/A | |
| Faster Margin Maximization Rates for Generic Optimization Methods | Unknown | N/A | |
| Getting ViT in Shape: Scaling Laws for Compute-Optimal Model Design | Unknown | N/A | |
| Transformer as a hippocampal memory consolidation model based on NMDAR-inspired nonlinearity | Unknown | N/A | |
| Outlier-Robust Gromov-Wasserstein for Graph Data | Unknown | N/A | |
| FairLISA: Fair User Modeling with Limited Sensitive Attributes Information | Unknown | N/A | |
| Reusing Pretrained Models by Multi-linear Operators for Efficient Training | Unknown | N/A | |
| Variational Weighting for Kernel Density Ratios | Unknown | N/A | |
| Taming Local Effects in Graph-based Spatiotemporal Forecasting | Unknown | N/A | |
| Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale | Unknown | N/A | |
| A Bayesian Take on Gaussian Process Networks | Unknown | N/A | |
| Granger Components Analysis: Unsupervised learning of latent temporal dependencies | Unknown | N/A | |
| Navigating the Pitfalls of Active Learning Evaluation: A Systematic Framework for Meaningful Performance Assessment | Unknown | N/A | |
| MADG: Margin-based Adversarial Learning for Domain Generalization | Unknown | N/A | |
| Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning | Unknown | N/A | |
| ASIF: Coupled Data Turns Unimodal Models to Multimodal without Training | Unknown | N/A | |
| MeCo: Zero-Shot NAS with One Data and Single Forward Pass via Minimum Eigenvalue of Correlation | Unknown | N/A | |
| Federated Learning with Client Subsampling, Data Heterogeneity, and Unbounded Smoothness: A New Algorithm and Lower Bounds | Unknown | N/A | |
| Characterization of Overfitting in Robust Multiclass Classification | Unknown | N/A | |
| Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach | Unknown | N/A | |
| Leveraging sparse and shared feature activations for disentangled representation learning | Unknown | N/A | |
| Entropy-based Training Methods for Scalable Neural Implicit Samplers | Unknown | N/A | |
| Implicit Contrastive Representation Learning with Guided Stop-gradient | Unknown | N/A | |
| A Robust and Opponent-Aware League Training Method for StarCraft II | Unknown | N/A | |
| Score-based Generative Models with Lévy Processes | Unknown | N/A | |
| Task Arithmetic in the Tangent Space: Improved Editing of Pre-Trained Models | Unknown | N/A | |
| Diffusion-Based Adversarial Sample Generation for Improved Stealthiness and Controllability | Unknown | N/A | |
| Balancing Risk and Reward: A Batched-Bandit Strategy for Automated Phased Release | Unknown | N/A | |
| Curve Your Enthusiasm: Concurvity Regularization in Differentiable Generalized Additive Models | Unknown | N/A | |
| Corruption-Robust Offline Reinforcement Learning with General Function Approximation | Unknown | N/A | |
| Gaussian Membership Inference Privacy | Unknown | N/A | |
| Coherent Soft Imitation Learning | Unknown | N/A | |
| An Efficient and Robust Framework for Approximate Nearest Neighbor Search with Attribute Constraint | Unknown | N/A | |
| Multi-task Graph Neural Architecture Search with Task-aware Collaboration and Curriculum | Unknown | N/A | |
| Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models | Unknown | N/A | |
| Towards a Unified Framework of Contrastive Learning for Disentangled Representations | Unknown | N/A | |
| A Theory of Transfer-Based Black-Box Attacks: Explanation and Implications | Unknown | N/A | |
| Convolution Monge Mapping Normalization for learning on sleep data | Unknown | N/A | |
| VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation | Unknown | N/A | |
| Triangulation Residual Loss for Data-efficient 3D Pose Estimation | Unknown | N/A | |
| Cross-modal Active Complementary Learning with Self-refining Correspondence | Unknown | N/A | |
| Private Distribution Learning with Public Data: The View from Sample Compression | Unknown | N/A | |
| Hierarchical Multi-Agent Skill Discovery | Unknown | N/A | |
| Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage | Unknown | N/A | |
| Adaptive Online Replanning with Diffusion Models | Unknown | N/A | |
| Contrastive Modules with Temporal Attention for Multi-Task Reinforcement Learning | Unknown | N/A | |
| Failure-Aware Gaussian Process Optimization with Regret Bounds | Unknown | N/A | |
| Causal normalizing flows: from theory to practice | Unknown | N/A | |
| Multi-Agent First Order Constrained Optimization in Policy Space | Unknown | N/A | |
| Active Bipartite Ranking | Unknown | N/A | |
| Enhancing Adversarial Robustness via Score-Based Optimization | Unknown | N/A | |
| The Pursuit of Human Labeling: A New Perspective on Unsupervised Learning | Unknown | N/A | |
| Class-Distribution-Aware Pseudo-Labeling for Semi-Supervised Multi-Label Learning | Unknown | N/A | |
| Mask Propagation for Efficient Video Semantic Segmentation | Unknown | N/A | |
| Fantastic Robustness Measures: The Secrets of Robust Generalization | Unknown | N/A | |
| Two Heads are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation | Unknown | N/A | |
| Distributional Pareto-Optimal Multi-Objective Reinforcement Learning | Unknown | N/A | |
| Reliable Off-Policy Learning for Dosage Combinations | Unknown | N/A | |
| Bounce: Reliable High-Dimensional Bayesian Optimization for Combinatorial and Mixed Spaces | Unknown | N/A | |
| Diffusion Representation for Asymmetric Kernels via Magnetic Transform | Unknown | N/A | |
| Context Shift Reduction for Offline Meta-Reinforcement Learning | Unknown | N/A | |
| Add and Thin: Diffusion for Temporal Point Processes | Unknown | N/A | |
| On quantum backpropagation, information reuse, and cheating measurement collapse | Unknown | N/A | |
| Optimal approximation using complex-valued neural networks | Unknown | N/A | |
| Learning Dense Flow Field for Highly-accurate Cross-view Camera Localization | Unknown | N/A | |
| MMGP: a Mesh Morphing Gaussian Process-based machine learning method for regression of physical problems under nonparametrized geometrical variability | Unknown | N/A | |
| InstanT: Semi-supervised Learning with Instance-dependent Thresholds | Unknown | N/A | |
| Complexity Matters: Rethinking the Latent Space for Generative Modeling | Unknown | N/A | |
| Efficient Policy Adaptation with Contrastive Prompt Ensemble for Embodied Agents | Unknown | N/A | |
| Mode Connectivity in Auction Design | Unknown | N/A | |
| Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration | Unknown | N/A | |
| One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning | Unknown | N/A | |
| PPi: Pretraining Brain Signal Model for Patient-independent Seizure Detection | Unknown | N/A | |
| Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models | Unknown | N/A | |
| A General Framework for Equivariant Neural Networks on Reductive Lie Groups | Unknown | N/A | |
| Nearest Neighbour with Bandit Feedback | Unknown | N/A | |
| Curvature Filtrations for Graph Generative Model Evaluation | Unknown | N/A | |
| Adversarial Attacks on Online Learning to Rank with Click Feedback | Unknown | N/A | |
| Bayesian nonparametric (non-)renewal processes for analyzing neural spike train variability | Unknown | N/A | |
| L-C2ST: Local Diagnostics for Posterior Approximations in Simulation-Based Inference | Unknown | N/A | |
| Volume Feature Rendering for Fast Neural Radiance Field Reconstruction | Unknown | N/A | |
| FedL2P: Federated Learning to Personalize | Unknown | N/A | |
| Continuous Parametric Optical Flow | Unknown | N/A | |
| Extremal Domain Translation with Neural Optimal Transport | Unknown | N/A | |
| A Guide Through the Zoo of Biased SGD | Unknown | N/A | |
| Nearly Tight Bounds For Differentially Private Multiway Cut | Unknown | N/A | |
| CamoPatch: An Evolutionary Strategy for Generating Camoflauged Adversarial Patches | Unknown | N/A | |
| Towards Data-Algorithm Dependent Generalization: a Case Study on Overparameterized Linear Regression | Unknown | N/A | |
| How do Minimum-Norm Shallow Denoisers Look in Function Space? | Unknown | N/A | |
| Training Fully Connected Neural Networks is $\exists\mathbb{R}$-Complete | Unknown | N/A | |
| The Rank-Reduced Kalman Filter: Approximate Dynamical-Low-Rank Filtering In High Dimensions | Unknown | N/A | |
| On the Minimax Regret for Online Learning with Feedback Graphs | Unknown | N/A | |
| Optimal cross-learning for contextual bandits with unknown context distributions | Unknown | N/A | |
| GeoTMI: Predicting Quantum Chemical Property with Easy-to-Obtain Geometry via Positional Denoising | Unknown | N/A | |
| Greedy Pruning with Group Lasso Provably Generalizes for Matrix Sensing | Unknown | N/A | |
| Approximate inference of marginals using the IBIA framework | Unknown | N/A | |
| A Unified Framework for Rank-based Loss Minimization | Unknown | N/A | |
| On Evaluating Adversarial Robustness of Large Vision-Language Models | Unknown | N/A | |
| GUST: Combinatorial Generalization by Unsupervised Grouping with Neuronal Coherence | Unknown | N/A | |
| Boosting Learning for LDPC Codes to Improve the Error-Floor Performance | Unknown | N/A | |
| Combating Representation Learning Disparity with Geometric Harmonization | Unknown | N/A | |
| HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception | Unknown | N/A | |
| Rigorous Runtime Analysis of MOEA/D for Solving Multi-Objective Minimum Weight Base Problems | Unknown | N/A | |
| Learning Sample Difficulty from Pre-trained Models for Reliable Prediction | Unknown | N/A | |
| Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification | Unknown | N/A | |
| Computational Guarantees for Doubly Entropic Wasserstein Barycenters | Unknown | N/A | |
| Tailoring Self-Attention for Graph via Rooted Subtrees | Unknown | N/A | |
| Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning | Unknown | N/A | |
| Vocabulary-free Image Classification | Unknown | N/A | |
| Boosting Verification of Deep Reinforcement Learning via Piece-Wise Linear Decision Neural Networks | Unknown | N/A | |
| Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis | Unknown | N/A | |
| Operation-Level Early Stopping for Robustifying Differentiable NAS | Unknown | N/A | |
| Towards Data-Agnostic Pruning At Initialization: What Makes a Good Sparse Mask? | Unknown | N/A | |
| Neural Processes with Stability | Unknown | N/A | |
| Minimax Risks and Optimal Procedures for Estimation under Functional Local Differential Privacy | Unknown | N/A | |
| DeepPCR: Parallelizing Sequential Operations in Neural Networks | Unknown | N/A | |
| Grounded Decoding: Guiding Text Generation with Grounded Models for Embodied Agents | Unknown | N/A | |
| Joint Training of Deep Ensembles Fails Due to Learner Collusion | Unknown | N/A | |
| Statistical Analysis of Quantum State Learning Process in Quantum Neural Networks | Unknown | N/A | |
| How to Turn Your Knowledge Graph Embeddings into Generative Models | Unknown | N/A | |
| Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery | Unknown | N/A | |
| TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation | Unknown | N/A | |
| A Scale-Invariant Sorting Criterion to Find a Causal Order in Additive Noise Models | Unknown | N/A | |
| Delegated Classification | Unknown | N/A | |
| Comparing Causal Frameworks: Potential Outcomes, Structural Models, Graphs, and Abstractions | Unknown | N/A | |
| Learning Invariant Representations of Graph Neural Networks via Cluster Generalization | Unknown | N/A | |
| Computational Complexity of Learning Neural Networks: Smoothness and Degeneracy | Unknown | N/A | |
| Zero-sum Polymatrix Markov Games: Equilibrium Collapse and Efficient Computation of Nash Equilibria | Unknown | N/A | |
| SoTTA: Robust Test-Time Adaptation on Noisy Data Streams | Unknown | N/A | |
| Limits, approximation and size transferability for GNNs on sparse graphs via graphops | Unknown | N/A | |
| Stabilized Neural Differential Equations for Learning Dynamics with Explicit Constraints | Unknown | N/A | |
| Learning from Both Structural and Textual Knowledge for Inductive Knowledge Graph Completion | Unknown | N/A | |
| Facing Off World Model Backbones: RNNs, Transformers, and S4 | Unknown | N/A | |
| Homotopy-based training of NeuralODEs for accurate dynamics discovery | Unknown | N/A | |
| Combinatorial Optimization with Policy Adaptation using Latent Space Search | Unknown | N/A | |
| Quantum Bayesian Optimization | Unknown | N/A | |
| Assumption violations in causal discovery and the robustness of score matching | Unknown | N/A | |
| Minimum Description Length and Generalization Guarantees for Representation Learning | Unknown | N/A | |
| Multi-resolution Spectral Coherence for Graph Generation with Score-based Diffusion | Unknown | N/A | |
| Batch Bayesian Optimization For Replicable Experimental Design | Unknown | N/A | |
| Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models | Unknown | N/A | |
| Diffused Task-Agnostic Milestone Planner | Unknown | N/A | |
| PICProp: Physics-Informed Confidence Propagation for Uncertainty Quantification | Unknown | N/A | |
| Riemannian SAM: Sharpness-Aware Minimization on Riemannian Manifolds | Unknown | N/A | |
| ODE-based Recurrent Model-free Reinforcement Learning for POMDPs | Unknown | N/A | |
| LEACE: Perfect linear concept erasure in closed form | Unknown | N/A | |
| Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP | Unknown | N/A | |
| Efficient Hyper-parameter Optimization with Cubic Regularization | Unknown | N/A | |
| Effective Targeted Attacks for Adversarial Self-Supervised Learning | Unknown | N/A | |
| Hypernetwork-based Meta-Learning for Low-Rank Physics-Informed Neural Networks | Unknown | N/A | |
| Debiased and Denoised Entity Recognition from Distant Supervision | Unknown | N/A | |
| Schema-learning and rebinding as mechanisms of in-context learning and emergence | Unknown | N/A | |
| State Sequences Prediction via Fourier Transform for Representation Learning | Unknown | N/A | |
| Analysis of Variance of Multiple Causal Networks | Unknown | N/A | |
| Undirected Probabilistic Model for Tensor Decomposition | Unknown | N/A | |
| Differentiable and Stable Long-Range Tracking of Multiple Posterior Modes | Unknown | N/A | |
| LICO: Explainable Models with Language-Image COnsistency | Unknown | N/A | |
| Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks | Unknown | N/A | |
| Active Learning-Based Species Range Estimation | Unknown | N/A | |
| Generative Modeling through the Semi-dual Formulation of Unbalanced Optimal Transport | Unknown | N/A | |
| Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors | Unknown | N/A | |
| The Utility of “Even if” Semifactual Explanation to Optimise Positive Outcomes | Unknown | N/A | |
| Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions | Unknown | N/A | |
| Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies | Unknown | N/A | |
| Necessary and Sufficient Conditions for Optimal Decision Trees using Dynamic Programming | Unknown | N/A | |
| Mind the spikes: Benign overfitting of kernels and neural networks in fixed dimension | Unknown | N/A | |
| Removing Hidden Confounding in Recommendation: A Unified Multi-Task Learning Approach | Unknown | N/A | |
| A Computationally Efficient Sparsified Online Newton Method | Unknown | N/A | |
| Multi-Object Representation Learning via Feature Connectivity and Object-Centric Regularization | Unknown | N/A | |
| Integration-free Training for Spatio-temporal Multimodal Covariate Deep Kernel Point Processes | Unknown | N/A | |
| Efficient Subgame Refinement for Extensive-form Games | Unknown | N/A | |
| Recaptured Raw Screen Image and Video Demoiréing via Channel and Spatial Modulations | Unknown | N/A | |
| Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift | Unknown | N/A | |
| Eliminating Catastrophic Overfitting Via Abnormal Adversarial Examples Regularization | Unknown | N/A | |
| Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective | Unknown | N/A | |
| Contextually Affinitive Neighborhood Refinery for Deep Clustering | Unknown | N/A | |
| A Dynamical System View of Langevin-Based Non-Convex Sampling | Unknown | N/A | |
| Extracting Reward Functions from Diffusion Models | Unknown | N/A | |
| Riemannian stochastic optimization methods avoid strict saddle points | Unknown | N/A | |
| Rewiring Neurons in Non-Stationary Environments | Unknown | N/A | |
| DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss | Unknown | N/A | |
| Provably Robust Temporal Difference Learning for Heavy-Tailed Rewards | Unknown | N/A | |
| Particle-based Variational Inference with Generalized Wasserstein Gradient Flow | Unknown | N/A | |
| Soft-Unification in Deep Probabilistic Logic | Unknown | N/A | |
| Robust covariance estimation with missing values and cell-wise contamination | Unknown | N/A | |
| Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters | Unknown | N/A | |
| Investigating how ReLU-networks encode symmetries | Unknown | N/A | |
| Enhancing Minority Classes by Mixing: An Adaptative Optimal Transport Approach for Long-tailed Classification | Unknown | N/A | |
| Activity Grammars for Temporal Action Segmentation | Unknown | N/A | |
| Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion | Unknown | N/A | |
| Fine-Grained Cross-View Geo-Localization Using a Correlation-Aware Homography Estimator | Unknown | N/A | |
| DiffTraj: Generating GPS Trajectory with Diffusion Probabilistic Model | Unknown | N/A | |
| SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process | Unknown | N/A | |
| CSOT: Curriculum and Structure-Aware Optimal Transport for Learning with Noisy Labels | Unknown | N/A | |
| Lift Yourself Up: Retrieval-augmented Text Generation with Self-Memory | Unknown | N/A | |
| Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition | Unknown | N/A | |
| On the Consistency of Maximum Likelihood Estimation of Probabilistic Principal Component Analysis | Unknown | N/A | |
| An active learning framework for multi-group mean estimation | Unknown | N/A | |
| From ViT Features to Training-free Video Object Segmentation via Streaming-data Mixture Models | Unknown | N/A | |
| Energy-Based Models for Anomaly Detection: A Manifold Diffusion Recovery Approach | Unknown | N/A | |
| Unsupervised Image Denoising with Score Function | Unknown | N/A | |
| Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks | Unknown | N/A | |
| Optimal Transport Model Distributional Robustness | Unknown | N/A | |
| WalkLM: A Uniform Language Model Fine-tuning Framework for Attributed Graph Embedding | Unknown | N/A | |
| On the Asymptotic Learning Curves of Kernel Ridge Regression under Power-law Decay | Unknown | N/A | |
| Information Theoretic Lower Bounds for Information Theoretic Upper Bounds | Unknown | N/A | |
| Incomplete Multimodality-Diffused Emotion Recognition | Unknown | N/A | |
| Smoothing the Landscape Boosts the Signal for SGD: Optimal Sample Complexity for Learning Single Index Models | Unknown | N/A | |
| Toolformer: Language Models Can Teach Themselves to Use Tools | Unknown | N/A | |
| Contrast, Attend and Diffuse to Decode High-Resolution Images from Brain Activities | Unknown | N/A | |
| RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization | Unknown | N/A | |
| Sample-efficient Multi-objective Molecular Optimization with GFlowNets | Unknown | N/A | |
| Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels | Unknown | N/A | |
| Flat Seeking Bayesian Neural Networks | Unknown | N/A | |
| AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation | Unknown | N/A | |
| Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models | Unknown | N/A | |
| Constructing Non-isotropic Gaussian Diffusion Model Using Isotropic Gaussian Diffusion Model for Image Editing | Unknown | N/A | |
| Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL | Unknown | N/A | |
| Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection | Unknown | N/A | |
| Nonparametric Teaching for Multiple Learners | Unknown | N/A | |
| A Hierarchical Spatial Transformer for Massive Point Samples in Continuous Space | Unknown | N/A | |
| Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm | Unknown | N/A | |
| Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations | Unknown | N/A | |
| REFINE: A Fine-Grained Medication Recommendation System Using Deep Learning and Personalized Drug Interaction Modeling | Unknown | N/A | |
| MG-ViT: A Multi-Granularity Method for Compact and Efficient Vision Transformers | Unknown | N/A | |
| Unsupervised Behavior Extraction via Random Intent Priors | Unknown | N/A | |
| Towards Accelerated Model Training via Bayesian Data Selection | Unknown | N/A | |
| Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction | Unknown | N/A | |
| UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild | Unknown | N/A | |
| Mutual Information Regularized Offline Reinforcement Learning | Unknown | N/A | |
| The Exact Sample Complexity Gain from Invariances for Kernel Regression | Unknown | N/A | |
| Exploiting Contextual Objects and Relations for 3D Visual Grounding | Unknown | N/A | |
| Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity | Unknown | N/A | |
| Zero-Regret Performative Prediction Under Inequality Constraints | Unknown | N/A | |
| Calibration by Distribution Matching: Trainable Kernel Calibration Metrics | Unknown | N/A | |
| Partial Label Learning with Dissimilarity Propagation guided Candidate Label Shrinkage | Unknown | N/A | |
| Bandit Social Learning under Myopic Behavior | Unknown | N/A | |
| An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination | Unknown | N/A | |
| SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation | Unknown | N/A | |
| Diversify \& Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement | Unknown | N/A | |
| Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem | Unknown | N/A | |
| Compositional Foundation Models for Hierarchical Planning | Unknown | N/A | |
| NPCL: Neural Processes for Uncertainty-Aware Continual Learning | Unknown | N/A | |
| Accelerated Quasi-Newton Proximal Extragradient: Faster Rate for Smooth Convex Optimization | Unknown | N/A | |
| ARTree: A Deep Autoregressive Model for Phylogenetic Inference | Unknown | N/A | |
| Graph Denoising Diffusion for Inverse Protein Folding | Unknown | N/A | |
| Visual Instruction Inversion: Image Editing via Image Prompting | Unknown | N/A | |
| DiViNeT: 3D Reconstruction from Disparate Views using Neural Template Regularization | Unknown | N/A | |
| Unleash the Potential of Image Branch for Cross-modal 3D Object Detection | Unknown | N/A | |
| Weakly Supervised 3D Open-vocabulary Segmentation | Unknown | N/A | |
| Complex Query Answering on Eventuality Knowledge Graph with Implicit Logical Constraints | Unknown | N/A | |
| Leveraging Early-Stage Robustness in Diffusion Models for Efficient and High-Quality Image Synthesis | Unknown | N/A | |
| A Unified Discretization Framework for Differential Equation Approach with Lyapunov Arguments for Convex Optimization | Unknown | N/A | |
| Parameterizing Non-Parametric Meta-Reinforcement Learning Tasks via Subtask Decomposition | Unknown | N/A | |
| On Differentially Private Sampling from Gaussian and Product Distributions | Unknown | N/A | |
| On the Stability-Plasticity Dilemma in Continual Meta-Learning: Theory and Algorithm | Unknown | N/A | |
| LightSpeed: Light and Fast Neural Light Fields on Mobile Devices | Unknown | N/A | |
| Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans | Unknown | N/A | |
| Laplacian Canonization: A Minimalist Approach to Sign and Basis Invariant Spectral Embedding | Unknown | N/A | |
| Recovering from Out-of-sample States via Inverse Dynamics in Offline Reinforcement Learning | Unknown | N/A | |
| Neural Oscillators are Universal | Unknown | N/A | |
| Online Corrupted User Detection and Regret Minimization | Unknown | N/A | |
| On the Gini-impurity Preservation For Privacy Random Forests | Unknown | N/A | |
| On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions | Unknown | N/A | |
| Exploring the Optimal Choice for Generative Processes in Diffusion Models: Ordinary vs Stochastic Differential Equations | Unknown | N/A | |
| Understanding Multi-phase Optimization Dynamics and Rich Nonlinear Behaviors of ReLU Networks | Unknown | N/A | |
| Adaptive Test-Time Personalization for Federated Learning | Unknown | N/A | |
| Errors-in-variables Fr\'echet Regression with Low-rank Covariate Approximation | Unknown | N/A | |
| Federated Conditional Stochastic Optimization | Unknown | N/A | |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Unknown | N/A | |
| Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts | Unknown | N/A | |
| Hierarchical Semi-Implicit Variational Inference with Application to Diffusion Model Acceleration | Unknown | N/A | |
| Gaussian Mixture Solvers for Diffusion Models | Unknown | N/A | |
| Subclass-Dominant Label Noise: A Counterexample for the Success of Early Stopping | Unknown | N/A | |
| Hyperbolic VAE via Latent Gaussian Distributions | Unknown | N/A | |
| Efficient Low-rank Backpropagation for Vision Transformer Adaptation | Unknown | N/A | |
| Bilevel Coreset Selection in Continual Learning: A New Formulation and Algorithm | Unknown | N/A | |
| Static and Sequential Malicious Attacks in the Context of Selective Forgetting | Unknown | N/A | |
| Online Clustering of Bandits with Misspecified User Models | Unknown | N/A | |
| The probability flow ODE is provably fast | Unknown | N/A | |
| On Calibrating Diffusion Probabilistic Models | Unknown | N/A | |
| Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions | Unknown | N/A | |
| Statistical Insights into HSIC in High Dimensions | Unknown | N/A | |
| Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning | Unknown | N/A | |
| Self-Weighted Contrastive Learning among Multiple Views for Mitigating Representation Degeneration | Unknown | N/A | |
| AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback | Unknown | N/A | |
| Restart Sampling for Improving Generative Processes | Unknown | N/A | |
| Geometry-Aware Adaptation for Pretrained Models | Unknown | N/A | |
| Incentives in Private Collaborative Machine Learning | Unknown | N/A | |
| Bayesian Optimization with Cost-varying Variable Subsets | Unknown | N/A | |
| Conditional Score Guidance for Text-Driven Image-to-Image Translation | Unknown | N/A | |
| Gold-YOLO: Efficient Object Detector via Gather-and-Distribute Mechanism | Unknown | N/A | |
| What Makes Good Examples for Visual In-Context Learning? | Unknown | N/A | |
| Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions | Unknown | N/A | |
| Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration | Unknown | N/A | |
| D-Separation for Causal Self-Explanation | Unknown | N/A | |
| Unbiased Compression Saves Communication in Distributed Optimization: When and How Much? | Unknown | N/A | |
| Perceptual Kalman Filters: Online State Estimation under a Perfect Perceptual-Quality Constraint | Unknown | N/A | |
| DSR: Dynamical Surface Representation as Implicit Neural Networks for Protein | Unknown | N/A | |
| Two-Stage Predict+Optimize for MILPs with Unknown Parameters in Constraints | Unknown | N/A | |
| Achieving Cross Modal Generalization with Multimodal Unified Representation | Unknown | N/A | |
| Generator Identification for Linear SDEs with Additive and Multiplicative Noise | Unknown | N/A | |
| On the Overlooked Structure of Stochastic Gradients | Unknown | N/A | |
| Private Everlasting Prediction | Unknown | N/A | |
| Hierarchical Gaussian Mixture based Task Generative Model for Robust Meta-Learning | Unknown | N/A | |
| Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models | Unknown | N/A | |
| In-Context Learning Unlocked for Diffusion Models | Unknown | N/A | |
| MGDD: A Meta Generator for Fast Dataset Distillation | Unknown | N/A | |
| Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification | Unknown | N/A | |
| Nominality Score Conditioned Time Series Anomaly Detection by Point/Sequential Reconstruction | Unknown | N/A | |
| A Riemannian Exponential Augmented Lagrangian Method for Computing the Projection Robust Wasserstein Distance | Unknown | N/A | |
| Large Language Models as Commonsense Knowledge for Large-Scale Task Planning | Unknown | N/A | |
| FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy | Unknown | N/A | |
| $L_2$-Uniform Stability of Randomized Learning Algorithms: Sharper Generalization Bounds and Confidence Boosting | Unknown | N/A | |
| Federated Spectral Clustering via Secure Similarity Reconstruction | Unknown | N/A | |
| Neural Graph Generation from Graph Statistics | Unknown | N/A | |
| Model-Based Control with Sparse Neural Dynamics | Unknown | N/A | |
| Birder: Communication-Efficient 1-bit Adaptive Optimizer for Practical Distributed DNN Training | Unknown | N/A | |
| Fast Rank-1 Lattice Targeted Sampling for Black-box Optimization | Unknown | N/A | |
| ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings | Unknown | N/A | |
| Newton–Cotes Graph Neural Networks: On the Time Evolution of Dynamic Systems | Unknown | N/A | |
| Tuning Multi-mode Token-level Prompt Alignment across Modalities | Unknown | N/A | |
| Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies | Unknown | N/A | |
| S-CLIP: Semi-supervised Vision-Language Learning using Few Specialist Captions | Unknown | N/A | |
| Text Promptable Surgical Instrument Segmentation with Vision-Language Models | Unknown | N/A | |
| Towards A Richer 2D Understanding of Hands at Scale | Unknown | N/A | |
| A Finite-Particle Convergence Rate for Stein Variational Gradient Descent | Unknown | N/A | |
| Towards Semi-Structured Automatic ICD Coding via Tree-based Contrastive Learning | Unknown | N/A | |
| A Novel Approach for Effective Multi-View Clustering with Information-Theoretic Perspective | Unknown | N/A | |
| Feature Adaptation for Sparse Linear Regression | Unknown | N/A | |
| Ecosystem-level Analysis of Deployed Machine Learning Reveals Homogeneous Outcomes | Unknown | N/A | |
| Efficient Adaptation of Large Vision Transformer via Adapter Re-Composing | Unknown | N/A | |
| Contrast Everything: A Hierarchical Contrastive Framework for Medical Time-Series | Unknown | N/A | |
| DoWG Unleashed: An Efficient Universal Parameter-Free Gradient Descent Method | Unknown | N/A | |
| Efficient Testable Learning of Halfspaces with Adversarial Label Noise | Unknown | N/A | |
| Critical Initialization of Wide and Deep Neural Networks using Partial Jacobians: General Theory and Applications | Unknown | N/A | |
| Stable and low-precision training for large-scale vision-language models | Unknown | N/A | |
| Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level Stability and High-Level Behavior | Unknown | N/A | |
| Wide Neural Networks as Gaussian Processes: Lessons from Deep Equilibrium Models | Unknown | N/A | |
| TabMT: Generating tabular data with masked transformers | Unknown | N/A | |
| Tools for Verifying Neural Models' Training Data | Unknown | N/A | |
| Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural Networks | Unknown | N/A | |
| Clustering the Sketch: Dynamic Compression for Embedding Tables | Unknown | N/A | |
| Inconsistency, Instability, and Generalization Gap of Deep Neural Network Training | Unknown | N/A | |
| Finite-Time Logarithmic Bayes Regret Upper Bounds | Unknown | N/A | |
| Adversarial Resilience in Sequential Prediction via Abstention | Unknown | N/A | |
| Front-door Adjustment Beyond Markov Equivalence with Limited Graph Knowledge | Unknown | N/A | |
| Balancing memorization and generalization in RNNs for high performance brain-machine Interfaces | Unknown | N/A | |
| Understanding and Mitigating Copying in Diffusion Models | Unknown | N/A | |
| Bootstrapping Vision-Language Learning with Decoupled Language Pre-training | Unknown | N/A | |
| On Generalization Bounds for Projective Clustering | Unknown | N/A | |
| Rethinking the Role of Token Retrieval in Multi-Vector Retrieval | Unknown | N/A | |
| QuIP: 2-Bit Quantization of Large Language Models With Guarantees | Unknown | N/A | |
| Approximately Equivariant Graph Networks | Unknown | N/A | |
| GNNEvaluator: Evaluating GNN Performance On Unseen Graphs Without Labels | Unknown | N/A | |
| Structure-free Graph Condensation: From Large-scale Graphs to Condensed Graph-free Data | Unknown | N/A | |
| Private (Stochastic) Non-Convex Optimization Revisited: Second-Order Stationary Points and Excess Risks | Unknown | N/A | |
| Where Did I Come From? Origin Attribution of AI-Generated Images | Unknown | N/A | |
| Your representations are in the network: composable and parallel adaptation for large scale models | Unknown | N/A | |
| Faster Relative Entropy Coding with Greedy Rejection Coding | Unknown | N/A | |
| Visual Instruction Tuning | Unknown | N/A | |
| Aiming towards the minimizers: fast convergence of SGD for overparametrized problems | Unknown | N/A | |
| Near-Optimal $k$-Clustering in the Sliding Window Model | Unknown | N/A | |
| Mobilizing Personalized Federated Learning in Infrastructure-Less and Heterogeneous Environments via Random Walk Stochastic ADMM | Unknown | N/A | |
| The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model | Unknown | N/A | |
| Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors | Unknown | N/A | |
| Fast Asymptotically Optimal Algorithms for Non-Parametric Stochastic Bandits | Unknown | N/A | |
| Order Matters in the Presence of Dataset Imbalance for Multilingual Learning | Unknown | N/A | |
| Invariant Anomaly Detection under Distribution Shifts: A Causal Perspective | Unknown | N/A | |
| InfoCD: A Contrastive Chamfer Distance Loss for Point Cloud Completion | Unknown | N/A | |
| The Crucial Role of Normalization in Sharpness-Aware Minimization | Unknown | N/A | |
| Causal Imitability Under Context-Specific Independence Relations | Unknown | N/A | |
| Identifiability Guarantees for Causal Disentanglement from Soft Interventions | Unknown | N/A | |
| Anonymous and Copy-Robust Delegations for Liquid Democracy | Unknown | N/A | |
| Boosting with Tempered Exponential Measures | Unknown | N/A | |
| TART: A plug-and-play Transformer module for task-agnostic reasoning | Unknown | N/A | |
| SimFBO: Towards Simple, Flexible and Communication-efficient Federated Bilevel Learning | Unknown | N/A | |
| Human-like Few-Shot Learning via Bayesian Reasoning over Natural Language | Unknown | N/A | |
| Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement | Unknown | N/A | |
| Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought | Unknown | N/A | |
| Distribution Learnability and Robustness | Unknown | N/A | |
| Performance Bounds for Policy-Based Average Reward Reinforcement Learning Algorithms | Unknown | N/A | |
| Stochastic Multi-armed Bandits: Optimal Trade-off among Optimality, Consistency, and Tail Risk | Unknown | N/A | |
| Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models | Unknown | N/A | |
| An $\varepsilon$-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond | Unknown | N/A | |
| GloptiNets: Scalable Non-Convex Optimization with Certificates | Unknown | N/A | |
| Time Series as Images: Vision Transformer for Irregularly Sampled Time Series | Unknown | N/A | |
| CELLE-2: Translating Proteins to Pictures and Back with a Bidirectional Text-to-Image Transformer | Unknown | N/A | |
| Explaining Predictive Uncertainty with Information Theoretic Shapley Values | Unknown | N/A | |
| The Behavior and Convergence of Local Bayesian Optimization | Unknown | N/A | |
| The Transient Nature of Emergent In-Context Learning in Transformers | Unknown | N/A | |
| Im-Promptu: In-Context Composition from Image Prompts | Unknown | N/A | |
| Scalable Membership Inference Attacks via Quantile Regression | Unknown | N/A | |
| Augmenting Language Models with Long-Term Memory | Unknown | N/A | |
| Which Models have Perceptually-Aligned Gradients? An Explanation via Off-Manifold Robustness | Unknown | N/A | |
| Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments | Unknown | N/A | |
| Information-guided Planning: An Online Approach for Partially Observable Problems | Unknown | N/A | |
| Energy Discrepancies: A Score-Independent Loss for Energy-Based Models | Unknown | N/A | |
| Fast Optimal Locally Private Mean Estimation via Random Projections | Unknown | N/A | |
| Posterior Contraction Rates for Matérn Gaussian Processes on Riemannian Manifolds | Unknown | N/A | |
| On the impact of activation and normalization in obtaining isometric embeddings at initialization | Unknown | N/A | |
| Zero-One Laws of Graph Neural Networks | Unknown | N/A | |
| Deep Stochastic Processes via Functional Markov Transition Operators | Unknown | N/A | |
| Computing a human-like reaction time metric from stable recurrent vision models | Unknown | N/A | |
| What can a Single Attention Layer Learn? A Study Through the Random Features Lens | Unknown | N/A | |
| Active representation learning for general task space with applications in robotics | Unknown | N/A | |
| Analyzing Vision Transformers for Image Classification in Class Embedding Space | Unknown | N/A | |
| Regularity as Intrinsic Reward for Free Play | Unknown | N/A | |
| Differentially Private Approximate Near Neighbor Counting in High Dimensions | Unknown | N/A | |
| Adaptive Algorithms for Relaxed Pareto Set Identification | Unknown | N/A | |
| Fitting trees to $\ell_1$-hyperbolic distances | Unknown | N/A | |
| Formalizing locality for normative synaptic plasticity models | Unknown | N/A | |
| Thin and deep Gaussian processes | Unknown | N/A | |
| Tempo Adaptation in Non-stationary Reinforcement Learning | Unknown | N/A | |
| Reconciling Competing Sampling Strategies of Network Embedding | Unknown | N/A | |
| Convergence of Actor-Critic with Multi-Layer Neural Networks | Unknown | N/A | |
| An Efficient Doubly-Robust Test for the Kernel Treatment Effect | Unknown | N/A | |
| Incentivizing Honesty among Competitors in Collaborative Learning and Optimization | Unknown | N/A | |
| From Trainable Negative Depth to Edge Heterophily in Graphs | Unknown | N/A | |
| Siamese Masked Autoencoders | Unknown | N/A | |
| PoET: A generative model of protein families as sequences-of-sequences | Unknown | N/A | |
| Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model | Unknown | N/A | |
| ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image | Unknown | N/A | |
| A Unified Fast Gradient Clipping Framework for DP-SGD | Unknown | N/A | |
| LART: Neural Correspondence Learning with Latent Regularization Transformer for 3D Motion Transfer | Unknown | N/A | |
| Continual Learning for Instruction Following from Realtime Feedback | Unknown | N/A | |
| Neural Data Transformer 2: Multi-context Pretraining for Neural Spiking Activity | Unknown | N/A | |
| Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model | Unknown | N/A | |
| Neural approximation of Wasserstein distance via a universal architecture for symmetric and factorwise group invariant functions | Unknown | N/A | |
| Hyper-HMM: aligning human brains and semantic features in a common latent event space | Unknown | N/A | |
| Learning better with Dale’s Law: A Spectral Perspective | Unknown | N/A | |
| Long Sequence Hopfield Memory | Unknown | N/A | |
| Minimum norm interpolation by perceptra: Explicit regularization and implicit bias | Unknown | N/A | |
| Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models | Unknown | N/A | |
| Unified Lower Bounds for Interactive High-dimensional Estimation under Information Constraints | Unknown | N/A | |
| On the Robustness of Mechanism Design under Total Variation Distance | Unknown | N/A | |
| Complexity of Derivative-Free Policy Optimization for Structured $\mathcal{H}_\infty$ Control | Unknown | N/A | |
| Asynchronous Proportional Response Dynamics: Convergence in Markets with Adversarial Scheduling | Unknown | N/A | |
| Learning via Wasserstein-Based High Probability Generalisation Bounds | Unknown | N/A | |
| Swarm Reinforcement Learning for Adaptive Mesh Refinement | Unknown | N/A | |
| Taking the neural sampling code very seriously: A data-driven approach for evaluating generative models of the visual system | Unknown | N/A | |
| Online Pricing for Multi-User Multi-Item Markets | Unknown | N/A | |
| Fixing the NTK: From Neural Network Linearizations to Exact Convex Programs | Unknown | N/A | |
| Systematic Visual Reasoning through Object-Centric Relational Abstraction | Unknown | N/A | |
| On the Statistical Consistency of Risk-Sensitive Bayesian Decision-Making | Unknown | N/A | |
| $\texttt{TACO}$: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning | Unknown | N/A | |
| Break It Down: Evidence for Structural Compositionality in Neural Networks | Unknown | N/A | |
| Latent Diffusion for Language Generation | Unknown | N/A | |
| Causal Fairness for Outcome Control | Unknown | N/A | |
| Privacy Amplification via Compression: Achieving the Optimal Privacy-Accuracy-Communication Trade-off in Distributed Mean Estimation | Unknown | N/A | |
| Representation Equivalent Neural Operators: a Framework for Alias-free Operator Learning | Unknown | N/A | |
| Unsupervised Learning for Solving the Travelling Salesman Problem | Unknown | N/A | |
| PlanE: Representation Learning over Planar Graphs | Unknown | N/A | |
| LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference | Unknown | N/A | |
| Collapsed Inference for Bayesian Deep Learning | Unknown | N/A | |
| RegBN: Batch Normalization of Multimodal Data with Regularization | Unknown | N/A | |
| Policy Optimization for Continuous Reinforcement Learning | Unknown | N/A | |
| Video-Mined Task Graphs for Keystep Recognition in Instructional Videos | Unknown | N/A | |
| Algorithm Selection for Deep Active Learning with Imbalanced Datasets | Unknown | N/A | |
| Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent | Unknown | N/A | |
| Loss Dynamics of Temporal Difference Reinforcement Learning | Unknown | N/A | |
| Minimum-Risk Recalibration of Classifiers | Unknown | N/A | |
| When Do Graph Neural Networks Help with Node Classification? Investigating the Homophily Principle on Node Distinguishability | Unknown | N/A | |
| Active Observing in Continuous-time Control | Unknown | N/A | |
| SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks | Unknown | N/A | |
| Scaling Open-Vocabulary Object Detection | Unknown | N/A | |
| Neural Image Compression: Generalization, Robustness, and Spectral Biases | Unknown | N/A | |
| Three-Way Trade-Off in Multi-Objective Learning: Optimization, Generalization and Conflict-Avoidance | Unknown | N/A | |
| How Does Adaptive Optimization Impact Local Neural Network Geometry? | Unknown | N/A | |
| In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer | Unknown | N/A | |
| Streaming Algorithms and Lower Bounds for Estimating Correlation Clustering Cost | Unknown | N/A | |
| Gaussian Differential Privacy on Riemannian Manifolds | Unknown | N/A | |
| Deep Gaussian Markov Random Fields for Graph-Structured Dynamical Systems | Unknown | N/A | |
| Auditing for Human Expertise | Unknown | N/A | |
| Norm-based Generalization Bounds for Sparse Neural Networks | Unknown | N/A | |
| Replicable Reinforcement Learning | Unknown | N/A | |
| CorresNeRF: Image Correspondence Priors for Neural Radiance Fields | Unknown | N/A | |
| Stability of Random Forests and Coverage of Random-Forest Prediction Intervals | Unknown | N/A | |
| CS4ML: A general framework for active learning with arbitrary data based on Christoffel functions | Unknown | N/A | |
| Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First | Unknown | N/A | |
| On the Importance of Exploration for Generalization in Reinforcement Learning | Unknown | N/A | |
| 3D-LLM: Injecting the 3D World into Large Language Models | Unknown | N/A | |
| A Unified Approach to Count-Based Weakly Supervised Learning | Unknown | N/A | |
| On Single-Index Models beyond Gaussian Data | Unknown | N/A | |
| Comparing Apples to Oranges: Learning Similarity Functions for Data Produced by Different Distributions | Unknown | N/A | |
| Accelerating Motion Planning via Optimal Transport | Unknown | N/A | |
| A General Framework for Robust G-Invariance in G-Equivariant Networks | Unknown | N/A | |
| Transformers are uninterpretable with myopic methods: a case study with bounded Dyck grammars | Unknown | N/A | |
| Every Parameter Matters: Ensuring the Convergence of Federated Learning with Dynamic Heterogeneous Models Reduction | Unknown | N/A | |
| EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding | Unknown | N/A | |
| Distributionally Robust Linear Quadratic Control | Unknown | N/A | |
| Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization | Unknown | N/A | |
| A Neural Collapse Perspective on Feature Evolution in Graph Neural Networks | Unknown | N/A | |
| Tree Variational Autoencoders | Unknown | N/A | |
| FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow | Unknown | N/A | |
| EgoEnv: Human-centric environment representations from egocentric video | Unknown | N/A | |
| Debiasing Conditional Stochastic Optimization | Unknown | N/A | |
| A U-turn on Double Descent: Rethinking Parameter Counting in Statistical Learning | Unknown | N/A | |
| Fantastic Weights and How to Find Them: Where to Prune in Dynamic Sparse Training | Unknown | N/A | |
| Algorithmic Regularization in Tensor Optimization: Towards a Lifted Approach in Matrix Sensing | Unknown | N/A | |
| On Computing Pairwise Statistics with Local Differential Privacy | Unknown | N/A | |
| Brain encoding models based on multimodal transformers can transfer across language and vision | Unknown | N/A | |
| Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning | Unknown | N/A | |
| Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models | Unknown | N/A | |
| Honesty Is the Best Policy: Defining and Mitigating AI Deception | Unknown | N/A | |
| Unbalanced Low-rank Optimal Transport Solvers | Unknown | N/A | |
| Auditing Fairness by Betting | Unknown | N/A | |
| Learning from Active Human Involvement through Proxy Value Propagation | Unknown | N/A | |
| Towards Last-layer Retraining for Group Robustness with Fewer Annotations | Unknown | N/A | |
| Differentiable Sampling of Categorical Distributions Using the CatLog-Derivative Trick | Unknown | N/A | |
| Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning | Unknown | N/A | |
| Meet in the Middle: A New Pre-training Paradigm | Unknown | N/A | |
| Adversarial Examples Are Not Real Features | Unknown | N/A | |
| Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision | Unknown | N/A | |
| A graphon-signal analysis of graph neural networks | Unknown | N/A | |
| Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision | Unknown | N/A | |
| Generative Neural Fields by Mixtures of Neural Implicit Functions | Unknown | N/A | |
| DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization | Unknown | N/A | |
| Real-Time Motion Prediction via Heterogeneous Polyline Transformer with Relative Pose Encoding | Unknown | N/A | |
| SyncTREE: Fast Timing Analysis for Integrated Circuit Design through a Physics-informed Tree-based Graph Neural Network | Unknown | N/A | |
| On the Importance of Feature Separability in Predicting Out-Of-Distribution Error | Unknown | N/A | |
| Monte Carlo Tree Search with Boltzmann Exploration | Unknown | N/A | |
| CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation | Unknown | N/A | |
| Supply-Side Equilibria in Recommender Systems | Unknown | N/A | |
| PETAL: Physics Emulation Through Averaged Linearizations for Solving Inverse Problems | Unknown | N/A | |
| Robust Contrastive Language-Image Pretraining against Data Poisoning and Backdoor Attacks | Unknown | N/A | |
| CBD: A Certified Backdoor Detector Based on Local Dominant Probability | Unknown | N/A | |
| Amortized Reparametrization: Efficient and Scalable Variational Inference for Latent SDEs | Unknown | N/A | |
| Differentiable Clustering with Perturbed Spanning Forests | Unknown | N/A | |
| Alternating Updates for Efficient Transformers | Unknown | N/A | |
| Improved Bayes Risk Can Yield Reduced Social Welfare Under Competition | Unknown | N/A | |
| On the Exploration of Local Significant Differences For Two-Sample Test | Unknown | N/A | |
| Meta-Learning with Neural Bandit Scheduler | Unknown | N/A | |
| Learning Causal Models under Independent Changes | Unknown | N/A | |
| Neural Combinatorial Optimization with Heavy Decoder: Toward Large Scale Generalization | Unknown | N/A | |
| The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks | Unknown | N/A | |
| Pengi: An Audio Language Model for Audio Tasks | Unknown | N/A | |
| Beyond MLE: Convex Learning for Text Generation | Unknown | N/A | |
| Online Control for Meta-optimization | Unknown | N/A | |
| Cognitive Steering in Deep Neural Networks via Long-Range Modulatory Feedback Connections | Unknown | N/A | |
| Moment Matching Denoising Gibbs Sampling | Unknown | N/A | |
| Optimal Excess Risk Bounds for Empirical Risk Minimization on $p$-Norm Linear Regression | Unknown | N/A | |
| Conditional score-based diffusion models for Bayesian inference in infinite dimensions | Unknown | N/A | |
| Topological Parallax: A Geometric Specification for Deep Perception Models | Unknown | N/A | |
| TIES-Merging: Resolving Interference When Merging Models | Unknown | N/A | |
| Joint processing of linguistic properties in brains and language models | Unknown | N/A | |
| Self-Correcting Bayesian Optimization through Bayesian Active Learning | Unknown | N/A | |
| Sketchy: Memory-efficient Adaptive Regularization with Frequent Directions | Unknown | N/A | |
| GEQ: Gaussian Kernel Inspired Equilibrium Models | Unknown | N/A | |
| Calibrated Stackelberg Games: Learning Optimal Commitments Against Calibrated Agents | Unknown | N/A | |
| $SE(3)$ Equivariant Convolution and Transformer in Ray Space | Unknown | N/A | |
| Oracle Complexity of Single-Loop Switching Subgradient Methods for Non-Smooth Weakly Convex Functional Constrained Optimization | Unknown | N/A | |
| Creating Multi-Level Skill Hierarchies in Reinforcement Learning | Unknown | N/A | |
| Asynchrony-Robust Collaborative Perception via Bird's Eye View Flow | Unknown | N/A | |
| Rethinking Semi-Supervised Imbalanced Node Classification from Bias-Variance Decomposition | Unknown | N/A | |
| Speculative Decoding with Big Little Decoder | Unknown | N/A | |
| ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation | Unknown | N/A | |
| Stein $\Pi$-Importance Sampling | Unknown | N/A | |
| Unexpected Improvements to Expected Improvement for Bayesian Optimization | Unknown | N/A | |
| Single-Call Stochastic Extragradient Methods for Structured Non-monotone Variational Inequalities: Improved Analysis under Weaker Conditions | Unknown | N/A | |
| A Unified Detection Framework for Inference-Stage Backdoor Defenses | Unknown | N/A | |
| Finding Local Minima Efficiently in Decentralized Optimization | Unknown | N/A | |
| Demystifying Structural Disparity in Graph Neural Networks: Can One Size Fit All? | Unknown | N/A | |
| Jigsaw: Learning to Assemble Multiple Fractured Objects | Unknown | N/A | |
| Sparsity-Preserving Differentially Private Training of Large Embedding Models | Unknown | N/A | |
| On the choice of Perception Loss Function for Learned Video Compression | Unknown | N/A | |
| PrObeD: Proactive Object Detection Wrapper | Unknown | N/A | |
| Refined Mechanism Design for Approximately Structured Priors via Active Regression | Unknown | N/A | |
| CAT-Walk: Inductive Hypergraph Learning via Set Walks | Unknown | N/A | |
| A Single-Loop Accelerated Extra-Gradient Difference Algorithm with Improved Complexity Bounds for Constrained Minimax Optimization | Unknown | N/A | |
| A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | Unknown | N/A | |
| Block Coordinate Plug-and-Play Methods for Blind Inverse Problems | Unknown | N/A | |
| Optimal Treatment Allocation for Efficient Policy Evaluation in Sequential Decision Making | Unknown | N/A | |
| Self-Consistent Velocity Matching of Probability Flows | Unknown | N/A | |
| An Inverse Scaling Law for CLIP Training | Unknown | N/A | |
| Neural Circuits for Fast Poisson Compressed Sensing in the Olfactory Bulb | Unknown | N/A | |
| Equivariant Flow Matching with Hybrid Probability Transport for 3D Molecule Generation | Unknown | N/A | |
| Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective | Unknown | N/A | |
| On Proper Learnability between Average- and Worst-case Robustness | Unknown | N/A | |
| Should I Stop or Should I Go: Early Stopping with Heterogeneous Populations | Unknown | N/A | |
| On the Learnability of Multilabel Ranking | Unknown | N/A | |
| Score-based Generative Modeling through Stochastic Evolution Equations in Hilbert Spaces | Unknown | N/A | |
| Global Identifiability of $\ell_1$-based Dictionary Learning via Matrix Volume Optimization | Unknown | N/A | |
| Worst-case Performance of Popular Approximate Nearest Neighbor Search Implementations: Guarantees and Limitations | Unknown | N/A | |
| Learning a 1-layer conditional generative model in total variation | Unknown | N/A | |
| TaskMet: Task-driven Metric Learning for Model Learning | Unknown | N/A | |
| Koopa: Learning Non-stationary Time Series Dynamics with Koopman Predictors | Unknown | N/A | |
| LogSpecT: Feasible Graph Learning Model from Stationary Signals with Recovery Guarantees | Unknown | N/A | |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Unknown | N/A | |
| Kissing to Find a Match: Efficient Low-Rank Permutation Representation | Unknown | N/A | |
| Differentially Private Image Classification by Learning Priors from Random Processes | Unknown | N/A | |
| Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework | Unknown | N/A | |
| CARE: Modeling Interacting Dynamics Under Temporal Environmental Variation | Unknown | N/A | |
| Anytime-Competitive Reinforcement Learning with Policy Prior | Unknown | N/A | |
| Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models | Unknown | N/A | |
| Efficient Algorithms for Generalized Linear Bandits with Heavy-tailed Rewards | Unknown | N/A | |
| Revisiting Area Convexity: Faster Box-Simplex Games and Spectrahedral Generalizations | Unknown | N/A | |
| Exponential Lower Bounds for Fictitious Play in Potential Games | Unknown | N/A | |
| NICE: NoIse-modulated Consistency rEgularization for Data-Efficient GANs | Unknown | N/A | |
| NeRF Revisited: Fixing Quadrature Instability in Volume Rendering | Unknown | N/A | |
| Directed Cyclic Graph for Causal Discovery from Multivariate Functional Data | Unknown | N/A | |
| First Order Methods with Markovian Noise: from Acceleration to Variational Inequalities | Unknown | N/A | |
| Data-Informed Geometric Space Selection | Unknown | N/A | |
| PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization | Unknown | N/A | |
| Transformed Low-Rank Parameterization Can Help Robust Generalization for Tensor Neural Networks | Unknown | N/A | |
| Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation | Unknown | N/A | |
| Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Models | Unknown | N/A | |
| Budgeting Counterfactual for Offline RL | Unknown | N/A | |
| T2T: From Distribution Learning in Training to Gradient Search in Testing for Combinatorial Optimization | Unknown | N/A | |
| Towards Free Data Selection with General-Purpose Models | Unknown | N/A | |
| Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection | Unknown | N/A | |
| Generalizing Importance Weighting to A Universal Solver for Distribution Shift Problems | Unknown | N/A | |
| Provably (More) Sample-Efficient Offline RL with Options | Unknown | N/A | |
| RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths | Unknown | N/A | |
| Survival Permanental Processes for Survival Analysis with Time-Varying Covariates | Unknown | N/A | |
| Communication-Efficient Federated Bilevel Optimization with Global and Local Lower Level Problems | Unknown | N/A | |
| Dynamo-Depth: Fixing Unsupervised Depth Estimation for Dynamical Scenes | Unknown | N/A | |
| Resolving the Tug-of-War: A Separation of Communication and Learning in Federated Learning | Unknown | N/A | |
| Implicit Bias of (Stochastic) Gradient Descent for Rank-1 Linear Neural Network | Unknown | N/A | |
| Belief Projection-Based Reinforcement Learning for Environments with Delayed Feedback | Unknown | N/A | |
| Maximum State Entropy Exploration using Predecessor and Successor Representations | Unknown | N/A | |
| E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning | Unknown | N/A | |
| Architecture Matters: Uncovering Implicit Mechanisms in Graph Contrastive Learning | Unknown | N/A | |
| GPT-ST: Generative Pre-Training of Spatio-Temporal Graph Neural Networks | Unknown | N/A | |
| Convolutional State Space Models for Long-Range Spatiotemporal Modeling | Unknown | N/A | |
| A Reduction-based Framework for Sequential Decision Making with Delayed Feedback | Unknown | N/A | |
| Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos | Unknown | N/A | |
| Is Learning in Games Good for the Learners? | Unknown | N/A | |
| Effective Robustness against Natural Distribution Shifts for Models with Different Training Data | Unknown | N/A | |
| Content-based Unrestricted Adversarial Attack | Unknown | N/A | |
| Adapting Fairness Interventions to Missing Values | Unknown | N/A | |
| Intriguing Properties of Quantization at Scale | Unknown | N/A | |
| ReSync: Riemannian Subgradient-based Robust Rotation Synchronization | Unknown | N/A | |
| Squared Neural Families: A New Class of Tractable Density Models | Unknown | N/A | |
| Towards Symmetry-Aware Generation of Periodic Materials | Unknown | N/A | |
| GraphMP: Graph Neural Network-based Motion Planning with Efficient Graph Search | Unknown | N/A | |
| ResShift: Efficient Diffusion Model for Image Super-resolution by Residual Shifting | Unknown | N/A | |
| Beyond Unimodal: Generalising Neural Processes for Multimodal Uncertainty Estimation | Unknown | N/A | |
| Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects | Unknown | N/A | |
| A General Theory of Correct, Incorrect, and Extrinsic Equivariance | Unknown | N/A | |
| Multi-Fidelity Multi-Armed Bandits Revisited | Unknown | N/A | |
| Rethinking Conditional Diffusion Sampling with Progressive Guidance | Unknown | N/A | |
| Deep Non-line-of-sight Imaging from Under-scanning Measurements | Unknown | N/A | |
| Model Shapley: Equitable Model Valuation with Black-box Access | Unknown | N/A | |
| Black-box Backdoor Defense via Zero-shot Image Purification | Unknown | N/A | |
| Many-body Approximation for Non-negative Tensors | Unknown | N/A | |
| InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding | Unknown | N/A | |
| Preconditioning Matters: Fast Global Convergence of Non-convex Matrix Factorization via Scaled Gradient Descent | Unknown | N/A | |
| Beyond Geometry: Comparing the Temporal Structure of Computation in Neural Circuits with Dynamical Similarity Analysis | Unknown | N/A | |
| Efficient Sampling of Stochastic Differential Equations with Positive Semi-Definite Models | Unknown | N/A | |
| Invariant Learning via Probability of Sufficient and Necessary Causes | Unknown | N/A | |
| Fractal Landscapes in Policy Optimization | Unknown | N/A | |
| Multi-Agent Meta-Reinforcement Learning: Sharper Convergence Rates with Task Similarity | Unknown | N/A | |
| SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds | Unknown | N/A | |
| Geometric Analysis of Matrix Sensing over Graphs | Unknown | N/A | |
| On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes | Unknown | N/A | |
| HASSOD: Hierarchical Adaptive Self-Supervised Object Detection | Unknown | N/A | |
| Byzantine-Tolerant Methods for Distributed Variational Inequalities | Unknown | N/A | |
| Practical Contextual Bandits with Feedback Graphs | Unknown | N/A | |
| Robust Learning for Smoothed Online Convex Optimization with Feedback Delay | Unknown | N/A | |
| Disentanglement via Latent Quantization | Unknown | N/A | |
| Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning | Unknown | N/A | |
| Smoothed Analysis of Sequential Probability Assignment | Unknown | N/A | |
| Solving a Class of Non-Convex Minimax Optimization in Federated Learning | Unknown | N/A | |
| DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning | Unknown | N/A | |
| AdaVAE: Bayesian Structural Adaptation for Variational Autoencoders | Unknown | N/A | |
| Causal discovery from observational and interventional data across multiple environments | Unknown | N/A | |
| Neural Lyapunov Control for Discrete-Time Systems | Unknown | N/A | |
| Dream the Impossible: Outlier Imagination with Diffusion Models | Unknown | N/A | |
| Neural Algorithmic Reasoning Without Intermediate Supervision | Unknown | N/A | |
| Energy-Efficient Scheduling with Predictions | Unknown | N/A | |
| DiffVL: Scaling Up Soft Body Manipulation using Vision-Language Driven Differentiable Physics | Unknown | N/A | |
| Does Visual Pretraining Help End-to-End Reasoning? | Unknown | N/A | |
| CROMA: Remote Sensing Representations with Contrastive Radar-Optical Masked Autoencoders | Unknown | N/A | |
| For SALE: State-Action Representation Learning for Deep Reinforcement Learning | Unknown | N/A | |
| AutoGO: Automated Computation Graph Optimization for Neural Network Evolution | Unknown | N/A | |
| Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation | Unknown | N/A | |
| Time-uniform confidence bands for the CDF under nonstationarity | Unknown | N/A | |
| On the spectral bias of two-layer linear networks | Unknown | N/A | |
| Learning to Configure Separators in Branch-and-Cut | Unknown | N/A | |
| Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples | Unknown | N/A | |
| Provable convergence guarantees for black-box variational inference | Unknown | N/A | |
| VaRT: Variational Regression Trees | Unknown | N/A | |
| Unlimiformer: Long-Range Transformers with Unlimited Length Input | Unknown | N/A | |
| Inverse Reinforcement Learning with the Average Reward Criterion | Unknown | N/A | |
| On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence | Unknown | N/A | |
| Asymmetric Certified Robustness via Feature-Convex Neural Networks | Unknown | N/A | |
| Learning Exponential Families from Truncated Samples | Unknown | N/A | |
| Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio | Unknown | N/A | |
| Extending the Design Space of Graph Neural Networks by Rethinking Folklore Weisfeiler-Lehman | Unknown | N/A | |
| MAG-GNN: Reinforcement Learning Boosted Graph Neural Network | Unknown | N/A | |
| Geodesic Multi-Modal Mixup for Robust Fine-Tuning | Unknown | N/A | |
| Conditional independence testing under misspecified inductive biases | Unknown | N/A | |
| What’s Left? Concept Grounding with Logic-Enhanced Foundation Models | Unknown | N/A | |
| Revisiting the Minimalist Approach to Offline Reinforcement Learning | Unknown | N/A | |
| Simultaneous embedding of multiple attractor manifolds in a recurrent neural network using constrained gradient optimization | Unknown | N/A | |
| Coupled Reconstruction of Cortical Surfaces by Diffeomorphic Mesh Deformation | Unknown | N/A | |
| Dense-Exponential Random Features: Sharp Positive Estimators of the Gaussian Kernel | Unknown | N/A | |
| Differentially Private Statistical Inference through $\beta$-Divergence One Posterior Sampling | Unknown | N/A | |
| A Rigorous Link between Deep Ensembles and (Variational) Bayesian Methods | Unknown | N/A | |
| C-Disentanglement: Discovering Causally-Independent Generative Factors under an Inductive Bias of Confounder | Unknown | N/A | |
| HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception | Unknown | N/A | |
| Implicit Bias of Gradient Descent for Logistic Regression at the Edge of Stability | Unknown | N/A | |
| Fair Allocation of Indivisible Chores: Beyond Additive Costs | Unknown | N/A | |
| Supported Value Regularization for Offline Reinforcement Learning | Unknown | N/A | |
| Sample Complexity of Forecast Aggregation | Unknown | N/A | |
| Learning Functional Transduction | Unknown | N/A | |
| Label-Only Model Inversion Attacks via Knowledge Transfer | Unknown | N/A | |
| PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks | Unknown | N/A | |
| Random Cuts are Optimal for Explainable k-Medians | Unknown | N/A | |
| FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation | Unknown | N/A | |
| Fine-grained Expressivity of Graph Neural Networks | Unknown | N/A | |
| Neural Lad: A Neural Latent Dynamics Framework for Times Series Modeling | Unknown | N/A | |
| Data Minimization at Inference Time | Unknown | N/A | |
| Distribution-Free Model-Agnostic Regression Calibration via Nonparametric Methods | Unknown | N/A | |
| Near-Linear Time Algorithm for the Chamfer Distance | Unknown | N/A | |
| Language Model Tokenizers Introduce Unfairness Between Languages | Unknown | N/A | |
| History Filtering in Imperfect Information Games: Algorithms and Complexity | Unknown | N/A | |
| Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directions | Unknown | N/A | |
| Fast Trainable Projection for Robust Fine-tuning | Unknown | N/A | |
| MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation | Unknown | N/A | |
| Entropic Neural Optimal Transport via Diffusion Processes | Unknown | N/A | |
| Semi-Supervised Domain Generalization with Known and Unknown Classes | Unknown | N/A | |
| Representation Learning via Consistent Assignment of Views over Random Partitions | Unknown | N/A | |
| Test-time Training for Matching-based Video Object Segmentation | Unknown | N/A | |
| Data Augmentations for Improved (Large) Language Model Generalization | Unknown | N/A | |
| Revisiting Implicit Differentiation for Learning Problems in Optimal Control | Unknown | N/A | |
| Learning Invariant Molecular Representation in Latent Discrete Space | Unknown | N/A | |
| High-dimensional Asymptotics of Denoising Autoencoders | Unknown | N/A | |
| Tame a Wild Camera: In-the-Wild Monocular Camera Calibration | Unknown | N/A | |
| 3D molecule generation by denoising voxel grids | Unknown | N/A | |
| ProPILE: Probing Privacy Leakage in Large Language Models | Unknown | N/A | |
| Described Object Detection: Liberating Object Detection with Flexible Expressions | Unknown | N/A | |
| Kernelized Cumulants: Beyond Kernel Mean Embeddings | Unknown | N/A | |
| Learning Re-sampling Methods with Parameter Attribution for Image Super-resolution | Unknown | N/A | |
| Learning from Visual Observation via Offline Pretrained State-to-Go Transformer | Unknown | N/A | |
| DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing | Unknown | N/A | |
| Inserting Anybody in Diffusion Models via Celeb Basis | Unknown | N/A | |
| Let the Flows Tell: Solving Graph Combinatorial Problems with GFlowNets | Unknown | N/A | |
| DreamHuman: Animatable 3D Avatars from Text | Unknown | N/A | |
| Trade-off Between Efficiency and Consistency for Removal-based Explanations | Unknown | N/A | |
| Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks | Unknown | N/A | |
| Fairness Aware Counterfactuals for Subgroups | Unknown | N/A | |
| SyncDiffusion: Coherent Montage via Synchronized Joint Diffusions | Unknown | N/A | |
| Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents | Unknown | N/A | |
| CrossGNN: Confronting Noisy Multivariate Time Series Via Cross Interaction Refinement | Unknown | N/A | |
| New Complexity-Theoretic Frontiers of Tractability for Neural Network Training | Unknown | N/A | |
| Saddle-to-Saddle Dynamics in Diagonal Linear Networks | Unknown | N/A | |
| Covariance-adaptive best arm identification | Unknown | N/A | |
| Metis: Understanding and Enhancing In-Network Regular Expressions | Unknown | N/A | |
| Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction | Unknown | N/A | |
| Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection | Unknown | N/A | |
| Fed-CO$_{2}$: Cooperation of Online and Offline Models for Severe Data Heterogeneity in Federated Learning | Unknown | N/A | |
| SOAR: Improved Indexing for Approximate Nearest Neighbor Search | Unknown | N/A | |
| CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society | Unknown | N/A | |
| CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graph Diffusion | Unknown | N/A | |
| Fast Model DeBias with Machine Unlearning | Unknown | N/A | |
| “Why Not Looking backward?” A Robust Two-Step Method to Automatically Terminate Bayesian Optimization | Unknown | N/A | |
| Hypervolume Maximization: A Geometric View of Pareto Set Learning | Unknown | N/A | |
| RH-BrainFS: Regional Heterogeneous Multimodal Brain Networks Fusion Strategy | Unknown | N/A | |
| Norm-guided latent space exploration for text-to-image generation | Unknown | N/A | |
| On skip connections and normalisation layers in deep optimisation | Unknown | N/A | |
| Language Is Not All You Need: Aligning Perception with Language Models | Unknown | N/A | |
| Robust Knowledge Transfer in Tiered Reinforcement Learning | Unknown | N/A | |
| Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks | Unknown | N/A | |
| On the Pareto Front of Multilingual Neural Machine Translation | Unknown | N/A | |
| DDF-HO: Hand-Held Object Reconstruction via Conditional Directed Distance Field | Unknown | N/A | |
| TexQ: Zero-shot Network Quantization with Texture Feature Distribution Calibration | Unknown | N/A | |
| MCUFormer: Deploying Vision Tranformers on Microcontrollers with Limited Memory | Unknown | N/A | |
| Train Hard, Fight Easy: Robust Meta Reinforcement Learning | Unknown | N/A | |
| Greedy Poisson Rejection Sampling | Unknown | N/A | |
| PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas | Unknown | N/A | |
| Time-Independent Information-Theoretic Generalization Bounds for SGLD | Unknown | N/A | |
| Toward Re-Identifying Any Animal | Unknown | N/A | |
| Wasserstein Gradient Flows for Optimizing Gaussian Mixture Policies | Unknown | N/A | |
| AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix | Unknown | N/A | |
| Seeing is not Believing: Robust Reinforcement Learning against Spurious Correlation | Unknown | N/A | |
| Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms | Unknown | N/A | |
| Uncertainty-Aware Instance Reweighting for Off-Policy Learning | Unknown | N/A | |
| Adversarially Robust Distributed Count Tracking via Partial Differential Privacy | Unknown | N/A | |
| Online Performative Gradient Descent for Learning Nash Equilibria in Decision-Dependent Games | Unknown | N/A | |
| Functional Renyi Differential Privacy for Generative Modeling | Unknown | N/A | |
| Simple and Asymmetric Graph Contrastive Learning without Augmentations | Unknown | N/A | |
| Efficient Equivariant Transfer Learning from Pretrained Models | Unknown | N/A | |
| Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings | Unknown | N/A | |
| When Does Optimizing a Proper Loss Yield Calibration? | Unknown | N/A | |
| Partial Multi-Label Learning with Probabilistic Graphical Disambiguation | Unknown | N/A | |
| Learning to Group Auxiliary Datasets for Molecule | Unknown | N/A | |
| Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations | Unknown | N/A | |
| On the Relationship Between Relevance and Conflict in Online Social Link Recommendations | Unknown | N/A | |
| QuadAttac$K$: A Quadratic Programming Approach to Learning Ordered Top-$K$ Adversarial Attacks | Unknown | N/A | |
| UltraRE: Enhancing RecEraser for Recommendation Unlearning via Error Decomposition | Unknown | N/A | |
| Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation | Unknown | N/A | |
| Towards Generic Semi-Supervised Framework for Volumetric Medical Image Segmentation | Unknown | N/A | |
| Online Nonstochastic Model-Free Reinforcement Learning | Unknown | N/A | |
| Computing Approximate $\ell_p$ Sensitivities | Unknown | N/A | |
| Loss Decoupling for Task-Agnostic Continual Learning | Unknown | N/A | |
| Modeling Dynamics over Meshes with Gauge Equivariant Nonlinear Message Passing | Unknown | N/A | |
| Quantifying the Cost of Learning in Queueing Systems | Unknown | N/A | |
| Self-Supervised Visual Acoustic Matching | Unknown | N/A | |
| DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets | Unknown | N/A | |
| Provably Efficient Algorithm for Nonstationary Low-Rank MDPs | Unknown | N/A | |
| Practical and Asymptotically Exact Conditional Sampling in Diffusion Models | Unknown | N/A | |
| OneNet: Enhancing Time Series Forecasting Models under Concept Drift by Online Ensembling | Unknown | N/A | |
| Keypoint-Augmented Self-Supervised Learning for Medical Image Segmentation with Limited Annotation | Unknown | N/A | |
| Behavior Alignment via Reward Function Optimization | Unknown | N/A | |
| Self-Supervised Motion Magnification by Backpropagating Through Optical Flow | Unknown | N/A | |
| Harnessing the power of choices in decision tree learning | Unknown | N/A | |
| Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints | Unknown | N/A | |
| Cross-Episodic Curriculum for Transformer Agents | Unknown | N/A | |
| Hypothesis Selection with Memory Constraints | Unknown | N/A | |
| State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding | Unknown | N/A | |
| Guiding The Last Layer in Federated Learning with Pre-Trained Models | Unknown | N/A | |
| Statistical Knowledge Assessment for Large Language Models | Unknown | N/A | |
| Training neural operators to preserve invariant measures of chaotic attractors | Unknown | N/A | |
| Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models | Unknown | N/A | |
| When is Agnostic Reinforcement Learning Statistically Tractable? | Unknown | N/A | |
| Expressive probabilistic sampling in recurrent neural networks | Unknown | N/A | |
| Gaussian Process Probes (GPP) for Uncertainty-Aware Probing | Unknown | N/A | |
| General Munchausen Reinforcement Learning with Tsallis Kullback-Leibler Divergence | Unknown | N/A | |
| Distributionally Robust Skeleton Learning of Discrete Bayesian Networks | Unknown | N/A | |
| Cross-links Matter for Link Prediction: Rethinking the Debiased GNN from a Data Perspective | Unknown | N/A | |
| Reproducibility in Multiple Instance Learning: A Case For Algorithmic Unit Tests | Unknown | N/A | |
| FLSL: Feature-level Self-supervised Learning | Unknown | N/A | |
| Mechanism Design for Collaborative Normal Mean Estimation | Unknown | N/A | |
| Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning | Unknown | N/A | |
| Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts | Unknown | N/A | |
| The Grand Illusion: The Myth of Software Portability and Implications for ML Progress. | Unknown | N/A | |
| Kernel Quadrature with Randomly Pivoted Cholesky | Unknown | N/A | |
| Bayesian target optimisation for high-precision holographic optogenetics | Unknown | N/A | |
| Multi-task learning with summary statistics | Unknown | N/A | |
| StreamNet: Memory-Efficient Streaming Tiny Deep Learning Inference on the Microcontroller | Unknown | N/A | |
| TD Convergence: An Optimization Perspective | Unknown | N/A | |
| Probabilistic Invariant Learning with Randomized Linear Classifiers | Unknown | N/A | |
| Doubly Constrained Fair Clustering | Unknown | N/A | |
| Bayesian Risk-Averse Q-Learning with Streaming Observations | Unknown | N/A | |
| Block-State Transformers | Unknown | N/A | |
| Localized Symbolic Knowledge Distillation for Visual Commonsense Models | Unknown | N/A | |
| Collaboratively Learning Linear Models with Structured Missing Data | Unknown | N/A | |
| Bi-Level Offline Policy Optimization with Limited Exploration | Unknown | N/A | |
| Learning Nonparametric Latent Causal Graphs with Unknown Interventions | Unknown | N/A | |
| QuACK: Accelerating Gradient-Based Quantum Optimization with Koopman Operator Learning | Unknown | N/A | |
| Learning to Receive Help: Intervention-Aware Concept Embedding Models | Unknown | N/A | |
| Uncovering Meanings of Embeddings via Partial Orthogonality | Unknown | N/A | |
| Riemannian Projection-free Online Learning | Unknown | N/A | |
| Online robust non-stationary estimation | Unknown | N/A | |
| Scaling laws for language encoding models in fMRI | Unknown | N/A | |
| LIMA: Less Is More for Alignment | Unknown | N/A | |
| Gradient-Based Feature Learning under Structured Data | Unknown | N/A | |
| An Alternating Optimization Method for Bilevel Problems under the Polyak-Łojasiewicz Condition | Unknown | N/A | |
| OKRidge: Scalable Optimal k-Sparse Ridge Regression | Unknown | N/A | |
| Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning | Unknown | N/A | |
| Randomized Sparse Neural Galerkin Schemes for Solving Evolution Equations with Deep Networks | Unknown | N/A | |
| The s-value: evaluating stability with respect to distributional shifts | Unknown | N/A | |
| Generating Images with Multimodal Language Models | Unknown | N/A | |
| Successor-Predecessor Intrinsic Exploration | Unknown | N/A | |
| Blurred-Dilated Method for Adversarial Attacks | Unknown | N/A | |
| CADet: Fully Self-Supervised Out-Of-Distribution Detection With Contrastive Learning | Unknown | N/A | |
| Bypassing spike sorting: Density-based decoding using spike localization from dense multielectrode probes | Unknown | N/A | |
| Any-to-Any Generation via Composable Diffusion | Unknown | N/A | |
| OBJECT 3DIT: Language-guided 3D-aware Image Editing | Unknown | N/A | |
| Binarized Neural Machine Translation | Unknown | N/A | |
| A Competitive Algorithm for Agnostic Active Learning | Unknown | N/A | |
| Fine-Tuning Language Models with Just Forward Passes | Unknown | N/A | |
| Robust Concept Erasure via Kernelized Rate-Distortion Maximization | Unknown | N/A | |
| Double Auctions with Two-sided Bandit Feedback | Unknown | N/A | |
| Lower Bounds on Adaptive Sensing for Matrix Recovery | Unknown | N/A | |
| FedNAR: Federated Optimization with Normalized Annealing Regularization | Unknown | N/A | |
| End-To-End Latent Variational Diffusion Models for Inverse Problems in High Energy Physics | Unknown | N/A | |
| GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization | Unknown | N/A | |
| Distributed Inference and Fine-tuning of Large Language Models Over The Internet | Unknown | N/A | |
| Mechanic: A Learning Rate Tuner | Unknown | N/A | |
| Going beyond persistent homology using persistent homology | Unknown | N/A | |
| Adaptive Privacy Composition for Accuracy-first Mechanisms | Unknown | N/A | |
| Temporally Disentangled Representation Learning under Unknown Nonstationarity | Unknown | N/A | |
| Evaluating the Moral Beliefs Encoded in LLMs | Unknown | N/A | |
| The Adversarial Consistency of Surrogate Risks for Binary Classification | Unknown | N/A | |
| Improving the Privacy and Practicality of Objective Perturbation for Differentially Private Linear Learners | Unknown | N/A | |
| Isometric Quotient Variational Auto-Encoders for Structure-Preserving Representation Learning | Unknown | N/A | |
| Thought Cloning: Learning to Think while Acting by Imitating Human Thinking | Unknown | N/A | |
| Context-lumpable stochastic bandits | Unknown | N/A | |
| Embracing the chaos: analysis and diagnosis of numerical instability in variational flows | Unknown | N/A | |
| Diverse Conventions for Human-AI Collaboration | Unknown | N/A | |
| RECKONING: Reasoning through Dynamic Knowledge Encoding | Unknown | N/A | |
| Optimistic Rates for Multi-Task Representation Learning | Unknown | N/A | |
| Neural Priming for Sample-Efficient Adaptation | Unknown | N/A | |
| Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models | Unknown | N/A | |
| Easy Learning from Label Proportions | Unknown | N/A | |
| Propagating Knowledge Updates to LMs Through Distillation | Unknown | N/A | |
| Learning Curves for Noisy Heterogeneous Feature-Subsampled Ridge Ensembles | Unknown | N/A | |
| Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion | Unknown | N/A | |
| What Planning Problems Can A Relational Neural Network Solve? | Unknown | N/A | |
| Optimal Learners for Realizable Regression: PAC Learning and Online Learning | Unknown | N/A | |
| Fragment-based Pretraining and Finetuning on Molecular Graphs | Unknown | N/A | |
| Non-Stationary Bandits with Auto-Regressive Temporal Dependency | Unknown | N/A | |
| Learning Universal Policies via Text-Guided Video Generation | Unknown | N/A | |
| Resetting the Optimizer in Deep RL: An Empirical Study | Unknown | N/A | |
| When are ensembles really effective? | Unknown | N/A | |
| Learning Repeatable Speech Embeddings Using An Intra-class Correlation Regularizer | Unknown | N/A | |
| Feature Selection in the Contrastive Analysis Setting | Unknown | N/A | |
| GradOrth: A Simple yet Efficient Out-of-Distribution Detection with Orthogonal Projection of Gradients | Unknown | N/A | |
| On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms | Unknown | N/A | |
| Reliable learning in challenging environments | Unknown | N/A | |
| Learning to Reason and Memorize with Self-Notes | Unknown | N/A | |
| When Demonstrations meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Unknown | N/A | |
| Label-Retrieval-Augmented Diffusion Models for Learning from Noisy Labels | Unknown | N/A | |
| Intervention Generalization: A View from Factor Graph Models | Unknown | N/A | |
| DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning | Unknown | N/A | |
| DIN-SQL: Decomposed In-Context Learning of Text-to-SQL with Self-Correction | Unknown | N/A | |
| Exact Representation of Sparse Networks with Symmetric Nonnegative Embeddings | Unknown | N/A | |
| Optimal Exploration for Model-Based RL in Nonlinear Systems | Unknown | N/A | |
| Large language models transition from integrating across position-yoked, exponential windows to structure-yoked, power-law windows | Unknown | N/A | |
| Marich: A Query-efficient Distributionally Equivalent Model Extraction Attack | Unknown | N/A | |
| Are Diffusion Models Vision-And-Language Reasoners? | Unknown | N/A | |
| FAMO: Fast Adaptive Multitask Optimization | Unknown | N/A | |
| Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback | Unknown | N/A | |
| Adaptive Contextual Perception: How To Generalize To New Backgrounds and Ambiguous Objects | Unknown | N/A | |
| What Can We Learn from Unlearnable Datasets? | Unknown | N/A | |
| Online Ad Allocation with Predictions | Unknown | N/A | |
| Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback | Unknown | N/A | |
| PAPR: Proximity Attention Point Rendering | Unknown | N/A | |
| On the Sublinear Regret of GP-UCB | Unknown | N/A | |
| On Imitation in Mean-field Games | Unknown | N/A | |
| Optimistic Exploration in Reinforcement Learning Using Symbolic Model Estimates | Unknown | N/A | |
| Strategic Classification under Unknown Personalized Manipulation | Unknown | N/A | |
| $k$-Means Clustering with Distance-Based Privacy | Unknown | N/A | |
| RoboCLIP: One Demonstration is Enough to Learn Robot Policies | Unknown | N/A | |
| CORNN: Convex optimization of recurrent neural networks for rapid inference of neural dynamics | Unknown | N/A | |
| SE(3) Equivariant Augmented Coupling Flows | Unknown | N/A | |
| One-Pass Distribution Sketch for Measuring Data Heterogeneity in Federated Learning | Unknown | N/A | |
| Feature-Learning Networks Are Consistent Across Widths At Realistic Scales | Unknown | N/A | |
| Bridging RL Theory and Practice with the Effective Horizon | Unknown | N/A | |
| Path Regularization: A Convexity and Sparsity Inducing Regularization for Parallel ReLU Networks | Unknown | N/A | |
| Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise | Unknown | N/A | |
| Meek Separators and Their Applications in Targeted Causal Discovery | Unknown | N/A | |
| Students Parrot Their Teachers: Membership Inference on Model Distillation | Unknown | N/A | |
| InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning | Unknown | N/A | |
| Causal Context Connects Counterfactual Fairness to Robust Prediction and Group Fairness | Unknown | N/A | |
| Exact recovery and Bregman hard clustering of node-attributed Stochastic Block Model | Unknown | N/A | |
| Evaluating Neuron Interpretation Methods of NLP Models | Unknown | N/A | |
| Composing Parameter-Efficient Modules with Arithmetic Operation | Unknown | N/A | |
| Conformal Prediction for Time Series with Modern Hopfield Networks | Unknown | N/A | |
| Generalized equivalences between subsampling and ridge regularization | Unknown | N/A | |
| Convolutional Neural Operators for robust and accurate learning of PDEs | Unknown | N/A | |
| LambdaBeam: Neural Program Search with Higher-Order Functions and Lambdas | Unknown | N/A | |
| Accelerated On-Device Forward Neural Network Training with Module-Wise Descending Asynchronism | Unknown | N/A | |
| Efficient Potential-based Exploration in Reinforcement Learning using Inverse Dynamic Bisimulation Metric | Unknown | N/A | |
| Triple Eagle: Simple, Fast and Practical Budget-Feasible Mechanisms | Unknown | N/A | |
| Block Low-Rank Preconditioner with Shared Basis for Stochastic Optimization | Unknown | N/A | |
| FOCAL: Contrastive Learning for Multimodal Time-Series Sensing Signals in Factorized Orthogonal Latent Space | Unknown | N/A | |
| A Bounded Ability Estimation for Computerized Adaptive Testing | Unknown | N/A | |
| ATMAN: Understanding Transformer Predictions Through Memory Efficient Attention Manipulation | Unknown | N/A | |
| Mitigating the Popularity Bias of Graph Collaborative Filtering: A Dimensional Collapse Perspective | Unknown | N/A | |
| Task-aware world model learning with meta weighting via bi-level optimization | Unknown | N/A | |
| A Scalable Neural Network for DSIC Affine Maximizer Auction Design | Unknown | N/A | |
| SUBP: Soft Uniform Block Pruning for 1$\times$N Sparse CNNs Multithreading Acceleration | Unknown | N/A | |
| Glance and Focus: Memory Prompting for Multi-Event Video Question Answering | Unknown | N/A | |
| Dynamic Personalized Federated Learning with Adaptive Differential Privacy | Unknown | N/A | |
| A Deep Instance Generative Framework for MILP Solvers Under Limited Data Availability | Unknown | N/A | |
| Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification | Unknown | N/A | |
| Optimal privacy guarantees for a relaxed threat model: Addressing sub-optimal adversaries in differentially private machine learning | Unknown | N/A | |
| Self-supervised Graph Neural Networks via Low-Rank Decomposition | Unknown | N/A | |
| UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models | Unknown | N/A | |
| Fed-GraB: Federated Long-tailed Learning with Self-Adjusting Gradient Balancer | Unknown | N/A | |
| GALOPA: Graph Transport Learning with Optimal Plan Alignment | Unknown | N/A | |
| Improving Compositional Generalization using Iterated Learning and Simplicial Embeddings | Unknown | N/A | |
| ResMem: Learn what you can and memorize the rest | Unknown | N/A | |
| PDF: Point Diffusion Implicit Function for Large-scale Scene Neural Representation | Unknown | N/A | |
| Punctuation-level Attack: Single-shot and Single Punctuation Can Fool Text Models | Unknown | N/A | |
| Provably Fast Finite Particle Variants of SVGD via Virtual Particle Stochastic Approximation | Unknown | N/A | |
| A Batch-to-Online Transformation under Random-Order Model | Unknown | N/A | |
| A Recurrent Neural Circuit Mechanism of Temporal-scaling Equivariant Representation | Unknown | N/A | |
| Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback | Unknown | N/A | |
| Thrust: Adaptively Propels Large Language Models with External Knowledge | Unknown | N/A | |
| CluB: Cluster Meets BEV for LiDAR-Based 3D Object Detection | Unknown | N/A | |
| RRHF: Rank Responses to Align Language Models with Human Feedback | Unknown | N/A | |
| Learning Modulated Transformation in GANs | Unknown | N/A | |
| Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation | Unknown | N/A | |
| Lightweight Vision Transformer with Bidirectional Interaction | Unknown | N/A | |
| Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms | Unknown | N/A | |
| Language Models can Solve Computer Tasks | Unknown | N/A | |
| Logarithmic-Regret Quantum Learning Algorithms for Zero-Sum Games | Unknown | N/A | |
| Single-Stage Visual Query Localization in Egocentric Videos | Unknown | N/A | |
| Meta-Adapter: An Online Few-shot Learner for Vision-Language Model | Unknown | N/A | |
| Revisit Weakly-Supervised Audio-Visual Video Parsing from the Language Perspective | Unknown | N/A | |
| Efficient Uncertainty Quantification and Reduction for Over-Parameterized Neural Networks | Unknown | N/A | |
| Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space | Unknown | N/A | |
| A Unified Approach to Domain Incremental Learning with Memory: Theory and Algorithm | Unknown | N/A | |
| Balanced Training for Sparse GANs | Unknown | N/A | |
| Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs | Unknown | N/A | |
| ReTR: Modeling Rendering Via Transformer for Generalizable Neural Surface Reconstruction | Unknown | N/A | |
| Towards Efficient Pre-Trained Language Model via Feature Correlation Distillation | Unknown | N/A | |
| Fast Partitioned Learned Bloom Filter | Unknown | N/A | |
| Towards Hybrid-grained Feature Interaction Selection for Deep Sparse Network | Unknown | N/A | |
| Vulnerabilities in Video Quality Assessment Models: The Challenge of Adversarial Attacks | Unknown | N/A | |
| Learning to Parameterize Visual Attributes for Open-set Fine-grained Retrieval | Unknown | N/A | |
| On Robust Streaming for Learning with Experts: Algorithms and Lower Bounds | Unknown | N/A | |
| H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training | Unknown | N/A | |
| Test-Time Distribution Normalization for Contrastively Learned Visual-language Models | Unknown | N/A | |
| RanPAC: Random Projections and Pre-trained Models for Continual Learning | Unknown | N/A | |
| Learning non-Markovian Decision-Making from State-only Sequences | Unknown | N/A | |
| Layer-Neighbor Sampling --- Defusing Neighborhood Explosion in GNNs | Unknown | N/A | |
| Training Private Models That Know What They Don’t Know | Unknown | N/A | |
| Improving Diffusion-Based Image Synthesis with Context Prediction | Unknown | N/A | |
| HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork | Unknown | N/A | |
| Deep Momentum Multi-Marginal Schrödinger Bridge | Unknown | N/A | |
| An Empirical Study Towards Prompt-Tuning for Graph Contrastive Pre-Training in Recommendations | Unknown | N/A | |
| False Discovery Proportion control for aggregated Knockoffs | Unknown | N/A | |
| Max-Margin Token Selection in Attention Mechanism | Unknown | N/A | |
| Block-Coordinate Methods and Restarting for Solving Extensive-Form Games | Unknown | N/A | |
| Video Prediction Models as Rewards for Reinforcement Learning | Unknown | N/A | |
| A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs | Unknown | N/A | |
| Core-sets for Fair and Diverse Data Summarization | Unknown | N/A | |
| Spectral Evolution and Invariance in Linear-width Neural Networks | Unknown | N/A | |
| Agnostic Multi-Group Active Learning | Unknown | N/A | |
| DropCompute: simple and more robust distributed synchronous training via compute variance reduction | Unknown | N/A | |
| Dynamic Pricing and Learning with Bayesian Persuasion | Unknown | N/A | |
| Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection | Unknown | N/A | |
| Synthetic Combinations: A Causal Inference Framework for Combinatorial Interventions | Unknown | N/A | |
| A Graph-Theoretic Framework for Understanding Open-World Semi-Supervised Learning | Unknown | N/A | |
| Learning Unseen Modality Interaction | Unknown | N/A | |
| A One-Size-Fits-All Approach to Improving Randomness in Paper Assignment | Unknown | N/A | |
| Synthetic Experience Replay | Unknown | N/A | |
| Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data | Unknown | N/A | |
| Symbolic Discovery of Optimization Algorithms | Unknown | N/A | |
| Predict-then-Calibrate: A New Perspective of Robust Contextual LP | Unknown | N/A | |
| Face Reconstruction from Facial Templates by Learning Latent Space of a Generator Network | Unknown | N/A | |
| Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis | Unknown | N/A | |
| ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking | Unknown | N/A | |
| Learning Dynamic Attribute-factored World Models for Efficient Multi-object Reinforcement Learning | Unknown | N/A | |
| Tight Bounds for Volumetric Spanners and Applications | Unknown | N/A | |
| Online Convex Optimization with Unbounded Memory | Unknown | N/A | |
| Tanimoto Random Features for Scalable Molecular Machine Learning | Unknown | N/A | |
| Learning World Models with Identifiable Factorization | Unknown | N/A | |
| SpatialRank: Urban Event Ranking with NDCG Optimization on Spatiotemporal Data | Unknown | N/A | |
| Structured Federated Learning through Clustered Additive Modeling | Unknown | N/A | |
| Learning and Collusion in Multi-unit Auctions | Unknown | N/A | |
| IDEA: An Invariant Perspective for Efficient Domain Adaptive Image Retrieval | Unknown | N/A | |
| Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation | Unknown | N/A | |
| Beyond probability partitions: Calibrating neural networks with semantic aware grouping | Unknown | N/A | |
| MixFormerV2: Efficient Fully Transformer Tracking | Unknown | N/A | |
| Tight Risk Bounds for Gradient Descent on Separable Data | Unknown | N/A | |
| Hidden Poison: Machine Unlearning Enables Camouflaged Poisoning Attacks | Unknown | N/A | |
| Complex-valued Neurons Can Learn More but Slower than Real-valued Neurons via Gradient Descent | Unknown | N/A | |
| Model Spider: Learning to Rank Pre-Trained Models Efficiently | Unknown | N/A | |
| HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds | Unknown | N/A | |
| Federated Learning with Manifold Regularization and Normalized Update Reaggregation | Unknown | N/A | |
| Pre-Training Protein Encoder via Siamese Sequence-Structure Diffusion Trajectory Prediction | Unknown | N/A | |
| Eliminating Domain Bias for Federated Learning in Representation Space | Unknown | N/A | |
| Deconstructing Data Reconstruction: Multiclass, Weight Decay and General Losses | Unknown | N/A | |
| Slot-guided Volumetric Object Radiance Fields | Unknown | N/A | |
| (Almost) Provable Error Bounds Under Distribution Shift via Disagreement Discrepancy | Unknown | N/A | |
| Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Unknown | N/A | |
| Trans-Dimensional Generative Modeling via Jump Diffusion Models | Unknown | N/A | |
| Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning | Unknown | N/A | |
| Learning in the Presence of Low-dimensional Structure: A Spiked Random Matrix Perspective | Unknown | N/A | |
| ForkMerge: Mitigating Negative Transfer in Auxiliary-Task Learning | Unknown | N/A | |
| Equivariant Spatio-Temporal Attentive Graph Networks to Simulate Physical Dynamics | Unknown | N/A | |
| Crystal Structure Prediction by Joint Equivariant Diffusion | Unknown | N/A | |
| ReContrast: Domain-Specific Anomaly Detection via Contrastive Reconstruction | Unknown | N/A | |
| Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals | Unknown | N/A | |
| Feature Dropout: Revisiting the Role of Augmentations in Contrastive Learning | Unknown | N/A | |
| TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials | Unknown | N/A | |
| From Cloze to Comprehension: Retrofitting Pre-trained Masked Language Models to Pre-trained Machine Reader | Unknown | N/A | |
| Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation | Unknown | N/A | |
| Contrastive Sampling Chains in Diffusion Models | Unknown | N/A | |
| DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation | Unknown | N/A | |
| Towards Distribution-Agnostic Generalized Category Discovery | Unknown | N/A | |
| Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training | Unknown | N/A | |
| Efficient Test-Time Adaptation for Super-Resolution with Second-Order Degradation and Reconstruction | Unknown | N/A | |
| Neural Frailty Machine: Beyond proportional hazard assumption in neural survival regressions | Unknown | N/A | |
| Spectral Co-Distillation for Personalized Federated Learning | Unknown | N/A | |
| Shared Adversarial Unlearning: Backdoor Mitigation by Unlearning Shared Adversarial Examples | Unknown | N/A | |
| Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects | Unknown | N/A | |
| PTQD: Accurate Post-Training Quantization for Diffusion Models | Unknown | N/A | |
| Language Models Can Improve Event Prediction by Few-Shot Abductive Reasoning | Unknown | N/A | |
| Optimal Treatment Regimes for Proximal Causal Learning | Unknown | N/A | |
| Swap Agnostic Learning, or Characterizing Omniprediction via Multicalibration | Unknown | N/A | |
| Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time | Unknown | N/A | |
| Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models | Unknown | N/A | |
| Large Language Models can Implement Policy Iteration | Unknown | N/A | |
| iSCAN: Identifying Causal Mechanism Shifts among Nonlinear Additive Noise Models | Unknown | N/A | |
| Global Optimality in Bivariate Gradient-based DAG Learning | Unknown | N/A | |
| A Dual-Stream Neural Network Explains the Functional Segregation of Dorsal and Ventral Visual Pathways in Human Brains | Unknown | N/A | |
| Truly Scale-Equivariant Deep Nets with Fourier Layers | Unknown | N/A | |
| Inverse Preference Learning: Preference-based RL without a Reward Function | Unknown | N/A | |
| Neural Functional Transformers | Unknown | N/A | |
| Permutation Equivariant Neural Functionals | Unknown | N/A | |
| Unsupervised Protein-Ligand Binding Energy Prediction via Neural Euler's Rotation Equation | Unknown | N/A | |
| Learning Invariant Representations with a Nonparametric Nadaraya-Watson Head | Unknown | N/A | |
| Online Constrained Meta-Learning: Provable Guarantees for Generalization | Unknown | N/A | |
| Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars | Unknown | N/A | |
| Adaptive Selective Sampling for Online Prediction with Experts | Unknown | N/A | |
| Coneheads: Hierarchy Aware Attention | Unknown | N/A | |
| A Measure-Theoretic Axiomatisation of Causality | Unknown | N/A | |
| Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback | Unknown | N/A | |
| Text-to-Image Diffusion Models are Zero Shot Classifiers | Unknown | N/A | |
| REx: Data-Free Residual Quantization Error Expansion | Unknown | N/A | |
| PointGPT: Auto-regressively Generative Pre-training from Point Clouds | Unknown | N/A | |
| Robust Model Reasoning and Fitting via Dual Sparsity Pursuit | Unknown | N/A | |
| Towards Characterizing the First-order Query Complexity of Learning (Approximate) Nash Equilibria in Zero-sum Matrix Games | Unknown | N/A | |
| FourierHandFlow: Neural 4D Hand Representation Using Fourier Query Flow | Unknown | N/A | |
| GlyphControl: Glyph Conditional Control for Visual Text Generation | Unknown | N/A | |
| Generalizable Lightweight Proxy for Robust NAS against Diverse Perturbations | Unknown | N/A | |
| Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL | Unknown | N/A | |
| LLM-Pruner: On the Structural Pruning of Large Language Models | Unknown | N/A | |
| Masked Space-Time Hash Encoding for Efficient Dynamic Scene Reconstruction | Unknown | N/A | |
| NeuralGF: Unsupervised Point Normal Estimation by Learning Neural Gradient Function | Unknown | N/A | |
| LuminAIRe: Illumination-Aware Conditional Image Repainting for Lighting-Realistic Generation | Unknown | N/A | |
| Latent exploration for Reinforcement Learning | Unknown | N/A | |
| Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning | Unknown | N/A | |
| Saving 100x Storage: Prototype Replay for Reconstructing Training Sample Distribution in Class-Incremental Semantic Segmentation | Unknown | N/A | |
| Generative Category-level Object Pose Estimation via Diffusion Models | Unknown | N/A | |
| Generalized Weighted Path Consistency for Mastering Atari Games | Unknown | N/A | |
| Towards Better Dynamic Graph Learning: New Architecture and Unified Library | Unknown | N/A | |
| Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation | Unknown | N/A | |
| Masked Image Residual Learning for Scaling Deeper Vision Transformers | Unknown | N/A | |
| No-Regret Learning in Dynamic Competition with Reference Effects Under Logit Demand | Unknown | N/A | |
| Neural Polarizer: A Lightweight and Effective Backdoor Defense via Purifying Poisoned Features | Unknown | N/A | |
| HubRouter: Learning Global Routing via Hub Generation and Pin-hub Connection | Unknown | N/A | |
| Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities | Unknown | N/A | |
| Demystifying Softmax Gating Function in Gaussian Mixture of Experts | Unknown | N/A | |
| Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions | Unknown | N/A | |
| Knowledge Distillation for High Dimensional Search Index | Unknown | N/A | |
| Accelerating Value Iteration with Anchoring | Unknown | N/A | |
| One Fits All: Power General Time Series Analysis by Pretrained LM | Unknown | N/A | |
| Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration | Unknown | N/A | |
| One Less Reason for Filter Pruning: Gaining Free Adversarial Robustness with Structured Grouped Kernel Pruning | Unknown | N/A | |
| Diffusion Probabilistic Models for Structured Node Classification | Unknown | N/A | |
| Overcoming Recency Bias of Normalization Statistics in Continual Learning: Balance and Adaptation | Unknown | N/A | |
| Decompose Novel into Known: Part Concept Learning For 3D Novel Class Discovery | Unknown | N/A | |
| A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation | Unknown | N/A | |
| Markovian Sliced Wasserstein Distances: Beyond Independent Projections | Unknown | N/A | |
| Controlling Text-to-Image Diffusion by Orthogonal Finetuning | Unknown | N/A | |
| Hierarchical Decomposition of Prompt-Based Continual Learning: Rethinking Obscured Sub-optimality | Unknown | N/A | |
| Interpretability at Scale: Identifying Causal Mechanisms in Alpaca | Unknown | N/A | |
| Robust Lipschitz Bandits to Adversarial Corruptions | Unknown | N/A | |
| Connected Superlevel Set in (Deep) Reinforcement Learning and its Application to Minimax Theorems | Unknown | N/A | |
| Energy-Based Sliced Wasserstein Distance | Unknown | N/A | |
| Riemannian Residual Neural Networks | Unknown | N/A | |
| Estimating Riemannian Metric with Noise-Contaminated Intrinsic Distance | Unknown | N/A | |
| One-Step Diffusion Distillation via Deep Equilibrium Models | Unknown | N/A | |
| Multi-Player Zero-Sum Markov Games with Networked Separable Interactions | Unknown | N/A | |
| Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale From A New Perspective | Unknown | N/A | |
| Time-Reversed Dissipation Induces Duality Between Minimizing Gradient Norm and Function Value | Unknown | N/A | |
| Certifiably Robust Graph Contrastive Learning | Unknown | N/A | |
| Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards | Unknown | N/A | |
| Latent Graph Inference with Limited Supervision | Unknown | N/A | |
| Latent Field Discovery in Interacting Dynamical Systems with Neural Fields | Unknown | N/A | |
| Interactive Multi-fidelity Learning for Cost-effective Adaptation of Language Model with Sparse Human Supervision | Unknown | N/A | |
| Strategic Data Sharing between Competitors | Unknown | N/A | |
| Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation | Unknown | N/A | |
| Domain Re-Modulation for Few-Shot Generative Domain Adaptation | Unknown | N/A | |
| Unifying GANs and Score-Based Diffusion as Generative Particle Models | Unknown | N/A | |
| Rank-DETR for High Quality Object Detection | Unknown | N/A | |
| Enhancing Adaptive History Reserving by Spiking Convolutional Block Attention Module in Recurrent Neural Networks | Unknown | N/A | |
| Regularization properties of adversarially-trained linear regression | Unknown | N/A | |
| On Class Distributions Induced by Nearest Neighbor Graphs for Node Classification of Tabular Data | Unknown | N/A | |
| Imagine That! Abstract-to-Intricate Text-to-Image Synthesis with Scene Graph Hallucination Diffusion | Unknown | N/A | |
| Greatness in Simplicity: Unified Self-Cycle Consistency for Parser-Free Virtual Try-On | Unknown | N/A | |
| Exploring Question Decomposition for Zero-Shot VQA | Unknown | N/A | |
| PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning. | Unknown | N/A | |
| Scale-teaching: Robust Multi-scale Training for Time Series Classification with Noisy Labels | Unknown | N/A | |
| SEGA: Instructing Text-to-Image Models using Semantic Guidance | Unknown | N/A | |
| High-dimensional Contextual Bandit Problem without Sparsity | Unknown | N/A | |
| Uncertainty Quantification over Graph with Conformalized Graph Neural Networks | Unknown | N/A | |
| Bayesian Learning via Q-Exponential Process | Unknown | N/A | |
| The Double-Edged Sword of Implicit Bias: Generalization vs. Robustness in ReLU Networks | Unknown | N/A | |
| Learning Energy-based Model via Dual-MCMC Teaching | Unknown | N/A | |
| Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting | Unknown | N/A | |
| Uncertainty Quantification via Neural Posterior Principal Components | Unknown | N/A | |
| FedGCN: Convergence-Communication Tradeoffs in Federated Training of Graph Convolutional Networks | Unknown | N/A | |
| Contextual Stochastic Bilevel Optimization | Unknown | N/A | |
| Multi-Head Adapter Routing for Cross-Task Generalization | Unknown | N/A | |
| Cal-DETR: Calibrated Detection Transformer | Unknown | N/A | |
| Two Sides of One Coin: the Limits of Untuned SGD and the Power of Adaptive Methods | Unknown | N/A | |
| Cheaply Estimating Inference Efficiency Metrics for Autoregressive Transformer Models | Unknown | N/A | |
| PHOTOSWAP: Personalized Subject Swapping in Images | Unknown | N/A | |
| Unpaired Multi-Domain Causal Representation Learning | Unknown | N/A | |
| Characterization and Learning of Causal Graphs with Small Conditioning Sets | Unknown | N/A | |
| Active Learning for Semantic Segmentation with Multi-class Label Query | Unknown | N/A | |
| Thinker: Learning to Plan and Act | Unknown | N/A | |
| Horospherical Decision Boundaries for Large Margin Classification in Hyperbolic Space | Unknown | N/A | |
| Coordinating Distributed Example Orders for Provably Accelerated Training | Unknown | N/A | |
| FLuID: Mitigating Stragglers in Federated Learning using Invariant Dropout | Unknown | N/A | |
| Explainable Brain Age Prediction using coVariance Neural Networks | Unknown | N/A | |
| Learning with Explanation Constraints | Unknown | N/A | |
| Locality-Aware Generalizable Implicit Neural Representation | Unknown | N/A | |
| Language Semantic Graph Guided Data-Efficient Learning | Unknown | N/A | |
| Compression with Bayesian Implicit Neural Representations | Unknown | N/A | |
| ProtoDiff: Learning to Learn Prototypical Networks by Task-Guided Diffusion | Unknown | N/A | |
| Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes | Unknown | N/A | |
| Learning a Neuron by a Shallow ReLU Network: Dynamics and Implicit Bias for Correlated Inputs | Unknown | N/A | |
| Joint Feature and Differentiable $ k $-NN Graph Learning using Dirichlet Energy | Unknown | N/A | |
| Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback | Unknown | N/A | |
| Transfer learning for atomistic simulations using GNNs and kernel mean embeddings | Unknown | N/A | |
| Adaptive Principal Component Regression with Applications to Panel Data | Unknown | N/A | |
| DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models | Unknown | N/A | |
| AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis | Unknown | N/A | |
| A Metadata-Driven Approach to Understand Graph Neural Networks | Unknown | N/A | |
| Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning | Unknown | N/A | |
| Delayed Algorithms for Distributed Stochastic Weakly Convex Optimization | Unknown | N/A | |
| Regret Matching+: (In)Stability and Fast Convergence in Games | Unknown | N/A | |
| SimMMDG: A Simple and Effective Framework for Multi-modal Domain Generalization | Unknown | N/A | |
| Bypassing the Simulator: Near-Optimal Adversarial Linear Contextual Bandits | Unknown | N/A | |
| VRA: Variational Rectified Activation for Out-of-distribution Detection | Unknown | N/A | |
| ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning | Unknown | N/A | |
| Flow: Per-instance Personalized Federated Learning | Unknown | N/A | |
| CP-SLAM: Collaborative Neural Point-based SLAM System | Unknown | N/A | |
| Trading-off price for data quality to achieve fair online allocation | Unknown | N/A | |
| Optimizing over trained GNNs via symmetry breaking | Unknown | N/A | |
| Trust Your $\nabla$: Gradient-based Intervention Targeting for Causal Discovery | Unknown | N/A | |
| Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment | Unknown | N/A | |
| Normalization-Equivariant Neural Networks with Application to Image Denoising | Unknown | N/A | |
| Neural Injective Functions for Multisets, Measures and Graphs via a Finite Witness Theorem | Unknown | N/A | |
| Post-processing Private Synthetic Data for Improving Utility on Selected Measures | Unknown | N/A | |
| GEX: A flexible method for approximating influence via Geometric Ensemble | Unknown | N/A | |
| Evaluating Post-hoc Explanations for Graph Neural Networks via Robustness Analysis | Unknown | N/A | |
| Classification of Heavy-tailed Features in High Dimensions: a Superstatistical Approach | Unknown | N/A | |
| Composable Coresets for Determinant Maximization: Greedy is Almost Optimal | Unknown | N/A | |
| Don't be so Monotone: Relaxing Stochastic Line Search in Over-Parameterized Models | Unknown | N/A | |
| Breaking the Communication-Privacy-Accuracy Tradeoff with $f$-Differential Privacy | Unknown | N/A | |
| Fast Optimal Transport through Sliced Generalized Wasserstein Geodesics | Unknown | N/A | |
| GIMLET: A Unified Graph-Text Model for Instruction-Based Molecule Zero-Shot Learning | Unknown | N/A | |
| HyTrel: Hypergraph-enhanced Tabular Data Representation Learning | Unknown | N/A | |
| Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences | Unknown | N/A | |
| Online Inventory Problems: Beyond the i.i.d. Setting with Online Convex Optimization | Unknown | N/A | |
| On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $\epsilon$-Greedy Exploration | Unknown | N/A | |
| Evaluating Cognitive Maps and Planning in Large Language Models with CogEval | Unknown | N/A | |
| Parameterizing Context: Unleashing the Power of Parameter-Efficient Fine-Tuning and In-Context Tuning for Continual Table Semantic Parsing | Unknown | N/A | |
| Aggregating Capacity in FL through Successive Layer Training for Computationally-Constrained Devices | Unknown | N/A | |
| Learning Multi-agent Behaviors from Distributed and Streaming Demonstrations | Unknown | N/A | |
| Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning? | Unknown | N/A | |
| Unsupervised Anomaly Detection with Rejection | Unknown | N/A | |
| Anchor Data Augmentation | Unknown | N/A | |
| Meta-AdaM: An Meta-Learned Adaptive Optimizer with Momentum for Few-Shot Learning | Unknown | N/A | |
| PreDiff: Precipitation Nowcasting with Latent Diffusion Models | Unknown | N/A | |
| A Heat Diffusion Perspective on Geodesic Preserving Dimensionality Reduction | Unknown | N/A | |
| Global Structure-Aware Diffusion Process for Low-light Image Enhancement | Unknown | N/A | |
| How a Student becomes a Teacher: learning and forgetting through Spectral methods | Unknown | N/A | |
| Approximate Heavy Tails in Offline (Multi-Pass) Stochastic Gradient Descent | Unknown | N/A | |
| Learning Rate Free Sampling in Constrained Domains | Unknown | N/A | |
| Safety Verification of Decision-Tree Policies in Continuous Time | Unknown | N/A | |
| Cocktail: Mixing Multi-Modality Control for Text-Conditional Image Generation | Unknown | N/A | |
| Binary Classification with Confidence Difference | Unknown | N/A | |
| A3FL: Adversarially Adaptive Backdoor Attacks to Federated Learning | Unknown | N/A | |
| Efficient Model-Free Exploration in Low-Rank MDPs | Unknown | N/A | |
| GeoPhy: Differentiable Phylogenetic Inference via Geometric Gradients of Tree Topologies | Unknown | N/A | |
| When Does Confidence-Based Cascade Deferral Suffice? | Unknown | N/A | |
| Dynamically Masked Discriminator for GANs | Unknown | N/A | |
| A Theoretical Analysis of the Test Error of Finite-Rank Kernel Ridge Regression | Unknown | N/A | |
| Combating Bilateral Edge Noise for Robust Link Prediction | Unknown | N/A | |
| SLIBO-Net: Floorplan Reconstruction via Slicing Box Representation with Local Geometry Regularization | Unknown | N/A | |
| Globally injective and bijective neural operators | Unknown | N/A | |
| Mutual-Information Regularized Multi-Agent Policy Iteration | Unknown | N/A | |
| Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation | Unknown | N/A | |
| Boosting Spectral Clustering on Incomplete Data via Kernel Correction and Affinity Learning | Unknown | N/A | |
| Understanding Contrastive Learning via Distributionally Robust Optimization | Unknown | N/A | |
| MEMTO: Memory-guided Transformer for Multivariate Time Series Anomaly Detection | Unknown | N/A | |
| Learning Large-Scale MTP$_2$ Gaussian Graphical Models via Bridge-Block Decomposition | Unknown | N/A | |
| Convergent Bregman Plug-and-Play Image Restoration for Poisson Inverse Problems | Unknown | N/A | |
| Black-Box Differential Privacy for Interactive ML | Unknown | N/A | |
| Connecting Multi-modal Contrastive Representations | Unknown | N/A | |
| Top-Ambiguity Samples Matter: Understanding Why Deep Ensemble Works in Selective Classification | Unknown | N/A | |
| A Theory of Unsupervised Translation Motivated by Understanding Animal Communication | Unknown | N/A | |
| Bias in Evaluation Processes: An Optimization-Based Model | Unknown | N/A | |
| SNAP: Self-Supervised Neural Maps for Visual Positioning and Semantic Understanding | Unknown | N/A | |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Unknown | N/A | |
| Discrete-Smoothness in Online Algorithms with Predictions | Unknown | N/A | |
| Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations | Unknown | N/A | |
| Beyond Deep Ensembles: A Large-Scale Evaluation of Bayesian Deep Learning under Distribution Shift | Unknown | N/A | |
| AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset | Unknown | N/A | |
| Calibrating Neural Simulation-Based Inference with Differentiable Coverage Probability | Unknown | N/A | |
| A Unified Framework for Uniform Signal Recovery in Nonlinear Generative Compressed Sensing | Unknown | N/A | |
| Fused Gromov-Wasserstein Graph Mixup for Graph-level Classifications | Unknown | N/A | |
| Relative Entropic Optimal Transport: a (Prior-aware) Matching Perspective to (Unbalanced) Classification | Unknown | N/A | |
| Revisiting Adversarial Robustness Distillation from the Perspective of Robust Fairness | Unknown | N/A | |
| Generalised f-Mean Aggregation for Graph Neural Networks | Unknown | N/A | |
| Self-Adaptive Motion Tracking against On-body Displacement of Flexible Sensors | Unknown | N/A | |
| DELTA: Diverse Client Sampling for Fasting Federated Learning | Unknown | N/A | |
| RS-Del: Edit Distance Robustness Certificates for Sequence Classifiers via Randomized Deletion | Unknown | N/A | |
| Multi-Step Generalized Policy Improvement by Leveraging Approximate Models | Unknown | N/A | |
| Alleviating the Semantic Gap for Generalized fMRI-to-Image Reconstruction | Unknown | N/A | |
| GraphAdapter: Tuning Vision-Language Models With Dual Knowledge Graph | Unknown | N/A | |
| SHOT: Suppressing the Hessian along the Optimization Trajectory for Gradient-Based Meta-Learning | Unknown | N/A | |
| Towards Consistent Video Editing with Text-to-Image Diffusion Models | Unknown | N/A | |
| Opening the Vocabulary of Egocentric Actions | Unknown | N/A | |
| Exact Generalization Guarantees for (Regularized) Wasserstein Distributionally Robust Models | Unknown | N/A | |
| Finite-Time Analysis of Single-Timescale Actor-Critic | Unknown | N/A | |
| A fast heuristic to optimize time-space tradeoff for large models | Unknown | N/A | |
| Geometric Algebra Transformer | Unknown | N/A | |
| STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning | Unknown | N/A | |
| From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion | Unknown | N/A | |
| VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks | Unknown | N/A | |
| Penalising the biases in norm regularisation enforces sparsity | Unknown | N/A | |
| Conservative Offline Policy Adaptation in Multi-Agent Games | Unknown | N/A | |
| On Certified Generalization in Structured Prediction | Unknown | N/A | |
| Is Distance Matrix Enough for Geometric Deep Learning? | Unknown | N/A | |
| What You See is What You Read? Improving Text-Image Alignment Evaluation | Unknown | N/A | |
| Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser | Unknown | N/A | |
| Point Cloud Completion with Pretrained Text-to-Image Diffusion Models | Unknown | N/A | |
| CQM: Curriculum Reinforcement Learning with a Quantized World Model | Unknown | N/A | |
| Chatting Makes Perfect: Chat-based Image Retrieval | Unknown | N/A | |
| Idempotent Learned Image Compression with Right-Inverse | Unknown | N/A | |
| Neural Fields with Hard Constraints of Arbitrary Differential Order | Unknown | N/A | |
| Optimize Planning Heuristics to Rank, not to Estimate Cost-to-Goal | Unknown | N/A | |
| LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation | Unknown | N/A | |
| Language-driven Scene Synthesis using Multi-conditional Diffusion Model | Unknown | N/A | |
| PUCA: Patch-Unshuffle and Channel Attention for Enhanced Self-Supervised Image Denoising | Unknown | N/A | |
| Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective | Unknown | N/A | |
| VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models | Unknown | N/A | |
| Score-based Data Assimilation | Unknown | N/A | |
| SODA: Robust Training of Test-Time Data Adaptors | Unknown | N/A | |
| RL-based Stateful Neural Adaptive Sampling and Denoising for Real-Time Path Tracing | Unknown | N/A | |
| Macro Placement by Wire-Mask-Guided Black-Box Optimization | Unknown | N/A | |
| Deep Optimal Transport: A Practical Algorithm for Photo-realistic Image Restoration | Unknown | N/A | |
| Learning Energy-Based Prior Model with Diffusion-Amortized MCMC | Unknown | N/A | |
| Inner-Outer Aware Reconstruction Model for Monocular 3D Scene Reconstruction | Unknown | N/A | |
| Understanding How Consistency Works in Federated Learning via Stage-wise Relaxed Initialization | Unknown | N/A | |
| DeepSimHO: Stable Pose Estimation for Hand-Object Interaction via Physics Simulation | Unknown | N/A | |
| Selective Amnesia: A Continual Learning Approach to Forgetting in Deep Generative Models | Unknown | N/A | |
| Reward Imputation with Sketching for Contextual Batched Bandits | Unknown | N/A | |
| Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules | Unknown | N/A | |
| Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval | Unknown | N/A | |
| PUe: Biased Positive-Unlabeled Learning Enhancement by Causal Inference | Unknown | N/A | |
| Topological RANSAC for instance verification and retrieval without fine-tuning | Unknown | N/A | |
| Non-Asymptotic Analysis of a UCB-based Top Two Algorithm | Unknown | N/A | |
| Lovász Principle for Unsupervised Graph Representation Learning | Unknown | N/A | |
| MathNAS: If Blocks Have a Role in Mathematical Architecture Design | Unknown | N/A | |
| Orthogonal Non-negative Tensor Factorization based Multi-view Clustering | Unknown | N/A | |
| Understanding the Limitations of Deep Models for Molecular property prediction: Insights and Solutions | Unknown | N/A | |
| Optimized Covariance Design for AB Test on Social Network under Interference | Unknown | N/A | |
| Off-Policy Evaluation for Human Feedback | Unknown | N/A | |
| Non-adversarial training of Neural SDEs with signature kernel scores | Unknown | N/A | |
| Separable Physics-Informed Neural Networks | Unknown | N/A | |
| Pareto Frontiers in Deep Feature Learning: Data, Compute, Width, and Luck | Unknown | N/A | |
| Exposing Attention Glitches with Flip-Flop Language Modeling | Unknown | N/A | |
| Goal-conditioned Offline Planning from Curious Exploration | Unknown | N/A | |
| Offline Reinforcement Learning with Differential Privacy | Unknown | N/A | |
| FaceComposer: A Unified Model for Versatile Facial Content Creation | Unknown | N/A | |
| Neural (Tangent Kernel) Collapse | Unknown | N/A | |
| Harnessing Hard Mixed Samples with Decoupled Regularizer | Unknown | N/A | |
| Collaborative Alignment of NLP Models | Unknown | N/A | |
| IEBins: Iterative Elastic Bins for Monocular Depth Estimation | Unknown | N/A | |
| VanillaNet: the Power of Minimalism in Deep Learning | Unknown | N/A | |
| AbDiffuser: full-atom generation of in-vitro functioning antibodies | Unknown | N/A | |
| Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts | Unknown | N/A | |
| Explaining the Uncertain: Stochastic Shapley Values for Gaussian Process Models | Unknown | N/A | |
| FedFed: Feature Distillation against Data Heterogeneity in Federated Learning | Unknown | N/A | |
| Gradient-Free Kernel Stein Discrepancy | Unknown | N/A | |
| Fairness-guided Few-shot Prompting for Large Language Models | Unknown | N/A | |
| Scattering Vision Transformer: Spectral Mixing Matters | Unknown | N/A | |
| Encoding Human Behavior in Information Design through Deep Learning | Unknown | N/A | |
| Recovering Unbalanced Communities in the Stochastic Block Model with Application to Clustering with a Faulty Oracle | Unknown | N/A | |
| Attentive Transfer Entropy to Exploit Transient Emergence of Coupling Effect | Unknown | N/A | |
| Setting the Trap: Capturing and Defeating Backdoors in Pretrained Language Models through Honeypots | Unknown | N/A | |
| Sequential Subset Matching for Dataset Distillation | Unknown | N/A | |
| SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation | Unknown | N/A | |
| Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources | Unknown | N/A | |
| FlatMatch: Bridging Labeled Data and Unlabeled Data with Cross-Sharpness for Semi-Supervised Learning | Unknown | N/A | |
| CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection | Unknown | N/A | |
| Distributionally Robust Bayesian Optimization with $\varphi$-divergences | Unknown | N/A | |
| Fed-FA: Theoretically Modeling Client Data Divergence for Federated Language Backdoor Defense | Unknown | N/A | |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL | Unknown | N/A | |
| Initialization-Dependent Sample Complexity of Linear Predictors and Neural Networks | Unknown | N/A | |
| FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing | Unknown | N/A | |
| Timewarp: Transferable Acceleration of Molecular Dynamics by Learning Time-Coarsened Dynamics | Unknown | N/A | |
| Solving Inverse Physics Problems with Score Matching | Unknown | N/A | |
| DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models | Unknown | N/A | |
| Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation | Unknown | N/A | |
| VideoComposer: Compositional Video Synthesis with Motion Controllability | Unknown | N/A | |
| Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval | Unknown | N/A | |
| K-Nearest-Neighbor Local Sampling Based Conditional Independence Testing | Unknown | N/A | |
| Collaborative Learning via Prediction Consensus | Unknown | N/A | |
| Principled Weight Initialisation for Input-Convex Neural Networks | Unknown | N/A | |
| What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement. | Unknown | N/A | |
| Spontaneous symmetry breaking in generative diffusion models | Unknown | N/A | |
| GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction | Unknown | N/A | |
| Evaluating Robustness and Uncertainty of Graph Models Under Structural Distributional Shifts | Unknown | N/A | |
| One-step differentiation of iterative algorithms | Unknown | N/A | |
| DäRF: Boosting Radiance Fields from Sparse Input Views with Monocular Depth Adaptation | Unknown | N/A | |
| Learning Generalizable Agents via Saliency-guided Features Decorrelation | Unknown | N/A | |
| R-divergence for Estimating Model-oriented Distribution Discrepancy | Unknown | N/A | |
| Kernelized Reinforcement Learning with Order Optimal Regret Bounds | Unknown | N/A | |
| DiffKendall: A Novel Approach for Few-Shot Learning with Differentiable Kendall's Rank Correlation | Unknown | N/A | |
| AIMS: All-Inclusive Multi-Level Segmentation for Anything | Unknown | N/A | |
| Segment Anything in 3D with NeRFs | Unknown | N/A | |
| Optimization or Architecture: How to Hack Kalman Filtering | Unknown | N/A | |
| ASPEN: Breaking Operator Barriers for Efficient Parallelization of Deep Neural Networks | Unknown | N/A | |
| MultiMoDN—Multimodal, Multi-Task, Interpretable Modular Networks | Unknown | N/A | |
| Improving Adversarial Robustness via Information Bottleneck Distillation | Unknown | N/A | |
| Imbalanced Mixed Linear Regression | Unknown | N/A | |
| L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors | Unknown | N/A | |
| BIOT: Biosignal Transformer for Cross-data Learning in the Wild | Unknown | N/A | |
| Formulating Discrete Probability Flow Through Optimal Transport | Unknown | N/A | |
| Universal Prompt Tuning for Graph Neural Networks | Unknown | N/A | |
| Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger | Unknown | N/A | |
| On Sparse Modern Hopfield Model | Unknown | N/A | |
| Discovering Intrinsic Spatial-Temporal Logic Rules to Explain Human Actions | Unknown | N/A | |
| Transient Neural Radiance Fields for Lidar View Synthesis and 3D Reconstruction | Unknown | N/A | |
| Cause-Effect Inference in Location-Scale Noise Models: Maximum Likelihood vs. Independence Testing | Unknown | N/A | |
| SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models | Unknown | N/A | |
| Pre-RMSNorm and Pre-CRMSNorm Transformers: Equivalent and Efficient Pre-LN Transformers | Unknown | N/A | |
| Closing the gap between the upper bound and lower bound of Adam's iteration complexity | Unknown | N/A | |
| Training Your Image Restoration Network Better with Random Weight Network as Optimization Function | Unknown | N/A | |
| StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models | Unknown | N/A | |
| Conformalized matrix completion | Unknown | N/A | |
| A Heavy-Tailed Algebra for Probabilistic Programming | Unknown | N/A | |
| What Knowledge Gets Distilled in Knowledge Distillation? | Unknown | N/A | |
| Large-Scale Distributed Learning via Private On-Device LSH | Unknown | N/A | |
| Abide by the law and follow the flow: conservation laws for gradient flows | Unknown | N/A | |
| Brain Dissection: fMRI-trained Networks Reveal Spatial Selectivity in the Processing of Natural Images | Unknown | N/A | |
| The Target-Charging Technique for Privacy Analysis across Interactive Computations | Unknown | N/A | |
| SutraNets: Sub-series Autoregressive Networks for Long-Sequence, Probabilistic Forecasting | Unknown | N/A | |
| Efficient Online Clustering with Moving Costs | Unknown | N/A | |
| On the Implicit Bias of Linear Equivariant Steerable Networks | Unknown | N/A | |
| A Unified Algorithm Framework for Unsupervised Discovery of Skills based on Determinantal Point Process | Unknown | N/A | |
| Improving Robustness with Adaptive Weight Decay | Unknown | N/A | |
| Calibrating “Cheap Signals” in Peer Review without a Prior | Unknown | N/A | |
| Adversarial Examples Exist in Two-Layer ReLU Networks for Low Dimensional Linear Subspaces | Unknown | N/A | |
| Towards Label-free Scene Understanding by Vision Foundation Models | Unknown | N/A | |
| ESSEN: Improving Evolution State Estimation for Temporal Networks using Von Neumann Entropy | Unknown | N/A | |
| Learning Robust Statistics for Simulation-based Inference under Model Misspecification | Unknown | N/A | |
| Momentum Provably Improves Error Feedback! | Unknown | N/A | |
| Strategic Apple Tasting | Unknown | N/A | |
| Sharp Bounds for Generalized Causal Sensitivity Analysis | Unknown | N/A | |
| Better Private Linear Regression Through Better Private Feature Selection | Unknown | N/A | |
| Multiplication-Free Transformer Training via Piecewise Affine Operations | Unknown | N/A | |
| Random-Access Infinite Context Length for Transformers | Unknown | N/A | |
| Aligning Gradient and Hessian for Neural Signed Distance Function | Unknown | N/A | |
| Strategyproof Voting under Correlated Beliefs | Unknown | N/A | |
| A Closer Look at the Robustness of Contrastive Language-Image Pre-Training (CLIP) | Unknown | N/A | |
| Distributional Learning of Variational AutoEncoder: Application to Synthetic Data Generation | Unknown | N/A | |
| FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning | Unknown | N/A | |
| Evaluating and Inducing Personality in Pre-trained Language Models | Unknown | N/A | |
| Beyond Exponential Graph: Communication-Efficient Topologies for Decentralized Learning via Finite-time Convergence | Unknown | N/A | |
| Data-Dependent Bounds for Online Portfolio Selection Without Lipschitzness and Smoothness | Unknown | N/A | |
| Provable Guarantees for Neural Networks via Gradient Feature Learning | Unknown | N/A | |
| LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition | Unknown | N/A | |
| Adaptive Topological Feature via Persistent Homology: Filtration Learning for Point Clouds | Unknown | N/A | |
| Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models | Unknown | N/A | |
| Q-DM: An Efficient Low-bit Quantized Diffusion Model | Unknown | N/A | |
| Temporal Conditioning Spiking Latent Variable Models of the Neural Response to Natural Visual Scenes | Unknown | N/A | |
| Leave No Stone Unturned: Mine Extra Knowledge for Imbalanced Facial Expression Recognition | Unknown | N/A | |
| Type-to-Track: Retrieve Any Object via Prompt-based Tracking | Unknown | N/A | |
| Jailbroken: How Does LLM Safety Training Fail? | Unknown | N/A | |
| On the Adversarial Robustness of Out-of-distribution Generalization Models | Unknown | N/A | |
| Deductive Verification of Chain-of-Thought Reasoning | Unknown | N/A | |
| A normative theory of social conflict | Unknown | N/A | |
| Selectively Sharing Experiences Improves Multi-Agent Reinforcement Learning | Unknown | N/A | |
| DISCOVER: Making Vision Networks Interpretable via Competition and Dissection | Unknown | N/A | |
| Efficient Activation Function Optimization through Surrogate Modeling | Unknown | N/A | |
| Statistical Guarantees for Variational Autoencoders using PAC-Bayesian Theory | Unknown | N/A | |
| Replicable Clustering | Unknown | N/A | |
| Scale-Space Hypernetworks for Efficient Biomedical Image Analysis | Unknown | N/A | |
| EDGI: Equivariant Diffusion for Planning with Embodied Agents | Unknown | N/A | |
| One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization | Unknown | N/A | |
| OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding | Unknown | N/A | |
| Smoothed Online Learning for Prediction in Piecewise Affine Systems | Unknown | N/A | |
| Optimization of Inter-group criteria for clustering with minimum size constraints | Unknown | N/A | |
| Kiki or Bouba? Sound Symbolism in Vision-and-Language Models | Unknown | N/A | |
| On the Constrained Time-Series Generation Problem | Unknown | N/A | |
| SparseProp: Efficient Event-Based Simulation and Training of Sparse Recurrent Spiking Neural Networks | Unknown | N/A | |
| Gradient Flossing: Improving Gradient Descent through Dynamic Control of Jacobians | Unknown | N/A | |
| TransHP: Image Classification with Hierarchical Prompting | Unknown | N/A | |
| GlucoSynth: Generating Differentially-Private Synthetic Glucose Traces | Unknown | N/A | |
| Finite-Time Analysis of Whittle Index based Q-Learning for Restless Multi-Armed Bandits with Neural Network Function Approximation | Unknown | N/A | |
| Pointwise uncertainty quantification for sparse variational Gaussian process regression with a Brownian motion prior | Unknown | N/A | |
| Deciphering Spatio-Temporal Graph Forecasting: A Causal Lens and Treatment | Unknown | N/A | |
| Theoretically Guaranteed Bidirectional Data Rectification for Robust Sequential Recommendation | Unknown | N/A | |
| What Truly Matters in Trajectory Prediction for Autonomous Driving? | Unknown | N/A | |
| InsActor: Instruction-driven Physics-based Characters | Unknown | N/A | |
| Differentiable Blocks World: Qualitative 3D Decomposition by Rendering Primitives | Unknown | N/A | |
| Semi-Supervised Contrastive Learning for Deep Regression with Ordinal Rankings from Spectral Seriation | Unknown | N/A | |
| Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations | Unknown | N/A | |
| Parameter-efficient Tuning of Large-scale Multimodal Foundation Model | Unknown | N/A | |
| Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models | Unknown | N/A | |
| PrimDiffusion: Volumetric Primitives Diffusion for 3D Human Generation | Unknown | N/A | |
| Optimal Time Complexities of Parallel Stochastic Optimization Methods Under a Fixed Computation Model | Unknown | N/A | |
| 2Direction: Theoretically Faster Distributed Training with Bidirectional Communication Compression | Unknown | N/A | |
| On student-teacher deviations in distillation: does it pay to disobey? | Unknown | N/A | |
| SPACE: Single-round Participant Amalgamation for Contribution Evaluation in Federated Learning | Unknown | N/A | |
| MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion | Unknown | N/A | |
| DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model | Unknown | N/A | |
| Inferring Hybrid Neural Fluid Fields from Videos | Unknown | N/A | |
| On the Trade-off of Intra-/Inter-class Diversity for Supervised Pre-training | Unknown | N/A | |
| Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data | Unknown | N/A | |
| Autodecoding Latent 3D Diffusion Models | Unknown | N/A | |
| Emergent Correspondence from Image Diffusion | Unknown | N/A | |
| Convolutional Visual Prompt for Robust Visual Perception | Unknown | N/A | |
| Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator | Unknown | N/A | |
| CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection | Unknown | N/A | |
| Transformers over Directed Acyclic Graphs | Unknown | N/A | |
| Unlocking Feature Visualization for Deep Network with MAgnitude Constrained Optimization | Unknown | N/A | |
| Does Graph Distillation See Like Vision Dataset Counterpart? | Unknown | N/A | |
| Modeling Human Visual Motion Processing with Trainable Motion Energy Sensing and a Self-attention Network | Unknown | N/A | |
| MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data | Unknown | N/A | |
| Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings | Unknown | N/A | |
| Diversifying Spatial-Temporal Perception for Video Domain Generalization | Unknown | N/A | |
| VCC: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens | Unknown | N/A | |
| How many samples are needed to leverage smoothness? | Unknown | N/A | |
| OpenMask3D: Open-Vocabulary 3D Instance Segmentation | Unknown | N/A | |
| Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance | Unknown | N/A | |
| Reading Relevant Feature from Global Representation Memory for Visual Object Tracking | Unknown | N/A | |
| GMSF: Global Matching Scene Flow | Unknown | N/A | |
| Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data | Unknown | N/A | |
| Causal Discovery from Subsampled Time Series with Proxy Variables | Unknown | N/A | |
| A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories | Unknown | N/A | |
| Demographic Parity Constrained Minimax Optimal Regression under Linear Model | Unknown | N/A | |
| Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors | Unknown | N/A | |
| ClusterFomer: Clustering As A Universal Visual Learner | Unknown | N/A | |
| Data-Centric Learning from Unlabeled Graphs with Diffusion Model | Unknown | N/A | |
| Streaming Factor Trajectory Learning for Temporal Tensor Decomposition | Unknown | N/A | |
| Offline Imitation Learning with Variational Counterfactual Reasoning | Unknown | N/A | |
| BERT Lost Patience Won't Be Robust to Adversarial Slowdown | Unknown | N/A | |
| Label-efficient Segmentation via Affinity Propagation | Unknown | N/A | |
| Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective | Unknown | N/A | |
| Segment Any Point Cloud Sequences by Distilling Vision Foundation Models | Unknown | N/A | |
| No-regret Algorithms for Fair Resource Allocation | Unknown | N/A | |
| DiffUTE: Universal Text Editing Diffusion Model | Unknown | N/A | |
| Stable Diffusion is Unstable | Unknown | N/A | |
| Unified 3D Segmenter As Prototypical Classifiers | Unknown | N/A | |
| Learning Motion Refinement for Unsupervised Face Animation | Unknown | N/A | |
| Dynamic Tensor Decomposition via Neural Diffusion-Reaction Processes | Unknown | N/A | |
| Learning Large Graph Property Prediction via Graph Segment Training | Unknown | N/A | |
| Unsupervised Optical Flow Estimation with Dynamic Timing Representation for Spike Camera | Unknown | N/A | |
| Variational Inference with Gaussian Score Matching | Unknown | N/A | |
| Block Broyden's Methods for Solving Nonlinear Equations | Unknown | N/A | |
| Real-World Image Variation by Aligning Diffusion Inversion Chain | Unknown | N/A | |
| Full-Atom Protein Pocket Design via Iterative Refinement | Unknown | N/A | |
| Adversarial Training from Mean Field Perspective | Unknown | N/A | |
| Deep Patch Visual Odometry | Unknown | N/A | |
| You Only Condense Once: Two Rules for Pruning Condensed Datasets | Unknown | N/A | |
| Online Map Vectorization for Autonomous Driving: A Rasterization Perspective | Unknown | N/A | |
| GPEX, A Framework For Interpreting Artificial Neural Networks | Unknown | N/A | |
| Efficient Symbolic Policy Learning with Differentiable Symbolic Expression | Unknown | N/A | |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Unknown | N/A | |
| Why Did This Model Forecast This Future? Information-Theoretic Saliency for Counterfactual Explanations of Probabilistic Regression Models | Unknown | N/A | |
| Epistemic Neural Networks | Unknown | N/A | |
| Self-supervised Object-Centric Learning for Videos | Unknown | N/A | |
| Inference-Time Intervention: Eliciting Truthful Answers from a Language Model | Unknown | N/A | |
| Online Learning under Adversarial Nonlinear Constraints | Unknown | N/A | |
| A Simple Yet Effective Strategy to Robustify the Meta Learning Paradigm | Unknown | N/A | |
| SoundCam: A Dataset for Finding Humans Using Room Acoustics | Unknown | N/A | |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Unknown | N/A | |
| SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction | Unknown | N/A | |
| A Computation and Communication Efficient Method for Distributed Nonconvex Problems in the Partial Participation Setting | Unknown | N/A | |
| The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications | Unknown | N/A | |
| Practical Equivariances via Relational Conditional Neural Processes | Unknown | N/A | |
| Classical Simulation of Quantum Circuits: Parallel Environments and Benchmark | Unknown | N/A | |
| BioMassters: A Benchmark Dataset for Forest Biomass Estimation using Multi-modal Satellite Time-series | Unknown | N/A | |
| Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense | Unknown | N/A | |
| SatBird: a Dataset for Bird Species Distribution Modeling using Remote Sensing and Citizen Science Data | Unknown | N/A | |
| On the Convergence of Encoder-only Shallow Transformers | Unknown | N/A | |
| TopP&R: Robust Support Estimation Approach for Evaluating Fidelity and Diversity in Generative Models | Unknown | N/A | |
| BuildingsBench: A Large-Scale Dataset of 900K Buildings and Benchmark for Short-Term Load Forecasting | Unknown | N/A | |
| Objaverse-XL: A Universe of 10M+ 3D Objects | Unknown | N/A | |
| AND: Adversarial Neural Degradation for Learning Blind Image Super-Resolution | Unknown | N/A | |
| Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving | Unknown | N/A | |
| FIND: A Function Description Benchmark for Evaluating Interpretability Methods | Unknown | N/A | |
| Towards Federated Foundation Models: Scalable Dataset Pipelines for Group-Structured Learning | Unknown | N/A | |
| DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology | Unknown | N/A | |
| Semi-Implicit Denoising Diffusion Models (SIDDMs) | Unknown | N/A | |
| Event Stream GPT: A Data Pre-processing and Modeling Library for Generative, Pre-trained Transformers over Continuous-time Sequences of Complex Events | Unknown | N/A | |
| DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models | Unknown | N/A | |
| The Memory-Perturbation Equation: Understanding Model's Sensitivity to Data | Unknown | N/A | |
| WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data | Unknown | N/A | |
| The Geometry of Neural Nets' Parameter Spaces Under Reparametrization | Unknown | N/A | |
| ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and Design | Unknown | N/A | |
| Aligning Language Models with Human Preferences via a Bayesian Approach | Unknown | N/A | |
| SARAMIS: Simulation Assets for Robotic Assisted and Minimally Invasive Surgery | Unknown | N/A | |
| FLAIR : a Country-Scale Land Cover Semantic Segmentation Dataset From Multi-Source Optical Imagery | Unknown | N/A | |
| Enhancing Sharpness-Aware Optimization Through Variance Suppression | Unknown | N/A | |
| CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganography | Unknown | N/A | |
| FELM: Benchmarking Factuality Evaluation of Large Language Models | Unknown | N/A | |
| TWIGMA: A dataset of AI-Generated Images with Metadata From Twitter | Unknown | N/A | |
| ASL Citizen: A Community-Sourced Dataset for Advancing Isolated Sign Language Recognition | Unknown | N/A | |
| GenS: Generalizable Neural Surface Reconstruction from Multi-View Images | Unknown | N/A | |
| Wyze Rule: Federated Rule Dataset for Rule Recommendation Benchmarking | Unknown | N/A | |
| Data Pruning via Moving-one-Sample-out | Unknown | N/A | |
| Rehearsal Learning for Avoiding Undesired Future | Unknown | N/A | |
| CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews | Unknown | N/A | |
| Characteristic Circuits | Unknown | N/A | |
| YouTubePD: A Multimodal Benchmark for Parkinson’s Disease Analysis | Unknown | N/A | |
| AiluRus: A Scalable ViT Framework for Dense Prediction | Unknown | N/A | |
| Generator Born from Classifier | Unknown | N/A | |
| BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks | Unknown | N/A | |
| Knowledge-based in silico models and dataset for the comparative evaluation of mammography AI for a range of breast characteristics, lesion conspicuities and doses | Unknown | N/A | |
| $\mathbf{\mathbb{E}^{FWI}}$: Multiparameter Benchmark Datasets for Elastic Full Waveform Inversion of Geophysical Properties | Unknown | N/A | |
| Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards | Unknown | N/A | |
| OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents | Unknown | N/A | |
| BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information | Unknown | N/A | |
| MedSat: A Public Health Dataset for England Featuring Medical Prescriptions and Satellite Imagery | Unknown | N/A | |
| RangePerception: Taming LiDAR Range View for Efficient and Accurate 3D Object Detection | Unknown | N/A | |
| Banana: Banach Fixed-Point Network for Pointcloud Segmentation with Inter-Part Equivariance | Unknown | N/A | |
| DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models | Unknown | N/A | |
| GenEval: An object-focused framework for evaluating text-to-image alignment | Unknown | N/A | |
| CHAMMI: A benchmark for channel-adaptive models in microscopy imaging | Unknown | N/A | |
| Performance-optimized deep neural networks are evolving into worse models of inferotemporal visual cortex | Unknown | N/A | |
| CAPP-130: A Corpus of Chinese Application Privacy Policy Summarization and Interpretation | Unknown | N/A | |
| Differentiable Registration of Images and LiDAR Point Clouds with VoxelPoint-to-Pixel Matching | Unknown | N/A | |
| SEENN: Towards Temporal Spiking Early Exit Neural Networks | Unknown | N/A | |
| OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning | Unknown | N/A | |
| OpenGSL: A Comprehensive Benchmark for Graph Structure Learning | Unknown | N/A | |
| Consensus and Subjectivity of Skin Tone Annotation for ML Fairness | Unknown | N/A | |
| HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models | Unknown | N/A | |
| NAP: Neural 3D Articulated Object Prior | Unknown | N/A | |
| Segment Everything Everywhere All at Once | Unknown | N/A | |
| Module-wise Training of Neural Networks via the Minimizing Movement Scheme | Unknown | N/A | |
| ProteinInvBench: Benchmarking Protein Inverse Folding on Diverse Tasks, Models, and Metrics | Unknown | N/A | |
| MLFMF: Data Sets for Machine Learning for Mathematical Formalization | Unknown | N/A | |
| Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection | Unknown | N/A | |
| Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation | Unknown | N/A | |
| PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning | Unknown | N/A | |
| AllSim: Simulating and Benchmarking Resource Allocation Policies in Multi-User Systems | Unknown | N/A | |
| ProteinShake: Building datasets and benchmarks for deep learning on protein structures | Unknown | N/A | |
| Diplomat: A Dialogue Dataset for Situated PragMATic Reasoning | Unknown | N/A | |
| The expressive power of pooling in Graph Neural Networks | Unknown | N/A | |
| QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data | Unknown | N/A | |
| Framework and Benchmarks for Combinatorial and Mixed-variable Bayesian Optimization | Unknown | N/A | |
| UUKG: Unified Urban Knowledge Graph Dataset for Urban Spatiotemporal Prediction | Unknown | N/A | |
| SUPA: A Lightweight Diagnostic Simulator for Machine Learning in Particle Physics | Unknown | N/A | |
| Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends | Unknown | N/A | |
| Networks are Slacking Off: Understanding Generalization Problem in Image Deraining | Unknown | N/A | |
| Semantic Image Synthesis with Unconditional Generator | Unknown | N/A | |
| WildfireSpreadTS: A dataset of multi-modal time series for wildfire spread prediction | Unknown | N/A | |
| Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark | Unknown | N/A | |
| When Visual Prompt Tuning Meets Source-Free Domain Adaptive Semantic Segmentation | Unknown | N/A | |
| HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count | Unknown | N/A | |
| Multimodal Clinical Benchmark for Emergency Care (MC-BEC): A Comprehensive Benchmark for Evaluating Foundation Models in Emergency Medicine | Unknown | N/A | |
| Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation | Unknown | N/A | |
| Non-Rigid Shape Registration via Deep Functional Maps Prior | Unknown | N/A | |
| Lung250M-4B: A Combined 3D Dataset for CT- and Point Cloud-Based Intra-Patient Lung Registration | Unknown | N/A | |
| Toward Better PAC-Bayes Bounds for Uniformly Stable Algorithms | Unknown | N/A | |
| STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events | Unknown | N/A | |
| Mitigating Source Bias for Fairer Weak Supervision | Unknown | N/A | |
| $\mathcal{M}^4$: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models | Unknown | N/A | |
| Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning | Unknown | N/A | |
| Data-Driven Network Neuroscience: On Data Collection and Benchmark | Unknown | N/A | |
| HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge Understanding | Unknown | N/A | |
| On the Convergence of Black-Box Variational Inference | Unknown | N/A | |
| Generating QM1B with PySCF$_{\text{IPU}}$ | Unknown | N/A | |
| Learning to Taste: A Multimodal Wine Dataset | Unknown | N/A | |
| On Private and Robust Bandits | Unknown | N/A | |
| VisAlign: Dataset for Measuring the Alignment between AI and Humans in Visual Perception | Unknown | N/A | |
| WCLD: Curated Large Dataset of Criminal Cases from Wisconsin Circuit Courts | Unknown | N/A | |
| Learning to Augment Distributions for Out-of-distribution Detection | Unknown | N/A | |
| Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties | Unknown | N/A | |
| StressID: a Multimodal Dataset for Stress Identification | Unknown | N/A | |
| Quilt-1M: One Million Image-Text Pairs for Histopathology | Unknown | N/A | |
| Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping | Unknown | N/A | |
| EMBERSim: A Large-Scale Databank for Boosting Similarity Search in Malware Analysis | Unknown | N/A | |
| CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection | Unknown | N/A | |
| The Drunkard’s Odometry: Estimating Camera Motion in Deforming Scenes | Unknown | N/A | |
| DynPoint: Dynamic Neural Point For View Synthesis | Unknown | N/A | |
| Into the Single Cell Multiverse: an End-to-End Dataset for Procedural Knowledge Extraction in Biomedical Texts | Unknown | N/A | |
| PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change | Unknown | N/A | |
| MVDoppler: Unleashing the Power of Multi-View Doppler for MicroMotion-based Gait Classification | Unknown | N/A | |
| SynMob: Creating High-Fidelity Synthetic GPS Trajectory Dataset for Urban Mobility Analysis | Unknown | N/A | |
| Enhancing Motion Deblurring in High-Speed Scenes with Spike Streams | Unknown | N/A | |
| M$^{2}$SODAI: Multi-Modal Maritime Object Detection Dataset With RGB and Hyperspectral Image Sensors | Unknown | N/A | |
| On the Ability of Graph Neural Networks to Model Interactions Between Vertices | Unknown | N/A | |
| CODA: Generalizing to Open and Unseen Domains with Compaction and Disambiguation | Unknown | N/A | |
| InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback | Unknown | N/A | |
| CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion | Unknown | N/A | |
| Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning | Unknown | N/A | |
| ClimateLearn: Benchmarking Machine Learning for Weather and Climate Modeling | Unknown | N/A | |
| SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue Agents | Unknown | N/A | |
| Scaling Laws for Hyperparameter Optimization | Unknown | N/A | |
| A Smooth Binary Mechanism for Efficient Private Continual Observation | Unknown | N/A | |
| Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks | Unknown | N/A | |
| DISCS: A Benchmark for Discrete Sampling | Unknown | N/A | |
| MotionGPT: Human Motion as a Foreign Language | Unknown | N/A | |
| SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality | Unknown | N/A | |
| A Unified Generalization Analysis of Re-Weighting and Logit-Adjustment for Imbalanced Learning | Unknown | N/A | |
| OpenProteinSet: Training data for structural biology at scale | Unknown | N/A | |
| Intelligent Knee Sleeves: A Real-time Multimodal Dataset for 3D Lower Body Motion Estimation Using Smart Textile | Unknown | N/A | |
| DataComp: In search of the next generation of multimodal datasets | Unknown | N/A | |
| How hard are computer vision datasets? Calibrating dataset difficulty to viewing time | Unknown | N/A | |
| M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models | Unknown | N/A | |
| RaLEs: a Benchmark for Radiology Language Evaluations | Unknown | N/A | |
| Unsupervised Graph Neural Architecture Search with Disentangled Self-Supervision | Unknown | N/A | |
| M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark | Unknown | N/A | |
| A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning | Unknown | N/A | |
| LOVM: Language-Only Vision Model Selection | Unknown | N/A | |
| TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs | Unknown | N/A | |
| Holistic Evaluation of Text-to-Image Models | Unknown | N/A | |
| ToolQA: A Dataset for LLM Question Answering with External Tools | Unknown | N/A | |
| ChessGPT: Bridging Policy Learning and Language Modeling | Unknown | N/A | |
| Live Graph Lab: Towards Open, Dynamic and Real Transaction Graphs with NFT | Unknown | N/A | |
| FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation | Unknown | N/A | |
| Exploring Loss Functions for Time-based Training Strategy in Spiking Neural Networks | Unknown | N/A | |
| Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks | Unknown | N/A | |
| Generalizing Nonlinear ICA Beyond Structural Sparsity | Unknown | N/A | |
| Lexinvariant Language Models | Unknown | N/A | |
| INSPECT: A Multimodal Dataset for Patient Outcome Prediction of Pulmonary Embolisms | Unknown | N/A | |
| On the Need for a Language Describing Distribution Shifts: Illustrations on Tabular Datasets | Unknown | N/A | |
| AVIDa-hIL6: A Large-Scale VHH Dataset Produced from an Immunized Alpaca for Predicting Antigen-Antibody Interactions | Unknown | N/A | |
| Improving multimodal datasets with image captioning | Unknown | N/A | |
| GeoDE: a Geographically Diverse Evaluation Dataset for Object Recognition | Unknown | N/A | |
| PyNeRF: Pyramidal Neural Radiance Fields | Unknown | N/A | |
| What a MESS: Multi-Domain Evaluation of Zero-Shot Semantic Segmentation | Unknown | N/A | |
| SPRING: Studying Papers and Reasoning to play Games | Unknown | N/A | |
| Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals | Unknown | N/A | |
| Boundary Guided Learning-Free Semantic Control with Diffusion Models | Unknown | N/A | |
| The ToMCAT Dataset | Unknown | N/A | |
| Parsel🐍: Algorithmic Reasoning with Language Models by Composing Decompositions | Unknown | N/A | |
| SSL4EO-L: Datasets and Foundation Models for Landsat Imagery | Unknown | N/A | |
| VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models | Unknown | N/A | |
| StyleGAN knows Normal, Depth, Albedo, and More | Unknown | N/A | |
| Exploring Why Object Recognition Performance Degrades Across Income Levels and Geographies with Factor Annotations | Unknown | N/A | |
| Variational Gaussian Processes with Decoupled Conditionals | Unknown | N/A | |
| A Massive Scale Semantic Similarity Dataset of Historical English | Unknown | N/A | |
| Similarity, Compression and Local Steps: Three Pillars of Efficient Communications for Distributed Variational Inequalities | Unknown | N/A | |
| Are These the Same Apple? Comparing Images Based on Object Intrinsics | Unknown | N/A | |
| Generalizable One-shot 3D Neural Head Avatar | Unknown | N/A | |
| American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers | Unknown | N/A | |
| SustainGym: Reinforcement Learning Environments for Sustainable Energy Systems | Unknown | N/A | |
| Global Update Tracking: A Decentralized Learning Algorithm for Heterogeneous Data | Unknown | N/A | |
| SubseasonalClimateUSA: A Dataset for Subseasonal Forecasting and Benchmarking | Unknown | N/A | |
| Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning | Unknown | N/A | |
| Information Design in Multi-Agent Reinforcement Learning | Unknown | N/A | |
| D4: Improving LLM Pretraining via Document De-Duplication and Diversification | Unknown | N/A | |
| A Step Towards Worldwide Biodiversity Assessment: The BIOSCAN-1M Insect Dataset | Unknown | N/A | |
| RoboHive: A Unified Framework for Robot Learning | Unknown | N/A | |
| Mind2Web: Towards a Generalist Agent for the Web | Unknown | N/A | |
| How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources | Unknown | N/A | |
| GEO-Bench: Toward Foundation Models for Earth Monitoring | Unknown | N/A | |
| COOM: A Game Benchmark for Continual Reinforcement Learning | Unknown | N/A | |
| HeadSculpt: Crafting 3D Head Avatars with Text | Unknown | N/A | |
| trajdata: A Unified Interface to Multiple Human Trajectory Datasets | Unknown | N/A | |
| DynaDojo: An Extensible Platform for Benchmarking Scaling in Dynamical System Identification | Unknown | N/A | |
| EgoTracks: A Long-term Egocentric Visual Object Tracking Dataset | Unknown | N/A | |
| AirDelhi: Fine-Grained Spatio-Temporal Particulate Matter Dataset From Delhi For ML based Modeling | Unknown | N/A | |
| LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day | Unknown | N/A | |
| Benchmarking Robustness to Adversarial Image Obfuscations | Unknown | N/A | |
| DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection | Unknown | N/A | |
| Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources | Unknown | N/A | |
| Realistic Synthetic Financial Transactions for Anti-Money Laundering Models | Unknown | N/A | |
| Object Reprojection Error (ORE): Camera pose benchmarks from lightweight tracking annotations | Unknown | N/A | |
| Learning Domain-Aware Detection Head with Prompt Tuning | Unknown | N/A | |
| CARE-MI: Chinese Benchmark for Misinformation Evaluation in Maternity and Infant Care | Unknown | N/A | |
| Multi-modal Queried Object Detection in the Wild | Unknown | N/A | |
| OceanBench: The Sea Surface Height Edition | Unknown | N/A | |
| MARBLE: Music Audio Representation Benchmark for Universal Evaluation | Unknown | N/A | |
| GADBench: Revisiting and Benchmarking Supervised Graph Anomaly Detection | Unknown | N/A | |
| Social Motion Prediction with Cognitive Hierarchies | Unknown | N/A | |
| A Dataset for Analyzing Streaming Media Performance over HTTP/3 Browsers | Unknown | N/A | |
| Data Portraits: Recording Foundation Model Training Data | Unknown | N/A | |
| HT-Step: Aligning Instructional Articles with How-To Videos | Unknown | N/A | |
| Renku: a platform for sustainable data science | Unknown | N/A | |
| Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias | Unknown | N/A | |
| Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method | Unknown | N/A | |
| Pairwise GUI Dataset Construction Between Android Phones and Tablets | Unknown | N/A | |
| Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark | Unknown | N/A | |
| A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning | Unknown | N/A | |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models | Unknown | N/A | |
| SituatedGen: Incorporating Geographical and Temporal Contexts into Generative Commonsense Reasoning | Unknown | N/A | |
| GSLB: The Graph Structure Learning Benchmark | Unknown | N/A | |
| RD-Suite: A Benchmark for Ranking Distillation | Unknown | N/A | |
| NetHack is Hard to Hack | Unknown | N/A | |
| DISCO-10M: A Large-Scale Music Dataset | Unknown | N/A | |
| The Rise of AI Language Pathologists: Exploring Two-level Prompt Learning for Few-shot Weakly-supervised Whole Slide Image Classification | Unknown | N/A | |
| Evaluating Open-QA Evaluation | Unknown | N/A | |
| Symmetry-Informed Geometric Representation for Molecules, Proteins, and Crystalline Materials | Unknown | N/A | |
| Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models | Unknown | N/A | |
| NU-MCC: Multiview Compressive Coding with Neighborhood Decoder and Repulsive UDF | Unknown | N/A | |
| RADAR: Robust AI-Text Detection via Adversarial Learning | Unknown | N/A | |
| Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure | Unknown | N/A | |
| Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation | Unknown | N/A | |
| Reducing Blackwell and Average Optimality to Discounted MDPs via the Blackwell Discount Factor | Unknown | N/A | |
| Training Neural Networks is NP-Hard in Fixed Dimension | Unknown | N/A | |
| Fast Exact Leverage Score Sampling from Khatri-Rao Products with Applications to Tensor Decomposition | Unknown | N/A | |
| PCF-GAN: generating sequential data via the characteristic function of measures on the path space | Unknown | N/A | |
| D$^2$CSG: Unsupervised Learning of Compact CSG Trees with Dual Complements and Dropouts | Unknown | N/A | |
| Prefix-Tree Decoding for Predicting Mass Spectra from Molecules | Unknown | N/A | |
| Finding Safe Zones of Markov Decision Processes Policies | Unknown | N/A | |
| Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning | Unknown | N/A | |
| SNEkhorn: Dimension Reduction with Symmetric Entropic Affinities | Unknown | N/A | |
| Kernel Stein Discrepancy thinning: a theoretical perspective of pathologies and a practical fix with regularization | Unknown | N/A | |
| VPGTrans: Transfer Visual Prompt Generator across LLMs | Unknown | N/A | |
| Sharp Calibrated Gaussian Processes | Unknown | N/A | |
| AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking | Unknown | N/A | |
| Unlocking Deterministic Robustness Certification on ImageNet | Unknown | N/A | |
| Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation | Unknown | N/A | |
| 4D Panoptic Scene Graph Generation | Unknown | N/A | |
| Katakomba: Tools and Benchmarks for Data-Driven NetHack | Unknown | N/A | |
| Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning | Unknown | N/A | |
| No Representation Rules Them All in Category Discovery | Unknown | N/A | |
| PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection | Unknown | N/A | |
| Best Arm Identification with Fixed Budget: A Large Deviation Perspective | Unknown | N/A | |
| Bucks for Buckets (B4B): Active Defenses Against Stealing Encoders | Unknown | N/A | |
| VoxDet: Voxel Learning for Novel Instance Detection | Unknown | N/A | |
| L2T-DLN: Learning to Teach with Dynamic Loss Network | Unknown | N/A | |
| A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and Causal Relationship | Unknown | N/A | |
| NIS3D: A Completely Annotated Benchmark for Dense 3D Nuclei Image Segmentation | Unknown | N/A | |
| Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning | Unknown | N/A | |
| Dynamics of Finite Width Kernel and Prediction Fluctuations in Mean Field Neural Networks | Unknown | N/A | |
| MultiVENT: Multilingual Videos of Events and Aligned Natural Text | Unknown | N/A | |
| STXD: Structural and Temporal Cross-Modal Distillation for Multi-View 3D Object Detection | Unknown | N/A | |
| A Dataset of Relighted 3D Interacting Hands | Unknown | N/A | |
| IMP-MARL: a Suite of Environments for Large-scale Infrastructure Management Planning via MARL | Unknown | N/A | |
| Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark | Unknown | N/A | |
| Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena | Unknown | N/A | |
| Language-based Action Concept Spaces Improve Video Self-Supervised Learning | Unknown | N/A | |
| RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars | Unknown | N/A | |
| Revisiting the Evaluation of Image Synthesis with GANs | Unknown | N/A | |
| Tanh Works Better with Asymmetry | Unknown | N/A | |
| Active Reasoning in an Open-World Environment | Unknown | N/A | |
| NeuroEvoBench: Benchmarking Evolutionary Optimizers for Deep Learning Applications | Unknown | N/A | |
| EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models | Unknown | N/A | |
| Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization | Unknown | N/A | |
| Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images | Unknown | N/A | |
| UDC-SIT: A Real-World Dataset for Under-Display Cameras | Unknown | N/A | |
| MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing | Unknown | N/A | |
| OV-PARTS: Towards Open-Vocabulary Part Segmentation | Unknown | N/A | |
| Benchmark of Machine Learning Force Fields for Semiconductor Simulations: Datasets, Metrics, and Comparative Analysis | Unknown | N/A | |
| Mr. HiSum: A Large-scale Dataset for Video Highlight Detection and Summarization | Unknown | N/A | |
| Stanford-ORB: A Real-World 3D Object Inverse Rendering Benchmark | Unknown | N/A | |
| REASONER: An Explainable Recommendation Dataset with Comprehensive Labeling Ground Truths | Unknown | N/A | |
| Learning Human Action Recognition Representations Without Real Humans | Unknown | N/A | |
| Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry | Unknown | N/A | |
| C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models | Unknown | N/A | |
| PGDiff: Guiding Diffusion Models for Versatile Face Restoration via Partial Guidance | Unknown | N/A | |
| Cola: A Benchmark for Compositional Text-to-image Retrieval | Unknown | N/A | |
| Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback | Unknown | N/A | |
| AQuA: A Benchmarking Tool for Label Quality Assessment | Unknown | N/A | |
| Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias | Unknown | N/A | |
| M$^2$Hub: Unlocking the Potential of Machine Learning for Materials Discovery | Unknown | N/A | |
| DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation | Unknown | N/A | |
| A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence | Unknown | N/A | |
| Improving CLIP Training with Language Rewrites | Unknown | N/A | |
| StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners | Unknown | N/A | |
| Factorized Contrastive Learning: Going Beyond Multi-view Redundancy | Unknown | N/A | |
| OpenDataVal: a Unified Benchmark for Data Valuation | Unknown | N/A | |
| Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework | Unknown | N/A | |
| Uncovering Neural Scaling Laws in Molecular Representation Learning | Unknown | N/A | |
| Learning Curves for Deep Structured Gaussian Feature Models | Unknown | N/A | |
| Building the Bridge of Schrödinger: A Continuous Entropic Optimal Transport Benchmark | Unknown | N/A | |
| The Cambridge Law Corpus: A Corpus for Legal AI Research | Unknown | N/A | |
| Beyond Confidence: Reliable Models Should Also Consider Atypicality | Unknown | N/A | |
| OFCOURSE: A Multi-Agent Reinforcement Learning Environment for Order Fulfillment | Unknown | N/A | |
| Does progress on ImageNet transfer to real-world datasets? | Unknown | N/A | |
| Ethical Considerations for Responsible Data Curation | Unknown | N/A | |
| Active Vision Reinforcement Learning under Limited Visual Observability | Unknown | N/A | |
| PTADisc: A Cross-Course Dataset Supporting Personalized Learning in Cold-Start Scenarios | Unknown | N/A | |
| Leveraging Vision-Centric Multi-Modal Expertise for 3D Object Detection | Unknown | N/A | |
| Amazon-M2: A Multilingual Multi-locale Shopping Session Dataset for Recommendation and Text Generation | Unknown | N/A | |
| Sequential Preference Ranking for Efficient Reinforcement Learning from Human Feedback | Unknown | N/A | |
| LEPARD: Learning Explicit Part Discovery for 3D Articulated Shape Reconstruction | Unknown | N/A | |
| EPIC Fields: Marrying 3D Geometry and Video Understanding | Unknown | N/A | |
| FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding | Unknown | N/A | |
| Compact Neural Volumetric Video Representations with Dynamic Codebooks | Unknown | N/A | |
| Training Energy-Based Normalizing Flow with Score-Matching Objectives | Unknown | N/A | |
| Metropolis Sampling for Constrained Diffusion Models | Unknown | N/A | |
| Hierarchical Adaptive Value Estimation for Multi-modal Visual Reinforcement Learning | Unknown | N/A | |
| OpenAssistant Conversations - Democratizing Large Language Model Alignment | Unknown | N/A | |
| Learning To Dive In Branch And Bound | Unknown | N/A | |
| RealTime QA: What's the Answer Right Now? | Unknown | N/A | |
| Lookup Table meets Local Laplacian Filter: Pyramid Reconstruction Network for Tone Mapping | Unknown | N/A | |
| Building Socio-culturally Inclusive Stereotype Resources with Community Engagement | Unknown | N/A | |
| Stochastic Distributed Optimization under Average Second-order Similarity: Algorithms and Analysis | Unknown | N/A | |
| Mathematical Capabilities of ChatGPT | Unknown | N/A | |
| DeWave: Discrete Encoding of EEG Waves for EEG to Text Translation | Unknown | N/A | |
| How to Data in Datathons | Unknown | N/A | |
| EV-Eye: Rethinking High-frequency Eye Tracking through the Lenses of Event Cameras | Unknown | N/A | |
| Towards Anytime Classification in Early-Exit Architectures by Enforcing Conditional Monotonicity | Unknown | N/A | |
| Alexa Arena: A User-Centric Interactive Platform for Embodied AI | Unknown | N/A | |
| SG×P : A Sorghum Genotype × Phenotype Prediction Dataset and Benchmark | Unknown | N/A | |
| The Waymo Open Sim Agents Challenge | Unknown | N/A | |
| URL: A Representation Learning Benchmark for Transferable Uncertainty Estimates | Unknown | N/A | |
| Towards Higher Ranks via Adversarial Weight Pruning | Unknown | N/A | |
| A High-Resolution Dataset for Instance Detection with Multi-View Object Capture | Unknown | N/A | |
| Optimal Transport-Guided Conditional Score-Based Diffusion Model | Unknown | N/A | |
| Towards a Comprehensive Benchmark for High-Level Synthesis Targeted to FPGAs | Unknown | N/A | |
| Natural Language Instruction-following with Task-related Language Development and Translation | Unknown | N/A | |
| Revisiting Evaluation Metrics for Semantic Segmentation: Optimization and Evaluation of Fine-grained Intersection over Union | Unknown | N/A | |
| An NLP Benchmark Dataset for Assessing Corporate Climate Policy Engagement | Unknown | N/A | |
| DAC-DETR: Divide the Attention Layers and Conquer | Unknown | N/A | |
| PIXIU: A Comprehensive Benchmark, Instruction Dataset and Large Language Model for Finance | Unknown | N/A | |
| rPPG-Toolbox: Deep Remote PPG Toolbox | Unknown | N/A | |
| IDRNet: Intervention-Driven Relation Network for Semantic Segmentation | Unknown | N/A | |
| Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory | Unknown | N/A | |
| DVSOD: RGB-D Video Salient Object Detection | Unknown | N/A | |
| Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation | Unknown | N/A | |
| BenchCLAMP: A Benchmark for Evaluating Language Models on Syntactic and Semantic Parsing | Unknown | N/A | |
| Interactive Visual Reasoning under Uncertainty | Unknown | N/A | |
| LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning | Unknown | N/A | |
| Elastic Decision Transformer | Unknown | N/A | |
| NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding | Unknown | N/A | |
| Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase | Unknown | N/A | |
| Temporal Robustness against Data poisoning | Unknown | N/A | |
| Focus Your Attention when Few-Shot Classification | Unknown | N/A | |
| Enhancing User Intent Capture in Session-Based Recommendation with Attribute Patterns | Unknown | N/A | |
| Real3D-AD: A Dataset of Point Cloud Anomaly Detection | Unknown | N/A | |
| Puzzlefusion: Unleashing the Power of Diffusion Models for Spatial Puzzle Solving | Unknown | N/A | |
| Temporal Graph Benchmark for Machine Learning on Temporal Graphs | Unknown | N/A | |
| Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks | Unknown | N/A | |
| An Information Theory Perspective on Variance-Invariance-Covariance Regularization | Unknown | N/A | |
| Learning Time-Invariant Representations for Individual Neurons from Population Dynamics | Unknown | N/A | |
| RGMIL: Guide Your Multiple-Instance Learning Model with Regressor | Unknown | N/A | |
| OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping | Unknown | N/A | |
| A Comprehensive Benchmark for Neural Human Radiance Fields | Unknown | N/A | |
| LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark | Unknown | N/A | |
| Should We Learn Most Likely Functions or Parameters? | Unknown | N/A | |
| RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization | Unknown | N/A | |
| Protein Design with Guided Discrete Diffusion | Unknown | N/A | |
| OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects | Unknown | N/A | |
| EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding | Unknown | N/A | |
| XES3G5M: A Knowledge Tracing Benchmark Dataset with Auxiliary Information | Unknown | N/A | |
| MetaBox: A Benchmark Platform for Meta-Black-Box Optimization with Reinforcement Learning | Unknown | N/A | |
| AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation | Unknown | N/A | |
| Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition | Unknown | N/A | |
| Benchmarking Robustness of Adaptation Methods on Pre-trained Vision-Language Models | Unknown | N/A | |
| ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification | Unknown | N/A | |
| Humans in Kitchens: A Dataset for Multi-Person Human Motion Forecasting with Scene Context | Unknown | N/A | |
| Low-shot Object Learning with Mutual Exclusivity Bias | Unknown | N/A | |
| ForecastPFN: Synthetically-Trained Zero-Shot Forecasting | Unknown | N/A | |
| Rethinking Bias Mitigation: Fairer Architectures Make for Fairer Face Recognition | Unknown | N/A | |
| Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning | Unknown | N/A | |
| How2comm: Communication-Efficient and Collaboration-Pragmatic Multi-Agent Perception | Unknown | N/A | |
| Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing | Unknown | N/A | |
| SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation | Unknown | N/A | |
| Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research | Unknown | N/A | |
| Parallel-mentoring for Offline Model-based Optimization | Unknown | N/A | |
| RoboDepth: Robust Out-of-Distribution Depth Estimation under Corruptions | Unknown | N/A | |
| Predicting a Protein's Stability under a Million Mutations | Unknown | N/A | |
| LayoutGPT: Compositional Visual Planning and Generation with Large Language Models | Unknown | N/A | |
| GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image | Unknown | N/A | |
| ARTIC3D: Learning Robust Articulated 3D Shapes from Noisy Web Image Collections | Unknown | N/A | |
| ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors | Unknown | N/A | |
| Faster Differentially Private Convex Optimization via Second-Order Methods | Unknown | N/A | |
| Counting Distinct Elements Under Person-Level Differential Privacy | Unknown | N/A | |
| Privacy Auditing with One (1) Training Run | Unknown | N/A | |
| LeanDojo: Theorem Proving with Retrieval-Augmented Language Models | Unknown | N/A | |
| Faster Discrete Convex Function Minimization with Predictions: The M-Convex Case | Unknown | N/A | |
| JourneyDB: A Benchmark for Generative Image Understanding | Unknown | N/A | |
| Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples | Unknown | N/A | |
| Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text | Unknown | N/A | |
| VidChapters-7M: Video Chapters at Scale | Unknown | N/A | |
| When Do Neural Nets Outperform Boosted Trees on Tabular Data? | Unknown | N/A | |
| NAVI: Category-Agnostic Image Collections with High-Quality 3D Shape and Pose Annotations | Unknown | N/A | |
| WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological Attributes | Unknown | N/A | |
| TFLEX: Temporal Feature-Logic Embedding Framework for Complex Reasoning over Temporal Knowledge Graph | Unknown | N/A | |
| Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner | Unknown | N/A | |
| Degraded Polygons Raise Fundamental Questions of Neural Network Perception | Unknown | N/A | |
| Benchmarking Large Language Models on CMExam - A comprehensive Chinese Medical Exam Dataset | Unknown | N/A | |
| Benchmarking Foundation Models with Language-Model-as-an-Examiner | Unknown | N/A | |
| QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution | Unknown | N/A | |
| Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis | Unknown | N/A | |
| BiMatting: Efficient Video Matting via Binarization | Unknown | N/A | |
| Segment Anything in High Quality | Unknown | N/A | |
| CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data | Unknown | N/A | |
| Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Bone Shape Reconstruction | Unknown | N/A | |
| LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models | Unknown | N/A | |
| Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning | Unknown | N/A | |
| Validated Image Caption Rating Dataset | Unknown | N/A | |
| Video Timeline Modeling For News Story Understanding | Unknown | N/A | |
| Learning Trajectories are Generalization Indicators | Unknown | N/A | |
| Echoes Beyond Points: Unleashing the Power of Raw Radar Data in Multi-modality Fusion | Unknown | N/A | |
| 3D-Aware Visual Question Answering about Parts, Poses and Occlusions | Unknown | N/A | |
| Reinforcement Learning with Fast and Forgetful Memory | Unknown | N/A | |
| Benchmarking Distribution Shift in Tabular Data with TableShift | Unknown | N/A | |
| Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models | Unknown | N/A | |
| On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes | Unknown | N/A | |
| Hierarchical Open-vocabulary Universal Image Segmentation | Unknown | N/A | |
| AircraftVerse: A Large-Scale Multimodal Dataset of Aerial Vehicle Designs | Unknown | N/A | |
| When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment | Unknown | N/A | |
| DAW: Exploring the Better Weighting Function for Semi-supervised Semantic Segmentation | Unknown | N/A | |
| LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering | Unknown | N/A | |
| ADGym: Design Choices for Deep Anomaly Detection | Unknown | N/A | |
| Digital Typhoon: Long-term Satellite Image Dataset for the Spatio-Temporal Modeling of Tropical Cyclones | Unknown | N/A | |
| LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios | Unknown | N/A | |
| The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data Only | Unknown | N/A | |
| Binarized Spectral Compressive Imaging | Unknown | N/A | |
| AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web | Unknown | N/A | |
| SiT Dataset: Socially Interactive Pedestrian Trajectory Dataset for Social Navigation Robots | Unknown | N/A | |
| LithoBench: Benchmarking AI Computational Lithography for Semiconductor Manufacturing | Unknown | N/A | |
| Lo-Hi: Practical ML Drug Discovery Benchmark | Unknown | N/A | |
| Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation | Unknown | N/A | |
| Sharpness-Aware Minimization Leads to Low-Rank Features | Unknown | N/A | |
| Evaluating Self-Supervised Learning for Molecular Graph Embeddings | Unknown | N/A | |
| CAPro: Webly Supervised Learning with Cross-modality Aligned Prototypes | Unknown | N/A | |
| Revealing the unseen: Benchmarking video action recognition under occlusion | Unknown | N/A | |
| CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations | Unknown | N/A | |
| Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs | Unknown | N/A | |
| Understanding Social Reasoning in Language Models with Language Models | Unknown | N/A | |
| Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? | Unknown | N/A | |
| Tartarus: A Benchmarking Platform for Realistic And Practical Inverse Molecular Design | Unknown | N/A | |
| Does Continual Learning Meet Compositionality? New Benchmarks and An Evaluation Framework | Unknown | N/A | |
| Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers | Unknown | N/A | |
| Contrastive Training of Complex-Valued Autoencoders for Object Discovery | Unknown | N/A | |
| Towards Stable Backdoor Purification through Feature Shift Tuning | Unknown | N/A | |
| Deep Contract Design via Discontinuous Networks | Unknown | N/A | |
| Environment-Aware Dynamic Graph Learning for Out-of-Distribution Generalization | Unknown | N/A | |
| FD-Align: Feature Discrimination Alignment for Fine-tuning Pre-Trained Models in Few-Shot Learning | Unknown | N/A | |
| On the Generalization Properties of Diffusion Models | Unknown | N/A | |
| Collaborative Score Distillation for Consistent Visual Editing | Unknown | N/A | |
| Multi-scale Diffusion Denoised Smoothing | Unknown | N/A | |
| Improving Language Plasticity via Pretraining with Active Forgetting | Unknown | N/A | |
| Alternation makes the adversary weaker in two-player games | Unknown | N/A | |
| Graph Contrastive Learning with Stable and Scalable Spectral Encoding | Unknown | N/A | |
| Textually Pretrained Speech Language Models | Unknown | N/A | |
| LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections | Unknown | N/A | |
| Effective Bayesian Heteroscedastic Regression with Deep Neural Networks | Unknown | N/A | |
| Continuous-Time Functional Diffusion Processes | Unknown | N/A | |
| One-Line-of-Code Data Mollification Improves Optimization of Likelihood-based Generative Models | Unknown | N/A | |
| Learning Adaptive Tensorial Density Fields for Clean Cryo-ET Reconstruction | Unknown | N/A | |
| How to Fine-tune the Model: Unified Model Shift and Model Bias Policy Optimization | Unknown | N/A | |
| On the Powerfulness of Textual Outlier Exposure for Visual OoD Detection | Unknown | N/A | |
| Disentangling Cognitive Diagnosis with Limited Exercise Labels | Unknown | N/A | |
| Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms | Unknown | N/A | |
| (Amplified) Banded Matrix Factorization: A unified approach to private training | Unknown | N/A | |
| Gradient Descent with Linearly Correlated Noise: Theory and Applications to Differential Privacy | Unknown | N/A | |
| Adversarial Robustness through Random Weight Sampling | Unknown | N/A | |
| Language Models Meet World Models: Embodied Experiences Enhance Language Models | Unknown | N/A | |
| Model-enhanced Vector Index | Unknown | N/A | |
| Beyond Normal: On the Evaluation of Mutual Information Estimators | Unknown | N/A | |
| ISP: Multi-Layered Garment Draping with Implicit Sewing Patterns | Unknown | N/A | |
| Representational Strengths and Limitations of Transformers | Unknown | N/A | |
| Direction-oriented Multi-objective Learning: Simple and Provable Stochastic Algorithms | Unknown | N/A | |
| Recovering Simultaneously Structured Data via Non-Convex Iteratively Reweighted Least Squares | Unknown | N/A | |
| A Spectral Theory of Neural Prediction and Alignment | Unknown | N/A | |
| Adaptive Normalization for Non-stationary Time Series Forecasting: A Temporal Slice Perspective | Unknown | N/A | |
| A Cross-Moment Approach for Causal Effect Estimation | Unknown | N/A | |
| Learning the Efficient Frontier | Unknown | N/A | |
| Defending against Data-Free Model Extraction by Distributionally Robust Defensive Training | Unknown | N/A | |
| Bounding the Invertibility of Privacy-preserving Instance Encoding using Fisher Information | Unknown | N/A | |
| Information Maximizing Curriculum: A Curriculum-Based Approach for Learning Versatile Skills | Unknown | N/A | |
| 3D Indoor Instance Segmentation in an Open-World | Unknown | N/A | |
| DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models | Unknown | N/A | |
| Generalization bounds for neural ordinary differential equations and deep residual networks | Unknown | N/A | |
| Spike-driven Transformer | Unknown | N/A | |
| Learning threshold neurons via edge of stability | Unknown | N/A | |
| Django: Detecting Trojans in Object Detection Models via Gaussian Focus Calibration | Unknown | N/A | |
| NeRF-IBVS: Visual Servo Based on NeRF for Visual Localization and Navigation | Unknown | N/A | |
| Image Captioners Are Scalable Vision Learners Too | Unknown | N/A | |
| Boosting Adversarial Transferability by Achieving Flat Local Maxima | Unknown | N/A | |
| To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis | Unknown | N/A | |
| Calibrate and Boost Logical Expressiveness of GNN Over Multi-Relational and Temporal Graphs | Unknown | N/A | |
| LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images | Unknown | N/A | |
| On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective | Unknown | N/A | |
| Dynamic Sparsity Is Channel-Level Sparsity Learner | Unknown | N/A | |
| Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements | Unknown | N/A | |
| SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs | Unknown | N/A | |
| DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models | Unknown | N/A | |
| In-Context Impersonation Reveals Large Language Models' Strengths and Biases | Unknown | N/A | |
| Don’t just prune by magnitude! Your mask topology is a secret weapon | Unknown | N/A | |
| Transfer Learning with Affine Model Transformation | Unknown | N/A | |
| Conditional Mutual Information for Disentangled Representations in Reinforcement Learning | Unknown | N/A | |
| Spiking PointNet: Spiking Neural Networks for Point Clouds | Unknown | N/A | |
| Episodic Multi-Task Learning with Heterogeneous Neural Processes | Unknown | N/A | |
| PaintSeg: Painting Pixels for Training-free Segmentation | Unknown | N/A | |
| Faith and Fate: Limits of Transformers on Compositionality | Unknown | N/A | |
| Faster approximate subgraph counts with privacy | Unknown | N/A | |
| Defending Pre-trained Language Models as Few-shot Learners against Backdoor Attacks | Unknown | N/A | |
| Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering | Unknown | N/A | |
| SLM: A Smoothed First-Order Lagrangian Method for Structured Constrained Nonconvex Optimization | Unknown | N/A | |
| Skill-it! A data-driven skills framework for understanding and training language models | Unknown | N/A | |
| Debiasing Pretrained Generative Models by Uniformly Sampling Semantic Attributes | Unknown | N/A | |
| Joint Attribute and Model Generalization Learning for Privacy-Preserving Action Recognition | Unknown | N/A | |
| ALGO: Synthesizing Algorithmic Programs with Generated Oracle Verifiers | Unknown | N/A | |
| Graph Mixture of Experts: Learning on Large-Scale Graphs with Explicit Diversity Modeling | Unknown | N/A | |
| StyleDrop: Text-to-Image Synthesis of Any Style | Unknown | N/A | |
| Three Towers: Flexible Contrastive Learning with Pretrained Image Models | Unknown | N/A | |
| When Can We Track Significant Preference Shifts in Dueling Bandits? | Unknown | N/A | |
| H-nobs: Achieving Certified Fairness and Robustness in Distributed Learning on Heterogeneous Datasets | Unknown | N/A | |
| Nash Regret Guarantees for Linear Bandits | Unknown | N/A | |
| An Efficient Dataset Condensation Plugin and Its Application to Continual Learning | Unknown | N/A | |
| A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation | Unknown | N/A | |
| PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile | Unknown | N/A | |
| Robust low-rank training via approximate orthonormal constraints | Unknown | N/A | |
| GNeSF: Generalizable Neural Semantic Fields | Unknown | N/A | |
| Augmentation-Aware Self-Supervision for Data-Efficient GAN Training | Unknown | N/A | |
| MAViL: Masked Audio-Video Learners | Unknown | N/A | |
| A Spectral Algorithm for List-Decodable Covariance Estimation in Relative Frobenius Norm | Unknown | N/A | |
| Interpretable Graph Networks Formulate Universal Algebra Conjectures | Unknown | N/A | |
| H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models | Unknown | N/A | |
| Self-Refine: Iterative Refinement with Self-Feedback | Unknown | N/A | |
| CoLA: Exploiting Compositional Structure for Automatic and Efficient Numerical Linear Algebra | Unknown | N/A | |
| Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation | Unknown | N/A | |
| White-Box Transformers via Sparse Rate Reduction | Unknown | N/A | |
| 4M: Massively Multimodal Masked Modeling | Unknown | N/A | |
| When can Regression-Adjusted Control Variate Help? Rare Events, Sobolev Embedding and Minimax Optimality | Unknown | N/A | |
| Polynomially Over-Parameterized Convolutional Neural Networks Contain Structured Strong Winning Lottery Tickets | Unknown | N/A | |
| RDumb: A simple approach that questions our progress in continual test-time adaptation | Unknown | N/A | |
| Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution | Unknown | N/A | |
| RevColV2: Exploring Disentangled Representations in Masked Image Modeling | Unknown | N/A | |
| PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models | Unknown | N/A | |
| Making Scalable Meta Learning Practical | Unknown | N/A | |
| Hierarchical Integration Diffusion Model for Realistic Image Deblurring | Unknown | N/A | |
| CAP: Correlation-Aware Pruning for Highly-Accurate Sparse Vision Models | Unknown | N/A | |
| MoVie: Visual Model-Based Policy Adaptation for View Generalization | Unknown | N/A | |
| Compositional Policy Learning in Stochastic Control Systems with Formal Guarantees | Unknown | N/A | |
| Self-Supervised Learning with Lie Symmetries for Partial Differential Equations | Unknown | N/A | |
| Leveraging the two-timescale regime to demonstrate convergence of neural networks | Unknown | N/A | |
| Training Chain-of-Thought via Latent-Variable Inference | Unknown | N/A | |
| Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning | Unknown | N/A | |
| The Emergence of Essential Sparsity in Large Pre-trained Models: The Weights that Matter | Unknown | N/A | |
| Context-PIPs: Persistent Independent Particles Demands Spatial Context Features | Unknown | N/A | |
| 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection | Unknown | N/A | |
| ZipLM: Inference-Aware Structured Pruning of Language Models | Unknown | N/A | |
| Does a sparse ReLU network training problem always admit an optimum ? | Unknown | N/A | |
| Large Language Models of Code Fail at Completing Code with Potential Bugs | Unknown | N/A | |
| Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline | Unknown | N/A | |
| Train Once and Explain Everywhere: Pre-training Interpretable Graph Neural Networks | Unknown | N/A | |
| BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing | Unknown | N/A | |
| Reducing Shape-Radiance Ambiguity in Radiance Fields with a Closed-Form Color Estimation Method | Unknown | N/A | |
| SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling | Unknown | N/A | |
| Online Adaptive Policy Selection in Time-Varying Systems: No-Regret via Contractive Perturbations | Unknown | N/A | |
| Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification | Unknown | N/A | |
| CLeAR: Continual Learning on Algorithmic Reasoning for Human-like Intelligence | Unknown | N/A | |
| EvoPrompting: Language Models for Code-Level Neural Architecture Search | Unknown | N/A | |
| Near-Optimal Algorithms for Gaussians with Huber Contamination: Mean Estimation and Linear Regression | Unknown | N/A | |
| Ess-InfoGAIL: Semi-supervised Imitation Learning from Imbalanced Demonstrations | Unknown | N/A | |
| TextDiffuser: Diffusion Models as Text Painters | Unknown | N/A | |
| On permutation symmetries in Bayesian neural network posteriors: a variational perspective | Unknown | N/A | |
| Computing Optimal Nash Equilibria in Multiplayer Games | Unknown | N/A | |
| BIRD: Generalizable Backdoor Detection and Removal for Deep Reinforcement Learning | Unknown | N/A | |
| Fine-Grained Visual Prompting | Unknown | N/A | |
| Unconstrained Dynamic Regret via Sparse Coding | Unknown | N/A | |
| H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation | Unknown | N/A | |
| DiffComplete: Diffusion-based Generative 3D Shape Completion | Unknown | N/A | |
| Towards the Difficulty for a Deep Neural Network to Learn Concepts of Different Complexities | Unknown | N/A | |
| Robust Distributed Learning: Tight Error Bounds and Breakdown Point under Data Heterogeneity | Unknown | N/A | |
| Individual Arbitrariness and Group Fairness | Unknown | N/A | |
| UniT: A Unified Look at Certified Robust Training against Text Adversarial Perturbation | Unknown | N/A | |
| Explore In-Context Learning for 3D Point Cloud Understanding | Unknown | N/A | |
| Model and Feature Diversity for Bayesian Neural Networks in Mutual Learning | Unknown | N/A | |
| StateMask: Explaining Deep Reinforcement Learning through State Mask | Unknown | N/A | |
| VLATTACK: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models | Unknown | N/A | |
| Transformers learn through gradual rank increase | Unknown | N/A | |
| Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information | Unknown | N/A | |
| Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning | Unknown | N/A | |
| Weakly-Supervised Audio-Visual Segmentation | Unknown | N/A | |
| Retrieval-Augmented Multiple Instance Learning | Unknown | N/A | |
| Unsupervised Semantic Correspondence Using Stable Diffusion | Unknown | N/A | |
| Meta-in-context learning in large language models | Unknown | N/A | |
| Offline RL with Discrete Proxy Representations for Generalizability in POMDPs | Unknown | N/A | |
| Knowledge Distillation Performs Partial Variance Reduction | Unknown | N/A | |
| H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection | Unknown | N/A | |
| Exploiting Correlated Auxiliary Feedback in Parameterized Bandits | Unknown | N/A | |
| Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection | Unknown | N/A | |
| GAUCHE: A Library for Gaussian Processes in Chemistry | Unknown | N/A | |
| Transitivity Recovering Decompositions: Interpretable and Robust Fine-Grained Relationships | Unknown | N/A | |
| BayesDAG: Gradient-Based Posterior Inference for Causal Discovery | Unknown | N/A | |
| Optimal Algorithms for the Inhomogeneous Spiked Wigner Model | Unknown | N/A | |
| Error Discovery By Clustering Influence Embeddings | Unknown | N/A | |
| Quasi-Monte Carlo Graph Random Features | Unknown | N/A | |
| Operator Learning with Neural Fields: Tackling PDEs on General Geometries | Unknown | N/A | |
| Non-Smooth Weakly-Convex Finite-sum Coupled Compositional Optimization | Unknown | N/A | |
| AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models | Unknown | N/A | |
| HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face | Unknown | N/A | |
| Federated Multi-Objective Learning | Unknown | N/A | |
| A Combinatorial Algorithm for Approximating the Optimal Transport in the Parallel and MPC Settings | Unknown | N/A | |
| Goal Driven Discovery of Distributional Differences via Language Descriptions | Unknown | N/A | |
| Counterfactual Generation with Identifiability Guarantees | Unknown | N/A | |
| CS-Isolate: Extracting Hard Confident Examples by Content and Style Isolation | Unknown | N/A | |
| ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection | Unknown | N/A | |
| Convex-Concave Zero-Sum Markov Stackelberg Games | Unknown | N/A | |
| Implicit Transfer Operator Learning: Multiple Time-Resolution Models for Molecular Dynamics | Unknown | N/A | |
| Zeroth-Order Methods for Nondifferentiable, Nonconvex, and Hierarchical Federated Optimization | Unknown | N/A | |
| MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining | Unknown | N/A | |
| Causes and Effects of Unanticipated Numerical Deviations in Neural Network Inference Frameworks | Unknown | N/A | |
| Real-World Image Super-Resolution as Multi-Task Learning | Unknown | N/A | |
| ProteinNPT: Improving Protein Property Prediction and Design with Non-Parametric Transformers | Unknown | N/A | |
| Counterfactually Comparing Abstaining Classifiers | Unknown | N/A | |
| Koopman Kernel Regression | Unknown | N/A | |
| KD-Zero: Evolving Knowledge Distiller for Any Teacher-Student Pairs | Unknown | N/A | |
| Achieving $\mathcal{O}(\epsilon^{-1.5})$ Complexity in Hessian/Jacobian-free Stochastic Bilevel Optimization | Unknown | N/A | |
| Contrastive Moments: Unsupervised Halfspace Learning in Polynomial Time | Unknown | N/A | |
| Lossy Image Compression with Conditional Diffusion Models | Unknown | N/A | |
| Emergence of Shape Bias in Convolutional Neural Networks through Activation Sparsity | Unknown | N/A | |
| LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning | Unknown | N/A | |
| Regret Minimization via Saddle Point Optimization | Unknown | N/A | |
| STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Unknown | N/A | |
| Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback | Unknown | N/A | |
| Experiment Planning with Function Approximation | Unknown | N/A | |
| Exploring Diverse In-Context Configurations for Image Captioning | Unknown | N/A | |
| Characterizing Out-of-Distribution Error via Optimal Transport | Unknown | N/A | |
| POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images | Unknown | N/A | |
| Contextual Gaussian Process Bandits with Neural Networks | Unknown | N/A | |
| Labeling Neural Representations with Inverse Recognition | Unknown | N/A | |
| Coop: Memory is not a Commodity | Unknown | N/A | |
| DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining | Unknown | N/A | |
| A Causal Framework for Decomposing Spurious Variations | Unknown | N/A | |
| PromptRestorer: A Prompting Image Restoration Method with Degradation Perception | Unknown | N/A | |
| Adversarial Counterfactual Environment Model Learning | Unknown | N/A | |
| Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback | Unknown | N/A | |
| Data Selection for Language Models via Importance Resampling | Unknown | N/A | |
| Linear Time Algorithms for k-means with Multi-Swap Local Search | Unknown | N/A | |
| Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation Difficulty via Attributes | Unknown | N/A | |
| Foundation Model is Efficient Multimodal Multitask Model Selector | Unknown | N/A | |
| Network Regression with Graph Laplacians | Unknown | N/A | |
| ClimSim: A large multi-scale dataset for hybrid physics-ML climate emulation | Unknown | N/A | |
| KuaiSim: A Comprehensive Simulator for Recommender Systems | Unknown | N/A | |
| Universality and Limitations of Prompt Tuning | Unknown | N/A | |
| [Re] Pure Noise to the Rescue of Insufficient Data | Unknown | N/A | |
| ResoNet: Noise-Trained Physics-Informed MRI Off-Resonance Correction | Unknown | N/A | |
| Lockdown: Backdoor Defense for Federated Learning with Isolated Subspace Training | Unknown | N/A | |
| SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models | Unknown | N/A | |
| Scientific Document Retrieval using Multi-level Aspect-based Queries | Unknown | N/A | |
| Model Sparsity Can Simplify Machine Unlearning | Unknown | N/A | |
| Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets | Unknown | N/A | |
| Guide Your Agent with Adaptive Multimodal Rewards | Unknown | N/A | |
| Label Robust and Differentially Private Linear Regression: Computational and Statistical Efficiency | Unknown | N/A | |
| Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management | Unknown | N/A | |
| SwiFT: Swin 4D fMRI Transformer | Unknown | N/A | |
| Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities | Unknown | N/A | |
| Asymptotically Optimal Quantile Pure Exploration for Infinite-Armed Bandits | Unknown | N/A | |
| Sharp Spectral Rates for Koopman Operator Learning | Unknown | N/A | |
| Encoding Time-Series Explanations through Self-Supervised Model Behavior Consistency | Unknown | N/A | |
| MMD Aggregated Two-Sample Test | Unknown | N/A | |
| PopSign ASL v1.0: An Isolated American Sign Language Dataset Collected via Smartphones | Unknown | N/A | |
| Decorate3D: Text-Driven High-Quality Texture Generation for Mesh Decoration in the Wild | Unknown | N/A | |
| Characterizing the Optimal $0-1$ Loss for Multi-class Classification with a Test-time Attacker | Unknown | N/A | |
| A Diffusion-Model of Joint Interactive Navigation | Unknown | N/A | |
| The Pick-to-Learn Algorithm: Empowering Compression for Tight Generalization Bounds and Improved Post-training Performance | Unknown | N/A | |
| High Precision Causal Model Evaluation with Conditional Randomization | Unknown | N/A | |
| AVIS: Autonomous Visual Information Seeking with Large Language Model Agent | Unknown | N/A | |
| Fast Scalable and Accurate Discovery of DAGs Using the Best Order Score Search and Grow Shrink Trees | Unknown | N/A | |
| Distance-Restricted Folklore Weisfeiler-Leman GNNs with Provable Cycle Counting Power | Unknown | N/A | |
| First- and Second-Order Bounds for Adversarial Linear Contextual Bandits | Unknown | N/A | |
| RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability | Unknown | N/A | |
| Distributionally Robust Ensemble of Lottery Tickets Towards Calibrated Sparse Network Training | Unknown | N/A | |
| Accelerated Zeroth-order Method for Non-Smooth Stochastic Convex Optimization Problem with Infinite Variance | Unknown | N/A | |
| Handling Data Heterogeneity via Architectural Design for Federated Visual Recognition | Unknown | N/A | |
| ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation | Unknown | N/A | |
| Percentile Criterion Optimization in Offline Reinforcement Learning | Unknown | N/A | |
| VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution | Unknown | N/A | |
| Functional Equivalence and Path Connectivity of Reducible Hyperbolic Tangent Networks | Unknown | N/A | |
| Switching Autoregressive Low-rank Tensor Models | Unknown | N/A | |
| Epidemic Learning: Boosting Decentralized Learning with Randomized Communication | Unknown | N/A | |
| Predicting mutational effects on protein-protein binding via a side-chain diffusion probabilistic model | Unknown | N/A | |
| SANFlow: Semantic-Aware Normalizing Flow for Anomaly Detection | Unknown | N/A | |
| Exact Bayesian Inference on Discrete Models via Probability Generating Functions: A Probabilistic Programming Approach | Unknown | N/A | |
| Structured Neural-PI Control with End-to-End Stability and Output Tracking Guarantees | Unknown | N/A | |
| Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples | Unknown | N/A | |
| Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch | Unknown | N/A | |
| Online (Multinomial) Logistic Bandit: Improved Regret and Constant Computation Cost | Unknown | N/A | |
| Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability | Unknown | N/A | |
| [Re] CrossWalk: Fairness-enhanced Node Representation Learning | Unknown | N/A | |
| Parallel Sampling of Diffusion Models | Unknown | N/A | |
| Variational Imbalanced Regression: Fair Uncertainty Quantification via Probabilistic Smoothing | Unknown | N/A | |
| Temporal Causal Mediation through a Point Process: Direct and Indirect Effects of Healthcare Interventions | Unknown | N/A | |
| Reproducibility Study of ”Label-Free Explainability for Unsupervised Models” | Unknown | N/A | |
| Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data | Unknown | N/A | |
| MiliPoint: A Point Cloud Dataset for mmWave Radar | Unknown | N/A | |
| One-for-All: Bridge the Gap Between Heterogeneous Architectures in Knowledge Distillation | Unknown | N/A | |
| Survival Instinct in Offline Reinforcement Learning | Unknown | N/A | |
| Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization | Unknown | N/A | |
| [Re] Exploring the Role of Grammar and Word Choice in Bias Toward African American English (AAE) in Hate Speech Classification | Unknown | N/A | |
| Suggesting Variable Order for Cylindrical Algebraic Decomposition via Reinforcement Learning | Unknown | N/A | |
| [Re] FOCUS: Flexible Optimizable Counterfactual Explanations for Tree Ensembles | Unknown | N/A | |
| Reproducibility study of 'Proto2Proto: Can you recognise the car, the way I do?' | Unknown | N/A | |
| Euler-Lagrange Analysis of Generative Adversarial Networks | Unknown | N/A | |
| Energy Transformer | Unknown | N/A | |
| Neural Lighting Simulation for Urban Scenes | Unknown | N/A | |
| VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset | Unknown | N/A | |
| Prototypical Variational Autoencoder for 3D Few-shot Object Detection | Unknown | N/A | |
| ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation | Unknown | N/A | |
| Strong and Precise Modulation of Human Percepts via Robustified ANNs | Unknown | N/A | |
| Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation | Unknown | N/A | |
| Regularizing Neural Networks with Meta-Learning Generative Models | Unknown | N/A | |
| Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant Learning | Unknown | N/A | |
| Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs | Unknown | N/A | |
| [Re] Numerical influence of ReLU'(0) on backpropagation | Unknown | N/A | |
| AMAG: Additive, Multiplicative and Adaptive Graph Neural Network For Forecasting Neuron Activity | Unknown | N/A | |
| Provably Safe Reinforcement Learning with Step-wise Violation Constraints | Unknown | N/A | |
| UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models | Unknown | N/A | |
| On Measuring Fairness in Generative Models | Unknown | N/A | |
| Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation | Unknown | N/A | |
| DESSERT: An Efficient Algorithm for Vector Set Search with Vector Set Queries | Unknown | N/A | |
| Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset | Unknown | N/A | |
| PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation | Unknown | N/A | |
| [Re] Hierarchical Shrinkage: Improving the Accuracy and Interpretability of Tree-Based Methods | Unknown | N/A | |
| [Re] VAE Approximation Error: ELBO and Exponential Families | Unknown | N/A | |
| Reduced Policy Optimization for Continuous Control with Hard Constraints | Unknown | N/A | |
| Sampling weights of deep neural networks | Unknown | N/A | |
| Learning Interpretable Low-dimensional Representation via Physical Symmetry | Unknown | N/A | |
| Towards a fuller understanding of neurons with Clustered Compositional Explanations | Unknown | N/A | |
| Augmented Memory Replay-based Continual Learning Approaches for Network Intrusion Detection | Unknown | N/A | |
| Efficient Robust Bayesian Optimization for Arbitrary Uncertain inputs | Unknown | N/A | |
| Resilient Constrained Learning | Unknown | N/A | |
| LargeST: A Benchmark Dataset for Large-Scale Traffic Forecasting | Unknown | N/A | |
| Pairwise Causality Guided Transformers for Event Sequences | Unknown | N/A | |
| An Iterative Self-Learning Framework for Medical Domain Generalization | Unknown | N/A | |
| Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment | Unknown | N/A | |
| RayDF: Neural Ray-surface Distance Fields with Multi-view Consistency | Unknown | N/A | |
| Causal Discovery in Semi-Stationary Time Series | Unknown | N/A | |
| SmoothHess: ReLU Network Feature Interactions via Stein's Lemma | Unknown | N/A | |
| Learning Score-based Grasping Primitive for Human-assisting Dexterous Grasping | Unknown | N/A | |
| The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation | Unknown | N/A | |
| Extensible Prompts for Language Models on Zero-shot Language Style Customization | Unknown | N/A | |
| Switching Temporary Teachers for Semi-Supervised Semantic Segmentation | Unknown | N/A | |
| [Re] $\mathcal{G}$-Mixup: Graph Data Augmentation for Graph Classification | Unknown | N/A | |
| [Re] End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking | Unknown | N/A | |
| [Re] Variational Neural Cellular Automata | Unknown | N/A | |
| Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models | Unknown | N/A | |
| Equivariant Neural Operator Learning with Graphon Convolution | Unknown | N/A | |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Unknown | N/A | |
| Easy Bayesian Transfer Learning with Informative Priors | Unknown | N/A | |
| Statistical Limits of Adaptive Linear Models: Low-Dimensional Estimation and Inference | Unknown | N/A | |
| Stability-penalty-adaptive follow-the-regularized-leader: Sparsity, game-dependency, and best-of-both-worlds | Unknown | N/A | |
| ANPL: Towards Natural Programming with Interactive Decomposition | Unknown | N/A | |
| Using Imperfect Surrogates for Downstream Inference: Design-based Supervised Learning for Social Science Applications of Large Language Models | Unknown | N/A | |
| Uncovering and Quantifying Social Biases in Code Generation | Unknown | N/A | |
| SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation | Unknown | N/A | |
| IPMix: Label-Preserving Data Augmentation Method for Training Robust Classifiers | Unknown | N/A | |
| Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer | Unknown | N/A | |
| Passive learning of active causal strategies in agents and language models | Unknown | N/A | |
| Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time | Unknown | N/A | |
| All Points Matter: Entropy-Regularized Distribution Alignment for Weakly-supervised 3D Segmentation | Unknown | N/A | |
| SwapPrompt: Test-Time Prompt Adaptation for Vision-Language Models | Unknown | N/A | |
| An Optimal and Scalable Matrix Mechanism for Noisy Marginals under Convex Loss Functions | Unknown | N/A | |
| Addressing Negative Transfer in Diffusion Models | Unknown | N/A | |
| Constant Approximation for Individual Preference Stable Clustering | Unknown | N/A | |
| LayoutPrompter: Awaken the Design Ability of Large Language Models | Unknown | N/A | |
| Structured Semidefinite Programming for Recovering Structured Preconditioners | Unknown | N/A | |
| Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks | Unknown | N/A | |
| Revisit the Power of Vanilla Knowledge Distillation: from Small Scale to Large Scale | Unknown | N/A | |
| Change point detection and inference in multivariate non-parametric models under mixing conditions | Unknown | N/A | |
| Does Invariant Graph Learning via Environment Augmentation Learn Invariance? | Unknown | N/A | |
| GRAND-SLAMIN’ Interpretable Additive Modeling with Structural Constraints | Unknown | N/A | |
| The geometry of hidden representations of large transformer models | Unknown | N/A | |
| This Looks Like Those: Illuminating Prototypical Concepts Using Multiple Visualizations | Unknown | N/A | |
| Stable Bias: Evaluating Societal Representations in Diffusion Models | Unknown | N/A | |
| Multitask Learning with No Regret: from Improved Confidence Bounds to Active Learning | Unknown | N/A | |
| Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning | Unknown | N/A | |
| Mixed Samples as Probes for Unsupervised Model Selection in Domain Adaptation | Unknown | N/A | |
| Parts of Speech–Grounded Subspaces in Vision-Language Models | Unknown | N/A | |
| Reproducibility Study of “Quantifying Societal Bias Amplification in Image Captioning” | Unknown | N/A | |
| Memory-Constrained Algorithms for Convex Optimization | Unknown | N/A | |
| Quantus: An Explainable AI Toolkit for Responsible Evaluation of Neural Network Explanations and Beyond | Unknown | N/A | |
| Synthetic-to-Real Pose Estimation with Geometric Reconstruction | Unknown | N/A | |
| Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control | Unknown | N/A | |
| Implicit Variational Inference for High-Dimensional Posteriors | Unknown | N/A | |
| Analyzing Generalization of Neural Networks through Loss Path Kernels | Unknown | N/A | |
| BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset | Unknown | N/A | |
| Auslan-Daily: Australian Sign Language Translation for Daily Communication and News | Unknown | N/A | |
| Closing the Computational-Statistical Gap in Best Arm Identification for Combinatorial Semi-bandits | Unknown | N/A | |
| De novo Drug Design using Reinforcement Learning with Multiple GPT Agents | Unknown | N/A | |
| Bifurcations and loss jumps in RNN training | Unknown | N/A | |
| ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation | Unknown | N/A | |
| Taylor TD-learning | Unknown | N/A | |
| State-space models with layer-wise nonlinearity are universal approximators with exponential decaying memory | Unknown | N/A | |
| Estimating Noise Correlations Across Continuous Conditions With Wishart Processes | Unknown | N/A | |
| Deep learning with kernels through RKHM and the Perron-Frobenius operator | Unknown | N/A | |
| Efficient Training of Energy-Based Models Using Jarzynski Equality | Unknown | N/A | |
| FABind: Fast and Accurate Protein-Ligand Binding | Unknown | N/A | |
| A Unified Conditional Framework for Diffusion-based Image Restoration | Unknown | N/A | |
| Drift doesn't Matter: Dynamic Decomposition with Diffusion Reconstruction for Unstable Multivariate Time Series Anomaly Detection | Unknown | N/A | |
| NeuroGF: A Neural Representation for Fast Geodesic Distance and Path Queries | Unknown | N/A | |
| Double and Single Descent in Causal Inference with an Application to High-Dimensional Synthetic Control | Unknown | N/A | |
| Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective | Unknown | N/A | |
| Query-based Temporal Fusion with Explicit Motion for 3D Object Detection | Unknown | N/A | |
| On Masked Pre-training and the Marginal Likelihood | Unknown | N/A | |
| On Learning Necessary and Sufficient Causal Graphs | Unknown | N/A | |
| Learning List-Level Domain-Invariant Representations for Ranking | Unknown | N/A | |
| (S)GD over Diagonal Linear Networks: Implicit bias, Large Stepsizes and Edge of Stability | Unknown | N/A | |
| Goal-Conditioned Predictive Coding for Offline Reinforcement Learning | Unknown | N/A | |
| Diffused Redundancy in Pre-trained Representations | Unknown | N/A | |
| ECG-QA: A Comprehensive Question Answering Dataset Combined With Electrocardiogram | Unknown | N/A | |
| Adversarially Robust Learning with Uncertain Perturbation Sets | Unknown | N/A | |
| BayesTune: Bayesian Sparse Deep Model Fine-tuning | Unknown | N/A | |
| Mixed-Initiative Multiagent Apprenticeship Learning for Human Training of Robot Teams | Unknown | N/A | |
| Rank-N-Contrast: Learning Continuous Representations for Regression | Unknown | N/A | |
| SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models | Unknown | N/A | |
| Local Convergence of Gradient Methods for Min-Max Games: Partial Curvature Generically Suffices | Unknown | N/A | |
| How to Select Which Active Learning Strategy is Best Suited for Your Specific Problem and Budget | Unknown | N/A | |
| Rubik's Cube: High-Order Channel Interactions with a Hierarchical Receptive Field | Unknown | N/A | |
| Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models | Unknown | N/A | |
| FouriDown: Factoring Down-Sampling into Shuffling and Superposing | Unknown | N/A | |
| Recasting Continual Learning as Sequence Modeling | Unknown | N/A | |
| LagrangeBench: A Lagrangian Fluid Mechanics Benchmarking Suite | Unknown | N/A | |
| Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieval | Unknown | N/A | |
| ParaFuzz: An Interpretability-Driven Technique for Detecting Poisoned Samples in NLP | Unknown | N/A | |
| Regularized Behavior Cloning for Blocking the Leakage of Past Action Information | Unknown | N/A | |
| Expressive Sign Equivariant Networks for Spectral Geometric Learning | Unknown | N/A | |
| AdANNS: A Framework for Adaptive Semantic Search | Unknown | N/A | |
| Continuous-time Analysis of Anchor Acceleration | Unknown | N/A | |
| Private Federated Frequency Estimation: Adapting to the Hardness of the Instance | Unknown | N/A | |
| Credal Marginal MAP | Unknown | N/A | |
| Canonical normalizing flows for manifold learning | Unknown | N/A | |
| QH9: A Quantum Hamiltonian Prediction Benchmark for QM9 Molecules | Unknown | N/A | |
| Trial matching: capturing variability with data-constrained spiking neural networks | Unknown | N/A | |
| ANTN: Bridging Autoregressive Neural Networks and Tensor Networks for Quantum Many-Body Simulation | Unknown | N/A | |
| Learning Visual Prior via Generative Pre-Training | Unknown | N/A | |
| How Re-sampling Helps for Long-Tail Learning? | Unknown | N/A | |
| Public Opinion Field Effect Fusion in Representation Learning for Trending Topics Diffusion | Unknown | N/A | |
| Symbol-LLM: Leverage Language Models for Symbolic System in Visual Human Activity Reasoning | Unknown | N/A | |
| Is Heterogeneity Notorious? Taming Heterogeneity to Handle Test-Time Shift in Federated Learning | Unknown | N/A | |
| Mirror Diffusion Models for Constrained and Watermarked Generation | Unknown | N/A | |
| Template-free Articulated Neural Point Clouds for Reposable View Synthesis | Unknown | N/A | |
| Context-guided Embedding Adaptation for Effective Topic Modeling in Low-Resource Regimes | Unknown | N/A | |
| Actively Testing Your Model While It Learns: Realizing Label-Efficient Learning in Practice | Unknown | N/A | |
| XAGen: 3D Expressive Human Avatars Generation | Unknown | N/A | |
| NVFi: Neural Velocity Fields for 3D Physics Learning from Dynamic Videos | Unknown | N/A | |
| Gradient Informed Proximal Policy Optimization | Unknown | N/A | |
| [Re] On the Reproducibility of CartoonX | Unknown | N/A | |
| [Re] Fairness Guarantees under Demographic Shift | Unknown | N/A | |
| Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start | Unknown | N/A | |
| OpenAGI: When LLM Meets Domain Experts | Unknown | N/A | |
| PRODIGY: Enabling In-context Learning Over Graphs | Unknown | N/A | |
| The Quantization Model of Neural Scaling | Unknown | N/A | |
| Bayesian Metric Learning for Uncertainty Quantification in Image Retrieval | Unknown | N/A | |
| MIMONets: Multiple-Input-Multiple-Output Neural Networks Exploiting Computation in Superposition | Unknown | N/A | |
| Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents | Unknown | N/A | |
| Unbiased constrained sampling with Self-Concordant Barrier Hamiltonian Monte Carlo | Unknown | N/A | |
| Mnemosyne: Learning to Train Transformers with Transformers | Unknown | N/A | |
| EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought | Unknown | N/A | |
| Flow Factorized Representation Learning | Unknown | N/A | |
| Zero-Shot Anomaly Detection via Batch Normalization | Unknown | N/A | |
| Sensitivity in Translation Averaging | Unknown | N/A | |
| Are GATs Out of Balance? | Unknown | N/A | |
| Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior | Unknown | N/A | |
| A Hierarchical Training Paradigm for Antibody Structure-sequence Co-design | Unknown | N/A | |
| No-Regret Learning with Unbounded Losses: The Case of Logarithmic Pooling | Unknown | N/A | |
| Expanding Small-Scale Datasets with Guided Imagination | Unknown | N/A | |
| Annotator: A Generic Active Learning Baseline for LiDAR Semantic Segmentation | Unknown | N/A | |
| Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data | Unknown | N/A | |
| Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning | Unknown | N/A | |
| What Do Deep Saliency Models Learn about Visual Attention? | Unknown | N/A | |
| Small Total-Cost Constraints in Contextual Bandits with Knapsacks, with Application to Fairness | Unknown | N/A | |
| Combinatorial Group Testing with Selfish Agents | Unknown | N/A | |
| Domain Adaptive Imitation Learning with Visual Observation | Unknown | N/A | |
| Fast Attention Requires Bounded Entries | Unknown | N/A | |
| Adaptive Data Analysis in a Balanced Adversarial Model | Unknown | N/A | |
| Datasets and Benchmarks for Nanophotonic Structure and Parametric Design Simulations | Unknown | N/A | |
| A benchmark of categorical encoders for binary classification | Unknown | N/A | |
| Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates | Unknown | N/A | |
| Learning to Tokenize for Generative Retrieval | Unknown | N/A | |
| Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data | Unknown | N/A | |
| Multi-Prompt Alignment for Multi-Source Unsupervised Domain Adaptation | Unknown | N/A | |
| NEO-KD: Knowledge-Distillation-Based Adversarial Training for Robust Multi-Exit Neural Networks | Unknown | N/A | |
| The Best of Both Worlds in Network Population Games: Reaching Consensus and Convergence to Equilibrium | Unknown | N/A | |
| [Re] On Explainability of Graph Neural Networks via Subgraph Explorations | Unknown | N/A | |
| Sparse Graph Learning from Spatiotemporal Time Series | Unknown | N/A | |
| EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images | Unknown | N/A | |
| Synthcity: a benchmark framework for diverse use cases of tabular synthetic data | Unknown | N/A | |
| T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation | Unknown | N/A | |
| Reinforcement-Enhanced Autoregressive Feature Transformation: Gradient-steered Search in Continuous Space for Postfix Expressions | Unknown | N/A | |
| State Regularized Policy Optimization on Data with Dynamics Shift | Unknown | N/A | |
| Leveraging Locality and Robustness to Achieve Massively Scalable Gaussian Process Regression | Unknown | N/A | |
| GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER | Unknown | N/A | |
| A Variational Perspective on High-Resolution ODEs | Unknown | N/A | |
| Federated Learning via Meta-Variational Dropout | Unknown | N/A | |
| BadTrack: A Poison-Only Backdoor Attack on Visual Object Tracking | Unknown | N/A | |
| Interpretable Prototype-based Graph Information Bottleneck | Unknown | N/A | |
| Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion | Unknown | N/A | |
| Iteratively Learn Diverse Strategies with State Distance Information | Unknown | N/A | |
| Open Compound Domain Adaptation with Object Style Compensation for Semantic Segmentation | Unknown | N/A | |
| Minimax Forward and Backward Learning of Evolving Tasks with Performance Guarantees | Unknown | N/A | |
| Temporal Continual Learning with Prior Compensation for Human Motion Prediction | Unknown | N/A | |
| [Re] On the Reproducibility of “FairCal: Fairness Calibration for Face Verification” | Unknown | N/A | |
| GLEMOS: Benchmark for Instantaneous Graph Learning Model Selection | Unknown | N/A | |
| Statistically Valid Variable Importance Assessment through Conditional Permutations | Unknown | N/A | |
| Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning | Unknown | N/A | |
| Prompt-augmented Temporal Point Process for Streaming Event Sequence | Unknown | N/A | |
| Path following algorithms for $\ell_2$-regularized $M$-estimation with approximation guarantee | Unknown | N/A | |
| The CLIP Model is Secretly an Image-to-Prompt Converter | Unknown | N/A | |
| Finding Counterfactually Optimal Action Sequences in Continuous State Spaces | Unknown | N/A | |
| Maximum Independent Set: Self-Training through Dynamic Programming | Unknown | N/A | |
| Beyond Average Return in Markov Decision Processes | Unknown | N/A | |
| Stochastic Approximation Approaches to Group Distributionally Robust Optimization | Unknown | N/A | |
| Hierarchically Gated Recurrent Neural Network for Sequence Modeling | Unknown | N/A | |
| Graph Convolutional Kernel Machine versus Graph Convolutional Networks | Unknown | N/A | |
| Identification of Nonlinear Latent Hierarchical Models | Unknown | N/A | |
| Bypass Exponential Time Preprocessing: Fast Neural Network Training via Weight-Data Correlation Preprocessing | Unknown | N/A | |
| A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference | Unknown | N/A | |
| Truncated Affinity Maximization: One-class Homophily Modeling for Graph Anomaly Detection | Unknown | N/A | |
| Tracr: Compiled Transformers as a Laboratory for Interpretability | Unknown | N/A | |
| Proximity-Informed Calibration for Deep Neural Networks | Unknown | N/A | |
| RELIC: Reproducibility and Extension on LIC metric in quantifying bias in captioning models | Unknown | N/A | |
| [Re] Masked Autoencoders Are Small Scale Vision Learners: A Reproduction Under Resource Constraints | Unknown | N/A | |
| AndroidInTheWild: A Large-Scale Dataset For Android Device Control | Unknown | N/A | |
| Preference-grounded Token-level Guidance for Language Model Fine-tuning | Unknown | N/A | |
| Learning Space-Time Continuous Latent Neural PDEs from Partially Observed States | Unknown | N/A | |
| Designing Robust Transformers using Robust Kernel Density Estimation | Unknown | N/A | |
| Sample Complexity of Goal-Conditioned Hierarchical Reinforcement Learning | Unknown | N/A | |
| Towards Label Position Bias in Graph Neural Networks | Unknown | N/A | |
| Can Language Models Teach? Teacher Explanations Improve Student Performance via Personalization | Unknown | N/A | |
| Guiding Large Language Models via Directional Stimulus Prompting | Unknown | N/A | |
| CAST: Cross-Attention in Space and Time for Video Action Recognition | Unknown | N/A | |
| Complementary Benefits of Contrastive Learning and Self-Training Under Distribution Shift | Unknown | N/A | |
| Do Not Marginalize Mechanisms, Rather Consolidate! | Unknown | N/A | |
| Cascading Contextual Assortment Bandits | Unknown | N/A | |
| Post Hoc Explanations of Language Models Can Improve Language Models | Unknown | N/A | |
| Likelihood Ratio Confidence Sets for Sequential Decision Making | Unknown | N/A | |
| HiBug: On Human-Interpretable Model Debug | Unknown | N/A | |
| DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions | Unknown | N/A | |
| Slow and Weak Attractor Computation Embedded in Fast and Strong E-I Balanced Neural Dynamics | Unknown | N/A | |
| Penguin: Parallel-Packed Homomorphic Encryption for Fast Graph Convolutional Network Inference | Unknown | N/A | |
| Training Transitive and Commutative Multimodal Transformers with LoReTTa | Unknown | N/A | |
| QLoRA: Efficient Finetuning of Quantized LLMs | Unknown | N/A | |
| Chanakya: Learning Runtime Decisions for Adaptive Real-Time Perception | Unknown | N/A | |
| Unleashing the Power of Randomization in Auditing Differentially Private ML | Unknown | N/A | |
| Deep Reinforcement Learning with Plasticity Injection | Unknown | N/A | |
| Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond | Unknown | N/A | |
| [Re] Bandit Theory and Thompson Sampling-guided Directed Evolution for Sequence Optimization | Unknown | N/A | |
| Neural Sculpting: Uncovering hierarchically modular task structure in neural networks through pruning and network analysis | Unknown | N/A | |
| Counterfactual Memorization in Neural Language Models | Unknown | N/A | |
| Importance-aware Co-teaching for Offline Model-based Optimization | Unknown | N/A | |
| Promises and Pitfalls of Threshold-based Auto-labeling | Unknown | N/A | |
| Functional-Group-Based Diffusion for Pocket-Specific Molecule Generation and Elaboration | Unknown | N/A | |
| Graph-Structured Gaussian Processes for Transferable Graph Learning | Unknown | N/A | |
| Normalization Layers Are All That Sharpness-Aware Minimization Needs | Unknown | N/A | |
| Neural Ideal Large Eddy Simulation: Modeling Turbulence with Neural Stochastic Differential Equations | Unknown | N/A | |
| Large sample spectral analysis of graph-based multi-manifold clustering | Unknown | N/A | |
| Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset | Unknown | N/A | |
| RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments | Unknown | N/A | |
| AVOIDDS: Aircraft Vision-based Intruder Detection Dataset and Simulator | Unknown | N/A | |
| RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation | Unknown | N/A | |
| Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization | Unknown | N/A | |
| Self-Evaluation Guided Beam Search for Reasoning | Unknown | N/A | |
| On the Identifiability of Sparse ICA without Assuming Non-Gaussianity | Unknown | N/A | |
| Imitation Learning from Imperfection: Theoretical Justifications and Algorithms | Unknown | N/A | |
| StableFDG: Style and Attention Based Learning for Federated Domain Generalization | Unknown | N/A | |
| ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation | Unknown | N/A | |
| Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery | Unknown | N/A | |
| On the Role of Noise in the Sample Complexity of Learning Recurrent Neural Networks: Exponential Gaps for Long Sequences | Unknown | N/A | |
| CL-NeRF: Continual Learning of Neural Radiance Fields for Evolving Scene Representation | Unknown | N/A | |
| COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs | Unknown | N/A | |
| Optimizing Prompts for Text-to-Image Generation | Unknown | N/A | |
| Subject-driven Text-to-Image Generation via Apprenticeship Learning | Unknown | N/A | |
| BCDiff: Bidirectional Consistent Diffusion for Instantaneous Trajectory Prediction | Unknown | N/A | |
| Data Quality in Imitation Learning | Unknown | N/A | |
| Approximation-Generalization Trade-offs under (Approximate) Group Equivariance | Unknown | N/A | |
| Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer | Unknown | N/A | |
| Robust Learning with Progressive Data Expansion Against Spurious Correlation | Unknown | N/A | |
| Exploring and Interacting with the Set of Good Sparse Generalized Additive Models | Unknown | N/A | |
| ClimateSet: A Large-Scale Climate Model Dataset for Machine Learning | Unknown | N/A | |
| Visual Explanations of Image-Text Representations via Multi-Modal Information Bottleneck Attribution | Unknown | N/A | |
| ID and OOD Performance Are Sometimes Inversely Correlated on Real-world Datasets | Unknown | N/A | |
| Provably Bounding Neural Network Preimages | Unknown | N/A | |
| Into the LAION’s Den: Investigating Hate in Multimodal Datasets | Unknown | N/A | |
| ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab | Unknown | N/A | |
| Exploiting Connections between Lipschitz Structures for Certifiably Robust Deep Equilibrium Models | Unknown | N/A | |
| Explainable and Efficient Randomized Voting Rules | Unknown | N/A | |
| Mitigating Test-Time Bias for Fair Image Retrieval | Unknown | N/A | |
| Scaling Data-Constrained Language Models | Unknown | N/A | |
| Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies | Unknown | N/A | |
| Mesogeos: A multi-purpose dataset for data-driven wildfire modeling in the Mediterranean | Unknown | N/A | |
| Dynamic Non-monotone Submodular Maximization | Unknown | N/A | |
| Prediction and Control in Continual Reinforcement Learning | Unknown | N/A | |
| Bridging Discrete and Backpropagation: Straight-Through and Beyond | Unknown | N/A | |
| Diverse Community Data for Benchmarking Data Privacy Algorithms | Unknown | N/A | |
| DataPerf: Benchmarks for Data-Centric AI Development | Unknown | N/A | |
| Efficient Diffusion Policies For Offline Reinforcement Learning | Unknown | N/A | |
| Improving Graph Matching with Positional Reconstruction Encoder-Decoder Network | Unknown | N/A | |
| Explore to Generalize in Zero-Shot RL | Unknown | N/A | |
| Gaussian Partial Information Decomposition: Bias Correction and Application to High-dimensional Data | Unknown | N/A | |
| Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems | Unknown | N/A | |
| Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities | Unknown | N/A | |
| DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data | Unknown | N/A | |
| Secure Out-of-Distribution Task Generalization with Energy-Based Models | Unknown | N/A | |
| Expressivity-Preserving GNN Simulation | Unknown | N/A | |
| An Optimal Structured Zeroth-order Algorithm for Non-smooth Optimization | Unknown | N/A | |
| BanditPAM++: Faster $k$-medoids Clustering | Unknown | N/A | |
| Most Neural Networks Are Almost Learnable | Unknown | N/A | |
| Estimating Generic 3D Room Structures from 2D Annotations | Unknown | N/A | |
| Dissecting Chain-of-Thought: Compositionality through In-Context Filtering and Learning | Unknown | N/A | |
| DRAUC: An Instance-wise Distributionally Robust AUC Optimization Framework | Unknown | N/A | |
| TOA: Task-oriented Active VQA | Unknown | N/A | |
| Flocks of Stochastic Parrots: Differentially Private Prompt Learning for Large Language Models | Unknown | N/A | |
| An Empirical Investigation of the Role of Pre-training in Lifelong Learning | Unknown | N/A | |
| BubbleML: A Multiphase Multiphysics Dataset and Benchmarks for Machine Learning | Unknown | N/A | |
| Task-aware Distributed Source Coding under Dynamic Bandwidth | Unknown | N/A | |
| Flexible Attention-Based Multi-Policy Fusion for Efficient Deep Reinforcement Learning | Unknown | N/A | |
| Deep Equilibrium Based Neural Operators for Steady-State PDEs | Unknown | N/A | |
| Replicability in Reinforcement Learning | Unknown | N/A | |
| Have it your way: Individualized Privacy Assignment for DP-SGD | Unknown | N/A | |
| DICES Dataset: Diversity in Conversational AI Evaluation for Safety | Unknown | N/A | |
| DP-HyPO: An Adaptive Private Framework for Hyperparameter Optimization | Unknown | N/A | |
| Fairly Recommending with Social Attributes: A Flexible and Controllable Optimization Approach | Unknown | N/A | |
| Individualized Dosing Dynamics via Neural Eigen Decomposition | Unknown | N/A | |
| Structural Pruning for Diffusion Models | Unknown | N/A | |
| Joint Data-Task Generation for Auxiliary Learning | Unknown | N/A | |
| Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective | Unknown | N/A | |
| Meta-Learning Adversarial Bandit Algorithms | Unknown | N/A | |
| What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks | Unknown | N/A | |
| Exponentially Convergent Algorithms for Supervised Matrix Factorization | Unknown | N/A | |
| Unified Enhancement of Privacy Bounds for Mixture Mechanisms via $f$-Differential Privacy | Unknown | N/A | |
| Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers | Unknown | N/A | |
| Fundamental Limits and Tradeoffs in Invariant Representation Learning | Unknown | N/A | |
| FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning | Unknown | N/A | |
| Hyper-Skin: A Hyperspectral Dataset for Reconstructing Facial Skin-Spectra from RGB Images | Unknown | N/A | |
| Intrinsic Gaussian Process on Unknown Manifolds with Probabilistic Metrics | Unknown | N/A | |
| Perception Test: A Diagnostic Benchmark for Multimodal Video Models | Unknown | N/A | |
| Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics | Unknown | N/A | |
| Diffusion Schrödinger Bridge Matching | Unknown | N/A | |
| Conformal PID Control for Time Series Prediction | Unknown | N/A | |
| Concentration analysis of multivariate elliptic diffusions | Unknown | N/A | |
| Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise | Unknown | N/A | |
| Robust Data Pruning under Label Noise via Maximizing Re-labeling Accuracy | Unknown | N/A | |
| Adversarial Model for Offline Reinforcement Learning | Unknown | N/A | |
| Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network | Unknown | N/A | |
| Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off | Unknown | N/A | |
| AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three Weeks | Unknown | N/A | |
| Sparse Deep Learning for Time Series Data: Theory and Applications | Unknown | N/A | |
| Large Language Models are Visual Reasoning Coordinators | Unknown | N/A | |
| Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation | Unknown | N/A | |
| Smooth Flipping Probability for Differential Private Sign Random Projection Methods | Unknown | N/A | |
| MADLAD-400: A Multilingual And Document-Level Large Audited Dataset | Unknown | N/A | |
| ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling | Unknown | N/A | |
| Self-supervised video pretraining yields robust and more human-aligned visual representations | Unknown | N/A | |
| Affinity-Aware Graph Networks | Unknown | N/A | |
| Towards Personalized Federated Learning via Heterogeneous Model Reassembly | Unknown | N/A | |
| Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis | Unknown | N/A | |
| Unsupervised Polychromatic Neural Representation for CT Metal Artifact Reduction | Unknown | N/A | |
| Progressive Ensemble Distillation: Building Ensembles for Efficient Inference | Unknown | N/A | |
| Adaptive whitening with fast gain modulation and slow synaptic plasticity | Unknown | N/A | |
| MonoUNI: A Unified Vehicle and Infrastructure-side Monocular 3D Object Detection Network with Sufficient Depth Clues | Unknown | N/A | |
| A Logic for Expressing Log-Precision Transformers | Unknown | N/A | |
| The noise level in linear regression with dependent data | Unknown | N/A | |
| Lending Interaction Wings to Recommender Systems with Conversational Agents | Unknown | N/A | |
| Directional diffusion models for graph representation learning | Unknown | N/A | |
| Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond $1+\alpha$ Moments | Unknown | N/A | |
| Reining Generalization in Offline Reinforcement Learning via Representation Distinction | Unknown | N/A | |
| Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases | Unknown | N/A | |
| Structure of universal formulas | Unknown | N/A | |
| The Learnability of In-Context Learning | Unknown | N/A | |
| Shape Non-rigid Kinematics (SNK): A Zero-Shot Method for Non-Rigid Shape Matching via Unsupervised Functional Map Regularized Reconstruction | Unknown | N/A | |
| Differentially Private Decoupled Graph Convolutions for Multigranular Topology Protection | Unknown | N/A | |
| On Learning Latent Models with Multi-Instance Weak Supervision | Unknown | N/A | |
| Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting | Unknown | N/A | |
| Customizable Image Synthesis with Multiple Subjects | Unknown | N/A | |
| Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent | Unknown | N/A | |
| Rethinking the Backward Propagation for Adversarial Transferability | Unknown | N/A | |
| Geometric Neural Diffusion Processes | Unknown | N/A | |
| SceneScape: Text-Driven Consistent Scene Generation | Unknown | N/A | |
| Task-Robust Pre-Training for Worst-Case Downstream Adaptation | Unknown | N/A | |
| Weighted ROC Curve in Cost Space: Extending AUC to Cost-Sensitive Learning | Unknown | N/A | |
| Low Tensor Rank Learning of Neural Dynamics | Unknown | N/A | |
| Alignment with human representations supports robust few-shot learning | Unknown | N/A | |
| May the Force be with You: Unified Force-Centric Pre-Training for 3D Molecular Conformations | Unknown | N/A | |
| Estimating Koopman operators with sketching to provably learn large scale dynamical systems | Unknown | N/A | |
| Transition-constant Normalization for Image Enhancement | Unknown | N/A | |
| Selective Sampling and Imitation Learning via Online Regression | Unknown | N/A | |
| 3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging Scenes | Unknown | N/A | |
| Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms | Unknown | N/A | |
| NAS-X: Neural Adaptive Smoothing via Twisting | Unknown | N/A | |
| Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products | Unknown | N/A | |
| Probabilistic Weight Fixing: Large-scale training of neural network weight uncertainties for quantisation. | Unknown | N/A | |
| Better Correlation and Robustness: A Distribution-Balanced Self-Supervised Learning Framework for Automatic Dialogue Evaluation | Unknown | N/A | |
| Learning to Compress Prompts with Gist Tokens | Unknown | N/A | |
| Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls and New Benchmarking | Unknown | N/A | |
| MM-Fi: Multi-Modal Non-Intrusive 4D Human Dataset for Versatile Wireless Sensing | Unknown | N/A | |
| Small Transformers Compute Universal Metric Embeddings | Unknown | N/A | |
| Fast Online Changepoint Detection via Functional Pruning CUSUM Statistics | Unknown | N/A | |
| The Separation Capacity of Random Neural Networks | Unknown | N/A | |
| Conditional Distribution Function Estimation Using Neural Networks for Censored and Uncensored Data | Unknown | N/A | |
| Inference for Gaussian Processes with Matern Covariogram on Compact Riemannian Manifolds | Unknown | N/A | |
| Toolbox for Multimodal Learn (scikit-multimodallearn) | Unknown | N/A | |
| Graph Clustering with Graph Neural Networks | Unknown | N/A | |
| Reproducibility study of the Fairness-enhanced Node Representation Learning | Unknown | N/A | |
| Reproducibility Study of ”CartoonX: Cartoon Explanations of Image Classifiers” | Unknown | N/A | |
| A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes | Unknown | N/A | |
| Online Ad Procurement in Non-stationary Autobidding Worlds | Unknown | N/A | |
| DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis | Unknown | N/A | |
| VTaC: A Benchmark Dataset of Ventricular Tachycardia Alarms from ICU Monitors | Unknown | N/A | |
| TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning | Unknown | N/A | |
| Simple, Scalable and Effective Clustering via One-Dimensional Projections | Unknown | N/A | |
| The Distortion of Binomial Voting Defies Expectation | Unknown | N/A | |
| A Theory of Multimodal Learning | Unknown | N/A | |
| Robustness Guarantees for Adversarially Trained Neural Networks | Unknown | N/A | |
| TrojLLM: A Black-box Trojan Prompt Attack on Large Language Models | Unknown | N/A | |
| PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds | Unknown | N/A | |
| $H$-Consistency Bounds: Characterization and Extensions | Unknown | N/A | |
| GraphPatcher: Mitigating Degree Bias for Graph Neural Networks via Test-time Augmentation | Unknown | N/A | |
| Neural Latent Geometry Search: Product Manifold Inference via Gromov-Hausdorff-Informed Bayesian Optimization | Unknown | N/A | |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Unknown | N/A | |
| Frequency-domain MLPs are More Effective Learners in Time Series Forecasting | Unknown | N/A | |
| Langevin Quasi-Monte Carlo | Unknown | N/A | |
| Anonymous Learning via Look-Alike Clustering: A Precise Analysis of Model Generalization | Unknown | N/A | |
| On the Overlooked Pitfalls of Weight Decay and How to Mitigate Them: A Gradient-Norm Perspective | Unknown | N/A | |
| Domain Watermark: Effective and Harmless Dataset Copyright Protection is Closed at Hand | Unknown | N/A | |
| CORL: Research-oriented Deep Offline Reinforcement Learning Library | Unknown | N/A | |
| Train 'n Trade: Foundations of Parameter Markets | Unknown | N/A | |
| Reproducibility Study of "Label-Free Explainability for Unsupervised Models" | Unknown | N/A | |
| What functions can Graph Neural Networks compute on random graphs? The role of Positional Encoding | Unknown | N/A | |
| Entropy-dissipation Informed Neural Network for McKean-Vlasov Type PDEs | Unknown | N/A | |
| Non-stationary Experimental Design under Linear Trends | Unknown | N/A | |
| Unbounded Differentially Private Quantile and Maximum Estimation | Unknown | N/A | |
| Sub-optimality of the Naive Mean Field approximation for proportional high-dimensional Linear Regression | Unknown | N/A | |
| Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels | Unknown | N/A | |
| Sheaf Hypergraph Networks | Unknown | N/A | |
| Theoretical Analysis of the Inductive Biases in Deep Convolutional Networks | Unknown | N/A | |
| Improving the Knowledge Gradient Algorithm | Unknown | N/A | |
| Noise-Adaptive Thompson Sampling for Linear Contextual Bandits | Unknown | N/A | |
| Subspace Identification for Multi-Source Domain Adaptation | Unknown | N/A | |
| VeriX: Towards Verified Explainability of Deep Neural Networks | Unknown | N/A | |
| Uni3DETR: Unified 3D Detection Transformer | Unknown | N/A | |
| Re-Think and Re-Design Graph Neural Networks in Spaces of Continuous Graph Diffusion Functionals | Unknown | N/A | |
| Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations | Unknown | N/A | |
| Species196: A One-Million Semi-supervised Dataset for Fine-grained Species Recognition | Unknown | N/A | |
| YouTube-ASL: A Large-Scale, Open-Domain American Sign Language-English Parallel Corpus | Unknown | N/A | |
| Scalable 3D Captioning with Pretrained Models | Unknown | N/A | |
| SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model | Unknown | N/A |
NIPS 2024
| Title | Author | Code URL | |
|---|---|---|---|
| Spotlights | Unknown | N/A | |
| IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware Neurons | Unknown | N/A | |
| Is Programming by Example Solved by LLMs? | Unknown | N/A | |
| Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models | Unknown | N/A | |
| Back to the Continuous Attractor | Unknown | N/A | |
| SkipPredict: When to Invest in Predictions for Scheduling | Unknown | N/A | |
| A Flexible, Equivariant Framework for Subgraph GNNs via Graph Products and Graph Coarsening | Unknown | N/A | |
| On the Expressive Power of Tree-Structured Probabilistic Circuits | Unknown | N/A | |
| Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation | Unknown | N/A | |
| Score-based 3D molecule generation with neural fields | Unknown | N/A | |
| Instance-Optimal Private Density Estimation in the Wasserstein Distance | Unknown | N/A | |
| Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks | Unknown | N/A | |
| Intervention and Conditioning in Causal Bayesian Networks | Unknown | N/A | |
| SpaFL: Communication-Efficient Federated Learning With Sparse Models And Low Computational Overhead | Unknown | N/A | |
| A Structure-Aware Framework for Learning Device Placements on Computation Graphs | Unknown | N/A | |
| Model Fusion through Bayesian Optimization in Language Model Fine-Tuning | Unknown | N/A | |
| Online Estimation via Offline Estimation: An Information-Theoretic Framework | Unknown | N/A | |
| WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models | Unknown | N/A | |
| Scaling transformer neural networks for skillful and reliable medium-range weather forecasting | Unknown | N/A | |
| Toward Efficient Inference for Mixture of Experts | Unknown | N/A | |
| Learning Formal Mathematics From Intrinsic Motivation | Unknown | N/A | |
| PRODuctive bandits: Importance Weighting No More | Unknown | N/A | |
| Random Function Descent | Unknown | N/A | |
| Toward Approaches to Scalability in 3D Human Pose Estimation | Unknown | N/A | |
| Fully Distributed, Flexible Compositional Visual Representations via Soft Tensor Products | Unknown | N/A | |
| Private Attribute Inference from Images with Vision-Language Models | Unknown | N/A | |
| Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization | Unknown | N/A | |
| Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation | Unknown | N/A | |
| Graph Neural Networks and Arithmetic Circuits | Unknown | N/A | |
| xMIL: Insightful Explanations for Multiple Instance Learning in Histopathology | Unknown | N/A | |
| QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation | Unknown | N/A | |
| Constant Acceleration Flow | Unknown | N/A | |
| Contracting with a Learning Agent | Unknown | N/A | |
| Rethinking Reconstruction-based Graph-Level Anomaly Detection: Limitations and a Simple Remedy | Unknown | N/A | |
| Reinforcement Learning Under Latent Dynamics: Toward Statistical and Algorithmic Modularity | Unknown | N/A | |
| HEPrune: Fast Private Training of Deep Neural Networks With Encrypted Data Pruning | Unknown | N/A | |
| On the Reproducibility of: "Learning Perturbations to Explain Time Series Predictions" | Unknown | N/A | |
| AROMA: Preserving Spatial Structure for Latent PDE Modeling with Local Neural Fields | Unknown | N/A | |
| Robust Sparse Regression with Non-Isotropic Designs | Unknown | N/A | |
| HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion | Unknown | N/A | |
| Agent Planning with World Knowledge Model | Unknown | N/A | |
| Latent Intrinsics Emerge from Training to Relight | Unknown | N/A | |
| Sketched Lanczos uncertainty score: a low-memory summary of the Fisher information | Unknown | N/A | |
| Learning from higher-order correlations, efficiently: hypothesis tests, random features, and neural networks | Unknown | N/A | |
| QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation | Unknown | N/A | |
| Geometric Analysis of Nonlinear Manifold Clustering | Unknown | N/A | |
| Generating Highly Designable Proteins with Geometric Algebra Flow Matching | Unknown | N/A | |
| Conditional Synthesis of 3D Molecules with Time Correction Sampler | Unknown | N/A | |
| A General Protocol to Probe Large Vision Models for 3D Physical Understanding | Unknown | N/A | |
| Improving Deep Learning Optimization through Constrained Parameter Regularization | Unknown | N/A | |
| HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models | Unknown | N/A | |
| Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts | Unknown | N/A | |
| Equivariant Neural Diffusion for Molecule Generation | Unknown | N/A | |
| Transformers need glasses! Information over-squashing in language tasks | Unknown | N/A | |
| Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions | Unknown | N/A | |
| Learning on Large Graphs using Intersecting Communities | Unknown | N/A | |
| Graph Edit Distance with General Costs Using Neural Set Divergence | Unknown | N/A | |
| fMRI predictors based on language models of increasing complexity recover brain left lateralization | Unknown | N/A | |
| Private Online Learning via Lazy Algorithms | Unknown | N/A | |
| Autonomous Agents for Collaborative Task under Information Asymmetry | Unknown | N/A | |
| HHD-GP: Incorporating Helmholtz-Hodge Decomposition into Gaussian Processes for Learning Dynamical Systems | Unknown | N/A | |
| Estimating the Hallucination Rate of Generative AI | Unknown | N/A | |
| Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability | Unknown | N/A | |
| DAGER: Exact Gradient Inversion for Large Language Models | Unknown | N/A | |
| A Theoretical Perspective for Speculative Decoding Algorithm | Unknown | N/A | |
| Generative Modeling of Molecular Dynamics Trajectories | Unknown | N/A | |
| Differentially Private Stochastic Gradient Descent with Fixed-Size Minibatches: Tighter RDP Guarantees with or without Replacement | Unknown | N/A | |
| CoBo: Collaborative Learning via Bilevel Optimization | Unknown | N/A | |
| Controlling Continuous Relaxation for Combinatorial Optimization | Unknown | N/A | |
| Graph Coarsening with Message-Passing Guarantees | Unknown | N/A | |
| [Re] Reproducibility Study of “Explaining Temporal Graph Models Through an Explorer-Navigator Framework" | Unknown | N/A | |
| Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing | Unknown | N/A | |
| EffiBench: Benchmarking the Efficiency of Automatically Generated Code | Unknown | N/A | |
| Solving Inverse Problems via Diffusion Optimal Control | Unknown | N/A | |
| EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization | Unknown | N/A | |
| ProTransformer: Robustify Transformers via Plug-and-Play Paradigm | Unknown | N/A | |
| Wasserstein Gradient Boosting: A Framework for Distribution-Valued Supervised Learning | Unknown | N/A | |
| Proportional Fairness in Non-Centroid Clustering | Unknown | N/A | |
| Fairness in Social Influence Maximization via Optimal Transport | Unknown | N/A | |
| Attack-Aware Noise Calibration for Differential Privacy | Unknown | N/A | |
| Accelerating ERM for data-driven algorithm design using output-sensitive techniques | Unknown | N/A | |
| Limits of Transformer Language Models on Learning to Compose Algorithms | Unknown | N/A | |
| MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning | Unknown | N/A | |
| 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Unknown | N/A | |
| Nimbus: Secure and Efficient Two-Party Inference for Transformers | Unknown | N/A | |
| Combining Observational Data and Language for Species Range Estimation | Unknown | N/A | |
| Reciprocal Learning | Unknown | N/A | |
| No Filter: Cultural and Socioeconomic Diversity in Contrastive Vision-Language Models | Unknown | N/A | |
| Entity Alignment with Noisy Annotations from Large Language Models | Unknown | N/A | |
| SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types | Unknown | N/A | |
| Regret Minimization in Stackelberg Games with Side Information | Unknown | N/A | |
| Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure | Unknown | N/A | |
| MiniCache: KV Cache Compression in Depth Dimension for Large Language Models | Unknown | N/A | |
| MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset | Unknown | N/A | |
| On Causal Discovery in the Presence of Deterministic Relations | Unknown | N/A | |
| Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition | Unknown | N/A | |
| FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models | Unknown | N/A | |
| Towards Universal Mesh Movement Networks | Unknown | N/A | |
| Expected Probabilistic Hierarchies | Unknown | N/A | |
| Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate | Unknown | N/A | |
| Codec Avatar Studio: Paired Human Captures for Complete, Driveable, and Generalizable Avatars | Unknown | N/A | |
| Beyond Aesthetics: Cultural Competence in Text-to-Image Models | Unknown | N/A | |
| Studying How to Efficiently and Effectively Guide Models with Explanations - A Reproducibility Study | Unknown | N/A | |
| FEEL-SNN: Robust Spiking Neural Networks with Frequency Encoding and Evolutionary Leak Factor | Unknown | N/A | |
| RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization | Unknown | N/A | |
| Neural P$^3$M: A Long-Range Interaction Modeling Enhancer for Geometric GNNs | Unknown | N/A | |
| DRIP: Unleashing Diffusion Priors for Joint Foreground and Alpha Prediction in Image Matting | Unknown | N/A | |
| OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI | Unknown | N/A | |
| MaskFactory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation | Unknown | N/A | |
| Road Network Representation Learning with the Third Law of Geography | Unknown | N/A | |
| EyeGraph: Modularity-aware Spatio Temporal Graph Clustering for Continuous Event-based Eye Tracking | Unknown | N/A | |
| Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs | Unknown | N/A | |
| PINNacle: A Comprehensive Benchmark of Physics-Informed Neural Networks for Solving PDEs | Unknown | N/A | |
| Generic Unsupervised Optimization for a Latent Variable Model With Exponential Family Observables | Unknown | N/A | |
| PrivAuditor: Benchmarking Data Protection Vulnerabilities in LLM Adaptation Techniques | Unknown | N/A | |
| Personalized Adapter for Large Meteorology Model on Devices: Towards Weather Foundation Models | Unknown | N/A | |
| Beware of Road Markings: A New Adversarial Patch Attack to Monocular Depth Estimation | Unknown | N/A | |
| Critically Assessing the State of the Art in Neural Network Verification | Unknown | N/A | |
| How to Solve Contextual Goal-Oriented Problems with Offline Datasets? | Unknown | N/A | |
| Mind the Gap Between Prototypes and Images in Cross-domain Finetuning | Unknown | N/A | |
| Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers | Unknown | N/A | |
| Reproducibility Study of "Robust Fair Clustering: A Novel Fairness Attack and Defense Framework" | Unknown | N/A | |
| TinyTTA: Efficient Test-time Adaptation via Early-exit Ensembles on Edge Devices | Unknown | N/A | |
| DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection | Unknown | N/A | |
| ARC: A Generalist Graph Anomaly Detector with In-Context Learning | Unknown | N/A | |
| RestoreAgent: Autonomous Image Restoration Agent via Multimodal Large Language Models | Unknown | N/A | |
| Understanding the Transferability of Representations via Task-Relatedness | Unknown | N/A | |
| Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel | Unknown | N/A | |
| MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering | Unknown | N/A | |
| Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models | Unknown | N/A | |
| QT-ViT: Improving Linear Attention in ViT with Quadratic Taylor Expansion | Unknown | N/A | |
| Reproducibility Study Of Learning Fair Graph Representations Via Automated Data Augmentations | Unknown | N/A | |
| Reproducibility study of “LICO: Explainable Models with Language-Image Consistency" | Unknown | N/A | |
| Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers | Unknown | N/A | |
| The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information | Unknown | N/A | |
| Reproducibility study of "Robust Fair Clustering: A Novel Fairness Attack and Defense Framework" | Unknown | N/A | |
| Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation" | Unknown | N/A | |
| Explaining RL Decisions with Trajectories': A Reproducibility Study | Unknown | N/A | |
| [Re] On the Reproducibility of Post-Hoc Concept Bottleneck Models | Unknown | N/A | |
| [Re] GNNInterpreter: A probabilistic generative model-level explanation for Graph Neural Networks | Unknown | N/A | |
| Metrizing Weak Convergence with Maximum Mean Discrepancies | Unknown | N/A | |
| Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression | Unknown | N/A | |
| Label Alignment Regularization for Distribution Shift | Unknown | N/A | |
| An Analysis of Robustness of Non-Lipschitz Networks | Unknown | N/A | |
| Topological Hidden Markov Models | Unknown | N/A | |
| Pre-trained Gaussian Processes for Bayesian Optimization | Unknown | N/A | |
| Causal Bandits for Linear Structural Equation Models | Unknown | N/A | |
| Causal-learn: Causal Discovery in Python | Unknown | N/A | |
| Optimization-based Causal Estimation from Heterogeneous Environments | Unknown | N/A | |
| Exploration, Exploitation, and Engagement in Multi-Armed Bandits with Abandonment | Unknown | N/A | |
| Nonparametric Regression for 3D Point Cloud Learning | Unknown | N/A | |
| Inference on the Change Point under a High Dimensional Covariance Shift | Unknown | N/A | |
| Dense Associative Memory Through the Lens of Random Features | Unknown | N/A | |
| Multi-Label Learning with Stronger Consistency Guarantees | Unknown | N/A | |
| Piecewise-Stationary Bandits with Knapsacks | Unknown | N/A | |
| Structural Inference of Dynamical Systems with Conjoined State Space Models | Unknown | N/A | |
| Inexact Augmented Lagrangian Methods for Conic Optimization: Quadratic Growth and Linear Convergence | Unknown | N/A | |
| SF-V: Single Forward Video Generation Model | Unknown | N/A | |
| Generative Retrieval Meets Multi-Graded Relevance | Unknown | N/A | |
| ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation | Unknown | N/A | |
| Deep Learning for Computing Convergence Rates of Markov Chains | Unknown | N/A | |
| Counterfactual Fairness by Combining Factual and Counterfactual Predictions | Unknown | N/A | |
| FIARSE: Model-Heterogeneous Federated Learning via Importance-Aware Submodel Extraction | Unknown | N/A | |
| Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation | Unknown | N/A | |
| Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning | Unknown | N/A | |
| ResAD: A Simple Framework for Class Generalizable Anomaly Detection | Unknown | N/A | |
| Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses | Unknown | N/A | |
| Non-Euclidean Mixture Model for Social Network Embedding | Unknown | N/A | |
| GenRL: Multimodal-foundation world models for generalization in embodied agents | Unknown | N/A | |
| Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural Network | Unknown | N/A | |
| Flatten Anything: Unsupervised Neural Surface Parameterization | Unknown | N/A | |
| Learning Low-Rank Feature for Thorax Disease Classification | Unknown | N/A | |
| Long-range Meta-path Search on Large-scale Heterogeneous Graphs | Unknown | N/A | |
| On the Saturation Effects of Spectral Algorithms in Large Dimensions | Unknown | N/A | |
| LibAMM: Empirical Insights into Approximate Computing for Accelerating Matrix Multiplication | Unknown | N/A | |
| Hierarchical Selective Classification | Unknown | N/A | |
| SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding | Unknown | N/A | |
| Fixed points of nonnegative neural networks | Unknown | N/A | |
| On $f$-Divergence Principled Domain Adaptation: An Improved Framework | Unknown | N/A | |
| 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Unknown | N/A | |
| Implicit Optimization Bias of Next-token Prediction in Linear Models | Unknown | N/A | |
| Co-occurrence is not Factual Association in Language Models | Unknown | N/A | |
| TorchOpt: An Efficient Library for Differentiable Optimization | Unknown | N/A | |
| Efficient Convex Algorithms for Universal Kernel Learning | Unknown | N/A | |
| Chain-of-Thought Unfaithfulness as Disguised Accuracy | Unknown | N/A | |
| Reproducibility study of FairAC | Unknown | N/A | |
| Unsupervised Anomaly Detection Algorithms on Real-world Data: How Many Do We Need? | Unknown | N/A | |
| BenchMARL: Benchmarking Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Text to Blind Motion | Unknown | N/A | |
| Supra-Laplacian Encoding for Transformer on Dynamic Graphs | Unknown | N/A | |
| Combining Statistical Depth and Fermat Distance for Uncertainty Quantification | Unknown | N/A | |
| MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions | Unknown | N/A | |
| Diffusion-Inspired Truncated Sampler for Text-Video Retrieval | Unknown | N/A | |
| TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks | Unknown | N/A | |
| Pretraining Codomain Attention Neural Operators for Solving Multiphysics PDEs | Unknown | N/A | |
| Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation | Unknown | N/A | |
| Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise | Unknown | N/A | |
| SustainDC: Benchmarking for Sustainable Data Center Control | Unknown | N/A | |
| MeshXL: Neural Coordinate Field for Generative 3D Foundation Models | Unknown | N/A | |
| No-Regret M${}^{\natural}$-Concave Function Maximization: Stochastic Bandit Algorithms and NP-Hardness of Adversarial Full-Information Setting | Unknown | N/A | |
| 3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object Detection | Unknown | N/A | |
| Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks | Unknown | N/A | |
| Unified Lexical Representation for Interpretable Visual-Language Alignment | Unknown | N/A | |
| UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion Models | Unknown | N/A | |
| Mercury: A Code Efficiency Benchmark for Code Large Language Models | Unknown | N/A | |
| DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception | Unknown | N/A | |
| ANT: Adaptive Noise Schedule for Time Series Diffusion Models | Unknown | N/A | |
| Principled Bayesian Optimization in Collaboration with Human Experts | Unknown | N/A | |
| DataStealing: Steal Data from Diffusion Models in Federated Learning with Multiple Trojans | Unknown | N/A | |
| STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases | Unknown | N/A | |
| CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes | Unknown | N/A | |
| VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding | Unknown | N/A | |
| A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data | Unknown | N/A | |
| Transferable Adversarial Attacks on SAM and Its Downstream Models | Unknown | N/A | |
| PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation | Unknown | N/A | |
| HumanVLA: Towards Vision-Language Directed Object Rearrangement by Physical Humanoid | Unknown | N/A | |
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | Unknown | N/A | |
| AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents | Unknown | N/A | |
| Intrinsic Self-Supervision for Data Quality Audits | Unknown | N/A | |
| Subsurface Scattering for Gaussian Splatting | Unknown | N/A | |
| DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA | Unknown | N/A | |
| DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models | Unknown | N/A | |
| RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions | Unknown | N/A | |
| WildGaussians: 3D Gaussian Splatting In the Wild | Unknown | N/A | |
| Dynamic 3D Gaussian Fields for Urban Areas | Unknown | N/A | |
| PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization | Unknown | N/A | |
| Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery | Unknown | N/A | |
| FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification | Unknown | N/A | |
| Interpretable Mesomorphic Networks for Tabular Data | Unknown | N/A | |
| Classification Done Right for Vision-Language Pre-Training | Unknown | N/A | |
| SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation | Unknown | N/A | |
| Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs | Unknown | N/A | |
| Reasoning Multi-Agent Behavioral Topology for Interactive Autonomous Driving | Unknown | N/A | |
| Empowering and Assessing the Utility of Large Language Models in Crop Science | Unknown | N/A | |
| LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation | Unknown | N/A | |
| Sparse maximal update parameterization: A holistic approach to sparse training dynamics | Unknown | N/A | |
| HEST-1k: A Dataset For Spatial Transcriptomics and Histology Image Analysis | Unknown | N/A | |
| Performative Control for Linear Dynamical Systems | Unknown | N/A | |
| Elucidating the Design Space of Dataset Condensation | Unknown | N/A | |
| ChatCam: Empowering Camera Control through Conversational AI | Unknown | N/A | |
| VideoGUI: A Benchmark for GUI Automation from Instructional Videos | Unknown | N/A | |
| Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting | Unknown | N/A | |
| How to Use Diffusion Priors under Sparse Views? | Unknown | N/A | |
| BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices | Unknown | N/A | |
| MAN TruckScenes: A multimodal dataset for autonomous trucking in diverse conditions | Unknown | N/A | |
| SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM | Unknown | N/A | |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Unknown | N/A | |
| A Systematic Review of NeurIPS Dataset Management Practices | Unknown | N/A | |
| Flaws can be Applause: Unleashing Potential of Segmenting Ambiguous Objects in SAM | Unknown | N/A | |
| Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions | Unknown | N/A | |
| Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models | Unknown | N/A | |
| DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic States | Unknown | N/A | |
| RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts | Unknown | N/A | |
| NN4SysBench: Characterizing Neural Network Verification for Computer Systems | Unknown | N/A | |
| TaskBench: Benchmarking Large Language Models for Task Automation | Unknown | N/A | |
| Adapting Diffusion Models for Improved Prompt Compliance and Controllable Image Synthesis | Unknown | N/A | |
| MMM-RS: A Multi-modal, Multi-GSD, Multi-scene Remote Sensing Dataset and Benchmark for Text-to-Image Generation | Unknown | N/A | |
| The Best of Both Worlds: On the Dilemma of Out-of-distribution Detection | Unknown | N/A | |
| Rethinking Misalignment in Vision-Language Model Adaptation from a Causal Perspective | Unknown | N/A | |
| A Cross-Domain Benchmark for Active Learning | Unknown | N/A | |
| HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection | Unknown | N/A | |
| DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion | Unknown | N/A | |
| MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations | Unknown | N/A | |
| Federated Model Heterogeneous Matryoshka Representation Learning | Unknown | N/A | |
| JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models | Unknown | N/A | |
| Mini-Sequence Transformers: Optimizing Intermediate Memory for Long Sequences Training | Unknown | N/A | |
| WFCRL: A Multi-Agent Reinforcement Learning Benchmark for Wind Farm Control | Unknown | N/A | |
| DAT: Improving Adversarial Robustness via Generative Amplitude Mix-up in Frequency Domain | Unknown | N/A | |
| StreamBench: Towards Benchmarking Continuous Improvement of Language Agents | Unknown | N/A | |
| RobIR: Robust Inverse Rendering for High-Illumination Scenes | Unknown | N/A | |
| WikiDBs: A Large-Scale Corpus Of Relational Databases From Wikidata | Unknown | N/A | |
| Neuro-Vision to Language: Enhancing Brain Recording-based Visual Reconstruction and Language Interaction | Unknown | N/A | |
| ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model | Unknown | N/A | |
| Mars: Situated Inductive Reasoning in an Open-World Environment | Unknown | N/A | |
| Efficient Sketches for Training Data Attribution and Studying the Loss Landscape | Unknown | N/A | |
| FEDMEKI: A Benchmark for Scaling Medical Foundation Models via Federated Knowledge Injection | Unknown | N/A | |
| Vision-Language Navigation with Energy-Based Policy | Unknown | N/A | |
| Quantifying and Optimizing Global Faithfulness in Persona-driven Role-playing | Unknown | N/A | |
| DisC-GS: Discontinuity-aware Gaussian Splatting | Unknown | N/A | |
| Benchmarking Structural Inference Methods for Interacting Dynamical Systems with Synthetic Data | Unknown | N/A | |
| Towards Visual Text Design Transfer Across Languages | Unknown | N/A | |
| Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy | Unknown | N/A | |
| UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Unknown | N/A | |
| Paloma: A Benchmark for Evaluating Language Model Fit | Unknown | N/A | |
| SeeA: Efficient Exploration-Enhanced A Search by Selective Sampling | Unknown | N/A | |
| WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models | Unknown | N/A | |
| DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors | Unknown | N/A | |
| FlexSBDD: Structure-Based Drug Design with Flexible Protein Modeling | Unknown | N/A | |
| $\texttt{pfl-research}$: simulation framework for accelerating research in Private Federated Learning | Unknown | N/A | |
| Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency | Unknown | N/A | |
| Historical Test-time Prompt Tuning for Vision Foundation Models | Unknown | N/A | |
| Instruction Tuning Large Language Models to Understand Electronic Health Records | Unknown | N/A | |
| Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities | Unknown | N/A | |
| Structured Matrix Basis for Multivariate Time Series Forecasting with Interpretable Dynamics | Unknown | N/A | |
| ODRL: A Benchmark for Off-Dynamics Reinforcement Learning | Unknown | N/A | |
| Sharing Key Semantics in Transformer Makes Efficient Image Restoration | Unknown | N/A | |
| Localizing Memorization in SSL Vision Encoders | Unknown | N/A | |
| A StrongREJECT for Empty Jailbreaks | Unknown | N/A | |
| Marginal Causal Flows for Validation and Inference | Unknown | N/A | |
| Nuclear Fusion Diamond Polishing Dataset | Unknown | N/A | |
| SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | Unknown | N/A | |
| E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding | Unknown | N/A | |
| DINTR: Tracking via Diffusion-based Interpolation | Unknown | N/A | |
| FLAME : Factuality-Aware Alignment for Large Language Models | Unknown | N/A | |
| GS-Blur: A 3D Scene-Based Dataset for Realistic Image Deblurring | Unknown | N/A | |
| Tetrahedron Splatting for 3D Generation | Unknown | N/A | |
| Integrating Suboptimal Human Knowledge with Hierarchical Reinforcement Learning for Large-Scale Multiagent Systems | Unknown | N/A | |
| MultiOrg: A Multi-rater Organoid-detection Dataset | Unknown | N/A | |
| Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments | Unknown | N/A | |
| VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding | Unknown | N/A | |
| MEQA: A Benchmark for Multi-hop Event-centric Question Answering with Explanations | Unknown | N/A | |
| A Generative Model of Symmetry Transformations | Unknown | N/A | |
| PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models | Unknown | N/A | |
| SCube: Instant Large-Scale Scene Reconstruction using VoxSplats | Unknown | N/A | |
| Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis | Unknown | N/A | |
| HC-GAE: The Hierarchical Cluster-based Graph Auto-Encoder for Graph Representation Learning | Unknown | N/A | |
| PROSPECT PTMs: Rich Labeled Tandem Mass Spectrometry Dataset of Modified Peptides for Machine Learning in Proteomics | Unknown | N/A | |
| Perceptual Fairness in Image Restoration | Unknown | N/A | |
| ABCFair: an Adaptable Benchmark approach for Comparing Fairness Methods | Unknown | N/A | |
| Biomedical Visual Instruction Tuning with Clinician Preference Alignment | Unknown | N/A | |
| DeepLag: Discovering Deep Lagrangian Dynamics for Intuitive Fluid Prediction | Unknown | N/A | |
| UrbanDataLayer: A Unified Data Pipeline for Urban Science | Unknown | N/A | |
| Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Unknown | N/A | |
| SuperVLAD: Compact and Robust Image Descriptors for Visual Place Recognition | Unknown | N/A | |
| Persistent Test-time Adaptation in Recurring Testing Scenarios | Unknown | N/A | |
| SurgicAI: A Hierarchical Platform for Fine-Grained Surgical Policy Learning and Benchmarking | Unknown | N/A | |
| Tell What You Hear From What You See - Video to Audio Generation Through Text | Unknown | N/A | |
| A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts | Unknown | N/A | |
| Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight Adjustment | Unknown | N/A | |
| M3LEO: A Multi-Modal, Multi-Label Earth Observation Dataset Integrating Interferometric SAR and Multispectral Data | Unknown | N/A | |
| WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks | Unknown | N/A | |
| kGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution | Unknown | N/A | |
| Neural Gaffer: Relighting Any Object via Diffusion | Unknown | N/A | |
| A hierarchical decomposition for explaining ML performance discrepancies | Unknown | N/A | |
| ACFun: Abstract-Concrete Fusion Facial Stylization | Unknown | N/A | |
| PromptFix: You Prompt and We Fix the Photo | Unknown | N/A | |
| WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia | Unknown | N/A | |
| Understanding Bias in Large-Scale Visual Datasets | Unknown | N/A | |
| HAWK: Learning to Understand Open-World Video Anomalies | Unknown | N/A | |
| Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies | Unknown | N/A | |
| Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs | Unknown | N/A | |
| The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection Benchmark | Unknown | N/A | |
| The Art of Saying No: Contextual Noncompliance in Language Models | Unknown | N/A | |
| A Practitioner's Guide to Real-World Continual Multimodal Pretraining | Unknown | N/A | |
| GV-Rep: A Large-Scale Dataset for Genetic Variant Representation Learning | Unknown | N/A | |
| CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence | Unknown | N/A | |
| Goal Conditioned Reinforcement Learning for Photo Finishing Tuning | Unknown | N/A | |
| CARE: a Benchmark Suite for the Classification and Retrieval of Enzymes | Unknown | N/A | |
| Using Unity to Help Solve Reinforcement Learning | Unknown | N/A | |
| Off-Policy Selection for Initiating Human-Centric Experimental Design | Unknown | N/A | |
| Rethinking No-reference Image Exposure Assessment from Holism to Pixel: Models, Datasets and Benchmarks | Unknown | N/A | |
| EHRCon: Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records | Unknown | N/A | |
| GameTraversalBenchmark: Evaluating Planning Abilities Of Large Language Models Through Traversing 2D Game Maps | Unknown | N/A | |
| End-to-end Learnable Clustering for Intent Learning in Recommendation | Unknown | N/A | |
| InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models | Unknown | N/A | |
| GLBench: A Comprehensive Benchmark for Graph with Large Language Models | Unknown | N/A | |
| T2Vs Meet VLMs: A Scalable Multimodal Dataset for Visual Harmfulness Recognition | Unknown | N/A | |
| Benchmarking Complex Instruction-Following with Multiple Constraints Composition | Unknown | N/A | |
| OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments | Unknown | N/A | |
| CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark | Unknown | N/A | |
| Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for Chemistry | Unknown | N/A | |
| Conjugate Bayesian Two-step Change Point Detection for Hawkes Process | Unknown | N/A | |
| Evaluating language models as risk scores | Unknown | N/A | |
| IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS | Unknown | N/A | |
| Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting | Unknown | N/A | |
| LINGOLY: A Benchmark of Olympiad-Level Linguistic Reasoning Puzzles in Low Resource and Extinct Languages | Unknown | N/A | |
| Near-Optimal Streaming Heavy-Tailed Statistical Estimation with Clipped SGD | Unknown | N/A | |
| Bayesian Optimisation with Unknown Hyperparameters: Regret Bounds Logarithmically Closer to Optimal | Unknown | N/A | |
| Exploring Low-Dimensional Subspace in Diffusion Models for Controllable Image Editing | Unknown | N/A | |
| Approaching Human-Level Forecasting with Language Models | Unknown | N/A | |
| BELM: Bidirectional Explicit Linear Multi-step Sampler for Exact Inversion in Diffusion Models | Unknown | N/A | |
| Shadowheart SGD: Distributed Asynchronous SGD with Optimal Time Complexity Under Arbitrary Computation and Communication Heterogeneity | Unknown | N/A | |
| Learning 3D Equivariant Implicit Function with Patch-Level Pose-Invariant Representation | Unknown | N/A | |
| Interactive Deep Clustering via Value Mining | Unknown | N/A | |
| UltraPixel: Advancing Ultra High-Resolution Image Synthesis to New Peaks | Unknown | N/A | |
| NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction | Unknown | N/A | |
| DU-Shapley: A Shapley Value Proxy for Efficient Dataset Valuation | Unknown | N/A | |
| Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers | Unknown | N/A | |
| Addressing Bias in Online Selection with Limited Budget of Comparisons | Unknown | N/A | |
| Neural Isometries: Taming Transformations for Equivariant ML | Unknown | N/A | |
| Style Adaptation and Uncertainty Estimation for Multi-Source Blended-Target Domain Adaptation | Unknown | N/A | |
| Robust Fine-tuning of Zero-shot Models via Variance Reduction | Unknown | N/A | |
| Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy | Unknown | N/A | |
| Improved off-policy training of diffusion samplers | Unknown | N/A | |
| Noisy Dual Mirror Descent: A Near Optimal Algorithm for Jointly-DP Convex Resource Allocation | Unknown | N/A | |
| A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective | Unknown | N/A | |
| Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning | Unknown | N/A | |
| Large Scale Transfer Learning for Tabular Data via Language Modeling | Unknown | N/A | |
| Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers | Unknown | N/A | |
| Return of Unconditional Generation: A Self-supervised Representation Generation Method | Unknown | N/A | |
| Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual Knowledge | Unknown | N/A | |
| Towards Flexible Visual Relationship Segmentation | Unknown | N/A | |
| Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models | Unknown | N/A | |
| Fourier Neural Operator with Learned Deformations for PDEs on General Geometries | Unknown | N/A | |
| Stepwise Alignment for Constrained Language Model Policy Optimization | Unknown | N/A | |
| Rethinking Transformer for Long Contextual Histopathology Whole Slide Image Analysis | Unknown | N/A | |
| Generate Universal Adversarial Perturbations for Few-Shot Learning | Unknown | N/A | |
| VLMimic: Vision Language Models are Visual Imitation Learner for Fine-grained Actions | Unknown | N/A | |
| Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features | Unknown | N/A | |
| Towards Diverse Device Heterogeneous Federated Learning via Task Arithmetic Knowledge Integration | Unknown | N/A | |
| Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics | Unknown | N/A | |
| R$^2$-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | Unknown | N/A | |
| TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene | Unknown | N/A | |
| Amortizing intractable inference in diffusion models for vision, language, and control | Unknown | N/A | |
| GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction | Unknown | N/A | |
| Multi-Agent Coordination via Multi-Level Communication | Unknown | N/A | |
| UQ-Guided Hyperparameter Optimization for Iterative Learners | Unknown | N/A | |
| ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images | Unknown | N/A | |
| Compositional PAC-Bayes: Generalization of GNNs with persistence and beyond | Unknown | N/A | |
| SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization | Unknown | N/A | |
| Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models | Unknown | N/A | |
| Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation | Unknown | N/A | |
| Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes | Unknown | N/A | |
| MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence | Unknown | N/A | |
| Optimal Algorithms for Online Convex Optimization with Adversarial Constraints | Unknown | N/A | |
| DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering | Unknown | N/A | |
| Video Token Merging for Long Video Understanding | Unknown | N/A | |
| Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning | Unknown | N/A | |
| Gradient-based Discrete Sampling with Automatic Cyclical Scheduling | Unknown | N/A | |
| PLIP: Language-Image Pre-training for Person Representation Learning | Unknown | N/A | |
| Can Large Language Model Agents Simulate Human Trust Behavior? | Unknown | N/A | |
| Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution | Unknown | N/A | |
| Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model | Unknown | N/A | |
| FuseMoE: Mixture-of-Experts Transformers for Fleximodal Fusion | Unknown | N/A | |
| Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding | Unknown | N/A | |
| Implicit Curriculum in Procgen Made Explicit | Unknown | N/A | |
| Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation | Unknown | N/A | |
| MambaTree: Tree Topology is All You Need in State Space Model | Unknown | N/A | |
| Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits | Unknown | N/A | |
| Boosting Graph Pooling with Persistent Homology | Unknown | N/A | |
| LG-CAV: Train Any Concept Activation Vector with Language Guidance | Unknown | N/A | |
| The Map Equation Goes Neural: Mapping Network Flows with Graph Neural Networks | Unknown | N/A | |
| Instruction Tuning With Loss Over Instructions | Unknown | N/A | |
| NVRC: Neural Video Representation Compression | Unknown | N/A | |
| 4+3 Phases of Compute-Optimal Neural Scaling Laws | Unknown | N/A | |
| Causal Temporal Representation Learning with Nonstationary Sparse Transition | Unknown | N/A | |
| Optimal deep learning of holomorphic operators between Banach spaces | Unknown | N/A | |
| A Globally Optimal Portfolio for m-Sparse Sharpe Ratio Maximization | Unknown | N/A | |
| Fine-grained Control of Generative Data Augmentation in IoT Sensing | Unknown | N/A | |
| BackTime: Backdoor Attacks on Multivariate Time Series Forecasting | Unknown | N/A | |
| On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance | Unknown | N/A | |
| Hierarchical Hybrid Sliced Wasserstein: A Scalable Metric for Heterogeneous Joint Distributions | Unknown | N/A | |
| Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models | Unknown | N/A | |
| ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation | Unknown | N/A | |
| Hyper-opinion Evidential Deep Learning for Out-of-Distribution Detection | Unknown | N/A | |
| BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models | Unknown | N/A | |
| CigTime: Corrective Instruction Generation Through Inverse Motion Editing | Unknown | N/A | |
| Event-3DGS: Event-based 3D Reconstruction Using 3D Gaussian Splatting | Unknown | N/A | |
| Efficient LLM Scheduling by Learning to Rank | Unknown | N/A | |
| Dealing with Synthetic Data Contamination in Online Continual Learning | Unknown | N/A | |
| Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Unknown | N/A | |
| Not All Tokens Are What You Need for Pretraining | Unknown | N/A | |
| Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning | Unknown | N/A | |
| Opponent Modeling based on Subgoal Inference | Unknown | N/A | |
| Dual Cone Gradient Descent for Training Physics-Informed Neural Networks | Unknown | N/A | |
| OneActor: Consistent Subject Generation via Cluster-Conditioned Guidance | Unknown | N/A | |
| Fourier-enhanced Implicit Neural Fusion Network for Multispectral and Hyperspectral Image Fusion | Unknown | N/A | |
| Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction | Unknown | N/A | |
| FOOGD: Federated Collaboration for Both Out-of-distribution Generalization and Detection | Unknown | N/A | |
| Adapting to Unknown Low-Dimensional Structures in Score-Based Diffusion Models | Unknown | N/A | |
| QTIP: Quantization with Trellises and Incoherence Processing | Unknown | N/A | |
| The Space Complexity of Approximating Logistic Loss | Unknown | N/A | |
| The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Unknown | N/A | |
| Towards Robust Multimodal Sentiment Analysis with Incomplete Data | Unknown | N/A | |
| Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion | Unknown | N/A | |
| HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis | Unknown | N/A | |
| Neural Experts: Mixture of Experts for Implicit Neural Representations | Unknown | N/A | |
| Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models. | Unknown | N/A | |
| Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images | Unknown | N/A | |
| Infinite-Dimensional Feature Interaction | Unknown | N/A | |
| FreqBlender: Enhancing DeepFake Detection by Blending Frequency Knowledge | Unknown | N/A | |
| Trading off Consistency and Dimensionality of Convex Surrogates for Multiclass Classification | Unknown | N/A | |
| Taming Cross-Domain Representation Variance in Federated Prototype Learning with Heterogeneous Data Domains | Unknown | N/A | |
| SAFE: Slow and Fast Parameter-Efficient Tuning for Continual Learning with Pre-Trained Models | Unknown | N/A | |
| Prospective Representation Learning for Non-Exemplar Class-Incremental Learning | Unknown | N/A | |
| Trap-MID: Trapdoor-based Defense against Model Inversion Attacks | Unknown | N/A | |
| Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms | Unknown | N/A | |
| EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding | Unknown | N/A | |
| $\textit{Bifr\"ost}$: 3D-Aware Image Compositing with Language Instructions | Unknown | N/A | |
| Domain Adaptation for Large-Vocabulary Object Detectors | Unknown | N/A | |
| Interpreting Learned Feedback Patterns in Large Language Models | Unknown | N/A | |
| First-Order Methods for Linearly Constrained Bilevel Optimization | Unknown | N/A | |
| Preferential Normalizing Flows | Unknown | N/A | |
| MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts | Unknown | N/A | |
| Ada-MSHyper: Adaptive Multi-Scale Hypergraph Transformer for Time Series Forecasting | Unknown | N/A | |
| Bridging The Gap between Low-rank and Orthogonal Adaptation via Householder Reflection Adaptation | Unknown | N/A | |
| InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory | Unknown | N/A | |
| Logical characterizations of recurrent graph neural networks with reals and floats | Unknown | N/A | |
| AGILE: A Novel Reinforcement Learning Framework of LLM Agents | Unknown | N/A | |
| SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge | Unknown | N/A | |
| Resolving Discrepancies in Compute-Optimal Scaling of Language Models | Unknown | N/A | |
| CausalDiff: Causality-Inspired Disentanglement via Diffusion Model for Adversarial Defense | Unknown | N/A | |
| SpikeReveal: Unlocking Temporal Sequences from Real Blurry Inputs with Spike Streams | Unknown | N/A | |
| Constrained Sampling with Primal-Dual Langevin Monte Carlo | Unknown | N/A | |
| Self-playing Adversarial Language Game Enhances LLM Reasoning | Unknown | N/A | |
| S2HPruner: Soft-to-Hard Distillation Bridges the Discretization Gap in Pruning | Unknown | N/A | |
| SDformer: Similarity-driven Discrete Transformer For Time Series Generation | Unknown | N/A | |
| Quantifying Aleatoric Uncertainty of the Treatment Effect: A Novel Orthogonal Learner | Unknown | N/A | |
| SMART: Towards Pre-trained Missing-Aware Model for Patient Health Status Prediction | Unknown | N/A | |
| Team-Fictitious Play for Reaching Team-Nash Equilibrium in Multi-team Games | Unknown | N/A | |
| RL-GPT: Integrating Reinforcement Learning and Code-as-policy | Unknown | N/A | |
| MindMerger: Efficiently Boosting LLM Reasoning in non-English Languages | Unknown | N/A | |
| Rethinking Parity Check Enhanced Symmetry-Preserving Ansatz | Unknown | N/A | |
| Shaping the distribution of neural responses with interneurons in a recurrent circuit model | Unknown | N/A | |
| CLIP in Mirror: Disentangling text from visual images through reflection | Unknown | N/A | |
| Sample-Efficient Constrained Reinforcement Learning with General Parameterization | Unknown | N/A | |
| Improved Regret for Bandit Convex Optimization with Delayed Feedback | Unknown | N/A | |
| GRANOLA: Adaptive Normalization for Graph Neural Networks | Unknown | N/A | |
| Learning the Latent Causal Structure for Modeling Label Noise | Unknown | N/A | |
| Multimodal Large Language Models Make Text-to-Image Generative Models Align Better | Unknown | N/A | |
| SCaR: Refining Skill Chaining for Long-Horizon Robotic Manipulation via Dual Regularization | Unknown | N/A | |
| Streaming Bayes GFlowNets | Unknown | N/A | |
| Ferrari: Federated Feature Unlearning via Optimizing Feature Sensitivity | Unknown | N/A | |
| GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation | Unknown | N/A | |
| Breaking Semantic Artifacts for Generalized AI-generated Image Detection | Unknown | N/A | |
| Text-Infused Attention and Foreground-Aware Modeling for Zero-Shot Temporal Action Detection | Unknown | N/A | |
| A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $\Theta(T^{2/3})$ and its Application to Best-of-Both-Worlds | Unknown | N/A | |
| Learning Cortico-Muscular Dependence through Orthonormal Decomposition of Density Ratios | Unknown | N/A | |
| Cascade of phase transitions in the training of energy-based models | Unknown | N/A | |
| Langevin Unlearning: A New Perspective of Noisy Gradient Descent for Machine Unlearning | Unknown | N/A | |
| An Improved Empirical Fisher Approximation for Natural Gradient Descent | Unknown | N/A | |
| SelfCodeAlign: Self-Alignment for Code Generation | Unknown | N/A | |
| ColJailBreak: Collaborative Generation and Editing for Jailbreaking Text-to-Image Deep Generation | Unknown | N/A | |
| Certified Machine Unlearning via Noisy Stochastic Gradient Descent | Unknown | N/A | |
| A Concept-Based Explainability Framework for Large Multimodal Models | Unknown | N/A | |
| Distributional Reinforcement Learning with Regularized Wasserstein Loss | Unknown | N/A | |
| Fair Online Bilateral Trade | Unknown | N/A | |
| Pseudo-Private Data Guided Model Inversion Attacks | Unknown | N/A | |
| Binary Search with Distributional Predictions | Unknown | N/A | |
| Towards Explainable Evaluation Metrics for Machine Translation | Unknown | N/A | |
| Random Cycle Coding: Lossless Compression of Cluster Assignments via Bits-Back Coding | Unknown | N/A | |
| $\textit{NeuroPath}$: A Neural Pathway Transformer for Joining the Dots of Human Connectomes | Unknown | N/A | |
| QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs | Unknown | N/A | |
| Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning | Unknown | N/A | |
| Identify Then Recommend: Towards Unsupervised Group Recommendation | Unknown | N/A | |
| Empowering Visible-Infrared Person Re-Identification with Large Foundation Models | Unknown | N/A | |
| Community Detection Guarantees using Embeddings Learned by Node2Vec | Unknown | N/A | |
| Decoupling Semantic Similarity from Spatial Alignment for Neural Networks. | Unknown | N/A | |
| Conditional Density Estimation with Histogram Trees | Unknown | N/A | |
| Rethinking LLM Memorization through the Lens of Adversarial Compression | Unknown | N/A | |
| Almost Free: Self-concordance in Natural Exponential Families and an Application to Bandits | Unknown | N/A | |
| Unveiling LoRA Intrinsic Ranks via Salience Analysis | Unknown | N/A | |
| Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing | Unknown | N/A | |
| LLMDFA: Analyzing Dataflow in Code with Large Language Models | Unknown | N/A | |
| Oracle-Efficient Differentially Private Learning with Public Data | Unknown | N/A | |
| On the Use of Anchoring for Training Vision Models | Unknown | N/A | |
| Probing Social Bias in Labor Market Text Generation by ChatGPT: A Masked Language Model Approach | Unknown | N/A | |
| Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding | Unknown | N/A | |
| Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs | Unknown | N/A | |
| Fisher Flow Matching for Generative Modeling over Discrete Data | Unknown | N/A | |
| Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently | Unknown | N/A | |
| Omnigrasp: Grasping Diverse Objects with Simulated Humanoids | Unknown | N/A | |
| Accelerating Augmentation Invariance Pretraining | Unknown | N/A | |
| Training Data Attribution via Approximate Unrolling | Unknown | N/A | |
| Transformers Can Do Arithmetic with the Right Embeddings | Unknown | N/A | |
| DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving | Unknown | N/A | |
| Exploring Consistency in Graph Representations: from Graph Kernels to Graph Neural Networks | Unknown | N/A | |
| Beyond Accuracy: Ensuring Correct Predictions With Correct Rationales | Unknown | N/A | |
| ChatQA: Surpassing GPT-4 on Conversational QA and RAG | Unknown | N/A | |
| Not so griddy: Internal representations of RNNs path integrating more than one agent | Unknown | N/A | |
| The Closeness of In-Context Learning and Weight Shifting for Softmax Regression | Unknown | N/A | |
| Instance-Specific Asymmetric Sensitivity in Differential Privacy | Unknown | N/A | |
| Approximation Rate of the Transformer Architecture for Sequence Modeling | Unknown | N/A | |
| In-Context Learning with Representations: Contextual Generalization of Trained Transformers | Unknown | N/A | |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | Unknown | N/A | |
| Towards Exact Gradient-based Training on Analog In-memory Computing | Unknown | N/A | |
| Identifiability Analysis of Linear ODE Systems with Hidden Confounders | Unknown | N/A | |
| AHA: Human-Assisted Out-of-Distribution Generalization and Detection | Unknown | N/A | |
| Linear Causal Representation Learning from Unknown Multi-node Interventions | Unknown | N/A | |
| RegExplainer: Generating Explanations for Graph Neural Networks in Regression Tasks | Unknown | N/A | |
| Reconstructing the Image Stitching Pipeline: Integrating Fusion and Rectangling into a Unified Inpainting Model | Unknown | N/A | |
| Fair Kernel K-Means: from Single Kernel to Multiple Kernel | Unknown | N/A | |
| An Autoencoder-Like Nonnegative Matrix Co-Factorization for Improved Student Cognitive Modeling | Unknown | N/A | |
| Active Perception for Grasp Detection via Neural Graspness Field | Unknown | N/A | |
| AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models | Unknown | N/A | |
| Absorb & Escape: Overcoming Single Model Limitations in Generating Heterogeneous Genomic Sequences | Unknown | N/A | |
| Adaptive Visual Scene Understanding: Incremental Scene Graph Generation | Unknown | N/A | |
| DoFIT: Domain-aware Federated Instruction Tuning with Alleviated Catastrophic Forgetting | Unknown | N/A | |
| Robust Offline Active Learning on Graphs | Unknown | N/A | |
| Local Curvature Smoothing with Stein's Identity for Efficient Score Matching | Unknown | N/A | |
| Unveiling Encoder-Free Vision-Language Models | Unknown | N/A | |
| SlimGPT: Layer-wise Structured Pruning for Large Language Models | Unknown | N/A | |
| Decoding-Time Language Model Alignment with Multiple Objectives | Unknown | N/A | |
| Automated Label Unification for Multi-Dataset Semantic Segmentation with GNNs | Unknown | N/A | |
| Make Your LLM Fully Utilize the Context | Unknown | N/A | |
| On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation | Unknown | N/A | |
| Temporal Graph Neural Tangent Kernel with Graphon-Guaranteed | Unknown | N/A | |
| Faster Local Solvers for Graph Diffusion Equations | Unknown | N/A | |
| Meaningful Learning: Enhancing Abstract Reasoning in Large Language Models via Generic Fact Guidance | Unknown | N/A | |
| Memory-Efficient Gradient Unrolling for Large-Scale Bi-level Optimization | Unknown | N/A | |
| Graph Neural Flows for Unveiling Systemic Interactions Among Irregularly Sampled Time Series | Unknown | N/A | |
| StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving | Unknown | N/A | |
| MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs | Unknown | N/A | |
| MKGL: Mastery of a Three-Word Language | Unknown | N/A | |
| Understanding Representation of Deep Equilibrium Models from Neural Collapse Perspective | Unknown | N/A | |
| Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation | Unknown | N/A | |
| OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling | Unknown | N/A | |
| GenRec: Unifying Video Generation and Recognition with Diffusion Models | Unknown | N/A | |
| OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding | Unknown | N/A | |
| Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels | Unknown | N/A | |
| SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow | Unknown | N/A | |
| Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning | Unknown | N/A | |
| B-ary Tree Push-Pull Method is Provably Efficient for Distributed Learning on Heterogeneous Data | Unknown | N/A | |
| SAND: Smooth imputation of sparse and noisy functional data with Transformer networks | Unknown | N/A | |
| How Sparse Can We Prune A Deep Network: A Fundamental Limit Perspective | Unknown | N/A | |
| GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes | Unknown | N/A | |
| ContactField: Implicit Field Representation for Multi-Person Interaction Geometry | Unknown | N/A | |
| Adaptive Domain Learning for Cross-domain Image Denoising | Unknown | N/A | |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Unknown | N/A | |
| Pedestrian-Centric 3D Pre-collision Pose and Shape Estimation from Dashcam Perspective | Unknown | N/A | |
| Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks | Unknown | N/A | |
| Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models | Unknown | N/A | |
| Physics-Constrained Comprehensive Optical Neural Networks | Unknown | N/A | |
| Divide-and-Conquer Posterior Sampling for Denoising Diffusion priors | Unknown | N/A | |
| Unveiling The Matthew Effect Across Channels: Assessing Layer Width Sufficiency via Weight Norm Variance | Unknown | N/A | |
| Can Simple Averaging Defeat Modern Watermarks? | Unknown | N/A | |
| LaSe-E2V: Towards Language-guided Semantic-aware Event-to-Video Reconstruction | Unknown | N/A | |
| AdaPKC: PeakConv with Adaptive Peak Receptive Field for Radar Semantic Segmentation | Unknown | N/A | |
| On Sampling Strategies for Spectral Model Sharding | Unknown | N/A | |
| Aligning Individual and Collective Objectives in Multi-Agent Cooperation | Unknown | N/A | |
| PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition | Unknown | N/A | |
| One-Step Diffusion Distillation through Score Implicit Matching | Unknown | N/A | |
| Penalty-based Methods for Simple Bilevel Optimization under Hölderian Error Bounds | Unknown | N/A | |
| FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model | Unknown | N/A | |
| Private Edge Density Estimation for Random Graphs: Optimal, Efficient and Robust | Unknown | N/A | |
| Memory-Efficient LLM Training with Online Subspace Descent | Unknown | N/A | |
| Transformers are Minimax Optimal Nonparametric In-Context Learners | Unknown | N/A | |
| MOTE-NAS: Multi-Objective Training-based Estimate for Efficient Neural Architecture Search | Unknown | N/A | |
| ADOPT: Modified Adam Can Converge with Any $\beta_2$ with the Optimal Rate | Unknown | N/A | |
| The Implicit Bias of Gradient Descent toward Collaboration between Layers: A Dynamic Analysis of Multilayer Perceptions | Unknown | N/A | |
| Measuring Goal-Directedness | Unknown | N/A | |
| Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers | Unknown | N/A | |
| Predicting Future Actions of Reinforcement Learning Agents | Unknown | N/A | |
| Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents | Unknown | N/A | |
| Suitable is the Best: Task-Oriented Knowledge Fusion in Vulnerability Detection | Unknown | N/A | |
| Towards Neuron Attributions in Multi-Modal Large Language Models | Unknown | N/A | |
| Analytically deriving Partial Information Decomposition for affine systems of stable and convolution-closed distributions | Unknown | N/A | |
| Stochastic Newton Proximal Extragradient Method | Unknown | N/A | |
| Neural Concept Binder | Unknown | N/A | |
| EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching | Unknown | N/A | |
| Parallel Backpropagation for Shared-Feature Visualization | Unknown | N/A | |
| Designs for Enabling Collaboration in Human-Machine Teaming via Interactive and Explainable Systems | Unknown | N/A | |
| The Secretary Problem with Predicted Additive Gap | Unknown | N/A | |
| Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models | Unknown | N/A | |
| Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks | Unknown | N/A | |
| Discrete Modeling via Boundary Conditional Diffusion Processes | Unknown | N/A | |
| LibMOON: A Gradient-based MultiObjective OptimizatioN Library in PyTorch | Unknown | N/A | |
| Using Time-Aware Graph Neural Networks to Predict Temporal Centralities in Dynamic Graphs | Unknown | N/A | |
| Latent Neural Operator for Solving Forward and Inverse PDE Problems | Unknown | N/A | |
| Mechanism design augmented with output advice | Unknown | N/A | |
| HyperPrism: An Adaptive Non-linear Aggregation Framework for Distributed Machine Learning over Non-IID Data and Time-varying Communication Links | Unknown | N/A | |
| Thinking Forward: Memory-Efficient Federated Finetuning of Language Models | Unknown | N/A | |
| Towards Editing Time Series | Unknown | N/A | |
| Geometry Awakening: Cross-Geometry Learning Exhibits Superiority over Individual Structures | Unknown | N/A | |
| Reproducibility Study: Equal Improvability: A New Fairness Notion Considering the Long-Term Impact | Unknown | N/A | |
| Language Without Borders: A Dataset and Benchmark for Code-Switching Lip Reading | Unknown | N/A | |
| The High Line: Exact Risk and Learning Rate Curves of Stochastic Adaptive Learning Rate Algorithms | Unknown | N/A | |
| Smoothed Online Classification can be Harder than Batch Classification | Unknown | N/A | |
| AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making | Unknown | N/A | |
| Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model | Unknown | N/A | |
| APDDv2: Aesthetics of Paintings and Drawings Dataset with Artist Labeled Scores and Comments | Unknown | N/A | |
| [Re] CUDA: Curriculum of Data Augmentation for Long‐tailed Recognition | Unknown | N/A | |
| Better by default: Strong pre-tuned MLPs and boosted trees on tabular data | Unknown | N/A | |
| Learning Action and Reasoning-Centric Image Editing from Videos and Simulation | Unknown | N/A | |
| Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation | Unknown | N/A | |
| DAPE: Data-Adaptive Positional Encoding for Length Extrapolation | Unknown | N/A | |
| Scaling Sign Language Translation | Unknown | N/A | |
| Exploitation of a Latent Mechanism in Graph Contrastive Learning: Representation Scattering | Unknown | N/A | |
| UDON: Universal Dynamic Online distillatioN for generic image representations | Unknown | N/A | |
| Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features | Unknown | N/A | |
| EGSST: Event-based Graph Spatiotemporal Sensitive Transformer for Object Detection | Unknown | N/A | |
| Warped Diffusion: Solving Video Inverse Problems with Image Diffusion Models | Unknown | N/A | |
| MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks | Unknown | N/A | |
| Multi-LLM Debate: Framework, Principals, and Interventions | Unknown | N/A | |
| Adaptive Sampling for Efficient Softmax Approximation | Unknown | N/A | |
| MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models | Unknown | N/A | |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Unknown | N/A | |
| Demystify Mamba in Vision: A Linear Attention Perspective | Unknown | N/A | |
| Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation | Unknown | N/A | |
| MVGamba: Unify 3D Content Generation as State Space Sequence Modeling | Unknown | N/A | |
| SciCode: A Research Coding Benchmark Curated by Scientists | Unknown | N/A | |
| Dissecting the Interplay of Attention Paths in a Statistical Mechanics Theory of Transformers | Unknown | N/A | |
| A Simple Image Segmentation Framework via In-Context Examples | Unknown | N/A | |
| DOFEN: Deep Oblivious Forest ENsemble | Unknown | N/A | |
| Large Language Models' Expert-level Global History Knowledge Benchmark (HiST-LLM) | Unknown | N/A | |
| Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning | Unknown | N/A | |
| B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable | Unknown | N/A | |
| Proving Olympiad Algebraic Inequalities without Human Demonstrations | Unknown | N/A | |
| On the Inductive Bias of Stacking Towards Improving Reasoning | Unknown | N/A | |
| Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making | Unknown | N/A | |
| Data Acquisition via Experimental Design for Data Markets | Unknown | N/A | |
| QKFormer: Hierarchical Spiking Transformer using Q-K Attention | Unknown | N/A | |
| CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns | Unknown | N/A | |
| DisCEdit: Model Editing by Identifying Discriminative Components | Unknown | N/A | |
| A Continuous-time Stochastic Gradient Descent Method for Continuous Data | Unknown | N/A | |
| Statistical Inference for Fairness Auditing | Unknown | N/A | |
| Spectral Adapter: Fine-Tuning in Spectral Space | Unknown | N/A | |
| Derandomizing Multi-Distribution Learning | Unknown | N/A | |
| RadarOcc: Robust 3D Occupancy Prediction with 4D Imaging Radar | Unknown | N/A | |
| Look, Listen, and Answer: Overcoming Biases for Audio-Visual Question Answering | Unknown | N/A | |
| Unifying Homophily and Heterophily for Spectral Graph Neural Networks via Triple Filter Ensembles | Unknown | N/A | |
| ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer | Unknown | N/A | |
| A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health | Unknown | N/A | |
| An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints | Unknown | N/A | |
| Time-Reversal Provides Unsupervised Feedback to LLMs | Unknown | N/A | |
| Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning | Unknown | N/A | |
| Where Do Large Learning Rates Lead Us? | Unknown | N/A | |
| Du-IN: Discrete units-guided mask modeling for decoding speech from Intracranial Neural signals | Unknown | N/A | |
| Distributed Sparse Regression via Penalization | Unknown | N/A | |
| A Unified Recipe for Deriving (Time-Uniform) PAC-Bayes Bounds | Unknown | N/A | |
| Numerically Stable Sparse Gaussian Processes via Minimum Separation using Cover Trees | Unknown | N/A | |
| Optimal Clustering with Bandit Feedback | Unknown | N/A | |
| Transfer Learning with Informative Priors: Simple Baselines Better than Previously Reported | Unknown | N/A | |
| Towards Global Optimal Visual In-Context Learning Prompt Selection | Unknown | N/A | |
| Unified Generative and Discriminative Training for Multi-modal Large Language Models | Unknown | N/A | |
| CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition | Unknown | N/A | |
| SfPUEL: Shape from Polarization under Unknown Environment Light | Unknown | N/A | |
| LG-VQ: Language-Guided Codebook Learning | Unknown | N/A | |
| OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step | Unknown | N/A | |
| Rethinking the Diffusion Models for Missing Data Imputation: A Gradient Flow Perspective | Unknown | N/A | |
| RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance | Unknown | N/A | |
| SegVol: Universal and Interactive Volumetric Medical Image Segmentation | Unknown | N/A | |
| Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation | Unknown | N/A | |
| Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets | Unknown | N/A | |
| Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation | Unknown | N/A | |
| SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation | Unknown | N/A | |
| BertaQA: How Much Do Language Models Know About Local Culture? | Unknown | N/A | |
| Selective Explanations | Unknown | N/A | |
| Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection | Unknown | N/A | |
| Reinforcement Learning with Lookahead Information | Unknown | N/A | |
| Focus On What Matters: Separated Models For Visual-Based RL Generalization | Unknown | N/A | |
| IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation | Unknown | N/A | |
| Efficient Contextual LLM Cascades through Budget-Constrained Policy Learning | Unknown | N/A | |
| Stochastic Optimal Control and Estimation with Multiplicative and Internal Noise | Unknown | N/A | |
| Understanding and Improving Training-free Loss-based Diffusion Guidance | Unknown | N/A | |
| VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models | Unknown | N/A | |
| GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages | Unknown | N/A | |
| HEMM: Holistic Evaluation of Multimodal Foundation Models | Unknown | N/A | |
| OpenSatMap: A Fine-grained High-resolution Satellite Dataset for Large-scale Map Construction | Unknown | N/A | |
| WhodunitBench: Evaluating Large Multimodal Agents via Murder Mystery Games | Unknown | N/A | |
| MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models | Unknown | N/A | |
| IaC-Eval: A Code Generation Benchmark for Cloud Infrastructure-as-Code Programs | Unknown | N/A | |
| UKnow: A Unified Knowledge Protocol with Multimodal Knowledge Graph Datasets for Reasoning and Vision-Language Pre-Training | Unknown | N/A | |
| LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation | Unknown | N/A | |
| ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty | Unknown | N/A | |
| Building Timeseries Dataset: Empowering Large-Scale Building Analytics | Unknown | N/A | |
| RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models | Unknown | N/A | |
| A Hitchhiker's Guide to Fine-Grained Face Forgery Detection Using Common Sense Reasoning | Unknown | N/A | |
| ImageNet3D: Towards General-Purpose Object-Level 3D Understanding | Unknown | N/A | |
| Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving | Unknown | N/A | |
| Infusing Synthetic Data with Real-World Patterns for Zero-Shot Material State Segmentation | Unknown | N/A | |
| ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation | Unknown | N/A | |
| CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models | Unknown | N/A | |
| TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs | Unknown | N/A | |
| Sim2Real-Fire: A Multi-modal Simulation Dataset for Forecast and Backtracking of Real-world Forest Fire | Unknown | N/A | |
| GC-Bench: An Open and Unified Benchmark for Graph Condensation | Unknown | N/A | |
| LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment | Unknown | N/A | |
| CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs | Unknown | N/A | |
| AFBench: A Large-scale Benchmark for Airfoil Design | Unknown | N/A | |
| INQUIRE: A Natural World Text-to-Image Retrieval Benchmark | Unknown | N/A | |
| ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs | Unknown | N/A | |
| FUSU: A Multi-temporal-source Land Use Change Segmentation Dataset for Fine-grained Urban Semantic Understanding | Unknown | N/A | |
| Streaming Detection of Queried Event Start | Unknown | N/A | |
| Test-time Adaptation in Non-stationary Environments via Adaptive Representation Alignment | Unknown | N/A | |
| A New Multi-Source Light Detection Benchmark and Semi-Supervised Focal Light Detection | Unknown | N/A | |
| BuckTales: A multi-UAV dataset for multi-object tracking and re-identification of wild antelopes | Unknown | N/A | |
| Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation? | Unknown | N/A | |
| MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations | Unknown | N/A | |
| Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning | Unknown | N/A | |
| DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model | Unknown | N/A | |
| FedLLM-Bench: Realistic Benchmarks for Federated Learning of Large Language Models | Unknown | N/A | |
| UniMTS: Unified Pre-training for Motion Time Series | Unknown | N/A | |
| GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks | Unknown | N/A | |
| SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models | Unknown | N/A | |
| FindingEmo: An Image Dataset for Emotion Recognition in the Wild | Unknown | N/A | |
| ComBack: A Versatile Dataset for Enhancing Compiler Backend Development Efficiency | Unknown | N/A | |
| Is Your HD Map Constructor Reliable under Sensor Corruptions? | Unknown | N/A | |
| LucidAction: A Hierarchical and Multi-model Dataset for Comprehensive Action Quality Assessment | Unknown | N/A | |
| $\texttt{Model-GLUE}$: Democratized LLM Scaling for A Large Model Zoo in the Wild | Unknown | N/A | |
| BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays | Unknown | N/A | |
| BEACON: Benchmark for Comprehensive RNA Tasks and Language Models | Unknown | N/A | |
| FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models | Unknown | N/A | |
| The iNaturalist Sounds Dataset | Unknown | N/A | |
| A Synthetic Dataset for Personal Attribute Inference | Unknown | N/A | |
| Learning Superconductivity from Ordered and Disordered Material Structures | Unknown | N/A | |
| CoIN: A Benchmark of Continual Instruction Tuning for Multimodel Large Language Models | Unknown | N/A | |
| LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding | Unknown | N/A | |
| Expecting The Unexpected: Towards Broad Out-Of-Distribution Detection | Unknown | N/A | |
| HourVideo: 1-Hour Video-Language Understanding | Unknown | N/A | |
| MARPLE: A Benchmark for Long-Horizon Inference | Unknown | N/A | |
| Scalable Early Childhood Reading Performance Prediction | Unknown | N/A | |
| Infer Induced Sentiment of Comment Response to Video: A New Task, Dataset and Baseline | Unknown | N/A | |
| IncomeSCM: From tabular data set to time-series simulator and causal estimation benchmark | Unknown | N/A | |
| LAVIB: A Large-scale Video Interpolation Benchmark | Unknown | N/A | |
| MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs | Unknown | N/A | |
| DreamCatcher: A Wearer-aware Multi-modal Sleep Event Dataset Based on Earables in Non-restrictive Environments | Unknown | N/A | |
| NeuralPlane: An Efficiently Parallelizable Platform for Fixed-wing Aircraft Control with Reinforcement Learning | Unknown | N/A | |
| UltraMedical: Building Specialized Generalists in Biomedicine | Unknown | N/A | |
| Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization Regime | Unknown | N/A | |
| Task Me Anything | Unknown | N/A | |
| FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models | Unknown | N/A | |
| Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors | Unknown | N/A | |
| VastTrack: Vast Category Visual Object Tracking | Unknown | N/A | |
| GAIA: Rethinking Action Quality Assessment for AI-Generated Videos | Unknown | N/A | |
| Shopping MMLU: A Massive Multi-Task Online Shopping Benchmark for Large Language Models | Unknown | N/A | |
| WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences | Unknown | N/A | |
| Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions | Unknown | N/A | |
| Archaeoscape: Bringing Aerial Laser Scanning Archaeology to the Deep Learning Era | Unknown | N/A | |
| Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning | Unknown | N/A | |
| OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset | Unknown | N/A | |
| SCRREAM : SCan, Register, REnder And Map: A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark | Unknown | N/A | |
| Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition | Unknown | N/A | |
| IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos | Unknown | N/A | |
| Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions | Unknown | N/A | |
| Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset | Unknown | N/A | |
| The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding | Unknown | N/A | |
| FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving | Unknown | N/A | |
| CLAVE: An Adaptive Framework for Evaluating Values of LLM Generated Responses | Unknown | N/A | |
| SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation | Unknown | N/A | |
| EpiCare: A Reinforcement Learning Benchmark for Dynamic Treatment Regimes | Unknown | N/A | |
| SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset | Unknown | N/A | |
| SeafloorAI: A Large-scale Vision-Language Dataset for Seafloor Geological Survey | Unknown | N/A | |
| Multi-modal Situated Reasoning in 3D Scenes | Unknown | N/A | |
| Benchmarking the Attribution Quality of Vision Models | Unknown | N/A | |
| 4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs | Unknown | N/A | |
| GeoPlant: Spatial Plant Species Prediction Dataset | Unknown | N/A | |
| ShareGPT4Video: Improving Video Understanding and Generation with Better Captions | Unknown | N/A | |
| PEACE: A Dataset of Pharmaceutical Care for Cancer Pain Analgesia Evaluation and Medication Decision | Unknown | N/A | |
| Rethinking the Evaluation of Out-of-Distribution Detection: A Sorites Paradox | Unknown | N/A | |
| Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs | Unknown | N/A | |
| Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models | Unknown | N/A | |
| WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark | Unknown | N/A | |
| NanoBaseLib: A Multi-Task Benchmark Dataset for Nanopore Sequencing | Unknown | N/A | |
| ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination | Unknown | N/A | |
| APEBench: A Benchmark for Autoregressive Neural Emulators of PDEs | Unknown | N/A | |
| A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era | Unknown | N/A | |
| On the Effects of Data Scale on UI Control Agents | Unknown | N/A | |
| TorchSpatial: A Location Encoding Framework and Benchmark for Spatial Representation Learning | Unknown | N/A | |
| GeSS: Benchmarking Geometric Deep Learning under Scientific Applications with Distribution Shifts | Unknown | N/A | |
| Indoor Air Quality Dataset with Activities of Daily Living in Low to Middle-income Communities | Unknown | N/A | |
| EEVR: A Dataset of Paired Physiological Signals and Textual Descriptions for Joint Emotion Representation Learning | Unknown | N/A | |
| Calibrated Self-Rewarding Vision Language Models | Unknown | N/A | |
| LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models | Unknown | N/A | |
| Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm | Unknown | N/A | |
| Towards General Loop Invariant Generation: A Benchmark of Programs with Memory Manipulation | Unknown | N/A | |
| Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias | Unknown | N/A | |
| Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge | Unknown | N/A | |
| Terra: A Multimodal Spatio-Temporal Dataset Spanning the Earth | Unknown | N/A | |
| Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking | Unknown | N/A | |
| NovoBench: Benchmarking Deep Learning-based \emph{De Novo} Sequencing Methods in Proteomics | Unknown | N/A | |
| GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI | Unknown | N/A | |
| Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models | Unknown | N/A | |
| WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management Tasks | Unknown | N/A | |
| GenAI Arena: An Open Evaluation Platform for Generative Models | Unknown | N/A | |
| MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark | Unknown | N/A | |
| Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation | Unknown | N/A | |
| Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and Metrics | Unknown | N/A | |
| NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples | Unknown | N/A | |
| When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models | Unknown | N/A | |
| Fast and Memory-Efficient Video Diffusion Using Streamlined Inference | Unknown | N/A | |
| PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action | Unknown | N/A | |
| Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads | Unknown | N/A | |
| A Benchmark Suite for Evaluating Neural Mutual Information Estimators on Unstructured Datasets | Unknown | N/A | |
| CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding | Unknown | N/A | |
| A Benchmark Dataset for Event-Guided Human Pose Estimation and Tracking in Extreme Conditions | Unknown | N/A | |
| HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation | Unknown | N/A | |
| UltraEdit: Instruction-based Fine-Grained Image Editing at Scale | Unknown | N/A | |
| DF40: Toward Next-Generation Deepfake Detection | Unknown | N/A | |
| SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution | Unknown | N/A | |
| Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation | Unknown | N/A | |
| WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts | Unknown | N/A | |
| AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games | Unknown | N/A | |
| The Multimodal Universe: Enabling Large-Scale Machine Learning with 100 TB of Astronomical Scientific Data | Unknown | N/A | |
| VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark | Unknown | N/A | |
| DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks | Unknown | N/A | |
| Needle In A Multimodal Haystack | Unknown | N/A | |
| GTA: A Benchmark for General Tool Agents | Unknown | N/A | |
| Muscles in Time: Learning to Understand Human Motion In-Depth by Simulating Muscle Activations | Unknown | N/A | |
| Copycats: the many lives of a publicly available medical imaging dataset | Unknown | N/A | |
| LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Unknown | N/A | |
| ChaosBench: A Multi-Channel, Physics-Based Benchmark for Subseasonal-to-Seasonal Climate Prediction | Unknown | N/A | |
| T2VSafetyBench: Evaluating the Safety of Text-to-Video Generative Models | Unknown | N/A | |
| Benchmarking Generative Models on Computational Thinking Tests in Elementary Visual Programming | Unknown | N/A | |
| BeanCounter: A low-toxicity, large-scale, and open dataset of business-oriented text | Unknown | N/A | |
| A Taxonomy of Challenges to Curating Fair Datasets | Unknown | N/A | |
| Revisiting Few-Shot Object Detection with Vision-Language Models | Unknown | N/A | |
| CaptainCook4D: A Dataset for Understanding Errors in Procedural Activities | Unknown | N/A | |
| PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations | Unknown | N/A | |
| DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios | Unknown | N/A | |
| FlexMol: A Flexible Toolkit for Benchmarking Molecular Relational Learning | Unknown | N/A | |
| A SARS-CoV-2 Interaction Dataset and VHH Sequence Corpus for Antibody Language Models | Unknown | N/A | |
| FinBen: A Holistic Financial Benchmark for Large Language Models | Unknown | N/A | |
| PURE: Prompt Evolution with Graph ODE for Out-of-distribution Fluid Dynamics Modeling | Unknown | N/A | |
| Brain Treebank: Large-scale intracranial recordings from naturalistic language stimuli | Unknown | N/A | |
| SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset | Unknown | N/A | |
| Hidden in Plain Sight: Evaluating Abstract Shape Recognition in Vision-Language Models | Unknown | N/A | |
| A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics | Unknown | N/A | |
| Einsum Benchmark: Enabling the Development of Next-Generation Tensor Execution Engines | Unknown | N/A | |
| Lean Workbook: A large-scale Lean problem set formalized from natural language math problems | Unknown | N/A | |
| ConceptFactory: Facilitate 3D Object Knowledge Annotation with Object Conceptualization | Unknown | N/A | |
| OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking | Unknown | N/A | |
| AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery | Unknown | N/A | |
| Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation | Unknown | N/A | |
| MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding | Unknown | N/A | |
| Towards Next-Generation Logic Synthesis: A Scalable Neural Circuit Generation Framework | Unknown | N/A | |
| SHDocs: A dataset, benchmark, and method to efficiently generate high-quality, real-world specular highlight data with near-perfect alignment | Unknown | N/A | |
| II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models | Unknown | N/A | |
| Do Counterfactually Fair Image Classifiers Satisfy Group Fairness? -- A Theoretical and Empirical Study | Unknown | N/A | |
| The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning | Unknown | N/A | |
| Towards Heterogeneous Long-tailed Learning: Benchmarking, Metrics, and Toolbox | Unknown | N/A | |
| SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification | Unknown | N/A | |
| The Fragility of Fairness: Causal Sensitivity Analysis for Fair Machine Learning | Unknown | N/A | |
| JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images | Unknown | N/A | |
| Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model | Unknown | N/A | |
| BIGOS V2 Benchmark for Polish ASR: Curated Datasets and Tools for Reproducible Evaluation | Unknown | N/A | |
| Off to new Shores: A Dataset & Benchmark for (near-)coastal Flood Inundation Forecasting | Unknown | N/A | |
| ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition | Unknown | N/A | |
| STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics | Unknown | N/A | |
| The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models | Unknown | N/A | |
| CableInspect-AD: An Expert-Annotated Anomaly Detection Dataset | Unknown | N/A | |
| Is Function Similarity Over-Engineered? Building a Benchmark | Unknown | N/A | |
| SolarCube: An Integrative Benchmark Dataset Harnessing Satellite and In-situ Observations for Large-scale Solar Energy Forecasting | Unknown | N/A | |
| MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling | Unknown | N/A | |
| USCILab3D: A Large-scale, Long-term, Semantically Annotated Outdoor Dataset | Unknown | N/A | |
| SubjECTive-QA: Measuring Subjectivity in Earnings Call Transcripts' QA Through Six-Dimensional Feature Analysis | Unknown | N/A | |
| Implicit Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Unknown | N/A | |
| RelBench: A Benchmark for Deep Learning on Relational Databases | Unknown | N/A | |
| Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets | Unknown | N/A | |
| emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation | Unknown | N/A | |
| NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security | Unknown | N/A | |
| ReactZyme: A Benchmark for Enzyme-Reaction Prediction | Unknown | N/A | |
| BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity | Unknown | N/A | |
| MOTIVE: A Drug-Target Interaction Graph For Inductive Link Prediction | Unknown | N/A | |
| IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization | Unknown | N/A | |
| Benchmark Data Repositories for Better Benchmarking | Unknown | N/A | |
| VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images | Unknown | N/A | |
| Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization | Unknown | N/A | |
| Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress? | Unknown | N/A | |
| Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation | Unknown | N/A | |
| Evaluating Copyright Takedown Methods for Language Models | Unknown | N/A | |
| Towards Comprehensive Detection of Chinese Harmful Memes | Unknown | N/A | |
| Assemblage: Automatic Binary Dataset Construction for Machine Learning | Unknown | N/A | |
| Job-SDF: A Multi-Granularity Dataset for Job Skill Demand Forecasting and Benchmarking | Unknown | N/A | |
| TEG-DB: A Comprehensive Dataset and Benchmark of Textual-Edge Graphs | Unknown | N/A | |
| MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models | Unknown | N/A | |
| Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving | Unknown | N/A | |
| Instruction Embedding: Latent Representations of Instructions Towards Task Identification | Unknown | N/A | |
| SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery | Unknown | N/A | |
| SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers | Unknown | N/A | |
| Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights | Unknown | N/A | |
| ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models | Unknown | N/A | |
| A Careful Examination of Large Language Model Performance on Grade School Arithmetic | Unknown | N/A | |
| SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations | Unknown | N/A | |
| Vript: A Video Is Worth Thousands of Words | Unknown | N/A | |
| emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface Electromyography | Unknown | N/A | |
| AudioMarkBench: Benchmarking Robustness of Audio Watermarking | Unknown | N/A | |
| UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-World Document Analysis | Unknown | N/A | |
| $E^3$: Exploring Embodied Emotion Through A Large-Scale Egocentric Video Dataset | Unknown | N/A | |
| ViLCo-Bench: VIdeo Language COntinual learning Benchmark | Unknown | N/A | |
| RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation | Unknown | N/A | |
| EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations | Unknown | N/A | |
| ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos | Unknown | N/A | |
| WildPPG: A Real-World PPG Dataset of Long Continuous Recordings | Unknown | N/A | |
| Benchmarking LLMs via Uncertainty Quantification | Unknown | N/A | |
| XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX | Unknown | N/A | |
| OpenDebateEvidence: A Massive-Scale Argument Mining and Summarization Dataset | Unknown | N/A | |
| Are Large Language Models Good Statisticians? | Unknown | N/A | |
| MedJourney: Benchmark and Evaluation of Large Language Models over Patient Clinical Journey | Unknown | N/A | |
| TACT: Advancing Complex Aggregative Reasoning with Information Extraction Tools | Unknown | N/A | |
| PUZZLES: A Benchmark for Neural Algorithmic Reasoning | Unknown | N/A | |
| dopanim: A Dataset of Doppelganger Animals with Noisy Annotations from Multiple Humans | Unknown | N/A | |
| Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs | Unknown | N/A | |
| $\nabla^2$DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials | Unknown | N/A | |
| AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database Queries | Unknown | N/A | |
| BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval | Unknown | N/A | |
| Fit for our purpose, not yours: Benchmark for a low-resource, Indigenous language | Unknown | N/A | |
| Evaluating Multiview Object Consistency in Humans and Image Models | Unknown | N/A | |
| Advancing Video Anomaly Detection: A Concise Review and a New Dataset | Unknown | N/A | |
| A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences | Unknown | N/A | |
| EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries | Unknown | N/A | |
| A Data-Centric Perspective on Evaluating Machine Learning Models for Tabular Data | Unknown | N/A | |
| CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence | Unknown | N/A | |
| NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates | Unknown | N/A | |
| Noisy Ostracods: A Fine-Grained, Imbalanced Real-World Dataset for Benchmarking Robust Machine Learning and Label Correction Methods | Unknown | N/A | |
| MathPile: A Billion-Token-Scale Pretraining Corpus for Math | Unknown | N/A | |
| Revisiting, Benchmarking and Understanding Unsupervised Graph Domain Adaptation | Unknown | N/A | |
| Benchmarking Counterfactual Image Generation | Unknown | N/A | |
| CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making | Unknown | N/A | |
| WindsorML: High-Fidelity Computational Fluid Dynamics Dataset For Automotive Aerodynamics | Unknown | N/A | |
| PowerGraph: A power grid benchmark dataset for graph neural networks | Unknown | N/A | |
| ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models | Unknown | N/A | |
| TAPVid-3D: A Benchmark for Tracking Any Point in 3D | Unknown | N/A | |
| Newswire: A Large-Scale Structured Database of a Century of Historical News | Unknown | N/A | |
| Few-shot Algorithms for Consistent Neural Decoding (FALCON) Benchmark | Unknown | N/A | |
| AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents | Unknown | N/A | |
| UniTox: Leveraging LLMs to Curate a Unified Dataset of Drug-Induced Toxicity from FDA Labels | Unknown | N/A | |
| EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity | Unknown | N/A | |
| FT-AED: Benchmark Dataset for Early Freeway Traffic Anomalous Event Detection | Unknown | N/A | |
| ProG: A Graph Prompt Learning Benchmark | Unknown | N/A | |
| $\texttt{dattri}$: A Library for Efficient Data Attribution | Unknown | N/A | |
| DECO-Bench: Unified Benchmark for Decoupled Task-Agnostic Synthetic Data Release | Unknown | N/A | |
| 3DCoMPaT200: Language Grounded Large-Scale 3D Vision Dataset for Compositional Recognition | Unknown | N/A | |
| Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models | Unknown | N/A | |
| OAM-TCD: A globally diverse dataset of high-resolution tree cover maps | Unknown | N/A | |
| RedCode: Risky Code Execution and Generation Benchmark for Code Agents | Unknown | N/A | |
| BioTrove: A Large Curated Image Dataset Enabling AI for Biodiversity | Unknown | N/A | |
| What to Say and When to Say it: Live Fitness Coaching as a Testbed for Situated Interaction | Unknown | N/A | |
| cPAPERS: A Dataset of Situated and Multimodal Interactive Conversations in Scientific Papers | Unknown | N/A | |
| RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation | Unknown | N/A | |
| Data curation via joint example selection further accelerates multimodal learning | Unknown | N/A | |
| StackEval: Benchmarking LLMs in Coding Assistance | Unknown | N/A | |
| Kuro Siwo: 33 billion $m^2$ under the water. A global multi-temporal satellite dataset for rapid flood mapping | Unknown | N/A | |
| ClashEval: Quantifying the tug-of-war between an LLM’s internal prior and external evidence | Unknown | N/A | |
| Enhancing vision-language models for medical imaging: bridging the 3D gap with innovative slice selection | Unknown | N/A | |
| SETLEXSEM CHALLENGE: Using Set Operations to Evaluate the Lexical and Semantic Robustness of Language Models | Unknown | N/A | |
| AsEP: Benchmarking Deep Learning Methods for Antibody-specific Epitope Prediction | Unknown | N/A | |
| Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videos | Unknown | N/A | |
| IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents | Unknown | N/A | |
| SM3-Text-to-Query: Synthetic Multi-Model Medical Text-to-Query Benchmark | Unknown | N/A | |
| RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content | Unknown | N/A | |
| TabularBench: Benchmarking Adversarial Robustness for Tabular Deep Learning in Real-world Use-cases | Unknown | N/A | |
| shapiq: Shapley Interactions for Machine Learning | Unknown | N/A | |
| Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs | Unknown | N/A | |
| WildGuard: Open One-stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs | Unknown | N/A | |
| MLLM-CompBench: A Comparative Reasoning Benchmark for Multimodal LLMs | Unknown | N/A | |
| ReMI: A Dataset for Reasoning with Multiple Images | Unknown | N/A | |
| Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks | Unknown | N/A | |
| Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis | Unknown | N/A | |
| CALE: Continuous Arcade Learning Environment | Unknown | N/A | |
| Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantification | Unknown | N/A | |
| PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition | Unknown | N/A | |
| Consent in Crisis: The Rapid Decline of the AI Data Commons | Unknown | N/A | |
| Vocal Call Locator Benchmark (VCL) for localizing rodent vocalizations from multi-channel audio | Unknown | N/A | |
| DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation | Unknown | N/A | |
| Weight decay induces low-rank attention layers | Unknown | N/A | |
| V-PETL Bench: A Unified Visual Parameter-Efficient Transfer Learning Benchmark | Unknown | N/A | |
| ClevrSkills: Compositional Language And Visual Reasoning in Robotics | Unknown | N/A | |
| Towards Open Respiratory Acoustic Foundation Models: Pretraining and Benchmarking | Unknown | N/A | |
| DiscoveryWorld: A Virtual Environment for Developing and Evaluating Automated Scientific Discovery Agents | Unknown | N/A | |
| RedPajama: an Open Dataset for Training Large Language Models | Unknown | N/A | |
| CiteME: Can Language Models Accurately Cite Scientific Claims? | Unknown | N/A | |
| Benchmarking Out-of-Distribution Generalization Capabilities of DNN-based Encoding Models for the Ventral Visual Cortex. | Unknown | N/A | |
| The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale | Unknown | N/A | |
| Croissant: A Metadata Format for ML-Ready Datasets | Unknown | N/A | |
| Evaluating Numerical Reasoning in Text-to-Image Models | Unknown | N/A | |
| CryoBench: Diverse and challenging datasets for the heterogeneity problem in cryo-EM | Unknown | N/A | |
| Topic-Conversation Relevance (TCR) Dataset and Benchmarks | Unknown | N/A | |
| HelpSteer 2: Open-source dataset for training top-performing reward models | Unknown | N/A | |
| Image2Struct: Benchmarking Structure Extraction for Vision-Language Models | Unknown | N/A | |
| UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling | Unknown | N/A | |
| MassSpecGym: A benchmark for the discovery and identification of molecules | Unknown | N/A | |
| VHELM: A Holistic Evaluation of Vision Language Models | Unknown | N/A | |
| The State of Data Curation at NeurIPS: An Assessment of Dataset Development Practices in the Datasets and Benchmarks Track | Unknown | N/A | |
| Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning | Unknown | N/A | |
| MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens | Unknown | N/A | |
| From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models | Unknown | N/A | |
| BLURD: Benchmarking and Learning using a Unified Rendering and Diffusion Model | Unknown | N/A | |
| MedCalc-Bench: Evaluating Large Language Models for Medical Calculations | Unknown | N/A | |
| A benchmark for prediction of transcriptomic responses to chemical perturbations across cell types | Unknown | N/A | |
| DevBench: A multimodal developmental benchmark for language learning | Unknown | N/A | |
| InterpBench: Semi-Synthetic Transformers for Evaluating Mechanistic Interpretability Techniques | Unknown | N/A | |
| QGym: Scalable Simulation and Benchmarking of Queuing Network Controllers | Unknown | N/A | |
| DTGB: A Comprehensive Benchmark for Dynamic Text-Attributed Graphs | Unknown | N/A | |
| NoisyGL: A Comprehensive Benchmark for Graph Neural Networks under Label Noise | Unknown | N/A | |
| Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing | Unknown | N/A | |
| Micro-Bench: A Microscopy Benchmark for Vision-Language Understanding | Unknown | N/A | |
| Melting Pot Contest: Charting the Future of Generalized Cooperative Intelligence | Unknown | N/A | |
| MmCows: A Multimodal Dataset for Dairy Cattle Monitoring | Unknown | N/A | |
| SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words | Unknown | N/A | |
| Towards Reliable Model Selection for Unsupervised Domain Adaptation: An Empirical Study and A Certified Baseline | Unknown | N/A | |
| The Selective $G$-Bispectrum and its Inversion: Applications to $G$-Invariant Networks | Unknown | N/A | |
| Map It Anywhere: Empowering BEV Map Prediction using Large-scale Public Datasets | Unknown | N/A | |
| WikiDO: A New Benchmark Evaluating Cross-Modal Retrieval for Vision-Language Models | Unknown | N/A | |
| EMGBench: Benchmarking Out-of-Distribution Generalization and Adaptation for Electromyography | Unknown | N/A | |
| Stress-Testing Long-Context Language Models with Lifelong ICL and Task Haystack | Unknown | N/A | |
| ProbTS: Benchmarking Point and Distributional Forecasting across Diverse Prediction Horizons | Unknown | N/A | |
| Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models | Unknown | N/A | |
| BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages | Unknown | N/A | |
| ProgressGym: Alignment with a Millennium of Moral Progress | Unknown | N/A | |
| Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? | Unknown | N/A | |
| TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series | Unknown | N/A | |
| BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack | Unknown | N/A | |
| $\texttt{ConflictBank}$: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLMs | Unknown | N/A | |
| Classic GNNs are Strong Baselines: Reassessing GNNs for Node Classification | Unknown | N/A | |
| FairJob: A Real-World Dataset for Fairness in Online Systems | Unknown | N/A | |
| Multi-Chain Graphs of Graphs: A New Approach to Analyzing Blockchain Datasets | Unknown | N/A | |
| CRAG - Comprehensive RAG Benchmark | Unknown | N/A | |
| NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking | Unknown | N/A | |
| Learning Structure-Aware Representations of Dependent Types | Unknown | N/A | |
| What Rotary Position Embedding Can Tell Us: Identifying Query and Key Weights Corresponding to Basic Syntactic or High-level Semantic Information | Unknown | N/A | |
| Large Language Model Unlearning via Embedding-Corrupted Prompts | Unknown | N/A | |
| Attack-Resilient Image Watermarking Using Stable Diffusion | Unknown | N/A | |
| Unlock the Intermittent Control Ability of Model Free Reinforcement Learning | Unknown | N/A | |
| Adaptive Passive-Aggressive Framework for Online Regression with Side Information | Unknown | N/A | |
| Accelerating Nash Equilibrium Convergence in Monte Carlo Settings Through Counterfactual Value Based Fictitious Play | Unknown | N/A | |
| Transition Constrained Bayesian Optimization via Markov Decision Processes | Unknown | N/A | |
| Fairness-Aware Meta-Learning via Nash Bargaining | Unknown | N/A | |
| The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations | Unknown | N/A | |
| Transcendence: Generative Models Can Outperform The Experts That Train Them | Unknown | N/A | |
| RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions | Unknown | N/A | |
| FactorizePhys: Matrix Factorization for Multidimensional Attention in Remote Physiological Sensing | Unknown | N/A | |
| StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses | Unknown | N/A | |
| AutoPSV: Automated Process-Supervised Verifier | Unknown | N/A | |
| Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning | Unknown | N/A | |
| Frustratingly Easy Test-Time Adaptation of Vision-Language Models | Unknown | N/A | |
| Who's asking? User personas and the mechanics of latent misalignment | Unknown | N/A | |
| SpeAr: A Spectral Approach for Zero-Shot Node Classification | Unknown | N/A | |
| Navigating the Effect of Parametrization for Dimensionality Reduction | Unknown | N/A | |
| EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views | Unknown | N/A | |
| Confidence Calibration of Classifiers with Many Classes | Unknown | N/A | |
| Image-aware Evaluation of Generated Medical Reports | Unknown | N/A | |
| CryoGEM: Physics-Informed Generative Cryo-Electron Microscopy | Unknown | N/A | |
| Nuclear Norm Regularization for Deep Learning | Unknown | N/A | |
| Adversarial Environment Design via Regret-Guided Diffusion Models | Unknown | N/A | |
| Multiple Physics Pretraining for Spatiotemporal Surrogate Models | Unknown | N/A | |
| TARP-VP: Towards Evaluation of Transferred Adversarial Robustness and Privacy on Label Mapping Visual Prompting Models | Unknown | N/A | |
| Mission Impossible: A Statistical Perspective on Jailbreaking LLMs | Unknown | N/A | |
| Infinite Limits of Multi-head Transformer Dynamics | Unknown | N/A | |
| Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models | Unknown | N/A | |
| Discovering Preference Optimization Algorithms with and for Large Language Models | Unknown | N/A | |
| Overcoming Brittleness in Pareto-Optimal Learning Augmented Algorithms | Unknown | N/A | |
| Neur2BiLO: Neural Bilevel Optimization | Unknown | N/A | |
| In-Context Symmetries: Self-Supervised Learning through Contextual World Models | Unknown | N/A | |
| Leveraging Drift to Improve Sample Complexity of Variance Exploding Diffusion Models | Unknown | N/A | |
| Foundation Inference Models for Markov Jump Processes | Unknown | N/A | |
| The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More | Unknown | N/A | |
| Adaptive Important Region Selection with Reinforced Hierarchical Search for Dense Object Detection | Unknown | N/A | |
| A PID Controller Approach for Adaptive Probability-dependent Gradient Decay in Model Calibration | Unknown | N/A | |
| LaKD: Length-agnostic Knowledge Distillation for Trajectory Prediction with Any Length Observations | Unknown | N/A | |
| Metric Flow Matching for Smooth Interpolations on the Data Manifold | Unknown | N/A | |
| The Unmet Promise of Synthetic Training Images: Using Retrieved Real Images Performs Better | Unknown | N/A | |
| SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion | Unknown | N/A | |
| Compositional Generalization Across Distributional Shifts with Sparse Tree Operations | Unknown | N/A | |
| Mesa-Extrapolation: A Weave Position Encoding Method for Enhanced Extrapolation in LLMs | Unknown | N/A | |
| Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding | Unknown | N/A | |
| Label Noise: Ignorance Is Bliss | Unknown | N/A | |
| TrajCLIP: Pedestrian trajectory prediction method using contrastive learning and idempotent networks | Unknown | N/A | |
| Pretrained Optimization Model for Zero-Shot Black Box Optimization | Unknown | N/A | |
| MatFormer: Nested Transformer for Elastic Inference | Unknown | N/A | |
| Approximating the Top Eigenvector in Random Order Streams | Unknown | N/A | |
| Graph Neural Networks Need Cluster-Normalize-Activate Modules | Unknown | N/A | |
| Exploiting Descriptive Completeness Prior for Cross Modal Hashing with Incomplete Labels | Unknown | N/A | |
| Efficient Leverage Score Sampling for Tensor Train Decomposition | Unknown | N/A | |
| Non-Stationary Learning of Neural Networks with Automatic Soft Parameter Reset | Unknown | N/A | |
| Reconstruct and Match: Out-of-Distribution Robustness via Topological Homogeneity | Unknown | N/A | |
| Localized Adaptive Risk Control | Unknown | N/A | |
| Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation | Unknown | N/A | |
| SpikedAttention: Training-Free and Fully Spike-Driven Transformer-to-SNN Conversion with Winner-Oriented Spike Shift for Softmax Operation | Unknown | N/A | |
| Rethinking Imbalance in Image Super-Resolution for Efficient Inference | Unknown | N/A | |
| ReFT: Representation Finetuning for Language Models | Unknown | N/A | |
| SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection | Unknown | N/A | |
| Speculative Monte-Carlo Tree Search | Unknown | N/A | |
| GFlowNet Assisted Biological Sequence Editing | Unknown | N/A | |
| SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization | Unknown | N/A | |
| CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models | Unknown | N/A | |
| Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis | Unknown | N/A | |
| An Information Theoretic Perspective on Conformal Prediction | Unknown | N/A | |
| Improving robustness to corruptions with multiplicative weight perturbations | Unknown | N/A | |
| Pandora's Box: Towards Building Universal Attackers against Real-World Large Vision-Language Models | Unknown | N/A | |
| Optimal Multi-Fidelity Best-Arm Identification | Unknown | N/A | |
| Panacea: Pareto Alignment via Preference Adaptation for LLMs | Unknown | N/A | |
| Understanding the Gains from Repeated Self-Distillation | Unknown | N/A | |
| Grid4D: 4D Decomposed Hash Encoding for High-Fidelity Dynamic Gaussian Splatting | Unknown | N/A | |
| Wormhole Loss for Partial Shape Matching | Unknown | N/A | |
| Alias-Free Mamba Neural Operator | Unknown | N/A | |
| Generalizable and Animatable Gaussian Head Avatar | Unknown | N/A | |
| Diffeomorphic interpolation for efficient persistence-based topological optimization | Unknown | N/A | |
| Quasi-Bayes meets Vines | Unknown | N/A | |
| In-N-Out: Lifting 2D Diffusion Prior for 3D Object Removal via Tuning-Free Latents Alignment | Unknown | N/A | |
| Improving the Worst-Case Bidirectional Communication Complexity for Nonconvex Distributed Optimization under Function Similarity | Unknown | N/A | |
| Federated Fine-tuning of Large Language Models under Heterogeneous Tasks and Client Resources | Unknown | N/A | |
| Metric Space Magnitude for Evaluating the Diversity of Latent Representations | Unknown | N/A | |
| SARAD: Spatial Association-Aware Anomaly Detection and Diagnosis for Multivariate Time Series | Unknown | N/A | |
| Boosting Vision-Language Models with Transduction | Unknown | N/A | |
| In-Context Learning State Vector with Inner and Momentum Optimization | Unknown | N/A | |
| Unveiling the Tapestry of Consistency in Large Vision-Language Models | Unknown | N/A | |
| Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Unknown | N/A | |
| Diversify, Contextualize, and Adapt: Efficient Entropy Modeling for Neural Image Codec | Unknown | N/A | |
| Image Copy Detection for Diffusion Models | Unknown | N/A | |
| End-To-End Causal Effect Estimation from Unstructured Natural Language Data | Unknown | N/A | |
| Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles | Unknown | N/A | |
| Loss Landscape Characterization of Neural Networks without Over-Parametrization | Unknown | N/A | |
| Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts | Unknown | N/A | |
| Provable Benefits of Complex Parameterizations for Structured State Space Models | Unknown | N/A | |
| Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model | Unknown | N/A | |
| InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD | Unknown | N/A | |
| PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher | Unknown | N/A | |
| TPC: Test-time Procrustes Calibration for Diffusion-based Human Image Animation | Unknown | N/A | |
| Continuous Partitioning for Graph-Based Semi-Supervised Learning | Unknown | N/A | |
| YOLOv10: Real-Time End-to-End Object Detection | Unknown | N/A | |
| Fixed Confidence Best Arm Identification in the Bayesian Setting | Unknown | N/A | |
| SimGen: Simulator-conditioned Driving Scene Generation | Unknown | N/A | |
| Low-Rank Optimal Transport through Factor Relaxation with Latent Coupling | Unknown | N/A | |
| Fast Last-Iterate Convergence of Learning in Games Requires Forgetful Algorithms | Unknown | N/A | |
| An engine not a camera: Measuring performative power of online search | Unknown | N/A | |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Unknown | N/A | |
| CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization | Unknown | N/A | |
| Is O(log N) practical? Near-Equivalence Between Delay Robustness and Bounded Regret in Bandits and RL | Unknown | N/A | |
| How Does Message Passing Improve Collaborative Filtering? | Unknown | N/A | |
| DRACO: A Denoising-Reconstruction Autoencoder for Cryo-EM | Unknown | N/A | |
| Divide-and-Conquer Predictive Coding: a structured Bayesian inference algorithm | Unknown | N/A | |
| Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning | Unknown | N/A | |
| Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery | Unknown | N/A | |
| Mixture of Experts Meets Prompt-Based Continual Learning | Unknown | N/A | |
| BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling | Unknown | N/A | |
| Vision-Language Models are Strong Noisy Label Detectors | Unknown | N/A | |
| Policy-shaped prediction: avoiding distractions in model-based reinforcement learning | Unknown | N/A | |
| Inverse M-Kernels for Linear Universal Approximators of Non-Negative Functions | Unknown | N/A | |
| Interpretable Image Classification with Adaptive Prototype-based Vision Transformers | Unknown | N/A | |
| HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models | Unknown | N/A | |
| Convergence of $\text{log}(1/\epsilon)$ for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis | Unknown | N/A | |
| RGFN: Synthesizable Molecular Generation Using GFlowNets | Unknown | N/A | |
| CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts | Unknown | N/A | |
| Set-based Neural Network Encoding Without Weight Tying | Unknown | N/A | |
| Pre-trained Large Language Models Use Fourier Features to Compute Addition | Unknown | N/A | |
| Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable | Unknown | N/A | |
| Learning to Predict Structural Vibrations | Unknown | N/A | |
| Causal language modeling can elicit search and reasoning capabilities on logic puzzles | Unknown | N/A | |
| Interpretable Lightweight Transformer via Unrolling of Learned Graph Smoothness Priors | Unknown | N/A | |
| SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions | Unknown | N/A | |
| HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation | Unknown | N/A | |
| S-SOS: Stochastic Sum-Of-Squares for Parametric Polynomial Optimization | Unknown | N/A | |
| $C^2M^3$: Cycle-Consistent Multi-Model Merging | Unknown | N/A | |
| ReFIR: Grounding Large Restoration Models with Retrieval Augmentation | Unknown | N/A | |
| Improved Algorithms for Contextual Dynamic Pricing | Unknown | N/A | |
| Are Self-Attentions Effective for Time Series Forecasting? | Unknown | N/A | |
| Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis | Unknown | N/A | |
| Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch | Unknown | N/A | |
| SubgDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning | Unknown | N/A | |
| Stochastic Concept Bottleneck Models | Unknown | N/A | |
| Physics-Informed Regularization for Domain-Agnostic Dynamical System Modeling | Unknown | N/A | |
| Theoretical guarantees in KL for Diffusion Flow Matching | Unknown | N/A | |
| DeepDRK: Deep Dependency Regularized Knockoff for Feature Selection | Unknown | N/A | |
| Variational Distillation of Diffusion Policies into Mixture of Experts | Unknown | N/A | |
| Only Strict Saddles in the Energy Landscape of Predictive Coding Networks? | Unknown | N/A | |
| COSMIC: Compress Satellite Image Efficiently via Diffusion Compensation | Unknown | N/A | |
| Reinforcement Learning with LTL and $\omega$-Regular Objectives via Optimality-Preserving Translation to Average Rewards | Unknown | N/A | |
| Accurate and Steady Inertial Pose Estimation through Sequence Structure Learning and Modulation | Unknown | N/A | |
| Does Worst-Performing Agent Lead the Pack? Analyzing Agent Dynamics in Unified Distributed SGD | Unknown | N/A | |
| MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images | Unknown | N/A | |
| Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention | Unknown | N/A | |
| Generalization Bound and Learning Methods for Data-Driven Projections in Linear Programming | Unknown | N/A | |
| Sparsity-Agnostic Linear Bandits with Adaptive Adversaries | Unknown | N/A | |
| Autoregressive Image Diffusion: Generation of Image Sequence and Application in MRI | Unknown | N/A | |
| Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model | Unknown | N/A | |
| Exploring Behavior-Relevant and Disentangled Neural Dynamics with Generative Diffusion Models | Unknown | N/A | |
| One Sample Fits All: Approximating All Probabilistic Values Simultaneously and Efficiently | Unknown | N/A | |
| Nonstationary Sparse Spectral Permanental Process | Unknown | N/A | |
| Sketchy Moment Matching: Toward Fast and Provable Data Selection for Finetuning | Unknown | N/A | |
| DARNet: Dual Attention Refinement Network with Spatiotemporal Construction for Auditory Attention Detection | Unknown | N/A | |
| Statistical Multicriteria Benchmarking via the GSD-Front | Unknown | N/A | |
| Data-faithful Feature Attribution: Mitigating Unobservable Confounders via Instrumental Variables | Unknown | N/A | |
| Small coresets via negative dependence: DPPs, linear statistics, and concentration | Unknown | N/A | |
| Advection Augmented Convolutional Neural Networks | Unknown | N/A | |
| Efficient Temporal Action Segmentation via Boundary-aware Query Voting | Unknown | N/A | |
| Towards Croppable Implicit Neural Representations | Unknown | N/A | |
| EGODE: An Event-attended Graph ODE Framework for Modeling Rigid Dynamics | Unknown | N/A | |
| RFLPA: A Robust Federated Learning Framework against Poisoning Attacks with Secure Aggregation | Unknown | N/A | |
| Dual Prototype Evolving for Test-Time Generalization of Vision-Language Models | Unknown | N/A | |
| RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation | Unknown | N/A | |
| Solving Minimum-Cost Reach Avoid using Reinforcement Learning | Unknown | N/A | |
| Multi-Label Open Set Recognition | Unknown | N/A | |
| Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation | Unknown | N/A | |
| Unveiling the Potential of Robustness in Selecting Conditional Average Treatment Effect Estimators | Unknown | N/A | |
| Beyond Optimism: Exploration With Partially Observable Rewards | Unknown | N/A | |
| Understanding and Improving Adversarial Collaborative Filtering for Robust Recommendation | Unknown | N/A | |
| Diffusion-Reward Adversarial Imitation Learning | Unknown | N/A | |
| Large Language Models Play StarCraft II:Benchmarks and A Chain of Summarization Approach | Unknown | N/A | |
| On the Robustness of Spectral Algorithms for Semirandom Stochastic Block Models | Unknown | N/A | |
| An Analysis of Elo Rating Systems via Markov Chains | Unknown | N/A | |
| Fairness and Efficiency in Online Class Matching | Unknown | N/A | |
| This Too Shall Pass: Removing Stale Observations in Dynamic Bayesian Optimization | Unknown | N/A | |
| Chain of Thoughtlessness? An Analysis of CoT in Planning | Unknown | N/A | |
| Solving Sparse \& High-Dimensional-Output Regression via Compression | Unknown | N/A | |
| Dimension-free Private Mean Estimation for Anisotropic Distributions | Unknown | N/A | |
| Data Attribution for Text-to-Image Models by Unlearning Synthesized Images | Unknown | N/A | |
| Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe | Unknown | N/A | |
| Multi-Winner Reconfiguration | Unknown | N/A | |
| Boosting the Transferability of Adversarial Attack on Vision Transformer with Adaptive Token Tuning | Unknown | N/A | |
| Operator World Models for Reinforcement Learning | Unknown | N/A | |
| A generalized neural tangent kernel for surrogate gradient learning | Unknown | N/A | |
| HydraViT: Stacking Heads for a Scalable ViT | Unknown | N/A | |
| Normal-GS: 3D Gaussian Splatting with Normal-Involved Rendering | Unknown | N/A | |
| Group-wise oracle-efficient algorithms for online multi-group learning | Unknown | N/A | |
| Functional Gradient Flows for Constrained Sampling | Unknown | N/A | |
| Optimal Flow Matching: Learning Straight Trajectories in Just One Step | Unknown | N/A | |
| BMRS: Bayesian Model Reduction for Structured Pruning | Unknown | N/A | |
| VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks | Unknown | N/A | |
| Acceleration Exists! Optimization Problems When Oracle Can Only Compare Objective Function Values | Unknown | N/A | |
| LP-3DGS: Learning to Prune 3D Gaussian Splatting | Unknown | N/A | |
| Diffusion Actor-Critic with Entropy Regulator | Unknown | N/A | |
| Practical $0.385$-Approximation for Submodular Maximization Subject to a Cardinality Constraint | Unknown | N/A | |
| Toward Conditional Distribution Calibration in Survival Prediction | Unknown | N/A | |
| RMLR: Extending Multinomial Logistic Regression into General Geometries | Unknown | N/A | |
| Replicable Uniformity Testing | Unknown | N/A | |
| Real-time Core-Periphery Guided ViT with Smart Data Layout Selection on Mobile Devices | Unknown | N/A | |
| Unitary Convolutions for Learning on Graphs and Groups | Unknown | N/A | |
| Thought of Search: Planning with Language Models Through The Lens of Efficiency | Unknown | N/A | |
| PaCE: Parsimonious Concept Engineering for Large Language Models | Unknown | N/A | |
| Optimizing over Multiple Distributions under Generalized Quasar-Convexity Condition | Unknown | N/A | |
| Accelerated Regularized Learning in Finite N-Person Games | Unknown | N/A | |
| From Chaos to Clarity: 3DGS in the Dark | Unknown | N/A | |
| Full-Distance Evasion of Pedestrian Detectors in the Physical World | Unknown | N/A | |
| Fine-grained Analysis of In-context Linear Estimation: Data, Architecture, and Beyond | Unknown | N/A | |
| Deep Learning in Medical Image Registration: Magic or Mirage? | Unknown | N/A | |
| Cascade Speculative Drafting for Even Faster LLM Inference | Unknown | N/A | |
| VISA: Variational Inference with Sequential Sample-Average Approximations | Unknown | N/A | |
| OPEL: Optimal Transport Guided ProcedurE Learning | Unknown | N/A | |
| Shuffling Gradient-Based Methods for Nonconvex-Concave Minimax Optimization | Unknown | N/A | |
| (FL)$^2$: Overcoming Few Labels in Federated Semi-Supervised Learning | Unknown | N/A | |
| UniGAD: Unifying Multi-level Graph Anomaly Detection | Unknown | N/A | |
| Clustering with Non-adaptive Subset Queries | Unknown | N/A | |
| Compressing Large Language Models using Low Rank and Low Precision Decomposition | Unknown | N/A | |
| Black-Box Forgetting | Unknown | N/A | |
| CODA: A Correlation-Oriented Disentanglement and Augmentation Modeling Scheme for Better Resisting Subpopulation Shifts | Unknown | N/A | |
| Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration | Unknown | N/A | |
| UniBias: Unveiling and Mitigating LLM Bias through Internal Attention and FFN Manipulation | Unknown | N/A | |
| EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG Signals | Unknown | N/A | |
| MotionCraft: Physics-Based Zero-Shot Video Generation | Unknown | N/A | |
| Controlling Multiple Errors Simultaneously with a PAC-Bayes Bound | Unknown | N/A | |
| Search for Efficient Large Language Models | Unknown | N/A | |
| Predictive Attractor Models | Unknown | N/A | |
| Do LLMs Build World Representations? Probing Through the Lens of State Abstraction | Unknown | N/A | |
| Should We Really Edit Language Models? On the Evaluation of Edited Language Models | Unknown | N/A | |
| GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient Descent | Unknown | N/A | |
| Generalization Error Bounds for Two-stage Recommender Systems with Tree Structure | Unknown | N/A | |
| Practical Bayesian Algorithm Execution via Posterior Sampling | Unknown | N/A | |
| Fetch and Forge: Efficient Dataset Condensation for Object Detection | Unknown | N/A | |
| START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation | Unknown | N/A | |
| Long-tailed Object Detection Pretraining: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction | Unknown | N/A | |
| MaNo: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts | Unknown | N/A | |
| Enhancing Diversity in Bayesian Deep Learning via Hyperspherical Energy Minimization of CKA | Unknown | N/A | |
| Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization | Unknown | N/A | |
| Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare | Unknown | N/A | |
| Quality-Improved and Property-Preserved Polarimetric Imaging via Complementarily Fusing | Unknown | N/A | |
| A Simple yet Scalable Granger Causal Structural Learning Approach for Topological Event Sequences | Unknown | N/A | |
| Improving the Training of Rectified Flows | Unknown | N/A | |
| DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering | Unknown | N/A | |
| BrainBits: How Much of the Brain are Generative Reconstruction Methods Using? | Unknown | N/A | |
| Once Read is Enough: Domain-specific Pretraining-free Language Models with Cluster-guided Sparse Experts for Long-tail Domain Knowledge | Unknown | N/A | |
| Learning Cooperative Trajectory Representations for Motion Forecasting | Unknown | N/A | |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Unknown | N/A | |
| Semantics and Spatiality of Emergent Communication | Unknown | N/A | |
| Learning-Augmented Approximation Algorithms for Maximum Cut and Related Problems | Unknown | N/A | |
| Non-asymptotic Global Convergence Analysis of BFGS with the Armijo-Wolfe Line Search | Unknown | N/A | |
| FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis of Indoor Scenes | Unknown | N/A | |
| Learning De-Biased Representations for Remote-Sensing Imagery | Unknown | N/A | |
| A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation | Unknown | N/A | |
| BLAST: Block-Level Adaptive Structured Matrices for Efficient Deep Neural Network Inference | Unknown | N/A | |
| Task-oriented Time Series Imputation Evaluation via Generalized Representers | Unknown | N/A | |
| Evidential Mixture Machines: Deciphering Multi-Label Correlations for Active Learning Sensitivity | Unknown | N/A | |
| Optimal Design for Human Preference Elicitation | Unknown | N/A | |
| Recurrent Reinforcement Learning with Memoroids | Unknown | N/A | |
| Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models | Unknown | N/A | |
| UniTS: A Unified Multi-Task Time Series Model | Unknown | N/A | |
| Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning | Unknown | N/A | |
| FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding | Unknown | N/A | |
| TinyLUT: Tiny Look-Up Table for Efficient Image Restoration at the Edge | Unknown | N/A | |
| DiffSF: Diffusion Models for Scene Flow Estimation | Unknown | N/A | |
| Shared Autonomy with IDA: Interventional Diffusion Assistance | Unknown | N/A | |
| To Err Like Human: Affective Bias-Inspired Measures for Visual Emotion Recognition Evaluation | Unknown | N/A | |
| UGC: Universal Graph Coarsening | Unknown | N/A | |
| Fight Back Against Jailbreaking via Prompt Adversarial Tuning | Unknown | N/A | |
| Toward Real Ultra Image Segmentation: Leveraging Surrounding Context to Cultivate General Segmentation Model | Unknown | N/A | |
| SEEV: Synthesis with Efficient Exact Verification for ReLU Neural Barrier Functions | Unknown | N/A | |
| Graph Neural Networks Do Not Always Oversmooth | Unknown | N/A | |
| Gradient-free Decoder Inversion in Latent Diffusion Models | Unknown | N/A | |
| A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models | Unknown | N/A | |
| HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection | Unknown | N/A | |
| DistrictNet: Decision-aware learning for geographical districting | Unknown | N/A | |
| Visual Fourier Prompt Tuning | Unknown | N/A | |
| NeuralFuse: Learning to Recover the Accuracy of Access-Limited Neural Network Inference in Low-Voltage Regimes | Unknown | N/A | |
| Estimating Generalization Performance Along the Trajectory of Proximal SGD in Robust Regression | Unknown | N/A | |
| Constrained Binary Decision Making | Unknown | N/A | |
| Precise asymptotics of reweighted least-squares algorithms for linear diagonal networks | Unknown | N/A | |
| VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks | Unknown | N/A | |
| 3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction | Unknown | N/A | |
| If You Want to Be Robust, Be Wary of Initialization | Unknown | N/A | |
| Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing | Unknown | N/A | |
| LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate | Unknown | N/A | |
| NeuralSteiner: Learning Steiner Tree for Overflow-avoiding Global Routing in Chip Design | Unknown | N/A | |
| Masked Pre-training Enables Universal Zero-shot Denoiser | Unknown | N/A | |
| Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models | Unknown | N/A | |
| Posture-Informed Muscular Force Learning for Robust Hand Pressure Estimation | Unknown | N/A | |
| Intruding with Words: Towards Understanding Graph Injection Attacks at the Text Level | Unknown | N/A | |
| Adversarially Robust Dense-Sparse Tradeoffs via Heavy-Hitters | Unknown | N/A | |
| BiDM: Pushing the Limit of Quantization for Diffusion Models | Unknown | N/A | |
| Private Stochastic Convex Optimization with Heavy Tails: Near-Optimality from Simple Reductions | Unknown | N/A | |
| Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Unknown | N/A | |
| Generalization Bounds via Conditional $f$-Information | Unknown | N/A | |
| Recovering Complete Actions for Cross-dataset Skeleton Action Recognition | Unknown | N/A | |
| Carrot and Stick: Eliciting Comparison Data and Beyond | Unknown | N/A | |
| RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees | Unknown | N/A | |
| Treatment of Statistical Estimation Problems in Randomized Smoothing for Adversarial Robustness | Unknown | N/A | |
| CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search | Unknown | N/A | |
| Efficient Discrepancy Testing for Learning with Distribution Shift | Unknown | N/A | |
| Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality | Unknown | N/A | |
| Forgetting, Ignorance or Myopia: Revisiting Key Challenges in Online Continual Learning | Unknown | N/A | |
| Optimal ablation for interpretability | Unknown | N/A | |
| Getting More Juice Out of the SFT Data: Reward Learning from Human Demonstration Improves SFT for LLM Alignment | Unknown | N/A | |
| Compositional 3D-aware Video Generation with LLM Director | Unknown | N/A | |
| CALANet: Cheap All-Layer Aggregation for Human Activity Recognition | Unknown | N/A | |
| Exploration by Learning Diverse Skills through Successor State Representations | Unknown | N/A | |
| Exact, Tractable Gauss-Newton Optimization in Deep Reversible Architectures Reveal Poor Generalization | Unknown | N/A | |
| Particle Semi-Implicit Variational Inference | Unknown | N/A | |
| A Walsh Hadamard Derived Linear Vector Symbolic Architecture | Unknown | N/A | |
| Learning to Solve Quadratic Unconstrained Binary Optimization in a Classification Way | Unknown | N/A | |
| From Dictionary to Tensor: A Scalable Multi-View Subspace Clustering Framework with Triple Information Enhancement | Unknown | N/A | |
| Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models | Unknown | N/A | |
| Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought | Unknown | N/A | |
| Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning | Unknown | N/A | |
| Faster Accelerated First-order Methods for Convex Optimization with Strongly Convex Function Constraints | Unknown | N/A | |
| Span-Based Optimal Sample Complexity for Weakly Communicating and General Average Reward MDPs | Unknown | N/A | |
| Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models | Unknown | N/A | |
| Deep Submodular Peripteral Networks | Unknown | N/A | |
| Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration | Unknown | N/A | |
| Refusal in Language Models Is Mediated by a Single Direction | Unknown | N/A | |
| Scalable Constrained Policy Optimization for Safe Multi-agent Reinforcement Learning | Unknown | N/A | |
| The Limits of Transfer Reinforcement Learning with Latent Low-rank Structure | Unknown | N/A | |
| Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning | Unknown | N/A | |
| Unleashing Multispectral Video's Potential in Semantic Segmentation: A Semi-supervised Viewpoint and New UAV-View Benchmark | Unknown | N/A | |
| What Variables Affect Out-of-Distribution Generalization in Pretrained Models? | Unknown | N/A | |
| MSAGPT: Neural Prompting Protein Structure Prediction via MSA Generative Pre-Training | Unknown | N/A | |
| E-Motion: Future Motion Simulation via Event Sequence Diffusion | Unknown | N/A | |
| ReMoDetect: Reward Models Recognize Aligned LLM's Generations | Unknown | N/A | |
| Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Generation | Unknown | N/A | |
| Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms | Unknown | N/A | |
| AdaptiveISP: Learning an Adaptive Image Signal Processor for Object Detection | Unknown | N/A | |
| Near-Optimal Distributed Minimax Optimization under the Second-Order Similarity | Unknown | N/A | |
| Wasserstein Distributionally Robust Optimization through the Lens of Structural Causal Models and Individual Fairness | Unknown | N/A | |
| Algorithmic Capabilities of Random Transformers | Unknown | N/A | |
| SuperDeepFool: a new fast and accurate minimal adversarial attack | Unknown | N/A | |
| CAT: Coordinating Anatomical-Textual Prompts for Multi-Organ and Tumor Segmentation | Unknown | N/A | |
| Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching | Unknown | N/A | |
| Spectral Editing of Activations for Large Language Model Alignment | Unknown | N/A | |
| Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers | Unknown | N/A | |
| Stochastic Kernel Regularisation Improves Generalisation in Deep Kernel Machines | Unknown | N/A | |
| Efficient Combinatorial Optimization via Heat Diffusion | Unknown | N/A | |
| Achieving $\tilde{O}(1/\epsilon)$ Sample Complexity for Constrained Markov Decision Process | Unknown | N/A | |
| Abstracted Shapes as Tokens - A Generalizable and Interpretable Model for Time-series Classification | Unknown | N/A | |
| Train-Attention: Meta-Learning Where to Focus in Continual Knowledge Learning | Unknown | N/A | |
| Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans | Unknown | N/A | |
| Enhancing LLM’s Cognition via Structurization | Unknown | N/A | |
| N-agent Ad Hoc Teamwork | Unknown | N/A | |
| Stealth edits to large language models | Unknown | N/A | |
| Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates | Unknown | N/A | |
| Get Rid of Isolation: A Continuous Multi-task Spatio-Temporal Learning Framework | Unknown | N/A | |
| Dynamic Neural Regeneration: Enhancing Deep Learning Generalization on Small Datasets | Unknown | N/A | |
| Differentially Private Equivalence Testing for Continuous Distributions and Applications | Unknown | N/A | |
| HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning | Unknown | N/A | |
| Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit | Unknown | N/A | |
| Persistence Homology Distillation for Semi-supervised Continual Learning | Unknown | N/A | |
| Recognize Any Regions | Unknown | N/A | |
| 4M-21: An Any-to-Any Vision Model for Tens of Tasks and Modalities | Unknown | N/A | |
| Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning | Unknown | N/A | |
| Geodesic Optimization for Predictive Shift Adaptation on EEG data | Unknown | N/A | |
| Expert-level protocol translation for self-driving labs | Unknown | N/A | |
| Uncovering, Explaining, and Mitigating the Superficial Safety of Backdoor Defense | Unknown | N/A | |
| Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling | Unknown | N/A | |
| The Evolution of Statistical Induction Heads: In-Context Learning Markov Chains | Unknown | N/A | |
| Protected Test-Time Adaptation via Online Entropy Matching: A Betting Approach | Unknown | N/A | |
| IODA: Instance-Guided One-shot Domain Adaptation for Super-Resolution | Unknown | N/A | |
| MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution | Unknown | N/A | |
| SocraticLM: Exploring Socratic Personalized Teaching with Large Language Models | Unknown | N/A | |
| Symmetry Discovery Beyond Affine Transformations | Unknown | N/A | |
| Variational Multi-scale Representation for Estimating Uncertainty in 3D Gaussian Splatting | Unknown | N/A | |
| Bridging OOD Detection and Generalization: A Graph-Theoretic View | Unknown | N/A | |
| How Control Information Influences Multilingual Text Image Generation and Editing? | Unknown | N/A | |
| From Causal to Concept-Based Representation Learning | Unknown | N/A | |
| Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model | Unknown | N/A | |
| DOPPLER: Differentially Private Optimizers with Low-pass Filter for Privacy Noise Reduction | Unknown | N/A | |
| Neural Persistence Dynamics | Unknown | N/A | |
| CLAP4CLIP: Continual Learning with Probabilistic Finetuning for Vision-Language Models | Unknown | N/A | |
| Learning Successor Features the Simple Way | Unknown | N/A | |
| Iteratively Refined Early Interaction Alignment for Subgraph Matching based Graph Retrieval | Unknown | N/A | |
| No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices | Unknown | N/A | |
| Debiasing Synthetic Data Generated by Deep Generative Models | Unknown | N/A | |
| Activating Self-Attention for Multi-Scene Absolute Pose Regression | Unknown | N/A | |
| Interaction-Force Transport Gradient Flows | Unknown | N/A | |
| Learning from Highly Sparse Spatio-temporal Data | Unknown | N/A | |
| Multi-turn Reinforcement Learning with Preference Human Feedback | Unknown | N/A | |
| Everyday Object Meets Vision-and-Language Navigation Agent via Backdoor | Unknown | N/A | |
| 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability | Unknown | N/A | |
| DeBaRA: Denoising-Based 3D Room Arrangement Generation | Unknown | N/A | |
| Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context | Unknown | N/A | |
| MIDGArD: Modular Interpretable Diffusion over Graphs for Articulated Designs | Unknown | N/A | |
| Distributed Least Squares in Small Space via Sketching and Bias Reduction | Unknown | N/A | |
| Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator | Unknown | N/A | |
| FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors | Unknown | N/A | |
| APIGen: Automated PIpeline for Generating Verifiable and Diverse Function-Calling Datasets | Unknown | N/A | |
| Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exiting | Unknown | N/A | |
| OTTER: Effortless Label Distribution Adaptation of Zero-shot Models | Unknown | N/A | |
| T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback | Unknown | N/A | |
| Symbolic Regression with a Learned Concept Library | Unknown | N/A | |
| Generative Modelling of Structurally Constrained Graphs | Unknown | N/A | |
| Conformalized Credal Set Predictors | Unknown | N/A | |
| A robust inlier identification algorithm for point cloud registration via $\mathbf{\ell_0}$-minimization | Unknown | N/A | |
| Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning | Unknown | N/A | |
| Order-Independence Without Fine Tuning | Unknown | N/A | |
| Expressive Gaussian Human Avatars from Monocular RGB Video | Unknown | N/A | |
| AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation | Unknown | N/A | |
| OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations | Unknown | N/A | |
| Learning Versatile Skills with Curriculum Masking | Unknown | N/A | |
| TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes | Unknown | N/A | |
| A probability contrastive learning framework for 3D molecular representation learning | Unknown | N/A | |
| RLE: A Unified Perspective of Data Augmentation for Cross-Spectral Re-Identification | Unknown | N/A | |
| Time Makes Space: Emergence of Place Fields in Networks Encoding Temporally Continuous Sensory Experiences | Unknown | N/A | |
| Achieving Constant Regret in Linear Markov Decision Processes | Unknown | N/A | |
| MMSite: A Multi-modal Framework for the Identification of Active Sites in Proteins | Unknown | N/A | |
| Inevitable Trade-off between Watermark Strength and Speculative Sampling Efficiency for Language Models | Unknown | N/A | |
| realSEUDO for real-time calcium imaging analysis | Unknown | N/A | |
| Online Non-convex Learning in Dynamic Environments | Unknown | N/A | |
| Questioning the Survey Responses of Large Language Models | Unknown | N/A | |
| GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling | Unknown | N/A | |
| Differentially Private Set Representations | Unknown | N/A | |
| Exploratory Retrieval-Augmented Planning For Continual Embodied Instruction Following | Unknown | N/A | |
| How do Large Language Models Handle Multilingualism? | Unknown | N/A | |
| Color-Oriented Redundancy Reduction in Dataset Distillation | Unknown | N/A | |
| Selective Attention: Enhancing Transformer through Principled Context Control | Unknown | N/A | |
| Trade-Offs of Diagonal Fisher Information Matrix Estimators | Unknown | N/A | |
| CONTRAST: Continual Multi-source Adaptation to Dynamic Distributions | Unknown | N/A | |
| OccFusion: Rendering Occluded Humans with Generative Diffusion Priors | Unknown | N/A | |
| Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance | Unknown | N/A | |
| DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators | Unknown | N/A | |
| Barely Random Algorithms and Collective Metrical Task Systems | Unknown | N/A | |
| Bias Amplification in Language Model Evolution: An Iterated Learning Perspective | Unknown | N/A | |
| SSA-Seg: Semantic and Spatial Adaptive Pixel-level Classifier for Semantic Segmentation | Unknown | N/A | |
| Even Sparser Graph Transformers | Unknown | N/A | |
| Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models | Unknown | N/A | |
| Improved Sample Complexity for Multiclass PAC Learning | Unknown | N/A | |
| Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators | Unknown | N/A | |
| SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models | Unknown | N/A | |
| Changing the Training Data Distribution to Reduce Simplicity Bias Improves In-distribution Generalization | Unknown | N/A | |
| Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling | Unknown | N/A | |
| REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR | Unknown | N/A | |
| Conditioning non-linear and infinite-dimensional diffusion processes | Unknown | N/A | |
| Long-form factuality in large language models | Unknown | N/A | |
| Scale Equivariant Graph Metanetworks | Unknown | N/A | |
| Shape analysis for time series | Unknown | N/A | |
| Provable Acceleration of Nesterov's Accelerated Gradient for Asymmetric Matrix Factorization and Linear Neural Networks | Unknown | N/A | |
| GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs | Unknown | N/A | |
| SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion | Unknown | N/A | |
| Breaking the curse of dimensionality in structured density estimation | Unknown | N/A | |
| Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation | Unknown | N/A | |
| An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding | Unknown | N/A | |
| Analysing Multi-Task Regression via Random Matrix Theory with Application to Time Series Forecasting | Unknown | N/A | |
| BiScope: AI-generated Text Detection by Checking Memorization of Preceding Tokens | Unknown | N/A | |
| Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer | Unknown | N/A | |
| Contrastive dimension reduction: when and how? | Unknown | N/A | |
| Large Pre-trained time series models for cross-domain Time series analysis tasks | Unknown | N/A | |
| FSP-Laplace: Function-Space Priors for the Laplace Approximation in Bayesian Deep Learning | Unknown | N/A | |
| Improved Particle Approximation Error for Mean Field Neural Networks | Unknown | N/A | |
| Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance | Unknown | N/A | |
| SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning | Unknown | N/A | |
| LLMs Can Evolve Continually on Modality for $\mathbb{X}$-Modal Reasoning | Unknown | N/A | |
| Deep Homomorphism Networks | Unknown | N/A | |
| Bridge-IF: Learning Inverse Protein Folding with Markov Bridges | Unknown | N/A | |
| Bridging Multicalibration and Out-of-distribution Generalization Beyond Covariate Shift | Unknown | N/A | |
| Rejection via Learning Density Ratios | Unknown | N/A | |
| Introducing Spectral Attention for Long-Range Dependency in Time Series Forecasting | Unknown | N/A | |
| Similarity-Navigated Conformal Prediction for Graph Neural Networks | Unknown | N/A | |
| Treeffuser: probabilistic prediction via conditional diffusions with gradient-boosted trees | Unknown | N/A | |
| Discrete Flow Matching | Unknown | N/A | |
| On the Power of Decision Trees in Auto-Regressive Language Modeling | Unknown | N/A | |
| ReVideo: Remake a Video with Motion and Content Control | Unknown | N/A | |
| Navigating Chemical Space with Latent Flows | Unknown | N/A | |
| Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection | Unknown | N/A | |
| Higher-Rank Irreducible Cartesian Tensors for Equivariant Message Passing | Unknown | N/A | |
| Towards Learning Group-Equivariant Features for Domain Adaptive 3D Detection | Unknown | N/A | |
| Parameter Competition Balancing for Model Merging | Unknown | N/A | |
| Gorilla: Large Language Model Connected with Massive APIs | Unknown | N/A | |
| A Simple yet Universal Framework for Depth Completion | Unknown | N/A | |
| Learnability of high-dimensional targets by two-parameter models and gradient flow | Unknown | N/A | |
| Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis | Unknown | N/A | |
| Active Sequential Posterior Estimation for Sample-Efficient Simulation-Based Inference | Unknown | N/A | |
| KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization | Unknown | N/A | |
| The surprising efficiency of temporal difference learning for rare event prediction | Unknown | N/A | |
| Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification | Unknown | N/A | |
| Human Expertise in Algorithmic Prediction | Unknown | N/A | |
| Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation | Unknown | N/A | |
| Are nuclear masks all you need for improved out-of-domain generalisation? A closer look at cancer classification in histopathology | Unknown | N/A | |
| Bounds for the smallest eigenvalue of the NTK for arbitrary spherical data of arbitrary dimension | Unknown | N/A | |
| NeuralClothSim: Neural Deformation Fields Meet the Thin Shell Theory | Unknown | N/A | |
| SparseLLM: Towards Global Pruning of Pre-trained Language Models | Unknown | N/A | |
| Smoothie: Label Free Language Model Routing | Unknown | N/A | |
| Finding Transformer Circuits With Edge Pruning | Unknown | N/A | |
| Learning Better Representations From Less Data For Propositional Satisfiability | Unknown | N/A | |
| Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting | Unknown | N/A | |
| Semidefinite Relaxations of the Gromov-Wasserstein Distance | Unknown | N/A | |
| Stability and Generalization of Adversarial Training for Shallow Neural Networks with Smooth Activation | Unknown | N/A | |
| Bigger, Regularized, Optimistic: scaling for compute and sample efficient continuous control | Unknown | N/A | |
| ReF-LDM: A Latent Diffusion Model for Reference-based Face Image Restoration | Unknown | N/A | |
| Fast Rates in Stochastic Online Convex Optimization by Exploiting the Curvature of Feasible Sets | Unknown | N/A | |
| Asymptotics of Alpha-Divergence Variational Inference Algorithms with Exponential Families | Unknown | N/A | |
| Probabilistic Conformal Distillation for Enhancing Missing Modality Robustness | Unknown | N/A | |
| Model Based Inference of Synaptic Plasticity Rules | Unknown | N/A | |
| Data Augmentation with Diffusion for Open-Set Semi-Supervised Learning | Unknown | N/A | |
| Most Influential Subset Selection: Challenges, Promises, and Beyond | Unknown | N/A | |
| Score Distillation via Reparametrized DDIM | Unknown | N/A | |
| Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer | Unknown | N/A | |
| FouRA: Fourier Low-Rank Adaptation | Unknown | N/A | |
| The Fairness-Quality Tradeoff in Clustering | Unknown | N/A | |
| Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference | Unknown | N/A | |
| AnyFit: Controllable Virtual Try-on for Any Combination of Attire Across Any Scenario | Unknown | N/A | |
| MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making | Unknown | N/A | |
| Data-Efficient Learning with Neural Programs | Unknown | N/A | |
| Modeling Latent Neural Dynamics with Gaussian Process Switching Linear Dynamical Systems | Unknown | N/A | |
| Equivariant Blurring Diffusion for Hierarchical Molecular Conformer Generation | Unknown | N/A | |
| Language Model as Visual Explainer | Unknown | N/A | |
| No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO | Unknown | N/A | |
| REBEL: Reinforcement Learning via Regressing Relative Rewards | Unknown | N/A | |
| Synthesize, Partition, then Adapt: Eliciting Diverse Samples from Foundation Models | Unknown | N/A | |
| Efficient multi-prompt evaluation of LLMs | Unknown | N/A | |
| Neural Network Reparametrization for Accelerated Optimization in Molecular Simulations | Unknown | N/A | |
| Decision-Focused Learning with Directional Gradients | Unknown | N/A | |
| Graph Diffusion Policy Optimization | Unknown | N/A | |
| Learning to compute Gröbner bases | Unknown | N/A | |
| Partial Transportability for Domain Generalization | Unknown | N/A | |
| Robust Mixture Learning when Outliers Overwhelm Small Groups | Unknown | N/A | |
| How does Architecture Influence the Base Capabilities of Pre-trained Language Models? A Case Study Based on FFN-Wider and MoE Transformers | Unknown | N/A | |
| Improving self-training under distribution shifts via anchored confidence with theoretical guarantees | Unknown | N/A | |
| Do Finetti: On Causal Effects for Exchangeable Data | Unknown | N/A | |
| DEPrune: Depth-wise Separable Convolution Pruning for Maximizing GPU Parallelism | Unknown | N/A | |
| Class Distribution Shifts in Zero-Shot Learning: Learning Robust Representations | Unknown | N/A | |
| DiffPO: A causal diffusion model for learning distributions of potential outcomes | Unknown | N/A | |
| Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization | Unknown | N/A | |
| Aligning LLM Agents by Learning Latent Preference from User Edits | Unknown | N/A | |
| Provably Efficient Interactive-Grounded Learning with Personalized Reward | Unknown | N/A | |
| Structured Unrestricted-Rank Matrices for Parameter Efficient Finetuning | Unknown | N/A | |
| The Prevalence of Neural Collapse in Neural Multivariate Regression | Unknown | N/A | |
| In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization | Unknown | N/A | |
| GraphTrail: Translating GNN Predictions into Human-Interpretable Logical Rules | Unknown | N/A | |
| S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training | Unknown | N/A | |
| Slot State Space Models | Unknown | N/A | |
| Online Consistency of the Nearest Neighbor Rule | Unknown | N/A | |
| Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving | Unknown | N/A | |
| Fully Explicit Dynamic Gaussian Splatting | Unknown | N/A | |
| Interpretable Generalized Additive Models for Datasets with Missing Values | Unknown | N/A | |
| Continual Learning with Global Alignment | Unknown | N/A | |
| MemoryFormer : Minimize Transformer Computation by Removing Fully-Connected Layers | Unknown | N/A | |
| Time-Constrained Robust MDPs | Unknown | N/A | |
| Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces | Unknown | N/A | |
| Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space | Unknown | N/A | |
| Adaptive Randomized Smoothing: Certified Adversarial Robustness for Multi-Step Defences | Unknown | N/A | |
| Diffusion Models are Certifiably Robust Classifiers | Unknown | N/A | |
| Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention | Unknown | N/A | |
| Causal Imitation for Markov Decision Processes: a Partial Identification Approach | Unknown | N/A | |
| Noisy Label Learning with Instance-Dependent Outliers: Identifiability via Crowd Wisdom | Unknown | N/A | |
| Symmetries in Overparametrized Neural Networks: A Mean Field View | Unknown | N/A | |
| Identifying Causal Effects Under Functional Dependencies | Unknown | N/A | |
| MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems | Unknown | N/A | |
| Unlocking the Potential of Global Human Expertise | Unknown | N/A | |
| Adaptive Exploration for Data-Efficient General Value Function Evaluations | Unknown | N/A | |
| Recursive Introspection: Teaching Language Model Agents How to Self-Improve | Unknown | N/A | |
| Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL | Unknown | N/A | |
| Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight | Unknown | N/A | |
| Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation? | Unknown | N/A | |
| Bayesian Adaptive Calibration and Optimal Design | Unknown | N/A | |
| Contrastive losses as generalized models of global epistasis | Unknown | N/A | |
| Learning Linear Causal Representations from General Environments: Identifiability and Intrinsic Ambiguity | Unknown | N/A | |
| Consistency of Neural Causal Partial Identification | Unknown | N/A | |
| Chimera: Effectively Modeling Multivariate Time Series with 2-Dimensional State Space Models | Unknown | N/A | |
| Revisiting Ensembling in One-Shot Federated Learning | Unknown | N/A | |
| Foundations of Multivariate Distributional Reinforcement Learning | Unknown | N/A | |
| Generative Fractional Diffusion Models | Unknown | N/A | |
| Reward Machines for Deep RL in Noisy and Uncertain Environments | Unknown | N/A | |
| Linear Time Approximation Algorithm for Column Subset Selection with Local Search | Unknown | N/A | |
| Activation Map Compression through Tensor Decomposition for Deep Learning | Unknown | N/A | |
| Poisson Variational Autoencoder | Unknown | N/A | |
| SOI: Scaling Down Computational Complexity by Estimating Partial States of the Model | Unknown | N/A | |
| Provably Optimal Memory Capacity for Modern Hopfield Models: Transformer-Compatible Dense Associative Memories as Spherical Codes | Unknown | N/A | |
| To Learn or Not to Learn, That is the Question — A Feature-Task Dual Learning Model of Perceptual Learning | Unknown | N/A | |
| Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy Exploration | Unknown | N/A | |
| Universal Sample Coding | Unknown | N/A | |
| Watermarking Makes Language Models Radioactive | Unknown | N/A | |
| MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views | Unknown | N/A | |
| Learnability Matters: Active Learning for Video Captioning | Unknown | N/A | |
| AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks | Unknown | N/A | |
| Self-Guiding Exploration for Combinatorial Problems | Unknown | N/A | |
| First-Explore, then Exploit: Meta-Learning to Solve Hard Exploration-Exploitation Trade-Offs | Unknown | N/A | |
| Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting | Unknown | N/A | |
| Testing Calibration in Nearly-Linear Time | Unknown | N/A | |
| Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf | Unknown | N/A | |
| Learning the Expected Core of Strictly Convex Stochastic Cooperative Games | Unknown | N/A | |
| ChronoEpilogi: Scalable Time Series Selection with Multiple Solutions | Unknown | N/A | |
| Learning Goal-Conditioned Representations for Language Reward Models | Unknown | N/A | |
| Linear Transformers are Versatile In-Context Learners | Unknown | N/A | |
| Efficient Multi-task Reinforcement Learning with Cross-Task Policy Guidance | Unknown | N/A | |
| Are Language Models Actually Useful for Time Series Forecasting? | Unknown | N/A | |
| Stress-Testing Capability Elicitation With Password-Locked Models | Unknown | N/A | |
| Cell ontology guided transcriptome foundation model | Unknown | N/A | |
| Distributionally Robust Performative Prediction | Unknown | N/A | |
| Optimal Multiclass U-Calibration Error and Beyond | Unknown | N/A | |
| Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations | Unknown | N/A | |
| The Implicit Bias of Gradient Descent on Separable Multiclass Data | Unknown | N/A | |
| TrackIME: Enhanced Video Point Tracking via Instance Motion Estimation | Unknown | N/A | |
| Robust Gaussian Processes via Relevance Pursuit | Unknown | N/A | |
| Understanding and Minimising Outlier Features in Transformer Training | Unknown | N/A | |
| Near-Optimal Distributionally Robust Reinforcement Learning with General $L_p$ Norms | Unknown | N/A | |
| COLD: Causal reasOning in cLosed Daily activities | Unknown | N/A | |
| Neuc-MDS: Non-Euclidean Multidimensional Scaling Through Bilinear Forms | Unknown | N/A | |
| Online Budgeted Matching with General Bids | Unknown | N/A | |
| Loki: Low-rank Keys for Efficient Sparse Attention | Unknown | N/A | |
| A Metalearned Neural Circuit for Nonparametric Bayesian Inference | Unknown | N/A | |
| Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters | Unknown | N/A | |
| Unelicitable Backdoors via Cryptographic Transformer Circuits | Unknown | N/A | |
| Secret Collusion among AI Agents: Multi-Agent Deception via Steganography | Unknown | N/A | |
| Computerized Adaptive Testing via Collaborative Ranking | Unknown | N/A | |
| LESS: Label-Efficient and Single-Stage Referring 3D Segmentation | Unknown | N/A | |
| Mixture of Nested Experts: Adaptive Processing of Visual Tokens | Unknown | N/A | |
| Skinned Motion Retargeting with Dense Geometric Interaction Perception | Unknown | N/A | |
| Handling Learnwares from Heterogeneous Feature Spaces with Explicit Label Exploitation | Unknown | N/A | |
| Sample Complexity of Interventional Causal Representation Learning | Unknown | N/A | |
| Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios | Unknown | N/A | |
| Categorical Flow Matching on Statistical Manifolds | Unknown | N/A | |
| Queueing Matching Bandits with Preference Feedback | Unknown | N/A | |
| Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space | Unknown | N/A | |
| Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering | Unknown | N/A | |
| DiffuLT: Diffusion for Long-tail Recognition Without External Knowledge | Unknown | N/A | |
| Towards Calibrated Robust Fine-Tuning of Vision-Language Models | Unknown | N/A | |
| SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout | Unknown | N/A | |
| DarkSAM: Fooling Segment Anything Model to Segment Nothing | Unknown | N/A | |
| Exact Gradients for Stochastic Spiking Neural Networks Driven by Rough Signals | Unknown | N/A | |
| Schrodinger Bridge Flow for Unpaired Data Translation | Unknown | N/A | |
| SimPO: Simple Preference Optimization with a Reference-Free Reward | Unknown | N/A | |
| Unlearnable 3D Point Clouds: Class-wise Transformation Is All You Need | Unknown | N/A | |
| The Representation Landscape of Few-Shot Learning and Fine-Tuning in Large Language Models | Unknown | N/A | |
| Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification | Unknown | N/A | |
| Learning Diffusion Priors from Observations by Expectation Maximization | Unknown | N/A | |
| Thompson Sampling For Combinatorial Bandits: Polynomial Regret and Mismatched Sampling Paradox | Unknown | N/A | |
| Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2) | Unknown | N/A | |
| On Learning Multi-Modal Forgery Representation for Diffusion Generated Video Detection | Unknown | N/A | |
| On the Efficiency of ERM in Feature Learning | Unknown | N/A | |
| Policy Aggregation | Unknown | N/A | |
| Face2QR: A Unified Framework for Aesthetic, Face-Preserving, and Scannable QR Code Generation | Unknown | N/A | |
| Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking | Unknown | N/A | |
| Diffusion for World Modeling: Visual Details Matter in Atari | Unknown | N/A | |
| WeiPer: OOD Detection using Weight Perturbations of Class Projections | Unknown | N/A | |
| Facilitating Multimodal Classification via Dynamically Learning Modality Gap | Unknown | N/A | |
| Mining and Transferring Feature-Geometry Coherence for Unsupervised Point Cloud Registration | Unknown | N/A | |
| Universal Physics Transformers: A Framework For Efficiently Scaling Neural Operators | Unknown | N/A | |
| AlterMOMA: Fusion Redundancy Pruning for Camera-LiDAR Fusion Models with Alternative Modality Masking | Unknown | N/A | |
| CountGD: Multi-Modal Open-World Counting | Unknown | N/A | |
| The Dormant Neuron Phenomenon in Multi-Agent Reinforcement Learning Value Factorization | Unknown | N/A | |
| When is an Embedding Model More Promising than Another? | Unknown | N/A | |
| Reinforcing LLM Agents via Policy Optimization with Action Decomposition | Unknown | N/A | |
| UDC: A Unified Neural Divide-and-Conquer Framework for Large-Scale Combinatorial Optimization Problems | Unknown | N/A | |
| Extending Video Masked Autoencoders to 128 frames | Unknown | N/A | |
| Spectral Learning of Shared Dynamics Between Generalized-Linear Processes | Unknown | N/A | |
| Almost-Linear RNNs Yield Highly Interpretable Symbolic Codes in Dynamical Systems Reconstruction | Unknown | N/A | |
| xLSTM: Extended Long Short-Term Memory | Unknown | N/A | |
| An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning | Unknown | N/A | |
| Over-parameterized Student Model via Tensor Decomposition Boosted Knowledge Distillation | Unknown | N/A | |
| Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits | Unknown | N/A | |
| Kermut: Composite kernel regression for protein variant effects | Unknown | N/A | |
| Factorized Diffusion Architectures for Unsupervised Image Generation and Segmentation | Unknown | N/A | |
| $SE(3)$ Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation | Unknown | N/A | |
| On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice | Unknown | N/A | |
| P$^2$C$^2$Net: PDE-Preserved Coarse Correction Network for efficient prediction of spatiotemporal dynamics | Unknown | N/A | |
| Accuracy is Not All You Need | Unknown | N/A | |
| CoSW: Conditional Sample Weighting for Smoke Segmentation with Label Noise | Unknown | N/A | |
| SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents | Unknown | N/A | |
| Exploiting LLM Quantization | Unknown | N/A | |
| Are Graph Neural Networks Optimal Approximation Algorithms? | Unknown | N/A | |
| Hierarchical Programmatic Option Framework | Unknown | N/A | |
| Sequential Signal Mixing Aggregation for Message Passing Graph Neural Networks | Unknown | N/A | |
| Gradient Methods for Online DR-Submodular Maximization with Stochastic Long-Term Constraints | Unknown | N/A | |
| SGD vs GD: Rank Deficiency in Linear Networks | Unknown | N/A | |
| Physics-Informed Variational State-Space Gaussian Processes | Unknown | N/A | |
| Coherent 3D Scene Diffusion From a Single RGB Image | Unknown | N/A | |
| Multi-Agent Domain Calibration with a Handful of Offline Data | Unknown | N/A | |
| Probabilistic size-and-shape functional mixed models | Unknown | N/A | |
| Soft Superpixel Neighborhood Attention | Unknown | N/A | |
| FINALLY: fast and universal speech enhancement with studio-like quality | Unknown | N/A | |
| Active preference learning for ordering items in- and out-of-sample | Unknown | N/A | |
| You Don’t Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning | Unknown | N/A | |
| Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors | Unknown | N/A | |
| Constrained Diffusion Models via Dual Training | Unknown | N/A | |
| The Many Faces of Optimal Weak-to-Strong Learning | Unknown | N/A | |
| Towards Safe Concept Transfer of Multi-Modal Diffusion via Causal Representation Editing | Unknown | N/A | |
| Diffusion of Thought: Chain-of-Thought Reasoning in Diffusion Language Models | Unknown | N/A | |
| HEALNet: Multimodal Fusion for Heterogeneous Biomedical Data | Unknown | N/A | |
| Harnessing Multiple Correlated Networks for Exact Community Recovery | Unknown | N/A | |
| HYDRA: Model Factorization Framework for Black-Box LLM Personalization | Unknown | N/A | |
| Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics | Unknown | N/A | |
| MTGS: A Novel Framework for Multi-Person Temporal Gaze Following and Social Gaze Prediction | Unknown | N/A | |
| Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs | Unknown | N/A | |
| MetaCURL: Non-stationary Concave Utility Reinforcement Learning | Unknown | N/A | |
| YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals | Unknown | N/A | |
| Amortized Eigendecomposition for Neural Networks | Unknown | N/A | |
| What do Graph Neural Networks learn? Insights from Tropical Geometry | Unknown | N/A | |
| Diffusion Twigs with Loop Guidance for Conditional Graph Generation | Unknown | N/A | |
| Moving Off-the-Grid: Scene-Grounded Video Representations | Unknown | N/A | |
| Be Confident in What You Know: Bayesian Parameter Efficient Fine-Tuning of Vision Foundation Models | Unknown | N/A | |
| Certified Adversarial Robustness via Randomized $\alpha$-Smoothing for Regression Models | Unknown | N/A | |
| Interventional Causal Discovery in a Mixture of DAGs | Unknown | N/A | |
| Virtual Scanning: Unsupervised Non-line-of-sight Imaging from Irregularly Undersampled Transients | Unknown | N/A | |
| On the Limitations of Fractal Dimension as a Measure of Generalization | Unknown | N/A | |
| Memorize What Matters: Emergent Scene Decomposition from Multitraverse | Unknown | N/A | |
| Unified Covariate Adjustment for Causal Inference | Unknown | N/A | |
| Model-free Low-Rank Reinforcement Learning via Leveraged Entry-wise Matrix Estimation | Unknown | N/A | |
| Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba | Unknown | N/A | |
| Local Anti-Concentration Class: Logarithmic Regret for Greedy Linear Contextual Bandit | Unknown | N/A | |
| Designing Cell-Type-Specific Promoter Sequences Using Conservative Model-Based Optimization | Unknown | N/A | |
| Flexible task abstractions emerge in linear networks with fast and bounded units | Unknown | N/A | |
| What Makes and Breaks Safety Fine-tuning? A Mechanistic Study | Unknown | N/A | |
| Efficient Policy Evaluation Across Multiple Different Experimental Datasets | Unknown | N/A | |
| Complete Graphical Criterion for Sequential Covariate Adjustment in Causal Inference | Unknown | N/A | |
| On the Adversarial Robustness of Benjamini Hochberg | Unknown | N/A | |
| Optimal Parallelization of Boosting | Unknown | N/A | |
| From Linear to Linearizable Optimization: A Novel Framework with Applications to Stationary and Non-stationary DR-submodular Optimization | Unknown | N/A | |
| Global Lyapunov functions: a long-standing open problem in mathematics, with symbolic transformers | Unknown | N/A | |
| Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization | Unknown | N/A | |
| Active Learning for Derivative-Based Global Sensitivity Analysis with Gaussian Processes | Unknown | N/A | |
| Graphcode: Learning from multiparameter persistent homology using graph neural networks | Unknown | N/A | |
| Sample-efficient Bayesian Optimisation Using Known Invariances | Unknown | N/A | |
| ProtGO: Function-Guided Protein Modeling for Unified Representation Learning | Unknown | N/A | |
| SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning | Unknown | N/A | |
| Attention Temperature Matters in ViT-Based Cross-Domain Few-Shot Learning | Unknown | N/A | |
| Improved Regret of Linear Ensemble Sampling | Unknown | N/A | |
| Utilizing Image Transforms and Diffusion Models for Generative Modeling of Short and Long Time Series | Unknown | N/A | |
| TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives | Unknown | N/A | |
| DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data | Unknown | N/A | |
| Validating Climate Models with Spherical Convolutional Wasserstein Distance | Unknown | N/A | |
| Unifying Generation and Prediction on Graphs with Latent Graph Diffusion | Unknown | N/A | |
| Provably Transformers Harness Multi-Concept Word Semantics for Efficient In-Context Learning | Unknown | N/A | |
| Transformers on Markov data: Constant depth suffices | Unknown | N/A | |
| Simulation-Free Training of Neural ODEs on Paired Data | Unknown | N/A | |
| PhoCoLens: Photorealistic and Consistent Reconstruction in Lensless Imaging | Unknown | N/A | |
| Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision | Unknown | N/A | |
| Disentangling the Roles of Distinct Cell Classes with Cell-Type Dynamical Systems | Unknown | N/A | |
| Implicitly Guided Design with PropEn: Match your Data to Follow the Gradient | Unknown | N/A | |
| SceneCraft: Layout-Guided 3D Scene Generation | Unknown | N/A | |
| NeuralFluid: Nueral Fluidic System Design and Control with Differentiable Simulation | Unknown | N/A | |
| AutoMix: Automatically Mixing Language Models | Unknown | N/A | |
| MeMo: Meaningful, Modular Controllers via Noise Injection | Unknown | N/A | |
| BAKU: An Efficient Transformer for Multi-Task Policy Learning | Unknown | N/A | |
| I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token | Unknown | N/A | |
| FasMe: Fast and Sample-efficient Meta Estimator for Precision Matrix Learning in Small Sample Settings | Unknown | N/A | |
| Plant-and-Steal: Truthful Fair Allocations via Predictions | Unknown | N/A | |
| Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation | Unknown | N/A | |
| MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities | Unknown | N/A | |
| Elliptical Attention | Unknown | N/A | |
| Wide Two-Layer Networks can Learn from Adversarial Perturbations | Unknown | N/A | |
| ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving | Unknown | N/A | |
| EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models | Unknown | N/A | |
| Deterministic Policies for Constrained Reinforcement Learning in Polynomial Time | Unknown | N/A | |
| SPEAR: Exact Gradient Inversion of Batches in Federated Learning | Unknown | N/A | |
| Doing Experiments and Revising Rules with Natural Language and Probabilistic Reasoning | Unknown | N/A | |
| PPLNs: Parametric Piecewise Linear Networks for Event-Based Temporal Modeling and Beyond | Unknown | N/A | |
| Why Transformers Need Adam: A Hessian Perspective | Unknown | N/A | |
| Learning Neural Contracting Dynamics: Extended Linearization and Global Guarantees | Unknown | N/A | |
| Neural Model Checking | Unknown | N/A | |
| SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents | Unknown | N/A | |
| Temporal Sentence Grounding with Relevance Feedback in Videos | Unknown | N/A | |
| Localized Zeroth-Order Prompt Optimization | Unknown | N/A | |
| UV-free Texture Generation with Denoising and Geodesic Heat Diffusion | Unknown | N/A | |
| Language Grounded Multi-agent Reinforcement Learning with Human-interpretable Communication | Unknown | N/A | |
| Utilizing Human Behavior Modeling to Manipulate Explanations in AI-Assisted Decision Making: The Good, the Bad, and the Scary | Unknown | N/A | |
| How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization? | Unknown | N/A | |
| Learning Transferable Features for Implicit Neural Representations | Unknown | N/A | |
| DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning | Unknown | N/A | |
| MotionTTT: 2D Test-Time-Training Motion Estimation for 3D Motion Corrected MRI | Unknown | N/A | |
| Monoculture in Matching Markets | Unknown | N/A | |
| Improving Environment Novelty Quantification for Effective Unsupervised Environment Design | Unknown | N/A | |
| TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight | Unknown | N/A | |
| Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness | Unknown | N/A | |
| BackdoorAlign: Mitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment | Unknown | N/A | |
| Euclidean distance compression via deep random features | Unknown | N/A | |
| Tight Bounds for Learning RUMs from Small Slates | Unknown | N/A | |
| Abductive Reasoning in Logical Credal Networks | Unknown | N/A | |
| Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters | Unknown | N/A | |
| On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs) | Unknown | N/A | |
| DFA-GNN: Forward Learning of Graph Neural Networks by Direct Feedback Alignment | Unknown | N/A | |
| Understanding the Differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks | Unknown | N/A | |
| Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning | Unknown | N/A | |
| Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Unknown | N/A | |
| Structured flexibility in recurrent neural networks via neuromodulation | Unknown | N/A | |
| Scalable DP-SGD: Shuffling vs. Poisson Subsampling | Unknown | N/A | |
| Self-Healing Machine Learning: A Framework for Autonomous Adaptation in Real-World Environments | Unknown | N/A | |
| End-to-End Ontology Learning with Large Language Models | Unknown | N/A | |
| Large Language Models Must Be Taught to Know What They Don’t Know | Unknown | N/A | |
| Stabilizing Linear Passive-Aggressive Online Learning with Weighted Reservoir Sampling | Unknown | N/A | |
| Aligning Diffusion Models by Optimizing Human Utility | Unknown | N/A | |
| SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models | Unknown | N/A | |
| Distributional Successor Features Enable Zero-Shot Policy Optimization | Unknown | N/A | |
| Sample-Efficient Private Learning of Mixtures of Gaussians | Unknown | N/A | |
| Referring Human Pose and Mask Estimation In the Wild | Unknown | N/A | |
| Unraveling the Gradient Descent Dynamics of Transformers | Unknown | N/A | |
| Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn | Unknown | N/A | |
| Simplifying Constraint Inference with Inverse Reinforcement Learning | Unknown | N/A | |
| GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats | Unknown | N/A | |
| Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment | Unknown | N/A | |
| MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting | Unknown | N/A | |
| Nonparametric Evaluation of Noisy ICA Solutions | Unknown | N/A | |
| Transfer Learning for Latent Variable Network Models | Unknown | N/A | |
| On Differentially Private U Statistics | Unknown | N/A | |
| Your contrastive learning problem is secretly a distribution alignment problem | Unknown | N/A | |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | Unknown | N/A | |
| GOMAA-Geo: GOal Modality Agnostic Active Geo-localization | Unknown | N/A | |
| MILP-StuDio: MILP Instance Generation via Block Structure Decomposition | Unknown | N/A | |
| Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data Corruptions | Unknown | N/A | |
| No Free Lunch Theorem and Black-Box Complexity Analysis for Adversarial Optimisation | Unknown | N/A | |
| Inferring Neural Signed Distance Functions by Overfitting on Single Noisy Point Clouds through Finetuning Data-Driven based Priors | Unknown | N/A | |
| A Simple Framework for Generalization in Visual RL under Dynamic Scene Perturbations | Unknown | N/A | |
| xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token | Unknown | N/A | |
| Federated Ensemble-Directed Offline Reinforcement Learning | Unknown | N/A | |
| OSLO: One-Shot Label-Only Membership Inference Attacks | Unknown | N/A | |
| Optimal Top-Two Method for Best Arm Identification and Fluid Analysis | Unknown | N/A | |
| MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps | Unknown | N/A | |
| X-Ray: A Sequential 3D Representation For Generation | Unknown | N/A | |
| Learning to Decouple the Lights for 3D Face Texture Modeling | Unknown | N/A | |
| Learning Group Actions on Latent Representations | Unknown | N/A | |
| Few-Shot Task Learning through Inverse Generative Modeling | Unknown | N/A | |
| Initialization is Critical to Whether Transformers Fit Composite Functions by Reasoning or Memorizing | Unknown | N/A | |
| Physically Compatible 3D Object Modeling from a Single Image | Unknown | N/A | |
| Emergence of Hidden Capabilities: Exploring Learning Dynamics in Concept Space | Unknown | N/A | |
| Perceiving Longer Sequences With Bi-Directional Cross-Attention Transformers | Unknown | N/A | |
| Newton Losses: Using Curvature Information for Learning with Differentiable Algorithms | Unknown | N/A | |
| Learning to Cooperate with Humans using Generative Agents | Unknown | N/A | |
| LLM-Check: Investigating Detection of Hallucinations in Large Language Models | Unknown | N/A | |
| Constrained Synthesis with Projected Diffusion Models | Unknown | N/A | |
| Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage | Unknown | N/A | |
| KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge | Unknown | N/A | |
| Divergences between Language Models and Human Brains | Unknown | N/A | |
| ECLipsE: Efficient Compositional Lipschitz Constant Estimation for Deep Neural Networks | Unknown | N/A | |
| Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion | Unknown | N/A | |
| Gradient Rewiring for Editable Graph Neural Network Training | Unknown | N/A | |
| Alignment for Honesty | Unknown | N/A | |
| Dissect Black Box: Interpreting for Rule-Based Explanations in Unsupervised Anomaly Detection | Unknown | N/A | |
| MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures | Unknown | N/A | |
| $\textit{Read-ME}$: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design | Unknown | N/A | |
| Optimization Can Learn Johnson Lindenstrauss Embeddings | Unknown | N/A | |
| Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models | Unknown | N/A | |
| AR-Pro: Counterfactual Explanations for Anomaly Repair with Formal Properties | Unknown | N/A | |
| No Free Delivery Service: Epistemic limits of passive data collection in complex social systems | Unknown | N/A | |
| Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from Videos | Unknown | N/A | |
| Retrieval & Fine-Tuning for In-Context Tabular Models | Unknown | N/A | |
| Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale | Unknown | N/A | |
| CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming | Unknown | N/A | |
| Disentangled Representation Learning in Non-Markovian Causal Systems | Unknown | N/A | |
| Capturing the denoising effect of PCA via compression ratio | Unknown | N/A | |
| Novel Object Synthesis via Adaptive Text-Image Harmony | Unknown | N/A | |
| Maia-2: A Unified Model for Human-AI Alignment in Chess | Unknown | N/A | |
| FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training | Unknown | N/A | |
| Identifying Selections for Unsupervised Subtask Discovery | Unknown | N/A | |
| SAMPa: Sharpness-aware Minimization Parallelized | Unknown | N/A | |
| Normalization and effective learning rates in reinforcement learning | Unknown | N/A | |
| LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling | Unknown | N/A | |
| Scale-invariant Optimal Sampling for Rare-events Data and Sparse Models | Unknown | N/A | |
| Gradient-Variation Online Learning under Generalized Smoothness | Unknown | N/A | |
| Off-policy estimation with adaptively collected data: the power of online learning | Unknown | N/A | |
| TrAct: Making First-layer Pre-Activations Trainable | Unknown | N/A | |
| Convolutional Differentiable Logic Gate Networks | Unknown | N/A | |
| PointMamba: A Simple State Space Model for Point Cloud Analysis | Unknown | N/A | |
| Light Unbalanced Optimal Transport | Unknown | N/A | |
| Scaling Law for Time Series Forecasting | Unknown | N/A | |
| DeTikZify: Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ | Unknown | N/A | |
| Splatter a Video: Video Gaussian Representation for Versatile Processing | Unknown | N/A | |
| Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment | Unknown | N/A | |
| Selective Generation for Controllable Language Models | Unknown | N/A | |
| Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Games | Unknown | N/A | |
| Adaptive Depth Networks with Skippable Sub-Paths | Unknown | N/A | |
| Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text | Unknown | N/A | |
| Fast Iterative Hard Thresholding Methods with Pruning Gradient Computations | Unknown | N/A | |
| Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality | Unknown | N/A | |
| Promoting Fairness Among Dynamic Agents in Online-Matching Markets under Known Stationary Arrival Distributions | Unknown | N/A | |
| Quantum algorithm for large-scale market equilibrium computation | Unknown | N/A | |
| On conditional diffusion models for PDE simulations | Unknown | N/A | |
| PSL: Rethinking and Improving Softmax Loss from Pairwise Perspective for Recommendation | Unknown | N/A | |
| When to Sense and Control? A Time-adaptive Approach for Continuous-Time RL | Unknown | N/A | |
| Federated Transformer: Multi-Party Vertical Federated Learning on Practical Fuzzily Linked Data | Unknown | N/A | |
| KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization | Unknown | N/A | |
| GFT: Graph Foundation Model with Transferable Tree Vocabulary | Unknown | N/A | |
| IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors | Unknown | N/A | |
| Adversarial Moment-Matching Distillation of Large Language Models | Unknown | N/A | |
| Provable Tempered Overfitting of Minimal Nets and Typical Nets | Unknown | N/A | |
| ScaleKD: Strong Vision Transformers Could Be Excellent Teachers | Unknown | N/A | |
| Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study | Unknown | N/A | |
| Non-parametric classification via expand-and-sparsify representation | Unknown | N/A | |
| Sourcerer: Sample-based Maximum Entropy Source Distribution Estimation | Unknown | N/A | |
| Learning Complete Protein Representation by Dynamically Coupling of Sequence and Structure | Unknown | N/A | |
| Neural collapse vs. low-rank bias: Is deep neural collapse really optimal? | Unknown | N/A | |
| ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models | Unknown | N/A | |
| Accelerating Non-Maximum Suppression: A Graph Theory Perspective | Unknown | N/A | |
| Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions | Unknown | N/A | |
| Latent Diffusion for Neural Spiking Data | Unknown | N/A | |
| Private and Personalized Frequency Estimation in a Federated Setting | Unknown | N/A | |
| Understanding the Expressive Power and Mechanisms of Transformer for Sequence Modeling | Unknown | N/A | |
| Confidence Regulation Neurons in Language Models | Unknown | N/A | |
| Accelerating Matroid Optimization through Fast Imprecise Oracles | Unknown | N/A | |
| Score-Optimal Diffusion Schedules | Unknown | N/A | |
| 3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration | Unknown | N/A | |
| TSDS: Data Selection for Task-Specific Model Finetuning | Unknown | N/A | |
| Maximizing utility in multi-agent environments by anticipating the behavior of other learners | Unknown | N/A | |
| Learning Noisy Halfspaces with a Margin: Massart is No Harder than Random | Unknown | N/A | |
| On the Stability and Generalization of Meta-Learning | Unknown | N/A | |
| Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms | Unknown | N/A | |
| AdaNovo: Towards Robust \emph{De Novo} Peptide Sequencing in Proteomics against Data Biases | Unknown | N/A | |
| Adversarially Robust Multi-task Representation Learning | Unknown | N/A | |
| Offline Multitask Representation Learning for Reinforcement Learning | Unknown | N/A | |
| No-Regret Learning for Fair Multi-Agent Social Welfare Optimization | Unknown | N/A | |
| Risk-Averse Fine-tuning of Large Language Models | Unknown | N/A | |
| Geometry-aware training of factorized layers in tensor Tucker format | Unknown | N/A | |
| Polynomial-Time Computation of Exact $\Phi$-Equilibria in Polyhedral Games | Unknown | N/A | |
| Unleashing Region Understanding in Intermediate Layers for MLLM-based Referring Expression Generation | Unknown | N/A | |
| A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Unknown | N/A | |
| RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold | Unknown | N/A | |
| Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images | Unknown | N/A | |
| Uncertainty-aware Fine-tuning of Segmentation Foundation Models | Unknown | N/A | |
| MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-Step | Unknown | N/A | |
| Episodic Future Thinking Mechanism for Multi-agent Reinforcement Learning | Unknown | N/A | |
| TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control | Unknown | N/A | |
| Continual Learning in the Frequency Domain | Unknown | N/A | |
| Meta-Controller: Few-Shot Imitation of Unseen Embodiments and Tasks in Continuous Control | Unknown | N/A | |
| GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration | Unknown | N/A | |
| Metric Transforms and Low Rank Representations of Kernels for Fast Attention | Unknown | N/A | |
| Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity | Unknown | N/A | |
| Are Multiple Instance Learning Algorithms Learnable for Instances? | Unknown | N/A | |
| FactorSim: Generative Simulation via Factorized Representation | Unknown | N/A | |
| Autoregressive Image Generation without Vector Quantization | Unknown | N/A | |
| AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Unknown | N/A | |
| Data Free Backdoor Attacks | Unknown | N/A | |
| ConStat: Performance-Based Contamination Detection in Large Language Models | Unknown | N/A | |
| Graph Convolutions Enrich the Self-Attention in Transformers! | Unknown | N/A | |
| PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics | Unknown | N/A | |
| Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts | Unknown | N/A | |
| LT-Defense: Searching-free Backdoor Defense via Exploiting the Long-tailed Effect | Unknown | N/A | |
| A Theoretical Understanding of Self-Correction through In-context Alignment | Unknown | N/A | |
| Training an Open-Vocabulary Monocular 3D Detection Model without 3D Data | Unknown | N/A | |
| Stratified Prediction-Powered Inference for Effective Hybrid Evaluation of Language Models | Unknown | N/A | |
| A Neural Network Approach for Efficiently Answering Most Probable Explanation Queries in Probabilistic Models | Unknown | N/A | |
| Proving Theorems Recursively | Unknown | N/A | |
| No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery | Unknown | N/A | |
| Probing the Decision Boundaries of In-context Learning in Large Language Models | Unknown | N/A | |
| 2D-OOB: Attributing Data Contribution Through Joint Valuation Framework | Unknown | N/A | |
| Provably Safe Neural Network Controllers via Differential Dynamic Logic | Unknown | N/A | |
| DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction | Unknown | N/A | |
| Unsupervised Discovery of Formulas for Mathematical Constants | Unknown | N/A | |
| Towards the Dynamics of a DNN Learning Symbolic Interactions | Unknown | N/A | |
| A Phase Transition between Positional and Semantic Learning in a Solvable Model of Dot-Product Attention | Unknown | N/A | |
| Coded Computing for Resilient Distributed Computing: A Learning-Theoretic Framework | Unknown | N/A | |
| Fast Channel Simulation via Error-Correcting Codes | Unknown | N/A | |
| Alignment at Pre-training! Towards Native Alignment for Arabic LLMs | Unknown | N/A | |
| Few-Shot Diffusion Models Escape the Curse of Dimensionality | Unknown | N/A | |
| The ALCHEmist: Automated Labeling 500x CHEaper than LLM Data Annotators | Unknown | N/A | |
| AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers | Unknown | N/A | |
| Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases | Unknown | N/A | |
| DMesh: A Differentiable Mesh Representation | Unknown | N/A | |
| SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training | Unknown | N/A | |
| FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making | Unknown | N/A | |
| Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem | Unknown | N/A | |
| bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction | Unknown | N/A | |
| Compact Proofs of Model Performance via Mechanistic Interpretability | Unknown | N/A | |
| Continual learning with the neural tangent ensemble | Unknown | N/A | |
| Do causal predictors generalize better to new domains? | Unknown | N/A | |
| Revisiting Score Propagation in Graph Out-of-Distribution Detection | Unknown | N/A | |
| Unrolled denoising networks provably learn to perform optimal Bayesian inference | Unknown | N/A | |
| DeNetDM: Debiasing by Network Depth Modulation | Unknown | N/A | |
| Ad Auctions for LLMs via Retrieval Augmented Generation | Unknown | N/A | |
| Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models | Unknown | N/A | |
| User-Creator Feature Polarization in Recommender Systems with Dual Influence | Unknown | N/A | |
| Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation | Unknown | N/A | |
| KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts | Unknown | N/A | |
| High-Resolution Image Harmonization with Adaptive-Interval Color Transformation | Unknown | N/A | |
| Generating Origin-Destination Matrices in Neural Spatial Interaction Models | Unknown | N/A | |
| Contextual Active Model Selection | Unknown | N/A | |
| On the Identifiability of Poisson Branching Structural Causal Model Using Probability Generating Function | Unknown | N/A | |
| FedGTST: Boosting Global Transferability of Federated Models via Statistics Tuning | Unknown | N/A | |
| InterControl: Zero-shot Human Interaction Generation by Controlling Every Joint | Unknown | N/A | |
| QGFN: Controllable Greediness with Action Values | Unknown | N/A | |
| Learning Structured Representations with Hyperbolic Embeddings | Unknown | N/A | |
| Cost-aware Bayesian Optimization via the Pandora's Box Gittins Index | Unknown | N/A | |
| Generalization Analysis for Label-Specific Representation Learning | Unknown | N/A | |
| Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations | Unknown | N/A | |
| Probabilistic Graph Rewiring via Virtual Nodes | Unknown | N/A | |
| A Bayesian Approach to Data Point Selection | Unknown | N/A | |
| Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training | Unknown | N/A | |
| Slight Corruption in Pre-training Data Makes Better Diffusion Models | Unknown | N/A | |
| Are Uncertainty Quantification Capabilities of Evidential Deep Learning a Mirage? | Unknown | N/A | |
| Enhancing Large Vision Language Models with Self-Training on Image Comprehension | Unknown | N/A | |
| A scalable generative model for dynamical system reconstruction from neuroimaging data | Unknown | N/A | |
| UniDSeg: Unified Cross-Domain 3D Semantic Segmentation via Visual Foundation Models Prior | Unknown | N/A | |
| AdjointDEIS: Efficient Gradients for Diffusion Models | Unknown | N/A | |
| MonkeySee: Space-time-resolved reconstructions of natural images from macaque multi-unit activity | Unknown | N/A | |
| Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data | Unknown | N/A | |
| Understanding Transformers via N-Gram Statistics | Unknown | N/A | |
| BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO | Unknown | N/A | |
| Truth is Universal: Robust Detection of Lies in LLMs | Unknown | N/A | |
| How Far Can Transformers Reason? The Globality Barrier and Inductive Scratchpad | Unknown | N/A | |
| A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding | Unknown | N/A | |
| Analysing the Generalisation and Reliability of Steering Vectors | Unknown | N/A | |
| Trajectory Diffusion for ObjectGoal Navigation | Unknown | N/A | |
| Log-concave Sampling from a Convex Body with a Barrier: a Robust and Unified Dikin Walk | Unknown | N/A | |
| Unveiling Causal Reasoning in Large Language Models: Reality or Mirage? | Unknown | N/A | |
| Mixture of Tokens: Continuous MoE through Cross-Example Aggregation | Unknown | N/A | |
| Parametric model reduction of mean-field and stochastic systems via higher-order action matching | Unknown | N/A | |
| Attractor Memory for Long-Term Time Series Forecasting: A Chaos Perspective | Unknown | N/A | |
| Induced Model Matching: Restricted Models Help Train Full-Featured Models | Unknown | N/A | |
| Fast Best-of-N Decoding via Speculative Rejection | Unknown | N/A | |
| Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance | Unknown | N/A | |
| CultureLLM: Incorporating Cultural Differences into Large Language Models | Unknown | N/A | |
| CulturePark: Boosting Cross-cultural Understanding in Large Language Models | Unknown | N/A | |
| No-regret Learning in Harmonic Games: Extrapolation in the Face of Conflicting Interests | Unknown | N/A | |
| Multi-Object Hallucination in Vision Language Models | Unknown | N/A | |
| Bridge the Points: Graph-based Few-shot Segment Anything Semantically | Unknown | N/A | |
| Learning from Snapshots of Discrete and Continuous Data Streams | Unknown | N/A | |
| Structure Consistent Gaussian Splatting with Matching Prior for Few-shot Novel View Synthesis | Unknown | N/A | |
| Tangent Space Causal Inference: Leveraging Vector Fields for Causal Discovery in Dynamical Systems | Unknown | N/A | |
| The Impact of Geometric Complexity on Neural Collapse in Transfer Learning | Unknown | N/A | |
| Polyhedral Complex Derivation from Piecewise Trilinear Networks | Unknown | N/A | |
| The Road Less Scheduled | Unknown | N/A | |
| Directional Smoothness and Gradient Methods: Convergence and Adaptivity | Unknown | N/A | |
| Learning to be Smooth: An End-to-End Differentiable Particle Smoother | Unknown | N/A | |
| Multi-view Masked Contrastive Representation Learning for Endoscopic Video Analysis | Unknown | N/A | |
| CryoSPIN: Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference | Unknown | N/A | |
| Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification | Unknown | N/A | |
| Mean-Field Analysis for Learning Subspace-Sparse Polynomials with Gaussian Input | Unknown | N/A | |
| ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search | Unknown | N/A | |
| RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier | Unknown | N/A | |
| A Polar coordinate system represents syntax in large language models | Unknown | N/A | |
| FedNE: Surrogate-Assisted Federated Neighbor Embedding for Dimensionality Reduction | Unknown | N/A | |
| DreamMesh4D: Video-to-4D Generation with Sparse-Controlled Gaussian-Mesh Hybrid Representation | Unknown | N/A | |
| Implicit Regularization of Sharpness-Aware Minimization for Scale-Invariant Problems | Unknown | N/A | |
| Causal Deciphering and Inpainting in Spatio-Temporal Dynamics via Diffusion Model | Unknown | N/A | |
| L-TTA: Lightweight Test-Time Adaptation Using a Versatile Stem Layer | Unknown | N/A | |
| Motif-oriented influence maximization for viral marketing in large-scale social networks | Unknown | N/A | |
| Out-Of-Distribution Detection with Diversification (Provably) | Unknown | N/A | |
| FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision | Unknown | N/A | |
| The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning | Unknown | N/A | |
| An Efficient High-dimensional Gradient Estimator for Stochastic Differential Equations | Unknown | N/A | |
| SpeechForensics: Audio-Visual Speech Representation Learning for Face Forgery Detection | Unknown | N/A | |
| SongCreator: Lyrics-based Universal Song Generation | Unknown | N/A | |
| Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling | Unknown | N/A | |
| The Star Geometry of Critic-Based Regularizer Learning | Unknown | N/A | |
| A Recipe for Charge Density Prediction | Unknown | N/A | |
| Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation | Unknown | N/A | |
| Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents | Unknown | N/A | |
| Unified Guidance for Geometry-Conditioned Molecular Generation | Unknown | N/A | |
| Federated Learning over Connected Modes | Unknown | N/A | |
| Zero-Shot Event-Intensity Asymmetric Stereo via Visual Prompting from Image Domain | Unknown | N/A | |
| B'MOJO: Hybrid State Space Realizations of Foundation Models with Eidetic and Fading Memory | Unknown | N/A | |
| How does PDE order affect the convergence of PINNs? | Unknown | N/A | |
| A Unifying Normative Framework of Decision Confidence | Unknown | N/A | |
| Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Unknown | N/A | |
| ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models | Unknown | N/A | |
| DisenGCD: A Meta Multigraph-assisted Disentangled Graph Learning Framework for Cognitive Diagnosis | Unknown | N/A | |
| Sequential Harmful Shift Detection Without Labels | Unknown | N/A | |
| RashomonGB: Analyzing the Rashomon Effect and Mitigating Predictive Multiplicity in Gradient Boosting | Unknown | N/A | |
| CoSy: Evaluating Textual Explanations of Neurons | Unknown | N/A | |
| $\textit{Trans-LoRA}$: towards data-free Transferable Parameter Efficient Finetuning | Unknown | N/A | |
| Efficient Large Multi-modal Models via Visual Context Compression | Unknown | N/A | |
| Mutual Information Estimation via Normalizing Flows | Unknown | N/A | |
| LLM-based Skill Diffusion for Zero-shot Policy Adaptation | Unknown | N/A | |
| A Universal Growth Rate for Learning with Smooth Surrogate Losses | Unknown | N/A | |
| Cardinality-Aware Set Prediction and Top-$k$ Classification | Unknown | N/A | |
| RTify: Aligning Deep Neural Networks with Human Behavioral Decisions | Unknown | N/A | |
| A Simple and Optimal Approach for Universal Online Learning with Gradient Variations | Unknown | N/A | |
| Universal Online Convex Optimization with $1$ Projection per Round | Unknown | N/A | |
| OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents | Unknown | N/A | |
| Exploring Molecular Pretraining Model at Scale | Unknown | N/A | |
| The Limits of Differential Privacy in Online Learning | Unknown | N/A | |
| Reparameterized Multi-Resolution Convolutions for Long Sequence Modelling | Unknown | N/A | |
| Empowering Active Learning for 3D Molecular Graphs with Geometric Graph Isomorphism | Unknown | N/A | |
| Harnessing small projectors and multiple views for efficient vision pretraining | Unknown | N/A | |
| On the Noise Robustness of In-Context Learning for Text Generation | Unknown | N/A | |
| ALPINE: Unveiling The Planning Capability of Autoregressive Learning in Language Models | Unknown | N/A | |
| AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning | Unknown | N/A | |
| Rethinking Optimal Transport in Offline Reinforcement Learning | Unknown | N/A | |
| Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates | Unknown | N/A | |
| Improving Temporal Link Prediction via Temporal Walk Matrix Projection | Unknown | N/A | |
| Bootstrapping Top-down Information for Self-modulating Slot Attention | Unknown | N/A | |
| Diffusion Policy Attacker: Crafting Adversarial Attacks for Diffusion-based Policies | Unknown | N/A | |
| Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming Data | Unknown | N/A | |
| Worst-Case Offline Reinforcement Learning with Arbitrary Data Support | Unknown | N/A | |
| SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data | Unknown | N/A | |
| Aligning Target-Aware Molecule Diffusion Models with Exact Energy Optimization | Unknown | N/A | |
| Optimal Hypothesis Selection in (Almost) Linear Time | Unknown | N/A | |
| Strategic Linear Contextual Bandits | Unknown | N/A | |
| Make-An-Agent: A Generalizable Policy Network Generator with Behavior-Prompted Diffusion | Unknown | N/A | |
| Mitigating Biases in Blackbox Feature Extractors for Image Classification Tasks | Unknown | N/A | |
| Self-Distilled Depth Refinement with Noisy Poisson Fusion | Unknown | N/A | |
| Tackling Uncertain Correspondences for Multi-Modal Entity Alignment | Unknown | N/A | |
| Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs | Unknown | N/A | |
| Single Image Reflection Separation via Dual-Stream Interactive Transformers | Unknown | N/A | |
| Looks Too Good To Be True: An Information-Theoretic Analysis of Hallucinations in Generative Restoration Models | Unknown | N/A | |
| GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation | Unknown | N/A | |
| Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity | Unknown | N/A | |
| Causal vs. Anticausal merging of predictors | Unknown | N/A | |
| Model Collapse Demystified: The Case of Regression | Unknown | N/A | |
| 4D Gaussian Splatting in the Wild with Uncertainty-Aware Regularization | Unknown | N/A | |
| Text-Aware Diffusion for Policy Learning | Unknown | N/A | |
| Multi-scale Consistency for Robust 3D Registration via Hierarchical Sinkhorn Tree | Unknown | N/A | |
| Motion Graph Unleashed: A Novel Approach to Video Prediction | Unknown | N/A | |
| On the Role of Attention Masks and LayerNorm in Transformers | Unknown | N/A | |
| Simplified and Generalized Masked Diffusion for Discrete Data | Unknown | N/A | |
| Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents | Unknown | N/A | |
| StepbaQ: Stepping backward as Correction for Quantized Diffusion Models | Unknown | N/A | |
| Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms | Unknown | N/A | |
| NeuroBOLT: Resting-state EEG-to-fMRI Synthesis with Multi-dimensional Feature Mapping | Unknown | N/A | |
| Tree of Attacks: Jailbreaking Black-Box LLMs Automatically | Unknown | N/A | |
| Communication Efficient Distributed Training with Distributed Lion | Unknown | N/A | |
| ECMamba: Consolidating Selective State Space Model with Retinex Guidance for Efficient Multiple Exposure Correction | Unknown | N/A | |
| MiSO: Optimizing brain stimulation to create neural activity states | Unknown | N/A | |
| Integrating GNN and Neural ODEs for Estimating Non-Reciprocal Two-Body Interactions in Mixed-Species Collective Motion | Unknown | N/A | |
| Lookback Prophet Inequalities | Unknown | N/A | |
| From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $\alpha$-NeuS | Unknown | N/A | |
| Learning-Augmented Priority Queues | Unknown | N/A | |
| Bias in Motion: Theoretical Insights into the Dynamics of Bias in SGD Training | Unknown | N/A | |
| Unconditional stability of a recurrent neural circuit implementing divisive normalization | Unknown | N/A | |
| MotionBooth: Motion-Aware Customized Text-to-Video Generation | Unknown | N/A | |
| Credal Deep Ensembles for Uncertainty Quantification | Unknown | N/A | |
| Embedding Dimension of Contrastive Learning and $k$-Nearest Neighbors | Unknown | N/A | |
| Understanding Linear Probing then Fine-tuning Language Models from NTK Perspective | Unknown | N/A | |
| IQA-EVAL: Automatic Evaluation of Human-Model Interactive Question Answering | Unknown | N/A | |
| GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping | Unknown | N/A | |
| FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space | Unknown | N/A | |
| Learning predictable and robust neural representations by straightening image sequences | Unknown | N/A | |
| SafeWorld: Geo-Diverse Safety Alignment | Unknown | N/A | |
| Taming the Long Tail in Human Mobility Prediction | Unknown | N/A | |
| Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs | Unknown | N/A | |
| Disentangling Interpretable Factors with Supervised Independent Subspace Principal Component Analysis | Unknown | N/A | |
| Fair Wasserstein Coresets | Unknown | N/A | |
| Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model | Unknown | N/A | |
| UNIT: Unifying Image and Text Recognition in One Vision Encoder | Unknown | N/A | |
| VidMan: Exploiting Implicit Dynamics from Video Diffusion Model for Effective Robot Manipulation | Unknown | N/A | |
| Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective | Unknown | N/A | |
| Analyzing & Reducing the Need for Learning Rate Warmup in GPT Training | Unknown | N/A | |
| BitDelta: Your Fine-Tune May Only Be Worth One Bit | Unknown | N/A | |
| Microstructures and Accuracy of Graph Recall by Large Language Models | Unknown | N/A | |
| Pretrained Transformer Efficiently Learns Low-Dimensional Target Functions In-Context | Unknown | N/A | |
| Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor | Unknown | N/A | |
| Sample-Efficient Agnostic Boosting | Unknown | N/A | |
| Vision Mamba Mender | Unknown | N/A | |
| Ask, Attend, Attack: An Effective Decision-Based Black-Box Targeted Attack for Image-to-Text Models | Unknown | N/A | |
| Adaptive Proximal Gradient Method for Convex Optimization | Unknown | N/A | |
| EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models | Unknown | N/A | |
| Nesterov acceleration despite very noisy gradients | Unknown | N/A | |
| Graph Structure Inference with BAM: Neural Dependency Processing via Bilinear Attention | Unknown | N/A | |
| Can Transformers Smell Like Humans? | Unknown | N/A | |
| Contrastive-Equivariant Self-Supervised Learning Improves Alignment with Primate Visual Area IT | Unknown | N/A | |
| Adaptive Labeling for Efficient Out-of-distribution Model Evaluation | Unknown | N/A | |
| Training for Stable Explanation for Free | Unknown | N/A | |
| DenseFormer: Enhancing Information Flow in Transformers via Depth Weighted Averaging | Unknown | N/A | |
| Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation | Unknown | N/A | |
| Data Distribution Valuation | Unknown | N/A | |
| FastDrag: Manipulate Anything in One Step | Unknown | N/A | |
| Online Posterior Sampling with a Diffusion Prior | Unknown | N/A | |
| DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut | Unknown | N/A | |
| $\text{ID}^3$: Identity-Preserving-yet-Diversified Diffusion Models for Synthetic Face Recognition | Unknown | N/A | |
| Quadratic Quantum Variational Monte Carlo | Unknown | N/A | |
| AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies | Unknown | N/A | |
| Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier Designs | Unknown | N/A | |
| LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models | Unknown | N/A | |
| PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator | Unknown | N/A | |
| Improving Equivariant Model Training via Constraint Relaxation | Unknown | N/A | |
| Evidential Stochastic Differential Equations for Time-Aware Sequential Recommendation | Unknown | N/A | |
| Enhancing Protein Mutation Effect Prediction through a Retrieval-Augmented Framework | Unknown | N/A | |
| Amortized Fourier Neural Operators | Unknown | N/A | |
| Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning | Unknown | N/A | |
| Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators | Unknown | N/A | |
| Predicting Ground State Properties: Constant Sample Complexity and Deep Learning Algorithms | Unknown | N/A | |
| LRM-Zero: Training Large Reconstruction Models with Synthesized Data | Unknown | N/A | |
| PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference | Unknown | N/A | |
| Leveraging Tumor Heterogeneity: Heterogeneous Graph Representation Learning for Cancer Survival Prediction in Whole Slide Images | Unknown | N/A | |
| Causal Effect Identification in a Sub-Population with Latent Variables | Unknown | N/A | |
| Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery | Unknown | N/A | |
| Computation-Aware Gaussian Processes: Model Selection And Linear-Time Inference | Unknown | N/A | |
| Inversion-based Latent Bayesian Optimization | Unknown | N/A | |
| EigenVI: score-based variational inference with orthogonal function expansions | Unknown | N/A | |
| Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities | Unknown | N/A | |
| Truthfulness of Calibration Measures | Unknown | N/A | |
| FineStyle: Fine-grained Controllable Style Personalization for Text-to-image Models | Unknown | N/A | |
| OwMatch: Conditional Self-Labeling with Consistency for Open-World Semi-Supervised Learning | Unknown | N/A | |
| Achievable distributional robustness when the robust risk is only partially identified | Unknown | N/A | |
| Optimal Private and Communication Constraint Distributed Goodness-of-Fit Testing for Discrete Distributions in the Large Sample Regime | Unknown | N/A | |
| Bandits with Abstention under Expert Advice | Unknown | N/A | |
| Taming Heavy-Tailed Losses in Adversarial Bandits and the Best-of-Both-Worlds Setting | Unknown | N/A | |
| The Empirical Impact of Neural Parameter Symmetries, or Lack Thereof | Unknown | N/A | |
| Computational Aspects of Bayesian Persuasion under Approximate Best Response | Unknown | N/A | |
| ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization | Unknown | N/A | |
| 3D Gaussian Rendering Can Be Sparser: Efficient Rendering via Learned Fragment Pruning | Unknown | N/A | |
| Score-based generative models are provably robust: an uncertainty quantification perspective | Unknown | N/A | |
| AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment | Unknown | N/A | |
| A Surprisingly Simple Approach to Generalized Few-Shot Semantic Segmentation | Unknown | N/A | |
| Listenable Maps for Zero-Shot Audio Classifiers | Unknown | N/A | |
| BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation | Unknown | N/A | |
| Online Iterative Reinforcement Learning from Human Feedback with General Preference Model | Unknown | N/A | |
| Dueling over Dessert, Mastering the Art of Repeated Cake Cutting | Unknown | N/A | |
| Provable Posterior Sampling with Denoising Oracles via Tilted Transport | Unknown | N/A | |
| NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention | Unknown | N/A | |
| A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation | Unknown | N/A | |
| Self-Supervised Adversarial Training via Diverse Augmented Queries and Self-Supervised Double Perturbation | Unknown | N/A | |
| Nonparametric Instrumental Variable Regression through Stochastic Approximate Gradients | Unknown | N/A | |
| Conditional Generative Models are Sufficient to Sample from Any Causal Effect Estimand | Unknown | N/A | |
| Partial Structure Discovery is Sufficient for No-regret Learning in Causal Bandits | Unknown | N/A | |
| Sample Efficient Bayesian Learning of Causal Graphs from Interventions | Unknown | N/A | |
| Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning | Unknown | N/A | |
| Efficient and Private Marginal Reconstruction with Local Non-Negativity | Unknown | N/A | |
| MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization | Unknown | N/A | |
| Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE | Unknown | N/A | |
| Large Language Models as Urban Residents: An LLM Agent Framework for Personal Mobility Generation | Unknown | N/A | |
| Learning Infinitesimal Generators of Continuous Symmetries from Data | Unknown | N/A | |
| CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics | Unknown | N/A | |
| Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks | Unknown | N/A | |
| Personalized Federated Learning with Mixture of Models for Adaptive Prediction and Model Fine-Tuning | Unknown | N/A | |
| On the Identifiability of Hybrid Deep Generative Models: Meta-Learning as a Solution | Unknown | N/A | |
| FairWire: Fair Graph Generation | Unknown | N/A | |
| Does Egalitarian Fairness Lead to Instability? The Fairness Bounds in Stable Federated Learning Under Altruistic Behaviors | Unknown | N/A | |
| Multi-model Ensemble Conformal Prediction in Dynamic Environments | Unknown | N/A | |
| SPO: Sequential Monte Carlo Policy Optimisation | Unknown | N/A | |
| Membership Inference Attacks against Large Vision-Language Models | Unknown | N/A | |
| Sample-Efficient Geometry Reconstruction from Euclidean Distances using Non-Convex Optimization | Unknown | N/A | |
| Causal Dependence Plots | Unknown | N/A | |
| LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking Loss | Unknown | N/A | |
| FlexCap: Describe Anything in Images in Controllable Detail | Unknown | N/A | |
| Sequoia: Scalable and Robust Speculative Decoding | Unknown | N/A | |
| Blind Image Restoration via Fast Diffusion Inversion | Unknown | N/A | |
| A Primal-Dual-Assisted Penalty Approach to Bilevel Optimization with Coupled Constraints | Unknown | N/A | |
| MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes | Unknown | N/A | |
| Diffusion Models With Learned Adaptive Noise | Unknown | N/A | |
| LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search | Unknown | N/A | |
| Generalizing CNNs to graphs with learnable neighborhood quantization | Unknown | N/A | |
| Bayesian Online Natural Gradient (BONG) | Unknown | N/A | |
| Just Add $100 More: Augmenting Pseudo-LiDAR Point Cloud for Resolving Class-imbalance Problem | Unknown | N/A | |
| Mixture of neural fields for heterogeneous reconstruction in cryo-EM | Unknown | N/A | |
| Automating Data Annotation under Strategic Human Agents: Risks and Potential Solutions | Unknown | N/A | |
| Policy Mirror Descent with Lookahead | Unknown | N/A | |
| AV-Cloud: Spatial Audio Rendering Through Audio-Visual Cloud Splatting | Unknown | N/A | |
| Self-Calibrating Conformal Prediction | Unknown | N/A | |
| Dual Lagrangian Learning for Conic Optimization | Unknown | N/A | |
| ControlSynth Neural ODEs: Modeling Dynamical Systems with Guaranteed Convergence | Unknown | N/A | |
| Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural Networks | Unknown | N/A | |
| Fair GLASSO: Estimating Fair Graphical Models with Unbiased Statistical Behavior | Unknown | N/A | |
| State-free Reinforcement Learning | Unknown | N/A | |
| CosAE: Learnable Fourier Series for Image Restoration | Unknown | N/A | |
| Bileve: Securing Text Provenance in Large Language Models Against Spoofing with Bi-level Signature | Unknown | N/A | |
| Practical Shuffle Coding | Unknown | N/A | |
| Statistical Estimation in the Spiked Tensor Model via the Quantum Approximate Optimization Algorithm | Unknown | N/A | |
| A Fast Convoluted Story: Scaling Probabilistic Inference for Integer Arithmetics | Unknown | N/A | |
| Evaluating the design space of diffusion-based generative models | Unknown | N/A | |
| Learn To be Efficient: Build Structured Sparsity in Large Language Models | Unknown | N/A | |
| On the Optimality of Dilated Entropy and Lower Bounds for Online Learning in Extensive-Form Games | Unknown | N/A | |
| Preference Learning of Latent Decision Utilities with a Human-like Model of Preferential Choice | Unknown | N/A | |
| Adaptive and Optimal Second-order Optimistic Methods for Minimax Optimization | Unknown | N/A | |
| Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning | Unknown | N/A | |
| Diffusing Differentiable Representations | Unknown | N/A | |
| Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models | Unknown | N/A | |
| Understanding Hallucinations in Diffusion Models through Mode Interpolation | Unknown | N/A | |
| Predicting the Performance of Foundation Models via Agreement-on-the-Line | Unknown | N/A | |
| Improving Alignment and Robustness with Circuit Breakers | Unknown | N/A | |
| DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation | Unknown | N/A | |
| Language-Driven Interactive Traffic Trajectory Generation | Unknown | N/A | |
| Delving into the Reversal Curse: How Far Can Large Language Models Generalize? | Unknown | N/A | |
| Graph Classification via Reference Distribution Learning: Theory and Practice | Unknown | N/A | |
| HOPE: Shape Matching Via Aligning Different K-hop Neighbourhoods | Unknown | N/A | |
| Reparameterization invariance in approximate Bayesian inference | Unknown | N/A | |
| Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond | Unknown | N/A | |
| G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models | Unknown | N/A | |
| You Only Cache Once: Decoder-Decoder Architectures for Language Models | Unknown | N/A | |
| Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights | Unknown | N/A | |
| Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric Videos | Unknown | N/A | |
| Revisiting Adversarial Patches for Designing Camera-Agnostic Attacks against Person Detection | Unknown | N/A | |
| Not Just Object, But State: Compositional Incremental Learning without Forgetting | Unknown | N/A | |
| Learning from Noisy Labels via Conditional Distributionally Robust Optimization | Unknown | N/A | |
| Globally Q-linear Gauss-Newton Method for Overparameterized Non-convex Matrix Sensing | Unknown | N/A | |
| DiffuserLite: Towards Real-time Diffusion Planning | Unknown | N/A | |
| Overcoming Common Flaws in the Evaluation of Selective Classification Systems | Unknown | N/A | |
| Dendritic Integration Inspired Artificial Neural Networks Capture Data Correlation | Unknown | N/A | |
| Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model | Unknown | N/A | |
| From Similarity to Superiority: Channel Clustering for Time Series Forecasting | Unknown | N/A | |
| When Is Inductive Inference Possible? | Unknown | N/A | |
| Weight Diffusion for Future: Learn to Generalize in Non-Stationary Environments | Unknown | N/A | |
| Active learning of neural population dynamics using two-photon holographic optogenetics | Unknown | N/A | |
| Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems | Unknown | N/A | |
| Quantitative Convergences of Lie Group Momentum Optimizers | Unknown | N/A | |
| Multi-language Diversity Benefits Autoformalization | Unknown | N/A | |
| DEL: Discrete Element Learner for Learning 3D Particle Dynamics with Neural Rendering | Unknown | N/A | |
| Externally Valid Policy Evaluation from Randomized Trials Using Additional Observational Data | Unknown | N/A | |
| Towards Understanding Extrapolation: a Causal Lens | Unknown | N/A | |
| Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning | Unknown | N/A | |
| Automated Efficient Estimation using Monte Carlo Efficient Influence Functions | Unknown | N/A | |
| Multidimensional Fractional Programming for Normalized Cuts | Unknown | N/A | |
| Instance-adaptive Zero-shot Chain-of-Thought Prompting | Unknown | N/A | |
| How Does Variance Shape the Regret in Contextual Bandits? | Unknown | N/A | |
| Aggregating Quantitative Relative Judgments: From Social Choice to Ranking Prediction | Unknown | N/A | |
| Towards Multi-dimensional Explanation Alignment for Medical Classification | Unknown | N/A | |
| On Tractable $\Phi$-Equilibria in Non-Concave Games | Unknown | N/A | |
| Enhancing Large Language Models through Adaptive Tokenizers | Unknown | N/A | |
| On the Impact of Feature Heterophily on Link Prediction with Graph Neural Networks | Unknown | N/A | |
| Stylus: Automatic Adapter Selection for Diffusion Models | Unknown | N/A | |
| Flexible Context-Driven Sensory Processing in Dynamical Vision Models | Unknown | N/A | |
| Generative Adversarial Model-Based Optimization via Source Critic Regularization | Unknown | N/A | |
| GS-Hider: Hiding Messages into 3D Gaussian Splatting | Unknown | N/A | |
| InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling | Unknown | N/A | |
| Global Rewards in Restless Multi-Armed Bandits | Unknown | N/A | |
| EGonc : Energy-based Open-Set Node Classification with substitute Unknowns | Unknown | N/A | |
| Improving the Learning Capability of Small-size Image Restoration Network by Deep Fourier Shifting | Unknown | N/A | |
| SEA: State-Exchange Attention for High-Fidelity Physics Based Transformers | Unknown | N/A | |
| Topological obstruction to the training of shallow ReLU neural networks | Unknown | N/A | |
| Low Degree Hardness for Broadcasting on Trees | Unknown | N/A | |
| Exploring the Edges of Latent State Clusters for Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention | Unknown | N/A | |
| Fearless Stochasticity in Expectation Propagation | Unknown | N/A | |
| Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm | Unknown | N/A | |
| D-MiSo: Editing Dynamic 3D Scenes using Multi-Gaussians Soup | Unknown | N/A | |
| A theoretical case-study of Scalable Oversight in Hierarchical Reinforcement Learning | Unknown | N/A | |
| SA3DIP: Segment Any 3D Instance with Potential 3D Priors | Unknown | N/A | |
| The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional Space | Unknown | N/A | |
| Higher-Order Causal Message Passing for Experimentation with Complex Interference | Unknown | N/A | |
| ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses | Unknown | N/A | |
| Expectile Regularization for Fast and Accurate Training of Neural Optimal Transport | Unknown | N/A | |
| Boundary Matters: A Bi-Level Active Finetuning Method | Unknown | N/A | |
| AP-Adapter: Improving Generalization of Automatic Prompts on Unseen Text-to-Image Diffusion Models | Unknown | N/A | |
| AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising | Unknown | N/A | |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Unknown | N/A | |
| Slack-Free Spiking Neural Network Formulation for Hypergraph Minimum Vertex Cover | Unknown | N/A | |
| How Does Black-Box Impact the Learning Guarantee of Stochastic Compositional Optimization? | Unknown | N/A | |
| Towards Dynamic Message Passing on Graphs | Unknown | N/A | |
| Bias Detection via Signaling | Unknown | N/A | |
| Linear Regression using Heterogeneous Data Batches | Unknown | N/A | |
| Strategic Littlestone Dimension: Improved Bounds on Online Strategic Classification | Unknown | N/A | |
| Aligning Embeddings and Geometric Random Graphs: Informational Results and Computational Approaches for the Procrustes-Wasserstein Problem | Unknown | N/A | |
| On the Complexity of Teaching a Family of Linear Behavior Cloning Learners | Unknown | N/A | |
| iVideoGPT: Interactive VideoGPTs are Scalable World Models | Unknown | N/A | |
| Introspective Planning: Aligning Robots' Uncertainty with Inherent Task Ambiguity | Unknown | N/A | |
| AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis | Unknown | N/A | |
| Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models | Unknown | N/A | |
| Real-Time Recurrent Learning using Trace Units in Reinforcement Learning | Unknown | N/A | |
| Dynamic Subgroup Identification in Covariate-adjusted Response-adaptive Randomization Experiments | Unknown | N/A | |
| Iterative Reasoning Preference Optimization | Unknown | N/A | |
| DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning | Unknown | N/A | |
| Chain-of-Thought Reasoning Without Prompting | Unknown | N/A | |
| A Near-optimal Algorithm for Learning Margin Halfspaces with Massart Noise | Unknown | N/A | |
| AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation | Unknown | N/A | |
| What is my quantum computer good for? Quantum capability learning with physics-aware neural networks | Unknown | N/A | |
| Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers | Unknown | N/A | |
| DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning | Unknown | N/A | |
| MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning | Unknown | N/A | |
| Motion Forecasting in Continuous Driving | Unknown | N/A | |
| Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function | Unknown | N/A | |
| Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning | Unknown | N/A | |
| Seek Commonality but Preserve Differences: Dissected Dynamics Modeling for Multi-modal Visual RL | Unknown | N/A | |
| Verified Code Transpilation with LLMs | Unknown | N/A | |
| Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus | Unknown | N/A | |
| VFIMamba: Video Frame Interpolation with State Space Models | Unknown | N/A | |
| Regularized Q-Learning | Unknown | N/A | |
| DiTFastAttn: Attention Compression for Diffusion Transformer Models | Unknown | N/A | |
| Improving Context-Aware Preference Modeling for Language Models | Unknown | N/A | |
| Graph-enhanced Optimizers for Structure-aware Recommendation Embedding Evolution | Unknown | N/A | |
| TARSS-Net: Temporal-Aware Radar Semantic Segmentation Network | Unknown | N/A | |
| CausalStock: Deep End-to-end Causal Discovery for News-driven Multi-stock Movement Prediction | Unknown | N/A | |
| Federated Behavioural Planes: Explaining the Evolution of Client Behaviour in Federated Learning | Unknown | N/A | |
| VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought | Unknown | N/A | |
| Minimizing UCB: a Better Local Search Strategy in Local Bayesian Optimization | Unknown | N/A | |
| Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental Learning | Unknown | N/A | |
| MG-Net: Learn to Customize QAOA with Circuit Depth Awareness | Unknown | N/A | |
| FIDE: Frequency-Inflated Conditional Diffusion Model for Extreme-Aware Time Series Generation | Unknown | N/A | |
| DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph | Unknown | N/A | |
| TOPA: Extending Large Language Models for Video Understanding via Text-Only Pre-Alignment | Unknown | N/A | |
| Efficient Centroid-Linkage Clustering | Unknown | N/A | |
| Deep Support Vectors | Unknown | N/A | |
| Toxicity Detection for Free | Unknown | N/A | |
| SIRIUS : Contexual Sparisty with Correction for Efficient LLMs | Unknown | N/A | |
| Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis | Unknown | N/A | |
| Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models | Unknown | N/A | |
| Verifiably Robust Conformal Prediction | Unknown | N/A | |
| Algorithmic progress in language models | Unknown | N/A | |
| Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication | Unknown | N/A | |
| One-shot Federated Learning via Synthetic Distiller-Distillate Communication | Unknown | N/A | |
| Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models | Unknown | N/A | |
| SeTAR: Out-of-Distribution Detection with Selective Low-Rank Approximation | Unknown | N/A | |
| Locating What You Need: Towards Adapting Diffusion Models to OOD Concepts In-the-Wild | Unknown | N/A | |
| LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS | Unknown | N/A | |
| Rethinking the Membrane Dynamics and Optimization Objectives of Spiking Neural Networks | Unknown | N/A | |
| Learning Discrete Latent Variable Structures with Tensor Rank Conditions | Unknown | N/A | |
| Open-Book Neural Algorithmic Reasoning | Unknown | N/A | |
| Separations in the Representational Capabilities of Transformers and Recurrent Architectures | Unknown | N/A | |
| S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity | Unknown | N/A | |
| ROIDICE: Offline Return on Investment Maximization for Efficient Decision Making | Unknown | N/A | |
| HYDRA-FL: Hybrid Knowledge Distillation for Robust and Accurate Federated Learning | Unknown | N/A | |
| Why Go Full? Elevating Federated Learning Through Partial Network Updates | Unknown | N/A | |
| Topological Generalization Bounds for Discrete-Time Stochastic Optimization Algorithms | Unknown | N/A | |
| Robust Neural Contextual Bandit against Adversarial Corruptions | Unknown | N/A | |
| The Sample-Communication Complexity Trade-off in Federated Q-Learning | Unknown | N/A | |
| Beating Adversarial Low-Rank MDPs with Unknown Transition and Bandit Feedback | Unknown | N/A | |
| Graph neural networks and non-commuting operators | Unknown | N/A | |
| Diffusion Imitation from Observation | Unknown | N/A | |
| Universal Rates of Empirical Risk Minimization | Unknown | N/A | |
| Classifier Clustering and Feature Alignment for Federated Learning under Distributed Concept Drift | Unknown | N/A | |
| Transfer Learning for Diffusion Models | Unknown | N/A | |
| DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation | Unknown | N/A | |
| MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models | Unknown | N/A | |
| Sparse High Rank Adapters | Unknown | N/A | |
| Improving Generalization in Federated Learning with Model-Data Mutual Information Regularization: A Posterior Inference Approach | Unknown | N/A | |
| Block Transformer: Global-to-Local Language Modeling for Fast Inference | Unknown | N/A | |
| Consensus Learning with Deep Sets for Essential Matrix Estimation | Unknown | N/A | |
| Learning symmetries via weight-sharing with doubly stochastic tensors | Unknown | N/A | |
| Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers | Unknown | N/A | |
| Fast Rates for Bandit PAC Multiclass Classification | Unknown | N/A | |
| Zero-Shot Reinforcement Learning from Low Quality Data | Unknown | N/A | |
| Is Your LiDAR Placement Optimized for 3D Scene Understanding? | Unknown | N/A | |
| Right this way: Can VLMs Guide Us to See More to Answer Questions? | Unknown | N/A | |
| Amortized Active Causal Induction with Deep Reinforcement Learning | Unknown | N/A | |
| Group and Shuffle: Efficient Structured Orthogonal Parametrization | Unknown | N/A | |
| Learning Place Cell Representations and Context-Dependent Remapping | Unknown | N/A | |
| Slot-VLM: Object-Event Slots for Video-Language Modeling | Unknown | N/A | |
| CALVIN: Improved Contextual Video Captioning via Instruction Tuning | Unknown | N/A | |
| Error Correction Output Codes for Robust Neural Networks against Weight-errors: A Neural Tangent Kernel Point of View | Unknown | N/A | |
| Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing | Unknown | N/A | |
| Learning Social Welfare Functions | Unknown | N/A | |
| Great Minds Think Alike: The Universal Convergence Trend of Input Salience | Unknown | N/A | |
| Meta-Learning Universal Priors Using Non-Injective Change of Variables | Unknown | N/A | |
| Doubly Mild Generalization for Offline Reinforcement Learning | Unknown | N/A | |
| Overfitting Behaviour of Gaussian Kernel Ridgeless Regression: Varying Bandwidth or Dimensionality | Unknown | N/A | |
| Stable Minima Cannot Overfit in Univariate ReLU Networks: Generalization by Large Step Sizes | Unknown | N/A | |
| Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Towards Harmless Rawlsian Fairness Regardless of Demographic Prior | Unknown | N/A | |
| Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) | Unknown | N/A | |
| Prediction-Powered Ranking of Large Language Models | Unknown | N/A | |
| Human-3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models | Unknown | N/A | |
| Derivatives of Stochastic Gradient Descent in parametric optimization | Unknown | N/A | |
| Progressive Entropic Optimal Transport Solvers | Unknown | N/A | |
| Untrained Neural Nets for Snapshot Compressive Imaging: Theory and Algorithms | Unknown | N/A | |
| Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration | Unknown | N/A | |
| Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels | Unknown | N/A | |
| Navigable Graphs for High-Dimensional Nearest Neighbor Search: Constructions and Limits | Unknown | N/A | |
| Invisible Image Watermarks Are Provably Removable Using Generative AI | Unknown | N/A | |
| Initializing Variable-sized Vision Transformers from Learngene with Learnable Transformation | Unknown | N/A | |
| VideoLLM-MoD: Efficient Video-Language Streaming with Mixture-of-Depths Vision Computation | Unknown | N/A | |
| DeltaDEQ: Exploiting Heterogeneous Convergence for Accelerating Deep Equilibrium Iterations | Unknown | N/A | |
| Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition | Unknown | N/A | |
| AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos | Unknown | N/A | |
| LAM3D: Large Image-Point Clouds Alignment Model for 3D Reconstruction from Single Image | Unknown | N/A | |
| The Power of Resets in Online Reinforcement Learning | Unknown | N/A | |
| Flipping-based Policy for Chance-Constrained Markov Decision Processes | Unknown | N/A | |
| Identifying Functionally Important Features with End-to-End Sparse Dictionary Learning | Unknown | N/A | |
| Sub-optimal Experts mitigate Ambiguity in Inverse Reinforcement Learning | Unknown | N/A | |
| Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning | Unknown | N/A | |
| Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff | Unknown | N/A | |
| Spectral Graph Pruning Against Over-Squashing and Over-Smoothing | Unknown | N/A | |
| Approximately Pareto-optimal Solutions for Bi-Objective k-Clustering | Unknown | N/A | |
| Unified Insights: Harnessing Multi-modal Data for Phenotype Imputation via View Decoupling | Unknown | N/A | |
| Bayesian Nonparametrics Meets Data-Driven Distributionally Robust Optimization | Unknown | N/A | |
| Adjust Pearson's $r$ to Measure Arbitrary Monotone Dependence | Unknown | N/A | |
| On the Power of Small-size Graph Neural Networks for Linear Programming | Unknown | N/A | |
| Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning | Unknown | N/A | |
| Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level | Unknown | N/A | |
| Robust Reinforcement Learning with General Utility | Unknown | N/A | |
| Optimal and Approximate Adaptive Stochastic Quantization | Unknown | N/A | |
| NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing | Unknown | N/A | |
| Extensive-Form Game Solving via Blackwell Approachability on Treeplexes | Unknown | N/A | |
| EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical Dilemmas | Unknown | N/A | |
| Evaluating alignment between humans and neural network representations in image-based learning tasks | Unknown | N/A | |
| Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency | Unknown | N/A | |
| Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects | Unknown | N/A | |
| Improving Neural Network Surface Processing with Principal Curvatures | Unknown | N/A | |
| Temporal-Difference Learning Using Distributed Error Signals | Unknown | N/A | |
| LLM Evaluators Recognize and Favor Their Own Generations | Unknown | N/A | |
| Conformalized Multiple Testing after Data-dependent Selection | Unknown | N/A | |
| Globally Convergent Variational Inference | Unknown | N/A | |
| Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs | Unknown | N/A | |
| Pearls from Pebbles: Improved Confidence Functions for Auto-labeling | Unknown | N/A | |
| Boosting Alignment for Post-Unlearning Text-to-Image Generative Models | Unknown | N/A | |
| RandNet-Parareal: a time-parallel PDE solver using Random Neural Networks | Unknown | N/A | |
| Noise-Aware Differentially Private Regression via Meta-Learning | Unknown | N/A | |
| Boosting Text-to-Video Generative Model with MLLMs Feedback | Unknown | N/A | |
| Building on Efficient Foundations: Effective Training of LLMs with Structured Feedforward Layers | Unknown | N/A | |
| Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis | Unknown | N/A | |
| Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation | Unknown | N/A | |
| Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning | Unknown | N/A | |
| Optimization Algorithm Design via Electric Circuits | Unknown | N/A | |
| Aligning Model Properties via Conformal Risk Control | Unknown | N/A | |
| UMB: Understanding Model Behavior for Open-World Object Detection | Unknown | N/A | |
| Compact Language Models via Pruning and Knowledge Distillation | Unknown | N/A | |
| Context and Geometry Aware Voxel Transformer for Semantic Scene Completion | Unknown | N/A | |
| IPM-LSTM: A Learning-Based Interior Point Method for Solving Nonlinear Programs | Unknown | N/A | |
| Knowledge-Empowered Dynamic Graph Network for Irregularly Sampled Medical Time Series | Unknown | N/A | |
| Online Learning with Sublinear Best-Action Queries | Unknown | N/A | |
| Knowledge Circuits in Pretrained Transformers | Unknown | N/A | |
| Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope Theory | Unknown | N/A | |
| UDPM: Upsampling Diffusion Probabilistic Models | Unknown | N/A | |
| Improving Subgroup Robustness via Data Selection | Unknown | N/A | |
| FAST: A Dual-tier Few-Shot Learning Paradigm for Whole Slide Image Classification | Unknown | N/A | |
| Attention boosted Individualized Regression | Unknown | N/A | |
| Mixed Dynamics In Linear Networks: Unifying the Lazy and Active Regimes | Unknown | N/A | |
| Testing Semantic Importance via Betting | Unknown | N/A | |
| Optimal Transport-based Labor-free Text Prompt Modeling for Sketch Re-identification | Unknown | N/A | |
| No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance | Unknown | N/A | |
| Label Delay in Online Continual Learning | Unknown | N/A | |
| Tensor-Based Synchronization and the Low-Rankness of the Block Trifocal Tensor | Unknown | N/A | |
| Vector Quantization Prompting for Continual Learning | Unknown | N/A | |
| TableRAG: Million-Token Table Understanding with Language Models | Unknown | N/A | |
| Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification | Unknown | N/A | |
| Feint Behaviors and Strategies: Formalization, Implementation and Evaluation | Unknown | N/A | |
| Credal Learning Theory | Unknown | N/A | |
| DEFT: Efficient Fine-tuning of Diffusion Models by Learning the Generalised $h$-transform | Unknown | N/A | |
| Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning | Unknown | N/A | |
| Improving Neural ODE Training with Temporal Adaptive Batch Normalization | Unknown | N/A | |
| CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing | Unknown | N/A | |
| Optimal Batched Best Arm Identification | Unknown | N/A | |
| Freya PAGE: First Optimal Time Complexity for Large-Scale Nonconvex Finite-Sum Optimization with Heterogeneous Asynchronous Computations | Unknown | N/A | |
| Lambda: Learning Matchable Prior For Entity Alignment with Unlabeled Dangling Cases | Unknown | N/A | |
| Causal Context Adjustment Loss for Learned Image Compression | Unknown | N/A | |
| Covariate Shift Corrected Conditional Randomization Test | Unknown | N/A | |
| Expanding Sparse Tuning for Low Memory Usage | Unknown | N/A | |
| SpelsNet: Surface Primitive Elements Segmentation by B-Rep Graph Structure Supervision | Unknown | N/A | |
| Integrating Deep Metric Learning with Coreset for Active Learning in 3D Segmentation | Unknown | N/A | |
| UQE: A Query Engine for Unstructured Databases | Unknown | N/A | |
| PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression | Unknown | N/A | |
| Learning Identifiable Factorized Causal Representations of Cellular Responses | Unknown | N/A | |
| Multi-Group Proportional Representation in Retrieval | Unknown | N/A | |
| Fair Secretaries with Unfair Predictions | Unknown | N/A | |
| Auditing Privacy Mechanisms via Label Inference Attacks | Unknown | N/A | |
| Robust and Faster Zeroth-Order Minimax Optimization: Complexity and Applications | Unknown | N/A | |
| Simple and Fast Distillation of Diffusion Models | Unknown | N/A | |
| Unsupervised Anomaly Detection in The Presence of Missing Values | Unknown | N/A | |
| Semi-supervised Multi-label Learning with Balanced Binary Angular Margin Loss | Unknown | N/A | |
| NeuMA: Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics | Unknown | N/A | |
| Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate | Unknown | N/A | |
| DeformableTST: Transformer for Time Series Forecasting without Over-reliance on Patching | Unknown | N/A | |
| Matryoshka Query Transformer for Large Vision-Language Models | Unknown | N/A | |
| SLowcalSGD : Slow Query Points Improve Local-SGD for Stochastic Convex Optimization | Unknown | N/A | |
| Differentiable Structure Learning with Partial Orders | Unknown | N/A | |
| Towards Accurate and Fair Cognitive Diagnosis via Monotonic Data Augmentation | Unknown | N/A | |
| Genetic-guided GFlowNets for Sample Efficient Molecular Optimization | Unknown | N/A | |
| Optimizing the coalition gain in Online Auctions with Greedy Structured Bandits | Unknown | N/A | |
| Rethinking Fourier Transform from A Basis Functions Perspective for Long-term Time Series Forecasting | Unknown | N/A | |
| Toward Semantic Gaze Target Detection | Unknown | N/A | |
| BERTs are Generative In-Context Learners | Unknown | N/A | |
| Efficiently Learning Significant Fourier Feature Pairs for Statistical Independence Testing | Unknown | N/A | |
| Model-based Diffusion for Trajectory Optimization | Unknown | N/A | |
| Provably Faster Algorithms for Bilevel Optimization via Without-Replacement Sampling | Unknown | N/A | |
| Locally Private and Robust Multi-Armed Bandits | Unknown | N/A | |
| Efficient Adaptation of Pre-trained Vision Transformer via Householder Transformation | Unknown | N/A | |
| AdanCA: Neural Cellular Automata As Adaptors For More Robust Vision Transformer | Unknown | N/A | |
| InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction | Unknown | N/A | |
| Causal Inference in the Closed-Loop: Marginal Structural Models for Sequential Excursion Effects | Unknown | N/A | |
| Truncated Variance Reduced Value Iteration | Unknown | N/A | |
| Effective Exploration Based on the Structural Information Principles | Unknown | N/A | |
| QWO: Speeding Up Permutation-Based Causal Discovery in LiGAMs | Unknown | N/A | |
| Learning Equilibria in Adversarial Team Markov Games: A Nonconvex-Hidden-Concave Min-Max Optimization Problem | Unknown | N/A | |
| Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars | Unknown | N/A | |
| Inferring stochastic low-rank recurrent neural networks from neural data | Unknown | N/A | |
| Unchosen Experts Can Contribute Too: Unleashing MoE Models’ Power by Self-Contrast | Unknown | N/A | |
| Advancing Open-Set Domain Generalization Using Evidential Bi-Level Hardest Domain Scheduler | Unknown | N/A | |
| Diffusion Spectral Representation for Reinforcement Learning | Unknown | N/A | |
| Reshuffling Resampling Splits Can Improve Generalization of Hyperparameter Optimization | Unknown | N/A | |
| Stabilizing Zero-Shot Prediction: A Novel Antidote to Forgetting in Continual Vision-Language Tasks | Unknown | N/A | |
| Multistep Distillation of Diffusion Models via Moment Matching | Unknown | N/A | |
| Towards Multi-Domain Learning for Generalizable Video Anomaly Detection | Unknown | N/A | |
| Convergence of No-Swap-Regret Dynamics in Self-Play | Unknown | N/A | |
| Cross-Scale Self-Supervised Blind Image Deblurring via Implicit Neural Representation | Unknown | N/A | |
| Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity | Unknown | N/A | |
| Bidirectional Recurrence for Cardiac Motion Tracking with Gaussian Process Latent Coding | Unknown | N/A | |
| Global Convergence in Training Large-Scale Transformers | Unknown | N/A | |
| Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMs | Unknown | N/A | |
| Exploring and Exploiting the Asymmetric Valley of Deep Neural Networks | Unknown | N/A | |
| Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces | Unknown | N/A | |
| Spiking Neural Network as Adaptive Event Stream Slicer | Unknown | N/A | |
| Universal Exact Compression of Differentially Private Mechanisms | Unknown | N/A | |
| Enhancing Robustness of Last Layer Two-Stage Fair Model Corrections | Unknown | N/A | |
| Fast Proxy Experiment Design for Causal Effect Identification | Unknown | N/A | |
| Coherence-free Entrywise Estimation of Eigenvectors in Low-rank Signal-plus-noise Matrix Models | Unknown | N/A | |
| From Instance Training to Instruction Learning: Task Adapters Generation from Instructions | Unknown | N/A | |
| ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings | Unknown | N/A | |
| Large Language Models-guided Dynamic Adaptation for Temporal Knowledge Graph Reasoning | Unknown | N/A | |
| Simplifying Latent Dynamics with Softly State-Invariant World Models | Unknown | N/A | |
| PAC-Bayes-Chernoff bounds for unbounded losses | Unknown | N/A | |
| Don't Compress Gradients in Random Reshuffling: Compress Gradient Differences | Unknown | N/A | |
| LLM Dataset Inference: Did you train on my dataset? | Unknown | N/A | |
| Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization | Unknown | N/A | |
| DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms | Unknown | N/A | |
| Learning Distinguishable Trajectory Representation with Contrastive Loss | Unknown | N/A | |
| Interpreting the Weight Space of Customized Diffusion Models | Unknown | N/A | |
| Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning | Unknown | N/A | |
| Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers | Unknown | N/A | |
| Identifying Spatio-Temporal Drivers of Extreme Events | Unknown | N/A | |
| HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting | Unknown | N/A | |
| Animal-Bench: Benchmarking Multimodal Video Models for Animal-centric Video Understanding | Unknown | N/A | |
| Contextual Decision-Making with Knapsacks Beyond the Worst Case | Unknown | N/A | |
| Is Knowledge Power? On the (Im)possibility of Learning from Strategic Interactions | Unknown | N/A | |
| Neural Combinatorial Optimization for Robust Routing Problem with Uncertain Travel Times | Unknown | N/A | |
| Make Continual Learning Stronger via C-Flat | Unknown | N/A | |
| Automatic Outlier Rectification via Optimal Transport | Unknown | N/A | |
| Exactly Minimax-Optimal Locally Differentially Private Sampling | Unknown | N/A | |
| On the Benefits of Public Representations for Private Transfer Learning under Distribution Shift | Unknown | N/A | |
| Breaking the False Sense of Security in Backdoor Defense through Re-Activation Attack | Unknown | N/A | |
| PuLID: Pure and Lightning ID Customization via Contrastive Alignment | Unknown | N/A | |
| A Theory of Optimistically Universal Online Learnability for General Concept Classes | Unknown | N/A | |
| Efficient Streaming Algorithms for Graphlet Sampling | Unknown | N/A | |
| Data Mixture Inference Attack: BPE Tokenizers Reveal Training Data Compositions | Unknown | N/A | |
| Nearly Minimax Optimal Regret for Multinomial Logistic Bandit | Unknown | N/A | |
| FM-Delta: Lossless Compression for Storing Massive Fine-tuned Foundation Models | Unknown | N/A | |
| STONE: A Submodular Optimization Framework for Active 3D Object Detection | Unknown | N/A | |
| Data subsampling for Poisson regression with pth-root-link | Unknown | N/A | |
| Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated Learning | Unknown | N/A | |
| Rethinking the Power of Timestamps for Robust Time Series Forecasting: A Global-Local Fusion Perspective | Unknown | N/A | |
| Kernel PCA for Out-of-Distribution Detection | Unknown | N/A | |
| Matching the Statistical Query Lower Bound for $k$-Sparse Parity Problems with Sign Stochastic Gradient Descent | Unknown | N/A | |
| Remove that Square Root: A New Efficient Scale-Invariant Version of AdaGrad | Unknown | N/A | |
| Revisiting K-mer Profile for Effective and Scalable Genome Representation Learning | Unknown | N/A | |
| CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection | Unknown | N/A | |
| Last-Iterate Convergence for Generalized Frank-Wolfe in Monotone Variational Inequalities | Unknown | N/A | |
| Multiview Scene Graph | Unknown | N/A | |
| Amnesia as a Catalyst for Enhancing Black Box Pixel Attacks in Image Classification and Object Detection | Unknown | N/A | |
| Fast Tree-Field Integrators: From Low Displacement Rank to Topological Transformers | Unknown | N/A | |
| 3D Structure Prediction of Atomic Systems with Flow-based Direct Preference Optimization | Unknown | N/A | |
| Leveraging an ECG Beat Diffusion Model for Morphological Reconstruction from Indirect Signals | Unknown | N/A | |
| LLaNA: Large Language and NeRF Assistant | Unknown | N/A | |
| Predicting Label Distribution from Ternary Labels | Unknown | N/A | |
| Deep linear networks for regression are implicitly regularized towards flat minima | Unknown | N/A | |
| SAM-Guided Masked Token Prediction for 3D Scene Understanding | Unknown | N/A | |
| MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning | Unknown | N/A | |
| Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity | Unknown | N/A | |
| Masked Hard-Attention Transformers Recognize Exactly the Star-Free Languages | Unknown | N/A | |
| Consistency Diffusion Bridge Models | Unknown | N/A | |
| Unveil Benign Overfitting for Transformer in Vision: Training Dynamics, Convergence, and Generalization | Unknown | N/A | |
| ProxyFusion: Face Feature Aggregation Through Sparse Experts | Unknown | N/A | |
| Language Generation in the Limit | Unknown | N/A | |
| GuardT2I: Defending Text-to-Image Models from Adversarial Prompts | Unknown | N/A | |
| Pruning neural network models for gene regulatory dynamics using data and domain knowledge | Unknown | N/A | |
| Space-Time Continuous PDE Forecasting using Equivariant Neural Fields | Unknown | N/A | |
| Policy Improvement using Language Feedback Models | Unknown | N/A | |
| The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks | Unknown | N/A | |
| Edit Distance Robust Watermarks via Indexing Pseudorandom Codes | Unknown | N/A | |
| Artemis: Towards Referential Understanding in Complex Videos | Unknown | N/A | |
| When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search | Unknown | N/A | |
| TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based Models | Unknown | N/A | |
| Federated Black-Box Adaptation for Semantic Segmentation | Unknown | N/A | |
| Structured Learning of Compositional Sequential Interventions | Unknown | N/A | |
| The Power of Extrapolation in Federated Learning | Unknown | N/A | |
| Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes | Unknown | N/A | |
| Learning Plaintext-Ciphertext Cryptographic Problems via ANF-based SAT Instance Representation | Unknown | N/A | |
| Linguistic Collapse: Neural Collapse in (Large) Language Models | Unknown | N/A | |
| Continuous Temporal Domain Generalization | Unknown | N/A | |
| Hybrid Generative AI for De Novo Design of Co-Crystals with Enhanced Tabletability | Unknown | N/A | |
| Relational Concept Bottleneck Models | Unknown | N/A | |
| Byzantine Robustness and Partial Participation Can Be Achieved at Once: Just Clip Gradient Differences | Unknown | N/A | |
| Construction and Application of Materials Knowledge Graph in Multidisciplinary Materials Science via Large Language Model | Unknown | N/A | |
| Robust Conformal Prediction Using Privileged Information | Unknown | N/A | |
| Differentiable Quantum Computing for Large-scale Linear Control | Unknown | N/A | |
| Beyond Efficiency: Molecular Data Pruning for Enhanced Generalization | Unknown | N/A | |
| Language Models as Hierarchy Encoders | Unknown | N/A | |
| On the Convergence of Loss and Uncertainty-based Active Learning Algorithms | Unknown | N/A | |
| Exponential Quantum Communication Advantage in Distributed Inference and Learning | Unknown | N/A | |
| BehaviorGPT: Smart Agent Simulation for Autonomous Driving with Next-Patch Prediction | Unknown | N/A | |
| Bridging Gaps: Federated Multi-View Clustering in Heterogeneous Hybrid Views | Unknown | N/A | |
| Sample Selection via Contrastive Fragmentation for Noisy Label Regression | Unknown | N/A | |
| Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language Models | Unknown | N/A | |
| First-Order Minimax Bilevel Optimization | Unknown | N/A | |
| Latent Learning Progress Drives Autonomous Goal Selection in Human Reinforcement Learning | Unknown | N/A | |
| Neuronal Competition Groups with Supervised STDP for Spike-Based Classification | Unknown | N/A | |
| Continuous Heatmap Regression for Pose Estimation via Implicit Neural Representation | Unknown | N/A | |
| A distributional simplicity bias in the learning dynamics of transformers | Unknown | N/A | |
| Improving Decision Sparsity | Unknown | N/A | |
| HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness | Unknown | N/A | |
| PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models | Unknown | N/A | |
| Active Set Ordering | Unknown | N/A | |
| Adaptive Preference Scaling for Reinforcement Learning with Human Feedback | Unknown | N/A | |
| Sparse Bayesian Generative Modeling for Compressive Sensing | Unknown | N/A | |
| Dual Critic Reinforcement Learning under Partial Observability | Unknown | N/A | |
| Achievable Fairness on Your Data With Utility Guarantees | Unknown | N/A | |
| Skill-aware Mutual Information Optimisation for Zero-shot Generalisation in Reinforcement Learning | Unknown | N/A | |
| Boosting Generalization in Parametric PDE Neural Solvers through Adaptive Conditioning | Unknown | N/A | |
| Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training | Unknown | N/A | |
| PhyloGen: Language Model-Enhanced Phylogenetic Inference via Graph Structure Generation | Unknown | N/A | |
| Preference-based Pure Exploration | Unknown | N/A | |
| Propensity Score Alignment of Unpaired Multimodal Data | Unknown | N/A | |
| Retrieval-Retro: Retrieval-based Inorganic Retrosynthesis with Expert Knowledge | Unknown | N/A | |
| ART: Automatic Red-teaming for Text-to-Image Models to Protect Benign Users | Unknown | N/A | |
| Self-Retrieval: End-to-End Information Retrieval with One Large Language Model | Unknown | N/A | |
| Physics-informed Neural Networks for Functional Differential Equations: Cylindrical Approximation and Its Convergence Guarantees | Unknown | N/A | |
| OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation | Unknown | N/A | |
| Learning the Infinitesimal Generator of Stochastic Diffusion Processes | Unknown | N/A | |
| Achieving Linear Convergence with Parameter-Free Algorithms in Decentralized Optimization | Unknown | N/A | |
| ESPACE: Dimensionality Reduction of Activations for Model Compression | Unknown | N/A | |
| Animate3D: Animating Any 3D Model with Multi-view Video Diffusion | Unknown | N/A | |
| Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach | Unknown | N/A | |
| Learning Representations for Hierarchies with Minimal Support | Unknown | N/A | |
| Learning Image Priors Through Patch-Based Diffusion Models for Solving Inverse Problems | Unknown | N/A | |
| Generative Semi-supervised Graph Anomaly Detection | Unknown | N/A | |
| Approximating mutual information of high-dimensional variables using learned representations | Unknown | N/A | |
| Online Feature Updates Improve Online (Generalized) Label Shift Adaptation | Unknown | N/A | |
| A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration | Unknown | N/A | |
| Finding good policies in average-reward Markov Decision Processes without prior knowledge | Unknown | N/A | |
| Is Score Matching Suitable for Estimating Point Processes? | Unknown | N/A | |
| Initializing Services in Interactive ML Systems for Diverse Users | Unknown | N/A | |
| LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language | Unknown | N/A | |
| Local and Adaptive Mirror Descents in Extensive-Form Games | Unknown | N/A | |
| Strategic Multi-Armed Bandit Problems Under Debt-Free Reporting | Unknown | N/A | |
| Safe LoRA: The Silver Lining of Reducing Safety Risks when Finetuning Large Language Models | Unknown | N/A | |
| Hierarchical Object-Aware Dual-Level Contrastive Learning for Domain Generalized Stereo Matching | Unknown | N/A | |
| Decomposed Prompt Decision Transformer for Efficient Unseen Task Generalization | Unknown | N/A | |
| SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning | Unknown | N/A | |
| Replay-and-Forget-Free Graph Class-Incremental Learning: A Task Profiling and Prompting Approach | Unknown | N/A | |
| Transferring disentangled representations: bridging the gap between synthetic and real images | Unknown | N/A | |
| Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Unknown | N/A | |
| Heterogeneity-Guided Client Sampling: Towards Fast and Efficient Non-IID Federated Learning | Unknown | N/A | |
| Neural Embeddings Rank: Aligning 3D latent dynamics with movements | Unknown | N/A | |
| Periodic agent-state based Q-learning for POMDPs | Unknown | N/A | |
| Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning | Unknown | N/A | |
| Mixture of Adversarial LoRAs: Boosting Robust Generalization in Meta-Tuning | Unknown | N/A | |
| FedLPA: One-shot Federated Learning with Layer-Wise Posterior Aggregation | Unknown | N/A | |
| Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding Network | Unknown | N/A | |
| Proportional Fairness in Clustering: A Social Choice Perspective | Unknown | N/A | |
| FedSSP: Federated Graph Learning with Spectral Knowledge and Personalized Preference | Unknown | N/A | |
| Full-Atom Peptide Design with Geometric Latent Diffusion | Unknown | N/A | |
| Learning via Surrogate PAC-Bayes | Unknown | N/A | |
| OpenDlign: Open-World Point Cloud Understanding with Depth-Aligned Images | Unknown | N/A | |
| Consistency Models for Scalable and Fast Simulation-Based Inference | Unknown | N/A | |
| Doubly Hierarchical Geometric Representations for Strand-based Human Hairstyle Generation | Unknown | N/A | |
| Piecewise deterministic generative models | Unknown | N/A | |
| Inflationary Flows: Calibrated Bayesian Inference with Diffusion-Based Models | Unknown | N/A | |
| A Comprehensive Analysis on the Learning Curve in Kernel Ridge Regression | Unknown | N/A | |
| Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions | Unknown | N/A | |
| Putting Gale & Shapley to Work: Guaranteeing Stability Through Learning | Unknown | N/A | |
| On the Optimal Time Complexities in Decentralized Stochastic Asynchronous Optimization | Unknown | N/A | |
| LM-HT SNN: Enhancing the Performance of SNN to ANN Counterpart through Learnable Multi-hierarchical Threshold Model | Unknown | N/A | |
| Dynamic Rescaling for Training GNNs | Unknown | N/A | |
| DASH: Warm-Starting Neural Network Training in Stationary Settings without Loss of Plasticity | Unknown | N/A | |
| Addressing Spectral Bias of Deep Neural Networks by Multi-Grade Deep Learning | Unknown | N/A | |
| Smoke and Mirrors in Causal Downstream Tasks | Unknown | N/A | |
| Randomized Sparse Matrix Compression for Large-Scale Constrained Optimization in Cancer Radiotherapy | Unknown | N/A | |
| Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation | Unknown | N/A | |
| NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks | Unknown | N/A | |
| SSDM: Scalable Speech Dysfluency Modeling | Unknown | N/A | |
| State Chrono Representation for Enhancing Generalization in Reinforcement Learning | Unknown | N/A | |
| WaveAttack: Asymmetric Frequency Obfuscation-based Backdoor Attacks Against Deep Neural Networks | Unknown | N/A | |
| SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices | Unknown | N/A | |
| Non-convolutional graph neural networks. | Unknown | N/A | |
| Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Unknown | N/A | |
| PowerPM: Foundation Model for Power Systems | Unknown | N/A | |
| Image Reconstruction Via Autoencoding Sequential Deep Image Prior | Unknown | N/A | |
| Hamiltonian Score Matching and Generative Flows | Unknown | N/A | |
| Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback | Unknown | N/A | |
| Visual Perception by Large Language Model’s Weights | Unknown | N/A | |
| Energy-Guided Continuous Entropic Barycenter Estimation for General Costs | Unknown | N/A | |
| Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models | Unknown | N/A | |
| Unravelling in Collaborative Learning | Unknown | N/A | |
| FACT or Fiction: Can Truthful Mechanisms Eliminate Federated Free Riding? | Unknown | N/A | |
| Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems | Unknown | N/A | |
| LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and Rendering | Unknown | N/A | |
| What Makes Partial-Label Learning Algorithms Effective? | Unknown | N/A | |
| Online Control with Adversarial Disturbance for Continuous-time Linear Systems | Unknown | N/A | |
| Semantic Routing via Autoregressive Modeling | Unknown | N/A | |
| Learning to Understand: Identifying Interactions via the Möbius Transform | Unknown | N/A | |
| Collaborative Cognitive Diagnosis with Disentangled Representation Learning for Learner Modeling | Unknown | N/A | |
| REDUCR: Robust Data Downsampling using Class Priority Reweighting | Unknown | N/A | |
| ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Field | Unknown | N/A | |
| SpaceByte: Towards Deleting Tokenization from Large Language Modeling | Unknown | N/A | |
| Oracle-Efficient Reinforcement Learning for Max Value Ensembles | Unknown | N/A | |
| RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance | Unknown | N/A | |
| KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis | Unknown | N/A | |
| Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass | Unknown | N/A | |
| Accelerating Blockwise Parallel Language Models with Draft Refinement | Unknown | N/A | |
| Banded Square Root Matrix Factorization for Differentially Private Model Training | Unknown | N/A | |
| Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control | Unknown | N/A | |
| DiffuPac: Contextual Mimicry in Adversarial Packets Generation via Diffusion Model | Unknown | N/A | |
| Beyond Concept Bottleneck Models: How to Make Black Boxes Intervenable? | Unknown | N/A | |
| ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation | Unknown | N/A | |
| On Neural Networks as Infinite Tree-Structured Probabilistic Graphical Models | Unknown | N/A | |
| Information-theoretic Limits of Online Classification with Noisy Labels | Unknown | N/A | |
| Scaling Continuous Latent Variable Models as Probabilistic Integral Circuits | Unknown | N/A | |
| Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time | Unknown | N/A | |
| Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models | Unknown | N/A | |
| Dual-Perspective Activation: Efficient Channel Denoising via Joint Forward-Backward Criterion for Artificial Neural Networks | Unknown | N/A | |
| Noise Contrastive Alignment of Language Models with Explicit Rewards | Unknown | N/A | |
| Why the Metric Backbone Preserves Community Structure | Unknown | N/A | |
| Bayesian Optimization of Functions over Node Subsets in Graphs | Unknown | N/A | |
| The Minimax Rate of HSIC Estimation for Translation-Invariant Kernels | Unknown | N/A | |
| Injecting Undetectable Backdoors in Obfuscated Neural Networks and Language Models | Unknown | N/A | |
| CondTSF: One-line Plugin of Dataset Condensation for Time Series Forecasting | Unknown | N/A | |
| ACES: Generating a Diversity of Challenging Programming Puzzles with Autotelic Generative Models | Unknown | N/A | |
| Adversarial Schrödinger Bridge Matching | Unknown | N/A | |
| Counter-Current Learning: A Biologically Plausible Dual Network Approach for Deep Learning | Unknown | N/A | |
| Proximal Causal Inference With Text Data | Unknown | N/A | |
| Transformers Learn to Achieve Second-Order Convergence Rates for In-Context Linear Regression | Unknown | N/A | |
| LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning | Unknown | N/A | |
| Partial observation can induce mechanistic mismatches in data-constrained models of neural dynamics | Unknown | N/A | |
| A Best-of-both-worlds Algorithm for Bandits with Delayed Feedback with Robustness to Excessive Delays | Unknown | N/A | |
| Robot Policy Learning with Temporal Optimal Transport Reward | Unknown | N/A | |
| Non-geodesically-convex optimization in the Wasserstein space | Unknown | N/A | |
| Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval | Unknown | N/A | |
| On provable privacy vulnerabilities of graph representations | Unknown | N/A | |
| Seeing Beyond the Crop: Using Language Priors for Out-of-Bounding Box Keypoint Prediction | Unknown | N/A | |
| Association Pattern-aware Fusion for Biological Entity Relationship Prediction | Unknown | N/A | |
| Towards Principled Graph Transformers | Unknown | N/A | |
| Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search | Unknown | N/A | |
| Cross-Modality Perturbation Synergy Attack for Person Re-identification | Unknown | N/A | |
| MAC Advice for facility location mechanism design | Unknown | N/A | |
| How Molecules Impact Cells: Unlocking Contrastive PhenoMolecular Retrieval | Unknown | N/A | |
| Neural decoding from stereotactic EEG: accounting for electrode variability across subjects | Unknown | N/A | |
| Towards Effective Planning Strategies for Dynamic Opinion Networks | Unknown | N/A | |
| A Kernel Perspective on Distillation-based Collaborative Learning | Unknown | N/A | |
| Beyond Accuracy: Tracking more like Human via Visual Search | Unknown | N/A | |
| Rule Extrapolation in Language Modeling: A Study of Compositional Generalization on OOD Prompts | Unknown | N/A | |
| Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering | Unknown | N/A | |
| ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models | Unknown | N/A | |
| MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models | Unknown | N/A | |
| eXponential FAmily Dynamical Systems (XFADS): Large-scale nonlinear Gaussian state-space modeling | Unknown | N/A | |
| Tolerant Algorithms for Learning with Arbitrary Covariate Shift | Unknown | N/A | |
| Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation | Unknown | N/A | |
| Test-Time Adaptation Induces Stronger Accuracy and Agreement-on-the-Line | Unknown | N/A | |
| Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning | Unknown | N/A | |
| Chain of Agents: Large Language Models Collaborating on Long-Context Tasks | Unknown | N/A | |
| Are Your Models Still Fair? Fairness Attacks on Graph Neural Networks via Node Injections | Unknown | N/A | |
| Learning 1D Causal Visual Representation with De-focus Attention Networks | Unknown | N/A | |
| The Surprising Ineffectiveness of Pre-Trained Visual Representations for Model-Based Reinforcement Learning | Unknown | N/A | |
| Paths to Equilibrium in Games | Unknown | N/A | |
| PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning | Unknown | N/A | |
| Provable Partially Observable Reinforcement Learning with Privileged Information | Unknown | N/A | |
| Are High-Degree Representations Really Unnecessary in Equivariant Graph Neural Networks? | Unknown | N/A | |
| Optimal Classification under Performative Distribution Shift | Unknown | N/A | |
| FedAvP: Augment Local Data via Shared Policy in Federated Learning | Unknown | N/A | |
| Identifiability Guarantees for Causal Disentanglement from Purely Observational Data | Unknown | N/A | |
| Reducing Transformer Key-Value Cache Size with Cross-Layer Attention | Unknown | N/A | |
| Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling | Unknown | N/A | |
| Building a stable classifier with the inflated argmax | Unknown | N/A | |
| A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits | Unknown | N/A | |
| A Single-Step, Sharpness-Aware Minimization is All You Need to Achieve Efficient and Accurate Sparse Training | Unknown | N/A | |
| How to Boost Any Loss Function | Unknown | N/A | |
| Advancing Fine-Grained Classification by Structure and Subject Preserving Augmentation | Unknown | N/A | |
| Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization | Unknown | N/A | |
| DePLM: Denoising Protein Language Models for Property Optimization | Unknown | N/A | |
| Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems | Unknown | N/A | |
| Analysis of Corrected Graph Convolutions | Unknown | N/A | |
| Marrying Causal Representation Learning with Dynamical Systems for Science | Unknown | N/A | |
| ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization | Unknown | N/A | |
| Active Classification with Few Queries under Misspecification | Unknown | N/A | |
| BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models | Unknown | N/A | |
| Online Composite Optimization Between Stochastic and Adversarial Environments | Unknown | N/A | |
| Theoretical Investigations and Practical Enhancements on Tail Task Risk Minimization in Meta Learning | Unknown | N/A | |
| Differential Privacy in Scalable General Kernel Learning via $K$-means Nystr{\"o}m Random Features | Unknown | N/A | |
| Robust group and simultaneous inferences for high-dimensional single index model | Unknown | N/A | |
| On the Worst Prompt Performance of Large Language Models | Unknown | N/A | |
| Implicit Regularization of Decentralized Gradient Descent for Sparse Regression | Unknown | N/A | |
| Scalable Kernel Inverse Optimization | Unknown | N/A | |
| CLIPCEIL: Domain Generalization through CLIP via Channel rEfinement and Image-text aLignment | Unknown | N/A | |
| Beyond Slow Signs in High-fidelity Model Extraction | Unknown | N/A | |
| A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning | Unknown | N/A | |
| Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning | Unknown | N/A | |
| Quantifying the Gain in Weak-to-Strong Generalization | Unknown | N/A | |
| Mirror and Preconditioned Gradient Descent in Wasserstein Space | Unknown | N/A | |
| Geometry of naturalistic object representations in recurrent neural network models of working memory | Unknown | N/A | |
| AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning | Unknown | N/A | |
| Natural Counterfactuals With Necessary Backtracking | Unknown | N/A | |
| Scaling laws for learning with real and surrogate data | Unknown | N/A | |
| The Power of Hard Attention Transformers on Data Sequences: A formal language theoretic perspective | Unknown | N/A | |
| DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection | Unknown | N/A | |
| Neural Residual Diffusion Models for Deep Scalable Vision Generation | Unknown | N/A | |
| Enhancing LLM Reasoning via Vision-Augmented Prompting | Unknown | N/A | |
| Discrete Dictionary-based Decomposition Layer for Structured Representation Learning | Unknown | N/A | |
| Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads | Unknown | N/A | |
| Parameter-Inverted Image Pyramid Networks | Unknown | N/A | |
| SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain | Unknown | N/A | |
| Understanding the Role of Equivariance in Self-supervised Learning | Unknown | N/A | |
| Decompose, Analyze and Rethink: Solving Intricate Problems with Human-like Reasoning Cycle | Unknown | N/A | |
| Robustly overfitting latents for flexible neural image compression | Unknown | N/A | |
| Harmonizing Stochasticity and Determinism: Scene-responsive Diverse Human Motion Prediction | Unknown | N/A | |
| DeTrack: In-model Latent Denoising Learning for Visual Object Tracking | Unknown | N/A | |
| Dual-Diffusion for Binocular 3D Human Pose Estimation | Unknown | N/A | |
| Meta-Diffu$B$: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration | Unknown | N/A | |
| Why Warmup the Learning Rate? Underlying Mechanisms and Improvements | Unknown | N/A | |
| Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents | Unknown | N/A | |
| MeLLoC: Lossless Compression with High-order Mechanism Learning | Unknown | N/A | |
| AID: Attention Interpolation of Text-to-Image Diffusion | Unknown | N/A | |
| Taming Generative Diffusion Prior for Universal Blind Image Restoration | Unknown | N/A | |
| Matrix Denoising with Doubly Heteroscedastic Noise: Fundamental Limits and Optimal Spectral Methods | Unknown | N/A | |
| Single-Loop Stochastic Algorithms for Difference of Max-Structured Weakly Convex Functions | Unknown | N/A | |
| Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond | Unknown | N/A | |
| Paralinguistics-Aware Speech-Empowered Large Language Models for Natural Conversation | Unknown | N/A | |
| BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning | Unknown | N/A | |
| When does perceptual alignment benefit vision representations? | Unknown | N/A | |
| Safe and Sparse Newton Method for Entropic-Regularized Optimal Transport | Unknown | N/A | |
| Adaptive Experimentation When You Can't Experiment | Unknown | N/A | |
| Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment | Unknown | N/A | |
| Reinforcement Learning with Euclidean Data Augmentation for State-Based Continuous Control | Unknown | N/A | |
| UrbanKGent: A Unified Large Language Model Agent Framework for Urban Knowledge Graph Construction | Unknown | N/A | |
| How Diffusion Models Learn to Factorize and Compose | Unknown | N/A | |
| On scalable oversight with weak LLMs judging strong LLMs | Unknown | N/A | |
| On the Comparison between Multi-modal and Single-modal Contrastive Learning | Unknown | N/A | |
| Pseudo-Siamese Blind-spot Transformers for Self-Supervised Real-World Denoising | Unknown | N/A | |
| Achieving Near-Optimal Convergence for Distributed Minimax Optimization with Adaptive Stepsizes | Unknown | N/A | |
| Abrupt Learning in Transformers: A Case Study on Matrix Completion | Unknown | N/A | |
| Honor Among Bandits: No-Regret Learning for Online Fair Division | Unknown | N/A | |
| M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | Unknown | N/A | |
| Scalable Bayesian Optimization via Focalized Sparse Gaussian Processes | Unknown | N/A | |
| Stochastic Extragradient with Flip-Flop Shuffling & Anchoring: Provable Improvements | Unknown | N/A | |
| DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning | Unknown | N/A | |
| Saliency-driven Experience Replay for Continual Learning | Unknown | N/A | |
| Ordering-Based Causal Discovery for Linear and Nonlinear Relations | Unknown | N/A | |
| CLUES: Collaborative Private-domain High-quality Data Selection for LLMs via Training Dynamics | Unknown | N/A | |
| Perplexity-aware Correction for Robust Alignment with Noisy Preferences | Unknown | N/A | |
| HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction | Unknown | N/A | |
| Can large language models explore in-context? | Unknown | N/A | |
| Geometric Trajectory Diffusion Models | Unknown | N/A | |
| Realizable $H$-Consistent and Bayes-Consistent Loss Functions for Learning to Defer | Unknown | N/A | |
| Mitigating Spurious Correlations via Disagreement Probability | Unknown | N/A | |
| Zero-Shot Transfer of Neural ODEs | Unknown | N/A | |
| Mean-Field Langevin Dynamics for Signed Measures via a Bilevel Approach | Unknown | N/A | |
| The GAN is dead; long live the GAN! A Modern GAN Baseline | Unknown | N/A | |
| Improved Guarantees for Fully Dynamic $k$-Center Clustering with Outliers in General Metric Spaces | Unknown | N/A | |
| HGDL: Heterogeneous Graph Label Distribution Learning | Unknown | N/A | |
| Improved Sample Complexity Bounds for Diffusion Model Training | Unknown | N/A | |
| When is Multicalibration Post-Processing Necessary? | Unknown | N/A | |
| DenoiseRep: Denoising Model for Representation Learning | Unknown | N/A | |
| QueST: Self-Supervised Skill Abstractions for Learning Continuous Control | Unknown | N/A | |
| GACL: Exemplar-Free Generalized Analytic Continual Learning | Unknown | N/A | |
| Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration | Unknown | N/A | |
| Linear Causal Bandits: Unknown Graph and Soft Interventions | Unknown | N/A | |
| Scaling Laws in Linear Regression: Compute, Parameters, and Data | Unknown | N/A | |
| Gene-Gene Relationship Modeling Based on Genetic Evidence for Single-Cell RNA-Seq Data Imputation | Unknown | N/A | |
| Relationship Prompt Learning is Enough for Open-Vocabulary Semantic Segmentation | Unknown | N/A | |
| Accelerating Transformers with Spectrum-Preserving Token Merging | Unknown | N/A | |
| Recursive PAC-Bayes: A Frequentist Approach to Sequential Prior Updates with No Information Loss | Unknown | N/A | |
| Group Robust Preference Optimization in Reward-free RLHF | Unknown | N/A | |
| No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Unknown | N/A | |
| Reinforcement Learning Guided Semi-Supervised Learning | Unknown | N/A | |
| Infusing Self-Consistency into Density Functional Theory Hamiltonian Prediction via Deep Equilibrium Models | Unknown | N/A | |
| Mutual Information Estimation via $f$-Divergence and Data Derangements | Unknown | N/A | |
| Hierarchical Visual Feature Aggregation for OCR-Free Document Understanding | Unknown | N/A | |
| Continual Audio-Visual Sound Separation | Unknown | N/A | |
| Continuous Contrastive Learning for Long-Tailed Semi-Supervised Recognition | Unknown | N/A | |
| Who’s Gaming the System? A Causally-Motivated Approach for Detecting Strategic Adaptation | Unknown | N/A | |
| Elo Uncovered: Robustness and Best Practices in Language Model Evaluation | Unknown | N/A | |
| Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers | Unknown | N/A | |
| DapperFL: Domain Adaptive Federated Learning with Model Fusion Pruning for Edge Devices | Unknown | N/A | |
| ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis | Unknown | N/A | |
| LoQT: Low-Rank Adapters for Quantized Pretraining | Unknown | N/A | |
| Object segmentation from common fate: Motion energy processing enables human-like zero-shot generalization to random dot stimuli | Unknown | N/A | |
| Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference | Unknown | N/A | |
| Gaussian Process Bandits for Top-k Recommendations | Unknown | N/A | |
| Second-order forward-mode optimization of recurrent neural networks for neuroscience | Unknown | N/A | |
| LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment | Unknown | N/A | |
| Fundamental Convergence Analysis of Sharpness-Aware Minimization | Unknown | N/A | |
| MADiff: Offline Multi-agent Learning with Diffusion Models | Unknown | N/A | |
| Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets | Unknown | N/A | |
| On the Complexity of Learning Sparse Functions with Statistical and Gradient Queries | Unknown | N/A | |
| An End-To-End Graph Attention Network Hashing for Cross-Modal Retrieval | Unknown | N/A | |
| Near-Optimality of Contrastive Divergence Algorithms | Unknown | N/A | |
| Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training | Unknown | N/A | |
| FashionR2R: Texture-preserving Rendered-to-Real Image Translation with Diffusion Models | Unknown | N/A | |
| An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning | Unknown | N/A | |
| Iteration Head: A Mechanistic Study of Chain-of-Thought | Unknown | N/A | |
| Leveraging partial stragglers within gradient coding | Unknown | N/A | |
| Learning rigid-body simulators over implicit shapes for large-scale scenes and vision | Unknown | N/A | |
| Learning Optimal Tax Design in Nonatomic Congestion Games | Unknown | N/A | |
| Understanding the Expressivity and Trainability of Fourier Neural Operator: A Mean-Field Perspective | Unknown | N/A | |
| DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Unknown | N/A | |
| SSDiff: Spatial-spectral Integrated Diffusion Model for Remote Sensing Pansharpening | Unknown | N/A | |
| SelectIT: Selective Instruction Tuning for LLMs via Uncertainty-Aware Self-Reflection | Unknown | N/A | |
| Imitating Language via Scalable Inverse Reinforcement Learning | Unknown | N/A | |
| Rule Based Rewards for Language Model Safety | Unknown | N/A | |
| A Gradient Accumulation Method for Dense Retriever under Memory Constraint | Unknown | N/A | |
| DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs | Unknown | N/A | |
| Safe Exploitative Play with Untrusted Type Beliefs | Unknown | N/A | |
| DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion | Unknown | N/A | |
| LIVE: Learnable In-Context Vector for Visual Question Answering | Unknown | N/A | |
| Wasserstein convergence of Cech persistence diagrams for samplings of submanifolds | Unknown | N/A | |
| Toward Dynamic Non-Line-of-Sight Imaging with Mamba Enforced Temporal Consistency | Unknown | N/A | |
| Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models | Unknown | N/A | |
| Multi-Agent Imitation Learning: Value is Easy, Regret is Hard | Unknown | N/A | |
| Rethinking Exploration in Reinforcement Learning with Effective Metric-Based Exploration Bonus | Unknown | N/A | |
| Cloud Object Detector Adaptation by Integrating Different Source Knowledge | Unknown | N/A | |
| PhyRecon: Physically Plausible Neural Scene Reconstruction | Unknown | N/A | |
| GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts | Unknown | N/A | |
| Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques | Unknown | N/A | |
| RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models | Unknown | N/A | |
| Understanding Model Selection for Learning in Strategic Environments | Unknown | N/A | |
| Parallelizing Model-based Reinforcement Learning Over the Sequence Length | Unknown | N/A | |
| Bayes-optimal learning of an extensive-width neural network from quadratically many samples | Unknown | N/A | |
| Qualitative Mechanism Independence | Unknown | N/A | |
| Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training | Unknown | N/A | |
| Trading Place for Space: Increasing Location Resolution Reduces Contextual Capacity in Hippocampal Codes | Unknown | N/A | |
| What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration | Unknown | N/A | |
| FastSurvival: Hidden Computational Blessings in Training Cox Proportional Hazards Models | Unknown | N/A | |
| Gradients of Functions of Large Matrices | Unknown | N/A | |
| Coarse-to-Fine Concept Bottleneck Models | Unknown | N/A | |
| Image Understanding Makes for A Good Tokenizer for Image Generation | Unknown | N/A | |
| Lisa: Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning Attack | Unknown | N/A | |
| Collaborative Refining for Learning from Inaccurate Labels | Unknown | N/A | |
| Bridging the Divide: Reconsidering Softmax and Linear Attention | Unknown | N/A | |
| DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor | Unknown | N/A | |
| HuRef: HUman-REadable Fingerprint for Large Language Models | Unknown | N/A | |
| Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning | Unknown | N/A | |
| Feature-Level Adversarial Attacks and Ranking Disruption for Visible-Infrared Person Re-identification | Unknown | N/A | |
| Iteratively Refined Behavior Regularization for Offline Reinforcement Learning | Unknown | N/A | |
| Scaling the Codebook Size of VQ-GAN to 100,000 with a Utilization Rate of 99% | Unknown | N/A | |
| Mixture of In-Context Experts Enhance LLMs' Long Context Awareness | Unknown | N/A | |
| Incremental Learning of Retrievable Skills For Efficient Continual Task Adaptation | Unknown | N/A | |
| EEG2Video: Towards Decoding Dynamic Visual Perception from EEG Signals | Unknown | N/A | |
| LACIE: Listener-Aware Finetuning for Calibration in Large Language Models | Unknown | N/A | |
| QVAE-Mole: The Quantum VAE with Spherical Latent Variable Learning for 3-D Molecule Generation | Unknown | N/A | |
| Exploring Adversarial Robustness of Deep State Space Models | Unknown | N/A | |
| Learning a Single Neuron Robustly to Distributional Shifts and Adversarial Label Noise | Unknown | N/A | |
| ROBIN: Robust and Invisible Watermarks for Diffusion Models with Adversarial Optimization | Unknown | N/A | |
| Zero-Shot Tokenizer Transfer | Unknown | N/A | |
| Intrinsic Robustness of Prophet Inequality to Strategic Reward Signaling | Unknown | N/A | |
| Statistical and Geometrical properties of the Kernel Kullback-Leibler divergence | Unknown | N/A | |
| Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion | Unknown | N/A | |
| Identification and Estimation of the Bi-Directional MR with Some Invalid Instruments | Unknown | N/A | |
| Spatio-Temporal Interactive Learning for Efficient Image Reconstruction of Spiking Cameras | Unknown | N/A | |
| VeXKD: The Versatile Integration of Cross-Modal Fusion and Knowledge Distillation for 3D Perception | Unknown | N/A | |
| A versatile informative diffusion model for single-cell ATAC-seq data generation and analysis | Unknown | N/A | |
| Normalization Layer Per-Example Gradients are Sufficient to Predict Gradient Noise Scale in Transformers | Unknown | N/A | |
| AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data | Unknown | N/A | |
| General bounds on the quality of Bayesian coresets | Unknown | N/A | |
| Stepping on the Edge: Curvature Aware Learning Rate Tuners | Unknown | N/A | |
| 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | Unknown | N/A | |
| Parameter-free Clipped Gradient Descent Meets Polyak | Unknown | N/A | |
| ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling | Unknown | N/A | |
| Automatically Learning Hybrid Digital Twins of Dynamical Systems | Unknown | N/A | |
| U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers | Unknown | N/A | |
| A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis | Unknown | N/A | |
| Bayesian Strategic Classification | Unknown | N/A | |
| Information Re-Organization Improves Reasoning in Large Language Models | Unknown | N/A | |
| FUGAL: Feature-fortified Unrestricted Graph Alignment | Unknown | N/A | |
| Time-Varying LoRA: Towards Effective Cross-Domain Fine-Tuning of Diffusion Models | Unknown | N/A | |
| Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure | Unknown | N/A | |
| Dual Encoder GAN Inversion for High-Fidelity 3D Head Reconstruction from Single Images | Unknown | N/A | |
| Deep Bayesian Active Learning for Preference Modeling in Large Language Models | Unknown | N/A | |
| Unscrambling disease progression at scale: fast inference of event permutations with optimal transport | Unknown | N/A | |
| EnOF-SNN: Training Accurate Spiking Neural Networks via Enhancing the Output Feature | Unknown | N/A | |
| MoGU: A Framework for Enhancing Safety of LLMs While Preserving Their Usability | Unknown | N/A | |
| Wild-GS: Real-Time Novel View Synthesis from Unconstrained Photo Collections | Unknown | N/A | |
| Deep Equilibrium Algorithmic Reasoning | Unknown | N/A | |
| Detecting and Measuring Confounding Using Causal Mechanism Shifts | Unknown | N/A | |
| EM Distillation for One-step Diffusion Models | Unknown | N/A | |
| Prompt Tuning Strikes Back: Customizing Foundation Models with Low-Rank Prompt Adaptation | Unknown | N/A | |
| Differentially Private Reinforcement Learning with Self-Play | Unknown | N/A | |
| Universal Rates for Active Learning | Unknown | N/A | |
| Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models | Unknown | N/A | |
| Monte Carlo Tree Search based Space Transfer for Black Box Optimization | Unknown | N/A | |
| OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | Unknown | N/A | |
| Conformal Prediction for Class-wise Coverage via Augmented Label Rank Calibration | Unknown | N/A | |
| LaSCal: Label-Shift Calibration without target labels | Unknown | N/A | |
| Generated and Pseudo Content guided Prototype Refinement for Few-shot Point Cloud Segmentation | Unknown | N/A | |
| From Biased to Unbiased Dynamics: An Infinitesimal Generator Approach | Unknown | N/A | |
| Confident Natural Policy Gradient for Local Planning in $q_\pi$-realizable Constrained MDPs | Unknown | N/A | |
| What type of inference is planning? | Unknown | N/A | |
| A two-scale Complexity Measure for Deep Learning Models | Unknown | N/A | |
| How Do Large Language Models Acquire Factual Knowledge During Pretraining? | Unknown | N/A | |
| Beyond Euclidean: Dual-Space Representation Learning for Weakly Supervised Video Violence Detection | Unknown | N/A | |
| MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence | Unknown | N/A | |
| Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models | Unknown | N/A | |
| DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation | Unknown | N/A | |
| Dynamic Service Fee Pricing under Strategic Behavior: Actions as Instruments and Phase Transition | Unknown | N/A | |
| DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers | Unknown | N/A | |
| High-dimensional (Group) Adversarial Training in Linear Regression | Unknown | N/A | |
| Randomized Truthful Auctions with Learning Agents | Unknown | N/A | |
| QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model | Unknown | N/A | |
| Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^\pi$-Realizability and Concentrability | Unknown | N/A | |
| Boosted Conformal Prediction Intervals | Unknown | N/A | |
| Idiographic Personality Gaussian Process for Psychological Assessment | Unknown | N/A | |
| UniSDF: Unifying Neural Representations for High-Fidelity 3D Reconstruction of Complex Scenes with Reflections | Unknown | N/A | |
| Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation | Unknown | N/A | |
| Ordered Momentum for Asynchronous SGD | Unknown | N/A | |
| Targeted Sequential Indirect Experiment Design | Unknown | N/A | |
| An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matching | Unknown | N/A | |
| Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement | Unknown | N/A | |
| Optimal Rates for Vector-Valued Spectral Regularization Learning Algorithms | Unknown | N/A | |
| MagR: Weight Magnitude Reduction for Enhancing Post-Training Quantization | Unknown | N/A | |
| Federated Graph Learning for Cross-Domain Recommendation | Unknown | N/A | |
| Benign overfitting in leaky ReLU networks with moderate input dimension | Unknown | N/A | |
| Exploring the trade-off between deep-learning and explainable models for brain-machine interfaces | Unknown | N/A | |
| Frequency-aware Generative Models for Multivariate Time Series Imputation | Unknown | N/A | |
| RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models | Unknown | N/A | |
| Distribution-Aware Data Expansion with Diffusion Models | Unknown | N/A | |
| Exocentric-to-Egocentric Video Generation | Unknown | N/A | |
| Rapid Plug-in Defenders | Unknown | N/A | |
| Improved learning rates in multi-unit uniform price auctions | Unknown | N/A | |
| Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | Unknown | N/A | |
| The Value of Reward Lookahead in Reinforcement Learning | Unknown | N/A | |
| Gradual Domain Adaptation via Manifold-Constrained Distributionally Robust Optimization | Unknown | N/A | |
| Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images | Unknown | N/A | |
| Toward a Well-Calibrated Discrimination via Survival Outcome-Aware Contrastive Learning | Unknown | N/A | |
| Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization | Unknown | N/A | |
| A Tractable Inference Perspective of Offline RL | Unknown | N/A | |
| State Space Models on Temporal Graphs: A First-Principles Study | Unknown | N/A | |
| Variational Flow Matching for Graph Generation | Unknown | N/A | |
| AverNet: All-in-one Video Restoration for Time-varying Unknown Degradations | Unknown | N/A | |
| Hierarchical Uncertainty Exploration via Feedforward Posterior Trees | Unknown | N/A | |
| Direct Unlearning Optimization for Robust and Safe Text-to-Image Models | Unknown | N/A | |
| PANORAMIA: Privacy Auditing of Machine Learning Models without Retraining | Unknown | N/A | |
| DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models | Unknown | N/A | |
| Faster Repeated Evasion Attacks in Tree Ensembles | Unknown | N/A | |
| Cross-modal Representation Flattening for Multi-modal Domain Generalization | Unknown | N/A | |
| Auditing Local Explanations is Hard | Unknown | N/A | |
| Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning | Unknown | N/A | |
| Interventionally Consistent Surrogates for Complex Simulation Models | Unknown | N/A | |
| Regression under demographic parity constraints via unlabeled post-processing | Unknown | N/A | |
| Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels | Unknown | N/A | |
| Renovating Names in Open-Vocabulary Segmentation Benchmarks | Unknown | N/A | |
| Inductive biases of multi-task learning and finetuning: multiple regimes of feature reuse | Unknown | N/A | |
| Uncovering Safety Risks of Large Language Models through Concept Activation Vector | Unknown | N/A | |
| Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models | Unknown | N/A | |
| DECRL: A Deep Evolutionary Clustering Jointed Temporal Knowledge Graph Representation Learning Approach | Unknown | N/A | |
| EMVP: Embracing Visual Foundation Model for Visual Place Recognition with Centroid-Free Probing | Unknown | N/A | |
| Outlier-Robust Distributionally Robust Optimization via Unbalanced Optimal Transport | Unknown | N/A | |
| FUG: Feature-Universal Graph Contrastive Pre-training for Graphs with Diverse Node Features | Unknown | N/A | |
| StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation | Unknown | N/A | |
| Learning Where to Edit Vision Transformers | Unknown | N/A | |
| Connectivity-Driven Pseudo-Labeling Makes Stronger Cross-Domain Segmenters | Unknown | N/A | |
| BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning | Unknown | N/A | |
| Spiking Graph Neural Network on Riemannian Manifolds | Unknown | N/A | |
| CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations | Unknown | N/A | |
| Weak Supervision Performance Evaluation via Partial Identification | Unknown | N/A | |
| PrivCirNet: Efficient Private Inference via Block Circulant Transformation | Unknown | N/A | |
| A Prompt-Based Knowledge Graph Foundation Model for Universal In-Context Reasoning | Unknown | N/A | |
| PageRank Bandits for Link Prediction | Unknown | N/A | |
| Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis | Unknown | N/A | |
| Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learner | Unknown | N/A | |
| Reproducibility of predictive networks for mouse visual cortex | Unknown | N/A | |
| Balancing Context Length and Mixing Times for Reinforcement Learning at Scale | Unknown | N/A | |
| LoRA-GA: Low-Rank Adaptation with Gradient Approximation | Unknown | N/A | |
| Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion models | Unknown | N/A | |
| Grounded Answers for Multi-agent Decision-making Problem through Generative World Model | Unknown | N/A | |
| TFGDA: Exploring Topology and Feature Alignment in Semi-supervised Graph Domain Adaptation through Robust Clustering | Unknown | N/A | |
| Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP | Unknown | N/A | |
| Nearly Minimax Optimal Submodular Maximization with Bandit Feedback | Unknown | N/A | |
| Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach | Unknown | N/A | |
| A Topology-aware Graph Coarsening Framework for Continual Graph Learning | Unknown | N/A | |
| SGLang: Efficient Execution of Structured Language Model Programs | Unknown | N/A | |
| Pipeline Parallelism with Controllable Memory | Unknown | N/A | |
| Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm | Unknown | N/A | |
| Learning Generalized Linear Programming Value Functions | Unknown | N/A | |
| Depth Anywhere: Enhancing 360 Monocular Depth Estimation via Perspective Distillation and Unlabeled Data Augmentation | Unknown | N/A | |
| ContextGS : Compact 3D Gaussian Splatting with Anchor Level Context Model | Unknown | N/A | |
| Contextual Bilevel Reinforcement Learning for Incentive Alignment | Unknown | N/A | |
| Relational Verification Leaps Forward with RABBit | Unknown | N/A | |
| Derivative-enhanced Deep Operator Network | Unknown | N/A | |
| Fine-Tuning Personalization in Federated Learning to Mitigate Adversarial Clients | Unknown | N/A | |
| One-Layer Transformer Provably Learns One-Nearest Neighbor In Context | Unknown | N/A | |
| Adversarially Robust Decision Transformer | Unknown | N/A | |
| Efficient Reinforcement Learning by Discovering Neural Pathways | Unknown | N/A | |
| Exploiting Representation Curvature for Boundary Detection in Time Series | Unknown | N/A | |
| Interpretable Concept-Based Memory Reasoning | Unknown | N/A | |
| Do LLMs dream of elephants (when told not to)? Latent concept association and associative memory in transformers | Unknown | N/A | |
| IPO: Interpretable Prompt Optimization for Vision-Language Models | Unknown | N/A | |
| Estimating Epistemic and Aleatoric Uncertainty with a Single Model | Unknown | N/A | |
| Query-Efficient Correlation Clustering with Noisy Oracle | Unknown | N/A | |
| Safety through feedback in Constrained RL | Unknown | N/A | |
| Embedding-Aligned Language Models | Unknown | N/A | |
| Hybrid Reinforcement Learning Breaks Sample Size Barriers In Linear MDPs | Unknown | N/A | |
| Disentangling Linear Quadratic Control with Untrusted ML Predictions | Unknown | N/A | |
| Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Unknown | N/A | |
| Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Unknown | N/A | |
| CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition | Unknown | N/A | |
| Subwords as Skills: Tokenization for Sparse-Reward Reinforcement Learning | Unknown | N/A | |
| Federated Learning under Periodic Client Participation and Heterogeneous Data: A New Communication-Efficient Algorithm and Analysis | Unknown | N/A | |
| Personalized Federated Learning via Feature Distribution Adaptation | Unknown | N/A | |
| Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference | Unknown | N/A | |
| Fairness-Aware Estimation of Graphical Models | Unknown | N/A | |
| PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications | Unknown | N/A | |
| Graph Learning for Numeric Planning | Unknown | N/A | |
| Generalized Protein Pocket Generation with Prior-Informed Flow Matching | Unknown | N/A | |
| Invariant subspaces and PCA in nearly matrix multiplication time | Unknown | N/A | |
| Functional Bilevel Optimization for Machine Learning | Unknown | N/A | |
| FuseAnyPart: Diffusion-Driven Facial Parts Swapping via Multiple Reference Images | Unknown | N/A | |
| Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion | Unknown | N/A | |
| Mixture of Link Predictors on Graphs | Unknown | N/A | |
| MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encoding | Unknown | N/A | |
| Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning | Unknown | N/A | |
| Non-asymptotic Approximation Error Bounds of Parameterized Quantum Circuits | Unknown | N/A | |
| Textual Training for the Hassle-Free Removal of Unwanted Visual Data: Case Studies on OOD and Hateful Image Detection | Unknown | N/A | |
| Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations | Unknown | N/A | |
| The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize | Unknown | N/A | |
| High-probability complexity bounds for stochastic non-convex minimax optimization | Unknown | N/A | |
| Visual Data Diagnosis and Debiasing with Concept Graphs | Unknown | N/A | |
| Linear Uncertainty Quantification of Graphical Model Inference | Unknown | N/A | |
| Continuous Product Graph Neural Networks | Unknown | N/A | |
| Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL | Unknown | N/A | |
| Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks | Unknown | N/A | |
| Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Unknown | N/A | |
| How does Gradient Descent Learn Features --- A Local Analysis for Regularized Two-Layer Neural Networks | Unknown | N/A | |
| Semi-Random Matrix Completion via Flow-Based Adaptive Reweighting | Unknown | N/A | |
| Pure Message Passing Can Estimate Common Neighbor for Link Prediction | Unknown | N/A | |
| When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback | Unknown | N/A | |
| What If the Input is Expanded in OOD Detection? | Unknown | N/A | |
| Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models | Unknown | N/A | |
| Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length | Unknown | N/A | |
| E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation | Unknown | N/A | |
| A Combinatorial Algorithm for the Semi-Discrete Optimal Transport Problem | Unknown | N/A | |
| TAIA: Large Language Models are Out-of-Distribution Data Learners | Unknown | N/A | |
| Goal Reduction with Loop-Removal Accelerates RL and Models Human Brain Activity in Goal-Directed Learning | Unknown | N/A | |
| Long-Horizon Planning for Multi-Agent Robots in Partially Observable Environments | Unknown | N/A | |
| Federated Learning from Vision-Language Foundation Models: Theoretical Analysis and Method | Unknown | N/A | |
| Conformal Inverse Optimization | Unknown | N/A | |
| SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification | Unknown | N/A | |
| MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map | Unknown | N/A | |
| A Cat Is A Cat (Not A Dog!): Unraveling Information Mix-ups in Text-to-Image Encoders through Causal Analysis and Embedding Optimization | Unknown | N/A | |
| Acoustic Volume Rendering for Neural Impulse Response Fields | Unknown | N/A | |
| Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-image | Unknown | N/A | |
| Cross-model Control: Improving Multiple Large Language Models in One-time Training | Unknown | N/A | |
| SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors | Unknown | N/A | |
| A Full-duplex Speech Dialogue Scheme Based On Large Language Model | Unknown | N/A | |
| Can Learned Optimization Make Reinforcement Learning Less Difficult? | Unknown | N/A | |
| CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks | Unknown | N/A | |
| Physics-Regularized Multi-Modal Image Assimilation for Brain Tumor Localization | Unknown | N/A | |
| Linearly Decomposing and Recomposing Vision Transformers for Diverse-Scale Models | Unknown | N/A | |
| Preference Learning Algorithms Do Not Learn Preference Rankings | Unknown | N/A | |
| Discrete-state Continuous-time Diffusion for Graph Generation | Unknown | N/A | |
| TreeVI: Reparameterizable Tree-structured Variational Inference for Instance-level Correlation Capturing | Unknown | N/A | |
| DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos | Unknown | N/A | |
| The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models | Unknown | N/A | |
| United We Stand, Divided We Fall: Fingerprinting Deep Neural Networks via Adversarial Trajectories | Unknown | N/A | |
| DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models | Unknown | N/A | |
| The tree autoencoder model, with application to hierarchical data visualization | Unknown | N/A | |
| Conformal Alignment: Knowing When to Trust Foundation Models with Guarantees | Unknown | N/A | |
| pcaGAN: Improving Posterior-Sampling cGANs via Principal Component Regularization | Unknown | N/A | |
| Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling | Unknown | N/A | |
| Unity by Diversity: Improved Representation Learning for Multimodal VAEs | Unknown | N/A | |
| Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature | Unknown | N/A | |
| Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery | Unknown | N/A | |
| SlimSAM: 0.1% Data Makes Segment Anything Slim | Unknown | N/A | |
| A Pairwise Pseudo-likelihood Approach for Matrix Completion with Informative Missingness | Unknown | N/A | |
| MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts | Unknown | N/A | |
| DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning | Unknown | N/A | |
| NeoRL: Efficient Exploration for Nonepisodic RL | Unknown | N/A | |
| Recurrent neural network dynamical systems for biological vision | Unknown | N/A | |
| ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models | Unknown | N/A | |
| Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution | Unknown | N/A | |
| CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos | Unknown | N/A | |
| Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models | Unknown | N/A | |
| How does Inverse RL Scale to Large State Spaces? A Provably Efficient Approach | Unknown | N/A | |
| CriticEval: Evaluating Large-scale Language Model as Critic | Unknown | N/A | |
| Constrained Adaptive Attack: Effective Adversarial Attack Against Deep Neural Networks for Tabular Data | Unknown | N/A | |
| Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance | Unknown | N/A | |
| Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching | Unknown | N/A | |
| OneBit: Towards Extremely Low-bit Large Language Models | Unknown | N/A | |
| Learn more, but bother less: parameter efficient continual learning | Unknown | N/A | |
| OPUS: Occupancy Prediction Using a Sparse Set | Unknown | N/A | |
| Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models | Unknown | N/A | |
| Block Sparse Bayesian Learning: A Diversified Scheme | Unknown | N/A | |
| Tri-Level Navigator: LLM-Empowered Tri-Level Learning for Time Series OOD Generalization | Unknown | N/A | |
| Images that Sound: Composing Images and Sounds on a Single Canvas | Unknown | N/A | |
| CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors | Unknown | N/A | |
| On the Sparsity of the Strong Lottery Ticket Hypothesis | Unknown | N/A | |
| Hamiltonian Monte Carlo on ReLU Neural Networks is Inefficient | Unknown | N/A | |
| Learning Mixtures of Unknown Causal Interventions | Unknown | N/A | |
| Diffusion PID: Interpreting Diffusion via Partial Information Decomposition | Unknown | N/A | |
| Hierarchical Federated Learning with Multi-Timescale Gradient Correction | Unknown | N/A | |
| Bandits with Ranking Feedback | Unknown | N/A | |
| TopoFR: A Closer Look at Topology Alignment on Face Recognition | Unknown | N/A | |
| On Differentially Private Subspace Estimation in a Distribution-Free Setting | Unknown | N/A | |
| An Accelerated Gradient Method for Convex Smooth Simple Bilevel Optimization | Unknown | N/A | |
| On Socially Fair Low-Rank Approximation and Column Subset Selection | Unknown | N/A | |
| Segment Anything without Supervision | Unknown | N/A | |
| Samba: Severity-aware Recurrent Modeling for Cross-domain Medical Image Grading | Unknown | N/A | |
| Symmetric Linear Bandits with Hidden Symmetry | Unknown | N/A | |
| Trajectory Flow Matching with Applications to Clinical Time Series Modelling | Unknown | N/A | |
| Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse Activation Control | Unknown | N/A | |
| SureMap: Simultaneous mean estimation for single-task and multi-task disaggregated evaluation | Unknown | N/A | |
| Evaluating the World Model Implicit in a Generative Model | Unknown | N/A | |
| Discovering plasticity rules that organize and maintain neural circuits | Unknown | N/A | |
| Learning World Models for Unconstrained Goal Navigation | Unknown | N/A | |
| Learning Elastic Costs to Shape Monge Displacements | Unknown | N/A | |
| Symmetry-Informed Governing Equation Discovery | Unknown | N/A | |
| Incorporating Surrogate Gradient Norm to Improve Offline Optimization Techniques | Unknown | N/A | |
| Zipper: Addressing Degeneracy in Algorithm-Agnostic Inference | Unknown | N/A | |
| Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression | Unknown | N/A | |
| Differentially Private Graph Diffusion with Applications in Personalized PageRanks | Unknown | N/A | |
| TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy | Unknown | N/A | |
| Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control | Unknown | N/A | |
| Is Multiple Object Tracking a Matter of Specialization? | Unknown | N/A | |
| Streaming Long Video Understanding with Large Language Models | Unknown | N/A | |
| VCR-GauS: View Consistent Depth-Normal Regularizer for Gaussian Surface Reconstruction | Unknown | N/A | |
| Using Noise to Infer Aspects of Simplicity Without Learning | Unknown | N/A | |
| Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps | Unknown | N/A | |
| Don't Look Twice: Faster Video Transformers with Run-Length Tokenization | Unknown | N/A | |
| Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models | Unknown | N/A | |
| Learning Frequency-Adapted Vision Foundation Model for Domain Generalized Semantic Segmentation | Unknown | N/A | |
| MALT Powers Up Adversarial Attacks | Unknown | N/A | |
| VeLoRA: Memory Efficient Training using Rank-1 Sub-Token Projections | Unknown | N/A | |
| Fair Allocation in Dynamic Mechanism Design | Unknown | N/A | |
| Opponent Modeling with In-context Search | Unknown | N/A | |
| Stability and Generalization of Asynchronous SGD: Sharper Bounds Beyond Lipschitz and Smoothness | Unknown | N/A | |
| VisMin: Visual Minimal-Change Understanding | Unknown | N/A | |
| Causal Contrastive Learning for Counterfactual Regression Over Time | Unknown | N/A | |
| VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization | Unknown | N/A | |
| Robust Sleep Staging over Incomplete Multimodal Physiological Signals via Contrastive Imagination | Unknown | N/A | |
| Towards training digitally-tied analog blocks via hybrid gradient computation | Unknown | N/A | |
| On the Complexity of Identification in Linear Structural Causal Models | Unknown | N/A | |
| Learning Discrete Concepts in Latent Hierarchical Models | Unknown | N/A | |
| One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos | Unknown | N/A | |
| Fast yet Safe: Early-Exiting with Risk Control | Unknown | N/A | |
| Challenges of Generating Structurally Diverse Graphs | Unknown | N/A | |
| Guiding a Diffusion Model with a Bad Version of Itself | Unknown | N/A | |
| Multistable Shape from Shading Emerges from Patch Diffusion | Unknown | N/A | |
| OnlineTAS: An Online Baseline for Temporal Action Segmentation | Unknown | N/A | |
| Decoupled Kullback-Leibler Divergence Loss | Unknown | N/A | |
| Advancing Cross-domain Discriminability in Continual Learning of Vision-Language Models | Unknown | N/A | |
| Fast Encoder-Based 3D from Casual Videos via Point Track Processing | Unknown | N/A | |
| Flow Snapshot Neurons in Action: Deep Neural Networks Generalize to Biological Motion Perception | Unknown | N/A | |
| Self-Labeling the Job Shop Scheduling Problem | Unknown | N/A | |
| Large Spatial Model: End-to-end Unposed Images to Semantic 3D | Unknown | N/A | |
| A Non-parametric Direct Learning Approach to Heterogeneous Treatment Effect Estimation under Unmeasured Confounding | Unknown | N/A | |
| Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy | Unknown | N/A | |
| Estimating Heterogeneous Treatment Effects by Combining Weak Instruments and Observational Data | Unknown | N/A | |
| Efficient $\Phi$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games | Unknown | N/A | |
| Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation | Unknown | N/A | |
| Nearly Tight Black-Box Auditing of Differentially Private Machine Learning | Unknown | N/A | |
| ParallelEdits: Efficient Multi-Aspect Text-Driven Image Editing with Attention Grouping | Unknown | N/A | |
| Convolutions and More as Einsum: A Tensor Network Perspective with Advances for Second-Order Methods | Unknown | N/A | |
| Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation | Unknown | N/A | |
| OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning | Unknown | N/A | |
| Training Binary Neural Networks via Gaussian Variational Inference and Low-Rank Semidefinite Programming | Unknown | N/A | |
| No-Regret Bandit Exploration based on Soft Tree Ensemble Model | Unknown | N/A | |
| One-to-Multiple: A Progressive Style Transfer Unsupervised Domain-Adaptive Framework for Kidney Tumor Segmentation | Unknown | N/A | |
| Fine-Tuning is Fine, if Calibrated | Unknown | N/A | |
| Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models | Unknown | N/A | |
| Private Geometric Median | Unknown | N/A | |
| Credit Attribution and Stable Compression | Unknown | N/A | |
| Generative Forests | Unknown | N/A | |
| Distributional regression: CRPS-error bounds for model fitting, model selection and convex aggregation | Unknown | N/A | |
| Learning Truncated Causal History Model for Video Restoration | Unknown | N/A | |
| Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders | Unknown | N/A | |
| Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation | Unknown | N/A | |
| Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference | Unknown | N/A | |
| DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning | Unknown | N/A | |
| Long-Tailed Out-of-Distribution Detection via Normalized Outlier Distribution Adaptation | Unknown | N/A | |
| Discretely beyond $1/e$: Guided Combinatorial Algortihms for Submodular Maximization | Unknown | N/A | |
| IF-Font: Ideographic Description Sequence-Following Font Generation | Unknown | N/A | |
| Learning Macroscopic Dynamics from Partial Microscopic Observations | Unknown | N/A | |
| Axioms for AI Alignment from Human Feedback | Unknown | N/A | |
| InversionView: A General-Purpose Method for Reading Information from Neural Activations | Unknown | N/A | |
| DeiSAM: Segment Anything with Deictic Prompting | Unknown | N/A | |
| Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models | Unknown | N/A | |
| Customized Subgraph Selection and Encoding for Drug-drug Interaction Prediction | Unknown | N/A | |
| Online Weighted Paging with Unknown Weights | Unknown | N/A | |
| An exactly solvable model for emergence and scaling laws in the multitask sparse parity problem | Unknown | N/A | |
| Many-shot Jailbreaking | Unknown | N/A | |
| Can neural operators always be continuously discretized? | Unknown | N/A | |
| Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences | Unknown | N/A | |
| LeDex: Training LLMs to Better Self-Debug and Explain Code | Unknown | N/A | |
| EPIC: Effective Prompting for Imbalanced-Class Data Synthesis in Tabular Data Classification via Large Language Models | Unknown | N/A | |
| Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models | Unknown | N/A | |
| Classification Diffusion Models: Revitalizing Density Ratio Estimation | Unknown | N/A | |
| One-Shot Safety Alignment for Large Language Models via Optimal Dualization | Unknown | N/A | |
| Towards Understanding How Transformers Learn In-context Through a Representation Learning Lens | Unknown | N/A | |
| DiGRAF: Diffeomorphic Graph-Adaptive Activation Function | Unknown | N/A | |
| UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models | Unknown | N/A | |
| MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models | Unknown | N/A | |
| Dissecting Query-Key Interaction in Vision Transformers | Unknown | N/A | |
| Constrained Diffusion with Trust Sampling | Unknown | N/A | |
| Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement | Unknown | N/A | |
| Learning Optimal Lattice Vector Quantizers for End-to-end Neural Image Compression | Unknown | N/A | |
| Retrieval-Augmented Diffusion Models for Time Series Forecasting | Unknown | N/A | |
| ReMAP: Neural Model Reprogramming with Network Inversion and Retrieval-Augmented Mapping for Adaptive Motion Forecasting | Unknown | N/A | |
| DeltaDock: A Unified Framework for Accurate, Efficient, and Physically Reliable Molecular Docking | Unknown | N/A | |
| FUSE: Fast Unified Simulation and Estimation for PDEs | Unknown | N/A | |
| Molecule Design by Latent Prompt Transformer | Unknown | N/A | |
| Geometric Exploitation for Indoor Panoramic Semantic Segmentation | Unknown | N/A | |
| Hardness of Learning Neural Networks under the Manifold Hypothesis | Unknown | N/A | |
| Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models | Unknown | N/A | |
| Semi-Supervised Sparse Gaussian Classification: Provable Benefits of Unlabeled Data | Unknown | N/A | |
| Dynamics of Supervised and Reinforcement Learning in the Non-Linear Perceptron | Unknown | N/A | |
| Noether's Razor: Learning Conserved Quantities | Unknown | N/A | |
| Approximately Equivariant Neural Processes | Unknown | N/A | |
| An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning | Unknown | N/A | |
| Free Lunch in Pathology Foundation Model: Task-specific Model Adaptation with Concept-Guided Feature Enhancement | Unknown | N/A | |
| Nature-Inspired Local Propagation | Unknown | N/A | |
| What matters when building vision-language models? | Unknown | N/A | |
| Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation | Unknown | N/A | |
| HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach | Unknown | N/A | |
| Semi-supervised Knowledge Transfer Across Multi-omic Single-cell Data | Unknown | N/A | |
| Learning Human-like Representations to Enable Learning Human Values | Unknown | N/A | |
| C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | Unknown | N/A | |
| SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models | Unknown | N/A | |
| Stochastic contextual bandits with graph feedback: from independence number to MAS number | Unknown | N/A | |
| Optimal Algorithms for Augmented Testing of Discrete Distributions | Unknown | N/A | |
| Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models | Unknown | N/A | |
| One-to-Normal: Anomaly Personalization for Few-shot Anomaly Detection | Unknown | N/A | |
| What Matters in Graph Class Incremental Learning? An Information Preservation Perspective | Unknown | N/A | |
| SILENCE: Protecting privacy in offloaded speech understanding on resource-constrained devices | Unknown | N/A | |
| Out-of-Distribution Detection with a Single Unconditional Diffusion Model | Unknown | N/A | |
| Reinforced Cross-Domain Knowledge Distillation on Time Series Data | Unknown | N/A | |
| Transductive Active Learning: Theory and Applications | Unknown | N/A | |
| Reimagining Mutual Information for Enhanced Defense against Data Leakage in Collaborative Inference | Unknown | N/A | |
| Adaptive Variance Reduction for Stochastic Optimization under Weaker Assumptions | Unknown | N/A | |
| TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration | Unknown | N/A | |
| AirSketch: Generative Motion to Sketch | Unknown | N/A | |
| Revisiting the Integration of Convolution and Attention for Vision Backbone | Unknown | N/A | |
| Conditional Outcome Equivalence: A Quantile Alternative to CATE | Unknown | N/A | |
| RAMP: Boosting Adversarial Robustness Against Multiple $l_p$ Perturbations for Universal Robustness | Unknown | N/A | |
| Leveraging Contrastive Learning for Enhanced Node Representations in Tokenized Graph Transformers | Unknown | N/A | |
| Online Convex Optimisation: The Optimal Switching Regret for all Segmentations Simultaneously | Unknown | N/A | |
| Protecting Your LLMs with Information Bottleneck | Unknown | N/A | |
| On the Necessity of Collaboration for Online Model Selection with Decentralized Data | Unknown | N/A | |
| Training Compute-Optimal Protein Language Models | Unknown | N/A | |
| Addressing Asynchronicity in Clinical Multimodal Fusion via Individualized Chest X-ray Generation | Unknown | N/A | |
| Nonconvex Federated Learning on Compact Smooth Submanifolds With Heterogeneous Data | Unknown | N/A | |
| PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices | Unknown | N/A | |
| Sample Complexity of Algorithm Selection Using Neural Networks and Its Applications to Branch-and-Cut | Unknown | N/A | |
| Sequential Probability Assignment with Contexts: Minimax Regret, Contextual Shtarkov Sums, and Contextual Normalized Maximum Likelihood | Unknown | N/A | |
| Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction | Unknown | N/A | |
| Convergence Analysis of Split Federated Learning on Heterogeneous Data | Unknown | N/A | |
| Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery | Unknown | N/A | |
| FIFO-Diffusion: Generating Infinite Videos from Text without Training | Unknown | N/A | |
| Drago: Primal-Dual Coupled Variance Reduction for Faster Distributionally Robust Optimization | Unknown | N/A | |
| The Fine-Grained Complexity of Gradient Computation for Training Large Language Models | Unknown | N/A | |
| Exploring Jacobian Inexactness in Second-Order Methods for Variational Inequalities: Lower Bounds, Optimal Algorithms and Quasi-Newton Approximations | Unknown | N/A | |
| Learning to Edit Visual Programs with Self-Supervision | Unknown | N/A | |
| Achieving Domain-Independent Certified Robustness via Knowledge Continuity | Unknown | N/A | |
| Weight for Robustness: A Comprehensive Approach towards Optimal Fault-Tolerant Asynchronous ML | Unknown | N/A | |
| GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields | Unknown | N/A | |
| CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework | Unknown | N/A | |
| Measuring Dejavu Memorization Efficiently | Unknown | N/A | |
| Learning from Uncertain Data: From Possible Worlds to Possible Models | Unknown | N/A | |
| Going Beyond Heuristics by Imposing Policy Improvement as a Constraint | Unknown | N/A | |
| Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes | Unknown | N/A | |
| Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication | Unknown | N/A | |
| Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient | Unknown | N/A | |
| DynaMo: In-Domain Dynamics Pretraining for Visuo-Motor Control | Unknown | N/A | |
| Can We Leave Deepfake Data Behind in Training Deepfake Detector? | Unknown | N/A | |
| $\epsilon$-Softmax: Approximating One-Hot Vectors for Mitigating Label Noise | Unknown | N/A | |
| Learning Segmentation from Point Trajectories | Unknown | N/A | |
| Average gradient outer product as a mechanism for deep neural collapse | Unknown | N/A | |
| Revive Re-weighting in Imbalanced Learning by Density Ratio Estimation | Unknown | N/A | |
| High Rank Path Development: an approach to learning the filtration of stochastic processes | Unknown | N/A | |
| Can Language Models Learn to Skip Steps? | Unknown | N/A | |
| Discovery of the Hidden World with Large Language Models | Unknown | N/A | |
| Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection | Unknown | N/A | |
| Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces | Unknown | N/A | |
| Learning Spatially-Aware Language and Audio Embeddings | Unknown | N/A | |
| Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling | Unknown | N/A | |
| Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists | Unknown | N/A | |
| S-MolSearch: 3D Semi-supervised Contrastive Learning for Bioactive Molecule Search | Unknown | N/A | |
| CRAYM: Neural Field Optimization via Camera RAY Matching | Unknown | N/A | |
| Batched Energy-Entropy acquisition for Bayesian Optimization | Unknown | N/A | |
| Neural Collapse Inspired Feature Alignment for Out-of-Distribution Generalization | Unknown | N/A | |
| Iterative Methods via Locally Evolving Set Process | Unknown | N/A | |
| Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks | Unknown | N/A | |
| Reflective Multi-Agent Collaboration based on Large Language Models | Unknown | N/A | |
| A Sober Look at the Robustness of CLIPs to Spurious Features | Unknown | N/A | |
| Incentivizing Quality Text Generation via Statistical Contracts | Unknown | N/A | |
| RoPINN: Region Optimized Physics-Informed Neural Networks | Unknown | N/A | |
| Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective | Unknown | N/A | |
| Real-Time Selection Under General Constraints via Predictive Inference | Unknown | N/A | |
| MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders | Unknown | N/A | |
| Stochastic Optimal Control Matching | Unknown | N/A | |
| Implicit Bias of Mirror Flow on Separable Data | Unknown | N/A | |
| Scaling White-Box Transformers for Vision | Unknown | N/A | |
| Advancing Training Efficiency of Deep Spiking Neural Networks through Rate-based Backpropagation | Unknown | N/A | |
| Corruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent Misspecification | Unknown | N/A | |
| The Feature Speed Formula: a flexible approach to scale hyper-parameters of deep neural networks | Unknown | N/A | |
| On the Ability of Developers' Training Data Preservation of Learnware | Unknown | N/A | |
| A provable control of sensitivity of neural networks through a direct parameterization of the overall bi-Lipschitzness | Unknown | N/A | |
| Unsupervised Object Detection with Theoretical Guarantees | Unknown | N/A | |
| On Convergence of Adam for Stochastic Optimization under Relaxed Assumptions | Unknown | N/A | |
| From Unstructured Data to In-Context Learning: Exploring What Tasks Can Be Learned and When | Unknown | N/A | |
| Med-Real2Sim: Non-Invasive Medical Digital Twins using Physics-Informed Self-Supervised Learning | Unknown | N/A | |
| Can an AI Agent Safely Run a Government? Existence of Probably Approximately Aligned Policies | Unknown | N/A | |
| Communication-Efficient Federated Group Distributionally Robust Optimization | Unknown | N/A | |
| The Implicit Bias of Adam on Separable Data | Unknown | N/A | |
| Beyond Redundancy: Information-aware Unsupervised Multiplex Graph Structure Learning | Unknown | N/A | |
| Feedback control guides credit assignment in recurrent neural networks | Unknown | N/A | |
| Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning | Unknown | N/A | |
| FedGMark: Certifiably Robust Watermarking for Federated Graph Learning | Unknown | N/A | |
| Reasons and Solutions for the Decline in Model Performance after Editing | Unknown | N/A | |
| Take A Shortcut Back: Mitigating the Gradient Vanishing for Training Spiking Neural Networks | Unknown | N/A | |
| Scene Graph Generation with Role-Playing Large Language Models | Unknown | N/A | |
| Inverse Factorized Soft Q-Learning for Cooperative Multi-agent Imitation Learning | Unknown | N/A | |
| ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation | Unknown | N/A | |
| Learning diffusion at lightspeed | Unknown | N/A | |
| Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving | Unknown | N/A | |
| Causal Discovery from Event Sequences by Local Cause-Effect Attribution | Unknown | N/A | |
| In Pursuit of Causal Label Correlations for Multi-label Image Recognition | Unknown | N/A | |
| Stepping Forward on the Last Mile | Unknown | N/A | |
| AUC Maximization under Positive Distribution Shift | Unknown | N/A | |
| Toward a Stable, Fair, and Comprehensive Evaluation of Object Hallucination in Large Vision-Language Models | Unknown | N/A | |
| Diffusion-based Curriculum Reinforcement Learning | Unknown | N/A | |
| Exogenous Matching: Learning Good Proposals for Tractable Counterfactual Estimation | Unknown | N/A | |
| Learning Bregman Divergences with Application to Robustness | Unknown | N/A | |
| Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking | Unknown | N/A | |
| Controlled maximal variability along with reliable performance in recurrent neural networks | Unknown | N/A | |
| Binarized Diffusion Model for Image Super-Resolution | Unknown | N/A | |
| Avoiding Undesired Future with Minimal Cost in Non-Stationary Environments | Unknown | N/A | |
| Optimal Algorithms for Learning Partitions with Faulty Oracles | Unknown | N/A | |
| Information-theoretic Generalization Analysis for Expected Calibration Error | Unknown | N/A | |
| ReGS: Reference-based Controllable Scene Stylization with Gaussian Splatting | Unknown | N/A | |
| PGN: The RNN's New Successor is Effective for Long-Range Time Series Forecasting | Unknown | N/A | |
| GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations | Unknown | N/A | |
| ReLIZO: Sample Reusable Linear Interpolation-based Zeroth-order Optimization | Unknown | N/A | |
| XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation | Unknown | N/A | |
| Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame | Unknown | N/A | |
| Gated Inference Network: Inference and Learning State-Space Models | Unknown | N/A | |
| Unveiling the Hidden: Online Vectorized HD Map Construction with Clip-Level Token Interaction and Propagation | Unknown | N/A | |
| Learning diverse causally emergent representations from time series data | Unknown | N/A | |
| Amortized Bayesian Experimental Design for Decision-Making | Unknown | N/A | |
| A teacher-teacher framework for clinical language representation learning | Unknown | N/A | |
| Deep Correlated Prompting for Visual Recognition with Missing Modalities | Unknown | N/A | |
| Semantic Feature Learning for Universal Unsupervised Cross-Domain Retrieval | Unknown | N/A | |
| Inference of Neural Dynamics Using Switching Recurrent Neural Networks | Unknown | N/A | |
| Exploring Context Window of Large Language Models via Decomposed Positional Vectors | Unknown | N/A | |
| SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution | Unknown | N/A | |
| Enhancing Robustness of Graph Neural Networks on Social Media with Explainable Inverse Reinforcement Learning | Unknown | N/A | |
| TPR: Topology-Preserving Reservoirs for Generalized Zero-Shot Learning | Unknown | N/A | |
| Rethinking Deep Thinking: Stable Learning of Algorithms using Lipschitz Constraints | Unknown | N/A | |
| GraphCroc: Cross-Correlation Autoencoder for Graph Structural Reconstruction | Unknown | N/A | |
| OxonFair: A Flexible Toolkit for Algorithmic Fairness | Unknown | N/A | |
| A Separation in Heavy-Tailed Sampling: Gaussian vs. Stable Oracles for Proximal Samplers | Unknown | N/A | |
| Toward Global Convergence of Gradient EM for Over-Paramterized Gaussian Mixture Models | Unknown | N/A | |
| Learning Distributions on Manifolds with Free-Form Flows | Unknown | N/A | |
| Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts | Unknown | N/A | |
| On the Expressivity and Sample Complexity of Node-Individualized Graph Neural Networks | Unknown | N/A | |
| Molecule Generation with Fragment Retrieval Augmentation | Unknown | N/A | |
| Persistent Homology for High-dimensional Data Based on Spectral Methods | Unknown | N/A | |
| Is Cross-validation the Gold Standard to Estimate Out-of-sample Model Performance? | Unknown | N/A | |
| Testably Learning Polynomial Threshold Functions | Unknown | N/A | |
| GAVEL: Generating Games via Evolution and Language Models | Unknown | N/A | |
| Linking In-context Learning in Transformers to Human Episodic Memory | Unknown | N/A | |
| Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning | Unknown | N/A | |
| Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning | Unknown | N/A | |
| Contextual Multinomial Logit Bandits with General Value Functions | Unknown | N/A | |
| How JEPA Avoids Noisy Features: The Implicit Bias of Deep Linear Self Distillation Networks | Unknown | N/A | |
| Transferable Boltzmann Generators | Unknown | N/A | |
| Quantum Deep Equilibrium Models | Unknown | N/A | |
| Navigating Extremes: Dynamic Sparsity in Large Output Spaces | Unknown | N/A | |
| Representation Noising: A Defence Mechanism Against Harmful Finetuning | Unknown | N/A | |
| Communication Bounds for the Distributed Experts Problem | Unknown | N/A | |
| Multi-Reward Best Policy Identification | Unknown | N/A | |
| Parseval Regularization for Continual Reinforcement Learning | Unknown | N/A | |
| Identifiable Shared Component Analysis of Unpaired Multimodal Mixtures | Unknown | N/A | |
| Code Repair with LLMs gives an Exploration-Exploitation Tradeoff | Unknown | N/A | |
| UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes | Unknown | N/A | |
| WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment | Unknown | N/A | |
| Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP | Unknown | N/A | |
| Zero-shot Image Editing with Reference Imitation | Unknown | N/A | |
| On Divergence Measures for Training GFlowNets | Unknown | N/A | |
| MambaLRP: Explaining Selective State Space Sequence Models | Unknown | N/A | |
| Large language model validity via enhanced conformal prediction methods | Unknown | N/A | |
| Depth Anything V2 | Unknown | N/A | |
| Vaccine: Perturbation-aware Alignment for Large Language Models against Harmful Fine-tuning Attack | Unknown | N/A | |
| Semi-Open 3D Object Retrieval via Hierarchical Equilibrium on Hypergraph | Unknown | N/A | |
| Assembly Fuzzy Representation on Hypergraph for Open-Set 3D Object Retrieval | Unknown | N/A | |
| Warm-starting Push-Relabel | Unknown | N/A | |
| Segment, Shuffle, and Stitch: A Simple Layer for Improving Time-Series Representations | Unknown | N/A | |
| When are dynamical systems learned from time series data statistically accurate? | Unknown | N/A | |
| STL: Still Tricky Logic (for System Validation, Even When Showing Your Work) | Unknown | N/A | |
| FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation | Unknown | N/A | |
| Preventing Dimensional Collapse in Self-Supervised Learning via Orthogonality Regularization | Unknown | N/A | |
| Preventing Model Collapse in Deep Canonical Correlation Analysis by Noise Regularization | Unknown | N/A | |
| Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models | Unknown | N/A | |
| Autobidder's Dilemma: Why More Sophisticated Autobidders Lead to Worse Auction Efficiency | Unknown | N/A | |
| Efficiency of the First-Price Auction in the Autobidding World | Unknown | N/A | |
| DiffusionPDE: Generative PDE-Solving under Partial Observation | Unknown | N/A | |
| Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics | Unknown | N/A | |
| Mitigating Covariate Shift in Behavioral Cloning via Robust Stationary Distribution Correction | Unknown | N/A | |
| Learning Cut Generating Functions for Integer Programming | Unknown | N/A | |
| GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching | Unknown | N/A | |
| Distributed-Order Fractional Graph Operating Network | Unknown | N/A | |
| Rad-NeRF: Ray-decoupled Training of Neural Radiance Field | Unknown | N/A | |
| Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection | Unknown | N/A | |
| On the Computational Landscape of Replicable Learning | Unknown | N/A | |
| Entrywise error bounds for low-rank approximations of kernel matrices | Unknown | N/A | |
| Cooperative Hardware-Prompt Learning for Snapshot Compressive Imaging | Unknown | N/A | |
| Neural Conditional Probability for Uncertainty Quantification | Unknown | N/A | |
| Learning the Optimal Policy for Balancing Short-Term and Long-Term Rewards | Unknown | N/A | |
| Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models | Unknown | N/A | |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | Unknown | N/A | |
| MC-DiT: Contextual Enhancement via Clean-to-Clean Reconstruction for Masked Diffusion Models | Unknown | N/A | |
| Parallelizing Linear Transformers with the Delta Rule over Sequence Length | Unknown | N/A | |
| RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling | Unknown | N/A | |
| Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization | Unknown | N/A | |
| KFNN: K-Free Nearest Neighbor For Crowdsourcing | Unknown | N/A | |
| Parsimony or Capability? Decomposition Delivers Both in Long-term Time Series Forecasting | Unknown | N/A | |
| GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting | Unknown | N/A | |
| LiT: Unifying LiDAR "Languages" with LiDAR Translator | Unknown | N/A | |
| Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis | Unknown | N/A | |
| GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models | Unknown | N/A | |
| MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer | Unknown | N/A | |
| Homology Consistency Constrained Efficient Tuning for Vision-Language Models | Unknown | N/A | |
| Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis | Unknown | N/A | |
| AdaNeg: Adaptive Negative Proxy Guided OOD Detection with Vision-Language Models | Unknown | N/A | |
| Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models | Unknown | N/A | |
| BendVLM: Test-Time Debiasing of Vision-Language Embeddings | Unknown | N/A | |
| JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models | Unknown | N/A | |
| Parameter Symmetry and Noise Equilibrium of Stochastic Gradient Descent | Unknown | N/A | |
| Understanding Information Storage and Transfer in Multi-Modal Large Language Models | Unknown | N/A | |
| Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination | Unknown | N/A | |
| Evaluation of Text-to-Video Generation Models: A Dynamics Perspective | Unknown | N/A | |
| Prediction with Action: Visual Policy Learning via Joint Denoising Process | Unknown | N/A | |
| UPS: Unified Projection Sharing for Lightweight Single-Image Super-resolution and Beyond | Unknown | N/A | |
| Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference | Unknown | N/A | |
| SyncVIS: Synchronized Video Instance Segmentation | Unknown | N/A | |
| An Image is Worth 32 Tokens for Reconstruction and Generation | Unknown | N/A | |
| DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization | Unknown | N/A | |
| Learning-Augmented Dynamic Submodular Maximization | Unknown | N/A | |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Unknown | N/A | |
| Association of Objects May Engender Stereotypes: Mitigating Association-Engendered Stereotypes in Text-to-Image Generation | Unknown | N/A | |
| Generalizablity of Memorization Neural Network | Unknown | N/A | |
| Universality of AdaGrad Stepsizes for Stochastic Optimization: Inexact Oracle, Acceleration and Variance Reduction | Unknown | N/A | |
| Finding NeMo: Localizing Neurons Responsible For Memorization in Diffusion Models | Unknown | N/A | |
| Neural Signed Distance Function Inference through Splatting 3D Gaussians Pulled on Zero-Level Set | Unknown | N/A | |
| Fast samplers for Inverse Problems in Iterative Refinement models | Unknown | N/A | |
| CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning | Unknown | N/A | |
| On Softmax Direct Preference Optimization for Recommendation | Unknown | N/A | |
| Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention | Unknown | N/A | |
| Tighter Convergence Bounds for Shuffled SGD via Primal-Dual Perspective | Unknown | N/A | |
| What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights | Unknown | N/A | |
| A Closer Look at the CLS Token for Cross-Domain Few-Shot Learning | Unknown | N/A | |
| FlexPlanner: Flexible 3D Floorplanning via Deep Reinforcement Learning in Hybrid Action Space with Multi-Modality Representation | Unknown | N/A | |
| Task-Agnostic Machine-Learning-Assisted Inference | Unknown | N/A | |
| End-to-End Video Semantic Segmentation in Adverse Weather using Fusion Blocks and Temporal-Spatial Teacher-Student Learning | Unknown | N/A | |
| OT4P: Unlocking Effective Orthogonal Group Path for Permutation Relaxation | Unknown | N/A | |
| Tight Rates for Bandit Control Beyond Quadratics | Unknown | N/A | |
| Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandits | Unknown | N/A | |
| Binocular-Guided 3D Gaussian Splatting with View Consistency for Sparse View Synthesis | Unknown | N/A | |
| Referencing Where to Focus: Improving Visual Grounding with Referential Query | Unknown | N/A | |
| Is Value Learning Really the Main Bottleneck in Offline RL? | Unknown | N/A | |
| ActAnywhere: Subject-Aware Video Background Generation | Unknown | N/A | |
| Wings: Learning Multimodal LLMs without Text-only Forgetting | Unknown | N/A | |
| Dual-Personalizing Adapter for Federated Foundation Models | Unknown | N/A | |
| Goal-Conditioned On-Policy Reinforcement Learning | Unknown | N/A | |
| Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices | Unknown | N/A | |
| On the Impacts of the Random Initialization in the Neural Tangent Kernel Theory | Unknown | N/A | |
| Few-Shot Adversarial Prompt Learning on Vision-Language Models | Unknown | N/A | |
| MGF: Mixed Gaussian Flow for Diverse Trajectory Prediction | Unknown | N/A | |
| DMNet: Self-comparison Driven Model for Subject-independent Seizure Detection | Unknown | N/A | |
| Latent Functional Maps: a spectral framework for representation alignment | Unknown | N/A | |
| SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering | Unknown | N/A | |
| Unified Domain Generalization and Adaptation for Multi-View 3D Object Detection | Unknown | N/A | |
| Human-Object Interaction Detection Collaborated with Large Relation-driven Diffusion Models | Unknown | N/A | |
| Optimal Aggregation of Prediction Intervals under Unsupervised Domain Shift | Unknown | N/A | |
| DC-Gaussian: Improving 3D Gaussian Splatting for Reflective Dash Cam Videos | Unknown | N/A | |
| D2R2: Diffusion-based Representation with Random Distance Matching for Tabular Few-shot Learning | Unknown | N/A | |
| John Ellipsoids via Lazy Updates | Unknown | N/A | |
| QUEST: Quadruple Multimodal Contrastive Learning with Constraints and Self-Penalization | Unknown | N/A | |
| Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration | Unknown | N/A | |
| Localize, Understand, Collaborate: Semantic-Aware Dragging via Intention Reasoner | Unknown | N/A | |
| GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning | Unknown | N/A | |
| Multi-Scale Representation Learning for Protein Fitness Prediction | Unknown | N/A | |
| Synthetic Programming Elicitation for Text-to-Code in Very Low-Resource Programming and Formal Languages | Unknown | N/A | |
| Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs | Unknown | N/A | |
| A Canonicalization Perspective on Invariant and Equivariant Learning | Unknown | N/A | |
| Identifying General Mechanism Shifts in Linear Causal Representations | Unknown | N/A | |
| An Expectation-Maximization Algorithm for Training Clean Diffusion Models from Corrupted Observations | Unknown | N/A | |
| Con4m: Context-aware Consistency Learning Framework for Segmented Time Series Classification | Unknown | N/A | |
| Query-Based Adversarial Prompt Generation | Unknown | N/A | |
| Instructor-inspired Machine Learning for Robust Molecular Property Prediction | Unknown | N/A | |
| Customizing Language Models with Instance-wise LoRA for Sequential Recommendation | Unknown | N/A | |
| Guided Trajectory Generation with Diffusion Models for Offline Model-based Optimization | Unknown | N/A | |
| ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing | Unknown | N/A | |
| PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders | Unknown | N/A | |
| Flexible mapping of abstract domains by grid cells via self-supervised extraction and projection of generalized velocity signals | Unknown | N/A | |
| A Bayesian Approach for Personalized Federated Learning in Heterogeneous Settings | Unknown | N/A | |
| Optimizing Automatic Differentiation with Deep Reinforcement Learning | Unknown | N/A | |
| Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling | Unknown | N/A | |
| On the Target-kernel Alignment: a Unified Analysis with Kernel Complexity | Unknown | N/A | |
| Online Learning of Delayed Choices | Unknown | N/A | |
| Detecting Bugs with Substantial Monetary Consequences by LLM and Rule-based Reasoning | Unknown | N/A | |
| A-FedPD: Aligning Dual-Drift is All Federated Primal-Dual Learning Needs | Unknown | N/A | |
| The Price of Implicit Bias in Adversarially Robust Generalization | Unknown | N/A | |
| Least Squares Regression Can Exhibit Under-Parameterized Double Descent | Unknown | N/A | |
| KptLLM: Unveiling the Power of Large Language Model for Keypoint Comprehension | Unknown | N/A | |
| Towards Unsupervised Model Selection for Domain Adaptive Object Detection | Unknown | N/A | |
| Fine-Grained Dynamic Framework for Bias-Variance Joint Optimization on Data Missing Not at Random | Unknown | N/A | |
| HyperLogic: Enhancing Diversity and Accuracy in Rule Learning with HyperNets | Unknown | N/A | |
| Parameter Disparities Dissection for Backdoor Defense in Heterogeneous Federated Learning | Unknown | N/A | |
| Long-range Brain Graph Transformer | Unknown | N/A | |
| DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs | Unknown | N/A | |
| Boundary Decomposition for Nadir Objective Vector Estimation | Unknown | N/A | |
| Active Learning of General Halfspaces: Label Queries vs Membership Queries | Unknown | N/A | |
| SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection | Unknown | N/A | |
| Fair Bilevel Neural Network (FairBiNN): On Balancing fairness and accuracy via Stackelberg Equilibrium | Unknown | N/A | |
| Multi-Head Mixture-of-Experts | Unknown | N/A | |
| Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models | Unknown | N/A | |
| Knowledge Graph Completion by Intermediate Variables Regularization | Unknown | N/A | |
| Fractal Patterns May Illuminate the Success of Next-Token Prediction | Unknown | N/A | |
| Transformer Doctor: Diagnosing and Treating Vision Transformers | Unknown | N/A | |
| Gliding over the Pareto Front with Uniform Designs | Unknown | N/A | |
| Boosting Transferability and Discriminability for Time Series Domain Adaptation | Unknown | N/A | |
| A Unified Principle of Pessimism for Offline Reinforcement Learning under Model Mismatch | Unknown | N/A | |
| FedGMKD: An Efficient Prototype Federated Learning Framework through Knowledge Distillation and Discrepancy-Aware Aggregation | Unknown | N/A | |
| Revisiting motion information for RGB-Event tracking with MOT philosophy | Unknown | N/A | |
| Long-Range Feedback Spiking Network Captures Dynamic and Static Representations of the Visual Cortex under Movie Stimuli | Unknown | N/A | |
| Adam on Local Time: Addressing Nonstationarity in RL with Relative Adam Timesteps | Unknown | N/A | |
| Optimistic Verifiable Training by Controlling Hardware Nondeterminism | Unknown | N/A | |
| Vivid-ZOO: Multi-View Video Generation with Diffusion Model | Unknown | N/A | |
| Identifying Equivalent Training Dynamics | Unknown | N/A | |
| MatrixNet: Learning over symmetry groups using learned group representations | Unknown | N/A | |
| Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows | Unknown | N/A | |
| Mind the Gap: A Causal Perspective on Bias Amplification in Prediction & Decision-Making | Unknown | N/A | |
| In-and-Out: Algorithmic Diffusion for Sampling Convex Bodies | Unknown | N/A | |
| IWBVT: Instance Weighting-based Bias-Variance Trade-off for Crowdsourcing | Unknown | N/A | |
| Video Diffusion Models are Training-free Motion Interpreter and Controller | Unknown | N/A | |
| TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation | Unknown | N/A | |
| Truthful High Dimensional Sparse Linear Regression | Unknown | N/A | |
| Generalizable Implicit Motion Modeling for Video Frame Interpolation | Unknown | N/A | |
| Typicalness-Aware Learning for Failure Detection | Unknown | N/A | |
| Implicit Regularization Paths of Weighted Neural Representations | Unknown | N/A | |
| $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$ | Unknown | N/A | |
| Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection | Unknown | N/A | |
| Collaboration! Towards Robust Neural Methods for Routing Problems | Unknown | N/A | |
| Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design | Unknown | N/A | |
| Tracing Hyperparameter Dependencies for Model Parsing via Learnable Graph Pooling Network | Unknown | N/A | |
| MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models | Unknown | N/A | |
| Data-Driven Discovery of Dynamical Systems in Pharmacology using Large Language Models | Unknown | N/A | |
| FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention | Unknown | N/A | |
| Breaking Long-Tailed Learning Bottlenecks: A Controllable Paradigm with Hypernetwork-Generated Diverse Experts | Unknown | N/A | |
| Spiking Transformer with Experts Mixture | Unknown | N/A | |
| A Functional Extension of Semi-Structured Networks | Unknown | N/A | |
| Universal Neural Functionals | Unknown | N/A | |
| Satformer: Accurate and Robust Traffic Data Estimation for Satellite Networks | Unknown | N/A | |
| LLM-AutoDA: Large Language Model-Driven Automatic Data Augmentation for Long-tailed Problems | Unknown | N/A | |
| A Motion-aware Spatio-temporal Graph for Video Salient Object Ranking | Unknown | N/A | |
| The Bayesian sampling in a canonical recurrent circuit with a diversity of inhibitory interneurons | Unknown | N/A | |
| Energy-based Hopfield Boosting for Out-of-Distribution Detection | Unknown | N/A | |
| Cost-efficient Knowledge-based Question Answering with Large Language Models | Unknown | N/A | |
| Coupled Mamba: Enhanced Multimodal Fusion with Coupled State Space Model | Unknown | N/A | |
| Cryptographic Hardness of Score Estimation | Unknown | N/A | |
| Separation and Bias of Deep Equilibrium Models on Expressivity and Learning Dynamics | Unknown | N/A | |
| Neural Cover Selection for Image Steganography | Unknown | N/A | |
| Task Confusion and Catastrophic Forgetting in Class-Incremental Learning: A Mathematical Framework for Discriminative and Generative Modelings | Unknown | N/A | |
| FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations | Unknown | N/A | |
| Subject-driven Text-to-Image Generation via Preference-based Reinforcement Learning | Unknown | N/A | |
| One-Step Effective Diffusion Network for Real-World Image Super-Resolution | Unknown | N/A | |
| Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly | Unknown | N/A | |
| LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization | Unknown | N/A | |
| Stabilized Proximal-Point Methods for Federated Optimization | Unknown | N/A | |
| Mixtures of Experts for Audio-Visual Learning | Unknown | N/A | |
| Computing the Bias of Constant-step Stochastic Approximation with Markovian Noise | Unknown | N/A | |
| DDN: Dual-domain Dynamic Normalization for Non-stationary Time Series Forecasting | Unknown | N/A | |
| On Giant's Shoulders: Effortless Weak to Strong by Dynamic Logits Fusion | Unknown | N/A | |
| Improving Generalization of Dynamic Graph Learning via Environment Prompt | Unknown | N/A | |
| Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-Experts | Unknown | N/A | |
| The motion planning neural circuit in goal-directed navigation as Lie group operator search | Unknown | N/A | |
| Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning | Unknown | N/A | |
| Faster Differentially Private Top-$k$ Selection: A Joint Exponential Mechanism with Pruning | Unknown | N/A | |
| Hybrid Mamba for Few-Shot Segmentation | Unknown | N/A | |
| Graph-based Uncertainty Metrics for Long-form Language Model Generations | Unknown | N/A | |
| Cross-video Identity Correlating for Person Re-identification Pre-training | Unknown | N/A | |
| Statistical-Computational Trade-offs for Density Estimation | Unknown | N/A | |
| Sharpness-Aware Minimization Activates the Interactive Teaching's Understanding and Optimization | Unknown | N/A | |
| KnowGPT: Knowledge Graph based Prompting for Large Language Models | Unknown | N/A | |
| The Reliability of OKRidge Method in Solving Sparse Ridge Regression Problems | Unknown | N/A | |
| Efficient Graph Matching for Correlated Stochastic Block Models | Unknown | N/A | |
| GDeR: Safeguarding Efficiency, Balancing, and Robustness via Prototypical Graph Pruning | Unknown | N/A | |
| Accelerating Relative Entropy Coding with Space Partitioning | Unknown | N/A | |
| Diffusion Priors for Variational Likelihood Estimation and Image Denoising | Unknown | N/A | |
| On the Computational Complexity of Private High-dimensional Model Selection | Unknown | N/A | |
| Observational Scaling Laws and the Predictability of Langauge Model Performance | Unknown | N/A | |
| Decomposable Transformer Point Processes | Unknown | N/A | |
| ST$_k$: A Scalable Module for Solving Top-k Problems | Unknown | N/A | |
| Enhancing Zero-Shot Vision Models by Label-Free Prompt Distribution Learning and Bias Correcting | Unknown | N/A | |
| ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Unknown | N/A | |
| Test-Time Dynamic Image Fusion | Unknown | N/A | |
| G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training | Unknown | N/A | |
| Nearest Neighbor Speculative Decoding for LLM Generation and Attribution | Unknown | N/A | |
| A Local Method for Satisfying Interventional Fairness with Partially Known Causal Graphs | Unknown | N/A | |
| UniAudio 1.5: Large Language Model-Driven Audio Codec is A Few-Shot Audio Task Learner | Unknown | N/A | |
| TFG: Unified Training-Free Guidance for Diffusion Models | Unknown | N/A | |
| Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data | Unknown | N/A | |
| Boosting Weakly Supervised Referring Image Segmentation via Progressive Comprehension | Unknown | N/A | |
| You Only Look Around: Learning Illumination-Invariant Feature for Low-light Object Detection | Unknown | N/A | |
| On Sparse Canonical Correlation Analysis | Unknown | N/A | |
| SLTrain: a sparse plus low rank approach for parameter and memory efficient pretraining | Unknown | N/A | |
| Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization | Unknown | N/A | |
| G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering | Unknown | N/A | |
| RanDumb: Random Representations Outperform Online Continually Learned Representations | Unknown | N/A | |
| From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning | Unknown | N/A | |
| Online Classification with Predictions | Unknown | N/A | |
| A Framework for Bilevel Optimization on Riemannian Manifolds | Unknown | N/A | |
| Multivariate Probabilistic Time Series Forecasting with Correlated Errors | Unknown | N/A | |
| Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models | Unknown | N/A | |
| Learning to Handle Complex Constraints for Vehicle Routing Problems | Unknown | N/A | |
| Near-Optimal Dynamic Regret for Adversarial Linear Mixture MDPs | Unknown | N/A | |
| NeuroGauss4D-PCI: 4D Neural Fields and Gaussian Deformation Fields for Point Cloud Interpolation | Unknown | N/A | |
| Learning to Price Homogeneous Data | Unknown | N/A | |
| Connecting Joint-Embedding Predictive Architecture with Contrastive Self-supervised Learning | Unknown | N/A | |
| Adaptive Layer Sparsity for Large Language Models via Activation Correlation Assessment | Unknown | N/A | |
| Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise | Unknown | N/A | |
| Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales? | Unknown | N/A | |
| Distribution Guidance Network for Weakly Supervised Point Cloud Semantic Segmentation | Unknown | N/A | |
| Transcoders find interpretable LLM feature circuits | Unknown | N/A | |
| Lower Bounds of Uniform Stability in Gradient-Based Bilevel Algorithms for Hyperparameter Optimization | Unknown | N/A | |
| Real-time Stereo-based 3D Object Detection for Streaming Perception | Unknown | N/A | |
| Soft ascent-descent as a stable and flexible alternative to flooding | Unknown | N/A | |
| Probablistic Emulation of a Global Climate Model with Spherical DYffusion | Unknown | N/A | |
| Lower Bounds and Optimal Algorithms for Non-Smooth Convex Decentralized Optimization over Time-Varying Networks | Unknown | N/A | |
| Rethinking Score Distillation as a Bridge Between Image Distributions | Unknown | N/A | |
| Revisiting Self-Supervised Heterogeneous Graph Learning from Spectral Clustering Perspective | Unknown | N/A | |
| Variance estimation in compound decision theory under boundedness | Unknown | N/A | |
| Time-FFM: Towards LM-Empowered Federated Foundation Model for Time Series Forecasting | Unknown | N/A | |
| SCOREQ: Speech Quality Assessment with Contrastive Regression | Unknown | N/A | |
| DOGS: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus | Unknown | N/A | |
| What Is Missing For Graph Homophily? Disentangling Graph Homophily For Graph Neural Networks | Unknown | N/A | |
| SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network | Unknown | N/A | |
| CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training | Unknown | N/A | |
| Pre-training Differentially Private Models with Limited Public Data | Unknown | N/A | |
| Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization | Unknown | N/A | |
| Transferability Bound Theory: Exploring Relationship between Adversarial Transferability and Flatness | Unknown | N/A | |
| Efficient Availability Attacks against Supervised and Contrastive Learning Simultaneously | Unknown | N/A | |
| A Critical Evaluation of AI Feedback for Aligning Large Language Models | Unknown | N/A | |
| StreamFlow: Streamlined Multi-Frame Optical Flow Estimation for Video Sequences | Unknown | N/A | |
| AutoTimes: Autoregressive Time Series Forecasters via Large Language Models | Unknown | N/A | |
| Using Surrogates in Covariate-adjusted Response-adaptive Randomization Experiments with Delayed Outcomes | Unknown | N/A | |
| AutoSurvey: Large Language Models Can Automatically Write Surveys | Unknown | N/A | |
| Dimension-free deterministic equivalents and scaling laws for random feature regression | Unknown | N/A | |
| ZeroMark: Towards Dataset Ownership Verification without Disclosing Watermark | Unknown | N/A | |
| Towards a Scalable Reference-Free Evaluation of Generative Models | Unknown | N/A | |
| Graph Diffusion Transformers for Multi-Conditional Molecular Generation | Unknown | N/A | |
| SE(3)-bi-equivariant Transformers for Point Cloud Assembly | Unknown | N/A | |
| Conformalized Time Series with Semantic Features | Unknown | N/A | |
| Addressing Spatial-Temporal Heterogeneity: General Mixed Time Series Analysis via Latent Continuity Recovery and Alignment | Unknown | N/A | |
| Emergence of heavy tails in homogenized stochastic gradient descent | Unknown | N/A | |
| The Implicit Bias of Heterogeneity towards Invariance: A Study of Multi-Environment Matrix Sensing | Unknown | N/A | |
| Learning 3D Garment Animation from Trajectories of A Piece of Cloth | Unknown | N/A | |
| On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability | Unknown | N/A | |
| Rethinking 3D Convolution in $\ell_p$-norm Space | Unknown | N/A | |
| A Siamese Transformer with Hierarchical Refinement for Lane Detection | Unknown | N/A | |
| HORSE: Hierarchical Representation for Large-Scale Neural Subset Selection | Unknown | N/A | |
| Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation | Unknown | N/A | |
| Segment Any Change | Unknown | N/A | |
| LSH-MoE: Communication-efficient MoE Training via Locality-Sensitive Hashing | Unknown | N/A | |
| Spike-based Neuromorphic Model for Sound Source Localization | Unknown | N/A | |
| Free-Rider and Conflict Aware Collaboration Formation for Cross-Silo Federated Learning | Unknown | N/A | |
| Ex Uno Pluria: Insights on Ensembling in Low Precision Number Systems | Unknown | N/A | |
| Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes | Unknown | N/A | |
| Fully Unconstrained Online Learning | Unknown | N/A | |
| Clustering then Propagation: Select Better Anchors for Knowledge Graph Embedding | Unknown | N/A | |
| SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures | Unknown | N/A | |
| Sample Complexity of Posted Pricing for a Single Item | Unknown | N/A | |
| Model Sensitivity Aware Continual Learning | Unknown | N/A | |
| GENOT: Entropic (Gromov) Wasserstein Flow Matching with Applications to Single-Cell Genomics | Unknown | N/A | |
| Learning to Embed Distributions via Maximum Kernel Entropy | Unknown | N/A | |
| From News to Forecast: Integrating Event Analysis in LLM-Based Time Series Forecasting with Reflection | Unknown | N/A | |
| Connectivity Shapes Implicit Regularization in Matrix Factorization Models for Matrix Completion | Unknown | N/A | |
| Cluster-Learngene: Inheriting Adaptive Clusters for Vision Transformers | Unknown | N/A | |
| Evidence of Learned Look-Ahead in a Chess-Playing Neural Network | Unknown | N/A | |
| BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping | Unknown | N/A | |
| Provable Benefit of Cutout and CutMix for Feature Learning | Unknown | N/A | |
| MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection | Unknown | N/A | |
| Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials | Unknown | N/A | |
| Pin-Tuning: Parameter-Efficient In-Context Tuning for Few-Shot Molecular Property Prediction | Unknown | N/A | |
| Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging | Unknown | N/A | |
| HENASY: Learning to Assemble Scene-Entities for Interpretable Egocentric Video-Language Model | Unknown | N/A | |
| Dissecting the Failure of Invariant Learning on Graphs | Unknown | N/A | |
| Do's and Don'ts: Learning Desirable Skills with Instruction Videos | Unknown | N/A | |
| RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models | Unknown | N/A | |
| Uncovering the Redundancy in Graph Self-supervised Learning Models | Unknown | N/A | |
| A Boosting-Type Convergence Result for AdaBoost.MH with Factorized Multi-Class Classifiers | Unknown | N/A | |
| Error Analysis of Spherically Constrained Least Squares Reformulation in Solving the Stackelberg Prediction Game | Unknown | N/A | |
| Neural Characteristic Activation Analysis and Geometric Parameterization for ReLU Networks | Unknown | N/A | |
| Diffusion Policies Creating a Trust Region for Offline Reinforcement Learning | Unknown | N/A | |
| DiffGS: Functional Gaussian Splatting Diffusion | Unknown | N/A | |
| Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars | Unknown | N/A | |
| Confusion-Resistant Federated Learning via Diffusion-Based Data Harmonization on Non-IID Data | Unknown | N/A | |
| IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation | Unknown | N/A | |
| Policy Optimization for Robust Average Reward MDPs | Unknown | N/A | |
| Text2CAD: Generating Sequential CAD Designs from Beginner-to-Expert Level Text Prompts | Unknown | N/A | |
| Hierarchical and Density-based Causal Clustering | Unknown | N/A | |
| Hypothesis Testing the Circuit Hypothesis in LLMs | Unknown | N/A | |
| Transfer Q-star : Principled Decoding for LLM Alignment | Unknown | N/A | |
| e-COP : Episodic Constrained Optimization of Policies | Unknown | N/A | |
| Relating Hopfield Networks to Episodic Control | Unknown | N/A | |
| LCGen: Mining in Low-Certainty Generation for View-consistent Text-to-3D | Unknown | N/A | |
| Alleviate Anchor-Shift: Explore Blind Spots with Cross-View Reconstruction for Incomplete Multi-View Clustering | Unknown | N/A | |
| Improving Robustness of 3D Point Cloud Recognition from a Fourier Perspective | Unknown | N/A | |
| WATT: Weight Average Test Time Adaptation of CLIP | Unknown | N/A | |
| OASIS: Conditional Distribution Shaping for Offline Safe Reinforcement Learning | Unknown | N/A | |
| Nearly Optimal Approximation of Matrix Functions by the Lanczos Method | Unknown | N/A | |
| Conformal Classification with Equalized Coverage for Adaptively Selected Groups | Unknown | N/A | |
| Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection | Unknown | N/A | |
| Theoretical Foundations of Deep Selective State-Space Models | Unknown | N/A | |
| Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series | Unknown | N/A | |
| LLM Circuit Analyses Are Consistent Across Training and Scale | Unknown | N/A | |
| LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings | Unknown | N/A | |
| Exploring Fixed Point in Image Editing: Theoretical Support and Convergence Optimization | Unknown | N/A | |
| SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction | Unknown | N/A | |
| Meta-Exploiting Frequency Prior for Cross-Domain Few-Shot Learning | Unknown | N/A | |
| Distributional Preference Alignment of LLMs via Optimal Transport | Unknown | N/A | |
| Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer | Unknown | N/A | |
| Can Models Learn Skill Composition from Examples? | Unknown | N/A | |
| FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner | Unknown | N/A | |
| Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes | Unknown | N/A | |
| Multilingual Diversity Improves Vision-Language Representations | Unknown | N/A | |
| Diversity Is Not All You Need: Training A Robust Cooperative Agent Needs Specialist Partners | Unknown | N/A | |
| Enhancing Domain Adaptation through Prompt Gradient Alignment | Unknown | N/A | |
| Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting | Unknown | N/A | |
| Reliable Learning of Halfspaces under Gaussian Marginals | Unknown | N/A | |
| Grounding Multimodal Large Language Models in Actions | Unknown | N/A | |
| Improved Analysis for Bandit Learning in Matching Markets | Unknown | N/A | |
| SyncTweedies: A General Generative Framework Based on Synchronized Diffusions | Unknown | N/A | |
| PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection | Unknown | N/A | |
| Unveiling the Bias Impact on Symmetric Moral Consistency of Large Language Models | Unknown | N/A | |
| Mixture of Demonstrations for In-Context Learning | Unknown | N/A | |
| Monomial Matrix Group Equivariant Neural Functional Networks | Unknown | N/A | |
| Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing | Unknown | N/A | |
| Precipitation Downscaling with Spatiotemporal Video Diffusion | Unknown | N/A | |
| Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking | Unknown | N/A | |
| Scalable DBSCAN with Random Projections | Unknown | N/A | |
| Reconstruction of Manipulated Garment with Guided Deformation Prior | Unknown | N/A | |
| A Simple Remedy for Dataset Bias via Self-Influence: A Mislabeled Sample Perspective | Unknown | N/A | |
| User-item fairness tradeoffs in recommendations | Unknown | N/A | |
| Transductive Learning is Compact | Unknown | N/A | |
| On the Scalability of Certified Adversarial Robustness with Generated Data | Unknown | N/A | |
| Provably Robust Score-Based Diffusion Posterior Sampling for Plug-and-Play Image Reconstruction | Unknown | N/A | |
| Learning and Transferring Sparse Contextual Bigrams with Linear Transformers | Unknown | N/A | |
| Resource-Aware Federated Self-Supervised Learning with Global Class Representations | Unknown | N/A | |
| Local to Global: Learning Dynamics and Effect of Initialization for Transformers | Unknown | N/A | |
| Risk-sensitive control as inference with Rényi divergence | Unknown | N/A | |
| GUIDE: Real-Time Human-Shaped Agents | Unknown | N/A | |
| An effective framework for estimating individualized treatment rules | Unknown | N/A | |
| SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models | Unknown | N/A | |
| Enhancing Feature Diversity Boosts Channel-Adaptive Vision Transformers | Unknown | N/A | |
| 2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution | Unknown | N/A | |
| CLIPAway: Harmonizing focused embeddings for removing objects via diffusion models | Unknown | N/A | |
| Supervised Kernel Thinning | Unknown | N/A | |
| AED: Adaptable Error Detection for Few-shot Imitation Policy | Unknown | N/A | |
| Active, anytime-valid risk controlling prediction sets | Unknown | N/A | |
| ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution | Unknown | N/A | |
| InstructG2I: Synthesizing Images from Multimodal Attributed Graphs | Unknown | N/A | |
| Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakes | Unknown | N/A | |
| Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability | Unknown | N/A | |
| Towards Human-AI Complementarity with Prediction Sets | Unknown | N/A | |
| Evaluate then Cooperate: Shapley-based View Cooperation Enhancement for Multi-view Clustering | Unknown | N/A | |
| LOVA3: Learning to Visual Question Answering, Asking and Assessment | Unknown | N/A | |
| FilterNet: Harnessing Frequency Filters for Time Series Forecasting | Unknown | N/A | |
| Enriching Disentanglement: From Logical Definitions to Quantitative Metrics | Unknown | N/A | |
| Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks | Unknown | N/A | |
| UniFL: Improve Latent Diffusion Model via Unified Feedback Learning | Unknown | N/A | |
| Learning to Balance Altruism and Self-interest Based on Empathy in Mixed-Motive Games | Unknown | N/A | |
| Aligning to Thousands of Preferences via System Message Generalization | Unknown | N/A | |
| F-OAL: Forward-only Online Analytic Learning with Fast Training and Low Memory Footprint in Class Incremental Learning | Unknown | N/A | |
| RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation | Unknown | N/A | |
| AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation | Unknown | N/A | |
| Prune and Repaint: Content-Aware Image Retargeting for any Ratio | Unknown | N/A | |
| Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation | Unknown | N/A | |
| Leveraging Catastrophic Forgetting to Develop Safe Diffusion Models against Malicious Finetuning | Unknown | N/A | |
| From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos | Unknown | N/A | |
| WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models | Unknown | N/A | |
| WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking | Unknown | N/A | |
| Prior-itizing Privacy: A Bayesian Approach to Setting the Privacy Budget in Differential Privacy | Unknown | N/A | |
| Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation | Unknown | N/A | |
| LocCa: Visual Pretraining with Location-aware Captioners | Unknown | N/A | |
| Soft-Label Integration for Robust Toxicity Classification | Unknown | N/A | |
| Sm: enhanced localization in Multiple Instance Learning for medical imaging classification | Unknown | N/A | |
| GraphVis: Boosting LLMs with Visual Knowledge Graph Integration | Unknown | N/A | |
| Faster Algorithms for User-Level Private Stochastic Convex Optimization | Unknown | N/A | |
| Nonparametric Classification on Low Dimensional Manifolds using Overparameterized Convolutional Residual Networks | Unknown | N/A | |
| Harmonizing Visual Text Comprehension and Generation | Unknown | N/A | |
| MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model | Unknown | N/A | |
| LinNet: Linear Network for Efficient Point Cloud Representation Learning | Unknown | N/A | |
| GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation | Unknown | N/A | |
| Exploring DCN-like architecture for fast image generation with arbitrary resolution | Unknown | N/A | |
| A Versatile Diffusion Transformer with Mixture of Noise Levels for Audiovisual Generation | Unknown | N/A | |
| Oja's Algorithm for Streaming Sparse PCA | Unknown | N/A | |
| UnSeg: One Universal Unlearnable Example Generator is Enough against All Image Segmentation | Unknown | N/A | |
| Can Graph Learning Improve Planning in LLM-based Agents? | Unknown | N/A | |
| Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization | Unknown | N/A | |
| GarmentLab: A Unified Simulation and Benchmark for Garment Manipulation | Unknown | N/A | |
| VMamba: Visual State Space Model | Unknown | N/A | |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Unknown | N/A | |
| SpeedLoader: An I/O efficient scheme for heterogeneous and distributed LLM operation | Unknown | N/A | |
| Amortized Planning with Large-Scale Transformers: A Case Study on Chess | Unknown | N/A | |
| Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention | Unknown | N/A | |
| Probabilistic Decomposed Linear Dynamical Systems for Robust Discovery of Latent Neural Dynamics | Unknown | N/A | |
| MVInpainter: Learning Multi-View Consistent Inpainting to Bridge 2D and 3D Editing | Unknown | N/A | |
| LLaMo: Large Language Model-based Molecular Graph Assistant | Unknown | N/A | |
| Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection? | Unknown | N/A | |
| Deep Graph Neural Networks via Posteriori-Sampling-based Node-Adaptative Residual Module | Unknown | N/A | |
| WizardArena: Post-training Large Language Models via Simulated Offline Chatbot Arena | Unknown | N/A | |
| MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging | Unknown | N/A | |
| A Huber Loss Minimization Approach to Mean Estimation under User-level Differential Privacy | Unknown | N/A | |
| Open-Vocabulary Object Detection via Language Hierarchy | Unknown | N/A | |
| GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning | Unknown | N/A | |
| SpeechAlign: Aligning Speech Generation to Human Preferences | Unknown | N/A | |
| Swift Sampler: Efficient Learning of Sampler by 10 Parameters | Unknown | N/A | |
| Scalable Optimization in the Modular Norm | Unknown | N/A | |
| Déjà Vu Memorization in Vision–Language Models | Unknown | N/A | |
| Conditional Controllable Image Fusion | Unknown | N/A | |
| Grasp as You Say: Language-guided Dexterous Grasp Generation | Unknown | N/A | |
| Upping the Game: How 2D U-Net Skip Connections Flip 3D Segmentation | Unknown | N/A | |
| Generative Hierarchical Materials Search | Unknown | N/A | |
| Self-supervised Transformation Learning for Equivariant Representations | Unknown | N/A | |
| Leveraging Separated World Model for Exploration in Visually Distracted Environments | Unknown | N/A | |
| Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis | Unknown | N/A | |
| Understanding Visual Feature Reliance through the Lens of Complexity | Unknown | N/A | |
| A Modular Conditional Diffusion Framework for Image Reconstruction | Unknown | N/A | |
| EfficientCAPER: An End-to-End Framework for Fast and Robust Category-Level Articulated Object Pose Estimation | Unknown | N/A | |
| Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives | Unknown | N/A | |
| Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization | Unknown | N/A | |
| GAMap: Zero-Shot Object Goal Navigation with Multi-Scale Geometric-Affordance Guidance | Unknown | N/A | |
| Universal In-Context Approximation By Prompting Fully Recurrent Models | Unknown | N/A | |
| Physical Consistency Bridges Heterogeneous Data in Molecular Multi-Task Learning | Unknown | N/A | |
| Improving Gloss-free Sign Language Translation by Reducing Representation Density | Unknown | N/A | |
| On Affine Homotopy between Language Encoders | Unknown | N/A | |
| HonestLLM: Toward an Honest and Helpful Large Language Model | Unknown | N/A | |
| Generalized Linear Bandits with Limited Adaptivity | Unknown | N/A | |
| Segmenting Watermarked Texts From Language Models | Unknown | N/A | |
| Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning | Unknown | N/A | |
| FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion | Unknown | N/A | |
| TAPTRv2: Attention-based Position Update Improves Tracking Any Point | Unknown | N/A | |
| DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection | Unknown | N/A | |
| On improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models | Unknown | N/A | |
| Understanding Transformer Reasoning Capabilities via Graph Algorithms | Unknown | N/A | |
| 4-bit Shampoo for Memory-Efficient Network Training | Unknown | N/A | |
| $\boldsymbol{\mu}\mathbf{P^2}$: Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling | Unknown | N/A | |
| Efficient Lifelong Model Evaluation in an Era of Rapid Progress | Unknown | N/A | |
| CV-VAE: A Compatible Video VAE for Latent Generative Video Models | Unknown | N/A | |
| Improved Generation of Adversarial Examples Against Safety-aligned LLMs | Unknown | N/A | |
| Offline Behavior Distillation | Unknown | N/A | |
| Learning from Pattern Completion: Self-supervised Controllable Generation | Unknown | N/A | |
| DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models | Unknown | N/A | |
| ContextCite: Attributing Model Generation to Context | Unknown | N/A | |
| Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis | Unknown | N/A | |
| Robust Contrastive Multi-view Clustering against Dual Noisy Correspondence | Unknown | N/A | |
| VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time | Unknown | N/A | |
| ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction | Unknown | N/A | |
| E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection | Unknown | N/A | |
| A Swiss Army Knife for Heterogeneous Federated Learning: Flexible Coupling via Trace Norm | Unknown | N/A | |
| I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing | Unknown | N/A | |
| Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching | Unknown | N/A | |
| Learning to Shape In-distribution Feature Space for Out-of-distribution Detection | Unknown | N/A | |
| Voxel Proposal Network via Multi-Frame Knowledge Distillation for Semantic Scene Completion | Unknown | N/A | |
| Bridge the Modality and Capability Gaps in Vision-Language Model Selection | Unknown | N/A | |
| Multiclass Transductive Online Learning | Unknown | N/A | |
| On the cohesion and separability of average-link for hierarchical agglomerative clustering | Unknown | N/A | |
| BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts | Unknown | N/A | |
| A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning | Unknown | N/A | |
| Theoretical Analysis of Weak-to-Strong Generalization | Unknown | N/A | |
| A Foundation Model for Zero-shot Logical Query Reasoning | Unknown | N/A | |
| Generalized Fast Exact Conformalization | Unknown | N/A | |
| QBB: Quantization with Binary Bases for LLMs | Unknown | N/A | |
| WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off | Unknown | N/A | |
| Enhancing Chess Reinforcement Learning with Graph Representation | Unknown | N/A | |
| Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality | Unknown | N/A | |
| Towards Stable Representations for Protein Interface Prediction | Unknown | N/A | |
| Almost Surely Asymptotically Constant Graph Neural Networks | Unknown | N/A | |
| Pricing and Competition for Generative AI | Unknown | N/A | |
| Variational Delayed Policy Optimization | Unknown | N/A | |
| GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration | Unknown | N/A | |
| Neuro-Symbolic Data Generation for Math Reasoning | Unknown | N/A | |
| DropEdge not Foolproof: Effective Augmentation Method for Signed Graph Neural Networks | Unknown | N/A | |
| Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs | Unknown | N/A | |
| What does guidance do? A fine-grained analysis in a simple setting | Unknown | N/A | |
| RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs | Unknown | N/A | |
| Self-Guided Masked Autoencoder | Unknown | N/A | |
| Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models | Unknown | N/A | |
| On the Surprising Effectiveness of Attention Transfer for Vision Transformers | Unknown | N/A | |
| Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment | Unknown | N/A | |
| Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models | Unknown | N/A | |
| Gradient-Free Methods for Nonconvex Nonsmooth Stochastic Compositional Optimization | Unknown | N/A | |
| Improving Adaptivity via Over-Parameterization in Sequence Models | Unknown | N/A | |
| GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing | Unknown | N/A | |
| Recurrent neural networks: vanishing and exploding gradients are not the end of the story | Unknown | N/A | |
| Revisiting Differentially Private ReLU Regression | Unknown | N/A | |
| Post-Hoc Reversal: Are We Selecting Models Prematurely? | Unknown | N/A | |
| Stochastic Optimal Control for Diffusion Bridges in Function Spaces | Unknown | N/A | |
| Toward Robust Incomplete Multimodal Sentiment Analysis via Hierarchical Representation Learning | Unknown | N/A | |
| Generalization of Hamiltonian algorithms | Unknown | N/A | |
| Online Bayesian Persuasion Without a Clue | Unknown | N/A | |
| Why Do We Need Weight Decay in Modern Deep Learning? | Unknown | N/A | |
| The Sample Complexity of Gradient Descent in Stochastic Convex Optimization | Unknown | N/A | |
| Online Control in Population Dynamics | Unknown | N/A | |
| Mutli-Armed Bandits with Network Interference | Unknown | N/A | |
| Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms | Unknown | N/A | |
| Disentangling and mitigating the impact of task similarity for continual learning | Unknown | N/A | |
| Denoising Diffusion Path: Attribution Noise Reduction with An Auxiliary Diffusion Model | Unknown | N/A | |
| Instruction-Guided Visual Masking | Unknown | N/A | |
| Stopping Bayesian Optimization with Probabilistic Regret Bounds | Unknown | N/A | |
| Robust Reinforcement Learning from Corrupted Human Feedback | Unknown | N/A | |
| Can LLMs Implicitly Learn Numeric Parameter Constraints in Data Science APIs? | Unknown | N/A | |
| Sample and Computationally Efficient Robust Learning of Gaussian Single-Index Models | Unknown | N/A | |
| UniIF: Unified Molecule Inverse Folding | Unknown | N/A | |
| Easy Regional Contrastive Learning of Expressive Fashion Representations | Unknown | N/A | |
| On Weak Regret Analysis for Dueling Bandits | Unknown | N/A | |
| Statistical Efficiency of Distributional Temporal Difference Learning | Unknown | N/A | |
| How Transformers Utilize Multi-Head Attention in In-Context Learning? A Case Study on Sparse Linear Regression | Unknown | N/A | |
| Neural Collapse To Multiple Centers For Imbalanced Data | Unknown | N/A | |
| Achieving Optimal Clustering in Gaussian Mixture Models with Anisotropic Covariance Structures | Unknown | N/A | |
| Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses | Unknown | N/A | |
| A Unifying Post-Processing Framework for Multi-Objective Learn-to-Defer Problems | Unknown | N/A | |
| Beyond Primal-Dual Methods in Bandits with Stochastic and Adversarial Constraints | Unknown | N/A | |
| CoFie: Learning Compact Neural Surface Representations with Coordinate Fields | Unknown | N/A | |
| Gated Slot Attention for Efficient Linear-Time Sequence Modeling | Unknown | N/A | |
| Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer | Unknown | N/A | |
| Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence Loss | Unknown | N/A | |
| Nonlinear dynamics of localization in neural receptive fields | Unknown | N/A | |
| Automated Multi-Task Learning for Joint Disease Prediction on Electronic Health Records | Unknown | N/A | |
| Neglected Hessian component explains mysteries in sharpness regularization | Unknown | N/A | |
| AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation | Unknown | N/A | |
| Where does In-context Learning Happen in Large Language Models? | Unknown | N/A | |
| Online Relational Inference for Evolving Multi-agent Interacting Systems | Unknown | N/A | |
| One for All: Multi-Domain Joint Training for Point Cloud Based 3D Object Detection | Unknown | N/A | |
| A theoretical design of concept sets: improving the predictability of concept bottleneck models | Unknown | N/A | |
| Measuring Per-Unit Interpretability at Scale Without Humans | Unknown | N/A | |
| Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models | Unknown | N/A | |
| Super Consistency of Neural Network Landscapes and Learning Rate Transfer | Unknown | N/A | |
| The Impact of Initialization on LoRA Finetuning Dynamics | Unknown | N/A | |
| Approximation-Aware Bayesian Optimization | Unknown | N/A | |
| Dynamic Conditional Optimal Transport through Simulation-Free Flows | Unknown | N/A | |
| Explaining Datasets in Words: Statistical Models with Natural Language Parameters | Unknown | N/A | |
| Adam with model exponential moving average is effective for nonconvex optimization | Unknown | N/A | |
| Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising | Unknown | N/A | |
| An Analysis of Tokenization: Transformers under Markov Data | Unknown | N/A | |
| Approximated Orthogonal Projection Unit: Stabilizing Regression Network Training Using Natural Gradient | Unknown | N/A | |
| MAmmoTH2: Scaling Instructions from the Web | Unknown | N/A | |
| Bridging Geometric States via Geometric Diffusion Bridge | Unknown | N/A | |
| Leveraging Environment Interaction for Automated PDDL Translation and Planning with Large Language Models | Unknown | N/A | |
| CogVLM: Visual Expert for Pretrained Language Models | Unknown | N/A | |
| Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity | Unknown | N/A | |
| Efficiency for Free: Ideal Data Are Transportable Representations | Unknown | N/A | |
| Aligning Audio-Visual Joint Representations with an Agentic Workflow | Unknown | N/A | |
| On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games | Unknown | N/A | |
| Distribution Learning with Valid Outputs Beyond the Worst-Case | Unknown | N/A | |
| Any2Policy: Learning Visuomotor Policy with Any-Modality | Unknown | N/A | |
| Length Optimization in Conformal Prediction | Unknown | N/A | |
| Autonomous Driving with Spiking Neural Networks | Unknown | N/A | |
| Generalized Eigenvalue Problems with Generative Priors | Unknown | N/A | |
| The Intelligible and Effective Graph Neural Additive Network | Unknown | N/A | |
| Reranking Laws for Language Generation: A Communication-Theoretic Perspective | Unknown | N/A | |
| MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention | Unknown | N/A | |
| Optimal Scalarizations for Sublinear Hypervolume Regret | Unknown | N/A | |
| Theoretical Characterisation of the Gauss Newton Conditioning in Neural Networks | Unknown | N/A | |
| Hyperbolic Embeddings of Supervised Models | Unknown | N/A | |
| Bandits with Preference Feedback: A Stackelberg Game Perspective | Unknown | N/A | |
| Generalized Tensor Decomposition for Understanding Multi-Output Regression under Combinatorial Shifts | Unknown | N/A | |
| Clustering in Causal Attention Masking | Unknown | N/A | |
| MSA Generation with Seqs2Seqs Pretraining: Advancing Protein Structure Predictions | Unknown | N/A | |
| Large Language Model Unlearning | Unknown | N/A | |
| Fairness without Harm: An Influence-Guided Active Sampling Approach | Unknown | N/A | |
| Q-VLM: Post-training Quantization for Large Vision-Language Models | Unknown | N/A | |
| Revealing Distribution Discrepancy by Sampling Transfer in Unlabeled Data | Unknown | N/A | |
| Slicing Vision Transformer for Flexible Inference | Unknown | N/A | |
| Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation | Unknown | N/A | |
| Measuring Mutual Policy Divergence for Multi-Agent Sequential Exploration | Unknown | N/A | |
| Understanding Multi-Granularity for Open-Vocabulary Part Segmentation | Unknown | N/A | |
| Hybrid Top-Down Global Causal Discovery with Local Search for Linear and Nonlinear Additive Noise Models | Unknown | N/A | |
| Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic Embedding | Unknown | N/A | |
| Constructing Semantics-Aware Adversarial Examples with a Probabilistic Perspective | Unknown | N/A | |
| Low Precision Local Training is Enough for Federated Learning | Unknown | N/A | |
| The Benefits of Balance: From Information Projections to Variance Reduction | Unknown | N/A | |
| An Accelerated Algorithm for Stochastic Bilevel Optimization under Unbounded Smoothness | Unknown | N/A | |
| Calibrating Reasoning in Language Models with Internal Consistency | Unknown | N/A | |
| Verified Safe Reinforcement Learning for Neural Network Dynamic Models | Unknown | N/A | |
| SEL-BALD: Deep Bayesian Active Learning with Selective Labels | Unknown | N/A | |
| LoTLIP: Improving Language-Image Pre-training for Long Text Understanding | Unknown | N/A | |
| Learning Partitions from Context | Unknown | N/A | |
| Identification of Analytic Nonlinear Dynamical Systems with Non-asymptotic Guarantees | Unknown | N/A | |
| Explanations that reveal all through the definition of encoding | Unknown | N/A | |
| A Compositional Atlas for Algebraic Circuits | Unknown | N/A | |
| In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before an Ongoing Trajectory Terminates | Unknown | N/A | |
| How many classifiers do we need? | Unknown | N/A | |
| TurboHopp: Accelerated Molecule Scaffold Hopping with Consistency Models | Unknown | N/A | |
| Aligner: Efficient Alignment by Learning to Correct | Unknown | N/A | |
| To Believe or Not to Believe Your LLM: Iterative Prompting for Estimating Epistemic Uncertainty | Unknown | N/A | |
| Generalizable Person Re-identification via Balancing Alignment and Uniformity | Unknown | N/A | |
| Spiking Token Mixer: An event-driven friendly Former structure for spiking neural networks | Unknown | N/A | |
| Belief-State Query Policies for User-Aligned POMDPs | Unknown | N/A | |
| DALD: Improving Logits-based Detector without Logits from Black-box LLMs | Unknown | N/A | |
| Towards Scalable and Stable Parallelization of Nonlinear RNNs | Unknown | N/A | |
| Rough Transformers: Lightweight and Continuous Time Series Modelling through Signature Patching | Unknown | N/A | |
| Biologically Inspired Learning Model for Instructed Vision | Unknown | N/A | |
| Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation | Unknown | N/A | |
| Stochastic Optimization Schemes for Performative Prediction with Nonconvex Loss | Unknown | N/A | |
| Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL | Unknown | N/A | |
| Adversarial Representation Engineering: A General Model Editing Framework for Large Language Models | Unknown | N/A | |
| A Unified Framework for 3D Scene Understanding | Unknown | N/A | |
| Active Learning with LLMs for Partially Observed and Cost-Aware Scenarios | Unknown | N/A | |
| Entropy testing and its application to testing Bayesian networks | Unknown | N/A | |
| BAN: Detecting Backdoors Activated by Adversarial Neuron Noise | Unknown | N/A | |
| Enhancing Preference-based Linear Bandits via Human Response Time | Unknown | N/A | |
| Parameterized Approximation Schemes for Fair-Range Clustering | Unknown | N/A | |
| Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees | Unknown | N/A | |
| Randomized Strategic Facility Location with Predictions | Unknown | N/A | |
| Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy | Unknown | N/A | |
| The Importance of Being Scalable: Improving the Speed and Accuracy of Neural Network Interatomic Potentials Across Chemical Domains | Unknown | N/A | |
| Prospective Learning: Learning for a Dynamic Future | Unknown | N/A | |
| General Articulated Objects Manipulation in Real Images via Part-Aware Diffusion Process | Unknown | N/A | |
| Provable and Efficient Dataset Distillation for Kernel Ridge Regression | Unknown | N/A | |
| Learning to Assist Humans without Inferring Rewards | Unknown | N/A | |
| NoiseGPT: Label Noise Detection and Rectification through Probability Curvature | Unknown | N/A | |
| Markov Equivalence and Consistency in Differentiable Structure Learning | Unknown | N/A | |
| Layer-Adaptive State Pruning for Deep State Space Models | Unknown | N/A | |
| Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement | Unknown | N/A | |
| Ensemble sampling for linear bandits: small ensembles suffice | Unknown | N/A | |
| Online Adaptation of Language Models with a Memory of Amortized Contexts | Unknown | N/A | |
| Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning | Unknown | N/A | |
| Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data | Unknown | N/A | |
| Algebraic Positional Encodings | Unknown | N/A | |
| Functionally Constrained Algorithm Solves Convex Simple Bilevel Problem | Unknown | N/A | |
| Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning | Unknown | N/A | |
| Universality in Transfer Learning for Linear Models | Unknown | N/A | |
| Unveiling and Mitigating Backdoor Vulnerabilities based on Unlearning Weight Changes and Backdoor Activeness | Unknown | N/A | |
| RETR: Multi-View Radar Detection Transformer for Indoor Perception | Unknown | N/A | |
| Visual Pinwheel Centers Act as Geometric Saliency Detectors | Unknown | N/A | |
| Model-Based Transfer Learning for Contextual Reinforcement Learning | Unknown | N/A | |
| VLG-CBM: Training Concept Bottleneck Models with Vision-Language Guidance | Unknown | N/A | |
| RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning | Unknown | N/A | |
| Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective | Unknown | N/A | |
| Provable Editing of Deep Neural Networks using Parametric Linear Relaxation | Unknown | N/A | |
| Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators | Unknown | N/A | |
| Schur Nets: exploiting local structure for equivariance in higher order graph neural networks | Unknown | N/A | |
| Ultrafast classical phylogenetic method beats large protein language models on variant effect prediction | Unknown | N/A | |
| Bayesian Domain Adaptation with Gaussian Mixture Domain-Indexing | Unknown | N/A | |
| Rethinking the Capacity of Graph Neural Networks for Branching Strategy | Unknown | N/A | |
| Newton Informed Neural Operator for Solving Nonlinear Partial Differential Equations | Unknown | N/A | |
| Sketching for Distributed Deep Learning: A Sharper Analysis | Unknown | N/A | |
| Hierarchy-Agnostic Unsupervised Segmentation: Parsing Semantic Image Structure | Unknown | N/A | |
| BOLD: Boolean Logic Deep Learning | Unknown | N/A | |
| Pretraining with Random Noise for Fast and Robust Learning without Weight Transport | Unknown | N/A | |
| FERERO: A Flexible Framework for Preference-Guided Multi-Objective Learning | Unknown | N/A | |
| A New Neural Kernel Regime: The Inductive Bias of Multi-Task Learning | Unknown | N/A | |
| Visual Prompt Tuning in Null Space for Continual Learning | Unknown | N/A | |
| Asynchronous Perception Machine for Efficient Test Time Training | Unknown | N/A | |
| Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data | Unknown | N/A | |
| Learning with Fitzpatrick Losses | Unknown | N/A | |
| Language Models as Zero-shot Lossless Gradient Compressors: Towards General Neural Parameter Prior Models | Unknown | N/A | |
| Addressing Hidden Confounding with Heterogeneous Observational Datasets for Recommendation | Unknown | N/A | |
| Disentangled Style Domain for Implicit $z$-Watermark Towards Copyright Protection | Unknown | N/A | |
| Understanding Emergent Abilities of Language Models from the Loss Perspective | Unknown | N/A | |
| Beyond task diversity: provable representation transfer for sequential multitask linear bandits | Unknown | N/A | |
| IR-CM: The Fast and General-purpose Image Restoration Method Based on Consistency Model | Unknown | N/A | |
| Continuous Spatiotemporal Events Decoupling through Spike-based Bayesian Computation | Unknown | N/A | |
| Temporally Consistent Atmospheric Turbulence Mitigation with Neural Representations | Unknown | N/A | |
| CIFD: Controlled Information Flow to Enhance Knowledge Distillation | Unknown | N/A | |
| DDK: Distilling Domain Knowledge for Efficient Large Language Models | Unknown | N/A | |
| pFedClub: Controllable Heterogeneous Model Aggregation for Personalized Federated Learning | Unknown | N/A | |
| Automated Multi-level Preference for MLLMs | Unknown | N/A | |
| Multi-modal Transfer Learning between Biological Foundation Models | Unknown | N/A | |
| MeshFormer : High-Quality Mesh Generation with 3D-Guided Reconstruction Model | Unknown | N/A | |
| Quantum Algorithms for Non-smooth Non-convex Optimization | Unknown | N/A | |
| Hamiltonian Monte Carlo Inference of Marginalized Linear Mixed-Effects Models | Unknown | N/A | |
| SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors | Unknown | N/A | |
| MoVA: Adapting Mixture of Vision Experts to Multimodal Context | Unknown | N/A | |
| SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning | Unknown | N/A | |
| Towards Estimating Bounds on the Effect of Policies under Unobserved Confounding | Unknown | N/A | |
| Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization | Unknown | N/A | |
| Learning to Reason via Program Generation, Emulation, and Search | Unknown | N/A | |
| When to Act and When to Ask: Policy Learning With Deferral Under Hidden Confounding | Unknown | N/A | |
| EASI: Evolutionary Adversarial Simulator Identification for Sim-to-Real Transfer | Unknown | N/A | |
| Improved Distribution Matching Distillation for Fast Image Synthesis | Unknown | N/A | |
| Any2Graph: Deep End-To-End Supervised Graph Prediction With An Optimal Transport Loss | Unknown | N/A | |
| Interpolating Item and User Fairness in Multi-Sided Recommendations | Unknown | N/A | |
| DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization | Unknown | N/A | |
| Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs | Unknown | N/A | |
| HardCore Generation: Generating Hard UNSAT Problems for Data Augmentation | Unknown | N/A | |
| einspace: Searching for Neural Architectures from Fundamental Operations | Unknown | N/A | |
| SnapKV: LLM Knows What You are Looking for Before Generation | Unknown | N/A | |
| Multi-Stage Predict+Optimize for (Mixed Integer) Linear Programs | Unknown | N/A | |
| Transformers as Game Players: Provable In-context Game-playing Capabilities of Pre-trained Models | Unknown | N/A | |
| Zipfian Whitening | Unknown | N/A | |
| Drift-Resilient TabPFN: In-Context Learning Temporal Distribution Shifts on Tabular Data | Unknown | N/A | |
| OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries | Unknown | N/A | |
| Classifier-guided Gradient Modulation for Enhanced Multimodal Learning | Unknown | N/A | |
| A Label is Worth A Thousand Images in Dataset Distillation | Unknown | N/A | |
| SS1: Accelerating Inference with Fast and Expressive Sketch Structured Transform | Unknown | N/A | |
| A Layer-Wise Natural Gradient Optimizer for Training Deep Neural Networks | Unknown | N/A | |
| Towards a "Universal Translator" for Neural Dynamics at Single-Cell, Single-Spike Resolution | Unknown | N/A | |
| Phased Consistency Models | Unknown | N/A | |
| Yo'LLaVA: Your Personalized Language and Vision Assistant | Unknown | N/A | |
| Unified Mechanism-Specific Amplification by Subsampling and Group Privacy Amplification | Unknown | N/A | |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Unknown | N/A | |
| Deep Graph Mating | Unknown | N/A | |
| In-Context Learning with Transformers: Softmax Attention Adapts to Function Lipschitzness | Unknown | N/A | |
| ShowMaker: Creating High-Fidelity 2D Human Video via Fine-Grained Diffusion Modeling | Unknown | N/A | |
| EMR-Merging: Tuning-Free High-Performance Model Merging | Unknown | N/A | |
| Contextual Linear Optimization with Bandit Feedback | Unknown | N/A | |
| MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space | Unknown | N/A | |
| On the Scalability of GNNs for Molecular Graphs | Unknown | N/A | |
| CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching | Unknown | N/A | |
| Mitigating Reward Overoptimization via Lightweight Uncertainty Estimation | Unknown | N/A | |
| General Detection-based Text Line Recognition | Unknown | N/A | |
| Identifying Latent State-Transition Processes for Individualized Reinforcement Learning | Unknown | N/A | |
| IllumiNeRF: 3D Relighting Without Inverse Rendering | Unknown | N/A | |
| Improved Bayes Regret Bounds for Multi-Task Hierarchical Bayesian Bandit Algorithms | Unknown | N/A | |
| Unified Graph Augmentations for Generalized Contrastive Learning on Graphs | Unknown | N/A | |
| AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback | Unknown | N/A | |
| INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness | Unknown | N/A | |
| Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling | Unknown | N/A | |
| Public-data Assisted Private Stochastic Optimization: Power and Limitations | Unknown | N/A | |
| Lumina-Next : Making Lumina-T2X Stronger and Faster with Next-DiT | Unknown | N/A | |
| Scaling Retrieval-Based Language Models with a Trillion-Token Datastore | Unknown | N/A | |
| Happy: A Debiased Learning Framework for Continual Generalized Category Discovery | Unknown | N/A | |
| Is the MMI Criterion Necessary for Interpretability? Degenerating Non-causal Features to Plain Noise for Self-Rationalization | Unknown | N/A | |
| Autoregressive Policy Optimization for Constrained Allocation Tasks | Unknown | N/A | |
| An Equivalence Between Static and Dynamic Regret Minimization | Unknown | N/A | |
| Action Imitation in Common Action Space for Customized Action Image Synthesis | Unknown | N/A | |
| PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation | Unknown | N/A | |
| Are We on the Right Way for Evaluating Large Vision-Language Models? | Unknown | N/A | |
| Exploring Token Pruning in Vision State Space Models | Unknown | N/A | |
| The Expressive Capacity of State Space Models: A Formal Language Perspective | Unknown | N/A | |
| Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models | Unknown | N/A | |
| Robust Graph Neural Networks via Unbiased Aggregation | Unknown | N/A | |
| LoFiT: Localized Fine-tuning on LLM Representations | Unknown | N/A | |
| Neural Krylov Iteration for Accelerating Linear System Solving | Unknown | N/A | |
| Improving Generalization and Convergence by Enhancing Implicit Regularization | Unknown | N/A | |
| Causal discovery with endogenous context variables | Unknown | N/A | |
| On Feature Learning in Structured State Space Models | Unknown | N/A | |
| MoEUT: Mixture-of-Experts Universal Transformers | Unknown | N/A | |
| Voila-A: Aligning Vision-Language Models with User's Gaze Attention | Unknown | N/A | |
| DiffHammer: Rethinking the Robustness of Diffusion-Based Adversarial Purification | Unknown | N/A | |
| Cross-Device Collaborative Test-Time Adaptation | Unknown | N/A | |
| Minimum Entropy Coupling with Bottleneck | Unknown | N/A | |
| DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion | Unknown | N/A | |
| Transformers Represent Belief State Geometry in their Residual Stream | Unknown | N/A | |
| MoLE: Enhancing Human-centric Text-to-image Diffusion via Mixture of Low-rank Experts | Unknown | N/A | |
| Detecting Brittle Decisions for Free: Leveraging Margin Consistency in Deep Robust Classifiers | Unknown | N/A | |
| Certified Robustness for Deep Equilibrium Models via Serialized Random Smoothing | Unknown | N/A | |
| Gradient Guidance for Diffusion Models: An Optimization Perspective | Unknown | N/A | |
| MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning | Unknown | N/A | |
| Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs | Unknown | N/A | |
| AlphaMath Almost Zero: Process Supervision without Process | Unknown | N/A | |
| Learning from Offline Foundation Features with Tensor Augmentations | Unknown | N/A | |
| Towards Combating Frequency Simplicity-biased Learning for Domain Generalization | Unknown | N/A | |
| Continual Counting with Gradual Privacy Expiration | Unknown | N/A | |
| Text2NKG: Fine-Grained N-ary Relation Extraction for N-ary relational Knowledge Graph Construction | Unknown | N/A | |
| A Closer Look at AUROC and AUPRC under Class Imbalance | Unknown | N/A | |
| D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models | Unknown | N/A | |
| 3D Gaussian Splatting as Markov Chain Monte Carlo | Unknown | N/A | |
| Enhancing Graph Transformers with Hierarchical Distance Structural Encoding | Unknown | N/A | |
| Interfacing Foundation Models' Embeddings | Unknown | N/A | |
| Test Where Decisions Matter: Importance-driven Testing for Deep Reinforcement Learning | Unknown | N/A | |
| CAT3D: Create Anything in 3D with Multi-View Diffusion Models | Unknown | N/A | |
| Frequency Adaptive Normalization For Non-stationary Time Series Forecasting | Unknown | N/A | |
| Achieving Tractable Minimax Optimal Regret in Average Reward MDPs | Unknown | N/A | |
| Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization | Unknown | N/A | |
| Shaving Weights with Occam's Razor: Bayesian Sparsification for Neural Networks using the Marginal Likelihood | Unknown | N/A | |
| SymILO: A Symmetry-Aware Learning Framework for Integer Linear Optimization | Unknown | N/A | |
| Optical Diffusion Models for Image Generation | Unknown | N/A | |
| VideoTetris: Towards Compositional Text-to-Video Generation | Unknown | N/A | |
| EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection | Unknown | N/A | |
| Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person Re-identification | Unknown | N/A | |
| Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval | Unknown | N/A | |
| Extending Multi-modal Contrastive Representations | Unknown | N/A | |
| L4GM: Large 4D Gaussian Reconstruction Model | Unknown | N/A | |
| Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration | Unknown | N/A | |
| GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting | Unknown | N/A | |
| Multi-Instance Partial-Label Learning with Margin Adjustment | Unknown | N/A | |
| Non-asymptotic Convergence of Training Transformers for Next-token Prediction | Unknown | N/A | |
| Towards a theory of how the structure of language is acquired by deep neural networks | Unknown | N/A | |
| MO-DDN: A Coarse-to-Fine Attribute-based Exploration Agent for Multi-Object Demand-driven Navigation | Unknown | N/A | |
| Equivariant spatio-hemispherical networks for diffusion MRI deconvolution | Unknown | N/A | |
| Generating compositional scenes via Text-to-image RGBA Instance Generation | Unknown | N/A | |
| DiffPhyCon: A Generative Approach to Control Complex Physical Systems | Unknown | N/A | |
| Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials | Unknown | N/A | |
| Mind the Graph When Balancing Data for Fairness or Robustness | Unknown | N/A | |
| Pessimistic Backward Policy for GFlowNets | Unknown | N/A | |
| Simple and Effective Masked Diffusion Language Models | Unknown | N/A | |
| Binding in hippocampal-entorhinal circuits enables compositionality in cognitive maps | Unknown | N/A | |
| Progressive Exploration-Conformal Learning for Sparsely Annotated Object Detection in Aerial Images | Unknown | N/A | |
| D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models | Unknown | N/A | |
| Fine Tuning Out-of-Vocabulary Item Recommendation with User Sequence Imagination | Unknown | N/A | |
| RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation | Unknown | N/A | |
| Poseidon: Efficient Foundation Models for PDEs | Unknown | N/A | |
| HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors | Unknown | N/A | |
| Dense Connector for MLLMs | Unknown | N/A | |
| TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables | Unknown | N/A | |
| Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms | Unknown | N/A | |
| Towards Understanding Evolving Patterns in Sequential Data | Unknown | N/A | |
| HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting | Unknown | N/A | |
| Efficient Multi-task LLM Quantization and Serving for Multiple LoRA Adapters | Unknown | N/A | |
| Neural Pfaffians: Solving Many Many-Electron Schrödinger Equations | Unknown | N/A | |
| The Importance of Online Data: Understanding Preference Fine-tuning via Coverage | Unknown | N/A | |
| Humanoid Locomotion as Next Token Prediction | Unknown | N/A | |
| CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning | Unknown | N/A | |
| DeepITE: Designing Variational Graph Autoencoders for Intervention Target Estimation | Unknown | N/A | |
| Metric from Human: Zero-shot Monocular Metric Depth Estimation via Test-time Adaptation | Unknown | N/A | |
| Knowledge Composition using Task Vectors with Learned Anisotropic Scaling | Unknown | N/A | |
| UniAR: A Unified model for predicting human Attention and Responses on visual content | Unknown | N/A | |
| Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training | Unknown | N/A | |
| Vision Foundation Model Enables Generalizable Object Pose Estimation | Unknown | N/A | |
| Efficient Prompt Optimization Through the Lens of Best Arm Identification | Unknown | N/A | |
| Base of RoPE Bounds Context Length | Unknown | N/A | |
| On the Parameter Identifiability of Partially Observed Linear Causal Models | Unknown | N/A | |
| Preference Alignment with Flow Matching | Unknown | N/A | |
| RAGraph: A General Retrieval-Augmented Graph Learning Framework | Unknown | N/A | |
| Efficient Federated Learning against Heterogeneous and Non-stationary Client Unavailability | Unknown | N/A | |
| LuSh-NeRF: Lighting up and Sharpening NeRFs for Low-light Scenes | Unknown | N/A | |
| Spatio-Spectral Graph Neural Networks | Unknown | N/A | |
| Soft Prompt Threats: Attacking Safety Alignment and Unlearning in Open-Source LLMs through the Embedding Space | Unknown | N/A | |
| The Surprising Effectiveness of SP Voting with Partial Preferences | Unknown | N/A | |
| Mitigating Object Hallucination via Concentric Causal Attention | Unknown | N/A | |
| Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models | Unknown | N/A | |
| An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models | Unknown | N/A | |
| SequentialAttention++ for Block Sparsification: Differentiable Pruning Meets Combinatorial Optimization | Unknown | N/A | |
| Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA | Unknown | N/A | |
| Many-Shot In-Context Learning | Unknown | N/A | |
| MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution | Unknown | N/A | |
| Weisfeiler and Leman Go Loopy: A New Hierarchy for Graph Representational Learning | Unknown | N/A | |
| Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees | Unknown | N/A | |
| GO4Align: Group Optimization for Multi-Task Alignment | Unknown | N/A | |
| Private Algorithms for Stochastic Saddle Points and Variational Inequalities: Beyond Euclidean Geometry | Unknown | N/A | |
| NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction | Unknown | N/A | |
| Efficient Adversarial Training in LLMs with Continuous Attacks | Unknown | N/A | |
| Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Unknown | N/A | |
| ReplaceAnything3D: Text-Guided Object Replacement in 3D Scenes with Compositional Scene Representations | Unknown | N/A | |
| SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention | Unknown | N/A | |
| QUEEN: QUantized Efficient ENcoding of Dynamic Gaussians for Streaming Free-viewpoint Videos | Unknown | N/A | |
| All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation | Unknown | N/A | |
| Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization | Unknown | N/A | |
| DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features | Unknown | N/A | |
| Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models | Unknown | N/A | |
| PCoTTA: Continual Test-Time Adaptation for Multi-Task Point Cloud Understanding | Unknown | N/A | |
| Energy-based Epistemic Uncertainty for Graph Neural Networks | Unknown | N/A | |
| Fair and Welfare-Efficient Constrained Multi-Matchings under Uncertainty | Unknown | N/A | |
| MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Unknown | N/A | |
| LION: Linear Group RNN for 3D Object Detection in Point Clouds | Unknown | N/A | |
| Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization | Unknown | N/A | |
| ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification | Unknown | N/A | |
| Adaptable Logical Control for Large Language Models | Unknown | N/A | |
| FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation | Unknown | N/A | |
| ProSST: Protein Language Modeling with Quantized Structure and Disentangled Attention | Unknown | N/A | |
| Differentially Private Optimization with Sparse Gradients | Unknown | N/A | |
| DCDepth: Progressive Monocular Depth Estimation in Discrete Cosine Domain | Unknown | N/A | |
| Privacy without Noisy Gradients: Slicing Mechanism for Generative Model Training | Unknown | N/A | |
| OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding | Unknown | N/A | |
| Learning-Augmented Algorithms for the Bahncard Problem | Unknown | N/A | |
| Geometric-Averaged Preference Optimization for Soft Preference Labels | Unknown | N/A | |
| Bridging semantics and pragmatics in information-theoretic emergent communication | Unknown | N/A | |
| Replicability in Learning: Geometric Partitions and KKM-Sperner Lemma | Unknown | N/A | |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | Unknown | N/A | |
| Bayesian-guided Label Mapping for Visual Reprogramming | Unknown | N/A | |
| BitsFusion: 1.99 bits Weight Quantization of Diffusion Model | Unknown | N/A | |
| Mobility-LLM: Learning Visiting Intentions and Travel Preference from Human Mobility Data with Large Language Models | Unknown | N/A | |
| Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning | Unknown | N/A | |
| FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions | Unknown | N/A | |
| Learning-Augmented Algorithms with Explicit Predictors | Unknown | N/A | |
| Local Superior Soups: A Catalyst for Model Merging in Cross-Silo Federated Learning | Unknown | N/A | |
| $\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation | Unknown | N/A | |
| Can Graph Neural Networks Expose Training Data Properties? An Efficient Risk Assessment Approach | Unknown | N/A | |
| Extracting Training Data from Molecular Pre-trained Models | Unknown | N/A | |
| PTQ4DiT: Post-training Quantization for Diffusion Transformers | Unknown | N/A | |
| Global Distortions from Local Rewards: Neural Coding Strategies in Path-Integrating Neural Systems | Unknown | N/A | |
| ActSort: An active-learning accelerated cell sorting algorithm for large-scale calcium imaging datasets | Unknown | N/A | |
| Aligning Large Language Models with Representation Editing: A Control Perspective | Unknown | N/A | |
| Scanning Trojaned Models Using Out-of-Distribution Samples | Unknown | N/A | |
| Why are Visually-Grounded Language Models Bad at Image Classification? | Unknown | N/A | |
| DataComp-LM: In search of the next generation of training sets for language models | Unknown | N/A | |
| Federated Online Prediction from Experts with Differential Privacy: Separations and Regret Speed-ups | Unknown | N/A | |
| Decentralized Noncooperative Games with Coupled Decision-Dependent Distributions | Unknown | N/A | |
| Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search | Unknown | N/A | |
| What makes unlearning hard and what to do about it | Unknown | N/A | |
| Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models | Unknown | N/A | |
| Grammar-Aligned Decoding | Unknown | N/A |